Back to Benchmarks

llama-3.1-405b-chat

text-generation
Uploaded: 24.07.2024
Seq Length:
Providers
provider icon
vertex-ai
provider icon
azure-ai
provider icon
aws-bedrock
provider icon
deepinfra
provider icon
fireworks-ai
provider icon
lepton-ai
provider icon
replicate
provider icon
together-ai
Learn more about how we are collecting this data here
5.32 $/1M tks
16 $/1M tks
29.98 tks/sec
1753.3 ms
33.35 ms
36808.79 ms
0 sec
5.33 $/1M tks
16 $/1M tks
6.85 tks/sec
832.52 ms
145.95 ms
6962.32 ms
0 sec
5.32 $/1M tks
16 $/1M tks
13.91 tks/sec
2156.99 ms
71.9 ms
12222.77 ms
0 sec
1.79 $/1M tks
1.79 $/1M tks
25.83 tks/sec
667.56 ms
38.72 ms
39695.15 ms
0 sec
3 $/1M tks
3 $/1M tks
84.88 tks/sec
628.64 ms
11.78 ms
12634.22 ms
0 sec
Not computed
Not computed
Not computed
Not computed
Not computed
Not computed
Not computed
9.5 $/1M tks
9.5 $/1M tks
14.08 tks/sec
1389.91 ms
71.02 ms
5296.02 ms
0 sec
3.5 $/1M tks
3.5 $/1M tks
34.97 tks/sec
1437.04 ms
28.6 ms
2066.17 ms
0 sec