Back to Benchmarks

llama-3.1-405b-chat

text-generation
Uploaded: 24.07.2024
Seq Length:
Providers
provider icon
vertex-ai
provider icon
azure-ai
provider icon
aws-bedrock
provider icon
deepinfra
provider icon
fireworks-ai
provider icon
lepton-ai
provider icon
replicate
provider icon
together-ai
Learn more about how we are collecting this data here
5.32 $/1M tks
16 $/1M tks
21.82 tks/sec
1773.13 ms
45.82 ms
3560.28 ms
0 sec
5.33 $/1M tks
16 $/1M tks
10.01 tks/sec
817.62 ms
99.94 ms
95256.32 ms
0 sec
5.32 $/1M tks
16 $/1M tks
13.98 tks/sec
1765.45 ms
71.53 ms
8990.32 ms
0 sec
1.79 $/1M tks
1.79 $/1M tks
26.29 tks/sec
690.84 ms
38.03 ms
48039.62 ms
0 sec
3 $/1M tks
3 $/1M tks
86.52 tks/sec
629.55 ms
11.56 ms
12650.08 ms
0 sec
Not computed
Not computed
Not computed
Not computed
Not computed
Not computed
Not computed
9.5 $/1M tks
9.5 $/1M tks
17.56 tks/sec
1398.87 ms
56.94 ms
30493.81 ms
0 sec
3.5 $/1M tks
3.5 $/1M tks
87.95 tks/sec
780.48 ms
11.37 ms
12651.39 ms
0 sec