Back to Benchmarks

llama-3.1-70b-chat

text-generation
Uploaded: 24.07.2024
Seq Length:
Providers
provider icon
azure-ai
provider icon
aws-bedrock
provider icon
deepinfra
provider icon
fireworks-ai
provider icon
lepton-ai
provider icon
octoai
provider icon
perplexity-ai
provider icon
together-ai
provider icon
groq
provider icon
vertex-ai
Learn more about how we are collecting this data here
2.68 $/1M tks
3.54 $/1M tks
26.41 tks/sec
385.92 ms
37.86 ms
7995.93 ms
0 sec
0.99 $/1M tks
0.99 $/1M tks
32.35 tks/sec
758.37 ms
30.91 ms
7125.58 ms
0 sec
0.35 $/1M tks
0.4 $/1M tks
24.28 tks/sec
349.64 ms
41.18 ms
7639.32 ms
0 sec
0.9 $/1M tks
0.9 $/1M tks
108.49 tks/sec
575.92 ms
9.22 ms
2382.55 ms
0 sec
0.8 $/1M tks
0.8 $/1M tks
72.14 tks/sec
536.78 ms
13.86 ms
938.79 ms
0 sec
0.9 $/1M tks
0.9 $/1M tks
70.97 tks/sec
456.63 ms
14.09 ms
3613.05 ms
0 sec
1 $/1M tks
1 $/1M tks
52.16 tks/sec
476.17 ms
19.17 ms
4731.99 ms
0 sec
0.88 $/1M tks
0.88 $/1M tks
61.55 tks/sec
701.23 ms
16.25 ms
4145.42 ms
0 sec
0.59 $/1M tks
0.79 $/1M tks
242.44 tks/sec
414.77 ms
4.12 ms
1454.22 ms
0 sec
0.99 $/1M tks
0.99 $/1M tks
36.13 tks/sec
1169.9 ms
27.68 ms
7563.57 ms
0 sec