Back to Benchmarks

llama-3.1-70b-chat

text-generation
Uploaded: 24.07.2024
Seq Length:
Providers
provider icon
azure-ai
provider icon
aws-bedrock
provider icon
deepinfra
provider icon
fireworks-ai
provider icon
lepton-ai
provider icon
octoai
provider icon
perplexity-ai
provider icon
together-ai
provider icon
groq
provider icon
vertex-ai
Learn more about how we are collecting this data here
2.68 $/1M tks
3.54 $/1M tks
21.67 tks/sec
459.82 ms
46.14 ms
3920.44 ms
0 sec
0.99 $/1M tks
0.99 $/1M tks
31.26 tks/sec
916.81 ms
31.99 ms
7155.49 ms
0 sec
0.35 $/1M tks
0.4 $/1M tks
20.83 tks/sec
578.12 ms
48.01 ms
9987.4 ms
0 sec
0.9 $/1M tks
0.9 $/1M tks
237.04 tks/sec
472.64 ms
4.22 ms
1430.28 ms
0 sec
0.8 $/1M tks
0.8 $/1M tks
57.96 tks/sec
506.87 ms
17.25 ms
3405.48 ms
0 sec
0.9 $/1M tks
0.9 $/1M tks
71.5 tks/sec
550.63 ms
13.99 ms
3207.98 ms
0 sec
1 $/1M tks
1 $/1M tks
52.92 tks/sec
565.21 ms
18.9 ms
4571.11 ms
0 sec
0.88 $/1M tks
0.88 $/1M tks
50.11 tks/sec
545.42 ms
19.96 ms
3958.03 ms
0 sec
0.59 $/1M tks
0.79 $/1M tks
237.43 tks/sec
549.78 ms
4.21 ms
1371.08 ms
0 sec
0.99 $/1M tks
0.99 $/1M tks
32.83 tks/sec
1244.72 ms
30.46 ms
7092.25 ms
0 sec