Back to Benchmarks

llama-3.1-70b-chat

text-generation
Uploaded: 24.07.2024
Seq Length:
Providers
provider icon
azure-ai
provider icon
aws-bedrock
provider icon
deepinfra
provider icon
fireworks-ai
provider icon
lepton-ai
provider icon
octoai
provider icon
perplexity-ai
provider icon
together-ai
provider icon
groq
provider icon
vertex-ai
Learn more about how we are collecting this data here
2.68 $/1M tks
3.54 $/1M tks
31.66 tks/sec
517.92 ms
31.59 ms
32771.23 ms
0 sec
0.99 $/1M tks
0.99 $/1M tks
44.39 tks/sec
776.44 ms
22.53 ms
1046.76 ms
0 sec
0.35 $/1M tks
0.4 $/1M tks
25.33 tks/sec
415.92 ms
39.48 ms
1955.7 ms
0 sec
0.9 $/1M tks
0.9 $/1M tks
172.65 tks/sec
626.35 ms
5.79 ms
6070.76 ms
0 sec
0.8 $/1M tks
0.8 $/1M tks
57.65 tks/sec
905.79 ms
17.35 ms
16639.83 ms
0 sec
0.9 $/1M tks
0.9 $/1M tks
72.04 tks/sec
504.76 ms
13.88 ms
13719.56 ms
0 sec
1 $/1M tks
1 $/1M tks
56.87 tks/sec
470.74 ms
17.58 ms
3600.74 ms
0 sec
0.88 $/1M tks
0.88 $/1M tks
72.3 tks/sec
619.91 ms
13.83 ms
14659.24 ms
0 sec
0.59 $/1M tks
0.79 $/1M tks
244.56 tks/sec
543.62 ms
4.09 ms
4693.93 ms
0 sec
0.99 $/1M tks
0.99 $/1M tks
45.1 tks/sec
1112.03 ms
22.18 ms
1200.73 ms
0 sec