Back to Benchmarks

llama-3.1-70b-chat

text-generation
Uploaded: 24.07.2024
Seq Length:
Providers
provider icon
azure-ai
provider icon
aws-bedrock
provider icon
deepinfra
provider icon
fireworks-ai
provider icon
lepton-ai
provider icon
octoai
provider icon
perplexity-ai
provider icon
together-ai
provider icon
groq
provider icon
vertex-ai
Learn more about how we are collecting this data here
2.68 $/1M tks
3.54 $/1M tks
31.9 tks/sec
1443.32 ms
31.35 ms
7776.59 ms
0 sec
0.99 $/1M tks
0.99 $/1M tks
32.95 tks/sec
982.94 ms
30.35 ms
7568.15 ms
0 sec
0.35 $/1M tks
0.4 $/1M tks
20.65 tks/sec
652.8 ms
48.43 ms
9515.59 ms
0 sec
0.9 $/1M tks
0.9 $/1M tks
89.32 tks/sec
688.39 ms
11.2 ms
2916.4 ms
0 sec
0.8 $/1M tks
0.8 $/1M tks
54.53 tks/sec
685.43 ms
18.34 ms
3913.19 ms
0 sec
0.9 $/1M tks
0.9 $/1M tks
72.15 tks/sec
712.72 ms
13.86 ms
3069.08 ms
0 sec
1 $/1M tks
1 $/1M tks
23.59 tks/sec
704.26 ms
42.39 ms
9901.87 ms
0 sec
0.88 $/1M tks
0.88 $/1M tks
23.55 tks/sec
2036.7 ms
42.47 ms
11719.45 ms
0 sec
0.59 $/1M tks
0.79 $/1M tks
323.06 tks/sec
985.59 ms
3.1 ms
1490.13 ms
0 sec
0.99 $/1M tks
0.99 $/1M tks
32.05 tks/sec
1496.55 ms
31.2 ms
7206.67 ms
0 sec