Back to Benchmarks

llama-3.1-8b-chat

text-generation
Uploaded: 24.07.2024
Seq Length:
Providers
provider icon
azure-ai
provider icon
aws-bedrock
provider icon
deepinfra
provider icon
fireworks-ai
provider icon
lepton-ai
provider icon
octoai
provider icon
perplexity-ai
provider icon
together-ai
provider icon
groq
provider icon
vertex-ai
Learn more about how we are collecting this data here
0.3 $/1M tks
0.61 $/1M tks
66.24 tks/sec
584.68 ms
15.1 ms
3332.34 ms
0 sec
0.22 $/1M tks
0.22 $/1M tks
86.59 tks/sec
730.49 ms
11.55 ms
2994.12 ms
0 sec
0.06 $/1M tks
0.06 $/1M tks
47.96 tks/sec
583.12 ms
20.85 ms
937.59 ms
0 sec
0.2 $/1M tks
0.2 $/1M tks
272.06 tks/sec
658.38 ms
3.68 ms
1353.08 ms
0 sec
0.07 $/1M tks
0.07 $/1M tks
217.16 tks/sec
680.78 ms
4.6 ms
786.69 ms
0 sec
0.15 $/1M tks
0.15 $/1M tks
193.55 tks/sec
678.32 ms
5.17 ms
1809.84 ms
0 sec
0.2 $/1M tks
0.2 $/1M tks
494.11 tks/sec
1238 ms
2.02 ms
1272.4 ms
0 sec
0.18 $/1M tks
0.18 $/1M tks
149.14 tks/sec
760.79 ms
6.71 ms
1974.42 ms
0 sec
0.05 $/1M tks
0.08 $/1M tks
281.78 tks/sec
1615.86 ms
3.55 ms
2339.83 ms
0 sec
0.22 $/1M tks
0.22 $/1M tks
39.97 tks/sec
1314.72 ms
25.02 ms
6218.08 ms
0 sec