Back to Benchmarks

llama-3.1-8b-chat

text-generation
Uploaded: 24.07.2024
Seq Length:
Providers
provider icon
azure-ai
provider icon
aws-bedrock
provider icon
deepinfra
provider icon
fireworks-ai
provider icon
lepton-ai
provider icon
octoai
provider icon
perplexity-ai
provider icon
together-ai
provider icon
groq
provider icon
vertex-ai
Learn more about how we are collecting this data here
0.3 $/1M tks
0.61 $/1M tks
53.22 tks/sec
407.3 ms
18.79 ms
989.76 ms
0 sec
0.22 $/1M tks
0.22 $/1M tks
92.88 tks/sec
559.69 ms
10.77 ms
2788.38 ms
0 sec
0.06 $/1M tks
0.06 $/1M tks
72.22 tks/sec
440.01 ms
13.85 ms
3126.27 ms
0 sec
0.2 $/1M tks
0.2 $/1M tks
294.96 tks/sec
768.14 ms
3.39 ms
1469.92 ms
0 sec
0.07 $/1M tks
0.07 $/1M tks
197 tks/sec
530.96 ms
5.08 ms
1536.02 ms
0 sec
0.15 $/1M tks
0.15 $/1M tks
198.02 tks/sec
685.32 ms
5.05 ms
1725.63 ms
0 sec
0.2 $/1M tks
0.2 $/1M tks
193.82 tks/sec
562.32 ms
5.16 ms
1583.89 ms
0 sec
0.18 $/1M tks
0.18 $/1M tks
104.11 tks/sec
578.65 ms
9.61 ms
1596.79 ms
0 sec
0.05 $/1M tks
0.08 $/1M tks
221.85 tks/sec
517.06 ms
4.51 ms
1423.07 ms
0 sec
0.22 $/1M tks
0.22 $/1M tks
126.19 tks/sec
1244.69 ms
7.92 ms
2782.01 ms
0 sec