Back to Benchmarks

llama-3.1-8b-chat

text-generation
Uploaded: 24.07.2024
Seq Length:
Providers
provider icon
azure-ai
provider icon
aws-bedrock
provider icon
deepinfra
provider icon
fireworks-ai
provider icon
lepton-ai
provider icon
octoai
provider icon
perplexity-ai
provider icon
together-ai
provider icon
groq
provider icon
vertex-ai
Learn more about how we are collecting this data here
0.3 $/1M tks
0.61 $/1M tks
32.18 tks/sec
455.52 ms
31.07 ms
1045.9 ms
0 sec
0.22 $/1M tks
0.22 $/1M tks
113.16 tks/sec
479.58 ms
8.84 ms
629.81 ms
0 sec
0.06 $/1M tks
0.06 $/1M tks
74.8 tks/sec
306.52 ms
13.37 ms
774.44 ms
0 sec
0.2 $/1M tks
0.2 $/1M tks
263.44 tks/sec
474.35 ms
3.8 ms
1127.25 ms
0 sec
0.07 $/1M tks
0.07 $/1M tks
186.24 tks/sec
373.38 ms
5.37 ms
1576.13 ms
0 sec
0.15 $/1M tks
0.15 $/1M tks
208.46 tks/sec
935.39 ms
4.8 ms
1961.96 ms
0 sec
0.2 $/1M tks
0.2 $/1M tks
196.58 tks/sec
471.97 ms
5.09 ms
1346.93 ms
0 sec
0.18 $/1M tks
0.18 $/1M tks
77.59 tks/sec
478.48 ms
12.89 ms
2850.02 ms
0 sec
0.05 $/1M tks
0.08 $/1M tks
387.12 tks/sec
425.72 ms
2.58 ms
970.77 ms
0 sec
0.22 $/1M tks
0.22 $/1M tks
42.46 tks/sec
1100.39 ms
23.55 ms
6139.95 ms
0 sec