Back to Benchmarks

llama-3-8b-chat

text-generation
Uploaded: 18.04.2024
Seq Length:
Providers
provider icon
aws-bedrock
provider icon
deepinfra
provider icon
fireworks-ai
provider icon
groq
provider icon
lepton-ai
provider icon
replicate
provider icon
together-ai
Learn more about how we are collecting this data here
0.3 $/1M tks
0.6 $/1M tks
81.21 tks/sec
462.68 ms
12.31 ms
2642.24 ms
0 sec
0.06 $/1M tks
0.06 $/1M tks
113.07 tks/sec
368.91 ms
8.84 ms
1651.32 ms
0 sec
0.2 $/1M tks
0.2 $/1M tks
178.75 tks/sec
352.92 ms
5.59 ms
1516.57 ms
0 sec
0.05 $/1M tks
0.08 $/1M tks
374.77 tks/sec
349.53 ms
2.67 ms
904.53 ms
0 sec
0.07 $/1M tks
0.07 $/1M tks
101.68 tks/sec
691.82 ms
9.84 ms
2599.85 ms
0 sec
0.05 $/1M tks
0.25 $/1M tks
19.04 tks/sec
1480.75 ms
52.52 ms
8623.17 ms
0 sec
0.2 $/1M tks
0.2 $/1M tks
355.21 tks/sec
712.91 ms
2.82 ms
1298.48 ms
0 sec