Back to Benchmarks

llama-3-8b-chat

text-generation
Uploaded: 18.04.2024
Seq Length:
Providers
provider icon
aws-bedrock
provider icon
deepinfra
provider icon
fireworks-ai
provider icon
groq
provider icon
lepton-ai
provider icon
replicate
provider icon
together-ai
Learn more about how we are collecting this data here
0.3 $/1M tks
0.6 $/1M tks
85.72 tks/sec
552.36 ms
11.67 ms
902.32 ms
0 sec
0.06 $/1M tks
0.06 $/1M tks
105.91 tks/sec
358.21 ms
9.44 ms
8685.74 ms
0 sec
0.2 $/1M tks
0.2 $/1M tks
146.31 tks/sec
377.72 ms
6.83 ms
904 ms
0 sec
0.05 $/1M tks
0.08 $/1M tks
351.04 tks/sec
502.75 ms
2.85 ms
3391.31 ms
0 sec
0.07 $/1M tks
0.07 $/1M tks
97.49 tks/sec
688.72 ms
10.26 ms
10812.65 ms
0 sec
0.05 $/1M tks
0.25 $/1M tks
17.79 tks/sec
1406.16 ms
56.2 ms
8767.98 ms
0 sec
0.2 $/1M tks
0.2 $/1M tks
290.46 tks/sec
535.77 ms
3.44 ms
818.09 ms
0 sec