Back to Benchmarks

llama-3-8b-chat

text-generation
Uploaded: 18.04.2024
Seq Length:
Providers
provider icon
aws-bedrock
provider icon
deepinfra
provider icon
fireworks-ai
provider icon
groq
provider icon
lepton-ai
provider icon
replicate
provider icon
together-ai
Learn more about how we are collecting this data here
0.3 $/1M tks
0.6 $/1M tks
80.72 tks/sec
526.89 ms
12.39 ms
3177.92 ms
0 sec
0.06 $/1M tks
0.06 $/1M tks
113.19 tks/sec
410.68 ms
8.83 ms
2151.08 ms
0 sec
0.2 $/1M tks
0.2 $/1M tks
195.09 tks/sec
446.9 ms
5.13 ms
1461.83 ms
0 sec
0.05 $/1M tks
0.08 $/1M tks
378.41 tks/sec
449.62 ms
2.64 ms
938.52 ms
0 sec
0.07 $/1M tks
0.07 $/1M tks
100.11 tks/sec
702.48 ms
9.99 ms
2410.68 ms
0 sec
0.05 $/1M tks
0.25 $/1M tks
14.63 tks/sec
1552.96 ms
68.34 ms
3329.91 ms
0 sec
0.2 $/1M tks
0.2 $/1M tks
337.2 tks/sec
776.5 ms
2.97 ms
1381.49 ms
0 sec