Back to Benchmarks

llama-3-8b-chat

text-generation
Uploaded: 18.04.2024
Seq Length:
Providers
provider icon
aws-bedrock
provider icon
deepinfra
provider icon
fireworks-ai
provider icon
groq
provider icon
lepton-ai
provider icon
replicate
provider icon
together-ai
Learn more about how we are collecting this data here
0.3 $/1M tks
0.6 $/1M tks
90.78 tks/sec
565.87 ms
11.02 ms
819.23 ms
0 sec
0.06 $/1M tks
0.06 $/1M tks
96.9 tks/sec
434.61 ms
10.32 ms
610.05 ms
0 sec
0.2 $/1M tks
0.2 $/1M tks
174.26 tks/sec
588.04 ms
5.74 ms
6751.22 ms
0 sec
0.05 $/1M tks
0.08 $/1M tks
333.26 tks/sec
564.79 ms
3 ms
3229.36 ms
0 sec
0.07 $/1M tks
0.07 $/1M tks
100.59 tks/sec
788.22 ms
9.94 ms
1494.05 ms
0 sec
0.05 $/1M tks
0.25 $/1M tks
17.35 tks/sec
1622.34 ms
57.64 ms
5080.5 ms
0 sec
0.2 $/1M tks
0.2 $/1M tks
281.56 tks/sec
627.01 ms
3.55 ms
939.56 ms
0 sec