Back to Benchmarks

llama-3.1-8b-chat

text-generation
Uploaded: 24.07.2024
Seq Length:
Providers
provider icon
azure-ai
provider icon
aws-bedrock
provider icon
deepinfra
provider icon
fireworks-ai
provider icon
lepton-ai
provider icon
octoai
provider icon
perplexity-ai
provider icon
together-ai
provider icon
groq
provider icon
vertex-ai
Learn more about how we are collecting this data here
0.3 $/1M tks
0.61 $/1M tks
30.47 tks/sec
658.02 ms
32.82 ms
3021.28 ms
0 sec
0.22 $/1M tks
0.22 $/1M tks
96.08 tks/sec
785.88 ms
10.41 ms
1035.67 ms
0 sec
0.06 $/1M tks
0.06 $/1M tks
72.71 tks/sec
617.31 ms
13.75 ms
11028.66 ms
0 sec
0.2 $/1M tks
0.2 $/1M tks
228.8 tks/sec
655.84 ms
4.37 ms
1800.94 ms
0 sec
0.07 $/1M tks
0.07 $/1M tks
95.76 tks/sec
940.6 ms
10.44 ms
9065.22 ms
0 sec
0.15 $/1M tks
0.15 $/1M tks
162.31 tks/sec
700.93 ms
6.16 ms
1070.59 ms
0 sec
0.2 $/1M tks
0.2 $/1M tks
408.55 tks/sec
944.25 ms
2.45 ms
988.31 ms
0 sec
0.18 $/1M tks
0.18 $/1M tks
123.82 tks/sec
1852.64 ms
8.08 ms
9363.42 ms
0 sec
0.05 $/1M tks
0.08 $/1M tks
339.66 tks/sec
809.84 ms
2.94 ms
3700.94 ms
0 sec
0.22 $/1M tks
0.22 $/1M tks
90.87 tks/sec
1349.04 ms
11 ms
11473.36 ms
0 sec