Back to Benchmarks

llama-3.1-8b-chat

text-generation
Uploaded: 24.07.2024
Seq Length:
Providers
provider icon
azure-ai
provider icon
aws-bedrock
provider icon
deepinfra
provider icon
fireworks-ai
provider icon
lepton-ai
provider icon
octoai
provider icon
perplexity-ai
provider icon
together-ai
provider icon
groq
provider icon
vertex-ai
Learn more about how we are collecting this data here
0.3 $/1M tks
0.61 $/1M tks
45.37 tks/sec
508.05 ms
22.04 ms
16290.41 ms
0 sec
0.22 $/1M tks
0.22 $/1M tks
89.42 tks/sec
532.07 ms
11.18 ms
12275.06 ms
0 sec
0.06 $/1M tks
0.06 $/1M tks
76.18 tks/sec
317.11 ms
13.13 ms
737.18 ms
0 sec
0.2 $/1M tks
0.2 $/1M tks
237.73 tks/sec
367.07 ms
4.21 ms
396.51 ms
0 sec
0.07 $/1M tks
0.07 $/1M tks
160.82 tks/sec
373.94 ms
6.22 ms
7188.92 ms
0 sec
0.15 $/1M tks
0.15 $/1M tks
170.56 tks/sec
1141.11 ms
5.86 ms
1334.59 ms
0 sec
0.2 $/1M tks
0.2 $/1M tks
163.68 tks/sec
468.27 ms
6.11 ms
7366.06 ms
0 sec
0.18 $/1M tks
0.18 $/1M tks
102.43 tks/sec
573.95 ms
9.76 ms
10063.64 ms
0 sec
0.05 $/1M tks
0.08 $/1M tks
367.74 tks/sec
539.43 ms
2.72 ms
3106.47 ms
0 sec
0.22 $/1M tks
0.22 $/1M tks
119.97 tks/sec
1153.8 ms
8.34 ms
2604.22 ms
0 sec