Back to Benchmarks

llama-3.1-70b-chat

text-generation
Uploaded: 24.07.2024
Seq Length:
Providers
provider icon
azure-ai
provider icon
aws-bedrock
provider icon
deepinfra
provider icon
fireworks-ai
provider icon
lepton-ai
provider icon
octoai
provider icon
perplexity-ai
provider icon
together-ai
provider icon
groq
provider icon
vertex-ai
Learn more about how we are collecting this data here
2.68 $/1M tks
3.54 $/1M tks
32.09 tks/sec
774.66 ms
31.16 ms
29319.6 ms
0 sec
0.99 $/1M tks
0.99 $/1M tks
29.81 tks/sec
1023.81 ms
33.54 ms
26684.41 ms
0 sec
0.35 $/1M tks
0.4 $/1M tks
23.64 tks/sec
733.84 ms
42.31 ms
42238.58 ms
0 sec
0.9 $/1M tks
0.9 $/1M tks
162.39 tks/sec
790.77 ms
6.16 ms
6782.37 ms
0 sec
0.8 $/1M tks
0.8 $/1M tks
38.44 tks/sec
792.49 ms
26.01 ms
1130.65 ms
0 sec
0.9 $/1M tks
0.9 $/1M tks
31.13 tks/sec
779.41 ms
32.13 ms
843.66 ms
0 sec
1 $/1M tks
1 $/1M tks
54.3 tks/sec
729.08 ms
18.42 ms
17304.88 ms
0 sec
0.88 $/1M tks
0.88 $/1M tks
36.27 tks/sec
1017.39 ms
27.57 ms
29330.95 ms
0 sec
0.59 $/1M tks
0.79 $/1M tks
89.25 tks/sec
907.52 ms
11.21 ms
941.13 ms
0 sec
0.99 $/1M tks
0.99 $/1M tks
96.31 tks/sec
1524.57 ms
10.38 ms
1856.84 ms
0 sec