Developers
Learn
Company
Sign in
Chat
Chat with and directly compare LLM endpoints
Benchmarks
Compare LLM endpoints with live performance benchmarks
Documentation
Learn how to use the Unify API
Blog
Read about LLM deployment infrastructure
Newsletter
Stay up to date with the latest in AI
Paper Readings
Join our discussions around cuttin-edge AI research
Talks
Dive deep with us into the AI landscape
Careers
Join our team and let’s Unify AI!
Contact
Reach out to our team
Privacy & Cookies
How we treat your navigation data
Terms Of Service
General requirements for using our Service
Socials
Follow us through our social accounts:
Sign in
Chat
Chat with and directly compare LLM endpoints
Benchmarks
Compare LLM endpoints with live performance benchmarks
Documentation
Learn how to use the Unify API
Blog
Read about LLM deployment infrastructure
Newsletter
Stay up to date with the latest in AI
Paper Readings
Join our discussions around cuttin-edge AI research
Talks
Dive deep with us into the AI landscape
Careers
Join our team and let’s Unify AI!
Contact
Reach out to our team
Privacy & Cookies
How we treat your navigation data
Terms Of Service
General requirements for using our Service
Socials
Follow us through our social accounts:
Back to Benchmarks
mixtral-8x7b-instruct-v0.1
text-generation
Uploaded: 09.01.2024
⏱️ Benchmarks
📑 API Docs
✨ Query this model
Region:
Hong Kong
Belgium
Iowa
Seq Length:
Short
Long
Providers
together-ai
octoai
replicate
mistral-ai
perplexity-ai
anyscale
fireworks-ai
lepton-ai
deepinfra
aws-bedrock
Learn more about how we are collecting this data
here
Input Cost
Output Cost
Output Tks / Sec
P
90
_{P90}
P
9
0
TTFT
P
90
_{P90}
P
9
0
ITL
P
90
_{P90}
P
9
0
E2E Latency
P
90
_{P90}
P
9
0
Cold Start
0.6 $/1M tks
0.6 $/1M tks
238.88 tks/sec
542.26 ms
4.19 ms
1212.05 ms
0 sec
0.3 $/1M tks
0.5 $/1M tks
75 tks/sec
720.9 ms
13.33 ms
2840.93 ms
0 sec
0.3 $/1M tks
1 $/1M tks
278.45 tks/sec
877.19 ms
3.59 ms
1426.66 ms
0 sec
0.7 $/1M tks
0.7 $/1M tks
106.16 tks/sec
311.38 ms
9.42 ms
2035.14 ms
0 sec
Not computed
Not computed
Not computed
Not computed
Not computed
Not computed
Not computed
0.5 $/1M tks
0.5 $/1M tks
61.08 tks/sec
1181.47 ms
16.37 ms
3964.5 ms
0 sec
0.5 $/1M tks
0.5 $/1M tks
325.63 tks/sec
453.45 ms
3.07 ms
1079.94 ms
0 sec
0.5 $/1M tks
0.5 $/1M tks
143.74 tks/sec
871.24 ms
6.96 ms
2019.11 ms
0 sec
0.27 $/1M tks
0.27 $/1M tks
87.06 tks/sec
585.07 ms
11.49 ms
2721.51 ms
0 sec
0.45 $/1M tks
0.7 $/1M tks
66.51 tks/sec
713.96 ms
15.03 ms
3435.29 ms
0 sec