Developers
Learn
Company
Chat
Chat with and directly compare LLM endpoints
Benchmarks
Compare LLM endpoints with live performance benchmarks
Documentation
Learn how to use the Unify API
Blog
Read about LLM deployment infrastructure
Newsletter
Stay up to date with the latest in AI
Paper Readings
Join our discussions around cuttin-edge AI research
Talks
Dive deep with us into the AI landscape
Careers
Join our team and letβs Unify AI!
Contact
Reach out to our team
Privacy & Cookies
How we treat your navigation data
Terms Of Service
General requirements for using our Service
Socials
Follow us through our social accounts:
Chat
Chat with and directly compare LLM endpoints
Benchmarks
Compare LLM endpoints with live performance benchmarks
Documentation
Learn how to use the Unify API
Blog
Read about LLM deployment infrastructure
Newsletter
Stay up to date with the latest in AI
Paper Readings
Join our discussions around cuttin-edge AI research
Talks
Dive deep with us into the AI landscape
Careers
Join our team and letβs Unify AI!
Contact
Reach out to our team
Privacy & Cookies
How we treat your navigation data
Terms Of Service
General requirements for using our Service
Socials
Follow us through our social accounts:
43 sets
Contribute
Category
compilers
compression
hardware
serving
supported-hardware
eco-system
compilers
mlir
inference-optimizer
llvm
Show hot cards π₯
Name - Ascending
Name - Descending
Sort by name...
π₯
TensorRT-LLM
apache-2.0
compilers
inference-optimizer
llm
π₯
SHARK
amd
apache-2.0
compilers
mlir
π₯
tinygrad
compilers
framework
mit
π₯
streaming-llm
compilers
framework
inference-optimizer
mit
π₯
MLIR
apache-2.0
compilers
llvm
mlir
π₯
MLC-LLM
apache-2.0
compilers
llm
π₯
IREE
apache-2.0
compilers
edge
mlir
π₯
BigDL
apache-2.0
compression
distillation
intel
π₯
CoreML
compression
palettization
proprietary
pruning
π₯
gptq
apache-2.0
compression
quantization
π₯
TensorLy
bsd-3-clause
compression
jax
mxnet
π₯
Torch-Pruning
compression
mit
open-source
pruning
π₯
only_train_once
compression
mit
open-source
pruning
π₯
bitsandbytes
compression
mit
quantization
π₯
TensorLy-Torch
bsd-3-clause
compression
pytorch
tensorization
π₯
Built In Pytorch Compression
bsd-3-clause
compression
pruning
quantization
π₯
Neural Compressor
apache-2.0
compression
distillation
mxnet
π₯
Pruna-AI
compression
distillation
proprietary
pruning
1
2
3