Compression

Tensorization: Breaking Through The Ranks

Tensorization is a model compression technique that breaks down weight tensors of deep neural networks into smaller, lower rank tensors to reveal underlying patterns and reduce...

Compiler

Compilers: Talking to The Hardware

One of the key drivers behind the rapid expansion in machine learning growth is the technological progress made in the development of...

Model Compression: A Survey of Techniques, Tools, and Libraries‍

Machine learning has witnessed a surge in interest in recent years driven by several factors. including the availability of large datasets, advancements in transfer learning...

Model Serving: A Multi-Layered Landscape

Machine learning (ML) models are becoming more complex and data-intensive, requiring more expensive hardware and infrastructure to train and run. As a result...

Guillermo Sanchez-Brizuela & Shyngyskhan Abilkassov

December 22, 2023

10 min read

Engineering

AI Deployment: A Cambrian Explosion

AI has been in an ever expanding renaissance for the past decade, with the success of AlexNet being a pivotal moment which sparked a new wave of deep learning research in 2012. Since then, there have been several step change moments, such as the success of reinforcement learning algorithms on...

Daniel Lenton

November 4, 2023

10 min read