Premium & Rewards Members Earn Double Stamps Shop Now Ends 7/5Premium & Rewards Members Earn Double Stamps Shop Now Ends 7/5

Hands-On LLM Serving and Optimization: Hosting LLMs at Scale

Paperback
$79.99

Premium Members get an additional 10% off now through 07/05/26, Premium & Rewards Members Earn Double Stamps! 10 stamps = $5 reward.

Promotion message icon
Premium Members save an extra 10% and all Members collect stamps to save with Rewards. 10 stamps = $5.Learn More
In stock
This item is currently out of stock online.
Free standard shipping on orders over $60
Select a store to view item availability.
Large language models (LLMs) are rapidly becoming the backbone of AI-driven applications. Without proper optimization, however, LLMs can be expensive to run, slow to serve, and prone to performance bottlenecks. As the demand for real-time AI applications grows, along comes Hands-On Serving and Optimizing LLM Models, a comprehensive guide to the complexities of deploying and optimizing LLMs at scale.

In this hands-on book, authors Chi Wang and Peiheng Hu take a real-world approach backed by ...