June 21st Father's Day! All the best gift ideas.  Shop NowJune 21st Father's Day! All the best gift ideas.  Shop Now

Vision Language Models: Building VLMs with Hugging Face

Paperback
$79.99
Promotion message icon
Premium Members save an extra 10% and all Members collect stamps to save with Rewards. 10 stamps = $5.Learn More
Formats
This item will be released on Jul 21, 2026
Free standard shipping on orders over $60
Vision language models (VLMs) combine computer vision and natural language processing to create powerful systems that can interpret, generate, and respond in multimodal contexts. Vision Language Models is a hands-on guide to building real-world VLMs using the most up-to-date stack of machine learning tools from Hugging Face, Meta (PyTorch), NVIDIA (Cuda), OpenAI (CLIP), and others, written by leading researchers and practitioners Merve Noyan, Miquel Farré, Andrés Marafioti, and Orr Zohar....