Pretrain Vision and Large Language Models in Python: End-to-end techniques for building and deploying foundation models on AWS

Master the art of training vision and large language models with conceptual fundaments and industry-expert guidance. Learn about AWS services and design patterns, with relevant coding examples


Key Features


Learn to develop, train, tune, and apply foundation models with optimized end-to-end pipelines


Explore large-scale distributed training for models and datasets with AWS and SageMaker examples


Evaluate, deploy, and operationalize your custom models with bias detection and pipeline monitoring


Book Description


Foundation models have forever changed machine learning. From BERT to ChatGPT, CLIP to Stable Diffusion, when billions of parameters are combined with large datasets and hundreds to thousands of GPUs, the result is nothing short of record-breaking. The recommendations, advice, and code samples in this book will help you pretrain and fine-tune your own foundation models from scratch on AWS and Amazon SageMaker, while applying them to hundreds of use cases across your organization.


With advice from seasoned AWS and machine learning expert Emily Webber, this book helps you learn everything you need to go from project ideation to dataset preparation, training, evaluation, and deployment for large language, vision, and multimodal models. With step-by-step explanations of essential concepts and practical examples, you’ll go from mastering the concept of pretraining to preparing your dataset and model, configuring your environment, training, fine-tuning, evaluating, deploying, and optimizing your foundation models.


You will learn how to apply the scaling laws to distributing your model and dataset over multiple GPUs, remove bias, achieve high throughput, and build deployment pipelines.


By the end of this book, you’ll be well equipped to embark on your own project to pretrain and fine-tune the foundation models of the future.


What you will learn


Find the right use cases and datasets for pretraining and fine-tuning


Prepare for large-scale training with custom accelerators and GPUs


Configure environments on AWS and SageMaker to maximize performance


Select hyperparameters based on your model and constraints


Distribute your model and dataset using many types of parallelism


Avoid pitfalls with job restarts, intermittent health checks, and more


Evaluate your model with quantitative and qualitative insights


Deploy your models with runtime improvements and monitoring pipelines


Who this book is for


If you’re a machine learning researcher or enthusiast who wants to start a foundation modelling project, this book is for you. Applied scientists, data scientists, machine learning engineers, solution architects, product managers, and students will all benefit from this book. Intermediate Python is a must, along with introductory concepts of cloud computing. A strong understanding of deep learning fundamentals is needed, while advanced topics will be explained. The content covers advanced machine learning and cloud techniques, explaining them in an actionable, easy-to-understand way.

1143577132
Pretrain Vision and Large Language Models in Python: End-to-end techniques for building and deploying foundation models on AWS

Master the art of training vision and large language models with conceptual fundaments and industry-expert guidance. Learn about AWS services and design patterns, with relevant coding examples


Key Features


Learn to develop, train, tune, and apply foundation models with optimized end-to-end pipelines


Explore large-scale distributed training for models and datasets with AWS and SageMaker examples


Evaluate, deploy, and operationalize your custom models with bias detection and pipeline monitoring


Book Description


Foundation models have forever changed machine learning. From BERT to ChatGPT, CLIP to Stable Diffusion, when billions of parameters are combined with large datasets and hundreds to thousands of GPUs, the result is nothing short of record-breaking. The recommendations, advice, and code samples in this book will help you pretrain and fine-tune your own foundation models from scratch on AWS and Amazon SageMaker, while applying them to hundreds of use cases across your organization.


With advice from seasoned AWS and machine learning expert Emily Webber, this book helps you learn everything you need to go from project ideation to dataset preparation, training, evaluation, and deployment for large language, vision, and multimodal models. With step-by-step explanations of essential concepts and practical examples, you’ll go from mastering the concept of pretraining to preparing your dataset and model, configuring your environment, training, fine-tuning, evaluating, deploying, and optimizing your foundation models.


You will learn how to apply the scaling laws to distributing your model and dataset over multiple GPUs, remove bias, achieve high throughput, and build deployment pipelines.


By the end of this book, you’ll be well equipped to embark on your own project to pretrain and fine-tune the foundation models of the future.


What you will learn


Find the right use cases and datasets for pretraining and fine-tuning


Prepare for large-scale training with custom accelerators and GPUs


Configure environments on AWS and SageMaker to maximize performance


Select hyperparameters based on your model and constraints


Distribute your model and dataset using many types of parallelism


Avoid pitfalls with job restarts, intermittent health checks, and more


Evaluate your model with quantitative and qualitative insights


Deploy your models with runtime improvements and monitoring pipelines


Who this book is for


If you’re a machine learning researcher or enthusiast who wants to start a foundation modelling project, this book is for you. Applied scientists, data scientists, machine learning engineers, solution architects, product managers, and students will all benefit from this book. Intermediate Python is a must, along with introductory concepts of cloud computing. A strong understanding of deep learning fundamentals is needed, while advanced topics will be explained. The content covers advanced machine learning and cloud techniques, explaining them in an actionable, easy-to-understand way.

39.99 In Stock
Pretrain Vision and Large Language Models in Python: End-to-end techniques for building and deploying foundation models on AWS

Pretrain Vision and Large Language Models in Python: End-to-end techniques for building and deploying foundation models on AWS

Pretrain Vision and Large Language Models in Python: End-to-end techniques for building and deploying foundation models on AWS

Pretrain Vision and Large Language Models in Python: End-to-end techniques for building and deploying foundation models on AWS

eBook

$39.99 

Available on Compatible NOOK devices, the free NOOK App and in My Digital Library.
WANT A NOOK?  Explore Now

Related collections and offers


Overview

Master the art of training vision and large language models with conceptual fundaments and industry-expert guidance. Learn about AWS services and design patterns, with relevant coding examples


Key Features


Learn to develop, train, tune, and apply foundation models with optimized end-to-end pipelines


Explore large-scale distributed training for models and datasets with AWS and SageMaker examples


Evaluate, deploy, and operationalize your custom models with bias detection and pipeline monitoring


Book Description


Foundation models have forever changed machine learning. From BERT to ChatGPT, CLIP to Stable Diffusion, when billions of parameters are combined with large datasets and hundreds to thousands of GPUs, the result is nothing short of record-breaking. The recommendations, advice, and code samples in this book will help you pretrain and fine-tune your own foundation models from scratch on AWS and Amazon SageMaker, while applying them to hundreds of use cases across your organization.


With advice from seasoned AWS and machine learning expert Emily Webber, this book helps you learn everything you need to go from project ideation to dataset preparation, training, evaluation, and deployment for large language, vision, and multimodal models. With step-by-step explanations of essential concepts and practical examples, you’ll go from mastering the concept of pretraining to preparing your dataset and model, configuring your environment, training, fine-tuning, evaluating, deploying, and optimizing your foundation models.


You will learn how to apply the scaling laws to distributing your model and dataset over multiple GPUs, remove bias, achieve high throughput, and build deployment pipelines.


By the end of this book, you’ll be well equipped to embark on your own project to pretrain and fine-tune the foundation models of the future.


What you will learn


Find the right use cases and datasets for pretraining and fine-tuning


Prepare for large-scale training with custom accelerators and GPUs


Configure environments on AWS and SageMaker to maximize performance


Select hyperparameters based on your model and constraints


Distribute your model and dataset using many types of parallelism


Avoid pitfalls with job restarts, intermittent health checks, and more


Evaluate your model with quantitative and qualitative insights


Deploy your models with runtime improvements and monitoring pipelines


Who this book is for


If you’re a machine learning researcher or enthusiast who wants to start a foundation modelling project, this book is for you. Applied scientists, data scientists, machine learning engineers, solution architects, product managers, and students will all benefit from this book. Intermediate Python is a must, along with introductory concepts of cloud computing. A strong understanding of deep learning fundamentals is needed, while advanced topics will be explained. The content covers advanced machine learning and cloud techniques, explaining them in an actionable, easy-to-understand way.


Product Details

ISBN-13: 9781804612545
Publisher: Packt Publishing
Publication date: 05/31/2023
Sold by: Barnes & Noble
Format: eBook
Pages: 258
File size: 9 MB

About the Author

Emily Webber is a Principal Machine Learning Specialist Solutions Architect at Amazon Web Services. She has assisted hundreds of customers on their journey to ML in the cloud, specializing in distributed training for large language and vision models. She mentors Machine Learning Solution Architects, authors countless feature designs for SageMaker and AWS, and guides the Amazon SageMaker product and engineering teams on best practices in regards around machine learning and customers. Emily is widely known in the AWS community for a 16-video YouTube series featuring SageMaker with 160,000 views, plus a Keynote at O’Reilly AI London 2019 on a novel reinforcement learning approach she developed for public policy.

Table of Contents

Table of Contents
  1. An Introduction to Pretraining Foundation Models
  2. Dataset Preparation: Part One
  3. Model Preparation
  4. Containers and Accelerators on the Cloud
  5. Distribution Fundamentals
  6. Dataset Preparation: Part Two, the Data Loader
  7. Finding the Right Hyperparameters
  8. Large-Scale Training on SageMaker
  9. Advanced Training Concepts
  10. Fine-Tuning and Evaluating
  11. Detecting, Mitigating, and Monitoring Bias
  12. How to Deploy Your Model
  13. Prompt Engineering
  14. MLOps for Vision and Language
  15. Future Trends in Pretraining Foundation Models
From the B&N Reads Blog

Customer Reviews