Building Generative AI Services with FastAPI: A Practical Approach to Developing Context Rich Generative AI Applications
Ready to build production-grade applications with generative AI? This practical guide takes you through designing and deploying AI services using the FastAPI web framework. Learn how to integrate models that process text, images, audio, and video while seamlessly interacting with databases, filesystems, websites, and APIs. Whether you're a web developer, data scientist, or DevOps engineer, this book equips you with the tools to build scalable, real-time AI applications.

Author Alireza Parandeh provides clear explanations and hands-on examples covering authentication, concurrency, caching, and retrieval-augmented generation (RAG) with vector databases. You'll also explore best practices for testing AI outputs, optimizing performance, and securing microservices. With containerized deployment using Docker, you'll be ready to launch AI-powered applications confidently in the cloud.

  • Build generative AI services that interact with databases, filesystems, websites, and APIs

  • Manage concurrency in AI workloads and handle long-running tasks

  • Stream AI-generated outputs in real time via WebSocket and server-sent events

  • Secure services with authentication, content filtering, throttling, and rate limiting

  • Optimize AI performance with caching, batch processing, and fine-tuning techniques

Visit the Book's Website.

1146455312
Building Generative AI Services with FastAPI: A Practical Approach to Developing Context Rich Generative AI Applications
Ready to build production-grade applications with generative AI? This practical guide takes you through designing and deploying AI services using the FastAPI web framework. Learn how to integrate models that process text, images, audio, and video while seamlessly interacting with databases, filesystems, websites, and APIs. Whether you're a web developer, data scientist, or DevOps engineer, this book equips you with the tools to build scalable, real-time AI applications.

Author Alireza Parandeh provides clear explanations and hands-on examples covering authentication, concurrency, caching, and retrieval-augmented generation (RAG) with vector databases. You'll also explore best practices for testing AI outputs, optimizing performance, and securing microservices. With containerized deployment using Docker, you'll be ready to launch AI-powered applications confidently in the cloud.

  • Build generative AI services that interact with databases, filesystems, websites, and APIs

  • Manage concurrency in AI workloads and handle long-running tasks

  • Stream AI-generated outputs in real time via WebSocket and server-sent events

  • Secure services with authentication, content filtering, throttling, and rate limiting

  • Optimize AI performance with caching, batch processing, and fine-tuning techniques

Visit the Book's Website.

59.99 In Stock
Building Generative AI Services with FastAPI: A Practical Approach to Developing Context Rich Generative AI Applications

Building Generative AI Services with FastAPI: A Practical Approach to Developing Context Rich Generative AI Applications

by Ali Parandeh
Building Generative AI Services with FastAPI: A Practical Approach to Developing Context Rich Generative AI Applications

Building Generative AI Services with FastAPI: A Practical Approach to Developing Context Rich Generative AI Applications

by Ali Parandeh

Paperback

$59.99 
  • SHIP THIS ITEM
    In stock. Ships in 1-2 days.
  • PICK UP IN STORE

    Your local store may have stock of this item.

Related collections and offers


Overview

Ready to build production-grade applications with generative AI? This practical guide takes you through designing and deploying AI services using the FastAPI web framework. Learn how to integrate models that process text, images, audio, and video while seamlessly interacting with databases, filesystems, websites, and APIs. Whether you're a web developer, data scientist, or DevOps engineer, this book equips you with the tools to build scalable, real-time AI applications.

Author Alireza Parandeh provides clear explanations and hands-on examples covering authentication, concurrency, caching, and retrieval-augmented generation (RAG) with vector databases. You'll also explore best practices for testing AI outputs, optimizing performance, and securing microservices. With containerized deployment using Docker, you'll be ready to launch AI-powered applications confidently in the cloud.

  • Build generative AI services that interact with databases, filesystems, websites, and APIs

  • Manage concurrency in AI workloads and handle long-running tasks

  • Stream AI-generated outputs in real time via WebSocket and server-sent events

  • Secure services with authentication, content filtering, throttling, and rate limiting

  • Optimize AI performance with caching, batch processing, and fine-tuning techniques

Visit the Book's Website.


Product Details

ISBN-13: 9781098160302
Publisher: O'Reilly Media, Incorporated
Publication date: 06/03/2025
Pages: 400
Product dimensions: 7.00(w) x 9.19(h) x 0.00(d)

About the Author

Alireza Parandeh is a chartered engineer (CEng) with the UK engineering council, a Microsoft and Google Certified Developer, Data Engineer and Data Scientist. He has a strong background in web development, data science and machine learning having led engineering teams at large multinational consultancies and tech startups in London. Ali's portfolio of clients include Network Rail, High-Speed Train 2, Transport for London, International Fertilizer's Association and the Department for Transport.

As a passionate educator, Ali dedicates his free time to teaching data science and web development through meetups and online platforms. In 2019, he founded London's Beginners Machine Learning (BML) group, a Microsoft-sponsored meetup aimed at helping professionals break into the field of Data Science & AI and obtain cloud certifications which has since grown to over 1,500 members.

From the B&N Reads Blog

Customer Reviews