Reinforcement Learning from Human Feedback: Alignment and post-training of LLMs
Paperback
$59.99
Premium Members save an extra 10% and all Members collect stamps to save with Rewards. 10 stamps = $5.Learn More
Get a free eBook (PDF or ePub) from Manning as well as access to the online liveBook format (and its AI assistant that will answer your questions in any language) when you purchase the print book.
This is the authoritative guide for Reinforcement learning from human feedback, alignment, and post-training LLMs. In this book, author Nathan Lambert blends diverse perspectives from fields like philosophy and economics with the core mathematics and computer science of RLHF to provide a practical ...
This is the authoritative guide for Reinforcement learning from human feedback, alignment, and post-training LLMs. In this book, author Nathan Lambert blends diverse perspectives from fields like philosophy and economics with the core mathematics and computer science of RLHF to provide a practical ...


