×

Uh-oh, it looks like your Internet Explorer is out of date.

For a better shopping experience, please upgrade now.

Hadoop Operations: A Guide for Developers and Administrators
  • Alternative view 1 of Hadoop Operations: A Guide for Developers and Administrators
  • Alternative view 2 of Hadoop Operations: A Guide for Developers and Administrators
     

Hadoop Operations: A Guide for Developers and Administrators

5.0 1
by Eric Sammer
 

See All Formats & Editions

If you’ve been asked to maintain large and complex Hadoop clusters, this book is a must. Demand for operations-specific material has skyrocketed now that Hadoop is becoming the de facto standard for truly large-scale data processing in the data center. Eric Sammer, Principal Solution Architect at Cloudera, shows you the particulars of running Hadoop in

Overview

If you’ve been asked to maintain large and complex Hadoop clusters, this book is a must. Demand for operations-specific material has skyrocketed now that Hadoop is becoming the de facto standard for truly large-scale data processing in the data center. Eric Sammer, Principal Solution Architect at Cloudera, shows you the particulars of running Hadoop in production, from planning, installing, and configuring the system to providing ongoing maintenance.

Rather than run through all possible scenarios, this pragmatic operations guide calls out what works, as demonstrated in critical deployments.

  • Get a high-level overview of HDFS and MapReduce: why they exist and how they work
  • Plan a Hadoop deployment, from hardware and OS selection to network requirements
  • Learn setup and configuration details with a list of critical properties
  • Manage resources by sharing a cluster across multiple groups
  • Get a runbook of the most common cluster maintenance tasks
  • Monitor Hadoop clusters—and learn troubleshooting with the help of real-world war stories
  • Use basic tools and techniques to handle backup and catastrophic failure

Product Details

ISBN-13:
9781449327293
Publisher:
O'Reilly Media, Incorporated
Publication date:
09/26/2012
Sold by:
Barnes & Noble
Format:
NOOK Book
Pages:
298
File size:
6 MB

Meet the Author

Eric Sammer is currently a Principal Solution Architect at Cloudera where he helps customers plan, deploy, develop for, and use Hadoop and the related projects at scale. His background is in the development and operations of distributed, highly concurrent, data ingest and processing systems. He's been involved in the open source community and has contributed to a large number of projects over the last decade.

Customer Reviews

Average Review:

Post to your social network

     

Most Helpful Customer Reviews

See all customer reviews

Hadoop Operations 5 out of 5 based on 0 ratings. 1 reviews.
FantasyRider More than 1 year ago
Excellent Technical Book for Operations Hadoop Operations by Eric Sammer (O'Reilly Media) is a thoughtfully organized book that guides the operational and architectural reader into a viable Hadoop-centric solution. In his book, Sammer spends a reasonable amount of time providing the reader with enough Hadoop background to be able to move onto the more complex considerations and actions needed to implement high quality Hadoop clusters in an operations environment. Sammer provides some very specific information in his books that puts it into my “must have” collection for Hadoop. First, instead of trying to cover what Hadoop can do in all flavors and colors, Sammer describes configurations that will meet the needs of a general operational implementation. This allows the reader to focus on the key concepts of installing, configuring, and operating a Hadoop cluster instead of learning the many Hadoop features that most shops will never use. Secondly, Sammer spends an appropriate amount of time discussing ways that an operational team can monitor and troubleshoot Hadoop clusters. Very few authors cover the areas needed so that a solution can move from “proof of concept” into a “production-level” implementation. Third, Sammer looks at products that work around Hadoop to either add features or allow for better maintainability/management of the system. This gives the reader the ability to see how Hadoop fits into the larger operational model. Finally, Sammer approaches the chapters in the book from the view of someone that has actually implemented Hadoop clusters by providing suggestion, tips, and tricks that allow the reader to bypass many of the more common challenges that Hadoop adopters can face. I highly recommend Hadoop Operations by Eric Sammer for the operational and architectural readers that want to get a highly viable solution as soon as possible.