Hadoop in Action

Hadoop in Action

by Chuck Lam
     
 

View All Available Formats & Editions

The massive datasets required for most modern businesses are too large to safely store and efficiently process on a single server. Hadoop is an open source data processing framework that provides a distributed file system that can manage data stored across clusters of servers and implements the MapReduce data processing model so that users can effectively query and

Overview

The massive datasets required for most modern businesses are too large to safely store and efficiently process on a single server. Hadoop is an open source data processing framework that provides a distributed file system that can manage data stored across clusters of servers and implements the MapReduce data processing model so that users can effectively query and utilize big data. The new Hadoop 2.0 is a stable, enterprise-ready platform supported by a rich ecosystem of tools and related technologies such as Pig, Hive, YARN, Spark, Tez, and many more.

Hadoop in Action, Second Edition, provides a comprehensive introduction to Hadoop and shows how to write programs in the MapReduce style. It starts with a few easy examples and then moves quickly to show how Hadoop can be used in more complex data analysis tasks. It covers how YARN, new in Hadoop 2, simplifies and supercharges resource management to make streaming and real-time applications more feasible. Included are best practices and design patterns of MapReduce programming. The book expands on the first edition by enhancing coverage of important Hadoop 2 concepts and systems, and by providing new chapters on data management and data science that reinforce a practical understanding of Hadoop.

Purchase of the print book includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications.

Product Details

ISBN-13:
9781935182191
Publisher:
Manning Publications Company
Publication date:
12/15/2010
Pages:
325
Sales rank:
597,344
Product dimensions:
7.30(w) x 9.10(h) x 0.80(d)

Meet the Author

Chuck Lam has been working with Hadoop since its earliest days. He is a serial startup veteran and the original author of Hadoop in Action.

Mark Davis have been working with Hadoop since its earliest days. He founded the Hadoop analytics company, Kitenga and is now a Distinguished Big Data Analytics Engineer for Dell and the Big Data Lead for the IEEE Cloud Computing Initiative.

Ajit Gaddam is a technologist, serial entrepreneur, and an information security expert. He is a frequent speaker at high-profile conferences and is an active participant in various open source and security architecture standards bodies.

Customer Reviews

Average Review:

Write a Review

and post it to your social network

     

Most Helpful Customer Reviews

See all customer reviews >