×

Uh-oh, it looks like your Internet Explorer is out of date.

For a better shopping experience, please upgrade now.

Hadoop in Practice
     

Hadoop in Practice

by Alex Holmes
 

See All Formats & Editions

Summary

Hadoop in Practice, Second Edition provides over 100 tested, instantly useful techniques that will help you conquer big data, using Hadoop. This revised new edition covers changes and new features in the Hadoop core architecture, including MapReduce 2. Brand new chapters cover YARN and integrating Kafka, Impala, and Spark SQL with Hadoop.

Overview

Summary

Hadoop in Practice, Second Edition provides over 100 tested, instantly useful techniques that will help you conquer big data, using Hadoop. This revised new edition covers changes and new features in the Hadoop core architecture, including MapReduce 2. Brand new chapters cover YARN and integrating Kafka, Impala, and Spark SQL with Hadoop. You'll also get new and updated techniques for Flume, Sqoop, and Mahout, all of which have seen major new versions recently. In short, this is the most practical, up-to-date coverage of Hadoop available anywhere.

Purchase of the print book includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications.

About the Book

It's always a good time to upgrade your Hadoop skills! Hadoop in Practice, Second Edition provides a collection of 104 tested, instantly useful techniques for analyzing real-time streams, moving data securely, machine learning, managing large-scale clusters, and taming big data using Hadoop. This completely revised edition covers changes and new features in Hadoop core, including MapReduce 2 and YARN. You'll pick up hands-on best practices for integrating Spark, Kafka, and Impala with Hadoop, and get new and updated techniques for the latest versions of Flume, Sqoop, and Mahout. In short, this is the most practical, up-to-date coverage of Hadoop available.

Readers need to know a programming language like Java and have basic familiarity with Hadoop.

What's Inside

  • Thoroughly updated for Hadoop 2
  • How to write YARN applications
  • Integrate real-time technologies like Storm, Impala, and Spark
  • Predictive analytics using Mahout and RR
  • Readers need to know a programming language like Java and have basic familiarity with Hadoop.

About the Author

Alex Holmes works on tough big-data problems. He is a software engineer, author, speaker, and blogger specializing in large-scale Hadoop projects.

Table of Contents

    PART 1 BACKGROUND AND FUNDAMENTALS
  1. Hadoop in a heartbeat
  2. Introduction to YARN
  3. PART 2 DATA LOGISTICS
  4. Data serialization—working with text and beyond
  5. Organizing and optimizing data in HDFS
  6. Moving data into and out of Hadoop
  7. PART 3 BIG DATA PATTERNS
  8. Applying MapReduce patterns to big data
  9. Utilizing data structures and algorithms at scale
  10. Tuning, debugging, and testing
  11. PART 4 BEYOND MAPREDUCE
  12. SQL on Hadoop
  13. Writing a YARN application

Product Details

ISBN-13:
9781617292224
Publisher:
Manning Publications Company
Publication date:
10/31/2014
Pages:
512
Sales rank:
719,211
Product dimensions:
7.30(w) x 9.10(h) x 1.20(d)

Meet the Author

Alex Holmes is a software engineer, author, speaker and blogger specializing in large-scale Hadoop projects and solving tough Big Data problems. Alex blogs at grepalex.com.

Customer Reviews

Average Review:

Post to your social network

     

Most Helpful Customer Reviews

See all customer reviews