Perfect for beginners, this book's approach will also appeal to experienced practitioners who want to brush up on their skills. Part I explains how Hadoop and MapReduce work, while Part II covers many analytic patterns you can use to process any data. As you work through several exercises, you'll also learn how to use Apache Pig to process data.
- Learn the necessary mechanics of working with Hadoop, including how data and computation move around the cluster
- Dive into map/reduce mechanics and build your first map/reduce job in Python
- Understand how to run chains of map/reduce jobs in the form of Pig scripts
- Use a real-world datasetbaseball performance statisticsthroughout the book
- Work with examples of several analytic patterns, and learn when and where you might use them
Perfect for beginners, this book's approach will also appeal to experienced practitioners who want to brush up on their skills. Part I explains how Hadoop and MapReduce work, while Part II covers many analytic patterns you can use to process any data. As you work through several exercises, you'll also learn how to use Apache Pig to process data.
- Learn the necessary mechanics of working with Hadoop, including how data and computation move around the cluster
- Dive into map/reduce mechanics and build your first map/reduce job in Python
- Understand how to run chains of map/reduce jobs in the form of Pig scripts
- Use a real-world datasetbaseball performance statisticsthroughout the book
- Work with examples of several analytic patterns, and learn when and where you might use them

Big Data for Chimps: A Guide to Massive-Scale Data Processing in Practice
217
Big Data for Chimps: A Guide to Massive-Scale Data Processing in Practice
217Product Details
ISBN-13: | 9781491923948 |
---|---|
Publisher: | O'Reilly Media, Incorporated |
Publication date: | 10/22/2015 |
Pages: | 217 |
Product dimensions: | 6.90(w) x 9.10(h) x 0.60(d) |