Practical Hadoop Ecosystem: A Definitive Guide to Hadoop-Related Frameworks and Tools
Learn how to use the Apache Hadoop projects, including MapReduce, HDFS, Apache Hive, Apache HBase, Apache Kafka, Apache Mahout, and Apache Solr. From setting up the environment to running sample applications each chapter in this book is a practical tutorial on using an Apache Hadoop ecosystem project.

While several books on Apache Hadoop are available, most are based on the main projects, MapReduce and HDFS, and none discusses the other Apache Hadoop ecosystem projects and how they all work together as a cohesive big data development platform.



What You Will Learn:

• Set up the environment in Linux for Hadoop projects using Cloudera Hadoop Distribution CDH 5

• Run a MapReduce job

• Store data with Apache Hive, and Apache HBase

• Index data in HDFS with Apache Solr

• Develop a Kafka messaging system

• Stream Logs to HDFS with Apache Flume

• Transfer data from MySQL database to Hive, HDFS, and HBase with Sqoop

• Create a Hive table over Apache Solr

• Develop a Mahout User Recommender System

Who This Book Is For:

Apache Hadoop developers. Pre-requisite knowledge of Linux and some knowledge of Hadoop is required.
1133118488
Practical Hadoop Ecosystem: A Definitive Guide to Hadoop-Related Frameworks and Tools
Learn how to use the Apache Hadoop projects, including MapReduce, HDFS, Apache Hive, Apache HBase, Apache Kafka, Apache Mahout, and Apache Solr. From setting up the environment to running sample applications each chapter in this book is a practical tutorial on using an Apache Hadoop ecosystem project.

While several books on Apache Hadoop are available, most are based on the main projects, MapReduce and HDFS, and none discusses the other Apache Hadoop ecosystem projects and how they all work together as a cohesive big data development platform.



What You Will Learn:

• Set up the environment in Linux for Hadoop projects using Cloudera Hadoop Distribution CDH 5

• Run a MapReduce job

• Store data with Apache Hive, and Apache HBase

• Index data in HDFS with Apache Solr

• Develop a Kafka messaging system

• Stream Logs to HDFS with Apache Flume

• Transfer data from MySQL database to Hive, HDFS, and HBase with Sqoop

• Create a Hive table over Apache Solr

• Develop a Mahout User Recommender System

Who This Book Is For:

Apache Hadoop developers. Pre-requisite knowledge of Linux and some knowledge of Hadoop is required.
54.99 In Stock
Practical Hadoop Ecosystem: A Definitive Guide to Hadoop-Related Frameworks and Tools

Practical Hadoop Ecosystem: A Definitive Guide to Hadoop-Related Frameworks and Tools

by Deepak Vohra
Practical Hadoop Ecosystem: A Definitive Guide to Hadoop-Related Frameworks and Tools

Practical Hadoop Ecosystem: A Definitive Guide to Hadoop-Related Frameworks and Tools

by Deepak Vohra

Paperback(1st ed.)

$54.99 
  • SHIP THIS ITEM
    In stock. Ships in 1-2 days.
  • PICK UP IN STORE

    Your local store may have stock of this item.

Related collections and offers


Overview

Learn how to use the Apache Hadoop projects, including MapReduce, HDFS, Apache Hive, Apache HBase, Apache Kafka, Apache Mahout, and Apache Solr. From setting up the environment to running sample applications each chapter in this book is a practical tutorial on using an Apache Hadoop ecosystem project.

While several books on Apache Hadoop are available, most are based on the main projects, MapReduce and HDFS, and none discusses the other Apache Hadoop ecosystem projects and how they all work together as a cohesive big data development platform.



What You Will Learn:

• Set up the environment in Linux for Hadoop projects using Cloudera Hadoop Distribution CDH 5

• Run a MapReduce job

• Store data with Apache Hive, and Apache HBase

• Index data in HDFS with Apache Solr

• Develop a Kafka messaging system

• Stream Logs to HDFS with Apache Flume

• Transfer data from MySQL database to Hive, HDFS, and HBase with Sqoop

• Create a Hive table over Apache Solr

• Develop a Mahout User Recommender System

Who This Book Is For:

Apache Hadoop developers. Pre-requisite knowledge of Linux and some knowledge of Hadoop is required.

Product Details

ISBN-13: 9781484221983
Publisher: Apress
Publication date: 10/01/2016
Edition description: 1st ed.
Pages: 421
Product dimensions: 6.90(w) x 9.80(h) x 1.10(d)

About the Author

Deepak Vohra is a coder, developer, programmer, book author, and technical reviewer.

Table of Contents

Part I. Fundamentals.- Introduction.- 1. HDFS and MapReduce.- Part II Storing & Querying.- 2. Apache Hive.- 3. Apache HBase.- Part III Bulk Transferring & Streaming.- 4. Apache Sqoop.- 5. Apache Flume.- Part IV Serializing.- 6. Apache Avro.- 7. Apache Parquet.- Part V Messaging & Indexing.- 8. Apache Kafka.- 9. Apache Solr.- 10.Apache Mahout.

From the B&N Reads Blog

Customer Reviews