Each chapter introduces a different topic—such as core technologies or data transfer—and explains why certain components may or may not be useful for particular needs. When it comes to data, Hadoop is a whole new ballgame, but with this handy reference, you’ll have a good grasp of the playing field.
Topics include:
- Core technologies—Hadoop Distributed File System (HDFS), MapReduce, YARN, and Spark
- Database and data management—Cassandra, HBase, MongoDB, and Hive
- Serialization—Avro, JSON, and Parquet
- Management and monitoring—Puppet, Chef, Zookeeper, and Oozie
- Analytic helpers—Pig, Mahout, and MLLib
- Data transfer—Scoop, Flume, distcp, and Storm
- Security, access control, auditing—Sentry, Kerberos, and Knox
- Cloud computing and virtualization—Serengeti, Docker, and Whirr
Each chapter introduces a different topic—such as core technologies or data transfer—and explains why certain components may or may not be useful for particular needs. When it comes to data, Hadoop is a whole new ballgame, but with this handy reference, you’ll have a good grasp of the playing field.
Topics include:
- Core technologies—Hadoop Distributed File System (HDFS), MapReduce, YARN, and Spark
- Database and data management—Cassandra, HBase, MongoDB, and Hive
- Serialization—Avro, JSON, and Parquet
- Management and monitoring—Puppet, Chef, Zookeeper, and Oozie
- Analytic helpers—Pig, Mahout, and MLLib
- Data transfer—Scoop, Flume, distcp, and Storm
- Security, access control, auditing—Sentry, Kerberos, and Knox
- Cloud computing and virtualization—Serengeti, Docker, and Whirr
Field Guide to Hadoop: An Introduction to Hadoop, Its Ecosystem, and Aligned Technologies
130Field Guide to Hadoop: An Introduction to Hadoop, Its Ecosystem, and Aligned Technologies
130Product Details
ISBN-13: | 9781491947937 |
---|---|
Publisher: | O'Reilly Media, Incorporated |
Publication date: | 03/23/2015 |
Pages: | 130 |
Product dimensions: | 6.00(w) x 8.80(h) x 0.40(d) |