Apache Hive Essentials: Essential techniques to help you process, and get unique insights from, big data

This book takes you on a fantastic journey to discover the attributes of big data using Apache Hive.


Key FeaturesGrasp the skills needed to write efficient Hive queries to analyze the Big Data Discover how Hive can coexist and work with other tools within the Hadoop ecosystemUses practical, example-oriented scenarios to cover all the newly released features of Apache Hive 2.3.3Book Description


In this book, we prepare you for your journey into big data by frstly introducing you to backgrounds in the big data domain, alongwith the process of setting up and getting familiar with your Hive working environment.


Next, the book guides you through discovering and transforming the values of big data with the help of examples. It also hones your skills in using the Hive language in an effcient manner. Toward the end, the book focuses on advanced topics, such as performance, security, and extensions in Hive, which will guide you on exciting adventures on this worthwhile big data journey.


By the end of the book, you will be familiar with Hive and able to work effeciently to find solutions to big data problems


What you will learnCreate and set up the Hive environmentDiscover how to use Hive's definition language to describe dataDiscover interesting data by joining and filtering datasets in HiveTransform data by using Hive sorting, ordering, and functionsAggregate and sample data in different waysBoost Hive query performance and enhance data security in HiveCustomize Hive to your needs by using user-defined functions and integrate it with other toolsWho this book is for


If you are a data analyst, developer, or simply someone who wants to quickly get started with Hive to explore and analyze Big Data in Hadoop, this is the book for you. Since Hive is an SQL-like language, some previous experience with SQL will be useful to get the most out of this book.


Dayong Du is a big data practitioner, author, and coach with over 10 years' experience in technology consulting, designing, and implementing enterprise big data architecture and analytics in various industries, including finance, media, travel, and telecoms. He has a master's degree in computer science from Dalhousie University and is a Cloudera certified Hadoop developer. He is a cofounder of Toronto Big Data Professional Association and the founder of DataFiber website.

1129050542
Apache Hive Essentials: Essential techniques to help you process, and get unique insights from, big data

This book takes you on a fantastic journey to discover the attributes of big data using Apache Hive.


Key FeaturesGrasp the skills needed to write efficient Hive queries to analyze the Big Data Discover how Hive can coexist and work with other tools within the Hadoop ecosystemUses practical, example-oriented scenarios to cover all the newly released features of Apache Hive 2.3.3Book Description


In this book, we prepare you for your journey into big data by frstly introducing you to backgrounds in the big data domain, alongwith the process of setting up and getting familiar with your Hive working environment.


Next, the book guides you through discovering and transforming the values of big data with the help of examples. It also hones your skills in using the Hive language in an effcient manner. Toward the end, the book focuses on advanced topics, such as performance, security, and extensions in Hive, which will guide you on exciting adventures on this worthwhile big data journey.


By the end of the book, you will be familiar with Hive and able to work effeciently to find solutions to big data problems


What you will learnCreate and set up the Hive environmentDiscover how to use Hive's definition language to describe dataDiscover interesting data by joining and filtering datasets in HiveTransform data by using Hive sorting, ordering, and functionsAggregate and sample data in different waysBoost Hive query performance and enhance data security in HiveCustomize Hive to your needs by using user-defined functions and integrate it with other toolsWho this book is for


If you are a data analyst, developer, or simply someone who wants to quickly get started with Hive to explore and analyze Big Data in Hadoop, this is the book for you. Since Hive is an SQL-like language, some previous experience with SQL will be useful to get the most out of this book.


Dayong Du is a big data practitioner, author, and coach with over 10 years' experience in technology consulting, designing, and implementing enterprise big data architecture and analytics in various industries, including finance, media, travel, and telecoms. He has a master's degree in computer science from Dalhousie University and is a Cloudera certified Hadoop developer. He is a cofounder of Toronto Big Data Professional Association and the founder of DataFiber website.

25.99 In Stock
Apache Hive Essentials: Essential techniques to help you process, and get unique insights from, big data

Apache Hive Essentials: Essential techniques to help you process, and get unique insights from, big data

by Dayong Du
Apache Hive Essentials: Essential techniques to help you process, and get unique insights from, big data

Apache Hive Essentials: Essential techniques to help you process, and get unique insights from, big data

by Dayong Du

eBook

$25.99 

Available on Compatible NOOK devices, the free NOOK App and in My Digital Library.
WANT A NOOK?  Explore Now

Related collections and offers


Overview

This book takes you on a fantastic journey to discover the attributes of big data using Apache Hive.


Key FeaturesGrasp the skills needed to write efficient Hive queries to analyze the Big Data Discover how Hive can coexist and work with other tools within the Hadoop ecosystemUses practical, example-oriented scenarios to cover all the newly released features of Apache Hive 2.3.3Book Description


In this book, we prepare you for your journey into big data by frstly introducing you to backgrounds in the big data domain, alongwith the process of setting up and getting familiar with your Hive working environment.


Next, the book guides you through discovering and transforming the values of big data with the help of examples. It also hones your skills in using the Hive language in an effcient manner. Toward the end, the book focuses on advanced topics, such as performance, security, and extensions in Hive, which will guide you on exciting adventures on this worthwhile big data journey.


By the end of the book, you will be familiar with Hive and able to work effeciently to find solutions to big data problems


What you will learnCreate and set up the Hive environmentDiscover how to use Hive's definition language to describe dataDiscover interesting data by joining and filtering datasets in HiveTransform data by using Hive sorting, ordering, and functionsAggregate and sample data in different waysBoost Hive query performance and enhance data security in HiveCustomize Hive to your needs by using user-defined functions and integrate it with other toolsWho this book is for


If you are a data analyst, developer, or simply someone who wants to quickly get started with Hive to explore and analyze Big Data in Hadoop, this is the book for you. Since Hive is an SQL-like language, some previous experience with SQL will be useful to get the most out of this book.


Dayong Du is a big data practitioner, author, and coach with over 10 years' experience in technology consulting, designing, and implementing enterprise big data architecture and analytics in various industries, including finance, media, travel, and telecoms. He has a master's degree in computer science from Dalhousie University and is a Cloudera certified Hadoop developer. He is a cofounder of Toronto Big Data Professional Association and the founder of DataFiber website.


Product Details

ISBN-13: 9781789136517
Publisher: Packt Publishing
Publication date: 06/30/2018
Sold by: Barnes & Noble
Format: eBook
Pages: 210
File size: 3 MB

About the Author

Dayong Du is a big data practitioner, author, and coach with over 10 years' experience in technology consulting, designing, and implementing enterprise big data architecture and analytics in various industries, including finance, media, travel, and telecoms. He has a master's degree in computer science from Dalhousie University and is a Cloudera certified Hadoop developer. He is a cofounder of Toronto Big Data Professional Association and the founder of DataFiber website.

Table of Contents

Table of Contents
  1. OVERVIEW OF BIG DATA AND HIVE
  2. SETTING UP THE HIVE ENVIRONMENT
  3. DATA DEFINITION AND DESCRIPTION
  4. Data Correlation and Scope
  5. DATA MANIPULATION
  6. DATA AGGREGATION AND SAMPLING
  7. Extensibility Considerations
  8. Working with Other Tools
  9. Performance Considerations
  10. Security Considerations
From the B&N Reads Blog

Customer Reviews