Oracle Big Data Handbook

Oracle Big Data Handbook

5.0 2
by Tom Plunkett, Brian Macdonald, Bruce Nelson, Mark Hornick
     
 

View All Available Formats & Editions

Transform Big Data into Insight

"In this book, some of Oracle's best engineers and architects explain how you can make use of big data. They'll tell you how you can integrate your existing Oracle solutions with big data systems, using each where appropriate and moving data between them as needed." -- Doug Cutting, co-creator of Apache

Overview

Transform Big Data into Insight

"In this book, some of Oracle's best engineers and architects explain how you can make use of big data. They'll tell you how you can integrate your existing Oracle solutions with big data systems, using each where appropriate and moving data between them as needed." -- Doug Cutting, co-creator of Apache Hadoop

Cowritten by members of Oracle's big data team, Oracle Big Data Handbook provides complete coverage of Oracle's comprehensive, integrated set of products for acquiring, organizing, analyzing, and leveraging unstructured data. The book discusses the strategies and technologies essential for a successful big data implementation, including Apache Hadoop, Oracle Big Data Appliance, Oracle Big Data Connectors, Oracle NoSQL Database, Oracle Endeca, Oracle Advanced Analytics, and Oracle's open source R offerings. Best practices for migrating from legacy systems and integrating existing data warehousing and analytics solutions into an enterprise big data infrastructure are also included in this Oracle Press guide.

  • Understand the value of a comprehensive big data strategy
  • Maximize the distributed processing power of the Apache Hadoop platform
  • Discover the advantages of using Oracle Big Data Appliance as an engineered system for Hadoop and
    Oracle NoSQL Database
  • Configure, deploy, and monitor Hadoop and Oracle NoSQL Database using Oracle Big Data Appliance
  • Integrate your existing data warehousing and analytics infrastructure into a big data architecture
  • Share data among Hadoop and relational databases using Oracle Big Data Connectors
  • Understand how Oracle NoSQL Database integrates into the Oracle Big Data architecture
  • Deliver faster time to value using in-database analytics
  • Analyze data with Oracle Advanced Analytics (Oracle R Enterprise and Oracle Data Mining), Oracle R Distribution, ROracle, and Oracle R Connector for Hadoop
  • Analyze disparate data with Oracle Endeca Information Discovery
  • Plan and implement a big data governance strategy and develop an architecture and roadmap

Product Details

ISBN-13:
9780071827263
Publisher:
McGraw-Hill Professional Publishing
Publication date:
09/04/2013
Pages:
464
Sales rank:
1,122,753
Product dimensions:
7.30(w) x 9.00(h) x 1.00(d)

Related Subjects

Meet the Author

Tom Plunkett is a Senior Sales Consultant with Oracle. Tom also teaches graduate-level computer science courses for Virginia Tech as an adjunct instructor and distance learning instructor. Tom helped win several industry awards for a big data project that Oracle and the Frederick National Laboratory for Cancer Research collaborated on to analyze relationships between genomes and cancer subtypes, including the 2012 Government Big Data Solution Award, ACT-IAC finalist for best pilot/start-up project, and was nominated for a 2013 Computer World Honor Award for Innovation. Tom has spoken internationally at over 40 conferences on the subject of Big Data since leading a team that won a Big Data project from the Office of the Secretary of Defense in 2009. Tom is the lead author of several books, including Oracle Big Data Handbook and Oracle Exalogic Elastic Cloud Handbook. Previously, Tom worked for IBM and practiced patent law for Fliesler Meyer. Tom has a BA and a JD from George Mason University, and an MS in computer science from Virginia Tech.

Brian Macdonald is a Distinguished Solution Consultant and certified Oracle Enterprise Architect with Oracle. He has more than 20 years of experience creating architectures and implementing analytic platforms to address a wide range of customer needs including data warehousing, business intelligence, OLAP, Hadoop, Master Data Management, and ETL technologies.

Bruce Nelson is the Oracle Big Data lead for the Western U.S. and has more than 24 years of experience in the IT industry with a focus on Hadoop, noSQL, Oracle Database, Oracle RAC, and Oracle Exadata.

Mark Hornick is a Director in the Oracle Database Advanced Analytics group focusing on Oracle R Enterprise (ORE), Oracle R Connector for Hadoop (ORCH), and Oracle R Distribution (ORD). He also works with internal and external customers in the application of R for scalable applications in Oracle Database, Exadata, and the Big Data Appliance, also engaging in SAS-to-R conversion and performance benchmarking. Mark is co-author of Java Data Mining: Strategy, Standard, and Practice. He joined Oracle’s Data Mining Technologies group in 1999 through the acquisition of Thinking Machines Corp. Mark was a founding member of and currently serves as an Oracle Advisor to the IOUG Business Intelligence Warehousing and Analytics (BIWA) SIG. He has conducted training sessions on R, ORE, and ORCH in the US, EMEA, APAC, and has presented at conference, including Oracle OpenWorld, Collaborate, BIWA Summit, and the R user conference useR! Mark holds a bachelor’s degree from Rutgers University and a master’s degree from Brown University, both in computer science.

Customer Reviews

Average Review:

Write a Review

and post it to your social network

     

Most Helpful Customer Reviews

See all customer reviews >

The Oracle Big Data Handbook 5 out of 5 based on 0 ratings. 2 reviews.
Anonymous More than 1 year ago
Big data management is a hot topic today, because big data delivers tremendous new value to the enterprise, across a wide variety of industries and verticals. Though the value of big data is now well accepted and understood, big data processing is still in its infancy. There is still a lot to learn; that's why I believe this handbook is so timely and useful. A big data practitioner will benefit from the complete and detailed descriptions and excellent explanations of various components and the tradeoffs of cost, design complexity, programming paradigm, and how it all fits together. I have been associated with and worked at Oracle for a long time. Besides the excellent technology that Oracle produces, I continue to be amazed and impressed by Oracle's ability to repeatedly adapt to new trends and embrace new concepts and technologies to deliver compelling data management solutions to the market. This is perhaps the most important reason for Oracle's enduring success in the market. Perhaps, this is also the most important reason why anyone interested in big data should read this book. If you are a user of Oracle technologies, you'll learn about how to extend your Oracle investment to encompass big data processing. If you're not an Oracle user, you might discover important technologies and approaches to enhance your big data projects. In either case, you will benefit! This book is about Oracle's big data offerings. As usual, Oracle has delivered a well thought-out, well integrated and comprehensive solution to the market and the authors have done a stellar job of describing Oracle's big data solutions. Processing big data involves several complementary technologies, each requiring different skills and expertise. This is partly reflected in the long list of authors for this book - no single person could have done justice to the topic of big data. EAch author has done a masterful job of describing the product or topic in his area of expertise and describing how these components relate to each other. The book is an excellent resource for a very relevant, present day topic. Though big data encompasses a lot of complementary technologies, the authors have done a great job of organizind the material into the introductory section, followed by chapters that discuss the important technology components individually and in-depth, followed by a multi-chapter discussion of how to use these technologies to analyze data in various ways. The last section of the book covers important issues like privacy, security and big data governance. The book is organized so that it can be read from the beginning to end to get a complete and comprehensive understanding of Oracle's big data offerings; on the other hand, each chapter is relatively self-contained and can be read "standalone" if the reader chooses. I liked the big data appliance chapter because it highlights the benefits of pre-built, tested, optimized and production-ready engineered systems for big data processing. In my opinion, big data processing is a complex topic along multiple dimensions including data processing expertise, server configuration and management, optimizing data flows and computation in a distributed environment. The big data appliance minimizes the headaches associated with building and maintaining servers, which can be a great help in getting a big data project up and running quickly and efficiently. What sets big data processing apart from other kinds of processing is the variety, volume and velocity of data as well as the plethora of analytic techniques needed in order to derive valuable information. The book describes the various data management platforms available, as well as the different kinds of analyses that make up big data processing, including discovery analytics, text and data analytics, spatial analytics and graph analytics. The book also outlines how one might go about using big data processing in an enterprise, starting with collecting data from diverse sources, doing "experiments" to understand how that data might be used, and then using those results to drive business decisions. Of course, this is an iterative process, where each iteration results in new understanding of the data. Written by a team of experts, this book provides a comprehensive and complete guide to the big data practitioner. Definitely a must read for anyone serious about big data.
Anonymous More than 1 year ago
Excellent intro and how-to for a hot topic.  Demystifies big data and helps readers understand how to apply to their business,across many industries.  Nice job!