Building the Unstructured Data Warehouse: Architecture, Analysis, and Design

Learn essential techniques from data warehouse legend Bill Inmon on how to build the reporting environment your business needs now!

Answers for many valuable business questions hide in text. How well can your existing reporting environment extract the necessary text from email, spreadsheets, and documents, and put it in a useful format for analytics and reporting? Transforming the traditional data warehouse into an efficient unstructured data warehouse requires additional skills from the analyst, architect, designer, and developer. This book will prepare you to successfully implement an unstructured data warehouse and, through clear explanations, examples, and case studies, you will learn new techniques and tips to successfully obtain and analyze text.

Master these ten objectives:

  • Build an unstructured data warehouse using the 11-step approach
  • Integrate text and describe it in terms of homogeneity, relevance, medium, volume, and structure
  • Overcome challenges including blather, the Tower of Babel, and lack of natural relationships
  • Avoid the Data Junkyard and combat the “Spider’s Web”
  • Reuse techniques perfected in the traditional data warehouse and Data Warehouse 2.0,including iterative development
  • Apply essential techniques for textual Extract, Transform, and Load (ETL) such as phrase recognition, stop word filtering, and synonym replacement
  • Design the Document Inventory system and link unstructured text to structured data
  • Leverage indexes for efficient text analysis and taxonomies for useful external categorization
  • Manage large volumes of data using advanced techniques such as backward pointers
  • Evaluate technology choices suitable for unstructured data processing, such as data warehouse appliances

1028403297
Building the Unstructured Data Warehouse: Architecture, Analysis, and Design

Learn essential techniques from data warehouse legend Bill Inmon on how to build the reporting environment your business needs now!

Answers for many valuable business questions hide in text. How well can your existing reporting environment extract the necessary text from email, spreadsheets, and documents, and put it in a useful format for analytics and reporting? Transforming the traditional data warehouse into an efficient unstructured data warehouse requires additional skills from the analyst, architect, designer, and developer. This book will prepare you to successfully implement an unstructured data warehouse and, through clear explanations, examples, and case studies, you will learn new techniques and tips to successfully obtain and analyze text.

Master these ten objectives:

  • Build an unstructured data warehouse using the 11-step approach
  • Integrate text and describe it in terms of homogeneity, relevance, medium, volume, and structure
  • Overcome challenges including blather, the Tower of Babel, and lack of natural relationships
  • Avoid the Data Junkyard and combat the “Spider’s Web”
  • Reuse techniques perfected in the traditional data warehouse and Data Warehouse 2.0,including iterative development
  • Apply essential techniques for textual Extract, Transform, and Load (ETL) such as phrase recognition, stop word filtering, and synonym replacement
  • Design the Document Inventory system and link unstructured text to structured data
  • Leverage indexes for efficient text analysis and taxonomies for useful external categorization
  • Manage large volumes of data using advanced techniques such as backward pointers
  • Evaluate technology choices suitable for unstructured data processing, such as data warehouse appliances

44.95 In Stock
Building the Unstructured Data Warehouse: Architecture, Analysis, and Design

Building the Unstructured Data Warehouse: Architecture, Analysis, and Design

Building the Unstructured Data Warehouse: Architecture, Analysis, and Design

Building the Unstructured Data Warehouse: Architecture, Analysis, and Design

Paperback(TECHNICS PUBLICATIONS LLC)

$44.95 
  • SHIP THIS ITEM
    In stock. Ships in 1-2 days.
  • PICK UP IN STORE

    Your local store may have stock of this item.

Related collections and offers


Overview

Learn essential techniques from data warehouse legend Bill Inmon on how to build the reporting environment your business needs now!

Answers for many valuable business questions hide in text. How well can your existing reporting environment extract the necessary text from email, spreadsheets, and documents, and put it in a useful format for analytics and reporting? Transforming the traditional data warehouse into an efficient unstructured data warehouse requires additional skills from the analyst, architect, designer, and developer. This book will prepare you to successfully implement an unstructured data warehouse and, through clear explanations, examples, and case studies, you will learn new techniques and tips to successfully obtain and analyze text.

Master these ten objectives:

  • Build an unstructured data warehouse using the 11-step approach
  • Integrate text and describe it in terms of homogeneity, relevance, medium, volume, and structure
  • Overcome challenges including blather, the Tower of Babel, and lack of natural relationships
  • Avoid the Data Junkyard and combat the “Spider’s Web”
  • Reuse techniques perfected in the traditional data warehouse and Data Warehouse 2.0,including iterative development
  • Apply essential techniques for textual Extract, Transform, and Load (ETL) such as phrase recognition, stop word filtering, and synonym replacement
  • Design the Document Inventory system and link unstructured text to structured data
  • Leverage indexes for efficient text analysis and taxonomies for useful external categorization
  • Manage large volumes of data using advanced techniques such as backward pointers
  • Evaluate technology choices suitable for unstructured data processing, such as data warehouse appliances


Product Details

ISBN-13: 9781935504047
Publisher: Technics Publications, LLC
Publication date: 01/28/2011
Edition description: TECHNICS PUBLICATIONS LLC
Pages: 216
Product dimensions: 7.00(w) x 9.90(h) x 0.60(d)

About the Author

Bill Inmon, the father of data warehousing, has written 52 books translated into 9 languages. Bill has written over 1000 articles and conducted seminars and spoken at conferences on every continent except Antarctica. Bill holds three software patents and his latest company is Forest Rim Technology, a company dedicated to the access and integration of unstructured data into the structured world.

Krish Krishnan is a recognized thought leader in Data Warehouse Performance and Architecture. Krish writes and teaches Social Intelligence across the world and is a frequent speaker at industry conferences. He provides consulting advice to CxO’s on DW Strategy and is an Independent Analyst covering the Data Warehouse and Business Intelligence Industry.

Table of Contents

SECTION I: Unstructured Data Warehouse Essentials 17
CHAPTER 1: Exploring our Unstructured World 19
CHAPTER 2: Managing Unstructured Data 27
CHAPTER 3: Evolving to the Unstructured Data Warehouse 39
CHAPTER 4: Extracting, Transforming, and Loading Text 67
CHAPTER 5: Developing the Unstructured Data Warehouse 99
SECTION II: Unstructured Data Warehouse Advanced Topics 109
CHAPTER 6: Inventorying and Linking Text 111
CHAPTER 7: Using Indexes 121
CHAPTER 8: Leveraging Taxonomies 141
CHAPTER 9: Coping with Large Amounts of Data 153
Chapter 10: Selecting Technology 165
SECTION III: Unstructured Data Warehouse Case Studies 187
CHAPTER 11: The Ablatz Medical Group 189
CHAPTER 12: The Eastern Hills Oil Company 199
CHAPTER 13: The Amber Oil Company 203
From the B&N Reads Blog

Customer Reviews