Data Simplification: Taming Information With Open Source Tools
Data Simplification: Taming Information With Open Source Tools addresses the simple fact that modern data is too big and complex to analyze in its native form. Data simplification is the process whereby large and complex data is rendered usable. Complex data must be simplified before it can be analyzed, but the process of data simplification is anything but simple, requiring a specialized set of skills and tools. This book provides data scientists from every scientific discipline with the methods and tools to simplify their data for immediate analysis or long-term storage in a form that can be readily repurposed or integrated with other data. Drawing upon years of practical experience, and using numerous examples and use cases, Jules Berman discusses the principles, methods, and tools that must be studied and mastered to achieve data simplification, open source tools, free utilities and snippets of code that can be reused and repurposed to simplify data, natural language processing and machine translation as a tool to simplify data, and data summarization and visualization and the role they play in making data useful for the end user. - Discusses data simplification principles, methods, and tools that must be studied and mastered - Provides open source tools, free utilities, and snippets of code that can be reused and repurposed to simplify data - Explains how to best utilize indexes to search, retrieve, and analyze textual data - Shows the data scientist how to apply ontologies, classifications, classes, properties, and instances to data using tried and true methods
1132569691
Data Simplification: Taming Information With Open Source Tools
Data Simplification: Taming Information With Open Source Tools addresses the simple fact that modern data is too big and complex to analyze in its native form. Data simplification is the process whereby large and complex data is rendered usable. Complex data must be simplified before it can be analyzed, but the process of data simplification is anything but simple, requiring a specialized set of skills and tools. This book provides data scientists from every scientific discipline with the methods and tools to simplify their data for immediate analysis or long-term storage in a form that can be readily repurposed or integrated with other data. Drawing upon years of practical experience, and using numerous examples and use cases, Jules Berman discusses the principles, methods, and tools that must be studied and mastered to achieve data simplification, open source tools, free utilities and snippets of code that can be reused and repurposed to simplify data, natural language processing and machine translation as a tool to simplify data, and data summarization and visualization and the role they play in making data useful for the end user. - Discusses data simplification principles, methods, and tools that must be studied and mastered - Provides open source tools, free utilities, and snippets of code that can be reused and repurposed to simplify data - Explains how to best utilize indexes to search, retrieve, and analyze textual data - Shows the data scientist how to apply ontologies, classifications, classes, properties, and instances to data using tried and true methods
59.95 In Stock
Data Simplification: Taming Information With Open Source Tools

Data Simplification: Taming Information With Open Source Tools

by Jules J. Berman
Data Simplification: Taming Information With Open Source Tools

Data Simplification: Taming Information With Open Source Tools

by Jules J. Berman

eBook

$59.95 

Available on Compatible NOOK devices, the free NOOK App and in My Digital Library.
WANT A NOOK?  Explore Now

Related collections and offers


Overview

Data Simplification: Taming Information With Open Source Tools addresses the simple fact that modern data is too big and complex to analyze in its native form. Data simplification is the process whereby large and complex data is rendered usable. Complex data must be simplified before it can be analyzed, but the process of data simplification is anything but simple, requiring a specialized set of skills and tools. This book provides data scientists from every scientific discipline with the methods and tools to simplify their data for immediate analysis or long-term storage in a form that can be readily repurposed or integrated with other data. Drawing upon years of practical experience, and using numerous examples and use cases, Jules Berman discusses the principles, methods, and tools that must be studied and mastered to achieve data simplification, open source tools, free utilities and snippets of code that can be reused and repurposed to simplify data, natural language processing and machine translation as a tool to simplify data, and data summarization and visualization and the role they play in making data useful for the end user. - Discusses data simplification principles, methods, and tools that must be studied and mastered - Provides open source tools, free utilities, and snippets of code that can be reused and repurposed to simplify data - Explains how to best utilize indexes to search, retrieve, and analyze textual data - Shows the data scientist how to apply ontologies, classifications, classes, properties, and instances to data using tried and true methods

Product Details

ISBN-13: 9780128038543
Publisher: Morgan Kaufmann Publishers
Publication date: 03/10/2016
Sold by: Barnes & Noble
Format: eBook
Pages: 398
File size: 7 MB

About the Author

Jules Berman holds two Bachelor of Science degrees from MIT (in Mathematics and in Earth and Planetary Sciences), a PhD from Temple University, and an MD from the University of Miami. He was a graduate researcher at the Fels Cancer Research Institute (Temple University) and at the American Health Foundation in Valhalla, New York. He completed his postdoctoral studies at the US National Institutes of Health, and his residency at the George Washington University Medical Center in Washington, DC. Dr. Berman served as Chief of anatomic pathology, surgical pathology, and cytopathology at the Veterans Administration Medical Center in Baltimore, Maryland, where he held joint appointments at the University of Maryland Medical Center and at the Johns Hopkins Medical Institutions. In 1998, he transferred to the US National Institutes of Health as a Medical Officer and as the Program Director for Pathology Informatics in the Cancer Diagnosis Program at the National Cancer Institute. Dr. Berman is a past President of the Association for Pathology Informatics and is the 2011 recipient of the Association's Lifetime Achievement Award. He is a listed author of more than 200 scientific publications and has written more than a dozen books in his three areas of expertise: informatics, computer programming, and pathology. Dr. Berman is currently a freelance writer.
Jules Berman holds two Bachelor of Science degrees from MIT (in Mathematics and in Earth and Planetary Sciences), a PhD from Temple University, and an MD from the University of Miami. He was a graduate researcher at the Fels Cancer Research Institute (Temple University) and at the American Health Foundation in Valhalla, New York. He completed his postdoctoral studies at the US National Institutes of Health, and his residency at the George Washington University Medical Center in Washington, DC. Dr. Berman served as Chief of anatomic pathology, surgical pathology, and cytopathology at the Veterans Administration Medical Center in Baltimore, Maryland, where he held joint appointments at the University of Maryland Medical Center and at the Johns Hopkins Medical Institutions. In 1998, he transferred to the US National Institutes of Health as a Medical Officer and as the Program Director for Pathology Informatics in the Cancer Diagnosis Program at the National Cancer Institute. Dr. Berman is a past President of the Association for Pathology Informatics and is the 2011 recipient of the Association’s Lifetime Achievement Award. He is a listed author of more than 200 scientific publications and has written more than a dozen books in his three areas of expertise: informatics, computer programming, and pathology. Dr. Berman is currently a freelance writer.

Table of Contents

1. The Simple Life2. Structuring Text3. Indexing Text4. Understanding Your Data5. Identifying and Deidentifying Data6. Giving Meaning to Data7. Object-oriented data8. Problem simplification

What People are Saying About This

From the Publisher

This comprehensive book teaches readers how to collect, categorize, simplify, and make sense of data using a step -by-step methodology that includes data simplication methods, open source tools, free utilities and snippets of code that can be reused and repurposed to simplify data.

From the B&N Reads Blog

Customer Reviews