Practical Text Mining and Statistical Analysis for Non-structured Text Data Applications [NOOK Book]

Overview

Practical Text Mining and Statistical Analysis for Non-structured Text Data Applications brings together all the information, tools and methods a professional will need to efficiently use text mining applications and statistical analysis.

Winner of a 2012 PROSE Award in Computing and Information Sciences from the Association of American Publishers, this book presents a comprehensive how-to reference that shows the user how to conduct text mining and statistically analyze ...

See more details below
Practical Text Mining and Statistical Analysis for Non-structured Text Data Applications

Available on NOOK devices and apps  
  • NOOK Devices
  • NOOK HD/HD+ Tablet
  • NOOK
  • NOOK Color
  • NOOK Tablet
  • Tablet/Phone
  • NOOK for Windows 8 Tablet
  • NOOK for iOS
  • NOOK for Android
  • NOOK Kids for iPad
  • PC/Mac
  • NOOK for Windows 8
  • NOOK for PC
  • NOOK for Mac
  • NOOK Study
  • NOOK for Web

Want a NOOK? Explore Now

NOOK Book (eBook)
$79.95
BN.com price

Overview

Practical Text Mining and Statistical Analysis for Non-structured Text Data Applications brings together all the information, tools and methods a professional will need to efficiently use text mining applications and statistical analysis.

Winner of a 2012 PROSE Award in Computing and Information Sciences from the Association of American Publishers, this book presents a comprehensive how-to reference that shows the user how to conduct text mining and statistically analyze results. In addition to providing an in-depth examination of core text mining and link detection tools, methods and operations, the book examines advanced preprocessing techniques, knowledge representation considerations, and visualization approaches. Finally, the book explores current real-world, mission-critical applications of text mining and link detection using real world example tutorials in such varied fields as corporate, finance, business intelligence, genomics research, and counterterrorism activities.

The world contains an unimaginably vast amount of digital information which is getting ever vaster ever more rapidly. This makes it possible to do many things that previously could not be done: spot business trends, prevent diseases, combat crime and so on. Managed well, the textual data can be used to unlock new sources of economic value, provide fresh insights into science and hold governments to account. As the Internet expands and our natural capacity to process the unstructured text that it contains diminishes, the value of text mining for information retrieval and search will increase dramatically.



-Extensive case studies, most in a tutorial format, allow the reader to 'click through' the example using a software program, thus learning to conduct text mining analyses in the most rapid manner of learning possible

-Numerous examples, tutorials, power points and datasets available via companion website on Elsevierdirect.com

-Glossary of text mining terms provided in the appendix

-CD included 

Read More Show Less

Editorial Reviews

From the Publisher
"They’ve done it again. From the same industry leaders who brought you the "bible" of data mining comes the definitive, go-to text mining resource. This book empowers you to dig in and seize value, with over two dozen hands-on tutorials that drive an incredible range of applications such as predicting marketing success and detecting customer sentiment, criminal lies, writing authorship, and patient schizophrenia. These step-by-step tutorials immediately place you in the practitioner’s driver’s seat, executing on text analytics. Beyond this, 17 more chapters cover the latest methods and the leading tools, making this the most comprehensive resource, and earning it a well-deserved place on your desk aside the authors’ data mining handbook." — Eric Siegel, Ph.D., Founder, Predictive Analytics World, Text Analytics World and Prediction Impact, Inc.

“Of the number of statistics books that are published each year, only a few stand out as really being important, meaning that they positively influence how future research is done in the subject area of the text. I believe that Practical Text Mining is just such a book.” — Joseph M. Hilbe, JD, PhD, Arizona State University and Jet Propulsion Laboratory

“When you want real help extracting insight from the mountains of text that you’re facing, this is the book to turn to for immediate practical advice.” — Karl Rexer, PhD, President, Rexer Analytics, Boston, MA

"The underlying premise is that almost all data in databases takes the form of unstructured text, or summaries of unstructured text, and that historians, marketers, crime investigators, and others need to know how to search that text for meaningful patterns — a very different process than reading. Contributors in a range of fields share their insights and experience with the process. After setting out the principles, they present tutorials and case studies, then move on to advanced topics." — Reference and Research Book News, Inc. "The authors of Practical Text Mining and Statistical Analysis for Nonstructured Text Data Applications have managed to produce three books in one. First, in 17 chapters they give a friendly yet comprehensive introduction to the huge field of text mining, a field comprising techniques from several different disciplines and a variety of different tasks. Miner and his colleagues have produced a readable overview of the area that is sure to help the practitioner navigate this large and unruly ocean of techniques. Second, the authors provide a comprehensive list and review of both the commercial and free software available to perform most text data mining tasks. Finally, and most importantly, the authors have also provided an amazing collection of tutorials and case studies. The tutorials illustrate various text mining scenarios and paths actually taken by researchers, while the case studies go into even more depth, showing both the methodology used and the business decisions taken based on the analysis. These practical step-by-step guides are impressive not only in the breadth of their applications but in the depth and detail that each case study delivers. The studies are authored by several guest authors in addition to the book authors and are built on real problems with real solutions. These case studies and tutorials alone make the book worth having. I have never seen such a collection of real business problems published in any field, much less in such a new field as text mining. These, together with the explanations in the chapters, should provide the practitioner wishing to get a broad view of the text mining field an invaluable resource for both learning and practice. — Richard De Veaux Professor of Statistics; Dept. of Mathematics and Statistics; Williams College; Williamstown MA 01267 "In writing Practical Text Mining and Statistical Analysis for Nonstructured Text Data Applications, the six authors (Miner, Delen, Elder, Fast, Hill, and Nisbet) accepted the daunting task of creating a cohesive operational framework from the disparate aspects and activities of text mining, an emerging field that they appropriately describe as the "Wild West" of data mining. Tapping into their unique expertise and applying a wide cross-application lens, they have succeeded in their mission. Rather than listing the facets of text mining simply as independent academic topics of discussion, the book leans much more to the practical, presenting a conceptual road map to assist users in correlating articulated text mining techniques to categories of actual commonly observed business needs. To finish out the job, summaries for some of the most prevalent commercial text mining solutions are included, along with examples. In this way, the authors have uniquely presented a text mining resource with value to readers across that breadth of business applications." — Gerard Britton, J.D. V.P., GRC Analytics, Opera Solutions LLC "Text Mining is one of those phrases people throw around as though it describes something singular. As the authors of Practical Text Mining and Statistical Analysis for Non-structured Text Data Applications show us, nothing could be further from the truth. There is a rich, diverse ecosystem of text mining approaches and technologies available. Readers of this book will discover a myriad of ways to use these text mining approaches to understand and improve their business. Because the authors are a practical bunch the book is full of examples and tutorials that use every approach, multiple commercial and open source tools, and that show the power and trade-offs each involves. The case studies are worked through in detail by the authors so you can see exactly how things would be done and learn how to apply it to your own problems. If you are interested in text mining, and you should be, this book will give you a perspective that is broad, deep and approachable." — James Taylor CEO Decision Management Solutions

Read More Show Less

Product Details

  • ISBN-13: 9780123870117
  • Publisher: Elsevier Science
  • Publication date: 1/25/2012
  • Sold by: Barnes & Noble
  • Format: eBook
  • Pages: 1000
  • Sales rank: 1,277,438
  • File size: 37 MB
  • Note: This product may take a few minutes to download.

Meet the Author

Dr. Gary Miner received a B.S. from Hamline University, St. Paul, Minnesota, with Biology, Chemistry, and Education majors; an M.S. in Zoology and Population Genetics from the University of Wyoming; and a Ph.D. in biochemical genetics from the University of Kansas as the recipient of a NASA predoctoral fellowship. During the doctoral study years, he also studied mammalian genetics at the Jackson Laboratory, Bar Harbor, Maine, under a College Training Program on an NIH award; another College Training Program at the Bermuda Biological Station, St. George’s West, Bermuda, in a Marine Developmental Embryology course, on an NSF award; and a third College Training Program held at the University of California, San Diego, at the Molecular Techniques in Developmental Biology Institute, again on an NSF award. Following that he studied as a postdoctoral student at the University of Minnesota in behavioral genetics, where, along with research in schizophrenia and Alzheimer’s disease, he learned what was involved in writing books from assisting in editing two book manuscripts of his mentor Irving Gottesman, Ph.D.

Dr. John Elder heads the United States’ leading data mining consulting team, with offices in Charlottesville, Virginia; Washington, D.C.; and Baltimore, Maryland (www.datamininglab.com). Founded in 1995, Elder Research, Inc. focuses on investment, commercial, and security applications of advanced analytics, including text mining, image recognition, process optimization, cross-selling, biometrics, drug efficacy, credit scoring, market sector timing, and fraud detection. John obtained a B.S. and an M.E.E. in electrical engineering from Rice University and a Ph.D. in systems engineering from the University of Virginia, where he’s an adjunct professor teaching Optimization or Data Mining. Prior to 16 years at ERI, he spent five years in aerospace defense consulting, four years heading research at an investment management firm, and two years in Rice's Computational & Applied Mathematics Department.

Dr. Andrew Fast leads research in text mining and social network analysis at Elder Research. Dr. Fast graduated magna cum laude from Bethel University and earned an M.S. and a Ph.D. in computer science from the University of Massachusetts Amherst. There, his research focused on causal data mining and mining complex relational data such as social networks. At ERI, Andrew leads the development of new tools and algorithms for data and text mining for applications of capabilities assessment, fraud detection, and national security. Dr. Fast has published on an array of applications, including detecting securities fraud using the social network among brokers and understanding the structure of criminal and violent groups. Other publications cover modeling peer-to-peer music file sharing networks, understanding how collective classification works, and predicting playoff success of NFL head coaches (work featured on ESPN.com).

Thomas Hill received his Vordiplom in psychology from Kiel University in Germany and earned an M.S. in industrial psychology and a Ph.D. in psychology and quantitative methods from the University of Kansas. He was associate professor (and then research professor) at the University of Tulsa from 1984 to 2009, where he taught data analysis and data mining courses. He also has been vice president for Research and Development and then Analytic Solutions at StatSoft Inc., where he has been involved for over 20 years in the development of data analysis, data and text mining algorithms, and the delivery of analytic solutions. Dr. Hill has received numerous academic grants and awards from the National Science Foundation, the National Institute of Health, the Center for Innovation Management, the Electric Power Research Institute, and other institutions. He has completed diverse consulting projects with companies from practically all industries and has worked with the leading financial services, insurance, manufacturing, pharmaceutical, retailing, and other companies in the United States and internationally on identifying and refining effective data mining and predictive modeling solutions for diverse applications.

Dr. Nisbet was trained initially in ecosystems analysis. He has over 30 years of experience in complex systems analysis and modeling as a researcher (University of California, Santa Barbara). He entered business in 1994 to lead the team that developed the first data mining models of customer response for AT&T and NCR Corporation. While at NCR Corporation and Torrent Systems, he pioneered the design and development of configurable data mining applications for retail sales forecasting and Churn, Propensity-to-buy, and Customer Acquisition in Telecommunications and Insurance. In addition to data mining, he has expertise in data warehousing technology for Extract, Transform, and Load (ETL) operations; business intelligence reporting; and data quality analyses. He is lead author of the Handbook of Statistical Analysis & Data Mining Applications (Academic Press, 2009). Currently, he functions as a data scientist and independent data mining consultant.

Dr. Dursun Delen is the William S. Spears Chair in Business Administration and Associate Professor of Management Science and Information Systems in the Spears School of Business at Oklahoma State University (OSU). He received his Ph.D. in industrial engineering and management from OSU in 1997. Prior to his appointment as an assistant professor at OSU in 2001, he worked for a privately owned research and consultancy company, Knowledge Based Systems Inc., in College Station, Texas, as a research scientist for five years, during which he led a number of decision support and other information systems-related research projects funded by federal agencies, including DoD, NASA, NIST and DOE.

Read More Show Less

Table of Contents

Part I Basic Text Mining Principles 1. The History of Text Mining 2. The Seven Practice Areas of Text Analytics 3. Conceptual Foundations of Text Mining and Preprocessing Steps 4. Applications and Use Cases for Text Mining 5. Text Mining Methodology 6. Three Common Text Mining Software Tools

Part II Introduction to the Tutorial and Case Study Section of This Book AA. CASE STUDY: Using the Social Share of Voice to Predict Events That Are about to Happen BB. Mining Twitter for Airline Consumer Sentiment A. Using STATISTICA Text Miner to Monitor and Predict Success of Marketing Campaigns Based on Social Media Data B. Text Mining Improves Model Performance in Predicting Airplane Flight Accident Outcome C. Insurance Industry: Text Analytics Adds “Lift” to Predictive Models with STATISTICA Text and Data Miner D. Analysis of Survey Data for Establishing the “Best Medical Survey Instrument” Using Text Mining E. Analysis of Survey Data for Establishing “Best Medical Survey Instrument” Using Text Mining: Central Asian (Russian Language) Study Tutorial 2: Potential for Constructing Instruments That Have Increased Validity F. Using eBay Text for Predicting ATLAS Instrumental Learning G. Text Mining for Patterns in Children’s Sleep Disorders Using STATISTICA Text Miner H. Extracting Knowledge from Published Literature Using RapidMiner I. Text Mining Speech Samples: Can the Speech of Individuals Diagnosed with Schizophrenia Differentiate Them from Unaffected Controls? J. Text Mining Using STM, CART, and TreeNet from Salford Systems: Analysis of 16,000 iPod Auctions on eBay K. Predicting Micro Lending Loan Defaults Using SAS Text Miner L. Opera Lyrics: Text Analytics Compared by the Composer and the Century of CompositiondWagner versus Puccini M. CASE STUDY: Sentiment-Based Text Analytics to Better Predict Customer Satisfaction and Net Promoter Score Using IBM SPSS Modeler N. CASE STUDY: Detecting Deception in Text with Freely Available Text and Data Mining Tools O. Predicting Box Office Success of Motion Pictures with Text Mining P. A Hands-On Tutorial of Text Mining in PASW: Clustering and Sentiment Analysis Using Tweets from Twitter Q. A Hands-On Tutorial on Text Mining in SAS: Analysis of Customer Comments for Clustering and Predictive Modeling R. Scoring Retention and Success of Incoming College Freshmen Using Text Analytics S. Searching for Relationships in Product Recall Data from the Consumer Product Safety Commission with STATISTICA Text Miner T. Potential Problems That Can Arise in Text Mining: Example Using NALL Aviation Data U. Exploring the Unabomber Manifesto Using Text Miner V. Text Mining PubMed: Extracting Publications on Genes and Genetic Markers Associated with Migraine Headaches from PubMed Abstracts W. CASE STUDY: The Problem with the Use of Medical Abbreviations by Physicians and Health Care Providers X. Classifying Documents with Respect to “Earnings” and Then Making a Predictive Model for the Target Variable Using Decision Trees, MARSplines, Naïve Bayes Classifier, and K-Nearest Neighbors with STATISTICA Text Miner Y. CASE STUDY: Predicting Exposure of Social Messages: The Bin Laden Live Tweeter Z. The InFLUence Model: Web Crawling, Text Mining, and Predictive Analysis with 2010e2011 Influenza GuidelinesdCDC, IDSA, WHO, and FMC

Part III Advanced Topics 7. Text Classification and Categorization 8. Prediction in Text Mining: The Data Mining Algorithms of Predictive Analytics 9. Entity Extraction 10. Feature Selection and Dimensionality Reduction 11. Singular Value Decomposition in Text Mining 12. Web Analytics and Web Mining 13. Clustering Words and Documents 14. Leveraging Text Mining in Property and Casualty Insurance 15. Focused Web Crawling 16. The Future of Text and Web Analytics 17. Summary

GLOSSARY

INDEX

HOW TO USE THE DATA SETS AND THE TEXT MINING SOFTWARE ON THE DVD OR ON LINKS FOR PRACTICAL TEXT MINING

Read More Show Less

Customer Reviews

Be the first to write a review
( 0 )
Rating Distribution

5 Star

(0)

4 Star

(0)

3 Star

(0)

2 Star

(0)

1 Star

(0)

Your Rating:

Your Name: Create a Pen Name or

Barnes & Noble.com Review Rules

Our reader reviews allow you to share your comments on titles you liked, or didn't, with others. By submitting an online review, you are representing to Barnes & Noble.com that all information contained in your review is original and accurate in all respects, and that the submission of such content by you and the posting of such content by Barnes & Noble.com does not and will not violate the rights of any third party. Please follow the rules below to help ensure that your review can be posted.

Reviews by Our Customers Under the Age of 13

We highly value and respect everyone's opinion concerning the titles we offer. However, we cannot allow persons under the age of 13 to have accounts at BN.com or to post customer reviews. Please see our Terms of Use for more details.

What to exclude from your review:

Please do not write about reviews, commentary, or information posted on the product page. If you see any errors in the information on the product page, please send us an email.

Reviews should not contain any of the following:

  • - HTML tags, profanity, obscenities, vulgarities, or comments that defame anyone
  • - Time-sensitive information such as tour dates, signings, lectures, etc.
  • - Single-word reviews. Other people will read your review to discover why you liked or didn't like the title. Be descriptive.
  • - Comments focusing on the author or that may ruin the ending for others
  • - Phone numbers, addresses, URLs
  • - Pricing and availability information or alternative ordering information
  • - Advertisements or commercial solicitation

Reminder:

  • - By submitting a review, you grant to Barnes & Noble.com and its sublicensees the royalty-free, perpetual, irrevocable right and license to use the review in accordance with the Barnes & Noble.com Terms of Use.
  • - Barnes & Noble.com reserves the right not to post any review -- particularly those that do not follow the terms and conditions of these Rules. Barnes & Noble.com also reserves the right to remove any review at any time without notice.
  • - See Terms of Use for other conditions and disclaimers.
Search for Products You'd Like to Recommend

Recommend other products that relate to your review. Just search for them below and share!

Create a Pen Name

Your Pen Name is your unique identity on BN.com. It will appear on the reviews you write and other website activities. Your Pen Name cannot be edited, changed or deleted once submitted.

 
Your Pen Name can be any combination of alphanumeric characters (plus - and _), and must be at least two characters long.

Continue Anonymously

    If you find inappropriate content, please report it to Barnes & Noble
    Why is this product inappropriate?
    Comments (optional)