Pub. Date:
Data Mining Methods and Models / Edition 1

Data Mining Methods and Models / Edition 1

by Daniel T. Larose


Current price is , Original price is $139.0. You

Temporarily Out of Stock Online

Please check back later for updated availability.

Product Details

ISBN-13: 9780471666561
Publisher: Wiley
Publication date: 01/30/2006
Pages: 344
Product dimensions: 6.48(w) x 9.57(h) x 0.88(d)

About the Author

DANIEL T. LAROSE, PhD, received his PhD in statistics from the University of Connecticut. An associate professor of statistics at Central Connecticut State University, he developed and directs Data Mining@CCSU, the world's first online master of science program in data mining. He has also worked as a data mining consultant for Connecticut-area companies. He is the author of Discovering Knowledge in Data: An Introduction to Data Mining (Wiley), and is currently working on the third book of his three-volume set on data mining: Data Mining the Web: Uncovering Patterns in Web Content, Structure, and Usage (with Zdravko Markov, PhD), scheduled to be published by Wiley in 2006.

Table of Contents


1. Dimension Reduction Methods.

Need for Dimension Reduction in Data Mining.

Principal Components Analysis.

Factor Analysis.

User-Defined Composites.

2. Regression Modeling.

Example of Simple Linear Regression.

Least-Squares Estimates.

Coefficient or Determination.

Correlation Coefficient.

The ANOVA Table.

Outliers, High Leverage Points, and Influential Observations.

The Regression Model.

Inference in Regression.

Verifying the Regression Assumptions.

An Example: The Baseball Data Set.

An Example: The California Data Set.

Transformations to Achieve Linearity.

3. Multiple Regression and Model Building.

An Example of Multiple Regression.

The Multiple Regression Model.

Inference in Multiple Regression.

Regression with Categorical Predictors.


Variable Selection Methods.

An Application of Variable Selection Methods.

Mallows’ C p Statistic.

Variable Selection Criteria.

Using the Principal Components as Predictors in Multiple Regression.

4. Logistic Regression.

A Simple Example of Logistic Regression.

Maximum Likelihood Estimation.

Interpreting Logistic Regression Output.

Inference: Are the Predictors Significant?

Interpreting the Logistic Regression Model.

Interpreting a Logistic Regression Model for a Dichotomous Predictor.

Interpreting a Logistic Regression Model for a Polychotomous Predictor.

Interpreting a Logistic Regression Model for a Continuous Predictor.

The Assumption of Linearity.

The Zero-Cell Problem.

Multiple Logistic Regression.

Introducing Higher Order terms to Handle Non-Linearity.

Validating the Logistic Regression Model.

WEKA: Hands-On Analysis Using Logistic Regression.

5. Naïve Bayes and Bayesian Networks.

The Bayesian Approach.

The Maximum a Posteriori (MAP) Classification.

The Posterior Odds Ratio.

Balancing the Data.

Naïve Bayes Classification.

Numeric Predictors for Naïve Bayes Classification.

WEKA: Hands-On Analysis Using Naïve Bayes.

Bayesian Belief Networks.

Using the Bayesian Network to Find Probabilities.

WEKA: Hands-On Analysis Using Bayes Net.

6. Genetic Algorithms.

Introduction to Genetic Algorithms.

The Basic Framework of a Genetic Algorithm.

A Simple Example of Genetic Algorithms at Work.

Modifications and Enhancements: Selection.

Modifications and enhancements: Crossover.

Genetic Algorithms for Real-Valued Variables.

Using Genetic Algorithms to Train a Neural Network.

WEKA: Hands-On Analysis Using Genetic Algorithms.

7. Case Study: Modeling Response to Direct-Mail Marketing.

The Cross-Industry Standard Process for Data Mining: CRISP-DM.

Business Understanding Phase.

Data Understanding and Data Preparation Phases.

The Modeling Phase and the Evaluation Phase.


What People are Saying About This

From the Publisher

"..the book is interesting to read, and the methods will be useful for data mining researchers…" (Computing, August 17, 2007)

"…an excellent problem-solving resource..." (CHOICE, June 2007)

"…the latest techniques…insight into how data mining algorithms work…" (Materials World, April 2007)

Customer Reviews

Most Helpful Customer Reviews

See All Customer Reviews