Data Analysis Using Regression and Multilevel/Hierarchical Models

Hardcover (Print)
Buy New
Buy New from
Used and New from Other Sellers
Used and New from Other Sellers
from $112.76
Usually ships in 1-2 business days
(Save 16%)
Other sellers (Hardcover)
  • All (9) from $112.76   
  • New (6) from $126.06   
  • Used (3) from $112.76   


Data Analysis Using Regression and Multilevel/Hierarchical Models is a comprehensive manual for the applied researcher who wants to perform data analysis using linear and nonlinear regression and multilevel models. The book introduces a wide variety of models, whilst at the same time instructing the reader in how to fit these models using available software packages. The book illustrates the concepts by working through scores of real data examples that have arisen from the authors’ own applied research, with programming codes provided for each one. Topics covered include causal inference, including regression, poststratification, matching, regression discontinuity, and instrumental variables, as well as multilevel logistic regression and missing-data imputation. Practical tips regarding building, fitting, and understanding are provided throughout. Author resource page:

Read More Show Less

Editorial Reviews

From the Publisher
"Data Analysis Using Regression and Multilevel/Hierarchical Models … careful yet mathematically accessible style is generously illustrated with examples and graphical displays, making it ideal for either classroom use or self-study. It appears destined to adorn the shelves of a great many applied statisticians and social scientists for years to come."
Brad Carlin, University of Minnesota

"Gelman and Hill have written what may be the first truly modern book on modeling. Containing practical as well as methodological insights into both Bayesian and traditional approaches, Data Analysis Using Regression and Multilevel/Hierarchical Models provides useful guidance into the process of building and evaluating models. For the social scientist and other applied statisticians interested in linear and logistic regression, causal inference, and hierarchical models, it should prove invaluable either as a classroom text or as an addition to the research bookshelf."
Richard De Veaux, Williams College

"The theme of Gelman and Hill's engaging and nontechnical introduction to statistical modeling is 'Be flexible.' Using a broad array of examples written in R and WinBugs, the authors illustrate the many ways in which readers can build more flexibility into their predictive and causal models. This hands-on textbook is sure to become a popular choice in applied regression courses."
Donald Green, Yale University

"Simply put, Data Analysis Using Regression and Multilevel/Hierarchical Models is the best place to learn how to do serious empirical research. Gelman and Hill have written a much needed book that is sophisticated about research design without being technical. Data Analysis Using Regression and Multilevel/Hierarchical Models is destined to be a classic!"
Alex Tabarrok, George Mason University

"a detailed, carefully written exposition of the modelling challenge, using numerous convincing examples, and always paying careful attention to the practical aspects of modeling. I recommend it very warmly."
Journal of Applied Statistics

"Gelman and Hill's book is an excellent intermediate text that would be very useful for researchers interested in multilevel modeling... This book gives a wealth of information for anyone interested in multilevel modeling and seems destined to be a classic."
Brandon K. Vaughn, Journal of Eductional Measurement

"With their new book, Data Analysis Using Regression and Multilevel/Hierarchical Models, Drs. Gelman and Hill have raised the bar for what a book on applied statistical modeling should seek to accomplish. The book is extraordinarily broad in scope, modern in its approach and philosophy, and ambitious in its goals... I am tremendously impressed with this book and highly recommend it. Data Analysis Using Regression and Multilevel/Hierarchical Models deserves to be widely read by applied statisticians and practicing researchers, especially in the social sciences. Instructors considering textbooks for courses on the practice of statistical modeling should move this book to the top of their list."
Daniel B. Hall, Journal of the American Statistical Association

"Data Analysis Using Regression and Multilevel/Hierarchical Models is the book I wish I had in graduate school."
Timothy Hellwig, The Political Methodologist

Read More Show Less

Product Details

  • ISBN-13: 9780521867061
  • Publisher: Cambridge University Press
  • Publication date: 9/1/2007
  • Series: Analytical Methods for Social Research Series
  • Edition description: First Edition
  • Pages: 648
  • Sales rank: 552,293
  • Product dimensions: 7.01 (w) x 10.00 (h) x 1.38 (d)

Meet the Author

Andrew Gelman is Professor of Statistics and Professor of Political Science at Columbia University. He has published over 150 articles in statistical theory, methods, and computation, and in applications areas including decision analysis, survey sampling, political science, public health, and policy. His other books are Bayesian Data Analysis (1995, second edition 2003) and Teaching Statistics: A Bag of Tricks (2002).

Jennifer Hill is Assistant Professor of Public Affairs in the Department of International and Public Affairs at Columbia University. She has co-authored articles that have appeared in the Journal of the American Statistical Association, American Political Science Review, American Journal of Public Health, Developmental Psychology, the Economic Journal and the Journal of Policy Analysis and Management, among others.

Read More Show Less

Table of Contents

List of examples     xvii
Preface     xix
Why?     1
What is multilevel regression modeling?     1
Some examples from our own research     3
Motivations for multilevel modeling     6
Distinctive features of this book     8
Computing     9
Concepts and methods from basic probability and statistics     13
Probability distributions     13
Statistical inference     16
Classical confidence intervals     18
Classical hypothesis testing     20
Problems with statistical significance     22
55,000 residents desperately need your help!     23
Bibliographic note     26
Exercises     26
Single-level regression     29
Linear regression: the basics     31
One predictor     31
Multiple predictors     32
Interactions     34
Statistical inference     37
Graphical displays of data and fitted model     42
Assumptions and diagnostics     45
Prediction and validation     47
Bibliographic note     49
Exercises     49
Linear regression:before and after fitting the model     53
Linear transformations     53
Centering and standardizing, especially for models with interactions     55
Correlation and "regression to the mean"     57
Logarithmic transformations     59
Other transformations     65
Building regression models for prediction     68
Fitting a series of regressions     73
Bibliographic note     74
Exercises     74
Logistic regression     79
Logistic regression with a single predictor     79
Interpreting the logistic regression coefficients     81
Latent-data formulation     85
Building a logistic regression model: wells in Bangladesh     86
Logistic regression with interactions     92
Evaluating, checking, and comparing fitted logistic regressions     97
Average predictive comparisons on the probability scale     101
Identifiability and separation     104
Bibliographic note     105
Exercises     105
Generalized linear models     109
Introduction     109
Poisson regression, exposure, and overdispersion     110
Logistic-binomial model      116
Probit regression: normally distributed latent data     118
Ordered and unordered categorical regression     119
Robust regression using the t model     124
Building more complex generalized linear models     125
Constructive choice models     127
Bibliographic note     131
Exercises     132
Working with regression inferences     135
Simulation of probability models and statistical inferences     137
Simulation of probability models     137
Summarizing linear regressions using simulation: an informal Bayesian approach     140
Simulation for nonlinear predictions: congressional elections     144
Predictive simulation for generalized linear models     148
Bibliographic note     151
Exercises     152
Simulation for checking statistical procedures and model fits     155
Fake-data simulation     155
Example: using fake-data simulation to understand residual plots     157
Simulating from the fitted model and comparing to actual data     158
Using predictive simulation to check the fit of a time-series model     163
Bibliographic note     165
Exercises     165
Causal inference using regression on the treatment variable     167
Causal inference and predictive comparisons     167
The fundamental problem of causal inference     170
Randomized experiments     172
Treatment interactions and poststratification     178
Observational studies     181
Understanding causal inference in observational studies     186
Do not control for post-treatment variables     188
Intermediate outcomes and causal paths     190
Bibliographic note     194
Exercises     194
Causal inference using more advanced models     199
Imbalance and lack of complete overlap     199
Subclassification: effects and estimates for different subpopulations     204
Matching: subsetting the data to get overlapping and balanced treatment and control groups     206
Lack of overlap when the assignment mechanism is known: regression discontinuity     212
Estimating causal effects indirectly using instrumental variables     215
Instrumental variables in a regression framework     220
Identification strategies that make use of variation within or between groups     226
Bibliographic note     229
Exercises     231
Multilevel regression      235
Multilevel structures     237
Varying-intercept and varying-slope models     237
Clustered data: child support enforcement in cities     237
Repeated measurements, time-series cross sections, and other non-nested structures     241
Indicator variables and fixed or random effects     244
Costs and benefits of multilevel modeling     246
Bibliographic note     247
Exercises     248
Multilevel linear models: the basics     251
Notation     251
Partial pooling with no predictors     252
Partial pooling with predictors     254
Quickly fitting multilevel models in R     259
Five ways to write the same model     262
Group-level predictors     265
Model building and statistical significance     270
Predictions for new observations and new groups     272
How many groups and how many observations per group are needed to fit a multilevel model?     275
Bibliographic note     276
Exercises     277
Multilevel linear models: varying slopes, non-nested models, and other complexities     279
Varying intercepts and slopes     279
Varying slopes without varying intercepts      283
Modeling multiple varying coefficients using the scaled inverse-Wishart distribution     284
Understanding correlations between group-level intercepts and slopes     287
Non-nested models     289
Selecting, transforming, and combining regression inputs     293
More complex multilevel models     297
Bibliographic note     297
Exercises     298
Multilevel logistic regression     301
State-level opinions from national polls     301
Red states and blue states: what's the matter with Connecticut?     310
Item-response and ideal-point models     314
Non-nested overdispersed model for death sentence reversals     320
Bibliographic note     321
Exercises     322
Multilevel generalized linear models     325
Overdispersed Poisson regression: police stops and ethnicity     325
Ordered categorical regression: storable votes     331
Non-nested negative-binomial model of structure in social networks     332
Bibliographic note     342
Exercises     342
Fitting multilevel models     343
Multilevel modeling in Bugs and R: the basics     345
Why you should learn Bugs      345
Bayesian inference and prior distributions     345
Fitting and understanding a varying-intercept multilevel model using R and Bugs     348
Step by step through a Bugs model, as called from R     353
Adding individual- and group-level predictors     359
Predictions for new observations and new groups     361
Fake-data simulation     363
The principles of modeling in Bugs     366
Practical issues of implementation     369
Open-ended modeling in Bugs     370
Bibliographic note     373
Exercises     373
Fitting multilevel linear and generalized linear models in Bugs and R     375
Varying-intercept, varying-slope models     375
Varying intercepts and slopes with group-level predictors     379
Non-nested models     380
Multilevel logistic regression     381
Multilevel Poisson regression     382
Multilevel ordered categorical regression     383
Latent-data parameterizations of generalized linear models     384
Bibliographic note     385
Exercises     385
Likelihood and Bayesian inference and computation     387
Least squares and maximum likelihood estimation      387
Uncertainty estimates using the likelihood surface     390
Bayesian inference for classical and multilevel regression     392
Gibbs sampler for multilevel linear models     397
Likelihood inference, Bayesian inference, and the Gibbs sampler: the case of censored data     402
Metropolis algorithm for more general Bayesian computation     408
Specifying a log posterior density, Gibbs sampler, and Metropolis algorithm in R     409
Bibliographic note     413
Exercises     413
Debugging and speeding convergence     415
Debugging and confidence building     415
General methods for reducing computational requirements     418
Simple linear transformations     419
Redundant parameters and intentionally nonidentifiable models     419
Parameter expansion: multiplicative redundant parameters     424
Using redundant parameters to create an informative prior distribution for multilevel variance parameters     427
Bibliographic note     434
Exercises     434
Prom data collection to model understanding to model checking     435
Sample size and power calculations     437
Choices in the design of data collection     437
Classical power calculations: general principles, as illustrated by estimates of proportions     439
Classical power calculations for continuous outcomes     443
Multilevel power calculation for cluster sampling     447
Multilevel power calculation using fake-data simulation     449
Bibliographic note     454
Exercises     454
Understanding and summarizing the fitted models     457
Uncertainty and variability     457
Superpopulation and finite-population variances     459
Contrasts and comparisons of multilevel coefficients     462
Average predictive comparisons     466
R[superscript 2] and explained variance     473
Summarizing the amount of partial pooling     477
Adding a predictor can increase the residual variance!     480
Multiple comparisons and statistical significance     481
Bibliographic note     484
Exercises     485
Analysis of variance     487
Classical analysis of variance     487
ANOVA and multilevel linear and generalized linear models     490
Summarizing multilevel models using ANOVA     492
Doing ANOVA using multilevel models     494
Adding predictors: analysis of covariance and contrast analysis      496
Modeling the variance parameters: a split-plot latin square     498
Bibliographic note     501
Exercises     501
Causal inference using multilevel models     503
Multilevel aspects of data collection     503
Estimating treatment effects in a multilevel observational study     506
Treatments applied at different levels     507
Instrumental variables and multilevel modeling     509
Bibliographic note     512
Exercises     512
Model checking and comparison     513
Principles of predictive checking     513
Example: a behavioral learning experiment     515
Model comparison and deviance     524
Bibliographic note     526
Exercises     527
Missing-data imputation     529
Missing-data mechanisms     530
Missing-data methods that discard data     531
Simple missing-data approaches that retain all the data     532
Random imputation of a single variable     533
Imputation of several missing variables     539
Model-based imputation     540
Combining inferences from multiple imputations     542
Bibliographic note      542
Exercises     543
Appendixes     545
Six quick tips to improve your regression modeling     547
Fit many models     547
Do a little work to make your computations faster and more reliable     547
Graphing the relevant and not the irrelevant     548
Transformations     548
Consider all coefficients as potentially varying     549
Estimate causal inferences in a targeted way, not as a byproduct of a large regression     549
Statistical graphics for research and presentation     551
Reformulating a graph by focusing on comparisons     552
Scatterplots     553
Miscellaneous tips     559
Bibliographic note     562
Exercises     563
Software     565
Getting started with R, Bugs, and a text editor     565
Fitting classical and multilevel regressions in R     565
Fitting models in Bugs and R     567
Fitting multilevel models using R, Stata, SAS, and other software     568
Bibliographic note     573
References     575
Author index     601
Subject index     607
Read More Show Less

Customer Reviews

Average Rating 2.5
( 2 )
Rating Distribution

5 Star


4 Star


3 Star


2 Star


1 Star


Your Rating:

Your Name: Create a Pen Name or

Barnes & Review Rules

Our reader reviews allow you to share your comments on titles you liked, or didn't, with others. By submitting an online review, you are representing to Barnes & that all information contained in your review is original and accurate in all respects, and that the submission of such content by you and the posting of such content by Barnes & does not and will not violate the rights of any third party. Please follow the rules below to help ensure that your review can be posted.

Reviews by Our Customers Under the Age of 13

We highly value and respect everyone's opinion concerning the titles we offer. However, we cannot allow persons under the age of 13 to have accounts at or to post customer reviews. Please see our Terms of Use for more details.

What to exclude from your review:

Please do not write about reviews, commentary, or information posted on the product page. If you see any errors in the information on the product page, please send us an email.

Reviews should not contain any of the following:

  • - HTML tags, profanity, obscenities, vulgarities, or comments that defame anyone
  • - Time-sensitive information such as tour dates, signings, lectures, etc.
  • - Single-word reviews. Other people will read your review to discover why you liked or didn't like the title. Be descriptive.
  • - Comments focusing on the author or that may ruin the ending for others
  • - Phone numbers, addresses, URLs
  • - Pricing and availability information or alternative ordering information
  • - Advertisements or commercial solicitation


  • - By submitting a review, you grant to Barnes & and its sublicensees the royalty-free, perpetual, irrevocable right and license to use the review in accordance with the Barnes & Terms of Use.
  • - Barnes & reserves the right not to post any review -- particularly those that do not follow the terms and conditions of these Rules. Barnes & also reserves the right to remove any review at any time without notice.
  • - See Terms of Use for other conditions and disclaimers.
Search for Products You'd Like to Recommend

Recommend other products that relate to your review. Just search for them below and share!

Create a Pen Name

Your Pen Name is your unique identity on It will appear on the reviews you write and other website activities. Your Pen Name cannot be edited, changed or deleted once submitted.

Your Pen Name can be any combination of alphanumeric characters (plus - and _), and must be at least two characters long.

Continue Anonymously
Sort by: Showing all of 2 Customer Reviews
  • Anonymous

    Posted February 9, 2014

    This is a wonderful book but the ebook version has figures that

    This is a wonderful book but the ebook version has figures that are so small as to be unreadable. I think it is fair to call it defective. It should not be for sale. 

    Was this review helpful? Yes  No   Report this review
  • Anonymous

    Posted March 26, 2015

    No text was provided for this review.

Sort by: Showing all of 2 Customer Reviews

If you find inappropriate content, please report it to Barnes & Noble
Why is this product inappropriate?
Comments (optional)