Statistical Bioinformatics: For Biomedical and Life Science Researchers / Edition 1

Paperback (Print)
Buy New
Buy New from
Used and New from Other Sellers
Used and New from Other Sellers
from $86.03
Usually ships in 1-2 business days
(Save 28%)
Other sellers (Paperback)
  • All (3) from $86.03   
  • New (2) from $91.43   
  • Used (1) from $86.03   


This book provides an essential understanding of statistical concepts necessary for the analysis of genomic and proteomic data using computational techniques. The author presents both basic and advanced topics, focusing on those that are relevant to the computational analysis of large data sets in biology. Chapters begin with a description of a statistical concept and a current example from biomedical research, followed by more detailed presentation, discussion of limitations, and problems. The book starts with an introduction to probability and statistics for genome-wide data, and moves into topics such as clustering, classification, multi-dimensional visualization, experimental design, statistical resampling, and statistical network analysis.

  • Clearly explains the use of bioinformatics tools in life sciences research without requiring an advanced background in math/statistics
  • Enables biomedical and life sciences researchers to successfully evaluate the validity of their results and make inferences
  • Enables statistical and quantitative researchers to rapidly learn novel statistical concepts and techniques appropriate for large biological data analysis
  • Carefully revisits frequently used statistical approaches and highlights their limitations in large biological data analysis
  • Offers programming examples and datasets
  • Includes chapter problem sets, a glossary, a list of statistical notations, and appendices with references to background mathematical and technical material
  • Features supplementary materials, including datasets, links, and a statistical package available online

Statistical Bioinformatics is an ideal textbook for students in medicine, life sciences, and bioengineering, aimed at researchers who utilize computational tools for the analysis of genomic, proteomic, and many other emerging high-throughput molecular data. It may also serve as a rapid introduction to the bioinformatics science for statistical and computational students and audiences who have not experienced such analysis tasks before.

Read More Show Less

Product Details

  • ISBN-13: 9780471692720
  • Publisher: Wiley
  • Publication date: 3/1/2010
  • Series: Methods of Biochemical Analysis Series
  • Edition description: New Edition
  • Edition number: 1
  • Pages: 370
  • Sales rank: 1,449,066
  • Product dimensions: 6.10 (w) x 9.10 (h) x 0.90 (d)

Meet the Author

Jae K. Lee, Ph.D., is a professor of biostatistics and epidemiology in the Department of Health Evaluation Sciences at the University of Virginia School of Medicine, where he designed and teaches a course on Statistical Bioinformatics in Medicine. He earned his doctorate in statistical genetics from the University of Wisconsin, Madison. He was previously a research scientist in the Laboratory of Molecular Pharmacology, National Cancer Institute. Among his current research interests is the integration of statistical and genomic information for the analysis of microarray data.

Read More Show Less

Table of Contents

Preface xi

Contributors xiii

1 Road Statistical Bioinformatics 1

Challenge 1 Multiple-Comparisons Issue 1

Challenge 2 High-Dimensional Biological Data 2

Challenge 3 Small-n and Large-p problem 3

Challenge 4 Noisy High-Throughput Biological Data 3

Challenge 5 Integration of multiple, Heterogeneous Biological Data Information References 5

2 Probability Concepts and Distributions for analyzing Large Biological Data 7

2.1 Introduction 7

2.2 Basic Concepts 8

2.3 Conditional Probability and Independence 10

2.4 Random Variables 13

2.5 Expected Value and Variance 15

2.6 Distributions of Random Variable 19

2.7 Joint and Marginal Distribution 39

2.8 Multivariate Distribution 42

2.9 Sampling Distribution 46

2.10 Summary 54

3 Quality Control of High-Throughput Biological Data 57

3.1 Sources of Error in High-Throughput Biological Experiments 57

3.2 Statistical Techniques for Quality Control 59

3.3 Issues specific to Microarray Gene Expression Experiments 66

3.4 Conclusion 69

References 69

4 Statistical Testing and Significance for Large Biological Data Analysis 71

4.1 Introduction 71

4.2 Statistical Testing 72

4.3 Error Controlling 78

4.4 Real Data Analysis 81

4.5 Concluding Remarks 87

Acknowledgement 87

References 87

5 Clustering: Unsupervised Learning in Large Biological Data 89

5.1 Measure of Similarity 90

5.2 Clustering 99

5.3 Assessment of Cluster Quality 115

5.4 Conclusion 123

References 123

6 Classification: Supervised Learning with High-Dimensional Biological Data 129

6.1 Introduction 129

6.2 Classification and Prediction Methods 132

6.3 Feature Selection and Ranking 140

6.4 Cross-Validation 144

6.5 Enhancement of Class Prediction by Ensemble Voting Methods 145

6.6 Comparison of Classification Methods Using High-Dimension Data 147

6.7 Software Examples for Classification Methods 150

References 154

7 Multidimensional Analysis and Visualization on Large Biomedical Data 157

7.1 Introduction 157

7.2 Classical Multidimensional Visualization Techniques 158

7.3 Two-Dimensional Projections 161

7.4 Issues and Challenges 165

7.5 Systematic Exploration of Low Dimensional Projections 166

7.6 One-Dimensional Histogram Ordering 170

7.7 Two-Dimensional Histogram Ordering 174

7.8 Conclusion 181

References 182

8 Statistical Models, Inferences, and Algorithms for Large Biological Data Analysis 185

8.1 Introduction 185

8.2 Statistical/Problematic Models 187

8.3 Estimation Methods 189

8.4 Numerical Algorithms 191

8.5 Examples 192

8.9 Conclusion 198

References 199

9 Expoerimental Designs on High-Throughput Biological Experiments 201

9.1 Randomization 201

9.2 Replication 202

9.3 Pooling 209

9.4 Blocking 210

9.5 Design for Classifications 214

9.6 Design for Time Course Experiments 215

9.7 Design for eQTL Studies 215

Reference 216

10 Statistical Resampling Techniques for Large Biological Data Analysis 219

10.1 Introduction 219

10.2 Resampling Methods for Prediction Error Assessment and Model Selection 221

10.3 Feature Selection 225

10.4 Resampling-Based Classification Algorithms 226

10.5 Practical Example: Lymphoma 226

10.6 Resampling Methods 227

10.7 Bootstrap Methods 232

10.8 Sample Size Issues 233

10.9 Loss Functions 235

10.10 Bootstrap Resampling for Quantifying Uncertainty 236

10.11 Markov Chain Monte Carlo Methods 238

10.12 Conclusion 240

References 247

11 Statistical Network Analysis for Biological Systems and Pathways 249

11.1 Introduction 249

11.2 Boolean Network Modeling 250

11.3 Bayesian Belief Network 259

11.4 Modeling of Metabolic Networks 273

References 279

12 Trends and Statistical Challenges in Genomewide Association Studies 283

12.1 Introduction 283

12.2 Alles, Linkage Disequilibrium, and Haplotype 283

12.3 International Hap Map Project 285

12.4 Genotyping Platforms 286

12.5 Overview of Current GWAS Results 287

12.6 Statistical Issues in GWAS 290

12.7 Haplotype Analysis 296

12.8 Homozygosity and Admixture Mapping 298

12.9 Gene x Gene and Gene x Environmental Interactions 298

12.10 Gene and Pathway-Based Analysis 299

12.11 Disease Risk Estimates 301

12.12 Meta-Analysis 301

12.13 Rare Variants and Sequence-Based Analysis 302

12.14 Conclusions 303

Acknowledgment 303

References 303

13 Rand Bioconductor Packages in Bioinformatics: Towards System Biology 309

13.1 Introduction 309

13.2 Brief Overview of the Bioconductor Project 310

13.3 Experimental Data 311

13.4 Annotation 318

13.5 Models of Biological Sytems 328

13.6 Conclusion 335

13.7 Acknowledgment 336

Refernces 336

Index 339

Read More Show Less

Customer Reviews

Be the first to write a review
( 0 )
Rating Distribution

5 Star


4 Star


3 Star


2 Star


1 Star


Your Rating:

Your Name: Create a Pen Name or

Barnes & Review Rules

Our reader reviews allow you to share your comments on titles you liked, or didn't, with others. By submitting an online review, you are representing to Barnes & that all information contained in your review is original and accurate in all respects, and that the submission of such content by you and the posting of such content by Barnes & does not and will not violate the rights of any third party. Please follow the rules below to help ensure that your review can be posted.

Reviews by Our Customers Under the Age of 13

We highly value and respect everyone's opinion concerning the titles we offer. However, we cannot allow persons under the age of 13 to have accounts at or to post customer reviews. Please see our Terms of Use for more details.

What to exclude from your review:

Please do not write about reviews, commentary, or information posted on the product page. If you see any errors in the information on the product page, please send us an email.

Reviews should not contain any of the following:

  • - HTML tags, profanity, obscenities, vulgarities, or comments that defame anyone
  • - Time-sensitive information such as tour dates, signings, lectures, etc.
  • - Single-word reviews. Other people will read your review to discover why you liked or didn't like the title. Be descriptive.
  • - Comments focusing on the author or that may ruin the ending for others
  • - Phone numbers, addresses, URLs
  • - Pricing and availability information or alternative ordering information
  • - Advertisements or commercial solicitation


  • - By submitting a review, you grant to Barnes & and its sublicensees the royalty-free, perpetual, irrevocable right and license to use the review in accordance with the Barnes & Terms of Use.
  • - Barnes & reserves the right not to post any review -- particularly those that do not follow the terms and conditions of these Rules. Barnes & also reserves the right to remove any review at any time without notice.
  • - See Terms of Use for other conditions and disclaimers.
Search for Products You'd Like to Recommend

Recommend other products that relate to your review. Just search for them below and share!

Create a Pen Name

Your Pen Name is your unique identity on It will appear on the reviews you write and other website activities. Your Pen Name cannot be edited, changed or deleted once submitted.

Your Pen Name can be any combination of alphanumeric characters (plus - and _), and must be at least two characters long.

Continue Anonymously
Sort by: Showing 1 Customer Reviews
  • Anonymous

    Posted November 26, 2010

    This book does NOT contain all that is advertised - Poor textbook

    The book is supposed to include:

    The description of this book promised:
    . Offers programming examples and datasets
    . Includes chapter problem sets, a glossary, a list of statistical notations, and appendices with references to background mathematical and technical material
    . Features supplementary materials, including datasets, links, and a statistical package available online

    While there are occasional programming examples and datasets, there are NO chapter problem sets, gloosary, statistical notations or appendices. There is NO link provided to an online site with supplementary materials, such as datasets, links and statistical package.

    Was this review helpful? Yes  No   Report this review
Sort by: Showing 1 Customer Reviews

If you find inappropriate content, please report it to Barnes & Noble
Why is this product inappropriate?
Comments (optional)