Data Mashups in R: A Case Study in Real-World Data Analysis
How do you use R to import, manage, visualize, and analyze real-world data? With this short, hands-on tutorial, you learn how to collect online data, massage it into a reasonable form, and work with it using R facilities to interact with web servers, parse HTML and XML, and more. Rather than use canned sample data, you'll plot and analyze current home foreclosure auctions in Philadelphia.

This practical mashup exercise shows you how to access spatial data in several formats locally and over the Web to produce a map of home foreclosures. It's an excellent way to explore how the R environment works with R packages and performs statistical analysis.

  • Parse messy data from public foreclosure auction postings
  • Plot the data using R's PBSmapping package
  • Import US Census data to add context to foreclosure data
  • Use R's lattice and latticeExtra packages for data visualization
  • Create multidimensional correlation graphs with the pairs() scatterplot matrix package
1142962840
Data Mashups in R: A Case Study in Real-World Data Analysis
How do you use R to import, manage, visualize, and analyze real-world data? With this short, hands-on tutorial, you learn how to collect online data, massage it into a reasonable form, and work with it using R facilities to interact with web servers, parse HTML and XML, and more. Rather than use canned sample data, you'll plot and analyze current home foreclosure auctions in Philadelphia.

This practical mashup exercise shows you how to access spatial data in several formats locally and over the Web to produce a map of home foreclosures. It's an excellent way to explore how the R environment works with R packages and performs statistical analysis.

  • Parse messy data from public foreclosure auction postings
  • Plot the data using R's PBSmapping package
  • Import US Census data to add context to foreclosure data
  • Use R's lattice and latticeExtra packages for data visualization
  • Create multidimensional correlation graphs with the pairs() scatterplot matrix package
14.99 In Stock
Data Mashups in R: A Case Study in Real-World Data Analysis

Data Mashups in R: A Case Study in Real-World Data Analysis

by Jeremy Leipzig, Xiao-Yi Li
Data Mashups in R: A Case Study in Real-World Data Analysis

Data Mashups in R: A Case Study in Real-World Data Analysis

by Jeremy Leipzig, Xiao-Yi Li

Paperback

$14.99 
  • SHIP THIS ITEM
    Qualifies for Free Shipping
  • PICK UP IN STORE
    Check Availability at Nearby Stores

Related collections and offers


Overview

How do you use R to import, manage, visualize, and analyze real-world data? With this short, hands-on tutorial, you learn how to collect online data, massage it into a reasonable form, and work with it using R facilities to interact with web servers, parse HTML and XML, and more. Rather than use canned sample data, you'll plot and analyze current home foreclosure auctions in Philadelphia.

This practical mashup exercise shows you how to access spatial data in several formats locally and over the Web to produce a map of home foreclosures. It's an excellent way to explore how the R environment works with R packages and performs statistical analysis.

  • Parse messy data from public foreclosure auction postings
  • Plot the data using R's PBSmapping package
  • Import US Census data to add context to foreclosure data
  • Use R's lattice and latticeExtra packages for data visualization
  • Create multidimensional correlation graphs with the pairs() scatterplot matrix package

Product Details

ISBN-13: 9781449303532
Publisher: O'Reilly Media, Incorporated
Publication date: 03/18/2011
Pages: 36
Product dimensions: 6.70(w) x 9.00(h) x 0.30(d)

About the Author

Jeremy Leipzig is a bioinformatics software developer at DuPont Crop Genetics. He has conducted academic research in viral integration, metagenomics, schizophrenia, and alternative splicing. While a graduate student, he developed one of the first faculty-review websites and wrote "Work Issues in Software Engineering", a survey-based study of "death march" projects.

Xiao-Yi Li is a biostatistician with an M.Sc. from University of Michigan. In fact, her entire education experience has be revolving statistics, a percentile or otherwise. Currently, she works in the bioinformatics group at DuPont as a statistical consultant. Her work consists mostly of design of experiments and analysis for phenotypic screens, quality control in microarrays, and association mapping.

Table of Contents

Introduction; Chapter 1: Mapping Foreclosures; 1.1 Messy Address Parsing; 1.2 Shaking the XML Tree; 1.3 The Many Ways to Philly (Latitude); 1.4 Exceptional Circumstances; 1.5 Taking Shape; 1.6 Developing the Plot; 1.7 Turning Up the Heat; Chapter 2: Statistics of Foreclosure; 2.1 Importing Census Data; 2.2 Descriptive Statistics; 2.3 Descriptive Plots; 2.4 Correlation; 2.5 Final Thoughts; Getting Started; Obtaining R; Quick and Dirty Essentials of R; O’Reilly Resources;

Jeremy Leipzig is a bioinformatics software developer at DuPont Crop Genetics. He has conducted academic research in viral integration, metagenomics, schizophrenia, and alternative splicing. While a graduate student, he developed one of the first faculty-review websites and wrote "Work Issues in Software Engineering", a survey-based study of "death march" projects.

Xiao-Yi Li is a biostatistician with an M.Sc. from University of Michigan. In fact, her entire education experience has be revolving statistics, a percentile or otherwise. Currently, she works in the bioinformatics group at DuPont as a statistical consultant. Her work consists mostly of design of experiments and analysis for phenotypic screens, quality control in microarrays, and association mapping.

From the B&N Reads Blog

Customer Reviews