Back to Search

Data Mashups in R: A Case Study in Real-World Data Analysis

AUTHOR Li, Xiao-Yi; Leipzig, Jeremy
PUBLISHER O'Reilly Media (04/19/2011)
PRODUCT TYPE Paperback (Paperback)

Description

How do you use R to import, manage, visualize, and analyze real-world data? With this short, hands-on tutorial, you learn how to collect online data, massage it into a reasonable form, and work with it using R facilities to interact with web servers, parse HTML and XML, and more. Rather than use canned sample data, you'll plot and analyze current home foreclosure auctions in Philadelphia.

This practical mashup exercise shows you how to access spatial data in several formats locally and over the Web to produce a map of home foreclosures. It's an excellent way to explore how the R environment works with R packages and performs statistical analysis.

  • Parse messy data from public foreclosure auction postings
  • Plot the data using R's PBSmapping package
  • Import US Census data to add context to foreclosure data
  • Use R's lattice and latticeExtra packages for data visualization
  • Create multidimensional correlation graphs with the pairs() scatterplot matrix package
Show More
Product Format
Product Details
ISBN-13: 9781449303532
ISBN-10: 1449303536
Binding: Paperback or Softback (Trade Paperback (Us))
Content Language: English
More Product Details
Page Count: 36
Carton Quantity: 111
Product Dimensions: 6.70 x 0.20 x 9.10 inches
Weight: 0.20 pound(s)
Feature Codes: Price on Product, Maps, Illustrated
Country of Origin: US
Subject Information
BISAC Categories
Computers | Data Science - Data Modeling & Design
Computers | Software Development & Engineering - Computer Graphics
Computers | Languages - General
Dewey Decimal: 006.633
Library of Congress Control Number: 2011282889
Descriptions, Reviews, Etc.
publisher marketing

How do you use R to import, manage, visualize, and analyze real-world data? With this short, hands-on tutorial, you learn how to collect online data, massage it into a reasonable form, and work with it using R facilities to interact with web servers, parse HTML and XML, and more. Rather than use canned sample data, you'll plot and analyze current home foreclosure auctions in Philadelphia.

This practical mashup exercise shows you how to access spatial data in several formats locally and over the Web to produce a map of home foreclosures. It's an excellent way to explore how the R environment works with R packages and performs statistical analysis.

  • Parse messy data from public foreclosure auction postings
  • Plot the data using R's PBSmapping package
  • Import US Census data to add context to foreclosure data
  • Use R's lattice and latticeExtra packages for data visualization
  • Create multidimensional correlation graphs with the pairs() scatterplot matrix package
Show More

Author: Li, Xiao-Yi
Xiao-Yi Li is a biostatistician with an M.Sc. from University of Michigan. In fact, her entire education experience has be revolving statistics, a percentile or otherwise. Currently, she works in the bioinformatics group at DuPont as a statistical consultant. Her work consists mostly of design of experiments and analysis for phenotypic screens, quality control in microarrays, and association mapping.
Show More
List Price $14.99
Your Price  $14.84
Paperback