I’m planning to teach a new course on statistical graphics next spring.
Graphical methods play several key roles in statistics:
– “Exploratory data analysis”: finding patterns in raw data. This can be a challenge, especially with complex data sets.
– Understanding and making sense of a set of statistical analyses (that is, finding patterns in “cooked data”)
– Clear presentation of results to others (and oneself!)
Compared to other areas of statistics, graphical methods require new ways of thinking and also new tools.
The borders of “statistical graphics” are not precisely defined. Neighboring fields include statistical computing, statistical communication, and multivariate analysis. Neighboring fields outside statistics include computer programming and graphics, visual perception, data mining with the use of data capture tools, and graphical presentation.
Structure of the course:
Class meetings will include demonstrations, discussions of readings, and lectures. Depending on their individual interests, different students will have to master different in-depth topics. All students will learn to make clear and informative graphs for data exploration, substantive research, and presentation to self and others.
Students will work in pairs on final projects. A final project can be a new graphical analysis of a research topic of interest, an innovative graphical presentation of important data or data summaries, an experiment investigating the effectiveness of some graphical method, or a computer program implementing a useful graphical method. Each final project should take the form of a publishable article.
The primary textbook will be R Graphics, by Paul Murrell (to be published Summer, 2005).
Tentative list of topics and readings:
(There’s more in the readings than any one student would read; I imagine that many would simply serve as references, with particularly relevant excerpts photocopied.)
1. General practice:
1a. What do good (and bad) graphs look like?
The Visual Display of Quantitative Information and Envisioning Information (E. Tufte).
Visual Revelations (H. Wainer).
“How to display data badly” (H. Wainer). The American Statistician 38, 137-147 (1984).
1b. How to make useful and good-looking graphs. Making graphs in the open-source statistical package R.
R Graphics (P. Murrell).
An R and S-Plus Companion to Applied Regression (J. Fox).
Webpage of resources on statistical graphics (M. Friendly). http://www.math.yorku.ca/SCS/StatResource.html#DataVis
1c. What should be graphed?
The Elements of Graphing Data and Visualizing Data (W. Cleveland).
2. Graphics as a tool in scientific inference:
2a. Graphics as a primary data-analysis tool.
“Why are American Presidential election campaigns so variable when votes are so predictable?” (A. Gelman and G. King). British Journal of Political Science 23, 409-451.
2b. Graphics that enhance the understanding of a fitted model.
Generalized Additive Models (T. Hastie and R. Tibshirani).
“Enhancing democracy through legislative redistricting” (A. Gelman and G. King). American Political Science Review 88, 541-559.
“The B-K plot: making Simpson’s paradox clear to the masses” (H. Wainer). Chance 15 (3), 60-62.
2c. More elaborate graphical methods (multidimensional scaling, other multivariate models, dynamic graphics, interactive graphics, . . .)
“Multidimensional scaling, tree-fitting, and clustering” (R. Shepard). Science 210, 390-398.
“Computational methods for high-dimensional rotations in data visualization” (A. Buja, D. Cook, D. Asimov, and C. Hurley. In Handbook of Statistics, ed. E. Wegman and C. Rao (2005).
3. Statistical theories of graphics:
3a. Graphs as model-free exploratory data analysis.
Exploratory Data Analysis (J. Tukey).
3b. Graphics and models.
“All maps of parameter estimates are misleading” (A. Gelman and P. Price). Statistics in Medicine 18, 3221-3234 (1999).
“Exploratory data analysis for complex models” (A. Gelman, with discussion by A. Buja). Journal of Computational and Graphical Statistics (2004).
3c. Graphical methods as algorithms.
The Grammar of Graphics (L. Wilkinson).
4a. Cognitive models of how people read and understand numbers and graphs.
“Spatial Schemas in Depictions” (B. Tversky). In Spatial Schemas and Abstract Thought, ed. M. Gattis, ed (2001).
“A theory of graph comprehension” (S. Pinker). In Artificial intelligence and the future of testing, ed. R. Freedle (1990).
4b. Experimental studies of effectiveness of different graphical displays.
“Understanding charts and graphs” (S. Kosslyn). Applied Cognitive Psychology 3, 185-223 (1989).
“Conceptual limitations in comprehending line graphs” (P. Shah and P. Carpenter). Journal of Experimental Psychology: General 124, 43-61 (1995).
4c. Some examples of graphs as actually used in psychological research.
The Statistical Sleuth, second edition (F. Ramsay and D. Schafer).
5. Communication (to self and to others):
“Let’s practice what we preach: turning tables into graphs” (A. Gelman, C. Pasarica, and R. Dodhia). American Statistician 56, 121-130 (2002).
“The science of scientific writing” (G. Gopen and J. Swan). American Scientist 78, 550-558 (1990).
How to Lie with Statistics (D. Huff).
(I need a more updated reference here also—-something along the lines of “Statistics for Journalists”)