Comment on W.S. Cleveland, A Model for Studying Display Methods of Statistical Graphics. Leland Wilkinson SYSTAT, Inc. and Northwestern University

Similar documents
Framework for Comparative Research on Relational Information Displays

Object vision (Chapter 4)

V1 (Chap 3, part II) Lecture 8. Jonathan Pillow Sensation & Perception (PSY 345 / NEU 325) Princeton University, Fall 2017

IAT 355 Perception 1. Or What You See is Maybe Not What You Were Supposed to Get

ID# Exam 1 PS 325, Fall 2003

Neural circuits PSY 310 Greg Francis. Lecture 05. Rods and cones

IAT 355 Visual Analytics. Encoding Information: Design. Lyn Bartram

Fundamentals of Cognitive Psychology, 3e by Ronald T. Kellogg Chapter 2. Multiple Choice

Competing Frameworks in Perception

Competing Frameworks in Perception

Today: Visual perception, leading to higher-level vision: object recognition, word perception.

M Cells. Why parallel pathways? P Cells. Where from the retina? Cortical visual processing. Announcements. Main visual pathway from retina to V1

Sensation and Perception

(Visual) Attention. October 3, PSY Visual Attention 1

Sensation and Perception

Visual Perception. Agenda. Visual perception. CS Information Visualization January 20, 2011 John Stasko. Pre-attentive processing Color Etc.

9.65 Sept. 12, 2001 Object recognition HANDOUT with additions in Section IV.b for parts of lecture that were omitted.

Sensation vs. Perception

l3;~~?~~~,'0~'~~t~t:~:~~~~~~~~~~!,1

EDGE DETECTION. Edge Detectors. ICS 280: Visual Perception

Unit 7 Comparisons and Relationships

Perception. Chapter 8, Section 3

Supporting Information

Definition Slides. Sensation. Perception. Bottom-up processing. Selective attention. Top-down processing 11/3/2013

= add definition here. Definition Slide

Does scene context always facilitate retrieval of visual object representations?

Lecture 2.1 What is Perception?

UNIT 1CP LAB 1 - Spaghetti Bridge

Chapter 5: Perceiving Objects and Scenes

VISUAL PERCEPTION & COGNITIVE PROCESSES

Dikran J. Martin. Psychology 110. Name: Date: Making Contact with the World around Us. Principal Features

Psychological Perspectives on Visualizing Uncertainty. Barbara Tversky Stanford University

Chapter 8: Visual Imagery & Spatial Cognition

The Perceptual Experience

Visual Perception. Agenda. Visual perception. CS Information Visualization August 26, 2013 John Stasko. Pre-attentive processing Color Etc.

Natural Scene Statistics and Perception. W.S. Geisler

Principals of Object Perception

Chapter 5: Perceiving Objects and Scenes

A Direct Object of Perception

Laws of Attraction: From Perceived Forces to Conceptual Similarity

Fundamentals of Psychophysics

Unit 4: Sensation and Perception

To What Extent Can the Recognition of Unfamiliar Faces be Accounted for by the Direct Output of Simple Cells?

Pattern Recognition. Organization of Lectures. Complexities of Perception

Goodness of Pattern and Pattern Uncertainty 1

OPTO 5320 VISION SCIENCE I

Spatial Prepositions in Context: The Semantics of near in the Presence of Distractor Objects

General recognition theory of categorization: A MATLAB toolbox

SENSES: VISION. Chapter 5: Sensation AP Psychology Fall 2014

How Far Away Is That? It Depends on You: Perception Accounts for the Abilities of Others

ID# Exam 1 PS 325, Fall 2007

Visual Design. Simplicity, Gestalt Principles, Organization/Structure

CS Information Visualization September 7, 2016 John Stasko. Identify visual features that are and are not pre-attentive

Recognizing Scenes by Simulating Implied Social Interaction Networks

Human Perception. Topic Objectives. CS 725/825 Information Visualization Fall Dr. Michele C. Weigle.

Psychology of Perception Psychology 4165, Spring 2015 Laboratory 1 Noisy Representations: The Oblique Effect

Edward Tufte s Principles & Guidelines of Information Design

Chapter 7: Descriptive Statistics

Perception Outline Chapter 6, Psychology, David G Meyers, 7 th Edition

Practice Test Questions

Significance of Natural Scene Statistics in Understanding the Anisotropies of Perceptual Filling-in at the Blind Spot

CHAPTER 2: PERCEPTION, SELF, AND COMMUNICATION

Visual Selection and Attention

Architectural Visualization

Automaticity of Number Perception

Review #6 ( )

LEA Color Vision Testing

Accounting for cognitive and perceptual biases in computational models of human movement

Vision Seeing is in the mind

Reveal Relationships in Categorical Data

Unit 1 Exploring and Understanding Data

Visual Search: A Novel Psychophysics for Preattentive Vision

JUDGMENTAL MODEL OF THE EBBINGHAUS ILLUSION NORMAN H. ANDERSON

Note:- Receptors are the person who receives any images from outer environment.

August 30, Alternative to the Mishkin-Ungerleider model

Psicológica ISSN: Universitat de València España

Lecturer: Rob van der Willigen 11/9/08

An experimental investigation of consistency of explanation and graph representation

7. Sharp perception or vision 8. The process of transferring genetic material from one cell to another by a plasmid or bacteriophage

Using Perceptual Grouping for Object Group Selection

AP PSYCHOLOGY 2012 SCORING GUIDELINES

Psychology Unit 3 Test

Lecturer: Rob van der Willigen 11/9/08

Lecture 9: Lab in Human Cognition. Todd M. Gureckis Department of Psychology New York University

Classic experiments in sensation and perception

What is mid level vision? Mid Level Vision. What is mid level vision? Lightness perception as revealed by lightness illusions

Stats 95. Statistical analysis without compelling presentation is annoying at best and catastrophic at worst. From raw numbers to meaningful pictures

Paper Airplanes & Scientific Methods

Gathering and Repetition of the Elements in an Image Affect the Perception of Order and Disorder

Learning Classifier Systems (LCS/XCSF)

Reading Assignments: Lecture 5: Introduction to Vision. None. Brain Theory and Artificial Intelligence

2/3/17. Visual System I. I. Eye, color space, adaptation II. Receptive fields and lateral inhibition III. Thalamus and primary visual cortex

Structure of the eye and retina

The lowest level of stimulation that a person can detect. absolute threshold. Adapting one's current understandings to incorporate new information.

Spatial Orientation Using Map Displays: A Model of the Influence of Target Location

Psych 333, Winter 2008, Instructor Boynton, Exam 2

Cognitive Psychology. Robert J. Sternberg EDITION. Yak University THOIVISOISI * WADSWORTH

Auditory scene analysis in humans: Implications for computational implementations.

Cognitive Processes PSY 334. Chapter 2 Perception

Caveats for the Spatial Arrangement Method: Supplementary Materials

Transcription:

Comment on W.S. Cleveland, A Model for Studying Display Methods of Statistical Graphics Leland Wilkinson SYSTAT, Inc. and Northwestern University Revised version published in Journal of Computational and Graphical Statistics, 2, 1993, 355-360. The social psychologist Susan Fiske has shown that commonsense notions about human behavior are as often wrong as right. For example, the popular maxim "opposites attract" is generally false. Numerous experiments by another psychologist, Amos Tversky, have demonstrated that we -- novices and Bayesian statisticians alike -- are poor judges of quantity and chance. Finally, perceptual experiments, including some by Bill Cleveland himself, have shown that statisticians and other humans succumb to visual illusions when viewing statistical graphs. Martin Gardner, Richard Feynman, and the Amazing Randi have all shown how easy it is for scientists to fool themselves. Because we all think we are expert psychologists, we are at greatest risk when we study ourselves and our perceptions. Bill Cleveland has done a service to statisticians by grounding discussions about graphics in experimentation. His ideas on graphical elements, based on a series of experiments (Cleveland, 1985), have influenced statistics packages (e.g. S +, STATA, SYSTAT) and have inspired further experimentation in graphical perception (see Spence and Lewandowsky, 1990). An experimental viewpoint should not diminish the work of graphic designers and others who have creative instincts for good displays. Bertin and Tufte, for example, have shown that effective graphs need not be dull or lack style. We must be suspicious of any design prescription that is not supported by experimental results, however. Good design does not always lead to effective statistical graphics. Most of Cleveland's early experiments concerned graphical elements -- lines, angles, areas, volumes, colors. This paper applies his thinking to graphical composites -- reference grids, plotting symbols, and aspect ratios. Although its title is "A Model for Studying Display Methods of Statistical Graphics," it is really more concerned with a variety of approaches to evaluating the use of these composites. While I cannot disagree with most of his conclusions about good usage, I find the model itself somewhat restrictive. Cleveland wishes to stay grounded in perceptual psychology, but the topics he discusses also involve areas of higher cognition. Cognitive psychologists such as Pinker (1990), Simken and Hastie (1987), and Kosslyn (1989) have proposed process models for graphical perceptual processing. Statisticians, on the other hand, like to think of the meaning of a graph as predefined: construct a graph properly and its meaning will be self-evident. A schematic (box) plot, for example, is a summary of the first few letter values of a batch. Viewers don't always see it that way, however. They may read meaning into the width of the box even when it is constant in every box they see.

A process model of graphical information processing covers all the stages in the graphical communication event, from the statistician who has information to communicate, to the viewer who makes a judgment about that information. These stages are: 1. The quantitative/qualitative information 2. The retinal image 3. The decomposition in the visual cortex 4. Integration and transformation via temporary storage in short term memory and schemas accessed in long term memory Each of these stages in the process model has implications for some of the points covered in Cleveland's paper. Here are a few. 1. Quantitative/qualitative information. Cleveland mentions that "There are two types of information in the data region of a graph - quantitative and categorical." Actually, quantitative and categorical information is part of what the statistician intends to convey to the viewer. The actual information in the data region is a more complex arrangement of texture, form, edge, and other features. Sometimes categorical information is mapped onto quantitative features, such as angles or line lengths and quantitative information is mapped onto categorical features such as numerals. This distinction between the information the statistician intends to convey and the actual organization of the graph is important because most of the formal arguments about graphs involve this stage. The statistician must select data features to highlight before constructing the graph. A single graph cannot always reveal everything about the data. For example, Figure 1 shows data from a study of neural firings in a cat's retina (Levine et al., 1987).. The cat figure parallels Cleveland's Figure 1. The lower graph in the cat figure uses Cleveland's median absolute slopes procedure to set the physical scale optimally for local segments in the plot. As Cleveland has noted, however, the slopes procedure does not specify which frequency to highlight in the series. The upper cat plot is scaled to reveal a low frequency (approximately 2.5 second) wave in the firings. This frequency component was caused by the cat's respiration, which affected blood oxygen and moderated the firings. Figure 2 shows how information can be emphasized by the addition of a feature. Cleveland's Figure 2 contains a dot plot of agricultural data. Adding connecting lines highlights two crossovers in the Grand Rapids and University Farm plots. This highlighting can interfere with the perception of other aspects of the graph, however. A well constructed graph from a formal point of view may nevertheless be misperceived due to cognitive processes in the later stages. Let's review how these come into play. 2. The retinal image. The initial image of the perceived graph is on the retina. This image differs in important ways from the physical image because lighting, viewing position, and other factors can create different retinal images of the same graph. Black and white graphs with high contrast are often preferable to ornate colored ones because these produce more constant retinal images under different lighting and viewing conditions.

3. The visual cortex. Retinal images are transformed in the visual pathway by decomposing them into features such as orientation and texture. Cleveland's discussion about core-cue symbols is based on this stage of visual processing. The operations at this level are highly parallel and organized to extract spatial frequency, orientation, and other features needed to construct complex visual scenes. I believe Cleveland is appropriate in grounding his discussion in the experimental literature on texture perception and discrimination. Cleveland's selection of core-cue symbols is based on studies of pairwise discriminability, however. There are likely configural effects when more than two of these symbols are used in a plot. A global recommendation may not apply in all cases. While we're in the business of recommending symbols, however, I propose an alternative set. The symbols Cleveland offers are discriminable mainly because they stimulate different sets of feature detectors in the visual cortex, namely verticals, horizontals, and diagonals. Why not use a set based on the elements which stimulate these feature detectors more exclusively? The set 'o',' ','-','x' is used in Figure 3. Compare the results with Cleveland's Figure 5. This set, while not shown in Cleveland's Table 1, performs as well as any in texture experiments. Incidentally, some of Cleveland's results in earlier studies are due to the operation of vertical, horizontal, and diagonal feature detectors in the visual cortex. Lines near verticals, horizontals, and 45 degree diagonals tend to be resolved toward these canonical positions. This is one reason there are visual biases near these orientations. These biases become even more pronounced as information is stored in long term memory. 4. Integration and transformation through schemas. Making a judgment about a graph involves integrating the features detected in the visual cortex by making use of a short term memory store -- sometimes verbal, sometimes iconic -- and schemas residing in long term memory. While the visual cortical operations are highly parallel, the operations at this stage are both parallel and serial. Short term memory allows temporary (less than half a minute) storage of information in order to perform serial operations. For example, what Cleveland calls "table look-up" is a process of short term memory access which occurs at this stage. The viewer stores perceived information temporarily in order to construct higher-order comparisons such as scale references. Because of limitations in this store, only a few (five to ten) distinct pieces of information can be stored simultaneously. This is why higher order interaction plots are difficult, if not impossible, to interpret. The number of comparisons required to understand interactions increase exponentially. Cleveland's gridding task is a good example in which processing at this stage depends on props (such as grid lines) that help in the temporary storage of information for subsequent use in higher order comparisons such as the ranking of minima among curves. Information is stored and accessed in short term memory via schemas available in long term memory. This is where some fascinating biases enter into the judgment of statistical graphs. For example, Barbara Tversky and others have found that features in maps can distort Euclidean distance judgments. Two towns divided by a river, for example, are judged to be farther apart than two towns separated by the same distance on uniform terrain. These distortions have nothing to do with the visual illusions which distort area, angle, and hue perceptions which Cleveland and others have demonstrated in statistical maps and other graphs. They have more to do with what Amos Tversky has called "framing," in which schemas assembled from a lifetime of experiences serve as

templates for higher order judgments. Rivers are difficult to cross. Bar charts look like stacks of building blocks, so they have bases on the ground (at zero). Many of the arguments concerning realism in graphs (e.g. Becker and Cleveland, 1991) hinge on the effects of "meaning" on the perception of quantity. Realism is not an unalloyed blessing in graphics because it can invoke schemas which are useful for decoding real world images but wholly inappropriate for quantitative graphical displays. Chernoff argued this point for perception of real vs cartoon faces and the psychologists Haber and Biederman have done the same for real versus artificial scenes. In conclusion, Cleveland has provided some useful guidelines for selecting display methods based on controlled experimentation. By placing Cleveland's model in the context of the more general information processing model favored by most psychologists today, we can help to understand how and why distortions occur in the perception of graphs.

References Becker, R., and Cleveland, W.S. (1991). Take a broader view of scientific vixualization. Pixel, 2, 42-44. Cleveland, W.S. (1985). The Elements of Graphing Data. Monterey, CA: Wadsworth. Kosslyn, S.M. (1989). Understanding charts and graphs. Applied Cognitive Psychology, 3, 185-226. Levine, M.W., Frishman, L.J., and Enroth-Cugell, C. (1987). Interactions between the rod and the cone pathways in the cat retina. Vision Research, 27, 1093-1104. Pinker, S. (1990). A theory of graph comprehension. In R. Friedle, (Ed.), Artificial Intelligence and the Future of Testing. Norwood, NJ: Ablex, 73-126. Simkin, D., and Hastie, R. (1987). An information-processing analysis of graph perception. Journal of the American Statistical Association, 82, 454-465. Spence, I., and Lewandowsky, S. (1990). Graphical percption. In Modern Methods of Data Analysis. Beverly Hills, CA: Sage Publications.

Figure 1: Neural firing data from Levine et al.(1987)

Figure 2: Barley yields with line enhancement

Figure 3: Core-cue symbols