Alan R. S. Ashworth III. Air Force Research Laboratory. Brooks AFB Texas & Yale University, USA. Quoc C. Vuong*

Similar documents
for Biological Cybernetics, T bingen, Germany d Unit Cognition et D veloppement & Laboratoire de Neurophysiologie,

Face Inversion Disproportionately Impairs the Perception of Vertical but not Horizontal Relations Between Features

Nonlinear relationship between holistic processing of individual faces and picture-plane rotation: evidence from the face composite illusion

Cognition xx (0000) xxx xxx. Brief article. A holistic account of the own-race effect in face recognition: evidence from a cross-cultural study

Viewpoint dependent recognition of familiar faces

Short article Detecting objects is easier than categorizing them

Viewpoint-dependent recognition of familiar faces

Expert Face Processing: Specialization and Constraints

The influence of feature conjunction on object inversion and conversion effects

Featural and configurational processes in the recognition of faces of different familiarity

Picture-plane inversion leads to qualitative changes of face perception

Recognition of Faces of Different Species: A Developmental Study Between 5 and 8 Years of Age

The respective role of low and high spatial frequencies in supporting configural and featural processing of faces

Configural information is processed differently in perception and recognition of faces

HOW DOES PERCEPTUAL LOAD DIFFER FROM SENSORY CONSTRAINS? TOWARD A UNIFIED THEORY OF GENERAL TASK DIFFICULTY

Interaction Between Social Categories in the Composite Face Paradigm. Wenfeng Chen and Naixin Ren. Chinese Academy of Sciences. Andrew W.

Discrimination and Generalization in Pattern Categorization: A Case for Elemental Associative Learning

Unraveling Mechanisms for Expert Object Recognition: Bridging Brain Activity and Behavior

BECOMING A GREEBLE EXPERT: EXPLORING MECHANISMS FOR FACE RECOGNITION

The Perception of Gender in Human Faces Samantha Haseltine Gustavus Adolphus College Faculty Advisor: Patricia Costello. Perception of Gender 1

Testing Conditions for Viewpoint Invariance in Object Recognition

Does Face Inversion Change Spatial Frequency Tuning?

Object Recognition & Categorization. Object Perception and Object Categorization

This article appeared in a journal published by Elsevier. The attached copy is furnished to the author for internal non-commercial research and

Replacing the frontal lobes? Having more time to think improve implicit perceptual categorization. A comment on Filoteo, Lauritzen & Maddox, 2010.

The Disproportionate Face Inversion Effect in Recognition Memory

Context influences holistic processing of nonface objects in the composite task

Congruency Effects with Dynamic Auditory Stimuli: Design Implications

The obligatory nature of holistic processing of faces in social judgments

Differences of Face and Object Recognition in Utilizing Early Visual Information

Priming of Depth-Rotated Objects Depends on Attention and Part Changes

Hierarchical attention in discriminating objects at different levels of specificity

Face Perception - An Overview Bozana Meinhardt-Injac Malte Persike

Supplementary materials for: Executive control processes underlying multi- item working memory

Journal of Experimental Psychology: Animal Behavior Processes

Giulia Righi a, Jessie J. Peissig b & Michael J. Tarr c a Children's Hospital Boston, Harvard Medical School,

Learning to classify integral-dimension stimuli

Orientation Specific Effects of Automatic Access to Categorical Information in Biological Motion Perception

Grouped Locations and Object-Based Attention: Comment on Egly, Driver, and Rafal (1994)

Word Count: Please address correspondence to: James Tanaka, Department of Psychology, University

EFFECTS OF NOISY DISTRACTORS AND STIMULUS REDUNDANCY ON VISUAL SEARCH. Laurence D. Smith University of Maine

Object Substitution Masking: When does Mask Preview work?

Face detection: Mapping human performance

ARTICLE IN PRESS. Vision Research xxx (2008) xxx xxx. Contents lists available at ScienceDirect. Vision Research

Separating Cue Encoding From Target Processing in the Explicit Task- Cuing Procedure: Are There True Task Switch Effects?

The Role of Spatial Frequency in Expert Object Recognition

Mental Imagery. What is Imagery? What We Can Imagine 3/3/10. What is nature of the images? What is the nature of imagery for the other senses?

Rapid fear detection relies on high spatial frequencies

Task and object learning in visual recognition

Two Routes to Face Perception: Evidence from Psychophysics and Computational Modeling

Object detection and basic-level categorization: Sometimes you know it is there before you know what it is

SHORT REPORT The Development of Perceptual Expertise for Faces and Objects in Autism Spectrum Conditions

A FRÖHLICH EFFECT IN MEMORY FOR AUDITORY PITCH: EFFECTS OF CUEING AND OF REPRESENTATIONAL GRAVITY. Timothy L. Hubbard 1 & Susan E.

COGNITIVE FACTORS IN AIRCRAFT IDENTIFICATION 149

Anatomical limitations in mental transformations of body parts

The Thatcher illusion and face processing in infancy

PSYCHOLOGICAL SCIENCE. Research Article. Materials

Learning Within-Category Attribute Correlations in a One-Attribute Visual Search Classification Paradigm

Mental Rotation is Not Easily Cognitively Penetrable

Semantic Contamination of Visual Similarity Judgments

The Impact of Schemas on the Placement of Eyes While Drawing.

Does momentary accessibility influence metacomprehension judgments? The influence of study judgment lags on accessibility effects

Visual Transformation of Size

Does Facial Processing Prioritize Change Detection? Change Blindness Illustrates Costs and Benefits of Holistic Processing

The impact of item clustering on visual search: It all depends on the nature of the visual search

THE SPATIAL EXTENT OF ATTENTION DURING DRIVING

Evidence for holistic processing of faces viewed as photographic negatives

The Role of Feedback in Categorisation

Individual differences in the mixture ratio of rotation and nonrotation trials during rotated mirror/normal letter discriminations

Absolute Identification is Surprisingly Faster with More Closely Spaced Stimuli

Interpreting Instructional Cues in Task Switching Procedures: The Role of Mediator Retrieval

Observational Category Learning as a Path to More Robust Generative Knowledge

the remaining half of the arrays, a single target image of a different type from the remaining

Framework for Comparative Research on Relational Information Displays

Goodness of Pattern and Pattern Uncertainty 1

Interference with spatial working memory: An eye movement is more than a shift of attention

Who Needs Cheeks? Eyes and Mouths are Enough for Emotion Identification. and. Evidence for a Face Superiority Effect. Nila K Leigh

To what extent do unique parts influence recognition across changes in viewpoint?

Differing views on views: comments on Biederman and Bar (1999)

Spatial Orientation Using Map Displays: A Model of the Influence of Target Location

IAT 355 Perception 1. Or What You See is Maybe Not What You Were Supposed to Get

Are there Hemispheric Differences in Visual Processes that Utilize Gestalt Principles?

General recognition theory of categorization: A MATLAB toolbox

The Training and Transfer of Real-World Perceptual Expertise

A Comparison of Three Measures of the Association Between a Feature and a Concept

The Meaning of the Mask Matters

The Role of Color and Attention in Fast Natural Scene Recognition

Accounts for the N170 face-effect: a reply to Rossion, Curran, & Gauthier

Satiation in name and face recognition

The path of visual attention

THE DEVELOPMENT OF PERCEPTUAL EXPERTISE FOR FACES AND OBJECTS IN AUTISM SPECTRUM DISORDERS. Cara Damiano. Thesis. Submitted to the Faculty of the

Object recognition under sequential viewing conditions: evidence for viewpoint-specific recognition procedures

CONCEPT LEARNING WITH DIFFERING SEQUENCES OF INSTANCES

Does scene context always facilitate retrieval of visual object representations?

Intentional and Incidental Classification Learning in Category Use

To What Extent Can the Recognition of Unfamiliar Faces be Accounted for by the Direct Output of Simple Cells?

Translating From Perceptual to Cognitive Coding

The influence of irrelevant information on speeded classification tasks*

PERCEPTUAL EXPERTISE AFFECTS VISUAL SHORT-TERM MEMORY AND THE TIME-COURSE OF PERCEPTUAL ENCODING. Kim M. Curby

Transcription:

Recognizing rotated faces and Greebles: Is the inversion effect unique to faces? Alan R. S. Ashworth III Air Force Research Laboratory Brooks AFB Texas & Yale University, USA Quoc C. Vuong* Department of Cognitive & Computational Psychophysics, Max Planck Institute for Biological Cybernetics, Germany Bruno Rossion Unité Cognition et Développement & Laboratoire de Neurophysiologie, Université Catholique de Louvain, Belgium Michael J. Tarr Department of Cognitive & Linguistic Sciences, Brown University, USA Running Head: Face inversion * Corresponding author: Mailing address: Max Planck Institute for Biological Cybernetics, Spemannstrasse 38, 72076 Tuebingen, Germany; phone: +49 (0)7071 601-630; fax: +49 (0)7071 601-616; email: quoc.vuong@tuebingen.mpg.de

Face inversion 1 Abstract The fact that faces are strongly affected by picture-plane inversion has often been cited as evidence for face-specific mechanisms. It is unclear, however, whether this face inversion effect is restricted to faces or results from general mechanisms that compensate for pictureplane rotations. To address this issue, in Experiment 1 observers learned upright faces and were then tested with faces across a range of orientations for comparison with previous studies using non-face objects. Experiments 2 and 3 directly compared the recognition of faces and matched novel stimuli (Greebles) in a naming task using either single or multiple trained orientations, respectively. In all three experiments, performance systematically decreased with increasing misorientation from either the upright (Experiments 1 and 2) or nearest trained orientation (Experiment 3). Taken together, these results suggest that recognizing inverted faces is difficult because faces, as a stimulus class, have several properties known to prompt orientation dependence.

Face inversion 2 Introduction Images inverted in the picture plane are often more difficult to recognize than their upright counterparts, regardless of whether the images depict faces (Yin, 1969), common non-face objects (Jolicoeur, 1985) or novel 2D shapes (Tarr & Pinker, 1989). However, inversion appears to affect the recognition of faces disproportionately more than the recognition of many other non-face objects, and this effect has become known as the face inversion effect in the face recognition literature (e.g., Diamond & Carey, 1986; Scapinello & Yarmey, 1970; Valentine & Bruce, 1986; Yin, 1969). One potential difference between the processing of faces and non-face objects is the dominant visual information that is used for recognition purposes. In particular, several researchers have advocated that the processing of spatial relationships between facial features (such as the eyes and mouth), in addition to the features themselves, is critical for face recognition (see Maurer, Grand, & Mondloch, 2002, for a review). By comparison, non-face objects may be predominantly recognized on the basis of their component features and their qualitative, rather than quantitative, relations (e.g., Biederman, 1987). Following an early proposal by Yin (1969), researchers have further reported that inversion in the picture plane impairs configural rather than featural processing (e.g., Collishaw & Hole, 2002; Freire, Lee, & Symons, 2000; Goffaux & Rossion, under revision; Leder, Candrian, Huber, & Bruce, 2000; Tanaka & Farah, 1993). Others have argued that inversion leads to quantitative rather than qualitative changes in how faces are processed (e.g., Sekuler, Gasper, Gold, & Bennett, 2004; Riesenhuber, Jarudi, Gilad, & Sinha, 2004). In either case, observers are disproportionately more impaired at recognizing inverted faces than inverted non-face objects. Regardless of one s preferred account, the inversion effect observed for faces may partly be a consequence of specific properties of faces, which may or may not be shared by

Face inversion 3 other non-face categories. In particular, faces form a highly visually homogenous class. That is, they have a single dominant orientation with a main vertical axis and multiple parts arranged in similar configurations across individuals. In the present study, we focus on the possibility that the recognition of exemplars from a category sharing some of these properties may also be impaired by picture-plane inversion. To test this hypothesis, we directly compared the recognition of faces and novel Greebles (Gauthier & Tarr, 1997a). We used Greebles because they are mono-oriented with a main vertical axis, and they share a configuration of visually-similar parts: both factors that may play some role in the occurrence of large inversion effects. Although earlier studies have compared the effect of inversion on faces and other homogeneous object categories (e.g., houses, airplanes, dogs), they have almost all used a binary upright versus inverted contrast. Such a design makes it easy to attribute quantitative differences between categories to qualitative differences in processing. That is, if any difference in the effect of inversion between categories is found, it may appear as if different mechanisms are recruited by the two when inverted. Yet it is equally likely that the same mechanism is used to compensate for inversion for both categories, but is affected to different extents. To address this concern, we included misoriented faces and Greebles shown across a wide range of picture plane orientations. Thus, we can know whether the same quantitative patterns arise for the two categories regardless of the overall magnitudes of the effects. The logic here is similar to that prevalent in the object recognition literature, where the systematic rotation of images in the picture plane has provided key insights regarding the encoding and representation of objects, such as possible mechanisms to generalize from familiar to unfamiliar views (e.g., Jolicoeur, 1985; Tarr & Pinker, 1989); the role of visual similarity in recognition (e.g., Lawson & Jolicoeur, 1999; Murray, 1998); the importance of the level at which objects are recognized (e.g., Hamm & McMullen, 1998); and the effects of practice

Face inversion 4 with a stimulus set (e.g., Jolicoeur, 1985; Jolicoeur & Milliken, 1989; Murray, Jolicoeur, McMullen, Ingleton, 1993; Tarr & Pinker, 1989). Considered in toto these studies reveal a consistent pattern of results. Most notably, Jolicoeur (1985) demonstrated that familiar common objects rotated in the picture plane become increasingly more difficult to name as the stimulus is rotated further from an upright orientation (typically with respect to gravity). This initial orientation effect diminishes substantially with repeated exposure to the same stimuli, sometimes as soon as the second presentation of the stimuli (e.g., Jolicoeur & Milliken, 1989; Murray et al., 1993). Similar performance patterns have also been found for novel 2D shapes (e.g., Gauthier & Tarr, 1997b; Tarr & Pinker, 1989). A similar finer-grain analysis of the orientation effect for faces and how the effect may change with learning will aid in determining whether qualitatively similar processes are used for faces and Greebles across different picture-plane orientations. A number of studies of face recognition that have explored the recognition of faces rotated in the picture plane. Valentine and Bruce (1988) reported several experiments using faces rotated in 45 increments (0, 45, 90, 135, and 180 ). In one experiment, they measured the time subjects required to correctly identify famous faces rotated in the picture plane. In a second experiment, they measured the time subjects required to judge whether a sequence of two faces, a canonical upright face immediately followed by a rotated face, were the same or different. Both experiments revealed a linear relationship between response time and the magnitude of rotation from upright, suggesting that the recognition of rotated faces may involve normalization processes qualitatively similar to those used for the recognition of other objects. Following Valentine and Bruce (1988), several investigators have also presented faces at a larger number of orientations in conjunction with other manipulations, such as blurring or featural and configural changes (Bruyer, Galvez, & Prairial, 1993; Collishaw & Hole, 2002;

Face inversion 5 Lewis, 2001; Murray, Yong, & Rhodes, 2000; Sjoberg & Windes, 1992; Stürzel & Spillman, 2000). Not surprisingly, in these studies, there are again similar response patterns dependent on the magnitude of rotation of faces away from the canonically upright orientation rather than an all-or-none decrease of performance or increase in response times for upside-down faces. In the majority of studies, as in Valentine and Bruce s initial study, this pattern is linear. Although some studies have reported some non-linearity in the data, this non-linearity seems to be localized to particular orientations and specific tasks involving categorical decisions (e.g., Murray et al., 2000; Stürzel & Spillman, 2000; see Lewis, 2001). A further issue when comparing orientation effects on faces and non-face objects is the default level of identification at which stimuli are recognized. Faces are typically recognized at the individual level. By comparison, objects are recognized at a basic or entry level, the level at which object shapes are best distinguished from one another (Rosch, Mervis, Gray, Johnson, & Boyes-Braem, 1976). Studies have shown that recognizing objects at the subordinate level (e.g., collies vs. poodles) prompts the largest orientation effects, whereas recognizing objects at the basic level (e.g., dog vs. cat) or superordinate level (e.g., animal vs. artifact) incurs little or no orientation effects (Hamm & McMullen, 1998). To take this factor into account, subjects in the current study named individual faces and Greebles. To summarize, the literature suggests that the recognition of both faces and non-face objects shows similar linear response patterns across picture-plane rotations. However, no study has directly compared orientation effects for unfamiliar faces and appropriately matched novel non-face objects (Brooks, Rosielle, & Cooper, 2002, compared priming in faces and common objects). Moreover, no previous studies have examined how the orientation effect changes with learning for faces in non-upright orientations (other than the inverted orientation; Robbins & McKone, 2003), or have compared learning effects with faces and matched non-face objects.

Face inversion 6 Experiment 1 In Experiment 1, we did not directly compare faces and Greebles. Rather, the goal here was to first determine the systematic effect of picture-plane rotation on the recognition of unfamiliar faces using a standard old/new recognition task (Yin, 1969). Although previous investigators systematically manipulated the orientation of faces, they did not directly address how long-term representations of unfamiliar faces are formed and later recognized. For example, they used a famous/not-famous discrimination task which requires access to preexperimental familiarity with the faces (e.g., Bruyer et al., 1993; Collishaw & Hole, 2002; Valentine & Bruce, 1988). Furthermore, they have often degraded or otherwise deformed their face stimuli (e.g., Collishaw & Hole, 2002; Murray et al., 2000; Stürzel & Spillman, 2001). Method Subjects. The 30 participants in Experiment 1 were undergraduate students and other members of the Yale University community. The undergraduates participated to fulfill a class requirement while the others were paid for their participation. All subjects had normal or corrected-to-normal vision. All subjects used their dominant hand for responding. None of the subjects participated in any of the other experiments reported here. Material and Stimuli A total of 144 digitized male faces were used as stimuli. The stimuli were scanned from full face photographs taken from a recent Harvard Business School yearbook. All scans were 256 shades of gray. Faces with distinguishing characteristics or accessories such as facial hair, scars, or glasses were excluded. The faces were normalized so that the eyes were

Face inversion 7 horizontal, and were rescaled to be approximately the same size. A circular region was superimposed on the central portion of each face, thus excluding most of the featural extremities such as ears, hair, and bottom of the chin. The region enclosed by the circle was then removed from the original background and placed into a white circular background within a black surround. Therefore, upon presentation the silhouette of each face appeared round, and the face itself was centered in a white circle. The stimuli were presented on a high resolution color monitor, driven by an IBM compatible 80486DX-33 computer. Subjects used a chinrest that was 45 cm from the monitor. The stimuli had a diameter of 10 cm, resulting in a visual angle of approximately 12. Design and Procedure An old/new recognition paradigm was used in Experiment 1. Subjects first studied a series of faces and then were asked to recognize these faces from a group consisting of both studied and unstudied faces. Experiment 1 consisted of three identical blocks. Each block consisted of a learning phase followed by a testing phase. Learning phase. In the learning phase, all of the targets were presented in the canonical upright (vertical) orientation, and subjects were instructed to keep their heads vertical. The subjects were explicitly informed that they would be asked to recognize the target faces later. They were not, however, informed that the faces would be rotated in the picture plane during the testing phase. Subjects were then presented with a set of 24 different faces to study. Each face was presented individually for 5 sec. The entire set was presented three times, and subjects performed study tasks designed to enhance memory for the faces (Bower & Karlin, 1974). On the first pass through the set, subjects simply viewed the faces. On the second pass, they rapidly verbalized personality assessments of each face. On the third

Face inversion 8 pass, they rated each face for likeability on a 1-7 Likert scale using the keyboard. The learning phase was followed immediately by the test phase. Test phase. During the test phase, stimuli were presented at 12 orientations in the picture plane, beginning at 0, and moving clockwise at intervals of 30. The 24 studied faces were mixed with 24 unstudied faces. Two studied and two unstudied faces were shown at each of the 12 orientations. All faces were shown only once per block, making 48 trials per block. As noted above, there were three blocks using the same procedure. For each of the three blocks, a new set of studied and unstudied faces were used. Trial order was randomized for each block and for each subject. At test, the subjects task was to identify each presented face as either old or new depending upon whether the face in question had been presented or not in the preceding learning phase. Stimulus presentation was preceded and followed by a pattern mask that subtended the same visual angle as the stimuli. The mask was a gradient fill from black to white, starting with black in the center and becoming progressively lighter along the radius, thus giving the effect of a dark centered starburst that blended from black to gray to white across concentric circles. Responses were made by pressing keys labeled old or new on a standard IBM keyboard. The face remained displayed until the subject responded. Accuracy and response times were recorded for each response. There was a short rest break between each block. Results In Experiment 1 and in the subsequent experiments reported, three common analyses were conducted. First, we analyzed the effect of orientation in the picture plane on response times (RTs). Second, we regressed RTs against orientation to determine the slope of the bestfit line. The slope provides a quantitative measure of the effect of orientation across different

Face inversion 9 stimuli and experimental procedures. Third, we analyzed the effect of orientation on accuracy to ensure that subjects were not trading speed for accuracy. For all analyses of RTs reported in the present study, we calculated median values from correct trials for each subject. In addition, both RT and accuracy data were collapsed across distance from 0. That is, the data were folded around 180 with 150 and 210 averaged, 120 and 240 averaged, and so on. However, to better represent subject performance, we have plotted our dependent measures at all orientations. An α=0.05 was adopted to test for any significant effects. Response times. The top panel of Figure 1 plots the relationship between mean RT and orientation in the picture plane. The point plotted at 360 is a duplicate of that at 0 and is included for clarity (this is also the case for subsequent experiments). The RT data were entered into an Analysis of Variance (ANOVA) with orientation (0-180 ) as a repeated factor, and a linear contrast was computed for orientation. As evident in the top panel of Figure 1, there was a significant effect for orientation in both the omnibus analysis, F(6,174)=16.78, p<.001, and the linear contrast, F(1,29)=39.46, p<.001. After computing the linear contrast, there was no significant residual variance associated with the orientation factor. ------ Insert Figure 1 about here ------ Regression. The regression analysis showed that the slope of the orientation function in Experiment 1 was 2.78 ms/. Accuracy. The accuracy data were used in a signal detection analysis. The bottom panel of Figure 1 shows the sensitivity (d ) as a function of orientation in the picture plane. The shape of the sensitivity function mirrors that of the response time function, with the

Face inversion 10 lowest sensitivity occurring approximately at 180. Although low, sensitivity at the inverted orientation and the two adjacent orientations remained significantly above chance: 150 orientation, t(29)=3.69, p<.001; 180 orientation, t(29)=4.70, p<.001; 210 orientation, t(29)=6.17, p<.001. The d data were entered into an ANOVA with orientation as a repeated factor. A linear contrast was also computed for the orientation factor. There was a significant effect for orientation in both the omnibus analysis, F(6,174)=38.76, p<.001, and the linear contrast, F(1,29)=149.05, p<.001. After computing the linear contrast, there was no significant residual variance associated with the orientation factor. There was no speed/accuracy tradeoff. Discussion Consistent with previous studies, the results of Experiment 1 indicate that there was a linear relationship between recognition performance and orientation (Bruyer et al., 1993; Collishaw & Hole, 2002; Valentine & Bruce, 1988). These results help to extend earlier studies to non-degraded unfamiliar faces and to a larger set of orientations in the picture plane. In some earlier studies, a deviation from linearity was observed, either taking the form of a switch at around 90-120 (Murray et al., 2000; Stürzel & Spillman, 2000) or a small dip between 150 and 180 of rotation (Lewis, 2001). This non-linearity was taken as evidence for qualitatively different mechanisms specific to faces. However, these effects were all obtained using face stimuli with eyes and mouth inverted ( Thatcherized faces, Thompson, 1980) which observers judged for grotesqueness. The conjunction of abnormal stimuli and a binary judgment task may have prompted the subjects to select a boundary associated with their criterion of where there was a change (Lewis, 2001). Moreover, a dip at 180 is often observed for rotated non-face objects (e.g., Jolicoeur, 1985).

Face inversion 11 Our results are also similar to object recognition studies using non-face objects. In particular, the slope of our orientation function (2.78 ms/ ) fell within the range typically reported for the recognition of misoriented non-face objects (e.g., 1.79-2.57 ms/ for nonsense two-dimensional shapes, Cooper, 1975; 2.9 ms/ for letters and numerals, Cooper & Shepard, 1973; 1.79-2.94 ms/ for two-dimensional polygons, Cohen & Kubovy, 1993; 2.3 ms/ for line drawings of common objects, Jolicoeur, 1985; 2.07 ms/ for simple letter-like line drawings, Tarr & Pinker, 1989). Overall, the linear response pattern in both response times and accuracy suggests that the mechanisms used to compensate for misorientation of face stimuli are similar to those used in normal object recognition and, consequently, there is no reason to assume that configural information is disrupted any more by inversion in the case of faces as compared to other object categories (Valentine & Bruce, 1988). Experiment 2 In Experiment 1, we compared performance with faces to previously published results with non-face objects. In Experiment 2, we directly compare the recognition of faces and appropriately-matched Greebles using a naming task. Like faces, Greebles are visually similar, they are symmetric with smooth surfaces, they have a canonical upright orientation with a main vertical axis, and exemplars are distinguished on the basis of both parts and configurations of parts. However, they are not perceived as face stimuli (Gauthier, Behrmann, & Tarr, 2004). Second, we used a naming task to ensure that both faces and Greebles were recognized at the individual level. The naming task also allowed us to measure practice effects for repeated stimuli. Finally, although we did not equate each observer s level of expertise with faces and Greebles, we provided observers with significant experience recognizing both unfamiliar faces and Greebles.

Face inversion 12 Method Subjects The 72 subjects participating in Experiment 2 were drawn from the same populations as Experiment 1. None of the subjects participated in any of the other experiments reported here. Material and Stimuli Two types of stimuli were used for Experiment 2. The first type consisted of faces from Experiment 1. A total of 18 faces were used. Three sets of stimuli were made such that each set contained six different target faces, with the remaining twelve faces used as distractors. In this manner all faces served as targets and distractors equally across subjects. Novel Greebles comprised the second type of stimuli. Examples of these objects in their upright orientation are shown in Figure 2. The Greebles served as controls for the faces, and they were equated with the faces along several stimulus dimensions. Importantly, all Greebles share the same major features arranged in the same overall configuration, thus forcing discrimination among them to be based on the subtle differences among common features (Gauthier & Tarr, 1997a). A total of 15 Greebles were used. Three sets were made such that each set contained five different Greebles as named targets, with the remaining ten used as distractors. In this manner all Greebles served as targets and distractors equally across subjects. Note that the silhouettes of the Greebles are not as homogeneous as those of faces. We expect this potential additional source of information might facilitate recognition of the Greebles. The apparatus was the same as Experiment 1. ------ Insert Figure 2 about here ------

Face inversion 13 Design and Procedure In Experiment 2, subjects first learned the names of target faces or Greebles. The names were attached to keys on the keyboard, and during the experiment subjects pressed the key with the name that corresponded to the stimuli presented. Thirty-six subjects participated in the face group, while 36 subjects participated in the Greeble group. For both stimulus types, there was learning phase followed by a testing phase. In the learning phase, all of the target stimuli to be learned were presented in the canonical upright orientation and subjects were instructed to keep their heads vertical at all times. Subjects were explicitly informed that later in the experiment they would be asked to identify each of the target stimuli by pressing the key with its corresponding name. They were not, however, informed that the targets would be rotated during the testing phase. Learning phase. During the learning phase, the stimuli were only presented in their upright orientation. Within this phase, there were two blocks. The purpose of these blocks was to enable the subjects to learn the name for each stimulus, and map the name of each stimulus onto the correct response key. For faces, there were five presentations of each of the six target faces with its corresponding name for a total of 30 trials on the first learning block. Subjects were instructed to study each face for the entire 5 sec presentation, and then press the key with the appropriate name after the stimulus cleared the display. The names used were Bob, Dan, Jim, Lee, Ray, and Wes. Next, there were 20 presentations of the six target faces without its name for a total of 120 trials on the second learning block. For these learning trials, subjects were instructed to respond accurately yet quickly. Subjects wore headphones through which they were given accuracy feedback. For a correct response, there was no sound. Incorrect responses were followed by a beep. Presentation order was randomized for each of these learning trial groups for each subject.

Face inversion 14 For Greebles, subjects were presented with five repetitions of the five target Greebles with its corresponding name for a total of 25 trials on the first learning block. The names used were Bob, Dan, Jim, Lee, and Ray. Next, there were 20 repetitions of the five target Greebles for a total for 100 trials on the second learning block. Test phase. Following the two learning blocks, there were two identical test blocks in the test phase, separated by a short break. Each of the six target faces was shown twice at each of the twelve orientations (0-330, in 30 steps) for a total of 144 trials. Each of the 12 distractor faces was shown at four orientations for a total of 48 trials. The 48 distractor trials were distributed equally across the twelve orientations. Thus, there were 192 trials per block in total. Similarly, each of the five target Greebles was shown twice at each of the twelve orientations for a total of 120 trials. Each of the ten distractor Greebles was shown at four orientations for a total of 40 trials. The 40 distractor trials were distributed as equally as possible across the twelve orientations. Thus, there were 160 trials per block. For both faces and Greebles, 75% of the trials were target trials and 25% of the trials were distractor trials. The presentation order was randomized for each testing block for each subject. The subjects task was to accurately and quickly identify each face by pressing the key with the appropriate name. There was a key labeled NA for none of the above that was to be pressed upon presentation of a distractor. Stimulus presentation was preceded and followed by the same pattern mask used in Experiment 1, and the trials were response terminated. Accuracy feedback was provided through headphones, with subjects hearing a beep for incorrect responses.

Face inversion 15 Results Response times. The top panels of Figures 3 and 4 show the relationship between mean RT, orientation, and block, separately for faces and Greebles. The RT data were entered into a mixed-design ANOVA with stimulus type (face, Greebles) as a between-subjects factor, and orientation (0-180 ) and block (1-2) as repeated factors. A linear contrast was also computed for orientation. ------ Insert Figures 3 and 4 about here ------ All main effects were significant: stimulus type, F(1,70)=18.54, p<.001; orientation, F(6,420)=29.56, p<.001; and block, F(1,70)=165.00, p<.001. There was also a significant stimulus type x block interaction, F(1,70)=17.17, p<.001. No other interactions were significant. In addition, the linear contrast of orientation was also significant, and only interacted with block: linear contrast, F(1,420)=87.27, p<.001; linear contrast x block interaction, F(1,70)=4.91, p<.05. After computing the linear contrast, there was no significant residual variance associated with the orientation factor. Recall that, as originally postulated by Yin (1969), inversion (180 rotation) should disproportionately affect faces more than non-face objects. Thus, we also analyzed response time at 0 and 180 averaged across blocks for each stimulus type in a mixed-designed ANOVA. For this analysis, there was a significant effect of stimulus type, F(1,70)=7.75, p<0.001; orientation, F(1,70)=69.42, p<0.001; and no interaction between stimulus type and orientation, F<1. Thus, although subjects were generally slower with Greebles, rotating the image by 180 was equally detrimental to both faces and Greebles (as indicated by the lack of interactions with stimulus type).

Face inversion 16 Regression. For each stimulus type and each block, response time was regressed on orientation to determine the slopes of the regression line relating the response time to orientation. For faces, the slope for Block 1 was 0.99 ms/, while the slope for Block 2 was 0.58 ms/. For Greebles, the slope for Block 1 was 0.93 ms/, while the slope for Block 2 was 0.58 ms/. In calculating the regression line for the first block of the face stimuli, the 180 orientation was excluded, as this point did not conform to the strong linear trend in RT found across the other orientations for each stimulus type and each block (e.g., see Jolicoeur, 1985, for a similar deflection in RT at 180 ). Note, however, that this point was included for all of the analysis of variance calculations for Experiment 2. Accuracy. Percentage of incorrect responses per orientation constituted the error data for Experiment 2. The error rates are plotted on the bottom panels of Figures 3 and 4 for faces and Greebles. In the same manner as the response time data, the error data were collapsed across distance from the canonical orientation, and then submitted to the same ANOVA. There were only main effects of block, F(1,70)=47.23, p<0.001; and orientation, F(6,420)=13.41, p<0.001. There was only a significant interaction between block and orientation, F(6,420)=9.09, p<0.001; although the interaction between stimulus type and orientation was marginally significant, F(6,420)=1.87, p=0.085. The linear contrast was significant, F(1,70)=33.02, p<0.001 (with a residual cubic trend, F(1,70)=6.94, p=0.01). Lastly, there was a significant block by linear contrast interaction, F(1,70)=26.15, p<0.001, suggesting that the orientation function was reduced on the second block. There was no evidence of any speed/accuracy tradeoffs. Discussion The results of Experiment 2 revealed a high degree of similarity in how subjects named faces and Greebles across changes in orientation. First, both faces and Greebles

Face inversion 17 yielded linear effects across orientations. Note that for faces, the average slope in this experiment was smaller than the average slope found in Experiment 1 presumably because recognition memory and naming recruit somewhat different sorts of information. Second, with increasing familiarity, subjects showed a decrease in the orientation function for both faces and Greebles (slope in the final block: faces: 0.41 ms/ ; Greebles: 0.35 ms/ ). That is, with practice, and as found for other object categories (Tarr & Pinker, 1989), misorientations in the picture plane had less of an effect on recognizing both faces and Greebles. Finally and critically, we did not find a larger effect of inversion for Greebles than for faces on either block of trials (cf. Yin, 1969). Given that we equated both stimulus types along several relevant dimensions (Gauthier & Tarr, 1997a) and, in particular in terms of the level of identification, it appears that it is not being a face per se that produces processing difficulties at the inverted orientation (or any other orientation). Experiment 3 One of the enduring findings of experiments involving the recognition of stimuli rotated in the picture plane is that practice effects are found as the subjects proceed through the experiment (e.g., Eley, 1982; Jolicoeur, 1985; Tarr & Pinker, 1989). That is, subjects are both faster and more accurate across all familiar orientations at the end of the experiment than at the beginning. With practice, observers can learn orientation-invariant diagnostic features (e.g., Eley, 1982; Jolicoeur & Milliken, 1989) or orientation-specific templates (e.g., Tarr & Pinker, 1989). In either case, practice effects seem to be specific to the learned stimuli, and generalize only to new stimuli that are visually similar (Gauthier & Tarr, 1997b). This high degree of view specificity is often revealed by surprising subjects with novel orientations following training (e.g., Tarr & Pinker, 1989). In Experiment 3, we tested whether learning individual faces and Greebles at multiple orientations show similar behavioral patterns across

Face inversion 18 multiple learned views. Thus, Experiment 3 was similar to Experiment 2 except that subjects received two additional hours of training with target faces or Greebles at the upright orientation and other specific orientations prior to testing. Method Subjects The 72 subjects participating in Experiment 3 were drawn from the same populations as Experiments 1 and 2. None of the subjects participated in any of the other experiments reported here. Material and Stimuli The faces for Experiment 3 were identical in construction to those of Experiments 1 and 2. A total of 114 faces were used. Of this total, 18 faces were used as targets, while the remainder served as distractors. The targets were blocked into three groups of six faces, with each group being used equally often across subjects. There was a total of 30 Greebles, similar in construction to those used in Experiment 2. Eighteen of these served as targets. The targets were blocked into three groups of six Greebles, with each group being used equally often across subjects. Thus, for both faces and Greebles, each subject only learned six target stimuli. The equipment used for stimulus presentation was identical to that used in Experiments 1 and 2. Design and Procedure Experiment 3 was similar to Experiment 2 in that a naming task was used in which subjects learned to associate names with the target stimuli. The names were attached to keys

Face inversion 19 on the keyboard, and during testing the subjects pressed the key with the name that corresponded to the stimulus currently being presented. Thirty-six subjects participated in the face group, while 36 subjects participated in the Greeble group. In contrast to the previous experiment, Experiment 3 added two one-hour training sessions on consecutive days prior to the testing session on the third day. During the two training days, subjects practiced naming the target stimulus at three of the twelve possible orientations used at test. Subjects practiced naming the target at 0, 150 and 240. At test, the same target stimuli were presented at all 12 orientations in the picture plane (0-330 in 30 steps). Training sessions. For faces, each training session was divided into three blocks of training trials preceded by two blocks of learning trials. For Greebles, the sessions were divided into two blocks of training trials preceded by two blocks of learning trials. The purpose of the first two learning blocks was to enable subjects to learn the name for each face or Greeble, and map the name of the target stimulus onto the correct response key. During these two blocks, the faces were always presented upright. As in Experiment 2, accuracy feedback was provided through headphones, with subjects hearing a beep for incorrect responses. For both faces and Greebles, the first learning block contained six presentations of each of the six targets with its corresponding name for a total of 36 trials. For these trials, the subjects were instructed to study each stimulus for the entire 5 sec presentation, then to press the key with the appropriate name after the stimulus was removed. The second learning block contained 12 presentations of each of the six targets without its name for a total of 72 trials. For these trials, the subjects were instructed to respond accurately yet quickly. During these two learning blocks, all stimuli were presented upright and subjects were instructed to keep their heads vertical. Subjects were explicitly informed that later in the

Face inversion 20 experiment they would be asked to identify each target by pressing the key with its corresponding name. They were not informed that the stimuli would sometimes be rotated in the picture plane. Presentation order was randomized for each of these learning groups for each subject. Following each of the two-hour training sessions subjects ran in either three blocks of training trials for faces or two blocks of training trials for Greebles. During these blocks, three of the six target stimuli were shown at 0 and 150, and the other three were shown at 0 and 240. The first set of items is referred to as the 150-Set, while the latter is referred to as the 240-Set. The distractor stimuli were presented at the same orientations as the target faces. A different set of 12 distractors were used for the different training blocks for faces. By comparison, the same set of 12 Greeble distractors was used given the limited number of these stimuli. During the training blocks, the three face or Greeble stimuli from the 150-Set were presented 12 times at 0 and 12 times at 150 for a total of 72 trials. Likewise, the three stimuli of each type comprising the 240-Set were presented the same number of times at 0 and 240 for a total of 72 trials. The 12 distractor faces and Greebles were presented twice at 0, once at 150, and once at 240 for a total of 48 trials. Therefore each training block contained 192 trials, 75% of which were targets and 25% of which were distractors. Presentation order was randomized for each training block for each subject. During these blocks, the subjects task was to accurately and quickly identify each face by pressing the key with the appropriate name. There was a key labeled NA for none of the above that was to be pressed upon presentation of a distractor. Stimulus presentation was preceded and followed by a pattern mask, and the trials were response terminated. Accuracy feedback was provided through headphones, with subjects hearing a beep for incorrect responses. Accuracy and response times were recorded for each response.

Face inversion 21 Testing session. After the two days of training, subjects returned the next day for the final test session. They were told that this last session was no different from the previous two. Like the two training sessions, the subjects were initially instructed to view the target stimulus in the canonical orientation (0 ), and to practice naming the faces or Greebles. Following next was a refresher block of trials and then two blocks of test trials in which the main body of data for Experiment 3 was collected. Other than being abbreviated, the refresher block of trials was identical to the training trials. The subjects trained with the same 150-Set and 240 Set of stimuli that were used during training. The 150-Set was presented six times at 0 and six times at 150, and the 240-Set was presented six times at 0 and six times at 240 for a total of 72 target trials. For the twelve distractor faces used, all were shown once at 0 ; whereas six were shown at 150 and six were shown at 240 for a total of 24 trials. Of the 96 total trials, the 72 trials during which target faces were shown represents 75% of the total trials. The final two blocks of trials represented the main body of data for the experiment. These two blocks were identical in all respects except for the mapping of distractors to orientation. In each of these blocks, the same six stimuli with which the subjects had trained were presented. However, unlike the training sessions the stimuli were now shown at all twelve orientations in the picture plane (0-330 in 30 steps). Therefore, each face and each Greeble was shown in ten novel orientations and in two previously trained orientations (including the upright orientation). Each of the six targets was shown three times at each of the twelve orientations for a total of 216 trials per block. The twelve distractors were divided into two groups of six. In the first test block, one group of distractors was shown six times at each of the following orientations: 0, 60, 120, 180, 240, and 300. The other group of distractors was shown six times at each of the following orientations: 30, 90, 150, 210, 270, and 330.

Face inversion 22 Therefore, there were 72 distractor trials in all. For the second test block, the orientations were switched between groups of distractors. Of the 288 trials per block, the 216 trials during which target faces were shown represents 75% of the total trials. For each of the two test blocks, the subjects task was to accurately and quickly identify each target by pressing the key with the appropriate name. There was a key labeled NA for none of the above that was to be pressed upon presentation of a distractor. Stimulus presentation was preceded and followed by the same pattern mask used in Experiments 1 and 2, and the trials were response terminated. Results Response times. For Experiment 3, we were primarily interested in naming times for targets during the test trials as a function of the angular distance to a trained orientation. The top panels of Figures 5 and 6 plot naming times as a function of block and orientation, separately for faces and Greebles. ------ Insert Figures 5 and 6 about here ------ In Experiment 3, subjects learned faces or Greebles at two orientations (0, and either 150 or 240 ). Therefore, the data were collapsed across distance from familiar orientation (either the canonical orientation or the trained orientations, whichever was closer). For example, to calculate the value that represents 30 from a familiar orientation, the following points were averaged: the points on the 150-Set function that were 30 from 150 (120 and 180 ); the points on the 240-Set function that were 30 from 240 (210 and 270 ); the points on either function that were 30 from 0 (30 and 330 from both the 150-Set and the 240- Set).

Face inversion 23 The data were entered into a mixed-design ANOVA with stimulus type (faces, Greebles) as a between-subjects factor; and orientation (0-120 ) and block (1-2) as repeated factors. As before, a linear contrast was computed for the orientation factor. There was a significant effect of orientation, F(4,140)=29.19, p<.001, and no effect for block. There was a marginal orientation x block interaction, F(4,140)=2.11, p=.08. There was also a significant effect for the linear contrast computed for the orientation factor, F(1,35)=51.98, p<.001, and a significant linear contrast x block interaction, F(1,35)=4.27, p<.05, indicating a difference in the slope of the orientation functions from Block 1 to Block 2. After computing the linear contrast, there was no significant residual variance associated with the orientation factor. As evident in Figures 5 and 6, there is a decrease in RT at 150 for the 150-Set. The peak RT for this set of faces and Greebles is at 240, and thus shifted 60 away from the 180 orientation. Similarly, there is a decrease in RT at 240 for the 240-Set. In contrast to the naming times for stimuli from the 150-Set, the peak RT for this set of faces and Greebles was shifted 60, but in the opposite direction, to the 120 orientation. Regression. The collapsed RT data, per stimulus type and block, were regressed against orientation to determine the orientation function. For faces, the slope for Block 1 was 0.98 ms/, while the slope for Block 2 was 0.71 ms/. For Greebles, the slope for Block 1 was 1.37 ms/, while the slope for Block 2 was 0.88 ms/. Analyses revealed that the slopes were not significantly different between faces and Greebles, F(1,70)=0.96; but the rates of rotation were significantly different across blocks, F(1,70)=5.11, p<0.05. Accuracy. The proportion of incorrect responses per orientation constituted the error data for Experiment 3. The error rates for faces and Greebles are plotted in the bottom panels of Figures 5 and 6. As with RTs, the error data was submitted to a mixed-design ANOVA with stimulus type (faces, Greebles) as a between-subjects factor, and with block (1-2) and orientation (0-180 ) as repeated factors. There were significant main effects of block,

Face inversion 24 F(1,70)=10.40, p<0.001; and orientation, F(6,420)=11.51, p<0.001. The main effect of stimulus type was marginally significant, F(1,70)=3.62, p=0.061. Overall, there was no evidence of a speed/accuracy tradeoff. Discussion As in Experiment 2, there were no apparent differences in how faces and Greebles were recognized when they were rotated in the picture plane. First, following extensive training at specific orientations in the picture plane, subjects named faces and Greebles quickest at their trained orientations (including the upright orientation). Second, naming times increased in a linear manner as the stimuli were rotated away from trained orientations. Third, additional practice at novel orientations resulted in a decrease in the orientation effect. As in Tarr and Pinker (1989; see also Jolicoeur & Milliken, 1989, Experiment 1), the speed benefit associated with training with a face or Greeble at a specific orientation does not generalize to other stimuli of the same class, or to trained stimuli shown at novel orientations. Only stimuli trained at 150 showed a benefit at 150 during test, and only stimuli trained at 240 showed a benefit at 240 during test. This finding is consistent with Tarr and Pinker s assertion that object representations are both orientation and object specific. Faces and Greebles do not appear to be exceptions to this rule. We note, however, that Jolicoeur and Milliken (1989) found evidence of orientation generalization with familiar objects. In their second experiment, subjects named familiar objects that were shown upright in the context of other objects shown rotated in the picture plane. In a surprise block, the upright-only objects were also shown at non-upright orientations. They found that subjects were not affected by orientation for these objects, suggesting that subjects could generalize to novel orientations. In Jolicoeur and Milliken s study, observers named visually distinctive objects at the basic level (see also Hamm &

Face inversion 25 McMullen, 1998). The orientation and stimulus specificity found in this experiment may therefore depend on the visual homogeneity of the stimulus class (as with the homogenous 2D figures used by Tarr & Pinker, 1989) and/or subordinate-level recognition (e.g., specificity may be increased by prompting subjects to attend to subtle or more complex features). Lastly, the results of Experiment 3 extend the findings of Experiment 2 and those of previous studies on the recognition of misoriented faces (e.g., Bruyer et al., 1993; Collishaw & Hole, 2002; Lewis, 2001; Murray et al., 2000; Valentine & Bruce, 1988). In particular, we find that representations of faces include specific orientation information about those faces, and that this information can generalize, in an orientation-sensitive manner, to a limited extent to novel orientations. One possibility is that the inclusion of orientation-specific information may be related to subjects reliance on metric spatial relations between facial features (e.g., the distance between the eyes), which have been shown to be a critical factor for recognizing individual faces (e.g., Collishaw & Hole, 2002). General Discussion The majority of studies on the face inversion effect have compared the recognition of upright and inverted faces and non-face objects (e.g., Yin, 1969). This comparison leaves unanswered potential similarities and differences in how faces and non-face objects are processed, for example, the types of visual features that may be critical for recognition. Although investigators have systematically examined how faces at other picture-plane orientations are recognized (Bruyer et al., 1993; Collishaw & Hole, 2002; Lewis, 2001; Murray et al., 2000; Sjoberg & Windes, 1992; Stürzel & Spillman, 2000; Valentine & Bruce, 1988), they have not directly compared faces and non-face objects that were matched to faces along various stimulus dimensions. Here we systematically misoriented faces and Greebles in the picture plane and measured how subjects recognized them across this continuum.

Face inversion 26 Our main findings are as follows. First, we found a strong linear dependence of response times on orientation for faces using both an old/new recognition task (Experiment 1) and a naming task (Experiments 2 and 3). Second, we found similar orientation effects for both faces and Greebles (Experiments 2 and 3). Third, we found that this orientation effect diminished with practice at familiar orientations for both categories. Taken as a whole, the data reported here suggest that disproportionate effects of inversion on faces with respect to non-face objects do not provide a strong argument in favor of separable mechanisms for faces and for non-face objects, rather they point towards a single mechanism that suffices for face and object recognition (Tarr & Cheng, 2003; Valentine & Bruce, 1988). In the face recognition literature, researchers have shown that inversion disrupts configural processing (e.g., Collishaw & Hole, 2002; Goffaux & Rossion, under revision; Freire et al., 2000; Leder et al., 2000; Tanaka & Farah, 1993). Consequently, subjects may be more likely to rely on individual local features for inverted faces. However, the recognition of some non-face upright objects may also rely on the same configural processes as the recognition of faces. As our data suggest, discriminating between individuals within a homogeneous stimulus class with a single orientation may be sufficient to recruit configural processes and, hence, be equally disrupted by inversion. It is important to note that one critical dimension that often varies between faces and non-face objects is the level of expertise (Gauthier, Tarr, Anderson, Skudlarski, & Gore, 1999; Gauthier & Logothetis, 2000; Diamond & Carey, 1986; Tanaka & Farah, 1993). For example, Diamond and Carey found a large inversion effect using human faces for both the dog experts and novices but only the dog experts showed a large inversion effect for pictures of dogs. However, we obtained comparable inversion effects for faces and Greebles, which suggest that expertise is not a necessary condition to obtain comparable effects of misorientation in the recognition of faces and non-face objects. Alternatively, given the

Face inversion 27 limited number of stimuli in Experiment 2 and the additional training in Experiment 3, observers may have gained some degree of expertise with the novel Greebles tested. These factors may lead to similar inversion effects found for faces and Greebles in the present study. Future work is needed in this area to determine the extent that expertise may contribute to inversion effects for faces and non-face objects (Gauthier & Tarr, 1997a). Lastly, from a theoretical perspective, the results reported here are consistent with a view-based model of object recognition (see Tarr & Bülthoff, 1998, for a review). According to this approach, objects are represented by a collection view- and orientation-specific features and metric relationships between these features. Objects at novel orientations are recognized by a generalization process that matches the stimulus input to stored representations. In terms of the neural instantiation of such a model, Perrett, Oram, and Ashbridge (1998) have proposed that populations of inferior temporal cortical neurons with different stimulus-orientation tuning functions accumulate information over time to match novel orientations of an object to stored orientation-specific representations. The current results impose at least three important constraints on this view-based model that we wish to emphasize: First, both configural and featural information may be differentially processed; second, these cues may have different generalization time courses; and third, both configural and component features are learned in a highly orientation-dependent manner. Critically, our direct comparison between faces and Greebles indicates that how quickly and accurately configural and featural mechanisms compensate for misorientations seems to depend on the homogeneity of the stimulus class. In sum, the degree to which misorientation in the picture plane affects recognition depends on at least the visual homogeneity of the stimulus class and whether exemplars have a single dominant orientation. The combination of these two factors may engage processes that are highly sensitive to the configurations of parts of faces and non-face objects alike.

Face inversion 28 Inverting a stimulus may severely impair this sort of processing, and make observers rely more on specific component features. Importantly, both configural and featural processes may be instantiated in a single neural mechanism (e.g., Perrett et al., 1998). Thus, it may be that misoriented faces are more difficult to recognize than most other objects because faces constitute a highly homogeneous and mono-oriented stimulus category, not because there is a special or separable face processing mechanism that is particularly sensitive to inversion.

Face inversion 29 Acknowledgments Funding for this research was provided to Alan Ashworth by the Intelligent Systems Branch, Armstrong Laboratory, Brooks Air Force Base, San Antonio, Texas, and to Michael J. Tarr by the Air Force Office of Scientific Research grant #F49620-91-J-0169. We thank Scott Yu for creating the novel objects used as control stimuli in Experiment 2 and Isabel Gauthier for her help in generating these stimuli.

Face inversion 30 References Biederman I. (1987). Recognition-by-components: A theory of human image understanding. Psychological Review, 94, 115-147. Bower, G. H., & Karlin, M. B. (1974). Depth of processing pictures of faces and recognition memory. Journal of Experimental Psychology, 103, 751-757. Brooks, B. E., Rosielle, L. J., & Cooper, E. E. (2002). The priming of face recognition after metric transformations. Perception, 31, 297-313. Bruyer, R., Galvez, C., & Prairial, C. (1993). Effect of disorientation on visual analysis, familiarity decision and semantic decision on faces. British Journal of Psychology, 84, 433-441. Cohen, D., & Kubovy, M. (1993). Mental rotation, mental representation, and flat slopes. Cognitive Psychology, 25, 351-382. Collishaw, S. M., & Hole, G. J. (2002). Is there a linear or a nonlinear relationship between rotation and configural processing of faces? Perception, 31, 287-296. Cooper, L. A. (1975). Mental rotation of random two-dimensional shapes. Cognitive Psychology, 7, 20-43. Cooper, L. A., & Shepard, R. N. (1973). Chronometric studies of the rotation of mental images. In W. G. Chase (Ed.), Visual information processing (pp. 75-176). New York: Academic Press. Diamond, R., & Carey, S. (1986). Why faces are and are not special: An effect of expertise. Journal of Experimental Psychology: General, 115, 107-117. Eley, M. G. (1982). Identifying rotated letter-like symbols. Memory & Cognition, 10, 25-32. Freire, A., Lee, K., & Symons, L. A. (2000). The face-inversion effect as a deficit in the encoding of configural information: Direct evidence. Perception, 29, 159-170.

Face inversion 31 Gauthier, I., Behrmann, M., & Tarr, M. J. (2004). Are Greebles like faces? Using the neuropsychological exception to test the rule. Neuropsychologia, 42, 1961-1970. Gauthier, I., & Logothetis, N. (2000). Is face recognition no so unique after all? Cognitive Neuropsychology, 17, 125-142. Gauthier, I., & Tarr, M. J. (1997a). Becoming a Greeble expert: Exploring the face recognition mechanism. Vision Research, 37, 1673-1682. Gauthier, I, & Tarr, M. J. (1997b). Orientation priming of novel shapes in the context of viewpoint-dependent recognition. Perception, 26, 51-73. Gauthier, I., Tarr, M. J., Anderson, A. W., Skudlarski, P., & Gore, J. C. (1999). Activation of the middle fusiform face area increases with expertise in recognizing novel objects. Nature Neuroscience, 2, 568-573. Goffaux, V., & Rossion, B. (under revision). Face inversion disproportionately impairs the perception of vertical but not horizontal relations between features. Journal of Experimental Psychology: Human Perception & Performance. Hamm, J. P., & McMullen, P. A. (1998). Effects of orientation on the identification of rotated objects depend on the level of identity. Journal of Experimental Psychology: Human Perception & Performance, 24, 413-426. Jolicoeur, P. (1985). The time to name disoriented natural objects. Memory & Cognition, 13, 289-303. Jolicoeur, P., & Milliken, B. (1989). Identification of disoriented objects: Effects of context of prior presentation. Journal of Experimental Psychology: Learning, Memory, & Cognition, 15, 200-210. Lawson, R., & Jolicoeur, P. (1999). The effect of prior experience on recognition thresholds for plane-disoriented pictures of familiar objects. Memory & Cognition, 27, 751-758.

Face inversion 32 Leder, H., Candrian, G., Huber, O., & Bruce, V. (2001). Configural features in the context of upright and inverted faces. Perception, 30, 73-83. Lewis, M. B. (2001). The Lady s not for turning: Rotation of the Thatcher illusion. Perception, 30, 769-774. Maurer D., Grand, R. L., & Mondloch, C. J. (2002).The many faces of configural processing. Trends in Cognitive Sciences, 6, 255-260. Murray, J. E. (1998). Is entry-level recognition viewpoint invariant or viewpoint dependent? Psychonomic Bulletin & Review, 5, 300-304. Murray, J. E., Jolicoeur, P., McMullen, P. A., Ingleton, M. (1993). Orientationinvariant transfer of training in the identification of rotated natural objects. Memory & Cognition, 21, 604-610. Murray, J. E., Yong, E., & Rhodes, G. (2000). Revisiting the perception of upsidedown faces. Psychological Science, 11, 492-496. Perrett, D. I., Oram, M. W., & Ashbridge, E. (1998). Evidence accumulation in cell populations responsive to faces: an account of generalisation of recognition without mental transformations. Cognition, 67, 111-145. Riesenhuber, M., Jarudi, I., Gilad, S., & Sinha, P. (2004). Face processing in humans is compatible with a simple shape-based model of vision. Proceedings of the Royal Society: Biological Sciences, 271, S448-S450. Robbins, R., & McKone, E. (2003). Can holistic processing be learned for inverted faces? Cognition, 88, 79-107. Rosch, E., Mervis, C. B., Gray, W. D., Johnson, D. M., & Boyes-Braem, P. (1976). Basic objects in natural categories. Cognitive Psychology, 8, 382-439. Scapinello, K. F., & Yarmey, A. D. (1970). The role of familiarity and orientation in immediate and delayed recognition of pictorial stimuli. Psychonomic Science, 21, 329-330.

Face inversion 33 Sekuler, A. B., Gaspar, C. M., Gold, J. M., & Bennett, P. J. (2004). Inversion leads to quantitative, not qualitative, changes in face processing. Current Biology, 14, 391-396. Sjoberg, W., Windes, J. (1992). Recognition times for rotated normal and Thatcher faces. Perceptual & Motor Skills, 75, 1176-1178. Stürzel, F., & Spillman, L. (2000). Thatcher illusion: Dependence on angle of rotation. Perception, 29, 937-942. Tanaka, J.W., & Farah, M. J. (1993). Parts and wholes in face recognition. Quarterly Journal of Experimental Psychology, 46A, 225-245. Tarr, M. J., & Bülthoff, H. H. (1998). Image-based object recognition in man, monkey and machine. Cognition, 67, 1-20. Sekuler, A. B., Gaspar, C. M., Gold, J. M., & Bennett, P. J. (2004). Inversion leads to quantitative, not qualitative, changes in face processing. Current Biology, 14, 391-396. Tarr, M. J., & Cheng, Y. D. (2003). Learning to see faces and objects. Trends in Cognitive Sciences, 7, 23-30. Tarr, M. J., & Pinker, S. (1989). Mental rotation and orientation-dependence in shape recognition. Cognitive Psychology, 21, 233-282. Thompson, P. (1980). Margaret Thatcher - A new illusion. Perception, 9, 483-484. Valentine, T., & Bruce, V. (1986). The effect of race, inversion and encoding activity upon face recognition. Acta Psychologica, 61, 259-273. Valentine, T., & Bruce, V. (1988). Mental rotation of faces. Memory & Cognition, 16, 556-566. Yarmey, A. D. (1971). Recognition memory for familiar 'public' faces: effects of orientation and delay. Psychonomic Science, 24, 286-288. Yin, R. K. (1969). Looking at upside-down faces. Journal of Experimental Psychology, 81, 141-145.

Face inversion 34 Figure Captions Figure 1. Mean response times and sensitivity averaged across subjects in Experiment 1 as a function of orientation away from vertical upright (0 ). Error bars in this and subsequent figures are standard errors of the mean (SEM). Figure 2. Examples of Greeble stimuli. Each row shows Greebles from a different family (same body shape). Each column shows exemplars from a family (different parts). Figure 3. Mean response times and error rates averaged across subjects in Experiment 2 for faces as a function of orientation and block. Figure 4. Mean response times and error rates averaged across subjects in Experiment 2 for Greebles as a function of orientation and block. Figure 5. Mean response times and error rates averaged across subjects in Experiment 3 for faces as a function of orientation, block, and training set. Faces from the 150-set were trained at 0 and 150, whereas those from the 240-set were trained at 0 and 240. Figure 6. Mean response times and error rates averaged across subjects in Experiment 3 for Greebles as a function of orientation, block, and training set. Greebles from the 150-set were trained at 0 and 150, whereas those from the 240-set were trained at 0 and 240.

Face inversion 35 1800 1600 Response Time (ms) 1400 1200 1000 800 0 30 60 90 120 150 180 210 240 270 300 330 360 Orientation (deg) 2.50 2.00 Sensitivity (d') 1.50 1.00 0.50 0.00 0 30 60 90 120 150 180 210 240 270 300 330 360 Orientation (deg) Figure 1

Figure 2 Face inversion 36