Neurophysiologically Driven Image Triage: A Pilot Study

Similar documents
Neurophysiologically Driven Image Triage: A Pilot Study

Neurotechnology for Image Analysis: Searching For Needles in Haystacks Efficiently

ABSTRACT 1. INTRODUCTION 2. ARTIFACT REJECTION ON RAW DATA

Optimising the Number of Channels in EEG-Augmented Image Search

NeuroSky s esense Meters and Detection of Mental State

IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, VOL.?, NO.?, JUNE

The impact of numeration on visual attention during a psychophysical task; An ERP study

Beyond Blind Averaging: Analyzing Event-Related Brain Dynamics. Scott Makeig. sccn.ucsd.edu

Supporting Information

Effects of Light Stimulus Frequency on Phase Characteristics of Brain Waves

Quality of evidence for perceptual decision making is indexed by trial-to-trial variability of the EEG

DATA MANAGEMENT & TYPES OF ANALYSES OFTEN USED. Dennis L. Molfese University of Nebraska - Lincoln

From Single-trial EEG to Brain Area Dynamics

One Class SVM and Canonical Correlation Analysis increase performance in a c-vep based Brain-Computer Interface (BCI)

A Brain Computer Interface System For Auto Piloting Wheelchair

NeuroImage 64 (2013) Contents lists available at SciVerse ScienceDirect. NeuroImage. journal homepage:

Practicalities of EEG Measurement and Experiment Design YATING LIU

Simultaneous Real-Time Detection of Motor Imagery and Error-Related Potentials for Improved BCI Accuracy

Event Related Potentials: Significant Lobe Areas and Wave Forms for Picture Visual Stimulus

ISSN: (Online) Volume 3, Issue 7, July 2015 International Journal of Advance Research in Computer Science and Management Studies

Linear spatial integration for single trial detection in encephalography

EBCC Data Analysis Tool (EBCC DAT) Introduction

EEG-Rhythm Dynamics during a 2-back Working Memory Task and Performance

HST 583 fmri DATA ANALYSIS AND ACQUISITION

Chapter 9 Cortically-Coupled Computer Vision

The Role of Color and Attention in Fast Natural Scene Recognition

Biomedical Research 2013; 24 (3): ISSN X

From single-trial EEG to brain area dynamics

Amy Kruse, Ph.D. Strategic Analysis, Inc. LCDR Dylan Schmorrow USN Defense Advanced Research Projects Agency

Transcranial direct current stimulation modulates shifts in global/local attention

Cortical origins of response time variability during rapid discrimination of visual objects

Novel single trial movement classification based on temporal dynamics of EEG

Animal Detection in Natural Images: Effects of Color and Image Database

Towards natural human computer interaction in BCI

A micropower support vector machine based seizure detection architecture for embedded medical devices

Tracking the Development of Automaticity in Memory Search with Human Electrophysiology

Center Speller ( )

An Overview of BMIs. Luca Rossini. Workshop on Brain Machine Interfaces for Space Applications

Prestimulus Alpha as a Precursor to Errors in a UAV Target Orientation Detection Task

The overlap of neural selectivity between faces and words: evidences

Incorporation of Imaging-Based Functional Assessment Procedures into the DICOM Standard Draft version 0.1 7/27/2011

Hybrid EEG-HEG based Neurofeedback Device

Title of Thesis. Study on Audiovisual Integration in Young and Elderly Adults by Event-Related Potential

Introduction to Computational Neuroscience

Curiosity Dublin City University Executive Summary

Characterization and Robust Classification of EEG Signal from Image RSVP Events with Independent Time- Frequency Features

Neural Correlates of Human Cognitive Function:

Support Vector Machine Classification and Psychophysiological Evaluation of Mental Workload and Engagement of Intuition- and Analysis-Inducing Tasks

Augmented Cognition to enhance human sensory awareness, cognitive functioning and psychic functioning: a research proposal in two phases

Prediction of Successful Memory Encoding from fmri Data

Manuscript under review for Psychological Science. Direct Electrophysiological Measurement of Attentional Templates in Visual Working Memory

Introduction to Computational Neuroscience

IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING 1

Evidence-Based Filters for Signal Detection: Application to Evoked Brain Responses

PHYSIOLOGICAL RESEARCH

Supplemental Information: Task-specific transfer of perceptual learning across sensory modalities

Quick Guide - eabr with Eclipse

Linear Spatial Integration for Single-Trial Detection in Encephalography

Recognition of Sleep Dependent Memory Consolidation with Multi-modal Sensor Data

Submitted report on Sufi recordings at AAPB 2013 in Portland. Not for general distribution. Thomas F. Collura, Ph.D. July, 2013

Top-Down versus Bottom-up Processing in the Human Brain: Distinct Directional Influences Revealed by Integrating SOBI and Granger Causality

by Laurie Gibson, Jon Touryan, Anthony Ries, Kaleb McDowell, Hubert Cecotti, and Barry Giesbrecht ARL-TR-6055 July 2012

Classification Accuracy Improvement of Chromatic and High Frequency Code Modulated Visual Evoked Potential Based BCI

Neural Strategies for Selective Attention Distinguish Fast-Action Video Game Players

Classifying P300 Responses to Vowel Stimuli for Auditory Brain-Computer Interface

Emotion Detection Using Physiological Signals. M.A.Sc. Thesis Proposal Haiyan Xu Supervisor: Prof. K.N. Plataniotis

Brain Computer Interface. Mina Mikhail

Outline of Talk. Introduction to EEG and Event Related Potentials. Key points. My path to EEG

Predicting Perceptual Performance From Neural Activity

This presentation is the intellectual property of the author. Contact them for permission to reprint and/or distribute.

Morton-Style Factorial Coding of Color in Primary Visual Cortex

Classification of Epileptic Seizure Predictors in EEG

Supplementary materials for: Executive control processes underlying multi- item working memory

Address for correspondence: School of Psychological Sciences, Zochonis Building, University of Manchester, Oxford Road, M139PL, Manchester, UK

This is a repository copy of Facial Expression Classification Using EEG and Gyroscope Signals.

Figure 1. Source localization results for the No Go N2 component. (a) Dipole modeling

EEG/ERP Control Process Separation

Supplementary material

(Visual) Attention. October 3, PSY Visual Attention 1

MENTAL WORKLOAD AS A FUNCTION OF TRAFFIC DENSITY: COMPARISON OF PHYSIOLOGICAL, BEHAVIORAL, AND SUBJECTIVE INDICES

DISCRETE WAVELET PACKET TRANSFORM FOR ELECTROENCEPHALOGRAM- BASED EMOTION RECOGNITION IN THE VALENCE-AROUSAL SPACE

International Journal of Neurology Research

ERP Components and the Application in Brain-Computer Interface

Supplementary Information on TMS/hd-EEG recordings: acquisition and preprocessing

VISUAL PERCEPTION OF STRUCTURED SYMBOLS

Modifying the Classic Peak Picking Technique Using a Fuzzy Multi Agent to Have an Accurate P300-based BCI

Analysis of in-vivo extracellular recordings. Ryan Morrill Bootcamp 9/10/2014

The neurolinguistic toolbox Jonathan R. Brennan. Introduction to Neurolinguistics, LSA2017 1

EEG changes accompanying learned regulation of 12-Hz EEG activity

CS/NEUR125 Brains, Minds, and Machines. Due: Friday, April 14

Neuro Q no.2 = Neuro Quotient

Entrainment of neuronal oscillations as a mechanism of attentional selection: intracranial human recordings

What do you notice? Woodman, Atten. Percept. Psychophys., 2010

EEG Analysis on Brain.fm (Focus)

Gene Selection for Tumor Classification Using Microarray Gene Expression Data

Information Processing During Transient Responses in the Crayfish Visual System

The Neural Basis of Somatosensory Remapping Develops in Human Infancy

NeuroImage 59 (2012) Contents lists available at ScienceDirect. NeuroImage. journal homepage:

Attentional Blink Paradigm

Transcription:

Neurophysiologically Driven Image Triage: A Pilot Study Santosh Mathan Honeywell Laboratories 3660 Technology Dr Minneapolis, MN 55418 USA santosh.mathan@honeywell.com Stephen Whitlow Honeywell Laboratories 3660 Technology Dr Minneapolis, MN 55418 USA stephen.whitlow@honeywell.com Abstract Effective analysis of complex imagery is a critical aspect of important domains such as medical diagnosis. As technological developments lower the cost of gathering and storing imagery, the cost of searching through large image sets for important information has been growing substantially. This paper demonstrates the feasibility of using neurophysiological signals associated with early perceptual processing to identify critical information within large image sets efficiently. Brain signals called evoked response potentials, detected in conjunction with rapid serial presentation of images, show promise as a human computer interaction modality for screening high volumes of imagery accurately and efficiently. Keywords Rapid Serial Visual Presentation, EEG, Visual Search, Brain Computer Interface Copyright is held by the author/owner(s). CHI 2006, April 22 27, 2006, Montreal, Canada. ACM 1-xxxxxxxxxxxxxxxxxx. ACM Classification Keywords H.5.2. Information interfaces and Presentation: User Interfaces: Interaction Styles H.3.3 Information Search and Retrieval: Information Filtering Introduction The problem of searching for targets in vast collections of imagery is one that affects practitioners in a variety of

2 Figure 1. Baseline EEG (top) EEG segment containing an Evoked Response Potential (bottom) domains from medical diagnosis to intelligence image analysis. Advances in imaging and storage technology have served to lower the cost of collecting and storing high volumes of imagery. However, the cost of searching through large sets of imagery for important information can often be substantial. In many domains, such as medical diagnosis and intelligence analysis, effective search currently requires the expertise of highly skilled analysts who search though sequences of images in a relatively slow manner. Unfortunately, the availability of skilled analysts is simply insufficient to cope with the volume of imagery to be analyzed. For example, the military reports that most intelligence imagery goes without being examined by analysts [4]. The problems just highlighted have led to calls for effective triage techniques that can be used to rapidly screen high volumes of imagery and identify a subset of images that merit careful scrutiny by an image analyst [4]. A triage process trades off specificity in favor of sensitivity the search may result in several false positives, but also most, or all, of the targets in an image set. Computer vision systems have been employed towards this end. However, in many contexts, these systems fall short of the sensitivity and specificity that humans display. They also fail to generalize to the extent that human analysts do. An ideal triage system might be one that leverages human visual processing capabilities in the role of a target detector, while raising the efficiency associated with the manual search process. One method for realizing an efficient triage platform that exploits human perceptual capabilities may lie in a combination of rapid serial visual presentation (RSVP) of images and electroencephalogram (EEG) recordings collected as a user views images flashed briefly at rates of 100 milliseconds or less per image. For example, Thorpe and colleagues [8] asked participants to detect images of animals in a sequence of nature scenes presented for 20 milliseconds per image. Using EEG sensors, researches were able to detect a brain signal known as an evoked response potential (ERP) within 150 ms of the onset of target stimuli. Evoked Response Potentials Evoked response potentials refer to a morphological change in EEG waveforms in response to task-relevant stimuli. They are typically measured by inspecting EEG activity within a window of several hundred milliseconds following critical events. Figure 1 shows EEG activity at a particular sensor following a non task relevant stimulus (distractor) and a task relevant stimulus (target). The x- axis depicts the progression of time following the stimulus in milliseconds the zero point corresponds to the onset of a stimulus. The wave form associated with the target shows a pronounced amplitude perturbation following stimulus onset. Research suggests that ERPs reflect the activity of underlying cognitive processes necessary for processing and coordinating a response to task relevant stimuli. The brain s response to critical events, such as the presence of targets, may begin in frontal areas generating topdown, intent information and propagate to sensorimotor areas triggering events that regulate bottom up information transmission through sensory and response selection areas. [5] ERPs are difficult to detect. These signals typically range in amplitude from approximately 1 to 10 microvolts, while background EEG activity may range from 10 to 100 microvolts. Common events such as eye blinks or facial

3 Figure 2. Experimental design. Participants viewed trials with or without targets. 50% of trial blocks contained targets. Fixation screen separated trial blocks. muscle activity can completely obscure ERPs. In order to deal with such an inherently low signal to noise ratio, ERP detection has relied on a strategy of trial averaging. Under this strategy, an experimental stimulus is presented to a subject several times. The waveforms elicited by each stimulus are averaged. Background EEG washes out in the averaging process, and the event induced activity becomes prominent. While integrating information across repeated presentations of a stimulus is an effective way to identify ERPs, it is an impractical strategy for application domains, such as a triage platform. Repeated presentation of stimuli compromises the efficiency of the search process. In domains where efficient ERP detection is critical, accurate detection of ERPs within a single trial becomes necessary. However, single trail detection of ERPs requires a robust signal processing and classification approach to overcome the problems imposed by the inherently low signal-to-noise ratio. Recently, researchers have developed promising approaches for single-trial ERP detection [3, 6]. Instead of integrating sensor data over time, they rely on integrating information spatially, across high density EEG sensors. While these studies are promising, several issues limit their practical relevance. First, they rely on large electrode arrays of over 60 elements this is both expensive and cumbersome. Second, the ability of classifiers to generalize across individuals and sessions spanning an hour or more has not been explored. Research to date has largely focused on data collected over the span of sessions under 10 minutes. However, analysts anecdotally report analyzing imagery for spans of approximately an hour. Third, existing research has focused on within-subject analysis of classification results. However, the ability of an ERP detector to generalize across subjects has practical implications. An approach that generalizes across individuals would reduce the need for individualized calibration and allow a greater proportion of each analysis session to be devoted to image analysis. The pilot study described in this paper focuses on ERP classification in the context of an RSVP based triage task. Using data collected with a 32 electrode EEG system, this research examines the capacity of an ERP detection system to generalize over the span of an hour. It also examines the ability of the system to generalize across individuals. Method Participants Two male volunteers participated in this pilot study. Both had normal corrected vision. EEG data was collected over the course of little over an hour as participants performed a target detection task in the context of an RSVP presentation. The experimental duration was broken into three sessions of approximately 20 minutes and produced three corresponding datasets for each subject. There was a rest period of approximately five minutes between sessions. Data from the first session was intended for use as training data for a classifier, data from the second session was intended for classifier validation, and the third dataset was collected with the objective of being used for classifier testing Task Participants were asked to locate objects within sequences of grayscale satellite imagery provided by the National Geospatial Intelligence Agency (NGA). Objects of interest, referred to as targets, consisted of satellite photographs of ships or boats in the midst of a pool of satellite images around a port scene. Both the target and distractor images

4 Figure 3. Average spatio temporal pattern of electrical activity over the scalp following target (left) and distractor images (right) were drawn from a common high-resolution, broad-area, satellite image. All imagery was presented using the Rapid Serial Visual Presentation (RSVP) paradigm [7]. Images were presented in rapid succession for durations of 50 or 100 milliseconds, per image. These images were grouped into sets of approximately 50 images, referred to as a trial (see Figure 2). Half the trials included a single target image inserted randomly within the sequence of images (target trials), while half the trials did not include a target (distractor trials). Participants were asked to indicate the presence of a target with a button press. Consecutive trials were separated by a fixation screen lasting several seconds in order to break monotony and minimize possible eye strain. Participants viewed tens of thousands of images over the course of the experiment. Display Images were presented on a 21 inch, CRT monitor. Images were 400 x 400 pixels in size and presented on a screen of 1240 x 768 pixel resolution. Participants were able to position themselves at a comfortable distance from the screen. All images shared a relatively similar level of luminance and were presented using a script developed for Presentation, a stimulus presentation tool developed by Neurobehavioral Systems. Data Acquisition EEG data was collected using the BioSemi Active Two system using a 32 channel EEG cap and eye electrodes. Data was sampled at 256 Hz. Triggers sent by the Presentation script to mark the onset of target and distractor stimuli were received by the BioSemi system over a parallel port and recorded concurrently with EEG signals. EEG was bandpass filtered between 1 Hz and 30 Hz, using an 8th order Butterworth filter. An adaptive linear filter was used to correct EEG signals affected by eye blinks. Results Following the experiment, EEG data was segmented into epochs. In the case of target trials each epoch consisted of a two second segment of EEG one second before, and one second after the onset of target stimuli. For distractor trials (no target trials) epochs were extracted around the trigger associated with the middle image of each trial block. Each epoch served to provide a picture of spatiotemporal electrical activity across brain regions. Each twenty minute session yielded approximately 80 to 90 target and 80 to 90 distractor epochs each. Each independent channel of each epoch (indexed by row in the epoch matrix) was normalized using the mean and standard deviation of channel values preceding the image trigger. Qualitative Analysis MEAN HEAD PLOTS Epoch associated with distractor and target images were averaged and rendered using EEGLab [1]. The visual rendering of spatio-temporal electrical activity reveals clear patterns of activation that help discriminate between target and distractor trials for both participants. For example, as Figure 3 based on data from Participant 1 shows, target trials were characterized by strong, positive, frontal activity and negative parietal activity at approximately 150 milliseconds. This pattern changes to strong negative frontal activity and positive parietal activation 350 milliseconds after stimulus onset. In contrast, activations remain relatively neutral and stable in the distractor conditions. Clear patterns of discriminatory activity are evident well beyond the 450 millisecond window depicted in Figure 3.

5 Figure 4. Electrical activity preceding and following target images (top) and distractor images (bottom) at a single EEG electrode Y-axis in each plot indexes individual trials or epochs; the x- axis represents time. The 0 point on the x-axis (bold black line) represents time of stimulus onset. Color in each graph represents the polarity and amplitude of electrical activity at a particular point in time. There is a consistent pattern of sustained, high amplitude activity following stimulus onset at practically every trial containing a target. While the mean head plots depicted in Figure 3 reveal a clear spatio-temporal pattern of activity that could serve to discriminate between targets and distractors, they provide no information about the trial-by-trial variability of activity patterns across time at particular channels. A high degree of variability could make it difficult to discriminate between targets and distractors on a trial-by-trial basis. To assess the variability of patterns that form features for classification, electrical activity associated with every trial was plotted for several channels. Figure 4 shows a channel ERP image for Participant 1 plotted using EEGLab. The channel ERP image depicted in Figure 4 shows electrical activity associated with every epoch in target and distractor conditions, at a particular frontal sensor location. Target epochs contained broad bands of high amplitude activity at various latencies with respect to stimulus onset. By comparison, sustained patterns of high amplitude activity are missing in the distractor epochs. ERP Classification OBJECTIVE ERP data was also used in the context of a classification analysis. There were two objectives associated with the analysis. First, the analysis would serve to establish whether it was possible to generalize learning over relatively large time windows up the span of an hour. Second, the analysis would help assess whether the features used to classify ERPs were consistent enough to generalize across individuals. A support vector machine (SVM) [2] was used in conjunction with this effort. Support vector machines are a widely-used linear machine learning technique that relies on ideas from statistical learning theory to provide good generalization performance. Support vector machines can also be used in the context of problems that are not linearly seperable by projecting data into a higher dimensional space where the data may be linearly seperable. A non-linear support vector machine with a radial basis function kernel was used in this study. The metric used to evaluate classification performance is the Area under the Receiver Operating Characteristic (ROC) curve [2]. ROC curves plot the ratio of True Positives to False Positives as a threshold for discriminating between targets and distractors is varied. It is widely used to evaluate human and machine signal detection capabilities. Perfect classification produces an Area under the curve value (Az) of 1.0, while chance performance produces an Az value of 0.5. GENERALIZATION OVER TIME, WITHIN SUBJECT As mentioned earlier, many single trial ERP classification studies have relied on a leave-one-out testing procedure to assess their classification approach. Such an approach largely circumvents well known problems associated with changes in the statistical properties of biological signals over time. In contrast, this study examined the capacity of a classifier to generalize over experimental sessions that span over an hour. A classifier trained on data from the first 20 minute session generalized well to data from a 20 minute test session separated by over a twenty minute gap. For Participant 1, an SVM produced an area under the receiver operating curve (Az) of 0.90. Results for Participant 2 were similar an SVM produced an Az value of 0.91. These results suggest that the features that help discriminate between EEG in epochs with target and distractors remain stable over the course of sessions distributed over the span of an hour.

6 GENERALIZATION ACROSS INDIVIDUALS In order to test the ability of the ERP detection approach described here to generalize across participants both the training and test data for Participant 1 were used as training data for the classifier. Both the training and testing data for Participant 2 were combined into a test data set. An SVM trained and tested with this data produced an Az value of 0.84. Similar results were obtained when the training and test dataset were reversed(training and test data for Participant 2 were aggregated into a training set, while training and test data for Participant 1 were aggregated into a test set). An Az value of 0.85 was obtained when training and test data were reversed. These results suggest that there is a stable and similar pattern of spatio temporal electrical activity induced by targets and distractors in both individuals. Whether these cross-subject classification results are sufficient for a triage platform will depend on the application domain and the cost of misclassification. Discussion The results presented here suggest that evoked response potentials detected in conjunction with rapid serial visual presentation of images offer a viable interaction modality for searching through complex imagery efficiently and accurately. Classification results suggest that features that serve to discriminate among conditions remain stable over time and may be shared among individuals. Additionally, the results reported here suggest that an array of 32 EEG electrodes may provide a sufficient set of features for detecting ERPs with a high degree of accuracy. Future work will assess the efficacy of the image triage approach described here across a wider group of participants. Acknowledgements This research was funded by the Defense Advanced Research Projects Agency (DARPA) and the National Geospatial Intelligence Agency (NGA). This paper has been reviewed by DARPA and NGA and Approved for Public Release, Distribution Unlimited. References [1] Delorme A and Makeig S. EEGLAB: an open source toolbox for analysis of single-trial EEG dynamics J Neuroscience Methods, 134:9-21, 2004 [2] Duda, R.O, Hart, R. E. and Stork D. G. Pattern Classification, Second Edition. John Wiley & Sons, New York, 2001 [3] Gerson, A.D., Parra, L.C., and Sajda, P. (2005) Cortical Origins of Response Time Variability during Rapid Discrimination of Visual Objects, NeuroImage, 28 (2) 326-341. [4] Kenyon, H. S. (2003). Unconventional Information Operations Shorten Wars. Signal Magazine. Armed Forces Communications and Electronics Association. [5] Makeig S, Westerfield M, Jung T-P, Enghoff S, Townsend J (2002) Dynamic brain sources of visual evoked responses. Science 295: 690 693 [6] Parra, L., Alvino, C., Tang, A., Pearlmutter, B., Yeung, N., Osman, A., and Sajda, P. (2003), Single Trial Detection in EEG and MEG: Keeping it Linear. Neurocomputing. (52-54) 177-183. [7] Spence, R., Witkowski, M., Craft, B., de Bruijn, O. & Fawcett, C. (2004) International Conference Advanced Visual Interfaces (AVI-2004). pp. 141-149 [8] Thorpe,S., Fize, D. & Marlot, C.(1996). Speed of processing in the human visual system. Nature, 381, 520-522.