Topics. Experiment Terminology (Part 1)

Similar documents
Designing A User Study

Empirical Research Methods for Human-Computer Interaction. I. Scott MacKenzie Steven J. Castellucci

Human Computer Interaction - An Introduction

Empirical Research Methods for Human-Computer Interaction

Experimental Research in HCI. Alma Leora Culén University of Oslo, Department of Informatics, Design

Last week: Introduction, types of empirical studies. Also: scribe assignments, project details

Designing Experiments... Or how many times and ways can I screw that up?!?

Research Landscape. Qualitative = Constructivist approach. Quantitative = Positivist/post-positivist approach Mixed methods = Pragmatist approach

Who? What? What do you want to know? What scope of the product will you evaluate?

CHAPTER 8 EXPERIMENTAL DESIGN

Single-Factor Experimental Designs. Chapter 8

The hearing aid that opens up your world

Getting the Design Right Daniel Luna, Mackenzie Miller, Saloni Parikh, Ben Tebbs

Contents. The language of disability 1. Suggested terminology 2. Experience of disability 3 (change to hearing aid/earplugs) Sighted guide 4

HCI Lecture 1: Human capabilities I: Perception. Barbara Webb

Designing Psychology Experiments: Data Analysis and Presentation

Two Experiments on Co-located Mobile Groupware

Correlating Trust with Signal Detection Theory Measures in a Hybrid Inspection System

Lecture 3. Previous lecture. Learning outcomes of lecture 3. Today. Trustworthiness in Fixed Design Research. Class question - Causality

The future of face to face interviewing. Michael F. Schober

Touch Behavior Analysis for Large Screen Smartphones

User Guide V: 3.0, August 2017

Human Performance Model. Designing for Humans. The Human: The most complex of the three elements. The Activity

Measuring the User Experience

Where No Interface Has Gone Before: What Can the Phaser Teach Us About Label Usage in HCI?

Learning Objectives. AT Goals. Assistive Technology for Sensory Impairments. Review Course for Assistive Technology Practitioners & Suppliers

Statistical analysis DIANA SAPLACAN 2017 * SLIDES ADAPTED BASED ON LECTURE NOTES BY ALMA LEORA CULEN

Evaluation: Scientific Studies. Title Text

Best Practice Principles for teaching Orientation and Mobility skills to a person who is Deafblind in Australia

HUMAN-COMPUTER INTERACTION EXPERIMENTAL DESIGN

9 research designs likely for PSYC 2100

Evaluation: Controlled Experiments. Title Text

Unit 1: Introduction to the Operating System, Computer Systems, and Networks 1.1 Define terminology Prepare a list of terms with definitions

Foundations of experimental research

Evaluating Tactile Feedback in Graphical User Interfaces

OFFICE WORKSTATION DESIGN

RESTore TM. Clinician Manual for Single User. Insomnia and Sleep Disorders. A step by step manual to help you guide your clients through the program

Psy201 Module 3 Study and Assignment Guide. Using Excel to Calculate Descriptive and Inferential Statistics

GYMTOP USB PROFESSIONAL 20143

myphonak app User Guide

What is Research? Independent research proves our Internet service is the fastest and most reliable period.

ACCESSIBILITY FOR THE DISABLED

Designing Psychology Experiments: Data Analysis and Presentation

Quantitative Evaluation

Two-Way Independent ANOVA

Experiments. Outline. Experiment. Independent samples designs. Experiment Independent variable

Modeling Visual Search Time for Soft Keyboards. Lecture #14

VPAT Summary. VPAT Details. Section Telecommunications Products - Detail. Date: October 8, 2014 Name of Product: BladeCenter HS23

Improving Informed Consent to Clinical Research. Charles W. Lidz Ph.D.

ITU-T. FG AVA TR Version 1.0 (10/2013) Part 3: Using audiovisual media A taxonomy of participation

You can use this app to build a causal Bayesian network and experiment with inferences. We hope you ll find it interesting and helpful.

MCC Human Machine lnterface

You can use this app to build a causal Bayesian network and experiment with inferences. We hope you ll find it interesting and helpful.

Interpretype Video Remote Interpreting (VRI) Subscription Service White Paper September 2010

James L. Pehringer, Au.D. The Top 10 Things You Must Know Before Choosing Your. Audiologist. Hearing Solutions Group

Difficulty judging body positioning in relation to objects in the environment

ChildFit. Widex Baby. Compass quick guide

PSY 3393 Experimental Projects Spring 2008

PERSONAL COMPUTER WORKSTATION CHECKLIST

ReSound Forte and ReSound Smart 3D App For Android Users Frequently Asked Questions

Audibel A3i The Made for iphone hearing aid

Ergonomic Equipment Guide

Making things work: Visual Structure

UK Biobank. Hearing Speech-in-Noise Test. Version August 2012

Modeling and Implementing an Adaptive Human-Computer Interface Using Passive Biosensors

Hearing Control App User Guide

Free Data! Using Publicly- Available Data for Your Research Project

Measuring impact. William Parienté UC Louvain J PAL Europe. povertyactionlab.org

Thrive Hearing Control Application

Human Abilities 1. Understanding the user

Human Information Processing. CS160: User Interfaces John Canny

SAMPLE. 1. Explain how you would carry out an experiment into the effect playing video games has on alertness.

1. Before starting the second session, quickly examine total on short form BDI; note

Final Exam: PSYC 300. Multiple Choice Items (1 point each)

NEXTGEN ICD10 TIPS DEMONSTRATION

20. Experiments. November 7,

Human Information Processing

TruLink Hearing Control App User Guide

Psychology Research Process

Data Management System (DMS) User Guide

Predictive Model for Group Selection Performance on Touch Devices

Experimental Research I. Quiz/Review 7/6/2011

Music. listening with hearing aids

Chapter 11 Nonexperimental Quantitative Research Steps in Nonexperimental Research

Introduction Can we improve memory for nonsense pics w/ cue? Method Study: See 28 pics (w/ or w/o label) Test: Immediate recall (draw)

Wi Series Wireless Hearing Products

Appendix B. Nodulus Observer XT Instructional Guide. 1. Setting up your project p. 2. a. Observation p. 2. b. Subjects, behaviors and coding p.

A longitudinal study of text entry by gazing and smiling. Titta Rintamäki

UK Biobank. Arterial Pulse-Wave Velocity. Version th April 2011

Elluminate and Accessibility: Receive, Respond, and Contribute

Tips for Youth Group Leaders

TriAuto mini Compact, Cordless Endodontic Handpiece

the human 1 of 3 Lecture 6 chapter 1 Remember to start on your paper prototyping

VIEW AS Fit Page! PRESS PgDn to advance slides!

TruLink Hearing Control App User Guide

Marking Period 3-5 week course- Activity Selection. Marking Period 1- One Marking Period dedicated to Health

Issues with Designing Dementia-Friendly Interfaces

Video Captioning Basics

Dräger Alcotest Handheld Product Suite

Transcription:

Topics The what, why, and how of empirical research Group participation in a real experiment Observations and measurements Research questions Experiment terminology Experiment design Hypothesis testing (ANOVA, etc.) Parts of a research paper 44 Experiment Terminology (Part 1) Terms to know Participant Independent variable (test conditions) Dependent variable (measured behaviors) Control and random variables Confounding variable Task Procedure Within subjects vs. between subjects Counterbalancing Latin square 45

Participant The people participating in an experiment are referred to as participants (the term subjects is also acceptable 1 ) When referring specifically to the experiment, use participants (e.g., all participants exhibited a high error rate ) General discussion on the problem or conclusions may use other terms (e.g., these results suggest that users are less likely to ) Report the selection criteria and give relevant demographic information or prior experience 1 APA. (2010). Publication Manual of the American Psychological Association (6 th ed.) Washington, DC: APA, p. 73. 46 How Many Participants Use the same number of participants as used in similar research 1 Too many participants and you get statistically significant results for differences of no practical significance Too few participants and you fail to get statistically significant results when there really is an inherent difference between the test conditions 1 Martin D.W. (2004). Doing psychology experiments (6 th ed.). Belmont, CA: Wadsworth, p. 234. 47

Independent Variable An independent variable (IV) is a circumstance or characteristic that is manipulated in an experiment to elicit a change in a human response while interacting with a computer. Independent because it is independent of participant behavior (i.e., there is nothing a participant can do to influence an independent variable) Examples: interface, device, feedback mode, button layout, visual layout, age, gender, background noise, expertise, etc. The terms independent variable and factor are synonymous 48 Test Conditions An independent variable (IV) must have at least two levels The levels, values, or settings for an IV are the test conditions Name both the factor (IV) and its levels (test conditions): Factor (IV) Levels (test conditions ) Device Feedback mode Task Visualization Search interface mouse, trackball, joystick audio, tactile, none pointing, dragging 2D, 3D, animated Google, custom 49

Human Characteristics Human characteristics are naturally occurring attributes Examples: Gender, age, height, weight, handedness, grip strength, finger width, visual acuity, personality trait, political viewpoint, first language, shoe size, etc. They are legitimate independent variables, but they cannot be manipulated in the usual sense Causal relationships are difficult to obtain due to unavoidable confounding variables 50 How Many IVs? An experiment must have at least one independent variable Possible to have 2, 3, or more IVs But the number of effects increases rapidly with the size of the experiment: Advice: Keep it simple (1 or 2 IVs, 3 at the most) (see HCI:ERP for further discussion) 51

Dependent Variable A dependent variable is a measured human behaviour (related to an aspect of the interaction involving an independent variable) Dependent because it depends on what the participant does Examples: task completion time, speed, accuracy, error rate, throughput, target re-entries, task retries, presses of backspace, etc. Dependent variables must be clearly defined Research must be reproducible! 52 Unique DVs Any observable, measurable behaviour is a legitimate dependent variable (provided it has the potential to reveal differences among the test conditions) So, feel free to roll your own Example: negative facial expressions 1 Application: user difficulty with mobile games Events logged included frowns, head shaking Counts used in ANOVA, etc. Clearly defined reproducible 1 Duh, H. B.-L., Chen, V. H. H., & Tan, C. B. (2008). Playing different games on different phones: An empirical study on mobile gaming. Proc MobileHCI 2008, 391-394. 53

Data Collection Obviously, the data for dependent variables must be collected in some manner Ideally, engage the experiment software to log timestamps, key presses, button clicks, etc. Planning and pilot testing important Ensure conditions are identified, either in the filenames or in the data columns Examples: (next two slides) 54 FittsTouch-P01-S01-B01-G01-C01-1D.sd1 The "sd1" file is low-level raw data with one row for each trial. (Note: "sd" = summary data) The "sd2" file summaries each sequence of trials in a single line (next slide). 55

FittsTouch-P01-S01-B01-G01-C01-1D.sd2 The data above are full-precision, comma-delimited. Here are the data in Excel: Click here to see all the data from this experiment collected together in Excel. Click here to see the final published paper. 1 1 MacKenzie, I. S. (2015). Fitts' throughput and the remarkable case of touch-based target selection. Proc HCII 2015, pp. 238-249. 56 Control Variable A control variable is any circumstance (not under investigation) that is kept constant while testing the effect of an independent variable on a dependent variable More control means the experiment is less generalizable (i.e., less applicable to other people and other situations) Consider an experiment on the effect of font color and background color on reading comprehension Independent variables: font color, background color Dependent variable: comprehension test score Control variables Font size (e.g., 12 point) Font family (e.g., Times) Ambient lighting (e.g., fluorescent ceiling light) Etc. 57

Random Variable A random variable is a circumstance that is allowed to vary randomly More variability is introduced in the measures (that s bad!), but the results are more generalizable (that s good!) Research question: Does user stance affect performance while playing Guitar Hero? Independent variable: stance (standing, sitting) Dependent variable: score on songs Random variables Prior experience playing a real musical instrument Prior experience playing Guitar Hero Amount of coffee consumed prior to testing Etc. 58 Control vs. Random Variables There is a trade-off which can be examined in terms of internal validity and external validity (see below) 59

Confounding Variable A confounding variable is a circumstance that varies systematically with an independent variable Should be controlled or randomized to avoid misleading results Consider a study comparing the target selection performance of a mouse vs. a gamepad where all participants are mouse experts, but gamepad novices A performance difference may emerge, but Prior experience is a confounding variable No reliable conclusions can be made on the inherent differences between the mouse and gamepad 60 Topics The what, why, and how of empirical research Group participation in a real experiment Observations and measurements Research questions Experiment terminology Experiment design Hypothesis testing (ANOVA, etc.) Parts of a research paper 61

Experiment Design Experiment design is the process of deciding What variables to use What tasks and procedures to use How many participants to use and how to solicit them Etc. Let s continue with some terminology 62 Experiment Terminology (Part 2) Terms to know Participant Independent variable (test conditions) Dependent variable (measured behaviors) Control and random variables Confounding variable Task Procedure Within-subjects vs. between-subjects Counterbalancing Latin square 63

Experiment Task The task is what participants do Recall the definition of an independent variable: a circumstance or characteristic that is manipulated in an experiment to elicit a change in a human response while interacting with a computer The experiment task must elicit a change Qualities of a good task: represent, discriminate Represent activities people do with the interface Improves external validity Discriminate among the test conditions Improves internal validity (i.e., increases likelihood of a statistically significant outcome) 64 Task Examples Usually the task is self-evident (follows directly from the research idea) Research idea a new graphical method for entering equations in a spreadsheet Experiment task insert an equation using (a) the graphical method and (b) the conventional method Research idea an auditory feedback technique for programming a GPS device Experiment task program a destination location into a GPS device using (a) the auditory feedback method and (b) the conventional method 65

Knowledge-based Tasks Most experiment tasks are performance-based or skillbased (e.g., inserting an equation, programming a destination location; see previous slide) Some tasks are knowledge-based (e.g., Use an Internet search interface to find the birth date of Albert Einstein. ) In this case, participants become contaminated (in a sense) after the first run of task, since they have acquired the knowledge Experimentally, this poses problems (beware!) A creative approach is needed (e.g., for the other test condition, slightly change the task; of William Shakespeare ) 66 Procedure The procedure encompasses everything that occurs with participants The procedure includes the experiment task (obviously), but everything else as well Arriving, welcoming Signing a consent form Instructions given to participants about the experiment task (next slide) Demonstration trials, practice trials Rest breaks Administering of a questionnaire or an interview 67

Instructions Very important (best to prepare in advance; write out) Typical instructions: proceed as quickly and accurately as possible but at a pace that is comfortable Other instructions are fine, as per the goal of the experiment or the nature of the tasks, but Give the same instructions to all participants If a participant asks for clarification, do not change the instructions in a way that may cause the participant to behave differently from the other participants 68 Within-subjects, Between-subjects (1) Two ways to assign conditions to participants: Within-subjects each participant is tested on each condition (aka repeated measures) Between-subjects each participant is tested on one condition only Examples: Within-subjects Between-subjects 69

Within-subjects, Between-subjects (2) Within-subjects advantages Fewer participants (easier to recruit, schedule, etc.) Less variation due to participants No need to balance groups (because there is only one group!) Within-subjects disadvantage Order effects (i.e., interference between conditions) Between-subjects advantage No order effects (i.e., no interference between conditions) Between-subjects disadvantage More participants (harder to recruit, schedule, etc.) More variation due to participants Need to balance groups (to ensure they are more or less the same) 70 Within-subjects, Between-subjects (3) Sometimes A factor must be assigned within-subjects Examples: block, session (if learning is the IV) A factor must be assigned between-subjects Examples: gender, handedness There is a choice In this case, the balance tips to within-subjects (see previous slide) With two factors, there are three possibilities: both factors within-subjects both factors between-subjects one factor within-subjects + one factor between-subjects (this is a mixed design) 71

Counterbalancing For within-subjects designs, participants may benefit from the first condition and consequently perform better on the second condition we don t want this! To compensate, the order of presenting conditions is counterbalanced Participants are divided into groups, and a different order of administration is used for each group The order is best governed by a Latin Square (next slide) Group, then, is a between subjects factor Was there an effect for group? Hopefully not! 72 Latin Square The defining characteristic of a Latin Square is that each condition occurs only once in each row and column Examples: 3 X 3 Latin Square 4 x 4 Latin Square 4 x 4 Balanced Latin Square A B C B C A C A B A B C D B C D A C D A B D A B C A B C D B D A C D C B A C A D B Note: In a balanced Latin Square each condition both precedes and follows each other condition an equal number of times 73

Succinct Statement of Design 3 x 2 within-subjects design An experiment with two factors, having three levels on the first, and two levels on the second There are six test conditions in total Both factors are within-subjects, meaning each participants was tested on all conditions A mixed design is also possible The levels for one factor are administered to all participants (within-subjects), while the levels for another factor are administered to separate groups of participants (between-subjects) 74