Making the Best of Imperfect Data: Reflections on an Ideal World. Mike Ambinder, PhD CHI PLAY 2014 October 20 th, 2014

Similar documents
MS&E 226: Small Data

Lecture 5 Conducting Interviews and Focus Groups

You must answer question 1.

More about inferential control models

Epidemiological study design. Paul Pharoah Department of Public Health and Primary Care

Psychology of Dysfunctional Behaviour RESEARCH METHODS

Scientific Inquiry Section 1: Length & Measurement ruler or meter stick: equipment used in the lab to measure length in millimeters, centimeters or

Research in Real-World Settings: PCORI s Model for Comparative Clinical Effectiveness Research

Seminar 3: Basics of Empirical Research

The Importance of Exercises in Your Business Continuity and Crisis Management Programs B3

RESEARCH METHODS. Winfred, research methods, ; rv ; rv

CSC2130: Empirical Research Methods for Software Engineering

RESEARCH METHODS. Winfred, research methods,

PSYC1024 Clinical Perspectives on Anxiety, Mood and Stress

Psych 1Chapter 2 Overview

Fitting the Method to the Question

MODULE 3 APPRAISING EVIDENCE. Evidence-Informed Policy Making Training

Evidence Informed Practice Online Learning Module Glossary

Overview of Study Designs

Behavioural Indicators That Could Be Used To Detect Interviewee Deception In The Conduct Of Investigative Interviews

Module 4 Introduction

About Reading Scientific Studies

MATH& 146 Lesson 6. Section 1.5 Experiments

Hypothesis-Driven Research

Previous Example. New. Tradition

Fitting the Method to the Question

CHECKLIST FOR EVALUATING A RESEARCH REPORT Provided by Dr. Blevins

Chapter 1. Introduction

Introduction to Applied Research in Economics

Chapter-2 RESEARCH DESIGN

The research process is a fascinating method for increasing the understanding of our world.

MSc Psychological Research Methods/ MPsych Advanced Psychology Module Catalogue / 2018

What Is Science? Lesson Overview. Lesson Overview. 1.1 What Is Science?

Empirical Knowledge: based on observations. Answer questions why, whom, how, and when.

Research Landscape. Qualitative = Constructivist approach. Quantitative = Positivist/post-positivist approach Mixed methods = Pragmatist approach

In this chapter we discuss validity issues for quantitative research and for qualitative research.

Observation process of obtaining info by using senses

The essential focus of an experiment is to show that variance can be produced in a DV by manipulation of an IV.

How Does Analysis of Competing Hypotheses (ACH) Improve Intelligence Analysis?

The Regression-Discontinuity Design

How rational are humans? Many important implications hinge. Qu a r t e r ly Jo u r n a l of. Vol. 15 N o Au s t r i a n.

Intro to HCI evaluation. Measurement & Evaluation of HCC Systems

COUNSELING PSYCHOLOGY. Spring 2004

Phil 12: Logic and Decision Making (Winter 2010) Directions and Sample Questions for Final Exam. Part I: Correlation

Endogeneity is a fancy word for a simple problem. So fancy, in fact, that the Microsoft Word spell-checker does not recognize it.

Experimental Validity

HUMAN-COMPUTER INTERACTION EXPERIMENTAL DESIGN

Process of Designing & Implementing a Research Project

An Overview of a MEG Study

Linking Theoretical and Empirical Accounting Research Part I

Measuring the User Experience

Does the knowledge of Synthetic Biology effect its acceptance?

You can use this app to build a causal Bayesian network and experiment with inferences. We hope you ll find it interesting and helpful.

Experimental Political Science and the Study of Causality

Who? What? What do you want to know? What scope of the product will you evaluate?

UNLOCKING VALUE WITH DATA SCIENCE BAYES APPROACH: MAKING DATA WORK HARDER

Is therapy a realistic option at the present time? Felipe Fregni LEASE DO NOT COPY. Spauding Neuromodulation Center Harvard Medical School

ST440/550: Applied Bayesian Statistics. (10) Frequentist Properties of Bayesian Methods

Confounding by indication developments in matching, and instrumental variable methods. Richard Grieve London School of Hygiene and Tropical Medicine

George B. Ploubidis. The role of sensitivity analysis in the estimation of causal pathways from observational data. Improving health worldwide

Psychology 205, Revelle, Fall 2014 Research Methods in Psychology Mid-Term. Name:

DAY 2 RESULTS WORKSHOP 7 KEYS TO C HANGING A NYTHING IN Y OUR LIFE TODAY!

Design of Experiments & Introduction to Research

Cross-Lagged Panel Analysis

A. Indicate the best answer to each the following multiple-choice questions (20 points)

Active User Affect Recognition and Assistance

Chapter 11: Behaviorism: After the Founding

Chapter 11: Experiments and Observational Studies p 318

Tutorial. Understanding the Task. People don t often read editorials critically, believing the writer may know more about the subject than they do.

Introduction to Observational Studies. Jane Pinelis

Actor (and network) models

1. What is the I-T approach, and why would you want to use it? a) Estimated expected relative K-L distance.

Stepwise method Modern Model Selection Methods Quantile-Quantile plot and tests for normality

CHAPTER 2 APPLYING SCIENTIFIC THINKING TO MANAGEMENT PROBLEMS

Lecture 9 Internal Validity

UMbRELLA interim report Preparatory work

Doing Quantitative Research 26E02900, 6 ECTS Lecture 6: Structural Equations Modeling. Olli-Pekka Kauppila Daria Kautto

investigate. educate. inform.

Conflict Dynamics Profile

Progress in Risk Science and Causality

Structural Approach to Bias in Meta-analyses

Experimental Psychology

Penn State MPH Program Competencies 2013

Healthcare lessons learned from Fantasy Football. Joshua W. Axene, ASA, FCA, MAAA

Conducting Research in the Social Sciences. Rick Balkin, Ph.D., LPC-S, NCC

Chapter 13 Summary Experiments and Observational Studies

Chapter 02 Producing Knowledge About Sports in Society: What is the Role of Research and Theory?

The validity of inferences about the correlation (covariation) between treatment and outcome.

Dr. Allen Back. Oct. 7, 2016

PLANNING THE RESEARCH PROJECT

Statistical Issues in Road Safety Part I: Uncertainty, Variability, Sampling

Experimental Research in HCI. Alma Leora Culén University of Oslo, Department of Informatics, Design

Strategies for handling missing data in randomised trials

Chapter 13. Experiments and Observational Studies. Copyright 2012, 2008, 2005 Pearson Education, Inc.

Measurement and meaningfulness in Decision Modeling

State-of-the-art Strategies for Addressing Selection Bias When Comparing Two or More Treatment Groups. Beth Ann Griffin Daniel McCaffrey

Research Design. Beyond Randomized Control Trials. Jody Worley, Ph.D. College of Arts & Sciences Human Relations

Fundamental Clinical Trial Design

Checklist for Randomized Controlled Trials. The Joanna Briggs Institute Critical Appraisal tools for use in JBI Systematic Reviews

Transcription:

Making the Best of Imperfect Data: Reflections on an Ideal World Mike Ambinder, PhD CHI PLAY 2014 October 20 th, 2014

Making the Best of Imperfect Data Turning the information we gather into a less biased, more complete, and wider-ranging set of meaningful conclusions.

Reflections on an Ideal World If we think about where our problems lie, we don t ignore them, and we improve our understanding. If we think about what could be possible, then we re innovating and expanding the scope of our profession. If we understand how to link between these two approaches, then we re making progress.

Overview The Current State of Affairs The Path Between The Future State of Affairs

The Current State of Affairs How we get data now Issues we face Information we lack

Data Collection Playtests Direct Observation Think Aloud Usability Q&A Metrics Descriptive statistics Quantifying behavior http://imgarcade.com/1/research-bias/

Data Collection - Alternative Methodologies Surveys Measurement of Opinion Eyetracking - Measurement of Attention Physiological Signals Measurement of Emotion Experiments Measurement of Effect

Data Analysis Statistical Tests T-Tests, Correlations, Regressions, ANOVAs, Frequentist vs. Bayesian Data Mining - Exploratory Machine Learning - Automated

http://qe-design.com/machine-learning/

Issues Biases in data Lack of resources Experimentation shortfalls

Data Interpretation What biases are present? What biases might be present? What biases can be removed?

https://www.youtube.com/watch?v=hefjkqkcvpo

Playtest Biases Playtesters know the company Playtesters know the game Playtesters know the genre Playtesters play in groups Playtesters are not unbiased Want to impress Want to profess WE are not unbiased

Data Source Biases Forums Emails In-game surveys Contaminated samples Expectations http://xkcd.com/386/

Metric Biases Incomplete/Missing Confounded Incorrect Violates statistical assumptions

Resource Constraints Time Money Equipment Expertise Hardware/Storage space Capability to eliminate issues

Tradeoffs Which questions do we focus on and why? Where do we allocate resources? How do we decide to do so?

Experimentation Correlational vs. Causal Retrospective vs. Prospective Do we have a valid control? Do we have a valid test condition? http://prologuegames.com/surface-pro-2/

Experimentation Failures Confounds Conflation of effects Inability to isolate effects Results-oriented experimentation Lack of iteration Not operationalizing success Lack of statistical power

What Information Are We Missing? Player sentiment Rationales for player choice Purchasing decisions Factors which contribute to game popularity Measuring game balance Operationalizing fun Operationalizing immersion...

Incomplete Information Behavioral correlates For sentiment For choice For explanations Measurable behavioral correlates

The Path Between the Present and the Future Fixing the problems we face New data to gather New methodologies to try http://www.scilogs.com/from_the_lab_bench/out-of-hiding-the-artsy-scientists-mid-life-crisis/

Data Collection Clean up our datasets Match up methodologies with suitable data/questions Identify confounds http://tay.kotaku.com/5-reasonably-serious-step-to-not-going-bankrupt-this-st-1593698005

Data Analysis Add technical ability Improve the usability of statistical tools Add automatization of analysis

Data Interpretation Be cognizant of biases Honest about their existence And work to remove them

Experiments The Scientific Method Understand the limitations of what we re doing Retrospective vs. Prospective Correlational vs. Causal Iteration Add theories/knowledge to the world

http://www.tomatosphere.org/teacher-resources/teachers-guide/principal-investigation/scientific-method.cfm

New Data Real-time analysis More complete data Experimental groups

Realtime Analysis Contextual surveys In-game manipulations Player sentiment

Complete Datasets Increases in computing power Smarter inferences Spatial analysis

Experimental Groups Segmentation of players Sample sizes in the millions Valid controls Valid test conditions http://pando.com/2013/10/21/stuck-in-the-social-media-matrix-pop-that-red-pill/

Novel Methodologies - Physiological Biofeedback Measurements of arousal Measurements of valence Eyetracking Posture? Gestures? Smell? EEGs?

Novel Methodologies Analytical Structured Q&As? Improved Surveys? Remove bias from verbal reports Letting people play for as long as they want Maybe the single biggest change we could implement Ways to test multiplayer What haven t we thought of?

The Future State of Affairs The perfect world Are any problems unsolvable? Is any data ungatherable? http://www.exacttarget.com/blog/wp-content/uploads/2014/06/advertising_3_w10241.jpeg

The Perfect World Our data is clean Our data is complete Our data is free from bias Our data is adaptable We can measure anything

The Perfect World Data Collection All data is collected Instantaneously available for analysis Confounds are readily identifiable People give more honest responses

The Perfect World Data Analysis Appropriate analyses are readily identifiable and easily applied Tools are available to easily explore datasets Automated processes are constantly pattern matching

The Perfect World Data Interpretation We can account for biases We can remove biases We can prevent biases http://nonparibus.files.wordpress.com/2013/04/hutz-hearsay.gif

The Perfect World Constraint Removal Time Money Equipment Expertise Hardware/Storage Space Capability

Questions We Can Answer Player sentiment Player choice Purchase decisions Game popularity Game balance Fun Immersion

Questions We Can t Answer

What Problems Remain? This is a theoretical exercise Constraints will always exist Tools will never be perfect Data will always be biased

What Data Remains? Behavioral correlates Access to introspective processes Measurement of situational factors

The Quickest Path Be aware of biases Acquire statistical abilities Information exchanges Never stop questioning

The Biggest Wins Let people play as long as they want Be aware of biases Measure physiological signals Record more data Experiment

If it can be destroyed by the truth, it deserves to be destroyed by the truth - Carl Sagan

Thanks! mikea@valvesoftware.com