Crowdsourcing & Human Computation. Matt Lease School of Information University of Texas at Austin

Similar documents
Automated, web-based data collection using IbexFarm and Amazon s Mechanical Turk. Joshua Levy 9/25/13

An Analysis of Assessor Behavior in Crowdsourced Preference Judgments

Exploring Normalization Techniques for Human Judgments of Machine Translation Adequacy Collected Using Amazon Mechanical Turk

The Crowd vs. the Lab: A Comparison of Crowd-Sourced and University Laboratory Participant Behavior

Background. Created at 2005 One of many Population. Uses

Inter-Task Effects Induce Bias in Crowdsourcing

Paul Bennett, Microsoft Research (CLUES) Joint work with Ben Carterette, Max Chickering, Susan Dumais, Eric Horvitz, Edith Law, and Anton Mityagin.

Responsible Crowdsourcing A short set of slides towards operationalizing ethical practices in online microtask marketplaces

Lecture 20: CS 5306 / INFO 5306: Crowdsourcing and Human Computation

MECHANICAL TURK: IS IT JUST ANOTHER CONVENIENCE SAMPLE?

Externalization of Cognition: from local brains to the Global Brain. Clément Vidal, Global Brain Institute

Building Evaluation Scales for NLP using Item Response Theory

CROWDSOURCING COLLECTIVE EMOTIONAL INTELLIGENCE. Rob Morris & Rosalind Picard

A Little Competition Never Hurt Anyone s Relevance Assessments

AI and Philosophy. Gilbert Harman. Thursday, October 9, What is the difference between people and other animals?

A Language Modeling Approach for Temporal Information Needs

Experiments with TurKit

From Once Upon a Time to Happily Ever After: Tracking Emotions in Books and Mail! Saif Mohammad! National Research Council Canada!

CS343: Artificial Intelligence

Getting Europe into the Turk EMAC Gabriele Paolacci. Rotterdam School of Management Erasmus University Rotterdam

Tolerance of Effectiveness Measures to Relevance Judging Errors

WikiWarsDE: A German Corpus of Narratives Annotated with Temporal Expressions


What Will Others Choose? How a Majority Vote Reward Scheme Can Improve Human Computation in a Spatial Location Identification Task

The Effects of Performance-Contingent Financial Incentives in Online Labor Markets

CROWDSOURCING AND MEDICINE BOB MARSHALL, MD MPH MISM FAAFP FACULTY, DOD CLINICAL INFORMATICS FELLOWSHIP

Investigating the Influence of Data Familiarity to Improve the Design of a Crowdsourcing Image Annotation System

Intelligent Agents. Instructor: Tsung-Che Chiang

AI and Philosophy. Gilbert Harman. Tuesday, December 4, Early Work in Computational Linguistics (including MT Lab at MIT)

Canadian Criminal Justice Association NB/PEI: Educational Workshop

Towards Best Practices for Crowdsourcing Ontology Alignment Benchmarks

THE ENVIRONMENTAL HEALTH AND SAFETY

Is there a Doctor in the Crowd? Diagnosis Needed! (for less than $5) 1 Introduction

To learn more, visit our website at Table of Contents

Use of Amazon MTurk Online Marketplace for Questionnaire Testing and Experimental Analysis of Survey Features

Sustainable Impact Fundraising

Technical errors of surgeon skill can lead to patient

Public Paper no Control and Confounds in Cross-Cultural Crowdsourced Experiments

Health Status Within the Workplace

The Impact of Relevance Judgments and Data Fusion on Results of Image Retrieval Test Collections

Video Saliency Detection via Dynamic Consistent Spatio- Temporal Attention Modelling

Artificial Intelligence. Admin.

User Guide V: 3.0, August 2017

Institutions and Cooperative Behavior

Gesture Recognition using Marathi/Hindi Alphabet

The Sign2 Project Digital Translation of American Sign- Language to Audio and Text

Sentiment Analysis of Reviews: Should we analyze writer intentions or reader perceptions?

Soccer: Soccer Strategies: The Top 100 Best Ways To Improve Your Soccer Game (The Best Strategies Exercises Nutrition & Training For Playing &

Use of Twitter to Assess Sentiment toward Waterpipe Tobacco Smoking

Crowdsourcing performance evaluations of user interfaces

Emotion-Aware Machines

Stereotyping and Bias in the Flickr30k Dataset

Chapter 12 Conclusions and Outlook

SAP s Autism at Work Program Provides Meaningful Employment for People on the Autism Spectrum

Evaluating Rater Quality and Rating Difficulty in Online Annotation Activities

A study of association between demographic factor income and emotional intelligence

2016 Results, 2017 Strategy The University of Texas System

Designing Human-like Video Game Synthetic Characters through Machine Consciousness

A Gendered Perspective on Artificial Intelligence

Emotional Delivery in Pro-social Crowdfunding Success

Organizational. Architectures of Cognition Lecture 1. What cognitive phenomena will we examine? Goals of this course. Practical Assignments.

Opportunities for Aquatic Personal Trainers By June M. Chewning MA, BS

Part I History & Conceptualizations

CAMPAIGN FOR STUDENT SUCCESS

Guilt and Pro-Social Behavior amongst Workers in an Online Labor Market

Interact-AS. Use handwriting, typing and/or speech input. The most recently spoken phrase is shown in the top box

CS324-Artificial Intelligence

Artificial intelligence (and Searle s objection) COS 116: 4/29/2008 Sanjeev Arora

Hello, Warmly, Murray Alan Kumm Hope Street CMC Executive Director

Repurposing Corpus Materials For Interpreter Education. Nancy Frishberg, Ph.D. MSB Associates San Mateo, California USA

Logic Models. What Is a Logic Model?

Why Is That Relevant? Collecting Annotator Rationales for Relevance Judgments

COLLECTION OF USER JUDGMENTS ON SPOKEN DIALOG SYSTEM WITH CROWDSOURCING

Beyond AI: Bringing Emotional Intelligence to the Digital

Foundations of Individual Behavior

Appendix I Teaching outcomes of the degree programme (art. 1.3)

Artificial Intelligence CS 6364

Motivation Motivation

Cascade: Crowdsourcing Taxonomy Creation

Science Magazine Podcast Transcript, 30 August

Myers-Briggs Personality Test

Turkers, Scholars, Arafat and Peace : Cultural Communities and Algorithmic Gold Standards

Agents and Environments

Accessible Computing Research for Users who are Deaf and Hard of Hearing (DHH)

Identifying and Accounting for Task-Dependent Bias in Crowdsourcing

Which CCSF Health Education Program Is Right For You?

Can Amazon s Mechanical Turk be used to recruit participants for Internet intervention trials?

The Role of Trust in Ubiquitous Computing Marc Langheinrich ETH Zurich, Switzerland

The Coming Gamification of Fitness

On the Quorum Programming Language and including Exceptional People in Computer Science

Economic Development Rural Women s Empowerment & Reproductive Health

Programme. Open Access Books & Journals

CS 5306 INFO 5306: Crowdsourcing and. Human Computation. Lecture 9. 9/19/17 Haym Hirsh

Nash Equilibrium and Dynamics

Angela Proffitt, LLC Certified True Colors Facilitator Consultant: Design-Productivity-Events st avenue, South, Suite 410, Nashville, TN 37212

Introduction to Artificial Intelligence 2 nd semester 2016/2017. Chapter 2: Intelligent Agents

Does Flight Path Context Matter? Impact on Worker Performance in Crowdsourced Aerial Imagery Analysis

Crowdsourcing for Affective Annotation of Video : Development of a Viewer-reported Boredom Corpus. SOLEYMANI, Mohammad, LARSON, Martha.

Transcription:

Crowdsourcing & Human Computation Matt Lease School of Information University of Texas at Austin ml@ischool.utexas.edu

Amazon Remembers J. Pontin. Artificial Intelligence, With Help From the Humans. NY Times (March 25, 2007) Matt Lease - ml@ischool.utexas.edu 2

Amazon Mechanical Turk (MTurk) Micro-task crowdsourcing marketplace On-demand, scalable, real-time workforce Online since 2005 (and still in beta ) Progammer s API & Dashboard GUI Matt Lease - ml@ischool.utexas.edu 3

From Outsourcing to Crowdsoucing http://www.mturk-tracker.com (P. Ipeirotis 10) From 1/09 4/10, 7M HITs from 10K requestors worth $500,000 USD (significant under-estimate) Matt Lease - ml@ischool.utexas.edu 4

Who are the workers? A. Baio, November 2008. The Faces of Mechanical Turk. P. Ipeitorotis. March 2010. The New Demographics of Mechanical Turk J. Ross, et al. Who are the Crowdworkers?... CHI 2010. Matt Lease - ml@ischool.utexas.edu 5

Road Map Example and Introduction Crowdsourcing Human Computation Rethinking Application Design The Road Ahead Wrap-up Matt Lease - ml@ischool.utexas.edu 6

Crowdsourcing Matt Lease - ml@ischool.utexas.edu 7

Crowdsourcing Take a job traditionally performed by a known agent (often an employee) Outsource it to an undefined, generally large group of people via an open call New application of principles from open source movement Matt Lease - ml@ischool.utexas.edu 8

Matt Lease - ml@ischool.utexas.edu 9

Other Crowdsourcing Examples Matt Lease - ml@ischool.utexas.edu 10

Human Computation Matt Lease - ml@ischool.utexas.edu 11

The Mechanical Turk The original, constructed and unveiled in 1770 by Wolfgang von Kempelen (1734 1804) Matt Lease - ml@ischool.utexas.edu 12

The Turing Test (Alan Turing, 1950) Matt Lease - ml@ischool.utexas.edu 13

Matt Lease - ml@ischool.utexas.edu 14

The Turing Test (Alan Turing, 1950) Matt Lease - ml@ischool.utexas.edu 15

What is a Computer? Matt Lease - ml@ischool.utexas.edu 16

What was old becomes new Crowdsourcing: A New Branch of Computer Science (March 29, 2011) Princeton University Press, 2005 Matt Lease - ml@ischool.utexas.edu 17

Davis et al. (2010) The HPU. HPU Matt Lease - ml@ischool.utexas.edu 18

Human Computation People become computists once more Do tasks computers cannot (do well) 1. Detect robots (Captcha reverse Turing test ) 2. Micro-tasks and data labeling (at scale) Game changer for improving practical AI: starving for data 3. Rethink what is possible in application design Integrate CPU + HPU = new capabilities Matt Lease - ml@ischool.utexas.edu 19

Blending Automation & Human Computation Matt Lease - ml@ischool.utexas.edu 20

Soylent: A Word Processor with a Crowd Inside Bernstein et al., UIST 2010 Matt Lease - ml@ischool.utexas.edu 21

Translation by monolingual speakers C. Hu, CHI 2009 Matt Lease - ml@ischool.utexas.edu 22

fold.it S. Cooper et al. (2010). Matt Lease - ml@ischool.utexas.edu 23

Invisible By-product L. von Ahn et al. (2008). recaptcha In Science. Matt Lease - ml@ischool.utexas.edu 24

CrowdSearch and mcrowd T. Yan, MobiSys 2010 Matt Lease - ml@ischool.utexas.edu 25

Matt Lease - ml@ischool.utexas.edu 26

Wisdom of Crowds Computing Pre-conditions Diversity Independence Decentralization Aggregation Input: large, diverse sample (increases likelihood of overall pool quality) Output: consensus, selection, distribution Matt Lease - ml@ischool.utexas.edu 27

Unreasonable Effectiveness of Data Massive free Web data changed how we train learning systems Banko and Brill (2001). Human Language Tech. Halevy et al. (2009). IEEE Intelligent Systems. How might access to cheap & plentiful labeled data change the balance again? Matt Lease - ml@ischool.utexas.edu 28

CrowdForge: MapReduce for Automation + Human Computation Matt Lease - ml@ischool.utexas.edu 29

Wrap-up Matt Lease - ml@ischool.utexas.edu 30

Conclusions Shift in practice that s here to stay Fast, cheap, easy: collect data or do work Emerging phenomenon to study and guide New capabilities in application design from automation + human computation Hot area, fast changing, many open problems Matt Lease - ml@ischool.utexas.edu 31

Thank You! Students Catherine Grady (ischool) Hyunjoon Jung (ECE) Adriana Kovashka (CS) Abhimanu Kumar (CS) Omar Alonso, Microsoft Bing Support John P. Commons ir.ischool.utexas.edu/crowd UT Mechanical Turk & Crowdsourcing Google Group Matt Lease - ml@ischool.utexas.edu 32

Matt Lease - ml@ischool.utexas.edu 33

Resources & Upcoming Events Special issue of Springer s Information Retrieval journal on Crowdsourcing (papers due May 6, 2011) Upcoming Conferences & Workshops 3 rd HCOMP at AAI 2011 CHI 2011 Workshop (proceedings online) SIGIR 2011 Workshop TREC 2011 Crowdsourcing Track CrowdConf 2011 (TBA) Matt Lease - ml@ischool.utexas.edu 34

Data Collection Matt Lease - ml@ischool.utexas.edu 35

Snow et al. (2008). EMNLP 5 Tasks Affect recognition Word similarity Recognizing textual entailment Event temporal ordering Word sense disambiguation 22K labels for $26 high agreement between Turk annotations and expert gold labels Matt Lease - ml@ischool.utexas.edu 36

Example Dialect Identification Matt Lease - ml@ischool.utexas.edu 37

Example Spelling correction Matt Lease - ml@ischool.utexas.edu 38

Kovashka & Lease, CrowdConf 10 Matt Lease - ml@ischool.utexas.edu 39

The Crowd Matt Lease - ml@ischool.utexas.edu 40

Models & Incentives Pay (e.g. MTurk) Fun (or avoid boredom) Socialize Earn acclaim/prestige Altruism Learn something new (e.g. English) Invisible by-product (e.g. re-captcha) Create self-serving resource (e.g. Wikipedia) Multiple incentives are often offered in tandem Matt Lease - ml@ischool.utexas.edu 41

Altruism Contribute knowledge Help others (who need knowledge) Help workers (e.g. SamaSource) Charity Matt Lease - ml@ischool.utexas.edu 42

Games with a Purpose (L. von Ahn) Players have fun, creators get data as by-product distinct from Serious Gaming / Edutainment Player learning / training / education is by-product Matt Lease - ml@ischool.utexas.edu 43

Worker Demographics 2008-2009 studies found less global and diverse than previously thought US Female Educated Bored Money is secondary Matt Lease - ml@ischool.utexas.edu 44

2010 shows increasing diversity 47% US, 34% India, 19% other (P. Ipeitorotis. March 2010) Matt Lease - ml@ischool.utexas.edu 45