Big Data and Sentiment Quantification: Analytical Tools and Outcomes

Size: px
Start display at page:

Download "Big Data and Sentiment Quantification: Analytical Tools and Outcomes"

Transcription

1 Big Data and Sentiment Quantification: Analytical Tools and Outcomes Fabrizio Sebastiani Istituto di Scienza e Tecnologie dell Informazione Consiglio Nazionale delle Ricerche Pisa, IT fabrizio.sebastiani@isti.cnr.it October 11, European University Institute, Firenze, IT Download these slides at

2 Classification: A Primer Classification (aka categorization ) is the task of assigning data items to groups ( classes ) whose existence is known in advance; e.g., Assigning newspaper articles to one or more of Home News, Politics, Economy, Lifestyles, Sports Assigning comments about products to exactly one of Excellent, Good, Average, Poor, Disastrous Classification requires subjective judgment : assigning natural numbers to either Prime or NonPrime is not classification (Automatic) Classification is usually tackled via supervised machine learning : a general-purpose learning algorithm trains (using a set of manually classified items) a classifier to recognize the characteristics an item should have in order to be attributed to a given class 2 / 32

3 What is quantification? 1 1 Dodds, Peter et al. Temporal Patterns of Happiness and Information in a Global Social Network: Hedonometrics and Twitter. PLoS ONE, 6(12), / 32

4 What is quantification? (cont d) 4 / 32

5 What is quantification? (cont d) In many applications of classification, the real goal is determining the relative frequency (or: prevalence) of each class in the unlabelled data (quantification, a.k.a. supervised prevalence estimation) E.g. Among the tweets about the next presidential elections, what is the fraction of pro-democrat ones? Among the posts about the Apple Watch 3 posted on forums, what is the fraction of very negative ones? How have these percentages evolved over time? This task has been studied within IR, ML, DM, NLP, and has given rise to learning methods and evaluation measures specific to it 5 / 32

6 The paradox of quantification Is classify and count the optimal quantification strategy? No! A perfect classifier is also a perfect quantifier (i.e., estimator of class prevalence), but a good classifier is not necessarily a good quantifier (and vice versa) : FP FN Classifier A Classifier B Paradoxically, we should choose quantifier B rather than quantifier A, since A is biased This means that quantification should be studied as a task in its own right 6 / 32

7 Vapnik s Principle Key observation: classification is a more general problem than quantification Vapnik s principle: If you possess a restricted amount of information for solving some problem, try to solve the problem directly and never solve a more general problem as an intermediate step. It is possible that the available information is sufficient for a direct solution but is insufficient for solving a more general intermediate problem. This suggests solving quantification directly (without solving classification as an intermediate step) with the goal of achieving higher quantification accuracy than if we opted for the indirect solution 7 / 32

8 What is quantification? (cont d) Quantification may be also defined as the task of approximating a true distribution by a predicted distribution +,-.!6,:8324,! 6,:8324,! 6,73-89! ,! +,-.!/012324,! "#"""$! "#%""$! "#&""$! "#'""$! "#(""$! "#)""$! "#*""$!! As a result, evaluation measures for quantification are divergences, which evaluate how much a predicted distribution diverges from the true distribution 8 / 32

9 Distribution drift The need to perform quantification arises because of distribution drift, i.e., the presence of a discrepancy between the class distribution of Tr and that of Te. Distribution drift may derive when the environment is not stationary across time and/or space and/or other variables, and the testing conditions are irreproducible at training time the process of labelling training data is class-dependent (e.g., stratified training sets) the labelling process introduces bias in the training set (e.g., if active learning is used) Distribution drift clashes with the IID assumption, on which standard ML algorithms are instead based. 9 / 32

10 Applications of quantification A number of fields where classification is used are not interested in individual data, but in data aggregated across spatio-temporal contexts and according to other variables (e.g., gender, age group, religion, job type,...); e.g., Social sciences : studying indicators concerning society and the relationships among individuals within it 2 [Others] may be interested in finding the needle in the haystack, but social scientists are more commonly interested in characterizing the haystack. (Hopkins and King, 2010) Political science : e.g., predicting election results by estimating the prevalence of blog posts (or tweets) supporting a given candidate or party 2 D. Hopkins and G. King, A Method of Automated Nonparametric Content Analysis for Social Science. American Journal of Political Science 54(1), / 32

11 Applications of quantification (cont d) Epidemiology : concerned with tracking the incidence and the spread of diseases; e.g., estimate pathology prevalence from clinical reports where pathologies are diagnosed estimate the prevalence of different causes of death from verbal accounts of symptoms Market Research : concerned with estimating the distribution of consumers attitudes about products, product features, or marketing strategies; e.g., quantifying customers attitudes from verbal responses to open-ended questions Others : e.g., estimating the proportion of no-shows within a set of bookings estimating the proportions of different types of cells in blood samples 11 / 32

12 Quantification methods Quantification methods belong to two classes 1. Aggregative : they require the classification of individual items as a basic step 2. Non-aggregative : quantification is performed without performing classification Aggregative methods may be further subdivided into 1a. Methods using general-purpose learners (i.e., originally devised for classification); can use any supervised learning algorithm that returns posterior probabilities 1b. Methods using special-purpose learners (i.e., especially devised for quantification) 12 / 32

13 Evaluating quantification methods Quantification accuracy is often analysed by class prevalence... Table: Accuracy as measured in terms of KLD on the 5148 test sets of RCV1-v2 grouped by class prevalence in Tr RCV1-v2 VLP LP HP VHP All SVM(KLD) 2.09E E E E E-03 PACC 2.16E E E E E-03 ACC 2.17E E E E E-03 MAX 2.16E E E E E-03 CC 2.55E E E E E-03 X 3.48E E E E E-03 PCC 1.04E E E E E-03 MM(PP) 1.76E E E E E-02 MS 1.98E E E E E-02 T E E E E E-02 MM(KS) 2.00E E E E E / 32

14 Evaluating quantification methods (cont d)... or by amount of drift... Table: Accuracy as measured in terms of KLD on the 5148 test sets of RCV1-v2 grouped into quartiles homogeneous by distribution drift RCV1-v2 VLD LD HD VHD All SVM(KLD) 1.17E E E E E-03 PACC 1.92E E E E E-03 ACC 1.70E E E E E-03 MAX 2.20E E E E E-03 CC 2.43E E E E E-03 X 3.89E E E E E-03 PCC 8.92E E E E E-03 MM(PP) 1.26E E E E E-02 MS 1.37E E E E E-02 T E E E E E-02 MM(KS) 1.41E E E E E / 32

15 Evaluating quantification methods (cont d)... or along the temporal dimension / 32

16 Sentiment quantification 16 / 32

17 Sentiment analysis Sentiment Quantification is a part of Sentiment Analysis, a set of tasks concerned with the analysing of texts according to the sentiments / opinions / emotions / judgments expressed in them SA is the Holy Grail of market research, opinion research, and online reputation management. Mostly concerned with analysing user-generated content in online media, such as product reviews or (micro-)blog posts 17 / 32

18 How Difficult is Sentiment Analysis? Sentiment analysis is inherently difficult, because in order to express opinions / emotions / etc. we often use a wide variety of sophisticated expressive means (e.g., metaphor, irony, sarcasm, allegation, understatement, etc.) At that time, Clint Eastwood had only two facial expressions: with the hat and without it. (from an interview with Sergio Leone) She runs the gamut of emotions from A to B (on Katharine Hepburn in The Lake, 1934) If you are reading this because it is your darling fragrance, please wear it at home exclusively, and tape the windows shut. (from a 2008 review of parfum Amarige, Givenchy) Sentiment analysis characterised as an NLP-complete problem 18 / 32

19 Sentiment quantification An interesting instance of quantification is sentiment quantification, i.e., supervised prevalence estimation for sentiment-related classes 3 Sentiment quantification (and classification) may be binary (Positive, Negative) ternary (Positive, Negative, Neutral) ordinal (e.g., Excellent, Good, Average, Poor, Disastrous) Each such case has its own learning algorithms and evaluation measures 3 A. Esuli and F. Sebastiani. Sentiment Quantification. IEEE Intelligent Systems, 25(4):72-75, / 32

20 Sentiment quantification (cont d) It is often the case that we perform sentiment classification with quantification in mind Example 1 (CRM): How satisfied are you with our mobile phone services? Asked by: telecom company Class of interest: MayDefectToCompetition Goal: classification (at the individual level) Example 2 (MR): How do you like the recent ad for product X? Asked by: MR agency Class of interest: LovedTheCampaign Goal: quantification (at the aggregate level) 20 / 32

21 Sentiment quantification (cont d) A typical medium for which sentiment classification is performed with quantification in mind is Twitter The 2016 and 2017 editions of the SemEval task Sentiment Analysis in Twitter include two subtasks each ( ternary and ordinal ) on quantification Experiments show that methods specifically aimed at quantification estimate class prevalences more accurately than standard CC in both the ternary 4 and ordinal 5 cases 4 W. Gao and F. Sebastiani. From Classification to Quantification in Tweet Sentiment Analysis. Social Network Analysis and Mining, 6(19), G. Da San Martino, W. Gao, and F. Sebastiani. Ordinal Text Quantification. Proceedings of SIGIR / 32

22 Conclusion Quantification: a relatively (yet) unexplored new task, with many research problems still open Growing awareness that (sentiment) quantification is going to be more and more important; given the advent of big data, application contexts will spring up in which we will simply be happy with analysing data at the aggregate (rather than at the individual) level 22 / 32

23 Questions? 23 / 32

24 Thank you! For any question, me at 24 / 32

An assistive application identifying emotional state and executing a methodical healing process for depressive individuals.

An assistive application identifying emotional state and executing a methodical healing process for depressive individuals. An assistive application identifying emotional state and executing a methodical healing process for depressive individuals. Bandara G.M.M.B.O bhanukab@gmail.com Godawita B.M.D.T tharu9363@gmail.com Gunathilaka

More information

ISC- GRADE XI HUMANITIES ( ) PSYCHOLOGY. Chapter 2- Methods of Psychology

ISC- GRADE XI HUMANITIES ( ) PSYCHOLOGY. Chapter 2- Methods of Psychology ISC- GRADE XI HUMANITIES (2018-19) PSYCHOLOGY Chapter 2- Methods of Psychology OUTLINE OF THE CHAPTER (i) Scientific Methods in Psychology -observation, case study, surveys, psychological tests, experimentation

More information

IRIT at e-risk. 1 Introduction

IRIT at e-risk. 1 Introduction IRIT at e-risk Idriss Abdou Malam 1 Mohamed Arziki 1 Mohammed Nezar Bellazrak 1 Farah Benamara 2 Assafa El Kaidi 1 Bouchra Es-Saghir 1 Zhaolong He 2 Mouad Housni 1 Véronique Moriceau 3 Josiane Mothe 2

More information

Using Information From the Target Language to Improve Crosslingual Text Classification

Using Information From the Target Language to Improve Crosslingual Text Classification Using Information From the Target Language to Improve Crosslingual Text Classification Gabriela Ramírez 1, Manuel Montes 1, Luis Villaseñor 1, David Pinto 2 and Thamar Solorio 3 1 Laboratory of Language

More information

MATH-134. Experimental Design

MATH-134. Experimental Design Experimental Design Controlled Experiment: Researchers assign treatment and control groups and examine any resulting changes in the response variable. (cause-and-effect conclusion) Observational Study:

More information

Asthma Surveillance Using Social Media Data

Asthma Surveillance Using Social Media Data Asthma Surveillance Using Social Media Data Wenli Zhang 1, Sudha Ram 1, Mark Burkart 2, Max Williams 2, and Yolande Pengetnze 2 University of Arizona 1, PCCI-Parkland Center for Clinical Innovation 2 {wenlizhang,

More information

MULTIPLE LINEAR REGRESSION 24.1 INTRODUCTION AND OBJECTIVES OBJECTIVES

MULTIPLE LINEAR REGRESSION 24.1 INTRODUCTION AND OBJECTIVES OBJECTIVES 24 MULTIPLE LINEAR REGRESSION 24.1 INTRODUCTION AND OBJECTIVES In the previous chapter, simple linear regression was used when you have one independent variable and one dependent variable. This chapter

More information

The Emotion Analysis on the Chinese Comments from News portal and Forums Jiawei Shen1, 2, Wenjun Wang1, 2 and Yueheng Sun1, 2, a

The Emotion Analysis on the Chinese Comments from News portal and Forums Jiawei Shen1, 2, Wenjun Wang1, 2 and Yueheng Sun1, 2, a 2nd International Conference on Economics, Social Science, Arts, Education and Management Engineering (ESSAEME 216) The Emotion Analysis on the Chinese Comments from News portal and Forums Jiawei Shen1,

More information

Author s Traits Prediction on Twitter Data using Content Based Approach

Author s Traits Prediction on Twitter Data using Content Based Approach Author s Traits Prediction on Twitter Data using Content Based Approach Notebook for PAN at CLEF 2015 Fahad Najib, Waqas Arshad Cheema, Rao Muhammad Adeel Nawab Department of Computer Science, COMSATS

More information

Social Media Mining for Toxicovigilance

Social Media Mining for Toxicovigilance Social Media Mining for Toxicovigilance Automatic Monitoring of Prescription Medication Abuse from Twitter Abeed Sarker (@sarkerabeed) Health Language Processing Lab Research Associate Department of Biostatistics,

More information

Lecture 20: CS 5306 / INFO 5306: Crowdsourcing and Human Computation

Lecture 20: CS 5306 / INFO 5306: Crowdsourcing and Human Computation Lecture 20: CS 5306 / INFO 5306: Crowdsourcing and Human Computation Today at 4:15pm in Gates G01 Title: Predicting Human Visual Memory using Deep Learning Speaker: Aditya Khosla, MIT Used deep learning

More information

Christopher Cairns and Elizabeth Plantan. October 9, 2016

Christopher Cairns and Elizabeth Plantan. October 9, 2016 Online appendices to Why autocrats sometimes relax online censorship of sensitive issues: A case study of microblog discussion of air pollution in China Christopher Cairns and Elizabeth Plantan October

More information

Measuring Attitudes. Measurement and Theory of Democratic Attitudes. Introduction Measurement Summary

Measuring Attitudes. Measurement and Theory of Democratic Attitudes. Introduction Measurement Summary Measuring Attitudes and Theory of Democratic Attitudes What are we looking for? What can we expect? Recap: Zaller s RAS-Model Real People might generate attitudes on the fly Based on political information

More information

Statistics is the science of collecting, organizing, presenting, analyzing, and interpreting data to assist in making effective decisions

Statistics is the science of collecting, organizing, presenting, analyzing, and interpreting data to assist in making effective decisions Readings: OpenStax Textbook - Chapters 1 5 (online) Appendix D & E (online) Plous - Chapters 1, 5, 6, 13 (online) Introductory comments Describe how familiarity with statistical methods can - be associated

More information

CHAPTER 2. MEASURING AND DESCRIBING VARIABLES

CHAPTER 2. MEASURING AND DESCRIBING VARIABLES 4 Chapter 2 CHAPTER 2. MEASURING AND DESCRIBING VARIABLES 1. A. Age: name/interval; military dictatorship: value/nominal; strongly oppose: value/ ordinal; election year: name/interval; 62 percent: value/interval;

More information

32.5. percent of U.S. manufacturers experiencing unfair currency manipulation in the trade practices of other countries.

32.5. percent of U.S. manufacturers experiencing unfair currency manipulation in the trade practices of other countries. TECH 646 Analysis of Research in Industry and Technology PART III The Sources and Collection of data: Measurement, Measurement Scales, Questionnaires & Instruments, Sampling Ch. 11 Measurement Lecture

More information

A Comparison of Collaborative Filtering Methods for Medication Reconciliation

A Comparison of Collaborative Filtering Methods for Medication Reconciliation A Comparison of Collaborative Filtering Methods for Medication Reconciliation Huanian Zheng, Rema Padman, Daniel B. Neill The H. John Heinz III College, Carnegie Mellon University, Pittsburgh, PA, 15213,

More information

Ch. 11 Measurement. Paul I-Hai Lin, Professor A Core Course for M.S. Technology Purdue University Fort Wayne Campus

Ch. 11 Measurement. Paul I-Hai Lin, Professor  A Core Course for M.S. Technology Purdue University Fort Wayne Campus TECH 646 Analysis of Research in Industry and Technology PART III The Sources and Collection of data: Measurement, Measurement Scales, Questionnaires & Instruments, Sampling Ch. 11 Measurement Lecture

More information

Data = collections of observations, measurements, gender, survey responses etc. Sample = collection of some members (a subset) of the population

Data = collections of observations, measurements, gender, survey responses etc. Sample = collection of some members (a subset) of the population Chapter 1: Basic Ideas 1.1 Sampling Statistics = the Science of Data By collecting a limited amount of data, we want to say something about the whole group that we want to study, i.e. we want to say something

More information

Agents with Attitude: Exploring Coombs Unfolding Technique with Agent-Based Models

Agents with Attitude: Exploring Coombs Unfolding Technique with Agent-Based Models Int J Comput Math Learning (2009) 14:51 60 DOI 10.1007/s10758-008-9142-6 COMPUTER MATH SNAPHSHOTS - COLUMN EDITOR: URI WILENSKY* Agents with Attitude: Exploring Coombs Unfolding Technique with Agent-Based

More information

Statistics is the science of collecting, organizing, presenting, analyzing, and interpreting data to assist in making effective decisions

Statistics is the science of collecting, organizing, presenting, analyzing, and interpreting data to assist in making effective decisions Readings: OpenStax Textbook - Chapters 1 5 (online) Appendix D & E (online) Plous - Chapters 1, 5, 6, 13 (online) Introductory comments Describe how familiarity with statistical methods can - be associated

More information

Math 124: Module 3 and Module 4

Math 124: Module 3 and Module 4 Experimental Math 124: Module 3 and Module 4 David Meredith Department of Mathematics San Francisco State University September 24, 2009 What we will do today Experimental 1 What we will do today Experimental

More information

Ch. 11 Measurement. Measurement

Ch. 11 Measurement. Measurement TECH 646 Analysis of Research in Industry and Technology PART III The Sources and Collection of data: Measurement, Measurement Scales, Questionnaires & Instruments, Sampling Ch. 11 Measurement Lecture

More information

Testing the Persuasiveness of the Oklahoma Academy of Science Statement on Science, Religion, and Teaching Evolution

Testing the Persuasiveness of the Oklahoma Academy of Science Statement on Science, Religion, and Teaching Evolution Testing the Persuasiveness of the Oklahoma Academy of Science Statement on Science, Religion, and Teaching Evolution 1 Robert D. Mather University of Central Oklahoma Charles M. Mather University of Science

More information

Homework #2 is due next Friday at 5pm.

Homework #2 is due next Friday at 5pm. Homework #2 is due next Friday at 5pm. Political Science 15 Lecture 7: Measurement (Part 2) Topics in Measurement Unit of analysis Operational definitions Accuracy (validity and reliability) Precision

More information

From Sentiment to Emotion Analysis in Social Networks

From Sentiment to Emotion Analysis in Social Networks From Sentiment to Emotion Analysis in Social Networks Jie Tang Department of Computer Science and Technology Tsinghua University, China 1 From Info. Space to Social Space Info. Space! Revolutionary changes!

More information

Psychology. January 11, 2019.

Psychology. January 11, 2019. Psychology January 11, 2019. The Attendance Question: Do you have a habit or a ritual or something you do to clear your mind and focus before a big test, a big game or something important? Reminders:

More information

How are polls conducted? by Frank Newport, Lydia Saad, David Moore from Where America Stands, 1997 John Wiley & Sons, Inc.

How are polls conducted? by Frank Newport, Lydia Saad, David Moore from Where America Stands, 1997 John Wiley & Sons, Inc. How are polls conducted? by Frank Newport, Lydia Saad, David Moore from Where America Stands, 1997 John Wiley & Sons, Inc. Public opinion polls would have less value in a democracy if the public -- the

More information

CHAPTER 3 RESEARCH METHODOLOGY

CHAPTER 3 RESEARCH METHODOLOGY CHAPTER 3 RESEARCH METHODOLOGY 3.1 Introduction 3.1 Methodology 3.1.1 Research Design 3.1. Research Framework Design 3.1.3 Research Instrument 3.1.4 Validity of Questionnaire 3.1.5 Statistical Measurement

More information

Sentiment Analysis of Reviews: Should we analyze writer intentions or reader perceptions?

Sentiment Analysis of Reviews: Should we analyze writer intentions or reader perceptions? Sentiment Analysis of Reviews: Should we analyze writer intentions or reader perceptions? Isa Maks and Piek Vossen Vu University, Faculty of Arts De Boelelaan 1105, 1081 HV Amsterdam e.maks@vu.nl, p.vossen@vu.nl

More information

This is a repository copy of Measuring the effect of public health campaigns on Twitter: the case of World Autism Awareness Day.

This is a repository copy of Measuring the effect of public health campaigns on Twitter: the case of World Autism Awareness Day. This is a repository copy of Measuring the effect of public health campaigns on Twitter: the case of World Autism Awareness Day. White Rose Research Online URL for this paper: http://eprints.whiterose.ac.uk/127215/

More information

Data collection, summarizing data (organization and analysis of data) The drawing of inferences about a population from a sample taken from

Data collection, summarizing data (organization and analysis of data) The drawing of inferences about a population from a sample taken from 1 Nature of Statistics The Study of statistics involves; Data collection, summarizing data (organization and analysis of data) and interpretation of data. The drawing of inferences about a population from

More information

Social Studies 4 8 (118)

Social Studies 4 8 (118) Purpose Social Studies 4 8 (118) The purpose of the Social Studies 4 8 test is to measure the requisite knowledge and skills that an entry-level educator in this field in Texas public schools must possess.

More information

IAASB Main Agenda (September 2005) Page Agenda Item. Analysis of ISA 330 and Mapping Document

IAASB Main Agenda (September 2005) Page Agenda Item. Analysis of ISA 330 and Mapping Document IAASB Main Agenda (September 2005 Page 2005 1869 Agenda Item 4-D.1 Analysis of ISA 330 and Mapping Document 1. Exhibit 1 sets out statements in ISA 330 that use the present tense to describe auditor actions,

More information

Chapter Five. Consumer Markets and Consumer Buyer Behavior. I t s good and good for you. Chapter 5- slide 1

Chapter Five. Consumer Markets and Consumer Buyer Behavior. I t s good and good for you. Chapter 5- slide 1 I t s good and good for you Chapter Five Consumer Markets and Consumer Buyer Behavior Chapter 5- slide 1 Model of Consumer Behavior Chapter 5- slide 2 What Influences Consumer Behavior? Cultural Factors

More information

RECOMMENDED CITATION: Pew Research Center, December, 2014, Perceptions of Job News Trend Upward

RECOMMENDED CITATION: Pew Research Center, December, 2014, Perceptions of Job News Trend Upward NUMBERS, FACTS AND TRENDS SHAPING THE WORLD FOR RELEASE DECEMBER 16, 2014 FOR FURTHER INFORMATION ON THIS REPORT: Carroll Doherty, Director of Political Research Seth Motel, Research Analyst Rachel Weisel,

More information

Making the Best of Imperfect Data: Reflections on an Ideal World. Mike Ambinder, PhD CHI PLAY 2014 October 20 th, 2014

Making the Best of Imperfect Data: Reflections on an Ideal World. Mike Ambinder, PhD CHI PLAY 2014 October 20 th, 2014 Making the Best of Imperfect Data: Reflections on an Ideal World Mike Ambinder, PhD CHI PLAY 2014 October 20 th, 2014 Making the Best of Imperfect Data Turning the information we gather into a less biased,

More information

Introduction to Econometrics

Introduction to Econometrics Global edition Introduction to Econometrics Updated Third edition James H. Stock Mark W. Watson MyEconLab of Practice Provides the Power Optimize your study time with MyEconLab, the online assessment and

More information

Lecture (chapter 1): Introduction

Lecture (chapter 1): Introduction Lecture (chapter 1): Introduction Ernesto F. L. Amaral January 17, 2018 Advanced Methods of Social Research (SOCI 420) Source: Healey, Joseph F. 2015. Statistics: A Tool for Social Research. Stamford:

More information

May All Your Wishes Come True: A Study of Wishes and How to Recognize Them

May All Your Wishes Come True: A Study of Wishes and How to Recognize Them May All Your Wishes Come True: A Study of Wishes and How to Recognize Them Andrew B. Goldberg, Nathanael Fillmore, David Andrzejewski, Zhiting Xu, Bryan Gibson & Xiaojin Zhu Computer Sciences Department

More information

Chapter 4: Defining and Measuring Variables

Chapter 4: Defining and Measuring Variables Chapter 4: Defining and Measuring Variables A. LEARNING OUTCOMES. After studying this chapter students should be able to: Distinguish between qualitative and quantitative, discrete and continuous, and

More information

I. Introduction and Data Collection B. Sampling. 1. Bias. In this section Bias Random Sampling Sampling Error

I. Introduction and Data Collection B. Sampling. 1. Bias. In this section Bias Random Sampling Sampling Error I. Introduction and Data Collection B. Sampling In this section Bias Random Sampling Sampling Error 1. Bias Bias a prejudice in one direction (this occurs when the sample is selected in such a way that

More information

Chapter 3. Producing Data

Chapter 3. Producing Data Chapter 3. Producing Data Introduction Mostly data are collected for a specific purpose of answering certain questions. For example, Is smoking related to lung cancer? Is use of hand-held cell phones associated

More information

Briefing for employers on Asperger Syndrome

Briefing for employers on Asperger Syndrome Briefing for employers on Asperger Syndrome What is Asperger Syndrome? Asperger Syndrome is a form of autism 1. People with the syndrome have difficulty in understanding and interpreting other people's

More information

Use of Twitter to Assess Sentiment toward Waterpipe Tobacco Smoking

Use of Twitter to Assess Sentiment toward Waterpipe Tobacco Smoking @ColditzJB #SBM2016 Use of Twitter to Assess Sentiment toward Waterpipe Tobacco Smoking Jason B. Colditz, MEd Maharsi Naidu, Class of 2018 Noah A. Smith, PhD Joel Welling, PhD Brian A. Primack, MD, PhD

More information

Math 124: Modules 3 and 4. Sampling. Designing. Studies. Studies. Experimental Studies Surveys. Math 124: Modules 3 and 4. Sampling.

Math 124: Modules 3 and 4. Sampling. Designing. Studies. Studies. Experimental Studies Surveys. Math 124: Modules 3 and 4. Sampling. What we will do today Five Experimental Module 3 and Module 4 David Meredith Department of Mathematics San Francisco State University September 24, 2008 Five Experimental 1 Five 2 Experimental Terminology

More information

DEVELOPING EMOTIONAL INTELLIGENCE IN SALES A S TRATEGIC L EARNING, I NC. W HITEPAPER THE SALES PERFORMANCE IMPROVEMENT CHALLENGE

DEVELOPING EMOTIONAL INTELLIGENCE IN SALES A S TRATEGIC L EARNING, I NC. W HITEPAPER THE SALES PERFORMANCE IMPROVEMENT CHALLENGE BY STRATEGIC LEARNING, INC. A S TRATEGIC L EARNING, I NC. W HITEPAPER THE SALES PERFORMANCE IMPROVEMENT CHALLENGE The discrepancy between expected behavior and actual on-the-job performance is among the

More information

Emotion-Aware Machines

Emotion-Aware Machines Emotion-Aware Machines Saif Mohammad, Senior Research Officer National Research Council Canada 1 Emotion-Aware Machines Saif Mohammad National Research Council Canada 2 What does it mean for a machine

More information

Signals from Text: Sentiment, Intent, Emotion, Deception

Signals from Text: Sentiment, Intent, Emotion, Deception Signals from Text: Sentiment, Intent, Emotion, Deception Stephen Pulman TheySay Ltd, www.theysay.io and Dept. of Computer Science, Oxford University stephen.pulman@cs.ox.ac.uk March 9, 2017 Text Analytics

More information

Variable Data univariate data set bivariate data set multivariate data set categorical qualitative numerical quantitative

Variable Data univariate data set bivariate data set multivariate data set categorical qualitative numerical quantitative The Data Analysis Process and Collecting Data Sensibly Important Terms Variable A variable is any characteristic whose value may change from one individual to another Examples: Brand of television Height

More information

LifelogExplorer: A Tool for Visual Exploration of Ambulatory Skin Conductance Measurements in Context

LifelogExplorer: A Tool for Visual Exploration of Ambulatory Skin Conductance Measurements in Context LifelogExplorer: A Tool for Visual Exploration of Ambulatory Skin Conductance Measurements in Context R.D. Kocielnik 1 1 Department of Mathematics & Computer Science, Eindhoven University of Technology,

More information

1. The Role of Sample Survey Design

1. The Role of Sample Survey Design Vista's Approach to Sample Survey Design 1978, 1988, 2006, 2007, 2009 Joseph George Caldwell. All Rights Reserved. Posted at Internet website http://www.foundationwebsite.org. Updated 20 March 2009 (two

More information

An Online ADR System Using a Tool for Animated Agents

An Online ADR System Using a Tool for Animated Agents An Online ADR System Using a Tool for Animated Agents Masahide Yuasa, Takahiro Tanaka, Yoshiaki Yasumura, Katsumi Nitta Department of Computational Intelligence and Systems Science, Tokyo Institute of

More information

Selection bias and models in nonprobability sampling

Selection bias and models in nonprobability sampling Selection bias and models in nonprobability sampling Andrew Mercer Senior Research Methodologist PhD Candidate, JPSM A BRIEF HISTORY OF BIAS IN NONPROBABILITY SAMPLES September 27, 2017 2 In 2004-2005

More information

A Cooperative Multiagent Architecture for Turkish Sign Tutors

A Cooperative Multiagent Architecture for Turkish Sign Tutors A Cooperative Multiagent Architecture for Turkish Sign Tutors İlker Yıldırım Department of Computer Engineering Boğaziçi University Bebek, 34342, Istanbul, Turkey ilker.yildirim@boun.edu.tr 1 Introduction

More information

Attitude Measurement

Attitude Measurement Business Research Methods 9e Zikmund Babin Carr Griffin Attitude Measurement 14 Chapter 14 Attitude Measurement 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or

More information

Statistics Mathematics 243

Statistics Mathematics 243 Statistics Mathematics 243 Michael Stob February 2, 2005 These notes are supplementary material for Mathematics 243 and are not intended to stand alone. They should be used in conjunction with the textbook

More information

Psychology 466: Judgment & Decision Making

Psychology 466: Judgment & Decision Making Psychology 466: Judgment & Decision Making Psychology 466: Judgment & Decision Making Instructor: John Miyamoto 09/28/2017: Lecture 01-2 Note: This Powerpoint presentation may contain macros that I wrote

More information

Detecting and monitoring foodborne illness outbreaks: Twitter communications and the 2015 U.S. Salmonella outbreak linked to imported cucumbers

Detecting and monitoring foodborne illness outbreaks: Twitter communications and the 2015 U.S. Salmonella outbreak linked to imported cucumbers Detecting and monitoring foodborne illness outbreaks: Twitter communications and the 2015 U.S. Salmonella outbreak linked to imported cucumbers Abstract This research uses Twitter, as a social media device,

More information

Affect in Virtual Agents (and Robots) Professor Beste Filiz Yuksel University of San Francisco CS 686/486

Affect in Virtual Agents (and Robots) Professor Beste Filiz Yuksel University of San Francisco CS 686/486 Affect in Virtual Agents (and Robots) Professor Beste Filiz Yuksel University of San Francisco CS 686/486 Software / Virtual Agents and Robots Affective Agents Computer emotions are of primary interest

More information

Creative Commons Attribution-NonCommercial-Share Alike License

Creative Commons Attribution-NonCommercial-Share Alike License Author: Brenda Gunderson, Ph.D., 2015 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution- NonCommercial-Share Alike 3.0 Unported License:

More information

PERSONALIZED PREDICTIVE PREVENTIVE

PERSONALIZED PREDICTIVE PREVENTIVE PERSONALIZED PREDICTIVE PREVENTIVE What is healthy advice? 1984 What is healthy advice? T o d a y We re all different Dietary Advice Health Exercise Advice Don t eat fat Aim for 600iu of Vitamin D Your

More information

Emotion Recognition using a Cauchy Naive Bayes Classifier

Emotion Recognition using a Cauchy Naive Bayes Classifier Emotion Recognition using a Cauchy Naive Bayes Classifier Abstract Recognizing human facial expression and emotion by computer is an interesting and challenging problem. In this paper we propose a method

More information

Modes of Measurement. Outline. Modes of Measurement. PSY 395 Oswald

Modes of Measurement. Outline. Modes of Measurement. PSY 395 Oswald Modes of Measurement PSY 395 Oswald Outline Modes of Measurement Direct questioning Paper-and-pencil Face-to-face Telephone Interviews Newer Modes Internet Experience Sampling Implicit measures Third-party

More information

Funnelling Used to describe a process of narrowing down of focus within a literature review. So, the writer begins with a broad discussion providing b

Funnelling Used to describe a process of narrowing down of focus within a literature review. So, the writer begins with a broad discussion providing b Accidental sampling A lesser-used term for convenience sampling. Action research An approach that challenges the traditional conception of the researcher as separate from the real world. It is associated

More information

Social Network Data Analysis for User Stress Discovery and Recovery

Social Network Data Analysis for User Stress Discovery and Recovery ISSN:2348-2079 Volume-6 Issue-2 International Journal of Intellectual Advancements and Research in Engineering Computations Social Network Data Analysis for User Stress Discovery and Recovery 1 R. Ragavi,

More information

Survey Research. We can learn a lot simply by asking people what we want to know... THE PREVALENCE OF SURVEYS IN COMMUNICATION RESEARCH

Survey Research. We can learn a lot simply by asking people what we want to know... THE PREVALENCE OF SURVEYS IN COMMUNICATION RESEARCH Survey Research From surveys we can learn how large groups of people think and act. To trust generalizations made on the basis of surveys, however, the sample must be representative, the response rate

More information

Observation Studies, Sampling Designs and Bias

Observation Studies, Sampling Designs and Bias Observation Studies, Sampling Designs and Bias Study / memorize this Observation Study: is a study wherein the researcher passively observes individuals or objects and measures / records some characteristic

More information

Methodological skills

Methodological skills Methodological skills rma linguistics, week 3 Tamás Biró ACLC University of Amsterdam t.s.biro@uva.nl Tamás Biró, UvA 1 Topics today Parameter of the population. Statistic of the sample. Re: descriptive

More information

The British Psychological Society. Promoting excellence in psychology. Belonging to DHP.

The British Psychological Society. Promoting excellence in psychology. Belonging to DHP. The British Psychological Society Promoting excellence in psychology Belonging to DHP www.bps.org.uk/dhp Member Networks further the development of psychology in specialist areas through the exchange of

More information

A MODIFIED FREQUENCY BASED TERM WEIGHTING APPROACH FOR INFORMATION RETRIEVAL

A MODIFIED FREQUENCY BASED TERM WEIGHTING APPROACH FOR INFORMATION RETRIEVAL Int. J. Chem. Sci.: 14(1), 2016, 449-457 ISSN 0972-768X www.sadgurupublications.com A MODIFIED FREQUENCY BASED TERM WEIGHTING APPROACH FOR INFORMATION RETRIEVAL M. SANTHANAKUMAR a* and C. CHRISTOPHER COLUMBUS

More information

Taking advantage of Twitter data to investigate sentiments towards environmental issues during the 2016 U.S. presidential election.

Taking advantage of Twitter data to investigate sentiments towards environmental issues during the 2016 U.S. presidential election. Taking advantage of Twitter data to investigate sentiments towards environmental issues during the 2016 U.S. presidential election. Big Data in Psychology Conference Trier, Germany 08.06.2018 Dr. Christian

More information

Using Social Media to Understand Cyber Attack Behavior

Using Social Media to Understand Cyber Attack Behavior Using Social Media to Understand Cyber Attack Behavior Amy Sliva 1(&), Kai Shu 2, and Huan Liu 2 1 Charles River Analytics, 625 Mount Auburn Street, Cambridge MA 02138, USA asliva@cra.com 2 School of Computing,

More information

Research Prospectus. Your major writing assignment for the quarter is to prepare a twelve-page research prospectus.

Research Prospectus. Your major writing assignment for the quarter is to prepare a twelve-page research prospectus. Department of Political Science UNIVERSITY OF CALIFORNIA, SAN DIEGO Philip G. Roeder Research Prospectus Your major writing assignment for the quarter is to prepare a twelve-page research prospectus. A

More information

TTI Personal Talent Skills Inventory Coaching Report

TTI Personal Talent Skills Inventory Coaching Report TTI Personal Talent Skills Inventory Coaching Report "He who knows others is learned. He who knows himself is wise." Lao Tse Mason Roberts District Manager YMCA 8-1-2008 Copyright 2003-2008. Performance

More information

Chapter 1 Introduction to I/O Psychology

Chapter 1 Introduction to I/O Psychology Chapter 1 Introduction to I/O Psychology 1. I/O Psychology is a branch of psychology that in the workplace. a. treats psychological disorders b. applies the principles of psychology c. provides therapy

More information

Research on Social Psychology Based on Network Big Data

Research on Social Psychology Based on Network Big Data 2017 2nd International Conference on Mechatronics and Information Technology (ICMIT 2017) Research on Social Psychology Based on Network Big Data Fuhong Li Department of psychology, Weifang Medical University,

More information

Sound is the. spice of life

Sound is the. spice of life Sound is the spice of life Let sound spice up your life Safran sharpens your hearing When you think about it, sound is like the spice of life. It adds touches of flavour and colour, enhancing the mood

More information

ADMS Sampling Technique and Survey Studies

ADMS Sampling Technique and Survey Studies Principles of Measurement Measurement As a way of understanding, evaluating, and differentiating characteristics Provides a mechanism to achieve precision in this understanding, the extent or quality As

More information

Unconscious Bias: From Awareness to Action!

Unconscious Bias: From Awareness to Action! Unconscious Bias: From Awareness to Action! Did you know that we all have unconscious bias, and that we each have a responsibility to prevent its unintended consequences? In this course, we will explore

More information

Observation and Assessment. Narratives

Observation and Assessment. Narratives Observation and Assessment Session #4 Thursday March 02 rd, 2017 Narratives To understand a child we have to watch him at play, study him in his different moods; we cannot project upon him our own prejudices,

More information

All Possible Regressions Using IBM SPSS: A Practitioner s Guide to Automatic Linear Modeling

All Possible Regressions Using IBM SPSS: A Practitioner s Guide to Automatic Linear Modeling Georgia Southern University Digital Commons@Georgia Southern Georgia Educational Research Association Conference Oct 7th, 1:45 PM - 3:00 PM All Possible Regressions Using IBM SPSS: A Practitioner s Guide

More information

PDRF About Propensity Weighting emma in Australia Adam Hodgson & Andrey Ponomarev Ipsos Connect Australia

PDRF About Propensity Weighting emma in Australia Adam Hodgson & Andrey Ponomarev Ipsos Connect Australia 1. Introduction It is not news for the research industry that over time, we have to face lower response rates from consumer surveys (Cook, 2000, Holbrook, 2008). It is not infrequent these days, especially

More information

Progress in Risk Science and Causality

Progress in Risk Science and Causality Progress in Risk Science and Causality Tony Cox, tcoxdenver@aol.com AAPCA March 27, 2017 1 Vision for causal analytics Represent understanding of how the world works by an explicit causal model. Learn,

More information

GfK Verein. Detecting Emotions from Voice

GfK Verein. Detecting Emotions from Voice GfK Verein Detecting Emotions from Voice Respondents willingness to complete questionnaires declines But it doesn t necessarily mean that consumers have nothing to say about products or brands: GfK Verein

More information

What is the impact of mode effect on non-response survey usability?

What is the impact of mode effect on non-response survey usability? What is the impact of mode effect on non-response survey usability? Caroline Vandenplas*, Dominique* Joye,Michèle Ernst Stähli** et AlexandrePollien ** * University of Lausanne, Switzerland ** FORS Swiss

More information

Sound Off DR. GOOGLE S ROLE IN PRE-DIAGNOSIS THROUGH TREATMENT. Ipsos SMX. June 2014

Sound Off DR. GOOGLE S ROLE IN PRE-DIAGNOSIS THROUGH TREATMENT. Ipsos SMX. June 2014 Sound Off DR. GOOGLE S ROLE IN PRE-DIAGNOSIS THROUGH TREATMENT June 2014 Ipsos SMX : Sound bits (of advice) and bites (of research) from Ipsos SMX Ipsos social media research division, dedicated to providing

More information

Chris A. Jones, D.Phil., M.Sc. Assistant Professor of Surgery and Economics Director, Global Health Economics Unit Vermont Center for Clinical and

Chris A. Jones, D.Phil., M.Sc. Assistant Professor of Surgery and Economics Director, Global Health Economics Unit Vermont Center for Clinical and Chris A. Jones, D.Phil., M.Sc. Assistant Professor of Surgery and Economics Director, Global Health Economics Unit Vermont Center for Clinical and Translational Science 1. Early work 2. Data-driven personalized

More information

Experimental Validity

Experimental Validity Experimental Validity 2/16/2003 P365 Experimental Validity 1 What We Will Cover in This Section Overview Internal Validity. External Validity. Key Learning Points. 2/16/2003 P365 Experimental Validity

More information

Understanding Science Conceptual Framework

Understanding Science Conceptual Framework 1 Understanding Science Conceptual Framework This list of conceptual understandings regarding the nature and process of science are aligned across grade levels to help instructors identify age-appropriate

More information

From Codes to Conclusions: Strategies for Analyzing Qualitative Data

From Codes to Conclusions: Strategies for Analyzing Qualitative Data From Codes to Conclusions: Strategies for Analyzing Qualitative Data Lindsay Bayham Department of Sociology, UC Berkeley October 18 th, 2017? Lindsay Bayham 1 Existing theory / knowledge The Research Process

More information

Americans Views on Moonshot Initiative and Cancer Research

Americans Views on Moonshot Initiative and Cancer Research Americans Views on Moonshot Initiative and Cancer Research January 2016 National public opinion survey commissioned by Research!America National Survey Methodology This nationwide online survey was conducted

More information

Accessible Social Media. Danny Housley IDEAS 2018

Accessible Social Media. Danny Housley IDEAS 2018 Accessible Social Media Danny Housley IDEAS 2018 AMAC Accessibility AMAC creates practical solutions that work, with a focus on utility, ease of use, and high quality. Accessibility Consulting focuses

More information

Predictive Analytics and machine learning in clinical decision systems: simplified medical management decision making for health practitioners

Predictive Analytics and machine learning in clinical decision systems: simplified medical management decision making for health practitioners Predictive Analytics and machine learning in clinical decision systems: simplified medical management decision making for health practitioners Mohamed Yassine, MD Vice President, Head of US Medical Affairs

More information

Correlation Neglect in Belief Formation

Correlation Neglect in Belief Formation Correlation Neglect in Belief Formation Benjamin Enke Florian Zimmermann Bonn Graduate School of Economics University of Zurich NYU Bounded Rationality in Choice Conference May 31, 2015 Benjamin Enke (Bonn)

More information

A Practical Introduction to Content Analysis. Catherine Corrigall-Brown Department of Sociology

A Practical Introduction to Content Analysis. Catherine Corrigall-Brown Department of Sociology A Practical Introduction to Content Analysis Catherine Corrigall-Brown Department of Sociology corrigall.brown@uwo.ca Data Most data in social sciences comes from: Opinion surveys, Censuses, Interviews

More information

The Heart Wants What It Wants: Effects of Desirability and Body Part Salience on Distance Perceptions

The Heart Wants What It Wants: Effects of Desirability and Body Part Salience on Distance Perceptions Student Publications Student Scholarship Spring 2018 The Heart Wants What It Wants: Effects of Desirability and Body Part Salience on Distance Perceptions Celeste M. Campbell Gettysburg College Follow

More information

Detecting Cognitive States Using Machine Learning

Detecting Cognitive States Using Machine Learning Detecting Cognitive States Using Machine Learning Xuerui Wang & Tom Mitchell Center for Automated Learning and Discovery School of Computer Science Carnegie Mellon University xuerui,tom.mitchell @cs.cmu.edu

More information

A Pragmatic Approach to Implementation of Emotional Intelligence in Machines

A Pragmatic Approach to Implementation of Emotional Intelligence in Machines A Pragmatic Approach to Implementation of Emotional Intelligence in Machines Michal Ptaszynski Rafal Rzepka Kenji Araki Language Media Laboratory Graduate School of Information Science and Technology Hokkaido

More information

Political Science 15, Winter 2014 Final Review

Political Science 15, Winter 2014 Final Review Political Science 15, Winter 2014 Final Review The major topics covered in class are listed below. You should also take a look at the readings listed on the class website. Studying Politics Scientifically

More information