Correcting Popularity Bias by Enhancing Recommendation Neutrality

Similar documents
Future directions of Fairness-aware Data Mining

Recommendation Independence

Considerations on Fairness-aware Data Mining

Fairness-aware Learning through Regularization Approach

The Long Tail of Recommender Systems and How to Leverage It

What Case Study means? Case Studies. Case Study in SE. Key Characteristics. Flexibility of Case Studies. Sources of Evidence

HOW TO IDENTIFY A RESEARCH QUESTION? How to Extract a Question from a Topic that Interests You?

Beyond Parity: Fairness Objectives for Collaborative Filtering

Towards More Confident Recommendations: Improving Recommender Systems Using Filtering Approach Based on Rating Variance

Chapter 11: Advanced Remedial Measures. Weighted Least Squares (WLS)

Balanced Neighborhoods for Fairness-aware Collaborative Recommendation

Overcoming Accuracy-Diversity Tradeoff in Recommender Systems: A Variance-Based Approach

POST GRADUATE DIPLOMA IN BIOETHICS (PGDBE) Term-End Examination June, 2016 MHS-014 : RESEARCH METHODOLOGY

A Social Curiosity Inspired Recommendation Model to Improve Precision, Coverage and Diversity

Statistics is the science of collecting, organizing, presenting, analyzing, and interpreting data to assist in making effective decisions

1. The Role of Sample Survey Design

Analysis of Human-Human and Human-Computer Agent Interactions from the Viewpoint of Design of and Attribution to a Partner

Survey Research. We can learn a lot simply by asking people what we want to know... THE PREVALENCE OF SURVEYS IN COMMUNICATION RESEARCH

Statistics is the science of collecting, organizing, presenting, analyzing, and interpreting data to assist in making effective decisions

Lecture Outline. Biost 517 Applied Biostatistics I. Purpose of Descriptive Statistics. Purpose of Descriptive Statistics

Studying and Modeling the Connection between People s Preferences and Content Sharing

Quality Management Guideline for Suppliers. Appendix. Instructions for the Preparation of 8D-Reports

EXAMINING THE RELATIONSHIP BETWEEN ORGANIZATIONAL JUSTICE AND EFFECTIVENESS OF STRATEGY IMPLEMENTATION AT FOOD INDUSTRIES IN ARDABIL PROVINCE

Lecture II: Difference in Difference. Causality is difficult to Show from cross

A Comparison of Collaborative Filtering Methods for Medication Reconciliation

Environmental, Health and Safety

Selecting a research method

Lose It! Weight Loss App Competative Analysis Report

Systematic Reviews and Meta- Analysis in Kidney Transplantation

Consumer Assessment of Wrigley s Alpine Gum

PLANNING THE RESEARCH PROJECT

THIS CHAPTER COVERS: The importance of sampling. Populations, sampling frames, and samples. Qualities of a good sample.

On the Combination of Collaborative and Item-based Filtering

De-Biasing User Preference Ratings in Recommender Systems

GATE CAT Diagnostic Test Accuracy Studies

Framework for Comparative Research on Relational Information Displays

Exploiting Implicit Item Relationships for Recommender Systems

Chapter 3 Tools for Practical Theorizing: Theoretical Maps and Ecosystem Maps

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

FDAP/ADFAM Drug & Alcohol Family Worker Professional Certification Workplace Assessment

Remarks on Bayesian Control Charts

A Brief Introduction to Bayesian Statistics

U.S. News Best Lawyers

Chapter 3. Producing Data

Recent trends in health care legislation have led to a rise in

More about inferential control models

Measuring Focused Attention Using Fixation Inner-Density

Probability-Based Protein Identification for Post-Translational Modifications and Amino Acid Variants Using Peptide Mass Fingerprint Data

Minimizing Uncertainty in Property Casualty Loss Reserve Estimates Chris G. Gross, ACAS, MAAA

Examining the Psychometric Properties of The McQuaig Occupational Test

Distributions and Samples. Clicker Question. Review

On the usefulness of the CEFR in the investigation of test versions content equivalence HULEŠOVÁ, MARTINA

CHAPTER 3 DATA ANALYSIS: DESCRIBING DATA

CHAPTER 4 THE QUESTIONNAIRE DESIGN /SOLUTION DESIGN. This chapter contains explanations that become a basic knowledge to create a good

BIOSTATISTICAL METHODS

Sample Report. Sample Report Report. Fa c i l i tat or s (05/13) 180

Measurement and meaningfulness in Decision Modeling

Chapter IR:VIII. VIII. Evaluation. Laboratory Experiments Logging Effectiveness Measures Efficiency Measures Training and Testing

Sound Preference Development and Correlation to Service Incidence Rate

1 of 16 24/05/ :06

DEVELOPING THE RESEARCH FRAMEWORK Dr. Noly M. Mascariñas

DATA GATHERING. Define : Is a process of collecting data from sample, so as for testing & analyzing before reporting research findings.

Final Report on Females in the Super Smash Brother's Gaming Community

Chapter 4 DESIGN OF EXPERIMENTS

CHAPTER - 6 STATISTICAL ANALYSIS. This chapter discusses inferential statistics, which use sample data to

Center for Advanced Studies in Measurement and Assessment. CASMA Research Report

Item Analysis Explanation

Foundation Competencies CHILD WELFARE EPAS Core

Module 28 - Estimating a Population Mean (1 of 3)

UNIT 4 ALGEBRA II TEMPLATE CREATED BY REGION 1 ESA UNIT 4

Technical Specifications

Unconscious Gender Bias in Academia: from PhD Students to Professors

Drugs, Alcohol and Substance Misuse Policy

Causal Research Design- Experimentation

A Field Experiment on Eyewitness Report

A proposal for collaboration between the Psychometrics Committee and the Association of Test Publishers of South Africa

Trial Designs. Professor Peter Cameron

The Perils of Empirical Work on Institutions

Readings: Textbook readings: OpenStax - Chapters 1 4 Online readings: Appendix D, E & F Online readings: Plous - Chapters 1, 5, 6, 13

Mantel-Haenszel Procedures for Detecting Differential Item Functioning

Cerner COMPASS ICD-10 Transition Guide

Empirical Knowledge: based on observations. Answer questions why, whom, how, and when.

Meta-Analysis. Zifei Liu. Biological and Agricultural Engineering

Psy2005: Applied Research Methods & Ethics in Psychology. Week 14: An Introduction to Qualitative Research

Population-adjusted treatment comparisons Overview and recommendations from the NICE Decision Support Unit

Chapter-2 RESEARCH DESIGN

Ursuline College Accelerated Program

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

Chapter 1: Exploring Data

Systematic Mapping Studies

LOVEMEAD GROUP PRACTICE PATIENT PARTICIPATION GROUP YEAR END REPORT 2013/14. Introduction

A Study determining the Consumption Pattern for Milk. Priti Bhanushali, Payal Bhanushali and Prof.Donna Sebastian

Recent advances in non-experimental comparison group designs

An Efficient Service Rating Forecasting by Exploring Social Mobile User s Geographical Locations

Chapter 25. Paired Samples and Blocks. Copyright 2010 Pearson Education, Inc.

Social Research (Complete) Agha Zohaib Khan

Lecture 2. Key Concepts in Clinical Research

Collaborative Filtering with Multi-component Rating for Recommender Systems

Adaptive Testing With the Multi-Unidimensional Pairwise Preference Model Stephen Stark University of South Florida

Identifying Peer Influence Effects in Observational Social Network Data: An Evaluation of Propensity Score Methods

Transcription:

Correcting Popularity Bias by Enhancing Recommendation Neutrality Toshihiro Kamishima *, Shotaro Akaho *, Hideki Asoh *, and Jun Sakuma ** *National Institute of Advanced Industrial Science and Technology (AIST), Japan ** University of Tsukuba, Japan The 8th ACM Conference on Recommender Systems, Poster @ Silicon Valley (Foster City), U.S.A., Oct. 8, 2014 START 1

Outline Overview Recommendation neutrality viewpoint feature, recommendation neutrality Experiments popularity bias, histogram of predicted ratings, accuracy vs neutrality Applications excluding information unwanted by a user, fair treatment of content providers, adherence of laws or regulations Information-neutral Recommender System formalization, neutrality term Recommendation neutrality vs recommendation diversity Recommendation diversity, differences between neutrality and diversity Conclusion 2

Overview Providing neutral information is important in recommendation excluding information unwanted by a user fair treatment of content suppliers or item providers adherence to laws and regulations in recommendation Information-neutral Recommender System The absolutely neutral recommendation is intrinsically infeasible, because recommendation is always biased in a sense that it is arranged for a specific user This system makes recommendation so as to enhance neutrality with respect to a viewpoint feature 3

Viewpoint Feature As in a case of standard recommendation, we use random variables X: a user, Y: an item, and R: a rating We adopt an additional variable for recommendation neutrality V : viewpoint feature It is specified by a user depending on his/her purpose Recommendation results are neutral with respect to this viewpoint Its value is determined depending on a user and an item Ex. viewpoint = movie s popularity / user s gender In this presentation, a viewpoint feature is restricted to a binary type 4

Recommendation Neutrality [Kamishima 12, Kamishima 13] Recommendation Neutrality Recommendation results are neutral if no information about a viewpoint feature influences the results The status of the viewpoint feature is explicitly excluded from the inference of the recommendation results the statistical independence between a result, R, and a viewpoint feature, V Pr[R V ]=Pr[R] R? V Ratings are predicted under this constraint of recommendation neutrality 5

Experimental Results 6

Popularity Bias [Celma 08] Popularity Bias the tendency for popular items to be recommended more frequently Flixster data The degree popularity of an item is measured by the number of users who rated the item [Jamali+ 10] short-head (top 1%) share in ratings: 47.2% mean rating: 3.71 long-tail (bottom 99%) share in ratings: 52.8% mean rating: 3.53 Short-head items are frequently and highly rated By specifying this popularity as a viewpoint feature, enhancing recommendation neutrality corrects this unwanted bias 7

Histograms of Predicted Ratings standard recommender two distributions are largely diverged neutrality enhanced distributions become close by enhancing neutrality dislike like dislike like each bin of histograms of short-head and long-tail data are arranged Unwanted information about popularity was successfully excluded by enhancing recommendation neutrality 8

Accuracy vs Neutrality accuracy (MAE) 0.01 neutrality (NMI) 0.70 more accurate 0.65 more neutral 0.005 0.002 0.60 0.01 0.1 1 10 100 0.001 0.01 0.1 1 10 100 neutrality parameter η : the lager value enhances the neutrality more As the increase of a neutrality parameter η, accuracy was slightly worsened, but neutrality was successfully improved Neutrality was enhanced, and thus a popularity bias was corrected, without sacrificing recommendation accuracy 9

Applications of Recommendation Neutrality 10

Application Excluding Unwanted Information [TED Talk by Eli Pariser, http://www.filterbubble.com/] Information unwanted by a user is excluded from recommendation Filter Bubble: To fit for Pariser s preference, conservative people are eliminated from his friend recommendation list in FaceBook viewpoint = a political conviction of a friend candidate Information about a candidate is conservative or progressive does not influence whether he/she is included in a friend list or not 11

Application Fair Treatment of Content Providers System managers should fairly treat their content providers Ranking in a list retrieved by search engines [Bloomberg] The US FTC has been investigating Google to determine whether the search engine ranks its own services higher than those of competitors Content providers are managers customers For marketplace sites, their tenants are customers, and these tenants must be treated fairly when recommending the tenants products viewpoint = a content provider of a candidate item Information about who provides a candidate item is ignored, and providers are treated fairly 12

Application Adherence to Laws and Regulations Recommendation services must be managed while adhering to laws and regulations suspicious placement keyword-matching advertisement [Sweeney 13] Advertisements indicating arrest records were more frequently displayed for names that are more popular among individuals of African descent than those of European descent Socially discriminative treatments must be avoided viewpoint = users socially sensitive demographic information Legally or socially sensitive information can be excluded from the inference process of recommendation 13

Information-neutral Recommender System 14

Information-neutral PMF Model information-neutral version of a PMF model adjust ratings according to the state of a viewpoint incorporate dependency on a viewpoint variable enhance the neutrality of a score from a viewpoint add a neutrality function as a constraint term adjust ratings according to the state of a viewpoint viewpoint feature ˆr(x, y, v) =µ (v) + b (v) x + c (v) y + p (v) x q (v) > y Multiple models are built separately, and each of these models corresponds to the each value of a viewpoint feature When predicting ratings, a model is selected according to the value of viewpoint feature 15

Neutrality Term and Objective Function enhance the neutrality of a rating from a viewpoint feature neutrality term, neutral(r, V ) : quantify the degree of neutrality It depends on both ratings and view point features The larger value of the neutrality term indicates that the higher level of the neutrality Objective Function of an Information-neutral PMF Model neutrality parameter to control the balance between the neutrality and accuracy regularization parameter P D (r i ˆr(x i,y i,v i )) 2 neutral(r, V )+ k k 2 2 squared loss function neutrality term L2 regularizer Parameters are learned by minimizing this objective function 16

Neutrality Term Calders&Verwer s Score (CV Score) make two distributions of R given V = 0 and 1 similar k Pr[R V = 0] Pr[R V = 1]k mean match method (Mean D (0)[ˆr] Mean D (1)[ˆr]) 2 Matching means of predicted ratings for each data set where V=1 and V=0 it matches only the first moment, but it empirically works well analytically differentiable and efficient in optimization 17

Recommendation Neutrality vs Recommendation Diversity 18

Recommendation Diversity Recommendation Diversity Similar items are not recommended in a single list, to a single user, to all users, or in a temporally successive lists recommendation list [Ziegler+ 05, Zhang+ 08, Latha+ 09, Adomavicius+ 12] similar items excluded Diversity Items that are similar in a specified metric are excluded from recommendation results The mutual relations among results Neutrality Information about a viewpoint f e a t u re i s e x c l u d e d f ro m recommendation results The relations between results and viewpoints 19

Neutrality vs Diversity Diversity Depending on the definition of similarity measures Neutrality Depending on the specification of viewpoint Similarity A function of two items Viewpoint A function of a pair of an item and a user Because a viewpoint depends on a user, neutrality can be applicable for coping with users factor, such as, users gender or age, which cannot be straightforwardly dealt by using diversity 20

Neutrality vs Diversity standard recommendation short-head diversified recommendation short-head long-tail long-tail Because a set of recommendations are diversified by abandoning short-head items, predicted scores are still biased Prediction scores themselves are unbiased by enhancing neutrality 21

Conclusion Our Contributions We formulate the recommendation neutrality from a specified viewpoint feature We developed a recommendation technique that can enhance the recommendation neutrality We applied this technique to correct popularity bias, and the effectiveness of this is empirically shown Future Work Developing a neutrality term that can more precisely approximate distributions without losing its computational efficiency Program codes: http://www.kamishima.net/inrs/ Acknowledgements We would like to thank for providing a data set for Dr. Mohsen Jamali This work is supported by MEXT/JSPS KAKENHI Grant Number 16700157, 21500154, 24500194, and 25540094 22