Confluence: Conformity Influence in Large Social Networks

Similar documents
From Sentiment to Emotion Analysis in Social Networks

MoodCast: Emotion Prediction via Dynamic Continuous Factor Graph Model

Computational Models for Social Influence and Diffusion

UNDERSTANDING THE EMOTIONS BEHIND SOCIAL IMAGES: INFERRING WITH USER DEMOGRAPHICS

Evaluating Classifiers for Disease Gene Discovery

Part 3: Case Studies. McGlohon, Faloutsos ICWSM

Predicting Sleep Using Consumer Wearable Sensing Devices

Causal Knowledge Modeling for Traditional Chinese Medicine using OWL 2

How Long Will She Call Me? Distribution, Social Theory and Duration Prediction

Toward Collective Behavior Prediction via Social Dimension Extraction

A Comparison of Collaborative Filtering Methods for Medication Reconciliation

CASINO: Towards Conformity-aware Social Influence Analysis in Online Social Networks

PGT: Measuring Mobility Relationship using Personal, Global and Temporal Factors

Heterogeneous Data Mining for Brain Disorder Identification. Bokai Cao 04/07/2015

UNIVERSITY of PENNSYLVANIA CIS 520: Machine Learning Final, Fall 2014

A Biostatistics Applications Area in the Department of Mathematics for a PhD/MSPH Degree

Improved Hepatic Fibrosis Grading Using Point Shear Wave Elastography and Machine Learning

Presented by: Eric Gavaletz

Case Studies of Signed Networks

An Improved Algorithm To Predict Recurrence Of Breast Cancer

MRI Image Processing Operations for Brain Tumor Detection

Deep Learning Models for Time Series Data Analysis with Applications to Health Care

Classıfıcatıon of Dıabetes Dısease Usıng Backpropagatıon and Radıal Basıs Functıon Network

ABSTRACT I. INTRODUCTION. Mohd Thousif Ahemad TSKC Faculty Nagarjuna Govt. College(A) Nalgonda, Telangana, India

Rumor Detection on Twitter with Tree-structured Recursive Neural Networks

Stage-Specific Predictive Models for Cancer Survivability

Temporal Analysis of Online Social Graph by Home Location

On The Role Of Centrality In Information Diffusion In Social Networks

Generalized additive model for disease risk prediction

Supplementary Figure 1

Predicting Breast Cancer Survival Using Treatment and Patient Factors

Agent-Based Models. Maksudul Alam, Wei Wang

Predicting Heart Attack using Fuzzy C Means Clustering Algorithm

Panel: Machine Learning in Surgery and Cancer

The Long Tail of Recommender Systems and How to Leverage It

Event Classification and Relationship Labeling in Affiliation Networks

May All Your Wishes Come True: A Study of Wishes and How to Recognize Them

An Examination of Factors Affecting Incidence and Survival in Respiratory Cancers. Katie Frank Roberto Perez Mentor: Dr. Kate Cowles.

Identifying Parkinson s Patients: A Functional Gradient Boosting Approach

Applying One-vs-One and One-vs-All Classifiers in k-nearest Neighbour Method and Support Vector Machines to an Otoneurological Multi-Class Problem

Modelling and Application of Logistic Regression and Artificial Neural Networks Models

MACHINE learning-as-a-service has seen an explosion

Forecasting Potential Diabetes Complications

Brain Tumor segmentation and classification using Fcm and support vector machine

Methods for Predicting Type 2 Diabetes

Sentiment Classification of Chinese Reviews in Different Domain: A Comparative Study

IJISET - International Journal of Innovative Science, Engineering & Technology, Vol. 3 Issue 2, February

Dynamic Sensory Gating Mechanism. data. Conditional Reactive Hierarchical Memory

Development of novel algorithm by combining Wavelet based Enhanced Canny edge Detection and Adaptive Filtering Method for Human Emotion Recognition

Enhanced Detection of Lung Cancer using Hybrid Method of Image Segmentation

Exploring Social Influence via Posterior Effect of Word-of-Mouth Recommendations

FUNNEL: Automatic Mining of Spatially Coevolving Epidemics

Introduction to Computational Neuroscience

Analyzing Spammers Social Networks for Fun and Profit

A Novel Iterative Linear Regression Perceptron Classifier for Breast Cancer Prediction

A Fuzzy Improved Neural based Soft Computing Approach for Pest Disease Prediction

Feature selection methods for early predictive biomarker discovery using untargeted metabolomic data

Introduction to Machine Learning. Katherine Heller Deep Learning Summer School 2018

How Opinions are Received by Online Communities: A Case Study on Amazon.com Helpfulness Votes

DANIEL KARELL. Soc Stats Reading Group. Princeton University

Using AUC and Accuracy in Evaluating Learning Algorithms

Large-Scale Statistical Modelling via Machine Learning Classifiers

Automatic Detection of Epileptic Seizures in EEG Using Machine Learning Methods

YOUR EMOTIONAL FINGERPRINT

Automatic Classification of Perceived Gender from Facial Images

Introduction to Sentiment Analysis

Social Effects in Blau Space:

Representing Association Classification Rules Mined from Health Data

Technical Specifications

BlurMe: Inferring and Obfuscating User Gender Based on Ratings

The Open Access Institutional Repository at Robert Gordon University

arxiv: v1 [stat.ml] 24 Aug 2017

Gene Selection for Tumor Classification Using Microarray Gene Expression Data

CSE 255 Assignment 9

Identifying Peer Influence Effects in Observational Social Network Data: An Evaluation of Propensity Score Methods

International Journal of Computer Science Trends and Technology (IJCST) Volume 5 Issue 1, Jan Feb 2017

Reader s Emotion Prediction Based on Partitioned Latent Dirichlet Allocation Model

Enhance micro-blogging recommendations of posts with an homophily-based graph

Application of Artificial Neural Network-Based Survival Analysis on Two Breast Cancer Datasets

Introduction to Social Network Analysis for Dissemination and Implementation Research

Investigating the performance of a CAD x scheme for mammography in specific BIRADS categories

Detecting Stress Based on Social Interactions in Social Networks

הרשת היא המסר, תפקידם של קשרים חברתיים בהערכת אמינותם של תכנים מקוונים

arxiv: v1 [cs.si] 19 Jun 2012

Mammogram Analysis: Tumor Classification

Review: Logistic regression, Gaussian naïve Bayes, linear regression, and their connections

Machine Gaydar : Using Facebook Profiles to Predict Sexual Orientation

基于循环神经网络的序列推荐 吴书智能感知与计算研究中心模式识别国家重点实验室中国科学院自动化研究所. Spe 17, 中国科学院自动化研究所 Institute of Automation Chinese Academy of Sciences

An Improved Patient-Specific Mortality Risk Prediction in ICU in a Random Forest Classification Framework

EECS 433 Statistical Pattern Recognition

J2.6 Imputation of missing data with nonlinear relationships

Intelligent Edge Detector Based on Multiple Edge Maps. M. Qasim, W.L. Woon, Z. Aung. Technical Report DNA # May 2012

GIANT: Geo-Informative Attributes for Location Recognition and Exploration

Hierarchical Convolutional Features for Visual Tracking

Performance Based Evaluation of Various Machine Learning Classification Techniques for Chronic Kidney Disease Diagnosis

Predicting Kidney Cancer Survival from Genomic Data

Multi Parametric Approach Using Fuzzification On Heart Disease Analysis Upasana Juneja #1, Deepti #2 *

A Bayesian Network Model of Knowledge-Based Authentication

Brain Tumour Detection of MR Image Using Naïve Beyer classifier and Support Vector Machine

Social Network Analysis: When Social Relationship is the Dependent Variable. Anabel Quan Haase Faculty of Information and Media Studies Sociology

Transcription:

Confluence: Conformity Influence in Large Social Networks Jie Tang *, Sen Wu *, and Jimeng Sun + * Tsinghua University + IBM TJ Watson Research Center 1

Conformity Conformity is the act of matching attitudes, opinions, and behaviors to group norms. [1] Kelman identified three major types of conformity [2] Compliance is public conformity, while possibly keeping one's own original beliefs for yourself. Identification is conforming to someone who is liked and respected, such as a celebrity or a favorite uncle. Internalization is accepting the belief or behavior, if the source is credible. It is the deepest influence on people and it will affect them for a long time. [1] R.B. Cialdini, & N.J. Goldstein. Social influence: Compliance and conformity. Annual Review of Psych., 2004, 55, 591 621. [2] H.C. Kelman. Compliance, Identification, and Internalization: Three Processes of Attitude Change. Journal of Conflict 2 Resolution, 1958, 2 (1): 51 60.

Love Obama I love Obama I hate Obama, the worst president ever Obama is fantastic Obama is great! No Obama in 2012! He cannot be the next president! Positive Negative 3

Conformity Influence Analysis I love Obama Obama is fantastic 3. Group conformity A Obama is great! D 1. Peer conformity B Positive C Negative 2. Individual conformity 4

Related Work Conformity Conformity theory Compliance, identification, and internalization [Kelman 1958] A theory of conformity based on game theory [Bernheim 1994] Influence and conformity Conformity-aware influence analysis [Li-Bhowmick-Sun 2011] Applications Social influence in social advertising [Bakshy-el-al 2012] 5

Related Work social influence Input: coauthor network Social influence anlaysis Output: topic-based social influences θ i1 =.5 θ i2 =.5 Frank George 2 Topic distribution 2 4 Eve 2 3 Ada Topics: Topic 1: Data mining Topic 2: Database 1 David 1 Carol 3 Bob θ i1 θ i2 George Frank Eve Topic distribution Ada David Node factor function a z r z g(v 1,y 1,z) Edge factor function f (y i,y j, z) Carol Bob Output Topic 1: Data mining George Frank Eve Topic 2: Database George Frank Ada Ada Bob Eve David 6 Influence test and quantification Influence and correlation [Anagnostopoulos-et-al 2008] Distinguish influence and homophily [Aral-et-al 2009, La Fond-Nevill 2010] Topic-based influence measure [Tang-Sun-Wang-Yang 2009, Liu-et-al 2012] Learning influence probability [Goyal-Bonchi-Lakshmanan 2010] Influence diffusion model Linear threshold and cascaded model [Kempe-Kleinberg-Tardos 2003] Efficient algorithm [Chen-Wang-Yang 2009]...

Challenges How to formally define and differentiate different types of conformities? How to construct a computational model to learn the different conformity factors? How to validate the proposed model in real large networks? 7

Problem Formulation and Methodologies 8

Four Datasets Network #Nodes #Edges Behavior #Actions Weibo 1,776,950 308,489,739 Flickr 1,991,509 208,118,719 Gowalla 196,591 950,327 ArnetMiner 737,690 2,416,472 Tweet on popular topics Comment on a popular photo Check-in some location Publish in a specific domain 6,761,186 3,531,801 6,442,890 1,974,466 All the datasets are publicly available for research. 9

A concrete example in Gowalla Legend Alice Alice s friend Other users 1 1 1 1 If Alice s friends check in this location at time t Will Alice also check in nearby? 10

Notations Time t Node/user: v i User Group: c ij Time t-1, t-2 Attributes: x i - location, gender, age, etc. Action/Status: y i - e.g., Love Obama G =(V, E, C, X) A = {(a,v i,t)} a,i,t each (a, v i, t) represents user v i performed action a at time t 11

Conformity Definition Three levels of conformities Individual conformity Peer conformity Group conformity 12

Individual Conformity The individual conformity represents how easily user v s behavior conforms to her friends A specific action performed by user v at time t Exists a friend v who performed the same action at time t All actions by user v 13

Peer Conformity The peer conformity represents how likely the user v s behavior is influenced by one particular friend v A specific action performed by user v at time t User v follows v to perform the action a at time t All actions by user v 14

Group Conformity The group conformity represents the conformity of user v s behavior to groups that the user belongs to. τ-group action: an action performed by more than a percentage τ of all users in the group C k A specific τ-group action User v conforms to the group to perform the action a at time t All τ-group actions performed by users in the group C k 15

For an example Conformity in the Co-Author Network Individual Conformity Peer Conformity 0.025 0.003 0.02 0.0025 0.002 0.015 0.0015 0.01 0.001 0.0005 0.005 0 2000 2005 2010 0 KDD ICDM CIKM Peer Random KDD Group Conformity 0.0025 0.002 0.0015 0.001 0.0005 0 Clustering Influence Recommendation Topic Model 16 KDD ICDM CIKM

Now our problem becomes How to incorporate the different types of conformities into a unified model? Input: G=(V, E, C, X), A Output: F: f(g, A) ->Y (t+1) 17

Confluence A conformity-aware factor graph model Group 2: C 2 Confluence model Input Network v 1 Group 1: C 1 v 2 g(y 1, gcf (v 1, C 1 )) y y 4 2 y 7 y 3 y 5 y 1 y 6 g(y 1, y 3, pcf (v 1, v 3 )) Group conformity factor function v 3 y 1 =a g(v 1, icf (v 1 )) Peer conformity factor function Random variable y: Action v 4 v5 v 6 Group 3: C 3 v 7 v 2 v 4 v 7 v 3 v 1 v 5 v 6 Individual conformity factor function Users 18

Model Instantiation Individual conformity factor function Peer conformity factor function Group conformity factor function 19

Opinion leader [1] General Social Features Whether the user is an opinion leader or not Structural hole [2] Whether the user is a structural hole spanner Social ties [3] Whether a tie between two users is a strong or weak tie Social balance [4] People in a social network tend to form balanced (triad) structures (like my friend s friend is also my friend ). [1] X. Song, Y. Chi, K. Hino, and B. L. Tseng. Identifying opinion leaders in the blogosphere. In CIKM 06, pages 971 974, 2007. [2] T. Lou and J Tang. Mining Structural Hole Spanners Through Information Diffusion in Social Networks. In WWW'13. pp. 837-848. [3] M. Granovetter. The strength of weak ties. American Journal of Sociology, 78(6):1360 1380, 1973. [4] D. Easley and J. Kleinberg. Networks, Crowds, and Markets: Reasoning about a Highly Connected World. Cambridge University 20 Press, 2010.

Distributed Model Learning Unknown parameters to estimate (1) Master (2) Slave (3) Master 21

Distributed Learning Master Global update Slave Compute local gradient via random sampling Graph Partition by Metis Master-Slave Computing Inevitable loss of correlation factors! 22

23 Experiments

Data Set and Baselines 24 Network #Nodes #Edges Behavior #Actions Weibo 1,776,950 308,489,739 Post a tweet 6,761,186 Flickr 1,991,509 208,118,719 Add comment 3,531,801 Gowalla 196,591 950,327 Check-in 6,442,890 ArnetMiner 737,690 2,416,472 Publish paper 1,974,466 Baselines - Support Vector Machine (SVM) - Logistic Regression (LR) - Naive Bayes (NB) - Gaussian Radial Basis Function Neural Network (RBF) - Conditional Random Field (CRF) Evaluation metrics - Precision, Recall, F1, and Area Under Curve (AUC)

Prediction Accuracy 25 t-test, p<<0.01

Effect of Conformity Confluence base stands for the Confluence method without any social based features Confluence base +I stands for the Confluence base method plus only individual conformity features Confluence base +P stands for the Confluence base method plus only peer conformity features Confluence base +G stands for the Confluence base method plus only group conformity 26

Scalability performance Achieve 9 speedup with 16 cores 27

Conclusion Study a novel problem of conformity influence analysis in large social networks Formally define three conformity functions to capture the different levels of conformities Propose a Confluence model to model users actions and conformity Our experiments on four datasets verify the effectiveness and efficiency of the proposed model 28

Future work Connect the conformity phenomena with other social theories e.g., social balance, status, and structural hole Study the interplay between conformity and reactance Better model the conformity phenomena with other methodologies (e.g., causality) 29

Confluence: Conformity Influence in Large Social Networks Jie Tang *, Sen Wu *, and Jimeng Sun + * Tsinghua University + IBM TJ Watson Research Center Data and codes are available at: http://arnetminer.org/conformity/ 30

31 Qualitative Case Study

I love Obama Positive Negative 1. Peer Conformity 2. Individual Conformity 1 32

I love Obama Positive Negative Obama is great! 1. Peer conformity 2. Individual 2 Conformity 33

I love Obama Positive Negative Obama is great! 1. Peer conformity 2. Individual 3 conformity 34

I love Obama Positive Negative Obama is fantastic 3. Group conformity Obama is great! 1. Peer conformity 2. Individual 4 conformity 35