Multiple equilibria in social interaction models: estimating a model of adolescent smoking behavior.

Similar documents
NBER WORKING PAPER SERIES PEER EFFECTS AND MULTIPLE EQUILIBRIA IN THE RISKY BEHAVIOR OF FRIENDS. David Card Laura Giuliano

Multi-level approaches to understanding and preventing obesity: analytical challenges and new directions

HANDBOOK OF SOCIAL ECONOMICS

11/24/2017. Do not imply a cause-and-effect relationship

Social and Behavioral Sciences for Tobacco Use

11/18/2013. Correlational Research. Correlational Designs. Why Use a Correlational Design? CORRELATIONAL RESEARCH STUDIES

Size Matters: the Structural Effect of Social Context

Technical Considerations: the past, present and future of simulation modeling of colorectal cancer

Exploring Network Effects and Tobacco Use with SIENA

Econometric Game 2012: infants birthweight?

Fruit and Vegetable Choices of African American Youths

A Bayesian Approach to Characterizing Heterogeneity of Rank-Dependent Expected Utility Models of Lottery Choices

Accommodating informative dropout and death: a joint modelling approach for longitudinal and semicompeting risks data

Identifying Endogenous Peer Effects in the Spread of Obesity. Abstract

Tutorial #7A: Latent Class Growth Model (# seizures)

Addendum: Multiple Regression Analysis (DRAFT 8/2/07)

Russian Journal of Agricultural and Socio-Economic Sciences, 3(15)

Estimating drug effects in the presence of placebo response: Causal inference using growth mixture modeling

Catherine A. Welch 1*, Séverine Sabia 1,2, Eric Brunner 1, Mika Kivimäki 1 and Martin J. Shipley 1

Generalized Estimating Equations for Depression Dose Regimes

Improving ecological inference using individual-level data

A Brief Introduction to Bayesian Statistics

NORTH SOUTH UNIVERSITY TUTORIAL 2

Review: Logistic regression, Gaussian naïve Bayes, linear regression, and their connections

Small-area estimation of mental illness prevalence for schools

NBER WORKING PAPER SERIES ASYMMETRIC SOCIAL INTERACTION IN ECONOMICS: CIGARETTE SMOKING AMONG YOUNG PEOPLE IN THE UNITED STATES,

The Effects of Maternal Alcohol Use and Smoking on Children s Mental Health: Evidence from the National Longitudinal Survey of Children and Youth

Advanced IPD meta-analysis methods for observational studies

Identifying Peer Influence Effects in Observational Social Network Data: An Evaluation of Propensity Score Methods

Inflation Persistence in the Euro Area

Big Data Challenges & Opportunities. L. Miriam Dickinson, PhD

Score Tests of Normality in Bivariate Probit Models

Macroeconometric Analysis. Chapter 1. Introduction

Bayesian graphical models for combining multiple data sources, with applications in environmental epidemiology

Treatment Effects Using Inverse Probability Weighting and Contaminated Treatment Data

Objective: To describe a new approach to neighborhood effects studies based on residential mobility and demonstrate this approach in the context of

Methods for treating bias in ISTAT mixed mode social surveys

Unit 1 Exploring and Understanding Data

Instrumental Variables Estimation: An Introduction

The Demand for Cigarettes in Tanzania and Implications for Tobacco Taxation Policy.

IMPACTS OF SOCIAL NETWORKS AND SPACE ON OBESITY. The Rights and Wrongs of Social Network Analysis

Design and Analysis Plan Quantitative Synthesis of Federally-Funded Teen Pregnancy Prevention Programs HHS Contract #HHSP I 5/2/2016

Experimental Methods. Policy Track

Cancer survivorship and labor market attachments: Evidence from MEPS data

Underuse, Overuse, Comparative Advantage and Expertise in Healthcare

Online Appendix A. A1 Ability

Peer Effects, Tobacco Control Policies, and Youth Smoking Behavior

Reinforcement Learning : Theory and Practice - Programming Assignment 1

Missing data. Patrick Breheny. April 23. Introduction Missing response data Missing covariate data

Bayesian Joint Modelling of Longitudinal and Survival Data of HIV/AIDS Patients: A Case Study at Bale Robe General Hospital, Ethiopia

Mostly Harmless Simulations? On the Internal Validity of Empirical Monte Carlo Studies

Bayesian growth mixture models to distinguish hemoglobin value trajectories in blood donors

TREATMENT CHOICE WHEN TREATMENT RESPONSE IS PARTIALLY IDENTIFIED. Charles F. Manski Northwestern University

Understanding and Applying Multilevel Models in Maternal and Child Health Epidemiology and Public Health

Issues Surrounding the Normalization and Standardisation of Skin Conductance Responses (SCRs).

SCHOOL OF MATHEMATICS AND STATISTICS

Lecture II: Difference in Difference and Regression Discontinuity

THE GOOD, THE BAD, & THE UGLY: WHAT WE KNOW TODAY ABOUT LCA WITH DISTAL OUTCOMES. Bethany C. Bray, Ph.D.

Inference with Difference-in-Differences Revisited

Chapter 11 Regression with a Binary Dependent Variable

Causal Mediation Analysis with the CAUSALMED Procedure

Epidemiologic Methods I & II Epidem 201AB Winter & Spring 2002

Rationed Fertility: Theory and Evidence

How well targeted are soda taxes?

How to analyze correlated and longitudinal data?

Chapter 21 Multilevel Propensity Score Methods for Estimating Causal Effects: A Latent Class Modeling Strategy

A COMPARISON OF IMPUTATION METHODS FOR MISSING DATA IN A MULTI-CENTER RANDOMIZED CLINICAL TRIAL: THE IMPACT STUDY

Bayesian and Frequentist Approaches

Lecture Outline. Biost 517 Applied Biostatistics I. Purpose of Descriptive Statistics. Purpose of Descriptive Statistics

Citation for published version (APA): Ebbes, P. (2004). Latent instrumental variables: a new approach to solve for endogeneity s.n.

A re-randomisation design for clinical trials

Statistics and Probability

Journal of Health Economics

Introduction to Bayesian Analysis 1

THE EMPIRICAL CONTENT OF MODELS WITH SOCIAL INTERACTIONS

Susan Kaai, PhD University of Waterloo, Ontario, Canada NBATC Webinar Slides February 10, 2015

Estimating price elasticity for tobacco in Canada s Aboriginal communities

P E R S P E C T I V E S

Statistical Analysis of Complete Social Networks

Motivation Empirical models Data and methodology Results Discussion. University of York. University of York

Health of the City & Community Health Improvement Planning. Raynard Washington, PhD, MPH Chief Epidemiologist

Introduction to Program Evaluation

This PDF is a selection from a published volume from the National Bureau of Economic Research

Cointegration: the Engle and Granger approach

Regression Discontinuity Designs: An Approach to Causal Inference Using Observational Data

A Multilevel Approach to Model Weight Gain: Evidence from NLSY79 Panel

WDHS Curriculum Map Probability and Statistics. What is Statistics and how does it relate to you?

Limited dependent variable regression models

STATISTICAL METHODS FOR DIAGNOSTIC TESTING: AN ILLUSTRATION USING A NEW METHOD FOR CANCER DETECTION XIN SUN. PhD, Kansas State University, 2012

Potential Consequences of E-Cigarette Use: Is Youth Health Going Up in Smoke? October 11, 2016

Measurement and Descriptive Statistics. Katie Rommel-Esham Education 604

Chapter 3: Describing Relationships

Non-parametric methods for linkage analysis

Comprehensive Tobacco Control Programs: Economic Review. Page 1 of 26. Study Info Intervention Characteristics Population Characteristics

Individual preference heterogeneity, targeting and welfare effects of soda taxes

Analyzing diastolic and systolic blood pressure individually or jointly?

Evaluating Social Programs Course: Evaluation Glossary (Sources: 3ie and The World Bank)

Classical Psychophysical Methods (cont.)

MEA DISCUSSION PAPERS

Bisakha Sen University of Central Florida. Abstract

Transcription:

Multiple equilibria in social interaction models: estimating a model of adolescent smoking behavior. Andrea Moro, Vanderbilt University (with Alberto Bisin and Giorgio Topa) Michigan State University, April 3, 213

Question Social interaction problems typically display multiple equilibria Problem often ignored in empirical applications Want to quantitatively assess the importance of multiplicity in a model of smoking with social interactions Introduction

Identification problems in social interaction models Manski s reflection problem Distinguish endogenous group effects from contextual effects individual effects Correlated unobservables Selection into group membership Multiplicity Introduction

Multiplicity is policy-relevant What equilibrium are we playing? 1% 45% Avg. smoking Introduction

Multiplicity is policy-relevant What equilibrium are we playing? The equilibrium is policy-relevant 1% 45% Avg. smoking Introduction

Multiplicity is policy-relevant What equilibrium are we playing? The equilibrium is policy-relevant Extrapolating results from a linear specification might be misleading 1% 45% Avg. smoking Introduction

Outline (of the paper) Present a general framework for models with multiple equilibria in economies with social interactions Introduce a consistent and computationally feasible two-step estimator Show Monte Carlo analysis of the estimator s properties Present an application to teenage smoking using Add Health data Introduction

Estimation of models with multiple equilibria Various approaches: Impose conditions for equilibrium uniqueness Characterize eq m structure and make assumptions about equilibrium selection Exploit information from data to learn about equilibrium Nakajima (27), Soetevent-Kooreman (27), Jovanovic (1989) - Dagsvik-Jovanovic (1994) - Moro (22), Fang (25) I/O: Bresnahan and Reiss (1991) - Tamer (23) - Bajari-Hong-Ryan (27) - Aguirregabiria-Mira (27), Bajari-Hong-Krainer-Nekipelov (26); Introduction

Brock and Durlauf s binary choice model Smoking is a binary choice y i { 1, 1} Model

Brock and Durlauf s binary choice model Smoking is a binary choice y i { 1, 1} School-wide interactions π = average smoking in school Model

Brock and Durlauf s binary choice model Smoking is a binary choice y i { 1, 1} School-wide interactions π = average smoking in school Agents maximize max V (y i, X i, Z, π, ε i ) = y i (cx i + Jπ) + ε i (y i ) y i { 1,1} Model

Brock and Durlauf s binary choice model Smoking is a binary choice y i { 1, 1} School-wide interactions π = average smoking in school Agents maximize max V (y i, X i, Z, π, ε i ) = y i (cx i + Jπ) + ε i (y i ) y i { 1,1} ε i depends on choice y i and follows an extreme-value distribution: Pr (ε i ( 1) ε i (1) z) = 1 1 + exp ( z) Model

Model solution From the First Order Conditions, Pr (y i = 1) = 1 1 + exp ( 2 (cx i + Jπ)) Assuming that #I is large enough that a LLN applies, we obtain the following characterization of equilibrium: π = tanh (cx i + Jπ). i I Model

Equilibrium mapping π = tanh (cx i + Jπ) i I The map moves down as c decreases and flattens as J decreases 1% 45% Avg. smoking Model

Estimation First step: obtain π n from fraction of smokers in each school n Estimation

Estimation First step: obtain π n from fraction of smokers in each school n Second step: use π in the FOC s. Log likelihood: log L n = i 1+yi 2 log 1 + exp 2 c n X in + J n π n + 1 y i 2 log 1 + exp 2 c n X in + J n π n Estimation

Estimation First step: obtain π n from fraction of smokers in each school n Second step: use π in the FOC s. Log likelihood: log L n = i 1+yi 2 log 1 + exp 2 c n X in + Jn Gπ n + Jn Lπ g(i) + 1 y i 2 log 1 + exp 2 c n X in + Jn Gπ n + Jn Lπ g(i) Can account for local interactions π g(i) Estimation

Selection into group membership prob((i, k) are friends) = W ik γ + u ik prob(i smokes k is a friend) = (model) Exclusion restrictions: 1. k s characteristics and difference in i and k s characteristics 2. k s friends average characteristics (Bramouille ) Selection model can be estimated with standard parametric or non-parametric methods Estimation

Identification Agents choose y i { 1, 1} to maximize y i (c + J π n )+shock Single school: intercept in c not identified Estimation

Identification Agents choose y i { 1, 1} to maximize y i (cx i + J π n )+shock Single school: intercept in c not identified Slopes in c identified from interactions J from variation in smoking behavior between people with different X (J affects all) Estimation

Identification Agents choose y i { 1, 1} to maximize y i (cx i + J π n )+shock Single school: intercept in c not identified Slopes in c identified from interactions J from variation in smoking behavior between people with different X (J affects all) With multiple schools: intercept in c is identified (constant affects all schools equally, J proportionally to the average smoking π n ) Estimation

Teen smoking and friendship data Data from the National Longitudinal Study of Adolescent Health ( Add Health ) Nationally representative sample of adolescents in grades 7-12 in the US during the 1994-95 school year Combines longitudinal survey data on respondents attributes with contextual data on the family, neighborhood, school, individual networks, and peer groups. Our current data consist of about 46,6 observations in 45 schools We define smoking as smoking at least once or twice a week schools ( 18% of students) Results: teen smoking behavior

Specifications Preference parameters c n J n 1. School-by-school estimation Results: teen smoking behavior

Specifications Preference parameters c n = α + α k Zn k J n = γ + γ k Zn k k k 1. School-by-school estimation 2. Multiple schools, parameters depend on school or neighborhood attributes Z Results: teen smoking behavior

Specifications Preference parameters c n = α + α n, with α n N(,σ α ) J n = γ + γ n, with γ n N(,σ γ ) 1. School-by-school estimation 2. Multiple schools, parameters depend on school or neighborhood attributes Z 3. Multiple schools with random coefficients in parameters Results: teen smoking behavior

Specifications Preference parameters c n = α + α k Zn k + α n, with α n N(,σ α ) k J n = γ + γ k Zn k + γ n, with γ n N(,σ γ ) k 1. School-by-school estimation 2. Multiple schools, parameters depend on school or neighborhood attributes Z 3. Multiple schools with random coefficients in parameters 4. Multiple schools with random parameters that depend on school+neighborhood attributes Results: teen smoking behavior

Specifications Preference parameters c n = α + α k Zn k + α n, with α n N(,σ α ) k J n = γ + γ k Zn k + γ n, with γ n N(,σ γ ) k 1. School-by-school estimation 2. Multiple schools, parameters depend on school or neighborhood attributes Z 3. Multiple schools with random coefficients in parameters 4. Multiple schools with random parameters that depend on school+neighborhood attributes We estimated versions with school-wide interactions only, as well as both school-wide and local interactions Results: teen smoking behavior

Results, separate schools Variable Mean Median Black -.74 -.67 Asian -.6 -.24 Hispanic -.2 -.17 Female.12.18 Age.11.8 Not belong to club.34.29 GPA -.33 -.32 Mom college -.2 -.16 Dad at home -.2 -.21 Local Inter..8.82 Constant -.73 -.85 One additional smoking friend increases probability of smoking by 12% Results: teen smoking behavior

4 2 1.2 1.8.6.4.2 Black 4 2 1.5.5 1 1.5 Hispanic 1 5.5.1.15.2.25.3 Age 1 5.6.5.4.3.2.1 GPA 1 5.6.4.2.2 Dad at home 15 1.5 1.5.5 1 Asian 1 5.6.4.2.2.4 Female 15 1 5.5 1 1.5 No Club 1 5 1.5.5 Mom college 6 Distribution of parameter estimates Results: teen smoking behavior 1 5.4.6.8 1 1.2 1.4 J n (local) 4 2 4 3 2 1 1 2 Constant

Multiple schools, parameters fct of Z Female Age No club Intercept 1.494 (.13) -.37 (.67) -.5867 (.2113) % Urban.34 (.61).333 (.8) -.122 (.726) % Poverty -1.6764 (.3644).1857 (.4).4575 (.4327) % College over 25.6127 (.3485).37 (.238).2228 (.4275) Female Labor Force P.R. -1.4867 (.2978) -.9 (.133).9968 (.5351) Male Labor Force P.R. -.8336 (.1841).16 (.76).2586 (.31) Tobacco training -.888 (.379).223 (.58).122 (.482) Tobacco student policy.922 (.88) -.179 (.68).256 (.14) Tobacco staff policy -.1237 (.882).388 (.61) -.189 (.956) Positive relationship between age and Results: teen smoking behavior

Multiple schools, parameters fct of Z Female Age No club Intercept 1.494 (.13) -.37 (.67) -.5867 (.2113) % Urban.34 (.61).333 (.8) -.122 (.726) % Poverty -1.6764 (.3644).1857 (.4).4575 (.4327) % College over 25.6127 (.3485).37 (.238).2228 (.4275) Female Labor Force P.R. -1.4867 (.2978) -.9 (.133).9968 (.5351) Male Labor Force P.R. -.8336 (.1841).16 (.76).2586 (.31) Tobacco training -.888 (.379).223 (.58).122 (.482) Tobacco student policy.922 (.88) -.179 (.68).256 (.14) Tobacco staff policy -.1237 (.882).388 (.61) -.189 (.956) Female smoking related to socio-economic status Results: teen smoking behavior

Multiple schools, parameters fct of Z Female Age No club Intercept 1.494 (.13) -.37 (.67) -.5867 (.2113) % Urban.34 (.61).333 (.8) -.122 (.726) % Poverty -1.6764 (.3644).1857 (.4).4575 (.4327) % College over 25.6127 (.3485).37 (.238).2228 (.4275) Female Labor Force P.R. -1.4867 (.2978) -.9 (.133).9968 (.5351) Male Labor Force P.R. -.8336 (.1841).16 (.76).2586 (.31) Tobacco training -.888 (.379).223 (.58).122 (.482) Tobacco student policy.922 (.88) -.179 (.68).256 (.14) Tobacco staff policy -.1237 (.882).388 (.61) -.189 (.956) Positive relationship between age and smoking stronger in poor areas Results: teen smoking behavior

Multiple schools, parameters fct of Z (cont.) GPA Mom college Dad Home Intercept.1427 (.325) -.3879 (.172) -.2364 (.966) % Urban.1 (.37).1338 (.73) -.185 (.77) % Poverty.711 (.174) -.321 (.399).3613 (.3915) % College over 25 -.967 (.1322) -.437 (.3739).2692 (.3856) Female Labor Force P.R. -.47 (.734) -.49 (.4452) -.6471 (.2632) Male Labor Force P.R. -.183 (.449).9316 (.2464).412 (.1495) Tobacco training -.128 (.245) -.264 (.457).121 (.485) Tobacco student policy -.375 (.386) -.75 (.114) -.1367 (.996) Tobacco staff policy.16 (.42) -.25 (.165).1388 (.923) Female smoking related to socio-economic status Results: teen smoking behavior

Multiple schools, parameters fct of Z (cont.) GPA Mom college Dad Home Intercept.1427 (.325) -.3879 (.172) -.2364 (.966) % Urban.1 (.37).1338 (.73) -.185 (.77) % Poverty.711 (.174) -.321 (.399).3613 (.3915) % College over 25 -.967 (.1322) -.437 (.3739).2692 (.3856) Female Labor Force P.R. -.47 (.734) -.49 (.4452) -.6471 (.2632) Male Labor Force P.R. -.183 (.449).9316 (.2464).412 (.1495) Tobacco training -.128 (.245) -.264 (.457).121 (.485) Tobacco student policy -.375 (.386) -.75 (.114) -.1367 (.996) Tobacco staff policy.16 (.42) -.25 (.165).1388 (.923) Dad at home -> Less smoking; more so when Fem LFPR high Results: teen smoking behavior

Multiple schools, parameters fct of Z (cont./2) Local Inter. Global Inter. Constant Intercept.3394 (.1373) 2.5743 (.1562).1882 (.199) % Urban -.148 (.517).959 (.1878) % Poverty -.2967 (.3193) 4.4323 (.7985) % College over 25 -.294 (.2953) 1.2494 (.69) Female Labor Force P.R..5911 (.3383) -6.843 (.3243) Male Labor Force P.R..58 (.216) -.8632 (.1961) Tobacco training -.43 (.397).6166 (.1487) Tobacco student policy.38 (.821) -.655 (.1554) Tobacco staff policy -.951 (.816) 1.344 (.156) School policies can have large impact on school-wide interaction Results: teen smoking behavior

The prevalence of multiplicity Specification: only global interaction, no school covariates, only random effects Counter-factuals

The prevalence of multiplicity Specification: only global interaction, no school covariates, only random effects Multiplicity in 4 out of 41 cases In 35 out or 4, intermediate equilibrium Counter-factuals

The prevalence of multiplicity Specification: only global interaction, no school covariates, only random effects Multiplicity in 4 out of 41 cases In 35 out or 4, intermediate equilibrium Example: School N J Percentage of smokers Data Equil. 1 Equil. 2 Equil. 3 41 156 2.67 5.9 5.9 17.2 99.8 56 193 6.5 18.7.2 18.4 1. 72 899.14 24.4 24.8 If all school were in the lowest equilibrium smoking would be 14.3 percentage points lower (from 6 to 32 percentage points) Counter-factuals

The effect of social interactions The mapping flattens as J increases Counter-factuals

Counterfactual: reduction in global interaction School ID 41 Interactions Percentage of smokers Data Equil 1 Equil 2 Equil 3 Baseline 5.9 5.9 17.2 99.8 J reduced 3% 9.1 11.8 99.8 J reduced 5% 99.8 J reduced 1% 99.8 Counter-factuals

Counterfactual: reduction in global interaction School ID 41 56 Interactions Percentage of smokers Data Equil 1 Equil 2 Equil 3 Baseline 5.9 5.9 17.2 99.8 J reduced 3% 9.1 11.8 99.8 J reduced 5% 99.8 J reduced 1% 99.8 Baseline 18.7.2 18.4 1. J reduced 3%.3 17. 1. J reduced 5%.4 16. 1. J reduced 1%.9 13. 1. Counter-factuals

What explains the variation of smoking? Most school in the intermediate equilibrium No correlation between interaction par. J n and smoking π n Counter-factuals

What explains the variation of smoking? Most school in the intermediate equilibrium No correlation between interaction par. J n and smoking π n Variation in c x (school heterogeneity), or student heterogeneity Standard deviation of smoking across schools Simulation Eq. 1 Eq. 2 Eq. 3 Original estimates 2.7 4.8.1 All schools same c, J.6 3.. All students like school #56 2.2 6..2 (school heterogeneity more important) Counter-factuals

Counterfactual with both local and global interactions Here we need to simulate the Markov process letting students update their smoking state based on configuration of smoking in the previous period Let the process run many periods until convergence Results do not change if alllow random agent to change state in each period Multiplicity: different starting percentage of smokers Various exercises using school #56 as baseline: changes in J G, J L, number of friends, utility cost of smoking. arrangement of students within school: perfect integration vs. segregation. Counter-factuals

Multiplicity and nonlinearities: local interactions Local interaction parameter 3J 2J 1.5J 1.2J 1.1J 1.5J 1.1J.99J J.95J.9J.8J.5J.2J.1J.5J.9.12.13.15.16.16.16.18.2.24.47.3 Percentage of smokers in low-level equil. Length of line represents the basin of attraction of low-level equilibrium.94 1.22 1.39 1.5 5 1 15 2 25 3 1 Initial percentage of smokers Counter-factuals

Global interactions Global interaction parameter 1.5G 2G 3G 1.2G 1.1G 1.5G 1.1G.99G G.95G.9G.8G.5G.2G.1G.5G.53 2.55.9.14.16.18.29.2.5 Only high-level equilibrium (1% smokers) Only high-level equilibrium (1% smokers) Only high-level equilibrium (99.99% smokers) Only high-level equilibrium (99.98% smokers) Only high-level equilibrium (99.96% smokers) 5 1 15 2 25 3 35 4 45 Initial percentage of smokers Counter-factuals

Number of friends Number of friends Baseline One fewer 2 fewer 3 fewer.7 4 fewer 1.99.16.29.48 5 1 15 2 25 3 Initial percentage of smokers Counter-factuals

What we learn about interactions Multiple equilibria are pervasive, nonlinearities are important Reducing strength of global or local interaction, or the number of friends makes low-level equilibria less likely Interactions lead to less smoking Counter-factuals

Direct utility from smoking Utility from smoking C i + 2med(C) C i + med(c) C i +.5med(C) C i +.2med(C) C i +.1med(C) C i +.5med(C) C i.5med(c) C i.1med(c) C i.2med(c) C i.5med(c) C i med(c) C i max(c) C i = Only high-level equilibrium (1% smokers).86.3.22.19.14.12.9.4.1 1 2 3 4 Initial percentage of smokers Counter-factuals

Artificial school School similar to #56. 8 students on a circle, with friends on her right Baseline: 4 friends (median in that school) Initially: homogenous students (median student: white 16yo female, 3.GPA...) Change demographics Change number of friends and patterns of friendships Counter-factuals

Integrated school: effect of friendship patterns L: least likely to smoke (based on individual characteristics M: most likely LLLLLLLM 1 friend LLLLLLM 4 friends LLLLLLM 8 friends LLLM 1 friend LLLM 4 friends LLLM 8 friends LMLM 1 friend LMLM 4 friends LMLM 8 friends.33.33.33.33.33.33.33.33.33 More friends: less smoking More integration of smokers: more smoking 5 1 15 2 25 3 Initial percentage of smokers Counter-factuals

Segregated school Fig 17 in the paper Two subgroups (least and most likely to smoke), local interactions only within groups A new equilibrium appears with the fraction of smokers equal to the proportion of high smokers The arrangement of students within the school and the composition of the students personal network matters Counter-factuals

Conclusions Nonlinearities matter, multiplicities matter Dangerous to extrapolate policy conclusions from a linear specification Conclusions