Small-area estimation of mental illness prevalence for schools

Similar documents
Small-area estimation of prevalence of serious emotional disturbance (SED) in schools. Alan Zaslavsky Harvard Medical School

County-Level Small Area Estimation using the National Health Interview Survey (NHIS) and the Behavioral Risk Factor Surveillance System (BRFSS)

Jeremy Aldworth, Neeraja Sathe, Misty Foster. July 29, Cornwallis Road P.O. Box Research Triangle Park, North Carolina, USA 27709

U.S. National Data on Prevalence of Mental Disorders in Children

Middle-aged and Older Adults: Findings from the National Survey of American Life

PRIMARY MENTAL HEALTH CARE MINIMUM DATA SET. Scoring the Kessler 10 Plus

Data Analysis Using Regression and Multilevel/Hierarchical Models

Conditional Distributions and the Bivariate Normal Distribution. James H. Steiger

Selection of Linking Items

Design and Analysis Plan Quantitative Synthesis of Federally-Funded Teen Pregnancy Prevention Programs HHS Contract #HHSP I 5/2/2016

UN Handbook Ch. 7 'Managing sources of non-sampling error': recommendations on response rates

JSM Survey Research Methods Section

Comparing Multiple Imputation to Single Imputation in the Presence of Large Design Effects: A Case Study and Some Theory

Data Analysis in Practice-Based Research. Stephen Zyzanski, PhD Department of Family Medicine Case Western Reserve University School of Medicine

Module Overview. What is a Marker? Part 1 Overview

A Bayesian Measurement Model of Political Support for Endorsement Experiments, with Application to the Militant Groups in Pakistan

SISCR Module 7 Part I: Introduction Basic Concepts for Binary Biomarkers (Classifiers) and Continuous Biomarkers

Bayesian hierarchical modelling

Supplementary Materials:

Modelling Spatially Correlated Survival Data for Individuals with Multiple Cancers

Weight Adjustment Methods using Multilevel Propensity Models and Random Forests

Model-based Small Area Estimation for Cancer Screening and Smoking Related Behaviors

Can Family Strengths Reduce Risk of Substance Abuse among Youth with SED?

Clinical trials with incomplete daily diary data

Empirical assessment of univariate and bivariate meta-analyses for comparing the accuracy of diagnostic tests

Catherine A. Welch 1*, Séverine Sabia 1,2, Eric Brunner 1, Mika Kivimäki 1 and Martin J. Shipley 1

THE UNIVERSITY OF OKLAHOMA HEALTH SCIENCES CENTER GRADUATE COLLEGE A COMPARISON OF STATISTICAL ANALYSIS MODELING APPROACHES FOR STEPPED-

Chapter 1: Exploring Data

Multiple Mediation Analysis For General Models -with Application to Explore Racial Disparity in Breast Cancer Survival Analysis

An Introduction to Bayesian Statistics

The Late Pretest Problem in Randomized Control Trials of Education Interventions

Advanced IPD meta-analysis methods for observational studies

Treatment effect estimates adjusted for small-study effects via a limit meta-analysis

A Nonlinear Hierarchical Model for Estimating Prevalence Rates with Small Samples

Propensity Score Methods with Multilevel Data. March 19, 2014

On Regression Analysis Using Bivariate Extreme Ranked Set Sampling

Nonresponse Adjustment Methodology for NHIS-Medicare Linked Data

Statistical Tolerance Regions: Theory, Applications and Computation

investigate. educate. inform.

Sampling Weights, Model Misspecification and Informative Sampling: A Simulation Study

A Comparison of Methods of Estimating Subscale Scores for Mixed-Format Tests

Applications. DSC 410/510 Multivariate Statistical Methods. Discriminating Two Groups. What is Discriminant Analysis

Analyzing data from educational surveys: a comparison of HLM and Multilevel IRT. Amin Mousavi

Improving ecological inference using individual-level data

Disparities in Tobacco Product Use in the United States

Application of Local Control Strategy in analyses of the effects of Radon on Lung Cancer Mortality for 2,881 US Counties

CHAPTER 3 RESEARCH METHODOLOGY

Analyzing diastolic and systolic blood pressure individually or jointly?

Title: Estimates and Projections of Prevalence of Type Two Diabetes Mellitus in the US 2001, 2011, 2021: The Role of Demographic Factors in T2DM.

Ecological Statistics

Sample size calculation for a stepped wedge trial

SPRING GROVE AREA SCHOOL DISTRICT. Course Description. Instructional Strategies, Learning Practices, Activities, and Experiences.

Bayesian Joint Modelling of Benefit and Risk in Drug Development

An Empirical Study to Evaluate the Performance of Synthetic Estimates of Substance Use in the National Survey of Drug Use and Health

Accommodating informative dropout and death: a joint modelling approach for longitudinal and semicompeting risks data

Using PIAAC Data for Producing Regional Estimates Follow #PIAAC_Invitational

CRITERIA FOR USE. A GRAPHICAL EXPLANATION OF BI-VARIATE (2 VARIABLE) REGRESSION ANALYSISSys

Colorectal Cancer in Idaho November 2, 2006 Chris Johnson, CDRI

On cows and test construction

Applying Bivariate Meta-analyses when Within-study Correlations are Unknown

LIHS Mini Master Class Multilevel Modelling

META-ANALYSIS OF DEPENDENT EFFECT SIZES: A REVIEW AND CONSOLIDATION OF METHODS

T-Statistic-based Up&Down Design for Dose-Finding Competes Favorably with Bayesian 4-parameter Logistic Design

Bayesian and Frequentist Approaches

Women with Co-Occurring Serious Mental Illness and a Substance Use Disorder

CIGARETTE SMOKING AMONG ADOLESCENTS AND ADULTS IN U.S. STATES AND THE DISTRICT OF COLUMBIA IN 1997 AND WHAT EXPLAINS THE RELATIONSHIP?

Regression CHAPTER SIXTEEN NOTE TO INSTRUCTORS OUTLINE OF RESOURCES

Ordinal Data Modeling

Nonresponse Rates and Nonresponse Bias In Household Surveys

Multivariate Multilevel Models

Mediation Analysis With Principal Stratification

Finding Gold: Results from National Outcome Measures for Healthy Transition Initiative

Supplementary Appendix

The Effect of Extremes in Small Sample Size on Simple Mixed Models: A Comparison of Level-1 and Level-2 Size

California 2,287, % Greater Bay Area 393, % Greater Bay Area adults 18 years and older, 2007

A Comparison of Methods for Determining HIV Viral Set Point

Impact of Nonresponse on Survey Estimates of Physical Fitness & Sleep Quality. LinChiat Chang, Ph.D. ESRA 2015 Conference Reykjavik, Iceland

arxiv: v2 [stat.ap] 7 Dec 2016

The Impact of Adverse Childhood Experiences on Psychopathology and Suicidal Behaviour in the Northern Ireland Population

Jonathan D. Sugimoto, PhD Lecture Website:

The SAGE Encyclopedia of Educational Research, Measurement, and Evaluation Multivariate Analysis of Variance

Bayesian Dose Escalation Study Design with Consideration of Late Onset Toxicity. Li Liu, Glen Laird, Lei Gao Biostatistics Sanofi

Meta-analysis using individual participant data: one-stage and two-stage approaches, and why they may differ

THE IMPACT OF MISSING STAGE AT DIAGNOSIS ON RESULTS OF GEOGRAPHIC RISK OF LATE- STAGE COLORECTAL CANCER

Design for Targeted Therapies: Statistical Considerations

An Introduction to Multiple Imputation for Missing Items in Complex Surveys

A Comparison of Robust and Nonparametric Estimators Under the Simple Linear Regression Model

Estimating Reliability in Primary Research. Michael T. Brannick. University of South Florida

Technical Specifications

Business Statistics Probability

Prepared by: Assoc. Prof. Dr Bahaman Abu Samah Department of Professional Development and Continuing Education Faculty of Educational Studies

Remarks on Bayesian Control Charts

Assessing spatial heterogeneity of MDR-TB in a high burden country

Heritability. The Extended Liability-Threshold Model. Polygenic model for continuous trait. Polygenic model for continuous trait.

Multivariate Mixed-Effects Meta-Analysis of Paired-Comparison Studies of Diagnostic Test Accuracy

National Findings on Mental Illness and Drug Use by Prisoners and Jail Inmates. Thursday, August 17

An informal analysis of multilevel variance

COMPARISON OF TWO BWARIATE STRATIFICATION SCHEMES Alan Roshwalb and Scott Regnier Georgetown University, Washington, D.C

Technical appendix Strengthening accountability through media in Bangladesh: final evaluation

Transcription:

Small-area estimation of mental illness prevalence for schools Fan Li 1 Alan Zaslavsky 2 1 Department of Statistical Science Duke University 2 Department of Health Care Policy Harvard Medical School March 6, 2012

Introduction Mental disorders account for 15% of overall burden of diseases in U.S.

Introduction Mental disorders account for 15% of overall burden of diseases in U.S. Early onset leads research focus on children and adolescents.

Introduction Mental disorders account for 15% of overall burden of diseases in U.S. Early onset leads research focus on children and adolescents. Significant variation exists in prevalence of mental illness across schools and geographical regions.

Introduction Mental disorders account for 15% of overall burden of diseases in U.S. Early onset leads research focus on children and adolescents. Significant variation exists in prevalence of mental illness across schools and geographical regions. Information about prevalence of serious emotional disturbance (SED) among youth in small areas (states, counties) are valuable for mental health policy planning.

Introduction Mental disorders account for 15% of overall burden of diseases in U.S. Early onset leads research focus on children and adolescents. Significant variation exists in prevalence of mental illness across schools and geographical regions. Information about prevalence of serious emotional disturbance (SED) among youth in small areas (states, counties) are valuable for mental health policy planning. Carrying out surveys to obtain direct estimates of SED in small areas are prohibitively expensive.

Introduction Mental disorders account for 15% of overall burden of diseases in U.S. Early onset leads research focus on children and adolescents. Significant variation exists in prevalence of mental illness across schools and geographical regions. Information about prevalence of serious emotional disturbance (SED) among youth in small areas (states, counties) are valuable for mental health policy planning. Carrying out surveys to obtain direct estimates of SED in small areas are prohibitively expensive. Most of current literature is based on synthetic estimation (reweight national surveys).

Motivation Data on short screening scales for SED, e.g., the K6, are collected in three major national health tracking surveys -

Motivation Data on short screening scales for SED, e.g., the K6, are collected in three major national health tracking surveys - (1) National Health Interview Survey (NHIS) (2) CDC s Behavioral Risk Factors Surveillance Survey (BRFSS) (3) SAMHSA s National Household Survey on Drug Use and Health (NSDUH)

Motivation Data on short screening scales for SED, e.g., the K6, are collected in three major national health tracking surveys - (1) National Health Interview Survey (NHIS) (2) CDC s Behavioral Risk Factors Surveillance Survey (BRFSS) (3) SAMHSA s National Household Survey on Drug Use and Health (NSDUH) Short screening scales are known to be correlated with clinical diagnosis. Can we utilize such information?

Motivation Data on short screening scales for SED, e.g., the K6, are collected in three major national health tracking surveys - (1) National Health Interview Survey (NHIS) (2) CDC s Behavioral Risk Factors Surveillance Survey (BRFSS) (3) SAMHSA s National Household Survey on Drug Use and Health (NSDUH) Short screening scales are known to be correlated with clinical diagnosis. Can we utilize such information? Motivated by the National Comorbidity Survey - Adolescent (NCS-A): 1st nationally representative population survey to evaluate mental health of adolescents.

Motivation Data on short screening scales for SED, e.g., the K6, are collected in three major national health tracking surveys - (1) National Health Interview Survey (NHIS) (2) CDC s Behavioral Risk Factors Surveillance Survey (BRFSS) (3) SAMHSA s National Household Survey on Drug Use and Health (NSDUH) Short screening scales are known to be correlated with clinical diagnosis. Can we utilize such information? Motivated by the National Comorbidity Survey - Adolescent (NCS-A): 1st nationally representative population survey to evaluate mental health of adolescents. Data on both clinical diagnosis and short screening scale are available.

NCS-A NCS-A school sample: 9244 adolescents from 318 schools, 42 sampling strata.

NCS-A NCS-A school sample: 9244 adolescents from 318 schools, 42 sampling strata.

NCS-A NCS-A school sample: 9244 adolescents from 318 schools, 42 sampling strata. Each individual is measured by 1. Short screening scale - K6: 6 questions about mental health during last 30 days, measured on 0-4 scale.

NCS-A NCS-A school sample: 9244 adolescents from 318 schools, 42 sampling strata. Each individual is measured by 1. Short screening scale - K6: 6 questions about mental health during last 30 days, measured on 0-4 scale. 2. Clinical diagnostic score of SED: CIDI-A.

NCS-A NCS-A school sample: 9244 adolescents from 318 schools, 42 sampling strata. Each individual is measured by 1. Short screening scale - K6: 6 questions about mental health during last 30 days, measured on 0-4 scale. 2. Clinical diagnostic score of SED: CIDI-A. Individual socio-demographic and school information is also collected.

NCS-A NCS-A school sample: 9244 adolescents from 318 schools, 42 sampling strata. Each individual is measured by 1. Short screening scale - K6: 6 questions about mental health during last 30 days, measured on 0-4 scale. 2. Clinical diagnostic score of SED: CIDI-A. Individual socio-demographic and school information is also collected. Goal: develop a methodology to provide small area estimates of a gold-standard measure of SED from short screening scale, together with socio-demographic variables.

Small-area estimation Small-area : domains with inadequate sample sizes for precise direct estimates.

Small-area estimation Small-area : domains with inadequate sample sizes for precise direct estimates. Small-area estimation (SAE): borrow strength across domains.

Small-area estimation Small-area : domains with inadequate sample sizes for precise direct estimates. Small-area estimation (SAE): borrow strength across domains. Multivariate models improve small-area estimation for each outcome (De Souza, 1992).

Small-area estimation Small-area : domains with inadequate sample sizes for precise direct estimates. Small-area estimation (SAE): borrow strength across domains. Multivariate models improve small-area estimation for each outcome (De Souza, 1992). In most existing multivariate SAE, all outcomes are observed in each domain.

Small-area estimation Small-area : domains with inadequate sample sizes for precise direct estimates. Small-area estimation (SAE): borrow strength across domains. Multivariate models improve small-area estimation for each outcome (De Souza, 1992). In most existing multivariate SAE, all outcomes are observed in each domain. Our study is different: except for a relatively small calibration sample, only one outcome (the K6) is observed.

Small-area estimation Small-area : domains with inadequate sample sizes for precise direct estimates. Small-area estimation (SAE): borrow strength across domains. Multivariate models improve small-area estimation for each outcome (De Souza, 1992). In most existing multivariate SAE, all outcomes are observed in each domain. Our study is different: except for a relatively small calibration sample, only one outcome (the K6) is observed. Our goal: using a model estimated from the calibration sample, we predict the small area quantities of another (missing) outcome (SED) from the observed one.

Outline for small-area prediction 1. Build a model on the bivariate (both K6 and SED) NCS-A data and estimate model parameters.

Outline for small-area prediction 1. Build a model on the bivariate (both K6 and SED) NCS-A data and estimate model parameters. 2. Derive formulas for predicting the small-area means of SED from K6 given the estimated parameters.

Outline for small-area prediction 1. Build a model on the bivariate (both K6 and SED) NCS-A data and estimate model parameters. 2. Derive formulas for predicting the small-area means of SED from K6 given the estimated parameters. 3. For a new data set with only K6, collect auxiliary information (e.g., socio-demographic) for each individual.

Outline for small-area prediction 1. Build a model on the bivariate (both K6 and SED) NCS-A data and estimate model parameters. 2. Derive formulas for predicting the small-area means of SED from K6 given the estimated parameters. 3. For a new data set with only K6, collect auxiliary information (e.g., socio-demographic) for each individual. 4. Plug in parameter estimates from the NCS-A and the auxiliary information of the new sample into small area prediction formulas derived in Step 2.

Data and notations We focus on two-level hierarchical structure with individual (i, j) for j = 1,..., J i belonging to cluster i for i = 1,..., I.

Data and notations We focus on two-level hierarchical structure with individual (i, j) for j = 1,..., J i belonging to cluster i for i = 1,..., I. Each individual has two continuous outcomes: Y 1, Y 2. For example, in the NCS-A: Y 1 is sum of the K6 scores; Y 2 is the linear clinical diagnostic score of SED.

Data and notations We focus on two-level hierarchical structure with individual (i, j) for j = 1,..., J i belonging to cluster i for i = 1,..., I. Each individual has two continuous outcomes: Y 1, Y 2. For example, in the NCS-A: Y 1 is sum of the K6 scores; Y 2 is the linear clinical diagnostic score of SED.

Data and notations We focus on two-level hierarchical structure with individual (i, j) for j = 1,..., J i belonging to cluster i for i = 1,..., I. Each individual has two continuous outcomes: Y 1, Y 2. For example, in the NCS-A: Y 1 is sum of the K6 scores; Y 2 is the linear clinical diagnostic score of SED. Auxiliary information (covariates): individual-level X ij1, X ij2 ; cluster-level Z i1, Z i2.

Data and notations We focus on two-level hierarchical structure with individual (i, j) for j = 1,..., J i belonging to cluster i for i = 1,..., I. Each individual has two continuous outcomes: Y 1, Y 2. For example, in the NCS-A: Y 1 is sum of the K6 scores; Y 2 is the linear clinical diagnostic score of SED. Auxiliary information (covariates): individual-level X ij1, X ij2 ; cluster-level Z i1, Z i2. Objective: provide small area prediction of second-level mean of Y 2 from Y 1 for a distinct new sample where only Y 1 and X, Z are observed.

Bivariate model for NCS-A: two continuous outcomes Two-level bivariate random effects model: Y ij1 = X ij1 β 1 + Z i1 α 1 + v i1 + e ij1, Y ij2 = X ij2 β 2 + Z i2 α 2 + v i2 + e ij2, (1)

Bivariate model for NCS-A: two continuous outcomes Two-level bivariate random effects model: Y ij1 = X ij1 β 1 + Z i1 α 1 + v i1 + e ij1, Y ij2 = X ij2 β 2 + Z i2 α 2 + v i2 + e ij2, (1) with (v i1, v i2 ) N(0, Σ v ), (e ij1, e ij2 ) N(0, Σ e ), and ( σ Σ e = e1 2 ρ e σ e1 σ e2 ρ e σ e1 σ e2 σ 2 e2 ) (, Σ v = σ 2 v1 ρ v σ v1 σ v2 ρ v σ v1 σ v2 σ 2 v2 ).

Bivariate model for NCS-A: two continuous outcomes Two-level bivariate random effects model: Y ij1 = X ij1 β 1 + Z i1 α 1 + v i1 + e ij1, Y ij2 = X ij2 β 2 + Z i2 α 2 + v i2 + e ij2, (1) with (v i1, v i2 ) N(0, Σ v ), (e ij1, e ij2 ) N(0, Σ e ), and ( σ Σ e = e1 2 ρ e σ e1 σ e2 ρ e σ e1 σ e2 σ 2 e2 ) (, Σ v = σ 2 v1 ρ v σ v1 σ v2 ρ v σ v1 σ v2 σ 2 v2 ). For a continuous and a binary outcome, we can assume a probit model for the binary, implementation is similar.

Model Fitting: Hierarchical Bayes We adopt the hierarchical Bayes approach to fit model (1).

Model Fitting: Hierarchical Bayes We adopt the hierarchical Bayes approach to fit model (1). Assume uninformative uniform priors for β, α: β 1, α 1.

Model Fitting: Hierarchical Bayes We adopt the hierarchical Bayes approach to fit model (1). Assume uninformative uniform priors for β, α: β 1, α 1. Assume inverse Wishart prior for Σ: Σ 1 Wishart(b 0, b 0 Σ 1 0 ), b 0 : prior sample size; Σ 0 : prior covariance matrix.

Model Fitting: Hierarchical Bayes We adopt the hierarchical Bayes approach to fit model (1). Assume uninformative uniform priors for β, α: β 1, α 1. Assume inverse Wishart prior for Σ: Σ 1 Wishart(b 0, b 0 Σ 1 0 ), b 0 : prior sample size; Σ 0 : prior covariance matrix. Prior ignorance: set b 0 = 3 and Σ 0 diagonal.

Model Fitting: Hierarchical Bayes We adopt the hierarchical Bayes approach to fit model (1). Assume uninformative uniform priors for β, α: β 1, α 1. Assume inverse Wishart prior for Σ: Σ 1 Wishart(b 0, b 0 Σ 1 0 ), b 0 : prior sample size; Σ 0 : prior covariance matrix. Prior ignorance: set b 0 = 3 and Σ 0 diagonal. Sensitivity analysis show the results are robust to the above prior settings, with a slightly conservative estimation of correlation.

Model Fitting: Hierarchical Bayes We adopt the hierarchical Bayes approach to fit model (1). Assume uninformative uniform priors for β, α: β 1, α 1. Assume inverse Wishart prior for Σ: Σ 1 Wishart(b 0, b 0 Σ 1 0 ), b 0 : prior sample size; Σ 0 : prior covariance matrix. Prior ignorance: set b 0 = 3 and Σ 0 diagonal. Sensitivity analysis show the results are robust to the above prior settings, with a slightly conservative estimation of correlation. Model is fitted by the software MLwiN 2.02 (Browne, 2005)

Small-area prediction: general settings Calibration sample: both Y 1, Y 2 are observed, where the parameters (α, β, Σ e, Σ v ) in model (1) are estimated.

Small-area prediction: general settings Calibration sample: both Y 1, Y 2 are observed, where the parameters (α, β, Σ e, Σ v ) in model (1) are estimated. Target population: a distinct new population of clusters that we want to estimate cluster-mean Y 2, but with only data on

Small-area prediction: general settings Calibration sample: both Y 1, Y 2 are observed, where the parameters (α, β, Σ e, Σ v ) in model (1) are estimated. Target population: a distinct new population of clusters that we want to estimate cluster-mean Y 2, but with only data on Survey sample: outcome Y 1 and auxiliary variables X 1, X 2 are available.

Small-area prediction: general settings Calibration sample: both Y 1, Y 2 are observed, where the parameters (α, β, Σ e, Σ v ) in model (1) are estimated. Target population: a distinct new population of clusters that we want to estimate cluster-mean Y 2, but with only data on Survey sample: outcome Y 1 and auxiliary variables X 1, X 2 are available. Both target population and survey sample are within the same cluster, but there are two different scenarios:

Small-area prediction: general settings Calibration sample: both Y 1, Y 2 are observed, where the parameters (α, β, Σ e, Σ v ) in model (1) are estimated. Target population: a distinct new population of clusters that we want to estimate cluster-mean Y 2, but with only data on Survey sample: outcome Y 1 and auxiliary variables X 1, X 2 are available. Both target population and survey sample are within the same cluster, but there are two different scenarios: 1. In-sample: the target population is equal to or a subset of the survey sample.

Small-area prediction: general settings Calibration sample: both Y 1, Y 2 are observed, where the parameters (α, β, Σ e, Σ v ) in model (1) are estimated. Target population: a distinct new population of clusters that we want to estimate cluster-mean Y 2, but with only data on Survey sample: outcome Y 1 and auxiliary variables X 1, X 2 are available. Both target population and survey sample are within the same cluster, but there are two different scenarios: 1. In-sample: the target population is equal to or a subset of the survey sample. 2. Out-of-sample: the target population is distinct from the survey sample.

Small-area prediction: general settings Calibration sample: both Y 1, Y 2 are observed, where the parameters (α, β, Σ e, Σ v ) in model (1) are estimated. Target population: a distinct new population of clusters that we want to estimate cluster-mean Y 2, but with only data on Survey sample: outcome Y 1 and auxiliary variables X 1, X 2 are available. Both target population and survey sample are within the same cluster, but there are two different scenarios: 1. In-sample: the target population is equal to or a subset of the survey sample. 2. Out-of-sample: the target population is distinct from the survey sample. Lead to different small-area prediction formulas.

Small-area prediction formulas For a in-sample case, the key is to find the distribution of s i = v i2 + ē i2 conditional on Y 1, X 1, Z 1.

Small-area prediction formulas For a in-sample case, the key is to find the distribution of s i = v i2 + ē i2 conditional on Y 1, X 1, Z 1. After some algebra, we can show s i Y i 1, X i 1, Z i1, θ N( µ si, σ 2 si ), where µ si = c si (Ȳi1 X i1 β 1 ), and σ si 2 = σv2 2 + σ2 e2 (ρ vσ v1 σ v2 + ρ e σ e1 σ e2 / Ji1 J i2 ) 2 J i2 σv1 2 + σ2 e1 /J, i1

Small-area prediction formulas For a in-sample case, the key is to find the distribution of s i = v i2 + ē i2 conditional on Y 1, X 1, Z 1. After some algebra, we can show s i Y i 1, X i 1, Z i1, θ N( µ si, σ 2 si ), where µ si = c si (Ȳi1 X i1 β 1 ), and σ si 2 = σv2 2 + σ2 e2 (ρ vσ v1 σ v2 + ρ e σ e1 σ e2 / Ji1 J i2 ) 2 J i2 σv1 2 + σ2 e1 /J, i1 with prediction coefficient c si = ρ vσ v1 σ v2 + ρ e σ e1 σ e2 / J i1 J i2 σv1 2 + σ2 e1 /J. i1

Small-area prediction Prediction for the cluster-level mean of a continuous outcome: E(Ȳi 2 Y i1, X i1, Z i1, X i2, Z i2, θ) X i2 β 2 + Z i2 α 2 + µ si where parameters θ as given (the estimates from model (1) and the NCS-A data).

Small-area prediction Prediction for the cluster-level mean of a continuous outcome: E(Ȳi 2 Y i1, X i1, Z i1, X i2, Z i2, θ) X i2 β 2 + Z i2 α 2 + µ si where parameters θ as given (the estimates from model (1) and the NCS-A data). Variance of the estimates can be obtained numerically.

Small-area prediction Prediction for the cluster-level mean of a continuous outcome: E(Ȳi 2 Y i1, X i1, Z i1, X i2, Z i2, θ) X i2 β 2 + Z i2 α 2 + µ si where parameters θ as given (the estimates from model (1) and the NCS-A data). Variance of the estimates can be obtained numerically. The key to the out-of-sample case is to obtain conditional distribution of the cluster-level random effects v i2 - the formulas have the same form but simpler.

Small-area prediction Prediction for the cluster-level mean of a continuous outcome: E(Ȳi 2 Y i1, X i1, Z i1, X i2, Z i2, θ) X i2 β 2 + Z i2 α 2 + µ si where parameters θ as given (the estimates from model (1) and the NCS-A data). Variance of the estimates can be obtained numerically. The key to the out-of-sample case is to obtain conditional distribution of the cluster-level random effects v i2 - the formulas have the same form but simpler. Formulas for binary outcome prediction are also developed.

Measure of reliability We measure accuracy of the prediction by ratio of the posterior to prior variance of the cluster-average random effects/erros: r 2 = Var(v i2 + ē i2 Y i 1, θ). Var(v i2 + ē i2 θ)

Measure of reliability We measure accuracy of the prediction by ratio of the posterior to prior variance of the cluster-average random effects/erros: r 2 = Var(v i2 + ē i2 Y i 1, θ). Var(v i2 + ē i2 θ) Reliability parameter ζ (variance reduction): ζ = 1 r 2.

Measure of reliability We measure accuracy of the prediction by ratio of the posterior to prior variance of the cluster-average random effects/erros: r 2 = Var(v i2 + ē i2 Y i 1, θ). Var(v i2 + ē i2 θ) Reliability parameter ζ (variance reduction): ζ = 1 r 2. ζ = 1: perfect prediction; ζ = 0: SAE data for Y 1 completely uninformative.

Measure of reliability Reliability of in-sample case: ζ i = [ρ v + ρ e σ e1 σ e2 /(J i1 σ v1 σ v2 )] 2 [1 + σ 2 e1 /(J i1σ 2 v1 )][1 + σ2 e2 /(J i2σ 2 v2 )].

Measure of reliability Reliability of in-sample case: ζ i = [ρ v + ρ e σ e1 σ e2 /(J i1 σ v1 σ v2 )] 2 [1 + σ 2 e1 /(J i1σ 2 v1 )][1 + σ2 e2 /(J i2σ 2 v2 )]. High correlation ρ v, ρ e, large observed cluster sample size J i, large ratio σ 2 v/σ 2 e all contribute to increased reliability of the results.

Measure of reliability Reliability of in-sample case: ζ i = [ρ v + ρ e σ e1 σ e2 /(J i1 σ v1 σ v2 )] 2 [1 + σ 2 e1 /(J i1σ 2 v1 )][1 + σ2 e2 /(J i2σ 2 v2 )]. High correlation ρ v, ρ e, large observed cluster sample size J i, large ratio σ 2 v/σ 2 e all contribute to increased reliability of the results. As J i increases, reliability converges to its upper bound ρ 2 v.

Measure of reliability Reliability of in-sample case: ζ i = [ρ v + ρ e σ e1 σ e2 /(J i1 σ v1 σ v2 )] 2 [1 + σ 2 e1 /(J i1σ 2 v1 )][1 + σ2 e2 /(J i2σ 2 v2 )]. High correlation ρ v, ρ e, large observed cluster sample size J i, large ratio σ 2 v/σ 2 e all contribute to increased reliability of the results. As J i increases, reliability converges to its upper bound ρ 2 v. For the same cluster sample size, in-sample cases have higher reliability than out-of-sample cases.

Application to NCS-A Schools are leading providers of mental health services to children and adolescents in the US.

Application to NCS-A Schools are leading providers of mental health services to children and adolescents in the US. We focus on the NCS-A school sample of the NCS-A to predict school-level SED prevalence from K6.

Application to NCS-A Schools are leading providers of mental health services to children and adolescents in the US. We focus on the NCS-A school sample of the NCS-A to predict school-level SED prevalence from K6. Schools with less than 10 students are dropped, resulting in 9022 students from 282 schools.

Application to NCS-A Schools are leading providers of mental health services to children and adolescents in the US. We focus on the NCS-A school sample of the NCS-A to predict school-level SED prevalence from K6. Schools with less than 10 students are dropped, resulting in 9022 students from 282 schools. Socio-demographic variables: age, sex, race/ethnicity, and age at entrance into primary school. School-level predictors include school size and public/private school status.

Application to NCS-A Schools are leading providers of mental health services to children and adolescents in the US. We focus on the NCS-A school sample of the NCS-A to predict school-level SED prevalence from K6. Schools with less than 10 students are dropped, resulting in 9022 students from 282 schools. Socio-demographic variables: age, sex, race/ethnicity, and age at entrance into primary school. School-level predictors include school size and public/private school status. The diagnostic instrument is a modification of the WHO s Composite International Diagnostic Interview (CIDI) appropriate for adolescents (CIDI-A).

The outcomes in NCS-A Y 2 - Probability of SED: calculated as a function of variables in the CIDI-A, from a probit model estimated from a small validation example.

The outcomes in NCS-A Y 2 - Probability of SED: calculated as a function of variables in the CIDI-A, from a probit model estimated from a small validation example. Y 1 - The K6 (Kessler and Mroczec, 1994): six short questions on mental health during the past 30 days - how often did you feel (1) nervous; (2) hopeless; (3) restless or fidgety; (4) so depressed that nothing could cheer you up; (5) everything was an effort; (6) worthless?

The outcomes in NCS-A Y 2 - Probability of SED: calculated as a function of variables in the CIDI-A, from a probit model estimated from a small validation example. Y 1 - The K6 (Kessler and Mroczec, 1994): six short questions on mental health during the past 30 days - how often did you feel (1) nervous; (2) hopeless; (3) restless or fidgety; (4) so depressed that nothing could cheer you up; (5) everything was an effort; (6) worthless? Each item uses a frequency response scale coded from 0 (never) to 4 (all the time). Sum of all six items.

The outcomes in NCS-A Y 2 - Probability of SED: calculated as a function of variables in the CIDI-A, from a probit model estimated from a small validation example. Y 1 - The K6 (Kessler and Mroczec, 1994): six short questions on mental health during the past 30 days - how often did you feel (1) nervous; (2) hopeless; (3) restless or fidgety; (4) so depressed that nothing could cheer you up; (5) everything was an effort; (6) worthless? Each item uses a frequency response scale coded from 0 (never) to 4 (all the time). Sum of all six items. Y 1 - Augmented K6 (Green et al., 2010): K6 supplemented by five additional CIDI items that specifically assessed behavior disorders.

Results of model fitting Augmented K6 SED Predictor Est SE Est SE intercept 2.635* (0.165) -2.369* (0.044) sex(male) -0.643* (0.068) -0.107* (0.016) age 14-year 0.266* (0.115) 0.079* (0.028) age 15-year 0.513* (0.124) 0.148* (0.030) age 16-year 0.373* (0.125) 0.150* (0.031) age 17-year 0.550* (0.127) 0.219* (0.032) age 18-year 0.496* (0.171) 0.205* (0.042) black 0.597* (0.101) 0.037 (0.025) hispanic 0.440* (0.104) 0.114* (0.026) other race 0.920* (0.151) 0.091* (0.036) start schl at 7 0.252* (0.075) 0.066* (0.018) start schl > 7 0.765* (0.151) 0.193* (0.036) schl size 0.153 (0.091) 0.082* (0.025) public schl 0.288* (0.135) 0.135* (0.037) Table: Estimates of coefficients of two-level models with posterior standard deviation (*Significant at 0.05 level two sided test).

Variance Components K6 and SED Aug. K6 and SED school individual school individual Variance (K6) 0.316 10.531 0.019 0.371 Variance (SED) 0.037 0.596 0.037 0.597 K6-SED correlation 0.695 0.516 0.845 0.544 Table: Estimated variance components of two-level models.

Variance Components K6 and SED Aug. K6 and SED school individual school individual Variance (K6) 0.316 10.531 0.019 0.371 Variance (SED) 0.037 0.596 0.037 0.597 K6-SED correlation 0.695 0.516 0.845 0.544 Table: Estimated variance components of two-level models. School-level correlation is strong despite modest individual-level correlation.

Variance Components K6 and SED Aug. K6 and SED school individual school individual Variance (K6) 0.316 10.531 0.019 0.371 Variance (SED) 0.037 0.596 0.037 0.597 K6-SED correlation 0.695 0.516 0.845 0.544 Table: Estimated variance components of two-level models. School-level correlation is strong despite modest individual-level correlation. Augmented K6 further improves school-level correlation.

Figure: Scatterplot of school-level K6 and the augmented K6 versus predicted SED for schools with more than 25 screened students.

Predictive models In a school-wide screening, the target population is exactly the survey sample: in-sample scenario with J i1 = J i2 = J i, X 1 = X 2 = X.

Predictive models In a school-wide screening, the target population is exactly the survey sample: in-sample scenario with J i1 = J i2 = J i, X 1 = X 2 = X. The prediction coefficients c si and reliability depend on the cluster sample size J i. Table: Reliability of small-area prediction 1 30 200 ζ out 0.00 0.28 0.60 0.72 ζ in 0.31 0.54 0.67 0.72

Predictive models In a school-wide screening, the target population is exactly the survey sample: in-sample scenario with J i1 = J i2 = J i, X 1 = X 2 = X. The prediction coefficients c si and reliability depend on the cluster sample size J i. Table: Reliability of small-area prediction 1 30 200 ζ out 0.00 0.28 0.60 0.72 ζ in 0.31 0.54 0.67 0.72 Bivariate analysis enables c si s to be extrapolated in practical cases with large school, but univariate analysis cannot.

Model Validation Use posterior predictive checks (Gelman, Meng and Stern, 1996) to check modeling fitting. Generate copies of the NCS-A data using 1000 posterior draws of the parameters from model (1) and calculate posterior predictive p-values. Table: Summary statistics from observed and simulated data. Posterior predictive p-values are shown in parenthesis. K6 Φ 1 (SED) summary statistics obs sim obs sim Individual mean -1.72-1.73 (0.41) -2.08-2.07 (0.37) Individual s.d. 0.63 0.63 (0.47) 0.81 0.81 (0.46) Mean of school means -1.72-1.72 (0.50) -2.08-2.07 (0.58) S.d. of school means 0.20 0.21 (0.58) 0.27 0.26 (0.88) Individual correlation with K6 - - 0.56 0.53 (0.57) School correlation with K6 - - 0.79 0.60 (0.85)

Comparison with the observed school means The NCS-A have both SED and K6, we can compare the model-based predictions with the observed SED.

Comparison with the observed school means The NCS-A have both SED and K6, we can compare the model-based predictions with the observed SED. We predict school SED prevalence from K6 in the NCS-A as if only K6 had been measured.

Comparison with the observed school means The NCS-A have both SED and K6, we can compare the model-based predictions with the observed SED. We predict school SED prevalence from K6 in the NCS-A as if only K6 had been measured. Average observed and predicted school prevalence is 6.1% and 5.9% respectively.

Comparison with the observed school means The NCS-A have both SED and K6, we can compare the model-based predictions with the observed SED. We predict school SED prevalence from K6 in the NCS-A as if only K6 had been measured. Average observed and predicted school prevalence is 6.1% and 5.9% respectively. The predicted and observed distributions of school means are well matched except in the upper tail (observed prevalence > 0.10)

Comparison with the observed school means The NCS-A have both SED and K6, we can compare the model-based predictions with the observed SED. We predict school SED prevalence from K6 in the NCS-A as if only K6 had been measured. Average observed and predicted school prevalence is 6.1% and 5.9% respectively. The predicted and observed distributions of school means are well matched except in the upper tail (observed prevalence > 0.10) The observed SED prevalence of 254 out of 282 (90.7%) schools falls into the 95% in-sample predictive intervals.

Comparison with the observed school means The NCS-A have both SED and K6, we can compare the model-based predictions with the observed SED. We predict school SED prevalence from K6 in the NCS-A as if only K6 had been measured. Average observed and predicted school prevalence is 6.1% and 5.9% respectively. The predicted and observed distributions of school means are well matched except in the upper tail (observed prevalence > 0.10) The observed SED prevalence of 254 out of 282 (90.7%) schools falls into the 95% in-sample predictive intervals. Despite the under-estimation, our method identifies 72.4% ( 21 out of 29) schools with the top 10% observed prevalence.

Comparison with the direct estimates A second check: compare direct and model-based estimates for aggregates of schools.

Comparison with the direct estimates A second check: compare direct and model-based estimates for aggregates of schools. Collapse the 42 geographical strata into 14 larger strata.

Comparison with the direct estimates A second check: compare direct and model-based estimates for aggregates of schools. Collapse the 42 geographical strata into 14 larger strata. Direct estimates: averages of the observed school means of SED probability within each strata weighted by the strata size.

Comparison with the direct estimates A second check: compare direct and model-based estimates for aggregates of schools. Collapse the 42 geographical strata into 14 larger strata. Direct estimates: averages of the observed school means of SED probability within each strata weighted by the strata size. In 10 out 14 strata, the bivariate predictions fall within the 95% confidence intervals of the direct estimates.

Comparison with the direct estimates A second check: compare direct and model-based estimates for aggregates of schools. Collapse the 42 geographical strata into 14 larger strata. Direct estimates: averages of the observed school means of SED probability within each strata weighted by the strata size. In 10 out 14 strata, the bivariate predictions fall within the 95% confidence intervals of the direct estimates. The same strata were identified as having the two highest observed and predicted prevalences.

Comparison with alternative methods (A) A standard regression-synthetic model: first regress individual-level SED score on covariates without K6, and then calculate individual predictions and thus predict school means (e.g., Hudson, 2010).

Comparison with alternative methods (A) A standard regression-synthetic model: first regress individual-level SED score on covariates without K6, and then calculate individual predictions and thus predict school means (e.g., Hudson, 2010). (B) A similar regression-synthetic model including K6 as a predictor.

Comparison with alternative methods (A) A standard regression-synthetic model: first regress individual-level SED score on covariates without K6, and then calculate individual predictions and thus predict school means (e.g., Hudson, 2010). (B) A similar regression-synthetic model including K6 as a predictor. (C) A univariate random-effects logistic model without K6 as a predictor. Y ij2 = X ij β + v i + e ij, v i N(0, σ 2 v), e ij N(0, σ 2 e).

Comparison with alternative methods (A) A standard regression-synthetic model: first regress individual-level SED score on covariates without K6, and then calculate individual predictions and thus predict school means (e.g., Hudson, 2010). (B) A similar regression-synthetic model including K6 as a predictor. (C) A univariate random-effects logistic model without K6 as a predictor. Y ij2 = X ij β + v i + e ij, v i N(0, σ 2 v), e ij N(0, σ 2 e). (D) The same as (C) but including K6 as a predictor. Y ij2 = Y ij1 α + X ij β + v i + e ij, v i N(0, σ 2 v), e ij N(0, σ 2 e).

Comparison with alternative methods Table: Comparison of errors of prediction of school-level SED prevalences in NCS-A from different SAE models. Model MSE ( 10 3 ) MAE ( 10 2 ) (A) Synthetic without K6 2.35 2.94 (B) Synthetic with K6 1.90 2.55 (C) Univariate without K6 2.44 2.95 (D) Univariate with K6 1.67 2.38 Bivariate multilevel 1.54 2.30 MSE = mean squared error, MAE = mean absolute error.

Discussion Develop a methodology for bivariate small-area prediction based on multilevel models.

Discussion Develop a methodology for bivariate small-area prediction based on multilevel models. Apply to NCS-A to provide small-area estimation of SED prevalence of schools from the short screening scale K6.

Discussion Develop a methodology for bivariate small-area prediction based on multilevel models. Apply to NCS-A to provide small-area estimation of SED prevalence of schools from the short screening scale K6. Provide evidence: K6 feasible alternative to clinical diagnosis at school level.

Discussion Develop a methodology for bivariate small-area prediction based on multilevel models. Apply to NCS-A to provide small-area estimation of SED prevalence of schools from the short screening scale K6. Provide evidence: K6 feasible alternative to clinical diagnosis at school level. Ongoing work: 1. Explore more flexible models (e.g., copula models with T distribution) to improve prediction in the upper tail of the prevalence distribution.

Discussion Develop a methodology for bivariate small-area prediction based on multilevel models. Apply to NCS-A to provide small-area estimation of SED prevalence of schools from the short screening scale K6. Provide evidence: K6 feasible alternative to clinical diagnosis at school level. Ongoing work: 1. Explore more flexible models (e.g., copula models with T distribution) to improve prediction in the upper tail of the prevalence distribution. 2. Develop open-source software packages for public usage.

Acknowledgements Harvard University HCP: Ron Kessler, Michael Gruber, Nancy Sampson Boston University: Jennifer Green The National Institute for Mental Health (NIMH), the National Institute on Drug Abuse (NIDA), the Substance Abuse and Mental Health Services Administration (SAMHSA), the Robert Wood Johnson Foundation, and the John W. Alden Trust.

Key References 1. Green, JG, Gruber, MJ, Kessler, RC, Sampson, NA, and Zaslavsky, AM. (2010). Improving the K6 Short Scale to Predict Serious Emotional Distress in Adolescents in the US. International Journal of Methods in Psychiatric Research, 19 (Supplement 1), 23-35. 2. Hudson, C. (2010). Validation of a model for estimating state and local prevalence of serious mental illness. International Journal of Methods in Psychiatric Research, 18(4). 3. Kessler, R. and Mroczec, D. (1994). Final Versions of Our Non-specific Psychological Distress Scale, University of Michigan, Survey Research Centre of the Institute for Social Research. 4. Li, F and Zaslavsky, AM. (2010). Using a short screening scale for small-area estimation of prevalence of mental illness prevalence for schools. Journal of the American Statistical Association 105(492), 1323-1332. 5. Li, F, Green, JG, Zaslavsky, AM, and Kessler, R. (2010). Estimating prevalence of serious emotional disturbance in schools using a brief screening scale. International Journal of Methods in Psychiatric Research 19 (Supplement 1), 88-98. 6. Merikangas, KR, Avenevoli, S, Costello, EJ, Koretz, D, and Kessler, RC. (2008). The National Comorbidity Survey Adolescent Supplement (NCS-A): I. Background and Measures. Journal of the American Academy of Child and Adolescent Psychiatry, 48, 367-369.