Assessing the efficacy of mobile phone interventions using randomised controlled trials: issues and their solutions. Dr Emma Beard

Similar documents
Validity and reliability of measurements

Validity and reliability of measurements

Disentangling the gateway hypothesis: does e-cigarette use cause subsequent smoking in adolescents?

Design and Analysis Plan Quantitative Synthesis of Federally-Funded Teen Pregnancy Prevention Programs HHS Contract #HHSP I 5/2/2016

Instrument for the assessment of systematic reviews and meta-analysis

Exploring the Impact of Missing Data in Multiple Regression

Module 5. The Epidemiological Basis of Randomised Controlled Trials. Landon Myer School of Public Health & Family Medicine, University of Cape Town

Version No. 7 Date: July Please send comments or suggestions on this glossary to

S Imputation of Categorical Missing Data: A comparison of Multivariate Normal and. Multinomial Methods. Holmes Finch.

Controlled Trials. Spyros Kitsiou, PhD

Programme Name: Climate Schools: Alcohol and drug education courses

Evaluating Social Programs Course: Evaluation Glossary (Sources: 3ie and The World Bank)

Help! Statistics! Missing data. An introduction

CAN EFFECTIVENESS BE MEASURED OUTSIDE A CLINICAL TRIAL?

Module 14: Missing Data Concepts

Using Propensity Score Matching in Clinical Investigations: A Discussion and Illustration

CONSORT: missing missing data guidelines, the effects on HTA monograph reporting Yvonne Sylvestre

The Impact of Relative Standards on the Propensity to Disclose. Alessandro Acquisti, Leslie K. John, George Loewenstein WEB APPENDIX

QUASI-EXPERIMENTAL HEALTH SERVICE EVALUATION COMPASS 1 APRIL 2016

Identifying Peer Influence Effects in Observational Social Network Data: An Evaluation of Propensity Score Methods

Conducting Strong Quasi-experiments

Propensity Score Methods for Estimating Causality in the Absence of Random Assignment: Applications for Child Care Policy Research

Recent developments for combining evidence within evidence streams: bias-adjusted meta-analysis

Bias in regression coefficient estimates when assumptions for handling missing data are violated: a simulation study

Appendix 1. Sensitivity analysis for ACQ: missing value analysis by multiple imputation

bivariate analysis: The statistical analysis of the relationship between two variables.

Analysis of TB prevalence surveys

Propensity Score Methods to Adjust for Bias in Observational Data SAS HEALTH USERS GROUP APRIL 6, 2018

A Bayesian Perspective on Unmeasured Confounding in Large Administrative Databases

The effectiveness of smoking cessation treatments used in the "real world"

Lecture 3. Previous lecture. Learning outcomes of lecture 3. Today. Trustworthiness in Fixed Design Research. Class question - Causality

Propensity score methods to adjust for confounding in assessing treatment effects: bias and precision

Section on Survey Research Methods JSM 2009

What is Multilevel Modelling Vs Fixed Effects. Will Cook Social Statistics

The RoB 2.0 tool (individually randomized, cross-over trials)

The ROBINS-I tool is reproduced from riskofbias.info with the permission of the authors. The tool should not be modified for use.

Week 10 Hour 1. Shapiro-Wilks Test (from last time) Cross-Validation. Week 10 Hour 2 Missing Data. Stat 302 Notes. Week 10, Hour 2, Page 1 / 32

Example 7.2. Autocorrelation. Pilar González and Susan Orbe. Dpt. Applied Economics III (Econometrics and Statistics)

BIOSTATISTICAL METHODS AND RESEARCH DESIGNS. Xihong Lin Department of Biostatistics, University of Michigan, Ann Arbor, MI, USA

Propensity Score Analysis Shenyang Guo, Ph.D.

EFFECTIVENESS OF PHONE AND LIFE- STYLE COUNSELING FOR LONG TERM WEIGHT CONTROL AMONG OVERWEIGHT EMPLOYEES

Problem solving therapy

Alcohol interventions in secondary and further education

Logistic Regression with Missing Data: A Comparison of Handling Methods, and Effects of Percent Missing Values

2.75: 84% 2.5: 80% 2.25: 78% 2: 74% 1.75: 70% 1.5: 66% 1.25: 64% 1.0: 60% 0.5: 50% 0.25: 25% 0: 0%

TRIPLL Webinar: Propensity score methods in chronic pain research

Longitudinal data monitoring for Child Health Indicators

Clinical trial design issues and options for the study of rare diseases

GRADE. Grading of Recommendations Assessment, Development and Evaluation. British Association of Dermatologists April 2018

Just what research design am I using? Shona J. Kelly Professor of Interdisciplinary Research

Supplementary Online Content

Challenges of Observational and Retrospective Studies

Applied Medical. Statistics Using SAS. Geoff Der. Brian S. Everitt. CRC Press. Taylor Si Francis Croup. Taylor & Francis Croup, an informa business

Glossary. Ó 2010 John Wiley & Sons, Ltd

Introduction to Observational Studies. Jane Pinelis

Confounding by indication developments in matching, and instrumental variable methods. Richard Grieve London School of Hygiene and Tropical Medicine

How to analyze correlated and longitudinal data?

Supplementary Appendix

Evidence Based Medicine

Joseph W Hogan Brown University & AMPATH February 16, 2010

Evaluating Survey Data: Mediation Analysis and Propensity Score Matching

Glossary From Running Randomized Evaluations: A Practical Guide, by Rachel Glennerster and Kudzai Takavarasha

OHDSI Tutorial: Design and implementation of a comparative cohort study in observational healthcare data

Endovascular or open repair for ruptured abdominal aortic aneurysm: 30-day outcomes from

ON THE MAGNITUDE AND PERSISTENCE OF THE HAWTHORNE EFFECT EVIDENCE FROM FOUR FIELD STUDIES

Thresholds for statistical and clinical significance in systematic reviews with meta-analytic methods

Causal Validity Considerations for Including High Quality Non-Experimental Evidence in Systematic Reviews

Recent advances in non-experimental comparison group designs

Catherine A. Welch 1*, Séverine Sabia 1,2, Eric Brunner 1, Mika Kivimäki 1 and Martin J. Shipley 1

Methods for treating bias in ISTAT mixed mode social surveys

The role of Randomized Controlled Trials

04/12/2014. Research Methods in Psychology. Chapter 6: Independent Groups Designs. What is your ideas? Testing

Chapter 21 Multilevel Propensity Score Methods for Estimating Causal Effects: A Latent Class Modeling Strategy

BLISTER STATISTICAL ANALYSIS PLAN Version 1.0

Individual Participant Data (IPD) Meta-analysis of prediction modelling studies

Use of GEEs in STATA

Systematic reviews and meta-analyses of observational studies (MOOSE): Checklist.

CHL 5225 H Advanced Statistical Methods for Clinical Trials. CHL 5225 H The Language of Clinical Trials

COMMITTEE FOR PROPRIETARY MEDICINAL PRODUCTS (CPMP) POINTS TO CONSIDER ON MISSING DATA

Missing Data and Imputation

Results. NeuRA Hypnosis June 2016

Matched Cohort designs.

Ec331: Research in Applied Economics Spring term, Panel Data: brief outlines

Moving beyond regression toward causality:

Online Supplementary Material

d =.20 which means females earn 2/10 a standard deviation more than males

SMART Clinical Trial Designs for Dynamic Treatment Regimes

Multiple imputation for multivariate missing-data. problems: a data analyst s perspective. Joseph L. Schafer and Maren K. Olsen

Models for potentially biased evidence in meta-analysis using empirically based priors

CONSORT 2010 checklist of information to include when reporting a randomised trial*

BIOSTATISTICAL METHODS

GLOSSARY OF GENERAL TERMS

Strategies for handling missing data in randomised trials

PubH 7405: REGRESSION ANALYSIS. Propensity Score

Systematic Reviews. Simon Gates 8 March 2007

Multilevel analysis quantifies variation in the experimental effect while optimizing power and preventing false positives

Examining Relationships Least-squares regression. Sections 2.3

Dealing with uncertainty in the economic evaluation of health care technologies

A COMPARISON OF IMPUTATION METHODS FOR MISSING DATA IN A MULTI-CENTER RANDOMIZED CLINICAL TRIAL: THE IMPACT STUDY

Bayesian graphical models for combining multiple data sources, with applications in environmental epidemiology

Transcription:

Assessing the efficacy of mobile phone interventions using randomised controlled trials: issues and their solutions Dr Emma Beard 2 nd Behaviour Change Conference Digital Health and Wellbeing 24 th -25 th Feb 2016

Background Efficacy of mobile phone interventions are generally assessed using exploratory methods (e.g., Pilot studies and proof of concept) In an era of evidence based practice calls have been to made to evaluate them with more rigour Randomised controlled trials Introduces a number of methodological and statistical analysis issues

Statistical issues Missing data multiple imputation Assessing engagement composite score and survival analysis Randomisation failure regression adjustment and propensity score matching Confounding effect of time random and fixed effect coefficients Time-series data ARIMA modelling Underpowered Bayes Factors

Missing data Missing data because of loss to follow-up or nonengagement Deletion methods Listwise deletion (complete case analysis) Reduces statistical power Doesn t use all the information Estimates are biased if data are not Missing Completely at Random (missing value does not depend on X or Y) Gender Age SES Female 18 High Male. Low Female 25. Single imputation methods (mean/mode substitution) Reduces variability Weakens covariance and correlation estimates in the data (ignores relationships between variables) Solution: model-based methods such as multiple imputation

Missing data: Multiple imputation Data are fill in with imputed values using a specified regression model This step is repeated m times, resulting in a separate dataset each time Analyses are performed on each dataset Results are pooled into one estimate (e.g. Rubin s Rules) Variability is more accurate for each missing value Considers variability due to sampling and variability due to imputation A primer on multiple imputation (Schafer, 1999) How may imputations are needed? (Graham et al, 2007) Potential and pitfalls (Sterne et al, 2009) Example: Mobile phone test messages for improving adherence to antiretroviral therapy (ART) (Mbuagbaw et al (2013)

Assessing engagement Need to better understand factors influencing engagement (Bailey et al, 2015) But how do we assess engagement? Viewing duration Program visits Composite measure: Analysis of Cobb et al (2007) Web-based QuitNet smoking cessation program used the product of the number of logins and duration per login in minutes (assumes each measure weighted equally) Statistical weighting methods: multiple correspondence analysis and principle components analysis Handbook on constructing composite indicators (http://www.oecd.org/std/42495745.pdf)

Assessing engagement Sometimes, we are interested in how group assignment is associated with engagement and non-engagement t-test (group(control/exp)~engagement (yes/no) But, sometimes we are interested in how group assignment affects time till non-engagement (duration of engagement) Why not linear regression? Group ~ time till non-engagement Survival times are typically positive numbers Cannot handle censoring of observations (Observations of time till an event are incomplete. For example, the study may end before the person has disengaged) Survival analysis (e.g., Epton et al, 2013) Traditionally used to assess time to an event eg., death, onset of a disease, length of stay in hospital Cox proportional hazard model (Fisher et al, 1999)

Randomisation failure Sometimes randomisation breaks down Subversion bias: manipulation of assignment to groups Schulz et al (1995) badly concealed allocation led to increased effect sizes (cheating?) Technical bias: allocation system breaks down COMET 1 trial (a trial of epidural anaesthetics for women in labour) program computer fault Attrition bias: attrition differs between groups Problem: confounding participant characteristics which may be associated with the outcome are not evenly distributed

Solution to: randomisation failure Regression adjustment Regress the outcome on the covariates including an indicator variable for treatment Statistically significant coefficient of treatment indicates a treatment effect Sometimes fails to adequately adjust for differences in observed covariates (Baser, 2007) What about unobserved covariates? Propensity score matching (Rosenbaum & Rubin, 1983) Propensity score provides a single metric that summarises all the information from explanatory variables such as disease severity and comorbidity It estimates the probability of a subject receiving the intervention of interest given his or her clinical status/sociodemographic characteristics Create a group of treated and control units that have similar characteristics Both methods can be combined (Austin, 2011; Austin, 2014)

Derive a propensity score using variables measured at baseline (so they have not been affected by treatment) Match participants on their propensity score Analyse these two matched groups Can reduce sample size due to unmatched participants Instead of one-to-one matching use one-to-two or one-to-three matching

Confounding effect of time In cross-sectional surveys of mobile phone application users, data may be collected at different times Need to adjust for time in pooled analyses Responses at time 1 may be different to responses at time 2 as a consequence of external unrelated factors For example, the smoking ban in public places reduced population prevalence rates introduced confounding if measurements for a mobile phone intervention took place before and immediately proceeding the ban. Solutions: Include a fixed effect covariate in our regression model i.e. Y = β 0 + time 1 X + treatment 2 X + e Include a random effect for time i.e use multi-level modelling If we assume that participants measured at one point in time are more similar (i.e. there is clustering) See Modugno et al (2013) A multilevel approach for repeated cross-sectional data

Time-series data Mobile phone applications can also provide a proliferation of time-series data Killingsworth and Gibert (2010) iphone application which tracks various behaviors, cognitions, and affect over time Database contained almost a quarter of a million psychological measurements from individuals in 83 countries. Such data generally cannot be analysed with traditional regression based methods due to internal structure e.g. autocorrelation (correlation for a variable with itself at a previous time period) and seasonality

Time-series data Need to look at use of more advanced methods from economics and financial arenas Dynamic regression models with Autoregressive integrative moving average (ARIMA) terms (Hyndman and Athanasopoulos, 2014): A regression model can be first fit to the data for explanatory or descriptive modelling ARIMA terms can be fit to the residuals in order to account for any remaining autocorrelation Prior values (AR terms) Lingering effects of prior unobserved shocks e.g. white noise (MA terms) Can also include seasonal ARIMA terms to account for seasonality

Power of a study Chance of finding a significant effect if one exists (usually set to 80%) Higher for smaller standard deviations, larger sample sizes and larger effect sizes Before you conduct an RCT you need to calculate the sample size required to detect a clinically important effect Sometimes obtaining an adequate sample size can be hard!

Impact of being under powered If your study is underpowered you may fail to detect an effect even if one is present (type 2 error) The boy who cried wolf The second error the villagers did (when they didn't believe him) was type 2 error. Cannot interpret a non-significant effect: Data are insensitive or evidence for the null hypothesis

Solution: Bayes Factors Strength of evidence for some theory versus another If BF > 1 then the data support your theory over the null If Bf<1 then the data support the null over your theory If B=about 1, your experiment was not sensitive Jeffreys (1939): BF s more than 3 or less than 1/3 rd provide substantial evidence B>3 substantial support for theory B<1/3 substantial support for the null

To calculate a Bayes Factor Online calculator: Zoltan Dienes (http://www.lifesci.sussex.ac.uk/home/zoltan_dienes/inf erence/bayes.htm) Baguley and Kaye (2010) provide equivalent R code Dienes (2014) Using Bayes to get the most out of nonsignificant results

Acknowledgements Thank you for listening My salary is funded by the NIHR School for Public Health Research and Cancer Research UK (CRUK)