Palo. Alto Medical WHAT IS. combining segmented. Regression and. intervention, then. receiving the. over. The GLIMMIX. change of a. follow.
|
|
- Brittney Harmon
- 5 years ago
- Views:
Transcription
1 Regression and Stepped Wedge Designs Eric C. Wong, Po-Han Foundation Researchh Institute, Palo Alto, CA ABSTRACT Impact evaluation often equires assessing the impact of a new policy, intervention, product, or service in real-world, observational, or quasi-experimental situations. Often, these interventions are rolled out in phasess at different points over time making them good candidates for stepped wedge designs.. Segmented regression is one method for Chen, Dorothy Hung Palo Alto Medical measuring the change in a time series before and after an intervention. In this paper, we propose combining segmented regression and stepped wedge designs to analyze phased interventions over time. Specifically, we propose the use of generalized linear mixed models for a non-randomized, stepped wedged observational study. We start by describing segmented regression for an interrupted time series in location receiving the intervention, then extend the approach to N locations receiving the intervention sequentially over time. Lastly, we discuss accounting for temporal autocorrelation and relevant clustering of individuals within locations. INTRODUCTION Many pragmatic interventions are deployed outside the controlled research world. This is common among real-worldimprovement. process improvement initiatives where an organizationn may be deploying new practices aimed at Often, the primary question of interest is whether the new changes altered the trajectory of measured outcomes over time. Is there a new normal? In the simplest case, there are many methods used to analyze the before and after change in an observational study. However, the real-world can be more complex. Itt may be strategic for an organization to sequentially deploy an intervention, rolling it out in several phases across all its locations. Moreover, an organization with many locations may experience clustered effects where the intervention behaves more similarly among individuals within a location but differentially across locations. In this paper, we propose using generalized linear mixed models (as implemented by The GLIMMIX Procedure), and describe several examples of increasing complexity. Throughout, we will use an example of a healthcare organization with multiple clinic locations. Each location has several departments and many physicians working in each department. A process improvement was phased-in across departments over time. WHAT IS SEGMENTED REGRESSION? A popular approach to evaluating the introduction of a new policy or intervention is to observe the change of a measurable outcome over time. The intervention interrupts this timee series of the outcome, and one is usually interested in how the path of the time series before the interruption compares to the path of the time series after the interruption. The changes to the time series may either be immediatee or gradual over time. Linear regression is a simple approach to modeling data over time, and segmented regression is a particular case. It is a piecewise regression where through model specification, the estimates from a single regression describess the segment before the interruption (e.g. the pre-intervention period) andd the change from the pre- after the interruption (e.g. the post-interventionn period). The figure below illustrates some of the model specifications to follow. post-intervention slope β time _aft_int (+β time ) intervention effect pre-intervention slope β β intervention β time (+β )
2 The original data is plotted in the first panel of the figure above. The gray shaded area marks the time period after the interventionn began. The second and third panel showss relevant corresponding algebra derived from the model below. We start with a basic linear regression model. Although there are twoo segments, there is only one regression model. Y = β + β time x T + β intervention I + β x T Y: β : T: I: T : β time : β intervention : β : WHAT ARE STEPPED WEDGEE DESIGNS? In situations when randomization of receipt of an intervention cannott be done, for example, when it is unethical to withhold an efficacious intervention from locations, randomization of the startt time of an intervention can be considered instead. In these designs, all locations eventually receivee the intervention, and also have varying amounts of baselinee and follow-up data. Stepped wedge designs are a form of one-way cross- locations. When locations or groups of locations begin an intervention at different time points, the timelines visually resemble a stepped wedge, from which it derives its name. Often, phased interventions in the real world can be framed in this paradigm, however in the real world, the start time of the intervention is often determined strategicallyy or over designs and can be used with these phased intervention designs across multiple opportunistically by organizational leadership and is not randomized.. When using this paradigm to evaluate phased interventions, one should be diligent and cautious about bias when interpreting the results, as with any quasi-experiment. outcome model intercept time from the start of the observation period (,, ) intervention status ( before intervention, after intervention) time after start of the intervention, otherwise (,,,,,, 3, ) model parameter representing the pre-intervention n slope model parameter representing the immediate intervention effect, additive to intercept model parameter representing the change in pre-intervention slope after the intervention, and is interpreted as gradual changes to the time series following the intervention. The value of the post- intervention slope is ( β time + β time_aft_i nt). By adding additional terms to this model, one can account for additional locations, or hierarchical structures such as when data on individuals within locations is available, described in detail later. Figure : Schematic of stepped wedge designs. REAL WORLD SCENARIOS In the real world, one could be analyzing phased intervention in one or many locations. The availability of data can be at the location level or smaller level such as department or individual. We consider several of these permutations in the following scenarios with description, input data set formats, SAS code, output, and interpretation. Table : Scenarios by Number of Locations and Data Availabilityy Scenario # of Data Availability Locations Location Individuals Location 3 N Locations 4 N Individuals Locations 5 N Individuals Departments Locations Segmented Regression Stepped Wedge No No SCENARIO : # OF LOCATIONS =. AVAILABILITY = LOCATION. This example can be solved using a traditional segmented regression for interrupted time series. Since there is only a single location with data available by location, a stepped wedged design is not yet necessary. A number of regression procedures can be used, and PROC GLIMMIX is one of them, shownn below. PROC GLIMMIX is appropriate for generalized linear mix models of which there is ample literature. If one is interested in adjusting for correlation of measures over time across subjects, a RANDOM statement with _RESIDUAL_, SUBJECT= and TYPE= options can be used, described in later scenarios.
3 Figure : Scenario Plot Table : Scenario Input Data time time interv. // // // // 3 // 4 // Y PROC GLIMMIX DATA= ; MODEL Y = time intervention time aft_int / SOLUTION; SCENARIO : # OF LOCATIONS =. AVAILABILITY = (INDIVIDUALS LOCATION). In some real world scenarios, one may still be studying one location, but have data from multiple individuals within the application. Segmented regression for interrupted time series is still an appropriate method. Since there is still only one location, stepped wedge designs are not yet leveraged in this scenario. We extend the previous methodology by accounting for individuals nested within locations. Too account for repeated measures one may use the PROC GLIMMIX RANDOM statement with SUBJECT=, and TYPE= options as appropriate. Figure 3: Scenario Plot Table 3: Scenario Input Data Person ID time time interv. // // // // 3 // 4 //3 36 // // Y PROC GLIMMIX DATA= ; CLASS person_id; MODEL Y = time intervention time aft_int / SOLUTION; RANDOM _RESIDUAL_ / SUBJECT=person_id TYPE=AR(); Table 4: Scenario Output Solutions for Fixed Effects Effect time intervention Estimate SE DF t Pr > t <..343 <. <. The intervention seems to have both an immediate and gradual effect. The significant coefficient for interventionn indicates an immediate upward shift to the trend at the start of the intervention, with the post-segment starting at (β +β interventio on). The significant coefficient indicates a change to the slope after the start of the intervention, with a post-slope value of (β time + β time_af ft_int). 3
4 SCENARIO 3: # OF LOCATIONS = N. AVAILABILITY = LOCATIONS. When studying more than one location in a phased implementation, one has the opportunity to use the stepped wedge framework. First, one should update the input data set with information about locations (a location identifier) and when they began the intervention respectively. Using segmentedd regression ass above, this means modifying the interventionn and time after invention () variables to reflectt location-specific time points of implementation. In this real world scenario, one is often interested in how each location perform before and after the intervention, as well as how the entire organization might perform before and after the intervention. Using generalized linear mixed models, we can estimate the location-specific impact as well as organization-wide impact throughh specifying fixed and random effects in the model. As before, we will also account for repeated measures. Figure 4: Scenario 3 Plot location N Table 5: Scenario 3 Input Data date time interv. // // // // 3 // 4 //3 36 // //3 7 3//3 8 // Y Now that the data has been simply expanded and modified to reflect t the stepped wedged design through implementation dates (time, intervention, and ) that vary by location, one can model the scenario by using mixed effects modeling below. PROC GLIMMIX DATA=work.scenario3; CLASS location; MODEL Y = time intervention time aft_int / SOLUTION; RANDOM INT intervention n time_aft int / SUBJECT=location SOLUTION; RANDOM _RESIDUAL_ / SUBJECT=location TYPE=AR(); We interpret the fixed effects of (time, intervention, and ) as the effects common to all locations, namely the organization-wide effects. Then, we allow individual locations to have unique starting points as well as immediate and gradual effects of the intervention through random intercepts, intervention and terms. We assume before the intervention, every location has the same common pre-slope (from the fixed effects). Lastly, we add a RANDOM _RESIDUAL_ statement to model the subject and autoregressive relationship within locations. 4
5 Table 6: Scenario 3 Output: Fixed and Random Effects Estimates Solutions for Fixed Effects Effect time intervention Estimate SE DF t Pr > t Effect intervention intervention intervention Solution for Random Effects Subject Estimate SE Pred DF t Pr > t Location Location Location Location Location Location Location Location Location Matching the figure, the solutions for the fixed effects have significant immediate impact of the intervention (P<.46). Visually this coincides with a positive vertical increase at the start of the intervention for each location. The amount of the increase varies and is measured by also considering the random effects, which for example in Location 3 shows a near-significant attenuation (P<.64) relative to the other locations. In similar manner, the increase in slope after the intervention () in Location is reflected in a near-significant value (P<.543) for the random effect. SCENARIO 4: # OF LOCATIONS = N. AVAILABILITY = (INDIVIDUALS LOCATION). When individuals are nested within location, the input data set and models can be scaled like before. An identifier for individual is added to the data set, and options are added to the model to account for nested relationship. Figure 5: Scenario 4 Plot 5
6 Regression and Stepped Wedge Designs, continued Table 7: Scenario 4 Input Data location Person ID date time interv. Y // 8. // // 9. N PROC GLIMMIX DATA=work.scenario4; CLASS location person_id; MODEL Y = time intervention / SOLUTION; RANDOM INT intervention / SUBJECT=location SOLUTION; RANDOM _RESIDUAL_ / SUBJECT=person_id(location) TYPE=AR(); The only modification to the code is the RANDOM statement option SUBJECT=person_id(location). The tables report the same conclusion as before noting a strong immediate effect of the intervention across all locations, and location specific differences in Location s post-slope and in Location 3 at the start of the intervention. Table 8: Scenario 4 Output: Fixed and Random Effects Estimates Solutions for Fixed Effects Effect Estimate SE DF t Pr > t time intervention Solution for Random Effects Effect Subject Estimate SE Pred DF t Pr > t Location intervention Location Location Location intervention Location Location Location intervention Location Location SCENARIO 5: # OF LOCATIONS = N. AVAILABILITY = (INDIVIDUALS DEPARTMENTS LOCATIONS). For larger organizations with mature data collection infrastructure, it may be realistic that interventions are phased across many locations, affecting individuals within departments within locations. In our example of a healthcare organization, this could mean an intervention phased across multiple clinic locations affecting doctors within departments within locations. The outcome of interest could be physician productivity, measured for each physician monthly, and one may be interested in how locations were affected before and after the intervention by individual location and also over the organization. As before, identifiers are added to the data set for individual, department, and location. 6
7 Figure 6: Scenario 5 Plot Table 9: Scenario 5 Input Data location N N department N Person ID N date time interv. // //3 36 // 4 Y The PROC GLIMMIX syntax is modifiedd to accommodate the nestedd structure and also report by department within locations. PROC GLIMMIX DATA=work.scenario5; CLASS location department person id; MODEL Y = time intervention time aft_int / SOLUTION; RANDOM INT intervention n time_aft int / SUBJECT=department(location) SOLUTION; RANDOM _RESIDUAL_ / SUBJECT=person_id(department location) TYPE=AR(); 7
8 Regression and Stepped Wedge Designs, continued Table : Scenario 5 Output: Fixed and Random Effects Estimates Solutions for Fixed Effects Effect Estimate SE DF t Pr > t <. time intervention Solution for Random Effects Effect Subject Estimate SE Pred DF t Pr > t Dept, Location intervention Dept, Location Dept, Location <. Dept, Location intervention Dept, Location Dept, Location Dept, Location intervention Dept, Location Dept, Location Dept, Location intervention Dept, Location Dept, Location Dept, Location intervention Dept, Location <. Dept, Location Dept, Location intervention Dept, Location Dept, Location The fixed effects (organization-wide trend) are supportive of an immediate increase in the outcome following the start of the intervention. Additionally, Department and from Location have noteworthy gradual increases, i.e. changes in the slope following the intervention. And Department from Location 3 has a noteworthy immediate decrease in the outcome following the start of the intervention. One should be remember that the model parameters do not estimate the post- values directly, but instead they estimate the change from pre-, so they must be interpreted together with pre- values to calculate the post- coefficient values (e.g. intercept or slope). By interrogating the fixed and random effects, one can construct organization-wide and local interpretations about the intervention impact. LIMITATIONS & FUTURE WORK Many real-world scenarios are likely to use the non-randomized phased interventions and are subject to selection biases and clustering that should be examined and accounted for by the user. The methods in this paper are described for non-randomized stepped wedge designs and the results should always be interpreted and treated with caution. Future work could expand on this application to compare common methods used to account for selection bias. Proponents of generalized estimating equations (GEE) may offer an alternative approach to generalized linear mixed models as a direct method for building population average models (marginal distribution). After initial considerations, we developed a method based on GLMMs here, with advantages for measuring subject-specific models (conditional distribution), and being able to make simultaneous interpretations of the organization-wide and location-specific effects of an intervention. Future work may consider whether and/or develop a GEE approach (with code) within the segmented regression and stepped wedge paradigms. Only the general question of whether the intervention was associated with an impact to the outcome was described here. Stepped wedge designs provide opportunities for many other types of within- or betweencomparisons described well in the literature. Depending on the intervention, a training/learning period may precede the start of the intervention. This can be modeled separately for more accurate estimates of the post-intervention period by modifying the input data set and including the new terms in the model indicating the training period. 8
9 Regression and Stepped Wedge Designs, continued CONCLUSIONS Phased interventions in the real-world can be examined using segmented regression and non-randomized stepped wedge designs together. This paper presents several real-world scenarios of measuring interventions over time according to varying amounts of data availability, with data set, code, and output examples. The user may follow the scenario most appropriate. Large organizations with a nested hierarchy and frequently captured data could use this methodology to make interpretations about the effect of a process improvement intervention while accounting for the phased nature of the intervention, nested hierarchies, and repeated measures. ACKNOWLEDGMENTS The authors thank Dr. Alice Pressman for early discussions about this methodology. RECOMMENDED READING Hussey M, Hughes J. Design and analysis of stepped wedge cluster randomized trials. Contemporary Clinical Trials. 8 (7): 8-9. Handley M, Schillinger D, Shiboski S. Quasi-Experimental Designs in Practice-based Research Settings: Design and Implementation Considerations. J Am Board Fam Med. ; 4: Gebski V, Ellingson K, Edwards J, Jernigan J, Kleinbaum D. Modelling interrupted time series to evaluate prevention and control of infection in healthcare. Epidemiol. Infect. 4 (): 3-4. CONTACT INFORMATION Your comments and questions are valued and encouraged. Contact the author at: Eric C. Wong Palo Alto Medical Foundation Research Institute 795 El Camino Real Palo Alto, CA 943 wonge@pamfri.org SAS and all other SAS Institute Inc. product or service names are registered trademarks or trademarks of SAS Institute Inc. in the USA and other countries. indicates USA registration. Other brand and product names are trademarks of their respective companies. 9
How to analyze correlated and longitudinal data?
How to analyze correlated and longitudinal data? Niloofar Ramezani, University of Northern Colorado, Greeley, Colorado ABSTRACT Longitudinal and correlated data are extensively used across disciplines
More informationTHE UNIVERSITY OF OKLAHOMA HEALTH SCIENCES CENTER GRADUATE COLLEGE A COMPARISON OF STATISTICAL ANALYSIS MODELING APPROACHES FOR STEPPED-
THE UNIVERSITY OF OKLAHOMA HEALTH SCIENCES CENTER GRADUATE COLLEGE A COMPARISON OF STATISTICAL ANALYSIS MODELING APPROACHES FOR STEPPED- WEDGE CLUSTER RANDOMIZED TRIALS THAT INCLUDE MULTILEVEL CLUSTERING,
More informationGeneralized Estimating Equations for Depression Dose Regimes
Generalized Estimating Equations for Depression Dose Regimes Karen Walker, Walker Consulting LLC, Menifee CA Generalized Estimating Equations on the average produce consistent estimates of the regression
More information1.4 - Linear Regression and MS Excel
1.4 - Linear Regression and MS Excel Regression is an analytic technique for determining the relationship between a dependent variable and an independent variable. When the two variables have a linear
More informationA Comparison of Linear Mixed Models to Generalized Linear Mixed Models: A Look at the Benefits of Physical Rehabilitation in Cardiopulmonary Patients
Paper PH400 A Comparison of Linear Mixed Models to Generalized Linear Mixed Models: A Look at the Benefits of Physical Rehabilitation in Cardiopulmonary Patients Jennifer Ferrell, University of Louisville,
More informationData Analysis in Practice-Based Research. Stephen Zyzanski, PhD Department of Family Medicine Case Western Reserve University School of Medicine
Data Analysis in Practice-Based Research Stephen Zyzanski, PhD Department of Family Medicine Case Western Reserve University School of Medicine Multilevel Data Statistical analyses that fail to recognize
More informationMethodology for Non-Randomized Clinical Trials: Propensity Score Analysis Dan Conroy, Ph.D., inventiv Health, Burlington, MA
PharmaSUG 2014 - Paper SP08 Methodology for Non-Randomized Clinical Trials: Propensity Score Analysis Dan Conroy, Ph.D., inventiv Health, Burlington, MA ABSTRACT Randomized clinical trials serve as the
More informationTreatment Adaptive Biased Coin Randomization: Generating Randomization Sequences in SAS
Adaptive Biased Coin Randomization: OBJECTIVES use SAS code to generate randomization s based on the adaptive biased coin design (ABCD) must have approximate balance in treatment groups can be used to
More informationLinear Regression in SAS
1 Suppose we wish to examine factors that predict patient s hemoglobin levels. Simulated data for six patients is used throughout this tutorial. data hgb_data; input id age race $ bmi hgb; cards; 21 25
More informationMultiple Regression. James H. Steiger. Department of Psychology and Human Development Vanderbilt University
Multiple Regression James H. Steiger Department of Psychology and Human Development Vanderbilt University James H. Steiger (Vanderbilt University) Multiple Regression 1 / 19 Multiple Regression 1 The Multiple
More informationParameter Estimation of Cognitive Attributes using the Crossed Random- Effects Linear Logistic Test Model with PROC GLIMMIX
Paper 1766-2014 Parameter Estimation of Cognitive Attributes using the Crossed Random- Effects Linear Logistic Test Model with PROC GLIMMIX ABSTRACT Chunhua Cao, Yan Wang, Yi-Hsin Chen, Isaac Y. Li University
More informationCRITERIA FOR USE. A GRAPHICAL EXPLANATION OF BI-VARIATE (2 VARIABLE) REGRESSION ANALYSISSys
Multiple Regression Analysis 1 CRITERIA FOR USE Multiple regression analysis is used to test the effects of n independent (predictor) variables on a single dependent (criterion) variable. Regression tests
More informationBaseline Mean Centering for Analysis of Covariance (ANCOVA) Method of Randomized Controlled Trial Data Analysis
MWSUG 2018 - Paper HS-088 Baseline Mean Centering for Analysis of Covariance (ANCOVA) Method of Randomized Controlled Trial Data Analysis Jennifer Scodes, New York State Psychiatric Institute, New York,
More informationPropensity Score Methods for Causal Inference with the PSMATCH Procedure
Paper SAS332-2017 Propensity Score Methods for Causal Inference with the PSMATCH Procedure Yang Yuan, Yiu-Fai Yung, and Maura Stokes, SAS Institute Inc. Abstract In a randomized study, subjects are randomly
More informationIntroduction to Multilevel Models for Longitudinal and Repeated Measures Data
Introduction to Multilevel Models for Longitudinal and Repeated Measures Data Today s Class: Features of longitudinal data Features of longitudinal models What can MLM do for you? What to expect in this
More information6. Unusual and Influential Data
Sociology 740 John ox Lecture Notes 6. Unusual and Influential Data Copyright 2014 by John ox Unusual and Influential Data 1 1. Introduction I Linear statistical models make strong assumptions about the
More informationCLASSICAL AND. MODERN REGRESSION WITH APPLICATIONS
- CLASSICAL AND. MODERN REGRESSION WITH APPLICATIONS SECOND EDITION Raymond H. Myers Virginia Polytechnic Institute and State university 1 ~l~~l~l~~~~~~~l!~ ~~~~~l~/ll~~ Donated by Duxbury o Thomson Learning,,
More informationLIHS Mini Master Class Multilevel Modelling
LIHS Mini Master Class Multilevel Modelling Robert M West 9 November 2016 Robert West c University of Leeds 2016. This work is made available for reuse under the terms of the Creative Commons Attribution
More information1. Objective: analyzing CD4 counts data using GEE marginal model and random effects model. Demonstrate the analysis using SAS and STATA.
LDA lab Feb, 6 th, 2002 1 1. Objective: analyzing CD4 counts data using GEE marginal model and random effects model. Demonstrate the analysis using SAS and STATA. 2. Scientific question: estimate the average
More informationCHAPTER 3 RESEARCH METHODOLOGY
CHAPTER 3 RESEARCH METHODOLOGY 3.1 Introduction 3.1 Methodology 3.1.1 Research Design 3.1. Research Framework Design 3.1.3 Research Instrument 3.1.4 Validity of Questionnaire 3.1.5 Statistical Measurement
More informationUnit 1 Exploring and Understanding Data
Unit 1 Exploring and Understanding Data Area Principle Bar Chart Boxplot Conditional Distribution Dotplot Empirical Rule Five Number Summary Frequency Distribution Frequency Polygon Histogram Interquartile
More informationWDHS Curriculum Map Probability and Statistics. What is Statistics and how does it relate to you?
WDHS Curriculum Map Probability and Statistics Time Interval/ Unit 1: Introduction to Statistics 1.1-1.3 2 weeks S-IC-1: Understand statistics as a process for making inferences about population parameters
More informationBiostatistics II
Biostatistics II 514-5509 Course Description: Modern multivariable statistical analysis based on the concept of generalized linear models. Includes linear, logistic, and Poisson regression, survival analysis,
More informationChapter 21 Multilevel Propensity Score Methods for Estimating Causal Effects: A Latent Class Modeling Strategy
Chapter 21 Multilevel Propensity Score Methods for Estimating Causal Effects: A Latent Class Modeling Strategy Jee-Seon Kim and Peter M. Steiner Abstract Despite their appeal, randomized experiments cannot
More information2.75: 84% 2.5: 80% 2.25: 78% 2: 74% 1.75: 70% 1.5: 66% 1.25: 64% 1.0: 60% 0.5: 50% 0.25: 25% 0: 0%
Capstone Test (will consist of FOUR quizzes and the FINAL test grade will be an average of the four quizzes). Capstone #1: Review of Chapters 1-3 Capstone #2: Review of Chapter 4 Capstone #3: Review of
More informationLecture 14: Adjusting for between- and within-cluster covariates in the analysis of clustered data May 14, 2009
Measurement, Design, and Analytic Techniques in Mental Health and Behavioral Sciences p. 1/3 Measurement, Design, and Analytic Techniques in Mental Health and Behavioral Sciences Lecture 14: Adjusting
More informationReveal Relationships in Categorical Data
SPSS Categories 15.0 Specifications Reveal Relationships in Categorical Data Unleash the full potential of your data through perceptual mapping, optimal scaling, preference scaling, and dimension reduction
More informationREPEATED MEASURES DESIGNS
Repeated Measures Designs The SAGE Encyclopedia of Educational Research, Measurement and Evaluation Markus Brauer (University of Wisconsin-Madison) Target word count: 1000 - Actual word count: 1071 REPEATED
More informationIntroduction to Multilevel Models for Longitudinal and Repeated Measures Data
Introduction to Multilevel Models for Longitudinal and Repeated Measures Data Today s Class: Features of longitudinal data Features of longitudinal models What can MLM do for you? What to expect in this
More informationChapter 3 CORRELATION AND REGRESSION
CORRELATION AND REGRESSION TOPIC SLIDE Linear Regression Defined 2 Regression Equation 3 The Slope or b 4 The Y-Intercept or a 5 What Value of the Y-Variable Should be Predicted When r = 0? 7 The Regression
More informationChapter 1: Exploring Data
Chapter 1: Exploring Data Key Vocabulary:! individual! variable! frequency table! relative frequency table! distribution! pie chart! bar graph! two-way table! marginal distributions! conditional distributions!
More informationCertificate Program in Practice-Based. Research Methods. PBRN Methods: Clustered Designs. Session 8 - January 26, 2017
Certificate Program in Practice-Based L. Miriam Dickinson, PhD Professor, University of Colorado School of Medicine Department of Family Medicine Research Methods PBRN Methods: Clustered Designs Session
More informationRegression Equation. November 29, S10.3_3 Regression. Key Concept. Chapter 10 Correlation and Regression. Definitions
MAT 155 Statistical Analysis Dr. Claude Moore Cape Fear Community College Chapter 10 Correlation and Regression 10 1 Review and Preview 10 2 Correlation 10 3 Regression 10 4 Variation and Prediction Intervals
More informationKnowledge is Power: The Basics of SAS Proc Power
ABSTRACT Knowledge is Power: The Basics of SAS Proc Power Elaina Gates, California Polytechnic State University, San Luis Obispo There are many statistics applications where it is important to understand
More informationConditional Distributions and the Bivariate Normal Distribution. James H. Steiger
Conditional Distributions and the Bivariate Normal Distribution James H. Steiger Overview In this module, we have several goals: Introduce several technical terms Bivariate frequency distribution Marginal
More informationMULTIPLE LINEAR REGRESSION 24.1 INTRODUCTION AND OBJECTIVES OBJECTIVES
24 MULTIPLE LINEAR REGRESSION 24.1 INTRODUCTION AND OBJECTIVES In the previous chapter, simple linear regression was used when you have one independent variable and one dependent variable. This chapter
More informationThe Use of Piecewise Growth Models in Evaluations of Interventions. CSE Technical Report 477
The Use of Piecewise Growth Models in Evaluations of Interventions CSE Technical Report 477 Michael Seltzer CRESST/University of California, Los Angeles Martin Svartberg Norwegian University of Science
More informationSTATISTICS & PROBABILITY
STATISTICS & PROBABILITY LAWRENCE HIGH SCHOOL STATISTICS & PROBABILITY CURRICULUM MAP 2015-2016 Quarter 1 Unit 1 Collecting Data and Drawing Conclusions Unit 2 Summarizing Data Quarter 2 Unit 3 Randomness
More informationExample 7.2. Autocorrelation. Pilar González and Susan Orbe. Dpt. Applied Economics III (Econometrics and Statistics)
Example 7.2 Autocorrelation Pilar González and Susan Orbe Dpt. Applied Economics III (Econometrics and Statistics) Pilar González and Susan Orbe OCW 2014 Example 7.2. Autocorrelation 1 / 17 Questions.
More informationAnalytic Strategies for the OAI Data
Analytic Strategies for the OAI Data Charles E. McCulloch, Division of Biostatistics, Dept of Epidemiology and Biostatistics, UCSF ACR October 2008 Outline 1. Introduction and examples. 2. General analysis
More informationExamining Relationships Least-squares regression. Sections 2.3
Examining Relationships Least-squares regression Sections 2.3 The regression line A regression line describes a one-way linear relationship between variables. An explanatory variable, x, explains variability
More informationLab 8: Multiple Linear Regression
Lab 8: Multiple Linear Regression 1 Grading the Professor Many college courses conclude by giving students the opportunity to evaluate the course and the instructor anonymously. However, the use of these
More informationHPS301 Exam Notes- Contents
HPS301 Exam Notes- Contents Week 1 Research Design: What characterises different approaches 1 Experimental Design 1 Key Features 1 Criteria for establishing causality 2 Validity Internal Validity 2 Threats
More informationImpact of guideline dissemination strategies among Network chiropractors: Interrupted time series with segmented regression analysis
Impact of guideline dissemination strategies among Network chiropractors: Interrupted time series with segmented regression analysis André Bussières DC, FCCS (C), MSc, PhD Anne Sales RN, PhD Timothy Ramsay,
More informationMultiple Linear Regression Analysis
Revised July 2018 Multiple Linear Regression Analysis This set of notes shows how to use Stata in multiple regression analysis. It assumes that you have set Stata up on your computer (see the Getting Started
More informationSimple Linear Regression the model, estimation and testing
Simple Linear Regression the model, estimation and testing Lecture No. 05 Example 1 A production manager has compared the dexterity test scores of five assembly-line employees with their hourly productivity.
More informationMeta-analysis using HLM 1. Running head: META-ANALYSIS FOR SINGLE-CASE INTERVENTION DESIGNS
Meta-analysis using HLM 1 Running head: META-ANALYSIS FOR SINGLE-CASE INTERVENTION DESIGNS Comparing Two Meta-Analysis Approaches for Single Subject Design: Hierarchical Linear Model Perspective Rafa Kasim
More informationBayes Linear Statistics. Theory and Methods
Bayes Linear Statistics Theory and Methods Michael Goldstein and David Wooff Durham University, UK BICENTENNI AL BICENTENNIAL Contents r Preface xvii 1 The Bayes linear approach 1 1.1 Combining beliefs
More informationClass 7 Everything is Related
Class 7 Everything is Related Correlational Designs l 1 Topics Types of Correlational Designs Understanding Correlation Reporting Correlational Statistics Quantitative Designs l 2 Types of Correlational
More informationAn Introduction to Modern Econometrics Using Stata
An Introduction to Modern Econometrics Using Stata CHRISTOPHER F. BAUM Department of Economics Boston College A Stata Press Publication StataCorp LP College Station, Texas Contents Illustrations Preface
More informationSimple Linear Regression
Simple Linear Regression Assoc. Prof Dr Sarimah Abdullah Unit of Biostatistics & Research Methodology School of Medical Sciences, Health Campus Universiti Sains Malaysia Regression Regression analysis
More informationCurrent Directions in Mediation Analysis David P. MacKinnon 1 and Amanda J. Fairchild 2
CURRENT DIRECTIONS IN PSYCHOLOGICAL SCIENCE Current Directions in Mediation Analysis David P. MacKinnon 1 and Amanda J. Fairchild 2 1 Arizona State University and 2 University of South Carolina ABSTRACT
More informationMODELING HIERARCHICAL STRUCTURES HIERARCHICAL LINEAR MODELING USING MPLUS
MODELING HIERARCHICAL STRUCTURES HIERARCHICAL LINEAR MODELING USING MPLUS M. Jelonek Institute of Sociology, Jagiellonian University Grodzka 52, 31-044 Kraków, Poland e-mail: magjelonek@wp.pl The aim of
More informationbivariate analysis: The statistical analysis of the relationship between two variables.
bivariate analysis: The statistical analysis of the relationship between two variables. cell frequency: The number of cases in a cell of a cross-tabulation (contingency table). chi-square (χ 2 ) test for
More informationQuestion 1(25= )
MSG500 Final 20-0-2 Examiner: Rebecka Jörnsten, 060-49949 Remember: To pass this course you also have to hand in a final project to the examiner. Open book, open notes but no calculators or computers allowed.
More informationEcological Statistics
A Primer of Ecological Statistics Second Edition Nicholas J. Gotelli University of Vermont Aaron M. Ellison Harvard Forest Sinauer Associates, Inc. Publishers Sunderland, Massachusetts U.S.A. Brief Contents
More informationStatistics and Probability
Statistics and a single count or measurement variable. S.ID.1: Represent data with plots on the real number line (dot plots, histograms, and box plots). S.ID.2: Use statistics appropriate to the shape
More informationShrimp adjust their sex ratio to fluctuating age distributions
Evolutionary Ecology Research, 2002, 4: 239 246 Shrimp adjust their sex ratio to fluctuating age distributions Eric L. Charnov 1,2 and Robert W. Hannah 3 1 Department of Biology, The University of New
More informationSCATTER PLOTS AND TREND LINES
1 SCATTER PLOTS AND TREND LINES LEARNING MAP INFORMATION STANDARDS 8.SP.1 Construct and interpret scatter s for measurement to investigate patterns of between two quantities. Describe patterns such as
More informationGPA vs. Hours of Sleep: A Simple Linear Regression Jacob Ushkurnis 12/16/2016
GPA vs. Hours of Sleep: A Simple Linear Regression Jacob Ushkurnis 12/16/2016 Introduction As a college student, life can sometimes get extremely busy and stressful when there is a lot of work to do. More
More informationinvestigate. educate. inform.
investigate. educate. inform. Research Design What drives your research design? The battle between Qualitative and Quantitative is over Think before you leap What SHOULD drive your research design. Advanced
More informationAnalyzing diastolic and systolic blood pressure individually or jointly?
Analyzing diastolic and systolic blood pressure individually or jointly? Chenglin Ye a, Gary Foster a, Lisa Dolovich b, Lehana Thabane a,c a. Department of Clinical Epidemiology and Biostatistics, McMaster
More informationSINGLE-CASE RESEARCH. Relevant History. Relevant History 1/9/2018
SINGLE-CASE RESEARCH And Small N Designs Relevant History In last half of nineteenth century, researchers more often looked at individual behavior (idiographic approach) Founders of psychological research
More informationTransitions in Depressive Symptoms After 10 Years of Follow-up Using PROC LTA
PharmaSUG 2015 Paper QT25 Transitions in Depressive Symptoms After 10 Years of Follow-up Using PROC LTA Seungyoung Hwang, Johns Hopkins University Bloomberg School of Public Health ABSTRACT PROC LTA is
More informationIntroduction to Machine Learning. Katherine Heller Deep Learning Summer School 2018
Introduction to Machine Learning Katherine Heller Deep Learning Summer School 2018 Outline Kinds of machine learning Linear regression Regularization Bayesian methods Logistic Regression Why we do this
More informationStepwise method Modern Model Selection Methods Quantile-Quantile plot and tests for normality
Week 9 Hour 3 Stepwise method Modern Model Selection Methods Quantile-Quantile plot and tests for normality Stat 302 Notes. Week 9, Hour 3, Page 1 / 39 Stepwise Now that we've introduced interactions,
More information(a) y = 1.0x + 0.0; r = ; N = 60 (b) y = 1.0x + 0.0; r = ; N = Lot 1, Li-heparin whole blood, HbA1c (%)
cobas b system - performance evaluation Study report from a multicenter evaluation of the new cobas b system for the measurement of HbAc and lipid panel Introduction The new cobas b system provides a point-of-care
More informationCarrying out an Empirical Project
Carrying out an Empirical Project Empirical Analysis & Style Hint Special program: Pre-training 1 Carrying out an Empirical Project 1. Posing a Question 2. Literature Review 3. Data Collection 4. Econometric
More informationAnalyzing Healthcare Costs with SAS: An Intern s Experience Ben Keefer, The Regence Group, Portland, OR
Analyzing Healthcare Costs with SAS: An Intern s Experience Ben Keefer, The Regence Group, Portland, OR Abstract The goal of this analysis was to measure the effect of Regence s wellness program on healthcare
More informationSupplementary Figure 1. Recording sites.
Supplementary Figure 1 Recording sites. (a, b) Schematic of recording locations for mice used in the variable-reward task (a, n = 5) and the variable-expectation task (b, n = 5). RN, red nucleus. SNc,
More informationThere are, in total, four free parameters. The learning rate a controls how sharply the model
Supplemental esults The full model equations are: Initialization: V i (0) = 1 (for all actions i) c i (0) = 0 (for all actions i) earning: V i (t) = V i (t - 1) + a * (r(t) - V i (t 1)) ((for chosen action
More informationImpact of Response Variability on Pareto Front Optimization
Impact of Response Variability on Pareto Front Optimization Jessica L. Chapman, 1 Lu Lu 2 and Christine M. Anderson-Cook 3 1 Department of Mathematics, Computer Science, and Statistics, St. Lawrence University,
More informationEdinburgh Research Explorer
Edinburgh Research Explorer Effect of time period of data used in international dairy sire evaluations Citation for published version: Weigel, KA & Banos, G 1997, 'Effect of time period of data used in
More informationSUPPLEMENTAL MATERIAL
1 SUPPLEMENTAL MATERIAL Response time and signal detection time distributions SM Fig. 1. Correct response time (thick solid green curve) and error response time densities (dashed red curve), averaged across
More information8/24/2011. Study Goal. Study Design. Patient Attributes Influencing Pain and Pain Management in Postoperative Total Knee Arthroplasty Patients
Patient Attributes Influencing Pain and Pain Management in Postoperative Total Knee Arthroplasty Patients Concurrent Session 1F Deborah L. Gentile, PhD, RN-BC Research Scientist Aurora Health Care Milwaukee,
More informationNORTH SOUTH UNIVERSITY TUTORIAL 2
NORTH SOUTH UNIVERSITY TUTORIAL 2 AHMED HOSSAIN,PhD Data Management and Analysis AHMED HOSSAIN,PhD - Data Management and Analysis 1 Correlation Analysis INTRODUCTION In correlation analysis, we estimate
More informationEvaluating Social Programs Course: Evaluation Glossary (Sources: 3ie and The World Bank)
Evaluating Social Programs Course: Evaluation Glossary (Sources: 3ie and The World Bank) Attribution The extent to which the observed change in outcome is the result of the intervention, having allowed
More informationA framework for evaluating public health interventions for obesity prevention. An IOM committee report
A framework for evaluating public health interventions for obesity prevention An IOM committee report Harold C. Sox, M.D., MACP The Dartmouth Institute Dartmouth Medical School Bridging the Evidence Gap
More informationTutorial #7A: Latent Class Growth Model (# seizures)
Tutorial #7A: Latent Class Growth Model (# seizures) 2.50 Class 3: Unstable (N = 6) Cluster modal 1 2 3 Mean proportional change from baseline 2.00 1.50 1.00 Class 1: No change (N = 36) 0.50 Class 2: Improved
More informationPropensity Score Methods to Adjust for Bias in Observational Data SAS HEALTH USERS GROUP APRIL 6, 2018
Propensity Score Methods to Adjust for Bias in Observational Data SAS HEALTH USERS GROUP APRIL 6, 2018 Institute Institute for Clinical for Clinical Evaluative Evaluative Sciences Sciences Overview 1.
More informationAP Stats Chap 27 Inferences for Regression
AP Stats Chap 27 Inferences for Regression Finally, we re interested in examining how slopes of regression lines vary from sample to sample. Each sample will have it s own slope, b 1. These are all estimates
More informationSupplementary Materials. Instructions for Target Subjects (Taken from, and kindly shared by, Haselton & Gildersleeve, 2011).
Supplementary Materials Instructions for Target Subjects (Taken from, and kindly shared by, Haselton & Gildersleeve, 2011). Participant ID: Scent Samples Task Instructions Next Lab Session - Date: Time:
More informationBangor University Laboratory Exercise 1, June 2008
Laboratory Exercise, June 2008 Classroom Exercise A forest land owner measures the outside bark diameters at.30 m above ground (called diameter at breast height or dbh) and total tree height from ground
More informationHEMOCHRON. Whole Blood Coagulation Systems
HEMOCHRON Whole Blood Coagulation Systems Citrated Activated Partial Thromboplastin Time (APTT) Cuvette Correlation Protocol for HEMOCHRON Microcoagulation Instruments MSIG:131 10/06 Dear Medical Professional:
More information1 Version SP.A Investigate patterns of association in bivariate data
Claim 1: Concepts and Procedures Students can explain and apply mathematical concepts and carry out mathematical procedures with precision and fluency. Content Domain: Statistics and Probability Target
More informationA model of parallel time estimation
A model of parallel time estimation Hedderik van Rijn 1 and Niels Taatgen 1,2 1 Department of Artificial Intelligence, University of Groningen Grote Kruisstraat 2/1, 9712 TS Groningen 2 Department of Psychology,
More informationProblem Set 5 ECN 140 Econometrics Professor Oscar Jorda. DUE: June 6, Name
Problem Set 5 ECN 140 Econometrics Professor Oscar Jorda DUE: June 6, 2006 Name 1) Earnings functions, whereby the log of earnings is regressed on years of education, years of on-the-job training, and
More information2012, Greenwood, L.
Critical Review: How Accurate are Voice Accumulators for Measuring Vocal Behaviour? Lauren Greenwood M.Cl.Sc. (SLP) Candidate University of Western Ontario: School of Communication Sciences and Disorders
More informationQuasi-Experimental and Single Case Experimental Designs. Experimental Designs vs. Quasi-Experimental Designs
1 Quasi-Experimental and Single Case Experimental Designs RCS 6740 6/30/04 Experimental Designs vs. Quasi-Experimental Designs Experimental Designs include: Random Assignment of Participants to groups
More informationReport Reference Guide
Report Reference Guide How to use this guide Each type of CareLink report and its components are described in the following sections. Report data used to generate the sample reports was from sample patient
More informationBias Adjustment: Local Control Analysis of Radon and Ozone
Bias Adjustment: Local Control Analysis of Radon and Ozone S. Stanley Young Robert Obenchain Goran Krstic NCSU 19Oct2016 Abstract Bias Adjustment: Local control analysis of Radon and ozone S. Stanley Young,
More informationMethod Comparison for Interrater Reliability of an Image Processing Technique in Epilepsy Subjects
22nd International Congress on Modelling and Simulation, Hobart, Tasmania, Australia, 3 to 8 December 2017 mssanz.org.au/modsim2017 Method Comparison for Interrater Reliability of an Image Processing Technique
More informationSampling Weights, Model Misspecification and Informative Sampling: A Simulation Study
Sampling Weights, Model Misspecification and Informative Sampling: A Simulation Study Marianne (Marnie) Bertolet Department of Statistics Carnegie Mellon University Abstract Linear mixed-effects (LME)
More informationLongitudinal and Hierarchical Analytic Strategies for OAI Data
Longitudinal and Hierarchical Analytic Strategies for OAI Data Charles E. McCulloch, Division of Biostatistics, Dept of Epidemiology and Biostatistics, UCSF OARSI Montreal September 10, 2009 Outline 1.
More informationDaniel Boduszek University of Huddersfield
Daniel Boduszek University of Huddersfield d.boduszek@hud.ac.uk Introduction to Multiple Regression (MR) Types of MR Assumptions of MR SPSS procedure of MR Example based on prison data Interpretation of
More informationCHAPTER TWO REGRESSION
CHAPTER TWO REGRESSION 2.0 Introduction The second chapter, Regression analysis is an extension of correlation. The aim of the discussion of exercises is to enhance students capability to assess the effect
More information3.2 Least- Squares Regression
3.2 Least- Squares Regression Linear (straight- line) relationships between two quantitative variables are pretty common and easy to understand. Correlation measures the direction and strength of these
More informationLOGLINK Example #1. SUDAAN Statements and Results Illustrated. Input Data Set(s): EPIL.SAS7bdat ( Thall and Vail (1990)) Example.
LOGLINK Example #1 SUDAAN Statements and Results Illustrated Log-linear regression modeling MODEL TEST SUBPOPN EFFECTS Input Data Set(s): EPIL.SAS7bdat ( Thall and Vail (1990)) Example Use the Epileptic
More informationMidterm Exam ANSWERS Categorical Data Analysis, CHL5407H
Midterm Exam ANSWERS Categorical Data Analysis, CHL5407H 1. Data from a survey of women s attitudes towards mammography are provided in Table 1. Women were classified by their experience with mammography
More informationBlood Glucose Monitoring System. Copyright 2016 Ascensia Diabetes Care Holdings AG diabetes.ascensia.com
Viewing test results in My Readings The CONTOUR DIABETES app captures all your blood glucose readings to create personalized patterns and trends, so you can see how your daily activities impact your results.
More information