COMPARING PERSONAL TRAJECTORIES

Size: px
Start display at page:

Download "COMPARING PERSONAL TRAJECTORIES"

Transcription

1 Annu. Rev. Psychol : Copyright c 2001 by Annual Reviews. All rights reserved COMPARING PERSONAL TRAJECTORIES AND DRAWING CAUSAL INFERENCES FROM LONGITUDINAL DATA Stephen W. Raudenbush School of Education and Institute for Social Research, University of Michigan, 610 East University Avenue, Ann Arbor, Michigan 48109; rauden@umich.edu Abstract This review considers statistical analysis of data from studies that obtain repeated measures on each of many participants. Such studies aim to describe the average change in populations and to illuminate individual differences in trajectories of change. A person-specific model for the trajectory of each participant is viewed as the foundation of any analysis having these aims. A second, between-person model describes how persons very in their trajectories. This two-stage modeling framework is common to a variety of popular analytic approaches variously labeled hierarchical models, multilevel models, latent growth models, and random coefficient models. Selected published examples reveal how the approach can be flexibly adapted to represent development in domains as diverse as vocabulary growth in early childhood, academic learning, and antisocial propensity during adolescence. The review then considers the problem of drawing causal inferences from repeated measures data. CONTENTS INTRODUCTION Trajectories of Change Describing and Comparing Growth and Change Curves Unpacking the Terminology A Framework for Analytic Choices Normal Data and Normal Trajectories Expanding the Class of Level-1 Models Expanding the Class of Level-2 Models Causal Inference Causal Inference in Longitudinal Studies: Between-Subject Causes Time-Varying Treatments Summary and Conclusions INTRODUCTION Longitudinal studies serve many purposes in psychology. They trace the course of normal growth, identify risk factors for mental illness, and assess the effects /01/ $

2 502 RAUDENBUSH of interventions. They identify the timing of onset of new abilities, the duration of disorders, and desistance from self-destructive behavior. What these purposes share in common is a focus on individual change and thus a need for repeated assessment of each participant. The time series for a person creates a basis for sketching that person s history. Whereas a person s history can capture many domains of change for example, changes in cognitive skill, emotional self-regulation, mood, and social behavior a trajectory describes a person s development in one welldefined domain. In this article, I view longitudinal analysis as beginning with the development of a model for individual trajectory. Only if the trajectory is clearly defined does it become possible quantitatively to summarize evidence across trajectories. The logical foundation for all longitudinal analysis is thus a statistical model defining parameters of change for the trajectory of a single participant. The task of comparing people then becomes the task of comparing the parameters of these personal trajectories. A model is thus needed for the population distribution of the parameters of personal change. Within this framework, I review current methodological developments that have dramatically expanded the analytic choices available to psychologists. The explosive growth of new methodology enables astonishing flexibility for studying change and comparing trajectories of change in the presence of missing data and varied timing and spacing of time points. This growth in new methods can also cause seemingly terminal confusion. Part of the confusion emanates from a proliferation of jargon often associated with new software packages. So part of my task is to clarify the new vocabulary and connect it to key logical issues in the analysis. A brief article cannot comprehensively review a topic as vast as methods of longitudinal analysis. Therefore, the first section of this article, while outlining a variety of conceptions of trajectory, selects a subset of these for more intensive discussion. By choosing to focus on just a part of the terrain, it becomes possible to portray quite vividly what is meant quantitatively by a trajectory and how such trajectories can be compared. Researchers have often criticized cross-sectional research designs as inadequate for identifying causal effects and have advocated longitudinal data collection as a solution. The second half of this article therefore considers the assumptions required to justify causal inferences in longitudinal studies. Trajectories of Change Longitudinal studies use quite varied conceptions of a trajectory of change. Let us briefly sketch three of these before moving to a fourth conception that provides the basis for the remainder of the paper. Models for Time as an Outcome Many important research questions concern the timing of an event, for example, the age of onset of menarche, time until recovery from depression, and the time until a student receives a degree. The probability that the event of interest will occur at time t, given that it has not already occurred,

3 LONGITUDINAL ANALYSIS 503 is called the hazard of the event. The hazard changes over time. A participant s changing hazard is then that person s trajectory. The trajectory may depend on time-invariant covariates, including gender, social background, and personality characteristics, and also on time-varying covariates, that is, events and experiences that unfold over time. The key methodological challenge arises from the fact that the study will typically end before some members of the sample experience the event. Of those not experiencing the event, many or all will ultimately experience it after the termination of the study. For these people, the time of the event is censored, that is, unobservable. Survival analysis involves an important class of longitudinal models that are tailored to deal with duration time as an outcome when outcomes for some participants are censored. This important class of models, reviewed, for example, by Cox & Oakes (1984), Kalbfleisch & Prentice (1980), and Singer & Willett (1991) will not be considered further in this article. Transitions Between States It is often of interest to develop theories that predict transitions between states, for example, whether a recovering alcoholic will be actively drinking at any time t (yes or no) given that this participant was (or was not) drinking at time t 1. In this kind of study, a person s trajectory is the sequence of such transitions. The probability of a given state (e.g. drinking at time t) given the prior state (e.g. not drinking at time t 1) is called a transition probability. A model for a person s trajectory begins with a theory about what determines a sequence of transition probabilities for a given person. Covariates describing personal background and experience may determine these probabilities and thus strongly influence a person s trajectory. The current article will not consider these models in further detail. We refer the interested reader to Diggle et al (1994, Chapter 10), Collins (2001), and Collins & Wugalter (1992). Sequences of Age-Appropriate Markers Developmental psychologists are interested in whether early impulsivity predicts conduct disorder in elementary school and juvenile crime later (Moffitt 1993). Status attainment researchers consider how parental social status predicts a person s educational attainment and how these, together, predict that person s adult occupation and income (Duncan et al 1972, Sewell & Hauser 1975). In these cases, the focus is on associations among variables, each of which marks a person s status at a given stage of life. A trajectory in such studies is the sequence of values on the variables of interest. Common methods of analysis include structural equation models or path models, as introduced by Blau & Duncan (1967), Blalock (1969), and Duncan (1966) and refined considerably since (cf Joreskog & Sorbom 1979). Although longitudinal data are essential to studying these important relationships, the analytic methods are not different from multivariate methods commonly used in cross-sectional research, so we shall not consider them further. Smooth Curves to Describe Trajectories of Growth or Change Perhaps the most common analytic goal in current longitudinal psychological research is to

4 504 RAUDENBUSH describe and compare growth curves or other smooth functions that describe a person s changing status. For this purpose, analysts have recommended approaches with the labels covariance component models, hierarchical linear models, latent curve analysis, mixed models, multilevel models, random-coefficient models, and random-effect models. Much of the remainder of this article aims to clarify the logic of these models while giving some meaning to these labels. Researchers interested in these approaches generally collect repeated measures of a given outcome variable for each participant with the aim of describing how people grow or change over some fairly broad interval of time. Other Studies of Repeated Measures This review is confined to studies performed over some significant segment of the life course. Such a focus omits other important types of repeated-measure studies. Studies of repeated measures over the short term, for example, can assess sequences of social interaction in detail, model reaction times in a series of cognitive tasks, or predict biophysical changes in response to a series of stimuli. We refer the interested reader to Gottman (1995, Part II), for reviews of methods for such data. Describing and Comparing Growth and Change Curves We begin with a simple and easily understandable example and then extend this logic to more complex cases. Defining the Trajectory for Person i Suppose that each person s status with respect to cognitive outcome, Y, grows with age at a constant rate and that these rates vary randomly over a population of persons. More specifically, we have an outcome Y ti for person i at time t, with T i time points (t = 1,...,T i ) observed for person i. We can formulate a simple linear model such as the following for individual change: Y ti = π 0i + π 1i a ti + e ti, e ti N(0,σ 2 ) (1) where a ti is the age of person i at time t. Here person i s trajectory is the straight line π 0i + π 1i a it. The parameters of individual i s trajectory are two: π 0i, the status of that person at a ti = 0, and π 1i, that person s linear rate of change per unit increase in age. The discrepancy between the actual score Y ti and the trajectory is e ti, assumed for simplicity here to be distributed normally with a mean of zero and constant variance, σ 2. The simple model of Equation 1 has found varied application in psychology. Francis et al (1991) defined π 1i as the rate of recovery of patient i s cognitive functioning after suffering a head injury. For Raudenbush and Chan (1993), π 1i was adolescent i s rate of increase in antisocial thinking. For Seltzer et al 1994, π 1i was the rate of growth of i s reading comprehension. Having defined the parameters of personal change, the next stage of the model expresses a theory about individual

5 LONGITUDINAL ANALYSIS 505 differences: differences, for example, in the rate of recovery from head injury, in the rate of increase of antisocial thinking, or the rate of growth in reading. Describing the Population Distribution of Personal Trajectories Next we consider how the personal trajectories of change are distributed in the population. A very simple model states that person-specific parameters (π 0i, π 1i ) vary around their grand means (β 00, β 01 ), with variance (τ 00, τ 11 ) and covariance (τ 01 ), according to a bivariate normal distribution: π 0i = β 00 + u 0i Unpacking the Terminology π 1i = β 10 + u 1i (2) ( ) [( ( )] u0i 0 τ00 τ with N, 01. u 1i 0) τ 10 τ 11 The model described by Equations 1 and 2 is a hierarchical model because it specifies a model for Y ti given first-level parameters (π 0i, π 1i ), while these parameters, in turn, depend on second-level parameters (the β s and τs). Levels could be added; the second-level parameters could depend on third-level parameters, and so on. Thus, it is the hierarchical dependence among the parameters that is decisive in making the model hierarchical, not necessarily the hierarchical structure of the data, although the two often go together. This idea is central to the current article. All of the models considered in this paper are hierarchical. The essence of the decision framework I propose is to ensure that specification of the model at each level fits the research question, design, and data. The model described by Equations 1 and 2 is a hierarchical linear model (Bryk & Raudenbush 1992) because, at the first stage, Y is a linear function of π 0i, π 1i, whereas at the second stage, each π is a linear function of β 00,β 10. Note that all polynomial models at the first stage are linear. The model is multilevel (Goldstein 1995) because it describes data that vary at two levels: within persons and between persons. The model is a random-coefficients model (Longford 1993) because the level-1 model defines coefficients π 0i, π 1i, that vary randomly over participants at level 2. It is a latent-curve model (Meredith & Tisak 1990) because the trajectory or curve π 0i + π 1i a it is unobservable, depending, that is, on unobserved latent variables π 0i and π 1i. It may also be a latent-growth model, although the use of the term growth implies a monotonic increasing trajectory which may or may not be the case. The model is also a mixed model (Diggle et al 1994). To see this, substitute Equation 1 into Equation 2, yielding the combined model Y ti = β 00 + β 01 a ti + ε ti (3)

6 506 RAUDENBUSH where ε ti = u 0i + u 1i a ti + e ti. (4) Thus, the model has fixed effects (β 00, β 01 ) and random effects (u 0i, u 1i ) as well as the elemental residual e ti. Models that incorporate both fixed and random effects have historically been labeled mixed models. This is a random-effects model (Laird & Ware 1982) because individual differences are characterized by random effects u 0i, u 1i. It is a covariance components model because it allows the random effects to covary. The model is also a structural-equation model (SEM) (Joreskog & Sorbom 1979). The first-stage model can be viewed as a measurement model with observed variable Y, latent variables π, and factor loadings 1 (for π 0i ) and a ti, (for π 1i ). The second-level model is an exceedingly simple structural equation model for the πs. A Framework for Analytic Choices The essence of the framework I propose is as follows. 1. In light of the developmental theory at hand, the design, and the data, choose a model for individual development over time. The model has a structural part that specifies a trajectory in terms of person-specific parameters (the πs). It may be a linear trajectory (including polynomials) as in Equation 1 or nonlinear. The model also includes a probabilistic part that describes the random behavior of Y around the person-specific trajectory. The probability model may be normal, as in Equation 1, or it may not. 2. In light of key research questions and data, define a model for the distribution of trajectories in the population. It will also have a structural part, which describes the expected trajectory given the measured characteristics of persons, and a probabilistic part, which describes the random behavior of the trajectories in the population. 3. From this point, we continue to higher levels as necessary. For example, the model might have a three-level structure with level 1 describing individual change, level 2 describing interindividual variation within clusters such as schools (Bryk & Raudenbush 1992, Chapter 8), and level 3 describing variation between clusters. 4. Assess alternative estimation methods, algorithms, and (hopefully) available software for making inferences about all unknowns. Unfortunately, constraints in estimation theory, algorithms, and software often distort model choices. In particular, the convenience of linear models and normal distribution theory at each level, contrasted with the relative difficulty of constructing efficient computational algorithms for nonlinear models and non-normal assumptions, has encouraged an unhealthy reliance on linearity and normality. Below we consider two classes of models. In the first, both the data Y (and the πs) are normal with a linear structure. In the second class, either Y (given the

7 LONGITUDINAL ANALYSIS 507 πs) or the πs or both are non-normal (and possibly nonlinear). The first class of models is most familiar to psychology, and, although it has many useful applications, it is often chosen more out of convenience than conviction. The second class is enormously broad and offers vast untapped potential for the modeling of developmental phenomena. Normal Data and Normal Trajectories We consider the examples in which assumptions of normality and linearity are benign in this section, while urging analysts to consider the broader range of models described in the subsequent section. Vocabulary Growth During the Second Year of Life Following the tradition of Bryk & Weisberg (1977) and Rogosa et al (1982), this article asserts that the appropriate foundation for longitudinal analysis is a model for the individual trajectory. To illustrate, we review the study by Huttenlocher et al (1991) of expressive vocabulary during the second year of life. These researchers visited the home of each child in their study every 2 months, each time counting the number of unique words uttered by the child. Figure 1 displays the scatter plot of vocabulary (vertical Figure 1 A sample of individual vocabulary growth trajectories. The symbols,, and represent the actual observations. The smooth curves result from fitting a separate quadratic polynomial to each child s vocabulary data.

8 508 RAUDENBUSH axis) as a function of age in months (horizontal axis) for three randomly selected children. An accelerating curve nicely summarizes the growth pattern for each child. Such a quadratic curve can be described by the level-1 model for the growth of individual i: Y ti = π 0i + π 1i a it + π 2i a 2 it + e ti, (5) where age a ti is expressed as a deviation from some meaningful constant. For example, if a ti = age 18, months for child i at time t, π 0i is the size of child i s vocabulary at 18 months, π 1i is the rate of growth of child i at that same age, and π 2i captures that child s rate of acceleration. In Huttenlocher s study, π 2i turned out to be the key growth parameter because acceleration was the single feature of growth that strongly differentiated the growth curves. By using all of the data to strengthen inference about correlates of acceleration, the researchers were able to discover reliable associations between key predictors (e.g. gender and maternal speech) and vocabulary growth. Growth of Reading and Math Achievement During the Elementary Years A key feature of Huttenlocher s study was the construction of an outcome variable (vocabulary) measured on the same scale at every age of interest. Without such an invariant outcome metric, discussion of quantitative change or growth is meaningless. Standardized tests often fail to provide such a metric; different forms of a test are constructed with age-appropriate items for different age groups, and no effort is made to equate the forms. However, by calibrating the items across alternate forms, it is possible to construct a common measure for studies of cognitive growth. The Sustaining Effects Study (Carter 1984) used such a measure to characterize growth in early-elementary reading and math. Two features of individual growth were apparent. First, academic-year growth was essentially linear. Second, growth rates during the summer were much smaller than growth rates during the academic year. Figure 2 plots the average trajectory for reading and math during the elementary years. Notice that assessments twice annually (fall and spring) enable one to distinguish academic-year and summer growth. On average, there is no growth during the summer in math, but the academic-year growth rate is substantially positive. For reading, the summer rate is modestly positive, with faster academic-year growth. To capture key features of growth, a sensible model for the individual trajectory for either math or reading is Y ti + π 0i + π 1i (grade ti 1) + π 2i ( fall ti ) + e ti (6) where grade ti 1 takes a value of 0 for first grade, 1 for second grade, etc, and fall is an indicator for the fall time point ( fall = 1 if the first occurred in the fall, 0 if spring). Under this specification, π 1i is the calendar-year growth rate, and π 1i + π 2i is the summer growth rate. Bryk & Raudenbush (1992, Chapter 8) were able to discern the following features of early elementary growth by using such an approach: (a) much greater academic-year than summer growth in math

9 LONGITUDINAL ANALYSIS 509 Figure 2 Expected growth functions in math and reading based on three-level model analysis of data from the sustaining effects study. and reading on average; (b) modest summer growth in reading with no summer growth on average in math; (c) subject matter differences in school effects such that most of the variation in academic-year rates in math was between schools while most of the variation in academic-year growth in reading lay between children within schools; (d) a stronger dependence of growth rates on social background for reading than for math. Summary The two examples illustrate the centrality of the model of the individual trajectory for studying longitudinal data on growth or change. For the first example, representing the accelerating nature of vocabulary growth during the second year of life was essential not only in describing individual growth and mean growth but also in discovering correlates of growth. In the second example, distinguishing between the summer growth rate and the academic-year growth rate for a

10 510 RAUDENBUSH given child was essential in illuminating school effects and clarifying differences between the growth processes for reading and math. Expanding the Class of Level-1 Models So far we have considered only linear models to characterize a person s trajectory in the level-1 model. The level-1 model also assumes a normally distributed random error e ti. The generalized linear model of McCullagh & Nelder (1989) provides a far more general class of level-1 models. Consider Horney et al s (1995) longitudinal study of high-rate offenders. Their interest focused on how changing life circumstances such as getting a job or moving in with a spouse are related to the propensity to commit crime. They therefore conceived each participant s propensity as having a trajectory over time that could be deflected by such changes in life circumstances. However, the data, collected by means of a life history calendar over 36 months, involved binary Y ti ; that is, Y ti = 1 if person i committed a crime during time t, t = 1,...,36. The binary character of the data strongly suggests that a Bernoulli probability model E(Y ti µ ti ) = Prob(Y ti = 1 µ ti ) = µ ti (7) Var (Y ti µ ti ) = µ ti (1 µ ti ). Note that, unlike the normal model, the conditional variance at level 1 is intrinsically heteroscedastic, depending on µ ti and therefore varying over i at time t. A linear structural model for µ ti would make little sense in this case. A linear model would be inconsistent with the bounds (0 <µ ti < 1) because a linear model could easily generate predicted values less than zero or greater than one. Thus, any effect sizes associated with such a model would be suspect. A more natural model and that used by Horney et al is the logit-linear structural model ( ) µ ti P η ti = log = π 0i + π pti a pti. (8) 1 µ ti Here a pti, p = 1,...,P, are time-varying predictors, measured aspects of life circumstances; and µ pi can be thought of as a deflection in the propensity to commit crime associated with a change in a pti. At level 2, the πs can be viewed as having the same kind of model as before (Equation 2), that is, a linear structural model and a multivariate normal-probability model. In sum, the two-level model of Horney et al (1995) is similar conceptually to Equations 1 and 2, conceiving each participant to have a trajectory of development characterized by certain parameters π that, in turn, vary over a population of persons. However, the nature of the data calls for a nonlinear structural model at level 1 and a non-normal sampling model. In the language of the generalized linear model (McCullagh & Nelder 1989), the logit transformation η ti of the mean µ ti is called the link function for the binomial mean. Other standard link functions are the log-link for count data and the identity p=1

11 LONGITUDINAL ANALYSIS 511 Figure 3 Probability of theft during adolescence. link for normal data. Typically, the link function is set equal to a linear model as in Equation 8; that is, η ti is a linear function of the πs. Diggle et al (1994) review applications of hierarchical generalized linear models. However, it will not always be the case that a linear model for the link function captures the interesting features of development. Consider now the age-crime curve, described by Gottfredson & Hirschi (1990) as one of the brute facts of criminology. Researchers have found that, for many societies and many subgroups, the probability of committing a serious crime tends to be very small during preadolescence. However, this probability tends to increase at an accelerating rate early in adolescence, typically reaching a peak around ages and then diminishing rapidly during late adolescence and early adulthood. This curve thus takes on a bell shape as a function of age, as displayed in Figure 3, based on an analysis of data from all seven cohorts of the National Youth Survey (Raudenbush 2001). These curves describe the fitted values based on a two-level hierarchical model. At level 1, the outcome Y ti takes on a value of 1 if participant i commits serious theft during time interval t, and the value is 0 if not. We thus adopt a Bernoulli probability model as in Horney et al (1995) and, provisionally, a logit linear structural model in η ti = π 0i + π 1i a ti + π 2i a 2 ti, (9) which the log odds of committing serious theft are a quadratic function of age: where a ti is the age of person i at time t minus 16 (the median age of the sample during the 5 years of data collection). Ages range from 11 to 21 across the seven cohorts during the 5 years of the study. At level 2, each of the three πs depends on gender (an indicator for female status), family income, and ethnicity (an indicator

12 512 RAUDENBUSH for minority status, with 1 = African American or Hispanic American; 0 = other): 1 π pi = β p0 + β p1( female) i + β p2(minority) i + u pi, (10) for p = 0, 1, and 2. The level-2 probability model assumes the random effects u pi to be normal in distribution with coefficients for linear and quadratic age fixed. The curves in Figure 3 apply the results to the inverse logit transform 1 ˆµ ti = (11) 1 + exp{ ˆη ti } that is, the predicted probability of committing serious theft for each of four groups (majority males, majority females, minority males, and minority females). While Figure 3 is interesting, it is difficult to interpret the polynomial coefficients. More interesting would be parameters that map onto a developmental theory linked to literature from the age-crime curve. SW Raudenbush (2001) transformed the polynomial model for the individual trajectory into a model with three parameters: (a) the peak age of offending, that is, the age at which the expected probability of offending is at a maximum; (b) the peak probability of offending, that is, the probability of offending at the peak age; and (c) the persistence of offending, that is, the extent to which a person continues to offend rather than to desist from offending during the transition to young adulthood. The graph suggests, for example, that females peak earlier than males and that, when they do, their offending rate is much lower than that of males. It also suggests that minority and majority youth have quite similar curves except that minority males tend to peak later and to persist. Consistent with the results of Horney et al (1995), we might expect that this persistence reflects the fact that minority males are having a harder time getting jobs and are more likely to remain single than are majority males, although we cannot test this hypothesis with the current data. The example shows that the level-1 structural model can be recast such that the parameters capture theoretically interesting properties of development. At the same time, the probabilistic part of the model is consistent with the binary nature of the data. A parallel analysis using the frequency of crime reached similar results. In this case, µ ti was the event rate for person i at time t, the link function was the log link, that is η ti = log(µ ti ), and the structural model for η ti is the same as in Equation 9. Expanding the Class of Level-2 Models Thus far I have emphasized the importance of getting the model for individual development right. It is equally important to make sound choices about the model for individual variation in development. 1 The vast majority of those sampled in the National Youth Survey were either European American or African American. Sample sizes of other subgroups were too small for analysis, especially in light of the low frequency of serious theft.

13 LONGITUDINAL ANALYSIS 513 Simultaneous Equation Models Applications of hierarchical models to date have nearly always specified multivariate regression models at level 2. In terms of our paradigm example of Equations 1 and 2, this means expanding the level-2 model, that is, in an especially simple case, π 0i = β 00 + β 01 X i + u 0i π 1i = β 10 + β 11 X i + u 1i (12) ( ) [ ( ) ( ) ] u0i 0 τ00 τ with N, 01. u 1i 0 τ 10 τ 11 Here the level-1 parameters of individual change, π 0i and π 1i, become correlated outcomes predicted by person characteristic X i. Raudenbush & Sampson (1999) show, however, how to specify and estimate simultaneous equation models in which the random coefficient, say π 0i, becomes a mediating variable. In the model below, X i has both a direct effect on the rate of change π 1i, given the intercept π 0i, and an indirect effect operating through the intercept. The model simple moves π 0i in Equation 13 to the right side of the equation for π 1i : E(π 1i π 0i ) = α 10 + α 11 X i + α 12 π 0i = β 10 α 12 β 00 + (β 11 α 12 β 01 )X i + α 12 u 0i. (13) Here α 11 = β 11 α 12 β 01 is the direct effect of X on π 1 ; α 12 β 01 is the indirect effect of X on π 1 operating through π 0 and α 12 = τ 10 /τ 00. A Multinomial Prior Assumption In many studies of growth, it is reasonable to assume that all participants are growing according to some common function but that the growth parameters vary in magnitude. For example, vocabulary growth curves for a normative sample of children invariably show upward curvature (acceleration) during the second year of life, and the interesting question about individual differences is the rate of acceleration (Huttenlocher et al 1991). For many other repeated-measures studies, however, the situation is quite different. Consider a study of changes in depression. It makes no sense to assume that everyone is increasing (or decreasing) with respect to depression. In a normative sample, many persons will never be high in depression, whereas others will always be high; some persons will be recovering from serious depression, but others will become increasingly depressed, and perhaps another group will oscillate between high and low levels of depression. Such depression curves can certainly be represented by a polynomial of sufficiently high degree, say three (i.e. the cubic degree) in the level-1 model. However, linear models for the polynomial coefficients, π, at level 2 may not capture the qualitatively different types of trajectories found in the population. To model these kinds of data, Nagin (1999) has developed a two-level model in which the first level is similar to those discussed in this paper (see also Muthen

14 514 RAUDENBUSH 2001). However, the second level of the model is reconceptualized such that the population is viewed as (approximately) falling into a fixed number of groups, where each group s development is characterized by a common set of change parameters (πs). The summary of evidence from this model is a set of conditional probabilities for each person: the probability that a person is in group 1, the probability that the person is in group 2, etc. In the example of a study of depression, group 1 might be the always depressed group while group 2 might be those who are becoming depressed. A multinomial regression model then can predict the probabilities of group membership. It may be, for example, that the predictors of being in the always depressed group are quite different from the predictors of being in the becoming depressed group. This model thus seems especially useful when trajectories of change involve sets of parameters that mark qualitatively different kinds of development. Nagin shows how to test the appropriateness of the assumed number of groups and thus to test alternative models for types of change. Alternative Probabilistic Models The vast majority of level-2 models assume that departures of the change parameters (the πs) from their predicted values are multivariate normal. This is certainly a convenient assumption, as mentioned, but it may poorly fit the data, and results may not be robust for departures from it. In particular, outlier values of the growth parameters may be far more influential than one would desire, particularly in small-sample settings. To develop more robust estimation, Seltzer (1993) adopted a multivariate t-prior distribution for the random effects (e.g. u 0i, u 1i in Equation 2). The t-prior anticipates more outliers than does the normal prior, and, as a result, model estimates are more resistant to the influence of such outliers. Seltzer embedded this approach within a Bayesian framework a three-level hierarchical model with the first two levels of the type given by Equations 1 and 2 and a third level specifying a prior for the level-2 parameters. Thum (1997) also adopted a t-prior in his multilevelmultivariate model with estimation via maximum likelihood. This model allows covariance structure analysis at each of two levels based on incomplete data. Causal Inference Let us consider the problem of causal inference in studies in which the outcome is an entire trajectory of change. However, I first review some basic ideas about causation in cross-sectional studies. Causal Effects in Cross-Sectional Studies What is a causal effect? Many discussions of causal inference and research design neglect to confront this issue. However, a theory that has come to dominate modern thinking in statistics about cause begins with this fundamental question. Pioneered by Rubin (1974) and Rosenbaum & Rubin (1983) and elaborated by Holland (1986), this theory has come to be known as the Rubin-Rosenbaum-Holland (RRH). To describe this theory, the simplest case will suffice; given a causal variable ( the treatment ) with two

15 LONGITUDINAL ANALYSIS 515 possible values ( experimental and control ), for clarity, consider a case in which a child receives either the new experimental approach to daycare (E) or the currently available approach (C) and the outcome, Y, is a measure of self-regulation. If the child receives E, we observe Y i (E), the outcome of child i under E. However, if that same child receives C, we observe Y i (C), the outcome of child i under C. The causal effect of the experimental treatment (relative to the control) is defined as the difference between these two potential outcomes, i = Y i (E) Y i (C). (14) Several conclusions follow from this definition. First, the causal effect i is defined uniquely for each child. The impact of the treatment can thus vary from child to child. Modern thinking about cause thus rejects the conventional assumption that a new treatment adds a constant effect for every child. This assumption, never realistic to scientists or practitioners, was imposed historically to simplify statistical analysis. Second, the causal effect cannot be observed. If a given child is assigned to E, we will observe Y i (E), but not Y i (C). On the other hand, if the child is assigned to C, we will observe Y i (C) but not Y i (E). The outcome that cannot be observed is the counter-factual outcome. Third, it must be reasonable at least to imagine a scenario in which that child could have received either E or C. If it is not possible to conceive of each child s response under each treatment, it is not possible to define a causal effect. Therefore, in current thinking about cause in statistical science, a fixed attribute of a child (say sex or ethnic background) cannot typically be a cause. We cannot realistically imagine how a girl would have responded if she had been a boy or how a black child would have responded if that child had been white. Epidemiologists have referred to such attributes as fixed markers (Kraemer et al 1996), unchangeable attributes that may be statistically related to an outcome but cannot be causes. Randomized Studies According to RRH, the problem of causal inference is a problem of missing data. If both potential outcomes Y i (E) and Y i (C) were observed, the causal effect could simply be calculated for each participant. However, one of the potential outcomes (the counterfactual) is always missing. Randomized studies ensure that the missing counterfactual is missing completely at random. In effect, the mechanism of random assignment ensures that the decision about which outcome is observed [Y i (E) ory i (C)] is decided by chance alone. While it is not possible to estimate the causal effect for each participant, a randomized experiment enables unbiased estimation of the average causal effect, that is = E[Y i (E) Y i (C)], (15) the population average difference between potential outcomes. The estimator is simply the difference between sample means of the experimental and control groups.

16 516 RAUDENBUSH Nonrandomized Studies Without random assignment, we cannot assume that the counterfactual is missing at random. First, those selected into the experimental group may be more (or less) disadvantaged than those selected into the control group. If so, the potential outcomes of those assigned to the experimental group would be higher (or lower), on average, than of those assigned to the control group in the absence of a treatment effect. Researchers often attempt to control for confounding variables preexisting variables that predict treatment group membership and are related to the potential outcomes by means of matching or statistical adjustments such as the analysis of covariance (Cook & Campbell 1979). However, one can never be sure that the relevant confounding variables have been identified or that the method of adjustment has been completely effective. Second, participants may have information about the likely benefit of choosing to participate in the treatment, information not available to the researcher. For example, a person choosing not to participate in a job-training program may know that she is about to be offered a good-paying job, better than can be expected to result from participating in the program. Such biases cannot be controlled because, by definition, the information needed to adjust for bias is unknown to the researcher. However, certain natural experiments can be exploited to avoid such bias. For example, a job-training program may be available only in certain states, ensuring that the choice to participate is not entirely under the control of the participants. Instrumental variable methods (cf Angrist et al 1996, Little & Yau 1998) can be used to exploit such natural experiments in order to minimize bias in nonexperimental studies. It has often been argued that collecting longitudinal data can also increase the credibility of causal inferences when randomization is impossible. We consider this issue, among others, in the following sections. Causal Inference in Longitudinal Studies: Between-Subject Causes Consider a case in which participants are assigned to one of two treatments to assess how the treatment affects an entire trajectory. Now we have a trajectory characterized by change parameters π i (E) if person i is assigned to E and π i (C)if person i is assigned to C. Only one of the two trajectories will be observed, leaving the counterfactual trajectory missing. Randomized Experiments Under random assignment, the counterfactual trajectory is missing completely at random, making it easy to calculate an unbiased estimate of the average treatment effect as the mean trajectory in the experimental group minus the mean trajectory in the control group assuming no attrition. If data are missing through attrition, for example, because some participants miss some appointments to be assessed or some drop out of the study, there are three possibilities (Little & Rubin 1987, Schafer 1997).

17 LONGITUDINAL ANALYSIS The attrition may lead to data that are missing completely at random (MCAR). Data that are MCAR result when the missing time points are a random sample of all time points or the dropouts are a random sample of all participants. In this case unbiased estimation follows easily using conventional methods, for example analyzing only those cases with complete data. However, to assume that data are MCAR is generally implausible, impossible to verify, and risky. 2. The data may be missing at random (MAR). MAR occurs when the probability of missing a time point is independent of the missing data, given the observed data. This assumption is reasonable when the observed data capture key confounding influences, for example, variables that predict both attrition and the outcome of interest. Under the MAR case, estimation of the treatment effect will be unbiased if (a) all of the data are used in the analysis and (b) a fully efficient estimation procedure is used. For example, maximum likelihood estimation of hierarchical models (also termed multilevel models of mixed models see earlier discussion) will efficiently use all of the available time points to estimate the model, assuring unbiased estimation of treatment effects under the MAR case. Under these conditions, the mechanism that produces the missingness is ignorable. Use of multiple, model-based imputation (Little & Rubin 1987, Schafer 1997) will also ensure ignorable missingness when data are MAR. 3. Nonignorable missingness arises when the data are neither MCAR or MAR. In this case, the probability of attrition does depend on the missing value, even after controlling for all observed data. Results will be robust to nonignorable missingness to the extent that (a) all of the data are efficiently used and (b) the fraction of missing information is small. 2 Little (1995) and Hedekker & Gibbons (1997) describe methods for nonignorably missing longitudinal data under the rubric of pattern-mixture models. Nonrandomized Studies Without random assignment, estimation of treatment effects will, in general, be biased even if there is no attrition. We therefore consider only the no-attrition case, keeping in mind that the concerns about attrition that apply in randomized experiments will also apply in nonrandomized experiments. Below I consider how longitudinal studies can overcome or reduce bias when it is impossible to randomly assign participants to treatments. I consider studies with two time points, three time points, and more than three time points. 2 The fraction of missing information is not the fraction of missing data points. It is, rather, related to the amount of variation in missing data that is not explained or accounted for by observed data. If the associations between the observed and missing data are strong, the fraction of missing data will be small even if the fraction of missing cases appears quite large.

18 518 RAUDENBUSH Figure 4 Causal inference when children are growing: pre-post design. Causal Comparative Studies with Two Time Points Consider a nonrandomized cross-sectional study. As mentioned above, the mean difference between outcomes of experimental and control groups is not, in general, an unbiased estimate of the treatment effect. One strategy to cope with bias is to add a pretest. Assuming Y is measured on a common scale at each time point, the difference in gain scores is arguably less biased than the difference in post-test scores. Consider, for example, Figure 4. The figure shows the average pre-post trajectory for experimental groups and for controls. The length of the interval in curly braces is presumably the treatment effect. Using Equation 12 as a model, this estimate is ˆβ 11 (T 2 T 1 ), (16) that is, the difference in mean growth rates between the two groups (β 11 ) multiplied by the time elapsed between the pre- and post-test. The problem is that such an estimate assumes that the two groups would have experienced identical growth rates in the absence of the treatment. Given nonrandom assignment, this seems improbable. A design having two time points gives no information about the rate of growth of the experimental group in the absence of a treatment. This fact is illustrated in Figures 5A and 5B. Figure 5A shows parallel growth of the two groups before the treatment, making the interval in curly braces plausible as a measure of treatment impact. However, Figure 5B shows nonparallel growth before the implementation of the treatment, undermining any case for that estimate. Two time points provide no basis for adjudicating

19 LONGITUDINAL ANALYSIS 519 Figure 5A Scenario 1: parallel growth in absence of treatment. between Figures 5A and 5B or any other pretreatment growth configuration. The problem does not follow from the use of gain scores. Bryk & Weisberg (1977) showed that all strategies of linear adjustment (including not only the gain score adjustment but also the analysis of covariance) are subject to biases that cannot be investigated with only two time points. Figure 5B Scenario 2: Fan spread growth in absence of treatment.

20 520 RAUDENBUSH Three Time Points Adding a third time point, in particular, a pre-pre-test, adds substantially to the information available in estimating the treatment effect. When we elaborate Equations 1 and 2 to incorporate this information, our level-1 model becomes Y ti = π 0i + π 1i a ti + δ i Z ti + e ti. (17) Here Z ti is a dummy variable taking on a value of 1 at the third time point and 0 otherwise, so that δ i is the deflection experienced by participant i between times 2 and 3, that is, when the treatment is implemented. The other terms in Equation 4 are defined as in Equation 1. We now elaborate the level-2 model to allow the deflection to depend on group membership: π 0i = β 00 + β 01 X i + u 0i π 1i = β 10 + β 11 X i + u 1i δ i = c + X i. (18) As displayed in Figure 6, this model allows estimation of (a) different mean initial status for C and E;(b) different pretreatment growth rates for C and E; and (c) different nonlinear deflections for C and E at time of treatment. The difference between these nonlinear deflections is the presumptive treatment effect. Clearly, adding a single time point has added a substantial amount of information relevant to estimating the treatment effect. More Than Two Time Points The quasi experimental design with three points, while much stronger than the design with two points, may be criticized for assuming that both groups would have the same nonlinear deflection in the absence of a treatment. Adding a fourth time point can distinguish nonlinear deflections and a treatment effect. Adding a fourth time point (a second post-test) can also enable the estimation of a randomly varying effect of treatment. This brings the model into closer alignment with the notion of participant-specific effects that is central to the potential-outcomes framework. Time-Varying Treatments We have limited the discussion so far to the case in which participants experience one and only one treatment. Yet many interesting causal questions involve time-varying treatments. Every time a child speaks, a caregiver s response can be conceived as a treatment. A time series unfolds in which each speech event triggers such a treatment, which affects the next speech event, and so on. Other examples include the use of medical treatment: a physician will assess a child and prescribe (or not prescribe) a drug; the physician will then follow up later with another assessment and another decision to prescribe or not, etc. In school, children are assigned to special education (or not) based on assessments of academic progress

21 LONGITUDINAL ANALYSIS 521 Figure 6 Estimating the treatment effect based on three time points. and behavior and are then mainstreamed (or not) based on subsequent behavior, which has arisen, in part, as a result of the first placement decision. In all of these examples, the assessment, treatment, later assessment, and later treatment are interconnected. There is a serious risk in these examples that the analyst will estimate the effect of treatment on the outcome with bias. Children who are assigned medication will do worse than those who are not, making the medication look harmful; children assigned to special education will look worse

22 522 RAUDENBUSH than those who are not, making special-education placement look harmful; and the mainstreamed will look better than those who are not, making mainstreaming look helpful even if the medication, placement, and mainstreaming have no effect. A dynamic-treatment regime arises when a time-varying treatment or dose is calibrated to the current status of the participant. It is normally true that the worse the current status, the larger the dose. The regime is the set of rules that assign doses to children by their statuses. Methodological research shows that one can make sound inferences about regimes, for example, by assigning participants at random to the regime (Robins et al 2000). However, if a regime is strictly adhered to, it is not possible, within regimes, to assess the causal effect of a dose. Only if the dose can vary, given current status, is it possible to make a causal inference about the dose. The problem of causal inference for time-varying treatments, assuming no randomization, is extraordinarily challenging. It is a cutting-edge issue on which progress is being made, but methodological workers in this area have not yet achieved a consensus on the best ways to proceed. Summary and Conclusions A statistical model is essential in precisely defining each participant s trajectory and facilitating comparisons of persons by comparing their trajectories. Based on such a model, one can define and estimate the reliability of measures of change. One can also define effect sizes and, with supplementary data at hand, compute the power of alternative designs (Hedeker & Gibbons 1997; Muthen & Curran 1997). Alternative models posit alternative views of confounding variables and alternative ways in which attrition might affect the precision and bias of inferences. An important new line of research considers how to extend modern thinking about causal effects to longitudinal data. According to this modern theory, a causal effect is person specific; it is the difference between how a person would respond under one condition and how that same person would respond under a different condition. The causal effect can thus vary among people. However, it cannot be directly computed, because invariably, a person will experience one condition but not the other at any given time. The set of responses that might occur for a person under the varied conditions possible is called the set of potential outcomes. A confounder is a pretreatment characteristic of a participant that is related to both the propensity to receive the treatments and to the potential outcomes. Valid causal inferences require that the propensity to receive treatments is independent of the potential outcomes, given the potential confounders on which the investigator has collected data. Randomized experiments ensure independence. In nonrandomized experiments, knowledge of predictors of propensity to receive the various treatments is essential in seeking valid causal inferences. In the nonrandomized setting, the burden of proof is on the shoulders of the investigator to show that the relevant confounders have been taken into account, and it will be difficult to avoid lingering disagreement about the success of the attempt to take these into account.

Quantitative Methods. Lonnie Berger. Research Training Policy Practice

Quantitative Methods. Lonnie Berger. Research Training Policy Practice Quantitative Methods Lonnie Berger Research Training Policy Practice Defining Quantitative and Qualitative Research Quantitative methods: systematic empirical investigation of observable phenomena via

More information

11/18/2013. Correlational Research. Correlational Designs. Why Use a Correlational Design? CORRELATIONAL RESEARCH STUDIES

11/18/2013. Correlational Research. Correlational Designs. Why Use a Correlational Design? CORRELATIONAL RESEARCH STUDIES Correlational Research Correlational Designs Correlational research is used to describe the relationship between two or more naturally occurring variables. Is age related to political conservativism? Are

More information

Regression Discontinuity Analysis

Regression Discontinuity Analysis Regression Discontinuity Analysis A researcher wants to determine whether tutoring underachieving middle school students improves their math grades. Another wonders whether providing financial aid to low-income

More information

Module 14: Missing Data Concepts

Module 14: Missing Data Concepts Module 14: Missing Data Concepts Jonathan Bartlett & James Carpenter London School of Hygiene & Tropical Medicine Supported by ESRC grant RES 189-25-0103 and MRC grant G0900724 Pre-requisites Module 3

More information

Propensity Score Methods for Estimating Causality in the Absence of Random Assignment: Applications for Child Care Policy Research

Propensity Score Methods for Estimating Causality in the Absence of Random Assignment: Applications for Child Care Policy Research 2012 CCPRC Meeting Methodology Presession Workshop October 23, 2012, 2:00-5:00 p.m. Propensity Score Methods for Estimating Causality in the Absence of Random Assignment: Applications for Child Care Policy

More information

Data and Statistics 101: Key Concepts in the Collection, Analysis, and Application of Child Welfare Data

Data and Statistics 101: Key Concepts in the Collection, Analysis, and Application of Child Welfare Data TECHNICAL REPORT Data and Statistics 101: Key Concepts in the Collection, Analysis, and Application of Child Welfare Data CONTENTS Executive Summary...1 Introduction...2 Overview of Data Analysis Concepts...2

More information

Supplement 2. Use of Directed Acyclic Graphs (DAGs)

Supplement 2. Use of Directed Acyclic Graphs (DAGs) Supplement 2. Use of Directed Acyclic Graphs (DAGs) Abstract This supplement describes how counterfactual theory is used to define causal effects and the conditions in which observed data can be used to

More information

Addendum: Multiple Regression Analysis (DRAFT 8/2/07)

Addendum: Multiple Regression Analysis (DRAFT 8/2/07) Addendum: Multiple Regression Analysis (DRAFT 8/2/07) When conducting a rapid ethnographic assessment, program staff may: Want to assess the relative degree to which a number of possible predictive variables

More information

Design and Analysis Plan Quantitative Synthesis of Federally-Funded Teen Pregnancy Prevention Programs HHS Contract #HHSP I 5/2/2016

Design and Analysis Plan Quantitative Synthesis of Federally-Funded Teen Pregnancy Prevention Programs HHS Contract #HHSP I 5/2/2016 Design and Analysis Plan Quantitative Synthesis of Federally-Funded Teen Pregnancy Prevention Programs HHS Contract #HHSP233201500069I 5/2/2016 Overview The goal of the meta-analysis is to assess the effects

More information

3 CONCEPTUAL FOUNDATIONS OF STATISTICS

3 CONCEPTUAL FOUNDATIONS OF STATISTICS 3 CONCEPTUAL FOUNDATIONS OF STATISTICS In this chapter, we examine the conceptual foundations of statistics. The goal is to give you an appreciation and conceptual understanding of some basic statistical

More information

A Brief Introduction to Bayesian Statistics

A Brief Introduction to Bayesian Statistics A Brief Introduction to Statistics David Kaplan Department of Educational Psychology Methods for Social Policy Research and, Washington, DC 2017 1 / 37 The Reverend Thomas Bayes, 1701 1761 2 / 37 Pierre-Simon

More information

Instrumental Variables Estimation: An Introduction

Instrumental Variables Estimation: An Introduction Instrumental Variables Estimation: An Introduction Susan L. Ettner, Ph.D. Professor Division of General Internal Medicine and Health Services Research, UCLA The Problem The Problem Suppose you wish to

More information

Unit 1 Exploring and Understanding Data

Unit 1 Exploring and Understanding Data Unit 1 Exploring and Understanding Data Area Principle Bar Chart Boxplot Conditional Distribution Dotplot Empirical Rule Five Number Summary Frequency Distribution Frequency Polygon Histogram Interquartile

More information

Chapter 21 Multilevel Propensity Score Methods for Estimating Causal Effects: A Latent Class Modeling Strategy

Chapter 21 Multilevel Propensity Score Methods for Estimating Causal Effects: A Latent Class Modeling Strategy Chapter 21 Multilevel Propensity Score Methods for Estimating Causal Effects: A Latent Class Modeling Strategy Jee-Seon Kim and Peter M. Steiner Abstract Despite their appeal, randomized experiments cannot

More information

Chapter 1: Exploring Data

Chapter 1: Exploring Data Chapter 1: Exploring Data Key Vocabulary:! individual! variable! frequency table! relative frequency table! distribution! pie chart! bar graph! two-way table! marginal distributions! conditional distributions!

More information

11/24/2017. Do not imply a cause-and-effect relationship

11/24/2017. Do not imply a cause-and-effect relationship Correlational research is used to describe the relationship between two or more naturally occurring variables. Is age related to political conservativism? Are highly extraverted people less afraid of rejection

More information

Chapter 3: Describing Relationships

Chapter 3: Describing Relationships Chapter 3: Describing Relationships Objectives: Students will: Construct and interpret a scatterplot for a set of bivariate data. Compute and interpret the correlation, r, between two variables. Demonstrate

More information

You must answer question 1.

You must answer question 1. Research Methods and Statistics Specialty Area Exam October 28, 2015 Part I: Statistics Committee: Richard Williams (Chair), Elizabeth McClintock, Sarah Mustillo You must answer question 1. 1. Suppose

More information

Business Statistics Probability

Business Statistics Probability Business Statistics The following was provided by Dr. Suzanne Delaney, and is a comprehensive review of Business Statistics. The workshop instructor will provide relevant examples during the Skills Assessment

More information

PARTIAL IDENTIFICATION OF PROBABILITY DISTRIBUTIONS. Charles F. Manski. Springer-Verlag, 2003

PARTIAL IDENTIFICATION OF PROBABILITY DISTRIBUTIONS. Charles F. Manski. Springer-Verlag, 2003 PARTIAL IDENTIFICATION OF PROBABILITY DISTRIBUTIONS Charles F. Manski Springer-Verlag, 2003 Contents Preface vii Introduction: Partial Identification and Credible Inference 1 1 Missing Outcomes 6 1.1.

More information

Complier Average Causal Effect (CACE)

Complier Average Causal Effect (CACE) Complier Average Causal Effect (CACE) Booil Jo Stanford University Methodological Advancement Meeting Innovative Directions in Estimating Impact Office of Planning, Research & Evaluation Administration

More information

Chapter 02. Basic Research Methodology

Chapter 02. Basic Research Methodology Chapter 02 Basic Research Methodology Definition RESEARCH Research is a quest for knowledge through diligent search or investigation or experimentation aimed at the discovery and interpretation of new

More information

Studying the effect of change on change : a different viewpoint

Studying the effect of change on change : a different viewpoint Studying the effect of change on change : a different viewpoint Eyal Shahar Professor, Division of Epidemiology and Biostatistics, Mel and Enid Zuckerman College of Public Health, University of Arizona

More information

Student Performance Q&A:

Student Performance Q&A: Student Performance Q&A: 2009 AP Statistics Free-Response Questions The following comments on the 2009 free-response questions for AP Statistics were written by the Chief Reader, Christine Franklin of

More information

MEA DISCUSSION PAPERS

MEA DISCUSSION PAPERS Inference Problems under a Special Form of Heteroskedasticity Helmut Farbmacher, Heinrich Kögel 03-2015 MEA DISCUSSION PAPERS mea Amalienstr. 33_D-80799 Munich_Phone+49 89 38602-355_Fax +49 89 38602-390_www.mea.mpisoc.mpg.de

More information

Propensity Score Analysis Shenyang Guo, Ph.D.

Propensity Score Analysis Shenyang Guo, Ph.D. Propensity Score Analysis Shenyang Guo, Ph.D. Upcoming Seminar: April 7-8, 2017, Philadelphia, Pennsylvania Propensity Score Analysis 1. Overview 1.1 Observational studies and challenges 1.2 Why and when

More information

Section 3.2 Least-Squares Regression

Section 3.2 Least-Squares Regression Section 3.2 Least-Squares Regression Linear relationships between two quantitative variables are pretty common and easy to understand. Correlation measures the direction and strength of these relationships.

More information

Fundamental Clinical Trial Design

Fundamental Clinical Trial Design Design, Monitoring, and Analysis of Clinical Trials Session 1 Overview and Introduction Overview Scott S. Emerson, M.D., Ph.D. Professor of Biostatistics, University of Washington February 17-19, 2003

More information

Cochrane Pregnancy and Childbirth Group Methodological Guidelines

Cochrane Pregnancy and Childbirth Group Methodological Guidelines Cochrane Pregnancy and Childbirth Group Methodological Guidelines [Prepared by Simon Gates: July 2009, updated July 2012] These guidelines are intended to aid quality and consistency across the reviews

More information

Chapter 13 Estimating the Modified Odds Ratio

Chapter 13 Estimating the Modified Odds Ratio Chapter 13 Estimating the Modified Odds Ratio Modified odds ratio vis-à-vis modified mean difference To a large extent, this chapter replicates the content of Chapter 10 (Estimating the modified mean difference),

More information

What is Multilevel Modelling Vs Fixed Effects. Will Cook Social Statistics

What is Multilevel Modelling Vs Fixed Effects. Will Cook Social Statistics What is Multilevel Modelling Vs Fixed Effects Will Cook Social Statistics Intro Multilevel models are commonly employed in the social sciences with data that is hierarchically structured Estimated effects

More information

EVALUATING THE INTERACTION OF GROWTH FACTORS IN THE UNIVARIATE LATENT CURVE MODEL. Stephanie T. Lane. Chapel Hill 2014

EVALUATING THE INTERACTION OF GROWTH FACTORS IN THE UNIVARIATE LATENT CURVE MODEL. Stephanie T. Lane. Chapel Hill 2014 EVALUATING THE INTERACTION OF GROWTH FACTORS IN THE UNIVARIATE LATENT CURVE MODEL Stephanie T. Lane A thesis submitted to the faculty of the University of North Carolina at Chapel Hill in partial fulfillment

More information

Glossary From Running Randomized Evaluations: A Practical Guide, by Rachel Glennerster and Kudzai Takavarasha

Glossary From Running Randomized Evaluations: A Practical Guide, by Rachel Glennerster and Kudzai Takavarasha Glossary From Running Randomized Evaluations: A Practical Guide, by Rachel Glennerster and Kudzai Takavarasha attrition: When data are missing because we are unable to measure the outcomes of some of the

More information

Some interpretational issues connected with observational studies

Some interpretational issues connected with observational studies Some interpretational issues connected with observational studies D.R. Cox Nuffield College, Oxford, UK and Nanny Wermuth Chalmers/Gothenburg University, Gothenburg, Sweden ABSTRACT After some general

More information

Logistic regression: Why we often can do what we think we can do 1.

Logistic regression: Why we often can do what we think we can do 1. Logistic regression: Why we often can do what we think we can do 1. Augst 8 th 2015 Maarten L. Buis, University of Konstanz, Department of History and Sociology maarten.buis@uni.konstanz.de All propositions

More information

Technical Specifications

Technical Specifications Technical Specifications In order to provide summary information across a set of exercises, all tests must employ some form of scoring models. The most familiar of these scoring models is the one typically

More information

Recent developments for combining evidence within evidence streams: bias-adjusted meta-analysis

Recent developments for combining evidence within evidence streams: bias-adjusted meta-analysis EFSA/EBTC Colloquium, 25 October 2017 Recent developments for combining evidence within evidence streams: bias-adjusted meta-analysis Julian Higgins University of Bristol 1 Introduction to concepts Standard

More information

The Use of Piecewise Growth Models in Evaluations of Interventions. CSE Technical Report 477

The Use of Piecewise Growth Models in Evaluations of Interventions. CSE Technical Report 477 The Use of Piecewise Growth Models in Evaluations of Interventions CSE Technical Report 477 Michael Seltzer CRESST/University of California, Los Angeles Martin Svartberg Norwegian University of Science

More information

Estimating drug effects in the presence of placebo response: Causal inference using growth mixture modeling

Estimating drug effects in the presence of placebo response: Causal inference using growth mixture modeling STATISTICS IN MEDICINE Statist. Med. 2009; 28:3363 3385 Published online 3 September 2009 in Wiley InterScience (www.interscience.wiley.com).3721 Estimating drug effects in the presence of placebo response:

More information

Chapter 6 Topic 6B Test Bias and Other Controversies. The Question of Test Bias

Chapter 6 Topic 6B Test Bias and Other Controversies. The Question of Test Bias Chapter 6 Topic 6B Test Bias and Other Controversies The Question of Test Bias Test bias is an objective, empirical question, not a matter of personal judgment. Test bias is a technical concept of amenable

More information

Propensity scores: what, why and why not?

Propensity scores: what, why and why not? Propensity scores: what, why and why not? Rhian Daniel, Cardiff University @statnav Joint workshop S3RI & Wessex Institute University of Southampton, 22nd March 2018 Rhian Daniel @statnav/propensity scores:

More information

Moving beyond regression toward causality:

Moving beyond regression toward causality: Moving beyond regression toward causality: INTRODUCING ADVANCED STATISTICAL METHODS TO ADVANCE SEXUAL VIOLENCE RESEARCH Regine Haardörfer, Ph.D. Emory University rhaardo@emory.edu OR Regine.Haardoerfer@Emory.edu

More information

A Case Study: Two-sample categorical data

A Case Study: Two-sample categorical data A Case Study: Two-sample categorical data Patrick Breheny January 31 Patrick Breheny BST 701: Bayesian Modeling in Biostatistics 1/43 Introduction Model specification Continuous vs. mixture priors Choice

More information

Exploring the Impact of Missing Data in Multiple Regression

Exploring the Impact of Missing Data in Multiple Regression Exploring the Impact of Missing Data in Multiple Regression Michael G Kenward London School of Hygiene and Tropical Medicine 28th May 2015 1. Introduction In this note we are concerned with the conduct

More information

Identifying Peer Influence Effects in Observational Social Network Data: An Evaluation of Propensity Score Methods

Identifying Peer Influence Effects in Observational Social Network Data: An Evaluation of Propensity Score Methods Identifying Peer Influence Effects in Observational Social Network Data: An Evaluation of Propensity Score Methods Dean Eckles Department of Communication Stanford University dean@deaneckles.com Abstract

More information

Catherine A. Welch 1*, Séverine Sabia 1,2, Eric Brunner 1, Mika Kivimäki 1 and Martin J. Shipley 1

Catherine A. Welch 1*, Séverine Sabia 1,2, Eric Brunner 1, Mika Kivimäki 1 and Martin J. Shipley 1 Welch et al. BMC Medical Research Methodology (2018) 18:89 https://doi.org/10.1186/s12874-018-0548-0 RESEARCH ARTICLE Open Access Does pattern mixture modelling reduce bias due to informative attrition

More information

Causal Mediation Analysis with the CAUSALMED Procedure

Causal Mediation Analysis with the CAUSALMED Procedure Paper SAS1991-2018 Causal Mediation Analysis with the CAUSALMED Procedure Yiu-Fai Yung, Michael Lamm, and Wei Zhang, SAS Institute Inc. Abstract Important policy and health care decisions often depend

More information

Section on Survey Research Methods JSM 2009

Section on Survey Research Methods JSM 2009 Missing Data and Complex Samples: The Impact of Listwise Deletion vs. Subpopulation Analysis on Statistical Bias and Hypothesis Test Results when Data are MCAR and MAR Bethany A. Bell, Jeffrey D. Kromrey

More information

Bayesian and Frequentist Approaches

Bayesian and Frequentist Approaches Bayesian and Frequentist Approaches G. Jogesh Babu Penn State University http://sites.stat.psu.edu/ babu http://astrostatistics.psu.edu All models are wrong But some are useful George E. P. Box (son-in-law

More information

EXERCISE: HOW TO DO POWER CALCULATIONS IN OPTIMAL DESIGN SOFTWARE

EXERCISE: HOW TO DO POWER CALCULATIONS IN OPTIMAL DESIGN SOFTWARE ...... EXERCISE: HOW TO DO POWER CALCULATIONS IN OPTIMAL DESIGN SOFTWARE TABLE OF CONTENTS 73TKey Vocabulary37T... 1 73TIntroduction37T... 73TUsing the Optimal Design Software37T... 73TEstimating Sample

More information

Regression Discontinuity Design

Regression Discontinuity Design Regression Discontinuity Design Regression Discontinuity Design Units are assigned to conditions based on a cutoff score on a measured covariate, For example, employees who exceed a cutoff for absenteeism

More information

Identifying Mechanisms behind Policy Interventions via Causal Mediation Analysis

Identifying Mechanisms behind Policy Interventions via Causal Mediation Analysis Identifying Mechanisms behind Policy Interventions via Causal Mediation Analysis December 20, 2013 Abstract Causal analysis in program evaluation has largely focused on the assessment of policy effectiveness.

More information

Impact and adjustment of selection bias. in the assessment of measurement equivalence

Impact and adjustment of selection bias. in the assessment of measurement equivalence Impact and adjustment of selection bias in the assessment of measurement equivalence Thomas Klausch, Joop Hox,& Barry Schouten Working Paper, Utrecht, December 2012 Corresponding author: Thomas Klausch,

More information

A COMPARISON OF IMPUTATION METHODS FOR MISSING DATA IN A MULTI-CENTER RANDOMIZED CLINICAL TRIAL: THE IMPACT STUDY

A COMPARISON OF IMPUTATION METHODS FOR MISSING DATA IN A MULTI-CENTER RANDOMIZED CLINICAL TRIAL: THE IMPACT STUDY A COMPARISON OF IMPUTATION METHODS FOR MISSING DATA IN A MULTI-CENTER RANDOMIZED CLINICAL TRIAL: THE IMPACT STUDY Lingqi Tang 1, Thomas R. Belin 2, and Juwon Song 2 1 Center for Health Services Research,

More information

Understanding and Applying Multilevel Models in Maternal and Child Health Epidemiology and Public Health

Understanding and Applying Multilevel Models in Maternal and Child Health Epidemiology and Public Health Understanding and Applying Multilevel Models in Maternal and Child Health Epidemiology and Public Health Adam C. Carle, M.A., Ph.D. adam.carle@cchmc.org Division of Health Policy and Clinical Effectiveness

More information

Chapter 11. Experimental Design: One-Way Independent Samples Design

Chapter 11. Experimental Design: One-Way Independent Samples Design 11-1 Chapter 11. Experimental Design: One-Way Independent Samples Design Advantages and Limitations Comparing Two Groups Comparing t Test to ANOVA Independent Samples t Test Independent Samples ANOVA Comparing

More information

The Pretest! Pretest! Pretest! Assignment (Example 2)

The Pretest! Pretest! Pretest! Assignment (Example 2) The Pretest! Pretest! Pretest! Assignment (Example 2) May 19, 2003 1 Statement of Purpose and Description of Pretest Procedure When one designs a Math 10 exam one hopes to measure whether a student s ability

More information

C h a p t e r 1 1. Psychologists. John B. Nezlek

C h a p t e r 1 1. Psychologists. John B. Nezlek C h a p t e r 1 1 Multilevel Modeling for Psychologists John B. Nezlek Multilevel analyses have become increasingly common in psychological research, although unfortunately, many researchers understanding

More information

DRAFT (Final) Concept Paper On choosing appropriate estimands and defining sensitivity analyses in confirmatory clinical trials

DRAFT (Final) Concept Paper On choosing appropriate estimands and defining sensitivity analyses in confirmatory clinical trials DRAFT (Final) Concept Paper On choosing appropriate estimands and defining sensitivity analyses in confirmatory clinical trials EFSPI Comments Page General Priority (H/M/L) Comment The concept to develop

More information

BIOSTATISTICAL METHODS AND RESEARCH DESIGNS. Xihong Lin Department of Biostatistics, University of Michigan, Ann Arbor, MI, USA

BIOSTATISTICAL METHODS AND RESEARCH DESIGNS. Xihong Lin Department of Biostatistics, University of Michigan, Ann Arbor, MI, USA BIOSTATISTICAL METHODS AND RESEARCH DESIGNS Xihong Lin Department of Biostatistics, University of Michigan, Ann Arbor, MI, USA Keywords: Case-control study, Cohort study, Cross-Sectional Study, Generalized

More information

Doing Quantitative Research 26E02900, 6 ECTS Lecture 6: Structural Equations Modeling. Olli-Pekka Kauppila Daria Kautto

Doing Quantitative Research 26E02900, 6 ECTS Lecture 6: Structural Equations Modeling. Olli-Pekka Kauppila Daria Kautto Doing Quantitative Research 26E02900, 6 ECTS Lecture 6: Structural Equations Modeling Olli-Pekka Kauppila Daria Kautto Session VI, September 20 2017 Learning objectives 1. Get familiar with the basic idea

More information

Introduction to Observational Studies. Jane Pinelis

Introduction to Observational Studies. Jane Pinelis Introduction to Observational Studies Jane Pinelis 22 March 2018 Outline Motivating example Observational studies vs. randomized experiments Observational studies: basics Some adjustment strategies Matching

More information

Chapter 1: Explaining Behavior

Chapter 1: Explaining Behavior Chapter 1: Explaining Behavior GOAL OF SCIENCE is to generate explanations for various puzzling natural phenomenon. - Generate general laws of behavior (psychology) RESEARCH: principle method for acquiring

More information

Lecture Notes Module 2

Lecture Notes Module 2 Lecture Notes Module 2 Two-group Experimental Designs The goal of most research is to assess a possible causal relation between the response variable and another variable called the independent variable.

More information

Donna L. Coffman Joint Prevention Methodology Seminar

Donna L. Coffman Joint Prevention Methodology Seminar Donna L. Coffman Joint Prevention Methodology Seminar The purpose of this talk is to illustrate how to obtain propensity scores in multilevel data and use these to strengthen causal inferences about mediation.

More information

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo Business Statistics The following was provided by Dr. Suzanne Delaney, and is a comprehensive review of Business Statistics. The workshop instructor will provide relevant examples during the Skills Assessment

More information

Detection of Unknown Confounders. by Bayesian Confirmatory Factor Analysis

Detection of Unknown Confounders. by Bayesian Confirmatory Factor Analysis Advanced Studies in Medical Sciences, Vol. 1, 2013, no. 3, 143-156 HIKARI Ltd, www.m-hikari.com Detection of Unknown Confounders by Bayesian Confirmatory Factor Analysis Emil Kupek Department of Public

More information

Missing data. Patrick Breheny. April 23. Introduction Missing response data Missing covariate data

Missing data. Patrick Breheny. April 23. Introduction Missing response data Missing covariate data Missing data Patrick Breheny April 3 Patrick Breheny BST 71: Bayesian Modeling in Biostatistics 1/39 Our final topic for the semester is missing data Missing data is very common in practice, and can occur

More information

EXPERIMENTAL RESEARCH DESIGNS

EXPERIMENTAL RESEARCH DESIGNS ARTHUR PSYC 204 (EXPERIMENTAL PSYCHOLOGY) 14A LECTURE NOTES [02/28/14] EXPERIMENTAL RESEARCH DESIGNS PAGE 1 Topic #5 EXPERIMENTAL RESEARCH DESIGNS As a strict technical definition, an experiment is a study

More information

An Introduction to Proportional Navigation

An Introduction to Proportional Navigation Chapter 9 An Introduction to Proportional Navigation odule 8: Lecture 22 PPN; TPN; GTPN; IPN Keywords. Proportional Navigation, PPN, TPN, GTPN, IPN Proportional navigation is, by far, the most important

More information

Ec331: Research in Applied Economics Spring term, Panel Data: brief outlines

Ec331: Research in Applied Economics Spring term, Panel Data: brief outlines Ec331: Research in Applied Economics Spring term, 2014 Panel Data: brief outlines Remaining structure Final Presentations (5%) Fridays, 9-10 in H3.45. 15 mins, 8 slides maximum Wk.6 Labour Supply - Wilfred

More information

Social Determinants and Consequences of Children s Non-Cognitive Skills: An Exploratory Analysis. Amy Hsin Yu Xie

Social Determinants and Consequences of Children s Non-Cognitive Skills: An Exploratory Analysis. Amy Hsin Yu Xie Social Determinants and Consequences of Children s Non-Cognitive Skills: An Exploratory Analysis Amy Hsin Yu Xie Abstract We assess the relative role of cognitive and non-cognitive skills in mediating

More information

The Effects of Maternal Alcohol Use and Smoking on Children s Mental Health: Evidence from the National Longitudinal Survey of Children and Youth

The Effects of Maternal Alcohol Use and Smoking on Children s Mental Health: Evidence from the National Longitudinal Survey of Children and Youth 1 The Effects of Maternal Alcohol Use and Smoking on Children s Mental Health: Evidence from the National Longitudinal Survey of Children and Youth Madeleine Benjamin, MA Policy Research, Economics and

More information

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo Please note the page numbers listed for the Lind book may vary by a page or two depending on which version of the textbook you have. Readings: Lind 1 11 (with emphasis on chapters 10, 11) Please note chapter

More information

Still important ideas

Still important ideas Readings: OpenStax - Chapters 1 13 & Appendix D & E (online) Plous Chapters 17 & 18 - Chapter 17: Social Influences - Chapter 18: Group Judgments and Decisions Still important ideas Contrast the measurement

More information

Examining Relationships Least-squares regression. Sections 2.3

Examining Relationships Least-squares regression. Sections 2.3 Examining Relationships Least-squares regression Sections 2.3 The regression line A regression line describes a one-way linear relationship between variables. An explanatory variable, x, explains variability

More information

Chapter 2 Interactions Between Socioeconomic Status and Components of Variation in Cognitive Ability

Chapter 2 Interactions Between Socioeconomic Status and Components of Variation in Cognitive Ability Chapter 2 Interactions Between Socioeconomic Status and Components of Variation in Cognitive Ability Eric Turkheimer and Erin E. Horn In 3, our lab published a paper demonstrating that the heritability

More information

Results & Statistics: Description and Correlation. I. Scales of Measurement A Review

Results & Statistics: Description and Correlation. I. Scales of Measurement A Review Results & Statistics: Description and Correlation The description and presentation of results involves a number of topics. These include scales of measurement, descriptive statistics used to summarize

More information

Sensitivity Analysis in Observational Research: Introducing the E-value

Sensitivity Analysis in Observational Research: Introducing the E-value Sensitivity Analysis in Observational Research: Introducing the E-value Tyler J. VanderWeele Harvard T.H. Chan School of Public Health Departments of Epidemiology and Biostatistics 1 Plan of Presentation

More information

Selected Topics in Biostatistics Seminar Series. Missing Data. Sponsored by: Center For Clinical Investigation and Cleveland CTSC

Selected Topics in Biostatistics Seminar Series. Missing Data. Sponsored by: Center For Clinical Investigation and Cleveland CTSC Selected Topics in Biostatistics Seminar Series Missing Data Sponsored by: Center For Clinical Investigation and Cleveland CTSC Brian Schmotzer, MS Biostatistician, CCI Statistical Sciences Core brian.schmotzer@case.edu

More information

Causal Validity Considerations for Including High Quality Non-Experimental Evidence in Systematic Reviews

Causal Validity Considerations for Including High Quality Non-Experimental Evidence in Systematic Reviews Non-Experimental Evidence in Systematic Reviews OPRE REPORT #2018-63 DEKE, MATHEMATICA POLICY RESEARCH JUNE 2018 OVERVIEW Federally funded systematic reviews of research evidence play a central role in

More information

IAPT: Regression. Regression analyses

IAPT: Regression. Regression analyses Regression analyses IAPT: Regression Regression is the rather strange name given to a set of methods for predicting one variable from another. The data shown in Table 1 and come from a student project

More information

UNIVERSITY OF FLORIDA 2010

UNIVERSITY OF FLORIDA 2010 COMPARISON OF LATENT GROWTH MODELS WITH DIFFERENT TIME CODING STRATEGIES IN THE PRESENCE OF INTER-INDIVIDUALLY VARYING TIME POINTS OF MEASUREMENT By BURAK AYDIN A THESIS PRESENTED TO THE GRADUATE SCHOOL

More information

Cross-Lagged Panel Analysis

Cross-Lagged Panel Analysis Cross-Lagged Panel Analysis Michael W. Kearney Cross-lagged panel analysis is an analytical strategy used to describe reciprocal relationships, or directional influences, between variables over time. Cross-lagged

More information

Citation for published version (APA): Ebbes, P. (2004). Latent instrumental variables: a new approach to solve for endogeneity s.n.

Citation for published version (APA): Ebbes, P. (2004). Latent instrumental variables: a new approach to solve for endogeneity s.n. University of Groningen Latent instrumental variables Ebbes, P. IMPORTANT NOTE: You are advised to consult the publisher's version (publisher's PDF) if you wish to cite from it. Please check the document

More information

CHAPTER ONE CORRELATION

CHAPTER ONE CORRELATION CHAPTER ONE CORRELATION 1.0 Introduction The first chapter focuses on the nature of statistical data of correlation. The aim of the series of exercises is to ensure the students are able to use SPSS to

More information

The Relative Performance of Full Information Maximum Likelihood Estimation for Missing Data in Structural Equation Models

The Relative Performance of Full Information Maximum Likelihood Estimation for Missing Data in Structural Equation Models University of Nebraska - Lincoln DigitalCommons@University of Nebraska - Lincoln Educational Psychology Papers and Publications Educational Psychology, Department of 7-1-2001 The Relative Performance of

More information

Introduction to Multilevel Models for Longitudinal and Repeated Measures Data

Introduction to Multilevel Models for Longitudinal and Repeated Measures Data Introduction to Multilevel Models for Longitudinal and Repeated Measures Data Today s Class: Features of longitudinal data Features of longitudinal models What can MLM do for you? What to expect in this

More information

Causal Methods for Observational Data Amanda Stevenson, University of Texas at Austin Population Research Center, Austin, TX

Causal Methods for Observational Data Amanda Stevenson, University of Texas at Austin Population Research Center, Austin, TX Causal Methods for Observational Data Amanda Stevenson, University of Texas at Austin Population Research Center, Austin, TX ABSTRACT Comparative effectiveness research often uses non-experimental observational

More information

Chapter Three: Sampling Methods

Chapter Three: Sampling Methods Chapter Three: Sampling Methods The idea of this chapter is to make sure that you address sampling issues - even though you may be conducting an action research project and your sample is "defined" by

More information

How to analyze correlated and longitudinal data?

How to analyze correlated and longitudinal data? How to analyze correlated and longitudinal data? Niloofar Ramezani, University of Northern Colorado, Greeley, Colorado ABSTRACT Longitudinal and correlated data are extensively used across disciplines

More information

Multiple Imputation For Missing Data: What Is It And How Can I Use It?

Multiple Imputation For Missing Data: What Is It And How Can I Use It? Multiple Imputation For Missing Data: What Is It And How Can I Use It? Jeffrey C. Wayman, Ph.D. Center for Social Organization of Schools Johns Hopkins University jwayman@csos.jhu.edu www.csos.jhu.edu

More information

educational assessment and educational measurement

educational assessment and educational measurement EDUCATIONAL ASSESSMENT AND EDUCATIONAL MEASUREMENT research line 5 educational assessment and educational measurement EDUCATIONAL ASSESSMENT AND EDUCATIONAL MEASUREMENT 98 1 Educational Assessment 100

More information

Bias in regression coefficient estimates when assumptions for handling missing data are violated: a simulation study

Bias in regression coefficient estimates when assumptions for handling missing data are violated: a simulation study STATISTICAL METHODS Epidemiology Biostatistics and Public Health - 2016, Volume 13, Number 1 Bias in regression coefficient estimates when assumptions for handling missing data are violated: a simulation

More information

Does Body Mass Index Adequately Capture the Relation of Body Composition and Body Size to Health Outcomes?

Does Body Mass Index Adequately Capture the Relation of Body Composition and Body Size to Health Outcomes? American Journal of Epidemiology Copyright 1998 by The Johns Hopkins University School of Hygiene and Public Health All rights reserved Vol. 147, No. 2 Printed in U.S.A A BRIEF ORIGINAL CONTRIBUTION Does

More information

Structural Approach to Bias in Meta-analyses

Structural Approach to Bias in Meta-analyses Original Article Received 26 July 2011, Revised 22 November 2011, Accepted 12 December 2011 Published online 2 February 2012 in Wiley Online Library (wileyonlinelibrary.com) DOI: 10.1002/jrsm.52 Structural

More information

investigate. educate. inform.

investigate. educate. inform. investigate. educate. inform. Research Design What drives your research design? The battle between Qualitative and Quantitative is over Think before you leap What SHOULD drive your research design. Advanced

More information

Methods for Addressing Selection Bias in Observational Studies

Methods for Addressing Selection Bias in Observational Studies Methods for Addressing Selection Bias in Observational Studies Susan L. Ettner, Ph.D. Professor Division of General Internal Medicine and Health Services Research, UCLA What is Selection Bias? In the regression

More information

(CORRELATIONAL DESIGN AND COMPARATIVE DESIGN)

(CORRELATIONAL DESIGN AND COMPARATIVE DESIGN) UNIT 4 OTHER DESIGNS (CORRELATIONAL DESIGN AND COMPARATIVE DESIGN) Quasi Experimental Design Structure 4.0 Introduction 4.1 Objectives 4.2 Definition of Correlational Research Design 4.3 Types of Correlational

More information

Conducting Strong Quasi-experiments

Conducting Strong Quasi-experiments Analytic Technical Assistance and Development Conducting Strong Quasi-experiments Version 1 May 2015 This report was prepared for the Institute of Education Sciences (IES) by Decision Information Resources,

More information