When to Use Hierarchical Linear Modeling

Size: px
Start display at page:

Download "When to Use Hierarchical Linear Modeling"

Transcription

1 Veronika Huta, a When to Use Hierarchical Linear odeling a School of psychology, University of Ottawa Abstract revious publications on hierarchical linear modeling (HL) have provided guidance on how to perform the analysis, yet there is relatively little information on two questions that arise even before analysis: Does HL apply to one s data and research question? And if it does apply, how does one choose between HL and other methods sometimes used in these circumstances, including multiple regression, repeated-measures or mixed ANOVA, and structural equation modeling or path analysis? The purpose of this tutorial is to briefly introduce HL and then to review some of the considerations that are helpful in answering these questions, including the nature of the data, the model to be tested, and the information desired on the output. Some examples of how the same analysis could be performed in HL, repeated-measures or mixed ANOVA, and structural equation modeling or path analysis are also provided. Keywords hierarchical linear modeling; multilevel modeling; repeated-measures; analysis of variance; structural equation modeling; path analysis vhuta@uottawa.ca Introduction Hierarchical linear modeling (HL) (also referred to as multilevel modeling, mixed modeling, and random coefficient modeling) is a statistical analysis that many researchers are becoming interested in. revious publications on HL have provided detailed information on how to perform the analysis (e.g., Raudenbush, Bryk, Cheong, Congdon, & du Toit, 2011; Woltman, Feldstain, ackay, & Rocchi, 2012). Yet there is relatively little information to help researchers decide whether HL applies to their data and research question, and how to choose between HL and alternative methods of analyzing the data. The purpose of this tutorial is to review some of the considerations that are helpful in answering these questions. I will focus specifically on the analyses that can be carried out by the software called HL7 (Raudenbush, Bryk, & Congdon, 2011). HL applies to randomly selected groups HL applies when the observations in a study form groups in some way and the groups are randomly selected (Raudenbush & Bryk, 2002). There are various ways of having grouped data. For example, there may be multiple time points per person and multiple persons these data are grouped because multiple time points are nested within each person. There may be multiple people per organization and multiple organizations, such that people are nested within organizations. There can even be multiple organizations per higher-order group, such as schools nested within cities. It is possible to have a grouping hierarchy with 2, 3, or 4 levels. An example of a four-level hierarchy is multiple students per school, multiple schools per city, multiple cities per county, and multiple counties here students are the Level 1 units, schools are the Level 2 units, cities are the Level 3 units, and counties are the Level 4 units. In this tutorial, for the sake of simplicity, I will focus primarily on two-level hierarchies. As noted above, HL applies to the situation when the groups are selected at random, i.e., when they represent a random factor rather than a fixed factor. For example, if a study has ten schools (with multiple students in each school), then schools are a random factor if they are randomly selected and the aim is to generalize to the population of all schools; in contrast, schools are a fixed factor if the researcher specifically wanted to draw conclusions about those ten schools, and not about schools in general (and the analysis then becomes an ANOVA). HL is an expanded form of regression HL is essentially an expanded form of regression. In most HL analyses, there is a single dependent variable, though a multivariate option exists as well within the HL7 software; the dependent variable can be quantitative and normally distributed, or it can be qualitative or non-normally distributed. In this tutorial, I will focus on the case of a single dependent variable that is normally distributed. Suppose the data set consists of 100 participants, studied at 50 time points each. Roughly speaking, HL 13

2 Figure 1 Sample two-level Hierarchical Linear odel. obtains what is called a Level 1 (or within-group) regression equation for each participant, based on that individual s 50 time points (for a total of 100 equations); the Level 1 equation may have one or more Level 1 independent variables (i.e., independent variables measured at each time point), or it may have no independent variables (the same set of independent variables must be used in all Level 1 equations); the dependent variable must be measured at each time point. Like any regression, the Level 1 equation for a given individual summarizes their data across 50 time points into just a few coefficients: an intercept (which equals the participants mean score on the dependent variable if the researcher uses what is called group mean centering for each Level 1 independent variable, a common procedure), and a slope for each of the Level 1 independent variables. Each of these coefficients the intercept and possibly some slopes then serves as the dependent variable in a Level 2 (or between-group) regression equation; for example, if there are two independent variables in the Level 1 equations, there will be three regression equations at Level 2, one predicting the Level 1 intercept, one predicting the Level 1 slope for one Level 1 independent variable, and the other predicting the slope for the other Level 1 independent variable. Each Level 2 equation has an intercept (which equals the mean intercept or slope across all participants if the researcher uses what is called grand mean centering for each Level 2 independent variable, a common procedure), and it may have one or more Level 2 independent variables (i.e., independent variables measured just once for each participant). For example, suppose again that there are 100 participants, with 50 time points each, the dependent variable is state well-being (s_wbeing HL truncates variable names to eight characters, so you might as well create short names to begin with), the Level 1 independent variable is state autonomy (s_auton), and the Level 2 independent variable is trait extraversion (t_extrav). Below is what the regression equation looks like at Level 1. (Note that e is the error term, indicating that the observed state well-being score at a given time point may differ from the well-being score predicted for that person based on the regression equation; e is always present in the Level 1 equation.) The intercept (π 0) and the slope (π 1) values will differ from participant to participant. If state autonomy is group mean centered, the π 0 conveniently equals the mean well-being score across all time points for a given participant, and thus provides an estimate of the participant s trait level of well-being. Below is what the regression equations look likes at Level 2. (Note that in HL, you can choose whether or not to include the error term r 0 and/or r 1; if the error term r 0 is included, this implies that the intercept π 0 is assumed to differ from person to person; if the error term r 1 is included, this implies that the slope π 1 is assumed to differ from person to person.) Figure 1 shows a screen shot of how the model would appear in HL. Each equation at Level 2 is a summary across all 100 participants, and each of the four coefficients (those indicated with the letter β) is tested to determine if it differs significantly from zero. If trait extraversion is grand mean centered, β 00 conveniently equals the mean well-being score across all time points and across all participants, called the grand mean, and thus provides an estimate of the average participant s trait level of well-being. The β 10 value provides the average π 1 value across all participants (assuming the Level 2 independent variable(s) is/are grand mean centered). If β 10 is statistically significant, then on average across 14

3 participants, state autonomy significantly relates to state well-being. The β 01 value gives the relationship between trait extraversion and trait well-being (assuming group mean centering was used). Finally, if β 11 is significantly different from zero, this indicates that trait extraversion moderates the strength of the relationship between state autonomy and state wellbeing (this moderation is also called a cross-level interaction, since trait extraversion at Level 2 is interacting with state autonomy at Level 1; it is certainly possible to have an interaction between independent variables at the same level, but these are product terms that must be created in the data set before importation into HL). When HL is superior to regular regression In the past, before HL was developed, people simply used a single regular regression for grouped data either what is called a Level 1 regression or what is called a Level 2 regression. Suppose there are 100 participants and 50 time points per participant, with the variables at each level as discussed before. A Level 1 regression can be used when the researcher is only interested in relationships at Level 1 (e.g.., does state autonomy relate to state well-being), and it involves simply running a regression with a sample size of all 5000 data points as if they came from 5000 independent participants. A Level 2 regression can be used when the researcher is only interested in relationships at Level 2 (e.g., does trait extraversion relate to trait well-being), and it involves running a regression on 100 data points, one per participant, after computing the mean well-being score for each participant. The question is: When are these regular regressions problematic, making HL a preferable choice? When HL is superior to Level 1 regression the problem of inflated Type I error A Level 1 regression treats data from 100 participants as if it were data from 5000 independent participants. Therein lies the problem. This can lead to a large inflation of Type I error, since the statistical significance of a result depends on sample size. HL deals with this problem by basing its sample size for inferential statistics on the number of groups (100 in this example), not on the total number of observations (5000 in this example). The HL approach has a drawback of its own, as the reader might guess. HL tends to be on the conservative side when testing relationships at Level 1, i.e., it has less power than a Level 1 regression would. There usually is some degree of dependence among the observations from a given group, however, and it is usually advisable to apply an analysis for grouped data, such as HL. Only if there is no dependence is it appropriate to conduct a Level 1 regression. It is possible to determine the degree of within-group dependence in HL by testing whether there is variance in the Level 1 intercept across groups if it does not vary, an analysis for grouped data is not necessary and one can use a Level 1 regression, though one may still want to proceed with an analysis for grouped data on theoretical grounds or for consistency with other analyses that are being conducted. Alternatively, an Intraclass Correlation Coefficient (ICC) smaller than 5% suggests that an analysis for grouped data is unnecessary (Bliese, 2000). The ICC is the proportion of the total variance in the dependent variable (which is the sum of the between-group variance and the within-group variance) that exists between groups. When HL is superior to Level 1 regression the value of differentiating Levels 1 and 2 In addition to the reduction of Type I error, there is also a conceptual reason to use HL instead of Level 1 regression whenever there is a significant variance in coefficients across groups. HL allows the researcher to separate within-group effects from between-group effects, whereas a Level-1 regression blends them together into a single coefficient. For example, in one study, I ran an experiencesampling study with about 100 participants and about 50 time points per participant (Huta & Ryan, 2010). At Level 1, I measured eudaimonia (the pursuit of excellence) and hedonia (the pursuit of pleasure). When I analyzed the data properly, using HL, I obtained the following results. At Level 1, so at a given moment in time, a person s degree of state eudaimonia and state hedonia correlated negatively, around -.3. Thus, if a person is momentarily striving for excellence, they are probably not simultaneously striving for pleasure. However, at Level 2, so at the trait level, a person s average degree of eudaimonia over the 50 time points and their average degree of hedonia over the 50 time points actually correlated positively, about.30! Thus, if a person often strives for excellence, they also tend to often strive for pleasure. (The latter analysis was performed by choosing one variable, say 15

4 hedonia Correct HL Level 2 slope i.e., slope between people Correct HL Level 1 slopes i.e., slopes within people eudaimonia Incorrect Level 1 Regression slope mixing between- & within-person slopes Figure 2 How two variables can have a negative relationship at the within-group level but a positive relationship at the between-group level in Hierarchical Linear odeling. hedonia, as the dependent variable for HL, and then at Level 2 using each person s mean eudaimonia score as the independent variable, with the mean eudaimonia scores being computed for each participant before running the HL in other words, HL can compute the mean score on the dependent variable for each person, but the mean score on the independent variable has to be computed person by person prior to running HL). If the data had simply been analyzed using a Level-1 regression, the correlation between eudaimonia and hedonia would be -.10, which is part way between -.3 and +.3 and tells us little about the true correlation at Level 1 or at Level 2. Figure 2 provides an illustration of how the correct slopes obtained through HL at Levels 1 and 2 can be in opposite directions from each other, and how the Level 1 regression slope is a blend of the two slopes obtained through HL. Each dotted ellipse represents data across multiple time points for each participants (only three participants are shown). The thin dotted lines running through the dotted ellipses are the lines of best fit for each participant. The thick dotted line is the mean of all the thin dotted lines across participants, and corresponds to the Level 1 or within-person correlation of -.3 I obtained using HL. The thin solid ellipse encompasses all of the data combined across participants, and the thin solid line is the line of best fit through this data and corresponds to the somewhat uninformative correlation of -.1 obtained through a Level 1 regression. The thick solid line corresponds to the Level 2 or between person correlation of +.3 obtained using HL (or using a Level 2 regression), and is the line of best fit through the center points (called centroids) of the ellipses for each participant, which are indicated by large dots. When HL is superior to Level L 2 regression Let me continue with the example of 100 participants and 50 time points per participant, and both eudaimonia and hedonia being measured at each time point. Recall that a Level-2 regression involves taking the mean dependent variable score and the mean independent variable score across the 50 time points for each participant, which produces just 100 observations in total, and then running a regular regression on those means. The results of a Level 2 regression and of HL will be the same if each participant has the same number of time points and no missing data (and the same variance in the dependent variable). But one lovely feature of HL is that it allows the researcher to have different numbers of observations per group (i.e., per participant in this example) and furthermore gives greater weight to the groups with more observations (and less variance), which produces slightly more accurate estimates of population values. The greater the differences in sample size (and variance) across groups, the greater the advantage of HL relative to Level-2 regression. This benefit is subtler and less crucial than the advantage of HL over Level 1 regression, but it is 16

5 Figure 3 How Functional Data Analysis models the fluctuations a variable undergoes over time. worth being aware of. When it is appropriate to use HL on data from dyads When comparing HL with regular regression, I would like to also make a comment about dyads. Dyads always have two members per group, e.g., husband and wife, caregiver and patient, coach and athlete. It is a common assumption that all research on dyads should be analyzed using HL. This is often true but not always. HL only applies when exactly the same set of variables the dependent variable, or the dependent variable and some Level 1 independent variables is measured in all members of the group, i.e., in both members of the dyad. For example, HL applies when the same marital satisfaction questionnaire is given to both the husband and the wife. However, if different variables have been assessed in the two members of the dyad for example, if the dependent variable assessed in the caregiver is burnout, but the dependent variable assessed in the patient is depression then HL does not apply and one would simply run a regular regression (or some other analysis) to predict caregiver burnout, and a separate analysis to predict patient depression. Different analyses for grouped data HL is not the only method available for dealing with grouped data. The two most common alternatives are structural equation modeling (SE) (or its simpler version, path analysis), and a general linear model (GL) with a repeated-measures variable, which I will simply refer to as repeated-measures. Examples of the latter include: mixed design ANOVA/GL with a repeated-measures/within-subjects variable that is analogous to the dependent variable measured repeatedly at Level 1 in HL, and one or more between-subjects variables that are analogous to the Level 2 independent variables in HL; repeatedmeasures ANOVA/GL with a repeatedmeasures/within-subjects variable; and a pairedsamples t-test, which has only two repeated measures. A less well-known alternative is functional data analysis (FDA), and there are others still, such as growth mixture modeling (G). Let me outline FDA and G only briefly, just enough to make the reader aware of these options. I will then discuss repeated measures and SE (and path analysis) in more detail, describing how the same research question would be addressed with these methods as well as HL, and listing criteria that can help a researcher choose between HL, repeated measures, and SE. Functional data analysis Developed by Ramsay and Silverman (2002, 2005), FDA is used specifically for longitudinal data. It is more flexible than other approaches in that it models the precise pattern of fluctuations that a variable undergoes over time, and every individual/entity can have a different pattern. This makes FDA more flexible than the other approaches I am comparing it with, which assume that a single function (e.g., a straight line, a quadratic curve, a cubic curve, exponential decay) can represent the entire span of data points, and that all individuals can be represented by the same function. For example, Figure 3 shows the pattern of depression scores over the course of therapy for four individuals. A smoothing process can then be applied, to a degree gauged by the researcher, in hopes of eliminating minor fluctuations that are likely to represent random noise, and retaining major fluctuations that are likely to represent a true signal. 17

6 Figure 4 How to set up the Level 1 and Level 2 data sets for odel A in SSS prior to using the HL software for Hierarchical Linear odeling. The entire pattern for each individual can then be used in various analyses. For example, analogous to an independent-samples t-test, it is possible to test whether two groups of individuals (such as those in cognitive therapy versus those in behavioral therapy) differ significantly in terms of their mean score at a given therapy session, or even in terms of their slope or rate of improvement (referred to as the velocity of the curve at that time point). Analogous to a regression, it is possible to test whether fluctuations in one variable over time (such as social support) predict later fluctuations in another variable (such as well-being). A principal components analysis can also be performed on the patterns to see at what time points individuals/entities differ most widely from each other for example, when studying depression scores over the course of therapy, the greatest spread in scores occurs during the last few therapy sessions, since some clients continue to get better while others have trouble with therapy termination and their symptoms get worse. Growth ixture odeling Unlike HL, repeated-measures, SE, and FDA, which are variable-centered approaches, G is a person-centered approach (Jung & Wickrama, 2008). Variable-centered approaches focus on relationships among variables (another example is factor analysis). erson-centered approaches, such as G and cluster analysis, focus on similarities between individuals and aim to classify participants into groups based on their responses across a set of variables. G is used for longitudinal data, and it analyzes the trajectories of different individuals to determine whether there are subgroups within which individuals have similar trajectories (Wang & Bodner, 2007). For example, when analyzing depression scores over the course of therapy, G might indicate that there are two groups of individuals those whose scores progressively improve, and those whose scores remain about the same. Latent class growth analysis is a special case of G which assumes that all individuals within a given group have exactly the same trajectory, rather than allowing for variability within groups the way G does (Jung & Wickrama, 2008). How to set up the same model in HL, Repeatedmeasures, and SE odel A Suppose a two-level data set with three time points per participant has state well-being as the dependent variable at Level 1, and trait extraversion as the independent variable at Level 2 (suppose there are no Level 1 independent variables). Suppose the researcher wishes to test whether there is a significant link between extraversion and well-being. Setting up the model in HL. rior to importing the data into HL (i.e., prior to creating the mdm, the multivariate data matrix), there would be one data set for the Level 1 data (with one line per time point) and a separate data set for the Level 2 data (with one line per participant), as shown in Figure 4 if one is using SSS (IB Corp., 2011a). 18

7 Figure 5 How to set up the analysis for odel A using the HL software for Hierarchical Linear odeling. Figure 6 How to set up the data set for odels A and B in SSS prior to Repeated-measures In HL, the analysis would then be set up as shown in Figure 5. (The error term u 0 is kept in the model, indicating that the intercept of state well-being is assumed to show some variance from person to person even after controlling for the role of trait extraversion, a reasonable assumption to start out with until there is evidence to the contrary.) On the output, to determine whether extraversion relates to well-being, one would see whether the coefficient β 01 is statistically significant. Setting up the model in repeated measures. For repeated measures, there would simply be one data set (with one line per participant), as shown in Figure 6 if one is using SSS (Lacroix& Giguère, 2006). The three variables in the data set that represent state well-being at the three time points would then be used to create the within-subjects factor/variable (which might be called time_point and which one would designate as having 3 levels), and the one variable in the data set that represents trait extraversion would be the between-subjects covariate. In other words, if one were using SSS, the analysis would be run as shown in Figure 7 (for guidelines on how to perform various repeated-measures models, see Tabachnick & Fidell, 2007). On the output, to determine whether extraversion relates to well-being, one would see whether the test of the between-subjects effect for extraversion was statistically significant. Setting up the model in SE. rior to importing the data into an SE software such as AOS (IB Corp., 2011b), the data would again be set up as shown in Figure 6 in other words, there would be one line per participant/group (for guidelines on running SE, see Arbuckle, 2011; Kline, 1998). (I focus here on AOS because of its wide use and availability, especially since it has become associated with SSS, though other software such as LISREL and ES are used often in the multilevel case). The model would then be set up as shown in Figure 8 if one is using AOS. Technically, Figure 8 is a path analysis rather than a structural equation model, given 19

8 Figure 7 How to set up the analysis for odel A using SSS for Repeated-measures analysis. that it examines the relationships between measured variables (indicated with rectangles), and not between latent variables which are factors extracted from multiple measured variables (which would be indicated with ellipses). Notice that all three regression coefficients are constrained to be equal, since they have all been assigned the same name a, so that the analysis parallels HL where a single value is obtained for the link between extraversion and well-being; similarly, all of the error terms have received the same label e1. Alternatively, the researcher may allow the regression coefficients and error terms to vary, to see if the Level 2 independent variable has a different impact on well-being at each time point, a feature not available in HL or repeated-measures. On the output, to determine whether extraversion relates to well-being, one would see whether the regression coefficient a was statistically significant. How to set up the same model in HL, Repeatedmeasures, and SE odel B Now suppose a two-level data set with three time points per participant has state well-being as the dependent variable at Level 1, and suppose the researcher wishes to test whether there is a linear increasing trend in well-being over time. Setting up the model in HL. rior to importing the data into HL, one would need to set up a variable called time in the Level 1 data set, and assign it the values 1, Figure 8 How to set up the analysis for odel A using AOS for ath Analysis. 2, and 3 for time points 1, 2, and 3 (assuming they were equally spaced), as shown in Figure 9. The Level 2 data set is also shown in Figure 9 (and is the same as in Figure 4). In HL, the analysis would then be set up as shown in Figure 10, if one assumes the intercept and slope will vary from person to person (a good assumption to begin with, until there is evidence to the contrary). On the output, to determine whether there is a linear increasing trend over time, one would see whether the coefficient β 10 is positive and statistically significant. In other words, one would see whether there is a relationship between time and state wellbeing, on average across participants. Setting up the model in repeated measures. For repeated measures, the data set in Figure 6 would be used as is. The three variables that represent state well-being would then be used to create the within-subjects variable, as shown in Figure 11. On the output, to determine whether there is a linear trend over time, one would see whether the within-subjects contrast for time_point is statistically significant. One would also need to check the descriptive statistics or a plot, to see if the linear trend was indeed increasing rather than decreasing. Setting up the model in SE. For SE, the data structure shown in Figure 6 would be used as is. When using the AOS software to run the SE, the model would be set up as shown in Figure 12 (which is 20

9 Figure 9 How to set up the Level 1 and Level 2 data sets for odel B in SSS prior to using the HL Software for Hierarchical Linear odeling. called a latent difference score analysis e.g., Hawley, Ho, Zuroff, & Blatt, 2007; cardle, 2001). The terms in ellipses would be created by the researcher, and the two paths from Sn would be constrained to be equal, by being assigned the same coefficient a. (The d values are the latent and error-free variables assumed to produce the measured variables; the delta d values are the differences in the d values from one time point to the next, being equal to the later time point minus the earlier time point, thus the term latent difference score analysis; Sn is the latent slope assumed to underlie the delta d values; and the regression coefficients a are fixed to be equal typically all set to 1 to reflect the expectation that change over time will be linear for each participant.) On the output, to determine whether there is an increasing linear trend over time, one would see whether the coefficient a is positive and statistically significant. Choosing between HL, Repeated-measures, and SE Let me now review a variety of criteria that come into play when choosing a method for analyzing grouped data. The focus will be on HL, repeated-measures, and SE, as these are the most commonly used methods. Sometimes more than one method is possible, but I will emphasize the situations where one method is preferable to another. In reading the criteria below, it will become clear that I was quite selective in creating odels A and B above to compare across HL, repeated measures, and SE. Only certain models and data sets can be analyzed using all three methods. It will also become clear that choosing an acceptable method is a complex process, with many considerations to take into account. Some considerations are rigid and can entirely rule out a method, whereas other considerations are more flexible and the ultimate selection of method will rely on the judgment of the researcher. When the hierarchy has three or more levels Hierarchies with three or more levels are typically analyzed using HL, though if sample sizes are small at all the lower levels of a hierarchy, it may also be feasible to use SE. Repeated-measures only applies to two-level hierarchies. When sample size differs from group to group Of the three analyses, only HL can be used when sample size differs from group to group at any of the lower levels of a hierarchy. When there is missing data at Level 1 issing data at Level 1 can be handled by HL, which can simply work with the data it is given, if the researcher chooses the delete when running analyses option when importing data into HL. (There is also an option to delete an entire Level 2 group if it has any missing data at Level 1, if the researcher chooses the delete when making mdm option). SE requires that 21

10 Figure 10 How to set up the analysis for odel B using the HL Software for Hierarchical Linear odeling. all data be present at all levels of the hierarchy, and can impute missing data as a part of the analysis. Repeatedmeasures deletes the entire Level 2 group if it has any data missing at Level 1. When there is missing data at a higher level of the hierarchy When there is missing data for a group (e.g., for a participant) at a higher level of a hierarchy, HL deletes all information regarding that group at the level of the hierarchy where the data is missing and also at all lower levels of the hierarchy. Thus, one should consider performing data imputation at the higher level where the data is missing before proceeding to HL. As noted above, SE requires that all data be present at all levels of the hierarchy, and can impute missing data as a part of the analysis. Repeated-measures deletes the entire Level 2 group if it has any data missing at Level 2. When group members are distinguishable Observations at a lower level of the hierarchy are either distinguishable or indistinguishable. When observations are distinguishable, they can be ordered in some kind of consistent non-arbitrary way for example, when the first observation in each group represents time 1, the second observation represents time 2, and so on; or when the first observation in each group represents husbands, and the second observation represents wives. When observations are indistinguishable, they do not fall in any natural order for example, when the researcher is simply studying multiple organizations with multiple employees within each organization, and there is no sequence to the employees, so that employee 1 in organization A does not correspond in any way to employee 1 in organization B. HL can be used whether observations at the lower levels are distinguishable or indistinguishable, though if observations at a given level are distinguishable, the researcher typically includes an index variable in the Figure 11 How to set up the analysis for odel B using SSS for Repeated-measures analysis. analysis which represents the sequence of observations (e.g., a variable called time with the values 1, 2, 3, etc.). SE is primarily used when observations are distinguishable, though there is a somewhat complicated procedure that can be used for indistinguishable observations. Repeated-measures is primarily used when observations are distinguishable, though the multivariate alternative to repeated-measures can be used when observations are indistinguishable. When there are independent variables at a lower level of the hierarchy In SE, there is no absolute limit on the number of independent variables that can be incorporated into a lower level of the hierarchy (though an overly complex model can raise a number of difficulties, such as insufficient degrees of freedom, and poor fit of fit indices that penalize model complexity). In repeated-measures, the only Level 1 independent variable that is built into the analysis is time or repeated measure, and additional independent variables cannot be added. In HL, the researcher needs to be careful about the number of independent variables include at a lower level. At a given level, as in any regression where one wishes to carry out inferential statistics, HL requires more degrees of freedom than there are coefficients to 22

11 variable. Researchers may vary in their comfort level with this practice of boosting the number of measurements of the dependent variable, but the practice is more acceptable to the degree that the items or subscales of the dependent variable measure the same concept if their content is diverse, the practice is harder to justify. Naturally, the strategy is only possible if the dependent variable is a multi-item scale. When there are independent variables at the highest level of the hierarchy Figure 12 How to set up the analysis for odel B using AOS for ath Analysis. be estimated (in at least some of the groups, though not necessarily all). Consider the Level 1 equation below: All three analysis HL, repeated-measures, and SE can handle any number of independent variables at the highest level of the hierarchy (within reason the number of independent variables should not take up a large fraction of the degrees of freedom at that level, and with too many independent variables, one starts to run into the problem of over-fitting, i.e., the equation starts modeling a lot of the noise rather than primarily the signal in the data). When one wishes to test cross-level interactions There are three coefficients in this equation the intercept π 0 and the two slopes π 1 and π 2 and therefore a minimum of four observations is required. Another way of summarizing this is as follows: if there are K independent variables, HL requires K + 2 observations. Thus, if a given level of the hierarchy consists of dyads, there are only two observations per group and there is no room for independent variables at that level! When there are too few observations for the number of independent variables at Level 1, some researchers artificially increase the number of observations (e.g., aguire, 1999) by treating individual items or subscales of items on the dependent variable measure as if they were separate measurements of the dependent variable. For example, if a researcher has dyads at Level 1 and the dependent variable is a four-item scale, the researcher could treat each of the four items as if it were a separate observation of the dependent variable, which would artificially boost the sample size at Level 1 to eight observations per dyad. Alternatively, if a researcher has dyads at Level 1, the dependent variable is an 18-item scale, and one wants to boost the sample size to six observations per dyad, one can create three parallel subscales of the dependent variable (as highly intercorrelated as possible), and treat each of them as if it were a separate measurement of the dependent Suppose a researcher wishes to test a cross-level interaction, such that trait extraversion moderates the effect of state autonomy on state well-being. This can easily be tested in HL, and the software itself will create the interaction term, the researcher does not need to create it in the data beforehand. SE can also test a cross-level interaction, but the researcher needs to create all the interaction terms in the data set before analysis (unless the higher level independent variable is a qualitative one, such as gender, in which case the SE model can be run once for each gender, and differing results for the two genders indicate an interaction), and SE quickly begins to consume many degrees of freedom as the number of observations at the lower level increases. Both HL and SE can test interactions between two independent variables in non-adjacent levels of a hierarchy, such as Levels 1 and 3, as well as interactions between independent variables at three or more levels (though SE is likely to consume even more degrees of freedom than for two-way interactions). In Repeated-measures, the only cross-level interaction possible is one between the repeated measure variable at Level 1 (e.g., time_point) and a between-subjects variable at Level 2 (e.g., t_extrav). In repeated-measures, since the only independent variable possible at Level 1 is time or repeated measure, the only cross-level interaction possible is an 23

12 interaction between time/repeated measures and a between-person independent variable, which is tested directly by the software, the researcher does not need to create an interaction term in the data. Figure 13 Examples of models that are more complex than a single regression, and which are therefore frequently best suited for structural equation modeling or path analysis. When one wishes to test same-level interactions Both HL and SE can test interactions at the same level, but the interaction terms must be created in the data before analysis. Repeatedmeasures can only test interactions at Level 2, since it cannot have multiple independent variables at Level 1. When the model does not have a multiple regression structure Both HL and repeated-measures require a multiple regression structure, such that one or more independent variable(s), and possibly some interactions between them, all directly predict a single dependent variable (or a composite of dependent variables). For any model more complex than a multiple regression, SE is most often used. For example, SE is appropriate when a variable in the model predicts more than one other variable, or when a variable serves both as an outcome and a predictor. Figure 13 shows examples of structures that are more complex than a single multiple regression (the first model is a path analysis of what is called an actor-partner interdependence model, which is commonly used in dyad research, and in this case the focus is only on relationships between variables at Level 1 the individual person rather than Level 2 the patientcaregiver dyad; the second model is the structural equation modeling version of the first model, where each concept in an ellipse is a factor extracted from three measured variables; the third model is a path analysis of a mediation where the independent variable is at Level 2 and the mediator and outcome are at Level 1, and where the regression coefficients and error terms could also be allowed to vary). HL can be used to analyze some complex models, but the model must first be broken down into pieces that each have a regression structure, with each piece analyzed separately. Furthermore, in a given piece of the model, the predictors need to be at the same level of the hierarchy as the outcome or at a higher level of the hierarchy, but not at a lower level; this restriction does not apply to 24

13 SE. The set of complex models that repeated-measures could apply to is extremely restricted. It would again require that the model be broken into pieces with a regression structure. Furthermore, the last regression in a causal chain could only have Level 2 variables predicting a Level 1 outcome (unless there is one predictor and it is time point or repeated measure), and all preceding steps in the causal chain would be analyzed using regular regressions with Level 2 variables predicting a Level 2 outcome. When the sample size is too small (or too large) Restrictions in sample size at one or more levels of the hierarchy may also push the researcher away from certain analysis options. Ideally, one would use a power analysis software to determine the sample size required at each level to achieve adequate power. (One software applicable to HL is Optimal Design, which is available for free on the internet, at design_software; one software that is applicable to repeated-measures is G*ower, also available for free on the internet, at power analysis for SE is less well established.) Nevertheless, there are some general guidelines that can help a person determine whether a sample size may be adequate for HL, repeatedmeasures, and/or SE. Sample size required at each level in HL. At a lower level of the hierarchy, HL can work with as few as two observations per group if there are no independent variables at that level, or as few as K+2 observations if there are K independent variables, as noted earlier. any papers have been published with just two, three, or four observations per group, so these sample sizes are quite common. To reach 80% power, however, group sizes often need to be 15 or greater (though 5 or 10 is sometimes sufficient). To be able to reliably report the regression equations of individual groups (which is done only occasionally, e.g., when there are multiple students per school and each school would like to know the results for that particular school), the required sample size is often closer to 50. At the highest level of the hierarchy, having an adequate sample size is more critical, since it is the sample size at this level which influences the power of the analyses. In other words, the number of groups at the highest level needs to be large enough for the results to be generalizable to the population of all groups. Some publications have had Level 2 sample sizes as small as 10 or 20, and in some research areas (e.g., work with animals or brain scans) this is difficult to avoid. To reach 80% power, however, sample sizes upward of 60 are typically required. Sample size required at each level in repeated-measures. At Level 1, repeated-measures can work with any sample size of 2 or greater. At Level 2, to reach 80% power, sample sizes above 60 are often required. Sample size required at each level in SE. At a lower level of the hierarchy, SE can have as few as 2 observations per group, and does not require that there be K+2 observations for K independent variables, as noted earlier. If there are too many observations, the SE model will use too many degrees of freedom unless corresponding regression coefficients are constrained to be equal and corresponding error terms are constrained to be equal (as shown in the third model of Figure 14). At the highest level of the hierarchy, SE typically requires a larger sample size than does HL or repeated-measures. Recommendations vary widely, but most people would agree that 100 is a bare minimum, and the more common recommendation is 200 or more; alternatively, many people use Bentler and Chou s (1987) suggestion that there be at least 5 observations per free parameter. When the researcher seeks certain information in the output Thus far I have discussed considerations that relate to the nature of the data or the nature of the model to be tested, when choosing between HL, repeatedmeasures, and SE. These analyses also differ in the information that appears on the output, so I will now review some of these differences. I will not provide an exhaustive list of the differences, I will only focus on one major difference that is commonly of interest: HL provides information based on how coefficients differ across groups (i.e., how coefficients at one level of the hierarchy differ across units of a higher level of the hierarchy), whereas repeated-measures and SE provide information about how coefficients differ across repeated measures. A separate regression equation for r each group. Only HL provides a separate regression equation for each higher-level group for example, if there are multiple organizations and multiple employees within each 25

14 organization, HL can provide the regression equation across employees within a given organization. (This is an additional option that must be requested when running the analysis, and the coefficients of the equations can be obtained from an SSS file generated with the name resfil. ) HL actually provides several versions of these regression equations the Ordinary Least Squares (OLS) version simply provides the coefficients that would be obtained if a regular regression were run on the data within a given group; an Empirical Bayes version provides coefficients for a given group that are influenced by the average coefficients when summarizing across all groups, to the degree that the data for the given group are unreliable, i.e., based on a small sample size and/or highly variable; and a more fine-tuned Conditional Empirical Bayes version, which provides coefficients for a given group that are influenced by coefficients in groups that have similar characteristics to the group in question. Reliability of a coefficient to determine whether the regression equation for each group can be reported reliably. Only HL provides an estimate of the average reliability, across groups, of each lower level coefficient. Among other things, this reliability informs the researcher whether the regression equations of individual groups can be reported (the reliability ranges from 0 to 1, and values in the neighborhood of.80 suggest that individual regressions can be reported). Variance of a coefficient across groups. Only HL reports the variance of each coefficient across groups, and tests the significance of this variance (e.g., for a three-level hierarchy, the output indicates how much Level 1 coefficients vary across Level 2 units, how much Level 1 coefficients vary across Level 3 units, how much Level 2 coefficients vary across Level 3 units, and how much cross-level interactions between Level 1 and 2 independent variables vary across Level 3 units). Among other things, a significant variance in a coefficient across units of a higher level provides statistical justification for adding independent variables at the higher level in hopes of accounting for some of that variance. Correlations between coefficients across groups. Only HL provides the correlations between lower level coefficients across groups. For example, if a research has a two-level hierarchy where the dependent variable is depression severity and the Level 1 independent variable is time, a negative correlation between the Level 1 intercept and slope indicates that the more severe the depression, the faster/steeper the decrease in depression over time. A separate regression coefficient for each repeated measure. If a researcher wishes to know whether the slope of the relationship between a predictor and outcome differs across repeated measurements of that predictor and outcome, the researcher would require SE. For example, in the third diagram of Figure 14, a researcher may wish to determine whether the link between a feeling of autonomy and well-being differs for the oldest, youngest, and middle student in that case, the researcher would simply remove the letter b from each regression arrow and thereby allow the coefficient to vary. SE can also be used to determine whether a higher-level independent variable relates differently to the dependent variable for each repeated measure. For example, in Figure 14, this would be accomplished by removing the constraint a on the link between teacher autonomy support and each student s feeling of autonomy, thereby allowing the strength of the link to vary. Repeated-measures can similarly show how the link between a Level 2 independent variable and the dependent variable differs across repeated measures, if one requests the parameter estimates option. Variance of the dependent variable across repeated measures. If a researcher wishes to know whether the mean score on the dependent variable varies across repeated measurements, they would use repeatedmeasures (and leave out any Level 2 independent variables). The between-subjects effect for the intercept would indicate whether this variance is significant. Summary In sum, Hierarchical linear modeling (HL) applies to data that are grouped in some way, such as: multiple cities, with multiple schools within each city, and multiple students within each school. In this example, cities would be referred to as Level 3 of the hierarchy, schools would be referred to as Level 2, and students would be referred to as Level 1. In addition, HL applies when the groups (e.g., cities and schools) are randomly selected, such that the researcher s aim is to generalize to the results to the population of all groups. The HL method is an expanded form of regression, whereby a separate regression is obtained within each group, and the dependent variable is always measured at the lowest level of the hierarchy. The coefficients (intercept and slopes) from the within-group 26

11/18/2013. Correlational Research. Correlational Designs. Why Use a Correlational Design? CORRELATIONAL RESEARCH STUDIES

11/18/2013. Correlational Research. Correlational Designs. Why Use a Correlational Design? CORRELATIONAL RESEARCH STUDIES Correlational Research Correlational Designs Correlational research is used to describe the relationship between two or more naturally occurring variables. Is age related to political conservativism? Are

More information

Manuscript Presentation: Writing up APIM Results

Manuscript Presentation: Writing up APIM Results Manuscript Presentation: Writing up APIM Results Example Articles Distinguishable Dyads Chung, M. L., Moser, D. K., Lennie, T. A., & Rayens, M. (2009). The effects of depressive symptoms and anxiety on

More information

Doing Quantitative Research 26E02900, 6 ECTS Lecture 6: Structural Equations Modeling. Olli-Pekka Kauppila Daria Kautto

Doing Quantitative Research 26E02900, 6 ECTS Lecture 6: Structural Equations Modeling. Olli-Pekka Kauppila Daria Kautto Doing Quantitative Research 26E02900, 6 ECTS Lecture 6: Structural Equations Modeling Olli-Pekka Kauppila Daria Kautto Session VI, September 20 2017 Learning objectives 1. Get familiar with the basic idea

More information

11/24/2017. Do not imply a cause-and-effect relationship

11/24/2017. Do not imply a cause-and-effect relationship Correlational research is used to describe the relationship between two or more naturally occurring variables. Is age related to political conservativism? Are highly extraverted people less afraid of rejection

More information

IAPT: Regression. Regression analyses

IAPT: Regression. Regression analyses Regression analyses IAPT: Regression Regression is the rather strange name given to a set of methods for predicting one variable from another. The data shown in Table 1 and come from a student project

More information

Structural Equation Modeling (SEM)

Structural Equation Modeling (SEM) Structural Equation Modeling (SEM) Today s topics The Big Picture of SEM What to do (and what NOT to do) when SEM breaks for you Single indicator (ASU) models Parceling indicators Using single factor scores

More information

Results & Statistics: Description and Correlation. I. Scales of Measurement A Review

Results & Statistics: Description and Correlation. I. Scales of Measurement A Review Results & Statistics: Description and Correlation The description and presentation of results involves a number of topics. These include scales of measurement, descriptive statistics used to summarize

More information

Business Statistics Probability

Business Statistics Probability Business Statistics The following was provided by Dr. Suzanne Delaney, and is a comprehensive review of Business Statistics. The workshop instructor will provide relevant examples during the Skills Assessment

More information

12/31/2016. PSY 512: Advanced Statistics for Psychological and Behavioral Research 2

12/31/2016. PSY 512: Advanced Statistics for Psychological and Behavioral Research 2 PSY 512: Advanced Statistics for Psychological and Behavioral Research 2 Introduce moderated multiple regression Continuous predictor continuous predictor Continuous predictor categorical predictor Understand

More information

MULTIPLE LINEAR REGRESSION 24.1 INTRODUCTION AND OBJECTIVES OBJECTIVES

MULTIPLE LINEAR REGRESSION 24.1 INTRODUCTION AND OBJECTIVES OBJECTIVES 24 MULTIPLE LINEAR REGRESSION 24.1 INTRODUCTION AND OBJECTIVES In the previous chapter, simple linear regression was used when you have one independent variable and one dependent variable. This chapter

More information

9.0 L '- ---'- ---'- --' X

9.0 L '- ---'- ---'- --' X 352 C hap te r Ten 11.0 10.5 Y 10.0 9.5 9.0 L...- ----'- ---'- ---'- --' 0.0 0.5 1.0 X 1.5 2.0 FIGURE 10.23 Interpreting r = 0 for curvilinear data. Establishing causation requires solid scientific understanding.

More information

Pitfalls in Linear Regression Analysis

Pitfalls in Linear Regression Analysis Pitfalls in Linear Regression Analysis Due to the widespread availability of spreadsheet and statistical software for disposal, many of us do not really have a good understanding of how to use regression

More information

Regression Discontinuity Analysis

Regression Discontinuity Analysis Regression Discontinuity Analysis A researcher wants to determine whether tutoring underachieving middle school students improves their math grades. Another wonders whether providing financial aid to low-income

More information

3 CONCEPTUAL FOUNDATIONS OF STATISTICS

3 CONCEPTUAL FOUNDATIONS OF STATISTICS 3 CONCEPTUAL FOUNDATIONS OF STATISTICS In this chapter, we examine the conceptual foundations of statistics. The goal is to give you an appreciation and conceptual understanding of some basic statistical

More information

MODELING HIERARCHICAL STRUCTURES HIERARCHICAL LINEAR MODELING USING MPLUS

MODELING HIERARCHICAL STRUCTURES HIERARCHICAL LINEAR MODELING USING MPLUS MODELING HIERARCHICAL STRUCTURES HIERARCHICAL LINEAR MODELING USING MPLUS M. Jelonek Institute of Sociology, Jagiellonian University Grodzka 52, 31-044 Kraków, Poland e-mail: magjelonek@wp.pl The aim of

More information

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo Business Statistics The following was provided by Dr. Suzanne Delaney, and is a comprehensive review of Business Statistics. The workshop instructor will provide relevant examples during the Skills Assessment

More information

TWO-DAY DYADIC DATA ANALYSIS WORKSHOP Randi L. Garcia Smith College UCSF January 9 th and 10 th

TWO-DAY DYADIC DATA ANALYSIS WORKSHOP Randi L. Garcia Smith College UCSF January 9 th and 10 th TWO-DAY DYADIC DATA ANALYSIS WORKSHOP Randi L. Garcia Smith College UCSF January 9 th and 10 th @RandiLGarcia RandiLGarcia Mediation in the APIM Moderation in the APIM Dyadic Growth Curve Modeling Other

More information

Correlational Research. Correlational Research. Stephen E. Brock, Ph.D., NCSP EDS 250. Descriptive Research 1. Correlational Research: Scatter Plots

Correlational Research. Correlational Research. Stephen E. Brock, Ph.D., NCSP EDS 250. Descriptive Research 1. Correlational Research: Scatter Plots Correlational Research Stephen E. Brock, Ph.D., NCSP California State University, Sacramento 1 Correlational Research A quantitative methodology used to determine whether, and to what degree, a relationship

More information

Introduction to Multilevel Models for Longitudinal and Repeated Measures Data

Introduction to Multilevel Models for Longitudinal and Repeated Measures Data Introduction to Multilevel Models for Longitudinal and Repeated Measures Data Today s Class: Features of longitudinal data Features of longitudinal models What can MLM do for you? What to expect in this

More information

CHAPTER TWO REGRESSION

CHAPTER TWO REGRESSION CHAPTER TWO REGRESSION 2.0 Introduction The second chapter, Regression analysis is an extension of correlation. The aim of the discussion of exercises is to enhance students capability to assess the effect

More information

Lesson 9: Two Factor ANOVAS

Lesson 9: Two Factor ANOVAS Published on Agron 513 (https://courses.agron.iastate.edu/agron513) Home > Lesson 9 Lesson 9: Two Factor ANOVAS Developed by: Ron Mowers, Marin Harbur, and Ken Moore Completion Time: 1 week Introduction

More information

6. Unusual and Influential Data

6. Unusual and Influential Data Sociology 740 John ox Lecture Notes 6. Unusual and Influential Data Copyright 2014 by John ox Unusual and Influential Data 1 1. Introduction I Linear statistical models make strong assumptions about the

More information

In this chapter, we discuss the statistical methods used to test the viability

In this chapter, we discuss the statistical methods used to test the viability 5 Strategy for Measuring Constructs and Testing Relationships In this chapter, we discuss the statistical methods used to test the viability of our conceptual models as well as the methods used to test

More information

10. LINEAR REGRESSION AND CORRELATION

10. LINEAR REGRESSION AND CORRELATION 1 10. LINEAR REGRESSION AND CORRELATION The contingency table describes an association between two nominal (categorical) variables (e.g., use of supplemental oxygen and mountaineer survival ). We have

More information

Unit 1 Exploring and Understanding Data

Unit 1 Exploring and Understanding Data Unit 1 Exploring and Understanding Data Area Principle Bar Chart Boxplot Conditional Distribution Dotplot Empirical Rule Five Number Summary Frequency Distribution Frequency Polygon Histogram Interquartile

More information

THE USE OF MULTIVARIATE ANALYSIS IN DEVELOPMENT THEORY: A CRITIQUE OF THE APPROACH ADOPTED BY ADELMAN AND MORRIS A. C. RAYNER

THE USE OF MULTIVARIATE ANALYSIS IN DEVELOPMENT THEORY: A CRITIQUE OF THE APPROACH ADOPTED BY ADELMAN AND MORRIS A. C. RAYNER THE USE OF MULTIVARIATE ANALYSIS IN DEVELOPMENT THEORY: A CRITIQUE OF THE APPROACH ADOPTED BY ADELMAN AND MORRIS A. C. RAYNER Introduction, 639. Factor analysis, 639. Discriminant analysis, 644. INTRODUCTION

More information

12/30/2017. PSY 5102: Advanced Statistics for Psychological and Behavioral Research 2

12/30/2017. PSY 5102: Advanced Statistics for Psychological and Behavioral Research 2 PSY 5102: Advanced Statistics for Psychological and Behavioral Research 2 Selecting a statistical test Relationships among major statistical methods General Linear Model and multiple regression Special

More information

Data and Statistics 101: Key Concepts in the Collection, Analysis, and Application of Child Welfare Data

Data and Statistics 101: Key Concepts in the Collection, Analysis, and Application of Child Welfare Data TECHNICAL REPORT Data and Statistics 101: Key Concepts in the Collection, Analysis, and Application of Child Welfare Data CONTENTS Executive Summary...1 Introduction...2 Overview of Data Analysis Concepts...2

More information

Hierarchical Linear Models: Applications to cross-cultural comparisons of school culture

Hierarchical Linear Models: Applications to cross-cultural comparisons of school culture Hierarchical Linear Models: Applications to cross-cultural comparisons of school culture Magdalena M.C. Mok, Macquarie University & Teresa W.C. Ling, City Polytechnic of Hong Kong Paper presented at the

More information

Daniel Boduszek University of Huddersfield

Daniel Boduszek University of Huddersfield Daniel Boduszek University of Huddersfield d.boduszek@hud.ac.uk Introduction to Multiple Regression (MR) Types of MR Assumptions of MR SPSS procedure of MR Example based on prison data Interpretation of

More information

Tutorial #7A: Latent Class Growth Model (# seizures)

Tutorial #7A: Latent Class Growth Model (# seizures) Tutorial #7A: Latent Class Growth Model (# seizures) 2.50 Class 3: Unstable (N = 6) Cluster modal 1 2 3 Mean proportional change from baseline 2.00 1.50 1.00 Class 1: No change (N = 36) 0.50 Class 2: Improved

More information

How to analyze correlated and longitudinal data?

How to analyze correlated and longitudinal data? How to analyze correlated and longitudinal data? Niloofar Ramezani, University of Northern Colorado, Greeley, Colorado ABSTRACT Longitudinal and correlated data are extensively used across disciplines

More information

The Pretest! Pretest! Pretest! Assignment (Example 2)

The Pretest! Pretest! Pretest! Assignment (Example 2) The Pretest! Pretest! Pretest! Assignment (Example 2) May 19, 2003 1 Statement of Purpose and Description of Pretest Procedure When one designs a Math 10 exam one hopes to measure whether a student s ability

More information

Intro to SPSS. Using SPSS through WebFAS

Intro to SPSS. Using SPSS through WebFAS Intro to SPSS Using SPSS through WebFAS http://www.yorku.ca/computing/students/labs/webfas/ Try it early (make sure it works from your computer) If you need help contact UIT Client Services Voice: 416-736-5800

More information

Introduction to Multilevel Models for Longitudinal and Repeated Measures Data

Introduction to Multilevel Models for Longitudinal and Repeated Measures Data Introduction to Multilevel Models for Longitudinal and Repeated Measures Data Today s Class: Features of longitudinal data Features of longitudinal models What can MLM do for you? What to expect in this

More information

Mixed Effect Modeling. Mixed Effects Models. Synonyms. Definition. Description

Mixed Effect Modeling. Mixed Effects Models. Synonyms. Definition. Description ixed Effects odels 4089 ixed Effect odeling Hierarchical Linear odeling ixed Effects odels atthew P. Buman 1 and Eric B. Hekler 2 1 Exercise and Wellness Program, School of Nutrition and Health Promotion

More information

C h a p t e r 1 1. Psychologists. John B. Nezlek

C h a p t e r 1 1. Psychologists. John B. Nezlek C h a p t e r 1 1 Multilevel Modeling for Psychologists John B. Nezlek Multilevel analyses have become increasingly common in psychological research, although unfortunately, many researchers understanding

More information

Data Analysis in Practice-Based Research. Stephen Zyzanski, PhD Department of Family Medicine Case Western Reserve University School of Medicine

Data Analysis in Practice-Based Research. Stephen Zyzanski, PhD Department of Family Medicine Case Western Reserve University School of Medicine Data Analysis in Practice-Based Research Stephen Zyzanski, PhD Department of Family Medicine Case Western Reserve University School of Medicine Multilevel Data Statistical analyses that fail to recognize

More information

Still important ideas

Still important ideas Readings: OpenStax - Chapters 1 13 & Appendix D & E (online) Plous Chapters 17 & 18 - Chapter 17: Social Influences - Chapter 18: Group Judgments and Decisions Still important ideas Contrast the measurement

More information

Sawtooth Software. MaxDiff Analysis: Simple Counting, Individual-Level Logit, and HB RESEARCH PAPER SERIES. Bryan Orme, Sawtooth Software, Inc.

Sawtooth Software. MaxDiff Analysis: Simple Counting, Individual-Level Logit, and HB RESEARCH PAPER SERIES. Bryan Orme, Sawtooth Software, Inc. Sawtooth Software RESEARCH PAPER SERIES MaxDiff Analysis: Simple Counting, Individual-Level Logit, and HB Bryan Orme, Sawtooth Software, Inc. Copyright 009, Sawtooth Software, Inc. 530 W. Fir St. Sequim,

More information

Simultaneous Equation and Instrumental Variable Models for Sexiness and Power/Status

Simultaneous Equation and Instrumental Variable Models for Sexiness and Power/Status Simultaneous Equation and Instrumental Variable Models for Seiness and Power/Status We would like ideally to determine whether power is indeed sey, or whether seiness is powerful. We here describe the

More information

Minimizing Uncertainty in Property Casualty Loss Reserve Estimates Chris G. Gross, ACAS, MAAA

Minimizing Uncertainty in Property Casualty Loss Reserve Estimates Chris G. Gross, ACAS, MAAA Minimizing Uncertainty in Property Casualty Loss Reserve Estimates Chris G. Gross, ACAS, MAAA The uncertain nature of property casualty loss reserves Property Casualty loss reserves are inherently uncertain.

More information

Multivariable Systems. Lawrence Hubert. July 31, 2011

Multivariable Systems. Lawrence Hubert. July 31, 2011 Multivariable July 31, 2011 Whenever results are presented within a multivariate context, it is important to remember that there is a system present among the variables, and this has a number of implications

More information

CHAPTER 3 METHOD AND PROCEDURE

CHAPTER 3 METHOD AND PROCEDURE CHAPTER 3 METHOD AND PROCEDURE Previous chapter namely Review of the Literature was concerned with the review of the research studies conducted in the field of teacher education, with special reference

More information

MEA DISCUSSION PAPERS

MEA DISCUSSION PAPERS Inference Problems under a Special Form of Heteroskedasticity Helmut Farbmacher, Heinrich Kögel 03-2015 MEA DISCUSSION PAPERS mea Amalienstr. 33_D-80799 Munich_Phone+49 89 38602-355_Fax +49 89 38602-390_www.mea.mpisoc.mpg.de

More information

Section 3.2 Least-Squares Regression

Section 3.2 Least-Squares Regression Section 3.2 Least-Squares Regression Linear relationships between two quantitative variables are pretty common and easy to understand. Correlation measures the direction and strength of these relationships.

More information

The Use of Piecewise Growth Models in Evaluations of Interventions. CSE Technical Report 477

The Use of Piecewise Growth Models in Evaluations of Interventions. CSE Technical Report 477 The Use of Piecewise Growth Models in Evaluations of Interventions CSE Technical Report 477 Michael Seltzer CRESST/University of California, Los Angeles Martin Svartberg Norwegian University of Science

More information

Models of Parent-Offspring Conflict Ethology and Behavioral Ecology

Models of Parent-Offspring Conflict Ethology and Behavioral Ecology Models of Parent-Offspring Conflict Ethology and Behavioral Ecology A. In this section we will look the nearly universal conflict that will eventually arise in any species where there is some form of parental

More information

Chapter 11. Experimental Design: One-Way Independent Samples Design

Chapter 11. Experimental Design: One-Way Independent Samples Design 11-1 Chapter 11. Experimental Design: One-Way Independent Samples Design Advantages and Limitations Comparing Two Groups Comparing t Test to ANOVA Independent Samples t Test Independent Samples ANOVA Comparing

More information

Multifactor Confirmatory Factor Analysis

Multifactor Confirmatory Factor Analysis Multifactor Confirmatory Factor Analysis Latent Trait Measurement and Structural Equation Models Lecture #9 March 13, 2013 PSYC 948: Lecture #9 Today s Class Confirmatory Factor Analysis with more than

More information

CONTENTS OF DAY 4 NOTES FOR SUMMER STATISTICS INSTITUTE COURSE COMMON MISTAKES IN STATISTICS SPOTTING THEM AND AVOIDING THEM

CONTENTS OF DAY 4 NOTES FOR SUMMER STATISTICS INSTITUTE COURSE COMMON MISTAKES IN STATISTICS SPOTTING THEM AND AVOIDING THEM 1 2 CONTENTS OF DAY 4 I. Catch-up as needed II. Data Snooping 3 Suggestions for Data Snooping Professionally and Ethically (See Appendix) III. Using an Inappropriate Method of Analysis 5 NOTES FOR SUMMER

More information

Technical Specifications

Technical Specifications Technical Specifications In order to provide summary information across a set of exercises, all tests must employ some form of scoring models. The most familiar of these scoring models is the one typically

More information

Understanding and Applying Multilevel Models in Maternal and Child Health Epidemiology and Public Health

Understanding and Applying Multilevel Models in Maternal and Child Health Epidemiology and Public Health Understanding and Applying Multilevel Models in Maternal and Child Health Epidemiology and Public Health Adam C. Carle, M.A., Ph.D. adam.carle@cchmc.org Division of Health Policy and Clinical Effectiveness

More information

Basic Concepts in Research and DATA Analysis

Basic Concepts in Research and DATA Analysis Basic Concepts in Research and DATA Analysis 1 Introduction: A Common Language for Researchers...2 Steps to Follow When Conducting Research...2 The Research Question...3 The Hypothesis...3 Defining the

More information

One-Way Independent ANOVA

One-Way Independent ANOVA One-Way Independent ANOVA Analysis of Variance (ANOVA) is a common and robust statistical test that you can use to compare the mean scores collected from different conditions or groups in an experiment.

More information

A NON-TECHNICAL INTRODUCTION TO REGRESSIONS. David Romer. University of California, Berkeley. January Copyright 2018 by David Romer

A NON-TECHNICAL INTRODUCTION TO REGRESSIONS. David Romer. University of California, Berkeley. January Copyright 2018 by David Romer A NON-TECHNICAL INTRODUCTION TO REGRESSIONS David Romer University of California, Berkeley January 2018 Copyright 2018 by David Romer CONTENTS Preface ii I Introduction 1 II Ordinary Least Squares Regression

More information

Bayesian and Frequentist Approaches

Bayesian and Frequentist Approaches Bayesian and Frequentist Approaches G. Jogesh Babu Penn State University http://sites.stat.psu.edu/ babu http://astrostatistics.psu.edu All models are wrong But some are useful George E. P. Box (son-in-law

More information

SUPPLEMENTAL MATERIAL

SUPPLEMENTAL MATERIAL 1 SUPPLEMENTAL MATERIAL Response time and signal detection time distributions SM Fig. 1. Correct response time (thick solid green curve) and error response time densities (dashed red curve), averaged across

More information

Citation for published version (APA): Ebbes, P. (2004). Latent instrumental variables: a new approach to solve for endogeneity s.n.

Citation for published version (APA): Ebbes, P. (2004). Latent instrumental variables: a new approach to solve for endogeneity s.n. University of Groningen Latent instrumental variables Ebbes, P. IMPORTANT NOTE: You are advised to consult the publisher's version (publisher's PDF) if you wish to cite from it. Please check the document

More information

Journal of Political Economy, Vol. 93, No. 2 (Apr., 1985)

Journal of Political Economy, Vol. 93, No. 2 (Apr., 1985) Confirmations and Contradictions Journal of Political Economy, Vol. 93, No. 2 (Apr., 1985) Estimates of the Deterrent Effect of Capital Punishment: The Importance of the Researcher's Prior Beliefs Walter

More information

Use of Structural Equation Modeling in Social Science Research

Use of Structural Equation Modeling in Social Science Research Asian Social Science; Vol. 11, No. 4; 2015 ISSN 1911-2017 E-ISSN 1911-2025 Published by Canadian Center of Science and Education Use of Structural Equation Modeling in Social Science Research Wali Rahman

More information

Running head: INDIVIDUAL DIFFERENCES 1. Why to treat subjects as fixed effects. James S. Adelman. University of Warwick.

Running head: INDIVIDUAL DIFFERENCES 1. Why to treat subjects as fixed effects. James S. Adelman. University of Warwick. Running head: INDIVIDUAL DIFFERENCES 1 Why to treat subjects as fixed effects James S. Adelman University of Warwick Zachary Estes Bocconi University Corresponding Author: James S. Adelman Department of

More information

Chapter 11 Nonexperimental Quantitative Research Steps in Nonexperimental Research

Chapter 11 Nonexperimental Quantitative Research Steps in Nonexperimental Research Chapter 11 Nonexperimental Quantitative Research (Reminder: Don t forget to utilize the concept maps and study questions as you study this and the other chapters.) Nonexperimental research is needed because

More information

Catherine A. Welch 1*, Séverine Sabia 1,2, Eric Brunner 1, Mika Kivimäki 1 and Martin J. Shipley 1

Catherine A. Welch 1*, Séverine Sabia 1,2, Eric Brunner 1, Mika Kivimäki 1 and Martin J. Shipley 1 Welch et al. BMC Medical Research Methodology (2018) 18:89 https://doi.org/10.1186/s12874-018-0548-0 RESEARCH ARTICLE Open Access Does pattern mixture modelling reduce bias due to informative attrition

More information

BIOL 458 BIOMETRY Lab 7 Multi-Factor ANOVA

BIOL 458 BIOMETRY Lab 7 Multi-Factor ANOVA BIOL 458 BIOMETRY Lab 7 Multi-Factor ANOVA PART 1: Introduction to Factorial ANOVA ingle factor or One - Way Analysis of Variance can be used to test the null hypothesis that k or more treatment or group

More information

STATISTICS INFORMED DECISIONS USING DATA

STATISTICS INFORMED DECISIONS USING DATA STATISTICS INFORMED DECISIONS USING DATA Fifth Edition Chapter 4 Describing the Relation between Two Variables 4.1 Scatter Diagrams and Correlation Learning Objectives 1. Draw and interpret scatter diagrams

More information

On Test Scores (Part 2) How to Properly Use Test Scores in Secondary Analyses. Structural Equation Modeling Lecture #12 April 29, 2015

On Test Scores (Part 2) How to Properly Use Test Scores in Secondary Analyses. Structural Equation Modeling Lecture #12 April 29, 2015 On Test Scores (Part 2) How to Properly Use Test Scores in Secondary Analyses Structural Equation Modeling Lecture #12 April 29, 2015 PRE 906, SEM: On Test Scores #2--The Proper Use of Scores Today s Class:

More information

An Introduction to Bayesian Statistics

An Introduction to Bayesian Statistics An Introduction to Bayesian Statistics Robert Weiss Department of Biostatistics UCLA Fielding School of Public Health robweiss@ucla.edu Sept 2015 Robert Weiss (UCLA) An Introduction to Bayesian Statistics

More information

MEASURES OF ASSOCIATION AND REGRESSION

MEASURES OF ASSOCIATION AND REGRESSION DEPARTMENT OF POLITICAL SCIENCE AND INTERNATIONAL RELATIONS Posc/Uapp 816 MEASURES OF ASSOCIATION AND REGRESSION I. AGENDA: A. Measures of association B. Two variable regression C. Reading: 1. Start Agresti

More information

Assignment 4: True or Quasi-Experiment

Assignment 4: True or Quasi-Experiment Assignment 4: True or Quasi-Experiment Objectives: After completing this assignment, you will be able to Evaluate when you must use an experiment to answer a research question Develop statistical hypotheses

More information

CRITERIA FOR USE. A GRAPHICAL EXPLANATION OF BI-VARIATE (2 VARIABLE) REGRESSION ANALYSISSys

CRITERIA FOR USE. A GRAPHICAL EXPLANATION OF BI-VARIATE (2 VARIABLE) REGRESSION ANALYSISSys Multiple Regression Analysis 1 CRITERIA FOR USE Multiple regression analysis is used to test the effects of n independent (predictor) variables on a single dependent (criterion) variable. Regression tests

More information

CHAPTER ONE CORRELATION

CHAPTER ONE CORRELATION CHAPTER ONE CORRELATION 1.0 Introduction The first chapter focuses on the nature of statistical data of correlation. The aim of the series of exercises is to ensure the students are able to use SPSS to

More information

2 Types of psychological tests and their validity, precision and standards

2 Types of psychological tests and their validity, precision and standards 2 Types of psychological tests and their validity, precision and standards Tests are usually classified in objective or projective, according to Pasquali (2008). In case of projective tests, a person is

More information

The Logic of Data Analysis Using Statistical Techniques M. E. Swisher, 2016

The Logic of Data Analysis Using Statistical Techniques M. E. Swisher, 2016 The Logic of Data Analysis Using Statistical Techniques M. E. Swisher, 2016 This course does not cover how to perform statistical tests on SPSS or any other computer program. There are several courses

More information

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo Please note the page numbers listed for the Lind book may vary by a page or two depending on which version of the textbook you have. Readings: Lind 1 11 (with emphasis on chapters 10, 11) Please note chapter

More information

An informal analysis of multilevel variance

An informal analysis of multilevel variance APPENDIX 11A An informal analysis of multilevel Imagine we are studying the blood pressure of a number of individuals (level 1) from different neighbourhoods (level 2) in the same city. We start by doing

More information

WELCOME! Lecture 11 Thommy Perlinger

WELCOME! Lecture 11 Thommy Perlinger Quantitative Methods II WELCOME! Lecture 11 Thommy Perlinger Regression based on violated assumptions If any of the assumptions are violated, potential inaccuracies may be present in the estimated regression

More information

Preliminary Report on Simple Statistical Tests (t-tests and bivariate correlations)

Preliminary Report on Simple Statistical Tests (t-tests and bivariate correlations) Preliminary Report on Simple Statistical Tests (t-tests and bivariate correlations) After receiving my comments on the preliminary reports of your datasets, the next step for the groups is to complete

More information

Sawtooth Software. The Number of Levels Effect in Conjoint: Where Does It Come From and Can It Be Eliminated? RESEARCH PAPER SERIES

Sawtooth Software. The Number of Levels Effect in Conjoint: Where Does It Come From and Can It Be Eliminated? RESEARCH PAPER SERIES Sawtooth Software RESEARCH PAPER SERIES The Number of Levels Effect in Conjoint: Where Does It Come From and Can It Be Eliminated? Dick Wittink, Yale University Joel Huber, Duke University Peter Zandan,

More information

Readings: Textbook readings: OpenStax - Chapters 1 13 (emphasis on Chapter 12) Online readings: Appendix D, E & F

Readings: Textbook readings: OpenStax - Chapters 1 13 (emphasis on Chapter 12) Online readings: Appendix D, E & F Readings: Textbook readings: OpenStax - Chapters 1 13 (emphasis on Chapter 12) Online readings: Appendix D, E & F Plous Chapters 17 & 18 Chapter 17: Social Influences Chapter 18: Group Judgments and Decisions

More information

Assessing Measurement Invariance in the Attitude to Marriage Scale across East Asian Societies. Xiaowen Zhu. Xi an Jiaotong University.

Assessing Measurement Invariance in the Attitude to Marriage Scale across East Asian Societies. Xiaowen Zhu. Xi an Jiaotong University. Running head: ASSESS MEASUREMENT INVARIANCE Assessing Measurement Invariance in the Attitude to Marriage Scale across East Asian Societies Xiaowen Zhu Xi an Jiaotong University Yanjie Bian Xi an Jiaotong

More information

Reliability, validity, and all that jazz

Reliability, validity, and all that jazz Reliability, validity, and all that jazz Dylan Wiliam King s College London Introduction No measuring instrument is perfect. The most obvious problems relate to reliability. If we use a thermometer to

More information

Propensity Score Methods for Estimating Causality in the Absence of Random Assignment: Applications for Child Care Policy Research

Propensity Score Methods for Estimating Causality in the Absence of Random Assignment: Applications for Child Care Policy Research 2012 CCPRC Meeting Methodology Presession Workshop October 23, 2012, 2:00-5:00 p.m. Propensity Score Methods for Estimating Causality in the Absence of Random Assignment: Applications for Child Care Policy

More information

Chapter 14: More Powerful Statistical Methods

Chapter 14: More Powerful Statistical Methods Chapter 14: More Powerful Statistical Methods Most questions will be on correlation and regression analysis, but I would like you to know just basically what cluster analysis, factor analysis, and conjoint

More information

RAG Rating Indicator Values

RAG Rating Indicator Values Technical Guide RAG Rating Indicator Values Introduction This document sets out Public Health England s standard approach to the use of RAG ratings for indicator values in relation to comparator or benchmark

More information

(2) In each graph above, calculate the velocity in feet per second that is represented.

(2) In each graph above, calculate the velocity in feet per second that is represented. Calculus Week 1 CHALLENGE 1-Velocity Exercise 1: Examine the two graphs below and answer the questions. Suppose each graph represents the position of Abby, our art teacher. (1) For both graphs above, come

More information

BOOTSTRAPPING CONFIDENCE LEVELS FOR HYPOTHESES ABOUT REGRESSION MODELS

BOOTSTRAPPING CONFIDENCE LEVELS FOR HYPOTHESES ABOUT REGRESSION MODELS BOOTSTRAPPING CONFIDENCE LEVELS FOR HYPOTHESES ABOUT REGRESSION MODELS 17 December 2009 Michael Wood University of Portsmouth Business School SBS Department, Richmond Building Portland Street, Portsmouth

More information

Bayesian Tailored Testing and the Influence

Bayesian Tailored Testing and the Influence Bayesian Tailored Testing and the Influence of Item Bank Characteristics Carl J. Jensema Gallaudet College Owen s (1969) Bayesian tailored testing method is introduced along with a brief review of its

More information

Weight Adjustment Methods using Multilevel Propensity Models and Random Forests

Weight Adjustment Methods using Multilevel Propensity Models and Random Forests Weight Adjustment Methods using Multilevel Propensity Models and Random Forests Ronaldo Iachan 1, Maria Prosviryakova 1, Kurt Peters 2, Lauren Restivo 1 1 ICF International, 530 Gaither Road Suite 500,

More information

An Instrumental Variable Consistent Estimation Procedure to Overcome the Problem of Endogenous Variables in Multilevel Models

An Instrumental Variable Consistent Estimation Procedure to Overcome the Problem of Endogenous Variables in Multilevel Models An Instrumental Variable Consistent Estimation Procedure to Overcome the Problem of Endogenous Variables in Multilevel Models Neil H Spencer University of Hertfordshire Antony Fielding University of Birmingham

More information

Simple Linear Regression the model, estimation and testing

Simple Linear Regression the model, estimation and testing Simple Linear Regression the model, estimation and testing Lecture No. 05 Example 1 A production manager has compared the dexterity test scores of five assembly-line employees with their hourly productivity.

More information

Reliability, validity, and all that jazz

Reliability, validity, and all that jazz Reliability, validity, and all that jazz Dylan Wiliam King s College London Published in Education 3-13, 29 (3) pp. 17-21 (2001) Introduction No measuring instrument is perfect. If we use a thermometer

More information

Using Analytical and Psychometric Tools in Medium- and High-Stakes Environments

Using Analytical and Psychometric Tools in Medium- and High-Stakes Environments Using Analytical and Psychometric Tools in Medium- and High-Stakes Environments Greg Pope, Analytics and Psychometrics Manager 2008 Users Conference San Antonio Introduction and purpose of this session

More information

Chapter 2 Norms and Basic Statistics for Testing MULTIPLE CHOICE

Chapter 2 Norms and Basic Statistics for Testing MULTIPLE CHOICE Chapter 2 Norms and Basic Statistics for Testing MULTIPLE CHOICE 1. When you assert that it is improbable that the mean intelligence test score of a particular group is 100, you are using. a. descriptive

More information

The Regression-Discontinuity Design

The Regression-Discontinuity Design Page 1 of 10 Home» Design» Quasi-Experimental Design» The Regression-Discontinuity Design The regression-discontinuity design. What a terrible name! In everyday language both parts of the term have connotations

More information

EXERCISE: HOW TO DO POWER CALCULATIONS IN OPTIMAL DESIGN SOFTWARE

EXERCISE: HOW TO DO POWER CALCULATIONS IN OPTIMAL DESIGN SOFTWARE ...... EXERCISE: HOW TO DO POWER CALCULATIONS IN OPTIMAL DESIGN SOFTWARE TABLE OF CONTENTS 73TKey Vocabulary37T... 1 73TIntroduction37T... 73TUsing the Optimal Design Software37T... 73TEstimating Sample

More information

Testing Means. Related-Samples t Test With Confidence Intervals. 6. Compute a related-samples t test and interpret the results.

Testing Means. Related-Samples t Test With Confidence Intervals. 6. Compute a related-samples t test and interpret the results. 10 Learning Objectives Testing Means After reading this chapter, you should be able to: Related-Samples t Test With Confidence Intervals 1. Describe two types of research designs used when we select related

More information

Module 14: Missing Data Concepts

Module 14: Missing Data Concepts Module 14: Missing Data Concepts Jonathan Bartlett & James Carpenter London School of Hygiene & Tropical Medicine Supported by ESRC grant RES 189-25-0103 and MRC grant G0900724 Pre-requisites Module 3

More information

Technical Whitepaper

Technical Whitepaper Technical Whitepaper July, 2001 Prorating Scale Scores Consequential analysis using scales from: BDI (Beck Depression Inventory) NAS (Novaco Anger Scales) STAXI (State-Trait Anxiety Inventory) PIP (Psychotic

More information