How few countries will do? Comparative survey analysis from a Bayesian perspective

Size: px
Start display at page:

Download "How few countries will do? Comparative survey analysis from a Bayesian perspective"

Transcription

1 Survey Research Methods (2012) Vol.6, No.2, pp ISSN European Survey Research Association How few countries will do? Comparative survey analysis from a Bayesian perspective Joop Hox Utrecht University Rens van de Schoot Utrecht University Suzette Matthijsse Utrecht University and Dept of Public Health, University Medical Center, Rotterdam Meuleman and Billiet (2009) have carried out a simulation study aimed at the question how many countries are needed for accurate multilevel SEM estimation in comparative studies. The authors concluded that a sample of 50 to 100 countries is needed for accurate estimation. Recently, Bayesian estimation methods have been introduced in structural equation modeling which should work well with much lower sample sizes. The current study reanalyzes the simulation of Meuleman and Billiet using Bayesian estimation to find the lowest number of countries needed when conducting multilevel SEM. The main result of our simulations is that a sample of about 20 countries is sufficient for accurate Bayesian estimation, which makes multilevel SEM practicable for the number of countries commonly available in large scale comparative surveys. Keywords: Multilevel SEM, sample size, cross-national research, Bayesian estimation 1 Introduction International cross-cultural and other comparative surveys involve a number of analysis issues. Measurement instruments must often be translated into different languages, which raises the issue of measurement equivalence. Can we assume that these instruments measure the same constructs in the same way? We need to assess whether we have measurement equivalence, and if not we need to investigate how we may correct measures in order to achieve measurement equivalence. Next, the analysis focuses on examining relationships within and between countries (or other contexts). That is, relationships can be established at the individual level within each country, but in comparative research the central issue is often the question whether such relationships are the same or different across countries. Finally, if we establish differences between countries, the question is whether country characteristics can explain such differences. The classic approach to deal with these questions is structural equation modeling (SEM) using a multi-group analysis. This analysis method makes it possible to test equivalence of measurement models; special procedures for categorical data enable SEM to be used to estimate and test Item Response (IRT) models. Criteria for measurement equivalence were already formulated by Jöreskog (1971), for a review we refer to Vandenberg and Lance (2000), while for a discussion in the context of comparative surveys we refer to Harkness et al. (2010). If measurement equivalence may be Contact information: Joop Hox, Utrecht University, the Netherlands, j.hox@uu.nl assumed, multigroup SEM can be used to investigate the degree of equivalence of structural (substantive) models across countries. When the number of countries is large, multi-group SEM becomes unwieldy. The setups become complicated, especially if subtle differences in measurement properties must be included. The statistical model for the structural differences also becomes complicated. Multi-group SEM is a fixed effects model, which means that it takes each group or country as given and the set of countries as the complete universe to generalize to. Unless many equality constraints are imposed, SEM estimates a unique set of parameter values for each different country, which results in a large model. Multilevel modeling (MLM) offers a different approach. Multilevel modeling treats the countries as a sample from a larger population. Instead of estimating a different parameter value for each country, it assumes a (normal) distribution of parameter values and estimates its mean and variance. This makes MLM much more parsimonious than SEM when the number of countries increases. In addition, differences between countries can be modeled formally using country-level variables. For a general introduction to multilevel modeling, we refer to Goldstein (2011), Raudenbush and Bryk (2002) and Hox (2010). Multilevel modeling for comparative surveys has been discussed by Hox, de Leeuw and Brinkhuis (2010) and Van de Vijver, van Hemert and Poortinga (2008). We mention in passing that multilevel modeling of comparative survey data not only poses statistical questions, but also methodological questions about the design. The statistical model assumes random sampling at all levels, while the survey design in fact does not use sampling at the country level. We can still use multilevel modeling, but its use is based on 87

2 88 JOOP HOX, RENS VAN DE SCHOOT AND SUZETTE MATTHIJSSE the advantages of a model based approach where we can explicitly include country level explanatory variables and country level residual variation in the model, rather than a sample design based argumentation. We refer to Groves (1989) for a discussion of these two perspectives. When multigroup SEM is used, the number of countries is not a principled issue. Multigroup SEM can be used to compare any number of groups. If the number of groups is huge, there may be practical analysis issues, such as the capacity of the software or the computer (or even the interpretational capacity of the analyst), but there is no formal lower or upper limit on the number of groups. In multilevel analysis, the second level sample size (in comparative surveys generally the number of countries) is an issue. The second level sample size must be large enough to permit accurate parameter estimates and associated standard errors. Simulations have shown that multilevel regression modeling can be used with second-level samples as low as 20, provided that the interpretation focuses on the regression coefficients (Maas and Hox 2005). However, accurate estimation and testing of variances requires much larger sample sizes, Maas and Hox (2005) suggest 50 groups as a lower limit when variances are important. Structural equation modeling with latent variables relies on (co)variances, which suggests that for multilevel SEM even larger samples are needed for accurate estimation. Indeed, a simulation involving a two-level confirmative factor model shows that with fewer than 50 groups, the group level model parameters and their corresponding standard errors are not estimated with acceptable accuracy (Hox, Maas and Brinkhuis 2010). These simulations suggest that for accurate estimation at least 50 groups should be available. Meuleman and Billiet (2009) have carried out a simulation study directly aimed at the question how many countries are needed for accurate multilevel SEM estimation in comparative surveys. They specified within country sample sizes to follow the sample sizes typically achieved in the European Social Survey. The number of countries was varied from 20 to 100. The simulation model at both the individual and the country level is a confirmative one-factor model for four indicator variables, plus a structural effect predicting the factor from an exogenous observed variable. Meuleman and Billiet (2009) conclude that a sample of 20 countries is simply not enough for accurate estimation. They do not suggest a specific lower limit for the country level sample size; instead, they discuss how model complexity and goal of the analysis affect the country level sample size requirements. However, their simulation results indicate that if we require that the 95% confidence interval for country level factor loadings lies in fact between 90 and 99 percent, which corresponds to a bias of about 5%, we require at least 60 countries. For 60 countries, the empirical alpha level for a test that the structural effect equals zero is 0.083, which is acceptable. With 40 countries, the empirical alpha level is 0.103, which is not acceptable (cf. Boomsma and Hoogland 2001). The power for a medium size structural effect at the country level is with 60 countries, well below the value of 0.80 that Cohen (1988) recommends as a worth pursuing. In conclusion, Meuleman and Billiet confirm the suggestion that about 50 countries is the minimum sample size at the second level for accurate estimation in multilevel SEM. The sample size requirements suggested by the simulation studies reviewed above imply that for most comparative surveys the country level sample sizes are problematic. For instance, the European Social Survey round four (2008) includes 30 countries ( the third wave of SHARE ( ) includes 13 countries ( the 2007 wave of the mathematics survey TIMMS includes countries ( and the 2009 large scale educational assessment PISA sponsored by the OECD includes 65 countries ( These country level sample sizes suggest that only the larger collaborative comparative surveys involve enough countries to consider employing multilevel SEM, but the majority appears too small to employ multilevel structural equation modeling. Recently, Bayesian estimation methods have been introduced in structural equation modeling (Lee 2007). Bayesian estimation works well with lower sample sizes, and will not produce inadmissible parameter estimates such as negative variances. Bayesian methods generally imply prior information in the analysis, but when uninformative priors are used this has only a small effect on the resulting parameter estimates. The goal of the current paper is to examine how well Bayesian estimation deals with the problem of estimating parameters in a multilevel SEM model with a small sample size at the country level. The paper starts with an introduction of Bayesian estimation methods and the issues involved in a Bayesian multilevel SEM analysis. Next, it describes the simulation design which is patterned after Meuleman and Billiet (2009). Our simulation design explicitly studies the accuracy of the estimation method with very small numbers of countries. The results and their implications for the analysis of comparative surveys are discussed in detail. We provide a basic introduction of Bayesian statistics, but interested researchers could further refer to Lynch (2007) for an introduction to Bayesian estimation, and for technical details to Gelman, Carlin, Stern, and Rubin (2004). Bayesian structural equation modeling is discussed by Lee (2007) and Bayesian multilevel modeling by Hox (2010). In this paper we use the software Mplus (Muthén and Muthén ) because it is often used by applied researchers. For the technical implementation of Bayesian statistics in Mplus, see Asparouhov and Muthén (2010). 2. Estimation methods in multilevel SEM In this section we describe briefly different estimation methods for multilevel SEM, including Bayesian estimation. For a more elaborate accessible introduction we refer to Hox (2010), and for a statistical treatment we refer to Kaplan (2009). Multilevel SEM assumes sampling at both individual and country levels. The individual data are collected in a

3 HOW FEW COUNTRIES WILL DO? COMPARATIVE SURVEY ANALYSIS FROM A BAYESIAN PERSPECTIVE 89 p-variate vector Y ij (subscript i for individuals, j for groups). The data Y ij are decomposed into a between groups (Group level) component Y B = Y j, and a within groups (individual level) component Y W = Y ij Y j. These two components are orthogonal and additive, thus Y T = Y B + Y W. The population covariance matrices are also orthogonal and additive, thus T = B + W. Multilevel structural equation modeling assumes that the population covariance matrices B and W are described by distinct models for the between groups and within groups structure. Several approaches have been proposed to estimate the parameters of the multilevel SEM. Muthén (1989) suggests to approximate the full maximum likelihood solution by assuming equal group sizes, which leads to a limited information estimation method called MUML (for Muthén s Maximum Likelihood). A more accurate way to estimate a model for B and W is a Weighted Least Squares (WLS) method implemented in Mplus. Full maximum likelihood estimation for multilevel structural equation modeling requires to model the raw data. This minimizes the fit function given by F = N i=1 log N i + log(x i μ i ) 1 i=1 i (x i μ i ), (1) where the subscript i refers to the observed cases, x i to those variables observed for case i, and μ i and i contain the population means and covariances of the variables observed for case i. Mehta and Neale (2005) show that models for multilevel data, with individuals nested within groups, can be expressed as a structural equation model. The fit function (1) applies, with clusters as units of observation, and individuals within clusters as variables. Unbalanced data, here unequal numbers of individuals within clusters, are included the same way as incomplete data in standard SEM. The twostage approaches that model B and W separately (MUML and WLS) include only random intercepts in the between groups model, the full ML representation can incorporate random slopes as well (Mehta and Neale 2005). Maximum likelihood estimation assumes large samples, and relies on numerical methods to integrate out random effects. In comparison, Bayesian methods are reliable in small samples, and are better able to deal with complex models. The Bayesian approach is fundamentally different from classical statistics (Barnett 2008). In classical statistics, the population parameter has one specific value, only we happen to not know it. In Bayesian statistics, we express the uncertainty about the population value of a model parameter by assigning to it a probability distribution of possible values. This probability distribution is called the prior distribution, because it is specified independently from the data. After we have collected our data, this distribution is combined with the Likelihood of the data to produce a posterior distribution, which describes our uncertainty about the population values after observing our data. Typically, the variance of the posterior distribution is smaller than the variance of the prior distribution, which means that observing the data has reduced our uncertainty about the possible population values. More formally, let M be a statistical model with a vector of unknown parameters θ, for example regression parameters and correlations, and let Y be the observed data set with sample size n. In Bayesian estimation, θ is considered to be random and the behavior of θ under Y in such a Bayesian model can be described by p(θ Y, M) p(θ M) p(y θ, M) (2) where p(y θ, M) is the likelihood function, the information about the parameters in the data, p(θ M) is the prior distribution, the information about the parameters before observing the data, and p = (θ Y, M) is the posterior distribution, the information about the parameters after observing the data and taking the prior information into account. For the prior distribution, we have a fundamental choice between using an informative prior or an uninformative prior. An informative prior is a peaked distribution with a small variance, which expresses a strong belief about the unknown population parameter, and has a substantial effect on the posterior distribution. In contrast, an uninformative or diffuse prior serves to produce the posterior, but has very little influence. An example of an uninformative prior is the uniform distribution, which simply states that all possible values for the unknown parameter are equally likely. Another example of an uninformative prior is a very flat normal distribution specified with an enormous variance. Sometimes such a prior is called an ignorance prior, to indicate that we know nothing about the unknown parameter. However, this is not accurate, since total ignorance does not exist. All priors add some information to the data, but diffuse priors add very little information, and therefore do not have much influence on the posterior. For our analyses we used the default prior specifications of Mplus which uses uninformative priors. If the posterior distribution has a mathematically simple form, the known characteristics of the distribution can be used to produce point estimates and confidence intervals. However, in complex models the posterior is generally a complicated multivariate distribution, which is often mathematically intractable. Therefore, simulation techniques are used to generate random draws from the multivariate posterior distribution. These simulation procedures are known as Markov Chain Monte Carlo (MCMC) simulation. MCMC simulation is used to produce a large number of random draws from the posterior distribution, which is then used to compute a point estimate and a confidence interval (for an introduction to Bayesian estimation including MCMC methods see Lynch 2007). Typically, the marginal (univariate) distribution of each parameter is used. Given a set of initial values from a specific multivariate distribution, MCMC procedures generate a new random draw from the same distribution. Suppose that Z (1) is a draw from a target distribution f (Z). Using MCMC methods, we generate a series of new draws: Z (1) Z (2)... Z (t). MCMC methods are attractive because, even if Z (1) is not from the target distribution f (Z), if t is sufficiently large, in the end Z (t) is a draw from the target distribution f (Z). Having good initial values for Z (1) helps, because it speeds up the conver-

4 90 JOOP HOX, RENS VAN DE SCHOOT AND SUZETTE MATTHIJSSE (a) within (individual) level (b) between (country) level Figure 1. Path diagram for within (individual) and between (country) level gence on the target distribution, so the classical maximum likelihood estimates are often used as initial values for Z (1). The number of iterations t needed before the target distribution is reached is referred to as the burn in period of the MCMC algorithm. It is important that the burn in is complete. To check if enough iterations of the algorithm have passed to converge on the target distribution, several diagnostics are used. A useful diagnostic is a graph of the successive values produced by the algorithm. A different procedure is to start the MCMC procedure several times with widely different initial values. If essentially identical distributions are obtained after t iterations, we decide that t has been large enough to converge on the target distribution (Gelman and Rubin 1992). An additional issue in MCMC methods is that successive draws are dependent. Depending on the distribution and the amount of information in the data, they can be strongly correlated. Logically, we would prefer independent draws to use as simulated draws from the posterior distribution. One way to reach independence is to omit a number of successive estimates before a new draw is used for estimation. This process is called thinning. To decide how many iterations must be deleted between two successive draws, it is useful to inspect the autocorrelations between successive draws. If the autocorrelations are high, we must delete many estimates. Alternatively, since each draw still gives some information, we may keep all draws, but use an extremely large number of draws. The mode of the marginal posterior distribution is an attractive point estimate of the unknown parameter, because it is the most likely value, and therefore the Bayesian equivalent of the maximum likelihood estimator. Since the mode is more difficult to determine than the mean, the mean of the posterior distribution is also often used. In skewed posterior distributions, the median is an attractive choice. In Bayesian estimation, the standard deviation of the posterior distribution is comparable to the standard error in classical statistics. However, the confidence interval generally is based on the 1/2 α and 100 1/2 α percentiles around the point estimate. In the Bayesian terminology, this is referred to as the 100 α% credibility interval. Mplus by default uses the median of the posterior distribution for the point estimate, and the percentile-based 95% credibility interval, which we have followed in our simulations. Bayesian methods have some advantages over classical methods. To begin, in contrast to the asymptotic maximum likelihood method, they are valid in small samples. Given the correct probability distribution, the estimates are always proper, which solves the problem of negative variance estimates. Finally, since the random draws are taken from the correct distribution, there is no assumption of normality when variances are estimated. In this study, we examine if Bayesian estimation will help in drawing correct inferences in multilevel SEM if the number of groups (countries) is relatively small. The simulation studies cited in the introduction typically find that at smaller country level sample sizes the parameter estimates themselves are unbiased, but that the standard errors are underestimated, which leads to poor control of the alpha level and undercoverage for the confidence intervals. We expect that the credibility intervals in our Bayesian estimation will perform better at the country level sample sizes usually encountered in comparative survey research.

5 HOW FEW COUNTRIES WILL DO? COMPARATIVE SURVEY ANALYSIS FROM A BAYESIAN PERSPECTIVE Simulation design The simulation design in this study closely follows Meuleman and Billiet (2009). The model at both the individual and the country level is a one-factor model with four indicators. There is one structural effect from an observed exogenous variable on the factor. Figure 1 shows the path diagram with the population parameter values. The simulated data were generated from a population that has the same characteristics as used in Meuleman and Billiet (2009:48): The observed variables have a multivariate distribution. The intraclass correlation of the observed indicators is The within level unstandardized factor loadings are 0.90, 0.90, 0.75 and The between level factor unstandardized loadings are 0.27, 0.27, 0.28 and The within level independent variable has an unstandardized effect of The between level independent variable has an effect that is manipulated. One condition has an effect size of The other effect sizes were manipulated to be 0.10 (small), 0.25 (medium), 0.50 (large) and 0.75 (very large), following Cohen s (1988) suggestions for effect sizes. The within level sample size is Meuleman and Billiet generate data for five different numbers of countries: 20, 40, 60, 80 and 100. We have generated data for 10, 15 and 20 countries, with 1000 replications for each condition in our simulation design. We have used Mplus 6.1 for our simulation. Mplus has a set of commands that can be used to tweak the Bayesian estimation process. Assuming that most users will use the default settings, we have not attempted to modify the default settings. The major issue here is to let Mplus automatically decide how long the burn-in must be. Mplus uses the Gelman-Rubin potential scale reduction (PSR; Gelman and Rubin 1992) to decide when the chain has converged. By default, two independent MCMC chains are produced, and the between and within chain variation is compared. When the between chain variance is smaller than 0.05, convergence is assumed. Lee (2007) discusses this and other Bayesian model checks, we will come back to this issue in the discussion Results The simulation results are summarized in Table 1, which also reports a selection of the results obtained by Meuleman and Billiet (2009). Table 2: Statistical power for detecting the country level structural effect, for various effect sizes and country level sample sizes Number of countries Bayesian estimation ML estimation Effect size None (0.00) Small (0.10) Medium (0.25) Large (0.50) Very large (0.75) Parameters estimated by Meuleman and Billiet Table 1 shows that, compared to ML estimation, Bayesian estimation tends to result in a much larger bias for the country level residual variance estimates, but to less bias for the country level factor loadings and the structural effect. The 95% credibility intervals show a much better coverage in Bayesian estimation than their maximum likelihood based counterparts. For example, with 20 countries the between level factor loadings have a mean absolute bias of 0.03 in Bayesian estimation, and in Maximum Likelihood estimation. The actual coverage of the nominal 95% interval is 0.94 in Bayesian estimation, and 0.84 with Maximum Likelihood estimation, which is woefully inadequate. Table 2 shows the proportion of p-values below 0.05, for various effect sizes. For an effect size of zero, the table shows the operating alpha level, which indicates the prevalence of the type I error. It is clear that ML estimation does not control the alpha level well, with an operating alpha level of 16% with twenty countries. Thus, if the nominal alpha level is set at the common value of 0.05, the prevalence of type I errors is actually The alpha level is much better controlled in Bayesian estimation, where even at 10 countries the operating alpha level is 0.03, which is reasonably close to the nominal alpha level of Table 2 also shows that with a small number of countries the power in both Bayesian and Maximum Likelihood to detect anything but the largest effects is low. When the effect size is not zero, ML estimation does reject the null hypothesis more often than Bayesian estimation. For example, with 20 countries the power to detect a large effect is 0.58 in Bayesian estimation and 0.75 in Maximum Likelihood estimation. As we showed above, this increased power is at the expense of a very poorly controlled alpha level. 5. Discussion The results of the simulation show that Bayesian estimation indeed can get away with far fewer countries than Maximum Likelihood estimation. Both the parameter estimates and the coverage of the 95% interval are surprisingly good. However, the between level residual error variances are estimated very poorly. We come back to this issue later in the 1 One simulation run encountered convergence problems, which were solved by setting this convergence criterion to 0.01.

6 92 JOOP HOX, RENS VAN DE SCHOOT AND SUZETTE MATTHIJSSE Table 1: Mean absolute bias for various country level sample sizes Number of countries Bayesian estimation ML estimation Parameter bias Within factor loadings Within error variances Within structural effect Between factor loadings Between error variances Between structural effect Coverage Within factor loadings Within error variances Within structural effect Between factor loadings Between error variances Between structural effect Parameters estimated by Meuleman and Billiet discussion, when we discuss convergence problems in the Bayesian context. With respect to statistical power, it is clear that Bayesian estimation does not solve the problem of small sample, only very large country level effects can be discovered when the number of countries is small. The results also show that Bayesian estimation is not magic. With ten countries, problems start to show in the summary tables, but they are clearer when the simulation output is studied in more detail. For the condition with ten countries, each simulation run contains some outliers for the estimates of the error variances and corresponding standard errors, with estimates up to twenty times the population values. Such outliers would be recognized as such in a real analysis. The between model contains a total of 10 parameters, so it is not surprising that problems arise when the number of countries approaches the number of parameters in the between model. Simplifying the model, for instance by using the mean of the observed indicators instead of a latent variable would make estimation easier. We briefly mentioned convergence problems and outlying estimates. In MCMC estimation, convergence means convergence of the chain to the correct distribution. In our simulation, we have decided to emulate a relatively nave user and therefore to follow all defaults implemented in the software (Mplus 6.1). We also used an automatic cut-off criterion to decide whether convergence had been reached. In one simulation run, we needed to change the default criterion to a more strict value. Textbooks introducing Bayesian statistics caution users to always use diagnostic tools such as plots of the iteration history (trace plots, c.f. Gelman, Carlin, Stern and Rubin 2004; Lynch 2007), and we completely agree with such recommendations. Obviously, in a simulation, visually inspecting trace plots for 15,000 replications times 20 parameters is not possible. In applied Bayesian analysis, we consider such inspection mandatory. In addition, especially in modeling situations as extreme as having as many parameters as we have countries, we recommend inspection of autocorrelations and setting much stricter criteria for convergence. In fact, if we deviate from the software defaults and set the convergence criteria much stricter, the bias in the residual variances at the country level becomes much smaller, at the cost of a much increased computation time. Softwarewise, we have simply specified a different estimation method. From a principled standpoint, we have chosen adifferent kind of statistics. As a result, the 95% credibility interval now may correctly be interpreted as the interval that contains the population parameter with 95% probability. In our power table, we presented p-values. In the Bayesian case, this is not the normal p-value, but the so-called posterior predictive p-value. This is roughly interpreted as a standard p-value, but it is actually a different entity. Bayesian modeling in general prefers that decisions about parameters are based on credibility intervals, and that decisions about models are based on comparative evidence, such as information criteria or Bayes factors. A discussion of these issues is beyond the scope of this paper, but we believe that applied researchers should be aware that doing a Bayesian analysis is not just choosing a different estimation method. In our analysis, we have chosen the default uninformative priors provided by Mplus. Other choices are possible. One interesting option is using an informative prior. For example, the default prior for a factor loading in Mplus is a normal distribution with a mean of zero and a very large variance (10 10 ). We have more prior knowledge than that. If we model seven-point answer scales with an underlying factor, using standard identifying constraints, we know that the (absolute) factor loadings will not exceed, say, the value ten. Why not use a prior distribution that reflects this knowledge? In doing so, we would become real subjectivist statisticians, a position that is far away from mainstream statistics. If we impose priors that describe only realistic parameter values, the convergence problem discussed above will disappear. But in

7 HOW FEW COUNTRIES WILL DO? COMPARATIVE SURVEY ANALYSIS FROM A BAYESIAN PERSPECTIVE 93 small samples, such prior information could easily dominate the information in the data. In this paper, we have taken the position that this is undesirable, and prefer to work with uninformative priors. Acknowledgements Rens van de Schoot received a grant from the Netherlands Organisation for Scientific Research: NWO-VINI References Asparouhov, T., & Muthén, B. (2010). Bayesian analysis of latent variable models using Mplus. Version 4. Unpublished manuscript, accessed October 13, 2011 on Barnett, V. (2008). Comparative statistical inference. Chicester, UK: Wiley. Boomsma, A., & Hoogland, J. J. (2001). The robustness of LIS- REL modeling revisited. In R. Cudeck, S. du Toit, & D. Sörbom (Eds.), Structural equation modeling: Present and future. A Festschrift in honor of Karl Jöreskog (p ). Chicago: Scientific Software International. Cohen, J. (1988). Statistical power analysis for the behavioral sciences. Mahwah, NJ: Lawrence Erlbaum Associates. Gelman, A., Carlin, J. B., Stern, H. S., & Rubin, D. B. (2004). Bayesian data analysis (2nd ed.). Boca Raton, FL: Chapman & Hall/CRC. Gelman, A., & Rubin, D. B. (1992). Inference from iterative simulation using multiple sequences. Statistical Science, 7, Goldstein, H. (2011). Multilevel statistical models. Chicester, UK: Wiley. Groves, R. M. (1989). Survey errors and survey costs. New York: Wiley. Harkness, J. A., Braun, M., Edwards, B., Johnson, T. P., Lyberg, L. E., Mohler, P. P., et al. (2010). Survey methods in multinational, multiregional, and multicultural contexts. Chicester, UK: Wiley. Hox, J. J. (2010). Multilevel analysis. Techniques and applications. NY: Routledge. Jöreskog, K. G. (1971). Simultaneous factor analysis in several populations. Psychometrika, 36, Kaplan, D. (2009). Structural equation modeling (2nd ed.). Thousand Oaks, CA: Sage. Lee, S. (2007). Structural equation modeling: a Bayesian approach. Chicester, UK: Wiley. Lynch, S. M. (2007). Introduction to applied Bayesian statistics and estimation for social scientists. Berlin: Springer. Maas, C. J. M., & Hox, J. J. (2005). Sufficient sample sizes for multilevel modeling. Methodology: European Journal of Research Methods for the Behavioral and Social Sciences, 1, Mehta, P. D., & Neale, M. C. (2005). People are variables too: multilevel structural equations modeling. Psychological Methods, 10, Meuleman, B., & Billiet, J. (2009). A Monte Carlo sample size study : How many countries are needed for accurate multilevel SEM? Survey Research Methods, 3, Muthén, B. (1989). Latent variable modeling in heterogeneous populations. Psychometrika, 54, Muthén, L. K., & Muthén, B. O. ( ). Mplus user s guide (6th ed.). Los Angeles, CA: Muthén & Muthén. Raudenbush, S. W., & Bryk, A. S. (2002). Hierarchical linear models (2nd ed.). Thousand Oaks, CA: Sage. Van de Vijver, F. R., van Hemert, D. A., & Poortinga, Y. H. (Eds.). (2008). Multilevel analysis of individuals and cultures. NY: Taylor & Francis.

Impact and adjustment of selection bias. in the assessment of measurement equivalence

Impact and adjustment of selection bias. in the assessment of measurement equivalence Impact and adjustment of selection bias in the assessment of measurement equivalence Thomas Klausch, Joop Hox,& Barry Schouten Working Paper, Utrecht, December 2012 Corresponding author: Thomas Klausch,

More information

OLS Regression with Clustered Data

OLS Regression with Clustered Data OLS Regression with Clustered Data Analyzing Clustered Data with OLS Regression: The Effect of a Hierarchical Data Structure Daniel M. McNeish University of Maryland, College Park A previous study by Mundfrom

More information

Combining Risks from Several Tumors Using Markov Chain Monte Carlo

Combining Risks from Several Tumors Using Markov Chain Monte Carlo University of Nebraska - Lincoln DigitalCommons@University of Nebraska - Lincoln U.S. Environmental Protection Agency Papers U.S. Environmental Protection Agency 2009 Combining Risks from Several Tumors

More information

Alternative Methods for Assessing the Fit of Structural Equation Models in Developmental Research

Alternative Methods for Assessing the Fit of Structural Equation Models in Developmental Research Alternative Methods for Assessing the Fit of Structural Equation Models in Developmental Research Michael T. Willoughby, B.S. & Patrick J. Curran, Ph.D. Duke University Abstract Structural Equation Modeling

More information

A Brief Introduction to Bayesian Statistics

A Brief Introduction to Bayesian Statistics A Brief Introduction to Statistics David Kaplan Department of Educational Psychology Methods for Social Policy Research and, Washington, DC 2017 1 / 37 The Reverend Thomas Bayes, 1701 1761 2 / 37 Pierre-Simon

More information

Detection of Unknown Confounders. by Bayesian Confirmatory Factor Analysis

Detection of Unknown Confounders. by Bayesian Confirmatory Factor Analysis Advanced Studies in Medical Sciences, Vol. 1, 2013, no. 3, 143-156 HIKARI Ltd, www.m-hikari.com Detection of Unknown Confounders by Bayesian Confirmatory Factor Analysis Emil Kupek Department of Public

More information

Bayesian Logistic Regression Modelling via Markov Chain Monte Carlo Algorithm

Bayesian Logistic Regression Modelling via Markov Chain Monte Carlo Algorithm Journal of Social and Development Sciences Vol. 4, No. 4, pp. 93-97, Apr 203 (ISSN 222-52) Bayesian Logistic Regression Modelling via Markov Chain Monte Carlo Algorithm Henry De-Graft Acquah University

More information

Unit 1 Exploring and Understanding Data

Unit 1 Exploring and Understanding Data Unit 1 Exploring and Understanding Data Area Principle Bar Chart Boxplot Conditional Distribution Dotplot Empirical Rule Five Number Summary Frequency Distribution Frequency Polygon Histogram Interquartile

More information

Analyzing data from educational surveys: a comparison of HLM and Multilevel IRT. Amin Mousavi

Analyzing data from educational surveys: a comparison of HLM and Multilevel IRT. Amin Mousavi Analyzing data from educational surveys: a comparison of HLM and Multilevel IRT Amin Mousavi Centre for Research in Applied Measurement and Evaluation University of Alberta Paper Presented at the 2013

More information

Bayesian and Frequentist Approaches

Bayesian and Frequentist Approaches Bayesian and Frequentist Approaches G. Jogesh Babu Penn State University http://sites.stat.psu.edu/ babu http://astrostatistics.psu.edu All models are wrong But some are useful George E. P. Box (son-in-law

More information

Manifestation Of Differences In Item-Level Characteristics In Scale-Level Measurement Invariance Tests Of Multi-Group Confirmatory Factor Analyses

Manifestation Of Differences In Item-Level Characteristics In Scale-Level Measurement Invariance Tests Of Multi-Group Confirmatory Factor Analyses Journal of Modern Applied Statistical Methods Copyright 2005 JMASM, Inc. May, 2005, Vol. 4, No.1, 275-282 1538 9472/05/$95.00 Manifestation Of Differences In Item-Level Characteristics In Scale-Level Measurement

More information

Understanding and Applying Multilevel Models in Maternal and Child Health Epidemiology and Public Health

Understanding and Applying Multilevel Models in Maternal and Child Health Epidemiology and Public Health Understanding and Applying Multilevel Models in Maternal and Child Health Epidemiology and Public Health Adam C. Carle, M.A., Ph.D. adam.carle@cchmc.org Division of Health Policy and Clinical Effectiveness

More information

A COMPARISON OF IMPUTATION METHODS FOR MISSING DATA IN A MULTI-CENTER RANDOMIZED CLINICAL TRIAL: THE IMPACT STUDY

A COMPARISON OF IMPUTATION METHODS FOR MISSING DATA IN A MULTI-CENTER RANDOMIZED CLINICAL TRIAL: THE IMPACT STUDY A COMPARISON OF IMPUTATION METHODS FOR MISSING DATA IN A MULTI-CENTER RANDOMIZED CLINICAL TRIAL: THE IMPACT STUDY Lingqi Tang 1, Thomas R. Belin 2, and Juwon Song 2 1 Center for Health Services Research,

More information

Multilevel IRT for group-level diagnosis. Chanho Park Daniel M. Bolt. University of Wisconsin-Madison

Multilevel IRT for group-level diagnosis. Chanho Park Daniel M. Bolt. University of Wisconsin-Madison Group-Level Diagnosis 1 N.B. Please do not cite or distribute. Multilevel IRT for group-level diagnosis Chanho Park Daniel M. Bolt University of Wisconsin-Madison Paper presented at the annual meeting

More information

The Relative Performance of Full Information Maximum Likelihood Estimation for Missing Data in Structural Equation Models

The Relative Performance of Full Information Maximum Likelihood Estimation for Missing Data in Structural Equation Models University of Nebraska - Lincoln DigitalCommons@University of Nebraska - Lincoln Educational Psychology Papers and Publications Educational Psychology, Department of 7-1-2001 The Relative Performance of

More information

The matching effect of intra-class correlation (ICC) on the estimation of contextual effect: A Bayesian approach of multilevel modeling

The matching effect of intra-class correlation (ICC) on the estimation of contextual effect: A Bayesian approach of multilevel modeling MODERN MODELING METHODS 2016, 2016/05/23-26 University of Connecticut, Storrs CT, USA The matching effect of intra-class correlation (ICC) on the estimation of contextual effect: A Bayesian approach of

More information

Business Statistics Probability

Business Statistics Probability Business Statistics The following was provided by Dr. Suzanne Delaney, and is a comprehensive review of Business Statistics. The workshop instructor will provide relevant examples during the Skills Assessment

More information

Advanced Bayesian Models for the Social Sciences

Advanced Bayesian Models for the Social Sciences Advanced Bayesian Models for the Social Sciences Jeff Harden Department of Political Science, University of Colorado Boulder jeffrey.harden@colorado.edu Daniel Stegmueller Department of Government, University

More information

S Imputation of Categorical Missing Data: A comparison of Multivariate Normal and. Multinomial Methods. Holmes Finch.

S Imputation of Categorical Missing Data: A comparison of Multivariate Normal and. Multinomial Methods. Holmes Finch. S05-2008 Imputation of Categorical Missing Data: A comparison of Multivariate Normal and Abstract Multinomial Methods Holmes Finch Matt Margraf Ball State University Procedures for the imputation of missing

More information

Advanced Bayesian Models for the Social Sciences. TA: Elizabeth Menninga (University of North Carolina, Chapel Hill)

Advanced Bayesian Models for the Social Sciences. TA: Elizabeth Menninga (University of North Carolina, Chapel Hill) Advanced Bayesian Models for the Social Sciences Instructors: Week 1&2: Skyler J. Cranmer Department of Political Science University of North Carolina, Chapel Hill skyler@unc.edu Week 3&4: Daniel Stegmueller

More information

A Bayesian Nonparametric Model Fit statistic of Item Response Models

A Bayesian Nonparametric Model Fit statistic of Item Response Models A Bayesian Nonparametric Model Fit statistic of Item Response Models Purpose As more and more states move to use the computer adaptive test for their assessments, item response theory (IRT) has been widely

More information

Bayesian Mediation Analysis

Bayesian Mediation Analysis Psychological Methods 2009, Vol. 14, No. 4, 301 322 2009 American Psychological Association 1082-989X/09/$12.00 DOI: 10.1037/a0016972 Bayesian Mediation Analysis Ying Yuan The University of Texas M. D.

More information

Hierarchical Linear Models: Applications to cross-cultural comparisons of school culture

Hierarchical Linear Models: Applications to cross-cultural comparisons of school culture Hierarchical Linear Models: Applications to cross-cultural comparisons of school culture Magdalena M.C. Mok, Macquarie University & Teresa W.C. Ling, City Polytechnic of Hong Kong Paper presented at the

More information

Citation for published version (APA): Ebbes, P. (2004). Latent instrumental variables: a new approach to solve for endogeneity s.n.

Citation for published version (APA): Ebbes, P. (2004). Latent instrumental variables: a new approach to solve for endogeneity s.n. University of Groningen Latent instrumental variables Ebbes, P. IMPORTANT NOTE: You are advised to consult the publisher's version (publisher's PDF) if you wish to cite from it. Please check the document

More information

On the Performance of Maximum Likelihood Versus Means and Variance Adjusted Weighted Least Squares Estimation in CFA

On the Performance of Maximum Likelihood Versus Means and Variance Adjusted Weighted Least Squares Estimation in CFA STRUCTURAL EQUATION MODELING, 13(2), 186 203 Copyright 2006, Lawrence Erlbaum Associates, Inc. On the Performance of Maximum Likelihood Versus Means and Variance Adjusted Weighted Least Squares Estimation

More information

Introduction to Bayesian Analysis 1

Introduction to Bayesian Analysis 1 Biostats VHM 801/802 Courses Fall 2005, Atlantic Veterinary College, PEI Henrik Stryhn Introduction to Bayesian Analysis 1 Little known outside the statistical science, there exist two different approaches

More information

UvA-DARE (Digital Academic Repository)

UvA-DARE (Digital Academic Repository) UvA-DARE (Digital Academic Repository) Small-variance priors can prevent detecting important misspecifications in Bayesian confirmatory factor analysis Jorgensen, T.D.; Garnier-Villarreal, M.; Pornprasertmanit,

More information

Bayesian Analysis of Between-Group Differences in Variance Components in Hierarchical Generalized Linear Models

Bayesian Analysis of Between-Group Differences in Variance Components in Hierarchical Generalized Linear Models Bayesian Analysis of Between-Group Differences in Variance Components in Hierarchical Generalized Linear Models Brady T. West Michigan Program in Survey Methodology, Institute for Social Research, 46 Thompson

More information

Chapter 1: Exploring Data

Chapter 1: Exploring Data Chapter 1: Exploring Data Key Vocabulary:! individual! variable! frequency table! relative frequency table! distribution! pie chart! bar graph! two-way table! marginal distributions! conditional distributions!

More information

BIOL 458 BIOMETRY Lab 7 Multi-Factor ANOVA

BIOL 458 BIOMETRY Lab 7 Multi-Factor ANOVA BIOL 458 BIOMETRY Lab 7 Multi-Factor ANOVA PART 1: Introduction to Factorial ANOVA ingle factor or One - Way Analysis of Variance can be used to test the null hypothesis that k or more treatment or group

More information

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo Business Statistics The following was provided by Dr. Suzanne Delaney, and is a comprehensive review of Business Statistics. The workshop instructor will provide relevant examples during the Skills Assessment

More information

Lec 02: Estimation & Hypothesis Testing in Animal Ecology

Lec 02: Estimation & Hypothesis Testing in Animal Ecology Lec 02: Estimation & Hypothesis Testing in Animal Ecology Parameter Estimation from Samples Samples We typically observe systems incompletely, i.e., we sample according to a designed protocol. We then

More information

Inclusive Strategy with Confirmatory Factor Analysis, Multiple Imputation, and. All Incomplete Variables. Jin Eun Yoo, Brian French, Susan Maller

Inclusive Strategy with Confirmatory Factor Analysis, Multiple Imputation, and. All Incomplete Variables. Jin Eun Yoo, Brian French, Susan Maller Inclusive strategy with CFA/MI 1 Running head: CFA AND MULTIPLE IMPUTATION Inclusive Strategy with Confirmatory Factor Analysis, Multiple Imputation, and All Incomplete Variables Jin Eun Yoo, Brian French,

More information

Ordinal Data Modeling

Ordinal Data Modeling Valen E. Johnson James H. Albert Ordinal Data Modeling With 73 illustrations I ". Springer Contents Preface v 1 Review of Classical and Bayesian Inference 1 1.1 Learning about a binomial proportion 1 1.1.1

More information

The Effect of Extremes in Small Sample Size on Simple Mixed Models: A Comparison of Level-1 and Level-2 Size

The Effect of Extremes in Small Sample Size on Simple Mixed Models: A Comparison of Level-1 and Level-2 Size INSTITUTE FOR DEFENSE ANALYSES The Effect of Extremes in Small Sample Size on Simple Mixed Models: A Comparison of Level-1 and Level-2 Size Jane Pinelis, Project Leader February 26, 2018 Approved for public

More information

THE INDIRECT EFFECT IN MULTIPLE MEDIATORS MODEL BY STRUCTURAL EQUATION MODELING ABSTRACT

THE INDIRECT EFFECT IN MULTIPLE MEDIATORS MODEL BY STRUCTURAL EQUATION MODELING ABSTRACT European Journal of Business, Economics and Accountancy Vol. 4, No. 3, 016 ISSN 056-6018 THE INDIRECT EFFECT IN MULTIPLE MEDIATORS MODEL BY STRUCTURAL EQUATION MODELING Li-Ju Chen Department of Business

More information

Ecological Statistics

Ecological Statistics A Primer of Ecological Statistics Second Edition Nicholas J. Gotelli University of Vermont Aaron M. Ellison Harvard Forest Sinauer Associates, Inc. Publishers Sunderland, Massachusetts U.S.A. Brief Contents

More information

ST440/550: Applied Bayesian Statistics. (10) Frequentist Properties of Bayesian Methods

ST440/550: Applied Bayesian Statistics. (10) Frequentist Properties of Bayesian Methods (10) Frequentist Properties of Bayesian Methods Calibrated Bayes So far we have discussed Bayesian methods as being separate from the frequentist approach However, in many cases methods with frequentist

More information

Bayesian Inference Bayes Laplace

Bayesian Inference Bayes Laplace Bayesian Inference Bayes Laplace Course objective The aim of this course is to introduce the modern approach to Bayesian statistics, emphasizing the computational aspects and the differences between the

More information

Inference About Magnitudes of Effects

Inference About Magnitudes of Effects invited commentary International Journal of Sports Physiology and Performance, 2008, 3, 547-557 2008 Human Kinetics, Inc. Inference About Magnitudes of Effects Richard J. Barker and Matthew R. Schofield

More information

2.75: 84% 2.5: 80% 2.25: 78% 2: 74% 1.75: 70% 1.5: 66% 1.25: 64% 1.0: 60% 0.5: 50% 0.25: 25% 0: 0%

2.75: 84% 2.5: 80% 2.25: 78% 2: 74% 1.75: 70% 1.5: 66% 1.25: 64% 1.0: 60% 0.5: 50% 0.25: 25% 0: 0% Capstone Test (will consist of FOUR quizzes and the FINAL test grade will be an average of the four quizzes). Capstone #1: Review of Chapters 1-3 Capstone #2: Review of Chapter 4 Capstone #3: Review of

More information

11/18/2013. Correlational Research. Correlational Designs. Why Use a Correlational Design? CORRELATIONAL RESEARCH STUDIES

11/18/2013. Correlational Research. Correlational Designs. Why Use a Correlational Design? CORRELATIONAL RESEARCH STUDIES Correlational Research Correlational Designs Correlational research is used to describe the relationship between two or more naturally occurring variables. Is age related to political conservativism? Are

More information

Score Tests of Normality in Bivariate Probit Models

Score Tests of Normality in Bivariate Probit Models Score Tests of Normality in Bivariate Probit Models Anthony Murphy Nuffield College, Oxford OX1 1NF, UK Abstract: A relatively simple and convenient score test of normality in the bivariate probit model

More information

Readings: Textbook readings: OpenStax - Chapters 1 13 (emphasis on Chapter 12) Online readings: Appendix D, E & F

Readings: Textbook readings: OpenStax - Chapters 1 13 (emphasis on Chapter 12) Online readings: Appendix D, E & F Readings: Textbook readings: OpenStax - Chapters 1 13 (emphasis on Chapter 12) Online readings: Appendix D, E & F Plous Chapters 17 & 18 Chapter 17: Social Influences Chapter 18: Group Judgments and Decisions

More information

WDHS Curriculum Map Probability and Statistics. What is Statistics and how does it relate to you?

WDHS Curriculum Map Probability and Statistics. What is Statistics and how does it relate to you? WDHS Curriculum Map Probability and Statistics Time Interval/ Unit 1: Introduction to Statistics 1.1-1.3 2 weeks S-IC-1: Understand statistics as a process for making inferences about population parameters

More information

Bayesian Confidence Intervals for Means and Variances of Lognormal and Bivariate Lognormal Distributions

Bayesian Confidence Intervals for Means and Variances of Lognormal and Bivariate Lognormal Distributions Bayesian Confidence Intervals for Means and Variances of Lognormal and Bivariate Lognormal Distributions J. Harvey a,b, & A.J. van der Merwe b a Centre for Statistical Consultation Department of Statistics

More information

C h a p t e r 1 1. Psychologists. John B. Nezlek

C h a p t e r 1 1. Psychologists. John B. Nezlek C h a p t e r 1 1 Multilevel Modeling for Psychologists John B. Nezlek Multilevel analyses have become increasingly common in psychological research, although unfortunately, many researchers understanding

More information

A Multilevel Testlet Model for Dual Local Dependence

A Multilevel Testlet Model for Dual Local Dependence Journal of Educational Measurement Spring 2012, Vol. 49, No. 1, pp. 82 100 A Multilevel Testlet Model for Dual Local Dependence Hong Jiao University of Maryland Akihito Kamata University of Oregon Shudong

More information

What is Multilevel Modelling Vs Fixed Effects. Will Cook Social Statistics

What is Multilevel Modelling Vs Fixed Effects. Will Cook Social Statistics What is Multilevel Modelling Vs Fixed Effects Will Cook Social Statistics Intro Multilevel models are commonly employed in the social sciences with data that is hierarchically structured Estimated effects

More information

A Hierarchical Linear Modeling Approach for Detecting Cheating and Aberrance. William Skorupski. University of Kansas. Karla Egan.

A Hierarchical Linear Modeling Approach for Detecting Cheating and Aberrance. William Skorupski. University of Kansas. Karla Egan. HLM Cheating 1 A Hierarchical Linear Modeling Approach for Detecting Cheating and Aberrance William Skorupski University of Kansas Karla Egan CTB/McGraw-Hill Paper presented at the May, 2012 Conference

More information

Practical Bayesian Design and Analysis for Drug and Device Clinical Trials

Practical Bayesian Design and Analysis for Drug and Device Clinical Trials Practical Bayesian Design and Analysis for Drug and Device Clinical Trials p. 1/2 Practical Bayesian Design and Analysis for Drug and Device Clinical Trials Brian P. Hobbs Plan B Advisor: Bradley P. Carlin

More information

11/24/2017. Do not imply a cause-and-effect relationship

11/24/2017. Do not imply a cause-and-effect relationship Correlational research is used to describe the relationship between two or more naturally occurring variables. Is age related to political conservativism? Are highly extraverted people less afraid of rejection

More information

Running Head: BAYESIAN MEDIATION WITH MISSING DATA 1. A Bayesian Approach for Estimating Mediation Effects with Missing Data. Craig K.

Running Head: BAYESIAN MEDIATION WITH MISSING DATA 1. A Bayesian Approach for Estimating Mediation Effects with Missing Data. Craig K. Running Head: BAYESIAN MEDIATION WITH MISSING DATA 1 A Bayesian Approach for Estimating Mediation Effects with Missing Data Craig K. Enders Arizona State University Amanda J. Fairchild University of South

More information

6. Unusual and Influential Data

6. Unusual and Influential Data Sociology 740 John ox Lecture Notes 6. Unusual and Influential Data Copyright 2014 by John ox Unusual and Influential Data 1 1. Introduction I Linear statistical models make strong assumptions about the

More information

Response to Comment on Cognitive Science in the field: Does exercising core mathematical concepts improve school readiness?

Response to Comment on Cognitive Science in the field: Does exercising core mathematical concepts improve school readiness? Response to Comment on Cognitive Science in the field: Does exercising core mathematical concepts improve school readiness? Authors: Moira R. Dillon 1 *, Rachael Meager 2, Joshua T. Dean 3, Harini Kannan

More information

Monte Carlo Analysis of Univariate Statistical Outlier Techniques Mark W. Lukens

Monte Carlo Analysis of Univariate Statistical Outlier Techniques Mark W. Lukens Monte Carlo Analysis of Univariate Statistical Outlier Techniques Mark W. Lukens This paper examines three techniques for univariate outlier identification: Extreme Studentized Deviate ESD), the Hampel

More information

JSM Survey Research Methods Section

JSM Survey Research Methods Section Methods and Issues in Trimming Extreme Weights in Sample Surveys Frank Potter and Yuhong Zheng Mathematica Policy Research, P.O. Box 393, Princeton, NJ 08543 Abstract In survey sampling practice, unequal

More information

An Introduction to Bayesian Statistics

An Introduction to Bayesian Statistics An Introduction to Bayesian Statistics Robert Weiss Department of Biostatistics UCLA Fielding School of Public Health robweiss@ucla.edu Sept 2015 Robert Weiss (UCLA) An Introduction to Bayesian Statistics

More information

Using Sample Weights in Item Response Data Analysis Under Complex Sample Designs

Using Sample Weights in Item Response Data Analysis Under Complex Sample Designs Using Sample Weights in Item Response Data Analysis Under Complex Sample Designs Xiaying Zheng and Ji Seung Yang Abstract Large-scale assessments are often conducted using complex sampling designs that

More information

Still important ideas

Still important ideas Readings: OpenStax - Chapters 1 13 & Appendix D & E (online) Plous Chapters 17 & 18 - Chapter 17: Social Influences - Chapter 18: Group Judgments and Decisions Still important ideas Contrast the measurement

More information

Multilevel analysis quantifies variation in the experimental effect while optimizing power and preventing false positives

Multilevel analysis quantifies variation in the experimental effect while optimizing power and preventing false positives DOI 10.1186/s12868-015-0228-5 BMC Neuroscience RESEARCH ARTICLE Open Access Multilevel analysis quantifies variation in the experimental effect while optimizing power and preventing false positives Emmeke

More information

Running head: NESTED FACTOR ANALYTIC MODEL COMPARISON 1. John M. Clark III. Pearson. Author Note

Running head: NESTED FACTOR ANALYTIC MODEL COMPARISON 1. John M. Clark III. Pearson. Author Note Running head: NESTED FACTOR ANALYTIC MODEL COMPARISON 1 Nested Factor Analytic Model Comparison as a Means to Detect Aberrant Response Patterns John M. Clark III Pearson Author Note John M. Clark III,

More information

Using a multilevel structural equation modeling approach to explain cross-cultural measurement noninvariance

Using a multilevel structural equation modeling approach to explain cross-cultural measurement noninvariance Zurich Open Repository and Archive University of Zurich Main Library Strickhofstrasse 39 CH-8057 Zurich www.zora.uzh.ch Year: 2012 Using a multilevel structural equation modeling approach to explain cross-cultural

More information

Catherine A. Welch 1*, Séverine Sabia 1,2, Eric Brunner 1, Mika Kivimäki 1 and Martin J. Shipley 1

Catherine A. Welch 1*, Séverine Sabia 1,2, Eric Brunner 1, Mika Kivimäki 1 and Martin J. Shipley 1 Welch et al. BMC Medical Research Methodology (2018) 18:89 https://doi.org/10.1186/s12874-018-0548-0 RESEARCH ARTICLE Open Access Does pattern mixture modelling reduce bias due to informative attrition

More information

Bayesian Estimation of a Meta-analysis model using Gibbs sampler

Bayesian Estimation of a Meta-analysis model using Gibbs sampler University of Wollongong Research Online Applied Statistics Education and Research Collaboration (ASEARC) - Conference Papers Faculty of Engineering and Information Sciences 2012 Bayesian Estimation of

More information

Chapter 11: Advanced Remedial Measures. Weighted Least Squares (WLS)

Chapter 11: Advanced Remedial Measures. Weighted Least Squares (WLS) Chapter : Advanced Remedial Measures Weighted Least Squares (WLS) When the error variance appears nonconstant, a transformation (of Y and/or X) is a quick remedy. But it may not solve the problem, or it

More information

Modelling Double-Moderated-Mediation & Confounder Effects Using Bayesian Statistics

Modelling Double-Moderated-Mediation & Confounder Effects Using Bayesian Statistics Modelling Double-Moderated-Mediation & Confounder Effects Using Bayesian Statistics George Chryssochoidis*, Lars Tummers**, Rens van de Schoot***, * Norwich Business School, University of East Anglia **

More information

Multivariate Regression with Small Samples: A Comparison of Estimation Methods W. Holmes Finch Maria E. Hernández Finch Ball State University

Multivariate Regression with Small Samples: A Comparison of Estimation Methods W. Holmes Finch Maria E. Hernández Finch Ball State University Multivariate Regression with Small Samples: A Comparison of Estimation Methods W. Holmes Finch Maria E. Hernández Finch Ball State University High dimensional multivariate data, where the number of variables

More information

Student Performance Q&A:

Student Performance Q&A: Student Performance Q&A: 2009 AP Statistics Free-Response Questions The following comments on the 2009 free-response questions for AP Statistics were written by the Chief Reader, Christine Franklin of

More information

Proof. Revised. Chapter 12 General and Specific Factors in Selection Modeling Introduction. Bengt Muthén

Proof. Revised. Chapter 12 General and Specific Factors in Selection Modeling Introduction. Bengt Muthén Chapter 12 General and Specific Factors in Selection Modeling Bengt Muthén Abstract This chapter shows how analysis of data on selective subgroups can be used to draw inference to the full, unselected

More information

Section on Survey Research Methods JSM 2009

Section on Survey Research Methods JSM 2009 Missing Data and Complex Samples: The Impact of Listwise Deletion vs. Subpopulation Analysis on Statistical Bias and Hypothesis Test Results when Data are MCAR and MAR Bethany A. Bell, Jeffrey D. Kromrey

More information

bivariate analysis: The statistical analysis of the relationship between two variables.

bivariate analysis: The statistical analysis of the relationship between two variables. bivariate analysis: The statistical analysis of the relationship between two variables. cell frequency: The number of cases in a cell of a cross-tabulation (contingency table). chi-square (χ 2 ) test for

More information

Sampling Weights, Model Misspecification and Informative Sampling: A Simulation Study

Sampling Weights, Model Misspecification and Informative Sampling: A Simulation Study Sampling Weights, Model Misspecification and Informative Sampling: A Simulation Study Marianne (Marnie) Bertolet Department of Statistics Carnegie Mellon University Abstract Linear mixed-effects (LME)

More information

Bias in regression coefficient estimates when assumptions for handling missing data are violated: a simulation study

Bias in regression coefficient estimates when assumptions for handling missing data are violated: a simulation study STATISTICAL METHODS Epidemiology Biostatistics and Public Health - 2016, Volume 13, Number 1 Bias in regression coefficient estimates when assumptions for handling missing data are violated: a simulation

More information

SUPPLEMENTAL MATERIAL

SUPPLEMENTAL MATERIAL 1 SUPPLEMENTAL MATERIAL Response time and signal detection time distributions SM Fig. 1. Correct response time (thick solid green curve) and error response time densities (dashed red curve), averaged across

More information

Understandable Statistics

Understandable Statistics Understandable Statistics correlated to the Advanced Placement Program Course Description for Statistics Prepared for Alabama CC2 6/2003 2003 Understandable Statistics 2003 correlated to the Advanced Placement

More information

On Test Scores (Part 2) How to Properly Use Test Scores in Secondary Analyses. Structural Equation Modeling Lecture #12 April 29, 2015

On Test Scores (Part 2) How to Properly Use Test Scores in Secondary Analyses. Structural Equation Modeling Lecture #12 April 29, 2015 On Test Scores (Part 2) How to Properly Use Test Scores in Secondary Analyses Structural Equation Modeling Lecture #12 April 29, 2015 PRE 906, SEM: On Test Scores #2--The Proper Use of Scores Today s Class:

More information

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo Please note the page numbers listed for the Lind book may vary by a page or two depending on which version of the textbook you have. Readings: Lind 1 11 (with emphasis on chapters 10, 11) Please note chapter

More information

A Case Study: Two-sample categorical data

A Case Study: Two-sample categorical data A Case Study: Two-sample categorical data Patrick Breheny January 31 Patrick Breheny BST 701: Bayesian Modeling in Biostatistics 1/43 Introduction Model specification Continuous vs. mixture priors Choice

More information

STATISTICAL INFERENCE 1 Richard A. Johnson Professor Emeritus Department of Statistics University of Wisconsin

STATISTICAL INFERENCE 1 Richard A. Johnson Professor Emeritus Department of Statistics University of Wisconsin STATISTICAL INFERENCE 1 Richard A. Johnson Professor Emeritus Department of Statistics University of Wisconsin Key words : Bayesian approach, classical approach, confidence interval, estimation, randomization,

More information

Regression CHAPTER SIXTEEN NOTE TO INSTRUCTORS OUTLINE OF RESOURCES

Regression CHAPTER SIXTEEN NOTE TO INSTRUCTORS OUTLINE OF RESOURCES CHAPTER SIXTEEN Regression NOTE TO INSTRUCTORS This chapter includes a number of complex concepts that may seem intimidating to students. Encourage students to focus on the big picture through some of

More information

SLAUGHTER PIG MARKETING MANAGEMENT: UTILIZATION OF HIGHLY BIASED HERD SPECIFIC DATA. Henrik Kure

SLAUGHTER PIG MARKETING MANAGEMENT: UTILIZATION OF HIGHLY BIASED HERD SPECIFIC DATA. Henrik Kure SLAUGHTER PIG MARKETING MANAGEMENT: UTILIZATION OF HIGHLY BIASED HERD SPECIFIC DATA Henrik Kure Dina, The Royal Veterinary and Agricuural University Bülowsvej 48 DK 1870 Frederiksberg C. kure@dina.kvl.dk

More information

Data Analysis Using Regression and Multilevel/Hierarchical Models

Data Analysis Using Regression and Multilevel/Hierarchical Models Data Analysis Using Regression and Multilevel/Hierarchical Models ANDREW GELMAN Columbia University JENNIFER HILL Columbia University CAMBRIDGE UNIVERSITY PRESS Contents List of examples V a 9 e xv " Preface

More information

Assessing Measurement Invariance in the Attitude to Marriage Scale across East Asian Societies. Xiaowen Zhu. Xi an Jiaotong University.

Assessing Measurement Invariance in the Attitude to Marriage Scale across East Asian Societies. Xiaowen Zhu. Xi an Jiaotong University. Running head: ASSESS MEASUREMENT INVARIANCE Assessing Measurement Invariance in the Attitude to Marriage Scale across East Asian Societies Xiaowen Zhu Xi an Jiaotong University Yanjie Bian Xi an Jiaotong

More information

Introduction. Patrick Breheny. January 10. The meaning of probability The Bayesian approach Preview of MCMC methods

Introduction. Patrick Breheny. January 10. The meaning of probability The Bayesian approach Preview of MCMC methods Introduction Patrick Breheny January 10 Patrick Breheny BST 701: Bayesian Modeling in Biostatistics 1/25 Introductory example: Jane s twins Suppose you have a friend named Jane who is pregnant with twins

More information

A MONTE CARLO STUDY OF MODEL SELECTION PROCEDURES FOR THE ANALYSIS OF CATEGORICAL DATA

A MONTE CARLO STUDY OF MODEL SELECTION PROCEDURES FOR THE ANALYSIS OF CATEGORICAL DATA A MONTE CARLO STUDY OF MODEL SELECTION PROCEDURES FOR THE ANALYSIS OF CATEGORICAL DATA Elizabeth Martin Fischer, University of North Carolina Introduction Researchers and social scientists frequently confront

More information

BOOTSTRAPPING CONFIDENCE LEVELS FOR HYPOTHESES ABOUT QUADRATIC (U-SHAPED) REGRESSION MODELS

BOOTSTRAPPING CONFIDENCE LEVELS FOR HYPOTHESES ABOUT QUADRATIC (U-SHAPED) REGRESSION MODELS BOOTSTRAPPING CONFIDENCE LEVELS FOR HYPOTHESES ABOUT QUADRATIC (U-SHAPED) REGRESSION MODELS 12 June 2012 Michael Wood University of Portsmouth Business School SBS Department, Richmond Building Portland

More information

Chapter 21 Multilevel Propensity Score Methods for Estimating Causal Effects: A Latent Class Modeling Strategy

Chapter 21 Multilevel Propensity Score Methods for Estimating Causal Effects: A Latent Class Modeling Strategy Chapter 21 Multilevel Propensity Score Methods for Estimating Causal Effects: A Latent Class Modeling Strategy Jee-Seon Kim and Peter M. Steiner Abstract Despite their appeal, randomized experiments cannot

More information

Sawtooth Software. MaxDiff Analysis: Simple Counting, Individual-Level Logit, and HB RESEARCH PAPER SERIES. Bryan Orme, Sawtooth Software, Inc.

Sawtooth Software. MaxDiff Analysis: Simple Counting, Individual-Level Logit, and HB RESEARCH PAPER SERIES. Bryan Orme, Sawtooth Software, Inc. Sawtooth Software RESEARCH PAPER SERIES MaxDiff Analysis: Simple Counting, Individual-Level Logit, and HB Bryan Orme, Sawtooth Software, Inc. Copyright 009, Sawtooth Software, Inc. 530 W. Fir St. Sequim,

More information

Sample Sizes for Predictive Regression Models and Their Relationship to Correlation Coefficients

Sample Sizes for Predictive Regression Models and Their Relationship to Correlation Coefficients Sample Sizes for Predictive Regression Models and Their Relationship to Correlation Coefficients Gregory T. Knofczynski Abstract This article provides recommended minimum sample sizes for multiple linear

More information

Mantel-Haenszel Procedures for Detecting Differential Item Functioning

Mantel-Haenszel Procedures for Detecting Differential Item Functioning A Comparison of Logistic Regression and Mantel-Haenszel Procedures for Detecting Differential Item Functioning H. Jane Rogers, Teachers College, Columbia University Hariharan Swaminathan, University of

More information

Propensity Score Analysis Shenyang Guo, Ph.D.

Propensity Score Analysis Shenyang Guo, Ph.D. Propensity Score Analysis Shenyang Guo, Ph.D. Upcoming Seminar: April 7-8, 2017, Philadelphia, Pennsylvania Propensity Score Analysis 1. Overview 1.1 Observational studies and challenges 1.2 Why and when

More information

How many speakers? How many tokens?:

How many speakers? How many tokens?: 1 NWAV 38- Ottawa, Canada 23/10/09 How many speakers? How many tokens?: A methodological contribution to the study of variation. Jorge Aguilar-Sánchez University of Wisconsin-La Crosse 2 Sample size in

More information

Data and Statistics 101: Key Concepts in the Collection, Analysis, and Application of Child Welfare Data

Data and Statistics 101: Key Concepts in the Collection, Analysis, and Application of Child Welfare Data TECHNICAL REPORT Data and Statistics 101: Key Concepts in the Collection, Analysis, and Application of Child Welfare Data CONTENTS Executive Summary...1 Introduction...2 Overview of Data Analysis Concepts...2

More information

An Exercise in Bayesian Econometric Analysis Probit and Linear Probability Models

An Exercise in Bayesian Econometric Analysis Probit and Linear Probability Models Utah State University DigitalCommons@USU All Graduate Plan B and other Reports Graduate Studies 5-1-2014 An Exercise in Bayesian Econometric Analysis Probit and Linear Probability Models Brooke Jeneane

More information

UNIVERSITY OF FLORIDA 2010

UNIVERSITY OF FLORIDA 2010 COMPARISON OF LATENT GROWTH MODELS WITH DIFFERENT TIME CODING STRATEGIES IN THE PRESENCE OF INTER-INDIVIDUALLY VARYING TIME POINTS OF MEASUREMENT By BURAK AYDIN A THESIS PRESENTED TO THE GRADUATE SCHOOL

More information

Individual Differences in Attention During Category Learning

Individual Differences in Attention During Category Learning Individual Differences in Attention During Category Learning Michael D. Lee (mdlee@uci.edu) Department of Cognitive Sciences, 35 Social Sciences Plaza A University of California, Irvine, CA 92697-5 USA

More information

MODEL SELECTION STRATEGIES. Tony Panzarella

MODEL SELECTION STRATEGIES. Tony Panzarella MODEL SELECTION STRATEGIES Tony Panzarella Lab Course March 20, 2014 2 Preamble Although focus will be on time-to-event data the same principles apply to other outcome data Lab Course March 20, 2014 3

More information

Missing Data and Institutional Research

Missing Data and Institutional Research A version of this paper appears in Umbach, Paul D. (Ed.) (2005). Survey research. Emerging issues. New directions for institutional research #127. (Chapter 3, pp. 33-50). San Francisco: Jossey-Bass. Missing

More information

Mediation Analysis With Principal Stratification

Mediation Analysis With Principal Stratification University of Pennsylvania ScholarlyCommons Statistics Papers Wharton Faculty Research 3-30-009 Mediation Analysis With Principal Stratification Robert Gallop Dylan S. Small University of Pennsylvania

More information