Statistical Models for Enhancing Cross-Population Comparability

Size: px
Start display at page:

Download "Statistical Models for Enhancing Cross-Population Comparability"

Transcription

1 Statistical Models for Enhancing Cross-Population Comparability A. Tandon, C.J.L. Murray, J.A. Salomon, and G. King Global Programme on Evidence for Health Policy Discussion Paper No. 42 World Health Organization, Geneva, Switzerland January 23, 22 1 Introduction Measuring the health state of individuals is important for the evaluation of health interventions, monitoring individual health progress, and as a critical step in measuring the health of populations. Self-report responses in household survey data are widely used for assessing the non-fatal health status of populations. These data typically take the form of ordered categorical (ordinal) responses. Over the past three decades, there has been great progress in developing instruments to measure the multiple domains of health that are reliable and demonstrate within population validity [31],[22]. One key analytical issue is that these self-report ordinal responses are not comparable across populations primarily because of response category cut-point shifts. Conceptualizing the observed responses as resulting from a mapping between an underlying unobserved latent variable (e.g., ability on the domain of mobility) and categorical response categories, cut-points are threshold levels on the latent variable that characterize the transition from one observed categorical response to the next. If cut-points differ systematically across populations, or even across sociodemographic groups within a population, then the observed ordinal responses are not cross-population comparable since they will not imply the same level on the underlying latent variable that we are trying to measure (Figure 1). Another way of characterizing this problem is that, for the same level of the latent variable on any given domain, the probability of an individual responding in any given response category is different across populations. This issue of cross-population comparability is not limited to health surveys: it is of equal relevance to self-report surveys on responsiveness of health systems, as well as to numerous other questions that rely on ordinal responses. One example of self-report health data comes from the WHO Multi-Country Household Survey Study on Health and Responsiveness [28]. The main self-report question on the domain of mobility is: Overall in the past 3 days, how much difficulty did you have with moving around? Respondents are asked to classify themselves using one of five response categories: 1=Extreme/Cannot do; 2=Severe difficulty; 3=Moderate difficulty; 4=Mild

2 A B C N N Mi N Mo Mi Mi S Mo Mo S E S Cut-points E E Latent mobility scale N = None, Mi = Mild, Mo = Moderate, S = Severe, E = Extreme Figure 1: Mapping from unobserved latent variable to observed response categories difficulty; 5=No difficulty. We can hypothesize that cut-points may vary between populationsbecauseofdifferent cultural or other expectations for domains of health. Cut-points are also likely to vary within a cultural or sociodemographic group. The cut-points for older individuals may shift as their expectations for a domain diminish with age. Men may be more likely to deny declines in health so that their cut-points may be systematically shifted as compared to women. Contact with health services may influence expectations for a domain and thus shift cut-points [2]. Empirical examples suggesting cross-population cut-point shifts in health surveys abound [23]. For instance, in Australian national health surveys comparing the self-reported health status of Aboriginals with that of the general population, only around 12% of the Aboriginal population characterized their own health status as fair or poor, while more than 2% of the general population rated their health in these low categories. By any other major indicator of mortality and morbidity, the Aboriginal population fares much worse than the general population, which suggests that there may be important differences in the interpretation of categorical responses in the different sub-populations due to shifts in response category cut-points. Residents of the state of Kerala in India which has the lowest rates of infant and child mortality and the highest rates of literacy in India consistently report highest incidences of morbidity in the country [19]. The object of this document is to elaborate on several statistical models used in the analysis of survey data. First, we focus on off-the-shelf models that are widely available as part of any standard statistical software. In particular, we demonstrate the problems of inference that arise from these standard methods when the underlying data are not crosspopulation comparable. In later sections, we introduce methods that modify these standard routines to enhance the cross-population comparability of survey analyses. 2

3 2 Models for Analyzing Ordinal Survey Responses We begin by describing the application of existing statistical models for the analysis of ordinal survey data. These models serve as the building blocks for the methodological innovations introduced in subsequent sections. In particular, the focus is on two off-theshelf methods: (a) the ordered probit model (widely used by econometricians and other social scientists), and (b) the partial credit model (from psychometrics). Both these models are used in the analysis of ordered categorical response data. The partial credit model is a multiple-category generalization of the Rasch model and is part of a large body of literature oftenreferredtoasitemresponsetheory(irt) whichhasitsrootsineducationaltesting using standardized exams. One needs to be careful, though, in using these standard models in the analysis of data that may not be cross-population comparable. In other words, if there are good reasons to believe that respondents saying they are in good health in Ethiopia and in Denmark mean very different things in terms of an underlying latent variable measure, then the use of these methods without correction may lead to very misleading conclusions regarding the actual levels of health in these two populations. In order to better demonstrate this point, and to subsequently introduce some methodological innovations dealing with cross-population comparability, a simulated dataset is utilized. The simulated dataset consists of 1, respondents each from two hypothetical populations (countries A and B) for which the level of health on a domain, say mobility, is to be estimated based on self-report categorical responses to three questions (one core question, and two auxiliary questions). These questions are: 1 Main Question: Overall in the past 3 days, how much difficulty did you have moving around? Auxiliary Question 1: Overall in the past 3 days, how much difficulty did you have standing for long periods such as 3 minutes? Auxiliary Question 2: Overall in the past 3 days, how much difficulty did you have climbing several flights of stairs or walking up a steep hill? Each of the questions asks the respondents to pick one of five responses: 1 = Extreme/Cannot Do 2 = Severe 3 = Moderate 4 = Mild 5= None Since this is simulated data, the true mobility levels are known for each respondent. This enables a comparison of the estimated mobility levels versus truth for the different models. 1 The questions mirror those in the WHO Multi-Country Study. 3

4 The simulated data is generated based on the assumption that true mobility is a function of age, sex, education, and country of residence for each respondent. An individual-level random effect term is also added to represent other individual-specific unobserved factors that might affect mobility. Table 1 reports the mean age, education level, and sex distribution in the simulated sample. Table 1: Descriptive statistics (simulated data) Country Mean Age Mean Education Female N A , B , In addition, the simulation allows cut-points for each question to differ by sociodemographic group. The response category cut-points are generated as functions of age, sex, education, and country of residence. Figure 2 plots the distribution of the simulated observed categorical responses for the three questions for countries A and B. 2 At first glance, the distribution of self-report responses in the two countries does not look very different. Country A Country A Country A Frac tion.4 Frac tion.4 Frac tion Country B Main question Country B Auxiliary question 1 Country B Auxiliary question Fract ion.6.4 Fract ion.6.4 Fract ion Main question Auxiliary question Auxiliary question 2 Figure 2: Distribution of responses for three self-report questions in countries A and B In the next two sub-sections, these data are analyzed using both the ordered probit model and the Rasch-based partial credit model. It is assumed that the data analyst has access to the self-report categorical responses as well as standard demographic variables such as age, sex, education, and country of residence for each of the respondents. The goal is to estimate mobility levels in the two simulated populations using these data. In later 2 In generating the categorical responses, a stochastic error term with a variances ranging from 15 to 25 units was used (assumed different across questions, with auxiliary question 2 being the noisiest question). 4

5 sections, we introduce models that allow response category-cut-points also to be functions of covariates. In such models, the direction of shift for the response category cut-points is also of substantive interest (e.g., to test the hypotheses that more educated respondents have higher cut-points indicative of higher norms, or that older individuals respond based on norms for their age category, and so on). Of course, such models can also be used for testing hypotheses relating to causal inferences and other tests of statistical significance. 2.1 The Ordered Probit Model The ordered probit model assumes there is an unobserved latent variable Yi (mobility) distributed with mean µ i and variance 1, where i refers to the respondent. 3 The mean level of the latent variable is a function of individual-level sociodemographic characteristics such as age, sex, education, and country of residence, Yi N(µ i, 1), i =1,..., N µ i = Ziβ. Let y i be the observed categorical response of individual i to the main self-report question. The ordered probit model stipulates an observation mechanism such that: y i = k if τ k 1 Y i < τ k ; for τ =, τ 5 =, i & k =1,..., 5. Also, it follows from the set-up of the model that τ 1 < τ 2 < τ 3 < τ 4. Given this structure, the probabilities of responding in any given category k =1,..., 5, conditional on a vector of covariates Z i,canbederivedas: Pr(y i = k) = F (τ 1 Zi β), k =1 F (τ 2 Zi β) F (τ 1 Zi β), k =2 F (τ 3 Zi β) F (τ 2 Zi β), k =3 F (τ 4 Zi β) F (τ 3 Zi β), k =4 1 F (τ 4 Zi β), k =5, where F ( ) is the standard normal cumulative distribution function. If the observations are assumed independent across individuals, then the likelihood function is simply the product of the probabilities of observing each value of y i in the dataset. Estimates of the β vector as well as the cut-points τ k may then be obtained using maximum likelihood methods. It is important to note that the standard ordered probit model assumes the same set of cut-points for the entire sample. Table 2 reports the results from a run of the ordered probit model for our simulated data for the main question in both countries. Figure 3 plots the cut-points estimated from the ordered probit model versus true cutpoints for the main question. Because the true cutpoints may vary across individuals but 3 Since the latent variable is unobserved, the variance of the latent variable conditional on determinants is arbitrarily set to 1 in the ordered probit model. In addition, in order to identify the model, the constant term is set to. These conventions produce a scale that is unique up to any positive affine transformation, i.e., the latent scale has so-called interval properties. (1) 5

6 Table 2: Estimation results : ordered probit Variable Coefficient (Std. Err.) Age (.65) Age (.77) Age (.88) Male -.62 (.53) 1 < Educ (.91) 6 < Educ (.96) Educ > (.113) Country B (.56) τ (.12) τ (.1) τ (.98) τ (.96) 5 First cut-point Second cut-point Third cut-point Fourth cut-point True cut-points Predicted cut-points Figure 3: Predicted versus true cut-points: ordered probit for main question the model assumes that they are fixed, each predicted cutpoint is associated with a range of different true values. Figure 4 is a plot of true mobility versus estimated average mobility using the standard ordered probit model. As reported in the graph, the R-squared value is only about.11. Not only does the ordered probit model predict the mean mobility poorly, it also predicts that the average mobility is lower in country B (see coefficient on country B in Table 2) even though the true level of mobility is higher in country B in the simulated data. The basic point of this simulation experiment is simple: if there are significant cutpoint shifts in the underlying data-generating mechanism then using standard procedures such as the ordered probit model to analyze the data can be very misleading. Since the ordered probit model is a probability model, we can also obtain the predicted probabilities of responding in each of the five categories for the main question, given any particular level on the underlying latent variable scale (Figure 5). We have used only the main question for analyzing the data using the ordered probit model. One way to analyze 6

7 R-squared =.11 RMSE = True mobility Predicted mobility Figure 4: Predicted versus true mobility: ordered probit for main question Predicted probabilities: ordered probit 1.8 Pr(k = 1) Pr(k = 5) Pr(k = 4) Pr(k = 3) Pr(k = 2) Latent mobility scale Figure 5: Predicted probabilities: ordered probit for main question multiple questions using this model would be to pool the data and allow for a dummy variable per question (since the cut-points will be assumed to be the same for all questions). However, doing this will yield a different mean value of the latent variable per question for each individual. Running the model in this way is potentially confusing, since we assume that an individual has a single value on the latent variable of interest that informs answers to all three questions, but this procedure would allow estimates of this latent variable to differ by question. 2.2 The Partial Credit Model Asecondmodelthatisoftenusedintheanalysisofordinaldataisthepartialcreditmodel from item response theory. This is basically a polytomous extension of the binary-response 7

8 Rasch model [16],[17],[18]. 4 Suppose there are N respondents, each answering J questions on a given domain. Individual i =1,...,N chooses response category k =1,..., 5 for question j = 1,..., J. The partial credit model conceptualizes the ordinal nature of the categorical data as a series of dichotomies or steps. 5 These dichotomies are modeled such that the probability that a respondent chooses response category k, given the choice between response category k or k 1, is: φ k ij = Pr(y ij = k) Pr(y ij = k 1) + Pr(y ij = k) = exp(β i δ k j ) 1+exp(β i δ k j ) Here, Pr(y ij = k) is the probability that individual i responds in category k for question j, andφ ijk is the corresponding probability of responding in category k conditional on responding either in category k 1ork. β i is the ability of individual i, andδj k is the difficulty associated with the k-thstepinquestionj. In other words, the probability of responding in category k, conditional on responding either in category k 1 ork, ismodeled as a positive function of a person s ability and a negative function of the difficulty for the question category. Making use of the condition that the probabilities of responding in a category must sum to 1 across all five categories for each individual i and question j, i.e., Pr(y ij =1)+Pr(y ij =2)+Pr(y ij =3)+Pr(y ij =4)+Pr(y ij =5)=1, a general expression for the probability of responding in the k-th category (where k = 1,..., 5) can be derived: Pr(y ij = k) = exp[(k 1)β i P k 1 m= δm j ] P 5s=1 exp[(s 1)β i P s 1 m= δm j ], where, for notational convenience, P m= δj. For the case of five categories, the probabilities of responding in each category can be written as: Pr(y ij = k) = where A is the expression 1/A, k =1 exp(β i δj 1 )/A, k =2 exp(2β i δj 1 δ2 j )/A, k =3 exp(3β i δj 1 δ2 j δ3 j )/A, k =4 exp(4β i δj 1 δ2 j δ3 j δ4 j )/A, k =5, A 1+exp(β i δj 1 )+exp(2β i δj 1 δ2 j )+exp(3β i δj 1 δ2 j δ3 j ) +exp(4β i δj 1 δj 2 δj 3 δj 4 ) (2) For a fixed number of questions, the unconditional estimation of the likelihood function yields difficulty parameters that are inconsistent [16],[3]. Consistent estimates of the difficulty parameters can be obtained by conditioning on the raw score (i.e., on the sum of 4 The Rasch model is a fixed-effect logit model and can also be reformulated as a quasi-symmetry loglinear model [27],[8]. 5 In this sense, the partial credit model can be viewed as an adjacent category logit model. 8

9 responses across questions for each individual). So, for example, the conditional probability that a person responds in category 2 for all 3 questions is calculated as the joint probability divided by the probability of getting a raw score r of 6 across the questions: Pr(y i1 =2)Pr(y i2 =2)Pr(y i3 =2) Pr(r =6) The likelihood written in this manner is free of the ability parameter β. Once the difficulty parameters have been estimated using the conditional approach, estimates of β r can be obtained using the unconditional likelihood derived from: 1/A, k =1 exp(β r ˆδ j 1 )/A, k =2 Pr(y ij = k) = exp(2β r ˆδ j 1 ˆδ j 2 )/A, k =3 exp(3β r ˆδ j 1 ˆδ j 2 ˆδ j 3 )/A, k =4 exp(4β r ˆδ j 1 ˆδ j 2 ˆδ j 3 ˆδ j 4 )/A, k =5, The notation changes to β r because this method requires only one estimate of ability for every possible sum score of responses across all questions. In the partial credit model, the difficulty parameters are points on the latent variable scale where the probabilities of responding in one category or the next are equal. Alternatively, the difficulty parameters are points where the probability of responding in category k, conditional on responding in categories k 1ork, is.5. The ability parameters can be thought of as estimates of the individual s underlying latent variable. The estimates of ability levels can be compared to true mobility for the simulated data to assess the performance of this model. This simple version of the partial credit model assumes that the difficulty parameters do not vary by sociodemographic characteristics which in the language of psychometrics is akin to saying that it assumes there is no differential item functioning. Table 3 reports the difficulty parameters for the simulated data obtained by running the conditional likelihood procedure in STATA (for identification, δ 1 is set to zero for the main question). 6 Figure 6 plots the estimated ability parameters versus the true mobility. As with the ordered probit model, Figure 7 reports the predicted probabilities from the model for given values of ability. The predicted probabilities are quite similar to those that are predicted by the ordered probit model (Figure 1). As the value of the latent variable increases, the probability of responding in the lowest category becomes small and the probability of responding in higher categories increases. The partial credit model does better than the ordered probit model in predicting the true level of mobility. The R-squared value is much higher than that of the ordered probit model. However, the comparison between the two models is not entirely fair since we only use one question for the ordered probit model and all three questions in the partial credit model. In the formulation introduced here, the partial credit model uses no extraneous information (i.e., covariates such as sex, age, and education) in the estimation of the abilities. 6 Estimates of the difficulty and ability parameters using STATA were of the same magnitude as those obtained using IRT software such as WINMIRA and RUMM. 9

10 R-squared =.221 RMSE = True mobility Predicted mobility Figure 6: Predicted versus true mobility: two-stage partial credit Predicted probabilities: partial credit 1.8 Pr(k = 1) Pr(k = 5) Pr(k = 2) Pr(k = 3) Pr(k = 4) Latent mobility scale Figure 7: Predicted probabilities: two-stage partial credit for main question 1

11 Table 3: Estimation results : two-stage partial credit Variable Coefficient (Std. Err.) δ 1 Dummy Aux 1.27 (.183) Dummy Aux (.178) δ 2 Dummy Aux (.186) Dummy Aux (.183) Main question (.267) δ 3 Dummy Aux (.154) Dummy Aux (.151) Main question (.187) δ 4 Dummy Aux (.11) Dummy Aux (.131) Main question (.175) In the next subsection, we present an alternative specification of the model that includes covariates. 2.3 The Partial Credit Model with Covariates The partial credit model can be reformulated so that instead of having a dummy variable per individual β i, variables such as age, sex, education, and country of residence can be introduced. Such a modification to the partial credit model is especially useful in the analysis of health survey data given that sociodemographic variables are usually collected in such surveys. Equation (2) with covariates can be written as the probability that individual i responds in category k for each of the questions j, conditional on a vector of covariates Z i : 1/A, k =1 exp(zi β δ1 j )/A, k =2 Pr(y ij = k) = exp(2zi β δ1 j δ2 j )/A, k =3 (3) exp(3zi β δ1 j δ2 j δ3 j )/A, k =4 exp(4zi β δ1 j δ2 j δ3 j δ4 j )/A, k =5, where A is the expression A 1+exp(Zi β δ1 j )+exp(2z i β δ1 j δ2 j )+exp(3z i β δ1 j δ2 j δ3 j ) +exp(4zi β δ1 j δ2 j δ3 j δ4 j ) Assuming independence across observations and questions, estimates can be computed using maximum likelihood. The mean predicted level of mobility versus truth is plotted in Figure 8 and the estimates are in Table 4. 11

12 Table 4: Estimation results : partial credit with covariates Variable Coefficient (Std. Err.) Mean Age (.24) Age (.28) Age (.32) Male -.77 (.19) 1<Educ 6.49 (.33) 1<Educ 6.19 (.34) Educ>11.16 (.41) Country B -.75 (.2) δ 1 Dummy Aux (.185) Dummy Aux (.163) Main question.272 (.144) δ 2 Dummy Aux 1.92 (.185) Dummy Aux (.166) Main question (.14) δ 3 Dummy Aux (.151) Dummy Aux (.126) Main question (.1) δ 4 Dummy Aux (.19) Dummy Aux (.99) Main question (.66) The mean level of the estimated latent variable that is plotted in Figure 8 does not account for the fact that the deterministic variation in the latent variable will be imperfectly captured by the limited set of included covariates. In the absence of a random effect, the model will overestimate the amount of stochastic variability in the data. The next subsection introduces a method for accounting for this by using Bayes theorem to estimate the predicted mobility. 2.4 Random Effects and Latent Variable Estimation using Bayes Theorem If there is an individual-level random effect in the data i.e., when covariates in our model do not capture all the systematic variation in the latent variable then there remains information content in the set of responses across questions for each individual that has not been fully exploited. The partial credit model with covariates and arandomeffect ν i with 12

13 R-squared =.55 RMSE = True mobility Predicted mobility Figure 8: Predicted versus true mobility: partial credit with covariates mean zero and variance σν 2 can be written out as follows: 1/A, k =1 exp[(zi β + ν i) δj 1 ]/A, k =2 Pr(y ij = k) = exp[2(zi β + ν i) δj 1 δ2 j ]/A, k =3 exp[3(zi β + ν i) δj 1 δ2 j δ3 j ]/A, k =4 exp[4(zi β + ν i) δj 1 δ2 j δ3 j δ4 j ]/A, k =5, (4) where A is the expression A 1+exp[(Zi β + ν i) δj 1 ]+exp[2(z i β + ν i) δj 1 δ2 j ]+exp[3(z i β + ν i) δj 1 δ2 j δ3 j ] +exp[4(zi β + ν i) δj 1 δ2 j δ3 j δ4 j ] In order to exploit the information content in the set of responses we can make use of Bayes theorem to obtain estimates of the mean level of mobility conditional of the observed set of responses. That is, we can estimate Pr(µ i y i ) using Bayes formula: Pr(µ i y i )= Pr(y i µ i )Pr(µ i ) R Pr(yi µ i )Pr(µ i )dµ i. (5) where y i represents the vector of categorical responses on all questions for individual i. The way this can be implemented is as follows. First,we use the model with a random effect and estimate all the parameters including the variance of the random effect. This estimate of the variance can be used to simulate 1 different values of µ i around the predicted Z i β of the latent variable for each individual in the sample. Hence, for each simulated value of µ i, Pr(µ i ) can be calculated. Pr(y i µ i ) can be calculated using the probability specifications given in equation (4). Integrating over all simulated values of µ i for each individual gives us the denominator of equation (5). In the absence of a model that estimates the variance of this individual-specific random effect, one can assume that the random effectcapturesabout5%ofthevariation 13

14 R-squared =.334 RMSE = True mobility Predicted mobility Figure 9: Predicted versus true mobility: partial credit with covariates (Bayesian) in estimated variance of the error term. Under this assumption, the Bayesian predication of mobility conditional on the observed pattern of responses is plotted in Figure 9 for the partial credit model with covariates. 7 It is quite remarkable that the Bayesian correction significantly improves the estimation of mobility (Figure 9) when compared with the estimation of abilities using the two-step conditional procedure for the partial credit model (Figure 6), as judged by the R-squared values. In other words, if the goal of the analyst is to estimate the underlying latent variable, then a modification of the partial credit model that allows for covariates and a random effect outperforms the simple version of the partial credit model. 2.5 Ordered Probit versus Partial Credit We have introduced two basic types of models that are widely used in the analysis of categorical data, namely the ordered probit model and the partial credit model (with ability dummies and with covariates). Fundamentally, both models assume some sort of latent variable that gives rise to an observation mechanism governed by probabilities given in equations (1) and (2). Viewed this way, the two models are quite similar differing only with respect to the functional form for the data generating mechanism and their differences in approach to modeling the probabilities: these being derived from differences in the cumulative probability function for the ordered probit model versus the focus on adjacent categories in the partial credit model. Apart from poor predictions of the underlying latent variable, both the ordered probit and the partial credit models suffer from the problem that one cannot allow the response category cut-points (τ s), or the so-called difficulty parameters (δ s), to be functions of the same covariates as the mean value of the latent variable. This is because there will be a clear 7 We have developed working versions of the models with random effects. However, they are very slow to run and we are currently trying to improve the speed of estimation. 14

15 identification problem if one does so: in the absence of additional exogenous information, neither model will be able to detect whether the effects of the covariates are on the mean value of the latent variable or on the cut-points or difficulties. This is easy to see from the equations for the predicted probabilities [equations (1) and (2)]. This is likely to be a serious shortcoming of both models in estimating cross-population comparable differences in the latent variable of interest. In simple terms, these models do not allow for a world in which the Danish not only have a higher health status, but also have different expectations for their health status relative to Ethiopians. In the next section, we introduce an innovation to both the ordered probit and partial credit models that allows for the introduction of exogenous information in the form of vignettes. Analyzing the self-report questions in conjunction with responses to vignettes allows us to identify the model such that the same set of covariates can be used to assess differences in the mean level of the underlying latent variable as well as in cut-points or difficulties. 3 Vignettes Wenowintroducetheuseofvignettesasameansofcorrectionofself-reportresponsesin order to make them cross-population comparable. A vignette is a description of a concrete level of ability on a given domain that respondents are asked to evaluate with relation to the same main question and on the same categorical response scale as the main self-report question [24]. The vignette fixes the level of ability such that variations in categorical responses are attributable to variations in response category cut-points. This introduction of exogenous information in the form of responses to vignettes allows us to identify the effects of a set of sociodemographic covariates (such as age, sex, education, country of residence, etc.) on both the level of the underlying latent variable that is being estimated as well as on the cut-points (in the ordered probit version of the model) and difficulties (in the partial credit version of the model). 8 In the WHO Multi-Country Study, there are six vignettes for the domain of mobility, each designed to capture a different level of ability on this domain. The vignettes are: Vignette 1: [Paul] is an active athlete who runs long distance races of 2 kilometers twice a week and engages in soccer with no problems. Vignette 2: [Mary] has no problems with moving around or using her hands, arms and legs. She jogs 4 kilometers twice a week without any problems. Vignette 3: [Rob] is able to walk distances of up to 2 meters without any problems but feels breathless after walking one kilometer or climbing up more than one flight of stairs. He has no problems with day-to-day physical activities, such as carrying food from the market. 8 An alternative method to set a comparable scale such that response category cut-point differences can be recovered is to use measured tests [26]. 15

16 Vignette 4: [Margaret] feels chest pain and gets breathless after walking distances of up to 2 meters, but is able to do so without assistance. Bending and lifting objects such as groceries produces pain. Vignette 5: [Louis] is able to move his arms and legs, but requires assistance in standing up from a chair or walking around the house. Any bending is painful and lifting is impossible. Vignette 6: [David] is paralyzed from the neck down. He is confined to bed and must be fed and bathed by somebody else. Respondents are asked to classify each of these vignettes on the same five-point response category scale as the main question. So, for each individual, we not only have categorical responses to their self-report main question and several auxiliary questions, but we also have their categorical responses to a set of vignettes (ranging in number from six to eight across the different domains for health and responsiveness in the WHO Multi-Country Study). In order to introduce statistical models designed around the use of vignettes, we have extended the simulated data set to include hypothetical ratings of seven mobility vignettes in countries A and B by assigning true mobility scores to the different vignettes and assuming that individuals will use the categorical response scale the same way in assessing vignettes as they do in assessing their own levels of mobility on the main question. This assumption is critical for the estimation of the models, as discussed below. The simulated vignette ratings for the two countries are summarized in Figures 1 and 11. Each graph shows the distribution of categorical responses for the set of vignettes (lighter colors signifying worse responses). The vignettes are ranked from 1 to 7 in decreasing order of ability: i.e., vignette 1 refers to a higher level of mobility than vignette 2, and vignette 3 is higher than vignette 2, and so on. From these graphs, it is clear that there are important differences in the cutpoints between country A and country B. At lower levels of mobility, respondents in country B are more likely to characterize a vignette unfavorably than respondents in country A. In addition, the compression of the middle categories in country B suggest cut-points that are more narrowly spaced than those in country A. The types of variation in vignette ratings that we have generated in the simulated dataset closely parallel the variation observed in actual data from the WHO Multi-Country Survey Study. In a later section, we show the response distributions for China versus those for India for mobility vignettes. In the following sections we describe how variants of the ordered probit model and partial credit model may be used in conjunction with vignette ratings in order to characterize these systematic cutpoint differences more precisely. Both models are modified such that: (a) information from responses to vignettes are introduced in the likelihood function, and (b) cut-points and difficulties are allowed to be functions of the same covariates as those used in the estimation of the mean value of the latent variable. 3.1 Hierarchical Ordered Probit Model (HOPIT) The hierarchical ordered probit (HOPIT) model is a modification of the standard ordered probit model described earlier. In order to incorporate information on vignette ratings and 16

17 1% 9% 8% 7% 6% 5% 4% 3% 2% 1% % Vignette Figure 1: Distribution of vignette responses for country A 1% 9% 8% 7% 6% 5% 4% 3% 2% 1% % Vignette Figure 11: Distribution of vignette responses for country B 17

18 multiple questions, the expanded model has several components to the likelihood function: the first component refers to estimation of cut-points using responses to vignettes, and the second component utilizes responses on the self-report main question. The remaining components are for auxiliary questions. In formal terms, the first component of the likelihood distributedwithmeanµ v ij and variance 1. Here, i refers to the respondent, j refers to the vignette number, and the v superscript indicates that this refers to the vignette component of the model. In mathematical terms, function assumes there is an unobserved latent variable Y v ij Yij v N(µ v ij, 1), i =1,...,N; j =1,..., V µ v ij = Ji α, where J i is a vector of indicator variables for each of V 1 vignettes. Letting y v ij denote the observed categorical response by individual i to vignette j, the observation mechanism is defined as follows: y v ij = k if τ k 1 i Y v ij < τ k i ; for τ i =, τ 5 i =, i, j & k =1,..., 5. In addition, the cut-points are allowed to be functions of covariates: As before, τ 1 i < τ 2 i < τ 3 i < τ 4 i. τ k i = X i γk, The second component of the likelihood function utilizes information from the respondent s main self-report question (the one that is tied to the vignettes) and assumes there is an unobserved latent variable Yi s distributed with mean µ s i and variance σ2. Here, the s superscript indicates that this component refers to self-report questions. This formulation is slightly different from the standard ordered probit model: since we are allowing the vignettes to drive the cut-point estimation, this second component of the likelihood function hasmoreincommonwithanintervalregressionmodel(i.e.,anorderedprobitmodelwith known cut-points). Since the cut-point estimation is being driven by vignettes and the scale is set by the first estimation component, we are now able to obtain estimates of the variance of the latent variable (i.e., there is no need to set the variance equal to 1 as before). In mathematical terms, the model is: Yi s N(µ s i, σ 2 ), i =1,..., N µ s i = Z i β. Let y s i be the observed categorical responses on the self-report such that: y s i = k if τ k 1 i Y s i < τ k i ; for τ i =, τ 5 i =, i & k =1,..., 5 Similarly for the auxiliary questions, let a j i be the observed categorical responses on the j-th auxiliary question such that: and a j i = k, if τ j,k 1 i Y s i < τ j,k i ; for τ j, i =, τ j,5 i =, i & k =1,..., 5 τ j,k i = X i γj,k 18

19 R-squared =.495 RMSE = True mobility Predicted mobility Figure 12: Predicted versus true mobility: HOPIT It is assumed that Yi s & Yi s are independent i 6= i, conditional on X i. Yij v & Yi s are independent i, j conditional on X i,j i and Z i. The probabilities associated with the observed responses to vignettes, the main question, and the auxiliary questions can be computed as in equation (1) with the adjustment for cut-point shifts being functions of covariates. The likelihood function can be written using these probabilities as three separate components. The three components of the likelihood function are additive in logs and can be jointly maximized to yield the parameter estimates. There is explicit parametric dependence between the different components of the likelihood function. The cut-points to be estimated from the vignettes component are the same as those in the main question component. In addition, µ s i is the same for both the main question and all the auxiliary questions. This ensures that the estimated cut-points for both the main question and the auxiliary questions are on the same scale to enable meaningful comparisons. Tables 5 to 9 report the results of the estimation in the Annex. Figure 12 plots the estimatesofthemeanlevelversustruth. TheR-squared for the prediction has improved when compared with the simple ordered probit model as well as with the partial credit models with and without covariates. Figure 13 reports the true versus estimated cutpoints for the main question. These differ by sociodemographic group in that they are also functions of the same covariates (age, sex, education, and country of residence) as the mean level of the mobility. As can be seen, the model is able to recover the cut-point differences quite well. Figures 14 and 15 report the comparison of estimated cut-points to truth for the two auxiliary questions. The recovery here is not quite as good as that for the main question. This is to be expected since the information in the vignettes are directly driving the main question cut-points, whereas the estimation of the cut-points for the auxiliary questions is more indirect and is not anchored to the cut-points derived from vignette responses. The estimation of the latent variable using Bayes theorem (Figure 16) improves the R-squared quite significantly, yielding estimates of mobility that are quite close to the true mobility levels in the underlying simulated data. 19

20 First cut-point Second cut-point Third cut-point Fourth cut-point 5 True cut-points Predicted cut-points Figure 13: Predicted versus true cut-points: HOPIT main question 5 First cut-point Second cut-point Third cut-point Fourth cut-point True cut-points Predicted cut-points Figure 14: Predicted versus true cut-points: HOPIT auxiliary question 1 2

21 First cut-point Second cut-point Third cut-point Fourth cut-point 5 True cut-points Predicted cut-points Figure 15: Predicted versus true cut-points: HOPIT auxiliary question 2 5 R-squared =.729 RMSE = True mobility Predicted mobility Figure 16: Predicted versus true mobility: HOPIT (Bayesian) 21

22 3.2 Hierarchical Partial Credit Model In analogy to the HOPIT model, we implement the use of vignettes in exactly the same way for the Rasch-based partial credit model. We allow for responses to vignettes to set the difficulty levels and estimate differences across sociodemographic groups in the first component of the likelihood function. In the other components of the likelihood, we utilize information from the main and auxiliary questions. The logic is the same as before: we are using information on difficulty parameters from responses on vignettes to allow us to have covariates that affect both the mean level of the estimated latent variable as well as the difficulty parameters. For all the vignette questions, i.e., for j = 1,...,V: 1/A, k =1 exp(j Pr(yi v i α δ1 i )/A, k =2 = k) = exp(2ji α δ1 i δ2 i )/A, k =3 (6) exp(3ji α δ1 i δ2 i δ3 i )/A, k =4 exp(4ji α δ1 i δ2 i δ3 i δ4 i )/A, k =5, where J i is a vector of indicator variables for each of V 1 vignettes, and A is the expression A 1+exp(Ji α δ1 i )+exp(2j i α δ1 i δ2 i )+exp(3j i α δ1 i δ2 i δ3 i ) +exp(4jiα δi 1 δi 2 δi 3 δi 4 ) and, δ k i = X iβ k Similarly, the probabilities for the main question (the one which is tied to the vignettes): 1/A, k =1 exp(z Pr(yi s i β δ1 i )/A, k =2 = k) = exp(2zi β δ1 i δ2 i )/A, k =3 (7) exp(3zi β δ1 i δ2 i δ3 i )/A, k =4 exp(4zi β δ1 i δ2 i δ3 i δ4 i )/A, k =5, where Z i is a vector of individual-level covariates, and A is the expression A 1+exp(Ziβ δi 1 )+exp(2ziβ δi 1 δi 2 )+exp(3ziβ δi 1 δi 2 δi 3 ) +exp(4ziβ δi 1 δi 2 δi 3 δi 4 ) And for the j-th auxiliary question: 1/A, k =1 exp(z Pr(yij s i β δ1 ij )/A, k =2 = k) = exp(2zi β δ1 ij δ2 ij )/A, k =3 exp(3zi β δ1 ij δ2 ij δ3 ij )/A, k =4 exp(4zi β δ1 ij δ2 ij δ3 ij δ4 ij )/A, k =5, (8) where Z i is a vector of individual-level covariates, and A is the expression A 1+exp(Zi β δ1 ij )+exp(2z i β δ1 ij δ2 ij )+exp(3z i β δ1 ij δ2 ij δ3 ij ) +exp(4zi β δ1 ij δ2 ij δ3 ij δ4 ij ) 22

23 R-squared =.487 RMSE = True mobility Predicted mobility Figure 17: Predicted versus true mobility: partial credit model 5 R-squared =.683 RMSE = True mobility Predicted mobility Figure 18: Predicted versus true mobility: partial credit model (Bayesian) Tables 1 to 14 in the Annex report the results of this estimation. Figures 17 and 18 show the predicted mobility versus the true mobility before and after the Bayesian correction. The R-squared values obtained from the hierarchical ordered probit model for predicted mobility are similar in magnitude for the pre-bayesian estimates obtained using the HOPIT model. The post-bayesian estimation appears to be slightly higher for HOPIT than for the hierarchical partial credit model. This may result from the fact that the hierarchical partial credit model, in the way we have formulated it, does not estimate the variance of the stochastic term. This constraint will inhibit the model from fitting the data as well as it could if the variance were included as a parameter. 23

24 4 Goodness-of-Fit Assessing goodness-of-fit for categorical data is not straightforward. One can compute a simple count-r 2 which is a measure of the proportion of correct responses obtained for a given sample. For ordinal data, the predicted categorical response would be the one associated with the maximum predicted probability. Other options include a pseudo-r 2 measure, which in software such as STATA, is a likelihood-based comparison of the model with all the parameters to one with only the intercept [12]. Rasch-based models use measures of fit suchas outfit and infit : outfit is a chi-square test based on the sum of the standardized deviation of observed versus expected values of a response. Infit is also a chisquare test which utilizes an information-weighted sum by adjusting for extreme responses using weights [32]. In order to assess model fit, a standard likelihood ratio test can be used. These tests compare the log-likelihood value of the full model with a constrained version of the same model (i.e., a model that is nested within the full model) to assess the contribution of the dropped covariates to the likelihood function. Assume L is the log-likelihood value associated with the full model and L 1 is the log-likelihood value of the constrained model. Then 2(L 1 L ) is distributed χ 2 with d d 1 degrees of freedom, where d and d 1 are the model degrees of freedom associated with the full and the constrained models, respectively [12]. 5 Unidimensionality Both the HOPIT model and the Rasch-based models in IRT assume some form of unidimensionality. In formal terms, unidimensionality can be defined as the assumption that any dependence between different questions tapping into a given domain is solely due to the existence of a single underlying latent trait. Tests of unidimensionality are often based on uncovering this assumed factor that underlies observed responses to multiple question. Mathematically, the assumption of unidimensionality can be worked out by assuming responses to all questions on a given domain are tapping this latent trait. In the WHO Multi-Country Study, test-retest data are available from a subsample of respondents who were revisited and administered the survey questionnaire for a second time. This availability of test-retest data can be used to design a test of unidimensionality. Suppose we get latent variable estimates from two separate questions on any given domain, Y1 and Y 2. Each of these estimates of the latent variable represents some measure of truth with error. That is, if truth were denoted by Ytrue, then: and Y 1test = Y true + ² 1test Y 1retest = Y true + ² 1retest Y 2test = Y true + ² 2test Y 2retest = Y true + ² 2retest 24

25 Here, ² 1 and ² 2 are the question-specific error terms for both test and retest questions, ² 1 N(, σ 2 ² 1 ), ² 2 N(, σ 2 ² 2 ). The correlation coefficient ρ between the measured Y s is: Rewriting (9), ρ = cov(y 1,Y 2 ) = σ Y 1 σ Y 2 ρ = cov(y 1,Y 2 ) (9) σ Y 1 σ Y 2 cov(y 1,Y 2 ) q σ 2 Y true + σ2 ² 1 q σ 2 Y true + σ 2 ² 2 (1) Similarly, ρ = cov(y true,y true) (11) σ Y true σ Y true Dividing (11) by (1), q ρ σ 2 ρ = Y + q σ2 ² true 1 σ 2 Y + σ 2 true ² 2, σ Y true σ Y true since cov(y1,y 2 )=cov(y true,y true) if the error terms are assumed to be uncorrelated. Therefore, v v ρ u = t σ2 Ytrue + σ2 ² 1 u t σ2 Ytrue + σ2 ² 2 ρ =1 σ 2 Y true σ 2 Y true where σ² 2 i = var(y itest Y iretest ) 2 for i =1, 2. Given that both σy 2 true =cov(y 1,Y 2 )andρ are observed, the above expression should equal 1. This can form the basis of a test of unidimensionality using information from test-retest data. 6 Discussion One of the key conclusions of this paper is that adjustments are needed to make survey results comparable across populations. In particular, when categorical variables are involved, analyses must account for differences in response category cut-points. There is considerable evidence that suggests that response category cut-points are different across countries. Therefore, until variation in cut-points is addressed, one must start from a presumption that results are not comparable across populations. The problem of cross-population comparability also appears to apply within populations across different socio-economic and demographic groups. This has important implications for the measurement of inequality, which may be greater or smaller than measured before taking into account response category cut-point shifts. It also has critical implications for comparisons over time. Cut-points may systematically shift over time (e.g., due to rising income, education, and health norms) so long-term trends may be difficult to assess without correction. 25

26 7 Annex Table5:Estimationresults:HOPIT Variable Coefficient (Std. Err.) Vignettes Ivignette (.146) Ivignette (.117) Ivignette (.122) Ivignette (.123) Ivignette (.175) Ivignette (.195) Mean Age (.85) Age (.1) Age (.113) Male.174 (.68) 1<Educ (.115) 6<Educ (.122) Educ> (.147) Country B.996 (.74) Intercept (.166) log(s).61 (.43) 26

27 Table 6: Estimation results : HOPIT τ 1 Variable Coefficient (Std. Err.) Main question Age (.46) Age (.54) Age (.62) Male.25 (.37) 1<Educ 6.79 (.61) 6<Educ (.65) Educ> (.79) Country B (.41) Intercept (.134) Auxiliary question 1 Age (.132) Age (.156) Age (.17) Male.227 (.16) 1<Educ 6.56 (.175) 6<Educ (.185) Educ> (.227) Country B.928 (.113) Intercept (.223) Auxiliary question 2 Age (.113) Age (.134) Age (.155) Male.464 (.92) 1<Educ (.157) 6<Educ (.166) Educ> (.197) Country B 1.26 (.99) Intercept (.22) 27

28 Table 7: Estimation results : HOPIT τ 2 Variable Coefficient (Std. Err.) Main question Age (.48) Age (.56) Age (.63) Male.25 (.38) 1<Educ 6.53 (.62) 6<Educ (.66) Educ>11.72 (.81) Country B (.43) Intercept (.134) Auxiliary question 1 Age (.125) Age (.145) Age (.164) Male.271 (.1) 1<Educ (.165) 6<Educ (.175) Educ>11.14 (.212) Country B.839 (.17) Intercept (.21) Auxiliary question 2 Age (.112) Age (.133) Age (.155) Male.46 (.92) 1<Educ (.157) 6<Educ (.165) Educ> (.196) Country B (.98) Intercept (.21) 28

29 Table 8: Estimation results : HOPIT τ 3 Variable Coefficient (Std. Err.) Main question Age (.51) Age (.59) Age (.65) Male.227 (.4) 1<Educ 6.89 (.65) 6<Educ (.7) Educ> (.87) Country B (.46) Intercept (.135) Auxiliary question 1 Age (.118) Age (.138) Age (.158) Male.217 (.95) 1<Educ (.159) 6<Educ (.168) Educ>11.73 (.24) Country B.836 (.11) Intercept (.23) Auxiliary question 2 Age (.113) Age (.136) Age (.162) Male.345 (.93) 1<Educ 6.26 (.16) 6<Educ (.168) Educ> (.2) Country B (.1) Intercept (.23) 29

11/18/2013. Correlational Research. Correlational Designs. Why Use a Correlational Design? CORRELATIONAL RESEARCH STUDIES

11/18/2013. Correlational Research. Correlational Designs. Why Use a Correlational Design? CORRELATIONAL RESEARCH STUDIES Correlational Research Correlational Designs Correlational research is used to describe the relationship between two or more naturally occurring variables. Is age related to political conservativism? Are

More information

Measuring mathematics anxiety: Paper 2 - Constructing and validating the measure. Rob Cavanagh Len Sparrow Curtin University

Measuring mathematics anxiety: Paper 2 - Constructing and validating the measure. Rob Cavanagh Len Sparrow Curtin University Measuring mathematics anxiety: Paper 2 - Constructing and validating the measure Rob Cavanagh Len Sparrow Curtin University R.Cavanagh@curtin.edu.au Abstract The study sought to measure mathematics anxiety

More information

11/24/2017. Do not imply a cause-and-effect relationship

11/24/2017. Do not imply a cause-and-effect relationship Correlational research is used to describe the relationship between two or more naturally occurring variables. Is age related to political conservativism? Are highly extraverted people less afraid of rejection

More information

A Brief Introduction to Bayesian Statistics

A Brief Introduction to Bayesian Statistics A Brief Introduction to Statistics David Kaplan Department of Educational Psychology Methods for Social Policy Research and, Washington, DC 2017 1 / 37 The Reverend Thomas Bayes, 1701 1761 2 / 37 Pierre-Simon

More information

CHAPTER 3 DATA ANALYSIS: DESCRIBING DATA

CHAPTER 3 DATA ANALYSIS: DESCRIBING DATA Data Analysis: Describing Data CHAPTER 3 DATA ANALYSIS: DESCRIBING DATA In the analysis process, the researcher tries to evaluate the data collected both from written documents and from other sources such

More information

Modelling Spatially Correlated Survival Data for Individuals with Multiple Cancers

Modelling Spatially Correlated Survival Data for Individuals with Multiple Cancers Modelling Spatially Correlated Survival Data for Individuals with Multiple Cancers Dipak K. Dey, Ulysses Diva and Sudipto Banerjee Department of Statistics University of Connecticut, Storrs. March 16,

More information

Technical Specifications

Technical Specifications Technical Specifications In order to provide summary information across a set of exercises, all tests must employ some form of scoring models. The most familiar of these scoring models is the one typically

More information

Score Tests of Normality in Bivariate Probit Models

Score Tests of Normality in Bivariate Probit Models Score Tests of Normality in Bivariate Probit Models Anthony Murphy Nuffield College, Oxford OX1 1NF, UK Abstract: A relatively simple and convenient score test of normality in the bivariate probit model

More information

Psychology, 2010, 1: doi: /psych Published Online August 2010 (

Psychology, 2010, 1: doi: /psych Published Online August 2010 ( Psychology, 2010, 1: 194-198 doi:10.4236/psych.2010.13026 Published Online August 2010 (http://www.scirp.org/journal/psych) Using Generalizability Theory to Evaluate the Applicability of a Serial Bayes

More information

Business Statistics Probability

Business Statistics Probability Business Statistics The following was provided by Dr. Suzanne Delaney, and is a comprehensive review of Business Statistics. The workshop instructor will provide relevant examples during the Skills Assessment

More information

Instrumental Variables Estimation: An Introduction

Instrumental Variables Estimation: An Introduction Instrumental Variables Estimation: An Introduction Susan L. Ettner, Ph.D. Professor Division of General Internal Medicine and Health Services Research, UCLA The Problem The Problem Suppose you wish to

More information

How to Create Better Performing Bayesian Networks: A Heuristic Approach for Variable Selection

How to Create Better Performing Bayesian Networks: A Heuristic Approach for Variable Selection How to Create Better Performing Bayesian Networks: A Heuristic Approach for Variable Selection Esma Nur Cinicioglu * and Gülseren Büyükuğur Istanbul University, School of Business, Quantitative Methods

More information

Citation for published version (APA): Ebbes, P. (2004). Latent instrumental variables: a new approach to solve for endogeneity s.n.

Citation for published version (APA): Ebbes, P. (2004). Latent instrumental variables: a new approach to solve for endogeneity s.n. University of Groningen Latent instrumental variables Ebbes, P. IMPORTANT NOTE: You are advised to consult the publisher's version (publisher's PDF) if you wish to cite from it. Please check the document

More information

Still important ideas

Still important ideas Readings: OpenStax - Chapters 1 13 & Appendix D & E (online) Plous Chapters 17 & 18 - Chapter 17: Social Influences - Chapter 18: Group Judgments and Decisions Still important ideas Contrast the measurement

More information

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo Business Statistics The following was provided by Dr. Suzanne Delaney, and is a comprehensive review of Business Statistics. The workshop instructor will provide relevant examples during the Skills Assessment

More information

Results & Statistics: Description and Correlation. I. Scales of Measurement A Review

Results & Statistics: Description and Correlation. I. Scales of Measurement A Review Results & Statistics: Description and Correlation The description and presentation of results involves a number of topics. These include scales of measurement, descriptive statistics used to summarize

More information

3 CONCEPTUAL FOUNDATIONS OF STATISTICS

3 CONCEPTUAL FOUNDATIONS OF STATISTICS 3 CONCEPTUAL FOUNDATIONS OF STATISTICS In this chapter, we examine the conceptual foundations of statistics. The goal is to give you an appreciation and conceptual understanding of some basic statistical

More information

Catherine A. Welch 1*, Séverine Sabia 1,2, Eric Brunner 1, Mika Kivimäki 1 and Martin J. Shipley 1

Catherine A. Welch 1*, Séverine Sabia 1,2, Eric Brunner 1, Mika Kivimäki 1 and Martin J. Shipley 1 Welch et al. BMC Medical Research Methodology (2018) 18:89 https://doi.org/10.1186/s12874-018-0548-0 RESEARCH ARTICLE Open Access Does pattern mixture modelling reduce bias due to informative attrition

More information

Adjusting for mode of administration effect in surveys using mailed questionnaire and telephone interview data

Adjusting for mode of administration effect in surveys using mailed questionnaire and telephone interview data Adjusting for mode of administration effect in surveys using mailed questionnaire and telephone interview data Karl Bang Christensen National Institute of Occupational Health, Denmark Helene Feveille National

More information

Remarks on Bayesian Control Charts

Remarks on Bayesian Control Charts Remarks on Bayesian Control Charts Amir Ahmadi-Javid * and Mohsen Ebadi Department of Industrial Engineering, Amirkabir University of Technology, Tehran, Iran * Corresponding author; email address: ahmadi_javid@aut.ac.ir

More information

Center for Advanced Studies in Measurement and Assessment. CASMA Research Report

Center for Advanced Studies in Measurement and Assessment. CASMA Research Report Center for Advanced Studies in Measurement and Assessment CASMA Research Report Number 39 Evaluation of Comparability of Scores and Passing Decisions for Different Item Pools of Computerized Adaptive Examinations

More information

THE APPLICATION OF ORDINAL LOGISTIC HEIRARCHICAL LINEAR MODELING IN ITEM RESPONSE THEORY FOR THE PURPOSES OF DIFFERENTIAL ITEM FUNCTIONING DETECTION

THE APPLICATION OF ORDINAL LOGISTIC HEIRARCHICAL LINEAR MODELING IN ITEM RESPONSE THEORY FOR THE PURPOSES OF DIFFERENTIAL ITEM FUNCTIONING DETECTION THE APPLICATION OF ORDINAL LOGISTIC HEIRARCHICAL LINEAR MODELING IN ITEM RESPONSE THEORY FOR THE PURPOSES OF DIFFERENTIAL ITEM FUNCTIONING DETECTION Timothy Olsen HLM II Dr. Gagne ABSTRACT Recent advances

More information

Systematic measurement error in selfreported health: is anchoring vignettes the way out?

Systematic measurement error in selfreported health: is anchoring vignettes the way out? Dasgupta IZA Journal of Development and Migration (2018) 8:12 DOI 10.1186/s40176-018-0120-z IZA Journal of Development and Migration ORIGINAL ARTICLE Systematic measurement error in selfreported health:

More information

Application of Local Control Strategy in analyses of the effects of Radon on Lung Cancer Mortality for 2,881 US Counties

Application of Local Control Strategy in analyses of the effects of Radon on Lung Cancer Mortality for 2,881 US Counties Application of Local Control Strategy in analyses of the effects of Radon on Lung Cancer Mortality for 2,881 US Counties Bob Obenchain, Risk Benefit Statistics, August 2015 Our motivation for using a Cut-Point

More information

Political Science 15, Winter 2014 Final Review

Political Science 15, Winter 2014 Final Review Political Science 15, Winter 2014 Final Review The major topics covered in class are listed below. You should also take a look at the readings listed on the class website. Studying Politics Scientifically

More information

Basic concepts and principles of classical test theory

Basic concepts and principles of classical test theory Basic concepts and principles of classical test theory Jan-Eric Gustafsson What is measurement? Assignment of numbers to aspects of individuals according to some rule. The aspect which is measured must

More information

Content. Basic Statistics and Data Analysis for Health Researchers from Foreign Countries. Research question. Example Newly diagnosed Type 2 Diabetes

Content. Basic Statistics and Data Analysis for Health Researchers from Foreign Countries. Research question. Example Newly diagnosed Type 2 Diabetes Content Quantifying association between continuous variables. Basic Statistics and Data Analysis for Health Researchers from Foreign Countries Volkert Siersma siersma@sund.ku.dk The Research Unit for General

More information

Assessing the Effectiveness of Anchoring Vignettes in Bias Reduction for Socioeconomic Disparities in Self-Rated Health among Chinese Adults

Assessing the Effectiveness of Anchoring Vignettes in Bias Reduction for Socioeconomic Disparities in Self-Rated Health among Chinese Adults Assessing the Effectiveness of Anchoring Vignettes in Bias Reduction for Socioeconomic Disparities in Self-Rated Health among Chinese Adults Hongwei Xu Survey Research Center University of Michigan 426

More information

Econometric Game 2012: infants birthweight?

Econometric Game 2012: infants birthweight? Econometric Game 2012: How does maternal smoking during pregnancy affect infants birthweight? Case A April 18, 2012 1 Introduction Low birthweight is associated with adverse health related and economic

More information

Running head: NESTED FACTOR ANALYTIC MODEL COMPARISON 1. John M. Clark III. Pearson. Author Note

Running head: NESTED FACTOR ANALYTIC MODEL COMPARISON 1. John M. Clark III. Pearson. Author Note Running head: NESTED FACTOR ANALYTIC MODEL COMPARISON 1 Nested Factor Analytic Model Comparison as a Means to Detect Aberrant Response Patterns John M. Clark III Pearson Author Note John M. Clark III,

More information

Connexion of Item Response Theory to Decision Making in Chess. Presented by Tamal Biswas Research Advised by Dr. Kenneth Regan

Connexion of Item Response Theory to Decision Making in Chess. Presented by Tamal Biswas Research Advised by Dr. Kenneth Regan Connexion of Item Response Theory to Decision Making in Chess Presented by Tamal Biswas Research Advised by Dr. Kenneth Regan Acknowledgement A few Slides have been taken from the following presentation

More information

Selection of Linking Items

Selection of Linking Items Selection of Linking Items Subset of items that maximally reflect the scale information function Denote the scale information as Linear programming solver (in R, lp_solve 5.5) min(y) Subject to θ, θs,

More information

Methods for Addressing Selection Bias in Observational Studies

Methods for Addressing Selection Bias in Observational Studies Methods for Addressing Selection Bias in Observational Studies Susan L. Ettner, Ph.D. Professor Division of General Internal Medicine and Health Services Research, UCLA What is Selection Bias? In the regression

More information

Item Response Theory. Steven P. Reise University of California, U.S.A. Unidimensional IRT Models for Dichotomous Item Responses

Item Response Theory. Steven P. Reise University of California, U.S.A. Unidimensional IRT Models for Dichotomous Item Responses Item Response Theory Steven P. Reise University of California, U.S.A. Item response theory (IRT), or modern measurement theory, provides alternatives to classical test theory (CTT) methods for the construction,

More information

Impact and adjustment of selection bias. in the assessment of measurement equivalence

Impact and adjustment of selection bias. in the assessment of measurement equivalence Impact and adjustment of selection bias in the assessment of measurement equivalence Thomas Klausch, Joop Hox,& Barry Schouten Working Paper, Utrecht, December 2012 Corresponding author: Thomas Klausch,

More information

Bayesian Confidence Intervals for Means and Variances of Lognormal and Bivariate Lognormal Distributions

Bayesian Confidence Intervals for Means and Variances of Lognormal and Bivariate Lognormal Distributions Bayesian Confidence Intervals for Means and Variances of Lognormal and Bivariate Lognormal Distributions J. Harvey a,b, & A.J. van der Merwe b a Centre for Statistical Consultation Department of Statistics

More information

Scale Building with Confirmatory Factor Analysis

Scale Building with Confirmatory Factor Analysis Scale Building with Confirmatory Factor Analysis Latent Trait Measurement and Structural Equation Models Lecture #7 February 27, 2013 PSYC 948: Lecture #7 Today s Class Scale building with confirmatory

More information

Introduction to Bayesian Analysis 1

Introduction to Bayesian Analysis 1 Biostats VHM 801/802 Courses Fall 2005, Atlantic Veterinary College, PEI Henrik Stryhn Introduction to Bayesian Analysis 1 Little known outside the statistical science, there exist two different approaches

More information

Readings: Textbook readings: OpenStax - Chapters 1 13 (emphasis on Chapter 12) Online readings: Appendix D, E & F

Readings: Textbook readings: OpenStax - Chapters 1 13 (emphasis on Chapter 12) Online readings: Appendix D, E & F Readings: Textbook readings: OpenStax - Chapters 1 13 (emphasis on Chapter 12) Online readings: Appendix D, E & F Plous Chapters 17 & 18 Chapter 17: Social Influences Chapter 18: Group Judgments and Decisions

More information

Bayesian graphical models for combining multiple data sources, with applications in environmental epidemiology

Bayesian graphical models for combining multiple data sources, with applications in environmental epidemiology Bayesian graphical models for combining multiple data sources, with applications in environmental epidemiology Sylvia Richardson 1 sylvia.richardson@imperial.co.uk Joint work with: Alexina Mason 1, Lawrence

More information

Nearest-Integer Response from Normally-Distributed Opinion Model for Likert Scale

Nearest-Integer Response from Normally-Distributed Opinion Model for Likert Scale Nearest-Integer Response from Normally-Distributed Opinion Model for Likert Scale Jonny B. Pornel, Vicente T. Balinas and Giabelle A. Saldaña University of the Philippines Visayas This paper proposes that

More information

The Impact of Relative Standards on the Propensity to Disclose. Alessandro Acquisti, Leslie K. John, George Loewenstein WEB APPENDIX

The Impact of Relative Standards on the Propensity to Disclose. Alessandro Acquisti, Leslie K. John, George Loewenstein WEB APPENDIX The Impact of Relative Standards on the Propensity to Disclose Alessandro Acquisti, Leslie K. John, George Loewenstein WEB APPENDIX 2 Web Appendix A: Panel data estimation approach As noted in the main

More information

A review of statistical methods in the analysis of data arising from observer reliability studies (Part 11) *

A review of statistical methods in the analysis of data arising from observer reliability studies (Part 11) * A review of statistical methods in the analysis of data arising from observer reliability studies (Part 11) * by J. RICHARD LANDIS** and GARY G. KOCH** 4 Methods proposed for nominal and ordinal data Many

More information

Chapter 1: Explaining Behavior

Chapter 1: Explaining Behavior Chapter 1: Explaining Behavior GOAL OF SCIENCE is to generate explanations for various puzzling natural phenomenon. - Generate general laws of behavior (psychology) RESEARCH: principle method for acquiring

More information

Poisson regression. Dae-Jin Lee Basque Center for Applied Mathematics.

Poisson regression. Dae-Jin Lee Basque Center for Applied Mathematics. Dae-Jin Lee dlee@bcamath.org Basque Center for Applied Mathematics http://idaejin.github.io/bcam-courses/ D.-J. Lee (BCAM) Intro to GLM s with R GitHub: idaejin 1/40 Modeling count data Introduction Response

More information

Measurement and meaningfulness in Decision Modeling

Measurement and meaningfulness in Decision Modeling Measurement and meaningfulness in Decision Modeling Brice Mayag University Paris Dauphine LAMSADE FRANCE Chapter 2 Brice Mayag (LAMSADE) Measurement theory and meaningfulness Chapter 2 1 / 47 Outline 1

More information

Unit 1 Exploring and Understanding Data

Unit 1 Exploring and Understanding Data Unit 1 Exploring and Understanding Data Area Principle Bar Chart Boxplot Conditional Distribution Dotplot Empirical Rule Five Number Summary Frequency Distribution Frequency Polygon Histogram Interquartile

More information

Underweight Children in Ghana: Evidence of Policy Effects. Samuel Kobina Annim

Underweight Children in Ghana: Evidence of Policy Effects. Samuel Kobina Annim Underweight Children in Ghana: Evidence of Policy Effects Samuel Kobina Annim Correspondence: Economics Discipline Area School of Social Sciences University of Manchester Oxford Road, M13 9PL Manchester,

More information

Performance of Median and Least Squares Regression for Slightly Skewed Data

Performance of Median and Least Squares Regression for Slightly Skewed Data World Academy of Science, Engineering and Technology 9 Performance of Median and Least Squares Regression for Slightly Skewed Data Carolina Bancayrin - Baguio Abstract This paper presents the concept of

More information

Estimating Reliability in Primary Research. Michael T. Brannick. University of South Florida

Estimating Reliability in Primary Research. Michael T. Brannick. University of South Florida Estimating Reliability 1 Running Head: ESTIMATING RELIABILITY Estimating Reliability in Primary Research Michael T. Brannick University of South Florida Paper presented in S. Morris (Chair) Advances in

More information

Empowered by Psychometrics The Fundamentals of Psychometrics. Jim Wollack University of Wisconsin Madison

Empowered by Psychometrics The Fundamentals of Psychometrics. Jim Wollack University of Wisconsin Madison Empowered by Psychometrics The Fundamentals of Psychometrics Jim Wollack University of Wisconsin Madison Psycho-what? Psychometrics is the field of study concerned with the measurement of mental and psychological

More information

Study Guide for the Final Exam

Study Guide for the Final Exam Study Guide for the Final Exam When studying, remember that the computational portion of the exam will only involve new material (covered after the second midterm), that material from Exam 1 will make

More information

Bayesian and Frequentist Approaches

Bayesian and Frequentist Approaches Bayesian and Frequentist Approaches G. Jogesh Babu Penn State University http://sites.stat.psu.edu/ babu http://astrostatistics.psu.edu All models are wrong But some are useful George E. P. Box (son-in-law

More information

Small-area estimation of mental illness prevalence for schools

Small-area estimation of mental illness prevalence for schools Small-area estimation of mental illness prevalence for schools Fan Li 1 Alan Zaslavsky 2 1 Department of Statistical Science Duke University 2 Department of Health Care Policy Harvard Medical School March

More information

Heritability. The Extended Liability-Threshold Model. Polygenic model for continuous trait. Polygenic model for continuous trait.

Heritability. The Extended Liability-Threshold Model. Polygenic model for continuous trait. Polygenic model for continuous trait. The Extended Liability-Threshold Model Klaus K. Holst Thomas Scheike, Jacob Hjelmborg 014-05-1 Heritability Twin studies Include both monozygotic (MZ) and dizygotic (DZ) twin

More information

Structural Equation Modeling (SEM)

Structural Equation Modeling (SEM) Structural Equation Modeling (SEM) Today s topics The Big Picture of SEM What to do (and what NOT to do) when SEM breaks for you Single indicator (ASU) models Parceling indicators Using single factor scores

More information

MS&E 226: Small Data

MS&E 226: Small Data MS&E 226: Small Data Lecture 10: Introduction to inference (v2) Ramesh Johari ramesh.johari@stanford.edu 1 / 17 What is inference? 2 / 17 Where did our data come from? Recall our sample is: Y, the vector

More information

SELECTION BIAS IN EDUCATIONAL TRANSITION MODELS: THEORY AND EMPIRICAL EVIDENCE

SELECTION BIAS IN EDUCATIONAL TRANSITION MODELS: THEORY AND EMPIRICAL EVIDENCE :2008 WORKING PAPER Anders Holm Mads Meier Jæger SELECTION BIAS IN EDUCATIONAL TRANSITION MODELS: THEORY AND EMPIRICAL EVIDENCE RESEARCH DEPARTMENT OF SOCIAL POLICY AND WELFARE SERVICES Selection Bias

More information

A Bayesian Measurement Model of Political Support for Endorsement Experiments, with Application to the Militant Groups in Pakistan

A Bayesian Measurement Model of Political Support for Endorsement Experiments, with Application to the Militant Groups in Pakistan A Bayesian Measurement Model of Political Support for Endorsement Experiments, with Application to the Militant Groups in Pakistan Kosuke Imai Princeton University Joint work with Will Bullock and Jacob

More information

An Introduction to Bayesian Statistics

An Introduction to Bayesian Statistics An Introduction to Bayesian Statistics Robert Weiss Department of Biostatistics UCLA Fielding School of Public Health robweiss@ucla.edu Sept 2015 Robert Weiss (UCLA) An Introduction to Bayesian Statistics

More information

On indirect measurement of health based on survey data. Responses to health related questions (items) Y 1,..,Y k A unidimensional latent health state

On indirect measurement of health based on survey data. Responses to health related questions (items) Y 1,..,Y k A unidimensional latent health state On indirect measurement of health based on survey data Responses to health related questions (items) Y 1,..,Y k A unidimensional latent health state A scaling model: P(Y 1,..,Y k ;α, ) α = item difficulties

More information

Do Danes and Italians rate life satisfaction in the same way?

Do Danes and Italians rate life satisfaction in the same way? Do Danes and Italians rate life satisfaction in the same way? Using vignettes to correct for individual-specific scale biases Viola Angelini 1 Danilo Cavapozzi 2 Luca Corazzini 2 Omar Paccagnella 2 1 University

More information

Analysis of Hearing Loss Data using Correlated Data Analysis Techniques

Analysis of Hearing Loss Data using Correlated Data Analysis Techniques Analysis of Hearing Loss Data using Correlated Data Analysis Techniques Ruth Penman and Gillian Heller, Department of Statistics, Macquarie University, Sydney, Australia. Correspondence: Ruth Penman, Department

More information

Liability Threshold Models

Liability Threshold Models Liability Threshold Models Frühling Rijsdijk & Kate Morley Twin Workshop, Boulder Tuesday March 4 th 2008 Aims Introduce model fitting to categorical data Define liability and describe assumptions of the

More information

Doing Quantitative Research 26E02900, 6 ECTS Lecture 6: Structural Equations Modeling. Olli-Pekka Kauppila Daria Kautto

Doing Quantitative Research 26E02900, 6 ECTS Lecture 6: Structural Equations Modeling. Olli-Pekka Kauppila Daria Kautto Doing Quantitative Research 26E02900, 6 ECTS Lecture 6: Structural Equations Modeling Olli-Pekka Kauppila Daria Kautto Session VI, September 20 2017 Learning objectives 1. Get familiar with the basic idea

More information

Contents. What is item analysis in general? Psy 427 Cal State Northridge Andrew Ainsworth, PhD

Contents. What is item analysis in general? Psy 427 Cal State Northridge Andrew Ainsworth, PhD Psy 427 Cal State Northridge Andrew Ainsworth, PhD Contents Item Analysis in General Classical Test Theory Item Response Theory Basics Item Response Functions Item Information Functions Invariance IRT

More information

Russian Journal of Agricultural and Socio-Economic Sciences, 3(15)

Russian Journal of Agricultural and Socio-Economic Sciences, 3(15) ON THE COMPARISON OF BAYESIAN INFORMATION CRITERION AND DRAPER S INFORMATION CRITERION IN SELECTION OF AN ASYMMETRIC PRICE RELATIONSHIP: BOOTSTRAP SIMULATION RESULTS Henry de-graft Acquah, Senior Lecturer

More information

Lecturer: Rob van der Willigen 11/9/08

Lecturer: Rob van der Willigen 11/9/08 Auditory Perception - Detection versus Discrimination - Localization versus Discrimination - - Electrophysiological Measurements Psychophysical Measurements Three Approaches to Researching Audition physiology

More information

Chapter 2 Norms and Basic Statistics for Testing MULTIPLE CHOICE

Chapter 2 Norms and Basic Statistics for Testing MULTIPLE CHOICE Chapter 2 Norms and Basic Statistics for Testing MULTIPLE CHOICE 1. When you assert that it is improbable that the mean intelligence test score of a particular group is 100, you are using. a. descriptive

More information

Research and Evaluation Methodology Program, School of Human Development and Organizational Studies in Education, University of Florida

Research and Evaluation Methodology Program, School of Human Development and Organizational Studies in Education, University of Florida Vol. 2 (1), pp. 22-39, Jan, 2015 http://www.ijate.net e-issn: 2148-7456 IJATE A Comparison of Logistic Regression Models for Dif Detection in Polytomous Items: The Effect of Small Sample Sizes and Non-Normality

More information

Lecturer: Rob van der Willigen 11/9/08

Lecturer: Rob van der Willigen 11/9/08 Auditory Perception - Detection versus Discrimination - Localization versus Discrimination - Electrophysiological Measurements - Psychophysical Measurements 1 Three Approaches to Researching Audition physiology

More information

CHAPTER - 6 STATISTICAL ANALYSIS. This chapter discusses inferential statistics, which use sample data to

CHAPTER - 6 STATISTICAL ANALYSIS. This chapter discusses inferential statistics, which use sample data to CHAPTER - 6 STATISTICAL ANALYSIS 6.1 Introduction This chapter discusses inferential statistics, which use sample data to make decisions or inferences about population. Populations are group of interest

More information

m 11 m.1 > m 12 m.2 risk for smokers risk for nonsmokers

m 11 m.1 > m 12 m.2 risk for smokers risk for nonsmokers SOCY5061 RELATIVE RISKS, RELATIVE ODDS, LOGISTIC REGRESSION RELATIVE RISKS: Suppose we are interested in the association between lung cancer and smoking. Consider the following table for the whole population:

More information

You must answer question 1.

You must answer question 1. Research Methods and Statistics Specialty Area Exam October 28, 2015 Part I: Statistics Committee: Richard Williams (Chair), Elizabeth McClintock, Sarah Mustillo You must answer question 1. 1. Suppose

More information

CHAPTER VI RESEARCH METHODOLOGY

CHAPTER VI RESEARCH METHODOLOGY CHAPTER VI RESEARCH METHODOLOGY 6.1 Research Design Research is an organized, systematic, data based, critical, objective, scientific inquiry or investigation into a specific problem, undertaken with the

More information

Statistical reports Regression, 2010

Statistical reports Regression, 2010 Statistical reports Regression, 2010 Niels Richard Hansen June 10, 2010 This document gives some guidelines on how to write a report on a statistical analysis. The document is organized into sections that

More information

Fundamental Concepts for Using Diagnostic Classification Models. Section #2 NCME 2016 Training Session. NCME 2016 Training Session: Section 2

Fundamental Concepts for Using Diagnostic Classification Models. Section #2 NCME 2016 Training Session. NCME 2016 Training Session: Section 2 Fundamental Concepts for Using Diagnostic Classification Models Section #2 NCME 2016 Training Session NCME 2016 Training Session: Section 2 Lecture Overview Nature of attributes What s in a name? Grain

More information

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo Please note the page numbers listed for the Lind book may vary by a page or two depending on which version of the textbook you have. Readings: Lind 1 11 (with emphasis on chapters 10, 11) Please note chapter

More information

Analysis of Rheumatoid Arthritis Data using Logistic Regression and Penalized Approach

Analysis of Rheumatoid Arthritis Data using Logistic Regression and Penalized Approach University of South Florida Scholar Commons Graduate Theses and Dissertations Graduate School November 2015 Analysis of Rheumatoid Arthritis Data using Logistic Regression and Penalized Approach Wei Chen

More information

A Case Study: Two-sample categorical data

A Case Study: Two-sample categorical data A Case Study: Two-sample categorical data Patrick Breheny January 31 Patrick Breheny BST 701: Bayesian Modeling in Biostatistics 1/43 Introduction Model specification Continuous vs. mixture priors Choice

More information

Sawtooth Software. MaxDiff Analysis: Simple Counting, Individual-Level Logit, and HB RESEARCH PAPER SERIES. Bryan Orme, Sawtooth Software, Inc.

Sawtooth Software. MaxDiff Analysis: Simple Counting, Individual-Level Logit, and HB RESEARCH PAPER SERIES. Bryan Orme, Sawtooth Software, Inc. Sawtooth Software RESEARCH PAPER SERIES MaxDiff Analysis: Simple Counting, Individual-Level Logit, and HB Bryan Orme, Sawtooth Software, Inc. Copyright 009, Sawtooth Software, Inc. 530 W. Fir St. Sequim,

More information

A Comparison of Methods of Estimating Subscale Scores for Mixed-Format Tests

A Comparison of Methods of Estimating Subscale Scores for Mixed-Format Tests A Comparison of Methods of Estimating Subscale Scores for Mixed-Format Tests David Shin Pearson Educational Measurement May 007 rr0701 Using assessment and research to promote learning Pearson Educational

More information

Simultaneous Equation and Instrumental Variable Models for Sexiness and Power/Status

Simultaneous Equation and Instrumental Variable Models for Sexiness and Power/Status Simultaneous Equation and Instrumental Variable Models for Seiness and Power/Status We would like ideally to determine whether power is indeed sey, or whether seiness is powerful. We here describe the

More information

Still important ideas

Still important ideas Readings: OpenStax - Chapters 1 11 + 13 & Appendix D & E (online) Plous - Chapters 2, 3, and 4 Chapter 2: Cognitive Dissonance, Chapter 3: Memory and Hindsight Bias, Chapter 4: Context Dependence Still

More information

Bruno D. Zumbo, Ph.D. University of Northern British Columbia

Bruno D. Zumbo, Ph.D. University of Northern British Columbia Bruno Zumbo 1 The Effect of DIF and Impact on Classical Test Statistics: Undetected DIF and Impact, and the Reliability and Interpretability of Scores from a Language Proficiency Test Bruno D. Zumbo, Ph.D.

More information

MEA DISCUSSION PAPERS

MEA DISCUSSION PAPERS Inference Problems under a Special Form of Heteroskedasticity Helmut Farbmacher, Heinrich Kögel 03-2015 MEA DISCUSSION PAPERS mea Amalienstr. 33_D-80799 Munich_Phone+49 89 38602-355_Fax +49 89 38602-390_www.mea.mpisoc.mpg.de

More information

MBA 605 Business Analytics Don Conant, PhD. GETTING TO THE STANDARD NORMAL DISTRIBUTION

MBA 605 Business Analytics Don Conant, PhD. GETTING TO THE STANDARD NORMAL DISTRIBUTION MBA 605 Business Analytics Don Conant, PhD. GETTING TO THE STANDARD NORMAL DISTRIBUTION Variables In the social sciences data are the observed and/or measured characteristics of individuals and groups

More information

Chapter 02. Basic Research Methodology

Chapter 02. Basic Research Methodology Chapter 02 Basic Research Methodology Definition RESEARCH Research is a quest for knowledge through diligent search or investigation or experimentation aimed at the discovery and interpretation of new

More information

Modelling Research Productivity Using a Generalization of the Ordered Logistic Regression Model

Modelling Research Productivity Using a Generalization of the Ordered Logistic Regression Model Modelling Research Productivity Using a Generalization of the Ordered Logistic Regression Model Delia North Temesgen Zewotir Michael Murray Abstract In South Africa, the Department of Education allocates

More information

Experimental Design. Dewayne E Perry ENS C Empirical Studies in Software Engineering Lecture 8

Experimental Design. Dewayne E Perry ENS C Empirical Studies in Software Engineering Lecture 8 Experimental Design Dewayne E Perry ENS 623 Perry@ece.utexas.edu 1 Problems in Experimental Design 2 True Experimental Design Goal: uncover causal mechanisms Primary characteristic: random assignment to

More information

University of Oxford Intermediate Social Statistics: Lecture One

University of Oxford Intermediate Social Statistics: Lecture One University of Oxford Intermediate Social Statistics: Lecture One Raymond M. Duch Nuffield College Oxford www.raymondduch.com @RayDuch January 17, 2012 Course Requirements Course Requirements Eight Lectures

More information

Students' perceived understanding and competency in probability concepts in an e- learning environment: An Australian experience

Students' perceived understanding and competency in probability concepts in an e- learning environment: An Australian experience University of Wollongong Research Online Faculty of Engineering and Information Sciences - Papers: Part A Faculty of Engineering and Information Sciences 2016 Students' perceived understanding and competency

More information

George B. Ploubidis. The role of sensitivity analysis in the estimation of causal pathways from observational data. Improving health worldwide

George B. Ploubidis. The role of sensitivity analysis in the estimation of causal pathways from observational data. Improving health worldwide George B. Ploubidis The role of sensitivity analysis in the estimation of causal pathways from observational data Improving health worldwide www.lshtm.ac.uk Outline Sensitivity analysis Causal Mediation

More information

Addendum: Multiple Regression Analysis (DRAFT 8/2/07)

Addendum: Multiple Regression Analysis (DRAFT 8/2/07) Addendum: Multiple Regression Analysis (DRAFT 8/2/07) When conducting a rapid ethnographic assessment, program staff may: Want to assess the relative degree to which a number of possible predictive variables

More information

Chapter 1: Introduction to Statistics

Chapter 1: Introduction to Statistics Chapter 1: Introduction to Statistics Variables A variable is a characteristic or condition that can change or take on different values. Most research begins with a general question about the relationship

More information

Item Analysis: Classical and Beyond

Item Analysis: Classical and Beyond Item Analysis: Classical and Beyond SCROLLA Symposium Measurement Theory and Item Analysis Modified for EPE/EDP 711 by Kelly Bradley on January 8, 2013 Why is item analysis relevant? Item analysis provides

More information

Issues That Should Not Be Overlooked in the Dominance Versus Ideal Point Controversy

Issues That Should Not Be Overlooked in the Dominance Versus Ideal Point Controversy Industrial and Organizational Psychology, 3 (2010), 489 493. Copyright 2010 Society for Industrial and Organizational Psychology. 1754-9426/10 Issues That Should Not Be Overlooked in the Dominance Versus

More information

CHAPTER 3 RESEARCH METHODOLOGY

CHAPTER 3 RESEARCH METHODOLOGY CHAPTER 3 RESEARCH METHODOLOGY 3.1 Introduction 3.1 Methodology 3.1.1 Research Design 3.1. Research Framework Design 3.1.3 Research Instrument 3.1.4 Validity of Questionnaire 3.1.5 Statistical Measurement

More information

Assessing Measurement Invariance in the Attitude to Marriage Scale across East Asian Societies. Xiaowen Zhu. Xi an Jiaotong University.

Assessing Measurement Invariance in the Attitude to Marriage Scale across East Asian Societies. Xiaowen Zhu. Xi an Jiaotong University. Running head: ASSESS MEASUREMENT INVARIANCE Assessing Measurement Invariance in the Attitude to Marriage Scale across East Asian Societies Xiaowen Zhu Xi an Jiaotong University Yanjie Bian Xi an Jiaotong

More information

Empirical Formula for Creating Error Bars for the Method of Paired Comparison

Empirical Formula for Creating Error Bars for the Method of Paired Comparison Empirical Formula for Creating Error Bars for the Method of Paired Comparison Ethan D. Montag Rochester Institute of Technology Munsell Color Science Laboratory Chester F. Carlson Center for Imaging Science

More information