MIXED MODEL ANALYSIS OF ORDINAL RESPONSE DATA WITH TWO NESTED LEVELS OF CLUSTERING: AN APPLICATION INVOLVING A CLINICAL TRIAL

Size: px
Start display at page:

Download "MIXED MODEL ANALYSIS OF ORDINAL RESPONSE DATA WITH TWO NESTED LEVELS OF CLUSTERING: AN APPLICATION INVOLVING A CLINICAL TRIAL"

Transcription

1 Pak. J. Statist Vol. 27(1), MIXED MODEL ANALYSIS OF ORDINAL RESPONSE DATA WITH TWO NESTED LEVELS OF CLUSTERING: AN APPLICATION INVOLVING A CLINICAL TRIAL Sadia Mahmud 1, Ursula Chohan 2, Gauhar Afshan 2 and M. Qamar-ul-Hoda 2 1 Department of Community Health Sciences, The Aga Khan University, Karachi, Pakistan. sadia.mahmud@aku.edu; mahmud_sadia@yahoo.com 2 Department of Anaesthesia, The Aga Khan University, Karachi, Pakistan ABSTRACT In this paper we present analysis of ordinal response data, from a clinical trial, with two nested levels of clustering using the generalized linear mixed model approach. We present a generalization of the continuation ratio model with two nested random effects. The model was fitted using maximum marginal likelihood estimation, assuming the nested random effects are independent and normally distributed. The clinical trial recruited high risk patients coming for unilateral lower limb surgeries. All patients were given unilateral spinal anaesthesia that intends to anesthetize only the leg that has to be operated; motor level was the ordinal response in the trial, measured repeatedly over time for both legs. The objective of the analysis was to evaluate the difference in the severity of motor level paralysis between the operated and the non-operated leg, and also to assess the effect of time on progression of motor level to higher degrees of paralysis. There was a significant interaction between the covariates time and leg; at baseline progression rate to higher degrees of paralysis was more rapid for the operated relative to the nonoperated leg. The progression rate levelled off by the end of the study, at 30 minutes, more prominently for the non-operated leg. The continuation ratio method, that makes a series of comparison of all lower categories on a scale to the next succeeding one, intricately models the progression of motor level paralysis with time to higher degrees, and gives insight into the mechanism of progression. KEY WORDS Continuation ratio model; three-level data; nested random effects; maximum marginal likelihood estimation. 1. INTRODUCTION Ordinal responses are commonly encountered in medical and social science research. An ordinal response variable consists of ordered categories such as level of pain graded as no pain, mild, moderate and severe pain. Different ways of generalizing the logit model to handle ordered categories have been suggested (Allison 1999; Agresti 1990; Lindsey et al. 1997). Moreover, in community based studies, and in clinical trials one 2011 Pakistan Journal of Statistics 31

2 32 Mixed Model Analysis of Ordinal Response Data encounters measurements recorded repeatedly on the same subject over time or some other dimension. In some situations there can be two nested levels of clustering, for example in an ophthalmology trial both eyes of a group of patients can be followed longitudinally. Such a data structure has been referred to as three-level data, the first level corresponds to the longitudinal measurements, the second level to an eye and the third level to the patient. Regression analysis of multilevel data structure for a continuous response is very developed and can easily be conducted using standard software (Littell et al. 1996). Multilevel analysis of nominal and ordinal responses has received increasing attention in recent years (Gibbons and Hedeker 1997; Ten Have et al. 1999; Hartzel et al. 2001; Raman and Hedeker 2005; Liu and Hedeker 2006). For logistic regression models involving three-level data it is important to adjust for both levels of clustering, and ignoring either can lead to bias in estimating regression parameters under certain conditions (Ten Have et al. 1999). Ten Have et al. (1999) adjusted for two nested levels of clustering for binary response data using a mixed effects logistic regression model. Gibbons and Hedeker (Gibbons and Hedeker 1997) suggested mixed effects probit and logistic regression models for three-level binary response data that was extended to an ordinal response by Raman and Hedeker (Raman and Hedeker 2005) and Liu and Hedeker (2006) using cumulative logit link. Hartzel et al. (2001) presented a general approach for logit random effects modeling of clustered ordinal and nominal responses, they used adjacent-categories logit to model the ordinal response and the random effect structure in their model was not nested. In this paper we present regression analysis of ordinal response data with two nested levels of clustering using the generalized linear mixed model approach. We generalized the continuation ratio method that models the ordinal character of the response variable, to adjust for the clustering structure in the data by including two nested random effects in the model. As an example of repeated ordinal responses with two nested levels of clustering, we consider a clinical trial conducted at the Department of Anesthesia, The Aga Khan University. The conventional method to induce spinal anesthesia for lower limb surgeries is bilateral spinal anesthesia that anesthetizes both legs. Cardiovascular system may be profoundly affected by the spinal anesthesia. Hypotension is the most frequent side effect (Carpenter et al. 1992). The unilateral spinal anesthesia, that anesthetizes only the leg that has to be operated, has been suggested as an alternative technique to restrict the undesired effects (Casati et al. 1998). A clinical trial was conducted with the primary objective to evaluate whether unilateral spinal anesthesia can avoid the undesired side effects such as low blood pressure and increased heart-rate accompanying the conventional bilateral spinal anesthesia for high risk patients (Chohan et al. 2002). The subjects comprised of ASA 1 III and IV patients, aged years, undergoing unilateral lower limb surgeries. This group of patients is considered as a high risk group. All patients received unilateral spinal anesthesia. Response variables such as systolic and diastolic blood pressure and heart-rate, measured at baseline and at every 5 minutes for 30 minutes after initiation of spinal anesthesia, are referred to as the haemodynamic variables and were measured on a continuous scale. For a Gaussian response with one level of clustering it is straight 1 American Society of Anesthesiologists

3 Sadia Mahmud et al. 33 forward to perform repeated measures Analysis of Variance available on SAS and other statistical packages. The high risk patients, in the present trial, showed minimal haemodynamic changes following unilateral spinal anesthesia (Chohan et al. 2002).. The motor level measured on the Bromage scale 2 was the ordinal response in the above mentioned clinical trial. The Bromage scale measures progression to higher degrees of motor level paralysis starting from No paralysis. Data for 30 patients were available for the analysis. For each patient motor level was measured both for the operated and the non-operated leg at baseline and at every 5 minutes, for 30 minutes after initiation of spinal anesthesia. A secondary objective of the above mentioned clinical trial was to evaluate the difference in the severity of motor level paralysis between the operated and the non-operated leg, and also to assess the pattern of progression to higher degrees of paralysis with time for each leg of a patient receiving unilateral spinal anesthesia. Ideally unilateral spinal anesthesia should paralyze only the leg to be operated, however there could be some spillover effects on the non-operative leg. In this paper we report regression analysis of the ordinal motor level responses, taking the clustering structure of the data into account. In this three-level data structure, the longitudinal measurements (at 7 time points) are at the first level, legs (operated/nonoperated) are at the second level and patients at the third level. Table 1 gives a visual presentation of the data in a contingency table showing how the proportions of measurements in various categories are changing over time for the two legs. At the baseline ( t 0) all the 30 patients were recorded as having No paralysis (category 1) for both legs. With the passage of time the proportions in categories corresponding to higher degrees of paralysis increase much more rapidly for the operated as compared to the non-operated leg. At t 5 minutes about 23% of the patients were recorded as having No paralysis (category 1) for the non-operated leg whereas only 3% had No paralysis for the operated leg. At t 10 minutes 80% of the patients had complete paralysis (category 5) on the operated whereas only about 7% had complete paralysis on the non-operated leg. At the end of the study ( t 30 minutes) about 97% of the patients had complete paralysis on the operated whereas only about 37% had complete paralysis on the non-operated leg. In section 2.1 we report the analysis of the ordinal motor level responses using the generalized linear model approach that is extended to generalized linear mixed model in section 2.2. Section 3 gives a discussion of the analysis approach, noting its limitations and some discussion of alternative methods for regression analysis of clustered categorical data. 2.1 Generalized Linear Model The models that deal with regression analysis involving an ordinal response variable are generalization of logit model for a binary response (Allison 1999; Agresti 1990). The 2 Bromage scale: 1. No Paralysis 2. Inability to raise extended leg. 3. Inability to raise flex knee 4. Inability to do dorsiflexion of foot but can wiggle toes. 5. Inability to move at all.

4 34 Mixed Model Analysis of Ordinal Response Data proportional odds, adjacent category and continuation ratio models differ with respect to how the ordinal categories are dichotomized into a binary response. The context of the research problem and research questions, and the nature of the ordinal measurements usually motivate the choice of the particular model to be employed for the analysis of ordinal responses. The continuation ratio model is appropriate for situations in which the ordered categories represent a progression through stages, so that individuals pass through each lower stage before they go on to higher stages. This seems to be the appropriate modeling approach with regards to the clinical relevance of the motor level responses measured on the ordinal Bromage scale in our example. Moreover, unlike the proportional odds model, the continuation ratio model possesses the property of conditional independence between the binary partitions that makes it easier to generalize to repeated ordinal responses (Dos Santos et al. 2000). We note that the odds ratio in the continuation ratio model change if continuation odds are set up starting from the top of the scale or the bottom. In our modeling of motor level responses using the continuation ratio method we start from the bottom of the scale, that is, the No Paralysis category. This is clinically relevant as all 30 patients had No Paralysis for both legs at baseline and they progressed to higher degrees of paralysis with passage of time. Consider an ordinal response with c categories denoted C1, C2, Cc and coded as 1,2.c. Suppose the multinomial distribution of the ordinal response has probabilities l 1,2,... c 1. The continuation ratio logits are defined as (Agresti 1990):, l L Let ck 1 or lower; that is k l ck 1 log, k 1,2,... c 1 (2.1) 1... ck denote the conditional probability of response c k 1, given c k 1 = ck 1 ck ck 1, k 1,2,... c 1 (2.2) The continuation ratio logits can then be expressed as ordinary logits of these conditional probabilities: L k ck 1 log, k 1,2,... c 1 (2.3) 1 ck 1 It can be shown that the original multinomial distribution can be expressed as a product of ( c 1) binomial distributions in terms of the conditional probabilities defined above (Agreti, 1990; Linsey et al. 1997). Hence one can apply the conventional logistic regression with the conditionally independent binary partitions of the ordinal response as the dependent, and the potential predictor variables and the partition variable as the independent variables. The partition variable ( k 1, 2,... c 1 ) is the identification variable for each conditionally independent binary partition and can be included in the model by creating dummy variables.

5 Sadia Mahmud et al. 35 In order to write a formal mathematical expression for the likelihood function for the ordinal response in the above mentioned trial, modeled through the continuation ratio method, let us develop the following notation. Consider an ordinal response with c categories denoted C1, C2, Cc and coded as 1,2.c. Assume that the indices i, j, t correspond to the subject, leg and the time point respectively ( i 1,.., 30, j 1, 2, t 0, 5,..30). Denote the ordinal response for the ith subject s jth leg at tth time point as y ijt. Each ordinal response is considered as a series of conditionally independent binary continuation ratio partitions of the ordinal scale. Denote the binary response corresponding to the kth partition Cck 1 : Cck, Cck 1,, C1, k 1,2, c 1, by Y ijt, k defined as and ijt, k Y if y c k 1 ijt, k 1 ijt = 0 if y c k 1 ijt Y is undefined for y c k 1. ijt The contribution to the likelihood by the ordinal measurement yijt is given by: L ijt min( c yijt1, c1) k ijt k 1 Yijt, k exp ' x 1 exp ' x (2.4) k ijt where k is the intercept associated with kth binary partition of the ordinal response; xijt is a vector of regression coefficients. is a vector of explanatory variables and Assuming the measurements on the same subject are independent (we address the nested clustering structure of the data in next section) the full likelihood is L L. The regression coefficients are estimated by maximizing the likelihood. ijt ijt The continuation ratio model can be fitted using any standard statistical software that fits conventional logistic regression model, by creating the extended vectors reducing the ordinal response to binary (Lindsey et al. 1997). Specialized software such as Sabre (Stott et al. 1999) and SAS (Release 9.1, SAS Institute Inc., Cary NC USA) can also be used through straightforward programming. The continuation ratio model for the unilateral spinal anesthesia trial data was fitted on SAS. We had data for 30 patients, with 14 measurements on each patient, thus giving us a total of 420 measurements on the ordinal scale. There were 950 observations in the extended dataset with the ordinal response reduced to binary. Table 2 (Column 1) reports the fit of the Continuation Ratio Logit (CRL) model. Examination of the scale of time through quartile analysis (Hosmer, D.W. and Lemeshow, S. 1989) indicated a logarithmic scale. There was a highly significant interaction between leg and time variable (Wald test Z = 1.94/0.34 = 5.71, p-value < 0.01, likelihood ratio test, , p-value < 0.005). The positive estimated regression

6 36 Mixed Model Analysis of Ordinal Response Data coefficient of the interaction term indicates that the effect of time on progression to higher degrees of paralysis is more rapid for the operated as compared to the nonoperated leg. 2.2 Generalized Linear Mixed Model The CRL analysis discussed in section 2.1 takes no account of the clustering structure of the data, and hence is inadequate. The purpose of reporting these results is to provide a basis for comparison with the model that adjusts for the two nested levels of clustering. As discussed in the introduction section Ten Have et al. (1999) and Gibbons and Hedeker (1997) proposed random effects (or mixed effects) logistic regression models for binary response data with two nested levels of clustering. Raman and Hedeker (2001) and Liu and Hedeker (2006) extended Gibbons and Hedeker s (1997) model for a binary response to an ordinal response, however they used the cumulative logit link. These authors modeled the nested clustering structure by addition of nested random effects in the linear predictor of the conventional logistic function (that is the logit) and implemented maximum marginal likelihood estimation (MMLE) using numerical integration to approximate the intractable integrals over the random effects. For an adequate analysis of data from the unilateral spinal anesthesia trial we propose a random effects (RE) model with two random effects i and j(i), to account for the nested levels of clustering due to the ith subject and the jth leg nested in subject i respectively. We refer to the notation introduced in section 2.1. For the sequence of ordinal outcomes for the ith subject, the marginal (integrated) likelihood is where *; ; ; L g y x f f d d (2.5) i ijt ijt i j( i) i j( i) i j( i) jt g y ijt. min( c yijt 1, c1) k 1 exp Yijt, k k *' xijt subject i leg( subject) j( i) 1 exp *' x k ijt subject i leg( subject) j( i) k is the intercept associated with the kth binary partition of the ordinal response. i f and f ji effects are the probability distribution functions of the two nested random i and j(i) respectively. We assume that i normal variables, with independence also assumed between leg( subject) leg level respectively. and j(i) are independent standard i and j(i). subject denote the standard deviation of the random effects at the subject and at the xijt is a vector of explanatory variables and * cluster-specific log-odds ratio parameters. The full likelihood is L L. i i and is a vector of

7 Sadia Mahmud et al. 37 Ten Have et al. (1999) implemented the estimation procedure for a logit model with a binary response with two nested levels of clustering, using the maximum marginal likelihood approach in a FORTRAN program. They accounted for the nested clustering by two independent standard normal variables and approximated the intractable integrals over the random effects using numerical integration. They approximated the integration of the likelihood over random effects with weighted sums where weights are binomial mass functions. We used the continuation ratio model to reduce the ordinal response in our example to binary on SAS (as discussed in section 2.1) and implemented maximum marginal likelihood estimation through the FORTRAN program developed by Ten Have et al. (1999). For this implementation we specified the initial values for the fixed effects regression coefficients obtained from the CRL fit, and set the initial values for the standard deviation of the random effects at subject subject and at leg level leg( subject) respectively equal to 0.5. Table 2 (2 nd column) reports the results of the analysis when the two nested level of clustering in the data were taken into account through the random effects (RE) model 2 stated above. The likelihood ratio test, 2 [log likelihood of CRL - log likelihood of CRL + RE ] = , gives an assessment of the relative fit of the data provided by the two models (Hedeker and Gibbons 1994) and clearly indicates the significance of including the random effects in the model df Results from the ordinal CRL+ RE analysis give the same inference as the ordinal CRL model, with a highly significant interaction between the leg and time variables (Wald test statistic Z = 3.92/0.64= 6.125, p-value < 0.01). However, the parameter estimates of fixed effects (that are significantly different from zero) are much larger for the ordinal CRL + RE model. Moreover, the standard errors of the parameter estimates for the CRL + RE model are greater than the corresponding standard errors from the CRL fit. The variability attributable to subjects ( ˆ subject = 2.01) is significant (Wald test statistic Z = 2.01/0.80 = 2.51, p-value= 0.012) and that attributable to legs nested within subjects ( ˆ leg ( subject ) = 2.83) is highly significant (Wald test statistic Z = 2.83/0.53 = 5.34, p-value < 0.01). The continuation ratio model for an ordinal response assumes that the effect of an explanatory variable (on the response) does not depend on the level of the partition variable. This implies absence of interaction between the partition and the explanatory variable. There was a significant interaction between the time and the partition variable at 2 the level of CRL + RE model, the likelihood ratio test, (3df) = 2( ) = [ (3df) = 7.815]. The CRL + RE model including the time and partition variable interaction is reported in Table 2 (3 rd column). This is a departure from the usual assumption that the regression coefficient corresponding to an explanatory variable does not depend on the level of the partition variable. However, Agresti (1990) and other authors (Dos Santos and Berridge 2000; Bender and Bender 2000) do allow for different

8 38 Mixed Model Analysis of Ordinal Response Data regression parameters for the effect of a covariate for different binary continuation ratio partitions of the ordinal response. Fitting the model including the interaction between the partition variable and leg (operated/non-operated) at CRL level leads to an unbelievable large estimated standard error for an estimated regression coefficient involving the interaction term. This is an indication of the presence of a zero cell count due to spreading the data over too many cells. The unstable result prevents us from ascertaining if in fact this interaction is significant (Hosmer and Lemeshow 1989). Due to the unstable result at the CRL level this interaction was not further evaluated at CRL + RE level. The estimated effects of the explanatory variables, leg and time (for different binary partitions) calculated from the final CRL + RE model (Table 2, 3 rd column) are shown in Fig. 1. Fig. 1 demonstrates the interaction between time and leg; at baseline, when the spinal anesthesia was initiated, progression rate to higher degrees of motor-level paralysis was more rapid for the operated as compared to the non-operated leg (for all binary partitions). The progression rate seems to level off by the end of the study, at 30 minutes, more so for the non-operated leg. The significant interaction between time and partition variable indicates that the effect of time is different for different binary partitions of the ordinal response. The effect of time on the ordinal response for the four binary partitions is shown in Fig. 2; for each leg the four intersecting curves corresponding to each binary partition illustrate this interaction. We note that for each leg at t 0 (baseline) the progression rate to the higher degree of paralysis was most rapid for partition 4 (category 2 versus 1) as compared to partition 3 (category 3 versus 2 &1), 2 (category 4 versus 3, 2 & 1) and 1 (category 5 versus 4,3,2 & 1), with the progression rate leveling off at later times for all four partitions. As all the patients started out with no paralysis (category 1) on both legs at baseline and progressed to higher degrees of motor level paralysis with passage of time, the most rapid progression rate for partition 4 (category 2 versus 1) at baseline seems plausible. 3. DISCUSSION In this paper we have presented an analysis of repeated ordinal response data with two nested levels of clustering, from a clinical trial involving unilateral spinal anaesthesia, using the generalized linear mixed model (GLMM). The ordinal character of the response is modeled using the continuation ratio method that reduces the ordinal response to conditionally independent binary continuation ratio partitions. The continuation ratio method was used to model the ordinal responses as it seemed clinically relevant with regards to the context of the problem, and the nature of the ordinal responses measured on the Bromage scale, assessing the degree of motor level paralysis. All patients initially had no paralysis on both legs and progressed to higher degrees of motor level paralysis with time, the progression rate being more rapid for the operated leg. For both legs the progression rate leveled off by the end of follow up at 30 minutes. After the initiation of spinal anaesthesia the severity of paralysis on the operated leg was greater than that on the non-operative leg, with the difference between the severity of paralysis increasing with time.

9 Sadia Mahmud et al. 39 To account for the clustering structure of the data we use a generalization of the continuation ratio model with two nested random effects. We fitted the model using maximum marginal likelihood estimation (MMLE) as developed by Ten Have et al. (1999) for a binary response with two nested levels of clustering. We note a limitation of the dataset used in this paper to exemplify the GLMM with regards to its size; 30 subjects at 3 rd level, each with 2 legs (2 nd level) and longitudinal measurements at 7 time points on each leg. Validity of statistical inferences constructed from logistic-normal models requires large number of clusters. However, keeping in mind that one cannot always have perfect dataset to exemplify methodologies, noting the limitation of moderate number of 3 rd level clusters in our dataset, we have demonstrated the generalization of the continuation ratio model with two nested random effects, fitting the model using MMLE. A referee noted that limited sample size limits the number of covariates that can be allowed into the model. However, in our example there were only two covariates of interest; operated versus non-operated leg and time. In addition a partition variable had to be introduced in the regression model to classify the four binary continuation ratio partitions. Due to only three covariates in the model, the above mentioned limitation of small sample size was not serious. Our final model including the three covariates as main effects and two interactions, as discussed in the results section, was numerically stable. However, we note that the interaction between the variable leg and the partition variable could not be analyzed due to the zero cell count problem that occurs when the data is spread over too many cells in case of limited sample size. In the GLMM analysis we have presented in this paper we assumed that the two nested random effects, at the subject and at the leg level respectively, are distributed as independent standard normal variables. The distribution of random effects is routinely assumed to be normal mainly for mathematical convenience (Song P, 2007). Recent research has focused on the impact of misspecification of random effects distribution on the maximum likelihood estimates for generalized linear mixed models (GLMM). For the case of linear mixed models (that corresponds to the identity link function for GLMM) the parameter estimates are rather robust with respect to deviation from normality of random effects. However, for random-intercept logistic models the estimates of the mean structure parameters can have substantial bias upon misspecification of random effects distribution in case of large random effects variance (Alonso A. et al. 2008). In our present analysis the maximum likelihood estimates of the variances of the two random intercepts are moderate (of the order of magnitude of 2 at the subject level and 3 at the leg level respectively) thereby suggesting that the potential bias in the estimates of fixed effects in case of misspecification of random effects distribution could be small. However, there can be considerable bias in the estimate of variance components in case of misspecification of the random effects distribution, thereby making it difficult to distinguish between the small or large variance scenarios (Alonso A. et al. 2008). This suggests the need for further research to investigate the impact of misspecification of random effects distribution on the estimates of fixed and random effects in a GLMM with two nested levels of clustering. Investigating the impact of the distribution of random effects on estimation of parameters in a GLMM with two nested levels of clustering will be taken up as future research.

10 40 Mixed Model Analysis of Ordinal Response Data An alternative approach to the solution of generalized linear mixed models (GLMM) is to turn to approximate quasi-likelihood solutions (Rodriguez and Goldman 1995, Breslow and Clayton 1993, Wolfinger and O Connell 1993). A SAS macro called GLIMMIX (Littell et al. 1996) uses a quasi-likelihood approach - restricted pseudolikelihood (REPL) suggested by Wolfinger and O Connell (1993) to find parameter estimates for the generalized linear mixed models and allows for multiple nested random effects. Other commercially available software such as MLwin (Goldstein et al. 1998) also use quasi-likelihood solutions of GLMM allowing for multiple random effects. However, the approximate quasi-likelihood solution can lead to downward bias in the estimates of fixed and random effects (variance of the distribution of random effects) especially in case of small cluster size (Rodriguez and Goldman 1995). Sabre (Stott et al. 1999) fits GLMM for 2- level binary, ordinal and count data using MMLE but allows only for a single case-specific random effect and does not allow for multiple nested random effects. PROC NLMIXED that is a recent update on SAS performs MMLE for random effects or mixed models, however it does not allow for nested random effects. Few authors Ten Have et al. (1999), Raman and Hedeker (2005) and Liu and Hedeker (2006) have presented analysis of binary or ordinal responses with two nested levels of clustering using MMLE, however as mentioned earlier the latter two references used a cumulative logit model for the ordinal responses. In this paper we have presented an analysis of ordinal response data with two nested levels of clustering using the continuation ratio method, and fitted the model using MMLE. Moreover, we have also explored interaction between an explanatory variable of interest and the partition variable in the continuation ratio model. The results of our GLMM analysis of the unilateral spinal anesthesia trial data indicate a significant interaction between factors leg and time. The operated leg progressed to higher degrees of paralysis more rapidly than the non-operated leg immediately after initiation of spinal anesthesia, with both legs leveling off at later times. The GLMM approach for the analysis reported in this paper intricately models the progression with time to higher degrees of motor level paralysis for both legs through the continuation ratio method, starting with no paralysis at baseline, and gives insight into the mechanism of progression through the interaction of time with different binary partitions of the ordinal response. As an alternative to GLMM for accounting the clustering structure of the data, one may fit a population-averaged model, such as Generalized Estimating Equations (GEE) (Mancl and DeRouen 2001, Qaqish and Liang 1992).The focus of a population averaged model is the marginal expectation of the response and the parameters are interpreted regardless of cluster membership. The mixed model is a subject-specific model that accounts for the heterogeneity across subjects (and across legs in our example) by addition of random effects in the logit. The parameter estimates from a subject-specific model are interpreted in terms of effect of covariates on an average (or a typical) subject (Omar and Thompson 2000, Vonesh and Chinchilli 1997, Zeger et al. 1988). A subjectspecific interpretation seems relevant with regards to the objective of our analysis to assess the effect of operated versus non-operated leg and of time on the motor level of an average (or a typical) patient. A subject specific model stratifies subjects according to

11 Sadia Mahmud et al. 41 their individual characteristics and is closer to physiological mechanisms than population averaged models that average over subject characteristics (Lindsey et al. 1998). 4. ACKNOWLEDGEMENT I would like to thank Professor Thomas Ten Have at University of Pennsylvania, USA for assisting me in implementing MMLE for the analysis reported in the paper through the FORTRAN program (reference: Ten Have et al. (1999)). I would also like to thank Mr. Iqbal Azam, Assistant Professor, The Aga Khan University, Pakistan for helpful discussions relevant to the analysis presented in this paper. I also acknowledge the computing facilities at the Aga Khan University, Pakistan that were used for conducting the statistical analysis for this study. 5. REFERENCES 1. Agresti, A. (1990). Categorical Data Analysis. John Wiley & Sons, New York. 2. Allison, P. (1999). Logistic Regression Using the SAS System. SAS Institute Inc.: Cary. 3. Alonso A., Litière S. and Molenberghs G. (2008). A family of tests to detect misspecifications in the random-effects structure of generalized linear mixed models. Computational Statistics and Data Analysis, 52, Bender, R. and Bender, A. (2000). Calculating ordinal regression models in SAS and S-Plus. Biometrical Journal, 42, Breslow, N.E. and Clayton, D.G. (1993). Approximate Inference in Generalized Linear Mixed Models. J. Amer. Statist. Assoc., 88(421), Carpenter, R.L., Caplan, R.A., Brown, D.L., Stephanson, C. and Wu, R. (1992). Incidence and risk factors for side effects of spinal anesthesia. Anesthesiology, 76, Casati, A., Fanelli, G., Beccaria, P., Aldegheri, G. Berti, M. ; Senatore, R.; and Torri, G. (1998). Block distribution and cardiovascular effects of unilateral spinal anesthesia by 0.5% hyperbaric bupivacain. A clinical comparison with bilateral spinal block. Minerva- Anesthesiology 64, Chohan, U., Afshan, G., Hoda, M.Q. and Mahmud, S. (2002). Haemodynamic effects of uni-lateral spinal anesthesia in high risk patients. J. Pak. Med. Assoc., 52, Dos Santos, D.M. and Berridge, D.M. (2000). A continuation ratio random effects model for repeated ordinal responses. Statistics in Medicine, 19, Gibbons, R.D. and Hedeker, D. (1997). Random effects probit and logistic regression models for three-level data. Biometrics, 53, Goldstein, H., Rasbash, J., Plewis, I., Draper, D., Browne, W., Yang, M., Woodhouse, G. and Healy, M. (1998). A User s Guide to MLwiN. Version 1.0. Multilevel Models Project: Institute of Education, London. 12. Hartzel, J., Agresti, A. and Caffo, B. (2001). Multinomial logit random effects models. Statistical Modeling, 1, Hedeker, D. and Gibbons, R.D. (1994). A random effects ordinal regression model for multilevel analysis. Biometrics, 50, Hosmer, D.W. and Lemeshow, S. (1989). Applied Logistic Regression. Wiley, New York.

12 42 Mixed Model Analysis of Ordinal Response Data 15. Lindsey, J.K., Jones, B. and Ebbutt, A.F. (1997). Simple models for repeated ordinal responses with an application to a seasonal rhinitis clinical trial. Statistics in Medicine, 16, Lindsey, J.K. and Lambart, P. (1998). Marginal models for repeated measurements. Statistics in Medicine, 17, Littell, R.C., Milliken, G.A., Stroup, W.W. and Wolfinger, R.D. (1996). SAS System for Mixed Models. SAS Institute Inc.: Cary. 18. Liu, L.C. and Hedeker, D. (2006). A mixed-effects regression model for longitudinal multivariate ordinal data. Biometrics, 62, Mancl, L.A. and DeRouen, T.A. (2001). A Covariance Estimator for GEE with improved small-sample properties. Biometrics, 57, Omar, R.Z. and Thompson, S.G. (2000). Analysis of cluster randomized trial with binary outcome data using a multi-level model. Statistics in Medicine, 19, Qaqish, B.F. and Liang, K.Y.(1992). Marginal models for correlated binary responses with multiple classes and multiple level of nesting. Biometrics, 48, Raman, R. and Hedeker, D. (2005). A mixed-effects regression model for three-level ordinal response data. Statistics in Medicine, 24, Rodriguez, G. and Goldman, N. (1995). An assessment of estimation procedures for multilevel models with binary responses. J.R. Statist. Soc. A., 158, Part I, Song, P. (2007). Correlated Data Analysis. Springer, New York. 25. Stott, D., Francis, B. and Davies, R. (1999). Sabre. Software for the Analysis of Binary Recurrent Events, (version 3.1) Center for Applied Statistics, Lancaster University. 26. Ten Have, T.R., Kunselman, A.R., and Tran, L. (1999). A Comparison of mixed effects logistic regression models for binary response data with two nested levels of clustering. Statistics in Medicine, 18, Vonesh, E.F. and Chinchilli, V.M. (1997). Linear and Nonlinear models for the Analysis of Repeated Measurements. Marcel Dekker, Inc., New York. 28. Wolfinger, R. and O Connell, M. (1993). Generalized Linear Mixed Models: A Pseudo-Likelihood Approach. J. Statist. Compu. and Simula., 48, Zeger, S.L., Liang, K.Y. and Albert, P.S. (1988). Models for longitudinal data: A generalized estimating equation approach. Biometrics, 44,

13 Sadia Mahmud et al. 43 Table 1: Contingency table of ordinal Bromage scale categories crossed with Leg for each of 7 time points: cell counts (and percentages) TIME (minutes) 0 LEG 5 LEG 10 LEG 15 LEG 20 LEG 25 LEG 30 LEG MOTOR LEVEL non-operated 30 (100%) 0(0%) 0(0%) 0(0%) 0(0%) operated 30(100%) 0(0%) 0(0%) 0(0%) 0(0%) non-operated 7(23.3%) 10(33.3%) 11(36.7%) 2(6.7%) 0(0%) operated 1(3.3%) 0(0%) 2(6.7%) 10(33.3%) 17(56.7%) non-operated 6(20.0%) 6(20.0%) 11(36.7%) 5(16.7%) 2(6.7%) operated 0(0%) 0(0%) 1(3.3%) 5(16.7%) 24(80%) non-operated 5(16.7%) 4(13.3%) 9(30.0%) 6(20.0%) 6(20.0%) operated 0(0%) 0(0%) 0(0%) 1(3.3%) 29(96.7%) non-operated 5(16.7%) 3(10.0%) 9(30.0%) 5(16.7%) 8(26.7%) operated 0(0%) 0(0%) 0(0%) 1(3.3%) 29(96.7%) non-operated 5(16.7%) 3(10.0%) 8(26.7%) 3(10.0%) 11(36.7%) operated 0(0%) 0(0%) 0(0%) 1(3.3%) 29(96.7%) non-operated 5(16.7%) 3(10.0%) 8(26.7%) 3(10.0%) 11(36.7%) operated 0(0%) 0(0%) 0(0%) 1(3.3%) 29(96.7%)

14 44 Mixed Model Analysis of Ordinal Response Data Table 2: Parameter (and standard Error) Estimates for continuation ratio logit (CRL), CRL + random effects (RE) CRL CRL + RE CRL+RE (with partition*log e (time+1)) Intercept (0.43) (1.46) (2.10) opleg a (0.75) (1.35) 0.61 (1.56) log e (time+1) 0.96 (0.14) ** 3.01 (0.33) ** 3.24 (0.58) ** opleg* log e (time+1) 1.94 (0.34) ** 3.92 (0.64) ** 4.02 (0.73) ** partition2 b (0.26) 1.78 (0.41) ** 5.09 (1.78) ** partition3 b 1.35 (0.26) ** 5.15 (0.60) ** 5.62 (1.86) ** partition4 b 1.26 (0.31) ** 7.15 (0.86) ** 6.17 (1.84) ** partition2 * log e (time+1) (0.62) * partition3 *log e (time+1) (0.65) partition4 *log e (time+1) 0.98 (0.71) subject 2.01 (0.80) * 2.21 (0.69) ** legsubject 2.83 (0.53) ** 3.42 (0.60) ** Log Likelihood *p-value < 0.05 **p-value < 0.01 a. opleg = 1 for operated leg; 0 otherwise, b. dummy variables for the partition variable, partition level 1 (k=1) is taken as reference.

15 Sadia Mahmud et al. 45 Fig. 1: The interaction between time and leg; continuation ratio logit versus time 1 Continuation ratio logit versus time partition leg non-operated operated operated non-operated logit time

16 46 Mixed Model Analysis of Ordinal Response Data Fig. 2: The interaction between time and partition variable; continuation ratio logit versus time. Continuation ratio logit versus time non-operated leg operated partition logit time

How to analyze correlated and longitudinal data?

How to analyze correlated and longitudinal data? How to analyze correlated and longitudinal data? Niloofar Ramezani, University of Northern Colorado, Greeley, Colorado ABSTRACT Longitudinal and correlated data are extensively used across disciplines

More information

Appropriate Statistical Methods to Account for Similarities in Binary Outcomes Between Fellow Eyes

Appropriate Statistical Methods to Account for Similarities in Binary Outcomes Between Fellow Eyes Appropriate Statistical Methods to Account for Similarities in Binary Outcomes Between Fellow Eyes Joanne Katz,* Scott Zeger,-\ and Kung-Yee Liangf Purpose. Many ocular measurements are more alike between

More information

Generalized Estimating Equations for Depression Dose Regimes

Generalized Estimating Equations for Depression Dose Regimes Generalized Estimating Equations for Depression Dose Regimes Karen Walker, Walker Consulting LLC, Menifee CA Generalized Estimating Equations on the average produce consistent estimates of the regression

More information

CONDITIONAL REGRESSION MODELS TRANSIENT STATE SURVIVAL ANALYSIS

CONDITIONAL REGRESSION MODELS TRANSIENT STATE SURVIVAL ANALYSIS CONDITIONAL REGRESSION MODELS FOR TRANSIENT STATE SURVIVAL ANALYSIS Robert D. Abbott Field Studies Branch National Heart, Lung and Blood Institute National Institutes of Health Raymond J. Carroll Department

More information

Daniel Boduszek University of Huddersfield

Daniel Boduszek University of Huddersfield Daniel Boduszek University of Huddersfield d.boduszek@hud.ac.uk Introduction to Multinominal Logistic Regression SPSS procedure of MLR Example based on prison data Interpretation of SPSS output Presenting

More information

A COMPARISON OF IMPUTATION METHODS FOR MISSING DATA IN A MULTI-CENTER RANDOMIZED CLINICAL TRIAL: THE IMPACT STUDY

A COMPARISON OF IMPUTATION METHODS FOR MISSING DATA IN A MULTI-CENTER RANDOMIZED CLINICAL TRIAL: THE IMPACT STUDY A COMPARISON OF IMPUTATION METHODS FOR MISSING DATA IN A MULTI-CENTER RANDOMIZED CLINICAL TRIAL: THE IMPACT STUDY Lingqi Tang 1, Thomas R. Belin 2, and Juwon Song 2 1 Center for Health Services Research,

More information

BIOSTATISTICAL METHODS AND RESEARCH DESIGNS. Xihong Lin Department of Biostatistics, University of Michigan, Ann Arbor, MI, USA

BIOSTATISTICAL METHODS AND RESEARCH DESIGNS. Xihong Lin Department of Biostatistics, University of Michigan, Ann Arbor, MI, USA BIOSTATISTICAL METHODS AND RESEARCH DESIGNS Xihong Lin Department of Biostatistics, University of Michigan, Ann Arbor, MI, USA Keywords: Case-control study, Cohort study, Cross-Sectional Study, Generalized

More information

ACHIEVING GOOD POWER WITH CLUSTERED AND MULTILEVEL DATA Keith E. Muller Department of Health Outcomes and Policy University of Florida,

ACHIEVING GOOD POWER WITH CLUSTERED AND MULTILEVEL DATA Keith E. Muller Department of Health Outcomes and Policy University of Florida, ACHIEVING GOOD POWER WITH CLUSTERED AND MULTILEVEL DATA Keith E. Muller Department of Health Outcomes and Policy University of Florida, KMuller@ufl.edu www.health-outcomes-policy.ufl.edu/muller Supported

More information

Analysis of Rheumatoid Arthritis Data using Logistic Regression and Penalized Approach

Analysis of Rheumatoid Arthritis Data using Logistic Regression and Penalized Approach University of South Florida Scholar Commons Graduate Theses and Dissertations Graduate School November 2015 Analysis of Rheumatoid Arthritis Data using Logistic Regression and Penalized Approach Wei Chen

More information

Use of GEEs in STATA

Use of GEEs in STATA Use of GEEs in STATA 1. When generalised estimating equations are used and example 2. Stata commands and options for GEEs 3. Results from Stata (and SAS!) 4. Another use of GEEs Use of GEEs GEEs are one

More information

Data Analysis in Practice-Based Research. Stephen Zyzanski, PhD Department of Family Medicine Case Western Reserve University School of Medicine

Data Analysis in Practice-Based Research. Stephen Zyzanski, PhD Department of Family Medicine Case Western Reserve University School of Medicine Data Analysis in Practice-Based Research Stephen Zyzanski, PhD Department of Family Medicine Case Western Reserve University School of Medicine Multilevel Data Statistical analyses that fail to recognize

More information

THE APPLICATION OF ORDINAL LOGISTIC HEIRARCHICAL LINEAR MODELING IN ITEM RESPONSE THEORY FOR THE PURPOSES OF DIFFERENTIAL ITEM FUNCTIONING DETECTION

THE APPLICATION OF ORDINAL LOGISTIC HEIRARCHICAL LINEAR MODELING IN ITEM RESPONSE THEORY FOR THE PURPOSES OF DIFFERENTIAL ITEM FUNCTIONING DETECTION THE APPLICATION OF ORDINAL LOGISTIC HEIRARCHICAL LINEAR MODELING IN ITEM RESPONSE THEORY FOR THE PURPOSES OF DIFFERENTIAL ITEM FUNCTIONING DETECTION Timothy Olsen HLM II Dr. Gagne ABSTRACT Recent advances

More information

Lecture 14: Adjusting for between- and within-cluster covariates in the analysis of clustered data May 14, 2009

Lecture 14: Adjusting for between- and within-cluster covariates in the analysis of clustered data May 14, 2009 Measurement, Design, and Analytic Techniques in Mental Health and Behavioral Sciences p. 1/3 Measurement, Design, and Analytic Techniques in Mental Health and Behavioral Sciences Lecture 14: Adjusting

More information

Introduction to diagnostic accuracy meta-analysis. Yemisi Takwoingi October 2015

Introduction to diagnostic accuracy meta-analysis. Yemisi Takwoingi October 2015 Introduction to diagnostic accuracy meta-analysis Yemisi Takwoingi October 2015 Learning objectives To appreciate the concept underlying DTA meta-analytic approaches To know the Moses-Littenberg SROC method

More information

112 Statistics I OR I Econometrics A SAS macro to test the significance of differences between parameter estimates In PROC CATMOD

112 Statistics I OR I Econometrics A SAS macro to test the significance of differences between parameter estimates In PROC CATMOD 112 Statistics I OR I Econometrics A SAS macro to test the significance of differences between parameter estimates In PROC CATMOD Unda R. Ferguson, Office of Academic Computing Mel Widawski, Office of

More information

Score Tests of Normality in Bivariate Probit Models

Score Tests of Normality in Bivariate Probit Models Score Tests of Normality in Bivariate Probit Models Anthony Murphy Nuffield College, Oxford OX1 1NF, UK Abstract: A relatively simple and convenient score test of normality in the bivariate probit model

More information

Unit 1 Exploring and Understanding Data

Unit 1 Exploring and Understanding Data Unit 1 Exploring and Understanding Data Area Principle Bar Chart Boxplot Conditional Distribution Dotplot Empirical Rule Five Number Summary Frequency Distribution Frequency Polygon Histogram Interquartile

More information

Quasicomplete Separation in Logistic Regression: A Medical Example

Quasicomplete Separation in Logistic Regression: A Medical Example Quasicomplete Separation in Logistic Regression: A Medical Example Madeline J Boyle, Carolinas Medical Center, Charlotte, NC ABSTRACT Logistic regression can be used to model the relationship between a

More information

OLANIRAN, Y.O.A Department of Marketing Federal Polytechnic, Ilaro Nigeria

OLANIRAN, Y.O.A Department of Marketing Federal Polytechnic, Ilaro Nigeria The Application of Logistic Regression Analysis to the Cummulative Grade Point Average of Graduating Students: A Case Study of Students of Applied Science, Federal Polytechnic, Ilaro FAGOYINBO, I.S. BSc

More information

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo Business Statistics The following was provided by Dr. Suzanne Delaney, and is a comprehensive review of Business Statistics. The workshop instructor will provide relevant examples during the Skills Assessment

More information

Parameter Estimation of Cognitive Attributes using the Crossed Random- Effects Linear Logistic Test Model with PROC GLIMMIX

Parameter Estimation of Cognitive Attributes using the Crossed Random- Effects Linear Logistic Test Model with PROC GLIMMIX Paper 1766-2014 Parameter Estimation of Cognitive Attributes using the Crossed Random- Effects Linear Logistic Test Model with PROC GLIMMIX ABSTRACT Chunhua Cao, Yan Wang, Yi-Hsin Chen, Isaac Y. Li University

More information

Abstract. Introduction A SIMULATION STUDY OF ESTIMATORS FOR RATES OF CHANGES IN LONGITUDINAL STUDIES WITH ATTRITION

Abstract. Introduction A SIMULATION STUDY OF ESTIMATORS FOR RATES OF CHANGES IN LONGITUDINAL STUDIES WITH ATTRITION A SIMULATION STUDY OF ESTIMATORS FOR RATES OF CHANGES IN LONGITUDINAL STUDIES WITH ATTRITION Fong Wang, Genentech Inc. Mary Lange, Immunex Corp. Abstract Many longitudinal studies and clinical trials are

More information

Today: Binomial response variable with an explanatory variable on an ordinal (rank) scale.

Today: Binomial response variable with an explanatory variable on an ordinal (rank) scale. Model Based Statistics in Biology. Part V. The Generalized Linear Model. Single Explanatory Variable on an Ordinal Scale ReCap. Part I (Chapters 1,2,3,4), Part II (Ch 5, 6, 7) ReCap Part III (Ch 9, 10,

More information

Many studies conducted in practice settings collect patient-level. Multilevel Modeling and Practice-Based Research

Many studies conducted in practice settings collect patient-level. Multilevel Modeling and Practice-Based Research Multilevel Modeling and Practice-Based Research L. Miriam Dickinson, PhD 1 Anirban Basu, PhD 2 1 Department of Family Medicine, University of Colorado Health Sciences Center, Aurora, Colo 2 Section of

More information

OLS Regression with Clustered Data

OLS Regression with Clustered Data OLS Regression with Clustered Data Analyzing Clustered Data with OLS Regression: The Effect of a Hierarchical Data Structure Daniel M. McNeish University of Maryland, College Park A previous study by Mundfrom

More information

What is Multilevel Modelling Vs Fixed Effects. Will Cook Social Statistics

What is Multilevel Modelling Vs Fixed Effects. Will Cook Social Statistics What is Multilevel Modelling Vs Fixed Effects Will Cook Social Statistics Intro Multilevel models are commonly employed in the social sciences with data that is hierarchically structured Estimated effects

More information

Statistical Primer for Cardiovascular Research

Statistical Primer for Cardiovascular Research Statistical Primer for Cardiovascular Research Repeated Measures Lisa M. Sullivan, PhD A repeated-measures design is one in which multiple, or repeated, measurements are made on each experimental unit.

More information

Selection and Combination of Markers for Prediction

Selection and Combination of Markers for Prediction Selection and Combination of Markers for Prediction NACC Data and Methods Meeting September, 2010 Baojiang Chen, PhD Sarah Monsell, MS Xiao-Hua Andrew Zhou, PhD Overview 1. Research motivation 2. Describe

More information

The Impact of Relative Standards on the Propensity to Disclose. Alessandro Acquisti, Leslie K. John, George Loewenstein WEB APPENDIX

The Impact of Relative Standards on the Propensity to Disclose. Alessandro Acquisti, Leslie K. John, George Loewenstein WEB APPENDIX The Impact of Relative Standards on the Propensity to Disclose Alessandro Acquisti, Leslie K. John, George Loewenstein WEB APPENDIX 2 Web Appendix A: Panel data estimation approach As noted in the main

More information

Analysis of Hearing Loss Data using Correlated Data Analysis Techniques

Analysis of Hearing Loss Data using Correlated Data Analysis Techniques Analysis of Hearing Loss Data using Correlated Data Analysis Techniques Ruth Penman and Gillian Heller, Department of Statistics, Macquarie University, Sydney, Australia. Correspondence: Ruth Penman, Department

More information

Selected Topics in Biostatistics Seminar Series. Missing Data. Sponsored by: Center For Clinical Investigation and Cleveland CTSC

Selected Topics in Biostatistics Seminar Series. Missing Data. Sponsored by: Center For Clinical Investigation and Cleveland CTSC Selected Topics in Biostatistics Seminar Series Missing Data Sponsored by: Center For Clinical Investigation and Cleveland CTSC Brian Schmotzer, MS Biostatistician, CCI Statistical Sciences Core brian.schmotzer@case.edu

More information

Modelling Research Productivity Using a Generalization of the Ordered Logistic Regression Model

Modelling Research Productivity Using a Generalization of the Ordered Logistic Regression Model Modelling Research Productivity Using a Generalization of the Ordered Logistic Regression Model Delia North Temesgen Zewotir Michael Murray Abstract In South Africa, the Department of Education allocates

More information

Chapter 13 Estimating the Modified Odds Ratio

Chapter 13 Estimating the Modified Odds Ratio Chapter 13 Estimating the Modified Odds Ratio Modified odds ratio vis-à-vis modified mean difference To a large extent, this chapter replicates the content of Chapter 10 (Estimating the modified mean difference),

More information

Correlation and regression

Correlation and regression PG Dip in High Intensity Psychological Interventions Correlation and regression Martin Bland Professor of Health Statistics University of York http://martinbland.co.uk/ Correlation Example: Muscle strength

More information

Daniel Boduszek University of Huddersfield

Daniel Boduszek University of Huddersfield Daniel Boduszek University of Huddersfield d.boduszek@hud.ac.uk Introduction to Logistic Regression SPSS procedure of LR Interpretation of SPSS output Presenting results from LR Logistic regression is

More information

bivariate analysis: The statistical analysis of the relationship between two variables.

bivariate analysis: The statistical analysis of the relationship between two variables. bivariate analysis: The statistical analysis of the relationship between two variables. cell frequency: The number of cases in a cell of a cross-tabulation (contingency table). chi-square (χ 2 ) test for

More information

In this module I provide a few illustrations of options within lavaan for handling various situations.

In this module I provide a few illustrations of options within lavaan for handling various situations. In this module I provide a few illustrations of options within lavaan for handling various situations. An appropriate citation for this material is Yves Rosseel (2012). lavaan: An R Package for Structural

More information

Mediation Analysis With Principal Stratification

Mediation Analysis With Principal Stratification University of Pennsylvania ScholarlyCommons Statistics Papers Wharton Faculty Research 3-30-009 Mediation Analysis With Principal Stratification Robert Gallop Dylan S. Small University of Pennsylvania

More information

Propensity score methods to adjust for confounding in assessing treatment effects: bias and precision

Propensity score methods to adjust for confounding in assessing treatment effects: bias and precision ISPUB.COM The Internet Journal of Epidemiology Volume 7 Number 2 Propensity score methods to adjust for confounding in assessing treatment effects: bias and precision Z Wang Abstract There is an increasing

More information

11/18/2013. Correlational Research. Correlational Designs. Why Use a Correlational Design? CORRELATIONAL RESEARCH STUDIES

11/18/2013. Correlational Research. Correlational Designs. Why Use a Correlational Design? CORRELATIONAL RESEARCH STUDIES Correlational Research Correlational Designs Correlational research is used to describe the relationship between two or more naturally occurring variables. Is age related to political conservativism? Are

More information

Business Statistics Probability

Business Statistics Probability Business Statistics The following was provided by Dr. Suzanne Delaney, and is a comprehensive review of Business Statistics. The workshop instructor will provide relevant examples during the Skills Assessment

More information

How to describe bivariate data

How to describe bivariate data Statistics Corner How to describe bivariate data Alessandro Bertani 1, Gioacchino Di Paola 2, Emanuele Russo 1, Fabio Tuzzolino 2 1 Department for the Treatment and Study of Cardiothoracic Diseases and

More information

Midterm Exam ANSWERS Categorical Data Analysis, CHL5407H

Midterm Exam ANSWERS Categorical Data Analysis, CHL5407H Midterm Exam ANSWERS Categorical Data Analysis, CHL5407H 1. Data from a survey of women s attitudes towards mammography are provided in Table 1. Women were classified by their experience with mammography

More information

11/24/2017. Do not imply a cause-and-effect relationship

11/24/2017. Do not imply a cause-and-effect relationship Correlational research is used to describe the relationship between two or more naturally occurring variables. Is age related to political conservativism? Are highly extraverted people less afraid of rejection

More information

Design and Analysis Plan Quantitative Synthesis of Federally-Funded Teen Pregnancy Prevention Programs HHS Contract #HHSP I 5/2/2016

Design and Analysis Plan Quantitative Synthesis of Federally-Funded Teen Pregnancy Prevention Programs HHS Contract #HHSP I 5/2/2016 Design and Analysis Plan Quantitative Synthesis of Federally-Funded Teen Pregnancy Prevention Programs HHS Contract #HHSP233201500069I 5/2/2016 Overview The goal of the meta-analysis is to assess the effects

More information

Introduction to Multilevel Models for Longitudinal and Repeated Measures Data

Introduction to Multilevel Models for Longitudinal and Repeated Measures Data Introduction to Multilevel Models for Longitudinal and Repeated Measures Data Today s Class: Features of longitudinal data Features of longitudinal models What can MLM do for you? What to expect in this

More information

Research and Evaluation Methodology Program, School of Human Development and Organizational Studies in Education, University of Florida

Research and Evaluation Methodology Program, School of Human Development and Organizational Studies in Education, University of Florida Vol. 2 (1), pp. 22-39, Jan, 2015 http://www.ijate.net e-issn: 2148-7456 IJATE A Comparison of Logistic Regression Models for Dif Detection in Polytomous Items: The Effect of Small Sample Sizes and Non-Normality

More information

1 Introduction. st0020. The Stata Journal (2002) 2, Number 3, pp

1 Introduction. st0020. The Stata Journal (2002) 2, Number 3, pp The Stata Journal (22) 2, Number 3, pp. 28 289 Comparative assessment of three common algorithms for estimating the variance of the area under the nonparametric receiver operating characteristic curve

More information

Introduction to Multilevel Models for Longitudinal and Repeated Measures Data

Introduction to Multilevel Models for Longitudinal and Repeated Measures Data Introduction to Multilevel Models for Longitudinal and Repeated Measures Data Today s Class: Features of longitudinal data Features of longitudinal models What can MLM do for you? What to expect in this

More information

Chapter 1: Exploring Data

Chapter 1: Exploring Data Chapter 1: Exploring Data Key Vocabulary:! individual! variable! frequency table! relative frequency table! distribution! pie chart! bar graph! two-way table! marginal distributions! conditional distributions!

More information

Original Article Downloaded from jhs.mazums.ac.ir at 22: on Friday October 5th 2018 [ DOI: /acadpub.jhs ]

Original Article Downloaded from jhs.mazums.ac.ir at 22: on Friday October 5th 2018 [ DOI: /acadpub.jhs ] Iranian journal of health sciences 213;1(3):58-7 http://jhs.mazums.ac.ir Original Article Downloaded from jhs.mazums.ac.ir at 22:2 +33 on Friday October 5th 218 [ DOI: 1.18869/acadpub.jhs.1.3.58 ] A New

More information

GENERALIZED ESTIMATING EQUATIONS FOR LONGITUDINAL DATA. Anti-Epileptic Drug Trial Timeline. Exploratory Data Analysis. Exploratory Data Analysis

GENERALIZED ESTIMATING EQUATIONS FOR LONGITUDINAL DATA. Anti-Epileptic Drug Trial Timeline. Exploratory Data Analysis. Exploratory Data Analysis GENERALIZED ESTIMATING EQUATIONS FOR LONGITUDINAL DATA 1 Example: Clinical Trial of an Anti-Epileptic Drug 59 epileptic patients randomized to progabide or placebo (Leppik et al., 1987) (Described in Fitzmaurice

More information

CHAPTER 3 RESEARCH METHODOLOGY

CHAPTER 3 RESEARCH METHODOLOGY CHAPTER 3 RESEARCH METHODOLOGY 3.1 Introduction 3.1 Methodology 3.1.1 Research Design 3.1. Research Framework Design 3.1.3 Research Instrument 3.1.4 Validity of Questionnaire 3.1.5 Statistical Measurement

More information

Media, Discussion and Attitudes Technical Appendix. 6 October 2015 BBC Media Action Andrea Scavo and Hana Rohan

Media, Discussion and Attitudes Technical Appendix. 6 October 2015 BBC Media Action Andrea Scavo and Hana Rohan Media, Discussion and Attitudes Technical Appendix 6 October 2015 BBC Media Action Andrea Scavo and Hana Rohan 1 Contents 1 BBC Media Action Programming and Conflict-Related Attitudes (Part 5a: Media and

More information

Modeling Binary outcome

Modeling Binary outcome Statistics April 4, 2013 Debdeep Pati Modeling Binary outcome Test of hypothesis 1. Is the effect observed statistically significant or attributable to chance? 2. Three types of hypothesis: a) tests of

More information

Statistics as a Tool. A set of tools for collecting, organizing, presenting and analyzing numerical facts or observations.

Statistics as a Tool. A set of tools for collecting, organizing, presenting and analyzing numerical facts or observations. Statistics as a Tool A set of tools for collecting, organizing, presenting and analyzing numerical facts or observations. Descriptive Statistics Numerical facts or observations that are organized describe

More information

Biostatistics II

Biostatistics II Biostatistics II 514-5509 Course Description: Modern multivariable statistical analysis based on the concept of generalized linear models. Includes linear, logistic, and Poisson regression, survival analysis,

More information

jorapain Original Article ABSTRACT INTRODUCTION MATERIALS AND METHODS

jorapain Original Article ABSTRACT INTRODUCTION MATERIALS AND METHODS Comparison between Hemodynamic Changes and Time to First Postoperative 10.5005/jp-journals-10046-0037 Analgesic Requirement in Patients Original Article Comparison between Hemodynamic Changes and Time

More information

An Instrumental Variable Consistent Estimation Procedure to Overcome the Problem of Endogenous Variables in Multilevel Models

An Instrumental Variable Consistent Estimation Procedure to Overcome the Problem of Endogenous Variables in Multilevel Models An Instrumental Variable Consistent Estimation Procedure to Overcome the Problem of Endogenous Variables in Multilevel Models Neil H Spencer University of Hertfordshire Antony Fielding University of Birmingham

More information

Lecture 21. RNA-seq: Advanced analysis

Lecture 21. RNA-seq: Advanced analysis Lecture 21 RNA-seq: Advanced analysis Experimental design Introduction An experiment is a process or study that results in the collection of data. Statistical experiments are conducted in situations in

More information

Understanding and Applying Multilevel Models in Maternal and Child Health Epidemiology and Public Health

Understanding and Applying Multilevel Models in Maternal and Child Health Epidemiology and Public Health Understanding and Applying Multilevel Models in Maternal and Child Health Epidemiology and Public Health Adam C. Carle, M.A., Ph.D. adam.carle@cchmc.org Division of Health Policy and Clinical Effectiveness

More information

4. THE CUMULATIVE (PROPORTIONAL) ODDS MODEL FOR ORDINAL OUTCOMES. Overview of the Cumulative Odds Model

4. THE CUMULATIVE (PROPORTIONAL) ODDS MODEL FOR ORDINAL OUTCOMES. Overview of the Cumulative Odds Model 26 27 The results for the second model, shown in Column 3 of Table 3.2, using SAS with the ascending option, simply model the probability that a child has a response in proficiency category 0 or, rather

More information

Centering Predictors

Centering Predictors Centering Predictors Longitudinal Data Analysis Workshop Section 3 University of Georgia: Institute for Interdisciplinary Research in Education and Human Development Section 3: Centering Covered this Section

More information

IAPT: Regression. Regression analyses

IAPT: Regression. Regression analyses Regression analyses IAPT: Regression Regression is the rather strange name given to a set of methods for predicting one variable from another. The data shown in Table 1 and come from a student project

More information

List of Figures. List of Tables. Preface to the Second Edition. Preface to the First Edition

List of Figures. List of Tables. Preface to the Second Edition. Preface to the First Edition List of Figures List of Tables Preface to the Second Edition Preface to the First Edition xv xxv xxix xxxi 1 What Is R? 1 1.1 Introduction to R................................ 1 1.2 Downloading and Installing

More information

2.75: 84% 2.5: 80% 2.25: 78% 2: 74% 1.75: 70% 1.5: 66% 1.25: 64% 1.0: 60% 0.5: 50% 0.25: 25% 0: 0%

2.75: 84% 2.5: 80% 2.25: 78% 2: 74% 1.75: 70% 1.5: 66% 1.25: 64% 1.0: 60% 0.5: 50% 0.25: 25% 0: 0% Capstone Test (will consist of FOUR quizzes and the FINAL test grade will be an average of the four quizzes). Capstone #1: Review of Chapters 1-3 Capstone #2: Review of Chapter 4 Capstone #3: Review of

More information

Jinhui Ma 1,2,3, Parminder Raina 1,2, Joseph Beyene 1 and Lehana Thabane 1,3,4,5*

Jinhui Ma 1,2,3, Parminder Raina 1,2, Joseph Beyene 1 and Lehana Thabane 1,3,4,5* Ma et al. BMC Medical Research Methodology 2013, 13:9 RESEARCH ARTICLE Open Access Comparison of population-averaged and cluster-specific models for the analysis of cluster randomized trials with missing

More information

Hierarchical Linear Models: Applications to cross-cultural comparisons of school culture

Hierarchical Linear Models: Applications to cross-cultural comparisons of school culture Hierarchical Linear Models: Applications to cross-cultural comparisons of school culture Magdalena M.C. Mok, Macquarie University & Teresa W.C. Ling, City Polytechnic of Hong Kong Paper presented at the

More information

Index. Springer International Publishing Switzerland 2017 T.J. Cleophas, A.H. Zwinderman, Modern Meta-Analysis, DOI /

Index. Springer International Publishing Switzerland 2017 T.J. Cleophas, A.H. Zwinderman, Modern Meta-Analysis, DOI / Index A Adjusted Heterogeneity without Overdispersion, 63 Agenda-driven bias, 40 Agenda-Driven Meta-Analyses, 306 307 Alternative Methods for diagnostic meta-analyses, 133 Antihypertensive effect of potassium,

More information

Logistic Regression and Bayesian Approaches in Modeling Acceptance of Male Circumcision in Pune, India

Logistic Regression and Bayesian Approaches in Modeling Acceptance of Male Circumcision in Pune, India 20th International Congress on Modelling and Simulation, Adelaide, Australia, 1 6 December 2013 www.mssanz.org.au/modsim2013 Logistic Regression and Bayesian Approaches in Modeling Acceptance of Male Circumcision

More information

What s New in SUDAAN 11

What s New in SUDAAN 11 What s New in SUDAAN 11 Angela Pitts 1, Michael Witt 1, Gayle Bieler 1 1 RTI International, 3040 Cornwallis Rd, RTP, NC 27709 Abstract SUDAAN 11 is due to be released in 2012. SUDAAN is a statistical software

More information

Evaluating health management programmes over time: application of propensity score-based weighting to longitudinal datajep_

Evaluating health management programmes over time: application of propensity score-based weighting to longitudinal datajep_ Journal of Evaluation in Clinical Practice ISSN 1356-1294 Evaluating health management programmes over time: application of propensity score-based weighting to longitudinal datajep_1361 180..185 Ariel

More information

Today Retrospective analysis of binomial response across two levels of a single factor.

Today Retrospective analysis of binomial response across two levels of a single factor. Model Based Statistics in Biology. Part V. The Generalized Linear Model. Chapter 18.3 Single Factor. Retrospective Analysis ReCap. Part I (Chapters 1,2,3,4), Part II (Ch 5, 6, 7) ReCap Part III (Ch 9,

More information

Statistical analysis of randomized trials in tobacco treatment: longitudinal designs with dichotomous outcome

Statistical analysis of randomized trials in tobacco treatment: longitudinal designs with dichotomous outcome Nicotine & Tobacco Research (2001) 3, 193 202 Statistical analysis of randomized trials in tobacco treatment: longitudinal designs with dichotomous outcome Sharon M. Hall, Kevin L. Delucchi, Wayne F. Velicer,

More information

Bayesian Analysis of Between-Group Differences in Variance Components in Hierarchical Generalized Linear Models

Bayesian Analysis of Between-Group Differences in Variance Components in Hierarchical Generalized Linear Models Bayesian Analysis of Between-Group Differences in Variance Components in Hierarchical Generalized Linear Models Brady T. West Michigan Program in Survey Methodology, Institute for Social Research, 46 Thompson

More information

The Use of Propensity Scores for Nonrandomized Designs With Clustered Data

The Use of Propensity Scores for Nonrandomized Designs With Clustered Data This article was downloaded by: [Cornell University Library] On: 23 February 2015, At: 11:03 Publisher: Routledge Informa Ltd Registered in England and Wales Registered Number: 1072954 Registered office:

More information

Applied Medical. Statistics Using SAS. Geoff Der. Brian S. Everitt. CRC Press. Taylor Si Francis Croup. Taylor & Francis Croup, an informa business

Applied Medical. Statistics Using SAS. Geoff Der. Brian S. Everitt. CRC Press. Taylor Si Francis Croup. Taylor & Francis Croup, an informa business Applied Medical Statistics Using SAS Geoff Der Brian S. Everitt CRC Press Taylor Si Francis Croup Boca Raton London New York CRC Press is an imprint of the Taylor & Francis Croup, an informa business A

More information

Department of Statistics, Biostatistics & Informatics, University of Dhaka, Dhaka-1000, Bangladesh. Abstract

Department of Statistics, Biostatistics & Informatics, University of Dhaka, Dhaka-1000, Bangladesh. Abstract Bangladesh J. Sci. Res. 29(1): 1-9, 2016 (June) Introduction GENERALIZED QUASI-LIKELIHOOD APPROACH FOR ANALYZING LONGITUDINAL COUNT DATA OF NUMBER OF VISITS TO A DIABETES HOSPITAL IN BANGLADESH Kanchan

More information

Graphical assessment of internal and external calibration of logistic regression models by using loess smoothers

Graphical assessment of internal and external calibration of logistic regression models by using loess smoothers Tutorial in Biostatistics Received 21 November 2012, Accepted 17 July 2013 Published online 23 August 2013 in Wiley Online Library (wileyonlinelibrary.com) DOI: 10.1002/sim.5941 Graphical assessment of

More information

The University of North Carolina at Chapel Hill School of Social Work

The University of North Carolina at Chapel Hill School of Social Work The University of North Carolina at Chapel Hill School of Social Work SOWO 918: Applied Regression Analysis and Generalized Linear Models Spring Semester, 2014 Instructor Shenyang Guo, Ph.D., Room 524j,

More information

Logistic regression: Why we often can do what we think we can do 1.

Logistic regression: Why we often can do what we think we can do 1. Logistic regression: Why we often can do what we think we can do 1. Augst 8 th 2015 Maarten L. Buis, University of Konstanz, Department of History and Sociology maarten.buis@uni.konstanz.de All propositions

More information

Exploring the Impact of Missing Data in Multiple Regression

Exploring the Impact of Missing Data in Multiple Regression Exploring the Impact of Missing Data in Multiple Regression Michael G Kenward London School of Hygiene and Tropical Medicine 28th May 2015 1. Introduction In this note we are concerned with the conduct

More information

Bayesian Logistic Regression Modelling via Markov Chain Monte Carlo Algorithm

Bayesian Logistic Regression Modelling via Markov Chain Monte Carlo Algorithm Journal of Social and Development Sciences Vol. 4, No. 4, pp. 93-97, Apr 203 (ISSN 222-52) Bayesian Logistic Regression Modelling via Markov Chain Monte Carlo Algorithm Henry De-Graft Acquah University

More information

Still important ideas

Still important ideas Readings: OpenStax - Chapters 1 13 & Appendix D & E (online) Plous Chapters 17 & 18 - Chapter 17: Social Influences - Chapter 18: Group Judgments and Decisions Still important ideas Contrast the measurement

More information

Prediction Model For Risk Of Breast Cancer Considering Interaction Between The Risk Factors

Prediction Model For Risk Of Breast Cancer Considering Interaction Between The Risk Factors INTERNATIONAL JOURNAL OF SCIENTIFIC & TECHNOLOGY RESEARCH VOLUME, ISSUE 0, SEPTEMBER 01 ISSN 81 Prediction Model For Risk Of Breast Cancer Considering Interaction Between The Risk Factors Nabila Al Balushi

More information

Multilevel analysis quantifies variation in the experimental effect while optimizing power and preventing false positives

Multilevel analysis quantifies variation in the experimental effect while optimizing power and preventing false positives DOI 10.1186/s12868-015-0228-5 BMC Neuroscience RESEARCH ARTICLE Open Access Multilevel analysis quantifies variation in the experimental effect while optimizing power and preventing false positives Emmeke

More information

Bias reduction with an adjustment for participants intent to dropout of a randomized controlled clinical trial

Bias reduction with an adjustment for participants intent to dropout of a randomized controlled clinical trial ARTICLE Clinical Trials 2007; 4: 540 547 Bias reduction with an adjustment for participants intent to dropout of a randomized controlled clinical trial Andrew C Leon a, Hakan Demirtas b, and Donald Hedeker

More information

STATISTICAL METHODS FOR DIAGNOSTIC TESTING: AN ILLUSTRATION USING A NEW METHOD FOR CANCER DETECTION XIN SUN. PhD, Kansas State University, 2012

STATISTICAL METHODS FOR DIAGNOSTIC TESTING: AN ILLUSTRATION USING A NEW METHOD FOR CANCER DETECTION XIN SUN. PhD, Kansas State University, 2012 STATISTICAL METHODS FOR DIAGNOSTIC TESTING: AN ILLUSTRATION USING A NEW METHOD FOR CANCER DETECTION by XIN SUN PhD, Kansas State University, 2012 A THESIS Submitted in partial fulfillment of the requirements

More information

Studying the effect of change on change : a different viewpoint

Studying the effect of change on change : a different viewpoint Studying the effect of change on change : a different viewpoint Eyal Shahar Professor, Division of Epidemiology and Biostatistics, Mel and Enid Zuckerman College of Public Health, University of Arizona

More information

Bias in randomised factorial trials

Bias in randomised factorial trials Research Article Received 17 September 2012, Accepted 9 May 2013 Published online 4 June 2013 in Wiley Online Library (wileyonlinelibrary.com) DOI: 10.1002/sim.5869 Bias in randomised factorial trials

More information

Application of Cox Regression in Modeling Survival Rate of Drug Abuse

Application of Cox Regression in Modeling Survival Rate of Drug Abuse American Journal of Theoretical and Applied Statistics 2018; 7(1): 1-7 http://www.sciencepublishinggroup.com/j/ajtas doi: 10.11648/j.ajtas.20180701.11 ISSN: 2326-8999 (Print); ISSN: 2326-9006 (Online)

More information

Data Analysis Using Regression and Multilevel/Hierarchical Models

Data Analysis Using Regression and Multilevel/Hierarchical Models Data Analysis Using Regression and Multilevel/Hierarchical Models ANDREW GELMAN Columbia University JENNIFER HILL Columbia University CAMBRIDGE UNIVERSITY PRESS Contents List of examples V a 9 e xv " Preface

More information

DEPARTMENT OF PUBLIC HEALTH SCIENCES COURSES OFFERED BY QUARTER. Summer 2017

DEPARTMENT OF PUBLIC HEALTH SCIENCES COURSES OFFERED BY QUARTER. Summer 2017 DEPARTMENT OF PUBLIC HEALTH SCIENCES COURSES OFFERED BY QUARTER Summer 2017 Clinical Epidemiology PBHS 30700 Course Instructors: Brian Chiu & Diane Lauderdale T/Th 9:00-11:00am 7/11-8/24 PQ: Introductory

More information

Analytic Strategies for the OAI Data

Analytic Strategies for the OAI Data Analytic Strategies for the OAI Data Charles E. McCulloch, Division of Biostatistics, Dept of Epidemiology and Biostatistics, UCSF ACR October 2008 Outline 1. Introduction and examples. 2. General analysis

More information

Sequential nonparametric regression multiple imputations. Irina Bondarenko and Trivellore Raghunathan

Sequential nonparametric regression multiple imputations. Irina Bondarenko and Trivellore Raghunathan Sequential nonparametric regression multiple imputations Irina Bondarenko and Trivellore Raghunathan Department of Biostatistics, University of Michigan Ann Arbor, MI 48105 Abstract Multiple imputation,

More information

Original Article Unilateral Spinal Anesthesia For Lower Limb Surgery Pak Armed Forces Med J 2016; 66(1):62-67

Original Article Unilateral Spinal Anesthesia For Lower Limb Surgery Pak Armed Forces Med J 2016; 66(1):62-67 Original Article Unilateral Spinal Anesthesia For Lower Limb Surgery Pak Armed Forces Med J 2016; 66(1):62-67 UNILATERAL SPINAL ANESTHESIA FOR MAJOR LOWER LIMB SURGERY IN PATIENTS WITH CARDIAC FAILURE

More information

Clincial Biostatistics. Regression

Clincial Biostatistics. Regression Regression analyses Clincial Biostatistics Regression Regression is the rather strange name given to a set of methods for predicting one variable from another. The data shown in Table 1 and come from a

More information

Chapter 21 Multilevel Propensity Score Methods for Estimating Causal Effects: A Latent Class Modeling Strategy

Chapter 21 Multilevel Propensity Score Methods for Estimating Causal Effects: A Latent Class Modeling Strategy Chapter 21 Multilevel Propensity Score Methods for Estimating Causal Effects: A Latent Class Modeling Strategy Jee-Seon Kim and Peter M. Steiner Abstract Despite their appeal, randomized experiments cannot

More information

AP Statistics. Semester One Review Part 1 Chapters 1-5

AP Statistics. Semester One Review Part 1 Chapters 1-5 AP Statistics Semester One Review Part 1 Chapters 1-5 AP Statistics Topics Describing Data Producing Data Probability Statistical Inference Describing Data Ch 1: Describing Data: Graphically and Numerically

More information