A MODEL FOR MIXED CONTINUOUS AND DISCRETE RESPONSES WITH POSSIBILITY OF MISSING RESPONSES

Size: px
Start display at page:

Download "A MODEL FOR MIXED CONTINUOUS AND DISCRETE RESPONSES WITH POSSIBILITY OF MISSING RESPONSES"

Transcription

1 Journal o Sciences Islamic epublic o Iran : 5-6 National Center For Scientiic esearch ISSN 6- A MODE FO MIED CONTINUOUS AND DISCETE ESPONSES WITH POSSIBIIT OF MISSING ESPONSES M. Ganjali Department o Statistics Facult o Mathematical Sciences Shahid Beheshti Universit Evin Tehran 988 Islamic epublic o Iran Abstract A model or missing data in mixed binar and continuous responses which can be used on cross-sectional data is presented. In this model response indicator or the binar response can be dependent on the continuous response. A closed orm or the likelihood is ound. For data with a complicated pattern o missing responses some new residuals are also proposed. The model o multiplicative heteroscedasticit is used to consider the problem o heteroscedasticit o the continuous response. The model is illustrated on the data o an observational stud where the eect o pschological disorder o parents on both the verbal comprehension score and the presence o adverse smptoms in their children are modeled in the presence o missing responses. Kewords: Mixed data Missing responses Multiplicative heteroscedasticit Pearson residuals Pschological disorder. Introduction Some biomedical pschological and health sciences data include both discrete and continuous outcomes. One example is the analsis o development toxicit endpoints when the relationship between etal weight and malormation in live etus is an important statistical issue []. Another example is in the stud o the maternal smoking eect on respirator illness in children where we have a continuous measure o pulmonar unction and a binar measure o chronic smptoms in children. For the irst example separate analses o the categorical or the continuous responses cannot properl assess the eect o dose on etal weight and malormations simultaneousl. For the second example separate analses cannot assess the eect o maternal smoking on all the responses simultaneousl. Furthermore separate analses give biased estimates or the parameters and misleading inerence [9]. Consequentl we need to consider a method in which these variables can be modeled jointl. For joint modeling o responses one method is to use the general location model o Olkin and Tate [] where the joint distribution o the continuous and categorical variables is decomposed into a marginal multinomial distribution or the categorical variables and a conditional Multivariate normal distribution or the m-ganjali@cc.sbu.ac.ir 5

2 Vol. No. Winter Ganjali J. Sci. I.. Iran continuous variables given the categorical variables. Another method is to consider the conditional distribution o discrete variables given the continuous variables and a marginal distribution or continuous variables. Cox and Wermuth [] empiricall examined the choice between these two methods. The third method which is developed here is to use simultaneous modeling o categorical and continuous variables to take into account the association and dependence between the responses b the correlation between the errors in the models or responses. For more details o this approach see or example Heckman [6] in which a general model or simultaneousl analing two mixed correlated responses is introduced and Catalano and an [] who extend and used the model or a cluster o discrete and continuous outcomes. ittle and ubin [] made an important distinction between the various tpes o missing mechanism. The deined the missing mechanism as missing completel at random MCA i missingness is dependent neither on the observed responses nor on the missing responses and as random MA i it is dependent on the observed responses but not on the missing responses. Missingness is deined as non-random i it depends on the unobserved responses. From a likelihood point o view MCA and MA are ignorable but missing not at random MNA is non-ignorable. For mixed data with missing outcomes ittle and Schluchter [] and Fitmaurice and aird [] used the general location model o Olkin and Tate [] with the assumption o missingness at random MA to justi ignoring the missing data mechanism []. This means that the used all available responses without a model or missing mechanism to obtain parameter estimates using the EM Expectation Maximiation algorithm. For a good discussion o mixed normal and non-normal data with missing responses under the assumption o MA see [] and [6]. In this paper a general latent variable model or simultaneousl handling response and non-response in mixed discrete and continuous data with potentiall nonrandom missing values in both tpes o responses is presented. With this model the dependence between responses can be taken into account b the correlation between errors o the response models. The aim is to use the general model o Heckman [6] or the joint modeling o the discrete and continuous responses. It is shown how we can incorporate a model or a complicated pattern o missing responses. In Section the model is described and some new residuals are introduced. In Section this model is used on a subset o data available in ittle and Schluchter [] about the eects o parental pschological disorders on various aspects o the development o their children. The results o itting this model are also presented. In Section the paper concludes with some remarks.. Model and esiduals or Mixed Data with Missing esponses.. The Model Suppose is a binar discrete variable and is a continuous variable. Variables and are correlated and must be modeled simultaneousl. et and denote the underling latent variables o the binar response the non-response mechanism or the binar variable and the non-response or the continuous variable respectivel and deine i as the response indicator or i as the response indicator or and i as the binar response. The model takes the orm: ε a ε b ε c ε d where j or j are vectors o explanator variables. It is assumed that E ε j or j... The unstructured us variance covariance matrix o the vector o errors us ε ε ε ε The vector parameters and the correlation parameters or j < j j and j j is 5

3 J. Sci. I.. Iran Ganjali Vol. No. Winter j... should be estimated. In this model an multivariate distribution can be assumed or the errors in the model. Here a multivariate normal distribution is assumed. To consider the problem o heteroscedasticit o the continuous response we can use multiplicative model o heteroscedasticit [5] or which is exp γ γ e where γ and the vector o parameter γ should be estimated and is the vector o explanator variables. The non-response o in this model depends on both responses the non-response or also depends on both responses and depends on. I one o the correlation parameters j j or j and j is ound to be signiicant then we have a nonrandom missing process. Parameter can tell us whether or not non-response in the continuous variable binar variable is dependent on response in the discrete variable continuous variable. Parameter can tell us whether or not non-response in the continuous variable is dependent on the non-response in the discrete variable. The likelihood or this model which has been given in appendix shows the simpliication obtained b using the assumption o normalit in the models or nonresponse. The simpliication arises rom the act that instead o resorting to a numerical approximation o the integral needed to integrate out the missing values o the continuous response we just need to invoke a point on the cumulative three-variate normal distribution which although approximate can be computed in man standard statistical packages with suicientl high precision. The NAG [] routine EOUCF is used in this paper to minimie the mines logarithm o the likelihood given in the appendix. EOUCF is a Fortran routine to minimie a smooth unction subject to constraints or example simple bounds on the variables using a sequential quadratic programming SQP [] method. In this routine all unspeciied derivatives are approximated b inite dierences... esiduals The missing values o responses create problems or the usual residual diagnostics see e.g. Hirano et al. [8]. For example usual Pearson residuals o the variable are o the orm r E var Z i.e. the assume MCA. These unconditional residuals will be misleading i missingness is MA or MNA as in these cases the 's are rom a conditional or truncated conditional distribution. We can examine the residuals o the responses conditional on being observed. et start with using theoretical orm involving E Z E and var Z rather than their predicted values. The Pearson residuals or continuous response can take the orm E Z r a var Z and or the discrete response can take the orm E E r b E where EZ Z M [ M M ] varz Z where M is the Mill's ratio [7] and E P r where and are respectivel the cumulative univariate and bivariate normal distributions. The estimated Pearson residuals r ˆ r ˆ can be ound b using the maximum likelihood estimates o the parameters in Sstem. As residuals in a and b are deined b conditioning on observing the responses the dier rom those o Ten Have et al. [5]. Ten Have et al. [5] ound the expectation and variances o responses and consequentl the residuals unconditional on the act that responses should be observed. Then to calculate the residuals the assumed no link between 55

4 Vol. No. Winter Ganjali J. Sci. I.. Iran response and nonresponse. This gives biased estimates o the means and variances o responses i missingness is not at random [].. Application: The Eect o Pschological Disorders on the Verbal Comprehension Score.. Pschological Disorders Data The data set extracted rom ittle and Schluchter [] has been used also b Fitmaurice and aird []. These authors assumed a mechanism b which observations were MA and then used the general model o Olkin and Tate [] to decompose the joint distribution o the continuous and discrete variables. In these data schoolage children are classiied according to the risk o their parents to pschological disorders normal moderate risk and high risk. ittle and Schluchter [] and Fitmaurice and aird [] used two continuous responses a standardied reading score and a standardied verbal comprehension score and a single binar response indicating the presence o adverse smptoms which was obtained or each child. Here onl the standardied verbal comprehension score and the binar response or the irst child in amil are used. The data include 69 amilies with a general pattern o missing responses. Table gives the dierent patterns o missing response or each child. For example 5 irst-born children responded or both variables and the observed value o their binar response is. Since a preliminar analsis indicated that there was no signiicant dierence between moderate and high risk groups o pschological disorders [] in this paper the moderate and high risk groups are combined into a single group. Table. Dierent patterns o missing responses or pschological disorders data No Table. Descriptive statistics or pschological disorders observed data ow isk Score Ad. Sm No mean S.D. Total Total Calculated ignoring low risk Count% Table presents summar statistics or presence o the adverse smptoms Ad. Sm. in Table and the standardied verbal comprehension score using onl observed data. In this table the value o or ow isk means that we have an individual rom moderate or high risk group and the value o means that individual is rom low risk group. Table shows that the mean standard deviation score o children rom a amil with a low risk o pschological disorders ow isk in Table is more lower than amilies with moderate or high risk o pschological disorders. It also shows that the occurrence o adverse smptoms is less in amil with a low risk o pschological disorders. As the variance o scores is less or a amil with a low risk o pschological disorders the problem o heteroscedasticit should be taken into account. A Kolmogorov Smirnov test o the assumption o normalit or scores o the children o a low amil o pschological disorders is not rejected p-value.. The same test does not reject the assumption o the normalit or scores o the children rom a moderate or high risk o pschological disorders p-value.. The Pearson correlation between two responses is r.9. This emphasies that two responses should be modeled simultaneousl... Models or Pschological Disorders Data For comparative purposes three models are considered. The irst model model I uses onl complete cases and does not consider the correlation between two responses. This model is ε 5a 5b ε exp γ γ 5c where there is no correlation between ε and ε. The second model model II uses model I and takes into account the correlation between two responses. The third model model III considers the ull model model with missing mechanism with the ollowing orm ε 6a 6b ε 56

5 J. Sci. I.. Iran Ganjali Vol. No. Winter and M ε 6c M ε 6d exp γ γ. 6e In these models M i child is rom a low risk group i child is rom a Moderate risk group In model III non-response models and let have one more explanator variable [6]... esults or Pschological Disorders Data For the standardied verbal comprehension score and the binar response or the irst born child in amil model III inds no connection between missing mechanism and responses with a change o deviance o.6 on d.. p-value.66. Consequentl using onl the complete data without considering the missing mechanism can give unbiased estimates o the parameters. esults o using three models are given in table. For model III as the correlation parameters or j and j are not signiicant as j j mentioned above results with removing these parameters are given. These results show that there is a negative correlation a change in deviance o.7 with d between two responses the more the score the less likel the presence o adverse smptoms and separate analsis o the responses gives biased estimates o the standard errors o the parameters. For example under model I the standard errors o the estimates o and in comparison with the results o the model II are overestimated and the standard errors o the estimates o and are underestimated. esults also show that children rom a low risk amil can obtain a better standardied verbal comprehension score and the variance o their scores is less than that o the amil with a high or moderate risk γ.786 or model II. The estimate o the parameter b model III shows a positive correlation between two underling variables or missing mechanisms. This means that children who do not respond their score are more likel to not to respond their discrete response. Table shows values o residuals calculated using b and model II or the discrete response. esiduals or continuous response calculated using a do not include an value larger than in absolute value and with the residual values in Table model II can be considered as a good it or the pschological disorders data. Table. esults using three models or the pschological disorders data Parameter ModelIII ModelII ModelI Estimate S.E. Estimate S.E. Estimate S.E γ γ loglik Table. esiduals or discrete response using model II or pschological disorders data Discussion In this paper a latent variable model is presented or simultaneousl modelling o binar and continuous correlated responses. The model is ormulated in such a wa that can handle the possibilit o missing responses. The latent variable model in sstem can also be used or a mixture o ordinal and continuous data with usual changes or ordinalit o the discrete response. It can also be used or more than two variables with missing responses. In this case the more variables with missing values we have the more 57

6 Vol. No. Winter Ganjali J. Sci. I.. Iran response indicators should be deined and the more parameters coming to the estimation procedure. The selection models in sstems can be sensitive to the assumption o normalit or error distributions []. However the residuals introduced in this paper can be used to practicall examine this assumption. Acknowledgments The author is grateul to Shahid Beheshti Universit or the awarded research grant. eerences. Catalano P. and an.m. Bivariate latent variable models or clustered discrete and continuous outcomes. Journal o the American Statistical Association 879: Cox D.. and Wermuth N. esponse models or mixed binar and quantitative variables. Biometrika 79: Fitmaurice G.M. and aird N.M. egression models or mixed discrete and continuous responses with potentiall missing values. Biometrics 5: Fletcher. Practical Methods o Optimiation. John Wile. 5. Harve A. Estimating regression models with multiplicative heteroscedasticit. Econometrica : Heckman J.J. Dumm endogenous variables in a simultaneous equation sstem. Econometrica 66: Heckman J.J. Sample selection bias as a speciication error. Ibid. 7: Hirano K. Imbens G. idder G. and ubin D. Combining panel data sets with attrition and rereshment samples. Technical working paper National Bureau o Economic esearch. Cambridge Massachusetts eon A.. and Carriere K.C. On the one sample location hpothesis or mixed bivariate data. Commun. statist. Theor and Meth. 9: ittle.j and ubin D. Statistical Analsis with Missing Data. New ork Wile ittle.j. and Schluchter M. Maximum likelihood estimation or mixed continuous and categorical data with missing values. Biometrika 7: Maddala G.S. imited dependent and qualitative variables in econometrics. Cambridge 98.. NAG. Numerical Algorithms Group Manual. Mark 6. Oxord U.K Olkin I. and Tate.F. Multivariate correlation models with mixed discrete and continuous variables. Annals o Mathematical Statistics : Ten Have T..T. Kunselman A.. Pulkstenis E.P. and andis J.. Mixed eects logistic regression models or longitudinal binar response data with inormative dropout. Biometrics 5: Schaer J.. Analsis o Incomplete Multivariate Data Chapman & Hall 997. Appendix: ikelihood or Mixed Data with Missing Mechanism For individuals who observe neither nor the likelihood is Pr 7 where is the cumulative bivariate normal distribution. For individuals who observe onl and the value o is the likelihood is where Pr Pr Pr 8 and is cumulative three-variate normal distribution. For individuals who observe onl and the value o is the likelihood is Pr Pr Pr Pr Pr. 9 Σ For individuals who observe onl the likelihood is Pr [Pr Pr ] 58

7 J. Sci. I.. Iran Ganjali Vol. No. Winter [ ] For individuals with both and observed and the value o equal to the likelihood is Pr Pr [Pr Pr ] Pr [ ] where For individuals with both and observed and the value o equal to the likelihood is Pr Pr Pr [ Pr Pr Pr Pr ] Pr [ 59

8 Vol. No. Winter Ganjali J. Sci. I.. Iran ]. 6

Complier Average Causal Effect (CACE)

Complier Average Causal Effect (CACE) Complier Average Causal Effect (CACE) Booil Jo Stanford University Methodological Advancement Meeting Innovative Directions in Estimating Impact Office of Planning, Research & Evaluation Administration

More information

Catherine A. Welch 1*, Séverine Sabia 1,2, Eric Brunner 1, Mika Kivimäki 1 and Martin J. Shipley 1

Catherine A. Welch 1*, Séverine Sabia 1,2, Eric Brunner 1, Mika Kivimäki 1 and Martin J. Shipley 1 Welch et al. BMC Medical Research Methodology (2018) 18:89 https://doi.org/10.1186/s12874-018-0548-0 RESEARCH ARTICLE Open Access Does pattern mixture modelling reduce bias due to informative attrition

More information

Score Tests of Normality in Bivariate Probit Models

Score Tests of Normality in Bivariate Probit Models Score Tests of Normality in Bivariate Probit Models Anthony Murphy Nuffield College, Oxford OX1 1NF, UK Abstract: A relatively simple and convenient score test of normality in the bivariate probit model

More information

Lec 02: Estimation & Hypothesis Testing in Animal Ecology

Lec 02: Estimation & Hypothesis Testing in Animal Ecology Lec 02: Estimation & Hypothesis Testing in Animal Ecology Parameter Estimation from Samples Samples We typically observe systems incompletely, i.e., we sample according to a designed protocol. We then

More information

A COMPARISON OF IMPUTATION METHODS FOR MISSING DATA IN A MULTI-CENTER RANDOMIZED CLINICAL TRIAL: THE IMPACT STUDY

A COMPARISON OF IMPUTATION METHODS FOR MISSING DATA IN A MULTI-CENTER RANDOMIZED CLINICAL TRIAL: THE IMPACT STUDY A COMPARISON OF IMPUTATION METHODS FOR MISSING DATA IN A MULTI-CENTER RANDOMIZED CLINICAL TRIAL: THE IMPACT STUDY Lingqi Tang 1, Thomas R. Belin 2, and Juwon Song 2 1 Center for Health Services Research,

More information

List of Figures. List of Tables. Preface to the Second Edition. Preface to the First Edition

List of Figures. List of Tables. Preface to the Second Edition. Preface to the First Edition List of Figures List of Tables Preface to the Second Edition Preface to the First Edition xv xxv xxix xxxi 1 What Is R? 1 1.1 Introduction to R................................ 1 1.2 Downloading and Installing

More information

Image Analysis. Edge Detection

Image Analysis. Edge Detection Image Analsis Image Analsis Edge Detection K. Buza Lars Schmidt-Thieme Inormation Sstems and Machine Learning Lab ISMLL Institute o Economics and Inormation Sstems & Institute o Computer Science Universit

More information

EC352 Econometric Methods: Week 07

EC352 Econometric Methods: Week 07 EC352 Econometric Methods: Week 07 Gordon Kemp Department of Economics, University of Essex 1 / 25 Outline Panel Data (continued) Random Eects Estimation and Clustering Dynamic Models Validity & Threats

More information

Analysis of Vaccine Effects on Post-Infection Endpoints Biostat 578A Lecture 3

Analysis of Vaccine Effects on Post-Infection Endpoints Biostat 578A Lecture 3 Analysis of Vaccine Effects on Post-Infection Endpoints Biostat 578A Lecture 3 Analysis of Vaccine Effects on Post-Infection Endpoints p.1/40 Data Collected in Phase IIb/III Vaccine Trial Longitudinal

More information

1 Introduction. st0020. The Stata Journal (2002) 2, Number 3, pp

1 Introduction. st0020. The Stata Journal (2002) 2, Number 3, pp The Stata Journal (22) 2, Number 3, pp. 28 289 Comparative assessment of three common algorithms for estimating the variance of the area under the nonparametric receiver operating characteristic curve

More information

Multiple Trait Random Regression Test Day Model for Production Traits

Multiple Trait Random Regression Test Day Model for Production Traits Multiple Trait Random Regression Test Da Model for Production Traits J. Jamrozik 1,2, L.R. Schaeffer 1, Z. Liu 2 and G. Jansen 2 1 Centre for Genetic Improvement of Livestock, Department of Animal and

More information

Machine Learning Statistical Learning. Prof. Matteo Matteucci

Machine Learning Statistical Learning. Prof. Matteo Matteucci Machine Learning Statistical Learning Pro. Matteo Matteucci Statistical Learning Outline o What Is Statistical Learning? Why estimate? How do we estimate? The trade-o between prediction accuracy & model

More information

Biostatistics II

Biostatistics II Biostatistics II 514-5509 Course Description: Modern multivariable statistical analysis based on the concept of generalized linear models. Includes linear, logistic, and Poisson regression, survival analysis,

More information

Adjusting for mode of administration effect in surveys using mailed questionnaire and telephone interview data

Adjusting for mode of administration effect in surveys using mailed questionnaire and telephone interview data Adjusting for mode of administration effect in surveys using mailed questionnaire and telephone interview data Karl Bang Christensen National Institute of Occupational Health, Denmark Helene Feveille National

More information

Practical propensity score matching: a reply to Smith and Todd

Practical propensity score matching: a reply to Smith and Todd Journal of Econometrics 125 (2005) 355 364 www.elsevier.com/locate/econbase Practical propensity score matching: a reply to Smith and Todd Rajeev Dehejia a,b, * a Department of Economics and SIPA, Columbia

More information

Department of Statistics, Biostatistics & Informatics, University of Dhaka, Dhaka-1000, Bangladesh. Abstract

Department of Statistics, Biostatistics & Informatics, University of Dhaka, Dhaka-1000, Bangladesh. Abstract Bangladesh J. Sci. Res. 29(1): 1-9, 2016 (June) Introduction GENERALIZED QUASI-LIKELIHOOD APPROACH FOR ANALYZING LONGITUDINAL COUNT DATA OF NUMBER OF VISITS TO A DIABETES HOSPITAL IN BANGLADESH Kanchan

More information

Logistic Regression with Missing Data: A Comparison of Handling Methods, and Effects of Percent Missing Values

Logistic Regression with Missing Data: A Comparison of Handling Methods, and Effects of Percent Missing Values Logistic Regression with Missing Data: A Comparison of Handling Methods, and Effects of Percent Missing Values Sutthipong Meeyai School of Transportation Engineering, Suranaree University of Technology,

More information

EMPIRICAL STRATEGIES IN LABOUR ECONOMICS

EMPIRICAL STRATEGIES IN LABOUR ECONOMICS EMPIRICAL STRATEGIES IN LABOUR ECONOMICS University of Minho J. Angrist NIPE Summer School June 2009 This course covers core econometric ideas and widely used empirical modeling strategies. The main theoretical

More information

LINEAR REGRESSION FOR BIVARIATE CENSORED DATA VIA MULTIPLE IMPUTATION

LINEAR REGRESSION FOR BIVARIATE CENSORED DATA VIA MULTIPLE IMPUTATION STATISTICS IN MEDICINE Statist. Med. 18, 3111} 3121 (1999) LINEAR REGRESSION FOR BIVARIATE CENSORED DATA VIA MULTIPLE IMPUTATION WEI PAN * AND CHARLES KOOPERBERG Division of Biostatistics, School of Public

More information

Study of measurement error due to interviewer. A Dissertation Prospectus Presented. August, Ruitao Zhang

Study of measurement error due to interviewer. A Dissertation Prospectus Presented. August, Ruitao Zhang Stud of measurement error due to interviewer A Dissertation Prospectus Presented August, 2007 b Ruitao Zhang Approved as to stle and content b: Edward J. Stanek III, Chair Graduate Program Director William

More information

How to analyze correlated and longitudinal data?

How to analyze correlated and longitudinal data? How to analyze correlated and longitudinal data? Niloofar Ramezani, University of Northern Colorado, Greeley, Colorado ABSTRACT Longitudinal and correlated data are extensively used across disciplines

More information

Statistical data preparation: management of missing values and outliers

Statistical data preparation: management of missing values and outliers KJA Korean Journal of Anesthesiology Statistical Round pissn 2005-6419 eissn 2005-7563 Statistical data preparation: management of missing values and outliers Sang Kyu Kwak 1 and Jong Hae Kim 2 Departments

More information

IAS 3.9 Bivariate Data

IAS 3.9 Bivariate Data Year 13 Mathematics IAS 3.9 Bivariate Data Robert Lakeland & Carl Nugent Contents Achievement Standard.................................................. 2 Bivariate Data..........................................................

More information

Missing Data in Longitudinal Studies: Strategies for Bayesian Modeling, Sensitivity Analysis, and Causal Inference

Missing Data in Longitudinal Studies: Strategies for Bayesian Modeling, Sensitivity Analysis, and Causal Inference COURSE: Missing Data in Longitudinal Studies: Strategies for Bayesian Modeling, Sensitivity Analysis, and Causal Inference Mike Daniels (Department of Statistics, University of Florida) 20-21 October 2011

More information

Comparing Two ROC Curves Independent Groups Design

Comparing Two ROC Curves Independent Groups Design Chapter 548 Comparing Two ROC Curves Independent Groups Design Introduction This procedure is used to compare two ROC curves generated from data from two independent groups. In addition to producing a

More information

Abstract. Introduction A SIMULATION STUDY OF ESTIMATORS FOR RATES OF CHANGES IN LONGITUDINAL STUDIES WITH ATTRITION

Abstract. Introduction A SIMULATION STUDY OF ESTIMATORS FOR RATES OF CHANGES IN LONGITUDINAL STUDIES WITH ATTRITION A SIMULATION STUDY OF ESTIMATORS FOR RATES OF CHANGES IN LONGITUDINAL STUDIES WITH ATTRITION Fong Wang, Genentech Inc. Mary Lange, Immunex Corp. Abstract Many longitudinal studies and clinical trials are

More information

Methods for Computing Missing Item Response in Psychometric Scale Construction

Methods for Computing Missing Item Response in Psychometric Scale Construction American Journal of Biostatistics Original Research Paper Methods for Computing Missing Item Response in Psychometric Scale Construction Ohidul Islam Siddiqui Institute of Statistical Research and Training

More information

Logistic regression: Why we often can do what we think we can do 1.

Logistic regression: Why we often can do what we think we can do 1. Logistic regression: Why we often can do what we think we can do 1. Augst 8 th 2015 Maarten L. Buis, University of Konstanz, Department of History and Sociology maarten.buis@uni.konstanz.de All propositions

More information

Modelling Spatially Correlated Survival Data for Individuals with Multiple Cancers

Modelling Spatially Correlated Survival Data for Individuals with Multiple Cancers Modelling Spatially Correlated Survival Data for Individuals with Multiple Cancers Dipak K. Dey, Ulysses Diva and Sudipto Banerjee Department of Statistics University of Connecticut, Storrs. March 16,

More information

Application of Cox Regression in Modeling Survival Rate of Drug Abuse

Application of Cox Regression in Modeling Survival Rate of Drug Abuse American Journal of Theoretical and Applied Statistics 2018; 7(1): 1-7 http://www.sciencepublishinggroup.com/j/ajtas doi: 10.11648/j.ajtas.20180701.11 ISSN: 2326-8999 (Print); ISSN: 2326-9006 (Online)

More information

BIOSTATISTICAL METHODS AND RESEARCH DESIGNS. Xihong Lin Department of Biostatistics, University of Michigan, Ann Arbor, MI, USA

BIOSTATISTICAL METHODS AND RESEARCH DESIGNS. Xihong Lin Department of Biostatistics, University of Michigan, Ann Arbor, MI, USA BIOSTATISTICAL METHODS AND RESEARCH DESIGNS Xihong Lin Department of Biostatistics, University of Michigan, Ann Arbor, MI, USA Keywords: Case-control study, Cohort study, Cross-Sectional Study, Generalized

More information

Evaluating health management programmes over time: application of propensity score-based weighting to longitudinal datajep_

Evaluating health management programmes over time: application of propensity score-based weighting to longitudinal datajep_ Journal of Evaluation in Clinical Practice ISSN 1356-1294 Evaluating health management programmes over time: application of propensity score-based weighting to longitudinal datajep_1361 180..185 Ariel

More information

CLASSICAL AND. MODERN REGRESSION WITH APPLICATIONS

CLASSICAL AND. MODERN REGRESSION WITH APPLICATIONS - CLASSICAL AND. MODERN REGRESSION WITH APPLICATIONS SECOND EDITION Raymond H. Myers Virginia Polytechnic Institute and State university 1 ~l~~l~l~~~~~~~l!~ ~~~~~l~/ll~~ Donated by Duxbury o Thomson Learning,,

More information

On the Use of Local Assessments for Monitoring Centrally Reviewed Endpoints with Missing Data in Clinical Trials*

On the Use of Local Assessments for Monitoring Centrally Reviewed Endpoints with Missing Data in Clinical Trials* On the Use of Local Assessments for Monitoring Centrally Reviewed Endpoints with Missing Data in Clinical Trials* The Harvard community has made this article openly available. Please share how this access

More information

Missing data. Patrick Breheny. April 23. Introduction Missing response data Missing covariate data

Missing data. Patrick Breheny. April 23. Introduction Missing response data Missing covariate data Missing data Patrick Breheny April 3 Patrick Breheny BST 71: Bayesian Modeling in Biostatistics 1/39 Our final topic for the semester is missing data Missing data is very common in practice, and can occur

More information

Comparison of imputation and modelling methods in the analysis of a physical activity trial with missing outcomes

Comparison of imputation and modelling methods in the analysis of a physical activity trial with missing outcomes IJE vol.34 no.1 International Epidemiological Association 2004; all rights reserved. International Journal of Epidemiology 2005;34:89 99 Advance Access publication 27 August 2004 doi:10.1093/ije/dyh297

More information

Exploring the Impact of Missing Data in Multiple Regression

Exploring the Impact of Missing Data in Multiple Regression Exploring the Impact of Missing Data in Multiple Regression Michael G Kenward London School of Hygiene and Tropical Medicine 28th May 2015 1. Introduction In this note we are concerned with the conduct

More information

Unit 1 Exploring and Understanding Data

Unit 1 Exploring and Understanding Data Unit 1 Exploring and Understanding Data Area Principle Bar Chart Boxplot Conditional Distribution Dotplot Empirical Rule Five Number Summary Frequency Distribution Frequency Polygon Histogram Interquartile

More information

Help! Statistics! Missing data. An introduction

Help! Statistics! Missing data. An introduction Help! Statistics! Missing data. An introduction Sacha la Bastide-van Gemert Medical Statistics and Decision Making Department of Epidemiology UMCG Help! Statistics! Lunch time lectures What? Frequently

More information

Applied Medical. Statistics Using SAS. Geoff Der. Brian S. Everitt. CRC Press. Taylor Si Francis Croup. Taylor & Francis Croup, an informa business

Applied Medical. Statistics Using SAS. Geoff Der. Brian S. Everitt. CRC Press. Taylor Si Francis Croup. Taylor & Francis Croup, an informa business Applied Medical Statistics Using SAS Geoff Der Brian S. Everitt CRC Press Taylor Si Francis Croup Boca Raton London New York CRC Press is an imprint of the Taylor & Francis Croup, an informa business A

More information

S Imputation of Categorical Missing Data: A comparison of Multivariate Normal and. Multinomial Methods. Holmes Finch.

S Imputation of Categorical Missing Data: A comparison of Multivariate Normal and. Multinomial Methods. Holmes Finch. S05-2008 Imputation of Categorical Missing Data: A comparison of Multivariate Normal and Abstract Multinomial Methods Holmes Finch Matt Margraf Ball State University Procedures for the imputation of missing

More information

Missing Data and Imputation

Missing Data and Imputation Missing Data and Imputation Barnali Das NAACCR Webinar May 2016 Outline Basic concepts Missing data mechanisms Methods used to handle missing data 1 What are missing data? General term: data we intended

More information

LINEAR REGRESSION FOR BIVARIATE CENSORED DATA VIA MULTIPLE IMPUTATION WEI PAN. School of Public Health. 420 Delaware Street SE

LINEAR REGRESSION FOR BIVARIATE CENSORED DATA VIA MULTIPLE IMPUTATION WEI PAN. School of Public Health. 420 Delaware Street SE LINEAR REGRESSION FOR BIVARIATE CENSORED DATA VIA MULTIPLE IMPUTATION Research Report 99-002 WEI PAN Division of Biostatistics School of Public Health University of Minnesota A460 Mayo Building, Box 303

More information

Advanced Handling of Missing Data

Advanced Handling of Missing Data Advanced Handling of Missing Data One-day Workshop Nicole Janz ssrmcta@hermes.cam.ac.uk 2 Goals Discuss types of missingness Know advantages & disadvantages of missing data methods Learn multiple imputation

More information

EPI 200C Final, June 4 th, 2009 This exam includes 24 questions.

EPI 200C Final, June 4 th, 2009 This exam includes 24 questions. Greenland/Arah, Epi 200C Sp 2000 1 of 6 EPI 200C Final, June 4 th, 2009 This exam includes 24 questions. INSTRUCTIONS: Write all answers on the answer sheets supplied; PRINT YOUR NAME and STUDENT ID NUMBER

More information

A review of statistical methods in the analysis of data arising from observer reliability studies (Part 11) *

A review of statistical methods in the analysis of data arising from observer reliability studies (Part 11) * A review of statistical methods in the analysis of data arising from observer reliability studies (Part 11) * by J. RICHARD LANDIS** and GARY G. KOCH** 4 Methods proposed for nominal and ordinal data Many

More information

Selecting the Right Data Analysis Technique

Selecting the Right Data Analysis Technique Selecting the Right Data Analysis Technique Levels of Measurement Nominal Ordinal Interval Ratio Discrete Continuous Continuous Variable Borgatta and Bohrnstedt state that "the most of central constructs

More information

Remarks on Bayesian Control Charts

Remarks on Bayesian Control Charts Remarks on Bayesian Control Charts Amir Ahmadi-Javid * and Mohsen Ebadi Department of Industrial Engineering, Amirkabir University of Technology, Tehran, Iran * Corresponding author; email address: ahmadi_javid@aut.ac.ir

More information

Sequential nonparametric regression multiple imputations. Irina Bondarenko and Trivellore Raghunathan

Sequential nonparametric regression multiple imputations. Irina Bondarenko and Trivellore Raghunathan Sequential nonparametric regression multiple imputations Irina Bondarenko and Trivellore Raghunathan Department of Biostatistics, University of Michigan Ann Arbor, MI 48105 Abstract Multiple imputation,

More information

Identifying Peer Influence Effects in Observational Social Network Data: An Evaluation of Propensity Score Methods

Identifying Peer Influence Effects in Observational Social Network Data: An Evaluation of Propensity Score Methods Identifying Peer Influence Effects in Observational Social Network Data: An Evaluation of Propensity Score Methods Dean Eckles Department of Communication Stanford University dean@deaneckles.com Abstract

More information

The Impact of Relative Standards on the Propensity to Disclose. Alessandro Acquisti, Leslie K. John, George Loewenstein WEB APPENDIX

The Impact of Relative Standards on the Propensity to Disclose. Alessandro Acquisti, Leslie K. John, George Loewenstein WEB APPENDIX The Impact of Relative Standards on the Propensity to Disclose Alessandro Acquisti, Leslie K. John, George Loewenstein WEB APPENDIX 2 Web Appendix A: Panel data estimation approach As noted in the main

More information

Effects of propensity score overlap on the estimates of treatment effects. Yating Zheng & Laura Stapleton

Effects of propensity score overlap on the estimates of treatment effects. Yating Zheng & Laura Stapleton Effects of propensity score overlap on the estimates of treatment effects Yating Zheng & Laura Stapleton Introduction Recent years have seen remarkable development in estimating average treatment effects

More information

Some interpretational issues connected with observational studies

Some interpretational issues connected with observational studies Some interpretational issues connected with observational studies D.R. Cox Nuffield College, Oxford, UK and Nanny Wermuth Chalmers/Gothenburg University, Gothenburg, Sweden ABSTRACT After some general

More information

Person centered treatment (PeT) effects: Individualized treatment effects using instrumental variables

Person centered treatment (PeT) effects: Individualized treatment effects using instrumental variables Person centered treatment (PeT) effects: Individualized treatment effects using instrumental variables Anirban Basu Departments of Health Services, Pharmacy, and Economics University of Washington, Seattle

More information

Quantitative Trait Analysis in Sibling Pairs. Biostatistics 666

Quantitative Trait Analysis in Sibling Pairs. Biostatistics 666 Quantitative Trait Analsis in Sibling Pairs Biostatistics 666 Outline Likelihood function for bivariate data Incorporate genetic kinship coefficients Incorporate IBD probabilities The data Pairs of measurements

More information

Basic Biostatistics. Chapter 1. Content

Basic Biostatistics. Chapter 1. Content Chapter 1 Basic Biostatistics Jamalludin Ab Rahman MD MPH Department of Community Medicine Kulliyyah of Medicine Content 2 Basic premises variables, level of measurements, probability distribution Descriptive

More information

An Introduction to Multiple Imputation for Missing Items in Complex Surveys

An Introduction to Multiple Imputation for Missing Items in Complex Surveys An Introduction to Multiple Imputation for Missing Items in Complex Surveys October 17, 2014 Joe Schafer Center for Statistical Research and Methodology (CSRM) United States Census Bureau Views expressed

More information

bivariate analysis: The statistical analysis of the relationship between two variables.

bivariate analysis: The statistical analysis of the relationship between two variables. bivariate analysis: The statistical analysis of the relationship between two variables. cell frequency: The number of cases in a cell of a cross-tabulation (contingency table). chi-square (χ 2 ) test for

More information

Still important ideas

Still important ideas Readings: OpenStax - Chapters 1 13 & Appendix D & E (online) Plous Chapters 17 & 18 - Chapter 17: Social Influences - Chapter 18: Group Judgments and Decisions Still important ideas Contrast the measurement

More information

Impact and adjustment of selection bias. in the assessment of measurement equivalence

Impact and adjustment of selection bias. in the assessment of measurement equivalence Impact and adjustment of selection bias in the assessment of measurement equivalence Thomas Klausch, Joop Hox,& Barry Schouten Working Paper, Utrecht, December 2012 Corresponding author: Thomas Klausch,

More information

Confounding by indication developments in matching, and instrumental variable methods. Richard Grieve London School of Hygiene and Tropical Medicine

Confounding by indication developments in matching, and instrumental variable methods. Richard Grieve London School of Hygiene and Tropical Medicine Confounding by indication developments in matching, and instrumental variable methods Richard Grieve London School of Hygiene and Tropical Medicine 1 Outline 1. Causal inference and confounding 2. Genetic

More information

Mostly Harmless Simulations? On the Internal Validity of Empirical Monte Carlo Studies

Mostly Harmless Simulations? On the Internal Validity of Empirical Monte Carlo Studies Mostly Harmless Simulations? On the Internal Validity of Empirical Monte Carlo Studies Arun Advani and Tymon Sªoczy«ski 13 November 2013 Background When interested in small-sample properties of estimators,

More information

Strategies for handling missing data in randomised trials

Strategies for handling missing data in randomised trials Strategies for handling missing data in randomised trials NIHR statistical meeting London, 13th February 2012 Ian White MRC Biostatistics Unit, Cambridge, UK Plan 1. Why do missing data matter? 2. Popular

More information

Prepared by: Assoc. Prof. Dr Bahaman Abu Samah Department of Professional Development and Continuing Education Faculty of Educational Studies

Prepared by: Assoc. Prof. Dr Bahaman Abu Samah Department of Professional Development and Continuing Education Faculty of Educational Studies Prepared by: Assoc. Prof. Dr Bahaman Abu Samah Department of Professional Development and Continuing Education Faculty of Educational Studies Universiti Putra Malaysia Serdang At the end of this session,

More information

Data harmonization tutorial:teaser for FH2019

Data harmonization tutorial:teaser for FH2019 Data harmonization tutorial:teaser for FH2019 Alden Gross, Johns Hopkins Rich Jones, Brown University Friday Harbor Tahoe 22 Aug. 2018 1 / 50 Outline Outline What is harmonization? Approach Prestatistical

More information

Understandable Statistics

Understandable Statistics Understandable Statistics correlated to the Advanced Placement Program Course Description for Statistics Prepared for Alabama CC2 6/2003 2003 Understandable Statistics 2003 correlated to the Advanced Placement

More information

Statistical methods for marginal inference from multivariate ordinal data Nooraee, Nazanin

Statistical methods for marginal inference from multivariate ordinal data Nooraee, Nazanin University of Groningen Statistical methods for marginal inference from multivariate ordinal data Nooraee, Nazanin DOI: 10.1016/j.csda.2014.03.009 IMPORTANT NOTE: You are advised to consult the publisher's

More information

Confidence Intervals On Subsets May Be Misleading

Confidence Intervals On Subsets May Be Misleading Journal of Modern Applied Statistical Methods Volume 3 Issue 2 Article 2 11-1-2004 Confidence Intervals On Subsets May Be Misleading Juliet Popper Shaffer University of California, Berkeley, shaffer@stat.berkeley.edu

More information

Statistical Techniques for Analyzing Data From Prevention Trials: Treatment of No-Shows Using Rubin's Causal Model

Statistical Techniques for Analyzing Data From Prevention Trials: Treatment of No-Shows Using Rubin's Causal Model Psychological Methods 1998, Vol. 3, No. 2,147-159 Copyright 1998 by the American Psychological Association, Inc. 1082-989X/98/$3.00 Statistical Techniques for Analyzing Data From Prevention Trials: Treatment

More information

Estimating average treatment effects from observational data using teffects

Estimating average treatment effects from observational data using teffects Estimating average treatment effects from observational data using teffects David M. Drukker Director of Econometrics Stata 2013 Nordic and Baltic Stata Users Group meeting Karolinska Institutet September

More information

Numerical Integration of Bivariate Gaussian Distribution

Numerical Integration of Bivariate Gaussian Distribution Numerical Integration of Bivariate Gaussian Distribution S. H. Derakhshan and C. V. Deutsch The bivariate normal distribution arises in many geostatistical applications as most geostatistical techniques

More information

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo Business Statistics The following was provided by Dr. Suzanne Delaney, and is a comprehensive review of Business Statistics. The workshop instructor will provide relevant examples during the Skills Assessment

More information

COMMITTEE FOR PROPRIETARY MEDICINAL PRODUCTS (CPMP) POINTS TO CONSIDER ON MISSING DATA

COMMITTEE FOR PROPRIETARY MEDICINAL PRODUCTS (CPMP) POINTS TO CONSIDER ON MISSING DATA The European Agency for the Evaluation of Medicinal Products Evaluation of Medicines for Human Use London, 15 November 2001 CPMP/EWP/1776/99 COMMITTEE FOR PROPRIETARY MEDICINAL PRODUCTS (CPMP) POINTS TO

More information

BRAIN TUMOR CLASSIFICATION USING GUSTAFSON KESSEL (G-K) FUZZY CLUSTERING ALGORITHM

BRAIN TUMOR CLASSIFICATION USING GUSTAFSON KESSEL (G-K) FUZZY CLUSTERING ALGORITHM BRAIN TUMOR CLASSIFICATION USING GUSTAFSON KESSEL (G-K) FUZZY CLUSTERING ALGORITHM A. S.Shankar, Dr.A.Asokan and Dr.D.Sivakumar Department o Electronics & Instrumentation Engineering, Annamalai Universit,

More information

Estimands, Missing Data and Sensitivity Analysis: some overview remarks. Roderick Little

Estimands, Missing Data and Sensitivity Analysis: some overview remarks. Roderick Little Estimands, Missing Data and Sensitivity Analysis: some overview remarks Roderick Little NRC Panel s Charge To prepare a report with recommendations that would be useful for USFDA's development of guidance

More information

Propensity Score Methods for Causal Inference with the PSMATCH Procedure

Propensity Score Methods for Causal Inference with the PSMATCH Procedure Paper SAS332-2017 Propensity Score Methods for Causal Inference with the PSMATCH Procedure Yang Yuan, Yiu-Fai Yung, and Maura Stokes, SAS Institute Inc. Abstract In a randomized study, subjects are randomly

More information

Practical Multivariate Analysis

Practical Multivariate Analysis Texts in Statistical Science Practical Multivariate Analysis Fifth Edition Abdelmonem Afifi Susanne May Virginia A. Clark CRC Press Taylor & Francis Group Boca Raton London New York CRC Press is an imprint

More information

MEA DISCUSSION PAPERS

MEA DISCUSSION PAPERS Inference Problems under a Special Form of Heteroskedasticity Helmut Farbmacher, Heinrich Kögel 03-2015 MEA DISCUSSION PAPERS mea Amalienstr. 33_D-80799 Munich_Phone+49 89 38602-355_Fax +49 89 38602-390_www.mea.mpisoc.mpg.de

More information

Statistical techniques to evaluate the agreement degree of medicine measurements

Statistical techniques to evaluate the agreement degree of medicine measurements Statistical techniques to evaluate the agreement degree of medicine measurements Luís M. Grilo 1, Helena L. Grilo 2, António de Oliveira 3 1 lgrilo@ipt.pt, Mathematics Department, Polytechnic Institute

More information

Bivariate Probability Distributions

Bivariate Probability Distributions bivariate 0.2.2 Bivariate Probabilit Distributions Abb Spurdle September 18, 2018 Contains convenience functions for constructing and plotting bivariate probabilit distributions (probabilit mass functions,

More information

Two-stage Methods to Implement and Analyze the Biomarker-guided Clinical Trail Designs in the Presence of Biomarker Misclassification

Two-stage Methods to Implement and Analyze the Biomarker-guided Clinical Trail Designs in the Presence of Biomarker Misclassification RESEARCH HIGHLIGHT Two-stage Methods to Implement and Analyze the Biomarker-guided Clinical Trail Designs in the Presence of Biomarker Misclassification Yong Zang 1, Beibei Guo 2 1 Department of Mathematical

More information

11/18/2013. Correlational Research. Correlational Designs. Why Use a Correlational Design? CORRELATIONAL RESEARCH STUDIES

11/18/2013. Correlational Research. Correlational Designs. Why Use a Correlational Design? CORRELATIONAL RESEARCH STUDIES Correlational Research Correlational Designs Correlational research is used to describe the relationship between two or more naturally occurring variables. Is age related to political conservativism? Are

More information

NORTH SOUTH UNIVERSITY TUTORIAL 1

NORTH SOUTH UNIVERSITY TUTORIAL 1 NORTH SOUTH UNIVERSITY TUTORIAL 1 REVIEW FROM BIOSTATISTICS I AHMED HOSSAIN,PhD Data Management and Analysis AHMED HOSSAIN,PhD - Data Management and Analysis 1 DATA TYPES/ MEASUREMENT SCALES Categorical:

More information

Meta-Analysis. Zifei Liu. Biological and Agricultural Engineering

Meta-Analysis. Zifei Liu. Biological and Agricultural Engineering Meta-Analysis Zifei Liu What is a meta-analysis; why perform a metaanalysis? How a meta-analysis work some basic concepts and principles Steps of Meta-analysis Cautions on meta-analysis 2 What is Meta-analysis

More information

Limited dependent variable regression models

Limited dependent variable regression models 181 11 Limited dependent variable regression models In the logit and probit models we discussed previously the dependent variable assumed values of 0 and 1, 0 representing the absence of an attribute and

More information

Memorial Sloan-Kettering Cancer Center

Memorial Sloan-Kettering Cancer Center Memorial Sloan-Kettering Cancer Center Memorial Sloan-Kettering Cancer Center, Dept. of Epidemiology & Biostatistics Working Paper Series Year 2007 Paper 14 On Comparing the Clustering of Regression Models

More information

Adjusting the Oral Health Related Quality of Life Measure (Using Ohip-14) for Floor and Ceiling Effects

Adjusting the Oral Health Related Quality of Life Measure (Using Ohip-14) for Floor and Ceiling Effects Journal of Oral Health & Community Dentistry original article Adjusting the Oral Health Related Quality of Life Measure (Using Ohip-14) for Floor and Ceiling Effects Andiappan M 1, Hughes FJ 2, Dunne S

More information

Advanced Bayesian Models for the Social Sciences. TA: Elizabeth Menninga (University of North Carolina, Chapel Hill)

Advanced Bayesian Models for the Social Sciences. TA: Elizabeth Menninga (University of North Carolina, Chapel Hill) Advanced Bayesian Models for the Social Sciences Instructors: Week 1&2: Skyler J. Cranmer Department of Political Science University of North Carolina, Chapel Hill skyler@unc.edu Week 3&4: Daniel Stegmueller

More information

Proof. Revised. Chapter 12 General and Specific Factors in Selection Modeling Introduction. Bengt Muthén

Proof. Revised. Chapter 12 General and Specific Factors in Selection Modeling Introduction. Bengt Muthén Chapter 12 General and Specific Factors in Selection Modeling Bengt Muthén Abstract This chapter shows how analysis of data on selective subgroups can be used to draw inference to the full, unselected

More information

Bias in Correlations from Selected Samples of Relatives: The Effects of Soft Selection

Bias in Correlations from Selected Samples of Relatives: The Effects of Soft Selection Behavior Genetics, Vol. 19, No. 2, 1989 Bias in Correlations from Selected Samples of Relatives: The Effects of Soft Selection M. C. Neale, ~ L. J. Eaves, 1 K. S. Kendler, 2 and J. K. Hewitt 1 Received

More information

A Comparison of Methods for Determining HIV Viral Set Point

A Comparison of Methods for Determining HIV Viral Set Point STATISTICS IN MEDICINE Statist. Med. 2006; 00:1 6 [Version: 2002/09/18 v1.11] A Comparison of Methods for Determining HIV Viral Set Point Y. Mei 1, L. Wang 2, S. E. Holte 2 1 School of Industrial and Systems

More information

Bias in regression coefficient estimates when assumptions for handling missing data are violated: a simulation study

Bias in regression coefficient estimates when assumptions for handling missing data are violated: a simulation study STATISTICAL METHODS Epidemiology Biostatistics and Public Health - 2016, Volume 13, Number 1 Bias in regression coefficient estimates when assumptions for handling missing data are violated: a simulation

More information

11/24/2017. Do not imply a cause-and-effect relationship

11/24/2017. Do not imply a cause-and-effect relationship Correlational research is used to describe the relationship between two or more naturally occurring variables. Is age related to political conservativism? Are highly extraverted people less afraid of rejection

More information

Propensity Score Analysis Shenyang Guo, Ph.D.

Propensity Score Analysis Shenyang Guo, Ph.D. Propensity Score Analysis Shenyang Guo, Ph.D. Upcoming Seminar: April 7-8, 2017, Philadelphia, Pennsylvania Propensity Score Analysis 1. Overview 1.1 Observational studies and challenges 1.2 Why and when

More information

Clinical trials with incomplete daily diary data

Clinical trials with incomplete daily diary data Clinical trials with incomplete daily diary data N. Thomas 1, O. Harel 2, and R. Little 3 1 Pfizer Inc 2 University of Connecticut 3 University of Michigan BASS, 2015 Thomas, Harel, Little (Pfizer) Clinical

More information

Estimating heritability for cause specific mortality based on twin studies

Estimating heritability for cause specific mortality based on twin studies Estimating heritability for cause specific mortality based on twin studies Thomas H. Scheike, Klaus K. Holst Department of Biostatistics, University of Copenhagen Øster Farimagsgade 5, DK-1014 Copenhagen

More information

Analysis of Rheumatoid Arthritis Data using Logistic Regression and Penalized Approach

Analysis of Rheumatoid Arthritis Data using Logistic Regression and Penalized Approach University of South Florida Scholar Commons Graduate Theses and Dissertations Graduate School November 2015 Analysis of Rheumatoid Arthritis Data using Logistic Regression and Penalized Approach Wei Chen

More information

Readings: Textbook readings: OpenStax - Chapters 1 13 (emphasis on Chapter 12) Online readings: Appendix D, E & F

Readings: Textbook readings: OpenStax - Chapters 1 13 (emphasis on Chapter 12) Online readings: Appendix D, E & F Readings: Textbook readings: OpenStax - Chapters 1 13 (emphasis on Chapter 12) Online readings: Appendix D, E & F Plous Chapters 17 & 18 Chapter 17: Social Influences Chapter 18: Group Judgments and Decisions

More information

Lecture 14: Adjusting for between- and within-cluster covariates in the analysis of clustered data May 14, 2009

Lecture 14: Adjusting for between- and within-cluster covariates in the analysis of clustered data May 14, 2009 Measurement, Design, and Analytic Techniques in Mental Health and Behavioral Sciences p. 1/3 Measurement, Design, and Analytic Techniques in Mental Health and Behavioral Sciences Lecture 14: Adjusting

More information