Modelling Spatially Correlated Survival Data for Individuals with Multiple Cancers

Size: px
Start display at page:

Download "Modelling Spatially Correlated Survival Data for Individuals with Multiple Cancers"

Transcription

1 Modelling Spatially Correlated Survival Data for Individuals with Multiple Cancers Dipak K. Dey, Ulysses Diva and Sudipto Banerjee Department of Statistics University of Connecticut, Storrs. March 16, Modeling Multivariate Survival Data

2 Introduction Motivation and Research Objectives Spatial data is used in a variety of disciplines current focus is on spatially referenced survival data. 2 Modeling Multivariate Survival Data

3 Introduction Motivation and Research Objectives Spatial data is used in a variety of disciplines current focus is on spatially referenced survival data. Accounting for spatial correlation provides additional insights Failure to account for such association could potentially lead to spurious and sometimes misleading results 2 Modeling Multivariate Survival Data

4 Introduction Motivation and Research Objectives Spatial data is used in a variety of disciplines current focus is on spatially referenced survival data. Accounting for spatial correlation provides additional insights Failure to account for such association could potentially lead to spurious and sometimes misleading results Spatial variation in survival patterns often reveal underlying lurking factors, which help in better understanding of the disease patterns. 2 Modeling Multivariate Survival Data

5 Introduction Motivation and Research Objectives Spatial data is used in a variety of disciplines current focus is on spatially referenced survival data. Accounting for spatial correlation provides additional insights Failure to account for such association could potentially lead to spurious and sometimes misleading results Spatial variation in survival patterns often reveal underlying lurking factors, which help in better understanding of the disease patterns. Spatial-survival models attempt to map spatial frailties 2 Modeling Multivariate Survival Data

6 Introduction Motivation and Research Objectives Spatial data is used in a variety of disciplines current focus is on spatially referenced survival data. Accounting for spatial correlation provides additional insights Failure to account for such association could potentially lead to spurious and sometimes misleading results Spatial variation in survival patterns often reveal underlying lurking factors, which help in better understanding of the disease patterns. Spatial-survival models attempt to map spatial frailties Data available on patients with several cancers, where each cancer can potentially have its own spatial distribution. 2 Modeling Multivariate Survival Data

7 Introduction Motivation and Research Objectives Spatial data is used in a variety of disciplines current focus is on spatially referenced survival data. Accounting for spatial correlation provides additional insights Failure to account for such association could potentially lead to spurious and sometimes misleading results Spatial variation in survival patterns often reveal underlying lurking factors, which help in better understanding of the disease patterns. Spatial-survival models attempt to map spatial frailties Data available on patients with several cancers, where each cancer can potentially have its own spatial distribution. How should we model these multiple spatial frailties? 2 Modeling Multivariate Survival Data

8 The SEER database Introduction SEER (Surveillance Epidemiology and End Results) database of the National Cancer Institute records primary cancer types First primary is followed up with subsequent cancers. GIS-link: Each patient s county of residence is available 3 Modeling Multivariate Survival Data

9 The SEER database Introduction SEER (Surveillance Epidemiology and End Results) database of the National Cancer Institute records primary cancer types First primary is followed up with subsequent cancers. GIS-link: Each patient s county of residence is available Survival models for multiple cancers are sought for practical reasons: To assess survival from multiple primary cancers simultaneously To adjust survival rates from a specific primary cancer in the presence of other primary cancers 3 Modeling Multivariate Survival Data

10 The SEER database Multiple response data Date of Date of Survival Patient ID diagnosis Cancer type end-point Status Time (months) 1234 March, 1991 Colorectal May, 1994 Dead August, 1992 Stomach May, 1994 Dead February, 1994 Pancreas May, 1994 Dead 3 The SEER (Surveillance Epidemiology and End Results) database lists cases of multiple cancers by cancer types diagnosed. Together with medicare data one can extract an individual patient s complete medical history. Simple example: The patient in the above table Patient specific information available in the registry includes gender, race, marital status, and their county of residence (provided the state belongs to the SEER registry). 4 Modeling Multivariate Survival Data

11 The SEER database Multiple response data Assume there are I counties. In the i th county we observe n i patients. Total no. of patients: N = I i=1 n i Ordered pair (i, j): j th individual from the i th county. Each patient has been diagnosed with at least one of K pre-fixed cancer types. t ijk is the survival time for the (i, j) th individual from his diagnosis with the k th type of cancer. 5 Modeling Multivariate Survival Data

12 The SEER database Multiple response data Assume there are I counties. In the i th county we observe n i patients. Total no. of patients: N = I i=1 n i Ordered pair (i, j): j th individual from the i th county. Each patient has been diagnosed with at least one of K pre-fixed cancer types. t ijk is the survival time for the (i, j) th individual from his diagnosis with the k th type of cancer. Some special care is needed: Not all patients develop all the K types of cancer, List possible cancers {1, 2,..., K} and form patient-specific index set C (i,j) {1, 2,..., K}. Example: if patient 1234 in Section is the 10th individual from the 8th county and if colorectal, stomach and pancreas are cancer types 1, 3 and 5, in {1, 2,..., K} then C (8,10) = {1, 3, 5}. 5 Modeling Multivariate Survival Data

13 Survival modelling framework Multiple responses Survival times will be correlated for similar county-patient-cancer combinations. We capture these correlations by introducing appropriate frailties. 6 Modeling Multivariate Survival Data

14 Survival modelling framework Multiple responses Survival times will be correlated for similar county-patient-cancer combinations. We capture these correlations by introducing appropriate frailties. Multiple response frailty modelling h (t ijk ) = h 0 (t ijk ) exp ( x T ijk β + u (i,j) + v k + φ ik ) 6 Modeling Multivariate Survival Data

15 Survival modelling framework Multiple responses Survival times will be correlated for similar county-patient-cancer combinations. We capture these correlations by introducing appropriate frailties. Multiple response frailty modelling h (t ijk ) = h 0 (t ijk ) exp ( x T ijk β + u (i,j) + v k + φ ik ) u (i,j) : frailty for the (i, j) th patient (these may be looked upon as patient effects nested within counties). v k : frailty for the k th cancer type. φ ik be the frailty for the k th cancer type nested within the i th county. x ijk denotes the patient-cancer specific covariates. 6 Modeling Multivariate Survival Data

16 Bayesian hierarchical modelling The data likelihood Hierarchical models combine information from the data likelihood and priors on parameters. The data likelihood I n i i=1 j=1 k C (i,j) [S (t ijk x ijk, β, η)] [ h 0 (t ijk ; η) exp ( x T ijk β + u (i,j) + v k + φ ik )] δij S (t ijk x ijk, β, η) is the survival function evaluated at t ijk conditional on the parameters and observed covariates. 7 Modeling Multivariate Survival Data

17 Bayesian hierarchical modelling Modelling the frailties Patient effects nested within counties: Priors on u ij indep u (i,j) N ( 0, σi 2 ) Cancer effects: Priors on v k s v = (v 1,..., v K ) T N (0, Λ), where Λ is a K K (unknown) covariance matrix, which is assigned an Inverse-Wishart prior. 8 Modeling Multivariate Survival Data

18 Bayesian hierarchical modelling Spatial Frailties Spatial Effects: Form Φ = ( φ T 1,...φ T ) T I, where φ i = (φ i1,..., φ ik ) T is the collection of the spatial frailties for the K cancers within the i th county. Priors on φ ik s Φ = ( φ T 1,...φ T ) T I N (0, ΣW (α) Λ). Here Σ W (α) = (Diag (m i ) αw ) 1. m i is the number of neighbors of county i W is the adjacency matrix of the graph representing our region. Only α is random in Σ W (α). Note: α (0, 1) ensures a proper M CAR distribution. 9 Modeling Multivariate Survival Data

19 Bayesian hierarchical modelling Remarks on likelihood modelling h 0 (t) denotes baseline hazard function modelled semiparametrically as a mixture of Beta functions (see, e.g., Gelfand and Mallick, 1995 and Ibrahim et al., 2001) as cancer-specific (in which case we write h 0k (t)) or county-specific (in which case we write h 0i (t)). In principle we can also include subject-cancer interaction terms of the form w (ij)k, which may reveal cancer-specific variation in patient frailties. However, in practice reliably estimating these higher order interaction terms becomes difficult, especially with data such as ours, where many subjects die before contracting fewer than three cancers. 10 Modeling Multivariate Survival Data

20 Bayesian hierarchical modelling Spatial modelling extensions We are using the same Λ to model the MCAR and the cancer main effects: we are modelling the dispersion matrix of the space-cancer interactions, Φ, as a Kronecker product of spatial dispersion and cancer dispersion. 11 Modeling Multivariate Survival Data

21 Bayesian hierarchical modelling Spatial modelling extensions We are using the same Λ to model the MCAR and the cancer main effects: we are modelling the dispersion matrix of the space-cancer interactions, Φ, as a Kronecker product of spatial dispersion and cancer dispersion. We may generalize the MCAR (α, Λ) to MCAR ((α 1,..., α K ), Λ), enriching our model by assigning a different spatial smoothness parameter to each cancer type. (Carlin and Banerjee, 2003) 11 Modeling Multivariate Survival Data

22 Bayesian hierarchical modelling Spatial modelling extensions We are using the same Λ to model the MCAR and the cancer main effects: we are modelling the dispersion matrix of the space-cancer interactions, Φ, as a Kronecker product of spatial dispersion and cancer dispersion. We may generalize the MCAR (α, Λ) to MCAR ((α 1,..., α K ), Λ), enriching our model by assigning a different spatial smoothness parameter to each cancer type. (Carlin and Banerjee, 2003) Yet another modification considers v i N (0, Λ i ) (not identically distributed), whereupon each county has its own cancer-covariance pattern. 11 Modeling Multivariate Survival Data

23 Bayesian hierarchical modelling Spatial modelling extensions We are using the same Λ to model the MCAR and the cancer main effects: we are modelling the dispersion matrix of the space-cancer interactions, Φ, as a Kronecker product of spatial dispersion and cancer dispersion. We may generalize the MCAR (α, Λ) to MCAR ((α 1,..., α K ), Λ), enriching our model by assigning a different spatial smoothness parameter to each cancer type. (Carlin and Banerjee, 2003) Yet another modification considers v i N (0, Λ i ) (not identically distributed), whereupon each county has its own cancer-covariance pattern. In addition, one may compare MCAR (α, Λ) to MCAR (α, (Λ 1,..., Λ I )), retaining a common smoothness parameter for the different cancers (even put α = 1) but allow different covariance functions for different counties. 11 Modeling Multivariate Survival Data

24 Bayesian hierarchical modelling Spatial modelling extensions However, the above generalizations of the MCAR (α, Λ) will often require substantial information on multiple cancer patients. Unfortunately, such information is rare in practice. For instance, in the SEER database less than one percent of the patients have moderately large monitoring times with more than two different types of cancers. 12 Modeling Multivariate Survival Data

25 Bayesian hierarchical modelling Spatial modelling extensions However, the above generalizations of the MCAR (α, Λ) will often require substantial information on multiple cancer patients. Unfortunately, such information is rare in practice. For instance, in the SEER database less than one percent of the patients have moderately large monitoring times with more than two different types of cancers. Hence general models are identifiable only via priors. 12 Modeling Multivariate Survival Data

26 Bayesian hierarchical modelling Spatial modelling extensions However, the above generalizations of the MCAR (α, Λ) will often require substantial information on multiple cancer patients. Unfortunately, such information is rare in practice. For instance, in the SEER database less than one percent of the patients have moderately large monitoring times with more than two different types of cancers. Hence general models are identifiable only via priors. Therefore, we restrict our subsequent attention to the M CAR(α, Λ) specification and compare a spatially independent model with basically the same structure by replacing the Σ W (α) with a diagonal matrix which does not account for the spatial structure of the region. 12 Modeling Multivariate Survival Data

27 Bayesian hierarchical modelling Spatial modelling extensions However, the above generalizations of the MCAR (α, Λ) will often require substantial information on multiple cancer patients. Unfortunately, such information is rare in practice. For instance, in the SEER database less than one percent of the patients have moderately large monitoring times with more than two different types of cancers. Hence general models are identifiable only via priors. Therefore, we restrict our subsequent attention to the M CAR(α, Λ) specification and compare a spatially independent model with basically the same structure by replacing the Σ W (α) with a diagonal matrix which does not account for the spatial structure of the region. Now the independent frailties for the K cancers within the i th county, denoted by Φ, has a N ( 0, τ 2 I I I Λ ) distribution where τ 2 is a scale parameter. 12 Modeling Multivariate Survival Data

28 Numerical computations Models Model Description Model Model Description Model I Full MCAR (α, Λ) Model II MCAR (α, Λ) Without Patient Frailties {u (i,j) } Model III Full Spatial Independence Model Model IV Spatial Independence Model Without Patient Frailties {u (i,j) } 13 Modeling Multivariate Survival Data

29 Numerical computations MCAR model: MCAR (α, Λ) We design an appropriate Gibbs sampler for the above hierarchical model. β have full conditional distributions that are log-concave (with either flat or vague Gaussian priors) and can be updated using the Adaptive Rejection Sampling (ARS) algorithm (Gilks and Wild, 1992) or a Metropolis step. The same is true for the patient frailties { u (i,j) }, the cancer frailties {v k } and the spatial frailties {φ ik }. The mixture weights, η, in the baseline hazard modelling needs to be updated using a Metropolis step. The full conditionals for each of the σi 2 inverted-gamma distributions are conjugate Λ has a conjugate inverted-wishart distribution, which follows from the following Lemma: 14 Modeling Multivariate Survival Data

30 Numerical computations The Lemma Lemma: Lemma: Let A and B be I I and K K matrices and let x be any vector of length IK. Then, there exists a K K matrix C and an I I matrix D such that x T (A B) x = tr (CB) = tr (DA). 15 Modeling Multivariate Survival Data

31 Illustrations SEER data description Source: SEER Public Use Incidence Database on patients from the state of Iowa diagnosed with colorectal cancer. In the dataset, 28% of the patients survival times were censored, 44% of patients were females, 69% of patients had at most two primary cancers, 47% had colon cancer before rectal cancer, majority of cases (78% for colon and 75% for rectal cancer) were in the local or regional stage, and most of the cases were diagnosed when patients were 65 years or older (76% for colon and 74% for rectal cancer). 16 Modeling Multivariate Survival Data

32 Illustrations SEER data description Colorectal cancer involves two gastrointestinal related organs. Good motivation for using this cancer type to illustrate the methodology. Diagnosis: two cancer types (colon and rectum) with survival times recorded in months from diagnosis up to study termination (censored) or death (failure) patients; represent 96 of the 99 counties in Iowa. Age at diagnosis for each type of cancer categorized: < 55, 55 65, 65 75, and > 75. Stage of each type of cancer: in situ or unstaged, local, regional, or distant. Others: total number of primary cancers, dichotomized as 0 (nprimes 2) and 1 (nprimes > 2), and gender. An indicator variable, colon first, is used to capture the effect of the order of occurrence. 17 Modeling Multivariate Survival Data

33 Illustrations Parameter Estimates: Regressors Parameter Model I Model II Model III Model IV Patient-level Gender=Female (-0.36, -0.36) (-0.36, -0.36) (-0.36, -0.36) (-0.36, -0.36) Number of primaries (-0.30, -0.30) (-0.30, -0.30) (-0.30, -0.30) (-0.30, -0.30) Colon first 0.04 (0.04, 0.04) 0.04 (0.04, 0.04) 0.04 (0.04, 0.04) 0.04 (0.04, 0.04) Colon cancer Frailty, ν (-2.90, -1.98) (-2.92, -2.01) (-1.86, -1.56) (-2.05, -1.57) Stage In situ or Unstaged Local (-0.33, -0.18) (-0.33, -0.18) (-0.33, -0.18) (-0.32, -0.18) Regional 0.03 (-0.05, 0.10) 0.03 (-0.04, 0.10) 0.03 (-0.05, 0.10) 0.03 (-0.04, 0.11) Distant 1.12 (1.02, 1.21) 1.12 (1.02, 1.21) 1.12 (1.02, 1.21) 1.12 (1.02, 1.20) Age < < (0.36, 0.58) 0.47 (0.36, 0.58) 0.47 (0.36, 0.57) 0.47 (0.35, 0.57) 65 < (0.64, 0.84) 0.74 (0.64, 0.84) 0.74 (0.64, 0.84) 0.74 (0.64, 0.84) (1.07, 1.27) 1.16 (1.06, 1.26) 1.16 (1.06, 1.26) 1.16 (1.06, 1.26) Rectal cancer Frailty, ν (-2.63, -1.72) (-2.73, -1.80) (-2.07, -1.69) (-2.37, -1.75) Stage In situ or Unstaged Local (-0.81, 0.02) (-0.81, 0.04) (-0.80, 0.02) (-0.82, 0.02) Regional 0.07 (-0.36, 0.49) 0.07 (-0.35, 0.47) 0.07 (-0.35, 0.49) 0.07 (-0.35, 0.48) Distant 1.44 (1.16, 1.72) 1.44 (1.16, 1.72) 1.44 (1.16, 1.71) 1.44 (1.17, 1.73) Age at diagnosis < < (0.05, 0.66) 0.35 (0.05, 0.65) 0.36 (0.05, 0.66) 0.35 (0.04, 0.64) 65 < (0.36, 1.17) 0.76 (0.35, 1.16) 0.76 (0.36, 1.18) 0.76 (0.36, 1.17) (0.86, 1.70) 1.28 (0.87, 1.71) 1.28 (0.87, 1.72) 1.28 (0.86, 1.70) Reference categories. 18 Modeling Multivariate Survival Data

34 Illustrations Parameter Estimates: Other parameters Parameter Model I Model II Model III Model IV α 0.95 (0.81, 1.00) 0.95 (0.81, 1.00) η (0.88, 0.92) 0.90 (0.88, 0.92) 0.90 (0.88, 0.92) 0.90 (0.88, 0.92) η (0.01, 0.04) 0.03 (0.01, 0.04) 0.02 (0.01, 0.04) 0.02 (0.01, 0.04) η (0.01, 0.04) 0.03 (0.01, 0.04) 0.03 (0.01, 0.04) 0.03 (0.01, 0.04) η (0.01, 0.04) 0.03 (0.01, 0.04) 0.02 (0.01, 0.04) 0.02 (0.01, 0.04) η (0.01, 0.04) 0.03 (0.01, 0.04) 0.03 (0.01, 0.04) 0.02 (0.01, 0.04) Λ (0.59, 2.11) 1.30 (0.59, 2.12) 0.41 (0.20, 0.65) 0.41 (0.19, 0.65) Λ (0.20, 0.91) 0.57 (0.23, 0.96) 0.27 (0.13, 0.42) 0.29 (0.13, 0.46) Λ (0.90, 2.74) 1.84 (0.95, 2.83) 0.87 (0.48, 1.30) 0.98 (0.54, 1.47) ρ 0.36 (0.20, 0.53) 0.37 (0.21, 0.54) 0.45 (0.31, 0.60) 0.46 (0.31, 0.62) Precision is Modeling Multivariate Survival Data

35 Model Comparisons CPO and ALMPL score The conditional predictive ordinate (CPO) and the average log-marginal pseudo-likelihood (ALMPL) were used. (Gelfand and Dey, 1994, and Chen et al., 2000) CPO statistic CP O ij = f ( D ij x ij, D ( ij)) ( = ) 1 1 π (Θ D) Θ f (D ij x ij, Θ) D ij = (i, j) th observation, D is the complete data and Θ is the collection of parameters. ALMPL ALMP L = 1 N log CP O ij. ij Larger values of ALMPL indicate better model fit. 20 Modeling Multivariate Survival Data

36 Model Comparisons DIC score The modified Deviance Information Criterion (DIC) proposed by Celeux et al. (2006) was also used for model comparison. The DIC for a particular model with data y, random effects Z and parameters Ω and likelihood f ( y Ω ) is given by DIC score DIC = 4E Ω,Z [ log f ( y, Z Ω ) y ] +2E Z [ log p ( y, Z EΩ [ Ω y, Z ])] This essentially integrates out the random effects. Lower values of DIC indicate better models. Measure of complexity [ ( p D = DIC + 2E Ω,Z log f y, Z Ω ) ] y Smaller values pointing to models with lesser complexity. 21 Modeling Multivariate Survival Data

37 Model Comparisons Model Comparison Table Model ALMPL DIC p D Model I Model II Model III Model IV DIC difference from Model I (smaller indicates better model). 22 Modeling Multivariate Survival Data

38 Figures Index plots of CPO s Index plots of the log CP Oij A log CP OB ij for each model pair. Positive values indicate support for model A over model B while negative values indicate the opposite. From left to right, top to bottom: I-II, I-III, I-IV, II-III, II-IV, III-IV. Y-axes range is from -1.0 to 7.0 (comparisons involving Model IV were truncated at 7.0 to facilitate presentation: 23 Modeling Multivariate Survival Data

39 Figures Cancer frailties Distribution of cancer-specific frailties (y-axes) under the different models. Positive frailties indicate higher relative risks while negative values indicate the opposite: 24 Modeling Multivariate Survival Data

40 Figures Patient-specific frailties Distribution of average patient-specific frailties (y-axes) from different counties (x-axes) in Iowa under Model I (top) and Model III (bottom). Positive frailties indicate higher relative risks while negative values indicate the opposite: 25 Modeling Multivariate Survival Data

41 Figures Spatial frailties Distribution of spatial frailties (y-axes) for colon and rectal cancers from different counties (x-axes) in Iowa under best model. Positive frailties indicate higher relative risks while negative values indicate the opposite: 26 Modeling Multivariate Survival Data

42 Figures Spatial frailties: choropleth map Spatial frailties for colon and rectal cancers in Iowa under the different models. Color change from blue to red indicates an increase from negative to positive spatial frailties. Cutoff values are one standard deviation units from the mean of each cancer type in each model. Locations of urban areas are also indicated to potentially account for clustering: 27 Modeling Multivariate Survival Data

43 Concluding Remarks Remarks We presented a methodology for modelling spatially correlated multivariate cancer survival data, illustrated using colon and rectal cancer data for the state of Iowa obtained from the SEER database. 28 Modeling Multivariate Survival Data

44 Concluding Remarks Remarks We presented a methodology for modelling spatially correlated multivariate cancer survival data, illustrated using colon and rectal cancer data for the state of Iowa obtained from the SEER database. The results suggest that there is positive correlation between the hazard rates of colon and rectal cancers. This information may not be available or interpretable when modelling the cancers independently. 28 Modeling Multivariate Survival Data

45 Concluding Remarks Remarks We presented a methodology for modelling spatially correlated multivariate cancer survival data, illustrated using colon and rectal cancer data for the state of Iowa obtained from the SEER database. The results suggest that there is positive correlation between the hazard rates of colon and rectal cancers. This information may not be available or interpretable when modelling the cancers independently. Other mechanisms for handling spatial covariation, such as the shared component approach presented by Held and Best (2001) applied to disease mapping of oral and esophageal cancers, are also possible. The basic modelling framework may be extended to different hazard structures if the need arises. 28 Modeling Multivariate Survival Data

46 Concluding Remarks Remarks We presented a methodology for modelling spatially correlated multivariate cancer survival data, illustrated using colon and rectal cancer data for the state of Iowa obtained from the SEER database. The results suggest that there is positive correlation between the hazard rates of colon and rectal cancers. This information may not be available or interpretable when modelling the cancers independently. Other mechanisms for handling spatial covariation, such as the shared component approach presented by Held and Best (2001) applied to disease mapping of oral and esophageal cancers, are also possible. The basic modelling framework may be extended to different hazard structures if the need arises. 28 Modeling Multivariate Survival Data

47 Concluding Remarks Thank you! 29 Modeling Multivariate Survival Data

Joint Spatio-Temporal Modeling of Low Incidence Cancers Sharing Common Risk Factors

Joint Spatio-Temporal Modeling of Low Incidence Cancers Sharing Common Risk Factors Journal of Data Science 6(2008), 105-123 Joint Spatio-Temporal Modeling of Low Incidence Cancers Sharing Common Risk Factors Jacob J. Oleson 1,BrianJ.Smith 1 and Hoon Kim 2 1 The University of Iowa and

More information

Application of EM Algorithm to Mixture Cure Model for Grouped Relative Survival Data

Application of EM Algorithm to Mixture Cure Model for Grouped Relative Survival Data Journal of Data Science 5(2007), 41-51 Application of EM Algorithm to Mixture Cure Model for Grouped Relative Survival Data Binbing Yu 1 and Ram C. Tiwari 2 1 Information Management Services, Inc. and

More information

A Bayesian Measurement Model of Political Support for Endorsement Experiments, with Application to the Militant Groups in Pakistan

A Bayesian Measurement Model of Political Support for Endorsement Experiments, with Application to the Militant Groups in Pakistan A Bayesian Measurement Model of Political Support for Endorsement Experiments, with Application to the Militant Groups in Pakistan Kosuke Imai Princeton University Joint work with Will Bullock and Jacob

More information

Prediction and Inference under Competing Risks in High Dimension - An EHR Demonstration Project for Prostate Cancer

Prediction and Inference under Competing Risks in High Dimension - An EHR Demonstration Project for Prostate Cancer Prediction and Inference under Competing Risks in High Dimension - An EHR Demonstration Project for Prostate Cancer Ronghui (Lily) Xu Division of Biostatistics and Bioinformatics Department of Family Medicine

More information

14. Linear Mixed-Effects Models for Data from Split-Plot Experiments

14. Linear Mixed-Effects Models for Data from Split-Plot Experiments 14. Linear Mixed-Effects Models for Data from Split-Plot Experiments opyright c 219 Dan Nettleton (Iowa State University) 14. Statistics 51 1 / 3 Start with a Field Field opyright c 219 Dan Nettleton (Iowa

More information

BAYESIAN ANALYSIS OF CANCER MORTALITY RATES FROM DIFFERENT TYPES AND THEIR RELATIVE OCCURENCES

BAYESIAN ANALYSIS OF CANCER MORTALITY RATES FROM DIFFERENT TYPES AND THEIR RELATIVE OCCURENCES BAYESIAN ANALYSIS OF CANCER MORTALITY RATES FROM DIFFERENT TYPES AND THEIR RELATIVE OCCURENCES by Sophie M. Delcroix A Thesis Submitted to the Faculty of the WORCESTER POLYTECHNIC INSTITUTE In partial

More information

Modelling prognostic capabilities of tumor size: application to colorectal cancer

Modelling prognostic capabilities of tumor size: application to colorectal cancer Session 3: Epidemiology and public health Modelling prognostic capabilities of tumor size: application to colorectal cancer Virginie Rondeau, INSERM Modelling prognostic capabilities of tumor size : application

More information

Data Analysis Using Regression and Multilevel/Hierarchical Models

Data Analysis Using Regression and Multilevel/Hierarchical Models Data Analysis Using Regression and Multilevel/Hierarchical Models ANDREW GELMAN Columbia University JENNIFER HILL Columbia University CAMBRIDGE UNIVERSITY PRESS Contents List of examples V a 9 e xv " Preface

More information

arxiv: v3 [stat.ap] 31 Jul 2017

arxiv: v3 [stat.ap] 31 Jul 2017 Bayesian semiparametric modelling of contraceptive behavior in India via sequential logistic regressions Tommaso Rigon, Bocconi University, Department of Decision Sciences, Via Roentgen 1, Milan, Italy

More information

Bayesian hierarchical joint modeling of repeatedly measured continuous and ordinal markers of disease severity: Application to Ugandan diabetes data

Bayesian hierarchical joint modeling of repeatedly measured continuous and ordinal markers of disease severity: Application to Ugandan diabetes data Received: 14 June 2016 Revised: 27 June 2017 Accepted: 27 July 2017 DOI: 10.1002/sim.7444 RESEARCH ARTICLE Bayesian hierarchical joint modeling of repeatedly measured continuous and ordinal markers of

More information

Outlier Analysis. Lijun Zhang

Outlier Analysis. Lijun Zhang Outlier Analysis Lijun Zhang zlj@nju.edu.cn http://cs.nju.edu.cn/zlj Outline Introduction Extreme Value Analysis Probabilistic Models Clustering for Outlier Detection Distance-Based Outlier Detection Density-Based

More information

SUPPLEMENTARY MATERIAL. Impact of Vaccination on 14 High-Risk HPV type infections: A Mathematical Modelling Approach

SUPPLEMENTARY MATERIAL. Impact of Vaccination on 14 High-Risk HPV type infections: A Mathematical Modelling Approach SUPPLEMENTARY MATERIAL Impact of Vaccination on 14 High-Risk HPV type infections: A Mathematical Modelling Approach Simopekka Vänskä, Kari Auranen, Tuija Leino, Heini Salo, Pekka Nieminen, Terhi Kilpi,

More information

SCHOOL OF MATHEMATICS AND STATISTICS

SCHOOL OF MATHEMATICS AND STATISTICS Data provided: Tables of distributions MAS603 SCHOOL OF MATHEMATICS AND STATISTICS Further Clinical Trials Spring Semester 014 015 hours Candidates may bring to the examination a calculator which conforms

More information

Bayesian Modeling of Multivariate Spatial Binary Data with Application to Dental Caries

Bayesian Modeling of Multivariate Spatial Binary Data with Application to Dental Caries Bayesian Modeling of Multivariate Spatial Binary Data with Application to Dental Caries Dipankar Bandyopadhyay dbandyop@umn.edu Division of Biostatistics, School of Public Health University of Minnesota

More information

Bayesian Joint Modelling of Longitudinal and Survival Data of HIV/AIDS Patients: A Case Study at Bale Robe General Hospital, Ethiopia

Bayesian Joint Modelling of Longitudinal and Survival Data of HIV/AIDS Patients: A Case Study at Bale Robe General Hospital, Ethiopia American Journal of Theoretical and Applied Statistics 2017; 6(4): 182-190 http://www.sciencepublishinggroup.com/j/ajtas doi: 10.11648/j.ajtas.20170604.13 ISSN: 2326-8999 (Print); ISSN: 2326-9006 (Online)

More information

Combining Risks from Several Tumors Using Markov Chain Monte Carlo

Combining Risks from Several Tumors Using Markov Chain Monte Carlo University of Nebraska - Lincoln DigitalCommons@University of Nebraska - Lincoln U.S. Environmental Protection Agency Papers U.S. Environmental Protection Agency 2009 Combining Risks from Several Tumors

More information

Small-area estimation of mental illness prevalence for schools

Small-area estimation of mental illness prevalence for schools Small-area estimation of mental illness prevalence for schools Fan Li 1 Alan Zaslavsky 2 1 Department of Statistical Science Duke University 2 Department of Health Care Policy Harvard Medical School March

More information

Bayesian Confidence Intervals for Means and Variances of Lognormal and Bivariate Lognormal Distributions

Bayesian Confidence Intervals for Means and Variances of Lognormal and Bivariate Lognormal Distributions Bayesian Confidence Intervals for Means and Variances of Lognormal and Bivariate Lognormal Distributions J. Harvey a,b, & A.J. van der Merwe b a Centre for Statistical Consultation Department of Statistics

More information

Cancer survivorship and labor market attachments: Evidence from MEPS data

Cancer survivorship and labor market attachments: Evidence from MEPS data Cancer survivorship and labor market attachments: Evidence from 2008-2014 MEPS data University of Memphis, Department of Economics January 7, 2018 Presentation outline Motivation and previous literature

More information

Bayesian Estimation of a Meta-analysis model using Gibbs sampler

Bayesian Estimation of a Meta-analysis model using Gibbs sampler University of Wollongong Research Online Applied Statistics Education and Research Collaboration (ASEARC) - Conference Papers Faculty of Engineering and Information Sciences 2012 Bayesian Estimation of

More information

Hierarchical Bayesian Modeling of Individual Differences in Texture Discrimination

Hierarchical Bayesian Modeling of Individual Differences in Texture Discrimination Hierarchical Bayesian Modeling of Individual Differences in Texture Discrimination Timothy N. Rubin (trubin@uci.edu) Michael D. Lee (mdlee@uci.edu) Charles F. Chubb (cchubb@uci.edu) Department of Cognitive

More information

Bayesian Nonparametric Methods for Precision Medicine

Bayesian Nonparametric Methods for Precision Medicine Bayesian Nonparametric Methods for Precision Medicine Brian Reich, NC State Collaborators: Qian Guan (NCSU), Eric Laber (NCSU) and Dipankar Bandyopadhyay (VCU) University of Illinois at Urbana-Champaign

More information

CDRI Cancer Disparities Geocoding Project. November 29, 2006 Chris Johnson, CDRI

CDRI Cancer Disparities Geocoding Project. November 29, 2006 Chris Johnson, CDRI CDRI Cancer Disparities Geocoding Project November 29, 2006 Chris Johnson, CDRI cjohnson@teamiha.org CDRI Cancer Disparities Geocoding Project Purpose: To describe and understand variations in cancer incidence,

More information

A bivariate parametric long-range dependent time series model with general phase

A bivariate parametric long-range dependent time series model with general phase A bivariate parametric long-range dependent time series model with general phase Vladas Pipiras (joint work with Stefanos Kechagias, SAS) University of North Carolina April 1, 2017 AMS Sectional Meeting,

More information

A Bayesian approach to sample size determination for studies designed to evaluate continuous medical tests

A Bayesian approach to sample size determination for studies designed to evaluate continuous medical tests Baylor Health Care System From the SelectedWorks of unlei Cheng 1 A Bayesian approach to sample size determination for studies designed to evaluate continuous medical tests unlei Cheng, Baylor Health Care

More information

with Stata Bayesian Analysis John Thompson University of Leicester A Stata Press Publication StataCorp LP College Station, Texas

with Stata Bayesian Analysis John Thompson University of Leicester A Stata Press Publication StataCorp LP College Station, Texas Bayesian Analysis with Stata John Thompson University of Leicester A Stata Press Publication StataCorp LP College Station, Texas Contents List of figures List of tables Preface Acknowledgments xiii xvii

More information

2014 Modern Modeling Methods (M 3 ) Conference, UCONN

2014 Modern Modeling Methods (M 3 ) Conference, UCONN 2014 Modern Modeling (M 3 ) Conference, UCONN Comparative study of two calibration methods for micro-simulation models Department of Biostatistics Center for Statistical Sciences Brown University School

More information

Bayesian Latent Subgroup Design for Basket Trials

Bayesian Latent Subgroup Design for Basket Trials Bayesian Latent Subgroup Design for Basket Trials Yiyi Chu Department of Biostatistics The University of Texas School of Public Health July 30, 2017 Outline Introduction Bayesian latent subgroup (BLAST)

More information

Hierarchical Bayesian Methods for Estimation of Parameters in a Longitudinal HIV Dynamic System

Hierarchical Bayesian Methods for Estimation of Parameters in a Longitudinal HIV Dynamic System Hierarchical Bayesian Methods for Estimation of Parameters in a Longitudinal HIV Dynamic System Yangxin Huang, Dacheng Liu and Hulin Wu Department of Biostatistics & Computational Biology University of

More information

Design for Targeted Therapies: Statistical Considerations

Design for Targeted Therapies: Statistical Considerations Design for Targeted Therapies: Statistical Considerations J. Jack Lee, Ph.D. Department of Biostatistics University of Texas M. D. Anderson Cancer Center Outline Premise General Review of Statistical Designs

More information

Practical Bayesian Design and Analysis for Drug and Device Clinical Trials

Practical Bayesian Design and Analysis for Drug and Device Clinical Trials Practical Bayesian Design and Analysis for Drug and Device Clinical Trials p. 1/2 Practical Bayesian Design and Analysis for Drug and Device Clinical Trials Brian P. Hobbs Plan B Advisor: Bradley P. Carlin

More information

Knowledge Discovery and Data Mining I

Knowledge Discovery and Data Mining I Ludwig-Maximilians-Universität München Lehrstuhl für Datenbanksysteme und Data Mining Prof. Dr. Thomas Seidl Knowledge Discovery and Data Mining I Winter Semester 2018/19 Introduction What is an outlier?

More information

Bayesian Modelling of the Consistency of Symptoms Reported During Hypoglycaemia for Individual Patients

Bayesian Modelling of the Consistency of Symptoms Reported During Hypoglycaemia for Individual Patients Malaysian Journal of Mathematical Sciences 10(S) March : 27-39 (2016) Special Issue: The 10th IMT-GT International Conference on Mathematics, Statistics and its Applications 2014 (ICMSA 2014) MALAYSIAN

More information

Jonathan D. Sugimoto, PhD Lecture Website:

Jonathan D. Sugimoto, PhD Lecture Website: Jonathan D. Sugimoto, PhD jons@fredhutch.org Lecture Website: http://www.cidid.org/transtat/ 1 Introduction to TranStat Lecture 6: Outline Case study: Pandemic influenza A(H1N1) 2009 outbreak in Western

More information

Bayesian Bi-Cluster Change-Point Model for Exploring Functional Brain Dynamics

Bayesian Bi-Cluster Change-Point Model for Exploring Functional Brain Dynamics Int'l Conf. Bioinformatics and Computational Biology BIOCOMP'18 85 Bayesian Bi-Cluster Change-Point Model for Exploring Functional Brain Dynamics Bing Liu 1*, Xuan Guo 2, and Jing Zhang 1** 1 Department

More information

Faculty of Sciences School for Information Technology Master of Statistics

Faculty of Sciences School for Information Technology Master of Statistics Faculty of Sciences School for Information Technology Master of Statistics Masterthesis The bivariate spatial modelling of breast and ovary cancer in Limburg Leyla Kodalci Thesis presented in fulfillment

More information

Accommodating informative dropout and death: a joint modelling approach for longitudinal and semicompeting risks data

Accommodating informative dropout and death: a joint modelling approach for longitudinal and semicompeting risks data Appl. Statist. (2018) 67, Part 1, pp. 145 163 Accommodating informative dropout and death: a joint modelling approach for longitudinal and semicompeting risks data Qiuju Li and Li Su Medical Research Council

More information

Lec 02: Estimation & Hypothesis Testing in Animal Ecology

Lec 02: Estimation & Hypothesis Testing in Animal Ecology Lec 02: Estimation & Hypothesis Testing in Animal Ecology Parameter Estimation from Samples Samples We typically observe systems incompletely, i.e., we sample according to a designed protocol. We then

More information

Multilevel analysis quantifies variation in the experimental effect while optimizing power and preventing false positives

Multilevel analysis quantifies variation in the experimental effect while optimizing power and preventing false positives DOI 10.1186/s12868-015-0228-5 BMC Neuroscience RESEARCH ARTICLE Open Access Multilevel analysis quantifies variation in the experimental effect while optimizing power and preventing false positives Emmeke

More information

Spatio-temporal modelling and probabilistic forecasting of infectious disease counts

Spatio-temporal modelling and probabilistic forecasting of infectious disease counts Spatio-temporal modelling and probabilistic forecasting of infectious disease counts Sebastian Meyer Institute of Medical Informatics, Biometry, and Epidemiology Friedrich-Alexander-Universität Erlangen-Nürnberg,

More information

TRIPODS Workshop: Models & Machine Learning for Causal I. & Decision Making

TRIPODS Workshop: Models & Machine Learning for Causal I. & Decision Making TRIPODS Workshop: Models & Machine Learning for Causal Inference & Decision Making in Medical Decision Making : and Predictive Accuracy text Stavroula Chrysanthopoulou, PhD Department of Biostatistics

More information

Statistics 202: Data Mining. c Jonathan Taylor. Final review Based in part on slides from textbook, slides of Susan Holmes.

Statistics 202: Data Mining. c Jonathan Taylor. Final review Based in part on slides from textbook, slides of Susan Holmes. Final review Based in part on slides from textbook, slides of Susan Holmes December 5, 2012 1 / 1 Final review Overview Before Midterm General goals of data mining. Datatypes. Preprocessing & dimension

More information

Using mixture priors for robust inference: application in Bayesian dose escalation trials

Using mixture priors for robust inference: application in Bayesian dose escalation trials Using mixture priors for robust inference: application in Bayesian dose escalation trials Astrid Jullion, Beat Neuenschwander, Daniel Lorand BAYES2014, London, 11 June 2014 Agenda Dose escalation in oncology

More information

HIDDEN MARKOV MODELS FOR ALCOHOLISM TREATMENT TRIAL DATA

HIDDEN MARKOV MODELS FOR ALCOHOLISM TREATMENT TRIAL DATA HIDDEN MARKOV MODELS FOR ALCOHOLISM TREATMENT TRIAL DATA By Kenneth Shirley, Dylan Small, Kevin Lynch, Stephen Maisto, and David Oslin Columbia University, University of Pennsylvania and Syracuse University

More information

arxiv: v2 [stat.ap] 11 Apr 2018

arxiv: v2 [stat.ap] 11 Apr 2018 A Novel Bayesian Multiple Testing Approach to Deregulated mirna Discovery Harnessing Positional Clustering Noirrit Kiran Chandra 1, Richa Singh 2 and Sourabh Bhattacharya 1 1 Interdisciplinary Statistical

More information

Cancer in Rural Illinois, Incidence, Mortality, Staging, and Access to Care. April 2014

Cancer in Rural Illinois, Incidence, Mortality, Staging, and Access to Care. April 2014 Cancer in Rural Illinois, 1990-2010 Incidence, Mortality, Staging, and Access to Care April 2014 Prepared by Whitney E. Zahnd, MS Research Development Coordinator Center for Clinical Research Southern

More information

Comparison of discrimination methods for the classification of tumors using gene expression data

Comparison of discrimination methods for the classification of tumors using gene expression data Comparison of discrimination methods for the classification of tumors using gene expression data Sandrine Dudoit, Jane Fridlyand 2 and Terry Speed 2,. Mathematical Sciences Research Institute, Berkeley

More information

Spatiotemporal models for disease incidence data: a case study

Spatiotemporal models for disease incidence data: a case study Spatiotemporal models for disease incidence data: a case study Erik A. Sauleau 1,2, Monica Musio 3, Nicole Augustin 4 1 Medicine Faculty, University of Strasbourg, France 2 Haut-Rhin Cancer Registry 3

More information

Analysis of Hearing Loss Data using Correlated Data Analysis Techniques

Analysis of Hearing Loss Data using Correlated Data Analysis Techniques Analysis of Hearing Loss Data using Correlated Data Analysis Techniques Ruth Penman and Gillian Heller, Department of Statistics, Macquarie University, Sydney, Australia. Correspondence: Ruth Penman, Department

More information

How to analyze correlated and longitudinal data?

How to analyze correlated and longitudinal data? How to analyze correlated and longitudinal data? Niloofar Ramezani, University of Northern Colorado, Greeley, Colorado ABSTRACT Longitudinal and correlated data are extensively used across disciplines

More information

A Brief Introduction to Bayesian Statistics

A Brief Introduction to Bayesian Statistics A Brief Introduction to Statistics David Kaplan Department of Educational Psychology Methods for Social Policy Research and, Washington, DC 2017 1 / 37 The Reverend Thomas Bayes, 1701 1761 2 / 37 Pierre-Simon

More information

Selection of Linking Items

Selection of Linking Items Selection of Linking Items Subset of items that maximally reflect the scale information function Denote the scale information as Linear programming solver (in R, lp_solve 5.5) min(y) Subject to θ, θs,

More information

Missing data. Patrick Breheny. April 23. Introduction Missing response data Missing covariate data

Missing data. Patrick Breheny. April 23. Introduction Missing response data Missing covariate data Missing data Patrick Breheny April 3 Patrick Breheny BST 71: Bayesian Modeling in Biostatistics 1/39 Our final topic for the semester is missing data Missing data is very common in practice, and can occur

More information

Multivariate meta-analysis using individual participant data

Multivariate meta-analysis using individual participant data Original Article Received 19 February 2014, Revised 10 October 2014, Accepted 17 October 2014 Published online 21 November 2014 in Wiley Online Library (wileyonlinelibrary.com) DOI: 10.1002/jrsm.1129 Multivariate

More information

Bayesian Models for Combining Data Across Subjects and Studies in Predictive fmri Data Analysis

Bayesian Models for Combining Data Across Subjects and Studies in Predictive fmri Data Analysis Bayesian Models for Combining Data Across Subjects and Studies in Predictive fmri Data Analysis Thesis Proposal Indrayana Rustandi April 3, 2007 Outline Motivation and Thesis Preliminary results: Hierarchical

More information

Bayesian graphical models for combining multiple data sources, with applications in environmental epidemiology

Bayesian graphical models for combining multiple data sources, with applications in environmental epidemiology Bayesian graphical models for combining multiple data sources, with applications in environmental epidemiology Sylvia Richardson 1 sylvia.richardson@imperial.co.uk Joint work with: Alexina Mason 1, Lawrence

More information

Gianluca Baio. University College London Department of Statistical Science.

Gianluca Baio. University College London Department of Statistical Science. Bayesian hierarchical models and recent computational development using Integrated Nested Laplace Approximation, with applications to pre-implantation genetic screening in IVF Gianluca Baio University

More information

Multivariate Multilevel Models

Multivariate Multilevel Models Multivariate Multilevel Models Getachew A. Dagne George W. Howe C. Hendricks Brown Funded by NIMH/NIDA 11/20/2014 (ISSG Seminar) 1 Outline What is Behavioral Social Interaction? Importance of studying

More information

BayesOpt: Extensions and applications

BayesOpt: Extensions and applications BayesOpt: Extensions and applications Javier González Masterclass, 7-February, 2107 @Lancaster University Agenda of the day 9:00-11:00, Introduction to Bayesian Optimization: What is BayesOpt and why it

More information

Where a licence is displayed above, please note the terms and conditions of the licence govern your use of this document.

Where a licence is displayed above, please note the terms and conditions of the licence govern your use of this document. Multivariate meta-analysis using individual participant data Riley, Richard; Price, Malcolm; Jackson, D.; Wardle, M.; Gueyffier, F.; Wang, J.; Staessen, J. A.; White, I. R. DOI: 10.1002/jrsm.1129 License:

More information

Bayesian versus maximum likelihood estimation of treatment effects in bivariate probit instrumental variable models

Bayesian versus maximum likelihood estimation of treatment effects in bivariate probit instrumental variable models Bayesian versus maximum likelihood estimation of treatment effects in bivariate probit instrumental variable models Florian M. Hollenbach Department of Political Science Texas A&M University Jacob M. Montgomery

More information

Bivariate Models for the Analysis of Internal Nitrogen Use Efficiency: Mixture Models as an Exploratory Tool

Bivariate Models for the Analysis of Internal Nitrogen Use Efficiency: Mixture Models as an Exploratory Tool Bivariate Models for the Analysis of Internal Nitrogen Use Efficiency: Mixture Models as an Exploratory Tool Isabel Munoz Santa (Masters of Applied Science in Biometrics) A thesis submitted for the degree

More information

Age-Adjusted US Cancer Death Rate Predictions

Age-Adjusted US Cancer Death Rate Predictions Georgia State University ScholarWorks @ Georgia State University Public Health Faculty Publications School of Public Health 2010 Age-Adjusted US Cancer Death Rate Predictions Matt Hayat Georgia State University,

More information

An application of a pattern-mixture model with multiple imputation for the analysis of longitudinal trials with protocol deviations

An application of a pattern-mixture model with multiple imputation for the analysis of longitudinal trials with protocol deviations Iddrisu and Gumedze BMC Medical Research Methodology (2019) 19:10 https://doi.org/10.1186/s12874-018-0639-y RESEARCH ARTICLE Open Access An application of a pattern-mixture model with multiple imputation

More information

Bayesian Dose Escalation Study Design with Consideration of Late Onset Toxicity. Li Liu, Glen Laird, Lei Gao Biostatistics Sanofi

Bayesian Dose Escalation Study Design with Consideration of Late Onset Toxicity. Li Liu, Glen Laird, Lei Gao Biostatistics Sanofi Bayesian Dose Escalation Study Design with Consideration of Late Onset Toxicity Li Liu, Glen Laird, Lei Gao Biostatistics Sanofi 1 Outline Introduction Methods EWOC EWOC-PH Modifications to account for

More information

Meta-analysis using individual participant data: one-stage and two-stage approaches, and why they may differ

Meta-analysis using individual participant data: one-stage and two-stage approaches, and why they may differ Tutorial in Biostatistics Received: 11 March 2016, Accepted: 13 September 2016 Published online 16 October 2016 in Wiley Online Library (wileyonlinelibrary.com) DOI: 10.1002/sim.7141 Meta-analysis using

More information

Bayesian Inference. Thomas Nichols. With thanks Lee Harrison

Bayesian Inference. Thomas Nichols. With thanks Lee Harrison Bayesian Inference Thomas Nichols With thanks Lee Harrison Attention to Motion Paradigm Results Attention No attention Büchel & Friston 1997, Cereb. Cortex Büchel et al. 1998, Brain - fixation only -

More information

Multilevel IRT for group-level diagnosis. Chanho Park Daniel M. Bolt. University of Wisconsin-Madison

Multilevel IRT for group-level diagnosis. Chanho Park Daniel M. Bolt. University of Wisconsin-Madison Group-Level Diagnosis 1 N.B. Please do not cite or distribute. Multilevel IRT for group-level diagnosis Chanho Park Daniel M. Bolt University of Wisconsin-Madison Paper presented at the annual meeting

More information

Modern Regression Methods

Modern Regression Methods Modern Regression Methods Second Edition THOMAS P. RYAN Acworth, Georgia WILEY A JOHN WILEY & SONS, INC. PUBLICATION Contents Preface 1. Introduction 1.1 Simple Linear Regression Model, 3 1.2 Uses of Regression

More information

Heritability. The Extended Liability-Threshold Model. Polygenic model for continuous trait. Polygenic model for continuous trait.

Heritability. The Extended Liability-Threshold Model. Polygenic model for continuous trait. Polygenic model for continuous trait. The Extended Liability-Threshold Model Klaus K. Holst Thomas Scheike, Jacob Hjelmborg 014-05-1 Heritability Twin studies Include both monozygotic (MZ) and dizygotic (DZ) twin

More information

Historical controls in clinical trials: the meta-analytic predictive approach applied to over-dispersed count data

Historical controls in clinical trials: the meta-analytic predictive approach applied to over-dispersed count data Historical controls in clinical trials: the meta-analytic predictive approach applied to over-dispersed count data Sandro Gsteiger, Beat Neuenschwander, and Heinz Schmidli Novartis Pharma AG Bayes Pharma,

More information

Estimation of Area under the ROC Curve Using Exponential and Weibull Distributions

Estimation of Area under the ROC Curve Using Exponential and Weibull Distributions XI Biennial Conference of the International Biometric Society (Indian Region) on Computational Statistics and Bio-Sciences, March 8-9, 22 43 Estimation of Area under the ROC Curve Using Exponential and

More information

Bayesian Semiparametric Estimation of Cancer-specific Age-at-onset Penetrance with Application to. Li-Fraumeni Syndrome

Bayesian Semiparametric Estimation of Cancer-specific Age-at-onset Penetrance with Application to. Li-Fraumeni Syndrome Bayesian Semiparametric Estimation of Cancer-specific Age-at-onset Penetrance with Application to arxiv:1701.01558v4 [stat.ap] 3 May 2018 Li-Fraumeni Syndrome Seung Jun Shin 1, Ying Yuan 2,, Louise C.

More information

Emerging Themes in Epidemiology. Open Access RESEARCH ARTICLE. Dan Li 1, Ruth Keogh 2, John P. Clancy 3 and Rhonda D.

Emerging Themes in Epidemiology. Open Access RESEARCH ARTICLE. Dan Li 1, Ruth Keogh 2, John P. Clancy 3 and Rhonda D. DOI 10.1186/s12982-017-0067-1 Emerging Themes in Epidemiology RESEARCH ARTICLE Open Access Flexible semiparametric joint modeling: an application to estimate individual lung function decline and risk of

More information

Advanced IPD meta-analysis methods for observational studies

Advanced IPD meta-analysis methods for observational studies Advanced IPD meta-analysis methods for observational studies Simon Thompson University of Cambridge, UK Part 4 IBC Victoria, July 2016 1 Outline of talk Usual measures of association (e.g. hazard ratios)

More information

How to interpret scientific & statistical graphs

How to interpret scientific & statistical graphs How to interpret scientific & statistical graphs Theresa A Scott, MS Department of Biostatistics theresa.scott@vanderbilt.edu http://biostat.mc.vanderbilt.edu/theresascott 1 A brief introduction Graphics:

More information

Some alternatives for Inhomogeneous Poisson Point Processes for presence only data

Some alternatives for Inhomogeneous Poisson Point Processes for presence only data Some alternatives for Inhomogeneous Poisson Point Processes for presence only data Hassan Doosti Macquarie University hassan.doosti@mq.edu.au July 6, 2017 Hassan Doosti (MQU) Inhomogeneous Spatial Point

More information

A Comparison of Methods for Determining HIV Viral Set Point

A Comparison of Methods for Determining HIV Viral Set Point STATISTICS IN MEDICINE Statist. Med. 2006; 00:1 6 [Version: 2002/09/18 v1.11] A Comparison of Methods for Determining HIV Viral Set Point Y. Mei 1, L. Wang 2, S. E. Holte 2 1 School of Industrial and Systems

More information

Information Systems Mini-Monograph

Information Systems Mini-Monograph Information Systems Mini-Monograph Interpreting Posterior Relative Risk Estimates in Disease-Mapping Studies Sylvia Richardson, Andrew Thomson, Nicky Best, and Paul Elliott Small Area Health Statistics

More information

Bayesian approaches to handling missing data: Practical Exercises

Bayesian approaches to handling missing data: Practical Exercises Bayesian approaches to handling missing data: Practical Exercises 1 Practical A Thanks to James Carpenter and Jonathan Bartlett who developed the exercise on which this practical is based (funded by ESRC).

More information

The North Carolina Health Data Explorer

The North Carolina Health Data Explorer The North Carolina Health Data Explorer The Health Data Explorer provides access to health data for North Carolina counties in an interactive, user-friendly atlas of maps, tables, and charts. It allows

More information

Reveal Relationships in Categorical Data

Reveal Relationships in Categorical Data SPSS Categories 15.0 Specifications Reveal Relationships in Categorical Data Unleash the full potential of your data through perceptual mapping, optimal scaling, preference scaling, and dimension reduction

More information

Applications. DSC 410/510 Multivariate Statistical Methods. Discriminating Two Groups. What is Discriminant Analysis

Applications. DSC 410/510 Multivariate Statistical Methods. Discriminating Two Groups. What is Discriminant Analysis DSC 4/5 Multivariate Statistical Methods Applications DSC 4/5 Multivariate Statistical Methods Discriminant Analysis Identify the group to which an object or case (e.g. person, firm, product) belongs:

More information

A Diabetes minimal model for Oral Glucose Tolerance Tests

A Diabetes minimal model for Oral Glucose Tolerance Tests arxiv:1601.04753v1 [stat.ap] 18 Jan 2016 A Diabetes minimal model for Oral Glucose Tolerance Tests J. Andrés Christen a, Marcos Capistrán a, Adriana Monroy b, Silvestre Alavez c, Silvia Quintana Vargas

More information

GENERALIZED ESTIMATING EQUATIONS FOR LONGITUDINAL DATA. Anti-Epileptic Drug Trial Timeline. Exploratory Data Analysis. Exploratory Data Analysis

GENERALIZED ESTIMATING EQUATIONS FOR LONGITUDINAL DATA. Anti-Epileptic Drug Trial Timeline. Exploratory Data Analysis. Exploratory Data Analysis GENERALIZED ESTIMATING EQUATIONS FOR LONGITUDINAL DATA 1 Example: Clinical Trial of an Anti-Epileptic Drug 59 epileptic patients randomized to progabide or placebo (Leppik et al., 1987) (Described in Fitzmaurice

More information

Package xmeta. August 16, 2017

Package xmeta. August 16, 2017 Type Package Title A Toolbox for Multivariate Meta-Analysis Version 1.1-4 Date 2017-5-12 Package xmeta August 16, 2017 Imports aod, glmmml, numderiv, metafor, mvmeta, stats A toolbox for meta-analysis.

More information

Appendix 1: Systematic Review Protocol [posted as supplied by author]

Appendix 1: Systematic Review Protocol [posted as supplied by author] Appendix 1: Systematic Review Protocol [posted as supplied by author] Title Physical Activity and Risks of Breast Cancer, Colon Cancer,, Ischemic Heart Disease and Ischemic Stroke Events: A Systematic

More information

Memorial Sloan-Kettering Cancer Center

Memorial Sloan-Kettering Cancer Center Memorial Sloan-Kettering Cancer Center Memorial Sloan-Kettering Cancer Center, Dept. of Epidemiology & Biostatistics Working Paper Series Year 2007 Paper 14 On Comparing the Clustering of Regression Models

More information

Live WebEx meeting agenda

Live WebEx meeting agenda 10:00am 10:30am Using OpenMeta[Analyst] to extract quantitative data from published literature Live WebEx meeting agenda August 25, 10:00am-12:00pm ET 10:30am 11:20am Lecture (this will be recorded) 11:20am

More information

Analysis of Vaccine Effects on Post-Infection Endpoints Biostat 578A Lecture 3

Analysis of Vaccine Effects on Post-Infection Endpoints Biostat 578A Lecture 3 Analysis of Vaccine Effects on Post-Infection Endpoints Biostat 578A Lecture 3 Analysis of Vaccine Effects on Post-Infection Endpoints p.1/40 Data Collected in Phase IIb/III Vaccine Trial Longitudinal

More information

An application of topological relations of fuzzy regions with holes

An application of topological relations of fuzzy regions with holes Chapter 5 An application of topological relations of fuzzy regions with holes 5.1 Introduction In the last two chapters, we have provided theoretical frameworks for fuzzy regions with holes in terms of

More information

Statistical Tolerance Regions: Theory, Applications and Computation

Statistical Tolerance Regions: Theory, Applications and Computation Statistical Tolerance Regions: Theory, Applications and Computation K. KRISHNAMOORTHY University of Louisiana at Lafayette THOMAS MATHEW University of Maryland Baltimore County Contents List of Tables

More information

11/18/2013. Correlational Research. Correlational Designs. Why Use a Correlational Design? CORRELATIONAL RESEARCH STUDIES

11/18/2013. Correlational Research. Correlational Designs. Why Use a Correlational Design? CORRELATIONAL RESEARCH STUDIES Correlational Research Correlational Designs Correlational research is used to describe the relationship between two or more naturally occurring variables. Is age related to political conservativism? Are

More information

Econometric Game 2012: infants birthweight?

Econometric Game 2012: infants birthweight? Econometric Game 2012: How does maternal smoking during pregnancy affect infants birthweight? Case A April 18, 2012 1 Introduction Low birthweight is associated with adverse health related and economic

More information

Statistical Models for Enhancing Cross-Population Comparability

Statistical Models for Enhancing Cross-Population Comparability Statistical Models for Enhancing Cross-Population Comparability A. Tandon, C.J.L. Murray, J.A. Salomon, and G. King Global Programme on Evidence for Health Policy Discussion Paper No. 42 World Health Organization,

More information

Linking Errors in Trend Estimation in Large-Scale Surveys: A Case Study

Linking Errors in Trend Estimation in Large-Scale Surveys: A Case Study Research Report Linking Errors in Trend Estimation in Large-Scale Surveys: A Case Study Xueli Xu Matthias von Davier April 2010 ETS RR-10-10 Listening. Learning. Leading. Linking Errors in Trend Estimation

More information

County-Level Small Area Estimation using the National Health Interview Survey (NHIS) and the Behavioral Risk Factor Surveillance System (BRFSS)

County-Level Small Area Estimation using the National Health Interview Survey (NHIS) and the Behavioral Risk Factor Surveillance System (BRFSS) County-Level Small Area Estimation using the National Health Interview Survey (NHIS) and the Behavioral Risk Factor Surveillance System (BRFSS) Van L. Parsons, Nathaniel Schenker Office of Research and

More information

Supplementary Material Estimating the Distribution of Sensorimotor Synchronization Data: A Bayesian Hierarchical Modeling Approach Rasmus Bååth

Supplementary Material Estimating the Distribution of Sensorimotor Synchronization Data: A Bayesian Hierarchical Modeling Approach Rasmus Bååth Supplementary Material Estimating the Distribution of Sensorimotor Synchronization Data: A Bayesian Hierarchical Modeling Approach Rasmus Bååth One advantage of working within a Bayesian statistical framework

More information

Metropolitan and Micropolitan Statistical Area Cancer Incidence: Late Stage Diagnoses for Cancers Amenable to Screening, Idaho

Metropolitan and Micropolitan Statistical Area Cancer Incidence: Late Stage Diagnoses for Cancers Amenable to Screening, Idaho Metropolitan and Micropolitan Statistical Area Cancer Incidence: Late Stage Diagnoses for Cancers Amenable to Screening, Idaho 26-29 December 2 A Publication of the CANCER DATA REGISTRY OF IDAHO P.O. Box

More information

Incidence of Cancers Associated with Modifiable Risk Factors and Late Stage Diagnoses for Cancers Amenable to Screening Idaho

Incidence of Cancers Associated with Modifiable Risk Factors and Late Stage Diagnoses for Cancers Amenable to Screening Idaho Incidence of Cancers Associated with Modifiable Risk Factors and Late Stage Diagnoses for Cancers Amenable to Screening Idaho 2008-2011 August 2013 A Publication of the Cancer Data Registry of Idaho PO

More information