EPI 200C Final, June 4 th, 2009 This exam includes 24 questions.

Size: px
Start display at page:

Download "EPI 200C Final, June 4 th, 2009 This exam includes 24 questions."

Transcription

1 Greenland/Arah, Epi 200C Sp of 6 EPI 200C Final, June 4 th, 2009 This exam includes 24 questions. INSTRUCTIONS: Write all answers on the answer sheets supplied; PRINT YOUR NAME and STUDENT ID NUMBER AT THE TOP OF THAT SHEET. Keep your exam questions for the review session, Thursday, June 12 th, The questions are multiple choice. There may be more than one correct answer for each question. List all the answers you agree with on the answer sheet. USE BLOCK CAPITAL LETTERS. Legibility is your responsibility. Each question is 2 points maximum credit, 2 points minimum, with 2/(# correct letters) for each correct letter and 2/(# incorrect letters) for each incorrect letter. THIS MEANS YOU ARE PENALIZED FOR GUESSING WRONG! 1. Random error A. is always chance variation. B. leads to incomplete predictability in epidemiology studies only when there is sampling error. C. can be reduced by increasing the size of the study. D. is still considered present even if the entire population of interest is included in the study. E. is low when study power is high. F. None of the above. 2. Good data analysis will always entail the following A. Data editing to identify incomplete records and check for consistency and accuracy of entries. B. Data summarization for inference using smoothing, modeling and hypothesis testing techniques. C. An inferential stage aimed at concise description of observations. D. Use of special methods to handle missing values 1

2 3. Suppose the following table were obtained from a cohort study of the effect of binary treatment X on binary outcome Y, with a measured binary covariate Z that is not affected by either X or Y: Z = 1 Z = 0 Summary table (ignoring covariate Z) X = 1 X = 0 X = 1 X = 0 X = 1 X = 0 Outcome Y = Outcome Y = A. The crude odds ratio is an average of the stratum-specific odds ratios. B. The crude risk difference is an average of the stratum-specific risk differences. C. The crude risk difference is a valid estimate of the absolute effect of X on Y. D. If there are no other potential confounders in this study, the covariate Z is not a confounder. 4. Which of the following is/are true? A. Loss to follow-up can lead to selection bias in cohort studies. B. Selection bias can arise from conditioning on the common effect of the exposure and an uncontrolled independent risk factor for the outcome. C. Case-control studies are no more prone to selection bias than are cohort studies. D. Selection bias cannot arise from conditioning on a common effect of an outcome and an unobserved independent predictor for the exposure. 5. In assessing interactions in epidemiology the following is/are true A. The definition of interaction response types under the potential-outcomes model is not specific to the outcome of interest. B. Additivity implies absence of interaction types. C. Presence of different interaction types is not compatible with an observation of additivity of risks. D. Superadditivity refers to an interaction contrast equal to or greater than 0. 2

3 6. Suppose in the DAG below all the variables are dichotomous and all the arrows represent positive but not perfect associations. Which of the following must be true? C X (Y) R Y* A. The marginal X-Y* association will be attenuated relative to the marginal association of X with Y. B. Controlling for C will make the X-Y* association less biased for the causal effect of X on Y than if C were not controlled. C. If C is controlled, the X-Y* association will be attenuated relative to the causal effect of X on Y. D. C is a valid instrumental variable for the effect of Y on Y*. 7. Regarding categorical analysis methods: A. Methods that stratify on follow-up time are needed for person-count data from studies with substantial loss to follow-up. B. Sparse-data methods are advisable when both expected and observed numbers of cases for each exposure category in some analysis strata are less than four. C. Categorical methods are needed to avoid assumptions about the shape of the XY doseresponse D. The method of choosing category boundaries is unimportant as long as the number of categories is at least five. 3

4 8. Regarding stratified analysis of epidemiologic data: A. Controlling for more variables can lead to the data becoming sparse across strata, which may lead to association changes being misinterpreted as evidence of confounding. B. One should use percentile categories of potential confounder variables to avoid unequal sizes of the analysis strata. C. Bias produced by confounder selection using statistical testing can be reduced by raising the α level. D. In order to reduce distortions caused by using the data to select variables for adjustment, one should base selection on changes in the point estimate. 9. Bayesian statistical analysis: A. requires the use of specialized software, as most software packages are not equipped to incorporate priors. B. requires posterior sampling. C. requires data augmentation. D. should compare priors with likelihoods before combining them. E. requires complete prior specification for all unknown parameters. F. None of the above. 10. Regarding bias analysis: A. A formal bias analysis is an objective assessment of the degree of bias that could have occurred in a study. B. Bias analysis should be done in all epidemiologic study reports. C. Bias analysis is more important for large studies because random error tends to be smaller in large studies. D. A bias analysis that uses prior distributions rather than specific values for the bias parameters could be interpreted as a semi-bayesian bias analysis. 11. A study collected data on the use (X) of selective serotonin reuptake inhibitors (SSRI: X=1 if used, 0 if not) and a binary measure (Y) of subsequent depression improvement (Y=1 if depression improved, 0 otherwise) among a cohort of middle-aged men suffering from 4

5 depression. It also had a measure of income Z as a potential confounder. Suppose you fit this logistic regression model to the data: g[e(y X=x, Z=z)] = β Y + β YX X + β YZ Z + β YXZ XZ. Which of the following statements is/are true? A. The link function g[.] is the logit link. B. The model implies that the odds of depression improvement among men with income of Z=2 using an SSRI is exp(β Y +β YX +2β YZ +2β YXZ ). C. The model implies that the log odds of depression improvement among men with an income of Z=1 who were not using any SSRI is β YZ. D. The inverse of the link function for the model is the antilog or exponential function ( exp ). E. The model is saturated. F. None of the above. 5

6 12. A closed cohort of 15,000 retired teachers aged at least 65 years was followed for 6 years to study the effect of statins on the occurrence of stroke. At the inception of the cohort, no teacher had ever used statins or had a history of heart disease or stroke. At baseline and every two years the teachers were evaluated for diagnosis of coronary heart disease (CHD), use of statins, and diagnosis of stroke since last evaluation. Suppose the causal diagram for this study could be drawn as follows where the evaluation times 0, 1 and 2 represent years 2, 4, and 6 after baseline: CHD 0 CHD 1 CHD 2 Statins 0 Statins 1 Statins 2 Stroke Not censored 0 Not censored 1 Not censored 2 Which of the following statements is/are true in the analysis of the longitudinal data from this cohort? A. Selection bias from the loss to follow-up (censoring) could be accounted for in the analysis provided there are no unmeasured confounders of the censoring process. B. To estimate the cumulative effect of statins on stroke incidence without bias, the diagnosis of CHD at each time point must be adjusted for in a regression model. C. Propensity score matching can be used to estimate the cumulative effect of statins use on stroke incidence. D. Conventional regression analysis will yield biased results because post-baseline CHD is affected by statins and confounds the effect of subsequent statins use. E. Standardization techniques can be used for the analysis of time-varying statins use. F. None of the above. 6

7 13. For a case-control study with an unmeasured binary confounder U, you were given the following general equation for the filling in the unobserved U-stratified 2 x 2 tables for the association between a binary exposure X and a binary outcome Y: For a cell count with Y=y and X=x in the crude (marginal) X-Y table, the proportion that should go into the U=1 stratum is expit(β U + β UY y + β UX x + β UYX yx). Crude X-Y table X=1 X=0 Y=1 A 1+ A 0+ Y=0 B 1+ B 0+ U=1 U=0 X=1 X=0 X=1 X=0 Y=1 A 11 A 01 Y=1 A 10 A 00 Y=0 B 11 B 01 Y=0 B 10 B 00 Which of the statements is/are always true? A. The cell count A 10 is given by A 1+ expit(β UY + β UX + β UYX ). B. The expression expit(β U + β UY + β UX + β UYX ) is the log odds of U when Y=1 and X=1. C. exp(β UYX ) quantifies how much the UX odds ratio changes when moving from Y=0 to Y=1. D. β UYX = 0 implies we are assuming there is no biologic interaction. 14. Which of the following is/are true regarding analysis of selection bias? A. There tend to be plenty of relevant data about bias parameters in analysis of selection bias. B. Because the concepts of selection bias and confounding overlap, the same bias correction factor is used to address them. C. Sensitivity analysis of selection bias sometimes simplifies to consideration of one bias factor. D. There will be no selection bias if the probability of selection in cases and noncases at every exposure level is If X, Z are binary exposures, Y is a binary outcome and odds(z=1 X=x, Y=y) = exp(β 0 + β X X + β Y Y + β XY XY), which of the following is/are true? 7

8 A. β Y = 0 implies that there is no association between Z and Y among those with X=1. B. β X = 0 and β XY = 0 together imply that X and Z are not associated given Y. C. β X = 0 implies no statistical interaction between X, Y and Z on the odds-ratio scale. D. β XY = 0 imply no biologic interaction between X and Z in producing the outcome Y. 16. In a study of the association between height (X) measured in centimeters and developing hypertrophic cardiomyopathy or chronic enlargement of the heart (Y), A. the model ln[r(x=x)] = α Y + α YX x is a logistic risk model for developing hypertrophic cardiomyopathy when X=x. B. ln[r(x=0)] = α Y can be interpreted as the background risk of hypertrophic cardiomyopathy. C. ln[odds(x=x)] = α Y + α YX x is a log-linear odds model for developing hypertrophic cardiomyopathy when X=x. D. it is advisable to recenter X around its mean in the study population. E. it is advisable to rescale X by transforming it into a Z-score. F. logit[r(x=x)] ln[r(x=x)] when the risk of developing hypertrophic cardiomyopathy is very small when X=x. G. None of the above. 17. Regarding regression models for a target population, A. In the two logistic regression models Y = β Y + β YX X + β YZ Z and X = γx + γ XY Y + γ XZ Z relating the same three binary variables X, Y and Z, the parameters β YX and γ XY are equal. B. The rate model E(Y X=x) = exp(α + βx) can never give negative rates. C. Pr(Y=1 set[sex=male]) Pr(Y=1 set[sex=female]) can be estimated when all confounding, selection bias, and misclassification is eliminated. D. Model specification is a form of model fitting. E. The model E(Y X=x) = exp( β Y β YX x) differs from the model E(Y -1 X=x) = exp(β Y +β YX x). F. None of the above. 18. Monte-Carlo Sensitivity Analysis: A) Should correct biases in the reverse order that they occurred. B) Will give similar results to a semi-bayesian analysis when no identified parameter is given a prior. C) Can incorporate random error in the corrections. 8

9 D) Treats every possible value of the bias parameters as equally probable. E) None of the above. 19. For a binary (0,1) outcome Y with antecedent variables X and Z, the expression Pr(Y=1 X=x,Z=z) represents A) the probability that having X=x will cause Y=1 if Z=z B) the probability that having Z=z will cause Y=1 if X=x C) the probability of observing Y=1 if X=x and Z=z. D) the mean of Y when X=x and Z=z E) None of the above 20. Bootstrapping requires A) drawing smaller subsamples from your study sample to see how your estimate changes. B) a large sample size. C) taking percentiles of the resampling distribution of estimates as confidence limits. D) specification of a prior distribution. E) None of the above 21. If the true exposure X and its measurement X* are positively associated, nondifferential error with respect to the outcome Y A) Always results in bias towards the null B) Can be reasonably assumed if the exposure measure was recorded before the outcome occurred C) Can be reasonably assumed if the exposure assignment is X* and is randomized, and intention to treat analysis is performed D) Absent other biases, allows a valid test of the null hypothesis that X does not affect Y. E) None of the above 22. Regarding model selection strategies, which of the following is/are true? A) For a regressand together with a given set of regressor variables, there is a unique minimal model and a unique maximal model that are not conflicting with background information about relationships among the variables. B) An expanding search process starts with a model form that is highly flexible. 9

10 C) A limitation of a purely contracting search process is that it may encounter sparse-data problems. D) A combination of expanding and contracting processes, such as the stepwise automated selection algorithm, is the best strategy in model searching. E) None of the above 23. Which of the following is/are true about model checking? A) A good fitting model must be a correct or approximately correct model. B) Model diagnostics not only helps to detect discrepancies between the data and the model but also indicates whether or not the model holds beyond the range of observed data. C) The usefulness of model diagnostic statistics is not affected by sample size. D) Comparing regression-model-based results with corresponding basic categorical-analysis results is helpful in understanding the extent to which the model-based results possibly do not reflect the data. E) None of the above. 24. Which of the following is/are described by Gilovich as example(s) of the influence of people s expectation and prior beliefs on their evaluation of evidence? A) Parents expect a child who excels in school one year to do as well or better the following year. B) Clergymen doubted Galileo s claim that the earth was not the center of the solar system C) Football and hockey sport teams that wear black uniforms have been penalized more often than average. D) Scientists are more likely to run additional experiments if the results of an initial study appear to refute a favored hypothesis. E) None of the above. 10

Lecture Outline. Biost 590: Statistical Consulting. Stages of Scientific Studies. Scientific Method

Lecture Outline. Biost 590: Statistical Consulting. Stages of Scientific Studies. Scientific Method Biost 590: Statistical Consulting Statistical Classification of Scientific Studies; Approach to Consulting Lecture Outline Statistical Classification of Scientific Studies Statistical Tasks Approach to

More information

Supplement 2. Use of Directed Acyclic Graphs (DAGs)

Supplement 2. Use of Directed Acyclic Graphs (DAGs) Supplement 2. Use of Directed Acyclic Graphs (DAGs) Abstract This supplement describes how counterfactual theory is used to define causal effects and the conditions in which observed data can be used to

More information

Biostatistics II

Biostatistics II Biostatistics II 514-5509 Course Description: Modern multivariable statistical analysis based on the concept of generalized linear models. Includes linear, logistic, and Poisson regression, survival analysis,

More information

PubH 7405: REGRESSION ANALYSIS. Propensity Score

PubH 7405: REGRESSION ANALYSIS. Propensity Score PubH 7405: REGRESSION ANALYSIS Propensity Score INTRODUCTION: There is a growing interest in using observational (or nonrandomized) studies to estimate the effects of treatments on outcomes. In observational

More information

Epidemiologic Methods I & II Epidem 201AB Winter & Spring 2002

Epidemiologic Methods I & II Epidem 201AB Winter & Spring 2002 DETAILED COURSE OUTLINE Epidemiologic Methods I & II Epidem 201AB Winter & Spring 2002 Hal Morgenstern, Ph.D. Department of Epidemiology UCLA School of Public Health Page 1 I. THE NATURE OF EPIDEMIOLOGIC

More information

A Bayesian Perspective on Unmeasured Confounding in Large Administrative Databases

A Bayesian Perspective on Unmeasured Confounding in Large Administrative Databases A Bayesian Perspective on Unmeasured Confounding in Large Administrative Databases Lawrence McCandless lmccandl@sfu.ca Faculty of Health Sciences, Simon Fraser University, Vancouver Canada Summer 2014

More information

A Brief Introduction to Bayesian Statistics

A Brief Introduction to Bayesian Statistics A Brief Introduction to Statistics David Kaplan Department of Educational Psychology Methods for Social Policy Research and, Washington, DC 2017 1 / 37 The Reverend Thomas Bayes, 1701 1761 2 / 37 Pierre-Simon

More information

Biases in clinical research. Seungho Ryu, MD, PhD Kanguk Samsung Hospital, Sungkyunkwan University

Biases in clinical research. Seungho Ryu, MD, PhD Kanguk Samsung Hospital, Sungkyunkwan University Biases in clinical research Seungho Ryu, MD, PhD Kanguk Samsung Hospital, Sungkyunkwan University Learning objectives Describe the threats to causal inferences in clinical studies Understand the role of

More information

Measurement Error in Nonlinear Models

Measurement Error in Nonlinear Models Measurement Error in Nonlinear Models R.J. CARROLL Professor of Statistics Texas A&M University, USA D. RUPPERT Professor of Operations Research and Industrial Engineering Cornell University, USA and L.A.

More information

Bayesian graphical models for combining multiple data sources, with applications in environmental epidemiology

Bayesian graphical models for combining multiple data sources, with applications in environmental epidemiology Bayesian graphical models for combining multiple data sources, with applications in environmental epidemiology Sylvia Richardson 1 sylvia.richardson@imperial.co.uk Joint work with: Alexina Mason 1, Lawrence

More information

In this module I provide a few illustrations of options within lavaan for handling various situations.

In this module I provide a few illustrations of options within lavaan for handling various situations. In this module I provide a few illustrations of options within lavaan for handling various situations. An appropriate citation for this material is Yves Rosseel (2012). lavaan: An R Package for Structural

More information

George B. Ploubidis. The role of sensitivity analysis in the estimation of causal pathways from observational data. Improving health worldwide

George B. Ploubidis. The role of sensitivity analysis in the estimation of causal pathways from observational data. Improving health worldwide George B. Ploubidis The role of sensitivity analysis in the estimation of causal pathways from observational data Improving health worldwide www.lshtm.ac.uk Outline Sensitivity analysis Causal Mediation

More information

Improving ecological inference using individual-level data

Improving ecological inference using individual-level data Improving ecological inference using individual-level data Christopher Jackson, Nicky Best and Sylvia Richardson Department of Epidemiology and Public Health, Imperial College School of Medicine, London,

More information

Chapter 13 Estimating the Modified Odds Ratio

Chapter 13 Estimating the Modified Odds Ratio Chapter 13 Estimating the Modified Odds Ratio Modified odds ratio vis-à-vis modified mean difference To a large extent, this chapter replicates the content of Chapter 10 (Estimating the modified mean difference),

More information

Objective: To describe a new approach to neighborhood effects studies based on residential mobility and demonstrate this approach in the context of

Objective: To describe a new approach to neighborhood effects studies based on residential mobility and demonstrate this approach in the context of Objective: To describe a new approach to neighborhood effects studies based on residential mobility and demonstrate this approach in the context of neighborhood deprivation and preterm birth. Key Points:

More information

Sample size and power calculations in Mendelian randomization with a single instrumental variable and a binary outcome

Sample size and power calculations in Mendelian randomization with a single instrumental variable and a binary outcome Sample size and power calculations in Mendelian randomization with a single instrumental variable and a binary outcome Stephen Burgess July 10, 2013 Abstract Background: Sample size calculations are an

More information

cloglog link function to transform the (population) hazard probability into a continuous

cloglog link function to transform the (population) hazard probability into a continuous Supplementary material. Discrete time event history analysis Hazard model details. In our discrete time event history analysis, we used the asymmetric cloglog link function to transform the (population)

More information

Commentary SANDER GREENLAND, MS, DRPH

Commentary SANDER GREENLAND, MS, DRPH Commentary Modeling and Variable Selection in Epidemiologic Analysis SANDER GREENLAND, MS, DRPH Abstract: This paper provides an overview of problems in multivariate modeling of epidemiologic data, and

More information

MS&E 226: Small Data

MS&E 226: Small Data MS&E 226: Small Data Lecture 10: Introduction to inference (v2) Ramesh Johari ramesh.johari@stanford.edu 1 / 17 What is inference? 2 / 17 Where did our data come from? Recall our sample is: Y, the vector

More information

Instrumental Variables Estimation: An Introduction

Instrumental Variables Estimation: An Introduction Instrumental Variables Estimation: An Introduction Susan L. Ettner, Ph.D. Professor Division of General Internal Medicine and Health Services Research, UCLA The Problem The Problem Suppose you wish to

More information

Write your identification number on each paper and cover sheet (the number stated in the upper right hand corner on your exam cover).

Write your identification number on each paper and cover sheet (the number stated in the upper right hand corner on your exam cover). STOCKHOLM UNIVERSITY Department of Economics Course name: Empirical methods 2 Course code: EC2402 Examiner: Per Pettersson-Lidbom Number of credits: 7,5 credits Date of exam: Sunday 21 February 2010 Examination

More information

Bias and confounding. Mads Kamper-Jørgensen, associate professor, Section of Social Medicine

Bias and confounding. Mads Kamper-Jørgensen, associate professor, Section of Social Medicine Bias and confounding Mads Kamper-Jørgensen, associate professor, maka@sund.ku.dk PhD-course in Epidemiology l 7 February 2017 l Slide number 1 The world according to an epidemiologist Exposure Outcome

More information

A COMPARISON OF IMPUTATION METHODS FOR MISSING DATA IN A MULTI-CENTER RANDOMIZED CLINICAL TRIAL: THE IMPACT STUDY

A COMPARISON OF IMPUTATION METHODS FOR MISSING DATA IN A MULTI-CENTER RANDOMIZED CLINICAL TRIAL: THE IMPACT STUDY A COMPARISON OF IMPUTATION METHODS FOR MISSING DATA IN A MULTI-CENTER RANDOMIZED CLINICAL TRIAL: THE IMPACT STUDY Lingqi Tang 1, Thomas R. Belin 2, and Juwon Song 2 1 Center for Health Services Research,

More information

SISCR Module 7 Part I: Introduction Basic Concepts for Binary Biomarkers (Classifiers) and Continuous Biomarkers

SISCR Module 7 Part I: Introduction Basic Concepts for Binary Biomarkers (Classifiers) and Continuous Biomarkers SISCR Module 7 Part I: Introduction Basic Concepts for Binary Biomarkers (Classifiers) and Continuous Biomarkers Kathleen Kerr, Ph.D. Associate Professor Department of Biostatistics University of Washington

More information

Analysis of Rheumatoid Arthritis Data using Logistic Regression and Penalized Approach

Analysis of Rheumatoid Arthritis Data using Logistic Regression and Penalized Approach University of South Florida Scholar Commons Graduate Theses and Dissertations Graduate School November 2015 Analysis of Rheumatoid Arthritis Data using Logistic Regression and Penalized Approach Wei Chen

More information

Methods for Addressing Selection Bias in Observational Studies

Methods for Addressing Selection Bias in Observational Studies Methods for Addressing Selection Bias in Observational Studies Susan L. Ettner, Ph.D. Professor Division of General Internal Medicine and Health Services Research, UCLA What is Selection Bias? In the regression

More information

BIOSTATISTICAL METHODS

BIOSTATISTICAL METHODS BIOSTATISTICAL METHODS FOR TRANSLATIONAL & CLINICAL RESEARCH PROPENSITY SCORE Confounding Definition: A situation in which the effect or association between an exposure (a predictor or risk factor) and

More information

Applied Medical. Statistics Using SAS. Geoff Der. Brian S. Everitt. CRC Press. Taylor Si Francis Croup. Taylor & Francis Croup, an informa business

Applied Medical. Statistics Using SAS. Geoff Der. Brian S. Everitt. CRC Press. Taylor Si Francis Croup. Taylor & Francis Croup, an informa business Applied Medical Statistics Using SAS Geoff Der Brian S. Everitt CRC Press Taylor Si Francis Croup Boca Raton London New York CRC Press is an imprint of the Taylor & Francis Croup, an informa business A

More information

Measures of Association

Measures of Association Measures of Association Lakkana Thaikruea M.D., M.S., Ph.D. Community Medicine Department, Faculty of Medicine, Chiang Mai University, Thailand Introduction One of epidemiological studies goal is to determine

More information

School of Population and Public Health SPPH 503 Epidemiologic methods II January to April 2019

School of Population and Public Health SPPH 503 Epidemiologic methods II January to April 2019 School of Population and Public Health SPPH 503 Epidemiologic methods II January to April 2019 Time: Tuesday, 1330 1630 Location: School of Population and Public Health, UBC Course description Students

More information

Confounding and Bias

Confounding and Bias 28 th International Conference on Pharmacoepidemiology and Therapeutic Risk Management Barcelona, Spain August 22, 2012 Confounding and Bias Tobias Gerhard, PhD Assistant Professor, Ernest Mario School

More information

Lecture Outline Biost 517 Applied Biostatistics I

Lecture Outline Biost 517 Applied Biostatistics I Lecture Outline Biost 517 Applied Biostatistics I Scott S. Emerson, M.D., Ph.D. Professor of Biostatistics University of Washington Lecture 2: Statistical Classification of Scientific Questions Types of

More information

You must answer question 1.

You must answer question 1. Research Methods and Statistics Specialty Area Exam October 28, 2015 Part I: Statistics Committee: Richard Williams (Chair), Elizabeth McClintock, Sarah Mustillo You must answer question 1. 1. Suppose

More information

investigate. educate. inform.

investigate. educate. inform. investigate. educate. inform. Research Design What drives your research design? The battle between Qualitative and Quantitative is over Think before you leap What SHOULD drive your research design. Advanced

More information

Chapter 2 A Guide to Implementing Quantitative Bias Analysis

Chapter 2 A Guide to Implementing Quantitative Bias Analysis Chapter 2 A Guide to Implementing Quantitative Bias Analysis Introduction Estimates of association from nonrandomized epidemiologic studies are susceptible to two types of error: random error and systematic

More information

Fundamental Clinical Trial Design

Fundamental Clinical Trial Design Design, Monitoring, and Analysis of Clinical Trials Session 1 Overview and Introduction Overview Scott S. Emerson, M.D., Ph.D. Professor of Biostatistics, University of Washington February 17-19, 2003

More information

PSYCH-GA.2211/NEURL-GA.2201 Fall 2016 Mathematical Tools for Cognitive and Neural Science. Homework 5

PSYCH-GA.2211/NEURL-GA.2201 Fall 2016 Mathematical Tools for Cognitive and Neural Science. Homework 5 PSYCH-GA.2211/NEURL-GA.2201 Fall 2016 Mathematical Tools for Cognitive and Neural Science Homework 5 Due: 21 Dec 2016 (late homeworks penalized 10% per day) See the course web site for submission details.

More information

8/10/2012. Education level and diabetes risk: The EPIC-InterAct study AIM. Background. Case-cohort design. Int J Epidemiol 2012 (in press)

8/10/2012. Education level and diabetes risk: The EPIC-InterAct study AIM. Background. Case-cohort design. Int J Epidemiol 2012 (in press) Education level and diabetes risk: The EPIC-InterAct study 50 authors from European countries Int J Epidemiol 2012 (in press) Background Type 2 diabetes mellitus (T2DM) is one of the most common chronic

More information

How to analyze correlated and longitudinal data?

How to analyze correlated and longitudinal data? How to analyze correlated and longitudinal data? Niloofar Ramezani, University of Northern Colorado, Greeley, Colorado ABSTRACT Longitudinal and correlated data are extensively used across disciplines

More information

WDHS Curriculum Map Probability and Statistics. What is Statistics and how does it relate to you?

WDHS Curriculum Map Probability and Statistics. What is Statistics and how does it relate to you? WDHS Curriculum Map Probability and Statistics Time Interval/ Unit 1: Introduction to Statistics 1.1-1.3 2 weeks S-IC-1: Understand statistics as a process for making inferences about population parameters

More information

Lecture 21. RNA-seq: Advanced analysis

Lecture 21. RNA-seq: Advanced analysis Lecture 21 RNA-seq: Advanced analysis Experimental design Introduction An experiment is a process or study that results in the collection of data. Statistical experiments are conducted in situations in

More information

Introducing a SAS macro for doubly robust estimation

Introducing a SAS macro for doubly robust estimation Introducing a SAS macro for doubly robust estimation 1Michele Jonsson Funk, PhD, 1Daniel Westreich MSPH, 2Marie Davidian PhD, 3Chris Weisen PhD 1Department of Epidemiology and 3Odum Institute for Research

More information

Propensity Score Methods for Causal Inference with the PSMATCH Procedure

Propensity Score Methods for Causal Inference with the PSMATCH Procedure Paper SAS332-2017 Propensity Score Methods for Causal Inference with the PSMATCH Procedure Yang Yuan, Yiu-Fai Yung, and Maura Stokes, SAS Institute Inc. Abstract In a randomized study, subjects are randomly

More information

Propensity scores: what, why and why not?

Propensity scores: what, why and why not? Propensity scores: what, why and why not? Rhian Daniel, Cardiff University @statnav Joint workshop S3RI & Wessex Institute University of Southampton, 22nd March 2018 Rhian Daniel @statnav/propensity scores:

More information

Bayesian approaches to handling missing data: Practical Exercises

Bayesian approaches to handling missing data: Practical Exercises Bayesian approaches to handling missing data: Practical Exercises 1 Practical A Thanks to James Carpenter and Jonathan Bartlett who developed the exercise on which this practical is based (funded by ESRC).

More information

Chapter 1: Exploring Data

Chapter 1: Exploring Data Chapter 1: Exploring Data Key Vocabulary:! individual! variable! frequency table! relative frequency table! distribution! pie chart! bar graph! two-way table! marginal distributions! conditional distributions!

More information

Time-varying confounding and marginal structural model

Time-varying confounding and marginal structural model Time-varying confounding and marginal structural model By David Chyou Overview An overview of time-varying confounding. Marginal structural model. Weighing regression by propensity score. Future prospective.

More information

A Bayesian Nonparametric Model Fit statistic of Item Response Models

A Bayesian Nonparametric Model Fit statistic of Item Response Models A Bayesian Nonparametric Model Fit statistic of Item Response Models Purpose As more and more states move to use the computer adaptive test for their assessments, item response theory (IRT) has been widely

More information

MEA DISCUSSION PAPERS

MEA DISCUSSION PAPERS Inference Problems under a Special Form of Heteroskedasticity Helmut Farbmacher, Heinrich Kögel 03-2015 MEA DISCUSSION PAPERS mea Amalienstr. 33_D-80799 Munich_Phone+49 89 38602-355_Fax +49 89 38602-390_www.mea.mpisoc.mpg.de

More information

Propensity Score Methods for Estimating Causality in the Absence of Random Assignment: Applications for Child Care Policy Research

Propensity Score Methods for Estimating Causality in the Absence of Random Assignment: Applications for Child Care Policy Research 2012 CCPRC Meeting Methodology Presession Workshop October 23, 2012, 2:00-5:00 p.m. Propensity Score Methods for Estimating Causality in the Absence of Random Assignment: Applications for Child Care Policy

More information

Comparison And Application Of Methods To Address Confounding By Indication In Non- Randomized Clinical Studies

Comparison And Application Of Methods To Address Confounding By Indication In Non- Randomized Clinical Studies University of Massachusetts Amherst ScholarWorks@UMass Amherst Masters Theses 1911 - February 2014 Dissertations and Theses 2013 Comparison And Application Of Methods To Address Confounding By Indication

More information

Evaluating health management programmes over time: application of propensity score-based weighting to longitudinal datajep_

Evaluating health management programmes over time: application of propensity score-based weighting to longitudinal datajep_ Journal of Evaluation in Clinical Practice ISSN 1356-1294 Evaluating health management programmes over time: application of propensity score-based weighting to longitudinal datajep_1361 180..185 Ariel

More information

Computer Age Statistical Inference. Algorithms, Evidence, and Data Science. BRADLEY EFRON Stanford University, California

Computer Age Statistical Inference. Algorithms, Evidence, and Data Science. BRADLEY EFRON Stanford University, California Computer Age Statistical Inference Algorithms, Evidence, and Data Science BRADLEY EFRON Stanford University, California TREVOR HASTIE Stanford University, California ggf CAMBRIDGE UNIVERSITY PRESS Preface

More information

Midterm Exam ANSWERS Categorical Data Analysis, CHL5407H

Midterm Exam ANSWERS Categorical Data Analysis, CHL5407H Midterm Exam ANSWERS Categorical Data Analysis, CHL5407H 1. Data from a survey of women s attitudes towards mammography are provided in Table 1. Women were classified by their experience with mammography

More information

Dr. Kelly Bradley Final Exam Summer {2 points} Name

Dr. Kelly Bradley Final Exam Summer {2 points} Name {2 points} Name You MUST work alone no tutors; no help from classmates. Email me or see me with questions. You will receive a score of 0 if this rule is violated. This exam is being scored out of 00 points.

More information

Citation for published version (APA): Ebbes, P. (2004). Latent instrumental variables: a new approach to solve for endogeneity s.n.

Citation for published version (APA): Ebbes, P. (2004). Latent instrumental variables: a new approach to solve for endogeneity s.n. University of Groningen Latent instrumental variables Ebbes, P. IMPORTANT NOTE: You are advised to consult the publisher's version (publisher's PDF) if you wish to cite from it. Please check the document

More information

Methodology for Non-Randomized Clinical Trials: Propensity Score Analysis Dan Conroy, Ph.D., inventiv Health, Burlington, MA

Methodology for Non-Randomized Clinical Trials: Propensity Score Analysis Dan Conroy, Ph.D., inventiv Health, Burlington, MA PharmaSUG 2014 - Paper SP08 Methodology for Non-Randomized Clinical Trials: Propensity Score Analysis Dan Conroy, Ph.D., inventiv Health, Burlington, MA ABSTRACT Randomized clinical trials serve as the

More information

Observational Study Designs. Review. Today. Measures of disease occurrence. Cohort Studies

Observational Study Designs. Review. Today. Measures of disease occurrence. Cohort Studies Observational Study Designs Denise Boudreau, PhD Center for Health Studies Group Health Cooperative Today Review cohort studies Case-control studies Design Identifying cases and controls Measuring exposure

More information

Analysis of TB prevalence surveys

Analysis of TB prevalence surveys Workshop and training course on TB prevalence surveys with a focus on field operations Analysis of TB prevalence surveys Day 8 Thursday, 4 August 2011 Phnom Penh Babis Sismanidis with acknowledgements

More information

Controlling Bias & Confounding

Controlling Bias & Confounding Controlling Bias & Confounding Chihaya Koriyama August 5 th, 2015 QUESTIONS FOR BIAS Key concepts Bias Should be minimized at the designing stage. Random errors We can do nothing at Is the nature the of

More information

Lecture 9 Internal Validity

Lecture 9 Internal Validity Lecture 9 Internal Validity Objectives Internal Validity Threats to Internal Validity Causality Bayesian Networks Internal validity The extent to which the hypothesized relationship between 2 or more variables

More information

Supplementary Appendix

Supplementary Appendix Supplementary Appendix This appendix has been provided by the authors to give readers additional information about their work. Supplement to: Rawshani Aidin, Rawshani Araz, Franzén S, et al. Risk factors,

More information

Chapter 17 Sensitivity Analysis and Model Validation

Chapter 17 Sensitivity Analysis and Model Validation Chapter 17 Sensitivity Analysis and Model Validation Justin D. Salciccioli, Yves Crutain, Matthieu Komorowski and Dominic C. Marshall Learning Objectives Appreciate that all models possess inherent limitations

More information

Propensity score methods to adjust for confounding in assessing treatment effects: bias and precision

Propensity score methods to adjust for confounding in assessing treatment effects: bias and precision ISPUB.COM The Internet Journal of Epidemiology Volume 7 Number 2 Propensity score methods to adjust for confounding in assessing treatment effects: bias and precision Z Wang Abstract There is an increasing

More information

Analysis of Environmental Data Conceptual Foundations: En viro n m e n tal Data

Analysis of Environmental Data Conceptual Foundations: En viro n m e n tal Data Analysis of Environmental Data Conceptual Foundations: En viro n m e n tal Data 1. Purpose of data collection...................................................... 2 2. Samples and populations.......................................................

More information

Bayesian methods for combining multiple Individual and Aggregate data Sources in observational studies

Bayesian methods for combining multiple Individual and Aggregate data Sources in observational studies Bayesian methods for combining multiple Individual and Aggregate data Sources in observational studies Sara Geneletti Department of Epidemiology and Public Health Imperial College, London s.geneletti@imperial.ac.uk

More information

Estimating indirect and direct effects of a Cancer of Unknown Primary (CUP) diagnosis on survival for a 6 month-period after diagnosis.

Estimating indirect and direct effects of a Cancer of Unknown Primary (CUP) diagnosis on survival for a 6 month-period after diagnosis. Estimating indirect and direct effects of a Cancer of Unknown Primary (CUP) diagnosis on survival for a 6 month-period after diagnosis. A Manuscript prepared in Fulfillment of a B.S Honors Thesis in Statistics

More information

Bayesian Logistic Regression Modelling via Markov Chain Monte Carlo Algorithm

Bayesian Logistic Regression Modelling via Markov Chain Monte Carlo Algorithm Journal of Social and Development Sciences Vol. 4, No. 4, pp. 93-97, Apr 203 (ISSN 222-52) Bayesian Logistic Regression Modelling via Markov Chain Monte Carlo Algorithm Henry De-Graft Acquah University

More information

Recent Advances in Methods for Quantiles. Matteo Bottai, Sc.D.

Recent Advances in Methods for Quantiles. Matteo Bottai, Sc.D. Recent Advances in Methods for Quantiles Matteo Bottai, Sc.D. Many Thanks to Advisees Andrew Ortaglia Huiling Zhen Joe Holbrook Junlong Wu Li Zhou Marco Geraci Nicola Orsini Paolo Frumento Yuan Liu Collaborators

More information

Evidence-Based Medicine Journal Club. A Primer in Statistics, Study Design, and Epidemiology. August, 2013

Evidence-Based Medicine Journal Club. A Primer in Statistics, Study Design, and Epidemiology. August, 2013 Evidence-Based Medicine Journal Club A Primer in Statistics, Study Design, and Epidemiology August, 2013 Rationale for EBM Conscientious, explicit, and judicious use Beyond clinical experience and physiologic

More information

Selected Topics in Biostatistics Seminar Series. Missing Data. Sponsored by: Center For Clinical Investigation and Cleveland CTSC

Selected Topics in Biostatistics Seminar Series. Missing Data. Sponsored by: Center For Clinical Investigation and Cleveland CTSC Selected Topics in Biostatistics Seminar Series Missing Data Sponsored by: Center For Clinical Investigation and Cleveland CTSC Brian Schmotzer, MS Biostatistician, CCI Statistical Sciences Core brian.schmotzer@case.edu

More information

UN Handbook Ch. 7 'Managing sources of non-sampling error': recommendations on response rates

UN Handbook Ch. 7 'Managing sources of non-sampling error': recommendations on response rates JOINT EU/OECD WORKSHOP ON RECENT DEVELOPMENTS IN BUSINESS AND CONSUMER SURVEYS Methodological session II: Task Force & UN Handbook on conduct of surveys response rates, weighting and accuracy UN Handbook

More information

AP STATISTICS 2013 SCORING GUIDELINES

AP STATISTICS 2013 SCORING GUIDELINES AP STATISTICS 2013 SCORING GUIDELINES Question 5 Intent of Question The primary goals of this question were to assess a student s ability to (1) recognize the limited conclusions that can be drawn from

More information

Blood Pressure and Complications in Individuals with Type 2 Diabetes and No Previous Cardiovascular Disease. ID BMJ

Blood Pressure and Complications in Individuals with Type 2 Diabetes and No Previous Cardiovascular Disease. ID BMJ 1 Blood Pressure and Complications in Individuals with Type 2 Diabetes and No Previous Cardiovascular Disease. ID BMJ 2016.033440 Dear Editor, Editorial Committee and Reviewers Thank you for your appreciation

More information

Evaluating Social Programs Course: Evaluation Glossary (Sources: 3ie and The World Bank)

Evaluating Social Programs Course: Evaluation Glossary (Sources: 3ie and The World Bank) Evaluating Social Programs Course: Evaluation Glossary (Sources: 3ie and The World Bank) Attribution The extent to which the observed change in outcome is the result of the intervention, having allowed

More information

Role of respondents education as a mediator and moderator in the association between childhood socio-economic status and later health and wellbeing

Role of respondents education as a mediator and moderator in the association between childhood socio-economic status and later health and wellbeing Sheikh et al. BMC Public Health 2014, 14:1172 RESEARCH ARTICLE Open Access Role of respondents education as a mediator and moderator in the association between childhood socio-economic status and later

More information

Estimating Direct Effects of New HIV Prevention Methods. Focus: the MIRA Trial

Estimating Direct Effects of New HIV Prevention Methods. Focus: the MIRA Trial Estimating Direct Effects of New HIV Prevention Methods. Focus: the MIRA Trial Michael Rosenblum UCSF: Helen Cheng, Nancy Padian, Steve Shiboski, Ariane van der Straten Berkeley: Nicholas P. Jewell, Mark

More information

Challenges of Observational and Retrospective Studies

Challenges of Observational and Retrospective Studies Challenges of Observational and Retrospective Studies Kyoungmi Kim, Ph.D. March 8, 2017 This seminar is jointly supported by the following NIH-funded centers: Background There are several methods in which

More information

Comparing treatments evaluated in studies forming disconnected networks of evidence: A review of methods

Comparing treatments evaluated in studies forming disconnected networks of evidence: A review of methods Comparing treatments evaluated in studies forming disconnected networks of evidence: A review of methods John W Stevens Reader in Decision Science University of Sheffield EFPSI European Statistical Meeting

More information

Using principal stratification to address post-randomization events: A case study. Baldur Magnusson, Advanced Exploratory Analytics PSI Webinar

Using principal stratification to address post-randomization events: A case study. Baldur Magnusson, Advanced Exploratory Analytics PSI Webinar Using principal stratification to address post-randomization events: A case study Baldur Magnusson, Advanced Exploratory Analytics PSI Webinar November 2, 2017 Outline Context Principal stratification

More information

Design of Experiments & Introduction to Research

Design of Experiments & Introduction to Research Design of Experiments & Introduction to Research 1 Design of Experiments Introduction to Research Definition and Purpose Scientific Method Research Project Paradigm Structure of a Research Project Types

More information

Missing Data and Imputation

Missing Data and Imputation Missing Data and Imputation Barnali Das NAACCR Webinar May 2016 Outline Basic concepts Missing data mechanisms Methods used to handle missing data 1 What are missing data? General term: data we intended

More information

The following are questions that students had difficulty with on the first three exams.

The following are questions that students had difficulty with on the first three exams. The following are questions that students had difficulty with on the first three exams. Exam 1 1. A measure has construct validity if it: a) really measures what it is supposed to measure b) appears, on

More information

Today: Binomial response variable with an explanatory variable on an ordinal (rank) scale.

Today: Binomial response variable with an explanatory variable on an ordinal (rank) scale. Model Based Statistics in Biology. Part V. The Generalized Linear Model. Single Explanatory Variable on an Ordinal Scale ReCap. Part I (Chapters 1,2,3,4), Part II (Ch 5, 6, 7) ReCap Part III (Ch 9, 10,

More information

Welcome to this series focused on sources of bias in epidemiologic studies. In this first module, I will provide a general overview of bias.

Welcome to this series focused on sources of bias in epidemiologic studies. In this first module, I will provide a general overview of bias. Welcome to this series focused on sources of bias in epidemiologic studies. In this first module, I will provide a general overview of bias. In the second module, we will focus on selection bias and in

More information

Sequential nonparametric regression multiple imputations. Irina Bondarenko and Trivellore Raghunathan

Sequential nonparametric regression multiple imputations. Irina Bondarenko and Trivellore Raghunathan Sequential nonparametric regression multiple imputations Irina Bondarenko and Trivellore Raghunathan Department of Biostatistics, University of Michigan Ann Arbor, MI 48105 Abstract Multiple imputation,

More information

Advanced IPD meta-analysis methods for observational studies

Advanced IPD meta-analysis methods for observational studies Advanced IPD meta-analysis methods for observational studies Simon Thompson University of Cambridge, UK Part 4 IBC Victoria, July 2016 1 Outline of talk Usual measures of association (e.g. hazard ratios)

More information

Statistical Hocus Pocus? Assessing the Accuracy of a Diagnostic Screening Test When You Don t Even Know Who Has the Disease

Statistical Hocus Pocus? Assessing the Accuracy of a Diagnostic Screening Test When You Don t Even Know Who Has the Disease Statistical Hocus Pocus? Assessing the Accuracy of a Diagnostic Screening Test When You Don t Even Know Who Has the Disease Michelle Norris Dept. of Mathematics and Statistics California State University,

More information

Module Overview. What is a Marker? Part 1 Overview

Module Overview. What is a Marker? Part 1 Overview SISCR Module 7 Part I: Introduction Basic Concepts for Binary Classification Tools and Continuous Biomarkers Kathleen Kerr, Ph.D. Associate Professor Department of Biostatistics University of Washington

More information

Poisson regression. Dae-Jin Lee Basque Center for Applied Mathematics.

Poisson regression. Dae-Jin Lee Basque Center for Applied Mathematics. Dae-Jin Lee dlee@bcamath.org Basque Center for Applied Mathematics http://idaejin.github.io/bcam-courses/ D.-J. Lee (BCAM) Intro to GLM s with R GitHub: idaejin 1/40 Modeling count data Introduction Response

More information

Lesson: A Ten Minute Course in Epidemiology

Lesson: A Ten Minute Course in Epidemiology Lesson: A Ten Minute Course in Epidemiology This lesson investigates whether childhood circumcision reduces the risk of acquiring genital herpes in men. 1. To open the data we click on File>Example Data

More information

Assessing the impact of unmeasured confounding: confounding functions for causal inference

Assessing the impact of unmeasured confounding: confounding functions for causal inference Assessing the impact of unmeasured confounding: confounding functions for causal inference Jessica Kasza jessica.kasza@monash.edu Department of Epidemiology and Preventive Medicine, Monash University Victorian

More information

STATISTICAL METHODS FOR DIAGNOSTIC TESTING: AN ILLUSTRATION USING A NEW METHOD FOR CANCER DETECTION XIN SUN. PhD, Kansas State University, 2012

STATISTICAL METHODS FOR DIAGNOSTIC TESTING: AN ILLUSTRATION USING A NEW METHOD FOR CANCER DETECTION XIN SUN. PhD, Kansas State University, 2012 STATISTICAL METHODS FOR DIAGNOSTIC TESTING: AN ILLUSTRATION USING A NEW METHOD FOR CANCER DETECTION by XIN SUN PhD, Kansas State University, 2012 A THESIS Submitted in partial fulfillment of the requirements

More information

Missing data. Patrick Breheny. April 23. Introduction Missing response data Missing covariate data

Missing data. Patrick Breheny. April 23. Introduction Missing response data Missing covariate data Missing data Patrick Breheny April 3 Patrick Breheny BST 71: Bayesian Modeling in Biostatistics 1/39 Our final topic for the semester is missing data Missing data is very common in practice, and can occur

More information

Detection of Unknown Confounders. by Bayesian Confirmatory Factor Analysis

Detection of Unknown Confounders. by Bayesian Confirmatory Factor Analysis Advanced Studies in Medical Sciences, Vol. 1, 2013, no. 3, 143-156 HIKARI Ltd, www.m-hikari.com Detection of Unknown Confounders by Bayesian Confirmatory Factor Analysis Emil Kupek Department of Public

More information

Estimating Heterogeneous Choice Models with Stata

Estimating Heterogeneous Choice Models with Stata Estimating Heterogeneous Choice Models with Stata Richard Williams Notre Dame Sociology rwilliam@nd.edu West Coast Stata Users Group Meetings October 25, 2007 Overview When a binary or ordinal regression

More information

Measuring cancer survival in populations: relative survival vs cancer-specific survival

Measuring cancer survival in populations: relative survival vs cancer-specific survival Int. J. Epidemiol. Advance Access published February 8, 2010 Published by Oxford University Press on behalf of the International Epidemiological Association ß The Author 2010; all rights reserved. International

More information

Mostly Harmless Simulations? On the Internal Validity of Empirical Monte Carlo Studies

Mostly Harmless Simulations? On the Internal Validity of Empirical Monte Carlo Studies Mostly Harmless Simulations? On the Internal Validity of Empirical Monte Carlo Studies Arun Advani and Tymon Sªoczy«ski 13 November 2013 Background When interested in small-sample properties of estimators,

More information

Unit 1 Exploring and Understanding Data

Unit 1 Exploring and Understanding Data Unit 1 Exploring and Understanding Data Area Principle Bar Chart Boxplot Conditional Distribution Dotplot Empirical Rule Five Number Summary Frequency Distribution Frequency Polygon Histogram Interquartile

More information

Bias. A systematic error (caused by the investigator or the subjects) that causes an incorrect (overor under-) estimate of an association.

Bias. A systematic error (caused by the investigator or the subjects) that causes an incorrect (overor under-) estimate of an association. Bias A systematic error (caused by the investigator or the subjects) that causes an incorrect (overor under-) estimate of an association. Here, random error is small, but systematic errors have led to

More information