Friday, September 9, :00-11:00 am Warwick Evans Conference Room, Building D Refreshments will be provided at 9:45am

Size: px
Start display at page:

Download "Friday, September 9, :00-11:00 am Warwick Evans Conference Room, Building D Refreshments will be provided at 9:45am"

Transcription

1 The Role of the Biostatistician in Cancer Research Edmund A. Gehan, PhD Professor Emeritus, Department of Biostatistics, Bioinformatics and Biomathematics Lombardi Comprehensive Cancer Center Georgetown University Medical Center in Washington, DC This paper considers triumphs and challenges for biostatisticians working in oncology at the beginning of the 21st century. The impact of three major articles in biostatistics in the 20 th century is considered. Cornfield s 1951 paper on estimating comparative rates from clinical data; Mantel and Haenszel s 1959 paper on obtaining summary measures of relative risk, adjusting for stratification factors in epidemiological studies; and D.R.Cox s 1972 paper, which developed the proportional hazards model for evaluating the effect of covariates on survival time outcomes. Biostatistical challenges for the 21 st century are considered for the areas of clinical trials, survival analysis, and statistical genetics. Friday, September 9, :00-11:00 am

2 Statistical Methods for Surrogate Marker Validation from Randomized Clinical Trials Abel T. Eshete, PhD Research Associate Harvard School of Public Health Harvard University Surrogate endpoints come into play in a number of contexts in place of the endpoint of interest, referred commonly to as the true or main endpoint. The use of surrogate endpoints is potentially beneficial, when these endpoints can be measured earlier, leading to a rapid approval of experimental drugs, or can be administered conveniently, which can be equated to less burden on the side of both the experimenter and the patients. In recent times, the evaluation exercise is framed within a meta-analytic setting, thereby overcoming difficulties that necessarily surround evaluation efforts based on a single trial. This approach however is computationally intensive and also requires different hierarchical models for different types of outcomes. To circumvent this problem, an information-theoretic approach which is simple to apply and can be used for a variety of outcome combinations has been introduced. A particular but important instance is where the true endpoint is the ultimate assessment in a sequence of repeated measures. It is then appealing to consider earlier measures, either in isolation or several combined, as a potential surrogate endpoint. The length and cost reducing potential has to be weighed carefully against loss in precision and the risks of an inappropriate decision regarding a new compound s fate. In this talk, we will discuss statistical methods used to validate surrogate endpoints of various types, within the meta-analytic framework. Friday, September 23, :00-11:00 am

3 Flexible Random Effects Copula Models for Clustered Mixed Bivariate Outcomes: Application in Developmental Toxicology Alexander de Leon, PhD Associate Professor Department of Mathematics & Statistics University of Calgary The talk concerns the analysis of clustered data with mixed bivariate responses, i.e., where each member of the cluster has a discrete and a continuous outcome. A copulabased random effects model is proposed that accounts for associations between discrete and/or continuous outcomes within and between clusters, including the intrinsic association between the mixed outcomes for the same subject. The approach yields regression parameters in models for both outcomes that are marginally meaningful; in addition, by assuming a latent variable framework to describe discrete outcomes, complications that arise from direct applications of copulas to discrete variables are avoided. Maximum likelihood estimation of the model parameters is implemented using readily available software (e.g., PROC NLMIXED in SAS), and results of simulations concerning the bias and efficiency of the estimates are reported. The proposed methodology is motivated by and illustrated using a developmental toxicity study of ethylene glycol (EG) in mice. Friday, October 7, :15-11:15 am Martin Marietta Conference Room Lombardi Comprehensive Cancer Center Refreshments will be provided at 10:00 am

4 Potential Impact of Population Structure on Population-Based Case-Control Association Studies Zhaohai Li, PhD Professor of Statistics and Biostatistics Department of Statistics The George Washington University Case-control association studies using unrelated cases and controls may suffer from potential confounding due to population stratification. Bias and variance distortion caused by population stratification in the commonly used allele-based tests can considerably inflate the Type I error rate. It is shown that the bias vanishes in the absence of disease rate heterogeneity. If only population stratification exists, a proper estimate of the variance of the allele based test statistic is developed. Using this estimated variance yields a valid Type I error. However, when the frequencies of the allele under study and the disease rates differ among the sub-populations, it is difficult to correct for this bias. Explicit expressions for the excess false positive rate (EFPR) of the test due to bias and variance distortion are derived. It turns out that the bias created when both population stratification and disease rate heterogeneity are present usually has a greater effect on the EFPR than variance distortion. Comprehensive simulation studies strongly support these results. Friday, October 14, :00-11:00 am

5 Likelihood-Based Methods for Regression Analysis with Binary Exposure Status Assessed by Pooling Robert H. Lyles, PhD Associate Professor Department of Biostatistics and Bioinformatics Rollins School of Public Health, Emory University The need for resource-intensive laboratory assays to assess exposures in many epidemiologic studies provides ample motivation to consider study designs that incorporate pooled samples. In this talk, we consider the case in which specimens are combined for the purpose of determining the presence or absence of a pool-wise exposure, in lieu of assessing the actual binary exposure status for each member of the pool. We presume a primary logistic regression model for an observed binary outcome, together with a secondary regression model for exposure. We facilitate maximum likelihood analysis by complete enumeration of the possible implications of a positive pool, and we discuss the applicability of this approach under both cross-sectional and case-control sampling. We also provide a maximum likelihood approach for longitudinal or repeated measures studies where the binary outcome and exposure are assessed on multiple occasions and within-subject pooling is conducted for exposure assessment. Simulation studies illustrate the performance of the proposed approaches along with their computational feasibility using widely available software. We apply the methods to investigate gene-disease association in a population-based case-control study of colorectal cancer. Friday, October 28, :00-11:00 am

6 Statistical Analysis of Copy Number Variations in Genome-Wide Association Studies Jianxin Shi, Ph.D. Biostatistics Branch Division of Cancer Epidemiology and Genetics National Cancer Institute Copy number variations (CNVs) are one of the major genetic resources in human populations. CNVs were reported to be associated with the risks of many complex diseases in both case-control and family-based genome-wide association studies (GWAS) based on single nucleotide polymorphism (SNP) genotyping arrays. In this talk, I will first review the statistical and computational challenges in CNV discovery and CNV association testing in GWAS, which are two fundamentally different problems. Both successful and controversial stories will be discussed. I will then present our methods (implemented in SegCNV) for CNV discovery and CNV association testing in case-control GWAS. Our methods are computationally efficient and statistical powerful for detecting both overlapping and non-overlapping CNVs in GWAS. Finally, I will present a novel method (cnvfam) for simultaneously inferring CNVs in nuclear families of any sizes. We demonstrated that cnvfam substantially improved the sensitivity of detecting shared CNVs as well as the boundary inference, compared to methods ignoring family information. In addition, by simultaneously modeling all family members, cnvfam made a biologically more plausible inference of de novo CNVs v.s. inherited CNVs. Application of the methods to cancer and autism GWAS will be discussed. Friday, November 11, :00-11:00 am Refreshments will be provided at 9:45 am

7 High breakdown generalized S-estimators for unbalanced linear mixed effects models Speaker: Inna Cherovneva, PhD Associate Professor Division of Biostatistics Thomas Jefferson University Linear mixed effects (LME) models provide the most important statistical tool for analysis of a wide range of clustered and correlated continuous measurements data in biomedical research. Real biomedical data are often contaminated with outliers, and it is well known that in the presence of outliers, traditional maximum likelihood based estimation methods may yield severely biased results misrepresenting characteristics of the majority of observations. High breakdown and re-descending M-estimators are currently the robust methods of choice for multivariate linear regression, but such estimators so far have been developed only for the limited class of completely balanced LME models. For a general unbalanced linear mixed effects model, we propose a class of re-descending M-estimators that reduce to the high breakdown S-estimators in a fully balanced linear mixed effects model. Therefore, they are referred to as generalized S-estimators. The asymptotic properties, influence function, and breakdown of the proposed estimators are investigated. A small simulation study is conducted to compare performance of the generalized S-estimates and M-estimates for unbalanced LME models. Finally, the proposed generalized S-estimators are used for robust analysis of age-related changes in hemoglobin levels of sickle cell disease patients. Friday, January 13, :00-11:00 am

8 Clinical Trials for Predictive Medicine New Challenges and Paradigms Speaker: Richard Simon, D. Sc. Biometric Research Branch, National Cancer Institute Development of treatments with companion diagnostics requires major changes in the standard paradigms for the design and analysis of clinical trials. Some of the key assumptions upon which current methods are based are no longer valid. The standard clinical trial paradigm of employing broad eligibility criteria and focusing design and analysis on testing the null hypothesis of no overall average effect relegating no longer has an adequate scientific basis in many oncology settings. It has lead to large clinical trials that identified small average treatment effects and resulted in approval of drugs that do not benefit most patients to whom they are administered. This problem has become exacerbated with the development of expensive molecularly targeted therapeutics, which is the vast majority of drugs being currently developed. The established molecular heterogeneity of human cancer and the availability of genomic technologies for characterizing the molecular basis of individual tumors requires the development of new paradigms for the design and analysis of randomized clinical trials as a reliable basis for predictive personalized oncology. I will review new designs for phase III clinical trials of new therapeutics and candidate companion diagnostics (predictive biomarkers). I will also outline a prediction based approach to the analysis of randomized clinical trials that both preserves the type I error and provides a reliable internally validated basis for predicting which patients are most likely or unlikely to benefit from the new treatment. This is a very structured approach whose use requires careful prospective planning. It may serve as a basis for a new generation of predictive clinical trials which provide the kinds of reliable individualized information which physicians and patients have long sought, but which have not been available from the past use of post-hoc subset analysis. Developing new treatments with predictive biomarkers for identifying the patients who are most likely or least likely to benefit makes drug development more complex. But for many new oncology drugs it is the only science based approach and should increase the chance of success. It may also lead to more consistency in results among trials and has obvious benefits for reducing the number of patients who ultimately receive expensive drugs which expose them risks of adverse events but no benefit. Friday, January 27, :00-11:00 am

9 Designing an Adaptive Multi-Arm Time-to-Event Clinical Trial Using Weibull Models Speaker: Alex Sverdlov, PhD Senior Research Biostatician Bristol-Myers Squibb We consider a design problem for a clinical trial with multiple treatment arms and time-toevent primary outcomes that are modeled using the Weibull family of distributions. We obtain the locally D-optimal design, along with compound optimal designs that deliver targeted efficiencies for different estimation problems and ethical considerations as well. The proposed designs are studied theoretically and are implemented using the doubly adaptive biased coin (DBCD) randomization procedure of Hu & Zhang (2004) for a clinical trial with censored Weibull outcomes. We compare the merits of our multiple-objective adaptive designs with popular and competing designs and show that our designs are more flexible, realistic, generally more ethical and frequently provide higher efficiencies for estimating different sets of parameters. Some Keywords: Clinical trial; D-optimal design; dose response study; doubly adaptive biased coin; ethical concern; randomization design. Friday, February 10, :00-11:00 am

10 Mountains out of molehills? The role of experiment design, quality assurance, and machine learning in effective clinical metabolomics biomarker discovery Speaker: David Broadhurst, Ph.D. University of Alberta Assistant Professor of Biostatistics Division of General Internal Medicine Department of Medicine Many clinical metabolomics, and other high-content or high-throughput, experiments are set up such that the primary aim is the discovery of biomarker metabolites that can discriminate, with a certain level of certainty, between nominally matched case and control samples. However, it is unfortunately very easy to find markers that are apparently persuasive but that are in fact entirely spurious. The main types of danger are not entirely independent of each other, but include: bias in patient selection and biobanking; poor choice in clinical endpoint; inadequate sample size and excessive false discovery rate due to multiple hypothesis testing; inappropriate choice of machine learning methods; poor model validation, and poor lab based quality assurance protocols. Many studies fail to take these issues into account, and thereby fail to discover anything of true significance (despite their claims). Here I summarise these problems, and provide pointers to assist in the improved design and evaluation of metabolomics experiments, thereby allowing robust scientific conclusions to be drawn from the available data. Friday, February 24, :00-11:00 am

11 Differential principal component analysis of ChIP-seq Speaker: Hongkai Ji, PhD Assistant Professor, Department of Biostatistics Bloomberg School of Public Health, Johns Hopkins University We propose Differential Principal Component Analysis (dpca) for analyzing multiple ChIP-seq datasets to identify differential protein-dna interactions between two biological conditions. dpca integrates unsupervised pattern discovery, dimension reduction, and statistical inference into a single statistical framework. It uses a small number of principal components to concisely summarize the major multi-protein differential patterns between the two conditions. For each pattern, it detects and prioritizes differential genomic loci by comparing the between-condition differences with the within-condition variation among replicate samples. dpca provides a new tool for efficiently analyzing large amounts of ChIP-seq data to study dynamic changes of gene regulation across different biological conditions. We demonstrate this approach through analyses of differential histone modifications at transcription factor binding sites and promoters. Friday, March 23, :00-11:00 am

12 Road to Bias Paved by Sensible Intuition: How We Sometimes Assess Exposures in Epidemiology Speaker: Igor Burstyn, Ph.D. Associate Professor and Director of Research Department of Environmental and Occupational Health School of Public Health, Drexel University The aim of the presentation is to raise awareness of unanticipated statistical properties of procedures employed in by epidemiologists in constructing exposure variables. These ad hoc procedures are typically based on sensible intuition and arguments by analogy, but in absence of theoretical exploration of their properties often lead to violations of assumptions in standard exposure-disease analyses. The net result is a biased body of literature that most likely distorts what can and should be learned from epidemiologic data. Two ad hoc procedures with such properties will be illustrated. The first is job-exposure matrix that aggregates exposure level and prevalence within a job into a single metric. I will demonstrate theoretically that this commonly adopted method leads to systematic and differential measurement error even under idealized conditions of no uncertainty about level and prevalence of exposure. Impact of this bias in a particular study of cancer risk will be illustrated via simulation. The second procedure is the habit of dichotomizing mismeasured continuous covariates for the sake of convenience and parsimony. This is known to create differential exposure misclassification and produces publication bias in situ (i.e. selective reporting of most significant findings). Newly proposed alternative procedure for presenting all dichotomizations will be presented (PMID: ) and the extent of bias from differential error illustrated. I conclude that there will always a need bold intuition to make advances in knowledge but urge all such insights must be examined critically and when they concern statistical properties with full engagement of statisticians who can explore the limits and pitfalls of methods that appear to be intuitively sensible. Friday, April 13, :00-11:00 am

Chapter 1. Introduction

Chapter 1. Introduction Chapter 1 Introduction 1.1 Motivation and Goals The increasing availability and decreasing cost of high-throughput (HT) technologies coupled with the availability of computational tools and data form a

More information

Biostatistics II

Biostatistics II Biostatistics II 514-5509 Course Description: Modern multivariable statistical analysis based on the concept of generalized linear models. Includes linear, logistic, and Poisson regression, survival analysis,

More information

Use of Archived Tissues in the Development and Validation of Prognostic & Predictive Biomarkers

Use of Archived Tissues in the Development and Validation of Prognostic & Predictive Biomarkers Use of Archived Tissues in the Development and Validation of Prognostic & Predictive Biomarkers Richard Simon, D.Sc. Chief, Biometric Research Branch National Cancer Institute http://brb.nci.nih.gov Different

More information

GUIDELINE COMPARATORS & COMPARISONS:

GUIDELINE COMPARATORS & COMPARISONS: GUIDELINE COMPARATORS & COMPARISONS: Direct and indirect comparisons Adapted version (2015) based on COMPARATORS & COMPARISONS: Direct and indirect comparisons - February 2013 The primary objective of

More information

16:35 17:20 Alexander Luedtke (Fred Hutchinson Cancer Research Center)

16:35 17:20 Alexander Luedtke (Fred Hutchinson Cancer Research Center) Conference on Causal Inference in Longitudinal Studies September 21-23, 2017 Columbia University Thursday, September 21, 2017: tutorial 14:15 15:00 Miguel Hernan (Harvard University) 15:00 15:45 Miguel

More information

Live WebEx meeting agenda

Live WebEx meeting agenda 10:00am 10:30am Using OpenMeta[Analyst] to extract quantitative data from published literature Live WebEx meeting agenda August 25, 10:00am-12:00pm ET 10:30am 11:20am Lecture (this will be recorded) 11:20am

More information

Two-stage Methods to Implement and Analyze the Biomarker-guided Clinical Trail Designs in the Presence of Biomarker Misclassification

Two-stage Methods to Implement and Analyze the Biomarker-guided Clinical Trail Designs in the Presence of Biomarker Misclassification RESEARCH HIGHLIGHT Two-stage Methods to Implement and Analyze the Biomarker-guided Clinical Trail Designs in the Presence of Biomarker Misclassification Yong Zang 1, Beibei Guo 2 1 Department of Mathematical

More information

Experimental Design for Immunologists

Experimental Design for Immunologists Experimental Design for Immunologists Hulin Wu, Ph.D., Dean s Professor Department of Biostatistics & Computational Biology Co-Director: Center for Biodefense Immune Modeling School of Medicine and Dentistry

More information

Title: A robustness study of parametric and non-parametric tests in Model-Based Multifactor Dimensionality Reduction for epistasis detection

Title: A robustness study of parametric and non-parametric tests in Model-Based Multifactor Dimensionality Reduction for epistasis detection Author's response to reviews Title: A robustness study of parametric and non-parametric tests in Model-Based Multifactor Dimensionality Reduction for epistasis detection Authors: Jestinah M Mahachie John

More information

On Missing Data and Genotyping Errors in Association Studies

On Missing Data and Genotyping Errors in Association Studies On Missing Data and Genotyping Errors in Association Studies Department of Biostatistics Johns Hopkins Bloomberg School of Public Health May 16, 2008 Specific Aims of our R01 1 Develop and evaluate new

More information

Biostatistics for Med Students. Lecture 1

Biostatistics for Med Students. Lecture 1 Biostatistics for Med Students Lecture 1 John J. Chen, Ph.D. Professor & Director of Biostatistics Core UH JABSOM JABSOM MD7 February 14, 2018 Lecture note: http://biostat.jabsom.hawaii.edu/education/training.html

More information

New Enhancements: GWAS Workflows with SVS

New Enhancements: GWAS Workflows with SVS New Enhancements: GWAS Workflows with SVS August 9 th, 2017 Gabe Rudy VP Product & Engineering 20 most promising Biotech Technology Providers Top 10 Analytics Solution Providers Hype Cycle for Life sciences

More information

Epidemiologic Methods I & II Epidem 201AB Winter & Spring 2002

Epidemiologic Methods I & II Epidem 201AB Winter & Spring 2002 DETAILED COURSE OUTLINE Epidemiologic Methods I & II Epidem 201AB Winter & Spring 2002 Hal Morgenstern, Ph.D. Department of Epidemiology UCLA School of Public Health Page 1 I. THE NATURE OF EPIDEMIOLOGIC

More information

Fundamental Clinical Trial Design

Fundamental Clinical Trial Design Design, Monitoring, and Analysis of Clinical Trials Session 1 Overview and Introduction Overview Scott S. Emerson, M.D., Ph.D. Professor of Biostatistics, University of Washington February 17-19, 2003

More information

A Brief Introduction to Bayesian Statistics

A Brief Introduction to Bayesian Statistics A Brief Introduction to Statistics David Kaplan Department of Educational Psychology Methods for Social Policy Research and, Washington, DC 2017 1 / 37 The Reverend Thomas Bayes, 1701 1761 2 / 37 Pierre-Simon

More information

Lec 02: Estimation & Hypothesis Testing in Animal Ecology

Lec 02: Estimation & Hypothesis Testing in Animal Ecology Lec 02: Estimation & Hypothesis Testing in Animal Ecology Parameter Estimation from Samples Samples We typically observe systems incompletely, i.e., we sample according to a designed protocol. We then

More information

CS2220 Introduction to Computational Biology

CS2220 Introduction to Computational Biology CS2220 Introduction to Computational Biology WEEK 8: GENOME-WIDE ASSOCIATION STUDIES (GWAS) 1 Dr. Mengling FENG Institute for Infocomm Research Massachusetts Institute of Technology mfeng@mit.edu PLANS

More information

Discussion Meeting for MCP-Mod Qualification Opinion Request. Novartis 10 July 2013 EMA, London, UK

Discussion Meeting for MCP-Mod Qualification Opinion Request. Novartis 10 July 2013 EMA, London, UK Discussion Meeting for MCP-Mod Qualification Opinion Request Novartis 10 July 2013 EMA, London, UK Attendees Face to face: Dr. Frank Bretz Global Statistical Methodology Head, Novartis Dr. Björn Bornkamp

More information

Individual Participant Data (IPD) Meta-analysis of prediction modelling studies

Individual Participant Data (IPD) Meta-analysis of prediction modelling studies Individual Participant Data (IPD) Meta-analysis of prediction modelling studies Thomas Debray, PhD Julius Center for Health Sciences and Primary Care Utrecht, The Netherlands March 7, 2016 Prediction

More information

Indirect Treatment Comparison Without Network Meta-Analysis: Overview of Novel Techniques

Indirect Treatment Comparison Without Network Meta-Analysis: Overview of Novel Techniques Indirect Treatment Comparison Without Network Meta-Analysis: Overview of Novel Techniques K. Jack Ishak, PhD, Executive Director, Statistics and Senior Research Scientist BACKGROUND There is generally

More information

Citation for published version (APA): Ebbes, P. (2004). Latent instrumental variables: a new approach to solve for endogeneity s.n.

Citation for published version (APA): Ebbes, P. (2004). Latent instrumental variables: a new approach to solve for endogeneity s.n. University of Groningen Latent instrumental variables Ebbes, P. IMPORTANT NOTE: You are advised to consult the publisher's version (publisher's PDF) if you wish to cite from it. Please check the document

More information

Certificate Courses in Biostatistics

Certificate Courses in Biostatistics Certificate Courses in Biostatistics Term I : September December 2015 Term II : Term III : January March 2016 April June 2016 Course Code Module Unit Term BIOS5001 Introduction to Biostatistics 3 I BIOS5005

More information

Lecture Outline. Biost 590: Statistical Consulting. Stages of Scientific Studies. Scientific Method

Lecture Outline. Biost 590: Statistical Consulting. Stages of Scientific Studies. Scientific Method Biost 590: Statistical Consulting Statistical Classification of Scientific Studies; Approach to Consulting Lecture Outline Statistical Classification of Scientific Studies Statistical Tasks Approach to

More information

Selected Topics in Biostatistics Seminar Series. Missing Data. Sponsored by: Center For Clinical Investigation and Cleveland CTSC

Selected Topics in Biostatistics Seminar Series. Missing Data. Sponsored by: Center For Clinical Investigation and Cleveland CTSC Selected Topics in Biostatistics Seminar Series Missing Data Sponsored by: Center For Clinical Investigation and Cleveland CTSC Brian Schmotzer, MS Biostatistician, CCI Statistical Sciences Core brian.schmotzer@case.edu

More information

CHAPTER 6. Conclusions and Perspectives

CHAPTER 6. Conclusions and Perspectives CHAPTER 6 Conclusions and Perspectives In Chapter 2 of this thesis, similarities and differences among members of (mainly MZ) twin families in their blood plasma lipidomics profiles were investigated.

More information

Comparing treatments evaluated in studies forming disconnected networks of evidence: A review of methods

Comparing treatments evaluated in studies forming disconnected networks of evidence: A review of methods Comparing treatments evaluated in studies forming disconnected networks of evidence: A review of methods John W Stevens Reader in Decision Science University of Sheffield EFPSI European Statistical Meeting

More information

Epidemiologic Methods and Counting Infections: The Basics of Surveillance

Epidemiologic Methods and Counting Infections: The Basics of Surveillance Epidemiologic Methods and Counting Infections: The Basics of Surveillance Ebbing Lautenbach, MD, MPH, MSCE University of Pennsylvania School of Medicine Nothing to disclose PENN Outline Definitions / Historical

More information

Recent developments for combining evidence within evidence streams: bias-adjusted meta-analysis

Recent developments for combining evidence within evidence streams: bias-adjusted meta-analysis EFSA/EBTC Colloquium, 25 October 2017 Recent developments for combining evidence within evidence streams: bias-adjusted meta-analysis Julian Higgins University of Bristol 1 Introduction to concepts Standard

More information

Propensity Score Methods for Estimating Causality in the Absence of Random Assignment: Applications for Child Care Policy Research

Propensity Score Methods for Estimating Causality in the Absence of Random Assignment: Applications for Child Care Policy Research 2012 CCPRC Meeting Methodology Presession Workshop October 23, 2012, 2:00-5:00 p.m. Propensity Score Methods for Estimating Causality in the Absence of Random Assignment: Applications for Child Care Policy

More information

BIOSTATISTICAL METHODS AND RESEARCH DESIGNS. Xihong Lin Department of Biostatistics, University of Michigan, Ann Arbor, MI, USA

BIOSTATISTICAL METHODS AND RESEARCH DESIGNS. Xihong Lin Department of Biostatistics, University of Michigan, Ann Arbor, MI, USA BIOSTATISTICAL METHODS AND RESEARCH DESIGNS Xihong Lin Department of Biostatistics, University of Michigan, Ann Arbor, MI, USA Keywords: Case-control study, Cohort study, Cross-Sectional Study, Generalized

More information

Evaluating health management programmes over time: application of propensity score-based weighting to longitudinal datajep_

Evaluating health management programmes over time: application of propensity score-based weighting to longitudinal datajep_ Journal of Evaluation in Clinical Practice ISSN 1356-1294 Evaluating health management programmes over time: application of propensity score-based weighting to longitudinal datajep_1361 180..185 Ariel

More information

Evaluation Models STUDIES OF DIAGNOSTIC EFFICIENCY

Evaluation Models STUDIES OF DIAGNOSTIC EFFICIENCY 2. Evaluation Model 2 Evaluation Models To understand the strengths and weaknesses of evaluation, one must keep in mind its fundamental purpose: to inform those who make decisions. The inferences drawn

More information

MS&E 226: Small Data

MS&E 226: Small Data MS&E 226: Small Data Lecture 10: Introduction to inference (v2) Ramesh Johari ramesh.johari@stanford.edu 1 / 17 What is inference? 2 / 17 Where did our data come from? Recall our sample is: Y, the vector

More information

Identification of Tissue Independent Cancer Driver Genes

Identification of Tissue Independent Cancer Driver Genes Identification of Tissue Independent Cancer Driver Genes Alexandros Manolakos, Idoia Ochoa, Kartik Venkat Supervisor: Olivier Gevaert Abstract Identification of genomic patterns in tumors is an important

More information

Module 14: Missing Data Concepts

Module 14: Missing Data Concepts Module 14: Missing Data Concepts Jonathan Bartlett & James Carpenter London School of Hygiene & Tropical Medicine Supported by ESRC grant RES 189-25-0103 and MRC grant G0900724 Pre-requisites Module 3

More information

Analysis of TB prevalence surveys

Analysis of TB prevalence surveys Workshop and training course on TB prevalence surveys with a focus on field operations Analysis of TB prevalence surveys Day 8 Thursday, 4 August 2011 Phnom Penh Babis Sismanidis with acknowledgements

More information

Data harmonization tutorial:teaser for FH2019

Data harmonization tutorial:teaser for FH2019 Data harmonization tutorial:teaser for FH2019 Alden Gross, Johns Hopkins Rich Jones, Brown University Friday Harbor Tahoe 22 Aug. 2018 1 / 50 Outline Outline What is harmonization? Approach Prestatistical

More information

Bayesian Joint Modelling of Benefit and Risk in Drug Development

Bayesian Joint Modelling of Benefit and Risk in Drug Development Bayesian Joint Modelling of Benefit and Risk in Drug Development EFSPI/PSDM Safety Statistics Meeting Leiden 2017 Disclosure is an employee and shareholder of GSK Data presented is based on human research

More information

Downloaded from:

Downloaded from: Hemingway, H; Croft, P; Perel, P; Hayden, JA; Abrams, K; Timmis, A; Briggs, A; Udumyan, R; Moons, KG; Steyerberg, EW; Roberts, I; Schroter, S; Altman, DG; Riley, RD; PROGRESS Group (2013) Prognosis research

More information

Your submission states that the adjusted indirect comparison methodology was based largely on the publication by Glenny et al

Your submission states that the adjusted indirect comparison methodology was based largely on the publication by Glenny et al Clarification of the indirect comparison Your submission states that the adjusted indirect comparison methodology was based largely on the publication by Glenny et al Your methodology is as follows: An

More information

Incorporating the direct assignment option into broader design frameworks

Incorporating the direct assignment option into broader design frameworks Incorporating the direct assignment option into broader design frameworks Ming-Wen An Sumithra J. Mandrekar, Daniel J. Sargent Lucy Lu Martin J. Edelman Vassar College, Poughkeepsie NY Mayo Clinic, Rochester

More information

RANDOMIZATION. Outline of Talk

RANDOMIZATION. Outline of Talk RANDOMIZATION Basic Ideas and Insights Marvin Zelen Harvard University Graybill Conference Ft.Collins, Colorado June 11, 2008 Outline of Talk Introduction Randomized vs. Observational Studies Conditioning

More information

DRAFT (Final) Concept Paper On choosing appropriate estimands and defining sensitivity analyses in confirmatory clinical trials

DRAFT (Final) Concept Paper On choosing appropriate estimands and defining sensitivity analyses in confirmatory clinical trials DRAFT (Final) Concept Paper On choosing appropriate estimands and defining sensitivity analyses in confirmatory clinical trials EFSPI Comments Page General Priority (H/M/L) Comment The concept to develop

More information

Ecological Statistics

Ecological Statistics A Primer of Ecological Statistics Second Edition Nicholas J. Gotelli University of Vermont Aaron M. Ellison Harvard Forest Sinauer Associates, Inc. Publishers Sunderland, Massachusetts U.S.A. Brief Contents

More information

STATISTICAL CONCLUSION VALIDITY

STATISTICAL CONCLUSION VALIDITY Validity 1 The attached checklist can help when one is evaluating the threats to validity of a study. VALIDITY CHECKLIST Recall that these types are only illustrative. There are many more. INTERNAL VALIDITY

More information

Introduction to the Genetics of Complex Disease

Introduction to the Genetics of Complex Disease Introduction to the Genetics of Complex Disease Jeremiah M. Scharf, MD, PhD Departments of Neurology, Psychiatry and Center for Human Genetic Research Massachusetts General Hospital Breakthroughs in Genome

More information

Cochrane Pregnancy and Childbirth Group Methodological Guidelines

Cochrane Pregnancy and Childbirth Group Methodological Guidelines Cochrane Pregnancy and Childbirth Group Methodological Guidelines [Prepared by Simon Gates: July 2009, updated July 2012] These guidelines are intended to aid quality and consistency across the reviews

More information

Design of Experiments & Introduction to Research

Design of Experiments & Introduction to Research Design of Experiments & Introduction to Research 1 Design of Experiments Introduction to Research Definition and Purpose Scientific Method Research Project Paradigm Structure of a Research Project Types

More information

Extended Abstract prepared for the Integrating Genetics in the Social Sciences Meeting 2014

Extended Abstract prepared for the Integrating Genetics in the Social Sciences Meeting 2014 Understanding the role of social and economic factors in GCTA heritability estimates David H Rehkopf, Stanford University School of Medicine, Division of General Medical Disciplines 1265 Welch Road, MSOB

More information

Chapter 17 Sensitivity Analysis and Model Validation

Chapter 17 Sensitivity Analysis and Model Validation Chapter 17 Sensitivity Analysis and Model Validation Justin D. Salciccioli, Yves Crutain, Matthieu Komorowski and Dominic C. Marshall Learning Objectives Appreciate that all models possess inherent limitations

More information

For general queries, contact

For general queries, contact Much of the work in Bayesian econometrics has focused on showing the value of Bayesian methods for parametric models (see, for example, Geweke (2005), Koop (2003), Li and Tobias (2011), and Rossi, Allenby,

More information

A Method for Utilizing Bivariate Efficacy Outcome Measures to Screen Agents for Activity in 2-Stage Phase II Clinical Trials

A Method for Utilizing Bivariate Efficacy Outcome Measures to Screen Agents for Activity in 2-Stage Phase II Clinical Trials A Method for Utilizing Bivariate Efficacy Outcome Measures to Screen Agents for Activity in 2-Stage Phase II Clinical Trials Michael W. Sill 1 and Greg Yothers 2 1 Adjunct Instructor; Research Assistant

More information

On the Targets of Latent Variable Model Estimation

On the Targets of Latent Variable Model Estimation On the Targets of Latent Variable Model Estimation Karen Bandeen-Roche Department of Biostatistics Johns Hopkins University Department of Mathematics and Statistics Miami University December 8, 2005 With

More information

Accelerating Innovation in Statistical Design

Accelerating Innovation in Statistical Design Implementing a National Cancer Clinical Trials System for the 21 st Century, Workshop #2 Session #5: Accelerating Innovation Through Effective Partnerships Accelerating Innovation in Statistical Design

More information

How many speakers? How many tokens?:

How many speakers? How many tokens?: 1 NWAV 38- Ottawa, Canada 23/10/09 How many speakers? How many tokens?: A methodological contribution to the study of variation. Jorge Aguilar-Sánchez University of Wisconsin-La Crosse 2 Sample size in

More information

Selection and estimation in exploratory subgroup analyses a proposal

Selection and estimation in exploratory subgroup analyses a proposal Selection and estimation in exploratory subgroup analyses a proposal Gerd Rosenkranz, Novartis Pharma AG, Basel, Switzerland EMA Workshop, London, 07-Nov-2014 Purpose of this presentation Proposal for

More information

investigate. educate. inform.

investigate. educate. inform. investigate. educate. inform. Research Design What drives your research design? The battle between Qualitative and Quantitative is over Think before you leap What SHOULD drive your research design. Advanced

More information

Contents. Part 1 Introduction. Part 2 Cross-Sectional Selection Bias Adjustment

Contents. Part 1 Introduction. Part 2 Cross-Sectional Selection Bias Adjustment From Analysis of Observational Health Care Data Using SAS. Full book available for purchase here. Contents Preface ix Part 1 Introduction Chapter 1 Introduction to Observational Studies... 3 1.1 Observational

More information

Recent advances in non-experimental comparison group designs

Recent advances in non-experimental comparison group designs Recent advances in non-experimental comparison group designs Elizabeth Stuart Johns Hopkins Bloomberg School of Public Health Department of Mental Health Department of Biostatistics Department of Health

More information

Bayesian Prediction Tree Models

Bayesian Prediction Tree Models Bayesian Prediction Tree Models Statistical Prediction Tree Modelling for Clinico-Genomics Clinical gene expression data - expression signatures, profiling Tree models for predictive sub-typing Combining

More information

Instrumental Variables Estimation: An Introduction

Instrumental Variables Estimation: An Introduction Instrumental Variables Estimation: An Introduction Susan L. Ettner, Ph.D. Professor Division of General Internal Medicine and Health Services Research, UCLA The Problem The Problem Suppose you wish to

More information

Design and Analysis Plan Quantitative Synthesis of Federally-Funded Teen Pregnancy Prevention Programs HHS Contract #HHSP I 5/2/2016

Design and Analysis Plan Quantitative Synthesis of Federally-Funded Teen Pregnancy Prevention Programs HHS Contract #HHSP I 5/2/2016 Design and Analysis Plan Quantitative Synthesis of Federally-Funded Teen Pregnancy Prevention Programs HHS Contract #HHSP233201500069I 5/2/2016 Overview The goal of the meta-analysis is to assess the effects

More information

Challenges of Observational and Retrospective Studies

Challenges of Observational and Retrospective Studies Challenges of Observational and Retrospective Studies Kyoungmi Kim, Ph.D. March 8, 2017 This seminar is jointly supported by the following NIH-funded centers: Background There are several methods in which

More information

Lecture Outline. Biost 517 Applied Biostatistics I. Purpose of Descriptive Statistics. Purpose of Descriptive Statistics

Lecture Outline. Biost 517 Applied Biostatistics I. Purpose of Descriptive Statistics. Purpose of Descriptive Statistics Biost 517 Applied Biostatistics I Scott S. Emerson, M.D., Ph.D. Professor of Biostatistics University of Washington Lecture 3: Overview of Descriptive Statistics October 3, 2005 Lecture Outline Purpose

More information

Interpretation of Epidemiologic Studies

Interpretation of Epidemiologic Studies Interpretation of Epidemiologic Studies Paolo Boffetta Mount Sinai School of Medicine, New York, USA International Prevention Research Institute, Lyon, France Outline Introduction to epidemiology Issues

More information

Version No. 7 Date: July Please send comments or suggestions on this glossary to

Version No. 7 Date: July Please send comments or suggestions on this glossary to Impact Evaluation Glossary Version No. 7 Date: July 2012 Please send comments or suggestions on this glossary to 3ie@3ieimpact.org. Recommended citation: 3ie (2012) 3ie impact evaluation glossary. International

More information

Screening for novel oncology biomarker panels using both DNA and protein microarrays. John Anson, PhD VP Biomarker Discovery

Screening for novel oncology biomarker panels using both DNA and protein microarrays. John Anson, PhD VP Biomarker Discovery Screening for novel oncology biomarker panels using both DNA and protein microarrays John Anson, PhD VP Biomarker Discovery Outline of presentation Introduction to OGT and our approach to biomarker studies

More information

AVENIO ctdna Analysis Kits The complete NGS liquid biopsy solution EMPOWER YOUR LAB

AVENIO ctdna Analysis Kits The complete NGS liquid biopsy solution EMPOWER YOUR LAB Analysis Kits The complete NGS liquid biopsy solution EMPOWER YOUR LAB Analysis Kits Next-generation performance in liquid biopsies 2 Accelerating clinical research From liquid biopsy to next-generation

More information

Sample Size Estimation for Microarray Experiments

Sample Size Estimation for Microarray Experiments Sample Size Estimation for Microarray Experiments Gregory R. Warnes Department of Biostatistics and Computational Biology Univeristy of Rochester Rochester, NY 14620 and Peng Liu Department of Biological

More information

Summary of main challenges and future directions

Summary of main challenges and future directions Summary of main challenges and future directions Martin Schumacher Institute of Medical Biometry and Medical Informatics, University Medical Center, Freiburg Workshop October 2008 - F1 Outline Some historical

More information

SISCR Module 7 Part I: Introduction Basic Concepts for Binary Biomarkers (Classifiers) and Continuous Biomarkers

SISCR Module 7 Part I: Introduction Basic Concepts for Binary Biomarkers (Classifiers) and Continuous Biomarkers SISCR Module 7 Part I: Introduction Basic Concepts for Binary Biomarkers (Classifiers) and Continuous Biomarkers Kathleen Kerr, Ph.D. Associate Professor Department of Biostatistics University of Washington

More information

Numerous hypothesis tests were performed in this study. To reduce the false positive due to

Numerous hypothesis tests were performed in this study. To reduce the false positive due to Two alternative data-splitting Numerous hypothesis tests were performed in this study. To reduce the false positive due to multiple testing, we are not only seeking the results with extremely small p values

More information

Small-area estimation of mental illness prevalence for schools

Small-area estimation of mental illness prevalence for schools Small-area estimation of mental illness prevalence for schools Fan Li 1 Alan Zaslavsky 2 1 Department of Statistical Science Duke University 2 Department of Health Care Policy Harvard Medical School March

More information

Summary. 20 May 2014 EMA/CHMP/SAWP/298348/2014 Procedure No.: EMEA/H/SAB/037/1/Q/2013/SME Product Development Scientific Support Department

Summary. 20 May 2014 EMA/CHMP/SAWP/298348/2014 Procedure No.: EMEA/H/SAB/037/1/Q/2013/SME Product Development Scientific Support Department 20 May 2014 EMA/CHMP/SAWP/298348/2014 Procedure No.: EMEA/H/SAB/037/1/Q/2013/SME Product Development Scientific Support Department evaluating patients with Autosomal Dominant Polycystic Kidney Disease

More information

Penn State MPH Program Competencies 2013

Penn State MPH Program Competencies 2013 Core Competencies Five Core Areas of Public Health: Biostatistics 1. Explain the fundamental concepts of biostatistics. 2. Utilize common statistical methods (i.e., calculate, analyze, interpret, report)

More information

Dan Koller, Ph.D. Medical and Molecular Genetics

Dan Koller, Ph.D. Medical and Molecular Genetics Design of Genetic Studies Dan Koller, Ph.D. Research Assistant Professor Medical and Molecular Genetics Genetics and Medicine Over the past decade, advances from genetics have permeated medicine Identification

More information

ICH E9(R1) Technical Document. Estimands and Sensitivity Analysis in Clinical Trials STEP I TECHNICAL DOCUMENT TABLE OF CONTENTS

ICH E9(R1) Technical Document. Estimands and Sensitivity Analysis in Clinical Trials STEP I TECHNICAL DOCUMENT TABLE OF CONTENTS ICH E9(R1) Technical Document Estimands and Sensitivity Analysis in Clinical Trials STEP I TECHNICAL DOCUMENT TABLE OF CONTENTS A.1. Purpose and Scope A.2. A Framework to Align Planning, Design, Conduct,

More information

Analysis of Environmental Data Conceptual Foundations: En viro n m e n tal Data

Analysis of Environmental Data Conceptual Foundations: En viro n m e n tal Data Analysis of Environmental Data Conceptual Foundations: En viro n m e n tal Data 1. Purpose of data collection...................................................... 2 2. Samples and populations.......................................................

More information

UNIVERSITY OF THE FREE STATE DEPARTMENT OF COMPUTER SCIENCE AND INFORMATICS CSIS6813 MODULE TEST 2

UNIVERSITY OF THE FREE STATE DEPARTMENT OF COMPUTER SCIENCE AND INFORMATICS CSIS6813 MODULE TEST 2 UNIVERSITY OF THE FREE STATE DEPARTMENT OF COMPUTER SCIENCE AND INFORMATICS CSIS6813 MODULE TEST 2 DATE: 3 May 2017 MARKS: 75 ASSESSOR: Prof PJ Blignaut MODERATOR: Prof C de Villiers (UP) TIME: 2 hours

More information

2) Cases and controls were genotyped on different platforms. The comparability of the platforms should be discussed.

2) Cases and controls were genotyped on different platforms. The comparability of the platforms should be discussed. Reviewers' Comments: Reviewer #1 (Remarks to the Author) The manuscript titled 'Association of variations in HLA-class II and other loci with susceptibility to lung adenocarcinoma with EGFR mutation' evaluated

More information

Hierarchical Linear Models: Applications to cross-cultural comparisons of school culture

Hierarchical Linear Models: Applications to cross-cultural comparisons of school culture Hierarchical Linear Models: Applications to cross-cultural comparisons of school culture Magdalena M.C. Mok, Macquarie University & Teresa W.C. Ling, City Polytechnic of Hong Kong Paper presented at the

More information

How to craft the Approach section of an R grant application

How to craft the Approach section of an R grant application How to craft the Approach section of an R grant application David Elashoff, PhD Professor of Medicine and Biostatistics Director, Department of Medicine Statistics Core Leader, CTSI Biostatistics Program

More information

Use of blinded independent central review of PFS in definitive trials

Use of blinded independent central review of PFS in definitive trials Use of blinded independent central review of PFS in definitive trials Lori E. Dodd, PhD Biostatistics Research Branch National Institute of Allergy and Infectious Diseases 1 Progression-free survival Use

More information

Bayesian hierarchical modelling

Bayesian hierarchical modelling Bayesian hierarchical modelling Matthew Schofield Department of Mathematics and Statistics, University of Otago Bayesian hierarchical modelling Slide 1 What is a statistical model? A statistical model:

More information

Estimands for time to event endpoints in oncology and beyond

Estimands for time to event endpoints in oncology and beyond Estimands for time to event endpoints in oncology and beyond Kaspar Rufibach, Hans Ulrich Burger Methods, Collaboration & Outreach Group (MCO) Department of Biostatistics, Roche Basel Context and problem

More information

INTERNAL VALIDITY, BIAS AND CONFOUNDING

INTERNAL VALIDITY, BIAS AND CONFOUNDING OCW Epidemiology and Biostatistics, 2010 J. Forrester, PhD Tufts University School of Medicine October 6, 2010 INTERNAL VALIDITY, BIAS AND CONFOUNDING Learning objectives for this session: 1) Understand

More information

SINGLE-CASE RESEARCH. Relevant History. Relevant History 1/9/2018

SINGLE-CASE RESEARCH. Relevant History. Relevant History 1/9/2018 SINGLE-CASE RESEARCH And Small N Designs Relevant History In last half of nineteenth century, researchers more often looked at individual behavior (idiographic approach) Founders of psychological research

More information

MEA DISCUSSION PAPERS

MEA DISCUSSION PAPERS Inference Problems under a Special Form of Heteroskedasticity Helmut Farbmacher, Heinrich Kögel 03-2015 MEA DISCUSSION PAPERS mea Amalienstr. 33_D-80799 Munich_Phone+49 89 38602-355_Fax +49 89 38602-390_www.mea.mpisoc.mpg.de

More information

Checking the counterarguments confirms that publication bias contaminated studies relating social class and unethical behavior

Checking the counterarguments confirms that publication bias contaminated studies relating social class and unethical behavior 1 Checking the counterarguments confirms that publication bias contaminated studies relating social class and unethical behavior Gregory Francis Department of Psychological Sciences Purdue University gfrancis@purdue.edu

More information

Effective Implementation of Bayesian Adaptive Randomization in Early Phase Clinical Development. Pantelis Vlachos.

Effective Implementation of Bayesian Adaptive Randomization in Early Phase Clinical Development. Pantelis Vlachos. Effective Implementation of Bayesian Adaptive Randomization in Early Phase Clinical Development Pantelis Vlachos Cytel Inc, Geneva Acknowledgement Joint work with Giacomo Mordenti, Grünenthal Virginie

More information

Exploring and Validating Surrogate Endpoints in Colorectal Cancer. Center, Pittsburgh, USA

Exploring and Validating Surrogate Endpoints in Colorectal Cancer. Center, Pittsburgh, USA Page 1 Exploring and Validating Surrogate Endpoints in Colorectal Cancer Tomasz Burzykowski, PhD 1,2, Marc Buyse, ScD 1,3, Greg Yothers PhD 4, Junichi Sakamoto, PhD 5, Dan Sargent, PhD 6 1 Center for Statistics,

More information

Statistics for Clinical Trials: Basics of Phase III Trial Design

Statistics for Clinical Trials: Basics of Phase III Trial Design Statistics for Clinical Trials: Basics of Phase III Trial Design Gary M. Clark, Ph.D. Vice President Biostatistics & Data Management Array BioPharma Inc. Boulder, Colorado USA NCIC Clinical Trials Group

More information

IS IT GENETIC? How do genes, environment and chance interact to specify a complex trait such as intelligence?

IS IT GENETIC? How do genes, environment and chance interact to specify a complex trait such as intelligence? 1 IS IT GENETIC? How do genes, environment and chance interact to specify a complex trait such as intelligence? Single-gene (monogenic) traits Phenotypic variation is typically discrete (often comparing

More information

Section on Survey Research Methods JSM 2009

Section on Survey Research Methods JSM 2009 Missing Data and Complex Samples: The Impact of Listwise Deletion vs. Subpopulation Analysis on Statistical Bias and Hypothesis Test Results when Data are MCAR and MAR Bethany A. Bell, Jeffrey D. Kromrey

More information

Data Mining CMMSs: How to Convert Data into Knowledge

Data Mining CMMSs: How to Convert Data into Knowledge FEATURE Copyright AAMI 2018. Single user license only. Copying, networking, and distribution prohibited. Data Mining CMMSs: How to Convert Data into Knowledge Larry Fennigkoh and D. Courtney Nanney Larry

More information

38 Int'l Conf. Bioinformatics and Computational Biology BIOCOMP'16

38 Int'l Conf. Bioinformatics and Computational Biology BIOCOMP'16 38 Int'l Conf. Bioinformatics and Computational Biology BIOCOMP'16 PGAR: ASD Candidate Gene Prioritization System Using Expression Patterns Steven Cogill and Liangjiang Wang Department of Genetics and

More information

Confidence Intervals On Subsets May Be Misleading

Confidence Intervals On Subsets May Be Misleading Journal of Modern Applied Statistical Methods Volume 3 Issue 2 Article 2 11-1-2004 Confidence Intervals On Subsets May Be Misleading Juliet Popper Shaffer University of California, Berkeley, shaffer@stat.berkeley.edu

More information

Lecture Outline Biost 517 Applied Biostatistics I. Statistical Goals of Studies Role of Statistical Inference

Lecture Outline Biost 517 Applied Biostatistics I. Statistical Goals of Studies Role of Statistical Inference Lecture Outline Biost 517 Applied Biostatistics I Scott S. Emerson, M.D., Ph.D. Professor of Biostatistics University of Washington Statistical Inference Role of Statistical Inference Hierarchy of Experimental

More information

Strategies for handling missing data in randomised trials

Strategies for handling missing data in randomised trials Strategies for handling missing data in randomised trials NIHR statistical meeting London, 13th February 2012 Ian White MRC Biostatistics Unit, Cambridge, UK Plan 1. Why do missing data matter? 2. Popular

More information