Hierarchy of Statistical Goals

Similar documents
Lecture Outline Biost 517 Applied Biostatistics I. Statistical Goals of Studies Role of Statistical Inference

MS&E 226: Small Data

Lecture Outline. Biost 590: Statistical Consulting. Stages of Scientific Studies. Scientific Method

Bayesian and Frequentist Approaches

Fundamental Clinical Trial Design

UW Biostatistics Working Paper Series

A Case Study: Two-sample categorical data

Biost 524: Design of Medical Studies

Lecture Outline Biost 517 Applied Biostatistics I

Beyond Subjective and Objective in Statistics

Introduction to Machine Learning. Katherine Heller Deep Learning Summer School 2018

ST440/550: Applied Bayesian Statistics. (10) Frequentist Properties of Bayesian Methods

How to use the Lafayette ESS Report to obtain a probability of deception or truth-telling

Biost 590: Statistical Consulting

Lecture Outline. Biost 517 Applied Biostatistics I. Purpose of Descriptive Statistics. Purpose of Descriptive Statistics

investigate. educate. inform.

For general queries, contact

Understanding Statistics for Research Staff!

Practical and ethical advantages of Bayesian approaches in adaptive clinical trial designs. Kristian Thorlund

A Brief Introduction to Bayesian Statistics

Bayesians methods in system identification: equivalences, differences, and misunderstandings

Learning from data when all models are wrong

Examining differences between two sets of scores

Research Design. Beyond Randomized Control Trials. Jody Worley, Ph.D. College of Arts & Sciences Human Relations

Ecological Statistics

STATISTICS AND RESEARCH DESIGN

Regression Discontinuity Designs: An Approach to Causal Inference Using Observational Data

Decision Making in Confirmatory Multipopulation Tailoring Trials

Using Bayesian Networks to Analyze Expression Data. Xu Siwei, s Muhammad Ali Faisal, s Tejal Joshi, s

Information Systems Mini-Monograph

Measurement Error in Nonlinear Models

Chapter 19. Confidence Intervals for Proportions. Copyright 2010 Pearson Education, Inc.

Connexion of Item Response Theory to Decision Making in Chess. Presented by Tamal Biswas Research Advised by Dr. Kenneth Regan

2 Critical thinking guidelines

What is a probability? Two schools in statistics: frequentists and Bayesians.

Three Attempts to Replicate the Behavioral Sunk-Cost Effect: A Note on Cunha and Caldieraro (2009)

CHAPTER OBJECTIVES - STUDENTS SHOULD BE ABLE TO:

What you should know before you collect data. BAE 815 (Fall 2017) Dr. Zifei Liu

Bayesian vs Frequentist

Retrospective power analysis using external information 1. Andrew Gelman and John Carlin May 2011

TEACHING BAYESIAN METHODS FOR EXPERIMENTAL DATA ANALYSIS

School of Population and Public Health SPPH 503 Epidemiologic methods II January to April 2019

Business Research: Principles and Processes MGMT6791 Workshop 1C: The Nature of Research & Scientific Method

Supplementary notes for lecture 8: Computational modeling of cognitive development

MTAT Bayesian Networks. Introductory Lecture. Sven Laur University of Tartu

Chapter 1: Introduction to Statistics

What is probability. A way of quantifying uncertainty. Mathematical theory originally developed to model outcomes in games of chance.

Risk Assessment Terminology, Expression of Nature and level of Risk and Uncertainties

Chapter 19. Confidence Intervals for Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc.

DECISION ANALYSIS WITH BAYESIAN NETWORKS

Intelligent Systems. Discriminative Learning. Parts marked by * are optional. WS2013/2014 Carsten Rother, Dmitrij Schlesinger

Psychology Research Process

Chapter 23. Inference About Means. Copyright 2010 Pearson Education, Inc.

Choosing the correct statistical test in research

Statistics: A Brief Overview Part I. Katherine Shaver, M.S. Biostatistician Carilion Clinic

Statistical Hocus Pocus? Assessing the Accuracy of a Diagnostic Screening Test When You Don t Even Know Who Has the Disease

Part 1. Online Session: Math Review and Math Preparation for Course 5 minutes Introduction 45 minutes Reading and Practice Problem Assignment

EXPERIMENTAL RESEARCH DESIGNS

Evidence-Based Medicine Journal Club. A Primer in Statistics, Study Design, and Epidemiology. August, 2013

Data and Statistics 101: Key Concepts in the Collection, Analysis, and Application of Child Welfare Data

Statistical Techniques. Masoud Mansoury and Anas Abulfaraj

Psychology Research Process

Biostatistics for Med Students. Lecture 1

Problem Set and Review Questions 2

Hypothesis Testing. Richard S. Balkin, Ph.D., LPC-S, NCC

Individualized Treatment Effects Using a Non-parametric Bayesian Approach

Standard Scores. Richard S. Balkin, Ph.D., LPC-S, NCC

An Introduction to Bayesian Statistics

Common Statistical Issues in Biomedical Research

Large simple randomized trials and therapeutic decisions

BAYESIAN HYPOTHESIS TESTING WITH SPSS AMOS

Bayesian performance

Neurophysiology and Information: Theory of Brain Function

Sensory Cue Integration

Lessons in biostatistics

04/12/2014. Research Methods in Psychology. Chapter 6: Independent Groups Designs. What is your ideas? Testing

3. Fixed-sample Clinical Trial Design

Bayesian Inference Bayes Laplace

Estimands. EFPIA webinar Rob Hemmings, Frank Bretz October 2017

3. Fixed-sample Clinical Trial Design

Evaluation Models STUDIES OF DIAGNOSTIC EFFICIENCY

Performance of Median and Least Squares Regression for Slightly Skewed Data

Statistical Decision Theory and Bayesian Analysis

Bayesian (Belief) Network Models,

STATISTICAL INFERENCE 1 Richard A. Johnson Professor Emeritus Department of Statistics University of Wisconsin

Placebo and Belief Effects: Optimal Design for Randomized Trials

UNLOCKING VALUE WITH DATA SCIENCE BAYES APPROACH: MAKING DATA WORK HARDER

George B. Ploubidis. The role of sensitivity analysis in the estimation of causal pathways from observational data. Improving health worldwide

Insight Assessment Measuring Thinking Worldwide

Biost 517: Applied Biostatistics I Emerson, Fall Homework #2 Key October 10, 2012

A simulation study of person-fit in the Rasch model

Bayesian Inference. Review. Breast Cancer Screening. Breast Cancer Screening. Breast Cancer Screening

arxiv: v1 [math.st] 8 Jun 2012

How do People Really Think about Climate Change?

Statistical analysis DIANA SAPLACAN 2017 * SLIDES ADAPTED BASED ON LECTURE NOTES BY ALMA LEORA CULEN

Understandable Statistics

The Loss of Heterozygosity (LOH) Algorithm in Genotyping Console 2.0

Statistical Issues in Road Safety Part I: Uncertainty, Variability, Sampling

On the diversity principle and local falsifiability

EXTRAPOLATION CHALLENGES IN PEDIATRIC PAH

Transcription:

Hierarchy of Statistical Goals Ideal goal of scientific study: Deterministic results Determine the exact value of a ment or population parameter Prediction: What will the value of a future observation be? Comparing groups: What is the difference between response across two populations? Problem: In the real world, few patterns are deterministic, so we do not observe the same outcome for all individuals Hidden (und) variables Inherent randomness : 1 : 2 Second choice: Probability model for response Determine the tendency for the response Prediction: What is the probability that a future observation will be some value? Within groups: What is the average response within the group? Comparing groups: What is the difference in average response between groups Second choice: Probability model for response (cont.) Consider the distribution of outcomes for individuals receiving intervention Use a probability model to describe distribution of response Usually choose a summary of the distribution e.g.. mean, median, etc. Scientific questions then expressed for values of summary : 3 : 4

Often we have many choices for the summary to be compared across treatment groups Example: Treatment of high blood pressure with a primary outcome of systolic blood pressure at end of treatment Statistical analysis might for example compare Average Median Percent above 160 mm Hg Mean or median time until blood pressure below 140 mm Hg Summary for comparison should most often be driven by scientific issues Thresholds may be most important clinically Means allow estimates of total costs/benefits Medians less sensitive to outliers Sometimes clinical importance is not proportional to magnitude of ments However, sometimes effect of intervention is only on outliers : 5 : 6 Sometimes choice of summary is more arbitrary Types of scientific questions Existence of an effect on the distribution Direction of effect on the distribution Linear approximations to effect on summary Quantifying dose-response on summary Only last two need dictate a choice of summary In any case, in choosing the summary used to define treatment effect, we should consider (in order of importance) Current state of knowledge about treatment effect Scientific (clinical) relevance of summary Plausibility that treatment would affect the summary Statistical precision of inference about the summary : 7 : 8

(Semi)Parametric vs Nonparametric Models In addition to the summary, we must also choose a model for the distribution of the data Parametric models assume a known shape for the distribution of the data Semiparametric models assume that the shape is similar in some way across groups, but do not otherwise make any assumptions about the exact shape of the distribution Nonparametric models make no assumption about how the shape of the distribution might be similar (or different) across groups As a general rule, it is rare that there is any advantage in assuming a parametric model in real life IF we do not even know whether an intervention affects the mean (or median, etc.) of a distribution (characteristics related to first moments), THEN why would we ever be willing to base our conclusions on assumptions about how the intervention might affect the shape of the distribution (characteristics that depend on 2nd, 3rd,, moments)? : 9 : 10 Luckily, there is rarely a need to assume a parametric model E.g., methods derived from normal theory are usually distribution free tests in large samples Problem: The distribution (or summary ) for the outcome is not directly observable Use a sample to estimate the distribution (or summary ) of outcomes It should also be noted that many semiparametric tests are quite sensitive to an unrealistic assumption E.g., proportional hazards models for survival data over long periods of time Qualification: Distribution free Bayesian methods are not as well established (but we re working on it: Coarsened Bayes) Such an estimate is thus subject to sampling error In presence of sampling error, we need an infinite sample size to discriminate between contiguous hypotheses (see later discussion on statistical criteria for evidence) : 11 : 12

Bayesian Inference Third choice: Bayesian methods Use the sample to estimate the probability that the hypotheses are true Probability of hypotheses given the observed data Such a Bayesian approach is analogous to the problem of diagnosing disease in patients using a diagnostic procedure Statistical analysis is used to diagnose a beneficial treatment Using a sample, we compute an estimate of treatment effect The estimate takes on the role of the diagnostic test result Using the probability model, we can compute the probability of observing results under various hypotheses The hypothesis of a beneficial treatment might be like the disease : 13 : 14 Bayesian Inference Bayesian Inference The probability that the hypothesis is true is then like the predictive value of a positive test result In order to use Bayes rule, we must have some of the prevalence of a beneficial treatment Such a is termed the prior distribution, because it is our estimate of the probability of a beneficial treatment prior to observing any data The probability of the hypotheses based on the data is then called the posterior distribution The actual implementation of Bayesian inference is a generalization of the diagnostic testing situation The estimate of treatment effect is continuous, rather than just positive or negative The parameter measuring a beneficial treatment is continuous, rather than just healthy or diseased The quantification of the prior distribution is thus an entire distribution (a probability for every possible value of the treatment effect) rather than a single prevalence. : 15 : 16

Bayesian Inference The criticism of Bayesian inference is that we usually do not know the prior probability of a beneficial treatment As we have seen, the predictive values are very sensitive to the choice of prior distribution Possible remedies: Use data from previous experiments Use subjective opinion or consensus of experts Do a sensitivity analysis over many different choices for the prior distribution Use frequentist approaches Fourth choice: Frequentist methods Calculate the probability of observing data such was obtained in the experiment under the hypotheses Not affected by subjective choice of prior distributions But not really answering the most important question : 17 : 18 Fourth choice: Frequentist methods (cont.) Frequentist methods consider the sampling distribution of statistics over (conceptual) replications of the same study If we were to repeat the study a large number of times (under the exact same conditions) what would be the distribution of the statistics computed from the samples obtained Fourth choice: Frequentist methods (cont.) We do not usually have enough data to know what would happen if we repeated the study under the true setting, but we can often guess what would happen under specific hypotheses Hence, frequentists characterize the sampling distribution under specific hypotheses and compare the observed data to what might reasonably have been obtained if that hypothesis were true : 19 : 20

Example: When playing poker, I get 4 full houses in a row Bayesian: Knows the probability that I might be a cheater based on information derived prior to observing me play Knows the probability that I would get 4 full houses for every level of cheating that I might engage in Computes the posterior probability that I was cheating (probability after observing me play) If that probability is low, calls me a cheater Example: When playing poker, I get 4 full houses in a row (cont.) Frequentist: Hypothetically assumes I am not a cheater Knows the probability that I would get 4 full houses if I were not a cheater If that probability is sufficiently low, calls me a cheater : 21 : 22