Introduction; Study design

Similar documents
Study Design. Study design. Patrick Breheny. January 23. Patrick Breheny Introduction to Biostatistics (171:161) 1/34

Sampling Controlled experiments Summary. Study design. Patrick Breheny. January 22. Patrick Breheny Introduction to Biostatistics (BIOS 4120) 1/34

Introduction to Statistics Design of Experiments

Experimental design. Basic principles

Elementary Statistics and Inference. Elementary Statistics and Inference. 1.) Introduction. 22S:025 or 7P:025. Lecture 1.

AMS 5 EXPERIMENTAL DESIGN

STATISTICS: METHOD TO GET INSIGHT INTO VARIATION IN A POPULATIONS If every unit in the population had the same value,say

MATH-134. Experimental Design

Patrick Breheny. January 28

Confidence Intervals and Sampling Design. Lecture Notes VI

Math 140 Introductory Statistics

Study Methodology: Tricks and Traps

MATH& 146 Lesson 4. Section 1.3 Study Beginnings

Population. population. parameter. Census versus Sample. Statistic. sample. statistic. Parameter. Population. Example: Census.

What is Statistics? (*) Collection of data Experiments and Observational studies. (*) Summarizing data Descriptive statistics.

Math 124: Module 3 and Module 4

Unit 5. Thinking Statistically

Stat 13, Intro. to Statistical Methods for the Life and Health Sciences.

Principle underlying all of statistics

Math 1680 Class Notes. Chapters: 1, 2, 3, 4, 5, 6

Bias in Sampling. MATH 130, Elements of Statistics I. J. Robert Buchanan. Fall Department of Mathematics

REVIEW FOR THE PREVIOUS LECTURE

Chapter 1 Review Questions

Introduction to Statistics

Objectives. Quantifying the quality of hypothesis tests. Type I and II errors. Power of a test. Cautions about significance tests

Math 124: Modules 3 and 4. Sampling. Designing. Studies. Studies. Experimental Studies Surveys. Math 124: Modules 3 and 4. Sampling.

Two-sample Categorical data: Measuring association

20. Experiments. November 7,

The Logic of Causal Order Richard Williams, University of Notre Dame, Last revised February 15, 2015

How is ethics like logistic regression? Ethics decisions, like statistical inferences, are informative only if they re not too easy or too hard 1

Vocabulary. Bias. Blinding. Block. Cluster sample

Applied Analysis of Variance and Experimental Design. Lukas Meier, Seminar für Statistik

Creative Commons Attribution-NonCommercial-Share Alike License

Variable Data univariate data set bivariate data set multivariate data set categorical qualitative numerical quantitative

Lecture Slides. Elementary Statistics Eleventh Edition. by Mario F. Triola. and the Triola Statistics Series 1.1-1

For each of the following cases, describe the population, sample, population parameters, and sample statistics.

Chapter 13 Summary Experiments and Observational Studies

Chapter 13. Experiments and Observational Studies. Copyright 2012, 2008, 2005 Pearson Education, Inc.

Sampling and Data Collection

Math 2a: Lecture 1. Agenda. Course organization. History of the subject. Why is the subject relevant today? First examples

AP Statistics Exam Review: Strand 2: Sampling and Experimentation Date:

Subliminal Messages: How Do They Work?

You can t fix by analysis what you bungled by design. Fancy analysis can t fix a poorly designed study.

STA 291 Lecture 4 Jan 26, 2010

Observational study is a poor way to gauge the effect of an intervention. When looking for cause effect relationships you MUST have an experiment.

CHAPTER 2 SOCIOLOGICAL RESEARCH METHODS

Clever Hans the horse could do simple math and spell out the answers to simple questions. He wasn t always correct, but he was most of the time.

Higher Psychology RESEARCH REVISION

BIAS: The design of a statistical study shows bias if it systematically favors certain outcomes.

Making decisions about therapy

2. METHODS OF DATA COLLECTION

This is an edited transcript of a telephone interview recorded in March 2010.

Political Science 15, Winter 2014 Final Review

Collecting Data Example: Does aspirin prevent heart attacks?

Problems for Chapter 8: Producing Data: Sampling. STAT Fall 2015.

Sampling for Success. Dr. Jim Mirabella President, Mirabella Research Services, Inc. Professor of Research & Statistics

Controlled experiments

Villarreal Rm. 170 Handout (4.3)/(4.4) - 1 Designing Experiments I

Choosing Life: Empowerment, Action, Results! CLEAR Menu Sessions. Substance Use Risk 2: What Are My External Drug and Alcohol Triggers?

(WG Whitfield Growden, MD; DR Diane Redington, CRNP)

ISC- GRADE XI HUMANITIES ( ) PSYCHOLOGY. Chapter 2- Methods of Psychology

Experiments in the Real World

Confidence in Sampling: Why Every Lawyer Needs to Know the Number 384. By John G. McCabe, M.A. and Justin C. Mary

What Constitutes a Good Contribution to the Literature (Body of Knowledge)?

Evidence Informed Practice Online Learning Module Glossary

I. Introduction and Data Collection B. Sampling. 1. Bias. In this section Bias Random Sampling Sampling Error

Data = collections of observations, measurements, gender, survey responses etc. Sample = collection of some members (a subset) of the population

Reduce Tension by Making the Desired Choice Easier

Human intuition is remarkably accurate and free from error.

Probability, Statistics, Error Analysis and Risk Assessment. 30-Oct-2014 PHYS 192 Lecture 8 1

Just What the Doctor Ordered

Moore, IPS 6e Chapter 03

P. 266 #9, 11. p. 289 # 4, 6 11, 14, 17

Chapter 23. Inference About Means. Copyright 2010 Pearson Education, Inc.

Reasoning about probabilities (cont.); Correlational studies of differences between means

Daily Agenda. Honors Statistics. 1. Check homework C4#9. 4. Discuss 4.3 concepts. Finish 4.2 concepts. March 28, 2017

How to CRITICALLY APPRAISE

How are polls conducted? by Frank Newport, Lydia Saad, David Moore from Where America Stands, 1997 John Wiley & Sons, Inc.

Psychology: The Science

What is Statistics? Ronghui (Lily) Xu (UCSD) An Introduction to Statistics 1 / 23

How to Work with the Patterns That Sustain Depression

CHAPTER 6. Experiments in the Real World

Unit 3: Collecting Data. Observational Study Experimental Study Sampling Bias Types of Sampling

Section 4 Decision-making

STA Module 1 Introduction to Statistics and Data

Public and patient values about informed choice and mammography screening: Results from 4 Ontario deliberations

15.301/310, Managerial Psychology Prof. Dan Ariely Recitation 8: T test and ANOVA

If you could interview anyone in the world, who. Polling. Do you think that grades in your school are inflated? would it be?

Review: Conditional Probability. Using tests to improve decisions: Cutting scores & base rates

STAT 110: Introduction to Descriptive Statistics

How Math (and Vaccines) Keep You Safe From the Flu

Sam: Annette, can we just start out with a brief simple explanation of what neuro-immune diseases are, including ME and CFS?

Next, we ll discuss some terminology that is typically used when discussing randomized experiments.

Chapter 7: Descriptive Statistics

Research Process. the Research Loop. Now might be a good time to review the decisions made when conducting any research project.

The HPV Data Is In What Do the Newest Updates in Screening Mean For Your Patients?

Observational studies; descriptive statistics

CHAPTER 5: PRODUCING DATA

Introduction to Bioethics

THE PUBLIC AND GENETIC EDITING, TESTING, AND THERAPY

Transcription:

; Study design Patrick Breheny January 12 Patrick Breheny STA 580: Biostatistics I 1/43

What is statistics? What is biostatistics, and why is it important? The statistical framework Statistics is the science of learning from experience In principle, people do this every day of their lives, and should be really good at it...... but we re not. Patrick Breheny STA 580: Biostatistics I 2/43

Limitations of human reasoning What is biostatistics, and why is it important? The statistical framework Human beings are not natural statisticians We are not good at picking out patterns from a sea of noisy data On the flip side, we are too good at picking out non-existent patterns from small numbers of observations We also find it difficult to sort out the effects of multiple factors occurring simultaneously Finally, we are subject to all sorts of biases depending on our personalities, emotions, and past experiences Patrick Breheny STA 580: Biostatistics I 3/43

Biostatistics What is biostatistics, and why is it important? The statistical framework In medicine, public health, and the biological sciences, we must often make decisions in the presence of uncertainty: Which drug should a doctor prescribe to treat an illness? An individual has a certain genetic mutation; what are the chances that she will develop breast cancer? Does a certain pesticide cause cancer? Should it be banned? These questions are too important to be left to opinion, superstition, and conjecture, which is why there has been a tremendous push for objective, evidence-based decision making in medicine and public health in the past several decades Patrick Breheny STA 580: Biostatistics I 4/43

Why do we need biostatistics? What is biostatistics, and why is it important? The statistical framework Statistics is the science which allows us to make these decisions Statistics is particularly important in research concerning humans, for several reasons: Humans are incredibly diverse and variable Humans are expensive to perform research upon There is a moral imperative to make decisions on potentially life-saving therapies as fast as possible For these reasons, Biostatistics has emerged as an important field within statistics, and has become an important set of skills in fields such as medicine, biology, and public health Patrick Breheny STA 580: Biostatistics I 5/43

Terms What is biostatistics, and why is it important? The statistical framework Scientists want to make generalizations about classes of people on the basis of their findings The class of people that they are trying to make generalizations about is called the population It is impractical to study the entire population, so people study only a small portion of it called the sample The researchers then make generalizations about the entire population based on studying the sample; this process is called inference Patrick Breheny STA 580: Biostatistics I 6/43

What is biostatistics, and why is it important? The statistical framework Population Study design Inference Sample Patrick Breheny STA 580: Biostatistics I 7/43

The three basic questions in statistics What is biostatistics, and why is it important? The statistical framework How should I collect my data (sampling)? How should I describe and summarize the data that I ve collected? What does my data tell me about the way that the world works (inference)? Patrick Breheny STA 580: Biostatistics I 8/43

Parameters What is biostatistics, and why is it important? The statistical framework Specifically, there is some numerical fact about the population that the investigator is interested in, such as the percent of children who are obese or the average reduction in cholesterol upon taking a certain drug These numbers that describe the population are called parameters Parameters cannot be determined exactly; they can only be estimated from a sample An estimate, or statistic, is a number that can be computed from a sample Patrick Breheny STA 580: Biostatistics I 9/43

How good are our estimates? What is biostatistics, and why is it important? The statistical framework So, Estimates are what investigators know Parameters are what investigators want to know We would like to know whether or not our estimate is measuring the parameter of interest well There are two major issues: On average, does our estimate tend to be centered around the right answer, or is it biased? How much variability is there likely to be in our estimate? Patrick Breheny STA 580: Biostatistics I 10/43

in the ideal world In an ideal world, We have a list of everyone in the population of interest We randomly sample these people It is equally costly to sample one person as it is another No one ever refuses to be sampled Patrick Breheny STA 580: Biostatistics I 11/43

in the real world In the real world, all of these assumptions may fail: We have to get access to people somehow in order to sample them The way in which we do that may create bias It is often more cost effective to sample people in one location than throughout a large area Not all people are equally likely to participate in your study Patrick Breheny STA 580: Biostatistics I 12/43

Example: The 1936 Presidential Election In 1936, Franklin Delano Roosevelt was completing his first term of office as president of the United States, and was up for re-election The Republican candidate that year was Alfred Landon The country was still in the Great Depression, with 9 million unemployed and real income only two-thirds of what it was in 1929 Roosevelt and Landon had very different views about how much the government should be spending on policies to bring the country out of the depression Patrick Breheny STA 580: Biostatistics I 13/43

Example: The 1936 Presidential Election (cont d) The Literary Digest magazine had predicted the winner in every presidential election since 1916 For the 1936 election, the Digest sampled 2.4 million people and predicted a landslide victory for Landon: 57% to 43% In the actual election, Roosevelt won 62% to 38% Patrick Breheny STA 580: Biostatistics I 14/43

What went wrong? How could this poll have been so incredibly far off? The poll had an enormous sample size; variability of the estimate is not the issue Instead, the flaw was bias Patrick Breheny STA 580: Biostatistics I 15/43

Where the bias came from, Part I The Digest mailed 10 million questionnaires to addresses gathered from telephone books and club membership lists This tended to screen out the poor, who tended not to belong to clubs or to own telephones (at the time, only one out of every four households owned a telephone) This is called selection bias: instead of random sampling, certain subgroups of the population were more likely to be included than others Patrick Breheny STA 580: Biostatistics I 16/43

Where the bias came from, Part II The Digest s poll contained another flaw: only 2.4 million people replied, out of the 10 million who got the questionnaire Nonresponders can differ from responders in many important ways This type of bias is called nonresponse bias Thus, the 2.4 million respondents in the Digest s poll do not even represent the 10 million people who were polled, let alone the entire population of voters Patrick Breheny STA 580: Biostatistics I 17/43

Survey design The study of sampling surveys is an important aspect of experimental design, and is further explored in CPH 631 For this course (STA 580), the important things to keep in mind are: Samples are subject to selection bias and nonresponse bias A researcher can only generalize her findings to the population that they sampled from Patrick Breheny STA 580: Biostatistics I 18/43

Pharmaceutical trials and children The issue of pharmaceutical trials and children is a biostatistical example of this last point For practical, ethical, and economic reasons, clinical trials usually involve only adults children are excluded (only about 25% of drugs are subjected to pediatric studies Physicians, however, are allowed to use any FDA-approved drug in any way that they think is beneficial, and aren t required to inform parents if the therapy hasn t been tested on children Patrick Breheny STA 580: Biostatistics I 19/43

Propofol For example, propofol is a sedative that has consistently proved safe in adults In 1992, after several children who received propofol died, the British government recommended against using it on patients under 16 In the U.S., however, propofol continued to be widely used In 2001, the manufacturers of propofol conducted a randomized controlled, blinded trial and found that 9.5% of children on propofol died, compared with 3.8% of children on a different sedative The FDA has now added a warning indicating this, although it is still legal to administer propofol to children Patrick Breheny STA 580: Biostatistics I 20/43

The method of comparison Even under ideal sampling, however, there are still questions of study design Suppose a new therapy is developed; how should we design an experiment to test whether or not it is effective? The only meaningful way to do this is to compare it with something else Thus, we are going to have to obtain two samples (or one sample, then split it into two groups): The therapy is given to subjects in the treatment group The other subjects are not treated and are used as controls Patrick Breheny STA 580: Biostatistics I 21/43

Assigning treatment Subjects should be assigned to treatment or control at random Furthermore, the experiment should be double blinded: The subjects should not know which group they are in The doctors should not know which group the subjects are in Patrick Breheny STA 580: Biostatistics I 22/43

The U.S. polio epidemic A polio epidemic hit the United States in 1916 During the next forty years, hundreds of thousands of people, especially children, fell victim to the disease By the 1950s, several vaccines had been developed One in particular, developed by Jonas Salk, seemed very promising based on laboratory studies Patrick Breheny STA 580: Biostatistics I 23/43

Field trials In 1954, the Public Health Service organized an experiment to see whether the Salk vaccine would protect children from polio outside the laboratory The subjects were children in the most vulnerable age groups: grades 1, 2, and 3 Two million children were involved: some were vaccinated, some refused treatment, and some were deliberately left unvaccinated Patrick Breheny STA 580: Biostatistics I 24/43

Ethical issues This raises issues of medical ethics, which are always a consideration in medical studies Is it ethical to leave some children deliberately unvaccinated? Maybe a more ethical design would be to offer the vaccine to all children, and the children whose parents refused vaccination would serve as the controls Patrick Breheny STA 580: Biostatistics I 25/43

Confounding There is a big problem with the ethical design Higher-income parents are more likely to consent to treatment, and their children are more likely to suffer from polio The reason for this is that children from poorer backgrounds are more likely to contract mild cases of polio early in childhood, while still protected by antibodies from their mothers Thus, differences between treatment and control groups could be due to parental income, not the treatment Family background here is said to be a confounding factor; confounding is a major source of bias Patrick Breheny STA 580: Biostatistics I 26/43

Ethical issues, Part II So, the ethical design isn t really all that ethical, in the sense that it won t correctly determine whether the vaccine works or not Although this is always an issue in medical studies, it is important to remember that when a new therapy first emerges, no one really knows whether or not its benefits will outweigh its risks This uncertainty is what justifies withholding treatment from the control group Patrick Breheny STA 580: Biostatistics I 27/43

Making treatment and control groups similar To avoid bias and confounding, it is important that the treatment and control groups be as similar as possible except for the treatment It is important, then, that both treatment and control groups be chosen from the same population: children whose parents consented to treatment But how should we decide which children go in which group? One approach would be to use human judgment to try to make the treatment and control group as similar as possible with respect to all the relevant variables Patrick Breheny STA 580: Biostatistics I 28/43

Randomized controlled trials Experience shows, however, that this is a bad idea Human judgments often result in substantial bias It is much better to use a carefully designed random procedure For the Salk vaccine trial, this is equivalent to flipping a coin: heads, the child gets the vaccine; tails, the child does not Experiments in which an impartial chance procedure determines whether a subject is placed into the treatment or control group are called randomized controlled experiments Patrick Breheny STA 580: Biostatistics I 29/43

Placebos Another basic precaution is the use of a placebo In the Salk vaccine trial, children in the control group were given an injection of salt dissolved in water Therefore, the children did not know whether they had received the treatment; this ensures that their response is due to the vaccine itself, not the idea of treatment This may not seem important, but the placebo effect can be surprisingly strong Patrick Breheny STA 580: Biostatistics I 30/43

Double-blinding To see whether or not polio was being prevented, physicians would have to determine whether or not the children had contracted polio Many forms of polio are hard to diagnose, and borderline cases could be influenced by a physician s knowledge of whether the child was vaccinated So, another precaution taken in the Salk vaccine trial is that the doctors were not told which group the child belonged to Thus, neither the subjects nor the doctors knew who was in the treatment group and who was in the control group The Salk vaccine trial was therefore a randomized controlled double-blind experiment This is pretty much the best design there is Patrick Breheny STA 580: Biostatistics I 31/43

The results of the trial Polio cases per Size of group 100,000 children Treatment 200,000 28 Control 200,000 71 No consent 350,000 46 Patrick Breheny STA 580: Biostatistics I 32/43

Another design Randomized controlled double-blind experiments are now recognized to be the gold standard for experiments, but this was not the case in the 1950s There was a lot of disagreement over the best way to design this study In addition to the design we just talked about, a second design proposed by the National Foundation for Infantile Paralysis (NFIP) was carried out In the NFIP design, all second graders would be offered the vaccine, and children in grades 1 and 3 would serve as controls Patrick Breheny STA 580: Biostatistics I 33/43

Results of both trials Randomized controlled double-blind experiment Size Rate Treatment 200,000 28 Control 200,000 71 No consent 350,000 46 NFIP study Size Rate Grade 2 (vaccine) 225,000 25 Grades 1 & 3 (control) 725,000 54 Grade 2 (no consent) 125,000 44 Patrick Breheny STA 580: Biostatistics I 34/43

Conclusions We observed lower incidence of Polio in the vaccine group than the placebo group; what could be causing it? Confounding? No Perception bias? No Chance? The incidence of Polio could be lower in the vaccine group simply by random chance Next week, we will investigate whether or not this is a plausible explanation Patrick Breheny STA 580: Biostatistics I 35/43

Conclusions (cont d) Poor study design can bias results Randomized controlled double-blind designs reduce bias to a minimum, and that is why they should be used whenever possible We will now look at a few other examples, in less detail, that further illustrate the hidden biases and confounding factors that are possible when trying to study cause and effect in a human population Patrick Breheny STA 580: Biostatistics I 36/43

Clofibrate The Coronary Drug Project Research group published an article in the New England Journal of Medicine (1980) describing a randomized controlled double-blind experiment involving the drug clofibrate, which reduces the level of cholesterol in the blood Clofibrate Number Deaths Adherers 708 15% Nonadherers 357 25% Total 1,103 20% Subjects who took more than 80% of their prescribed medicine were called adherers Patrick Breheny STA 580: Biostatistics I 37/43

Interpreting the clofibrate results This looks like strong evidence that clofibrate is effective, but caution is in order Subjects were randomized with respect to whether they received the drug; they were not randomized with respect to their adherence Thus, confounding is possible Patrick Breheny STA 580: Biostatistics I 38/43

Clofibrate and placebo results Clofibrate Placebo Number Deaths Number Deaths Adherers 708 15% 1,813 15% Nonadherers 357 25% 882 28% Total 1,103 20% 2,789 21% Taking into account the placebo results as well, clofibrate no longer looks effective One possibility is that adherers are more concerned with their health, and take better care of themselves in general Take-home message: comparing subjects as they were randomized is the only completely valid way of carrying out a controlled experiment; all other comparisons are subject to confounding and bias Patrick Breheny STA 580: Biostatistics I 39/43

Portacaval shunts Patients with cirrhosis of the liver may start to hemorrhage and bleed to death One treatment involves surgery to redirect the flow of blood through what is called a portacaval shunt, a long and hazardous operation A bunch of studies were done in the 1950s and 1960s trying to determine whether the benefits of this surgery outweighed its risks Patrick Breheny STA 580: Biostatistics I 40/43

Portacaval shunt studies Degree of enthusiasm Design Marked Moderate None No controls 24 7 1 Controls, but not randomized 10 3 2 Randomized controlled 0 1 3 Patrick Breheny STA 580: Biostatistics I 41/43

Conclusions The poorly designed studies greatly exaggerated the value of the surgery One possible explanation is that in an experiment without randomized controls, many physicians have a natural tendency to treat only the patients who are in relatively good shape This biases the study in favor of the treatment Patrick Breheny STA 580: Biostatistics I 42/43

Diethylstibesterol DES (Diethylstibesterol) is an artificial hormone used to prevent miscarriage in pregnant women Five studies of DES were carried out using historical controls (outcome rates of patients from the past); all had favorable conclusions regarding the value of the therapy Three randomized controlled designs were carried out, and all were negative about the value of DES Doctors paid attention to the positive studies and ignored the randomized controlled studies, giving the drug to 50,000 women each year throughout the 1960s This turned out to be a medical tragedy DES has the disastrous side effect of causing cancer in female offspring; DES was banned in 1971 Patrick Breheny STA 580: Biostatistics I 43/43