Basic Biostatistics. Dr. Kiran Chaudhary Dr. Mina Chandra

Similar documents
Samples, Sample Size And Sample Error. Research Methodology. How Big Is Big? Estimating Sample Size. Variables. Variables 2/25/2018

9 research designs likely for PSYC 2100

Introduction to Statistical Data Analysis I

Clinical research in AKI Timing of initiation of dialysis in AKI

Theory. = an explanation using an integrated set of principles that organizes observations and predicts behaviors or events.

NORTH SOUTH UNIVERSITY TUTORIAL 1

Chapter 23. Inference About Means. Copyright 2010 Pearson Education, Inc.

Evidence-based practice tutorial Critical Appraisal Skills

Evidence Informed Practice Online Learning Module Glossary

Never P alone: The value of estimates and confidence intervals

OCW Epidemiology and Biostatistics, 2010 David Tybor, MS, MPH and Kenneth Chui, PhD Tufts University School of Medicine October 27, 2010

Biases in clinical research. Seungho Ryu, MD, PhD Kanguk Samsung Hospital, Sungkyunkwan University

Understandable Statistics

Critical Appraisal Series

Unit 1 Exploring and Understanding Data

Introduction to Statistics

Psychology Research Process

Observational studies; descriptive statistics

Interpretation of Data and Statistical Fallacies

Appendix B Statistical Methods

Biases in clinical research. Seungho Ryu, MD, PhD Kanguk Samsung Hospital, Sungkyunkwan University

Biostatistics. Donna Kritz-Silverstein, Ph.D. Professor Department of Family & Preventive Medicine University of California, San Diego

Psychology Research Process

Statistics for Psychology

Objectives. Quantifying the quality of hypothesis tests. Type I and II errors. Power of a test. Cautions about significance tests

The comparison or control group may be allocated a placebo intervention, an alternative real intervention or no intervention at all.

Statistical Techniques. Masoud Mansoury and Anas Abulfaraj

Survival Skills for Researchers. Study Design

STATISTICS AND RESEARCH DESIGN

Module 28 - Estimating a Population Mean (1 of 3)

POST GRADUATE DIPLOMA IN BIOETHICS (PGDBE) Term-End Examination June, 2016 MHS-014 : RESEARCH METHODOLOGY

The degree to which a measure is free from error. (See page 65) Accuracy

2 Critical thinking guidelines

BIOSTATISTICAL METHODS

STA Module 9 Confidence Intervals for One Population Mean

Quantitative Methods in Computing Education Research (A brief overview tips and techniques)

Example The median earnings of the 28 male students is the average of the 14th and 15th, or 3+3

Psychologist use statistics for 2 things

Chapter 1: Introduction to Statistics

Essential Skills for Evidence-based Practice Understanding and Using Systematic Reviews

Lesson 9 Presentation and Display of Quantitative Data

AP Psychology -- Chapter 02 Review Research Methods in Psychology

10 Intraclass Correlations under the Mixed Factorial Design

Medical Statistics 1. Basic Concepts Farhad Pishgar. Defining the data. Alive after 6 months?

the standard deviation (SD) is a measure of how much dispersion exists from the mean SD = square root (variance)

Power & Sample Size. Dr. Andrea Benedetti

UN Handbook Ch. 7 'Managing sources of non-sampling error': recommendations on response rates

The normal curve and standardisation. Percentiles, z-scores

Chapter 1: Exploring Data

APPENDIX N. Summary Statistics: The "Big 5" Statistical Tools for School Counselors

AP Stats Review for Midterm

Student Performance Q&A:

UNIT 4 ALGEBRA II TEMPLATE CREATED BY REGION 1 ESA UNIT 4

METHOD VALIDATION: WHY, HOW AND WHEN?

Meta-Analysis. Zifei Liu. Biological and Agricultural Engineering

Applied Statistical Analysis EDUC 6050 Week 4

Global Clinical Trials Innovation Summit Berlin October 2016

Lecture Outline Biost 517 Applied Biostatistics I. Statistical Goals of Studies Role of Statistical Inference

Review Statistics review 2: Samples and populations Elise Whitley* and Jonathan Ball

BIOSTATISTICS. Dr. Hamza Aduraidi

Homework Exercises for PSYC 3330: Statistics for the Behavioral Sciences

ISC- GRADE XI HUMANITIES ( ) PSYCHOLOGY. Chapter 2- Methods of Psychology

Determining the size of a vaccine trial

University of Wollongong. Research Online. Australian Health Services Research Institute

Chapter 20: Test Administration and Interpretation

Part III Taking Chances for Fun and Profit

Statistics: Interpreting Data and Making Predictions. Interpreting Data 1/50

Types of Statistics. Censored data. Files for today (June 27) Lecture and Homework INTRODUCTION TO BIOSTATISTICS. Today s Outline

ADMS Sampling Technique and Survey Studies

Introduction & Basics

Clinical Research Scientific Writing. K. A. Koram NMIMR

Human-Computer Interaction IS4300. I6 Swing Layout Managers due now

Quantitative Literacy: Thinking Between the Lines

Collecting & Making Sense of

Empirical Knowledge: based on observations. Answer questions why, whom, how, and when.

Common Statistical Issues in Biomedical Research

WDHS Curriculum Map Probability and Statistics. What is Statistics and how does it relate to you?

Descriptive Statistics Lecture

Sta 309 (Statistics And Probability for Engineers)

Learning Objectives 9/9/2013. Hypothesis Testing. Conflicts of Interest. Descriptive statistics: Numerical methods Measures of Central Tendency

12.1 Inference for Linear Regression. Introduction

CHAPTER OBJECTIVES - STUDENTS SHOULD BE ABLE TO:

Normal Distribution. Many variables are nearly normal, but none are exactly normal Not perfect, but still useful for a variety of problems.

Introduction to Statistics. Mahmoud Alhussami, DSc., PhD.

4 Diagnostic Tests and Measures of Agreement

EXERCISE: HOW TO DO POWER CALCULATIONS IN OPTIMAL DESIGN SOFTWARE

Understanding Statistics for Research Staff!

Data and Statistics 101: Key Concepts in the Collection, Analysis, and Application of Child Welfare Data

PubHlth Introductory Biostatistics Practice Test I (Without Unit 3 Questions)

Chapter 2 Psychological Research, Methods and Statistics

Lecture Notes Module 2

Objectives. Distinguish between primary and secondary studies. Discuss ways to assess methodological quality. Review limitations of all studies.

GLOSSARY OF GENERAL TERMS

An Introduction to Research Statistics

STATISTICS 8 CHAPTERS 1 TO 6, SAMPLE MULTIPLE CHOICE QUESTIONS

What you should know before you collect data. BAE 815 (Fall 2017) Dr. Zifei Liu

Sanjay P. Zodpey Clinical Epidemiology Unit, Department of Preventive and Social Medicine, Government Medical College, Nagpur, Maharashtra, India.

Types of Biomedical Research

Lec 02: Estimation & Hypothesis Testing in Animal Ecology

Collecting & Making Sense of

Transcription:

Basic Biostatistics Dr. Kiran Chaudhary Dr. Mina Chandra

Overview 1.Importance of Biostatistics 2.Biological Variations, Uncertainties and Sources of uncertainties 3.Terms- Population/Sample, Validity/ Reliability, Precision/Accuracy, Gaussian distribution, Confidence intervals. 4. Sampling Techniques 5.Summarisation of Medical Data-Averages, Tables, Graphs, Plots etc.

Introduction

What is Statistics? A science that deals with Collection, Organization/Classification, Tabulation, Analysis, Interpretation and Presentation of information, that can be presented numerically and/or graphically, to summarize results in a meaningful way, to help us answer a question of interest.

Why Study Biostatistics? Clinician-One drug superior to another with P<0.05 To assess contribution of risk factors (hereditary, biological, environmental),in development of disease-e.g. Association between lifestyle and heart disease To decide between two or three alternative strategies treatment /diagnostic e.g. ELISA, NAT, Both Policy makers-situational analysis for carrying out a control program e.g. AIDS control program, sex workers and IV drug users Administrators-e.g. Bed occupancy rates, projecting requirement, needs assessment

Why Study Biostatistics Biological variations and uncertainties bound to be present in Nature Biostatistics helps to control the impact of uncertainties, to measure its magnitude And to take valid decisions, that is least affected by such uncertainties. GOAL of Research-to minimize errors that threaten conclusions based on inferences/ to keep errors at an acceptable level

Classification

Types of Statistics Descriptive Statistics- Description of data in a study Includes techniques for organizing, summarizing and presenting data using tables, graphs, plots etc. E.g. Mean, SD of height of children E.g. Proportion of overweight children is 20%

Types of Statistics Inferential Statistics- consists of statistical methods for making inferences about a population based on information obtained from a sample Estimation-Using results from a sample, to estimate levels in a larger population Testing Hypothesis-Comparing two groups and ascertaining whether there is true difference between the groups

BIOSTATISTICS Descriptive Inferential Continuous Discrete Estimation Hypothesis Testing Proportions% P Values Central Dispersion Confidence Intervals tendency

Biological Variations & UNCERTAINTIES DEALING WITH IMPERFECT INFORMATION

Sources of Medical Uncertainties 1.Genuine Variability-Biological, Environmental, Sampling Fluctuations, Chance variability 2.Variability due to un standardized methods- Observer variability, Instrument variability, Laboratory variability 3.Variability due to partial or erroneous information-unavoidable incomplete information, Avoidable incomplete information, Partial compliance, Errors

Sources-Genuine V Biological Variability: from person to person, from time to time, within healthy groups and within sick groups e.g. BP, blood sugar levels Environmental Variability: Nutrition, Smoking, Pollution e.g. water pollution, sanitation Sampling Variability: Cannot study the whole population- sampling variability or sampling error e.g.-two different samples show different results e.g. Different mean HB. Chance Variability: This is Unknown cannot be explained

V due to Un Standardized Methods Observer V-Different opinions-e.g., X Ray findings in TB Instrument V- e.g. B.P. on different instruments, Pain Intensity-on visual analogue scale and visual rating scale. Un standardized Questionnaire-inconsistent results Laboratory V- Improper Quality Control

V due to Partial or Erroneous Information Unavoidable Incomplete Info-e.g. comatose Avoidable Incomplete Info-e.g. particular test not done for economic reasons, unavailability of facility, suppression of facts in history taking Partial Compliance- leads to varied response of treatment e.g.- Effect of exercise, Drug Errors-Occur due to carelessness and lack of knowledge-wrong calibration, wrong reagents, wrong recording, misinterpretation e.g. Blood Group-Negative for positive

Concept of Population and Sample

Population/Sample Population-larger group of units that is the target of investigation-i.e., the group to which the findings are generalized E.g.-A drug for treatment of hyper emesis, Target population- All pregnant females with hyper emesis But cannot study the whole population- Why?? -Values for population are unknown Sample-i.e., a fraction or part of a population is actually studied.

Population/Sample SAMPLING VARIATIONS ARE BOUND TO OCCUR In view of this- Can samples be really used to estimate the characteristic of a population? Can legitimate inference be drawn about population from the sample results? If somebody else has done so can you believe these inferences? ANSWER IS--??

Population/Sample YES!!!!! PROVIDED- The sample is chosen with due precaution A. Random selection B. Inclusion of sufficient number of individuals so that the sample is able to represent the entire spectrum of features of the population-representative Sample Then, sample mean,(median, proportion) is a good estimate of the target population.

Parameter vs Statistic DEF: A measurable characteristic of a population, such as a mean or standard deviation, is called a parameter, but a measurable characteristic of a sample is called a statistic. Mean of a population is denoted by the symbol μ Mean of a sample is denoted by symbol- x

Population (Greek) Parameter Sample (Roman) Statistic Mean μ x Standard Deviation σ s Proportion π p

Increasing the Sample Size CLOSER TO TRUTH RQ: What is the Sero positivity for HIV among Blood donors in RMLH? By this study sample we want to know the TRUTH among all Blood donors. By increasing Sample size we can get closer to the truth Sample Size No. Positive % 10 x 0% 100 3 3% 1000 15 1.5% 10000 80 0.8%

Planning the Measurements Precision and Accuracy

Precision Repeatability/Reproducibility/Reliability /Consistency Precision-The degree to which the value of a measurement has nearly the same value when measured several times Precision has a very important influence on the Power of the study The more precise the measurement the greater the statistical power at a given sample size to test mean values and to test hypothesis

Precision is Affected by Random Error Error: The difference between the measurement and the truth Random error-a chance event Sources of error: observer, instrument, subject variability

Reliability The extent to which repeated measurements of a stable phenomenon - by different people and instruments, at different times and places - get similar results Reproducibility and Reliability are similar Random error affects Reliability

Reliability Intra-Rater reliability Comparison of measures by same person (test-retest; Repeatability) Inter-Rater Reliability Comparison of measures by different people

Accuracy Accuracy-The degree to which the measurement actually represents what it was intended to represent How close is the measurement to the truth? Compare result to a GOLD STANDARD Eg, serum sodium can be measured on an instrument recently calibrated against solutions made up with known concentrations of sodium

Accuracy Important influence on the Validity of the study- The degree to which the observed findings lead to correct Inference about phenomenon taking place in the study sample and in the universe

Accuracy is Affected by Systematic Error S.E.-The difference between the measurement and the truth Systematic error/bias Design flaws Protocol flaws (instrument or observer bias) Use of a rectal thermometer for oral temperatures Mis calibration of a scale Subject limitations (subject bias) Memory Mistakes in Data entry and Analysis Sources of error: observer, instrument, subject

Precision/Accuracy DEFINITION Best way to assess Value to study PRECISION The degree to which a variable has nearly same value when measured several times Comparison among repeated measures Increase power to detect effects Threatened by Random error Chance Observer Subject Instrument ACCURACY The degree to which a variable actually represents what it is supposed to represent Comparison with a reference standard Increase validity of conclusions Systematic error- Bias Observer Subject Instrument

Strategies to Reduce Random Error- Increasing Precision Standardizing the measurement method Training and certifying the observer Refining the Instrument Automating the Instrument Repeating the measurement

Strategies to Reduce Systematic Error- Increasing Accuracy Standardizing the measurement method Training and certifying the observer Refining the instrument Automating the instrument Making unobtrusive measurements Calibrating the instrument Blinding

Strategies to Reduce R Error Standardizing methods-operamanual Training observer Refining the Instrument Automating the instrument Increasing Precision Source Error Prevention Observer Subject Observer Instrument+ observer Variation in BP due to variable rate of cuff deflation Variable length of quiet sitting before BP check Variable observer technique Rounding off of BP Specify how cuff will be deflated 2mm/sec Specify Training Conceals the reading Observer Technique Variable Automatic device Subject Emotional Reaction Automatic Device Repeating the measurement O+S+I All sources of measurements Use mean of two or more measurements

Strategies to Reduce Systematic Making unobtrusive measurements Calibrating the Instrument Error-Increase Accuracy Source Error Prevention Subject Instrument Tendency to overestimate compliance with drug High reading adjustment wrong Blinding Observer Reads lower BP in t/t group Subject Tendency to over report side effects Measure drug level in urine Periodic calibration Double blind placebo Assignment concealment Same

Internal /External Validity

Validity of the Study DEF: Validity is the degree of correctness of the results. Did you measure what you were intended to measure? (Accuracy) Internal validity External validity

Internal Validity Internal validity: DID the study actually measure what it set out to? Are the findings true for the study subjects? Depends on- Selection bias, information bias, measurement bias and confounding are threats to internal validity If the study has been undertaken using wrong methods E.g, use of outdated kits, untrained technician, hemolysed samples This is a problem of Internal validity The study has to be rejected, Achieving internal validity takes a priority

External Validity External validity: To what extent are the findings generalizable to the population? Depends on- Inclusion & Exclusion criteria E.g,HIV Sero Positivity rates among those attending STD clinics? Sample is Non Representative- Systematically different from the General population Loss of generalizability if this sample is used to predict the prevalence rate in the population Problem of Representativeness arises

Normal Distribution

Properties Of Normal Curve Normal curves are symmetrical, uni modal, have a bell-shaped form. Mean, median, and mode all have the same value, are in the centre How do we know if data is normally distributed? Visual appearance of histogram Skewed: If SD > half of mean

Percent of Values Within One Standard Deviations 68.26% of Cases 42

Percent of Values Within Two Standard Deviations 95.44% of Cases 43

Percent of Values Within Three Standard Deviations 99.72% of Cases 44

Confidence Intervals

Confidence Interval CI provides a Range that is likely to contain the population mean when so obtained for repeated samples. Conventionally 95% is fixed as a limit and is called Confidence level Eg. A range from 10mg/dl-16mg/dl has 95 % chances of containing the actual population mean for HB, such a range is called 95% CI. The limits 10 and 16 are called confidence limits. There is only 5 % chance that the actual population mean is outside the range.

Confidence Interval Example-: In a study of a sample of 100 subjects it was found that the mean systolic blood pressure was 120 mm Hg with a standard deviation of 10 mm Hg. Find out 95% confidence limits for the population mean of systolic blood pressure. SE = SD / ( n ) = 10/ ( 100 ) = 10/10 =1 LL :--- mean - 1.96*1 :--- 120-1.96 = 118.04 UL :--- mean +1.96*1 :--- 120 + 1.96 = 121.96 i.e. the population mean value of systolic blood pressure will lie between 118.04 and 121.96 and we can have a confidence of 95% for making this statement.

Confidence Interval CI can be calculated for the following: Is our question that of estimation-estimation of mean, proportion Is our question that of Hypothesis testing- Difference between two means Difference between two proportion Relative Risk,Odds Ratio Smaller the CI- more Precise the measurements Wider CI- Increase the sample size

In Summary 1.All biological phenomenon-differ in characteristics from each other Variability 2.We cannot study the entire population so we have to study SAMPLES- Representative Ensures external validity/ generalisability #Role of Biostatistics

Summary contd 3.Sample should be of Adequate Size to obviate the play of chance 4.Select correct tools and techniques for correct measurements-to measure what we really intend to measure, To ensure Internal validity

Summary contd 5.This data should be collected, presented and analyzed in a scientific way, so that meaningful inferences could be drawn 6.Analyse data appropriately using appropriate statistical tests the extent of chance, random or sampling variation should be kept at a minimum acceptable level

Summary 7.Interpret results according to their clinical or public health significance and not simply go by statistical significance- Hurray the p value is <0.05,lets celebrate!!!! 8.Draw correct inferences and Generate new research Hypothesis

TAKE HOME MESSAGE Μ, x,σ, s, π, p ~!!!!!