STATS8: Introduction to Biostatistics. Overview. Babak Shahbaba Department of Statistics, UCI

Similar documents
Analysis of TB prevalence surveys

Chapter 13. Experiments and Observational Studies

Quantitative Methods. Lonnie Berger. Research Training Policy Practice

Chapter 5: Producing Data

Chapter 13 Summary Experiments and Observational Studies

Statistics and Probability

Chapter 13. Experiments and Observational Studies. Copyright 2012, 2008, 2005 Pearson Education, Inc.

Psych 1Chapter 2 Overview

Biostatistics for Med Students. Lecture 1

UNIT 4 ALGEBRA II TEMPLATE CREATED BY REGION 1 ESA UNIT 4

Copyright 2014, 2011, and 2008 Pearson Education, Inc. 1-1

Villarreal Rm. 170 Handout (4.3)/(4.4) - 1 Designing Experiments I

Political Science 15, Winter 2014 Final Review

Introduction to Research Methods

Lecture Slides. Elementary Statistics Eleventh Edition. by Mario F. Triola. and the Triola Statistics Series 1.1-1

Chapter 1 Data Types and Data Collection. Brian Habing Department of Statistics University of South Carolina. Outline

PubH 7470: STATISTICS FOR TRANSLATIONAL & CLINICAL RESEARCH

The Logic of Data Analysis Using Statistical Techniques M. E. Swisher, 2016

Basic Concepts in Research and DATA Analysis

Chapter 11 Nonexperimental Quantitative Research Steps in Nonexperimental Research

Lecture 4: Research Approaches

Inferences: What inferences about the hypotheses and questions can be made based on the results?

Design of Experiments & Introduction to Research

Chapter 3. Producing Data

Population. Sample. AP Statistics Notes for Chapter 1 Section 1.0 Making Sense of Data. Statistics: Data Analysis:

Vocabulary. Bias. Blinding. Block. Cluster sample

Lecture Outline. Biost 517 Applied Biostatistics I. Purpose of Descriptive Statistics. Purpose of Descriptive Statistics

Chapter 02. Basic Research Methodology

Chapter 11: Experiments and Observational Studies p 318

Chapter 1 Introduction to Educational Research

AP Statistics Exam Review: Strand 2: Sampling and Experimentation Date:

CHAPTER 4 Designing Studies

aps/stone U0 d14 review d2 teacher notes 9/14/17 obj: review Opener: I have- who has

Lecture Outline. Biost 590: Statistical Consulting. Stages of Scientific Studies. Scientific Method

MAT 155. Chapter 1 Introduction to Statistics. Key Concept. Basics of Collecting Data. August 20, S1.5_3 Collecting Sample Data

What is Psychology? chapter 1

Chapter 1: Data Collection Pearson Prentice Hall. All rights reserved

Psychology of Dysfunctional Behaviour RESEARCH METHODS

Chapter 1 - Sampling and Experimental Design

UNIT I SAMPLING AND EXPERIMENTATION: PLANNING AND CONDUCTING A STUDY (Chapter 4)

Unit 1 Exploring and Understanding Data

BIOSTATISTICAL METHODS

Carrying out an Empirical Project

Chapter 2. The Data Analysis Process and Collecting Data Sensibly. Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc.

For each of the following cases, describe the population, sample, population parameters, and sample statistics.

Unit 3: Collecting Data. Observational Study Experimental Study Sampling Bias Types of Sampling

Introduction to Applied Research in Economics

Why Does Sampling Matter? Answers From the Gender Norms and Labour Supply Project

10.1 Estimating with Confidence. Chapter 10 Introduction to Inference

9. Interpret a Confidence level: "To say that we are 95% confident is shorthand for..

Raya Abu-Tawileh. Dana Alrafaiah. Hamza Alduraidi. 1 P a g e

Statistical Significance, Effect Size, and Practical Significance Eva Lawrence Guilford College October, 2017

REVIEW FOR THE PREVIOUS LECTURE

Handout 1: Introduction to the Research Process and Study Design STAT 335 Fall 2016

Design, Sampling, and Probability

Lab 4 (M13) Objective: This lab will give you more practice exploring the shape of data, and in particular in breaking the data into two groups.

Analysis A step in the research process that involves describing and then making inferences based on a set of data.

Clinical Research Scientific Writing. K. A. Koram NMIMR

Lecture Outline Biost 517 Applied Biostatistics I. Statistical Goals of Studies Role of Statistical Inference

ISC- GRADE XI HUMANITIES ( ) PSYCHOLOGY. Chapter 2- Methods of Psychology

Randomization as a Tool for Development Economists. Esther Duflo Sendhil Mullainathan BREAD-BIRS Summer school

CHAPTER 3 DATA ANALYSIS: DESCRIBING DATA

2 Critical thinking guidelines

Mary Emmett, PhD, FACHE Center for Health Services and Outcomes Research

CHAPTER OBJECTIVES - STUDENTS SHOULD BE ABLE TO:

Lecturer: Dr. Emmanuel Adjei Department of Information Studies Contact Information:

A conversation with Professor David Chalmers, May 20, 2016 Participants

Regression Discontinuity Analysis

Chapter 23. Inference About Means. Copyright 2010 Pearson Education, Inc.

STA Module 1 The Nature of Statistics. Rev.F07 1

STA Rev. F Module 1 The Nature of Statistics. Learning Objectives. Learning Objectives (cont.

Student Performance Q&A:

Variable Data univariate data set bivariate data set multivariate data set categorical qualitative numerical quantitative

observational studies Descriptive studies

Lecture Outline Biost 517 Applied Biostatistics I

Chapter 8: Estimating with Confidence

Technical Specifications

Statistical Issues in Road Safety Part I: Uncertainty, Variability, Sampling

MAT Mathematics in Today's World

Designed Experiments have developed their own terminology. The individuals in an experiment are often called subjects.

Slide 1 - Introduction to Statistics Tutorial: An Overview Slide notes

Research Questions, Variables, and Hypotheses: Part 1. Overview. Research Questions RCS /2/04

QUASI EXPERIMENTAL DESIGN

Data collection, summarizing data (organization and analysis of data) The drawing of inferences about a population from a sample taken from

AP Statistics. Semester One Review Part 1 Chapters 1-5

Introduction, Evidence, and Sampling

BASIC CONCEPTS IN RESEARCH AND DATA ANALYSIS

Experimental Research in HCI. Alma Leora Culén University of Oslo, Department of Informatics, Design

challenges we face in studying interviewing, and make some comments about future research about interviewing.

The Practice of Statistics 1 Week 2: Relationships and Data Collection

Quantitative and qualitative research

I. Introduction and Data Collection B. Sampling. 1. Bias. In this section Bias Random Sampling Sampling Error

What is Multilevel Modelling Vs Fixed Effects. Will Cook Social Statistics

Causal Research Design- Experimentation

How Faithful is the Old Faithful? The Practice of Statistics, 5 th Edition 1

IAPT: Regression. Regression analyses

Sta 309 (Statistics And Probability for Engineers)

Doing Quantitative Research 26E02900, 6 ECTS Lecture 6: Structural Equations Modeling. Olli-Pekka Kauppila Daria Kautto

Chapter 1 A Cultural Approach to Child Development

Undertaking statistical analysis of

Transcription:

STATS8: Introduction to Biostatistics Overview Babak Shahbaba Department of Statistics, UCI

The role of statistical analysis in science This course discusses some biostatistical methods, which involve applying statistical methods to biological problems. We use empirical evidence to study populations and make informed decisions. To study a population, we measure a set of characteristics, which we refer to as variables. The objective of many scientific studies is to learn about the variation of a specific characteristic (e.g., BMI, disease status) in the population of interest.

The role of statistical analysis in science In many studies, we are interested in possible relationships among different variables. We refer to the variables that are the main focus of our study as the response (or target) variables. In contrast, we call variables that explain or predict the variation in the response variable as explanatory variables or predictors depending on the role of these variables. Statistical analysis begins with a scientific problem usually presented in the form of a hypothesis testing or a prediction problem.

Sampling To answer our scientific questions, we would, ideally, observe or perform an experiment on all members of the population of interest. However, this is usually impossible either physically, ethically, or economically. Instead, we select a sample of representative members from the population. Then with the methods of statistical inference, the conclusions based on the sample can cautiously be attributed to the whole population.

Sampling The samples are selected randomly (i.e., with some probability) from the population. Unless stated otherwise, these randomly selected members of populations are assumed to be independent. The selected members (e.g., people, households, cells) are called sampling units. The individual entities from which we collect information are called observation units, or simply observations. Our sample must be representative of the population, and their environments should be comparable to that of the whole population.

Some sampling schemes: Simple random sampling Stratified sampling Cluster sampling Sampling

Observational studies and experiments After obtaining the sample, the next step is gathering the relevant information from the selected members. In observational studies, researchers are passive examiners, trying to have the least impact on the data collection process. Observational studies are quite helpful in detecting relationships among characteristics. When studying the relationships between characteristics, it is important to distinguish between association and causality. It is usually easier to establish causality by using experiments. In experiments, researchers attempt to control the process as much as possible.

Observational studies and experiments Retrospective and prospective observational studies Case-control studies Randomization, replication, and blocking in experiments Cross-Sectional, Longitudinal, and Time Series data

Data exploration After collecting data, the next step towards statistical inference and decision making is to perform data exploration, which involves visualizing and summarizing the data. The objective of data visualization is to obtain a high level understanding of the sample and their observed (measured) characteristics. To make the data more manageable, we need to further reduce the amount of information in some meaningful ways so that we can focus on the key aspects of the data. Summary statistics are used for this purpose.

Data exploration Using data exploration techniques, we can learn about the distribution of a variable. Informally, the distribution of a variable tells us the possible values it can take, the chance of observing those values, and how often we expect to see them in a random sample from the population. Through data exploration, we might detect previously unknown patterns and relationships that are worth further investigation. We can also identify possible data issues, such as unexpected or unusual measurements, known as outliers.

Statistical inference We collect data on a sample from the population in order to learn about the whole population. For example, Mackowiak, et al. (1992) measure the normal body temperature for 148 people to learn about the normal body temperature for the entire population. In this case, we say we are estimating the unknown population average. However, the characteristics and relationships in the whole population remain unknown. Therefore, there is always some uncertainty associated with our estimations.

Statistical inference In Statistics, the mathematical tool to address uncertainty is probability. The process of using the data to draw conclusions about the whole population, while acknowledging the extent of our uncertainty about our findings, is called statistical inference. The knowledge we acquire from data through statistical inference allows us to make decisions with respect to the scientific problem that motivated our study and our data analysis.

Computation We usually use computer programs to perform most of our statistical analysis and inference. The computer programs commonly used for this purpose are SAS, STATA, SPSS, MINITAB, MATLAB, and R. R is free and arguably the most common software among statisticians. For the purpose of this course, we use R-Commander, which allows us to do basic statistical analysis without necessarily learning the programing language of R. You are however encouraged to learn R for additional flexibility in your data analysis.