Knowledge is Power: The Basics of SAS Proc Power

Size: px
Start display at page:

Download "Knowledge is Power: The Basics of SAS Proc Power"

Transcription

1 ABSTRACT Knowledge is Power: The Basics of SAS Proc Power Elaina Gates, California Polytechnic State University, San Luis Obispo There are many statistics applications where it is important to understand how the power function of a specific distribution behaves. Coding a power function from scratch can be an arduous process and can become complicated when investigating effect size and sample size. This presentation will cover the basic uses of proc power in regards to testing proportions and include examples of power analyses. It will also demonstrate the simplicity of using proc power to generate plots of power curves and obtain other valuable information. INTRODUCTION In hypothesis testing, there is a null and alternate hypothesis. When something is found to be statistically significant, we reject the null in favor of the alternative. Power in regards to hypothesis testing is defined as the probability of correctly rejecting the null hypothesis. In a more general definition, power is defined as the probability of rejecting the null hypothesis (without any assumptions). Many factors play a part in calculating power, one of which is sample size. This is perhaps the most common use of studying power curves prior to a study. In order to save time and money in statistical studies, researchers use power analysis to determine what their optimal sample size should be in order to show statistical significance. This paper will outline how to use proc power, specifically with proportions, to determine a suitable sample size, how to calculate power after a sample size is chosen, and how to interpret the plot of the power curve. DETERMINING SAMPLE SIZE Power analysis is useful in determining the number of subjects needed in a study or a clinical trial. One of these applications may be deciding how many subjects are needed in a control group versus a treatment group to achieve a specific level of power. For example, a new drug is being developed to treat migraine headaches. The current treatment reduces symptoms in 40% of patients; this new drug will be put into production only if its effectiveness is at least 15% higher than the current treatment. For this experiment, we need two groups of patients. One will be given the current drug and the other group will be given the new drug. The results of the groups will be compared to determine the effectiveness of the drug and if it is at least 15% more effective. Before conducting this experiment, the researchers need to know how many subjects will be needed in each group to achieve power of at least 0.8. Using proc power, we will conduct a power analysis for this experiment. The code is shown in below. We will use the twosamplefreq option. The test we will be using to compare the two groups is a Pearson Chi-Square test and this is specified in the test= option. The default of proc power is a two-sided test. In this study we will change it to a one sided test because we are interested in the improvement in symptoms. Finally, we include what level of power we want to achieve after power= and include ntotal=. so SAS calculates the sample size minimum. power = 0.8 ntotal=.; 1

2 Figure 1. Proc Power Results for Migraine Example Part I After running the power analysis, the output shows us that in order to achieve a level of power of at least 0.8 we must have a sample size of 272 subjects. Since we haven t changed any options regarding the weight of the two groups, the default is equal sample sizes. This can be altered with the groupweights= option shown below. This will give us the sample size minimum for two groups where one has twice as many subjects as the other. groupweights = (1 2) power = 0.8 ntotal=.; DETERMINING POWER FOR A GIVEN SAMPLE SIZE When conducting an experiment where the sample size has already been selected, you can use proc power to calculate the power as well as provide plots showing how the sample size will affect the power. Suppose that there are only 160 subjects who are qualified to participate in the study involving the migraine treatment described above. The researchers are curious as to how powerful the test will be with this sample size. Now we switch the code to include the total number of subjects and change power= to missing. ntotal= 160 power =.; 2

3 Figure 2. Proc Power Results for Migraine Example Part II Effect size can also have an impact on the power of a test. It is tougher to detect a small difference between the null and alternative hypothesis than a larger difference. In the migraine example, the effect size is relatively small. Using the plot statement with proc power, we can look at how different effect sizes and sample sizes change the power of this test in Figure 3. Now instead of only including 0.4 and 0.55, representing the proportion of subjects who s symptoms are improved with the current and new medication respectively, we can include many pairs. We have included a smaller effect size, represented by the pair 0.4 and 0.5, as well as two larger effect sizes. Now the output will calculate the power for all of these pairs of proportions. After the twosamplefreq options, we have included a plot option. This will generate a plot in the output. By including x=n, we are plotting sample size on the x-axis. I have also twosamplefreq test = pchi groupproportions = (0.4, 0.5)(0.4, 0.55) (0.4, 0.6) (0.4, 0.65) power =. ntotal= 300; plot x = n min = 100 max = 500; included the limits of the x-axis after min= and max=. 3

4 Figure 3. Plot and Results Generated by Proc Power The plot in Figure 3 illustrates the effects of increasing sample size and increasing effect size. We can see that as the sample size increases, so does the power. The power also increases as the effect size gets larger. This application of proc power saves a lot of time. If we were to code this plot from scratch we would need many loops and iterations. This output also calculates the power for each of the effect sizes. CONCLUSION The POWER procedure saves the user time coding and provides all of the relevant output and plots needed for a power analysis. As a student, I have found proc power to be extremely beneficial because there are many assignments in which I have needed to conduct a power analysis. It is also beneficial for SAS users who design experiments. The plot option is extremely flexible and more user-friendly than coding from scratch. Some of the other options for tests are two-sample tests involving means as well as one sample tests for both means and proportions. This procedure also has options for tests involving linear regression, survival, ANOVA, and logistic regression. Proc power is a great tool for calculating power, sample sizes, and creating plots. REFERENCES SAS Institute Inc., SAS 9.2 User Guide. The Power Procedure, Cary, NC: SAS Institute Inc., 2016 SAS Data Analysis Examples. UCLA. Statistical Consulting Group. from ACKNOWLEDGMENTS I would like to acknowledge Professor Matthew Carlton for his phenomenal instruction on power curves and hypothesis testing. I would like to thank Rebecca Ottesen for her advice on my presentation as well as her neverending help and instruction in my SAS endeavors. CONTACT INFORMATION Your comments and questions are valued and encouraged. Contact the author at: Name: Elaina Gates elainagates@gmail.com Web: 4

5 SAS and all other SAS Institute Inc. product or service names are registered trademarks or trademarks of SAS Institute Inc. in the USA and other countries. indicates USA registration. Other brand and product names are trademarks of their respective companies. 5

Statistical questions for statistical methods

Statistical questions for statistical methods Statistical questions for statistical methods Unpaired (two-sample) t-test DECIDE: Does the numerical outcome have a relationship with the categorical explanatory variable? Is the mean of the outcome the

More information

Problem #1 Neurological signs and symptoms of ciguatera poisoning as the start of treatment and 2.5 hours after treatment with mannitol.

Problem #1 Neurological signs and symptoms of ciguatera poisoning as the start of treatment and 2.5 hours after treatment with mannitol. Ho (null hypothesis) Ha (alternative hypothesis) Problem #1 Neurological signs and symptoms of ciguatera poisoning as the start of treatment and 2.5 hours after treatment with mannitol. Hypothesis: Ho:

More information

Inferential Statistics

Inferential Statistics Inferential Statistics and t - tests ScWk 242 Session 9 Slides Inferential Statistics Ø Inferential statistics are used to test hypotheses about the relationship between the independent and the dependent

More information

Methodology for Non-Randomized Clinical Trials: Propensity Score Analysis Dan Conroy, Ph.D., inventiv Health, Burlington, MA

Methodology for Non-Randomized Clinical Trials: Propensity Score Analysis Dan Conroy, Ph.D., inventiv Health, Burlington, MA PharmaSUG 2014 - Paper SP08 Methodology for Non-Randomized Clinical Trials: Propensity Score Analysis Dan Conroy, Ph.D., inventiv Health, Burlington, MA ABSTRACT Randomized clinical trials serve as the

More information

Sheila Barron Statistics Outreach Center 2/8/2011

Sheila Barron Statistics Outreach Center 2/8/2011 Sheila Barron Statistics Outreach Center 2/8/2011 What is Power? When conducting a research study using a statistical hypothesis test, power is the probability of getting statistical significance when

More information

TEACHING REGRESSION WITH SIMULATION. John H. Walker. Statistics Department California Polytechnic State University San Luis Obispo, CA 93407, U.S.A.

TEACHING REGRESSION WITH SIMULATION. John H. Walker. Statistics Department California Polytechnic State University San Luis Obispo, CA 93407, U.S.A. Proceedings of the 004 Winter Simulation Conference R G Ingalls, M D Rossetti, J S Smith, and B A Peters, eds TEACHING REGRESSION WITH SIMULATION John H Walker Statistics Department California Polytechnic

More information

Lessons in biostatistics

Lessons in biostatistics Lessons in biostatistics The test of independence Mary L. McHugh Department of Nursing, School of Health and Human Services, National University, Aero Court, San Diego, California, USA Corresponding author:

More information

Biostatistics & SAS programming

Biostatistics & SAS programming Biostatistics & SAS programming Kevin Zhang April 18, 2017 Determine Sample Size and Power 1 Errors April 18, 2017 Biostat 2 In practice When you design the study, you need to first tell how many units,

More information

Quasicomplete Separation in Logistic Regression: A Medical Example

Quasicomplete Separation in Logistic Regression: A Medical Example Quasicomplete Separation in Logistic Regression: A Medical Example Madeline J Boyle, Carolinas Medical Center, Charlotte, NC ABSTRACT Logistic regression can be used to model the relationship between a

More information

THE STATSWHISPERER. Introduction to this Issue. Doing Your Data Analysis INSIDE THIS ISSUE

THE STATSWHISPERER. Introduction to this Issue. Doing Your Data Analysis INSIDE THIS ISSUE Spring 20 11, Volume 1, Issue 1 THE STATSWHISPERER The StatsWhisperer Newsletter is published by staff at StatsWhisperer. Visit us at: www.statswhisperer.com Introduction to this Issue The current issue

More information

Linear Regression in SAS

Linear Regression in SAS 1 Suppose we wish to examine factors that predict patient s hemoglobin levels. Simulated data for six patients is used throughout this tutorial. data hgb_data; input id age race $ bmi hgb; cards; 21 25

More information

Parameter Estimation of Cognitive Attributes using the Crossed Random- Effects Linear Logistic Test Model with PROC GLIMMIX

Parameter Estimation of Cognitive Attributes using the Crossed Random- Effects Linear Logistic Test Model with PROC GLIMMIX Paper 1766-2014 Parameter Estimation of Cognitive Attributes using the Crossed Random- Effects Linear Logistic Test Model with PROC GLIMMIX ABSTRACT Chunhua Cao, Yan Wang, Yi-Hsin Chen, Isaac Y. Li University

More information

Research Methods in Forest Sciences: Learning Diary. Yoko Lu December Research process

Research Methods in Forest Sciences: Learning Diary. Yoko Lu December Research process Research Methods in Forest Sciences: Learning Diary Yoko Lu 285122 9 December 2016 1. Research process It is important to pursue and apply knowledge and understand the world under both natural and social

More information

BIOL 458 BIOMETRY Lab 7 Multi-Factor ANOVA

BIOL 458 BIOMETRY Lab 7 Multi-Factor ANOVA BIOL 458 BIOMETRY Lab 7 Multi-Factor ANOVA PART 1: Introduction to Factorial ANOVA ingle factor or One - Way Analysis of Variance can be used to test the null hypothesis that k or more treatment or group

More information

appstats26.notebook April 17, 2015

appstats26.notebook April 17, 2015 Chapter 26 Comparing Counts Objective: Students will interpret chi square as a test of goodness of fit, homogeneity, and independence. Goodness of Fit A test of whether the distribution of counts in one

More information

Chapter 25. Paired Samples and Blocks. Copyright 2010 Pearson Education, Inc.

Chapter 25. Paired Samples and Blocks. Copyright 2010 Pearson Education, Inc. Chapter 25 Paired Samples and Blocks Copyright 2010 Pearson Education, Inc. Paired Data Data are paired when the observations are collected in pairs or the observations in one group are naturally related

More information

1.4 - Linear Regression and MS Excel

1.4 - Linear Regression and MS Excel 1.4 - Linear Regression and MS Excel Regression is an analytic technique for determining the relationship between a dependent variable and an independent variable. When the two variables have a linear

More information

Generalized Estimating Equations for Depression Dose Regimes

Generalized Estimating Equations for Depression Dose Regimes Generalized Estimating Equations for Depression Dose Regimes Karen Walker, Walker Consulting LLC, Menifee CA Generalized Estimating Equations on the average produce consistent estimates of the regression

More information

Treatment Adaptive Biased Coin Randomization: Generating Randomization Sequences in SAS

Treatment Adaptive Biased Coin Randomization: Generating Randomization Sequences in SAS Adaptive Biased Coin Randomization: OBJECTIVES use SAS code to generate randomization s based on the adaptive biased coin design (ABCD) must have approximate balance in treatment groups can be used to

More information

Advanced ANOVA Procedures

Advanced ANOVA Procedures Advanced ANOVA Procedures Session Lecture Outline:. An example. An example. Two-way ANOVA. An example. Two-way Repeated Measures ANOVA. MANOVA. ANalysis of Co-Variance (): an ANOVA procedure whereby the

More information

AP STATISTICS 2008 SCORING GUIDELINES (Form B)

AP STATISTICS 2008 SCORING GUIDELINES (Form B) AP STATISTICS 2008 SCORING GUIDELINES (Form B) Question 4 Intent of Question The primary goals of this question were to assess a student s ability to (1) design an experiment to compare two treatments

More information

Data and Statistics 101: Key Concepts in the Collection, Analysis, and Application of Child Welfare Data

Data and Statistics 101: Key Concepts in the Collection, Analysis, and Application of Child Welfare Data TECHNICAL REPORT Data and Statistics 101: Key Concepts in the Collection, Analysis, and Application of Child Welfare Data CONTENTS Executive Summary...1 Introduction...2 Overview of Data Analysis Concepts...2

More information

Before we get started:

Before we get started: Before we get started: http://arievaluation.org/projects-3/ AEA 2018 R-Commander 1 Antonio Olmos Kai Schramm Priyalathta Govindasamy Antonio.Olmos@du.edu AntonioOlmos@aumhc.org AEA 2018 R-Commander 2 Plan

More information

112 Statistics I OR I Econometrics A SAS macro to test the significance of differences between parameter estimates In PROC CATMOD

112 Statistics I OR I Econometrics A SAS macro to test the significance of differences between parameter estimates In PROC CATMOD 112 Statistics I OR I Econometrics A SAS macro to test the significance of differences between parameter estimates In PROC CATMOD Unda R. Ferguson, Office of Academic Computing Mel Widawski, Office of

More information

Psychology Research Process

Psychology Research Process Psychology Research Process Logical Processes Induction Observation/Association/Using Correlation Trying to assess, through observation of a large group/sample, what is associated with what? Examples:

More information

From Biostatistics Using JMP: A Practical Guide. Full book available for purchase here. Chapter 1: Introduction... 1

From Biostatistics Using JMP: A Practical Guide. Full book available for purchase here. Chapter 1: Introduction... 1 From Biostatistics Using JMP: A Practical Guide. Full book available for purchase here. Contents Dedication... iii Acknowledgments... xi About This Book... xiii About the Author... xvii Chapter 1: Introduction...

More information

An Interactive SAS/AF System For Sample Size Calculation

An Interactive SAS/AF System For Sample Size Calculation An Interactive SAS/AF System For Sample Size Calculation Deborah J. Matos CIBA-GEIGY Corporation, Drug Development Department, Summit, NJ ABSTRACT In the design of a clinical trial, the number of patients

More information

Pitfalls in Linear Regression Analysis

Pitfalls in Linear Regression Analysis Pitfalls in Linear Regression Analysis Due to the widespread availability of spreadsheet and statistical software for disposal, many of us do not really have a good understanding of how to use regression

More information

An Introduction to Bayesian Statistics

An Introduction to Bayesian Statistics An Introduction to Bayesian Statistics Robert Weiss Department of Biostatistics UCLA Fielding School of Public Health robweiss@ucla.edu Sept 2015 Robert Weiss (UCLA) An Introduction to Bayesian Statistics

More information

Propensity Score Methods for Causal Inference with the PSMATCH Procedure

Propensity Score Methods for Causal Inference with the PSMATCH Procedure Paper SAS332-2017 Propensity Score Methods for Causal Inference with the PSMATCH Procedure Yang Yuan, Yiu-Fai Yung, and Maura Stokes, SAS Institute Inc. Abstract In a randomized study, subjects are randomly

More information

STA 3024 Spring 2013 EXAM 3 Test Form Code A UF ID #

STA 3024 Spring 2013 EXAM 3 Test Form Code A UF ID # STA 3024 Spring 2013 Name EXAM 3 Test Form Code A UF ID # Instructions: This exam contains 34 Multiple Choice questions. Each question is worth 3 points, for a total of 102 points (there are TWO bonus

More information

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 1) 1) A) B) C) D)

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 1) 1) A) B) C) D) Exam Name MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 1) 1) A) B) C) D) Decide whether or not the conditions and assumptions for inference with

More information

Intro to SPSS. Using SPSS through WebFAS

Intro to SPSS. Using SPSS through WebFAS Intro to SPSS Using SPSS through WebFAS http://www.yorku.ca/computing/students/labs/webfas/ Try it early (make sure it works from your computer) If you need help contact UIT Client Services Voice: 416-736-5800

More information

ANOVA in SPSS (Practical)

ANOVA in SPSS (Practical) ANOVA in SPSS (Practical) Analysis of Variance practical In this practical we will investigate how we model the influence of a categorical predictor on a continuous response. Centre for Multilevel Modelling

More information

Preliminary Report on Simple Statistical Tests (t-tests and bivariate correlations)

Preliminary Report on Simple Statistical Tests (t-tests and bivariate correlations) Preliminary Report on Simple Statistical Tests (t-tests and bivariate correlations) After receiving my comments on the preliminary reports of your datasets, the next step for the groups is to complete

More information

11/18/2013. Correlational Research. Correlational Designs. Why Use a Correlational Design? CORRELATIONAL RESEARCH STUDIES

11/18/2013. Correlational Research. Correlational Designs. Why Use a Correlational Design? CORRELATIONAL RESEARCH STUDIES Correlational Research Correlational Designs Correlational research is used to describe the relationship between two or more naturally occurring variables. Is age related to political conservativism? Are

More information

Day 11: Measures of Association and ANOVA

Day 11: Measures of Association and ANOVA Day 11: Measures of Association and ANOVA Daniel J. Mallinson School of Public Affairs Penn State Harrisburg mallinson@psu.edu PADM-HADM 503 Mallinson Day 11 November 2, 2017 1 / 45 Road map Measures of

More information

4 Diagnostic Tests and Measures of Agreement

4 Diagnostic Tests and Measures of Agreement 4 Diagnostic Tests and Measures of Agreement Diagnostic tests may be used for diagnosis of disease or for screening purposes. Some tests are more effective than others, so we need to be able to measure

More information

A Comparison of Linear Mixed Models to Generalized Linear Mixed Models: A Look at the Benefits of Physical Rehabilitation in Cardiopulmonary Patients

A Comparison of Linear Mixed Models to Generalized Linear Mixed Models: A Look at the Benefits of Physical Rehabilitation in Cardiopulmonary Patients Paper PH400 A Comparison of Linear Mixed Models to Generalized Linear Mixed Models: A Look at the Benefits of Physical Rehabilitation in Cardiopulmonary Patients Jennifer Ferrell, University of Louisville,

More information

How to analyze correlated and longitudinal data?

How to analyze correlated and longitudinal data? How to analyze correlated and longitudinal data? Niloofar Ramezani, University of Northern Colorado, Greeley, Colorado ABSTRACT Longitudinal and correlated data are extensively used across disciplines

More information

Improved Transparency in Key Operational Decisions in Real World Evidence

Improved Transparency in Key Operational Decisions in Real World Evidence PharmaSUG 2018 - Paper RW-06 Improved Transparency in Key Operational Decisions in Real World Evidence Rebecca Levin, Irene Cosmatos, Jamie Reifsnyder United BioSource Corp. ABSTRACT The joint International

More information

Lab 8: Multiple Linear Regression

Lab 8: Multiple Linear Regression Lab 8: Multiple Linear Regression 1 Grading the Professor Many college courses conclude by giving students the opportunity to evaluate the course and the instructor anonymously. However, the use of these

More information

MEA DISCUSSION PAPERS

MEA DISCUSSION PAPERS Inference Problems under a Special Form of Heteroskedasticity Helmut Farbmacher, Heinrich Kögel 03-2015 MEA DISCUSSION PAPERS mea Amalienstr. 33_D-80799 Munich_Phone+49 89 38602-355_Fax +49 89 38602-390_www.mea.mpisoc.mpg.de

More information

A SAS Macro for Adaptive Regression Modeling

A SAS Macro for Adaptive Regression Modeling A SAS Macro for Adaptive Regression Modeling George J. Knafl, PhD Professor University of North Carolina at Chapel Hill School of Nursing Supported in part by NIH Grants R01 AI57043 and R03 MH086132 Overview

More information

Basic Features of Statistical Analysis and the General Linear Model

Basic Features of Statistical Analysis and the General Linear Model 01-Foster-3327(ch-01).qxd 9/5/2005 5:48 PM Page 1 1 Basic Features of Statistical Analysis and the General Linear Model INTRODUCTION The aim of this book is to describe some of the statistical techniques

More information

Stepwise method Modern Model Selection Methods Quantile-Quantile plot and tests for normality

Stepwise method Modern Model Selection Methods Quantile-Quantile plot and tests for normality Week 9 Hour 3 Stepwise method Modern Model Selection Methods Quantile-Quantile plot and tests for normality Stat 302 Notes. Week 9, Hour 3, Page 1 / 39 Stepwise Now that we've introduced interactions,

More information

Dan Byrd UC Office of the President

Dan Byrd UC Office of the President Dan Byrd UC Office of the President 1. OLS regression assumes that residuals (observed value- predicted value) are normally distributed and that each observation is independent from others and that the

More information

MAKING THE NSQIP PARTICIPANT USE DATA FILE (PUF) WORK FOR YOU

MAKING THE NSQIP PARTICIPANT USE DATA FILE (PUF) WORK FOR YOU MAKING THE NSQIP PARTICIPANT USE DATA FILE (PUF) WORK FOR YOU Hani Tamim, PhD Clinical Research Institute Department of Internal Medicine American University of Beirut Medical Center Beirut - Lebanon Participant

More information

MULTIPLE LINEAR REGRESSION 24.1 INTRODUCTION AND OBJECTIVES OBJECTIVES

MULTIPLE LINEAR REGRESSION 24.1 INTRODUCTION AND OBJECTIVES OBJECTIVES 24 MULTIPLE LINEAR REGRESSION 24.1 INTRODUCTION AND OBJECTIVES In the previous chapter, simple linear regression was used when you have one independent variable and one dependent variable. This chapter

More information

Zheng Yao Sr. Statistical Programmer

Zheng Yao Sr. Statistical Programmer ROC CURVE ANALYSIS USING SAS Zheng Yao Sr. Statistical Programmer Outline Background Examples: Accuracy assessment Compare ROC curves Cut-off point selection Summary 2 Outline Background Examples: Accuracy

More information

Research Example Aliza Ben-Zacharia DrNP, ANP

Research Example Aliza Ben-Zacharia DrNP, ANP Research Example Aliza Ben-Zacharia DrNP, ANP Example - Outline Introduction Depression & MS Research Question? Hypotheses Study Design Tools/Variables Beck Depression Inventory (BDI) BDI-II BDI-Fast-Screen

More information

Introduction to Machine Learning. Katherine Heller Deep Learning Summer School 2018

Introduction to Machine Learning. Katherine Heller Deep Learning Summer School 2018 Introduction to Machine Learning Katherine Heller Deep Learning Summer School 2018 Outline Kinds of machine learning Linear regression Regularization Bayesian methods Logistic Regression Why we do this

More information

Data Analysis with SPSS

Data Analysis with SPSS Data Analysis with SPSS A First Course in Applied Statistics Fourth Edition Stephen Sweet Ithaca College Karen Grace-Martin The Analysis Factor Allyn & Bacon Boston Columbus Indianapolis New York San Francisco

More information

Dr. Kelly Bradley Final Exam Summer {2 points} Name

Dr. Kelly Bradley Final Exam Summer {2 points} Name {2 points} Name You MUST work alone no tutors; no help from classmates. Email me or see me with questions. You will receive a score of 0 if this rule is violated. This exam is being scored out of 00 points.

More information

Fundamental Clinical Trial Design

Fundamental Clinical Trial Design Design, Monitoring, and Analysis of Clinical Trials Session 1 Overview and Introduction Overview Scott S. Emerson, M.D., Ph.D. Professor of Biostatistics, University of Washington February 17-19, 2003

More information

Conditional Distributions and the Bivariate Normal Distribution. James H. Steiger

Conditional Distributions and the Bivariate Normal Distribution. James H. Steiger Conditional Distributions and the Bivariate Normal Distribution James H. Steiger Overview In this module, we have several goals: Introduce several technical terms Bivariate frequency distribution Marginal

More information

Binary Diagnostic Tests Two Independent Samples

Binary Diagnostic Tests Two Independent Samples Chapter 537 Binary Diagnostic Tests Two Independent Samples Introduction An important task in diagnostic medicine is to measure the accuracy of two diagnostic tests. This can be done by comparing summary

More information

Psychology Research Process

Psychology Research Process Psychology Research Process Logical Processes Induction Observation/Association/Using Correlation Trying to assess, through observation of a large group/sample, what is associated with what? Examples:

More information

Introduction to Survival Analysis Procedures (Chapter)

Introduction to Survival Analysis Procedures (Chapter) SAS/STAT 9.3 User s Guide Introduction to Survival Analysis Procedures (Chapter) SAS Documentation This document is an individual chapter from SAS/STAT 9.3 User s Guide. The correct bibliographic citation

More information

Name: emergency please discuss this with the exam proctor. 6. Vanderbilt s academic honor code applies.

Name: emergency please discuss this with the exam proctor. 6. Vanderbilt s academic honor code applies. Name: Biostatistics 1 st year Comprehensive Examination: Applied in-class exam May 28 th, 2015: 9am to 1pm Instructions: 1. There are seven questions and 12 pages. 2. Read each question carefully. Answer

More information

BOOTSTRAPPING CONFIDENCE LEVELS FOR HYPOTHESES ABOUT REGRESSION MODELS

BOOTSTRAPPING CONFIDENCE LEVELS FOR HYPOTHESES ABOUT REGRESSION MODELS BOOTSTRAPPING CONFIDENCE LEVELS FOR HYPOTHESES ABOUT REGRESSION MODELS 17 December 2009 Michael Wood University of Portsmouth Business School SBS Department, Richmond Building Portland Street, Portsmouth

More information

Statistics Assignment 11 - Solutions

Statistics Assignment 11 - Solutions Statistics 44.3 Assignment 11 - Solutions 1. Samples were taken of individuals with each blood type to see if the average white blood cell count differed among types. Eleven individuals in each group were

More information

ABSTRACT THE INDEPENDENT MEANS T-TEST AND ALTERNATIVES SESUG Paper PO-10

ABSTRACT THE INDEPENDENT MEANS T-TEST AND ALTERNATIVES SESUG Paper PO-10 SESUG 01 Paper PO-10 PROC TTEST (Old Friend), What Are You Trying to Tell Us? Diep Nguyen, University of South Florida, Tampa, FL Patricia Rodríguez de Gil, University of South Florida, Tampa, FL Eun Sook

More information

ANALYZING ALCOHOL BEHAVIOR IN SAN LUIS OBISPO COUNTY

ANALYZING ALCOHOL BEHAVIOR IN SAN LUIS OBISPO COUNTY ANALYZING ALCOHOL BEHAVIOR IN SAN LUIS OBISPO COUNTY Ariana Montes In Partial Fulfillment of the Requirements for the Degree Bachelor of Science, Statistics December 2014 TABLE OF CONTENTS Methods 2 Part

More information

CHAPTER 2 TAGUCHI OPTIMISATION TECHNIQUE

CHAPTER 2 TAGUCHI OPTIMISATION TECHNIQUE 8 CHAPTER 2 TAGUCHI OPTIMISATION TECHNIQUE 2.1 OVERVIEW OF TAGUCHI METHOD The Design of Experiments (DOE) is a powerful statistical technique introduced by Fisher R. in England in the 1920s (Ross 1996),

More information

STP 231 Example FINAL

STP 231 Example FINAL STP 231 Example FINAL Instructor: Ela Jackiewicz Honor Statement: I have neither given nor received information regarding this exam, and I will not do so until all exams have been graded and returned.

More information

ABSTRACT INTRODUCTION

ABSTRACT INTRODUCTION Adaptive Randomization: Institutional Balancing Using SAS Macro Rita Tsang, Aptiv Solutions, Southborough, Massachusetts Katherine Kacena, Aptiv Solutions, Southborough, Massachusetts ABSTRACT Adaptive

More information

Problem Set 3 ECN Econometrics Professor Oscar Jorda. Name. ESSAY. Write your answer in the space provided.

Problem Set 3 ECN Econometrics Professor Oscar Jorda. Name. ESSAY. Write your answer in the space provided. Problem Set 3 ECN 140 - Econometrics Professor Oscar Jorda Name ESSAY. Write your answer in the space provided. 1) Sir Francis Galton, a cousin of James Darwin, examined the relationship between the height

More information

This module illustrates SEM via a contrast with multiple regression. The module on Mediation describes a study of post-fire vegetation recovery in

This module illustrates SEM via a contrast with multiple regression. The module on Mediation describes a study of post-fire vegetation recovery in This module illustrates SEM via a contrast with multiple regression. The module on Mediation describes a study of post-fire vegetation recovery in southern California woodlands. Here I borrow that study

More information

Quantitative Evaluation

Quantitative Evaluation Quantitative Evaluation Research Questions Quantitative Data Controlled Studies Experimental Methods Role of Statistics Quantitative Evaluation What is experimental design? What is an experimental hypothesis?

More information

Exam 4 Review Exercises

Exam 4 Review Exercises Math 160: Statistics Spring, 2014 Toews Exam 4 Review Exercises Instructions: Working in groups of 2-4, first review the goals and objectives for this exam (listed below) and then work the following problems.

More information

A Spreadsheet for Deriving a Confidence Interval, Mechanistic Inference and Clinical Inference from a P Value

A Spreadsheet for Deriving a Confidence Interval, Mechanistic Inference and Clinical Inference from a P Value SPORTSCIENCE Perspectives / Research Resources A Spreadsheet for Deriving a Confidence Interval, Mechanistic Inference and Clinical Inference from a P Value Will G Hopkins sportsci.org Sportscience 11,

More information

Learning with Rare Cases and Small Disjuncts

Learning with Rare Cases and Small Disjuncts Appears in Proceedings of the 12 th International Conference on Machine Learning, Morgan Kaufmann, 1995, 558-565. Learning with Rare Cases and Small Disjuncts Gary M. Weiss Rutgers University/AT&T Bell

More information

Bayes Factors for t tests and one way Analysis of Variance; in R

Bayes Factors for t tests and one way Analysis of Variance; in R Bayes Factors for t tests and one way Analysis of Variance; in R Dr. Jon Starkweather It may seem like small potatoes, but the Bayesian approach offers advantages even when the analysis to be run is not

More information

T-Statistic-based Up&Down Design for Dose-Finding Competes Favorably with Bayesian 4-parameter Logistic Design

T-Statistic-based Up&Down Design for Dose-Finding Competes Favorably with Bayesian 4-parameter Logistic Design T-Statistic-based Up&Down Design for Dose-Finding Competes Favorably with Bayesian 4-parameter Logistic Design James A. Bolognese, Cytel Nitin Patel, Cytel Yevgen Tymofyeyef, Merck Inna Perevozskaya, Wyeth

More information

2.75: 84% 2.5: 80% 2.25: 78% 2: 74% 1.75: 70% 1.5: 66% 1.25: 64% 1.0: 60% 0.5: 50% 0.25: 25% 0: 0%

2.75: 84% 2.5: 80% 2.25: 78% 2: 74% 1.75: 70% 1.5: 66% 1.25: 64% 1.0: 60% 0.5: 50% 0.25: 25% 0: 0% Capstone Test (will consist of FOUR quizzes and the FINAL test grade will be an average of the four quizzes). Capstone #1: Review of Chapters 1-3 Capstone #2: Review of Chapter 4 Capstone #3: Review of

More information

UNIVERSITY OF THE FREE STATE DEPARTMENT OF COMPUTER SCIENCE AND INFORMATICS CSIS6813 MODULE TEST 2

UNIVERSITY OF THE FREE STATE DEPARTMENT OF COMPUTER SCIENCE AND INFORMATICS CSIS6813 MODULE TEST 2 UNIVERSITY OF THE FREE STATE DEPARTMENT OF COMPUTER SCIENCE AND INFORMATICS CSIS6813 MODULE TEST 2 DATE: 3 May 2017 MARKS: 75 ASSESSOR: Prof PJ Blignaut MODERATOR: Prof C de Villiers (UP) TIME: 2 hours

More information

Bangor University Laboratory Exercise 1, June 2008

Bangor University Laboratory Exercise 1, June 2008 Laboratory Exercise, June 2008 Classroom Exercise A forest land owner measures the outside bark diameters at.30 m above ground (called diameter at breast height or dbh) and total tree height from ground

More information

Choosing a Significance Test. Student Resource Sheet

Choosing a Significance Test. Student Resource Sheet Choosing a Significance Test Student Resource Sheet Choosing Your Test Choosing an appropriate type of significance test is a very important consideration in analyzing data. If an inappropriate test is

More information

The Geography of Viral Hepatitis C in Texas,

The Geography of Viral Hepatitis C in Texas, The Geography of Viral Hepatitis C in Texas, 1992 1999 Author: Mara Hedrich Faculty Mentor: Joseph Oppong, Department of Geography, College of Arts and Sciences & School of Public Health, UNT Health Sciences

More information

Chapter 17 Sensitivity Analysis and Model Validation

Chapter 17 Sensitivity Analysis and Model Validation Chapter 17 Sensitivity Analysis and Model Validation Justin D. Salciccioli, Yves Crutain, Matthieu Komorowski and Dominic C. Marshall Learning Objectives Appreciate that all models possess inherent limitations

More information

Midterm Exam ANSWERS Categorical Data Analysis, CHL5407H

Midterm Exam ANSWERS Categorical Data Analysis, CHL5407H Midterm Exam ANSWERS Categorical Data Analysis, CHL5407H 1. Data from a survey of women s attitudes towards mammography are provided in Table 1. Women were classified by their experience with mammography

More information

Ashwini S Erande MPH, Shaista Malik MD University of California Irvine, Orange, California

Ashwini S Erande MPH, Shaista Malik MD University of California Irvine, Orange, California The Association of Morbid Obesity with Mortality and Coronary Revascularization among Patients with Acute Myocardial Infarction using ARRAYS, PROC FREQ and PROC LOGISTIC ABSTRACT Ashwini S Erande MPH,

More information

Applications. DSC 410/510 Multivariate Statistical Methods. Discriminating Two Groups. What is Discriminant Analysis

Applications. DSC 410/510 Multivariate Statistical Methods. Discriminating Two Groups. What is Discriminant Analysis DSC 4/5 Multivariate Statistical Methods Applications DSC 4/5 Multivariate Statistical Methods Discriminant Analysis Identify the group to which an object or case (e.g. person, firm, product) belongs:

More information

A SAS sy Study of ediary Data

A SAS sy Study of ediary Data A SAS sy Study of ediary Data ABSTRACT PharmaSUG 2017 - Paper BB14 A SAS sy Study of ediary Data Amie Bissonett, inventiv Health Clinical, Minneapolis, MN Many sponsors are using electronic diaries (ediaries)

More information

Predicting New Customer Retention for Online Dieting & Fitness Programs

Predicting New Customer Retention for Online Dieting & Fitness Programs Predicting New Customer Retention for Online Dieting & Fitness Programs December 11, 2007 BUDT733 DC01 Team Four Amy Brunner Harin Sandhoo Lilah Pomerance Paola Nasser Srinath Bala Executive Summary GymAmerica.com

More information

A SAS Macro to Investigate Statistical Power in Meta-analysis Jin Liu, Fan Pan University of South Carolina Columbia

A SAS Macro to Investigate Statistical Power in Meta-analysis Jin Liu, Fan Pan University of South Carolina Columbia Paper 109 A SAS Macro to Investigate Statistical Power in Meta-analysis Jin Liu, Fan Pan University of South Carolina Columbia ABSTRACT Meta-analysis is a quantitative review method, which synthesizes

More information

"Homegrown" Exercises around M&M Chapter 6-1- Help a journalist to be "statistically correct" age-related prevalence, and conflicting evidence exists in favor of the mortality hypothesis. We compared mortality

More information

Age (continuous) Gender (0=Male, 1=Female) SES (1=Low, 2=Medium, 3=High) Prior Victimization (0= Not Victimized, 1=Victimized)

Age (continuous) Gender (0=Male, 1=Female) SES (1=Low, 2=Medium, 3=High) Prior Victimization (0= Not Victimized, 1=Victimized) Criminal Justice Doctoral Comprehensive Exam Statistics August 2016 There are two questions on this exam. Be sure to answer both questions in the 3 and half hours to complete this exam. Read the instructions

More information

MMI 409 Spring 2009 Final Examination Gordon Bleil. 1. Is there a difference in depression as a function of group and drug?

MMI 409 Spring 2009 Final Examination Gordon Bleil. 1. Is there a difference in depression as a function of group and drug? MMI 409 Spring 2009 Final Examination Gordon Bleil Table of Contents Research Scenario and General Assumptions Questions for Dataset (Questions are hyperlinked to detailed answers) 1. Is there a difference

More information

Definition 1: A fixed point iteration scheme to approximate the fixed point, p, of a function g, = for all n 1 given a starting approximation, p.

Definition 1: A fixed point iteration scheme to approximate the fixed point, p, of a function g, = for all n 1 given a starting approximation, p. Supplemental Material: A. Proof of Convergence In this Appendix, we provide a computational proof that the circadian adjustment method (CAM) belongs to the class of fixed-point iteration schemes (FPIS)

More information

The Association of Morbid Obesity with Mortality and Coronary Revascularization among Patients with Acute Myocardial Infarction

The Association of Morbid Obesity with Mortality and Coronary Revascularization among Patients with Acute Myocardial Infarction PharmaSUG 2014 - Paper HA06 The Association of Morbid Obesity with Mortality and Coronary Revascularization among Patients with Acute Myocardial Infarction ABSTRACT Ashwini S Erande MPH University Of California

More information

Inverse Probability of Censoring Weighting for Selective Crossover in Oncology Clinical Trials.

Inverse Probability of Censoring Weighting for Selective Crossover in Oncology Clinical Trials. Paper SP02 Inverse Probability of Censoring Weighting for Selective Crossover in Oncology Clinical Trials. José Luis Jiménez-Moro (PharmaMar, Madrid, Spain) Javier Gómez (PharmaMar, Madrid, Spain) ABSTRACT

More information

12/30/2017. PSY 5102: Advanced Statistics for Psychological and Behavioral Research 2

12/30/2017. PSY 5102: Advanced Statistics for Psychological and Behavioral Research 2 PSY 5102: Advanced Statistics for Psychological and Behavioral Research 2 Selecting a statistical test Relationships among major statistical methods General Linear Model and multiple regression Special

More information

Choosing the Correct Statistical Test

Choosing the Correct Statistical Test Choosing the Correct Statistical Test T racie O. Afifi, PhD Departments of Community Health Sciences & Psychiatry University of Manitoba Department of Community Health Sciences COLLEGE OF MEDICINE, FACULTY

More information

a) Is it reasonable to compute the relative risk for endometrial cancer? Explain. b) Can we estimate relative risk for heart attacks? Explain.

a) Is it reasonable to compute the relative risk for endometrial cancer? Explain. b) Can we estimate relative risk for heart attacks? Explain. 1. A 1980 study investigated the relationship between the use of oral contraceptives and the development of endometrial cancer. It was found that of 117 endometrial cancer patients, 6 had used oral contraceptives

More information

Stat 13, Lab 11-12, Correlation and Regression Analysis

Stat 13, Lab 11-12, Correlation and Regression Analysis Stat 13, Lab 11-12, Correlation and Regression Analysis Part I: Before Class Objective: This lab will give you practice exploring the relationship between two variables by using correlation, linear regression

More information

Reverse Engineering a Regression Table. Achim Kemmerling Willy Brandt School Erfurt University

Reverse Engineering a Regression Table. Achim Kemmerling Willy Brandt School Erfurt University Reverse Engineering a Regression Table Achim Kemmerling Willy Brandt School Erfurt University Folie 2 Reverse Engineering a Regression Table Many students of social sciences/ researchers react to statistics

More information

11/24/2017. Do not imply a cause-and-effect relationship

11/24/2017. Do not imply a cause-and-effect relationship Correlational research is used to describe the relationship between two or more naturally occurring variables. Is age related to political conservativism? Are highly extraverted people less afraid of rejection

More information