Biostatistics 513 Spring Homework 1 Key solution. See the Appendix below for Stata code and output related to this assignment.

Size: px
Start display at page:

Download "Biostatistics 513 Spring Homework 1 Key solution. See the Appendix below for Stata code and output related to this assignment."

Transcription

1 Biostatistics 513 Spring 2010 Homework 1 Key solution * Question will be graded for points, the remaining question is credit/no credit. See the Appendix below for Stata code and output related to this assignment. 1. *(Hosmer & Lemeshow (1989)). Infant mortality and birth defect rates are elevated for low birth weight infants (defined as birth weight below 2500gms). A study undertaken in Massachusetts investigated the impact of various maternal characteristics on the risk of a low birth weight delivery. Information and data on this study are available in lowbwt description.pdf and lowbwt.txt respectively on the class website. Input these data into STATA and recode mothers pre pregnancy weight into five categories , , , and >150 lbs. [20 points] a) Construct a contingency table showing the frequencies and percentages of low birth weight infants according to these categories of maternal weight. [3] Table of observed counts and percentages of low birth weight infants across the five categories of maternal weight (1: , 2: , 3: , 4: , 5: >150 lbs): Maternal weight (categorized) Birth weight Normal (%) Low (%) Total Total b) Calculate and plot the estimated probability of low birthweight for these maternal weight categories. [3] Estimated probabilities of low birth weight across the five categories of maternal weight: Maternal weight Category Estimated probability of low bw Note the plot below is included for your interest. Stata code for a plot without the confidence intervals is provided in the Stata appendix.

2 Estimated probabilities of low birth weight infant across maternal weight categories Probabilities Maternal weight category c) Calculate the estimated odds ratios for low birthweight for maternal weight categories , , , lbs versus weight category >150 lbs. [2] First, we calculate the odds of low birth weight for each category of maternal weight: Mat. Weight Category Odds of low bw The odds ratios of categories 1-4 versus category 5 are then easily computed as following ratios: OR1 = 0.893/0.241 = 3.7 (OR of cat.1 versus 5, etc.) OR2 = 0.3/0.241 = 1.24 OR3 = 0.55/0.241 = 2.28 OR4 = 0.304/0.241 = 1.26 d) We are interested in assessing whether maternal pre pregnancy weight is associated with low birth weight. State the null hypothesis and the alternative hypothesis corresponding to i) an unordered test; ii) an ordered test. [4] The null hypothesis is H 0 : p 1 =p 2 = =p 5, where p i is the probability of low birth weight infant born to a mother with pre-pregnancy weight in the category i. i) The unordered alternative is Ha: at least one p i is not equal to the others ii) The ordered alternative is Ha: p1>=p2>= >=p5 (or <=), with strict inequality for at least one pair. e) Calculate the expected cell counts under the null hypotheses. [2] The expected cell counts under the null hypothesis are equal to

3 E ij = m i * n j / N, where m i is the marginal count for i-th row and n j is the marginal count for the j-th column. Maternal weight (cat.) Birth weight Total Normal Low Total f) Carry out an appropriate test for the unordered hypothesis in d). Provide the value of your test statistic and the appropriate reference distribution. State your conclusion. [3] The appropriate test for the unordered alternative hypothesis is the test of independence as we have a random sample of subjects (189 mother-baby pairs). The test statistic has a χ 2 (4) distribution under H 0 and in our case its value is With p-value = , we reject the hypothesis and conclude that there is evidence of an association between maternal pre-pregnancy weight and low birth weight of the infant. g) Carry out an appropriate test for the ordered hypothesis in d). Provide the value of your test statistic and the appropriate reference distribution. State your conclusion. [3] The appropriate test for the ordered alternative hypothesis is the score test for trend. The test statistic has a χ 2 (1) distribution under H 0 and in our case, with scaling of the weight categories 1,2,3,4,5, its value is With p- value = , we reject the hypothesis that the maternal pre-pregnancy weight is not associated with low birth weight and conclude that there statistically significant trend in the probability of low birth weight with increasing maternal weight. Note #1: This example shows that one can achieve greater power by designing a statistical procedure to test for a specific alternative (ordered probabilities, in this case) compared to a general alternative. Note #2: If we were to carry out the test with another maternal weight scaling (instead of the above values of 1,2,3,4,5), we find that the test statistic will depend on the choice of the values, or level of categories (lecture p.30). For example, if use actual weight values 100, 115, 125, 140 and 200 we find that we obtain a smaller test statistic =5.59 with p-value=0.0181, but our conclusion is unchanged. The choice of scaling should be made a priori based on what is meaningful for the particular context. 2. In a study of HIV infection among women entering the NYS prison system, 475 inmates were cross-classified with respect to HIV seropositivity and their history of intravenous drug use (Smith et al., 1991, AJPH Supplement). Use either direct calculations or STATA to answer the following questions (Note: If using Stata, compare the cci and csi commands. Which is more appropriate?) HIV+ HIV- Total IVDU Yes IVDU No Total a) Test the null hypothesis that there is no association between IVDU and HIV serostatus. State the hypotheses, and write a sentence (or two) that explains the results to a general medical audience. Ho: P(HIV IVDU) = P(HIV not IVDU) or Ho: HIV and IVDU are independent,

4 H a : P(HIV IVDU) P(HIV not IVDU). This is a cross-sectional sample so we are talking about disease prevalence, not incidence, and use the chi-square test of independence (lecture p.60). We see that the chi-square statistic for testing this hypothesis is 78.0 and the p-value for this test is very small (p < 0.001). Therefore, we conclude that the risk of HIV infection in this population of women is associated with a history of IVDU use. b) Give a point estimate and 95% confidence interval for the difference in HIV risk in the two groups and write a sentence (or two) that explains this interval to a general medical audience. The risk of HIV infection in women in this sample who do not report a history of IVDU use is 29/339 = 0.08 while the risk of HIV infection among women in this sample who report a history of IVDU use is 59/136 = Therefore, the estimated difference in risk (the increased risk among women reporting a history of IVDU use compared to those who do not report a history of IVDU use) is A 95% confidence interval for the true difference in prevalence between these populations is (0.26, 0.44) (lecture p.61, 64). A summary might be: Among incarcerated women who report a history of IV drug use, we find that 43% are HIV seropositive while among women who do not report a history of IV drug use, we find that 8% are HIV seropositive. The increased prevalence associated with IV drug use as estimated by the risk difference is 35% (95% CI: 26%, 44%). These results suggest a relatively high prevalence of HIV among incarcerated women that do not report IV drug use (8%), and a statistically significant increase in seropositivity among the women that do report IV drug use. c) Give a point estimate and 95% confidence interval for the relative risk of HIV, comparing IV drug using women to women that are not IV drug users, and write a sentence (or two) that explains this interval to a general medical audience. As discussed above, the prevalence of HIV infection in women in this sample who do not report a history of IVDU use is 29/339 = 0.08 while the risk of HIV infection among women in this sample who report a history of IVDU use is 59/136 = Therefore, the estimated relative risk is 0.43/0.08 = 5.1 with a 95% confidence interval of (3.4, 7.5). Thus, the prevalence of HIV infection in women reporting a history of IVDU is 5.1 times higher than the prevalence of HIV infection in women who do not report a history of IVDU use (lecture p.61, 64). d) Give a point estimate and 95% confidence interval for the odds ratio, comparing IV drug using women to women that are not IV drug users, and write a sentence (or two) that explains this interval to a general medical audience. The estimated odds of HIV seropositivity among women who report IV drug use was (59/136)/(1-59/136))=0.766 while among women who did not report IV drug use, the odds was (29/339)/(1-29/339))= The association between HIV seropositivity and IV drug use can be summarized by the odds ratio of 8.2 (95% confidence interval 4.9, 13.6) that compares the odds of seropositivity among IDVU women relative to the odds of seropositivity among non-ivdu women. These results suggest a more than an 8-fold increase in the odds of being seropositive among women that report IV drug use relative to those who do not. e) Based on these summaries, what do you conclude? These results suggest a relatively high prevalence of HIV seropositivity among incarcerated women that do not report IV drug use (8%), but an additional 35% are seropositive among the women that do report IV drug use. We further conclude that the risk of HIV seropositivity is 5-fold greater among incarcerated women using IV drugs as

5 compared to those not using IV drugs and that odds of HIV seropositivity is 8-fold greater among incarcerated women exposed to IV drug use as compared to those not exposed to IV drug use. Note: Since HIV seropositivity has a prevalence of 8% among the unexposed, the OR overestimates the RR if used as an approximation of the RR (lecture p.54). Appendix Problem 1. infile id lbwt age lwt race smoke ptl hyper urirr pvft bwt using.../lowbwt.txt (maternal weight categorized): gen wtcat=1 if lwt<111 replace wtcat=2 if lwt>110 & lwt<=120 replace wtcat=3 if lwt>120 & lwt<=130 replace wtcat=4 if lwt>130 & lwt<=150 replace wtcat=5 if lwt>150 (a) tabulate lbwt wtcat, col wtcat lbwt Total Total (b) mean(lbwt), over(wtcat) Over Mean Std. Err. [95% Conf. Interval] lbwt graph dot (mean) lbwt, over(wtcat) (c) tabodds lbwt wtcat wtcat cases controls odds [95% Conf. Interval]

6 (e) tabulate lbwt wtcat, expected wtcat lbwt Total Total (f) + (g) tabodds lbwt wtcat Test of homogeneity (equal odds): chi2(4) = Pr>chi2 = Score test for trend of odds: chi2(1) = 6.74 Pr>chi2 = * with an alternative scaling for weight: gen wtcat2=100 if lwt<111 replace wtcat2=115 if lwt>110 & lwt<=120 replace wtcat2=125 if lwt>120 & lwt<=130 replace wtcat2=140 if lwt>130 & lwt<=150 replace wtcat2=200 if lwt>150 tabodds lbwt wtcat2 Test of homogeneity (equal odds): chi2(4) = Pr>chi2 = Score test for trend of odds: chi2(1) = 5.59 Pr>chi2 = Problem 2. Note: The csi command with the or option provided the information necessary to answer the question. The csi command is appropriate for cohort or cross-sectional studies. The cci command is for case-control studies and only gives odds ratios.. csi , or Exposed Unexposed Total Cases Noncases Total Risk Point estimate [95% Conf. Interval] Risk difference Risk ratio Attr. frac. ex Attr. frac. pop Odds ratio (Cornfield) chi2(1) = Pr>chi2 =

Contingency Tables Summer 2017 Summer Institutes 187

Contingency Tables Summer 2017 Summer Institutes 187 Contingency Tables 87 Overview ) Types of Variables ) Comparing () Categorical Variables Contingency (two-way) tables Tests 3) x Tables Sampling designs Testing for association Estimation of effects Paired

More information

Midterm Exam ANSWERS Categorical Data Analysis, CHL5407H

Midterm Exam ANSWERS Categorical Data Analysis, CHL5407H Midterm Exam ANSWERS Categorical Data Analysis, CHL5407H 1. Data from a survey of women s attitudes towards mammography are provided in Table 1. Women were classified by their experience with mammography

More information

Stratified Tables. Example: Effect of seat belt use on accident fatality

Stratified Tables. Example: Effect of seat belt use on accident fatality Stratified Tables Often, a third measure influences the relationship between the two primary measures (i.e. disease and exposure). How do we remove or control for the effect of the third measure? Issues

More information

Section F. Measures of Association: Risk Difference, Relative Risk, and the Odds Ratio

Section F. Measures of Association: Risk Difference, Relative Risk, and the Odds Ratio This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this

More information

Statistics Assignment 11 - Solutions

Statistics Assignment 11 - Solutions Statistics 44.3 Assignment 11 - Solutions 1. Samples were taken of individuals with each blood type to see if the average white blood cell count differed among types. Eleven individuals in each group were

More information

Comparing Proportions between Two Independent Populations. John McGready Johns Hopkins University

Comparing Proportions between Two Independent Populations. John McGready Johns Hopkins University This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this

More information

Section D. Another Non-Randomized Study Design: The Case-Control Design

Section D. Another Non-Randomized Study Design: The Case-Control Design This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this

More information

Basic statistics for public health and policy

Basic statistics for public health and policy PHM102 Basic statistics for public health and policy FORMATIVE ASSIGNMENT Write (preferably type*) answers to BOTH parts of this assignment. Show your working and give reasons for your answers. *See Hint

More information

BMI 541/699 Lecture 16

BMI 541/699 Lecture 16 BMI 541/699 Lecture 16 Where we are: 1. Introduction and Experimental Design 2. Exploratory Data Analysis 3. Probability 4. T-based methods for continous variables 5. Proportions & contingency tables -

More information

Statistical Reasoning in Public Health Biostatistics 612, 2009, HW#3

Statistical Reasoning in Public Health Biostatistics 612, 2009, HW#3 Statistical Reasoning in Public Health Biostatistics 612, 2009, HW#3 1. A random sample of 200 patients admitted to an adult intensive care unit (ICU) was collected to examine factors associated with death

More information

Risk Ratio and Odds Ratio

Risk Ratio and Odds Ratio Risk Ratio and Odds Ratio Risk and Odds Risk is a probability as calculated from one outcome probability = ALL possible outcomes Odds is opposed to probability, and is calculated from one outcome Odds

More information

Lecture 5. Contingency /incidence tables Sensibility, specificity Relative Risk Odds Ratio CHI SQUARE test

Lecture 5. Contingency /incidence tables Sensibility, specificity Relative Risk Odds Ratio CHI SQUARE test Lecture 5 Contingency /incidence tables Sensibility, specificity Relative Risk Odds Ratio CHI SQUARE test Contingency tables - example Factor 2 Present + Absent - Total Factor 1 Present + a b a+b Absent

More information

4. STATA output of the analysis

4. STATA output of the analysis Biostatistics(1.55) 1. Objective: analyzing epileptic seizures data using GEE marginal model in STATA.. Scientific question: Determine whether the treatment reduces the rate of epileptic seizures. 3. Dataset:

More information

AP STATISTICS 2009 SCORING GUIDELINES

AP STATISTICS 2009 SCORING GUIDELINES AP STATISTICS 2009 SCORING GUIDELINES Question 1 Intent of Question The primary goals of this question were to assess a student s ability to (1) construct an appropriate graphical display for comparing

More information

bivariate analysis: The statistical analysis of the relationship between two variables.

bivariate analysis: The statistical analysis of the relationship between two variables. bivariate analysis: The statistical analysis of the relationship between two variables. cell frequency: The number of cases in a cell of a cross-tabulation (contingency table). chi-square (χ 2 ) test for

More information

Analysis of Categorical Data from the Ashe Center Student Wellness Survey

Analysis of Categorical Data from the Ashe Center Student Wellness Survey Lab 6 Analysis of Categorical Data from the Ashe Center Student Wellness Survey Before starting this lab, you should be familiar with: the difference between categorical and quantitative variables, and

More information

Medical Statistics 1. Basic Concepts Farhad Pishgar. Defining the data. Alive after 6 months?

Medical Statistics 1. Basic Concepts Farhad Pishgar. Defining the data. Alive after 6 months? Medical Statistics 1 Basic Concepts Farhad Pishgar Defining the data Population and samples Except when a full census is taken, we collect data on a sample from a much larger group called the population.

More information

C2 Training: August 2010

C2 Training: August 2010 C2 Training: August 2010 Introduction to meta-analysis The Campbell Collaboration www.campbellcollaboration.org Pooled effect sizes Average across studies Calculated using inverse variance weights Studies

More information

STP 231 Example FINAL

STP 231 Example FINAL STP 231 Example FINAL Instructor: Ela Jackiewicz Honor Statement: I have neither given nor received information regarding this exam, and I will not do so until all exams have been graded and returned.

More information

Controlling Bias & Confounding

Controlling Bias & Confounding Controlling Bias & Confounding Chihaya Koriyama August 5 th, 2015 QUESTIONS FOR BIAS Key concepts Bias Should be minimized at the designing stage. Random errors We can do nothing at Is the nature the of

More information

University of New Mexico Hypothesis Testing-4 (Fall 2015) PH 538: Public Health Biostatistical Methods I (by Fares Qeadan)

University of New Mexico Hypothesis Testing-4 (Fall 2015) PH 538: Public Health Biostatistical Methods I (by Fares Qeadan) University of New Mexico Hypothesis Testing-4 (Fall 2015) PH 538: Public Health Biostatistical Methods I (by Fares Qeadan) hypothesis testing for association between two categorical variables (Chi-square

More information

Strategies for Data Analysis: Cohort and Case-control Studies

Strategies for Data Analysis: Cohort and Case-control Studies Strategies for Data Analysis: Cohort and Case-control Studies Post-Graduate Course, Training in Research in Sexual Health, 24 Feb 05 Isaac M. Malonza, MD, MPH Department of Reproductive Health and Research

More information

Figure S1. Flowchart of sample included in the analysis.

Figure S1. Flowchart of sample included in the analysis. Figure S1. Flowchart of sample included in the analysis. 3098 mother/infant pairs with EMR records of well-child and specialty visits 418 cases with any ADHD diagnosis 94 cases with any ASD diagnosis while

More information

Statistical Reasoning in Public Health 2009 Biostatistics 612, Homework #2

Statistical Reasoning in Public Health 2009 Biostatistics 612, Homework #2 Statistical Reasoning in Public Health 2009 Biostatistics 612, Homework #2 1. Suppose it is the year 1985 and you are doing research on the differences in wages earned by men and women in the U.S. workforce.

More information

Daniel Boduszek University of Huddersfield

Daniel Boduszek University of Huddersfield Daniel Boduszek University of Huddersfield d.boduszek@hud.ac.uk Introduction to Logistic Regression SPSS procedure of LR Interpretation of SPSS output Presenting results from LR Logistic regression is

More information

Multivariate dose-response meta-analysis: an update on glst

Multivariate dose-response meta-analysis: an update on glst Multivariate dose-response meta-analysis: an update on glst Nicola Orsini Unit of Biostatistics Unit of Nutritional Epidemiology Institute of Environmental Medicine Karolinska Institutet http://www.imm.ki.se/biostatistics/

More information

Gender Analysis. Week 3

Gender Analysis. Week 3 Gender Analysis Week 3 1 Objectives 1. Understand how to practically conduct gender analysis 2. Learn and apply measures of frequency and association 2 100 75 50 25 0 Measures of Disease Frequency and

More information

Final Exam - section 2. Thursday, December hours, 30 minutes

Final Exam - section 2. Thursday, December hours, 30 minutes Econometrics, ECON312 San Francisco State University Michael Bar Fall 2011 Final Exam - section 2 Thursday, December 15 2 hours, 30 minutes Name: Instructions 1. This is closed book, closed notes exam.

More information

Lesson: A Ten Minute Course in Epidemiology

Lesson: A Ten Minute Course in Epidemiology Lesson: A Ten Minute Course in Epidemiology This lesson investigates whether childhood circumcision reduces the risk of acquiring genital herpes in men. 1. To open the data we click on File>Example Data

More information

Lecture 20: Chi Square

Lecture 20: Chi Square Statistics 20_chi.pdf Michael Hallstone, Ph.D. hallston@hawaii.edu Lecture 20: Chi Square Introduction Up until now, we done statistical test using means, but the assumptions for means have eliminated

More information

Day 11: Measures of Association and ANOVA

Day 11: Measures of Association and ANOVA Day 11: Measures of Association and ANOVA Daniel J. Mallinson School of Public Affairs Penn State Harrisburg mallinson@psu.edu PADM-HADM 503 Mallinson Day 11 November 2, 2017 1 / 45 Road map Measures of

More information

Reflection Questions for Math 58B

Reflection Questions for Math 58B Reflection Questions for Math 58B Johanna Hardin Spring 2017 Chapter 1, Section 1 binomial probabilities 1. What is a p-value? 2. What is the difference between a one- and two-sided hypothesis? 3. What

More information

Why nursing students should understand statistics. Objectives of lecture. Why Statistics? Not to put students off statistics!

Why nursing students should understand statistics. Objectives of lecture. Why Statistics? Not to put students off statistics! Why nursing students should understand statistics Objectives of lecture Not to put students off statistics! Review methods of displaying and summarizing data. Review basic statistical concepts used in

More information

Hepatitis C Seroprevalence Among HIV-Infected Childbearing Women in New York State in 2006

Hepatitis C Seroprevalence Among HIV-Infected Childbearing Women in New York State in 2006 DOI 10.1007/s10995-015-1853-4 Hepatitis C Seroprevalence Among HIV-Infected Childbearing Women in New York State in 2006 L. Ghazaryan 1 L. Smith 2 M. Parker 3 C. Flanigan 4 W. Pulver 2 T. Sullivan 5 A.

More information

Do the sample size assumptions for a trial. addressing the following question: Among couples with unexplained infertility does

Do the sample size assumptions for a trial. addressing the following question: Among couples with unexplained infertility does Exercise 4 Do the sample size assumptions for a trial addressing the following question: Among couples with unexplained infertility does a program of up to three IVF cycles compared with up to three FSH

More information

One-Way Independent ANOVA

One-Way Independent ANOVA One-Way Independent ANOVA Analysis of Variance (ANOVA) is a common and robust statistical test that you can use to compare the mean scores collected from different conditions or groups in an experiment.

More information

Comparing Proportions between Two Independent Populations. John McGready Johns Hopkins University

Comparing Proportions between Two Independent Populations. John McGready Johns Hopkins University This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this

More information

115 remained abstinent. 140 remained abstinent. Relapsed Remained abstinent Total

115 remained abstinent. 140 remained abstinent. Relapsed Remained abstinent Total Chapter 10 Exercises 1. Intent-to-treat analysis: Example 1 In a randomized controlled trial to determine whether the nicotine patch reduces the risk of relapse among smokers who have committed to quit,

More information

Low birthweight and respiratory disease in adulthood: A population-based casecontrol

Low birthweight and respiratory disease in adulthood: A population-based casecontrol Page 26 of 36 Online Data Supplement Low birthweight and respiratory disease in adulthood: A population-based casecontrol study Eric C. Walter, MD; William J. Ehlenbach, MD; David L. Hotchkin, MD, Jason

More information

Binary Diagnostic Tests Paired Samples

Binary Diagnostic Tests Paired Samples Chapter 536 Binary Diagnostic Tests Paired Samples Introduction An important task in diagnostic medicine is to measure the accuracy of two diagnostic tests. This can be done by comparing summary measures

More information

Data Analysis with SPSS

Data Analysis with SPSS Data Analysis with SPSS A First Course in Applied Statistics Fourth Edition Stephen Sweet Ithaca College Karen Grace-Martin The Analysis Factor Allyn & Bacon Boston Columbus Indianapolis New York San Francisco

More information

Biostatistics and Epidemiology Step 1 Sample Questions Set 1

Biostatistics and Epidemiology Step 1 Sample Questions Set 1 Biostatistics and Epidemiology Step 1 Sample Questions Set 1 1. A study wishes to assess birth characteristics in a population. Which of the following variables describes the appropriate measurement scale

More information

Name: emergency please discuss this with the exam proctor. 6. Vanderbilt s academic honor code applies.

Name: emergency please discuss this with the exam proctor. 6. Vanderbilt s academic honor code applies. Name: Biostatistics 1 st year Comprehensive Examination: Applied in-class exam May 28 th, 2015: 9am to 1pm Instructions: 1. There are seven questions and 12 pages. 2. Read each question carefully. Answer

More information

University of Bristol - Explore Bristol Research. Peer reviewed version. Link to published version (if available): /peds.

University of Bristol - Explore Bristol Research. Peer reviewed version. Link to published version (if available): /peds. Gustavson, K., Ystrom, E., Stoltenberg, C., Susser, E., Suren, P., Magnus, P.,... Reichborn-Kjennerud, T. (2017). Smoking in pregnancy and child ADHD. Pediatrics, 139(2), [e20162509]. DOI: 10.1542/peds.2016-2509

More information

Daniel Boduszek University of Huddersfield

Daniel Boduszek University of Huddersfield Daniel Boduszek University of Huddersfield d.boduszek@hud.ac.uk Introduction to Multinominal Logistic Regression SPSS procedure of MLR Example based on prison data Interpretation of SPSS output Presenting

More information

STAT362 Homework Assignment 5

STAT362 Homework Assignment 5 STAT362 Homework Assignment 5 Sharon O Boyle Problem 1, Problem 3.6, p. 117 SAS Program * Sharon O'Boyle; * Stat 362; * Homework Assignment 5; * Problem 3.6, p. 117; * Program to compute Odds Ratio and

More information

Lab 4 (M13) Objective: This lab will give you more practice exploring the shape of data, and in particular in breaking the data into two groups.

Lab 4 (M13) Objective: This lab will give you more practice exploring the shape of data, and in particular in breaking the data into two groups. Lab 4 (M13) Objective: This lab will give you more practice exploring the shape of data, and in particular in breaking the data into two groups. Activity 1 Examining Data From Class Background Download

More information

Analyzing Discrete Data

Analyzing Discrete Data Analyzing Discrete Data Kwang Woo Ahn, PhD Sponsored by the Clinical and Translational Science Institute (CTSI) and the Department of Population Health / Division of Biostatistics Speaker disclosure In

More information

Measure of Association Examples of measure of association

Measure of Association Examples of measure of association Measure of Association Examples of measure of association Epidemiologists usually use relative differences to assess causal association (Table 3-1). Table 3 1 Types of Measures of Association Used in Analytic

More information

PSY 216: Elementary Statistics Exam 4

PSY 216: Elementary Statistics Exam 4 Name: PSY 16: Elementary Statistics Exam 4 This exam consists of multiple-choice questions and essay / problem questions. For each multiple-choice question, circle the one letter that corresponds to the

More information

University of New Mexico Hypothesis Testing-4 (Fall 2017) BIOM 505: Biostatistical Methods I (by Fares Qeadan)

University of New Mexico Hypothesis Testing-4 (Fall 2017) BIOM 505: Biostatistical Methods I (by Fares Qeadan) University of New Mexico Hypothesis Testing-4 (Fall 2017) BIOM 505: Biostatistical Methods I (by Fares Qeadan) hypothesis testing for association between two categorical variables (Chi-square Test of Independence):

More information

What Are Your Odds? : An Interactive Web Application to Visualize Health Outcomes

What Are Your Odds? : An Interactive Web Application to Visualize Health Outcomes What Are Your Odds? : An Interactive Web Application to Visualize Health Outcomes Abstract Spreading health knowledge and promoting healthy behavior can impact the lives of many people. Our project aims

More information

Does Postpartum Depression Affect Employment? Maria Elena Komodromou

Does Postpartum Depression Affect Employment? Maria Elena Komodromou Does Postpartum Depression Affect Employment? Maria Elena Komodromou Outline Postpartum Depression-definition Research question/ what this study adds Data Descriptives KHB method Results Further steps

More information

m 11 m.1 > m 12 m.2 risk for smokers risk for nonsmokers

m 11 m.1 > m 12 m.2 risk for smokers risk for nonsmokers SOCY5061 RELATIVE RISKS, RELATIVE ODDS, LOGISTIC REGRESSION RELATIVE RISKS: Suppose we are interested in the association between lung cancer and smoking. Consider the following table for the whole population:

More information

Today: Binomial response variable with an explanatory variable on an ordinal (rank) scale.

Today: Binomial response variable with an explanatory variable on an ordinal (rank) scale. Model Based Statistics in Biology. Part V. The Generalized Linear Model. Single Explanatory Variable on an Ordinal Scale ReCap. Part I (Chapters 1,2,3,4), Part II (Ch 5, 6, 7) ReCap Part III (Ch 9, 10,

More information

Measures of Association

Measures of Association Measures of Association Lakkana Thaikruea M.D., M.S., Ph.D. Community Medicine Department, Faculty of Medicine, Chiang Mai University, Thailand Introduction One of epidemiological studies goal is to determine

More information

Age (continuous) Gender (0=Male, 1=Female) SES (1=Low, 2=Medium, 3=High) Prior Victimization (0= Not Victimized, 1=Victimized)

Age (continuous) Gender (0=Male, 1=Female) SES (1=Low, 2=Medium, 3=High) Prior Victimization (0= Not Victimized, 1=Victimized) Criminal Justice Doctoral Comprehensive Exam Statistics August 2016 There are two questions on this exam. Be sure to answer both questions in the 3 and half hours to complete this exam. Read the instructions

More information

Title: Prevalence and epidemiological correlates and treatment outcome of HCV infection in an Italian prison setting

Title: Prevalence and epidemiological correlates and treatment outcome of HCV infection in an Italian prison setting Author's response to reviews Title: Prevalence and epidemiological correlates and treatment outcome of HCV infection in an Italian prison setting Authors: Micaela Brandolini (m.brandolini@smatteo.pv.it)

More information

Measuring association in contingency tables

Measuring association in contingency tables Measuring association in contingency tables Patrick Breheny April 3 Patrick Breheny University of Iowa Introduction to Biostatistics (BIOS 4120) 1 / 28 Hypothesis tests and confidence intervals Fisher

More information

Title: The role of cognitive stimulation at home in low-income preschoolers' nutrition, physical activity and Body Mass Index

Title: The role of cognitive stimulation at home in low-income preschoolers' nutrition, physical activity and Body Mass Index Reviewer s report Title: The role of cognitive stimulation at home in low-income preschoolers' nutrition, physical activity and Body Mass Index Version: 1 Date: 23 Feb 2017 Reviewer: Pattanee Winichagoon

More information

Multiple Linear Regression Analysis

Multiple Linear Regression Analysis Revised July 2018 Multiple Linear Regression Analysis This set of notes shows how to use Stata in multiple regression analysis. It assumes that you have set Stata up on your computer (see the Getting Started

More information

The Relationship of Adverse Childhood Experiences to Adult Health Status

The Relationship of Adverse Childhood Experiences to Adult Health Status The Relationship of Adverse Childhood Experiences to Adult Health Status Presentation to MCAH Committee December2, 21 Edwin Ferran Director of Learning and Innovation Community Action Partnership of Sonoma

More information

I. Identifying the question Define Research Hypothesis and Questions

I. Identifying the question Define Research Hypothesis and Questions Term Paper I. Identifying the question What is the question? (What are my hypotheses?) Is it possible to answer the question with statistics? Is the data obtainable? (birth weight, socio economic, drugs,

More information

Advanced ANOVA Procedures

Advanced ANOVA Procedures Advanced ANOVA Procedures Session Lecture Outline:. An example. An example. Two-way ANOVA. An example. Two-way Repeated Measures ANOVA. MANOVA. ANalysis of Co-Variance (): an ANOVA procedure whereby the

More information

Binary Diagnostic Tests Two Independent Samples

Binary Diagnostic Tests Two Independent Samples Chapter 537 Binary Diagnostic Tests Two Independent Samples Introduction An important task in diagnostic medicine is to measure the accuracy of two diagnostic tests. This can be done by comparing summary

More information

Constructing a Bivariate Table:

Constructing a Bivariate Table: Introduction Bivariate Analysis: A statistical method designed to detect and describe the relationship between two nominal or ordinal variables (typically independent and dependent variables). Cross-Tabulation:

More information

Applied Medical. Statistics Using SAS. Geoff Der. Brian S. Everitt. CRC Press. Taylor Si Francis Croup. Taylor & Francis Croup, an informa business

Applied Medical. Statistics Using SAS. Geoff Der. Brian S. Everitt. CRC Press. Taylor Si Francis Croup. Taylor & Francis Croup, an informa business Applied Medical Statistics Using SAS Geoff Der Brian S. Everitt CRC Press Taylor Si Francis Croup Boca Raton London New York CRC Press is an imprint of the Taylor & Francis Croup, an informa business A

More information

appstats26.notebook April 17, 2015

appstats26.notebook April 17, 2015 Chapter 26 Comparing Counts Objective: Students will interpret chi square as a test of goodness of fit, homogeneity, and independence. Goodness of Fit A test of whether the distribution of counts in one

More information

t-test Tutorial Aliza McConnahey & Josh Petkash

t-test Tutorial Aliza McConnahey & Josh Petkash t-test Tutorial Aliza McConnahey & Josh Petkash t-testing? A t-test is used to test against the mean of a population when the population standard deviation is not known. In order to conduct a t-test, a

More information

Before we get started:

Before we get started: Before we get started: http://arievaluation.org/projects-3/ AEA 2018 R-Commander 1 Antonio Olmos Kai Schramm Priyalathta Govindasamy Antonio.Olmos@du.edu AntonioOlmos@aumhc.org AEA 2018 R-Commander 2 Plan

More information

STA 3024 Spring 2013 EXAM 3 Test Form Code A UF ID #

STA 3024 Spring 2013 EXAM 3 Test Form Code A UF ID # STA 3024 Spring 2013 Name EXAM 3 Test Form Code A UF ID # Instructions: This exam contains 34 Multiple Choice questions. Each question is worth 3 points, for a total of 102 points (there are TWO bonus

More information

Common Statistical Issues in Biomedical Research

Common Statistical Issues in Biomedical Research Common Statistical Issues in Biomedical Research Howard Cabral, Ph.D., M.P.H. Boston University CTSI Boston University School of Public Health Department of Biostatistics May 15, 2013 1 Overview of Basic

More information

Introduction. Lecture 1. What is Statistics?

Introduction. Lecture 1. What is Statistics? Lecture 1 Introduction What is Statistics? Statistics is the science of collecting, organizing and interpreting data. The goal of statistics is to gain information and understanding from data. A statistic

More information

Midterm Exam MMI 409 Spring 2009 Gordon Bleil

Midterm Exam MMI 409 Spring 2009 Gordon Bleil Midterm Exam MMI 409 Spring 2009 Gordon Bleil Table of contents: (Hyperlinked to problem sections) Problem 1 Hypothesis Tests Results Inferences Problem 2 Hypothesis Tests Results Inferences Problem 3

More information

8/6/15. Homework. Homework. Lecture 8: Systematic Reviews and Meta-analysis

8/6/15. Homework. Homework. Lecture 8: Systematic Reviews and Meta-analysis Lecture 8: Systematic Reviews and Meta-analysis Christopher S. Hollenbeak, PhD Jane R. Schubart, PhD The Outcomes Research Toolbox Homework Stata Example: stset surv_days, failure(died) sts test age_cat

More information

ANALYSIS OF SURVEYS WITH EPI INFO AND STATA

ANALYSIS OF SURVEYS WITH EPI INFO AND STATA Department of Epidemiology Course EPI 418 School of Public Health University of California, Los Angeles Session 11 ANALYSIS OF SURVEYS WITH EPI INFO AND STATA Note: prepared with Epi Info (Windows) and

More information

NORTH SOUTH UNIVERSITY TUTORIAL 1

NORTH SOUTH UNIVERSITY TUTORIAL 1 NORTH SOUTH UNIVERSITY TUTORIAL 1 REVIEW FROM BIOSTATISTICS I AHMED HOSSAIN,PhD Data Management and Analysis AHMED HOSSAIN,PhD - Data Management and Analysis 1 DATA TYPES/ MEASUREMENT SCALES Categorical:

More information

Unit 4 Probabilities in Epidemiology

Unit 4 Probabilities in Epidemiology BIOSTATS 540 Fall 2018 4. Probabilities in Epidemiology Page 1 of 23 Unit 4 Probabilities in Epidemiology All knowledge degenerates into probability - David Hume With just a little bit of thinking, we

More information

MEASURES OF ASSOCIATION AND REGRESSION

MEASURES OF ASSOCIATION AND REGRESSION DEPARTMENT OF POLITICAL SCIENCE AND INTERNATIONAL RELATIONS Posc/Uapp 816 MEASURES OF ASSOCIATION AND REGRESSION I. AGENDA: A. Measures of association B. Two variable regression C. Reading: 1. Start Agresti

More information

Dylan Small Department of Statistics, Wharton School, University of Pennsylvania. Based on joint work with Paul Rosenbaum

Dylan Small Department of Statistics, Wharton School, University of Pennsylvania. Based on joint work with Paul Rosenbaum Instrumental variables and their sensitivity to unobserved biases Dylan Small Department of Statistics, Wharton School, University of Pennsylvania Based on joint work with Paul Rosenbaum Overview Instrumental

More information

EPS 625 INTERMEDIATE STATISTICS TWO-WAY ANOVA IN-CLASS EXAMPLE (FLEXIBILITY)

EPS 625 INTERMEDIATE STATISTICS TWO-WAY ANOVA IN-CLASS EXAMPLE (FLEXIBILITY) EPS 625 INTERMEDIATE STATISTICS TO-AY ANOVA IN-CLASS EXAMPLE (FLEXIBILITY) A researcher conducts a study to evaluate the effects of the length of an exercise program on the flexibility of female and male

More information

Psych 5741/5751: Data Analysis University of Boulder Gary McClelland & Charles Judd. Exam #2, Spring 1992

Psych 5741/5751: Data Analysis University of Boulder Gary McClelland & Charles Judd. Exam #2, Spring 1992 Exam #2, Spring 1992 Question 1 A group of researchers from a neurobehavioral institute are interested in the relationships that have been found between the amount of cerebral blood flow (CB FLOW) to the

More information

Objectives. Types of HIV Tests. Age Appropriateness of Tests. Breastfeeding and HIV Testing. Why are there different tests for different ages?

Objectives. Types of HIV Tests. Age Appropriateness of Tests. Breastfeeding and HIV Testing. Why are there different tests for different ages? Objectives At the end of the lesson participants will be able to: Identify the types of HIV tests available in Botswana State who should be tested Identify which tests are used for infants and children

More information

Stata: Merge and append Topics: Merging datasets, appending datasets - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - 1. Terms There are several situations when working with

More information

Lesson 1: Distributions and Their Shapes

Lesson 1: Distributions and Their Shapes Lesson 1 Name Date Lesson 1: Distributions and Their Shapes 1. Sam said that a typical flight delay for the sixty BigAir flights was approximately one hour. Do you agree? Why or why not? 2. Sam said that

More information

ANOVA in SPSS (Practical)

ANOVA in SPSS (Practical) ANOVA in SPSS (Practical) Analysis of Variance practical In this practical we will investigate how we model the influence of a categorical predictor on a continuous response. Centre for Multilevel Modelling

More information

Lessons in biostatistics

Lessons in biostatistics Lessons in biostatistics The test of independence Mary L. McHugh Department of Nursing, School of Health and Human Services, National University, Aero Court, San Diego, California, USA Corresponding author:

More information

Lab 7 (100 pts.): One-Way ANOVA Objectives: Analyze data via the One-Way ANOVA

Lab 7 (100 pts.): One-Way ANOVA Objectives: Analyze data via the One-Way ANOVA STAT 350 (Spring 2015) Lab 7: SAS Solution 1 Lab 7 (100 pts.): One-Way ANOVA Objectives: Analyze data via the One-Way ANOVA A. (50 pts.) Do isoflavones increase bone mineral density? (ex12-45bmd.txt) Kudzu

More information

Statistical reports Regression, 2010

Statistical reports Regression, 2010 Statistical reports Regression, 2010 Niels Richard Hansen June 10, 2010 This document gives some guidelines on how to write a report on a statistical analysis. The document is organized into sections that

More information

What you should know before you collect data. BAE 815 (Fall 2017) Dr. Zifei Liu

What you should know before you collect data. BAE 815 (Fall 2017) Dr. Zifei Liu What you should know before you collect data BAE 815 (Fall 2017) Dr. Zifei Liu Zifeiliu@ksu.edu Types and levels of study Descriptive statistics Inferential statistics How to choose a statistical test

More information

Writing about multivariate analysis

Writing about multivariate analysis Writing about multivariate analysis Jane E. Miller, Ph.D. Overview Structure of an academic paper Creating a logical sequence of tables Describing multivariate results Designing effective statistical tables

More information

Name Student Response sheet Period. Cell Division Mitosis Lab

Name Student Response sheet Period. Cell Division Mitosis Lab ame Student Response sheet Period Cell Division Mitosis Lab Modeling Mitosis (Refer to your summary sheets from Mitosis & Meiosis on the Table ) Part 1: Effects of Environment on Mitosis Questions: 1.

More information

Lecture 15 Chapters 12&13 Relationships between Two Categorical Variables

Lecture 15 Chapters 12&13 Relationships between Two Categorical Variables Lecture 15 Chapters 12&13 Relationships between wo Categorical ariables abulating and Summarizing able of Expected Counts Statistical Significance for wo-way ables Constructing & Assessing a wo-way able

More information

Epidemiology: Overview of Key Concepts and Study Design. Polly Marchbanks

Epidemiology: Overview of Key Concepts and Study Design. Polly Marchbanks Epidemiology: Overview of Key Concepts and Study Design Polly Marchbanks Lecture Outline (1) Key epidemiologic concepts - Definition - What epi is not - What epi is - Process of epi research Lecture Outline

More information

Exam 4 Review Exercises

Exam 4 Review Exercises Math 160: Statistics Spring, 2014 Toews Exam 4 Review Exercises Instructions: Working in groups of 2-4, first review the goals and objectives for this exam (listed below) and then work the following problems.

More information

Welcome to this third module in a three-part series focused on epidemiologic measures of association and impact.

Welcome to this third module in a three-part series focused on epidemiologic measures of association and impact. Welcome to this third module in a three-part series focused on epidemiologic measures of association and impact. 1 This three-part series focuses on the estimation of the association between exposures

More information

Please evaluate this material by clicking here:

Please evaluate this material by clicking here: EPI Case Study 3: Cross-Sectional, Case-Control, and Cohort Studies Identification of TB Risk Time to Complete Exercise: 60 minutes LEARNING OBJECTIVES At the completion of this module, participants should

More information

7. Bivariate Graphing

7. Bivariate Graphing 1 7. Bivariate Graphing Video Link: https://www.youtube.com/watch?v=shzvkwwyguk&index=7&list=pl2fqhgedk7yyl1w9tgio8w pyftdumgc_j Section 7.1: Converting a Quantitative Explanatory Variable to Categorical

More information

A Case Study: Two-sample categorical data

A Case Study: Two-sample categorical data A Case Study: Two-sample categorical data Patrick Breheny January 31 Patrick Breheny BST 701: Bayesian Modeling in Biostatistics 1/43 Introduction Model specification Continuous vs. mixture priors Choice

More information