Biostatistics 2 nd year Comprehensive Examination. Due: May 31 st, 2013 by 5pm. Instructions:

Size: px
Start display at page:

Download "Biostatistics 2 nd year Comprehensive Examination. Due: May 31 st, 2013 by 5pm. Instructions:"

Transcription

1 Biostatistics 2 nd year Comprehensive Examination Due: May 31 st, 2013 by 5pm. Instructions: 1. The exam is divided into two parts. There are 6 questions in section I and 2 questions in section II. 2. Answer each question to the best of your ability. 3. Be as specific as possible and write as clearly as possible. 4. This is a take- home examination. You may consult books, notes, papers, and you may use the Internet. However, you may not consult or discuss this exam with another human being or statistical oracle, nor may you seek help from another individual on the internet (e.g., no posting questions to chat rooms or message boards). 5. If you have any questions, please contact Professor Blume by or by phone (cell: ). Do not worry about being polite; Professor Blume as needed and call for emergencies. 6. Turn in your exam by ing it to Professor Blume at j.blume@vanderbilt.edu AND Linda Wilson at linda.l.wilson@vanderbilt.edu. You must get confirmation from Professor Blume or Ms. Wilson that your exam was received before 5pm on Friday 5/31. Alternatively, you may turn in a hard copy to either person by the deadline. 7. Vanderbilt s academic honor code applies; be sure to adhere to the spirit of this code. Question Points Score Comments Section II pts. Part I; 60 pts. Part II Total 240

2 Section I 1. Let Ω be a finite set and let F = PP Ω be the σσ- algebra of all subsets of Ω. For AA Ω, let AA be the number of elements of AA. a. Prove that the formula PP AA = AA / Ω defines a probability on Ω, F. b. Prove that PP BB is the uniform distribution on BB, where BB Ω. c. Suppose now that Ω is a countably infinite set. Prove that a probability PP on Ω, F cannot be the uniform distribution. 2. Let XX, XX, XX, be iiiiii random variables each with zero expectation and common finite variance σσ. Define SS = XX. a. What is the limiting distribution (convergence in distribution) of the sequence (justify your answer): SS σσ nn b. Show that there cannot exist a limiting random variable SS such that SS.. SS σσ nn 3. Let h be an absolutely continuous function on 0,1 with 0 h xx 1 for all xx. We wish to estimate the integral II h h xx dddd. a. Let UU, UU,, UU be an iiiiii sample from a Uniform 0,1 distribution. Using this sample, argue that II h = 1 nn is the standard Monte Carlo estimator. h UU 1 of 7

3 b. The method of antithetic variables leads to an alternative estimator, namely II h = 1 2nn h UU + h 1 UU i. Show that both II and II are consistent. Specifically, that II.. II h and.. II II h as nn. ii. Show there exist positive constants σσ h and σσ h such that nn II h II h NN 0, σσ h nn II h II h NN 0, σσ h iii. For a given h, determine if one method is to be preferred over the other in terms of minimizing σσ h. 4. Consider the squared- error loss function LL yy, ff xx = yy ff xx. Given data TT = xx, yy : ii = 1,, nn and a fitted model yy = ff xx, we define the in- sample error to be EEEErr = 1 nn EE LL YY, ff xx TT where the expectation is over a future observation YY drawn from the conditional distribution YY XX = xx. The training error is defined as eeeeee = 1 nn LL yy, ff xx and model optimism is defined as oooo = EEEErr eeeeee. Note that when xx : ii = 1,, nn are viewed as fixed, oooo is a function of the random variables YY = yy,, yy. Under these conditions, show that EE oooo = cov yy, yy. 2 of 7

4 5. A lasso regression model is fit to pp predictors using the following pre- specified constraint for the coefficients: ββ tt. The fitted coefficient for one predictor, say XX, is ββ = aa. Consider what happens when one more predictor - a duplicate of XX called XX - is added to the model. The lasso model is refit with pp + 1 predictors using the constraint: ββ + ββ tt, where ββ is the coefficient for XX = XX. a. Describe the set of solutions for the two coefficients ββ and ββ. b. Now suppose a ridge regression model with a pre- specified penalty λλ is used. Show that including the duplicated predictor XX results in equal coefficients for both XX and XX. c. Comment on the implications of these results. 6. Consider the typical linear regression model, yy = ββββ + εε with εε~nn 0, σσ II, where the residual sum of squares RRRRRR ββ = yy ββ xx is minimized to obtain parameter estimates ββ nn, where the subscript is added to emphasize that the parameter estimates are functions of nn data points xx, yy ~FF for ii = 1,, nn. The observed residual sum of squares, say RR = RRRRRR ββ nn, is a random variable. Now suppose that additional data are collected from the same joint distribution, say xx, yy ~FF for jj = 1,, mm (i.e., collect an additional mm data points). The observed fit of the previously established model to these new data is given by the residual fit of the additional data using the previously established parameter estimates, say TT = yy ββ xx. Show that EE RR /nn EE TT /mm and comment on the implication of this result. End Part I. 3 of 7

5 Section II Instructions Prepare a 3 to 6 page statistical analysis report on the Vanderbilt Inpatient Cohort Study (VICS) that addresses the two scientific questions described on page 7. A detailed description of the data and scientific aims are described below. Tables and figures should be included in an appendix; they are not counted against the page total. Be sure to include the following sections in your report: Introduction, Methods, Results, and Conclusions. You may cite this description as a source document for background information. Your Methods section should be comprised of your analytical approach and may be proportionately longer than a typical scientific article. Results and interpretations should be well written, formatted and presented in an accessible manner for a scientific investigator. However, they should also include enough statistical details for a statistical reviewer. Check model assumptions and/or other diagnostics as appropriate. Introduction Following discharge from the hospital 19-23% of patients experience an adverse event, and the majority are due to adverse drug events (ADEs) from medications. ADEs, or harm due to medications, may result from unexplained differences in medication regimens across sites of care and unexplained differences between what medications the patient thinks they should be taking and what is ordered. Some estimates report that 30-50% of patients have a discrepancy between the medications listed at hospital discharge and the patient- reported regimen that persists after discharge. Patients who take cardiovascular medications are at particularly high risk for discrepancies after discharge, due to the multiple types (i.e., classes) of medications prescribed. As a step towards reducing adverse event and rehospitalization rates VUH is interested in identifying social risk factors associated with both medication discrepancies and with overall patient health. There are two parts to this analysis: 1) characterizing the relationship between medication discrepancies (outcome) and educational attainment (exposure), and 2) examining the relationship between self- reported global health status (outcome) and marital status (exposure). The Vanderbilt Inpatient Cohort Study (VICS) Sample The Vanderbilt Inpatient Cohort Study (VICS) is a prospective cohort study of participants admitted to Vanderbilt University Hospital in Nashville, TN with cardiovascular disease. The broad goal of VICS is to determine the impact of social determinants on health outcomes such as medication management, unplanned hospital utilization, and mortality after discharge. The rationale and design of VICS will be detailed in a future publication (so don t look for this). Eligibility screening 4 of 7

6 occurred shortly after admission and sought to identify participants who were at least 18 years of age admitted with an intermediate or high likelihood of acute coronary syndromes (ACS) or acute decompensated heart failure (HF) per a physician s review of the clinical record. Exclusion criteria included: inability to communicate in English or Spanish, severe unstable psychiatric illness, inability to contact after discharge, on hospice, or otherwise too ill. Participants were approached while still hospitalized for written informed consent. The Data The data set VICS.TXT has 14 variables and 473 observations. There are 6 missing values (coded as NA s) and they are noted in the variable descriptions. Primary outcomes and exposures: Medication Discrepancies: Two to eight days following hospital discharge, participants were contacted by phone. They were questioned about their medication regimen and since patients were potentially prescribed multiple medications from multiple classes of medications, they were asked about one randomly selected medication within each class of cardiac medications prescribed. If the patient was on eight cardiac medications from five classes of medications, he/she was asked about five cardiac medications, each from a different cardiac medication class. If the patient did not fully understand the medication indication, the dose he/she was prescribed, or frequency with which he/she was to take the medication, it was considered a cardiac medication discrepancy. Data regarding medication discrepancies are stored as the number of discrepancies (ndiscrep) and the number of cardiac medications tested (ncartest). Global health status: Global health status (promismean) was assessed using a 5- item measure from the NIH Participant Reported Outcomes Measurement Information System (PROMIS). The 5 items ask participants about overall health, quality of life, physical health, mental health, and satisfaction with social activities and relationships with each question on a 5- point Likert scale (1, 2, 3, 4, or 5). Responses for the 5 questions are then averaged with higher scores implying better global health. Education: Education (edu) is stored as the number of years of educational attainment. Twelve years of education should be thought of as obtaining a high school degree (or equivalent) but without college. Sixteen years of education should be thought of as obtaining a college degree but without graduate work. The low end of the education distribution (e.g., less than twelve years) is of particular interest to researchers since such patients are believed to be at highest risk for a discrepancy. 5 of 7

7 Marital Status: Marital status (maritalcat) is stored as a categorical variable with four distinct values: Married/Living with partner, Separated/Divorced, Widowed, Single/Never married. Other Variables in the dataset: studyid: participant identifier diagnosis: All patients in this study had symptoms of ACS or HF. Some had symptoms of both. This is a three category variable with levels: ACS only, HF only, and ACS and HF. age: patient age at admission to the hospital (years). gender: Male or Female racecat: White, Black/AA, or Other. There is 1 missing value. essi6sum: Social support was assessed using 6 of the 7 items from the ENRICHD Social Support Inventory (ESSI). Participants were asked the 6 questions regarding emotional and instrumental support, each of which had a 5- item Likert response scale. The ESSI is ranges from 5 to 30, where higher scores indicate a greater level of social support. stofhlasum: Participants completed the short form of the Test of Functional Health Literacy in Adults (s- TOFHLA), a timed test administered in a maximum of 7 minutes. Scores can range from 0 to 36; there are 5 missing values. This variable is a measure of health literacy or how well patients understand their health and the health care system. snsmean: Numeracy is the degree to which individuals have the capacity to access, process, interpret, communicate, and act on numerical, quantitative, graphical, biostatistical, and probabilistic health information needed to make effective health decisions. We employed a shortened 3- item version of the Subjective Numeracy Scale (SNS), which quantifies the participants perceived quantitative abilities through items about math skills and preferences for numerical information. The SNS is reported as the mean of the 3 items on a Likert scale of 1 to 6, with higher scores reflecting greater subjective numeracy. phqsum: We assessed depression during the 2 weeks prior to the interview using the Participant Health Questionnaire- 8 (PHQ- 8). The 8- item measure is scored on a scale of 0 to 24 with higher scores reflecting more severe depression. ncartest: Number of cardiac medication tested for the follow- up phone interview. 6 of 7

8 The Scientific Questions: Part 1 Researchers would like to characterize the relationship between educational attainment and the risk of medication discrepancies. They are particularly interested in those patients at the low end of the educational attainment distribution since they believe that such subjects are likely to be at greatest risk of a discrepancy. Note that journal referees and readers expect age, gender, race, and marital status to be included in the regression model. Consider the goals of the analysis to determine the other variables that should be included, and provide justification of your decisions. Part 2 Researchers hypothesize that patients who are married or have a partner are more likely to have improved health overall than those who are not married and not living with a partner. They believe that being married or having a partner leads to decreased stress levels and increased security, which in turn leads to improved health. Specifically, researchers want to know the probability that patients who are married or living with a partner have a greater mean PROMIS score than other types of patients. They want to know this probability for comparing married/living with partner vs. each of the following groups of patients: 1) separated/divorced 2) widowed 3) single/never married 4) separated/divorced, widowed, or single/never married (all combined) Use a single linear regression model that can be interpreted appropriately for this question. Consider the goals of the analysis to determine whether other variables should be included in the model, and provide a brief justification of your decisions. End Section II. 7 of 7

Name: emergency please discuss this with the exam proctor. 6. Vanderbilt s academic honor code applies.

Name: emergency please discuss this with the exam proctor. 6. Vanderbilt s academic honor code applies. Name: Biostatistics 1 st year Comprehensive Examination: Applied in-class exam May 28 th, 2015: 9am to 1pm Instructions: 1. There are seven questions and 12 pages. 2. Read each question carefully. Answer

More information

Name: Biostatistics 1 st year Comprehensive Examination: Applied Take Home exam. Due May 29 th, 2015 by 5pm. Late exams will not be accepted.

Name: Biostatistics 1 st year Comprehensive Examination: Applied Take Home exam. Due May 29 th, 2015 by 5pm. Late exams will not be accepted. Name: Biostatistics 1 st year Comprehensive Examination: Applied Take Home exam Due May 29 th, 2015 by 5pm. Late exams will not be accepted. Instructions: 1. There are 2 questions and 4 pages. Answer each

More information

Lecture 21. RNA-seq: Advanced analysis

Lecture 21. RNA-seq: Advanced analysis Lecture 21 RNA-seq: Advanced analysis Experimental design Introduction An experiment is a process or study that results in the collection of data. Statistical experiments are conducted in situations in

More information

MULTIPLE LINEAR REGRESSION 24.1 INTRODUCTION AND OBJECTIVES OBJECTIVES

MULTIPLE LINEAR REGRESSION 24.1 INTRODUCTION AND OBJECTIVES OBJECTIVES 24 MULTIPLE LINEAR REGRESSION 24.1 INTRODUCTION AND OBJECTIVES In the previous chapter, simple linear regression was used when you have one independent variable and one dependent variable. This chapter

More information

Division of Biostatistics College of Public Health Qualifying Exam II Part I. 1-5 pm, June 7, 2013 Closed Book

Division of Biostatistics College of Public Health Qualifying Exam II Part I. 1-5 pm, June 7, 2013 Closed Book Division of Biostatistics College of Public Health Qualifying Exam II Part I -5 pm, June 7, 03 Closed Book. Write the question number in the upper left-hand corner and your exam ID code in the right-hand

More information

7 Statistical Issues that Researchers Shouldn t Worry (So Much) About

7 Statistical Issues that Researchers Shouldn t Worry (So Much) About 7 Statistical Issues that Researchers Shouldn t Worry (So Much) About By Karen Grace-Martin Founder & President About the Author Karen Grace-Martin is the founder and president of The Analysis Factor.

More information

Measurement. 500 Research Methods Mike Kroelinger

Measurement. 500 Research Methods Mike Kroelinger Measurement 500 Research Methods Mike Kroelinger Levels of Measurement Nominal Lowest level -- used to classify variables into two or more categories. Cases placed in the same category must be equivalent.

More information

Katarina R. Mansir, Psy.D. Licensed Psychologist PSY25417 (858) Name: Date: Presenting Concerns

Katarina R. Mansir, Psy.D. Licensed Psychologist PSY25417 (858) Name: Date: Presenting Concerns Name: Date: Presenting Concerns Briefly describe what brings you to therapy. Approximately how long has this concern been bothering you? Day Week Month Several months Year Several years Most of my life

More information

Simple Linear Regression One Categorical Independent Variable with Several Categories

Simple Linear Regression One Categorical Independent Variable with Several Categories Simple Linear Regression One Categorical Independent Variable with Several Categories Does ethnicity influence total GCSE score? We ve learned that variables with just two categories are called binary

More information

Study Guide #2: MULTIPLE REGRESSION in education

Study Guide #2: MULTIPLE REGRESSION in education Study Guide #2: MULTIPLE REGRESSION in education What is Multiple Regression? When using Multiple Regression in education, researchers use the term independent variables to identify those variables that

More information

Problem Set 3 ECN Econometrics Professor Oscar Jorda. Name. ESSAY. Write your answer in the space provided.

Problem Set 3 ECN Econometrics Professor Oscar Jorda. Name. ESSAY. Write your answer in the space provided. Problem Set 3 ECN 140 - Econometrics Professor Oscar Jorda Name ESSAY. Write your answer in the space provided. 1) Sir Francis Galton, a cousin of James Darwin, examined the relationship between the height

More information

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 1) 1) A) B) C) D)

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 1) 1) A) B) C) D) Exam Name MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 1) 1) A) B) C) D) Decide whether or not the conditions and assumptions for inference with

More information

DAZED AND CONFUSED: THE CHARACTERISTICS AND BEHAVIOROF TITLE CONFUSED READERS

DAZED AND CONFUSED: THE CHARACTERISTICS AND BEHAVIOROF TITLE CONFUSED READERS Worldwide Readership Research Symposium 2005 Session 5.6 DAZED AND CONFUSED: THE CHARACTERISTICS AND BEHAVIOROF TITLE CONFUSED READERS Martin Frankel, Risa Becker, Julian Baim and Michal Galin, Mediamark

More information

Study on perceptually-based fitting line-segments

Study on perceptually-based fitting line-segments Regeo. Geometric Reconstruction Group www.regeo.uji.es Technical Reports. Ref. 08/2014 Study on perceptually-based fitting line-segments Raquel Plumed, Pedro Company, Peter A.C. Varley Department of Mechanical

More information

Linear Regression in SAS

Linear Regression in SAS 1 Suppose we wish to examine factors that predict patient s hemoglobin levels. Simulated data for six patients is used throughout this tutorial. data hgb_data; input id age race $ bmi hgb; cards; 21 25

More information

Testing Statistical Models to Improve Screening of Lung Cancer

Testing Statistical Models to Improve Screening of Lung Cancer Testing Statistical Models to Improve Screening of Lung Cancer 1 Elliot Burghardt: University of Iowa Daren Kuwaye: University of Hawai i at Mānoa Iowa Summer Institute in Biostatistics - University of

More information

Identifying Susceptibility in Epidemiology Studies: Implications for Risk Assessment. Joel Schwartz Harvard TH Chan School of Public Health

Identifying Susceptibility in Epidemiology Studies: Implications for Risk Assessment. Joel Schwartz Harvard TH Chan School of Public Health Identifying Susceptibility in Epidemiology Studies: Implications for Risk Assessment Joel Schwartz Harvard TH Chan School of Public Health Risk Assessment and Susceptibility Typically we do risk assessments

More information

INTRODUCTION TO STATISTICS SORANA D. BOLBOACĂ

INTRODUCTION TO STATISTICS SORANA D. BOLBOACĂ INTRODUCTION TO STATISTICS SORANA D. BOLBOACĂ OBJECTIVES Definitions Stages of Scientific Knowledge Quantification and Accuracy Types of Medical Data Population and sample Sampling methods DEFINITIONS

More information

Chapter 11: Advanced Remedial Measures. Weighted Least Squares (WLS)

Chapter 11: Advanced Remedial Measures. Weighted Least Squares (WLS) Chapter : Advanced Remedial Measures Weighted Least Squares (WLS) When the error variance appears nonconstant, a transformation (of Y and/or X) is a quick remedy. But it may not solve the problem, or it

More information

RNA-seq. Design of experiments

RNA-seq. Design of experiments RNA-seq Design of experiments Experimental design Introduction An experiment is a process or study that results in the collection of data. Statistical experiments are conducted in situations in which researchers

More information

Collecting & Making Sense of

Collecting & Making Sense of Collecting & Making Sense of Quantitative Data Deborah Eldredge, PhD, RN Director, Quality, Research & Magnet Recognition i Oregon Health & Science University Margo A. Halm, RN, PhD, ACNS-BC, FAHA Director,

More information

Midterm project due next Wednesday at 2 PM

Midterm project due next Wednesday at 2 PM Course Business Midterm project due next Wednesday at 2 PM Please submit on CourseWeb Next week s class: Discuss current use of mixed-effects models in the literature Short lecture on effect size & statistical

More information

One-Way ANOVAs t-test two statistically significant Type I error alpha null hypothesis dependant variable Independent variable three levels;

One-Way ANOVAs t-test two statistically significant Type I error alpha null hypothesis dependant variable Independent variable three levels; 1 One-Way ANOVAs We have already discussed the t-test. The t-test is used for comparing the means of two groups to determine if there is a statistically significant difference between them. The t-test

More information

This exam consists of three parts. Provide answers to ALL THREE sections.

This exam consists of three parts. Provide answers to ALL THREE sections. Empirical Analysis and Research Methodology Examination Yale University Department of Political Science January 2008 This exam consists of three parts. Provide answers to ALL THREE sections. Your answers

More information

Reliability Theory for Total Test Scores. Measurement Methods Lecture 7 2/27/2007

Reliability Theory for Total Test Scores. Measurement Methods Lecture 7 2/27/2007 Reliability Theory for Total Test Scores Measurement Methods Lecture 7 2/27/2007 Today s Class Reliability theory True score model Applications of the model Lecture 7 Psych 892 2 Great Moments in Measurement

More information

Introduction to statistics Dr Alvin Vista, ACER Bangkok, 14-18, Sept. 2015

Introduction to statistics Dr Alvin Vista, ACER Bangkok, 14-18, Sept. 2015 Analysing and Understanding Learning Assessment for Evidence-based Policy Making Introduction to statistics Dr Alvin Vista, ACER Bangkok, 14-18, Sept. 2015 Australian Council for Educational Research Structure

More information

Multiple Regression. James H. Steiger. Department of Psychology and Human Development Vanderbilt University

Multiple Regression. James H. Steiger. Department of Psychology and Human Development Vanderbilt University Multiple Regression James H. Steiger Department of Psychology and Human Development Vanderbilt University James H. Steiger (Vanderbilt University) Multiple Regression 1 / 19 Multiple Regression 1 The Multiple

More information

PubH 7470: STATISTICS FOR TRANSLATIONAL & CLINICAL RESEARCH

PubH 7470: STATISTICS FOR TRANSLATIONAL & CLINICAL RESEARCH PubH 7470: STATISTICS FOR TRANSLATIONAL & CLINICAL RESEARCH Instructor: Chap T. Le, Ph.D. Distinguished Professor of Biostatistics Basic Issues: COURSE INTRODUCTION BIOSTATISTICS BIOSTATISTICS is the Biomedical

More information

Investigator Initiated Study Proposal Form

Investigator Initiated Study Proposal Form Please submit completed form to IISReview@KCI1.com Date Submitted Name & Title Institution Address Phone Number Email Address Principal Investigator / Institution YES NO Multi Center Study Acelity Product(s)

More information

Intro to SPSS. Using SPSS through WebFAS

Intro to SPSS. Using SPSS through WebFAS Intro to SPSS Using SPSS through WebFAS http://www.yorku.ca/computing/students/labs/webfas/ Try it early (make sure it works from your computer) If you need help contact UIT Client Services Voice: 416-736-5800

More information

You must answer question 1.

You must answer question 1. Research Methods and Statistics Specialty Area Exam October 28, 2015 Part I: Statistics Committee: Richard Williams (Chair), Elizabeth McClintock, Sarah Mustillo You must answer question 1. 1. Suppose

More information

A framework for predicting item difficulty in reading tests

A framework for predicting item difficulty in reading tests Australian Council for Educational Research ACEReSearch OECD Programme for International Student Assessment (PISA) National and International Surveys 4-2012 A framework for predicting item difficulty in

More information

Exploring the Relationship Between Substance Abuse and Dependence Disorders and Discharge Status: Results and Implications

Exploring the Relationship Between Substance Abuse and Dependence Disorders and Discharge Status: Results and Implications MWSUG 2017 - Paper DG02 Exploring the Relationship Between Substance Abuse and Dependence Disorders and Discharge Status: Results and Implications ABSTRACT Deanna Naomi Schreiber-Gregory, Henry M Jackson

More information

Dr. Kelly Bradley Final Exam Summer {2 points} Name

Dr. Kelly Bradley Final Exam Summer {2 points} Name {2 points} Name You MUST work alone no tutors; no help from classmates. Email me or see me with questions. You will receive a score of 0 if this rule is violated. This exam is being scored out of 00 points.

More information

STATISTICS INFORMED DECISIONS USING DATA

STATISTICS INFORMED DECISIONS USING DATA STATISTICS INFORMED DECISIONS USING DATA Fifth Edition Chapter 4 Describing the Relation between Two Variables 4.1 Scatter Diagrams and Correlation Learning Objectives 1. Draw and interpret scatter diagrams

More information

25. Two-way ANOVA. 25. Two-way ANOVA 371

25. Two-way ANOVA. 25. Two-way ANOVA 371 25. Two-way ANOVA The Analysis of Variance seeks to identify sources of variability in data with when the data is partitioned into differentiated groups. In the prior section, we considered two sources

More information

HOW TO WRITE A STUDY PROTOCOL

HOW TO WRITE A STUDY PROTOCOL HOW TO WRITE A STUDY PROTOCOL Manar Mohamed Moneer Assistant Professor Epidemiology & Biostatistics Department 2007 Reasoning behind Structure of a research project. What? Every Step of a Study A document

More information

RNA-seq. Differential analysis

RNA-seq. Differential analysis RNA-seq Differential analysis Data transformations Count data transformations In order to test for differential expression, we operate on raw counts and use discrete distributions differential expression.

More information

CHAPTER TWO REGRESSION

CHAPTER TWO REGRESSION CHAPTER TWO REGRESSION 2.0 Introduction The second chapter, Regression analysis is an extension of correlation. The aim of the discussion of exercises is to enhance students capability to assess the effect

More information

A Brief Introduction to Bayesian Statistics

A Brief Introduction to Bayesian Statistics A Brief Introduction to Statistics David Kaplan Department of Educational Psychology Methods for Social Policy Research and, Washington, DC 2017 1 / 37 The Reverend Thomas Bayes, 1701 1761 2 / 37 Pierre-Simon

More information

CONTENT ANALYSIS OF COGNITIVE BIAS: DEVELOPMENT OF A STANDARDIZED MEASURE Heather M. Hartman-Hall David A. F. Haaga

CONTENT ANALYSIS OF COGNITIVE BIAS: DEVELOPMENT OF A STANDARDIZED MEASURE Heather M. Hartman-Hall David A. F. Haaga Journal of Rational-Emotive & Cognitive-Behavior Therapy Volume 17, Number 2, Summer 1999 CONTENT ANALYSIS OF COGNITIVE BIAS: DEVELOPMENT OF A STANDARDIZED MEASURE Heather M. Hartman-Hall David A. F. Haaga

More information

Weekly Paper Topics psy230 / Bizer / Fall xxarticles are available at idol.union.edu/bizerg/readings230xx

Weekly Paper Topics psy230 / Bizer / Fall xxarticles are available at idol.union.edu/bizerg/readings230xx Weekly Paper Topics psy230 / Bizer / Fall 2018 xxarticles are available at idol.union.edu/bizerg/readings230xx One question is listed below for each of the eight articles assigned for the term. You ll

More information

CHAPTER III RESEARCH METHODOLOGY

CHAPTER III RESEARCH METHODOLOGY CHAPTER III RESEARCH METHODOLOGY In this chapter, the researcher will elaborate the methodology of the measurements. This chapter emphasize about the research methodology, data source, population and sampling,

More information

RISK PREDICTION MODEL: PENALIZED REGRESSIONS

RISK PREDICTION MODEL: PENALIZED REGRESSIONS RISK PREDICTION MODEL: PENALIZED REGRESSIONS Inspired from: How to develop a more accurate risk prediction model when there are few events Menelaos Pavlou, Gareth Ambler, Shaun R Seaman, Oliver Guttmann,

More information

Doing Quantitative Research 26E02900, 6 ECTS Lecture 6: Structural Equations Modeling. Olli-Pekka Kauppila Daria Kautto

Doing Quantitative Research 26E02900, 6 ECTS Lecture 6: Structural Equations Modeling. Olli-Pekka Kauppila Daria Kautto Doing Quantitative Research 26E02900, 6 ECTS Lecture 6: Structural Equations Modeling Olli-Pekka Kauppila Daria Kautto Session VI, September 20 2017 Learning objectives 1. Get familiar with the basic idea

More information

Lecture Outline. Biost 517 Applied Biostatistics I. Purpose of Descriptive Statistics. Purpose of Descriptive Statistics

Lecture Outline. Biost 517 Applied Biostatistics I. Purpose of Descriptive Statistics. Purpose of Descriptive Statistics Biost 517 Applied Biostatistics I Scott S. Emerson, M.D., Ph.D. Professor of Biostatistics University of Washington Lecture 3: Overview of Descriptive Statistics October 3, 2005 Lecture Outline Purpose

More information

Analysis of Rheumatoid Arthritis Data using Logistic Regression and Penalized Approach

Analysis of Rheumatoid Arthritis Data using Logistic Regression and Penalized Approach University of South Florida Scholar Commons Graduate Theses and Dissertations Graduate School November 2015 Analysis of Rheumatoid Arthritis Data using Logistic Regression and Penalized Approach Wei Chen

More information

PubH 7405: REGRESSION ANALYSIS. Propensity Score

PubH 7405: REGRESSION ANALYSIS. Propensity Score PubH 7405: REGRESSION ANALYSIS Propensity Score INTRODUCTION: There is a growing interest in using observational (or nonrandomized) studies to estimate the effects of treatments on outcomes. In observational

More information

CHAPTER ELEVEN. NON-EXPERIMENTAL RESEARCH of GROUP DIFFERENCES

CHAPTER ELEVEN. NON-EXPERIMENTAL RESEARCH of GROUP DIFFERENCES CHAPTER ELEVEN NON-EXPERIMENTAL RESEARCH of GROUP DIFFERENCES Chapter Objectives: Understand how non-experimental studies of group differences compare to experimental studies of groups differences. Understand

More information

Exam 4 Review Exercises

Exam 4 Review Exercises Math 160: Statistics Spring, 2014 Toews Exam 4 Review Exercises Instructions: Working in groups of 2-4, first review the goals and objectives for this exam (listed below) and then work the following problems.

More information

Problem #1 Neurological signs and symptoms of ciguatera poisoning as the start of treatment and 2.5 hours after treatment with mannitol.

Problem #1 Neurological signs and symptoms of ciguatera poisoning as the start of treatment and 2.5 hours after treatment with mannitol. Ho (null hypothesis) Ha (alternative hypothesis) Problem #1 Neurological signs and symptoms of ciguatera poisoning as the start of treatment and 2.5 hours after treatment with mannitol. Hypothesis: Ho:

More information

Alan S. Gerber Donald P. Green Yale University. January 4, 2003

Alan S. Gerber Donald P. Green Yale University. January 4, 2003 Technical Note on the Conditions Under Which It is Efficient to Discard Observations Assigned to Multiple Treatments in an Experiment Using a Factorial Design Alan S. Gerber Donald P. Green Yale University

More information

1) What is the independent variable? What is our Dependent Variable?

1) What is the independent variable? What is our Dependent Variable? 1) What is the independent variable? What is our Dependent Variable? Independent Variable: Whether the font color and word name are the same or different. (Congruency) Dependent Variable: The amount of

More information

SPARRA Mental Disorder: Scottish Patients at Risk of Readmission and Admission (to psychiatric hospitals or units)

SPARRA Mental Disorder: Scottish Patients at Risk of Readmission and Admission (to psychiatric hospitals or units) SPARRA Mental Disorder: Scottish Patients at Risk of Readmission and Admission (to psychiatric hospitals or units) A report on the work to identify patients at greatest risk of readmission and admission

More information

Midterm STAT-UB.0003 Regression and Forecasting Models. I will not lie, cheat or steal to gain an academic advantage, or tolerate those who do.

Midterm STAT-UB.0003 Regression and Forecasting Models. I will not lie, cheat or steal to gain an academic advantage, or tolerate those who do. Midterm STAT-UB.0003 Regression and Forecasting Models The exam is closed book and notes, with the following exception: you are allowed to bring one letter-sized page of notes into the exam (front and

More information

Detecting Anomalous Patterns of Care Using Health Insurance Claims

Detecting Anomalous Patterns of Care Using Health Insurance Claims Partially funded by National Science Foundation grants IIS-0916345, IIS-0911032, and IIS-0953330, and funding from Disruptive Health Technology Institute. We are also grateful to Highmark Health for providing

More information

WELCOME! Lecture 11 Thommy Perlinger

WELCOME! Lecture 11 Thommy Perlinger Quantitative Methods II WELCOME! Lecture 11 Thommy Perlinger Regression based on violated assumptions If any of the assumptions are violated, potential inaccuracies may be present in the estimated regression

More information

BACKGROUND HISTORY QUESTIONNAIRE

BACKGROUND HISTORY QUESTIONNAIRE BACKGROUND HISTORY QUESTIONNAIRE Name: Sex M F Address: Home Number: Work Number: Cell Number: Email: SSN: Name and Address of Employer: Date of Birth: Age: Ethnicity: Referred By: Referral Question or

More information

Secukinumab for treating moderate to severe plaque psoriasis. Erratum

Secukinumab for treating moderate to severe plaque psoriasis. Erratum Aberdeen HTA Group Secukinumab for treating moderate to severe plaque psoriasis Erratum Completed 2 March 2015 This report was commissioned by the NIHR HTA Programme as project number 13/129/01 Contains

More information

Lecture 20: Chi Square

Lecture 20: Chi Square Statistics 20_chi.pdf Michael Hallstone, Ph.D. hallston@hawaii.edu Lecture 20: Chi Square Introduction Up until now, we done statistical test using means, but the assumptions for means have eliminated

More information

Chapter 3: Examining Relationships

Chapter 3: Examining Relationships Name Date Per Key Vocabulary: response variable explanatory variable independent variable dependent variable scatterplot positive association negative association linear correlation r-value regression

More information

Testing Means. Related-Samples t Test With Confidence Intervals. 6. Compute a related-samples t test and interpret the results.

Testing Means. Related-Samples t Test With Confidence Intervals. 6. Compute a related-samples t test and interpret the results. 10 Learning Objectives Testing Means After reading this chapter, you should be able to: Related-Samples t Test With Confidence Intervals 1. Describe two types of research designs used when we select related

More information

Section 6: Analysing Relationships Between Variables

Section 6: Analysing Relationships Between Variables 6. 1 Analysing Relationships Between Variables Section 6: Analysing Relationships Between Variables Choosing a Technique The Crosstabs Procedure The Chi Square Test The Means Procedure The Correlations

More information

STEP II Conceptualising a Research Design

STEP II Conceptualising a Research Design STEP II Conceptualising a Research Design This operational step includes two chapters: Chapter 7: The research design Chapter 8: Selecting a study design CHAPTER 7 The Research Design In this chapter you

More information

CBT Intake Form. Patient Name: Preferred Name: Last. First. Best contact phone number: address: Address:

CBT Intake Form. Patient Name: Preferred Name: Last. First. Best contact phone number:  address: Address: Patient Information CBT Intake Form Patient Name: Preferred Name: Last Date of Birth: _// Age: _ First MM DD YYYY Gender: Best contact phone number: Email address: _ Address: _ Primary Care Physician:

More information

A Monte Carlo Simulation Study for Comparing Power of the Most Powerful and Regular Bivariate Normality Tests

A Monte Carlo Simulation Study for Comparing Power of the Most Powerful and Regular Bivariate Normality Tests International Journal of Statistics and Applications 014, 4(1): 40-45 DOI: 10.593/j.statistics.0140401.04 A Monte Carlo Simulation Study for Comparing Power of the Most Powerful and Regular Bivariate Normality

More information

EFFECTIVE MEDICAL WRITING Michelle Biros, MS, MD Editor-in -Chief Academic Emergency Medicine

EFFECTIVE MEDICAL WRITING Michelle Biros, MS, MD Editor-in -Chief Academic Emergency Medicine EFFECTIVE MEDICAL WRITING Michelle Biros, MS, MD Editor-in -Chief Academic Emergency Medicine Why We Write To disseminate information To share ideas, discoveries, and perspectives to a broader audience

More information

Designing Systems for Effective Heart Failure Care

Designing Systems for Effective Heart Failure Care Designing Systems for Effective Heart Failure Care Ileana L. Piña, MD, MPH Professor of Medicine, Epidemiology and Population Health Albert Einstein College of Medicine Associate Chief of Cardiology for

More information

Regression CHAPTER SIXTEEN NOTE TO INSTRUCTORS OUTLINE OF RESOURCES

Regression CHAPTER SIXTEEN NOTE TO INSTRUCTORS OUTLINE OF RESOURCES CHAPTER SIXTEEN Regression NOTE TO INSTRUCTORS This chapter includes a number of complex concepts that may seem intimidating to students. Encourage students to focus on the big picture through some of

More information

Bayesian Tailored Testing and the Influence

Bayesian Tailored Testing and the Influence Bayesian Tailored Testing and the Influence of Item Bank Characteristics Carl J. Jensema Gallaudet College Owen s (1969) Bayesian tailored testing method is introduced along with a brief review of its

More information

Questionnaire on Anticipated Discrimination (QUAD)(1): is a self-complete measure comprising 14 items

Questionnaire on Anticipated Discrimination (QUAD)(1): is a self-complete measure comprising 14 items Online Supplement Data Supplement for Clement et al. (10.1176/appi.ps.201300448) Details of additional measures included in the analysis Questionnaire on Anticipated Discrimination (QUAD)(1): is a self-complete

More information

Review and Wrap-up! ESP 178 Applied Research Methods Calvin Thigpen 3/14/17 Adapted from presentation by Prof. Susan Handy

Review and Wrap-up! ESP 178 Applied Research Methods Calvin Thigpen 3/14/17 Adapted from presentation by Prof. Susan Handy Review and Wrap-up! ESP 178 Applied Research Methods Calvin Thigpen 3/14/17 Adapted from presentation by Prof. Susan Handy Final Proposals Read instructions carefully! Check Canvas for our comments on

More information

ANALYZING ALCOHOL BEHAVIOR IN SAN LUIS OBISPO COUNTY

ANALYZING ALCOHOL BEHAVIOR IN SAN LUIS OBISPO COUNTY ANALYZING ALCOHOL BEHAVIOR IN SAN LUIS OBISPO COUNTY Ariana Montes In Partial Fulfillment of the Requirements for the Degree Bachelor of Science, Statistics December 2014 TABLE OF CONTENTS Methods 2 Part

More information

NIH NEW CLINICAL TRIAL REQUIREMENTS AND FORMS-E

NIH NEW CLINICAL TRIAL REQUIREMENTS AND FORMS-E NIH NEW CLINICAL TRIAL REQUIREMENTS AND FORMS-E HOW DOES THE NIH DEFINE CLINICAL TRIAL AND HOW DO I KNOW IF MY STUDY IS ONE? 01-18-18 A clinical trial is defined by the NIH as a research study in which

More information

CHAPTER 3 DATA ANALYSIS: DESCRIBING DATA

CHAPTER 3 DATA ANALYSIS: DESCRIBING DATA Data Analysis: Describing Data CHAPTER 3 DATA ANALYSIS: DESCRIBING DATA In the analysis process, the researcher tries to evaluate the data collected both from written documents and from other sources such

More information

Partitioning genetic variance of metabolizable energy efficiency in dairy cows

Partitioning genetic variance of metabolizable energy efficiency in dairy cows Partitioning genetic variance of metabolizable energy efficiency in dairy cows Mehtiö, T., Negussie, E., Mäntysaari, P., Mäntysaari, E.A., Lidauer, M.H.* Natural Resources Institute Finland (Luke) EAAP

More information

Empowered by Psychometrics The Fundamentals of Psychometrics. Jim Wollack University of Wisconsin Madison

Empowered by Psychometrics The Fundamentals of Psychometrics. Jim Wollack University of Wisconsin Madison Empowered by Psychometrics The Fundamentals of Psychometrics Jim Wollack University of Wisconsin Madison Psycho-what? Psychometrics is the field of study concerned with the measurement of mental and psychological

More information

UCLA Social Support Inventory * (UCLA-SSI) Christine Dunkel-Schetter. Lawrence Feinstein. Jyllian Call. University of California, Los Angeles

UCLA Social Support Inventory * (UCLA-SSI) Christine Dunkel-Schetter. Lawrence Feinstein. Jyllian Call. University of California, Los Angeles UCLA Social Support Inventory * (UCLA-SSI) Christine Dunkel-Schetter Lawrence Feinstein Jyllian Call University of California, Los Angeles December, 1986 Further information on the instrument can be obtained

More information

BOOTSTRAPPING CONFIDENCE LEVELS FOR HYPOTHESES ABOUT QUADRATIC (U-SHAPED) REGRESSION MODELS

BOOTSTRAPPING CONFIDENCE LEVELS FOR HYPOTHESES ABOUT QUADRATIC (U-SHAPED) REGRESSION MODELS BOOTSTRAPPING CONFIDENCE LEVELS FOR HYPOTHESES ABOUT QUADRATIC (U-SHAPED) REGRESSION MODELS 12 June 2012 Michael Wood University of Portsmouth Business School SBS Department, Richmond Building Portland

More information

NORTH SOUTH UNIVERSITY TUTORIAL 2

NORTH SOUTH UNIVERSITY TUTORIAL 2 NORTH SOUTH UNIVERSITY TUTORIAL 2 AHMED HOSSAIN,PhD Data Management and Analysis AHMED HOSSAIN,PhD - Data Management and Analysis 1 Correlation Analysis INTRODUCTION In correlation analysis, we estimate

More information

Simple Linear Regression the model, estimation and testing

Simple Linear Regression the model, estimation and testing Simple Linear Regression the model, estimation and testing Lecture No. 05 Example 1 A production manager has compared the dexterity test scores of five assembly-line employees with their hourly productivity.

More information

(CORRELATIONAL DESIGN AND COMPARATIVE DESIGN)

(CORRELATIONAL DESIGN AND COMPARATIVE DESIGN) UNIT 4 OTHER DESIGNS (CORRELATIONAL DESIGN AND COMPARATIVE DESIGN) Quasi Experimental Design Structure 4.0 Introduction 4.1 Objectives 4.2 Definition of Correlational Research Design 4.3 Types of Correlational

More information

Validity and Reliability. PDF Created with deskpdf PDF Writer - Trial ::

Validity and Reliability. PDF Created with deskpdf PDF Writer - Trial :: Validity and Reliability PDF Created with deskpdf PDF Writer - Trial :: http://www.docudesk.com Validity Is the translation from concept to operationalization accurately representing the underlying concept.

More information

Comparing Vertical and Horizontal Scoring of Open-Ended Questionnaires

Comparing Vertical and Horizontal Scoring of Open-Ended Questionnaires A peer-reviewed electronic journal. Copyright is retained by the first or sole author, who grants right of first publication to the Practical Assessment, Research & Evaluation. Permission is granted to

More information

Unit 1 Exploring and Understanding Data

Unit 1 Exploring and Understanding Data Unit 1 Exploring and Understanding Data Area Principle Bar Chart Boxplot Conditional Distribution Dotplot Empirical Rule Five Number Summary Frequency Distribution Frequency Polygon Histogram Interquartile

More information

ABOUT SUBSTANCE USE INTRODUCTION TO ASSESSMENT OPTIONS SUBSTANCE USE. 2/26/2018 PROMIS Substance Use Page 1

ABOUT SUBSTANCE USE INTRODUCTION TO ASSESSMENT OPTIONS SUBSTANCE USE. 2/26/2018 PROMIS Substance Use Page 1 SUBSTANCE USE A brief guide to the PROMIS Substance Use instruments: ADULT PROMIS Item Bank v1.0 Appeal of Substance Use (Past 3 Months) PROMIS Item Bank v1.0 Appeal of Substance Use (Past 30 days) PROMIS

More information

Threats and Analysis. Shawn Cole. Harvard Business School

Threats and Analysis. Shawn Cole. Harvard Business School Threats and Analysis Shawn Cole Harvard Business School Course Overview 1. What is Evaluation? 2. Outcomes, Impact, and Indicators 3. Why Randomize? 4. How to Randomize? 5. Sampling and Sample Size 6.

More information

Selecting the right patient for medication reviews

Selecting the right patient for medication reviews Selecting the right patient for medication reviews Prof dr Petra Denig, Clinical Pharmacy & Pharmacology, University Medical Center Groningen, the Netherlands 2 Who is in need of medication review: can

More information

Collecting & Making Sense of

Collecting & Making Sense of Collecting & Making Sense of Quantitative Data Deborah Eldredge, PhD, RN Director, Quality, Research & Magnet Recognition i Oregon Health & Science University Margo A. Halm, RN, PhD, ACNS-BC, FAHA Director,

More information

The University of North Carolina at Chapel Hill School of Social Work

The University of North Carolina at Chapel Hill School of Social Work The University of North Carolina at Chapel Hill School of Social Work SOWO 918: Applied Regression Analysis and Generalized Linear Models Spring Semester, 2014 Instructor Shenyang Guo, Ph.D., Room 524j,

More information

MULTIPLE REGRESSION OF CPS DATA

MULTIPLE REGRESSION OF CPS DATA MULTIPLE REGRESSION OF CPS DATA A further inspection of the relationship between hourly wages and education level can show whether other factors, such as gender and work experience, influence wages. Linear

More information

SUMMER 2011 RE-EXAM PSYF11STAT - STATISTIK

SUMMER 2011 RE-EXAM PSYF11STAT - STATISTIK SUMMER 011 RE-EXAM PSYF11STAT - STATISTIK Full Name: Årskortnummer: Date: This exam is made up of three parts: Part 1 includes 30 multiple choice questions; Part includes 10 matching questions; and Part

More information

Running head: INDIVIDUAL DIFFERENCES 1. Why to treat subjects as fixed effects. James S. Adelman. University of Warwick.

Running head: INDIVIDUAL DIFFERENCES 1. Why to treat subjects as fixed effects. James S. Adelman. University of Warwick. Running head: INDIVIDUAL DIFFERENCES 1 Why to treat subjects as fixed effects James S. Adelman University of Warwick Zachary Estes Bocconi University Corresponding Author: James S. Adelman Department of

More information

Ordinary Least Squares Regression

Ordinary Least Squares Regression Ordinary Least Squares Regression March 2013 Nancy Burns (nburns@isr.umich.edu) - University of Michigan From description to cause Group Sample Size Mean Health Status Standard Error Hospital 7,774 3.21.014

More information

Technical appendix Strengthening accountability through media in Bangladesh: final evaluation

Technical appendix Strengthening accountability through media in Bangladesh: final evaluation Technical appendix Strengthening accountability through media in Bangladesh: final evaluation July 2017 Research and Learning Contents Introduction... 3 1. Survey sampling methodology... 4 2. Regression

More information

1. The Role of Sample Survey Design

1. The Role of Sample Survey Design Vista's Approach to Sample Survey Design 1978, 1988, 2006, 2007, 2009 Joseph George Caldwell. All Rights Reserved. Posted at Internet website http://www.foundationwebsite.org. Updated 20 March 2009 (two

More information

CHAPTER 4: FINDINGS 4.1 Introduction This chapter includes five major sections. The first section reports descriptive statistics and discusses the

CHAPTER 4: FINDINGS 4.1 Introduction This chapter includes five major sections. The first section reports descriptive statistics and discusses the CHAPTER 4: FINDINGS 4.1 Introduction This chapter includes five major sections. The first section reports descriptive statistics and discusses the respondent s representativeness of the overall Earthwatch

More information

UNIVERSITY OF THE FREE STATE DEPARTMENT OF COMPUTER SCIENCE AND INFORMATICS CSIS6813 MODULE TEST 2

UNIVERSITY OF THE FREE STATE DEPARTMENT OF COMPUTER SCIENCE AND INFORMATICS CSIS6813 MODULE TEST 2 UNIVERSITY OF THE FREE STATE DEPARTMENT OF COMPUTER SCIENCE AND INFORMATICS CSIS6813 MODULE TEST 2 DATE: 3 May 2017 MARKS: 75 ASSESSOR: Prof PJ Blignaut MODERATOR: Prof C de Villiers (UP) TIME: 2 hours

More information

Lesson: A Ten Minute Course in Epidemiology

Lesson: A Ten Minute Course in Epidemiology Lesson: A Ten Minute Course in Epidemiology This lesson investigates whether childhood circumcision reduces the risk of acquiring genital herpes in men. 1. To open the data we click on File>Example Data

More information