Final Exam Version A

Similar documents
Simple Linear Regression the model, estimation and testing

CRITERIA FOR USE. A GRAPHICAL EXPLANATION OF BI-VARIATE (2 VARIABLE) REGRESSION ANALYSISSys

CHAPTER TWO REGRESSION

Statistics for Psychology

NORTH SOUTH UNIVERSITY TUTORIAL 2

Business Statistics Probability

Class 7 Everything is Related

Chapter 1: Exploring Data

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

Chapter 2 Organizing and Summarizing Data. Chapter 3 Numerically Summarizing Data. Chapter 4 Describing the Relation between Two Variables

CHAPTER 3 RESEARCH METHODOLOGY

Statistical Methods and Reasoning for the Clinical Sciences

bivariate analysis: The statistical analysis of the relationship between two variables.

Conditional Distributions and the Bivariate Normal Distribution. James H. Steiger

Multiple Regression. James H. Steiger. Department of Psychology and Human Development Vanderbilt University

11/18/2013. Correlational Research. Correlational Designs. Why Use a Correlational Design? CORRELATIONAL RESEARCH STUDIES

2.75: 84% 2.5: 80% 2.25: 78% 2: 74% 1.75: 70% 1.5: 66% 1.25: 64% 1.0: 60% 0.5: 50% 0.25: 25% 0: 0%

Chapter 1: Explaining Behavior

Midterm STAT-UB.0003 Regression and Forecasting Models. I will not lie, cheat or steal to gain an academic advantage, or tolerate those who do.

WDHS Curriculum Map Probability and Statistics. What is Statistics and how does it relate to you?

Still important ideas

STA 3024 Spring 2013 EXAM 3 Test Form Code A UF ID #

Readings: Textbook readings: OpenStax - Chapters 1 13 (emphasis on Chapter 12) Online readings: Appendix D, E & F

Regression Including the Interaction Between Quantitative Variables

3.2A Least-Squares Regression

UNIVERSITY OF TORONTO SCARBOROUGH Department of Computer and Mathematical Sciences Midterm Test February 2016

Name: emergency please discuss this with the exam proctor. 6. Vanderbilt s academic honor code applies.

For People With Diabetes. Blood Sugar Diary. SCAN Health Plan

Understandable Statistics

MMI 409 Spring 2009 Final Examination Gordon Bleil. 1. Is there a difference in depression as a function of group and drug?

PSY 216: Elementary Statistics Exam 4

Examining Relationships Least-squares regression. Sections 2.3

IAPT: Regression. Regression analyses

Pearson Education Limited Edinburgh Gate Harlow Essex CM20 2JE England and Associated Companies throughout the world

Method Comparison Report Semi-Annual 1/5/2018

isc ove ring i Statistics sing SPSS

Applied Medical. Statistics Using SAS. Geoff Der. Brian S. Everitt. CRC Press. Taylor Si Francis Croup. Taylor & Francis Croup, an informa business

From Biostatistics Using JMP: A Practical Guide. Full book available for purchase here. Chapter 1: Introduction... 1

Constructing a mixed model using the AIC

Regression CHAPTER SIXTEEN NOTE TO INSTRUCTORS OUTLINE OF RESOURCES

12/31/2016. PSY 512: Advanced Statistics for Psychological and Behavioral Research 2

MULTIPLE LINEAR REGRESSION 24.1 INTRODUCTION AND OBJECTIVES OBJECTIVES

Dr. Kelly Bradley Final Exam Summer {2 points} Name

12/30/2017. PSY 5102: Advanced Statistics for Psychological and Behavioral Research 2

Study Guide for the Final Exam

Correlation and regression

Instructions and Checklist

Chapter 3 CORRELATION AND REGRESSION

F1: Introduction to Econometrics

10/4/2007 MATH 171 Name: Dr. Lunsford Test Points Possible

STATISTICS INFORMED DECISIONS USING DATA

Sample Math 71B Final Exam #1. Answer Key

M 140 Test 1 A Name SHOW YOUR WORK FOR FULL CREDIT! Problem Max. Points Your Points Total 60

Biology 345: Biometry Fall 2005 SONOMA STATE UNIVERSITY Lab Exercise 5 Residuals and multiple regression Introduction

Advanced ANOVA Procedures

One-Way ANOVAs t-test two statistically significant Type I error alpha null hypothesis dependant variable Independent variable three levels;

Chapter 1: Introduction to Statistics

Example of Interpreting and Applying a Multiple Regression Model

SPRING GROVE AREA SCHOOL DISTRICT. Course Description. Instructional Strategies, Learning Practices, Activities, and Experiences.

Regression models, R solution day7

10. LINEAR REGRESSION AND CORRELATION

List of Figures. List of Tables. Preface to the Second Edition. Preface to the First Edition

BIOL 458 BIOMETRY Lab 7 Multi-Factor ANOVA

Introduction to Statistical Data Analysis I

Glucose Assay Kit. Catalog Number KA assays Version: 03. Intended for research use only.

1. Below is the output of a 2 (gender) x 3(music type) completely between subjects factorial ANOVA on stress ratings

11/24/2017. Do not imply a cause-and-effect relationship

(C) Jamalludin Ab Rahman

Answer all three questions. All questions carry equal marks.

were selected at random, the probability that it is white or black would be 2 3.

Final Exam PS 217, Spring 2004

Simple Linear Regression

HS Exam 1 -- March 9, 2006

RESPONSE SURFACE MODELING AND OPTIMIZATION TO ELUCIDATE THE DIFFERENTIAL EFFECTS OF DEMOGRAPHIC CHARACTERISTICS ON HIV PREVALENCE IN SOUTH AFRICA

EXECUTIVE SUMMARY DATA AND PROBLEM

The following are questions that students had difficulty with on the first three exams.

Chapter 3: Describing Relationships

Chapter 3: Examining Relationships

AP Statistics Practice Test Ch. 3 and Previous

Centering Predictors

Statistics is the science of collecting, organizing, presenting, analyzing, and interpreting data to assist in making effective decisions

Polymer Technology Systems, Inc. CardioChek PA Comparison Study

LIPASE liquicolor. Design Verification. Multipurpose Reagent

Name: Biostatistics 1 st year Comprehensive Examination: Applied Take Home exam. Due May 29 th, 2015 by 5pm. Late exams will not be accepted.

Addendum: Multiple Regression Analysis (DRAFT 8/2/07)

Be sure to show all calculations so that you can receive partial credit for your work!

Table of Contents. Plots. Essential Statistics for Nursing Research 1/12/2017

SUMMER 2011 RE-EXAM PSYF11STAT - STATISTIK

EXPERIMENT 3 ENZYMATIC QUANTITATION OF GLUCOSE

Performance of Median and Least Squares Regression for Slightly Skewed Data

MEA DISCUSSION PAPERS

Statistical Reasoning in Public Health Biostatistics 612, 2009, HW#3

Chapter 13 Estimating the Modified Odds Ratio

Statistical Methods Exam I Review

SPINE ROAD HIGH SCHOOL

Stat 13, Lab 11-12, Correlation and Regression Analysis

Student Number: THE UNIVERSITY OF MANITOBA April 10, 2000, 9:00 AM - 12:00 PM Page 1 (of 4) Biochemistry II Lab Section Final Examination

UNIVERSITY of PENNSYLVANIA CIS 520: Machine Learning Final, Fall 2014

Multiple Linear Regression Analysis

Transcription:

Final Exam Version A Open Book and Notes your 4-digit code: Staple the question sheets to your answers Write your name only once on the back of this sheet. Problem 1: (10 points) A popular method to isolate single cells for further characterization is the limiting dilution method. A cell suspension is diluted to an appropriate low concentration and 0.05 ml of this suspension is dispersed into each well of a 96 well plate. What should be cell number concentration (number of cells per ml) in the dilute suspension such that not a single well out of the 96 contains two cells? At this concentration, how many wells are expected to have a single cell? Problem 2: (10 points) Potassium mass fraction (% by weight) in fertilizer samples are measured in a commercial product as shown below: 21.9 23.3 22.1 22.3 24.7 24.5 24.0 24.1 24.2 26.5 23.8 25.3 24.8 24.5 27.8 24.9 27.2 25.1 25.5 23.7 26.5 22.0 26.7 25.2 23.1 22.8 25.2 23.7 24.6 a. Construct a stem-and-leaf diagram for these data and comment on the apparent distribution. b. Find the 99 % confidence intervals for the sample variance and standard deviation. Problem 3: (5 points) The following data were obtained on total nitrogen concentration (in ppm) of water drawn from a lake being considered for use as a source of drinking water for a town.: 0.045 0.055 0.049 0.028 0.025 0.039 0.023 0.045 0.038 0.035 0.026 0.059 Find a 95 % one-sided confidence interval on the largest possible value for the mean nitrogen concentration. To be acceptable as a source of drinking water, the mean nitrogen concentration must li below 0.05 ppm. Does this lake appear to meet this criterion? Problem 4: (20 points) Another environmental monitoring company measures the total nitrogen concentration (ppm) in the same lake (as in problem above) and obtains the following data: 0.042 0.023 0.049 0.036 0.045 0.025 0.048 0.035 0.048 0.043 0.044 0.052 (a) Can the variances for these two sets of data be considered equal or unequal? (b) Are the two mean nitrogen concentrations the same or different with 95% confidence level? Problem 5: (15 points) The blood glucose concentration measured before breakfast in the normal human population is normally distributed with a mean of 100 gm/dl and known standard deviation of 10 gm/dl. At 95% confidence, what are the acceptance limits of blood glucose concentration for a normal person? What is the P value of a diabetic patient s blood glucose measurement being 135 gm/dl? If this patient s blood glucose concentration is normally distributed with a mean of 140 gm/dl and a standard deviation of 20 mg/dl, what is the probability that this patient will be misdiagnosed as normal?

Problem 6: (15 points) For the experimental data in the table below: Y 8 11 8 16 16 15 18 X 7 10 11 15 18 25 30 The sums are calculated: Σx = 116, Σy = 92, Σx 2 = 2344, Σy 2 = 1310, Σxy =1697. Find the slope and the intercept of a linear regression model. Test for the significance of regression and calculate the adjusted R 2. Problem 7: (15 points) For the experimental data in the table below, the multiple linear regression model y = β 0 + β 1 x 1 + β 2 x 2 is considered. Y 17.9 16.5 16.4 16.8 18.8 15.5 17.5 16.4 15.9 18.3 X 1 1.35 1.90 1.70 1.80 1.30 2.05 1.60 1.80 1.85 1.40 X 2 90 30 80 40 35 45 50 60 65 30 The covariance matrix is calculated and given below: 6. 071 3. 026 0. 017 C X t 1 = ( X) = 3. 026 1. 739 0. 0022 0. 017 0. 0022 0. 00026 and s 2 = 0.02. Determine the coefficients of the above model and 95% confidence intervals for the parameter β 2. Problem 8: (10 points) Construct a fractional factorial design table for six factors that requires only 8 experiments. Write down the complete defining relationship and the aliases from this design. Present the design table with a column showing the treatment conditions. Happy Holidays!

Final Exam Version B Open Book and Notes your 4-digit code: Staple the question sheets to your answers Write your name only once on the back of this sheet. Problem 1: (10 points) A popular method to isolate single cells for further characterization is the limiting dilution method. A cell suspension is diluted to an appropriate low concentration and 0.10 ml of this suspension is dispersed into each well of a 96 well plate. What should be cell number concentration (number of cells per ml) in the dilute suspension such that not a single well out of the 96 contains two cells? At this concentration, how many wells are expected to have a single cell? Problem 2: (10 points) Potassium mass fraction (% by weight) in fertilizer samples are measured in a commercial product as shown below: 24.8 24.5 27.8 24.9 27.2 25.1 25.5 23.7 23.8 25.3 22.3 24.7 24.5 24.0 24.1 24.2 26.5 21.9 23.3 22.1 26.5 22.0 26.7 25.2 23.1 22.8 25.2 23.7 24.6 25.0 a. Construct a stem-and-leaf diagram for these data and comment on the apparent distribution. b. Find the 95 % confidence intervals for the sample variance and standard deviation. Problem 3: (5 points) The following data were obtained on total nitrogen concentration (in ppm) of water drawn from a lake being considered for use as a source of drinking water for a town.: 0.042 0.023 0.049 0.036 0.045 0.025 0.048 0.035 0.048 0.043 0.044 0.052 Find a 95 % one-sided confidence interval on the largest possible value for the mean nitrogen concentration. To be acceptable as a source of drinking water, the mean nitrogen concentration must li below 0.05 ppm. Does this lake appear to meet this criterion? Problem 4: (20 points) Another environmental monitoring company measures the total nitrogen concentration (ppm) in the same lake (as in problem above) and obtains the following data: 0.045 0.055 0.049 0.028 0.025 0.039 0.023 0.045 0.038 0.035 0.026 0.059 (c) Can the variances for these two sets of data be considered equal or unequal? (d) Are the two mean nitrogen concentrations the same or different with 95% confidence level? Problem 5: (15 points) The blood glucose concentration measured before breakfast in the normal human population is normally distributed with a mean of 100 gm/dl and known standard deviation of 12 gm/dl. At 95% confidence, what are the acceptance limits of blood glucose concentration for a normal person? What is the P value of a diabetic patient s blood glucose measurement being 135 gm/dl? If this patient s blood glucose concentration is normally distributed with a mean of 140 gm/dl and a standard deviation of 20 mg/dl, what is the probability that this patient will be misdiagnosed as normal?

Problem 6: (15 points) For the experimental data in the table below: Y 10 11 8 16 16 15 18 X 9 10 11 15 18 25 30 The sums are calculated: Σx = 118, Σy = 94, Σx 2 = 2376, Σy 2 = 1346, Σxy = 1731 Find the slope and the intercept of a linear regression model. Test for the significance of regression and calculate the adjusted R 2. Problem 7: (15 points) For the experimental data in the table below, the multiple linear regression model y = β 0 + β 1 x 1 + β 2 x 2 is considered. Y 16.5 16.5 16.4 16.8 18.8 15.5 17.5 16.4 15.9 18.3 X 1 1.35 1.90 1.70 1.80 1.30 2.05 1.60 1.80 1.85 1.40 X 2 80 30 80 40 35 45 50 60 65 30 The covariance matrix is calculated and given below: 5. 899 2. 93 0. 0173 C X t 1 = ( X) = 2. 93 1. 723 0. 00086 0. 0173 0. 00086 0. 000308 and s 2 = 0.135. Determine the coefficients of the above model and 95% confidence intervals for the parameter β 2. Problem 8: (10 points) Construct a fractional factorial design table for six factors that requires only 8 experiments. Write down the complete defining relationship and the aliases from this design. Present the design table with a column showing the treatment conditions. Happy Holidays!

Final Exam Version C Open Book and Notes your 4-digit code: Staple the question sheets to your answers Write your name only once on the back of this sheet. Problem 1: (10 points) A popular method to isolate single cells for further characterization is the limiting dilution method. A cell suspension is diluted to an appropriate low concentration and 0.15 ml of this suspension is dispersed into each well of a 96 well plate. What should be cell number concentration (number of cells per ml) in the dilute suspension such that not a single well out of the 96 contains two cells? At this concentration, how many wells are expected to have a single cell? Problem 2: (10 points) Potassium mass fraction (% by weight) in fertilizer samples is measured in a commercial product as shown below: 24.5 24.0 24.1 24.2 26.5 21.9 23.3 22.1 22.3 24.7 25.1 25.5 23.7 23.8 25.3 24.8 24.5 27.8 24.9 27.2 26.7 25.2 23.1 22.8 25.2 23.7 24.6 26.5 22.0 25.0 a. Construct a stem-and-leaf diagram for these data and comment on the apparent distribution. b. Find the 90 % confidence intervals for the sample variance and standard deviation. Problem 3: (5 points) The following data were obtained on total nitrogen concentration (in ppm) of water drawn from a lake being considered for use as a source of drinking water for a town: 0.039 0.023 0.045 0.038 0.035 0.026 0.059 0.045 0.055 0.049 0.028 0.025 Find a 95 % one-sided confidence interval on the largest possible value for the mean nitrogen concentration. To be acceptable as a source of drinking water, the mean nitrogen concentration must li below 0.05 ppm. Does this lake appear to meet this criterion? Problem 4: (20 points) Another environmental monitoring company measures the total nitrogen concentration (ppm) in the same lake (as in problem above) and obtains the following data: 0.025 0.048 0.035 0.048 0.043 0.044 0.052 0.042 0.023 0.049 0.036 0.045 (e) Can the variances for these two sets of data be considered equal or unequal? (f) Are the two mean nitrogen concentrations the same or different with 95% confidence level? Problem 5: (15 points) The blood glucose concentration measured before breakfast in the normal human population is normally distributed with a mean of 100 gm/dl and known standard deviation of 14 gm/dl. At 95% confidence, what are the acceptance limits of blood glucose concentration for a normal person? What is the P value of a diabetic patient s blood glucose measurement being 135 gm/dl? If this patient s blood glucose concentration is normally distributed with a mean of 140 gm/dl and a standard deviation of 20 mg/dl, what is the probability that this patient will be misdiagnosed as normal?

Problem 6: (15 points) For the experimental data in the table below: Y 8 11 8 16 16 15 23 X 7 8 11 15 18 25 30 The sums are calculated: Σx = 114, Σy = 97, Σx 2 = 2308, Σy 2 = 1515, Σxy =1825 Find the slope and the intercept of a linear regression model. Test for the significance of regression and calculate the adjusted R 2. Problem 7 (15 points) For the experimental data in the table below, the multiple linear regression model y = β 0 + β 1 x 1 + β 2 x 2 is considered. Y 17.9 17 16.4 16.8 18.8 15.5 17.5 16.4 15.9 18.3 X 1 1.8 1.9 1.7 1.8 1.3 2.05 1.6 1.8 1.85 1.4 X 2 90 30 80 40 35 45 50 60 65 30 The matrix calculations are mostly done and the results are provided below: 6. 402 3. 59 0. 0024 C X t 1 = ( X) = 3. 59 2. 303 0. 007 0. 0024 0. 007 0. 000277 and s 2 = 0.417. Determine the coefficients of the above model and 95% confidence intervals for the parameter β 2. Problem 8 (10 points) Construct a fractional factorial design table for six factors that requires only 8 experiments. Write down the complete defining relationship and the aliases from this design. Present the design table with a column showing the treatment conditions. Happy Holidays!