On Regression Analysis Using Bivariate Extreme Ranked Set Sampling

Size: px
Start display at page:

Download "On Regression Analysis Using Bivariate Extreme Ranked Set Sampling"

Transcription

1 On Regression Analysis Using Bivariate Extreme Ranked Set Sampling Atsu S. S. Dorvlo Walid Abu-Dayyeh Obaid Alsaidy Abstract- Many forms of ranked set samples have been introduced recently for estimating the population mean and other parameters. The ranked set samples are proven to be more efficient than simple random sample in many situations. For multiple characteristics estimation, bivariate ranked set sample was shown to be more efficient than bivariate simple random sample and the usual univariate rank set sample for estimating population means. In this paper, bivariate extreme ranked set sample is introduced and its effect on regression analysis, when the regressors are assumed to be random, is investigated. A simulation study is used to compare the efficiency of the estimators and study the impact of these sampling plans on regression analysis in general. It is shown that the bivariate extreme ranked set sample gives unbiased and more efficient estimators of the regression model parameters, than those obtained by using bivariate simple random sample, univariate ranked set sample and the bivariate ranked set sample, using the same number of quantified observations. Keywords- Bivariate Extreme Ranked Set Sampling, Bivariate Ranked Set Sampling, Bivariate Simple Random Sampling, Random Regressors, Ranked Set Sampling, Regression Analysis. I. INTRODUCTION Linear Regression model attempts to explain the relationship between two or more variables using a straight line. For example in a chemical process where the yield is to be regressed on the temperature, the temperature cannot be assumed to be constant. When both X and Y are considered random variables, the usual results on estimation, testing and prediction as obtained from the constant X case still hold if (i) the conditional distributions of the Y i given X i, are normal and independent, with conditional means α + βx i and conditional variance σ 2 and (ii) the X I are independent random variables, whose probability distribution does not involve the parameters, α, β and σ 2 [6]. Furthermore, the major modification occurs in the interpretation of confidence coefficients and specified risks of error. When X is random, these refer to repeated 72.1

2 Atsu S. S. Dorvlo, Walid Abu-Dayyeh, and Obaid Alsaidy sampling of pairs of (X i, Y i ) values where the X i values as well as the Y i values change from sample to sample. In any case, the observed data are used to obtain estimates of the unknown parameters. The method of estimation used in this paper is the ordinary least squares method. In practice, data are collected on each of a number of units or cases on these variables using simple random sampling (SRS) technique. As an alternative for SRS, the rank set sampling (RSS) procedure, as introduced by McIntyre [3], has been shown to improve the efficiency of estimating a population mean. The RSS procedure can be described as follows: Identify a group of sampling units from the target population. Then, randomly partition the group into disjoint subsets each having a pre-assigned size m. In the most practical situations, the size m will be 2, 3 or 4. Then, rank each subset by a suitable method of ranking like prior information, visual inspection or by the experimenter himself... etc. Then the i th judgment ordered statistic from the i th subset, i = 1,..., m, will be quantified, say X i(i). Therefore, X 1(1), X 2(2),..., X m(m) denote a RSS. This represents one cycle. We can repeat the whole procedure c-times to get a RSS of size n = cm [2, 6, 7]. For multiple characteristics estimation, bivariate ranked set sample (BVRSS), was introduced by Al- Saleh and Zheng [1]. They indicated that this BVRSS procedure could be easily extended to multivariate RSS. Based on their procedure, a BVRSS can be obtained as follows: Suppose (X, Y) is a bivariate random vector with the joint probability density function (jpdf) f (x, y). 1. A random sample of size m 4 is identified from the population and randomly allocated into m 2 pools of size m 2 each, where each pool is a square matrix with m rows and m columns. 2. In the first pool, identify by judgment, the minimum value w.r.t. the first characteristic, for each of the m rows. 3. For the m minima obtained in Step 2, choose the pair that corresponds to the minimum value of the second characteristic, identified by judgment, for actual quantification. This pair, which resembles the label (1,1), is the first element of the BVRSS sample. 4. Repeat Steps 2 and 3 for the second pool, but in step 3, the pair that corresponds to the second minimum value w.r.t. the second characteristic is chosen for actual quantification. This pair resembled the label (1, 2). 5. The process continues until the label (m, m) is resembled from the (m 2 ) th (last) pool. The above procedure produces a BVRSS of size m 2. Thus we have m 2 observations denote by : (X [i](j),y (i)[j] ), i=1,2 m and j=1,2, m. 6. The procedure can be repeated r times to obtain a sample of size n = m 2 r which will be denoted by (X [i](j)k,y (i)[j]k ), i=1,2 m and j=1,2, m, k=1,2, r. Now we will introduce bivariate extreme ranked set sample. For any m, obtain (X [1](1),Y [1](1) ), (X [1](m ),Y [1](m ) ) (X [m](1),y [m](1) ) and (X [m](m),y [m](m) ) and repeat q times so that n=4q. Denote this sample by (X [i](j)k,y (i)[j]k ), i, j = 1,m, k = 1, 2,.., q. This will be our bivariate extreme ranked set sample BVERSS. Muttlak [4] used RSS to estimate the parameters of the simple regression model assuming that X values are known constants. However, under his model assumptions there is no improvement in estimating the model parameters. To achieve any improvement of estimated model parameters, it is required to assume that both Y and X are random variables. Moreover, he assumed that ranking is done on one variable (say X) only. Therefore, X can be perfectly ranked but Y will be in error in ranking. Samawi and Ababneh [8], used RSS, ranking only on the Special Issue of the International Journal of the Computer, the Internet and Management, Vol. 19 No. SP1, June,

3 On Regression Analysis Using Bivariate Extreme Ranked Set Sampling variable X, to investigate the effect of RSS on regression analysis in general. They found that the efficiency of using RSS for model parameter estimation is just slightly better than using SRS. However, Samawi and Abu- Dayyeh [10] showed that using extreme ranked set sampling (ERSS) scheme will improve the performance of estimating the model parameters better than using RSS [9]. Also, Samawi and Al-Saleh [11] used BVRSS to investigate its effect on regression analysis in general. They found that the efficiency of using BVRSS for model parameter estimation is just slightly better than using SRS and RSS. In this paper, the effect of BVRSS on regression analysis assuming that both Y and X are random variables is investigated. Note that, in BVRSS and BVERSS ranking is done on both variable X and Y simultaneously. In Section 2, the general set up of the problem, notation and simple linear regression using BVERSS is discussed. Inference for the simple regression model parameters (α and β) using asymptotic results is discussed in Section 3. In Section 4 the effect of using BVERSS on residual analysis and model diagnosis is illustrated using real data. B. Simple Linear Regression Using BVERSS be a BVERSS from the population with size n=4q. The simple regression model of the two variables Y and X is defined by: y (i )[ j ]k = α + βx [i ]( j )k + ϵ ijk where α is the model intercept, β is the model slope and ϵ ijk is the random error. The assumptions needed here for the purpose of parameters estimation are the meanof the error is zero, its variance is finite and they uncorrelated. Also X i and ϵ i are independent. Then the least squares estimators of α and β are given by where II. LINEAR REGRESSION A. Some Notations The following notations will be used throughout this paper. E (X ) = μ X, Then the fitted model is Also, a consistent unbiased estimator for 72.3

4 Atsu S. S. Dorvlo, Walid Abu-Dayyeh, and Obaid Alsaidy Using the basic properties of conditional moments and the results of Samawi and Al- Saleh [11], we have the following theorem. The proof is straight forward and will not be presented here. Theorem: Assuming the conditions of the regression model above, then III. NUMERICAL COMPUTATIONS From Theorem 1 and the above results, we can derive the efficiencies of the estimators of α and β using BVERSS relative to the estimators using BVSRS as follows: Definitions of the efficiencies of the estimators of α and β using RSS, BVRSS and BVERSS relative to the estimators using BVSRS can be defined as above. Assume that (X, Y) follow a bivariate normal distribution, the performance of simple regression parameter estimators using BVSRS, RSS, BVRSS and BVERSS, for m = 2, 3, 4; r=1 to 4 and ρ 0.2, 0.5, 0.6, 0.8, 0.9 is investigated based on an extensive simulation study to evaluate the expressions efficiencies. In this case, the total sample size is n = m 2 r for all samples schemes used. Ranking on the concomitant variable X only is considered when using RSS. The simulation results are given in Tables 1 and 2. IV. RESULTS and CONCLUSION From the simulation results we may conclude the following: (1) The RSS, BVRSS and BVERSS are all more efficient than BVSRS. (2) BVERSS is the best followed by BVRSS which is followed RSS. Special Issue of the International Journal of the Computer, the Internet and Management, Vol. 19 No. SP1, June,

5 On Regression Analysis Using Bivariate Extreme Ranked Set Sampling TABLE I THE RELATIVE PRECISION OF THE INTERCEPT AND SLOPE ESTIMATORS (3) The efficiencies of BVRSS and RSS decrease while BVERSS increase for fixed r and fixed ρ as m increases. (4) The efficiencies decrease for fixed m and fixed ρ as r increases. TABLE II THE RELATIVE PRECISION OF THE INTERCEPT AND SLOPE ESTIMATORS REFERENCES [1] Al-Saleh, M. F. and Zheng, G. (2002). Estimation of bivariate characteristics using ranked set sampling, Australian and New Zealand J. of Statistics, 44, [2] Kaur, A., Patil, G., Sinha, A., and Taillie, C. (1995) Ranked set sampling: an annotated bibliography, Environmental and ecology statistics, 2, [3] McIntyre, G. A. (1952) A method of unbiased selective sampling, using ranked sets, Australian J. Agricultural Research, 3, [4] Muttlak, H. A. (1995) Parameters Estimation in a simple linear regression using rank set sampling, Biometrical. J., 37, 7, [5] Muttlak, H. A. and Al-Saleh, M. F. (2000). Recent developments on ranked set sampling, PJS, Vol. 16, Issue 3, pp [6] Neter, J., Wasserman, W., and Kutner, M. (1985) Applied linear statistical models, Richard D. IRWIN, INC, Homewood, Illinois [7] Patil, G. P., Sinha, A. K., and Taillie, C. (1999) Ranked set sampling: A bibliography. Environ, Ecolog. Statist., 6, [8] Samawi, H. M. and Ababneh, F. (2001) On regression analysis using ranked set sample, Journal of Statistical Research (JSR), 35 (2), [9] Samawi, H. M., Ahmed, M. S., and Abu-Dayyeh, W. (1996) Estimating the population mean using extreme ranked set sampling, Biometrical. J., 38, 5, [10] Samawi, H. M. and Abu-Dayyeh, W. (2002) On regression analysis using extreme ranked set samples, International J. of Information and Management Sciences, Vol. 13, Number 3, pp [11] Samawi, H. M. and Al-Saleh, M. F. (2002). On regression analysis using bivariate ranked set samples, Metron, Vol. LX, Issue 3, pp

Unit 1 Exploring and Understanding Data

Unit 1 Exploring and Understanding Data Unit 1 Exploring and Understanding Data Area Principle Bar Chart Boxplot Conditional Distribution Dotplot Empirical Rule Five Number Summary Frequency Distribution Frequency Polygon Histogram Interquartile

More information

MEA DISCUSSION PAPERS

MEA DISCUSSION PAPERS Inference Problems under a Special Form of Heteroskedasticity Helmut Farbmacher, Heinrich Kögel 03-2015 MEA DISCUSSION PAPERS mea Amalienstr. 33_D-80799 Munich_Phone+49 89 38602-355_Fax +49 89 38602-390_www.mea.mpisoc.mpg.de

More information

Multiple Regression. James H. Steiger. Department of Psychology and Human Development Vanderbilt University

Multiple Regression. James H. Steiger. Department of Psychology and Human Development Vanderbilt University Multiple Regression James H. Steiger Department of Psychology and Human Development Vanderbilt University James H. Steiger (Vanderbilt University) Multiple Regression 1 / 19 Multiple Regression 1 The Multiple

More information

Conditional Distributions and the Bivariate Normal Distribution. James H. Steiger

Conditional Distributions and the Bivariate Normal Distribution. James H. Steiger Conditional Distributions and the Bivariate Normal Distribution James H. Steiger Overview In this module, we have several goals: Introduce several technical terms Bivariate frequency distribution Marginal

More information

NORTH SOUTH UNIVERSITY TUTORIAL 2

NORTH SOUTH UNIVERSITY TUTORIAL 2 NORTH SOUTH UNIVERSITY TUTORIAL 2 AHMED HOSSAIN,PhD Data Management and Analysis AHMED HOSSAIN,PhD - Data Management and Analysis 1 Correlation Analysis INTRODUCTION In correlation analysis, we estimate

More information

Score Tests of Normality in Bivariate Probit Models

Score Tests of Normality in Bivariate Probit Models Score Tests of Normality in Bivariate Probit Models Anthony Murphy Nuffield College, Oxford OX1 1NF, UK Abstract: A relatively simple and convenient score test of normality in the bivariate probit model

More information

Chapter 1: Exploring Data

Chapter 1: Exploring Data Chapter 1: Exploring Data Key Vocabulary:! individual! variable! frequency table! relative frequency table! distribution! pie chart! bar graph! two-way table! marginal distributions! conditional distributions!

More information

Content. Basic Statistics and Data Analysis for Health Researchers from Foreign Countries. Research question. Example Newly diagnosed Type 2 Diabetes

Content. Basic Statistics and Data Analysis for Health Researchers from Foreign Countries. Research question. Example Newly diagnosed Type 2 Diabetes Content Quantifying association between continuous variables. Basic Statistics and Data Analysis for Health Researchers from Foreign Countries Volkert Siersma siersma@sund.ku.dk The Research Unit for General

More information

CLASSICAL AND. MODERN REGRESSION WITH APPLICATIONS

CLASSICAL AND. MODERN REGRESSION WITH APPLICATIONS - CLASSICAL AND. MODERN REGRESSION WITH APPLICATIONS SECOND EDITION Raymond H. Myers Virginia Polytechnic Institute and State university 1 ~l~~l~l~~~~~~~l!~ ~~~~~l~/ll~~ Donated by Duxbury o Thomson Learning,,

More information

An Introduction to Bayesian Statistics

An Introduction to Bayesian Statistics An Introduction to Bayesian Statistics Robert Weiss Department of Biostatistics UCLA Fielding School of Public Health robweiss@ucla.edu Sept 2015 Robert Weiss (UCLA) An Introduction to Bayesian Statistics

More information

2.75: 84% 2.5: 80% 2.25: 78% 2: 74% 1.75: 70% 1.5: 66% 1.25: 64% 1.0: 60% 0.5: 50% 0.25: 25% 0: 0%

2.75: 84% 2.5: 80% 2.25: 78% 2: 74% 1.75: 70% 1.5: 66% 1.25: 64% 1.0: 60% 0.5: 50% 0.25: 25% 0: 0% Capstone Test (will consist of FOUR quizzes and the FINAL test grade will be an average of the four quizzes). Capstone #1: Review of Chapters 1-3 Capstone #2: Review of Chapter 4 Capstone #3: Review of

More information

bivariate analysis: The statistical analysis of the relationship between two variables.

bivariate analysis: The statistical analysis of the relationship between two variables. bivariate analysis: The statistical analysis of the relationship between two variables. cell frequency: The number of cases in a cell of a cross-tabulation (contingency table). chi-square (χ 2 ) test for

More information

6. Unusual and Influential Data

6. Unusual and Influential Data Sociology 740 John ox Lecture Notes 6. Unusual and Influential Data Copyright 2014 by John ox Unusual and Influential Data 1 1. Introduction I Linear statistical models make strong assumptions about the

More information

12.1 Inference for Linear Regression. Introduction

12.1 Inference for Linear Regression. Introduction 12.1 Inference for Linear Regression vocab examples Introduction Many people believe that students learn better if they sit closer to the front of the classroom. Does sitting closer cause higher achievement,

More information

Simple Linear Regression the model, estimation and testing

Simple Linear Regression the model, estimation and testing Simple Linear Regression the model, estimation and testing Lecture No. 05 Example 1 A production manager has compared the dexterity test scores of five assembly-line employees with their hourly productivity.

More information

A Comparison of Robust and Nonparametric Estimators Under the Simple Linear Regression Model

A Comparison of Robust and Nonparametric Estimators Under the Simple Linear Regression Model Nevitt & Tam A Comparison of Robust and Nonparametric Estimators Under the Simple Linear Regression Model Jonathan Nevitt, University of Maryland, College Park Hak P. Tam, National Taiwan Normal University

More information

Multiple Regression Analysis

Multiple Regression Analysis Multiple Regression Analysis Basic Concept: Extend the simple regression model to include additional explanatory variables: Y = β 0 + β1x1 + β2x2 +... + βp-1xp + ε p = (number of independent variables

More information

Lecture 14: Adjusting for between- and within-cluster covariates in the analysis of clustered data May 14, 2009

Lecture 14: Adjusting for between- and within-cluster covariates in the analysis of clustered data May 14, 2009 Measurement, Design, and Analytic Techniques in Mental Health and Behavioral Sciences p. 1/3 Measurement, Design, and Analytic Techniques in Mental Health and Behavioral Sciences Lecture 14: Adjusting

More information

Lecture Outline. Biost 517 Applied Biostatistics I. Purpose of Descriptive Statistics. Purpose of Descriptive Statistics

Lecture Outline. Biost 517 Applied Biostatistics I. Purpose of Descriptive Statistics. Purpose of Descriptive Statistics Biost 517 Applied Biostatistics I Scott S. Emerson, M.D., Ph.D. Professor of Biostatistics University of Washington Lecture 3: Overview of Descriptive Statistics October 3, 2005 Lecture Outline Purpose

More information

An informal analysis of multilevel variance

An informal analysis of multilevel variance APPENDIX 11A An informal analysis of multilevel Imagine we are studying the blood pressure of a number of individuals (level 1) from different neighbourhoods (level 2) in the same city. We start by doing

More information

14. Linear Mixed-Effects Models for Data from Split-Plot Experiments

14. Linear Mixed-Effects Models for Data from Split-Plot Experiments 14. Linear Mixed-Effects Models for Data from Split-Plot Experiments opyright c 219 Dan Nettleton (Iowa State University) 14. Statistics 51 1 / 3 Start with a Field Field opyright c 219 Dan Nettleton (Iowa

More information

Sampling Weights, Model Misspecification and Informative Sampling: A Simulation Study

Sampling Weights, Model Misspecification and Informative Sampling: A Simulation Study Sampling Weights, Model Misspecification and Informative Sampling: A Simulation Study Marianne (Marnie) Bertolet Department of Statistics Carnegie Mellon University Abstract Linear mixed-effects (LME)

More information

The SAGE Encyclopedia of Educational Research, Measurement, and Evaluation Multivariate Analysis of Variance

The SAGE Encyclopedia of Educational Research, Measurement, and Evaluation Multivariate Analysis of Variance The SAGE Encyclopedia of Educational Research, Measurement, Multivariate Analysis of Variance Contributors: David W. Stockburger Edited by: Bruce B. Frey Book Title: Chapter Title: "Multivariate Analysis

More information

Citation for published version (APA): Ebbes, P. (2004). Latent instrumental variables: a new approach to solve for endogeneity s.n.

Citation for published version (APA): Ebbes, P. (2004). Latent instrumental variables: a new approach to solve for endogeneity s.n. University of Groningen Latent instrumental variables Ebbes, P. IMPORTANT NOTE: You are advised to consult the publisher's version (publisher's PDF) if you wish to cite from it. Please check the document

More information

SCATTER PLOTS AND TREND LINES

SCATTER PLOTS AND TREND LINES 1 SCATTER PLOTS AND TREND LINES LEARNING MAP INFORMATION STANDARDS 8.SP.1 Construct and interpret scatter s for measurement to investigate patterns of between two quantities. Describe patterns such as

More information

Chapter 11: Advanced Remedial Measures. Weighted Least Squares (WLS)

Chapter 11: Advanced Remedial Measures. Weighted Least Squares (WLS) Chapter : Advanced Remedial Measures Weighted Least Squares (WLS) When the error variance appears nonconstant, a transformation (of Y and/or X) is a quick remedy. But it may not solve the problem, or it

More information

List of Figures. List of Tables. Preface to the Second Edition. Preface to the First Edition

List of Figures. List of Tables. Preface to the Second Edition. Preface to the First Edition List of Figures List of Tables Preface to the Second Edition Preface to the First Edition xv xxv xxix xxxi 1 What Is R? 1 1.1 Introduction to R................................ 1 1.2 Downloading and Installing

More information

EC352 Econometric Methods: Week 07

EC352 Econometric Methods: Week 07 EC352 Econometric Methods: Week 07 Gordon Kemp Department of Economics, University of Essex 1 / 25 Outline Panel Data (continued) Random Eects Estimation and Clustering Dynamic Models Validity & Threats

More information

Chapter 1: Explaining Behavior

Chapter 1: Explaining Behavior Chapter 1: Explaining Behavior GOAL OF SCIENCE is to generate explanations for various puzzling natural phenomenon. - Generate general laws of behavior (psychology) RESEARCH: principle method for acquiring

More information

Empirical assessment of univariate and bivariate meta-analyses for comparing the accuracy of diagnostic tests

Empirical assessment of univariate and bivariate meta-analyses for comparing the accuracy of diagnostic tests Empirical assessment of univariate and bivariate meta-analyses for comparing the accuracy of diagnostic tests Yemisi Takwoingi, Richard Riley and Jon Deeks Outline Rationale Methods Findings Summary Motivating

More information

Instrumental Variables Estimation: An Introduction

Instrumental Variables Estimation: An Introduction Instrumental Variables Estimation: An Introduction Susan L. Ettner, Ph.D. Professor Division of General Internal Medicine and Health Services Research, UCLA The Problem The Problem Suppose you wish to

More information

Small-area estimation of mental illness prevalence for schools

Small-area estimation of mental illness prevalence for schools Small-area estimation of mental illness prevalence for schools Fan Li 1 Alan Zaslavsky 2 1 Department of Statistical Science Duke University 2 Department of Health Care Policy Harvard Medical School March

More information

REGRESSION MODELLING IN PREDICTING MILK PRODUCTION DEPENDING ON DAIRY BOVINE LIVESTOCK

REGRESSION MODELLING IN PREDICTING MILK PRODUCTION DEPENDING ON DAIRY BOVINE LIVESTOCK REGRESSION MODELLING IN PREDICTING MILK PRODUCTION DEPENDING ON DAIRY BOVINE LIVESTOCK Agatha POPESCU University of Agricultural Sciences and Veterinary Medicine Bucharest, 59 Marasti, District 1, 11464,

More information

Analysis of Vaccine Effects on Post-Infection Endpoints Biostat 578A Lecture 3

Analysis of Vaccine Effects on Post-Infection Endpoints Biostat 578A Lecture 3 Analysis of Vaccine Effects on Post-Infection Endpoints Biostat 578A Lecture 3 Analysis of Vaccine Effects on Post-Infection Endpoints p.1/40 Data Collected in Phase IIb/III Vaccine Trial Longitudinal

More information

The Statistical Analysis of Failure Time Data

The Statistical Analysis of Failure Time Data The Statistical Analysis of Failure Time Data Second Edition JOHN D. KALBFLEISCH ROSS L. PRENTICE iwiley- 'INTERSCIENCE A JOHN WILEY & SONS, INC., PUBLICATION Contents Preface xi 1. Introduction 1 1.1

More information

10. LINEAR REGRESSION AND CORRELATION

10. LINEAR REGRESSION AND CORRELATION 1 10. LINEAR REGRESSION AND CORRELATION The contingency table describes an association between two nominal (categorical) variables (e.g., use of supplemental oxygen and mountaineer survival ). We have

More information

Comparison of Some Almost Unbiased Ratio Estimators

Comparison of Some Almost Unbiased Ratio Estimators Comparison of Some Almost Unbiased Ratio Estimators Priyaranjan Dash Department of Statistics, Tripura University, India. ---------------------------------------------------------------------***---------------------------------------------------------------------

More information

Example 7.2. Autocorrelation. Pilar González and Susan Orbe. Dpt. Applied Economics III (Econometrics and Statistics)

Example 7.2. Autocorrelation. Pilar González and Susan Orbe. Dpt. Applied Economics III (Econometrics and Statistics) Example 7.2 Autocorrelation Pilar González and Susan Orbe Dpt. Applied Economics III (Econometrics and Statistics) Pilar González and Susan Orbe OCW 2014 Example 7.2. Autocorrelation 1 / 17 Questions.

More information

Understanding the cluster randomised crossover design: a graphical illustration of the components of variation and a sample size tutorial

Understanding the cluster randomised crossover design: a graphical illustration of the components of variation and a sample size tutorial Arnup et al. Trials (2017) 18:381 DOI 10.1186/s13063-017-2113-2 METHODOLOGY Open Access Understanding the cluster randomised crossover design: a graphical illustration of the components of variation and

More information

A re-randomisation design for clinical trials

A re-randomisation design for clinical trials Kahan et al. BMC Medical Research Methodology (2015) 15:96 DOI 10.1186/s12874-015-0082-2 RESEARCH ARTICLE Open Access A re-randomisation design for clinical trials Brennan C Kahan 1*, Andrew B Forbes 2,

More information

Statistical Tests of Agreement Based on Non-Standard Data

Statistical Tests of Agreement Based on Non-Standard Data Statistical Tests of Agreement Based on Non-Standard Data Elizabeth Stanwyck Bimal Sinha Department of Mathematics and Statistics University of Maryland, Baltimore County Barry Nussbaum Office of Environmental

More information

Decomposition of the Genotypic Value

Decomposition of the Genotypic Value Decomposition of the Genotypic Value 1 / 17 Partitioning of Phenotypic Values We introduced the general model of Y = G + E in the first lecture, where Y is the phenotypic value, G is the genotypic value,

More information

MMI 409 Spring 2009 Final Examination Gordon Bleil. 1. Is there a difference in depression as a function of group and drug?

MMI 409 Spring 2009 Final Examination Gordon Bleil. 1. Is there a difference in depression as a function of group and drug? MMI 409 Spring 2009 Final Examination Gordon Bleil Table of Contents Research Scenario and General Assumptions Questions for Dataset (Questions are hyperlinked to detailed answers) 1. Is there a difference

More information

Previously, when making inferences about the population mean,, we were assuming the following simple conditions:

Previously, when making inferences about the population mean,, we were assuming the following simple conditions: Chapter 17 Inference about a Population Mean Conditions for inference Previously, when making inferences about the population mean,, we were assuming the following simple conditions: (1) Our data (observations)

More information

Class 7 Everything is Related

Class 7 Everything is Related Class 7 Everything is Related Correlational Designs l 1 Topics Types of Correlational Designs Understanding Correlation Reporting Correlational Statistics Quantitative Designs l 2 Types of Correlational

More information

A review of statistical methods in the analysis of data arising from observer reliability studies (Part 11) *

A review of statistical methods in the analysis of data arising from observer reliability studies (Part 11) * A review of statistical methods in the analysis of data arising from observer reliability studies (Part 11) * by J. RICHARD LANDIS** and GARY G. KOCH** 4 Methods proposed for nominal and ordinal data Many

More information

Catherine A. Welch 1*, Séverine Sabia 1,2, Eric Brunner 1, Mika Kivimäki 1 and Martin J. Shipley 1

Catherine A. Welch 1*, Séverine Sabia 1,2, Eric Brunner 1, Mika Kivimäki 1 and Martin J. Shipley 1 Welch et al. BMC Medical Research Methodology (2018) 18:89 https://doi.org/10.1186/s12874-018-0548-0 RESEARCH ARTICLE Open Access Does pattern mixture modelling reduce bias due to informative attrition

More information

Application of Local Control Strategy in analyses of the effects of Radon on Lung Cancer Mortality for 2,881 US Counties

Application of Local Control Strategy in analyses of the effects of Radon on Lung Cancer Mortality for 2,881 US Counties Application of Local Control Strategy in analyses of the effects of Radon on Lung Cancer Mortality for 2,881 US Counties Bob Obenchain, Risk Benefit Statistics, August 2015 Our motivation for using a Cut-Point

More information

SPRING GROVE AREA SCHOOL DISTRICT. Course Description. Instructional Strategies, Learning Practices, Activities, and Experiences.

SPRING GROVE AREA SCHOOL DISTRICT. Course Description. Instructional Strategies, Learning Practices, Activities, and Experiences. SPRING GROVE AREA SCHOOL DISTRICT PLANNED COURSE OVERVIEW Course Title: Basic Introductory Statistics Grade Level(s): 11-12 Units of Credit: 1 Classification: Elective Length of Course: 30 cycles Periods

More information

BIOL 458 BIOMETRY Lab 7 Multi-Factor ANOVA

BIOL 458 BIOMETRY Lab 7 Multi-Factor ANOVA BIOL 458 BIOMETRY Lab 7 Multi-Factor ANOVA PART 1: Introduction to Factorial ANOVA ingle factor or One - Way Analysis of Variance can be used to test the null hypothesis that k or more treatment or group

More information

Chapter 14: More Powerful Statistical Methods

Chapter 14: More Powerful Statistical Methods Chapter 14: More Powerful Statistical Methods Most questions will be on correlation and regression analysis, but I would like you to know just basically what cluster analysis, factor analysis, and conjoint

More information

Chapter 02. Basic Research Methodology

Chapter 02. Basic Research Methodology Chapter 02 Basic Research Methodology Definition RESEARCH Research is a quest for knowledge through diligent search or investigation or experimentation aimed at the discovery and interpretation of new

More information

Understandable Statistics

Understandable Statistics Understandable Statistics correlated to the Advanced Placement Program Course Description for Statistics Prepared for Alabama CC2 6/2003 2003 Understandable Statistics 2003 correlated to the Advanced Placement

More information

Examining Relationships Least-squares regression. Sections 2.3

Examining Relationships Least-squares regression. Sections 2.3 Examining Relationships Least-squares regression Sections 2.3 The regression line A regression line describes a one-way linear relationship between variables. An explanatory variable, x, explains variability

More information

12/30/2017. PSY 5102: Advanced Statistics for Psychological and Behavioral Research 2

12/30/2017. PSY 5102: Advanced Statistics for Psychological and Behavioral Research 2 PSY 5102: Advanced Statistics for Psychological and Behavioral Research 2 Selecting a statistical test Relationships among major statistical methods General Linear Model and multiple regression Special

More information

The Late Pretest Problem in Randomized Control Trials of Education Interventions

The Late Pretest Problem in Randomized Control Trials of Education Interventions The Late Pretest Problem in Randomized Control Trials of Education Interventions Peter Z. Schochet ACF Methods Conference, September 2012 In Journal of Educational and Behavioral Statistics, August 2010,

More information

Bayes Linear Statistics. Theory and Methods

Bayes Linear Statistics. Theory and Methods Bayes Linear Statistics Theory and Methods Michael Goldstein and David Wooff Durham University, UK BICENTENNI AL BICENTENNIAL Contents r Preface xvii 1 The Bayes linear approach 1 1.1 Combining beliefs

More information

Multilevel IRT for group-level diagnosis. Chanho Park Daniel M. Bolt. University of Wisconsin-Madison

Multilevel IRT for group-level diagnosis. Chanho Park Daniel M. Bolt. University of Wisconsin-Madison Group-Level Diagnosis 1 N.B. Please do not cite or distribute. Multilevel IRT for group-level diagnosis Chanho Park Daniel M. Bolt University of Wisconsin-Madison Paper presented at the annual meeting

More information

Bayesian approaches to handling missing data: Practical Exercises

Bayesian approaches to handling missing data: Practical Exercises Bayesian approaches to handling missing data: Practical Exercises 1 Practical A Thanks to James Carpenter and Jonathan Bartlett who developed the exercise on which this practical is based (funded by ESRC).

More information

Hypothesis Testing. Richard S. Balkin, Ph.D., LPC-S, NCC

Hypothesis Testing. Richard S. Balkin, Ph.D., LPC-S, NCC Hypothesis Testing Richard S. Balkin, Ph.D., LPC-S, NCC Overview When we have questions about the effect of a treatment or intervention or wish to compare groups, we use hypothesis testing Parametric statistics

More information

Math 215, Lab 7: 5/23/2007

Math 215, Lab 7: 5/23/2007 Math 215, Lab 7: 5/23/2007 (1) Parametric versus Nonparamteric Bootstrap. Parametric Bootstrap: (Davison and Hinkley, 1997) The data below are 12 times between failures of airconditioning equipment in

More information

Prepared by: Assoc. Prof. Dr Bahaman Abu Samah Department of Professional Development and Continuing Education Faculty of Educational Studies

Prepared by: Assoc. Prof. Dr Bahaman Abu Samah Department of Professional Development and Continuing Education Faculty of Educational Studies Prepared by: Assoc. Prof. Dr Bahaman Abu Samah Department of Professional Development and Continuing Education Faculty of Educational Studies Universiti Putra Malaysia Serdang At the end of this session,

More information

MEASURES OF ASSOCIATION AND REGRESSION

MEASURES OF ASSOCIATION AND REGRESSION DEPARTMENT OF POLITICAL SCIENCE AND INTERNATIONAL RELATIONS Posc/Uapp 816 MEASURES OF ASSOCIATION AND REGRESSION I. AGENDA: A. Measures of association B. Two variable regression C. Reading: 1. Start Agresti

More information

EpiGRAPH regression: A toolkit for (epi-)genomic correlation analysis and prediction of quantitative attributes

EpiGRAPH regression: A toolkit for (epi-)genomic correlation analysis and prediction of quantitative attributes EpiGRAPH regression: A toolkit for (epi-)genomic correlation analysis and prediction of quantitative attributes by Konstantin Halachev Supervisors: Christoph Bock Prof. Dr. Thomas Lengauer A thesis submitted

More information

Linear Regression Analysis

Linear Regression Analysis Linear Regression Analysis WILEY SERIES IN PROBABILITY AND STATISTICS Established by WALTER A. SHEWHART and SAMUEL S. WILKS Editors: David J. Balding, Peter Bloomfield, Noel A. C. Cressie, Nicholas I.

More information

Survey research (Lecture 1) Summary & Conclusion. Lecture 10 Survey Research & Design in Psychology James Neill, 2015 Creative Commons Attribution 4.

Survey research (Lecture 1) Summary & Conclusion. Lecture 10 Survey Research & Design in Psychology James Neill, 2015 Creative Commons Attribution 4. Summary & Conclusion Lecture 10 Survey Research & Design in Psychology James Neill, 2015 Creative Commons Attribution 4.0 Overview 1. Survey research 2. Survey design 3. Descriptives & graphing 4. Correlation

More information

Survey research (Lecture 1)

Survey research (Lecture 1) Summary & Conclusion Lecture 10 Survey Research & Design in Psychology James Neill, 2015 Creative Commons Attribution 4.0 Overview 1. Survey research 2. Survey design 3. Descriptives & graphing 4. Correlation

More information

Analyzing diastolic and systolic blood pressure individually or jointly?

Analyzing diastolic and systolic blood pressure individually or jointly? Analyzing diastolic and systolic blood pressure individually or jointly? Chenglin Ye a, Gary Foster a, Lisa Dolovich b, Lehana Thabane a,c a. Department of Clinical Epidemiology and Biostatistics, McMaster

More information

Business Statistics Probability

Business Statistics Probability Business Statistics The following was provided by Dr. Suzanne Delaney, and is a comprehensive review of Business Statistics. The workshop instructor will provide relevant examples during the Skills Assessment

More information

Introduction to Econometrics

Introduction to Econometrics Global edition Introduction to Econometrics Updated Third edition James H. Stock Mark W. Watson MyEconLab of Practice Provides the Power Optimize your study time with MyEconLab, the online assessment and

More information

10 Intraclass Correlations under the Mixed Factorial Design

10 Intraclass Correlations under the Mixed Factorial Design CHAPTER 1 Intraclass Correlations under the Mixed Factorial Design OBJECTIVE This chapter aims at presenting methods for analyzing intraclass correlation coefficients for reliability studies based on a

More information

A Comparative Study of Some Estimation Methods for Multicollinear Data

A Comparative Study of Some Estimation Methods for Multicollinear Data International Journal of Engineering and Applied Sciences (IJEAS) A Comparative Study of Some Estimation Methods for Multicollinear Okeke Evelyn Nkiruka, Okeke Joseph Uchenna Abstract This article compares

More information

Correlation and Regression

Correlation and Regression Dublin Institute of Technology ARROW@DIT Books/Book Chapters School of Management 2012-10 Correlation and Regression Donal O'Brien Dublin Institute of Technology, donal.obrien@dit.ie Pamela Sharkey Scott

More information

Statistical Techniques. Meta-Stat provides a wealth of statistical tools to help you examine your data. Overview

Statistical Techniques. Meta-Stat provides a wealth of statistical tools to help you examine your data. Overview 7 Applying Statistical Techniques Meta-Stat provides a wealth of statistical tools to help you examine your data. Overview... 137 Common Functions... 141 Selecting Variables to be Analyzed... 141 Deselecting

More information

The Pretest! Pretest! Pretest! Assignment (Example 2)

The Pretest! Pretest! Pretest! Assignment (Example 2) The Pretest! Pretest! Pretest! Assignment (Example 2) May 19, 2003 1 Statement of Purpose and Description of Pretest Procedure When one designs a Math 10 exam one hopes to measure whether a student s ability

More information

Chapter 2 Organizing and Summarizing Data. Chapter 3 Numerically Summarizing Data. Chapter 4 Describing the Relation between Two Variables

Chapter 2 Organizing and Summarizing Data. Chapter 3 Numerically Summarizing Data. Chapter 4 Describing the Relation between Two Variables Tables and Formulas for Sullivan, Fundamentals of Statistics, 4e 014 Pearson Education, Inc. Chapter Organizing and Summarizing Data Relative frequency = frequency sum of all frequencies Class midpoint:

More information

Multilevel analysis quantifies variation in the experimental effect while optimizing power and preventing false positives

Multilevel analysis quantifies variation in the experimental effect while optimizing power and preventing false positives DOI 10.1186/s12868-015-0228-5 BMC Neuroscience RESEARCH ARTICLE Open Access Multilevel analysis quantifies variation in the experimental effect while optimizing power and preventing false positives Emmeke

More information

What is Multilevel Modelling Vs Fixed Effects. Will Cook Social Statistics

What is Multilevel Modelling Vs Fixed Effects. Will Cook Social Statistics What is Multilevel Modelling Vs Fixed Effects Will Cook Social Statistics Intro Multilevel models are commonly employed in the social sciences with data that is hierarchically structured Estimated effects

More information

11/18/2013. Correlational Research. Correlational Designs. Why Use a Correlational Design? CORRELATIONAL RESEARCH STUDIES

11/18/2013. Correlational Research. Correlational Designs. Why Use a Correlational Design? CORRELATIONAL RESEARCH STUDIES Correlational Research Correlational Designs Correlational research is used to describe the relationship between two or more naturally occurring variables. Is age related to political conservativism? Are

More information

Analysis of Hearing Loss Data using Correlated Data Analysis Techniques

Analysis of Hearing Loss Data using Correlated Data Analysis Techniques Analysis of Hearing Loss Data using Correlated Data Analysis Techniques Ruth Penman and Gillian Heller, Department of Statistics, Macquarie University, Sydney, Australia. Correspondence: Ruth Penman, Department

More information

Where does "analysis" enter the experimental process?

Where does analysis enter the experimental process? Lecture Topic : ntroduction to the Principles of Experimental Design Experiment: An exercise designed to determine the effects of one or more variables (treatments) on one or more characteristics (response

More information

Quantitative Methods in Computing Education Research (A brief overview tips and techniques)

Quantitative Methods in Computing Education Research (A brief overview tips and techniques) Quantitative Methods in Computing Education Research (A brief overview tips and techniques) Dr Judy Sheard Senior Lecturer Co-Director, Computing Education Research Group Monash University judy.sheard@monash.edu

More information

Division of Biostatistics College of Public Health Qualifying Exam II Part I. 1-5 pm, June 7, 2013 Closed Book

Division of Biostatistics College of Public Health Qualifying Exam II Part I. 1-5 pm, June 7, 2013 Closed Book Division of Biostatistics College of Public Health Qualifying Exam II Part I -5 pm, June 7, 03 Closed Book. Write the question number in the upper left-hand corner and your exam ID code in the right-hand

More information

Chapter 5: Field experimental designs in agriculture

Chapter 5: Field experimental designs in agriculture Chapter 5: Field experimental designs in agriculture Jose Crossa Biometrics and Statistics Unit Crop Research Informatics Lab (CRIL) CIMMYT. Int. Apdo. Postal 6-641, 06600 Mexico, DF, Mexico Introduction

More information

Lecture 21. RNA-seq: Advanced analysis

Lecture 21. RNA-seq: Advanced analysis Lecture 21 RNA-seq: Advanced analysis Experimental design Introduction An experiment is a process or study that results in the collection of data. Statistical experiments are conducted in situations in

More information

Lessons in biostatistics

Lessons in biostatistics Lessons in biostatistics The test of independence Mary L. McHugh Department of Nursing, School of Health and Human Services, National University, Aero Court, San Diego, California, USA Corresponding author:

More information

Biology 345: Biometry Fall 2005 SONOMA STATE UNIVERSITY Lab Exercise 5 Residuals and multiple regression Introduction

Biology 345: Biometry Fall 2005 SONOMA STATE UNIVERSITY Lab Exercise 5 Residuals and multiple regression Introduction Biology 345: Biometry Fall 2005 SONOMA STATE UNIVERSITY Lab Exercise 5 Residuals and multiple regression Introduction In this exercise, we will gain experience assessing scatterplots in regression and

More information

Correlation and regression

Correlation and regression PG Dip in High Intensity Psychological Interventions Correlation and regression Martin Bland Professor of Health Statistics University of York http://martinbland.co.uk/ Correlation Example: Muscle strength

More information

BIOSTATISTICAL METHODS

BIOSTATISTICAL METHODS BIOSTATISTICAL METHODS FOR TRANSLATIONAL & CLINICAL RESEARCH PROPENSITY SCORE Confounding Definition: A situation in which the effect or association between an exposure (a predictor or risk factor) and

More information

Summary & Conclusion. Lecture 10 Survey Research & Design in Psychology James Neill, 2016 Creative Commons Attribution 4.0

Summary & Conclusion. Lecture 10 Survey Research & Design in Psychology James Neill, 2016 Creative Commons Attribution 4.0 Summary & Conclusion Lecture 10 Survey Research & Design in Psychology James Neill, 2016 Creative Commons Attribution 4.0 Overview 1. Survey research and design 1. Survey research 2. Survey design 2. Univariate

More information

Reliability of Ordination Analyses

Reliability of Ordination Analyses Reliability of Ordination Analyses Objectives: Discuss Reliability Define Consistency and Accuracy Discuss Validation Methods Opening Thoughts Inference Space: What is it? Inference space can be defined

More information

Abstract. Introduction A SIMULATION STUDY OF ESTIMATORS FOR RATES OF CHANGES IN LONGITUDINAL STUDIES WITH ATTRITION

Abstract. Introduction A SIMULATION STUDY OF ESTIMATORS FOR RATES OF CHANGES IN LONGITUDINAL STUDIES WITH ATTRITION A SIMULATION STUDY OF ESTIMATORS FOR RATES OF CHANGES IN LONGITUDINAL STUDIES WITH ATTRITION Fong Wang, Genentech Inc. Mary Lange, Immunex Corp. Abstract Many longitudinal studies and clinical trials are

More information

Pitfalls in Linear Regression Analysis

Pitfalls in Linear Regression Analysis Pitfalls in Linear Regression Analysis Due to the widespread availability of spreadsheet and statistical software for disposal, many of us do not really have a good understanding of how to use regression

More information

Multiple Linear Regression Analysis

Multiple Linear Regression Analysis Revised July 2018 Multiple Linear Regression Analysis This set of notes shows how to use Stata in multiple regression analysis. It assumes that you have set Stata up on your computer (see the Getting Started

More information

Statistics and Probability

Statistics and Probability Statistics and a single count or measurement variable. S.ID.1: Represent data with plots on the real number line (dot plots, histograms, and box plots). S.ID.2: Use statistics appropriate to the shape

More information

WDHS Curriculum Map Probability and Statistics. What is Statistics and how does it relate to you?

WDHS Curriculum Map Probability and Statistics. What is Statistics and how does it relate to you? WDHS Curriculum Map Probability and Statistics Time Interval/ Unit 1: Introduction to Statistics 1.1-1.3 2 weeks S-IC-1: Understand statistics as a process for making inferences about population parameters

More information

Data Analysis Using Regression and Multilevel/Hierarchical Models

Data Analysis Using Regression and Multilevel/Hierarchical Models Data Analysis Using Regression and Multilevel/Hierarchical Models ANDREW GELMAN Columbia University JENNIFER HILL Columbia University CAMBRIDGE UNIVERSITY PRESS Contents List of examples V a 9 e xv " Preface

More information

26:010:557 / 26:620:557 Social Science Research Methods

26:010:557 / 26:620:557 Social Science Research Methods 26:010:557 / 26:620:557 Social Science Research Methods Dr. Peter R. Gillett Associate Professor Department of Accounting & Information Systems Rutgers Business School Newark & New Brunswick 1 Overview

More information

Midterm Exam MMI 409 Spring 2009 Gordon Bleil

Midterm Exam MMI 409 Spring 2009 Gordon Bleil Midterm Exam MMI 409 Spring 2009 Gordon Bleil Table of contents: (Hyperlinked to problem sections) Problem 1 Hypothesis Tests Results Inferences Problem 2 Hypothesis Tests Results Inferences Problem 3

More information