The Analysis of 2 K Contingency Tables with Different Statistical Approaches

Size: px
Start display at page:

Download "The Analysis of 2 K Contingency Tables with Different Statistical Approaches"

Transcription

1 The Analysis of 2 K Contingency Tables with Different tatistical Approaches Hassan alah M. Thebes Higher Institute for Management and Information Technology drhassn_242@yahoo.com Abstract The main objective of this paper is to analyze the 2 K contingency tables with three statistical approaches (regression analysis, multinomial logistic regression analysis and linguistic fuzzy model). We compare these methods for evaluating the association between a risk factor and a disease. These statistical methods measure the association between the numeric levels of a risk factor and a disease in different ways. They have been applied to a set of data of childhood cancer risk from prenatal x-ray exposure. Regression and multinomial logistic regression analyses show similar results for a data set of children whereas the fuzzy analysis yields a different result. Keywords Contingency table, Multinomial logistic regression, Linguistic fuzzy model, Data of childhood cancer, X-ray exposure. 1. Introduction The 2 K contingency table is an important extension of 2 2 table which is a basic tool for epidemiology investigation. In 2 K contingency table, the presence or absence of a disease is recorded at K levels of a risk factor. The 2 K contingency table can be viewed from the perspective of a K - level variable (risk factor) or from the perspective of a binary variable (disease) [4]. In this paper, we use three different statistical approaches for analyzing the 2 K contingency table; regression analysis, multinomial logistic regression analysis and linguistic fuzzy model. Data on malignancies in children under 10 years of age and information on the mother's exposure to x-ray provide an example for the discussion and analysis of a 2 K table [2] and [3]. Table 1 shows the numbers of prenatal x-rays received by mothers of children with a malignant disease, and a series of controls (healthy children of the same age, sex, and similar areas of residence)

2 Table 1 Observed numbers of cases and controls by recorded number of maternal x-ray films during pregnancy * for simplicity, the values greater than five were coded as Regression Analysis A 2 K contingency table can be viewed as a set of K pairs of values. An ) estimated probability is generated for each value of X producing K pairs x j, p ) ( j where p j is the estimated probability that Y = 0 associated with each level represented by x j. In order to analyze the K pairs of values, a straight line which summarizes the relationship between X and Y is estimated and the slope of the estimated line is used as a summary of the relationship between X and Y. For a simple linear regression, three quantities are necessary to derive the basic statistical measures: the sum of squares for X ), the sum of squares for Y ), and the sum of cross-products for X and Y ( Films Total Cases Y = Controls Y = Total Proportion ( yy ( xy ). These expressions calculated from a 2 K contingency table are [7]: = k j = 1 n. j ( x j v x) 2, where v x = n x n. j j / yy = n n / n (2) k v v v xy = ( x1 x2 ) yy where xi = n Now, the regression coefficient can be estimated as b y / x = xy / (4) and the variance of the estimated regression coefficient can be estimated as ) var( b y / x ) = yy / ( n 1) (5) On the other hand, a correlation coefficient measuring the degree of linear association between X and Y calculated in the usual way is xy r xy = (6) yy j= 1 ij x j / n i. * (1) (3) 2

3 For the data in Table 1, these quantities for the malignant disease are: = , yy = , xy = Using (4) and (5), the estimated coefficient of regression and its variance are and respectively. The correlation between the case/control status and the x-ray exposure is A 95 % confidence interval of the association coefficient is (0.0577, ). Moreover, the expected numbers of cases and controls by recorded number of maternal x-ray films during pregnancy are estimated using an estimated linear response p = x as shown in Table 2 below. i Table 2 Expected numbers of cases and controls by recorded number of maternal x-ray films during pregnancy Films Total Cases Y = Controls Y = Total Proportion * for simplicity, the values greater than five were coded as 5. The observed and expected proportions of cases shown in Table 1 and Table 2 are plotted in Figure 1 below. 0.8 P Obs. P Exp. P X-ray Figure 1: Proportion of cases childhood cancer for exposure to maternal x-ray during pregnancy 3

4 Figure 1 indicates that the distribution of number of the cases is better fitted and the estimated line is good. An additional assessment of the dose-response relationship is accomplished by partitioning the total chi-square value. The chi-square statistic that measures homogeneity (H 0 : the proportion of cases is the same regardless of the degree of maternal x-ray exposure) is χ 2 = A chi-square value of this magnitude indicates the presence of some sort of nonhomogeneous pattern of response ( ρ value =0. 001) [7]. 3. Multinomial Logistic Regression Analysis Multinomial logistic regression analysis is useful for situations in which we want to be able to classify subjects based on values of a set of predictor variables. This type of regression is similar to logistic regression, but it is more general. In regression analysis, we use the numeric levels of a risk factor (the number of x-ray exposures) as an independent variable and the corresponding proportion of cases as dependent variable, but in multinomial logistic regression there is need to consider a large number of records (frequency) to establish an association between risk factor and a disease [5]. In order to analyze a 2 K contingency table using multinomial logistic regression analysis, the data in Table 1 were processed using PWIN and the numeric results were similar as those obtained by regression analysis [1]. That is the association coefficient between risk factor and disease is with standard error of A 95 % confidence interval of the association coefficient is (0.0481, ). 4. Fuzzy analysis In bioscience there are several levels of uncertainty, vagueness and imprecision, particularly in the medical and epidemiological areas, where the best and most useful description of disease entities often comprise linguistic terms that are inevitably vague. The theory of fuzzy logic has been developed to deal with the concept of partial truth values, ranging from completely true to completely false, and has become a powerful tool for dealing with imprecision and uncertainty aiming at tractability, robustness and low-cost solutions for real-world problems. These features and the ability to deal with linguistic terms could explain the increasing number of works applying fuzzy logic in biomedicine problems. In fact, the theory of fuzzy sets has become an important mathematical approach in diagnosis system, treatment of medical images and, more recently in epidemiology and public health [5] and [6]. For more knowledge about fuzzy logic theory the book by Yen and Langari [8] is recommended. A linguistic fuzzy model consists of a set of fuzzy rules and an inference method. The most common inference method is the Minimum of Mamdani, whose output is a fuzzy set. The fuzzy linguistic model to evaluate a childhood cancer risk 4

5 from prenatal x-ray exposure has two antecedents: malignancies in children under 10 years of age and information on the mother's exposure to x-ray. The model elaborated five fuzzy sets to the variable number of x-ray films that exposure to the mothers (very low, low, medium, high and very high) and two fuzzy sets for the variable number of children with a malignant disease and a series of controls ( healthy children of the same age) (cases and controls). The consequence of the model is the association between x-ray films and the malignancies in children under 10 years of age. We considered three fuzzy sets for this linguistic variable; weak, medium and strong. The base rules consist of the following ones: 1. If x-ray is very low and case then association is weak. 2. If x-ray is low and case then association is weak. 3. If x-ray is medium and case then association is weak. 4. If x-ray is high and case then association is medium. 5. If x-ray is very high and case then association is strong The association between the childrens' malignancies and x-ray films is determined by inference of the fuzzy rule set, and defuzzifiction of the fuzzy output. The system was run in a C++ language. Fuzzy sets to input variable number of x-ray and to output variable of association between malignancies children and x-ray are displayed in Figure 2 and Figure 3 below. Membership function 1 VLOW LOW MEDIUM HIGH VHIGH X Ray Figure 2: Fuzzy sets to input variable number of X-ray 5

6 Membership function WEAK MEDIUM TRONG Figure 3: Fuzzy sets to output variable of Association between malignancies children and X-Ray We notice that by combining all possible inputs it is possible to build 10 rules but, it only 5 rules were considered because some situations that can not occur. For example, it is impossible, for the mothers who were not exposed to x-ray, the children have a disease (if they have; this occurs for another reason). Although this is mathematically possible, it was subtracted from the rule bases, reducing the number of rules. The fuzzy set related to linguistic variables is presented in Figure 2. The membership fucntion represents the degree of compatibility of some input to all categpries. In fact, the membership degree represents the possibility that the input belongs to the set. Figure 3 shows the memebership function of the output. It is clear that the association increases monotonically when the number of x-ray films increases. It was 16 % for weak, 17 % for medium and 18 % for strong associations respectively. Also the weighted mean of the association between X-ray and the disease was and the standard error was A 95 % confidence interval of the association coefficient is ( , ). 6

7 Discussion In regression analysis, we use the numeric levels of a risk factor (the number of x-ray exposures) as an independent variable and the corresponding proportion of cases as a dependent variable. Furthermore, in multinomial logistic regression there is need for a considerable number of records (frequency) to establish an association between risk factor and a disease. In a fuzzy linguistic model, there is not such need. ( b y / x The point biserial correlation coefficient ( r xy ), the regression coefficient ) are interrelated when calculated from a 2 K table. For example, each has an expected value of zero when the variables X and Y are unrelated. The two statistics measure the association between the numeric levels of a risk factor and a disease in different ways but, in terms of probability, lead to the same inference. A measure of association assesses the strength of a relationship, while a statistical test gives an idea of the likelihood that such an association occurs by chance where both regression and multinomial logistic regression give similar results, the fuzzy model gives rather different results for evaluating the association between the risk factor and the disease (ee: Table 3). Table 3 Comparison between the results of the three methods Regression Multinomial logistic regression Fuzzy model Association coefficient tandard error % CI (.0577,.0683) (.0481,.0579) (.1178, 1322) ρ value We notice from Table 3 that the three statistical methods (regression, multinomial logistic regression and fuzzy model) for evaluating the association between risk factor and a disease show similar results for a data set of children, but the results from fuzzy model are rather different. References [1] Ashour,. K. and alem,. A. (2005). tatistical Presentation and Analysis using PWIN, Part two: Advanced Applied tatistics. Cairo University: IR. 7

8 [2] Bithell, J. F., and teward, M. A. (1975). Prenatal Irradiation and childhood Malignancy: A Review of British Data from the Oxford tudy. Brit. J. of Cancer (31): [3] Breslow, N. E., and Day, N. E. (1987). tatistical Methods in Cancer Research, Volume II. Oxford University Press. Oxford, UK. [4] Hardeo ahai and Anwer Khurshid (1996). tatistics in Epidemiology, Methods, Techniques and Applications. CRC Press, New York. [5] Luiz Fernando C. Nascimento and Neli Regina Ortega (2002). Fuzzy Linguistic Model for Evaluating the Risk of Neonatal Death. Rev aude Publica, 36 (6): [6] chwarzer G., Nagata T., Mattern D., chmelzeisen R. and chumacher (2003). Comparison of Fuzzy Inference, Logistic Regression, and Classification Trees (CART). Methods Inf Med; 42: [7] teve,. (1996). tatistical Analysis of Epidemiologic Data, 2 nd ed. Oxford University Press, Oxford. [8] Yen J. and Langari R. (1999). Fuzzy Logic: Intelligence, Control an information. Upper addle River (NJ), Prentic-hall. 8

11/18/2013. Correlational Research. Correlational Designs. Why Use a Correlational Design? CORRELATIONAL RESEARCH STUDIES

11/18/2013. Correlational Research. Correlational Designs. Why Use a Correlational Design? CORRELATIONAL RESEARCH STUDIES Correlational Research Correlational Designs Correlational research is used to describe the relationship between two or more naturally occurring variables. Is age related to political conservativism? Are

More information

Artificial Intelligence For Homeopathic Remedy Selection

Artificial Intelligence For Homeopathic Remedy Selection Artificial Intelligence For Homeopathic Remedy Selection A. R. Pawar, amrut.pawar@yahoo.co.in, S. N. Kini, snkini@gmail.com, M. R. More mangeshmore88@gmail.com Department of Computer Science and Engineering,

More information

Regression Including the Interaction Between Quantitative Variables

Regression Including the Interaction Between Quantitative Variables Regression Including the Interaction Between Quantitative Variables The purpose of the study was to examine the inter-relationships among social skills, the complexity of the social situation, and performance

More information

11/24/2017. Do not imply a cause-and-effect relationship

11/24/2017. Do not imply a cause-and-effect relationship Correlational research is used to describe the relationship between two or more naturally occurring variables. Is age related to political conservativism? Are highly extraverted people less afraid of rejection

More information

Stepwise Knowledge Acquisition in a Fuzzy Knowledge Representation Framework

Stepwise Knowledge Acquisition in a Fuzzy Knowledge Representation Framework Stepwise Knowledge Acquisition in a Fuzzy Knowledge Representation Framework Thomas E. Rothenfluh 1, Karl Bögl 2, and Klaus-Peter Adlassnig 2 1 Department of Psychology University of Zurich, Zürichbergstraße

More information

Artificially Intelligent Primary Medical Aid for Patients Residing in Remote areas using Fuzzy Logic

Artificially Intelligent Primary Medical Aid for Patients Residing in Remote areas using Fuzzy Logic Artificially Intelligent Primary Medical Aid for Patients Residing in Remote areas using Fuzzy Logic Ravinkal Kaur 1, Virat Rehani 2 1M.tech Student, Dept. of CSE, CT Institute of Technology & Research,

More information

Applications. DSC 410/510 Multivariate Statistical Methods. Discriminating Two Groups. What is Discriminant Analysis

Applications. DSC 410/510 Multivariate Statistical Methods. Discriminating Two Groups. What is Discriminant Analysis DSC 4/5 Multivariate Statistical Methods Applications DSC 4/5 Multivariate Statistical Methods Discriminant Analysis Identify the group to which an object or case (e.g. person, firm, product) belongs:

More information

STATISTICAL METHODS FOR DIAGNOSTIC TESTING: AN ILLUSTRATION USING A NEW METHOD FOR CANCER DETECTION XIN SUN. PhD, Kansas State University, 2012

STATISTICAL METHODS FOR DIAGNOSTIC TESTING: AN ILLUSTRATION USING A NEW METHOD FOR CANCER DETECTION XIN SUN. PhD, Kansas State University, 2012 STATISTICAL METHODS FOR DIAGNOSTIC TESTING: AN ILLUSTRATION USING A NEW METHOD FOR CANCER DETECTION by XIN SUN PhD, Kansas State University, 2012 A THESIS Submitted in partial fulfillment of the requirements

More information

IAPT: Regression. Regression analyses

IAPT: Regression. Regression analyses Regression analyses IAPT: Regression Regression is the rather strange name given to a set of methods for predicting one variable from another. The data shown in Table 1 and come from a student project

More information

Daniel Boduszek University of Huddersfield

Daniel Boduszek University of Huddersfield Daniel Boduszek University of Huddersfield d.boduszek@hud.ac.uk Introduction to Logistic Regression SPSS procedure of LR Interpretation of SPSS output Presenting results from LR Logistic regression is

More information

CRITERIA FOR USE. A GRAPHICAL EXPLANATION OF BI-VARIATE (2 VARIABLE) REGRESSION ANALYSISSys

CRITERIA FOR USE. A GRAPHICAL EXPLANATION OF BI-VARIATE (2 VARIABLE) REGRESSION ANALYSISSys Multiple Regression Analysis 1 CRITERIA FOR USE Multiple regression analysis is used to test the effects of n independent (predictor) variables on a single dependent (criterion) variable. Regression tests

More information

bivariate analysis: The statistical analysis of the relationship between two variables.

bivariate analysis: The statistical analysis of the relationship between two variables. bivariate analysis: The statistical analysis of the relationship between two variables. cell frequency: The number of cases in a cell of a cross-tabulation (contingency table). chi-square (χ 2 ) test for

More information

Understandable Statistics

Understandable Statistics Understandable Statistics correlated to the Advanced Placement Program Course Description for Statistics Prepared for Alabama CC2 6/2003 2003 Understandable Statistics 2003 correlated to the Advanced Placement

More information

12/30/2017. PSY 5102: Advanced Statistics for Psychological and Behavioral Research 2

12/30/2017. PSY 5102: Advanced Statistics for Psychological and Behavioral Research 2 PSY 5102: Advanced Statistics for Psychological and Behavioral Research 2 Selecting a statistical test Relationships among major statistical methods General Linear Model and multiple regression Special

More information

Statistical questions for statistical methods

Statistical questions for statistical methods Statistical questions for statistical methods Unpaired (two-sample) t-test DECIDE: Does the numerical outcome have a relationship with the categorical explanatory variable? Is the mean of the outcome the

More information

Fuzzy Expert System Design for Medical Diagnosis

Fuzzy Expert System Design for Medical Diagnosis Second International Conference Modelling and Development of Intelligent Systems Sibiu - Romania, September 29 - October 02, 2011 Man Diana Ofelia Abstract In recent years, the methods of artificial intelligence

More information

Unit 1 Exploring and Understanding Data

Unit 1 Exploring and Understanding Data Unit 1 Exploring and Understanding Data Area Principle Bar Chart Boxplot Conditional Distribution Dotplot Empirical Rule Five Number Summary Frequency Distribution Frequency Polygon Histogram Interquartile

More information

Business Statistics Probability

Business Statistics Probability Business Statistics The following was provided by Dr. Suzanne Delaney, and is a comprehensive review of Business Statistics. The workshop instructor will provide relevant examples during the Skills Assessment

More information

Applied Medical. Statistics Using SAS. Geoff Der. Brian S. Everitt. CRC Press. Taylor Si Francis Croup. Taylor & Francis Croup, an informa business

Applied Medical. Statistics Using SAS. Geoff Der. Brian S. Everitt. CRC Press. Taylor Si Francis Croup. Taylor & Francis Croup, an informa business Applied Medical Statistics Using SAS Geoff Der Brian S. Everitt CRC Press Taylor Si Francis Croup Boca Raton London New York CRC Press is an imprint of the Taylor & Francis Croup, an informa business A

More information

Statistics as a Tool. A set of tools for collecting, organizing, presenting and analyzing numerical facts or observations.

Statistics as a Tool. A set of tools for collecting, organizing, presenting and analyzing numerical facts or observations. Statistics as a Tool A set of tools for collecting, organizing, presenting and analyzing numerical facts or observations. Descriptive Statistics Numerical facts or observations that are organized describe

More information

isc ove ring i Statistics sing SPSS

isc ove ring i Statistics sing SPSS isc ove ring i Statistics sing SPSS S E C O N D! E D I T I O N (and sex, drugs and rock V roll) A N D Y F I E L D Publications London o Thousand Oaks New Delhi CONTENTS Preface How To Use This Book Acknowledgements

More information

Overview of Non-Parametric Statistics

Overview of Non-Parametric Statistics Overview of Non-Parametric Statistics LISA Short Course Series Mark Seiss, Dept. of Statistics April 7, 2009 Presentation Outline 1. Homework 2. Review of Parametric Statistics 3. Overview Non-Parametric

More information

The SAGE Encyclopedia of Educational Research, Measurement, and Evaluation Multivariate Analysis of Variance

The SAGE Encyclopedia of Educational Research, Measurement, and Evaluation Multivariate Analysis of Variance The SAGE Encyclopedia of Educational Research, Measurement, Multivariate Analysis of Variance Contributors: David W. Stockburger Edited by: Bruce B. Frey Book Title: Chapter Title: "Multivariate Analysis

More information

Sample size and power calculations in Mendelian randomization with a single instrumental variable and a binary outcome

Sample size and power calculations in Mendelian randomization with a single instrumental variable and a binary outcome Sample size and power calculations in Mendelian randomization with a single instrumental variable and a binary outcome Stephen Burgess July 10, 2013 Abstract Background: Sample size calculations are an

More information

Daniel Boduszek University of Huddersfield

Daniel Boduszek University of Huddersfield Daniel Boduszek University of Huddersfield d.boduszek@hud.ac.uk Introduction to Multinominal Logistic Regression SPSS procedure of MLR Example based on prison data Interpretation of SPSS output Presenting

More information

Week 17 and 21 Comparing two assays and Measurement of Uncertainty Explain tools used to compare the performance of two assays, including

Week 17 and 21 Comparing two assays and Measurement of Uncertainty Explain tools used to compare the performance of two assays, including Week 17 and 21 Comparing two assays and Measurement of Uncertainty 2.4.1.4. Explain tools used to compare the performance of two assays, including 2.4.1.4.1. Linear regression 2.4.1.4.2. Bland-Altman plots

More information

Chapter 3: Examining Relationships

Chapter 3: Examining Relationships Name Date Per Key Vocabulary: response variable explanatory variable independent variable dependent variable scatterplot positive association negative association linear correlation r-value regression

More information

Results & Statistics: Description and Correlation. I. Scales of Measurement A Review

Results & Statistics: Description and Correlation. I. Scales of Measurement A Review Results & Statistics: Description and Correlation The description and presentation of results involves a number of topics. These include scales of measurement, descriptive statistics used to summarize

More information

Human Immunodeficiency Virus (HIV) Diagnosis Using Neuro-Fuzzy Expert System

Human Immunodeficiency Virus (HIV) Diagnosis Using Neuro-Fuzzy Expert System ORIENTAL JOURNAL OF COMPUTER SCIENCE & TECHNOLOGY An International Open Free Access, Peer Reviewed Research Journal Published By: Oriental Scientific Publishing Co., India. www.computerscijournal.org ISSN:

More information

3 CONCEPTUAL FOUNDATIONS OF STATISTICS

3 CONCEPTUAL FOUNDATIONS OF STATISTICS 3 CONCEPTUAL FOUNDATIONS OF STATISTICS In this chapter, we examine the conceptual foundations of statistics. The goal is to give you an appreciation and conceptual understanding of some basic statistical

More information

HOW STATISTICS IMPACT PHARMACY PRACTICE?

HOW STATISTICS IMPACT PHARMACY PRACTICE? HOW STATISTICS IMPACT PHARMACY PRACTICE? CPPD at NCCR 13 th June, 2013 Mohamed Izham M.I., PhD Professor in Social & Administrative Pharmacy Learning objective.. At the end of the presentation pharmacists

More information

Study Guide for the Final Exam

Study Guide for the Final Exam Study Guide for the Final Exam When studying, remember that the computational portion of the exam will only involve new material (covered after the second midterm), that material from Exam 1 will make

More information

Modeling Health Related Quality of Life among Cancer Patients Using an Integrated Inference System and Linear Regression

Modeling Health Related Quality of Life among Cancer Patients Using an Integrated Inference System and Linear Regression International Journal of Pharma Medicine and Biological Sciences Vol. 4, No. 1, January 2015 Modeling Health Related Quality of Life among Cancer Patients Using an Integrated Inference System and Linear

More information

Meta-Analysis. Zifei Liu. Biological and Agricultural Engineering

Meta-Analysis. Zifei Liu. Biological and Agricultural Engineering Meta-Analysis Zifei Liu What is a meta-analysis; why perform a metaanalysis? How a meta-analysis work some basic concepts and principles Steps of Meta-analysis Cautions on meta-analysis 2 What is Meta-analysis

More information

Centering Predictors

Centering Predictors Centering Predictors Longitudinal Data Analysis Workshop Section 3 University of Georgia: Institute for Interdisciplinary Research in Education and Human Development Section 3: Centering Covered this Section

More information

BMI 541/699 Lecture 16

BMI 541/699 Lecture 16 BMI 541/699 Lecture 16 Where we are: 1. Introduction and Experimental Design 2. Exploratory Data Analysis 3. Probability 4. T-based methods for continous variables 5. Proportions & contingency tables -

More information

STA 3024 Spring 2013 EXAM 3 Test Form Code A UF ID #

STA 3024 Spring 2013 EXAM 3 Test Form Code A UF ID # STA 3024 Spring 2013 Name EXAM 3 Test Form Code A UF ID # Instructions: This exam contains 34 Multiple Choice questions. Each question is worth 3 points, for a total of 102 points (there are TWO bonus

More information

From Bivariate Through Multivariate Techniques

From Bivariate Through Multivariate Techniques A p p l i e d S T A T I S T I C S From Bivariate Through Multivariate Techniques R e b e c c a M. W a r n e r University of New Hampshire DAI HOC THAI NGUYEN TRUNG TAM HOC LIEU *)SAGE Publications '55'

More information

Correlation and regression

Correlation and regression PG Dip in High Intensity Psychological Interventions Correlation and regression Martin Bland Professor of Health Statistics University of York http://martinbland.co.uk/ Correlation Example: Muscle strength

More information

On the purpose of testing:

On the purpose of testing: Why Evaluation & Assessment is Important Feedback to students Feedback to teachers Information to parents Information for selection and certification Information for accountability Incentives to increase

More information

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo Business Statistics The following was provided by Dr. Suzanne Delaney, and is a comprehensive review of Business Statistics. The workshop instructor will provide relevant examples during the Skills Assessment

More information

Evidence-Based Medicine Journal Club. A Primer in Statistics, Study Design, and Epidemiology. August, 2013

Evidence-Based Medicine Journal Club. A Primer in Statistics, Study Design, and Epidemiology. August, 2013 Evidence-Based Medicine Journal Club A Primer in Statistics, Study Design, and Epidemiology August, 2013 Rationale for EBM Conscientious, explicit, and judicious use Beyond clinical experience and physiologic

More information

Developing a fuzzy Likert scale for measuring xenophobia in Greece

Developing a fuzzy Likert scale for measuring xenophobia in Greece Developing a fuzzy Likert scale for measuring xenophobia in Greece Maria Symeonaki 1, and Aggeliki Kazani 2 1 Panteion University of Political and Social Sciences Department of Social Policy 136 Syggrou

More information

Non Linear Control of Glycaemia in Type 1 Diabetic Patients

Non Linear Control of Glycaemia in Type 1 Diabetic Patients Non Linear Control of Glycaemia in Type 1 Diabetic Patients Mosè Galluzzo*, Bartolomeo Cosenza Dipartimento di Ingegneria Chimica dei Processi e dei Materiali, Università degli Studi di Palermo Viale delle

More information

Confounding, Effect modification, and Stratification

Confounding, Effect modification, and Stratification Confounding, Effect modification, and Stratification Tunisia, 30th October 2014 Acknowledgment: Kostas Danis Takis Panagiotopoulos National Schoool of Public Health, Athens, Greece takis.panagiotopoulos@gmail.com

More information

Diagnostic screening. Department of Statistics, University of South Carolina. Stat 506: Introduction to Experimental Design

Diagnostic screening. Department of Statistics, University of South Carolina. Stat 506: Introduction to Experimental Design Diagnostic screening Department of Statistics, University of South Carolina Stat 506: Introduction to Experimental Design 1 / 27 Ties together several things we ve discussed already... The consideration

More information

Midterm Exam ANSWERS Categorical Data Analysis, CHL5407H

Midterm Exam ANSWERS Categorical Data Analysis, CHL5407H Midterm Exam ANSWERS Categorical Data Analysis, CHL5407H 1. Data from a survey of women s attitudes towards mammography are provided in Table 1. Women were classified by their experience with mammography

More information

Bayesian Logistic Regression Modelling via Markov Chain Monte Carlo Algorithm

Bayesian Logistic Regression Modelling via Markov Chain Monte Carlo Algorithm Journal of Social and Development Sciences Vol. 4, No. 4, pp. 93-97, Apr 203 (ISSN 222-52) Bayesian Logistic Regression Modelling via Markov Chain Monte Carlo Algorithm Henry De-Graft Acquah University

More information

Chapter 11: Advanced Remedial Measures. Weighted Least Squares (WLS)

Chapter 11: Advanced Remedial Measures. Weighted Least Squares (WLS) Chapter : Advanced Remedial Measures Weighted Least Squares (WLS) When the error variance appears nonconstant, a transformation (of Y and/or X) is a quick remedy. But it may not solve the problem, or it

More information

Analysis of Rheumatoid Arthritis Data using Logistic Regression and Penalized Approach

Analysis of Rheumatoid Arthritis Data using Logistic Regression and Penalized Approach University of South Florida Scholar Commons Graduate Theses and Dissertations Graduate School November 2015 Analysis of Rheumatoid Arthritis Data using Logistic Regression and Penalized Approach Wei Chen

More information

Data Analysis Using Regression and Multilevel/Hierarchical Models

Data Analysis Using Regression and Multilevel/Hierarchical Models Data Analysis Using Regression and Multilevel/Hierarchical Models ANDREW GELMAN Columbia University JENNIFER HILL Columbia University CAMBRIDGE UNIVERSITY PRESS Contents List of examples V a 9 e xv " Preface

More information

Content. Basic Statistics and Data Analysis for Health Researchers from Foreign Countries. Research question. Example Newly diagnosed Type 2 Diabetes

Content. Basic Statistics and Data Analysis for Health Researchers from Foreign Countries. Research question. Example Newly diagnosed Type 2 Diabetes Content Quantifying association between continuous variables. Basic Statistics and Data Analysis for Health Researchers from Foreign Countries Volkert Siersma siersma@sund.ku.dk The Research Unit for General

More information

Conditional Distributions and the Bivariate Normal Distribution. James H. Steiger

Conditional Distributions and the Bivariate Normal Distribution. James H. Steiger Conditional Distributions and the Bivariate Normal Distribution James H. Steiger Overview In this module, we have several goals: Introduce several technical terms Bivariate frequency distribution Marginal

More information

m 11 m.1 > m 12 m.2 risk for smokers risk for nonsmokers

m 11 m.1 > m 12 m.2 risk for smokers risk for nonsmokers SOCY5061 RELATIVE RISKS, RELATIVE ODDS, LOGISTIC REGRESSION RELATIVE RISKS: Suppose we are interested in the association between lung cancer and smoking. Consider the following table for the whole population:

More information

Choosing a Significance Test. Student Resource Sheet

Choosing a Significance Test. Student Resource Sheet Choosing a Significance Test Student Resource Sheet Choosing Your Test Choosing an appropriate type of significance test is a very important consideration in analyzing data. If an inappropriate test is

More information

Diagnosis Of the Diabetes Mellitus disease with Fuzzy Inference System Mamdani

Diagnosis Of the Diabetes Mellitus disease with Fuzzy Inference System Mamdani Diagnosis Of the Diabetes Mellitus disease with Fuzzy Inference System Mamdani Za imatun Niswati, Aulia Paramita and Fanisya Alva Mustika Technical Information, Indraprasta PGRI University E-mail : zaimatunnis@gmail.com,

More information

CHAPTER ONE CORRELATION

CHAPTER ONE CORRELATION CHAPTER ONE CORRELATION 1.0 Introduction The first chapter focuses on the nature of statistical data of correlation. The aim of the series of exercises is to ensure the students are able to use SPSS to

More information

Prediction of Malignant and Benign Tumor using Machine Learning

Prediction of Malignant and Benign Tumor using Machine Learning Prediction of Malignant and Benign Tumor using Machine Learning Ashish Shah Department of Computer Science and Engineering Manipal Institute of Technology, Manipal University, Manipal, Karnataka, India

More information

Controlling Bias & Confounding

Controlling Bias & Confounding Controlling Bias & Confounding Chihaya Koriyama August 5 th, 2015 QUESTIONS FOR BIAS Key concepts Bias Should be minimized at the designing stage. Random errors We can do nothing at Is the nature the of

More information

MEASURES OF ASSOCIATION AND REGRESSION

MEASURES OF ASSOCIATION AND REGRESSION DEPARTMENT OF POLITICAL SCIENCE AND INTERNATIONAL RELATIONS Posc/Uapp 816 MEASURES OF ASSOCIATION AND REGRESSION I. AGENDA: A. Measures of association B. Two variable regression C. Reading: 1. Start Agresti

More information

Uncertain Rule-Based Fuzzy Logic Systems:

Uncertain Rule-Based Fuzzy Logic Systems: Uncertain Rule-Based Fuzzy Logic Systems: Introduction and New Directions Jerry M. Mendel University of Southern California Los Angeles, CA PH PTR Prentice Hall PTR Upper Saddle River, NJ 07458 www.phptr.com

More information

Basic Statistics and Data Analysis in Work psychology: Statistical Examples

Basic Statistics and Data Analysis in Work psychology: Statistical Examples Basic Statistics and Data Analysis in Work psychology: Statistical Examples WORK PSYCHOLOGY INTRODUCTION In this chapter we examine a topic which is given too little coverage in most texts of this kind,

More information

A review of statistical methods in the analysis of data arising from observer reliability studies (Part 11) *

A review of statistical methods in the analysis of data arising from observer reliability studies (Part 11) * A review of statistical methods in the analysis of data arising from observer reliability studies (Part 11) * by J. RICHARD LANDIS** and GARY G. KOCH** 4 Methods proposed for nominal and ordinal data Many

More information

ExperimentalPhysiology

ExperimentalPhysiology Exp Physiol 97.5 (2012) pp 557 561 557 Editorial ExperimentalPhysiology Categorized or continuous? Strength of an association and linear regression Gordon B. Drummond 1 and Sarah L. Vowler 2 1 Department

More information

Six Sigma Glossary Lean 6 Society

Six Sigma Glossary Lean 6 Society Six Sigma Glossary Lean 6 Society ABSCISSA ACCEPTANCE REGION ALPHA RISK ALTERNATIVE HYPOTHESIS ASSIGNABLE CAUSE ASSIGNABLE VARIATIONS The horizontal axis of a graph The region of values for which the null

More information

CHAPTER - 6 STATISTICAL ANALYSIS. This chapter discusses inferential statistics, which use sample data to

CHAPTER - 6 STATISTICAL ANALYSIS. This chapter discusses inferential statistics, which use sample data to CHAPTER - 6 STATISTICAL ANALYSIS 6.1 Introduction This chapter discusses inferential statistics, which use sample data to make decisions or inferences about population. Populations are group of interest

More information

Statistical Methods and Reasoning for the Clinical Sciences

Statistical Methods and Reasoning for the Clinical Sciences Statistical Methods and Reasoning for the Clinical Sciences Evidence-Based Practice Eiki B. Satake, PhD Contents Preface Introduction to Evidence-Based Statistics: Philosophical Foundation and Preliminaries

More information

2.75: 84% 2.5: 80% 2.25: 78% 2: 74% 1.75: 70% 1.5: 66% 1.25: 64% 1.0: 60% 0.5: 50% 0.25: 25% 0: 0%

2.75: 84% 2.5: 80% 2.25: 78% 2: 74% 1.75: 70% 1.5: 66% 1.25: 64% 1.0: 60% 0.5: 50% 0.25: 25% 0: 0% Capstone Test (will consist of FOUR quizzes and the FINAL test grade will be an average of the four quizzes). Capstone #1: Review of Chapters 1-3 Capstone #2: Review of Chapter 4 Capstone #3: Review of

More information

POL 242Y Final Test (Take Home) Name

POL 242Y Final Test (Take Home) Name POL 242Y Final Test (Take Home) Name_ Due August 6, 2008 The take-home final test should be returned in the classroom (FE 36) by the end of the class on August 6. Students who fail to submit the final

More information

A prediction model for type 2 diabetes using adaptive neuro-fuzzy interface system.

A prediction model for type 2 diabetes using adaptive neuro-fuzzy interface system. Biomedical Research 208; Special Issue: S69-S74 ISSN 0970-938X www.biomedres.info A prediction model for type 2 diabetes using adaptive neuro-fuzzy interface system. S Alby *, BL Shivakumar 2 Research

More information

appstats26.notebook April 17, 2015

appstats26.notebook April 17, 2015 Chapter 26 Comparing Counts Objective: Students will interpret chi square as a test of goodness of fit, homogeneity, and independence. Goodness of Fit A test of whether the distribution of counts in one

More information

Lecture 21. RNA-seq: Advanced analysis

Lecture 21. RNA-seq: Advanced analysis Lecture 21 RNA-seq: Advanced analysis Experimental design Introduction An experiment is a process or study that results in the collection of data. Statistical experiments are conducted in situations in

More information

Introduction to Machine Learning. Katherine Heller Deep Learning Summer School 2018

Introduction to Machine Learning. Katherine Heller Deep Learning Summer School 2018 Introduction to Machine Learning Katherine Heller Deep Learning Summer School 2018 Outline Kinds of machine learning Linear regression Regularization Bayesian methods Logistic Regression Why we do this

More information

Lec 02: Estimation & Hypothesis Testing in Animal Ecology

Lec 02: Estimation & Hypothesis Testing in Animal Ecology Lec 02: Estimation & Hypothesis Testing in Animal Ecology Parameter Estimation from Samples Samples We typically observe systems incompletely, i.e., we sample according to a designed protocol. We then

More information

Comparison of Mamdani and Sugeno Fuzzy Interference Systems for the Breast Cancer Risk

Comparison of Mamdani and Sugeno Fuzzy Interference Systems for the Breast Cancer Risk Comparison of Mamdani and Sugeno Fuzzy Interference Systems for the Breast Cancer Risk Alshalaa A. Shleeg, Issmail M. Ellabib Abstract Breast cancer is a major health burden worldwide being a major cause

More information

CHAPTER 4 ANFIS BASED TOTAL DEMAND DISTORTION FACTOR

CHAPTER 4 ANFIS BASED TOTAL DEMAND DISTORTION FACTOR 47 CHAPTER 4 ANFIS BASED TOTAL DEMAND DISTORTION FACTOR In distribution systems, the current harmonic distortion should be limited to an acceptable limit to avoid heating, losses and malfunctioning of

More information

Age (continuous) Gender (0=Male, 1=Female) SES (1=Low, 2=Medium, 3=High) Prior Victimization (0= Not Victimized, 1=Victimized)

Age (continuous) Gender (0=Male, 1=Female) SES (1=Low, 2=Medium, 3=High) Prior Victimization (0= Not Victimized, 1=Victimized) Criminal Justice Doctoral Comprehensive Exam Statistics August 2016 There are two questions on this exam. Be sure to answer both questions in the 3 and half hours to complete this exam. Read the instructions

More information

MMI 409 Spring 2009 Final Examination Gordon Bleil. 1. Is there a difference in depression as a function of group and drug?

MMI 409 Spring 2009 Final Examination Gordon Bleil. 1. Is there a difference in depression as a function of group and drug? MMI 409 Spring 2009 Final Examination Gordon Bleil Table of Contents Research Scenario and General Assumptions Questions for Dataset (Questions are hyperlinked to detailed answers) 1. Is there a difference

More information

Simple Linear Regression

Simple Linear Regression Simple Linear Regression Assoc. Prof Dr Sarimah Abdullah Unit of Biostatistics & Research Methodology School of Medical Sciences, Health Campus Universiti Sains Malaysia Regression Regression analysis

More information

An Introduction to Bayesian Statistics

An Introduction to Bayesian Statistics An Introduction to Bayesian Statistics Robert Weiss Department of Biostatistics UCLA Fielding School of Public Health robweiss@ucla.edu Sept 2015 Robert Weiss (UCLA) An Introduction to Bayesian Statistics

More information

Analysis of Environmental Data Conceptual Foundations: En viro n m e n tal Data

Analysis of Environmental Data Conceptual Foundations: En viro n m e n tal Data Analysis of Environmental Data Conceptual Foundations: En viro n m e n tal Data 1. Purpose of data collection...................................................... 2 2. Samples and populations.......................................................

More information

Adaptive Type-2 Fuzzy Logic Control of Non-Linear Processes

Adaptive Type-2 Fuzzy Logic Control of Non-Linear Processes Adaptive Type-2 Fuzzy Logic Control of Non-Linear Processes Bartolomeo Cosenza, Mosè Galluzzo* Dipartimento di Ingegneria Chimica dei Processi e dei Materiali, Università degli Studi di Palermo Viale delle

More information

Fuzzy Logic Based Expert System for Detecting Colorectal Cancer

Fuzzy Logic Based Expert System for Detecting Colorectal Cancer Fuzzy Logic Based Expert System for Detecting Colorectal Cancer Tanjia Chowdhury Lecturer, Dept. of Computer Science and Engineering, Southern University Bangladesh, Chittagong, Bangladesh ---------------------------------------------------------------------***----------------------------------------------------------------------

More information

Fever Diagnosis Rule-Based Expert Systems

Fever Diagnosis Rule-Based Expert Systems Fever Diagnosis Rule-Based Expert Systems S. Govinda Rao M. Eswara Rao D. Siva Prasad Dept. of CSE Dept. of CSE Dept. of CSE TP inst. Of Science & Tech., TP inst. Of Science & Tech., Rajah RSRKRR College

More information

SUMMER 2011 RE-EXAM PSYF11STAT - STATISTIK

SUMMER 2011 RE-EXAM PSYF11STAT - STATISTIK SUMMER 011 RE-EXAM PSYF11STAT - STATISTIK Full Name: Årskortnummer: Date: This exam is made up of three parts: Part 1 includes 30 multiple choice questions; Part includes 10 matching questions; and Part

More information

Study Guide #2: MULTIPLE REGRESSION in education

Study Guide #2: MULTIPLE REGRESSION in education Study Guide #2: MULTIPLE REGRESSION in education What is Multiple Regression? When using Multiple Regression in education, researchers use the term independent variables to identify those variables that

More information

Analysis of Variance (ANOVA)

Analysis of Variance (ANOVA) Research Methods and Ethics in Psychology Week 4 Analysis of Variance (ANOVA) One Way Independent Groups ANOVA Brief revision of some important concepts To introduce the concept of familywise error rate.

More information

White Paper Estimating Complex Phenotype Prevalence Using Predictive Models

White Paper Estimating Complex Phenotype Prevalence Using Predictive Models White Paper 23-12 Estimating Complex Phenotype Prevalence Using Predictive Models Authors: Nicholas A. Furlotte Aaron Kleinman Robin Smith David Hinds Created: September 25 th, 2015 September 25th, 2015

More information

Results. NeuRA Family relationships May 2017

Results. NeuRA Family relationships May 2017 Introduction Familial expressed emotion involving hostility, emotional over-involvement, and critical comments has been associated with increased psychotic relapse in people with schizophrenia, so these

More information

SCUOLA DI SPECIALIZZAZIONE IN FISICA MEDICA. Sistemi di Elaborazione dell Informazione. Introduzione. Ruggero Donida Labati

SCUOLA DI SPECIALIZZAZIONE IN FISICA MEDICA. Sistemi di Elaborazione dell Informazione. Introduzione. Ruggero Donida Labati SCUOLA DI SPECIALIZZAZIONE IN FISICA MEDICA Sistemi di Elaborazione dell Informazione Introduzione Ruggero Donida Labati Dipartimento di Informatica via Bramante 65, 26013 Crema (CR), Italy http://homes.di.unimi.it/donida

More information

Table of Contents. Plots. Essential Statistics for Nursing Research 1/12/2017

Table of Contents. Plots. Essential Statistics for Nursing Research 1/12/2017 Essential Statistics for Nursing Research Kristen Carlin, MPH Seattle Nursing Research Workshop January 30, 2017 Table of Contents Plots Descriptive statistics Sample size/power Correlations Hypothesis

More information

Multi Parametric Approach Using Fuzzification On Heart Disease Analysis Upasana Juneja #1, Deepti #2 *

Multi Parametric Approach Using Fuzzification On Heart Disease Analysis Upasana Juneja #1, Deepti #2 * Multi Parametric Approach Using Fuzzification On Heart Disease Analysis Upasana Juneja #1, Deepti #2 * Department of CSE, Kurukshetra University, India 1 upasana_jdkps@yahoo.com Abstract : The aim of this

More information

Multiple Bivariate Gaussian Plotting and Checking

Multiple Bivariate Gaussian Plotting and Checking Multiple Bivariate Gaussian Plotting and Checking Jared L. Deutsch and Clayton V. Deutsch The geostatistical modeling of continuous variables relies heavily on the multivariate Gaussian distribution. It

More information

SPECIAL ISSUE FOR INTERNATIONAL CONFERENCE ON INNOVATIONS IN SCIENCE & TECHNOLOGY: OPPORTUNITIES & CHALLENGES"

SPECIAL ISSUE FOR INTERNATIONAL CONFERENCE ON INNOVATIONS IN SCIENCE & TECHNOLOGY: OPPORTUNITIES & CHALLENGES INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY A PATH FOR HORIZING YOUR INNOVATIVE WORK SPECIAL ISSUE FOR INTERNATIONAL CONFERENCE ON INNOVATIONS IN SCIENCE & TECHNOLOGY:

More information

1 Introduction. st0020. The Stata Journal (2002) 2, Number 3, pp

1 Introduction. st0020. The Stata Journal (2002) 2, Number 3, pp The Stata Journal (22) 2, Number 3, pp. 28 289 Comparative assessment of three common algorithms for estimating the variance of the area under the nonparametric receiver operating characteristic curve

More information

CHAPTER 3 RESEARCH METHODOLOGY

CHAPTER 3 RESEARCH METHODOLOGY CHAPTER 3 RESEARCH METHODOLOGY 3.1 Introduction 3.1 Methodology 3.1.1 Research Design 3.1. Research Framework Design 3.1.3 Research Instrument 3.1.4 Validity of Questionnaire 3.1.5 Statistical Measurement

More information

Correlational Research. Correlational Research. Stephen E. Brock, Ph.D., NCSP EDS 250. Descriptive Research 1. Correlational Research: Scatter Plots

Correlational Research. Correlational Research. Stephen E. Brock, Ph.D., NCSP EDS 250. Descriptive Research 1. Correlational Research: Scatter Plots Correlational Research Stephen E. Brock, Ph.D., NCSP California State University, Sacramento 1 Correlational Research A quantitative methodology used to determine whether, and to what degree, a relationship

More information

A Comparison of Methods of Estimating Subscale Scores for Mixed-Format Tests

A Comparison of Methods of Estimating Subscale Scores for Mixed-Format Tests A Comparison of Methods of Estimating Subscale Scores for Mixed-Format Tests David Shin Pearson Educational Measurement May 007 rr0701 Using assessment and research to promote learning Pearson Educational

More information

SW 9300 Applied Regression Analysis and Generalized Linear Models 3 Credits. Master Syllabus

SW 9300 Applied Regression Analysis and Generalized Linear Models 3 Credits. Master Syllabus SW 9300 Applied Regression Analysis and Generalized Linear Models 3 Credits Master Syllabus I. COURSE DOMAIN AND BOUNDARIES This is the second course in the research methods sequence for WSU doctoral students.

More information