The Analysis of 2 K Contingency Tables with Different Statistical Approaches
|
|
- Francis Lawson
- 6 years ago
- Views:
Transcription
1 The Analysis of 2 K Contingency Tables with Different tatistical Approaches Hassan alah M. Thebes Higher Institute for Management and Information Technology drhassn_242@yahoo.com Abstract The main objective of this paper is to analyze the 2 K contingency tables with three statistical approaches (regression analysis, multinomial logistic regression analysis and linguistic fuzzy model). We compare these methods for evaluating the association between a risk factor and a disease. These statistical methods measure the association between the numeric levels of a risk factor and a disease in different ways. They have been applied to a set of data of childhood cancer risk from prenatal x-ray exposure. Regression and multinomial logistic regression analyses show similar results for a data set of children whereas the fuzzy analysis yields a different result. Keywords Contingency table, Multinomial logistic regression, Linguistic fuzzy model, Data of childhood cancer, X-ray exposure. 1. Introduction The 2 K contingency table is an important extension of 2 2 table which is a basic tool for epidemiology investigation. In 2 K contingency table, the presence or absence of a disease is recorded at K levels of a risk factor. The 2 K contingency table can be viewed from the perspective of a K - level variable (risk factor) or from the perspective of a binary variable (disease) [4]. In this paper, we use three different statistical approaches for analyzing the 2 K contingency table; regression analysis, multinomial logistic regression analysis and linguistic fuzzy model. Data on malignancies in children under 10 years of age and information on the mother's exposure to x-ray provide an example for the discussion and analysis of a 2 K table [2] and [3]. Table 1 shows the numbers of prenatal x-rays received by mothers of children with a malignant disease, and a series of controls (healthy children of the same age, sex, and similar areas of residence)
2 Table 1 Observed numbers of cases and controls by recorded number of maternal x-ray films during pregnancy * for simplicity, the values greater than five were coded as Regression Analysis A 2 K contingency table can be viewed as a set of K pairs of values. An ) estimated probability is generated for each value of X producing K pairs x j, p ) ( j where p j is the estimated probability that Y = 0 associated with each level represented by x j. In order to analyze the K pairs of values, a straight line which summarizes the relationship between X and Y is estimated and the slope of the estimated line is used as a summary of the relationship between X and Y. For a simple linear regression, three quantities are necessary to derive the basic statistical measures: the sum of squares for X ), the sum of squares for Y ), and the sum of cross-products for X and Y ( Films Total Cases Y = Controls Y = Total Proportion ( yy ( xy ). These expressions calculated from a 2 K contingency table are [7]: = k j = 1 n. j ( x j v x) 2, where v x = n x n. j j / yy = n n / n (2) k v v v xy = ( x1 x2 ) yy where xi = n Now, the regression coefficient can be estimated as b y / x = xy / (4) and the variance of the estimated regression coefficient can be estimated as ) var( b y / x ) = yy / ( n 1) (5) On the other hand, a correlation coefficient measuring the degree of linear association between X and Y calculated in the usual way is xy r xy = (6) yy j= 1 ij x j / n i. * (1) (3) 2
3 For the data in Table 1, these quantities for the malignant disease are: = , yy = , xy = Using (4) and (5), the estimated coefficient of regression and its variance are and respectively. The correlation between the case/control status and the x-ray exposure is A 95 % confidence interval of the association coefficient is (0.0577, ). Moreover, the expected numbers of cases and controls by recorded number of maternal x-ray films during pregnancy are estimated using an estimated linear response p = x as shown in Table 2 below. i Table 2 Expected numbers of cases and controls by recorded number of maternal x-ray films during pregnancy Films Total Cases Y = Controls Y = Total Proportion * for simplicity, the values greater than five were coded as 5. The observed and expected proportions of cases shown in Table 1 and Table 2 are plotted in Figure 1 below. 0.8 P Obs. P Exp. P X-ray Figure 1: Proportion of cases childhood cancer for exposure to maternal x-ray during pregnancy 3
4 Figure 1 indicates that the distribution of number of the cases is better fitted and the estimated line is good. An additional assessment of the dose-response relationship is accomplished by partitioning the total chi-square value. The chi-square statistic that measures homogeneity (H 0 : the proportion of cases is the same regardless of the degree of maternal x-ray exposure) is χ 2 = A chi-square value of this magnitude indicates the presence of some sort of nonhomogeneous pattern of response ( ρ value =0. 001) [7]. 3. Multinomial Logistic Regression Analysis Multinomial logistic regression analysis is useful for situations in which we want to be able to classify subjects based on values of a set of predictor variables. This type of regression is similar to logistic regression, but it is more general. In regression analysis, we use the numeric levels of a risk factor (the number of x-ray exposures) as an independent variable and the corresponding proportion of cases as dependent variable, but in multinomial logistic regression there is need to consider a large number of records (frequency) to establish an association between risk factor and a disease [5]. In order to analyze a 2 K contingency table using multinomial logistic regression analysis, the data in Table 1 were processed using PWIN and the numeric results were similar as those obtained by regression analysis [1]. That is the association coefficient between risk factor and disease is with standard error of A 95 % confidence interval of the association coefficient is (0.0481, ). 4. Fuzzy analysis In bioscience there are several levels of uncertainty, vagueness and imprecision, particularly in the medical and epidemiological areas, where the best and most useful description of disease entities often comprise linguistic terms that are inevitably vague. The theory of fuzzy logic has been developed to deal with the concept of partial truth values, ranging from completely true to completely false, and has become a powerful tool for dealing with imprecision and uncertainty aiming at tractability, robustness and low-cost solutions for real-world problems. These features and the ability to deal with linguistic terms could explain the increasing number of works applying fuzzy logic in biomedicine problems. In fact, the theory of fuzzy sets has become an important mathematical approach in diagnosis system, treatment of medical images and, more recently in epidemiology and public health [5] and [6]. For more knowledge about fuzzy logic theory the book by Yen and Langari [8] is recommended. A linguistic fuzzy model consists of a set of fuzzy rules and an inference method. The most common inference method is the Minimum of Mamdani, whose output is a fuzzy set. The fuzzy linguistic model to evaluate a childhood cancer risk 4
5 from prenatal x-ray exposure has two antecedents: malignancies in children under 10 years of age and information on the mother's exposure to x-ray. The model elaborated five fuzzy sets to the variable number of x-ray films that exposure to the mothers (very low, low, medium, high and very high) and two fuzzy sets for the variable number of children with a malignant disease and a series of controls ( healthy children of the same age) (cases and controls). The consequence of the model is the association between x-ray films and the malignancies in children under 10 years of age. We considered three fuzzy sets for this linguistic variable; weak, medium and strong. The base rules consist of the following ones: 1. If x-ray is very low and case then association is weak. 2. If x-ray is low and case then association is weak. 3. If x-ray is medium and case then association is weak. 4. If x-ray is high and case then association is medium. 5. If x-ray is very high and case then association is strong The association between the childrens' malignancies and x-ray films is determined by inference of the fuzzy rule set, and defuzzifiction of the fuzzy output. The system was run in a C++ language. Fuzzy sets to input variable number of x-ray and to output variable of association between malignancies children and x-ray are displayed in Figure 2 and Figure 3 below. Membership function 1 VLOW LOW MEDIUM HIGH VHIGH X Ray Figure 2: Fuzzy sets to input variable number of X-ray 5
6 Membership function WEAK MEDIUM TRONG Figure 3: Fuzzy sets to output variable of Association between malignancies children and X-Ray We notice that by combining all possible inputs it is possible to build 10 rules but, it only 5 rules were considered because some situations that can not occur. For example, it is impossible, for the mothers who were not exposed to x-ray, the children have a disease (if they have; this occurs for another reason). Although this is mathematically possible, it was subtracted from the rule bases, reducing the number of rules. The fuzzy set related to linguistic variables is presented in Figure 2. The membership fucntion represents the degree of compatibility of some input to all categpries. In fact, the membership degree represents the possibility that the input belongs to the set. Figure 3 shows the memebership function of the output. It is clear that the association increases monotonically when the number of x-ray films increases. It was 16 % for weak, 17 % for medium and 18 % for strong associations respectively. Also the weighted mean of the association between X-ray and the disease was and the standard error was A 95 % confidence interval of the association coefficient is ( , ). 6
7 Discussion In regression analysis, we use the numeric levels of a risk factor (the number of x-ray exposures) as an independent variable and the corresponding proportion of cases as a dependent variable. Furthermore, in multinomial logistic regression there is need for a considerable number of records (frequency) to establish an association between risk factor and a disease. In a fuzzy linguistic model, there is not such need. ( b y / x The point biserial correlation coefficient ( r xy ), the regression coefficient ) are interrelated when calculated from a 2 K table. For example, each has an expected value of zero when the variables X and Y are unrelated. The two statistics measure the association between the numeric levels of a risk factor and a disease in different ways but, in terms of probability, lead to the same inference. A measure of association assesses the strength of a relationship, while a statistical test gives an idea of the likelihood that such an association occurs by chance where both regression and multinomial logistic regression give similar results, the fuzzy model gives rather different results for evaluating the association between the risk factor and the disease (ee: Table 3). Table 3 Comparison between the results of the three methods Regression Multinomial logistic regression Fuzzy model Association coefficient tandard error % CI (.0577,.0683) (.0481,.0579) (.1178, 1322) ρ value We notice from Table 3 that the three statistical methods (regression, multinomial logistic regression and fuzzy model) for evaluating the association between risk factor and a disease show similar results for a data set of children, but the results from fuzzy model are rather different. References [1] Ashour,. K. and alem,. A. (2005). tatistical Presentation and Analysis using PWIN, Part two: Advanced Applied tatistics. Cairo University: IR. 7
8 [2] Bithell, J. F., and teward, M. A. (1975). Prenatal Irradiation and childhood Malignancy: A Review of British Data from the Oxford tudy. Brit. J. of Cancer (31): [3] Breslow, N. E., and Day, N. E. (1987). tatistical Methods in Cancer Research, Volume II. Oxford University Press. Oxford, UK. [4] Hardeo ahai and Anwer Khurshid (1996). tatistics in Epidemiology, Methods, Techniques and Applications. CRC Press, New York. [5] Luiz Fernando C. Nascimento and Neli Regina Ortega (2002). Fuzzy Linguistic Model for Evaluating the Risk of Neonatal Death. Rev aude Publica, 36 (6): [6] chwarzer G., Nagata T., Mattern D., chmelzeisen R. and chumacher (2003). Comparison of Fuzzy Inference, Logistic Regression, and Classification Trees (CART). Methods Inf Med; 42: [7] teve,. (1996). tatistical Analysis of Epidemiologic Data, 2 nd ed. Oxford University Press, Oxford. [8] Yen J. and Langari R. (1999). Fuzzy Logic: Intelligence, Control an information. Upper addle River (NJ), Prentic-hall. 8
11/18/2013. Correlational Research. Correlational Designs. Why Use a Correlational Design? CORRELATIONAL RESEARCH STUDIES
Correlational Research Correlational Designs Correlational research is used to describe the relationship between two or more naturally occurring variables. Is age related to political conservativism? Are
More informationArtificial Intelligence For Homeopathic Remedy Selection
Artificial Intelligence For Homeopathic Remedy Selection A. R. Pawar, amrut.pawar@yahoo.co.in, S. N. Kini, snkini@gmail.com, M. R. More mangeshmore88@gmail.com Department of Computer Science and Engineering,
More informationRegression Including the Interaction Between Quantitative Variables
Regression Including the Interaction Between Quantitative Variables The purpose of the study was to examine the inter-relationships among social skills, the complexity of the social situation, and performance
More information11/24/2017. Do not imply a cause-and-effect relationship
Correlational research is used to describe the relationship between two or more naturally occurring variables. Is age related to political conservativism? Are highly extraverted people less afraid of rejection
More informationStepwise Knowledge Acquisition in a Fuzzy Knowledge Representation Framework
Stepwise Knowledge Acquisition in a Fuzzy Knowledge Representation Framework Thomas E. Rothenfluh 1, Karl Bögl 2, and Klaus-Peter Adlassnig 2 1 Department of Psychology University of Zurich, Zürichbergstraße
More informationArtificially Intelligent Primary Medical Aid for Patients Residing in Remote areas using Fuzzy Logic
Artificially Intelligent Primary Medical Aid for Patients Residing in Remote areas using Fuzzy Logic Ravinkal Kaur 1, Virat Rehani 2 1M.tech Student, Dept. of CSE, CT Institute of Technology & Research,
More informationApplications. DSC 410/510 Multivariate Statistical Methods. Discriminating Two Groups. What is Discriminant Analysis
DSC 4/5 Multivariate Statistical Methods Applications DSC 4/5 Multivariate Statistical Methods Discriminant Analysis Identify the group to which an object or case (e.g. person, firm, product) belongs:
More informationSTATISTICAL METHODS FOR DIAGNOSTIC TESTING: AN ILLUSTRATION USING A NEW METHOD FOR CANCER DETECTION XIN SUN. PhD, Kansas State University, 2012
STATISTICAL METHODS FOR DIAGNOSTIC TESTING: AN ILLUSTRATION USING A NEW METHOD FOR CANCER DETECTION by XIN SUN PhD, Kansas State University, 2012 A THESIS Submitted in partial fulfillment of the requirements
More informationIAPT: Regression. Regression analyses
Regression analyses IAPT: Regression Regression is the rather strange name given to a set of methods for predicting one variable from another. The data shown in Table 1 and come from a student project
More informationDaniel Boduszek University of Huddersfield
Daniel Boduszek University of Huddersfield d.boduszek@hud.ac.uk Introduction to Logistic Regression SPSS procedure of LR Interpretation of SPSS output Presenting results from LR Logistic regression is
More informationCRITERIA FOR USE. A GRAPHICAL EXPLANATION OF BI-VARIATE (2 VARIABLE) REGRESSION ANALYSISSys
Multiple Regression Analysis 1 CRITERIA FOR USE Multiple regression analysis is used to test the effects of n independent (predictor) variables on a single dependent (criterion) variable. Regression tests
More informationbivariate analysis: The statistical analysis of the relationship between two variables.
bivariate analysis: The statistical analysis of the relationship between two variables. cell frequency: The number of cases in a cell of a cross-tabulation (contingency table). chi-square (χ 2 ) test for
More informationUnderstandable Statistics
Understandable Statistics correlated to the Advanced Placement Program Course Description for Statistics Prepared for Alabama CC2 6/2003 2003 Understandable Statistics 2003 correlated to the Advanced Placement
More information12/30/2017. PSY 5102: Advanced Statistics for Psychological and Behavioral Research 2
PSY 5102: Advanced Statistics for Psychological and Behavioral Research 2 Selecting a statistical test Relationships among major statistical methods General Linear Model and multiple regression Special
More informationStatistical questions for statistical methods
Statistical questions for statistical methods Unpaired (two-sample) t-test DECIDE: Does the numerical outcome have a relationship with the categorical explanatory variable? Is the mean of the outcome the
More informationFuzzy Expert System Design for Medical Diagnosis
Second International Conference Modelling and Development of Intelligent Systems Sibiu - Romania, September 29 - October 02, 2011 Man Diana Ofelia Abstract In recent years, the methods of artificial intelligence
More informationUnit 1 Exploring and Understanding Data
Unit 1 Exploring and Understanding Data Area Principle Bar Chart Boxplot Conditional Distribution Dotplot Empirical Rule Five Number Summary Frequency Distribution Frequency Polygon Histogram Interquartile
More informationBusiness Statistics Probability
Business Statistics The following was provided by Dr. Suzanne Delaney, and is a comprehensive review of Business Statistics. The workshop instructor will provide relevant examples during the Skills Assessment
More informationApplied Medical. Statistics Using SAS. Geoff Der. Brian S. Everitt. CRC Press. Taylor Si Francis Croup. Taylor & Francis Croup, an informa business
Applied Medical Statistics Using SAS Geoff Der Brian S. Everitt CRC Press Taylor Si Francis Croup Boca Raton London New York CRC Press is an imprint of the Taylor & Francis Croup, an informa business A
More informationStatistics as a Tool. A set of tools for collecting, organizing, presenting and analyzing numerical facts or observations.
Statistics as a Tool A set of tools for collecting, organizing, presenting and analyzing numerical facts or observations. Descriptive Statistics Numerical facts or observations that are organized describe
More informationisc ove ring i Statistics sing SPSS
isc ove ring i Statistics sing SPSS S E C O N D! E D I T I O N (and sex, drugs and rock V roll) A N D Y F I E L D Publications London o Thousand Oaks New Delhi CONTENTS Preface How To Use This Book Acknowledgements
More informationOverview of Non-Parametric Statistics
Overview of Non-Parametric Statistics LISA Short Course Series Mark Seiss, Dept. of Statistics April 7, 2009 Presentation Outline 1. Homework 2. Review of Parametric Statistics 3. Overview Non-Parametric
More informationThe SAGE Encyclopedia of Educational Research, Measurement, and Evaluation Multivariate Analysis of Variance
The SAGE Encyclopedia of Educational Research, Measurement, Multivariate Analysis of Variance Contributors: David W. Stockburger Edited by: Bruce B. Frey Book Title: Chapter Title: "Multivariate Analysis
More informationSample size and power calculations in Mendelian randomization with a single instrumental variable and a binary outcome
Sample size and power calculations in Mendelian randomization with a single instrumental variable and a binary outcome Stephen Burgess July 10, 2013 Abstract Background: Sample size calculations are an
More informationDaniel Boduszek University of Huddersfield
Daniel Boduszek University of Huddersfield d.boduszek@hud.ac.uk Introduction to Multinominal Logistic Regression SPSS procedure of MLR Example based on prison data Interpretation of SPSS output Presenting
More informationWeek 17 and 21 Comparing two assays and Measurement of Uncertainty Explain tools used to compare the performance of two assays, including
Week 17 and 21 Comparing two assays and Measurement of Uncertainty 2.4.1.4. Explain tools used to compare the performance of two assays, including 2.4.1.4.1. Linear regression 2.4.1.4.2. Bland-Altman plots
More informationChapter 3: Examining Relationships
Name Date Per Key Vocabulary: response variable explanatory variable independent variable dependent variable scatterplot positive association negative association linear correlation r-value regression
More informationResults & Statistics: Description and Correlation. I. Scales of Measurement A Review
Results & Statistics: Description and Correlation The description and presentation of results involves a number of topics. These include scales of measurement, descriptive statistics used to summarize
More informationHuman Immunodeficiency Virus (HIV) Diagnosis Using Neuro-Fuzzy Expert System
ORIENTAL JOURNAL OF COMPUTER SCIENCE & TECHNOLOGY An International Open Free Access, Peer Reviewed Research Journal Published By: Oriental Scientific Publishing Co., India. www.computerscijournal.org ISSN:
More information3 CONCEPTUAL FOUNDATIONS OF STATISTICS
3 CONCEPTUAL FOUNDATIONS OF STATISTICS In this chapter, we examine the conceptual foundations of statistics. The goal is to give you an appreciation and conceptual understanding of some basic statistical
More informationHOW STATISTICS IMPACT PHARMACY PRACTICE?
HOW STATISTICS IMPACT PHARMACY PRACTICE? CPPD at NCCR 13 th June, 2013 Mohamed Izham M.I., PhD Professor in Social & Administrative Pharmacy Learning objective.. At the end of the presentation pharmacists
More informationStudy Guide for the Final Exam
Study Guide for the Final Exam When studying, remember that the computational portion of the exam will only involve new material (covered after the second midterm), that material from Exam 1 will make
More informationModeling Health Related Quality of Life among Cancer Patients Using an Integrated Inference System and Linear Regression
International Journal of Pharma Medicine and Biological Sciences Vol. 4, No. 1, January 2015 Modeling Health Related Quality of Life among Cancer Patients Using an Integrated Inference System and Linear
More informationMeta-Analysis. Zifei Liu. Biological and Agricultural Engineering
Meta-Analysis Zifei Liu What is a meta-analysis; why perform a metaanalysis? How a meta-analysis work some basic concepts and principles Steps of Meta-analysis Cautions on meta-analysis 2 What is Meta-analysis
More informationCentering Predictors
Centering Predictors Longitudinal Data Analysis Workshop Section 3 University of Georgia: Institute for Interdisciplinary Research in Education and Human Development Section 3: Centering Covered this Section
More informationBMI 541/699 Lecture 16
BMI 541/699 Lecture 16 Where we are: 1. Introduction and Experimental Design 2. Exploratory Data Analysis 3. Probability 4. T-based methods for continous variables 5. Proportions & contingency tables -
More informationSTA 3024 Spring 2013 EXAM 3 Test Form Code A UF ID #
STA 3024 Spring 2013 Name EXAM 3 Test Form Code A UF ID # Instructions: This exam contains 34 Multiple Choice questions. Each question is worth 3 points, for a total of 102 points (there are TWO bonus
More informationFrom Bivariate Through Multivariate Techniques
A p p l i e d S T A T I S T I C S From Bivariate Through Multivariate Techniques R e b e c c a M. W a r n e r University of New Hampshire DAI HOC THAI NGUYEN TRUNG TAM HOC LIEU *)SAGE Publications '55'
More informationCorrelation and regression
PG Dip in High Intensity Psychological Interventions Correlation and regression Martin Bland Professor of Health Statistics University of York http://martinbland.co.uk/ Correlation Example: Muscle strength
More informationOn the purpose of testing:
Why Evaluation & Assessment is Important Feedback to students Feedback to teachers Information to parents Information for selection and certification Information for accountability Incentives to increase
More informationDescribe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo
Business Statistics The following was provided by Dr. Suzanne Delaney, and is a comprehensive review of Business Statistics. The workshop instructor will provide relevant examples during the Skills Assessment
More informationEvidence-Based Medicine Journal Club. A Primer in Statistics, Study Design, and Epidemiology. August, 2013
Evidence-Based Medicine Journal Club A Primer in Statistics, Study Design, and Epidemiology August, 2013 Rationale for EBM Conscientious, explicit, and judicious use Beyond clinical experience and physiologic
More informationDeveloping a fuzzy Likert scale for measuring xenophobia in Greece
Developing a fuzzy Likert scale for measuring xenophobia in Greece Maria Symeonaki 1, and Aggeliki Kazani 2 1 Panteion University of Political and Social Sciences Department of Social Policy 136 Syggrou
More informationNon Linear Control of Glycaemia in Type 1 Diabetic Patients
Non Linear Control of Glycaemia in Type 1 Diabetic Patients Mosè Galluzzo*, Bartolomeo Cosenza Dipartimento di Ingegneria Chimica dei Processi e dei Materiali, Università degli Studi di Palermo Viale delle
More informationConfounding, Effect modification, and Stratification
Confounding, Effect modification, and Stratification Tunisia, 30th October 2014 Acknowledgment: Kostas Danis Takis Panagiotopoulos National Schoool of Public Health, Athens, Greece takis.panagiotopoulos@gmail.com
More informationDiagnostic screening. Department of Statistics, University of South Carolina. Stat 506: Introduction to Experimental Design
Diagnostic screening Department of Statistics, University of South Carolina Stat 506: Introduction to Experimental Design 1 / 27 Ties together several things we ve discussed already... The consideration
More informationMidterm Exam ANSWERS Categorical Data Analysis, CHL5407H
Midterm Exam ANSWERS Categorical Data Analysis, CHL5407H 1. Data from a survey of women s attitudes towards mammography are provided in Table 1. Women were classified by their experience with mammography
More informationBayesian Logistic Regression Modelling via Markov Chain Monte Carlo Algorithm
Journal of Social and Development Sciences Vol. 4, No. 4, pp. 93-97, Apr 203 (ISSN 222-52) Bayesian Logistic Regression Modelling via Markov Chain Monte Carlo Algorithm Henry De-Graft Acquah University
More informationChapter 11: Advanced Remedial Measures. Weighted Least Squares (WLS)
Chapter : Advanced Remedial Measures Weighted Least Squares (WLS) When the error variance appears nonconstant, a transformation (of Y and/or X) is a quick remedy. But it may not solve the problem, or it
More informationAnalysis of Rheumatoid Arthritis Data using Logistic Regression and Penalized Approach
University of South Florida Scholar Commons Graduate Theses and Dissertations Graduate School November 2015 Analysis of Rheumatoid Arthritis Data using Logistic Regression and Penalized Approach Wei Chen
More informationData Analysis Using Regression and Multilevel/Hierarchical Models
Data Analysis Using Regression and Multilevel/Hierarchical Models ANDREW GELMAN Columbia University JENNIFER HILL Columbia University CAMBRIDGE UNIVERSITY PRESS Contents List of examples V a 9 e xv " Preface
More informationContent. Basic Statistics and Data Analysis for Health Researchers from Foreign Countries. Research question. Example Newly diagnosed Type 2 Diabetes
Content Quantifying association between continuous variables. Basic Statistics and Data Analysis for Health Researchers from Foreign Countries Volkert Siersma siersma@sund.ku.dk The Research Unit for General
More informationConditional Distributions and the Bivariate Normal Distribution. James H. Steiger
Conditional Distributions and the Bivariate Normal Distribution James H. Steiger Overview In this module, we have several goals: Introduce several technical terms Bivariate frequency distribution Marginal
More informationm 11 m.1 > m 12 m.2 risk for smokers risk for nonsmokers
SOCY5061 RELATIVE RISKS, RELATIVE ODDS, LOGISTIC REGRESSION RELATIVE RISKS: Suppose we are interested in the association between lung cancer and smoking. Consider the following table for the whole population:
More informationChoosing a Significance Test. Student Resource Sheet
Choosing a Significance Test Student Resource Sheet Choosing Your Test Choosing an appropriate type of significance test is a very important consideration in analyzing data. If an inappropriate test is
More informationDiagnosis Of the Diabetes Mellitus disease with Fuzzy Inference System Mamdani
Diagnosis Of the Diabetes Mellitus disease with Fuzzy Inference System Mamdani Za imatun Niswati, Aulia Paramita and Fanisya Alva Mustika Technical Information, Indraprasta PGRI University E-mail : zaimatunnis@gmail.com,
More informationCHAPTER ONE CORRELATION
CHAPTER ONE CORRELATION 1.0 Introduction The first chapter focuses on the nature of statistical data of correlation. The aim of the series of exercises is to ensure the students are able to use SPSS to
More informationPrediction of Malignant and Benign Tumor using Machine Learning
Prediction of Malignant and Benign Tumor using Machine Learning Ashish Shah Department of Computer Science and Engineering Manipal Institute of Technology, Manipal University, Manipal, Karnataka, India
More informationControlling Bias & Confounding
Controlling Bias & Confounding Chihaya Koriyama August 5 th, 2015 QUESTIONS FOR BIAS Key concepts Bias Should be minimized at the designing stage. Random errors We can do nothing at Is the nature the of
More informationMEASURES OF ASSOCIATION AND REGRESSION
DEPARTMENT OF POLITICAL SCIENCE AND INTERNATIONAL RELATIONS Posc/Uapp 816 MEASURES OF ASSOCIATION AND REGRESSION I. AGENDA: A. Measures of association B. Two variable regression C. Reading: 1. Start Agresti
More informationUncertain Rule-Based Fuzzy Logic Systems:
Uncertain Rule-Based Fuzzy Logic Systems: Introduction and New Directions Jerry M. Mendel University of Southern California Los Angeles, CA PH PTR Prentice Hall PTR Upper Saddle River, NJ 07458 www.phptr.com
More informationBasic Statistics and Data Analysis in Work psychology: Statistical Examples
Basic Statistics and Data Analysis in Work psychology: Statistical Examples WORK PSYCHOLOGY INTRODUCTION In this chapter we examine a topic which is given too little coverage in most texts of this kind,
More informationA review of statistical methods in the analysis of data arising from observer reliability studies (Part 11) *
A review of statistical methods in the analysis of data arising from observer reliability studies (Part 11) * by J. RICHARD LANDIS** and GARY G. KOCH** 4 Methods proposed for nominal and ordinal data Many
More informationExperimentalPhysiology
Exp Physiol 97.5 (2012) pp 557 561 557 Editorial ExperimentalPhysiology Categorized or continuous? Strength of an association and linear regression Gordon B. Drummond 1 and Sarah L. Vowler 2 1 Department
More informationSix Sigma Glossary Lean 6 Society
Six Sigma Glossary Lean 6 Society ABSCISSA ACCEPTANCE REGION ALPHA RISK ALTERNATIVE HYPOTHESIS ASSIGNABLE CAUSE ASSIGNABLE VARIATIONS The horizontal axis of a graph The region of values for which the null
More informationCHAPTER - 6 STATISTICAL ANALYSIS. This chapter discusses inferential statistics, which use sample data to
CHAPTER - 6 STATISTICAL ANALYSIS 6.1 Introduction This chapter discusses inferential statistics, which use sample data to make decisions or inferences about population. Populations are group of interest
More informationStatistical Methods and Reasoning for the Clinical Sciences
Statistical Methods and Reasoning for the Clinical Sciences Evidence-Based Practice Eiki B. Satake, PhD Contents Preface Introduction to Evidence-Based Statistics: Philosophical Foundation and Preliminaries
More information2.75: 84% 2.5: 80% 2.25: 78% 2: 74% 1.75: 70% 1.5: 66% 1.25: 64% 1.0: 60% 0.5: 50% 0.25: 25% 0: 0%
Capstone Test (will consist of FOUR quizzes and the FINAL test grade will be an average of the four quizzes). Capstone #1: Review of Chapters 1-3 Capstone #2: Review of Chapter 4 Capstone #3: Review of
More informationPOL 242Y Final Test (Take Home) Name
POL 242Y Final Test (Take Home) Name_ Due August 6, 2008 The take-home final test should be returned in the classroom (FE 36) by the end of the class on August 6. Students who fail to submit the final
More informationA prediction model for type 2 diabetes using adaptive neuro-fuzzy interface system.
Biomedical Research 208; Special Issue: S69-S74 ISSN 0970-938X www.biomedres.info A prediction model for type 2 diabetes using adaptive neuro-fuzzy interface system. S Alby *, BL Shivakumar 2 Research
More informationappstats26.notebook April 17, 2015
Chapter 26 Comparing Counts Objective: Students will interpret chi square as a test of goodness of fit, homogeneity, and independence. Goodness of Fit A test of whether the distribution of counts in one
More informationLecture 21. RNA-seq: Advanced analysis
Lecture 21 RNA-seq: Advanced analysis Experimental design Introduction An experiment is a process or study that results in the collection of data. Statistical experiments are conducted in situations in
More informationIntroduction to Machine Learning. Katherine Heller Deep Learning Summer School 2018
Introduction to Machine Learning Katherine Heller Deep Learning Summer School 2018 Outline Kinds of machine learning Linear regression Regularization Bayesian methods Logistic Regression Why we do this
More informationLec 02: Estimation & Hypothesis Testing in Animal Ecology
Lec 02: Estimation & Hypothesis Testing in Animal Ecology Parameter Estimation from Samples Samples We typically observe systems incompletely, i.e., we sample according to a designed protocol. We then
More informationComparison of Mamdani and Sugeno Fuzzy Interference Systems for the Breast Cancer Risk
Comparison of Mamdani and Sugeno Fuzzy Interference Systems for the Breast Cancer Risk Alshalaa A. Shleeg, Issmail M. Ellabib Abstract Breast cancer is a major health burden worldwide being a major cause
More informationCHAPTER 4 ANFIS BASED TOTAL DEMAND DISTORTION FACTOR
47 CHAPTER 4 ANFIS BASED TOTAL DEMAND DISTORTION FACTOR In distribution systems, the current harmonic distortion should be limited to an acceptable limit to avoid heating, losses and malfunctioning of
More informationAge (continuous) Gender (0=Male, 1=Female) SES (1=Low, 2=Medium, 3=High) Prior Victimization (0= Not Victimized, 1=Victimized)
Criminal Justice Doctoral Comprehensive Exam Statistics August 2016 There are two questions on this exam. Be sure to answer both questions in the 3 and half hours to complete this exam. Read the instructions
More informationMMI 409 Spring 2009 Final Examination Gordon Bleil. 1. Is there a difference in depression as a function of group and drug?
MMI 409 Spring 2009 Final Examination Gordon Bleil Table of Contents Research Scenario and General Assumptions Questions for Dataset (Questions are hyperlinked to detailed answers) 1. Is there a difference
More informationSimple Linear Regression
Simple Linear Regression Assoc. Prof Dr Sarimah Abdullah Unit of Biostatistics & Research Methodology School of Medical Sciences, Health Campus Universiti Sains Malaysia Regression Regression analysis
More informationAn Introduction to Bayesian Statistics
An Introduction to Bayesian Statistics Robert Weiss Department of Biostatistics UCLA Fielding School of Public Health robweiss@ucla.edu Sept 2015 Robert Weiss (UCLA) An Introduction to Bayesian Statistics
More informationAnalysis of Environmental Data Conceptual Foundations: En viro n m e n tal Data
Analysis of Environmental Data Conceptual Foundations: En viro n m e n tal Data 1. Purpose of data collection...................................................... 2 2. Samples and populations.......................................................
More informationAdaptive Type-2 Fuzzy Logic Control of Non-Linear Processes
Adaptive Type-2 Fuzzy Logic Control of Non-Linear Processes Bartolomeo Cosenza, Mosè Galluzzo* Dipartimento di Ingegneria Chimica dei Processi e dei Materiali, Università degli Studi di Palermo Viale delle
More informationFuzzy Logic Based Expert System for Detecting Colorectal Cancer
Fuzzy Logic Based Expert System for Detecting Colorectal Cancer Tanjia Chowdhury Lecturer, Dept. of Computer Science and Engineering, Southern University Bangladesh, Chittagong, Bangladesh ---------------------------------------------------------------------***----------------------------------------------------------------------
More informationFever Diagnosis Rule-Based Expert Systems
Fever Diagnosis Rule-Based Expert Systems S. Govinda Rao M. Eswara Rao D. Siva Prasad Dept. of CSE Dept. of CSE Dept. of CSE TP inst. Of Science & Tech., TP inst. Of Science & Tech., Rajah RSRKRR College
More informationSUMMER 2011 RE-EXAM PSYF11STAT - STATISTIK
SUMMER 011 RE-EXAM PSYF11STAT - STATISTIK Full Name: Årskortnummer: Date: This exam is made up of three parts: Part 1 includes 30 multiple choice questions; Part includes 10 matching questions; and Part
More informationStudy Guide #2: MULTIPLE REGRESSION in education
Study Guide #2: MULTIPLE REGRESSION in education What is Multiple Regression? When using Multiple Regression in education, researchers use the term independent variables to identify those variables that
More informationAnalysis of Variance (ANOVA)
Research Methods and Ethics in Psychology Week 4 Analysis of Variance (ANOVA) One Way Independent Groups ANOVA Brief revision of some important concepts To introduce the concept of familywise error rate.
More informationWhite Paper Estimating Complex Phenotype Prevalence Using Predictive Models
White Paper 23-12 Estimating Complex Phenotype Prevalence Using Predictive Models Authors: Nicholas A. Furlotte Aaron Kleinman Robin Smith David Hinds Created: September 25 th, 2015 September 25th, 2015
More informationResults. NeuRA Family relationships May 2017
Introduction Familial expressed emotion involving hostility, emotional over-involvement, and critical comments has been associated with increased psychotic relapse in people with schizophrenia, so these
More informationSCUOLA DI SPECIALIZZAZIONE IN FISICA MEDICA. Sistemi di Elaborazione dell Informazione. Introduzione. Ruggero Donida Labati
SCUOLA DI SPECIALIZZAZIONE IN FISICA MEDICA Sistemi di Elaborazione dell Informazione Introduzione Ruggero Donida Labati Dipartimento di Informatica via Bramante 65, 26013 Crema (CR), Italy http://homes.di.unimi.it/donida
More informationTable of Contents. Plots. Essential Statistics for Nursing Research 1/12/2017
Essential Statistics for Nursing Research Kristen Carlin, MPH Seattle Nursing Research Workshop January 30, 2017 Table of Contents Plots Descriptive statistics Sample size/power Correlations Hypothesis
More informationMulti Parametric Approach Using Fuzzification On Heart Disease Analysis Upasana Juneja #1, Deepti #2 *
Multi Parametric Approach Using Fuzzification On Heart Disease Analysis Upasana Juneja #1, Deepti #2 * Department of CSE, Kurukshetra University, India 1 upasana_jdkps@yahoo.com Abstract : The aim of this
More informationMultiple Bivariate Gaussian Plotting and Checking
Multiple Bivariate Gaussian Plotting and Checking Jared L. Deutsch and Clayton V. Deutsch The geostatistical modeling of continuous variables relies heavily on the multivariate Gaussian distribution. It
More informationSPECIAL ISSUE FOR INTERNATIONAL CONFERENCE ON INNOVATIONS IN SCIENCE & TECHNOLOGY: OPPORTUNITIES & CHALLENGES"
INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY A PATH FOR HORIZING YOUR INNOVATIVE WORK SPECIAL ISSUE FOR INTERNATIONAL CONFERENCE ON INNOVATIONS IN SCIENCE & TECHNOLOGY:
More information1 Introduction. st0020. The Stata Journal (2002) 2, Number 3, pp
The Stata Journal (22) 2, Number 3, pp. 28 289 Comparative assessment of three common algorithms for estimating the variance of the area under the nonparametric receiver operating characteristic curve
More informationCHAPTER 3 RESEARCH METHODOLOGY
CHAPTER 3 RESEARCH METHODOLOGY 3.1 Introduction 3.1 Methodology 3.1.1 Research Design 3.1. Research Framework Design 3.1.3 Research Instrument 3.1.4 Validity of Questionnaire 3.1.5 Statistical Measurement
More informationCorrelational Research. Correlational Research. Stephen E. Brock, Ph.D., NCSP EDS 250. Descriptive Research 1. Correlational Research: Scatter Plots
Correlational Research Stephen E. Brock, Ph.D., NCSP California State University, Sacramento 1 Correlational Research A quantitative methodology used to determine whether, and to what degree, a relationship
More informationA Comparison of Methods of Estimating Subscale Scores for Mixed-Format Tests
A Comparison of Methods of Estimating Subscale Scores for Mixed-Format Tests David Shin Pearson Educational Measurement May 007 rr0701 Using assessment and research to promote learning Pearson Educational
More informationSW 9300 Applied Regression Analysis and Generalized Linear Models 3 Credits. Master Syllabus
SW 9300 Applied Regression Analysis and Generalized Linear Models 3 Credits Master Syllabus I. COURSE DOMAIN AND BOUNDARIES This is the second course in the research methods sequence for WSU doctoral students.
More information