Parental Smoking Behavior, Ethnicity, Gender, and the Cigarette Smoking Behavior of High School Students

Size: px
Start display at page:

Download "Parental Smoking Behavior, Ethnicity, Gender, and the Cigarette Smoking Behavior of High School Students"

Transcription

1 Parental Smoking Behavior, Ethnicity, Gender, and the Cigarette Smoking Behavior of High School Students Paul R. Yarnold, Ph.D. Optimal Data Analysis, LLC Novometric analysis is used to predict cigarette smoking (class variable) of 3,577 Anglo-American, 1,001 Mexican-American, and 797 Indian- American (multicategorical attribute) high-school students. Additional categorical attributes used were gender, and dummy-variable indicators of whether the student s mother, and whether the student s father, did or did not smoke. The globally optimal model in this application used only gender as an attribute: moderate ESS=27.1; D=5.4; p< The next more-complex model in the descendant family (moderate ESS=29.7; D= 11.8; p s<0.001) identified five student strata: female Indian-American students had the highest smoking rate (45.6% of 390 students), and male students for whom neither parent smoked had the lowest smoking rate (14.9% of 564 students). Data 1 (Table 1) were originally analyzed using disintegrated chi-square analysis 2 (more modern legacy analyses recommended for this design include the log-linear model 3-10 or logistic regression analysis 3,4,9-17 ), and the findings were reported using undocumented chi-square analysis. 1,18 It was concluded: smoking behavior of children of both sexes for all three ethnic groups reflects that of their parents. For Anglo-American boys the difference between the proportions of smokers when one parent smokes and when neither parent smokes is a significant one (p<0.001). For Anglo-American girls there is a significant difference (p<0.01) in percentages when one parent smokes and when both parents smoke. Mexican-American responses follow the same general pattern: the proportion of smokers is directly related to whether neither, one, or both parents smoke. The same general trend is also found for the Indian-Americans, although no significant differences were noted (pp ). Novometric Statistical Analysis Theoretical and analytic distinctions between novometric vs. alternative methods are detailed elsewhere 10 so the present focus (to the extent it is possible) is on the empirical findings. Table 2 is a summary of the descendant family of optimal models for this application: model four has the lowest D statistic and thus is the globally-optimal (GO) model presently. 136

2 Table 1: Study Data Smoker Ethnicity Mother Father Student Gender N Anglo Yes Yes Yes Male 192 Anglo Yes Yes No Male 503 Anglo Yes No Yes Male 31 Anglo Yes No No Male 85 Anglo No Yes Yes Male 136 Anglo No Yes No Male 413 Anglo No No Yes Male 60 Anglo No No No Male 387 Anglo Yes Yes Yes Female 97 Anglo Yes Yes No Female 520 Anglo Yes No Yes Female 11 Anglo Yes No No Female 91 Anglo No Yes Yes Female 63 Anglo No Yes No Female 563 Anglo No No Yes Female 30 Anglo No No No Female 395 Mexican Yes Yes Yes Male 49 Mexican Yes Yes No Male 106 Mexican Yes No Yes Male 10 Mexican Yes No No Male 22 Mexican No Yes Yes Male 40 Mexican No Yes No Male 131 Mexican No No Yes Male 24 Mexican No No No Male 93 Mexican Yes Yes Yes Female 16 Mexican Yes Yes No Female 153 Mexican Yes No Yes Female 3 Mexican Yes No No Female 28 Mexican No Yes Yes Female 13 Mexican No Yes No Female 197 Mexican No No Yes Female 8 Mexican No No No Female 108 Indian Yes Yes Yes Male 34 Indian Yes Yes No Male 34 Indian Yes No Yes Male 3 Indian Yes No No Male 9 Indian No Yes Yes Male 84 Indian No Yes No Male 97 Indian No No Yes Male 57 Indian No No No Male 72 Indian Yes Yes Yes Female 12 Indian Yes Yes No Female 64 Indian Yes No Yes Female 3 Indian Yes No No Female 10 Indian No Yes Yes Female 19 Indian No Yes No Female 177 Indian No No Yes Female 9 Indian No No No Female 113 Table 2: Descendant Family of Optimal Models Predicting Student Smoking Step ESS Strata Efficiency D Minimum Endpoint N ,672 All of the optimal models had identical performance in training (total sample) and jackknife validity analysis. 9,10 Compared to the GO model the first three models identified in novometric analysis are complex, untenable representations of the antecedents of student smoking. For model 3, for example, increasing the model complexity (number of strata or model endpoints) by 250% (from two to five) yields only an 8.8% [( )/29.7] gain in normed accuracy. The D statistic norms model predictive accuracy for parsimony, and here D indicates that the five-strata model was 2.2-times more distant from a theoretically ideal model having perfect accuracy and maximum possible parsimony for the application. 10,19 GO (Two-Strata) Model The two-strata GO model was: if gender is female, predict non-smoker; otherwise predict smoker (10.5% of females vs. 27.0% of males smoked). Table 3 is the confusion matrix for the model (ESS=27.1, p<0.001), which accurately predicted approximately 1 in 2 of the actual non-smokers (50% accuracy is expected by chance for each class category in two-category applications 9,10 ) compared to 7 in 10 of the actual smokers. Table 3: Confusion Matrix for GO Model Predicted Smoking No Yes Actual No 2,419 1, % Smoking Yes % 137

3 Complex (Five-Strata) Model The five-strata identified are presented below in order from greatest to lowest likelihood of a student being an actual smoker. Greatest likelihood of being a smoker (178/390=45.6%) occurred for female students (gender was the CTA model root attribute) who were of Indian-American descent. Here p<0.001 for all model nodes. 9,10,20-22 The second-greatest likelihood of being a smoker (282/998=28.3%) occurred for males with mothers who smoked. Third-greatest likelihood of a student being a smoker (176/720=24.4%) occurred for males with non-smoking mothers, but with fathers who smoked. The fourth-greatest likelihood of a student being a smoker (542/2282=23.75%) occurred for Anglo-American and Mexican- American females. Finally, lowest likelihood of a student being a smoker (84/564=14.9%) occurred for males for whom neither parent smoked. Table 4 is the confusion matrix for the model (ESS=29.7, all p<0.001), which accurately predicted 2 in 3 of the actual nonsmokers, and 5 in 8 of the actual smokers. Table 4: Confusion Matrix for the Complex Five-Strata Optimal Model Predicted Smoking No Yes Actual No 2,899 1, % Smoking Yes % The set of five student strata identified by the third optimal model in the descendant family yields approximately the same overall level of predictive accuracy normed against chance as does the two-strata GO model. The specific Indian-American female student subtype identified by the more complex model offers more opportunity for intervention than the all males subtype identified by the GO model: 45.6% vs. 27.0% were smokers, respectively. In applications involving risk of adverse events such as adverse drug reactions 23, bioterror 24, criminal recidivism 25 or research confounding 26 for example, complex models while statistically dubious, nevertheless may identify theoretically important and/or pragmatically useful, statistically reliable sample strata: for example, the strata having greatest vs. least need of immediate ( triaged ) intervention. References 1 Zagona SV (1967). Psycho-social correlates of smoking behavior and attitudes for a sample of Anglo-American, Mexican-American, and Indian-American high school students. In: Zagona SV (Ed.), Studies and issues in smoking behavior. Tucson, AZ: University of Arizona Press (pp ). 2 Yarnold PR (2016). CTA vs. disintegrated chisquare: Integrated vs. piecemeal analysis. Optimal Data Analysis, 5, Grimm LG, Yarnold PR (1995). Reading and Understanding Multivariate Statistics. Washington, DC: APA Books. 4 Grimm LG, Yarnold PR (2000). Reading and Understanding More Multivariate Statistics. Washington, DC: APA Books. 5 Yarnold PR (2015). UniODA-based structural decomposition vs. log-linear model: Statics and dynamics of intergenerational class mobility. Optimal Data Analysis, 4, Yarnold PR (2015). Modeling religious mobility by UniODA-based structural decomposition. Optimal Data Analysis, 4, Yarnold PR (2015). UniODA-based structural decomposition vs. legacy linear models: Statics and dynamics of intergenerational occupational mobility. Optimal Data Analysis, 4,

4 8 Yarnold PR (2015). UniODA-based structural decomposition vs. legacy linear models: Statics and dynamics of intergenerational occupational mobility. Optimal Data Analysis, 4, Yarnold PR, Soltysik RC (2005). Optimal data analysis: A guidebook with software for Windows. Washington, DC, APA Books. 10 Yarnold PR, Soltysik RC (2016). Maximizing predictive accuracy. Chicago, IL: ODA Books. DOI: /RG Yarnold PR, Soltysik RC (1991). Refining two-group multivariable classification models using univariate optimal discriminant analysis. Decision Sciences, 22, Yarnold PR, Hart LA, Soltysik RC (1994). Optimizing the classification performance of logistic regression and Fisher s discriminant analyses. Educational and Psychological Measurement, 54, Yarnold PR, Soltysik RC, McCormick WC, Burns R, Lin EHB, Bush T, Martin GJ (1995). Application of multivariable optimal discriminant analysis in general internal medicine. Journal of General Internal Medicine, 10, Yarnold PR, Soltysik RC, Lefevre F, Martin GJ (1998). Predicting in-hospital mortality of patients receiving cardiopulmonary resuscitation: Unit-weighted MultiODA for binary data. Statistics in Medicine, 17, Yarnold PR (2013). Univariate and multivariate analysis of categorical attributes with many response categories. Optimal Data Analysis, 2, Yarnold PR (2014). UniODA vs. logistic regression analysis: Serum cholesterol and coronary heart disease and mortality among middle aged diabetic men. Optimal Data Analysis, 3, Yarnold PR (2015). UniODA vs. logistic regression and Fisher s linear discriminant analysis: Modeling 10-year population change. Optimal Data Analysis, 4, Yarnold PR (2016). ODA vs. undocumented chi-square: Clarity vs. confusion. Optimal Data Analysis, 5, Yarnold PR (2015). Distance from a theoretically ideal statistical classification model defined as the number of additional equivalent effects needed to obtain perfect classification for the sample. Optimal Data Analysis, 4, Yarnold PR (1996). Discriminating geriatric and non-geriatric patients using functional status information: An example of classification tree analysis via UniODA. Educational and Psychological Measurement, 56, Yarnold PR, Soltysik RC, Bennett CL (1997). Predicting in-hospital mortality of patients with AIDS-related Pneumocystis carinii pneumonia: An example of hierarchically optimal classification tree analysis. Statistics in Medicine, 16, Soltysik RC, Yarnold PR (2010). Automated CTA software: Fundamental concepts and control commands. Optimal Data Analysis, 1, Belknap SM, Moore H, Lanzotti SA, Yarnold PR, Getz M, Deitrick DL, Peterson A, Akeson J, Maurer T, Soltysik RC, Storm J (2008). Application of software design principles and debugging methods to an analgesia prescription reduces risk of severe injury from medical use of opioids. Clinical Pharmacology and Therapeutics, 84, Kyriacou DN, Stein AC, Yarnold PR, Courtney DM, Nelson RR, Noskin GA, Handler JA, Frerichs RR (2004). Clinical predictors of bioterrorism-related inhalational anthrax. The Lancet, 364,

5 25 Stalans LJ, Yarnold PR, Seng M, Olson DE, Repp M (2004). Identifying three types of violent offenders and predicting violent recidivism while on probation: A classification tree analysis. Law & Human Behavior, 28, Linden A, Yarnold PR (2016). Using data mining techniques to characterize participation in observational studies. Journal of Evaluation in Clinical Practice. DOI: /jep Author Notes The study analyzed de-individuated data and was exempt from Institutional Review Board review. No conflict of interest was reported. Mail: Optimal Data Analysis, LLC 6348 N. Milwaukee Ave., #163 Chicago, IL

Restricted vs. Unrestricted Optimal Analysis: Smoking Behavior of College Undergraduates

Restricted vs. Unrestricted Optimal Analysis: Smoking Behavior of College Undergraduates Restricted vs. Unrestricted Optimal Analysis: Smoking Behavior of College Undergraduates Paul R. Yarnold, Ph.D. Optimal Data Analysis, LLC Maximum-accuracy statistical methods involving (un)restricted

More information

Novometric Analysis with Ordered Class Variables: The Optimal Alternative to Linear Regression Analysis

Novometric Analysis with Ordered Class Variables: The Optimal Alternative to Linear Regression Analysis Novometric Analysis with Ordered Class Variables: The Optimal Alternative to Linear Regression Analysis Paul R. Yarnold, Ph.D., and Ariel Linden, Dr.P.H. Optimal Data Analysis, LLC Linden Consulting Group,

More information

Comparative Accuracy of a Diagnostic Index Modeled Using (Optimized) Regression vs. Novometrics

Comparative Accuracy of a Diagnostic Index Modeled Using (Optimized) Regression vs. Novometrics Comparative Accuracy of a Diagnostic Index Modeled Using (Optimized) Regression vs. Novometrics Ariel Linden, Dr.P.H. and Paul R. Yarnold, Ph.D. Linden Consulting Group, LLC Optimal Data Analysis LLC Diagnostic

More information

Maximizing the Accuracy of Multiple Regression Models using UniODA: Regression Away From the Mean

Maximizing the Accuracy of Multiple Regression Models using UniODA: Regression Away From the Mean Maximizing the Accuracy of Multiple Regression Models using UniODA: Regression Away From the Mean Paul R. Yarnold, Ph.D., Fred B. Bryant, Ph.D., and Robert C. Soltysik, M.S. Optimal Data Analysis, LLC

More information

Optimal Data Analysis

Optimal Data Analysis Volume 3 2014 Published in real time by Optimal Data Analysis, LLC Optimal Data Analysis A Multidisciplinary Journal Specializing in Maximum-Accuracy Methods Editor Paul R. Yarnold, Ph.D. Co-Editors Fred

More information

Comparing Attributes Measured with Identical Likert-Type Scales in Single-Case Designs with UniODA

Comparing Attributes Measured with Identical Likert-Type Scales in Single-Case Designs with UniODA Comparing Attributes Measured with Identical Likert-Type Scales in Single-Case Designs with UniODA Paul R. Yarnold, Ph.D. Optimal Data Analysis, LLC Often at the advice of their physician, patients managing

More information

Combining machine learning and matching techniques to improve causal inference in program evaluation

Combining machine learning and matching techniques to improve causal inference in program evaluation bs_bs_banner Journal of Evaluation in Clinical Practice ISSN1365-2753 Combining machine learning and matching techniques to improve causal inference in program evaluation Ariel Linden DrPH 1,2 and Paul

More information

Using Classification Tree Analysis to Generate Propensity Score Weights

Using Classification Tree Analysis to Generate Propensity Score Weights Using Classification Tree Analysis to Generate Propensity Score Weights Ariel Linden, DrPH 1,2, Paul R. Yarnold, PhD 3 1 President, Linden Consulting Group, LLC, Ann Arbor, Michigan, USA 2 Research Scientist,

More information

Using machine learning to assess covariate balance in matching studies

Using machine learning to assess covariate balance in matching studies bs_bs_banner Journal of Evaluation in Clinical Practice ISSN1365-2753 Using machine learning to assess covariate balance in matching studies Ariel Linden, DrPH 1,2 and Paul R. Yarnold, PhD 3 1 President,

More information

Globally Optimal Statistical Classification Models, I: Binary Class Variable, One Ordered Attribute

Globally Optimal Statistical Classification Models, I: Binary Class Variable, One Ordered Attribute Globally Optimal Statistical Classification Models, I: Binary Class Variable, One Ordered Attribute Paul R. Yarnold, Ph.D. and Robert C. Soltysik, M.S. Optimal Data Analysis, LLC Imagine a random sample

More information

Applications. DSC 410/510 Multivariate Statistical Methods. Discriminating Two Groups. What is Discriminant Analysis

Applications. DSC 410/510 Multivariate Statistical Methods. Discriminating Two Groups. What is Discriminant Analysis DSC 4/5 Multivariate Statistical Methods Applications DSC 4/5 Multivariate Statistical Methods Discriminant Analysis Identify the group to which an object or case (e.g. person, firm, product) belongs:

More information

Ascertaining an Individual Patient s Symptom Dominance Hierarchy: Analysis of Raw Longitudinal Data Induces Simpson s Paradox

Ascertaining an Individual Patient s Symptom Dominance Hierarchy: Analysis of Raw Longitudinal Data Induces Simpson s Paradox Ascertaining an Individual Patient s Symptom Dominance Hierarchy: Analysis of Raw Longitudinal Data Induces Simpson s Paradox Paul R. Yarnold, Ph.D. Optimal Data Analysis, LLC Consider an application with

More information

m 11 m.1 > m 12 m.2 risk for smokers risk for nonsmokers

m 11 m.1 > m 12 m.2 risk for smokers risk for nonsmokers SOCY5061 RELATIVE RISKS, RELATIVE ODDS, LOGISTIC REGRESSION RELATIVE RISKS: Suppose we are interested in the association between lung cancer and smoking. Consider the following table for the whole population:

More information

Modeling Binary outcome

Modeling Binary outcome Statistics April 4, 2013 Debdeep Pati Modeling Binary outcome Test of hypothesis 1. Is the effect observed statistically significant or attributable to chance? 2. Three types of hypothesis: a) tests of

More information

Daniel Boduszek University of Huddersfield

Daniel Boduszek University of Huddersfield Daniel Boduszek University of Huddersfield d.boduszek@hud.ac.uk Introduction to Logistic Regression SPSS procedure of LR Interpretation of SPSS output Presenting results from LR Logistic regression is

More information

Optimal Data Analysis

Optimal Data Analysis Volume 2 2013 Published in real time by Optimal Data Analysis, LLC Optimal Data Analysis A Multidisciplinary Journal Specializing in Maximum-Accuracy Methods Editor Paul R. Yarnold, Ph.D. Co-Editors Fred

More information

Basic Biostatistics. Chapter 1. Content

Basic Biostatistics. Chapter 1. Content Chapter 1 Basic Biostatistics Jamalludin Ab Rahman MD MPH Department of Community Medicine Kulliyyah of Medicine Content 2 Basic premises variables, level of measurements, probability distribution Descriptive

More information

Part 8 Logistic Regression

Part 8 Logistic Regression 1 Quantitative Methods for Health Research A Practical Interactive Guide to Epidemiology and Statistics Practical Course in Quantitative Data Handling SPSS (Statistical Package for the Social Sciences)

More information

Statistical questions for statistical methods

Statistical questions for statistical methods Statistical questions for statistical methods Unpaired (two-sample) t-test DECIDE: Does the numerical outcome have a relationship with the categorical explanatory variable? Is the mean of the outcome the

More information

BRIEF REPORT OPTIMISTIC BIAS IN ADOLESCENT AND ADULT SMOKERS AND NONSMOKERS

BRIEF REPORT OPTIMISTIC BIAS IN ADOLESCENT AND ADULT SMOKERS AND NONSMOKERS Pergamon Addictive Behaviors, Vol. 25, No. 4, pp. 625 632, 2000 Copyright 2000 Elsevier Science Ltd. Printed in the USA. All rights reserved 0306-4603/00/$ see front matter PII S0306-4603(99)00072-6 BRIEF

More information

Lecture Outline. Biost 590: Statistical Consulting. Stages of Scientific Studies. Scientific Method

Lecture Outline. Biost 590: Statistical Consulting. Stages of Scientific Studies. Scientific Method Biost 590: Statistical Consulting Statistical Classification of Scientific Studies; Approach to Consulting Lecture Outline Statistical Classification of Scientific Studies Statistical Tasks Approach to

More information

Substance Use Disorder in Hospitalized Severely Mentally 111 Psychiatric Patients: Prevalence, Correlates, and Subgroups

Substance Use Disorder in Hospitalized Severely Mentally 111 Psychiatric Patients: Prevalence, Correlates, and Subgroups Substance Use Disorder in Hospitalized Severely Mentally 111 Psychiatric Patients: Prevalence, Correlates, and Subgroups Abstract by Kim T. Mueser, Paul R. Yarnold, Stanley D. Rosenberg, Chester Swett,

More information

Midterm Exam ANSWERS Categorical Data Analysis, CHL5407H

Midterm Exam ANSWERS Categorical Data Analysis, CHL5407H Midterm Exam ANSWERS Categorical Data Analysis, CHL5407H 1. Data from a survey of women s attitudes towards mammography are provided in Table 1. Women were classified by their experience with mammography

More information

BMI 541/699 Lecture 16

BMI 541/699 Lecture 16 BMI 541/699 Lecture 16 Where we are: 1. Introduction and Experimental Design 2. Exploratory Data Analysis 3. Probability 4. T-based methods for continous variables 5. Proportions & contingency tables -

More information

Survey of Smoking, Drinking and Drug Use (SDD) among young people in England, Andrew Bryant

Survey of Smoking, Drinking and Drug Use (SDD) among young people in England, Andrew Bryant Survey of Smoking, Drinking and Drug Use (SDD) among young people in England, 2010 Andrew Bryant Newcastle University Institute of Health and Society Background Background Young people s drinking behaviour

More information

Logistic Regression Predicting the Chances of Coronary Heart Disease. Multivariate Solutions

Logistic Regression Predicting the Chances of Coronary Heart Disease. Multivariate Solutions Logistic Regression Predicting the Chances of Coronary Heart Disease Multivariate Solutions What is Logistic Regression? Logistic regression in a nutshell: Logistic regression is used for prediction of

More information

Today: Binomial response variable with an explanatory variable on an ordinal (rank) scale.

Today: Binomial response variable with an explanatory variable on an ordinal (rank) scale. Model Based Statistics in Biology. Part V. The Generalized Linear Model. Single Explanatory Variable on an Ordinal Scale ReCap. Part I (Chapters 1,2,3,4), Part II (Ch 5, 6, 7) ReCap Part III (Ch 9, 10,

More information

Supplementary Online Content

Supplementary Online Content Supplementary Online Content Baker TB, Piper ME, Stein JH, et al. Effects of nicotine patch vs varenicline vs combination nicotine replacement therapy on smoking cessation at 26 weeks: a randomized clinical

More information

Review: Logistic regression, Gaussian naïve Bayes, linear regression, and their connections

Review: Logistic regression, Gaussian naïve Bayes, linear regression, and their connections Review: Logistic regression, Gaussian naïve Bayes, linear regression, and their connections New: Bias-variance decomposition, biasvariance tradeoff, overfitting, regularization, and feature selection Yi

More information

Classical Psychophysical Methods (cont.)

Classical Psychophysical Methods (cont.) Classical Psychophysical Methods (cont.) 1 Outline Method of Adjustment Method of Limits Method of Constant Stimuli Probit Analysis 2 Method of Constant Stimuli A set of equally spaced levels of the stimulus

More information

Daniel Boduszek University of Huddersfield

Daniel Boduszek University of Huddersfield Daniel Boduszek University of Huddersfield d.boduszek@hud.ac.uk Introduction to Multinominal Logistic Regression SPSS procedure of MLR Example based on prison data Interpretation of SPSS output Presenting

More information

PhD Course in Biostatistics

PhD Course in Biostatistics PhD Course in Biostatistics Univ.-Prof. DI Dr. Andrea Berghold Institute for Medical Informatics, Statistics and Documentation Medical University of Graz andrea.berghold@medunigraz.at Content Introduction

More information

Trends in Second-Hand Tobacco Smoke Exposure Levels at Home among Viet Nam School Children Aged and Associated Factors

Trends in Second-Hand Tobacco Smoke Exposure Levels at Home among Viet Nam School Children Aged and Associated Factors RESEARCH ARTICLE Trends in Second-Hand Tobacco Smoke Exposure Levels at Home among Viet Nam School Children Aged 13-15 and Associated Factors Nguyen Tuan Lam 1, Pham Thi Quynh Nga 1 *, Hoang Van Minh 2,

More information

10CS664: PATTERN RECOGNITION QUESTION BANK

10CS664: PATTERN RECOGNITION QUESTION BANK 10CS664: PATTERN RECOGNITION QUESTION BANK Assignments would be handed out in class as well as posted on the class blog for the course. Please solve the problems in the exercises of the prescribed text

More information

Predicting Offspring Conduct Disorder Using Parental Alcohol and Drug Dependence

Predicting Offspring Conduct Disorder Using Parental Alcohol and Drug Dependence Predicting Offspring Conduct Disorder Using Parental Alcohol and Drug Dependence Paul T. Korte, B.A. J. Randolph Haber, Ph.D. Abstract Introduction: Previous research has shown that the offspring of parents

More information

Joseph W Hogan Brown University & AMPATH February 16, 2010

Joseph W Hogan Brown University & AMPATH February 16, 2010 Joseph W Hogan Brown University & AMPATH February 16, 2010 Drinking and lung cancer Gender bias and graduate admissions AMPATH nutrition study Stratification and regression drinking and lung cancer graduate

More information

The Loyola Experience ( ): Optimal Data Analysis in the Department of Psychology

The Loyola Experience ( ): Optimal Data Analysis in the Department of Psychology The Loyola Experience (1993-2009): Optimal Data Analysis in the Department of Psychology Fred B. Bryant, Ph.D. Loyola University Chicago This article traces the origins and development of the use of optimal

More information

bivariate analysis: The statistical analysis of the relationship between two variables.

bivariate analysis: The statistical analysis of the relationship between two variables. bivariate analysis: The statistical analysis of the relationship between two variables. cell frequency: The number of cases in a cell of a cross-tabulation (contingency table). chi-square (χ 2 ) test for

More information

Modelling the impact of poverty on contraceptive choices in. Indian states

Modelling the impact of poverty on contraceptive choices in. Indian states Int. Statistical Inst.: Proc. 58th World Statistical Congress, 2, Dublin (Session STS67) p.3649 Modelling the impact of poverty on contraceptive choices in Indian states Oliveira, Isabel Tiago ISCTE Lisbon

More information

11/18/2013. Correlational Research. Correlational Designs. Why Use a Correlational Design? CORRELATIONAL RESEARCH STUDIES

11/18/2013. Correlational Research. Correlational Designs. Why Use a Correlational Design? CORRELATIONAL RESEARCH STUDIES Correlational Research Correlational Designs Correlational research is used to describe the relationship between two or more naturally occurring variables. Is age related to political conservativism? Are

More information

CLASSIFICATION TREE ANALYSIS:

CLASSIFICATION TREE ANALYSIS: CLASSIFICATION TREE ANALYSIS: A USEFUL STATISTICAL TOOL FOR PROGRAM EVALUATORS Meredith L. Philyaw, MS Jennifer Lyons, MSW(c) Why This Session? Stand up if you... Consider yourself to be a data analyst,

More information

STATISTICAL METHODS FOR DIAGNOSTIC TESTING: AN ILLUSTRATION USING A NEW METHOD FOR CANCER DETECTION XIN SUN. PhD, Kansas State University, 2012

STATISTICAL METHODS FOR DIAGNOSTIC TESTING: AN ILLUSTRATION USING A NEW METHOD FOR CANCER DETECTION XIN SUN. PhD, Kansas State University, 2012 STATISTICAL METHODS FOR DIAGNOSTIC TESTING: AN ILLUSTRATION USING A NEW METHOD FOR CANCER DETECTION by XIN SUN PhD, Kansas State University, 2012 A THESIS Submitted in partial fulfillment of the requirements

More information

Index. Springer International Publishing Switzerland 2017 T.J. Cleophas, A.H. Zwinderman, Modern Meta-Analysis, DOI /

Index. Springer International Publishing Switzerland 2017 T.J. Cleophas, A.H. Zwinderman, Modern Meta-Analysis, DOI / Index A Adjusted Heterogeneity without Overdispersion, 63 Agenda-driven bias, 40 Agenda-Driven Meta-Analyses, 306 307 Alternative Methods for diagnostic meta-analyses, 133 Antihypertensive effect of potassium,

More information

Is there an association between waist circumference and type 2 diabetes or impaired fasting glucose in US adolescents?

Is there an association between waist circumference and type 2 diabetes or impaired fasting glucose in US adolescents? Is there an association between waist circumference and type 2 diabetes or impaired fasting glucose in US adolescents? Meghann M. Moore, RD, CD Masters Thesis Maternal & Child Health Track School of Public

More information

Ward Headstrom Institutional Research Humboldt State University CAIR

Ward Headstrom Institutional Research Humboldt State University CAIR Using a Random Forest model to predict enrollment Ward Headstrom Institutional Research Humboldt State University CAIR 2013 1 Overview Forecasting enrollment to assist University planning The R language

More information

Statistics as a Tool. A set of tools for collecting, organizing, presenting and analyzing numerical facts or observations.

Statistics as a Tool. A set of tools for collecting, organizing, presenting and analyzing numerical facts or observations. Statistics as a Tool A set of tools for collecting, organizing, presenting and analyzing numerical facts or observations. Descriptive Statistics Numerical facts or observations that are organized describe

More information

STATISTICS IN CLINICAL AND TRANSLATIONAL RESEARCH

STATISTICS IN CLINICAL AND TRANSLATIONAL RESEARCH 09/07/11 1 Overview and Descriptive Statistics a. Application of statistics in biomedical research b. Type of data c. Graphic representation of data d. Summary statistics: central tendency and dispersion

More information

Heavy Smokers', Light Smokers', and Nonsmokers' Beliefs About Cigarette Smoking

Heavy Smokers', Light Smokers', and Nonsmokers' Beliefs About Cigarette Smoking Journal of Applied Psychology 1982, Vol. 67, No. 5, 616-622 Copyright 1982 by the American Psychological Association, Inc. 002I-9010/82/6705-0616S00.75 ', ', and Nonsmokers' Beliefs About Cigarette Smoking

More information

CHAPTER VI RESEARCH METHODOLOGY

CHAPTER VI RESEARCH METHODOLOGY CHAPTER VI RESEARCH METHODOLOGY 6.1 Research Design Research is an organized, systematic, data based, critical, objective, scientific inquiry or investigation into a specific problem, undertaken with the

More information

Assessing Violence Risk in Stalking Cases: A Regression Tree Approach

Assessing Violence Risk in Stalking Cases: A Regression Tree Approach Law and Human Behavior, Vol. 29, No. 3, June 2005 ( C 2005) DOI: 10.1007/s10979-005-3318-6 Assessing Violence Risk in Stalking Cases: A Regression Tree Approach Barry Rosenfeld 1,2 and Charles Lewis 1

More information

Practical Multivariate Analysis

Practical Multivariate Analysis Texts in Statistical Science Practical Multivariate Analysis Fifth Edition Abdelmonem Afifi Susanne May Virginia A. Clark CRC Press Taylor & Francis Group Boca Raton London New York CRC Press is an imprint

More information

Lecture Outline. Biost 517 Applied Biostatistics I. Purpose of Descriptive Statistics. Purpose of Descriptive Statistics

Lecture Outline. Biost 517 Applied Biostatistics I. Purpose of Descriptive Statistics. Purpose of Descriptive Statistics Biost 517 Applied Biostatistics I Scott S. Emerson, M.D., Ph.D. Professor of Biostatistics University of Washington Lecture 3: Overview of Descriptive Statistics October 3, 2005 Lecture Outline Purpose

More information

Unit 1 Exploring and Understanding Data

Unit 1 Exploring and Understanding Data Unit 1 Exploring and Understanding Data Area Principle Bar Chart Boxplot Conditional Distribution Dotplot Empirical Rule Five Number Summary Frequency Distribution Frequency Polygon Histogram Interquartile

More information

Understanding Statistics for Research Staff!

Understanding Statistics for Research Staff! Statistics for Dummies? Understanding Statistics for Research Staff! Those of us who DO the research, but not the statistics. Rachel Enriquez, RN PhD Epidemiologist Why do we do Clinical Research? Epidemiology

More information

Recognition of Complications After Pancreaticoduodenectomy for Cancer Determines Inpatient Mortality

Recognition of Complications After Pancreaticoduodenectomy for Cancer Determines Inpatient Mortality ORIGINAL ARTICLE Recognition of Complications After Pancreaticoduodenectomy for Cancer Determines Inpatient Mortality Evan S Glazer 1, Albert Amini 1, Tun Jie 1, Rainer WG Gruessner 1, Robert S Krouse

More information

MULTIPLE REGRESSION OF CPS DATA

MULTIPLE REGRESSION OF CPS DATA MULTIPLE REGRESSION OF CPS DATA A further inspection of the relationship between hourly wages and education level can show whether other factors, such as gender and work experience, influence wages. Linear

More information

Analyzing diastolic and systolic blood pressure individually or jointly?

Analyzing diastolic and systolic blood pressure individually or jointly? Analyzing diastolic and systolic blood pressure individually or jointly? Chenglin Ye a, Gary Foster a, Lisa Dolovich b, Lehana Thabane a,c a. Department of Clinical Epidemiology and Biostatistics, McMaster

More information

Technical Specifications

Technical Specifications Technical Specifications In order to provide summary information across a set of exercises, all tests must employ some form of scoring models. The most familiar of these scoring models is the one typically

More information

Chapter 3. Psychometric Properties

Chapter 3. Psychometric Properties Chapter 3 Psychometric Properties Reliability The reliability of an assessment tool like the DECA-C is defined as, the consistency of scores obtained by the same person when reexamined with the same test

More information

Table S1. Characteristics associated with frequency of nut consumption (full entire sample; Nn=4,416).

Table S1. Characteristics associated with frequency of nut consumption (full entire sample; Nn=4,416). Table S1. Characteristics associated with frequency of nut (full entire sample; Nn=4,416). Daily nut Nn= 212 Weekly nut Nn= 487 Monthly nut Nn= 1,276 Infrequent or never nut Nn= 2,441 Sex; n (%) men 52

More information

SW 9300 Applied Regression Analysis and Generalized Linear Models 3 Credits. Master Syllabus

SW 9300 Applied Regression Analysis and Generalized Linear Models 3 Credits. Master Syllabus SW 9300 Applied Regression Analysis and Generalized Linear Models 3 Credits Master Syllabus I. COURSE DOMAIN AND BOUNDARIES This is the second course in the research methods sequence for WSU doctoral students.

More information

Age (continuous) Gender (0=Male, 1=Female) SES (1=Low, 2=Medium, 3=High) Prior Victimization (0= Not Victimized, 1=Victimized)

Age (continuous) Gender (0=Male, 1=Female) SES (1=Low, 2=Medium, 3=High) Prior Victimization (0= Not Victimized, 1=Victimized) Criminal Justice Doctoral Comprehensive Exam Statistics August 2016 There are two questions on this exam. Be sure to answer both questions in the 3 and half hours to complete this exam. Read the instructions

More information

Biases in clinical research. Seungho Ryu, MD, PhD Kanguk Samsung Hospital, Sungkyunkwan University

Biases in clinical research. Seungho Ryu, MD, PhD Kanguk Samsung Hospital, Sungkyunkwan University Biases in clinical research Seungho Ryu, MD, PhD Kanguk Samsung Hospital, Sungkyunkwan University Learning objectives Describe the threats to causal inferences in clinical studies Understand the role of

More information

Current Directions in Mediation Analysis David P. MacKinnon 1 and Amanda J. Fairchild 2

Current Directions in Mediation Analysis David P. MacKinnon 1 and Amanda J. Fairchild 2 CURRENT DIRECTIONS IN PSYCHOLOGICAL SCIENCE Current Directions in Mediation Analysis David P. MacKinnon 1 and Amanda J. Fairchild 2 1 Arizona State University and 2 University of South Carolina ABSTRACT

More information

Doing Quantitative Research 26E02900, 6 ECTS Lecture 6: Structural Equations Modeling. Olli-Pekka Kauppila Daria Kautto

Doing Quantitative Research 26E02900, 6 ECTS Lecture 6: Structural Equations Modeling. Olli-Pekka Kauppila Daria Kautto Doing Quantitative Research 26E02900, 6 ECTS Lecture 6: Structural Equations Modeling Olli-Pekka Kauppila Daria Kautto Session VI, September 20 2017 Learning objectives 1. Get familiar with the basic idea

More information

Controlling Bias & Confounding

Controlling Bias & Confounding Controlling Bias & Confounding Chihaya Koriyama August 5 th, 2015 QUESTIONS FOR BIAS Key concepts Bias Should be minimized at the designing stage. Random errors We can do nothing at Is the nature the of

More information

Template 1 for summarising studies addressing prognostic questions

Template 1 for summarising studies addressing prognostic questions Template 1 for summarising studies addressing prognostic questions Instructions to fill the table: When no element can be added under one or more heading, include the mention: O Not applicable when an

More information

ORIGINAL INVESTIGATION. C-Reactive Protein Concentration and Incident Hypertension in Young Adults

ORIGINAL INVESTIGATION. C-Reactive Protein Concentration and Incident Hypertension in Young Adults ORIGINAL INVESTIGATION C-Reactive Protein Concentration and Incident Hypertension in Young Adults The CARDIA Study Susan G. Lakoski, MD, MS; David M. Herrington, MD, MHS; David M. Siscovick, MD, MPH; Stephen

More information

LEPTIN AS A NOVEL PREDICTOR OF DEPRESSION IN PATIENTS WITH THE METABOLIC SYNDROME

LEPTIN AS A NOVEL PREDICTOR OF DEPRESSION IN PATIENTS WITH THE METABOLIC SYNDROME LEPTIN AS A NOVEL PREDICTOR OF DEPRESSION IN PATIENTS WITH THE METABOLIC SYNDROME Diana A. Chirinos, Ronald Goldberg, Elias Querales-Mago, Miriam Gutt, Judith R. McCalla, Marc Gellman and Neil Schneiderman

More information

Exploring Cohort- and Country-Variation in the Link Between Social Background and Marital Attitudes: A Study Using ESS Data

Exploring Cohort- and Country-Variation in the Link Between Social Background and Marital Attitudes: A Study Using ESS Data Exploring Cohort- and Country-Variation in the Link Between Social Background and Marital Attitudes: A Study Using ESS Data Paul Sellies Aart Liefbroer Den Haag 16 March 2018 Bij ons leer je de wereld

More information

THE EFFECTS OF SELF AND PROXY RESPONSE STATUS ON THE REPORTING OF RACE AND ETHNICITY l

THE EFFECTS OF SELF AND PROXY RESPONSE STATUS ON THE REPORTING OF RACE AND ETHNICITY l THE EFFECTS OF SELF AND PROXY RESPONSE STATUS ON THE REPORTING OF RACE AND ETHNICITY l Brian A. Harris-Kojetin, Arbitron, and Nancy A. Mathiowetz, University of Maryland Brian Harris-Kojetin, The Arbitron

More information

A Comparison of First and Second Generation Multivariate Analyses: Canonical Correlation Analysis and Structural Equation Modeling 1

A Comparison of First and Second Generation Multivariate Analyses: Canonical Correlation Analysis and Structural Equation Modeling 1 Florida Journal of Educational Research, 2004, Vol. 42, pp. 22-40 A Comparison of First and Second Generation Multivariate Analyses: Canonical Correlation Analysis and Structural Equation Modeling 1 A.

More information

STATIC RISK AND OFFENDER NEEDS GUIDE-REVISED FOR SEX OFFENDERS (STRONG-S)

STATIC RISK AND OFFENDER NEEDS GUIDE-REVISED FOR SEX OFFENDERS (STRONG-S) STATIC RISK AND OFFENDER NEEDS GUIDE-REVISED FOR SEX OFFENDERS (STRONG-S) Ming-Li Hsieh & Zachary K. Hamilton Background In 1990, the passage of the Community Protection Act (CPA) 1 required the Washington

More information

Predictors of Cigarette Smoking Behavior Among Military University Students in Taiwan. Wang, Kwua-Yun; Yang, Chia-Chen

Predictors of Cigarette Smoking Behavior Among Military University Students in Taiwan. Wang, Kwua-Yun; Yang, Chia-Chen The Henderson Repository is a free resource of the Honor Society of Nursing, Sigma Theta Tau International. It is dedicated to the dissemination of nursing research, researchrelated, and evidence-based

More information

Computer Age Statistical Inference. Algorithms, Evidence, and Data Science. BRADLEY EFRON Stanford University, California

Computer Age Statistical Inference. Algorithms, Evidence, and Data Science. BRADLEY EFRON Stanford University, California Computer Age Statistical Inference Algorithms, Evidence, and Data Science BRADLEY EFRON Stanford University, California TREVOR HASTIE Stanford University, California ggf CAMBRIDGE UNIVERSITY PRESS Preface

More information

Research Brief Convergent and Discriminate Validity of the STRONG-Rof the Static Risk Offender Need Guide for Recidivism (STRONG-R)

Research Brief Convergent and Discriminate Validity of the STRONG-Rof the Static Risk Offender Need Guide for Recidivism (STRONG-R) Research Brief Convergent and Discriminate Validity of the STRONG-Rof the Static Risk Offender Need Guide for Recidivism (STRONG-R) Xiaohan Mei, M.A. Douglas Routh, M.A. Zachary Hamilton, Ph.D. Washington

More information

SGRQ Questionnaire assessing respiratory disease-specific quality of life. Questionnaire assessing general quality of life

SGRQ Questionnaire assessing respiratory disease-specific quality of life. Questionnaire assessing general quality of life SUPPLEMENTARY MATERIAL e-table 1: Outcomes studied in present analysis. Outcome Abbreviation Definition Nature of data, direction indicating adverse effect (continuous only) Clinical outcomes- subjective

More information

Fitting discrete-data regression models in social science

Fitting discrete-data regression models in social science Fitting discrete-data regression models in social science Andrew Gelman Dept. of Statistics and Dept. of Political Science Columbia University For Greg Wawro sclass, 7 Oct 2010 Today s class Example: wells

More information

VCU Scholars Compass. Virginia Commonwealth University. Hassan Zakaria Virginia Commonwealth University

VCU Scholars Compass. Virginia Commonwealth University. Hassan Zakaria Virginia Commonwealth University Virginia Commonwealth University VCU Scholars Compass Theses and Dissertations Graduate School 2007 Analgesic Use in U.S. Emergency Departments for Patients Reporting Moderate to Severe Pain: Diagnosis

More information

STA 3024 Spring 2013 EXAM 3 Test Form Code A UF ID #

STA 3024 Spring 2013 EXAM 3 Test Form Code A UF ID # STA 3024 Spring 2013 Name EXAM 3 Test Form Code A UF ID # Instructions: This exam contains 34 Multiple Choice questions. Each question is worth 3 points, for a total of 102 points (there are TWO bonus

More information

SAQ-Short Form Reliability and Validity Study in a Large Sample of Offenders

SAQ-Short Form Reliability and Validity Study in a Large Sample of Offenders SAQ-Short Form Reliability and Validity Study in a Large Sample of Offenders Donald D Davignon, Ph.D. 10-21-02 ABSTRACT The SAQ-Short Form (SAQ-SF) is an adult offender assessment test that accurately

More information

NORTH SOUTH UNIVERSITY TUTORIAL 2

NORTH SOUTH UNIVERSITY TUTORIAL 2 NORTH SOUTH UNIVERSITY TUTORIAL 2 AHMED HOSSAIN,PhD Data Management and Analysis AHMED HOSSAIN,PhD - Data Management and Analysis 1 Correlation Analysis INTRODUCTION In correlation analysis, we estimate

More information

Addendum: Multiple Regression Analysis (DRAFT 8/2/07)

Addendum: Multiple Regression Analysis (DRAFT 8/2/07) Addendum: Multiple Regression Analysis (DRAFT 8/2/07) When conducting a rapid ethnographic assessment, program staff may: Want to assess the relative degree to which a number of possible predictive variables

More information

Heterogeneity of Symptom Presentation in Sexually Abused Youth: Complex Profiles of a Complex Problem

Heterogeneity of Symptom Presentation in Sexually Abused Youth: Complex Profiles of a Complex Problem Heterogeneity of Symptom Presentation in Sexually Abused Youth: Complex Profiles of a Complex Problem Genelle K. Sawyer, Poonam Tavkar, C. Thresa Yancey, David J. Hansen, and Mary Fran Flood University

More information

CLINICAL BIOSTATISTICS

CLINICAL BIOSTATISTICS 09/06/17 1 Overview and Descriptive Statistics a. Application of statistics in biomedical research b. Type of data c. Graphic representation of data d. Summary statistics: central tendency and dispersion

More information

Assessing Risk. October 22, Tammy Meredith, Ph.D. Applied Research Services, Inc.

Assessing Risk. October 22, Tammy Meredith, Ph.D. Applied Research Services, Inc. Assessing Risk October 22, 2009 Tammy Meredith, Ph.D. Applied Research Services, Inc. Project 1: Develop automated parole supervision risk assessment instruments. Original Risk Study 6,327 Georgia parolees

More information

A SAS Macro for Adaptive Regression Modeling

A SAS Macro for Adaptive Regression Modeling A SAS Macro for Adaptive Regression Modeling George J. Knafl, PhD Professor University of North Carolina at Chapel Hill School of Nursing Supported in part by NIH Grants R01 AI57043 and R03 MH086132 Overview

More information

Different Roles, Same Goals: Preventing Sexual Abuse 2016 ATSA Conference Friday November 4 1:30 PM - 3:00 PM F-19

Different Roles, Same Goals: Preventing Sexual Abuse 2016 ATSA Conference Friday November 4 1:30 PM - 3:00 PM F-19 F-19 Cross-Cultural Validity of Static-99R and Stable-2007 Scores Symposium Chair: Kelly M. Babchishin, Ph.D. The Royal's Institute of Mental Health Research / University of Ottawa In this symposium, we

More information

Lesson: A Ten Minute Course in Epidemiology

Lesson: A Ten Minute Course in Epidemiology Lesson: A Ten Minute Course in Epidemiology This lesson investigates whether childhood circumcision reduces the risk of acquiring genital herpes in men. 1. To open the data we click on File>Example Data

More information

Business Statistics Probability

Business Statistics Probability Business Statistics The following was provided by Dr. Suzanne Delaney, and is a comprehensive review of Business Statistics. The workshop instructor will provide relevant examples during the Skills Assessment

More information

Today Retrospective analysis of binomial response across two levels of a single factor.

Today Retrospective analysis of binomial response across two levels of a single factor. Model Based Statistics in Biology. Part V. The Generalized Linear Model. Chapter 18.3 Single Factor. Retrospective Analysis ReCap. Part I (Chapters 1,2,3,4), Part II (Ch 5, 6, 7) ReCap Part III (Ch 9,

More information

Non-Randomized Trials

Non-Randomized Trials Non-Randomized Trials ADA Research Toolkit ADA Research Committee 2011 American Dietetic Association. This presentation may be used for educational purposes Learning Objectives At the end of this presentation

More information

Risk Assessment Update: ARREST SCALES February 28, 2018 DRAFT

Risk Assessment Update: ARREST SCALES February 28, 2018 DRAFT SUMMARY: In December 2017 the Commission voted to replace number of prior convictions with number of prior arrests as a predictor in the risk assessment scales. Over the past months staff has prepared

More information

Supplementary Material. other ethnic backgrounds. All but six of the yoked pairs were matched on ethnicity. Results

Supplementary Material. other ethnic backgrounds. All but six of the yoked pairs were matched on ethnicity. Results Supplementary Material S1 Methodological Details Participants The sample was 80% Caucasian, 16.7% Asian or Asian American, and 3.3% from other ethnic backgrounds. All but six of the yoked pairs were matched

More information

Research Brief Concurrent Validity of the Static Risk Offender Need Guide for Recidivism (STRONG-R)

Research Brief Concurrent Validity of the Static Risk Offender Need Guide for Recidivism (STRONG-R) Research Brief Concurrent Validity of the Static Risk Offender Need Guide for Recidivism (STRONG-R) Xiaohan Mei, M.A. Zachary Hamilton, Ph.D. Washington State University 1 Concurrent Validity of STRONG-R

More information

Objective: To describe a new approach to neighborhood effects studies based on residential mobility and demonstrate this approach in the context of

Objective: To describe a new approach to neighborhood effects studies based on residential mobility and demonstrate this approach in the context of Objective: To describe a new approach to neighborhood effects studies based on residential mobility and demonstrate this approach in the context of neighborhood deprivation and preterm birth. Key Points:

More information

isc ove ring i Statistics sing SPSS

isc ove ring i Statistics sing SPSS isc ove ring i Statistics sing SPSS S E C O N D! E D I T I O N (and sex, drugs and rock V roll) A N D Y F I E L D Publications London o Thousand Oaks New Delhi CONTENTS Preface How To Use This Book Acknowledgements

More information

Exploring the Benefits of Data Mining on Juvenile Justice Data

Exploring the Benefits of Data Mining on Juvenile Justice Data Exploring the Benefits of Data Mining on Juvenile Justice Data Author Gray, Brett, Birks, Daniel, Allard, Troy, Ogilvie, James, Stewart, Anna, Lewis, Andrew Published 2008 Copyright Statement The Author(s)

More information

Hellinger Optimal Criterion and HPA- Optimum Designs for Model Discrimination and Probability-Based Optimality

Hellinger Optimal Criterion and HPA- Optimum Designs for Model Discrimination and Probability-Based Optimality International Journal of Mathematics and Statistics Invention (IJMSI) E-ISSN: 2321 4767 P-ISSN: 2321-4759 Volume 5 Issue 2 February. 2017 PP-71-77 Hellinger Optimal Criterion and HPA- Optimum Designs for

More information