Practical Multivariate Analysis

Similar documents
Applied Medical. Statistics Using SAS. Geoff Der. Brian S. Everitt. CRC Press. Taylor Si Francis Croup. Taylor & Francis Croup, an informa business

PRACTICAL STATISTICS FOR MEDICAL RESEARCH

A Handbook of Statistical Analyses using SAS

isc ove ring i Statistics sing SPSS

The Statistical Analysis of Failure Time Data

From Biostatistics Using JMP: A Practical Guide. Full book available for purchase here. Chapter 1: Introduction... 1

DECISION ANALYSIS WITH BAYESIAN NETWORKS

Research Methods. for Business. A Skill'Building Approach SEVENTH EDITION. Uma Sekaran. and. Roger Bougie

CLASSICAL AND. MODERN REGRESSION WITH APPLICATIONS

HANDOUTS FOR BST 660 ARE AVAILABLE in ACROBAT PDF FORMAT AT:

Statistical Tolerance Regions: Theory, Applications and Computation

Data Analysis Using Regression and Multilevel/Hierarchical Models

Computer Age Statistical Inference. Algorithms, Evidence, and Data Science. BRADLEY EFRON Stanford University, California

Applications of Regression Models in Epidemiology

Modern Regression Methods

f WILEY ANOVA and ANCOVA A GLM Approach Second Edition ANDREW RUTHERFORD Staffordshire, United Kingdom Keele University School of Psychology

Bureaucracy and Teachers' Sense of Power. Cemil Yücel

An Introduction to Modern Econometrics Using Stata

Applications. DSC 410/510 Multivariate Statistical Methods. Discriminating Two Groups. What is Discriminant Analysis

Linear Regression Analysis

List of Figures. List of Tables. Preface to the Second Edition. Preface to the First Edition

Contents. Part 1 Introduction. Part 2 Cross-Sectional Selection Bias Adjustment

AN EXPLORATORY STUDY OF LEADER-MEMBER EXCHANGE IN CHINA, AND THE ROLE OF GUANXI IN THE LMX PROCESS

Indirect Questioning in Sample Surveys

Multicultural Health

Ordinal Data Modeling

Ecological Statistics

National Surgical Adjuvant Breast and Bowel Project (NSABP) Foundation Annual Progress Report: 2008 Formula Grant

Lecture Outline. Biost 590: Statistical Consulting. Stages of Scientific Studies. Scientific Method

Data Analysis with SPSS

Basic Biostatistics. Chapter 1. Content

Certificate Courses in Biostatistics

A Guide to Algorithm Design: Paradigms, Methods, and Complexity Analysis

PSYC3010 Advanced Statistics for Psychology

Analysis of Waiting-Time Data in Health Services Research

The National TB Prevalence Survey Pakistan

Statistical Analysis with Missing Data. Second Edition

III. C. Research Subject and Population III. D. Sample Size III. E. Variable and Operational Definition III. E. 1.

Graduate Survey - May 2014 (Human Services)

12/30/2017. PSY 5102: Advanced Statistics for Psychological and Behavioral Research 2

Help-seeking behaviour for emotional or behavioural problems. among Australian adolescents: the role of socio-demographic

Understanding. Regression Analysis

BASIC PHARMACOKINETICS


THE RELATIONSHIP BETWEEN PERSONALITY VARIABLES AND WORK PERFORMANCE OF CREDIT CONTROLLERS IN A BANK OLGA COETZEE

RESEARCH METHODS. A Process of Inquiry. tm HarperCollinsPublishers ANTHONY M. GRAZIANO MICHAEL L RAULIN

PSYC3010 Advanced Statistics for Psychology

~ SAGE PUBLICATIONS. MULTIPLE REGRESSION: Testing and Interpreting Interactions. $ The International Professional Publishers

Industrial Flow Measurement 3rd Edition. David W. Spitzer

Statistics for Biology and Health

Marine Polysaccharides

For general queries, contact

BIOSTATISTICAL METHODS AND RESEARCH DESIGNS. Xihong Lin Department of Biostatistics, University of Michigan, Ann Arbor, MI, USA

Use of GEEs in STATA

Social Psychology of Self-Referent Behavior

Contents. Acknowledgments

Part 8 Logistic Regression

Lunchtime Seminar. Risper Awuor, Ph.D. Department of Graduate Educational and Leadership. January 30, 2013

HIV Development Assistance and Adult Mortality in Africa: A replication study of Bendavid et al. (2012)

An Introduction to Multiple Imputation for Missing Items in Complex Surveys

Dr. SANDHEEP S. (MBBS MD DPH) Dr. BENNY PV (MBBS MD DPH) (DATA ANALYSIS USING SPSS ILLUSTRATED WITH STEP-BY-STEP SCREENSHOTS)

Chapter 14: More Powerful Statistical Methods

Measurement Error in Nonlinear Models

Biostatistics II

fifth edition Assessment in Counseling A Guide to the Use of Psychological Assessment Procedures Danica G. Hays

MEASURES OF ASSOCIATION AND REGRESSION

Today: Binomial response variable with an explanatory variable on an ordinal (rank) scale.

AGE-RELATED DIFFERENCES IN VERBAL, VISUAL. AND SPATIAL MEMORY: The Same or Different?

Sequential nonparametric regression multiple imputations. Irina Bondarenko and Trivellore Raghunathan

Design and Analysis of a Cancer Prevention Trial: Plans and Results. Matthew Somerville 09 November 2009

Epidemiologic Methods I & II Epidem 201AB Winter & Spring 2002

Scientific and Regulatory Basis for Development, Quality Assurance and Marketing Authorisation

Chapter 17 Sensitivity Analysis and Model Validation

Biostatistics for Med Students. Lecture 1

Regression Analysis II

A Handbook of Statistical Analyses Using R. Brian S. Everitt and Torsten Hothorn

Bioavailability and Analysis of Vitamins in Foods

The SAGE Encyclopedia of Educational Research, Measurement, and Evaluation Multivariate Analysis of Variance

Bayesian Logistic Regression Modelling via Markov Chain Monte Carlo Algorithm

The Food Consumption Analysis

Unbalanced Analysis of Variance, Design, and Regression: Applied Statistical Methods

Social. Research SECOND EDITION

How to describe bivariate data

From Bivariate Through Multivariate Techniques

Palgrave Advances in Behavioral Economics. Series Editor John F. Tomer Co-Editor, Jl of Socio-Economics Manhattan College Riverdale, USA

Package frailtyhl. February 19, 2015

Daniel Boduszek University of Huddersfield

A MONTE CARLO STUDY OF MODEL SELECTION PROCEDURES FOR THE ANALYSIS OF CATEGORICAL DATA

Statistical Methods in Food and Consumer Research. Second Edition

Detection of Unknown Confounders. by Bayesian Confirmatory Factor Analysis

SW 9300 Applied Regression Analysis and Generalized Linear Models 3 Credits. Master Syllabus

! Club!Council!Meeting!Minutes!

Course Outline Introduction to ICD-10 Coding Course

ANALYSIS OF SURVEYS WITH EPI INFO AND STATA

Lisa Yelland. BMa&CompSc (Hons)

Copyright is owned by the Author of the thesis. Permission is given for a copy to be downloaded by an individual for the purpose of research and

Chapter 11: Advanced Remedial Measures. Weighted Least Squares (WLS)

Experimental Political Science and the Study of Causality

Transcription:

Texts in Statistical Science Practical Multivariate Analysis Fifth Edition Abdelmonem Afifi Susanne May Virginia A. Clark CRC Press Taylor & Francis Group Boca Raton London New York CRC Press is an imprint of the Taylor & Francis Group an informa business A CHAPMAN & HALL BOOK

Contents Preface xv Authors' Biographies xix I Preparation for Analysis 1 1 What is multivariate analysis? 3 1.1 Defining multivariate analysis 3 1.2 Examples of multivariate analyses 3 1.3 Multivariate analyses discussed in this book 7 1.4 Organization and content of the book 12 2 Characterizing data for analysis 15 2.1 Variables: their definition, classification, and use 15 2.2 Defining statistical variables 15 2.3 Stevens's classification of variables 16 2.4 How variables are used in data analysis 19 2.5 Examples of classifying variables 20 2.6 Other characteristics of data 21 2.7 Summary 21 2.8 Problems 22 3 Preparing for data analysis 23 3.1 Processing data so they can be analyzed 23 3.2 Choice of a statistical package 24 3.3 Techniques for data entry 26 3.4 Organizing the data 33 3.5 Example: depression study 40 3.6 Summary 43 3.7 Problems 45 vii

viii CONTENTS 4 Data screening and transformations 47 4.1 Transformations, assessing normality and independence 47 4.2 Common transformations 47 4.3 Selecting appropriate transformations 51 4.4 Assessing independence 61 4.5 Summary 62 4.6 Problems 64 5 Selecting appropriate analyses 67 5.1 Which analyses to perform? 67 5.2 Why selection is often difficult 67 5.3 Appropriate statistical measures 68 5.4 Selecting appropriate multivariate analyses 72 5.5 Summary 74 5.6 Problems 74 II Applied Regression Analysis 79 6 Simple regression and correlation 81 6.1 Chapter outline 81 6.2 When are regression and correlation used? 81 6.3 Data example 82 6.4 Regression methods: fixed-x case 84 6.5 Regression and correlation: variable-x case 89 6.6 Interpretation: 6.7 Interpretation: fixed-x case 90 variable-x case 91 6.8 Other available computer output 95 6.9 Robustness and transformations for regression 103 6.10 Other types of regression 106 6.11 Special applications of regression 108 6.12 Discussion of computer programs 112 6.13 What to watch out for 113 6.14 Summary 115 6.15 Problems 115 7 Multiple regression and correlation 119 7.1 Chapter outline 119 7.2 When are regression and correlation used? 120 7.3 Data example 120 7.4 Regression methods: fixed-.y case 123 7.5 Regression and correlation: variable-x case 125

CONTENTS ix 7.6 Interpretation: fixed-x case 131 7.7 Interpretation: variable-x case 134 7.8 Regression diagnostics and transformations 137 7.9 Other options in computer programs 143 7.10 Discussion of computer programs 148 7.11 What to watch out for 151 7.12 Summary 154 7.13 Problems 154 8 Variable selection in regression 159 8.1 Chapter outline 159 8.2 When are variable selection methods used? 159 8.3 Data example 160 8.4 Criteria for variable selection 163 8.5 A general F test 166 8.6 Stepwise regression 168 8.7 Subset regression 173 8.8 Discussion of computer programs 177 8.9 Discussion of strategies 180 8.10 What to watch out for 183 8.11 Summary 186 8.12 Problems 186 9 Special regression topics 189 9.1 Chapter outline 189 9.2 Missing values in regression analysis 189 9.3 Dummy variables 197 9.4 Constraints on parameters 206 9.5 Regression analysis with multicollinearity 209 9.6 Ridge regression 211 9.7 Summary 215 9.8 Problems 215 III Multivariate Analysis 221 10 Canonical correlation analysis 223 10.1 Chapter outline 223 10.2 When is canonical correlation analysis used? 223 10.3 Data example 224 10.4 Basic concepts of canonical correlation 225 10.5 Other topics in canonical correlation 230

X CONTENTS 10.6 Discussion of computer programs 233 10.7 What to watch out for 235 10.8 Summary 236 10.9 Problems 236 11 Discriminant analysis 239 11.1 Chapter outline 239 11.2 When is discriminant analysis used? 240 11.3 Data example 241 11.4 Basic concepts of classification 242 11.5 Theoretical background 249 11.6 Interpretation 251 11.7 Adjusting the dividing point 255 11.8 How good is the discrimination? 258 11.9 Testing variable contributions 260 11.10 Variable selection 262 11.11 Discussion of computer programs 262 11.12 What to watch out for 263 11.13 Summary 266 11.14 Problems 266 12 Logistic regression 269 12.1 Chapter outline 269 12.2 When is logistic regression used? 269 12.3 Data example 270 12.4 Basic concepts of logistic regression 272 12.5 Interpretation: categorical variables 273 12.6 Interpretation: continuous variables 275 12.7 Interpretation: interactions 277 12.8 Refining and evaluating logistic regression 285 12.9 Nominal and ordinal logistic regression 298 12.10 Applications of logistic regression 305 12.11 Poisson regression 309 12.12 Discussion of computer programs 313 12.13 What to watch out for 314 12.14 Summary 317 12.15 Problems 317 13 Regression analysis with survival data 323 13.1 Chapter outline 323 13.2 When is survival analysis used? 324 13.3 Data examples 324

CONTENTS xi 13.4 Survival functions 325 13.5 Common survival distributions 332 13.6 Comparing survival among groups 334 13.7 The log-linear regression model 335 13.8 The Cox regression model 338 13.9 Comparing regression models 349 13.10 Discussion of computer programs 352 13.11 What to watch out for 354 13.12 Summary 355 13.13 Problems 355 14 Principal components analysis 357 14.1 Chapter outline 357 14.2 When is principal components analysis used? 357 14.3 Data example 358 14.4 Basic concepts 359 14.5 Interpretation 363 14.6 Other uses 371 14.7 Discussion of computer programs 373 14.8 What to watch out for 374 14.9 Summary 376 14.10 Problems 376 15 Factor analysis 379 15.1 Chapter outline 379 15.2 When is factor analysis used? 379 15.3 Data example 380 15.4 Basic concepts 381 15.5 Initial extraction: principal components 383 15.6 Initial extraction: iterated components 386 15.7 Factor rotations 390 15.8 Assigning factor scores 395 15.9 Application of factor analysis 396 15.10 Discussion of computer programs 397 15.11 What to watch out for 400 15.12 Summary 401 15.13 Problems 402 16 Cluster analysis 405 16.1 Chapter outline 405 16.2 When is cluster analysis used? 405 16.3 Data example 407

16.4 Basic concepts: initial analysis 16.5 Analytical clustering techniques 16.6 Cluster analysis for financial data set 16.7 Discussion of computer programs 16.8 What to watch out for 16.9 Summary 16.10 Problems 17 Log-linear analysis 17.1 Chapter outline 17.2 When is log-linear analysis used? 17.3 Data example 17.4 Notation and sample considerations 17.5 Tests and models for two-way tables 17.6 Example of a two-way table 17.7 Models for multiway tables 17.8 Exploratory model building 17.9 Assessing specific models 17.10 Sample size issues 17.11 The logit model 17.12 Discussion of computer programs 17.13 What to watch out for 17.14 Summary 17.15 Problems 18 Correlated outcomes regression 18..1 Chapter outline 18.,2 When is correlated outcomes regression used? 18,.3 Data examples 18..4 Basic concepts 18.5 Regression of clustered data 18.6 Regression of longitudinal data 18..7 Other analyses of correlated outcomes 18.8 Discussion of computer programs 18.9 What to watch out for 18.10 Summary 18.11 Problems Appendix A A. 1 Data sets and how to obtain them A.2 Chemical companies financial data A.3 Depression study data

CONTENTS xiii A.4 Financial performance cluster analysis data 492 A.5 Lung cancer survival data 492 A.6 Lung function data 492 A.7 Parental HIV data 493 A. 8 Northridge earthquake data 493 A.9 School data 494 A. 10 Mice data 494 References 495 Index 509