Ecological Statistics

Similar documents
Understandable Statistics

List of Figures. List of Tables. Preface to the Second Edition. Preface to the First Edition

Unit 1 Exploring and Understanding Data

isc ove ring i Statistics sing SPSS

From Biostatistics Using JMP: A Practical Guide. Full book available for purchase here. Chapter 1: Introduction... 1

PRACTICAL STATISTICS FOR MEDICAL RESEARCH

2.75: 84% 2.5: 80% 2.25: 78% 2: 74% 1.75: 70% 1.5: 66% 1.25: 64% 1.0: 60% 0.5: 50% 0.25: 25% 0: 0%

Analysis and Interpretation of Data Part 1

investigate. educate. inform.

STATISTICS AND RESEARCH DESIGN

Experimental Studies. Statistical techniques for Experimental Data. Experimental Designs can be grouped. Experimental Designs can be grouped

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

PTHP 7101 Research 1 Chapter Assignments

WDHS Curriculum Map Probability and Statistics. What is Statistics and how does it relate to you?

Data Analysis Using Regression and Multilevel/Hierarchical Models

PRINCIPLES OF STATISTICS

Chapter 1: Exploring Data

HOW STATISTICS IMPACT PHARMACY PRACTICE?

MTH 225: Introductory Statistics

Still important ideas

SPRING GROVE AREA SCHOOL DISTRICT. Course Description. Instructional Strategies, Learning Practices, Activities, and Experiences.

12/30/2017. PSY 5102: Advanced Statistics for Psychological and Behavioral Research 2

Readings: Textbook readings: OpenStax - Chapters 1 13 (emphasis on Chapter 12) Online readings: Appendix D, E & F

1.4 - Linear Regression and MS Excel

Psychology Research Process

Biostatistics II

Business Statistics Probability

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

CLASSICAL AND. MODERN REGRESSION WITH APPLICATIONS

Table of Contents. Plots. Essential Statistics for Nursing Research 1/12/2017

Bayesian Logistic Regression Modelling via Markov Chain Monte Carlo Algorithm

Business Research Methods. Introduction to Data Analysis

Bayes Linear Statistics. Theory and Methods

Basic Biostatistics. Chapter 1. Content

RESEARCH METHODS. A Process of Inquiry. tm HarperCollinsPublishers ANTHONY M. GRAZIANO MICHAEL L RAULIN

Choosing the Correct Statistical Test

Statistics is the science of collecting, organizing, presenting, analyzing, and interpreting data to assist in making effective decisions

Reliability of Ordination Analyses

11/18/2013. Correlational Research. Correlational Designs. Why Use a Correlational Design? CORRELATIONAL RESEARCH STUDIES

Statistical analysis DIANA SAPLACAN 2017 * SLIDES ADAPTED BASED ON LECTURE NOTES BY ALMA LEORA CULEN

Medical Statistics 1. Basic Concepts Farhad Pishgar. Defining the data. Alive after 6 months?

Chapter 1: Explaining Behavior

An Introduction to Statistical Thinking Dan Schafer Table of Contents

Still important ideas

Statistics Guide. Prepared by: Amanda J. Rockinson- Szapkiw, Ed.D.

Lecture Outline. Biost 517 Applied Biostatistics I. Purpose of Descriptive Statistics. Purpose of Descriptive Statistics

Statistical Tolerance Regions: Theory, Applications and Computation

bivariate analysis: The statistical analysis of the relationship between two variables.

The Statistical Analysis of Failure Time Data

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

Research Methods in Forest Sciences: Learning Diary. Yoko Lu December Research process

Modern Regression Methods

A Brief (very brief) Overview of Biostatistics. Jody Kreiman, PhD Bureau of Glottal Affairs

9 research designs likely for PSYC 2100

Survey research (Lecture 1) Summary & Conclusion. Lecture 10 Survey Research & Design in Psychology James Neill, 2015 Creative Commons Attribution 4.

Survey research (Lecture 1)

Lecture Outline. Biost 590: Statistical Consulting. Stages of Scientific Studies. Scientific Method

What you should know before you collect data. BAE 815 (Fall 2017) Dr. Zifei Liu

Correlation and regression

Readings: Textbook readings: OpenStax - Chapters 1 11 Online readings: Appendix D, E & F Plous Chapters 10, 11, 12 and 14

On the purpose of testing:

Ordinal Data Modeling

Application of Local Control Strategy in analyses of the effects of Radon on Lung Cancer Mortality for 2,881 US Counties

Psychology Research Process

11/24/2017. Do not imply a cause-and-effect relationship

Statistics is the science of collecting, organizing, presenting, analyzing, and interpreting data to assist in making effective decisions

Overview of Lecture. Survey Methods & Design in Psychology. Correlational statistics vs tests of differences between groups

An Introduction to Bayesian Statistics

AMSc Research Methods Research approach IV: Experimental [2]

CHAPTER 3 RESEARCH METHODOLOGY

Applications. DSC 410/510 Multivariate Statistical Methods. Discriminating Two Groups. What is Discriminant Analysis

Analysis of Environmental Data Conceptual Foundations: En viro n m e n tal Data

Applied Medical. Statistics Using SAS. Geoff Der. Brian S. Everitt. CRC Press. Taylor Si Francis Croup. Taylor & Francis Croup, an informa business

Introductory Statistical Inference with the Likelihood Function

Data and Statistics 101: Key Concepts in the Collection, Analysis, and Application of Child Welfare Data

Reveal Relationships in Categorical Data

Epidemiologic Methods I & II Epidem 201AB Winter & Spring 2002

f WILEY ANOVA and ANCOVA A GLM Approach Second Edition ANDREW RUTHERFORD Staffordshire, United Kingdom Keele University School of Psychology

Statistics for Social and Behavioral Sciences

Quantitative Methods in Computing Education Research (A brief overview tips and techniques)

AP Statistics. Semester One Review Part 1 Chapters 1-5

STATISTICS & PROBABILITY

Political Science 15, Winter 2014 Final Review

Examining differences between two sets of scores

Basic Features of Statistical Analysis and the General Linear Model

Dr. Kelly Bradley Final Exam Summer {2 points} Name

Summary & Conclusion. Lecture 10 Survey Research & Design in Psychology James Neill, 2016 Creative Commons Attribution 4.0

Russian Journal of Agricultural and Socio-Economic Sciences, 3(15)

Immunological Data Processing & Analysis

3 CONCEPTUAL FOUNDATIONS OF STATISTICS

Index. Springer International Publishing Switzerland 2017 T.J. Cleophas, A.H. Zwinderman, Modern Meta-Analysis, DOI /

Inferential Statistics

Section 6: Analysing Relationships Between Variables

PONDICHERRY UNIVERSITY DEPARTMENT OF STATISTICS POST GRADUATE DIPLOMA IN STATISTICAL AND RESEARCH METHODS (SEMESTER PATTERN)

Profile Analysis. Intro and Assumptions Psy 524 Andrew Ainsworth

Dr. SANDHEEP S. (MBBS MD DPH) Dr. BENNY PV (MBBS MD DPH) (DATA ANALYSIS USING SPSS ILLUSTRATED WITH STEP-BY-STEP SCREENSHOTS)

Using Analytical and Psychometric Tools in Medium- and High-Stakes Environments

Statistics and Probability

Score Tests of Normality in Bivariate Probit Models

Transcription:

A Primer of Ecological Statistics Second Edition Nicholas J. Gotelli University of Vermont Aaron M. Ellison Harvard Forest Sinauer Associates, Inc. Publishers Sunderland, Massachusetts U.S.A.

Brief Contents PART I FUNDAMENTALS OF PROBABILITY AND STATISTICAL THINKING 1 An Introduction to Probability 3 2 Random Variables and Probability Distributions 25 3 Summary Statistics: Measures of Location and Spread 57 4 Framing and Testing Hypotheses 79 5 Three Frameworks for Statistical Analysis 107 PART II DESIGNING EXPERIMENTS 6 Designing Successful Field Studies 137 7 A Bestiary of Experimental and Sampling Designs 163 8 Managing and Curating Data 207 PART III DATA ANALYSIS 9 Regression 239 10 The Analysis of Variance 289 11 The Analysis of Categorical Data 349 12 The Analysis of Multivariate Data 383 PART IV ESTIMATION 13 The Measurement of Biodiversity 449 14 Detecting Populations and Estimating their Size 483 Appendix Matrix Algebra for Ecologists 523

Contents PART I Fundamentals of Probability and Statistical Thinking CHAPTER 1 An Introduction to Probability 3 What Is Probability? 4 Measuring Probability 4 The Probability of a Single Event: Prey Capture by Carnivorous Plants 4 Estimating Probabilities by Sampling 7 Problems in the Definition of Probability 9 The Mathematics of Probability 11 Defining the Sample Space 11 Complex and Shared Events: Combining Simple Probabilities 13 Probability Calculations: Milkweeds and Caterpillars 15 Complex and Shared Events: Rules for Combining Sets 18 Conditional Probabilities 21 Bayes Theorem 22 Summary 24 CHAPTER 2 Random Variables and Probability Distributions 25 Discrete Random Variables 26 Bernoulli Random Variables 26 An Example of a Bernoulli Trial 27 Many Bernoulli Trials = A Binomial Random Variable 28 The Binomial Distribution 31 Poisson Random Variables 34 An Example of a Poisson Random Variable: Distribution of a Rare Plant 36

Contents ix The Expected Value of a Discrete Random Variable 39 The Variance of a Discrete Random Variable 39 Continuous Random Variables 41 Uniform Random Variables 42 The Expected Value of a Continuous Random Variable 45 Normal Random Variables 46 Useful Properties of the Normal Distribution 48 Other Continuous Random Variables 50 The Central Limit Theorem 53 Summary 54 CHAPTER 3 Summary Statistics: Measures of Location and Spread 57 Measures of Location 58 The Arithmetic Mean 58 Other Means 60 Other Measures of Location: The Median and the Mode 64 When to Use Each Measure of Location 65 Measures of Spread 66 The Variance and the Standard Deviation 66 The Standard Error of the Mean 67 Skewness, Kurtosis, and Central Moments 69 Quantiles 71 Using Measures of Spread 72 Some Philosophical Issues Surrounding Summary Statistics 73 Confidence Intervals 74 Generalized Confidence Intervals 76 Summary 78 CHAPTER 4 Framing and Testing Hypotheses 79 Scientific Methods 80 Deduction and Induction 81 Modern-Day Induction: Bayesian Inference 84 The Hypothetico-Deductive Method 87 Testing Statistical Hypotheses 90 Statistical Hypotheses versus Scientific Hypotheses 90 Statistical Significance and P-Values 91 Errors in Hypothesis Testing 100 Parameter Estimation and Prediction 104 Summary 105 CHAPTER 5 Three Frameworks for Statistical Analysis 107 Sample Problem 107 Monte Carlo Analysis 109 Step 1: Specifying the Test Statistic 111 Step 2: Creating the Null Distribution 111 Step 3: Deciding on a One- or Two-Tailed Test 112 Step 4: Calculating the Tail Probability 114 Assumptions of the Monte Carlo Method 115 Advantages and Disadvantages of the Monte Carlo Method 115 Parametric Analysis 117 Step 1: Specifying the Test Statistic 117 Step 2: Specifying the Null Distribution 119 Step 3: Calculating the Tail Probability 119 Assumptions of the Parametric Method 120 Advantages and Disadvantages of the Parametric Method 121

x Contents Non-Parametric Analysis: A Special Case of Monte Carlo Analysis 121 Bayesian Analysis 122 Step 1: Specifying the Hypothesis 122 Step 2: Specifying Parameters as Random Variables 125 Step 3: Specifying the Prior Probability Distribution 125 Step 4: Calculating the Likelihood 129 Step 5: Calculating the Posterior Probability Distribution 129 Step 6: Interpreting the Results 130 Assumptions of Bayesian Analysis 132 Advantages and Disadvantages of Bayesian Analysis 133 Summary 133 PART II Designing Experiments CHAPTER 6 Designing Successful Field Studies 137 What Is the Point of the Study? 137 Are There Spatial or Temporal Differences in Variable Y? 137 What Is the Effect of Factor X on Variable Y? 138 Are the Measurements of Variable Y Consistent with the Predictions of Hypothesis H? 138 Using the Measurements of Variable Y, What Is the Best Estimate of Parameter θ in Model Z? 139 Manipulative Experiments 139 Natural Experiments 141 Snapshot versus Trajectory Experiments 143 The Problem of Temporal Dependence 144 Press versus Pulse Experiments 146 Replication 148 How Much Replication? 148 How Many Total Replicates Are Affordable? 149 The Rule of 10 150 Large-Scale Studies and Environmental Impacts 150 Ensuring Independence 151 Avoiding Confounding Factors 153 Replication and Randomization 154 Designing Effective Field Experiments and Sampling Studies 158 Are the Plots or Enclosures Large Enough to Ensure Realistic Results? 158 What Is the Grain and Extent of the Study? 158 Does the Range of Treatments or Census Categories Bracket or Span the Range of Possible Environmental Conditions? 159 Have Appropriate Controls Been Established to Ensure that Results Reflect Variation Only in the Factor of Interest? 160

Contents xi Have All Replicates Been Manipulated in the Same Way Except for the Intended Treatment Application? 160 Have Appropriate Covariates Been Measured in Each Replicate? 161 Summary 161 CHAPTER 7 A Bestiary of Experimental and Sampling Designs 163 Categorical versus Continuous Variables 164 Dependent and Independent Variables 165 Four Classes of Experimental Design 165 Regression Designs 166 ANOVA Designs 171 Alternatives to ANOVA: Experimental Regression 197 Tabular Designs 200 Alternatives to Tabular Designs: Proportional Designs 203 Summary 204 CHAPTER 8 Managing and Curating Data 207 The First Step: Managing Raw Data 208 Spreadsheets 208 Metadata 209 The Second Step: Storing and Curating the Data 210 Storage: Temporary and Archival 210 Curating the Data 211 The Third Step: Checking the Data 212 The Importance of Outliers 212 Errors 214 Missing Data 215 Detecting Outliers and Errors 215 Creating an Audit Trail 223 The Final Step: Transforming the Data 223 Data Transformations as a Cognitive Tool 224 Data Transformations because the Statistics Demand It 229 Reporting Results: Transformed or Not? 233 The Audit Trail Redux 233 Summary: The Data Management Flow Chart 235 PART III Data Analysis CHAPTER 9 Regression 239 Defining the Straight Line and Its Two Parameters 239 Fitting Data to a Linear Model 241 Variances and Covariances 244

xii Contents Least-Squares Parameter Estimates 246 Variance Components and the Coefficient of Determination 248 Hypothesis Tests with Regression 250 The Anatomy of an ANOVA Table 251 Other Tests and Confidence Intervals 253 Assumptions of Regression 257 Diagnostic Tests For Regression 259 Plotting Residuals 259 Other Diagnostic Plots 262 The Influence Function 262 Monte Carlo and Bayesian Analyses 264 Linear Regression Using Monte Carlo Methods 264 Linear Regression Using Bayesian Methods 266 Other Kinds of Regression Analyses 268 Robust Regression 268 Quantile Regression 271 Logistic Regression 273 Non-Linear Regression 275 Multiple Regression 275 Path Analysis 279 Model Selection Criteria 282 Model Selection Methods for Multiple Regression 283 Model Selection Methods in Path Analysis 284 Bayesian Model Selection 285 Summary 287 CHAPTER 10 The Analysis of Variance 289 Symbols and Labels in ANOVA 290 ANOVA and Partitioning of the Sum of Squares 290 The Assumptions of ANOVA 295 Hypothesis Tests with ANOVA 296 Constructing F-Ratios 298 A Bestiary of ANOVA Tables 300 Randomized Block 300 Nested ANOVA 302 Two-Way ANOVA 304 ANOVA for Three-Way and n-way Designs 308 Split-Plot ANOVA 308 Repeated Measures ANOVA 309 ANCOVA 314 Random versus Fixed Factors in ANOVA 317 Partitioning the Variance in ANOVA 322 After ANOVA: Plotting and Understanding Interaction Terms 325 Plotting Results from One-Way ANOVAs 325 Plotting Results from Two-Way ANOVAs 327 Understanding the Interaction Term 331 Plotting Results from ANCOVAs 333 Comparing Means 335 A Posteriori Comparisons 337 A Priori Contrasts 339 Bonferroni Corrections and the Problem of Multiple Tests 345 Summary 348 CHAPTER 11 The Analysis of Categorical Data 349 Two-Way Contingency Tables 350 Organizing the Data 350 Are the Variables Independent? 352 Testing the Hypothesis: Pearson s Chi-square Test 354 An Alternative to Pearson s Chi-Square: The G-Test 358

Contents xiii The Chi-square Test and the G-Test for R C Tables 359 Which Test To Choose? 363 Multi-Way Contingency Tables 364 Organizing the Data 364 On to Multi-Way Tables! 368 Bayesian Approaches to Contingency Tables 375 Tests for Goodness-of-Fit 376 Goodness-of-Fit Tests for Discrete Distributions 376 Testing Goodness-of-Fit for Continuous Distributions: The Kolmogorov-Smirnov Test 380 Summary 382 CHAPTER 12 The Analysis of Multivariate Data 383 Approaching Multivariate Data 383 The Need for Matrix Algebra 384 Comparing Multivariate Means 387 Comparing Multivariate Means of Two Samples: Hotelling s T 2 Test 387 Comparing Multivariate Means of More Than Two Samples: A Simple MANOVA 390 The Multivariate Normal Distribution 394 Testing for Multivariate Normality 396 Measurements of Multivariate Distance 398 Measuring Distances between Two Individuals 398 Measuring Distances between Two Groups 402 Other Measurements of Distance 402 Ordination 406 Principal Component Analysis 406 Factor Analysis 415 Principal Coordinates Analysis 418 Correspondence Analysis 421 Non-Metric Multidimensional Scaling 425 Advantages and Disadvantages of Ordination 427 Classification 429 Cluster Analysis 429 Choosing a Clustering Method 430 Discriminant Analysis 433 Advantages and Disadvantages of Classification 437 Multivariate Multiple Regression 438 Redundancy Analysis 438 Summary 444 PART IV Estimation CHAPTER 13 The Measurement of Biodiversity 449 Estimating Species Richness 450 Standardizing Diversity Comparisons through Random Subsampling 453

xiv Contents Rarefaction Curves: Interpolating Species Richness 455 The Expectation of the Individual-Based Rarefaction Curve 459 Sample-Based Rarefaction Curves: Massachusetts Ants 461 Species Richness versus Species Density 465 The Statistical Comparison of Rarefaction Curves 466 Assumptions of Rarefaction 467 Asymptotic Estimators: Extrapolating Species Richness 470 Rarefaction Curves Redux: Extrapolation and Interpolation 476 Estimating Species Diversity and Evenness 476 Hill Numbers 479 Software for Estimation of Species Diversity 481 Summary 482 CHAPTER 14 Detecting Populations and Estimating their Size 483 Occupancy 485 The Basic Model: One Species, One Season, Two Samples at a Range of Sites 487 Occupancy of More than One Species 493 A Hierarchical Model for Parameter Estimation and Modeling 495 Occupancy Models for Open Populations 501 Dynamic Occupancy of the Adelgid in Massachusetts 505 Estimating Population Size 506 Mark-Recapture: The Basic Model 507 Mark-Recapture Models for Open Populations 516 Occupancy Modeling and Mark-Recapture: Yet More Models 518 Sampling for Occupancy and Abundance 519 Software for Estimating Occupancy and Abundance 521 Summary 522 APPENDIX Matrix Algebra for Ecologists 523 Glossary 535 Literature Cited 565 Index 583