Immunological Data Processing & Analysis
|
|
- Ethel Jackson
- 5 years ago
- Views:
Transcription
1 Immunological Data Processing & Analysis Hongmei Yang Center for Biodefence Immune Modeling Department of Biostatistics and Computational Biology University of Rochester June 12, 2012 Hongmei Yang (CBIM at URMC) Immunological Data Processing & Analysis June 12, / 44
2 Outline 1 Immunological Data Analysis Examples Exploratory Data Analysis Graphic Methods Description of Data: Summary Statistics Basic Statistical Methods Univariate & Bivariate Analysis Multivariate Analysis 2 Immunological Data Processing Elispot Data Processing Elisa & Luminex Data Processing Hemagglutination Data Processing 3 Appendix Hongmei Yang (CBIM at URMC) Immunological Data Processing & Analysis June 12, / 44
3 Examples Example I: H1N1 vaccine trial The H1N1 vaccine trial to study the correlations among immune responses to vaccination. It is an open-label, prospective evaluation in healthy adults in three age groups: 32, and 70+, representing a spectrum of prior exposure to H1N1 viruses. Immune outcomes include: HAI titers against 2009 ph1n1 (day 0, day 28, fold increase) HAI titers against H1N1 Brisbane (day 0, day 28, fold increase) IgA and IgG ELISPOTs against ph1n1 on day 7 Percentages from FACS analysis of B cell subsets Hongmei Yang (CBIM at URMC) Immunological Data Processing & Analysis June 12, / 44
4 Examples Example II: B Cell Response of DR1 and B10 Mice to NC and PR8 Infection Scientific interests include: Do DR1 and B10 mice respond differently to NC infection? Do DR1 and B10 mice respond differently to PR8 infection? About experiments: 2 mice groups with strains of DR1 and B10 respectively 3 types of infection imposed on each mice group: MOCK, NC and PR8 Measured 30 types of Cytokines or chemokines concentration representing B cell response after infection 5 replicates for each combination, but only 28 observations available Hongmei Yang (CBIM at URMC) Immunological Data Processing & Analysis June 12, / 44
5 Exploratory Data Analysis Graphic Methods Bar charts Histograms Box plots Violin plots: a combination of a box plot and a kernel density plot Scatter plots & scatter plot matrices Hongmei Yang (CBIM at URMC) Immunological Data Processing & Analysis June 12, / 44
6 Exploratory Data Analysis Bar Plot HAI at D0 and Fold Increase at D28 by Age Group d0 F.d <= Hongmei Yang (CBIM at URMC) Immunological Data Processing & Analysis June 12, / 44
7 Exploratory Data Analysis Histogram plot Histogram of H1N1.IgG Histogram of log10(h1n1.igg) Density 0e+00 1e 05 2e 05 3e 05 4e H1N1.IgG(million) Density log10(h1n1.igg) Hongmei Yang (CBIM at URMC) Immunological Data Processing & Analysis June 12, / 44
8 Exploratory Data Analysis Box plot with mean, median and geometric mean Boxplot of IgG H1N1.IgG(million) Arithmetic Mean Median Geometric Mean <= Age Hongmei Yang (CBIM at URMC) Immunological Data Processing & Analysis June 12, / 44
9 Exploratory Data Analysis Violin Plot A combination of a box plot and a kernel density plot: Arithmetic Mean Median Geometric Mean <= Hongmei Yang (CBIM at URMC) Immunological Data Processing & Analysis June 12, / 44
10 Scatter Plot Matrices Exploratory Data Analysis Scatterplot Matrix of H1N1 Vaccine Study log2.a.d0 r = 0.42 p < 0.01 r = 0.18 p = 0.17 r = 0 p = 0.99 r = 0.12 p = 0.37 r = 0.25 p = 0.08 r = 0.22 p = log2.f.a r = 0.04 p = 0.76 r = 0.2 p = 0.15 r = 0.64 p < 0.01 r = 0.53 p < 0.01 r = 0.56 p < 0.01 log2.b.d0 r = 0.14 p = 0.31 r = 0.2 p = 0.14 r = 0.01 p = 0.92 r = 0.11 p = log10.iga r = 0.53 p < 0.01 r = 0.49 p < 0.01 r = 0.52 p < 0.01 log10.igg r = 0.73 p < 0.01 r = 0.8 p < x27hi38hi r = 0.96 p < x38hi Hongmei Yang (CBIM at URMC) Immunological Data Processing & Analysis June 12, / 44
11 Exploratory Data Analysis Data Assumptions & Transformation Scale of measurement: categorical, ordinal or continuous? If continuous, is normally distributed? If continuous but not normally distributed, could be transformed to be approximately normal. log transformation of Y = log(y + 1): most popular in immunological data analysis Square root transformation of Y = Y + 0.5: good to Poisson distributed data but not so popular Arcsine square root transformation of p = arcsin p: good to outcomes in percentages Inverse hyperbolic sine transformation Y = arcsinh(y ) = log (Y + Y 2 + 1): good for data with a lot of zeroes, e.g., microbiome data which often contains many zeroes for absent bacterial species By inverse hyperbolic sine transformation, dependent variable may be positive, negative and zero. Hongmei Yang (CBIM at URMC) Immunological Data Processing & Analysis June 12, / 44
12 Exploratory Data Analysis Measure of Central Location Typically Mean, Median and Geometric Mean: n G = n x 1,..., x n ln G = 1 n i=1 ln x i Sample mean is very sensitive to outliers, but median and geometric mean are robust. When distribution of data is not symmetrical, mean and median will differ. Geometric mean is not more than mean. Presented measure of central location should be compatible with the test used for statistical analysis. For example: t test: mean Wilcoxon rank sum test: median Hongmei Yang (CBIM at URMC) Immunological Data Processing & Analysis June 12, / 44
13 Exploratory Data Analysis IgG Plots Histogram of H1N1.IgG Histogram of log10(h1n1.igg) Density 0e+00 1e 05 2e 05 3e 05 4e H1N1.IgG(million) Density log10(h1n1.igg) Hongmei Yang (CBIM at URMC) Immunological Data Processing & Analysis June 12, / 44
14 Exploratory Data Analysis IgG Homogeneity among 3 Age Groups: 32, and 70+ Kruskal-Wallis One-Way ANOVA IgG log 10 IgG skewness kurtosis p value 1.99e e 05 Figure : IgG Homogeneity among 3 Age Groups Hongmei Yang (CBIM at URMC) Immunological Data Processing & Analysis June 12, / 44
15 Exploratory Data Analysis Plasmablast Plots Plasmablast Arcsine Square Root Density Plasmablast Log Density Inverse Hyperbolic Sine Density Density Hongmei Yang (CBIM at URMC) Immunological Data Processing & Analysis June 12, / 44
16 Exploratory Data Analysis Plasmablast Homogeneity among 3 Age Groups: 32, and 70+ Kruskal-Wallis One-Way ANOVA f * log f * arcsin p arcsinh(p) skewness kurtosis p value * f = 100p Figure : Plasmablast Homogeneity among 3 Age Groups Hongmei Yang (CBIM at URMC) Immunological Data Processing & Analysis June 12, / 44
17 Exploratory Data Analysis Measure of Spread Measure of spread uses Sample standard error of mean, if mean is used as central location; Sample interquartile range, if median is used as central location Sample standard error of log transformed geometric mean, if geometric mean is used as central location Hongmei Yang (CBIM at URMC) Immunological Data Processing & Analysis June 12, / 44
18 Basic Statistical Methods Correlation Analysis Pearson correlation coefficients: indicating the strength of a linear relationship between two variables, and influenced by outliers unequal variances non-normality nonlinearity Spearman s rank correlation coefficients: a non-parametric measure of statistical dependence between two variables and it assesses how well the relationship between two variables can be described using a monotonic function Kendall s Tau: a measure of correlation between two ordinal-level variables Hongmei Yang (CBIM at URMC) Immunological Data Processing & Analysis June 12, / 44
19 Basic Statistical Methods Standard Statistical Testing Methods: independent samples vs. paired samples Independent samples: two separate sets of i.i.d samples Paired samples: a sample of matched pairs of similar units, or one group of units that has been tested twice. Hongmei Yang (CBIM at URMC) Immunological Data Processing & Analysis June 12, / 44
20 Basic Statistical Methods Standard Statistical Testing Methods: different types of data Type of data No. of samples being compared Relationship between samples Underlying distribution of all samples Potential statistical test Binary 1 Not applicable Binary One sample binomial test Binary 2 Independent Binary Chi-square test, Fisher s exact test Binary >2 Independent Binary Chi-square test Binary 2 Paired Binary McNemar s test Binary >2 Related Binary Cochran s Q test (an extension to McNemar s test) Nominal 1 Not applicable Normal One-sample t-test for means, one-sample chi-square test for variance Nominal 1 Not applicable Nonnormal One-sample Wilcoxon signed-rank test, one-sample sign test Nominal 2 Independent Normal Two-sample t test for means, two-sample F test for variance Nominal 2 Independent Nonnormal Wilcoxon rank sum test (also called Mann-Whitney U test, requires identical spread) for medians, Ansari-Bradley test for spread Nominal 2 Independent Nonnormal Kolmogorov-Smirnov test for overall difference. In case of same shape and spread, Wilcoxon rank sum test is more powerful; in case of same median and shape, Ansari-Bradley test is more powerful. Nominal 2 Paired Normal Paired t test Nominal 2 Paired Nonnormal Wilcoxon signed-rank test, sign test (Sign-rank test is more robust to outliers or data from a heavy-tailed distribution, but less powerful in case of outliers) Nominal >2 Independent Normal One-way ANOVA for means, Bartlett s test for homogeneity for variances Nominal >2 Independent Nonnormal Kruskal-Wallis test (an extension of Wilcoxon rank sum test) Nominal >2 Related Nonnormal Friedman rank sum test: an extension to sign test, and the non-parametric analogy of repeated ANOVA Hongmei Yang (CBIM at URMC) Immunological Data Processing & Analysis June 12, / 44
21 Basic Statistical Methods Multivariate Analysis Multiple regression (linear, non-linear, logistic,...) MANOVA: simultaneous comparisons of more endpoints (cytokines) instead of repeated application of ANOVA MANCOVA: an extension of MANOVA that additionally allows to control for the effect of an other continuous variable to be controlled (confounder), f.g., simultaneously compare cytokines across groups and adjusted for age. Hongmei Yang (CBIM at URMC) Immunological Data Processing & Analysis June 12, / 44
22 Multiple Regression: Example Basic Statistical Methods Example: Is age confounding between HAI at day 0 and HAI at day 28 in the H1N1 vaccine trial study? A confounding variable is an extraneous variable in a statistical model that correlates with both the dependent variable and the independent variable. HAI(log of F.I) D ρ = 0.42 p value= HAI(log) D p value=0.011 HAI at D0 p value< HAI F at D28 <= Hongmei Yang (CBIM at URMC) Immunological Data Processing & Analysis June 12, / 44
23 Multiple Regression: Example Basic Statistical Methods Model: log 2 (HAI.F.D28) = β 1 log 2 (HAI.D0) + β 2 Age, Table : Estimators from linear model fitting Estimates Std Pr(> t ) β β 21 : < 2e 16 β 22 : e 05 β 23 : Hongmei Yang (CBIM at URMC) Immunological Data Processing & Analysis June 12, / 44
24 Basic Statistical Methods Data Reduction Principal component analysis (PCA): finding summary variables called principal components that contain most of the information of the original data Cluster analysis: group individuals so that subjects in the same cluster have similar profiles of the parameter under study Linear discriminant analysis: a method that derives linear combinations of the independent variable that best discriminate between the two outcome groups Hongmei Yang (CBIM at URMC) Immunological Data Processing & Analysis June 12, / 44
25 Basic Statistical Methods MANOVA on Example II Possible statistical approaches: ANOVA on each cytokines: separate ANOVA is generally less powerful MANOVA: utilizing information from all variables simultaneously. In consideration of large number of dependent variables but smaller number of observation, following cytokines are selected to demonstrate:il7, IL12p70, VEGF, IL10. Hongmei Yang (CBIM at URMC) Immunological Data Processing & Analysis June 12, / 44
26 Basic Statistical Methods MANOVA on Example II Table : Hypothesis testing by ANOVA and MANOVA p value NC: B10-DR1 PR8: B10-DR1 IL IL12p VEGF IL MANOVA: IL7 IL12p70 VEGF IL Hongmei Yang (CBIM at URMC) Immunological Data Processing & Analysis June 12, / 44
27 Examples Elispot Data Processing id= id= 25 IgG id= id= Dilution Factor Hongmei Yang (CBIM at URMC) Immunological Data Processing & Analysis June 12, / 44
28 Elispot Data Processing Problems How to determine cell numbers from spot counts yielded by Elispot assays? How to deal with outliers? Hongmei Yang (CBIM at URMC) Immunological Data Processing & Analysis June 12, / 44
29 Methods we developed Elispot Data Processing Least Squares: βls = K k=1 X ky k K k=1 X 2 k Robust Least Squares: βrls = min β K k=1 ρ( Y k X k β Mean Approach: βme = K Y k k=1 X k K σ ) Median Approach: βmd = median{ Y k X k }, k = 1,..., K Poisson Approach: βpoi = K k=1 Y k K k=1 X k Robust Poisson Approach: βrpoi = min β K k=1 ρ( Y k/x k β σ ) Hongmei Yang (CBIM at URMC) Immunological Data Processing & Analysis June 12, / 44
30 Elispot Data Processing Simulations: Without Staining Error on Top Dilution Levels IgG: ARE IgG: PSAE HA: ARE HA: PSAE MD ME LS RLS POI RPOI Figure : ARE, average relative error; PSAE, proportion of yielding estimates with smallest absolute error among 500 runs Hongmei Yang (CBIM at URMC) Immunological Data Processing & Analysis June 12, / 44
31 Elispot Data Processing Simulations: With Staining Error on Top Dilution Levels IgG: ARE IgG: PSAE HA: ARE HA: PSAE MD * ME * LS * RLS * POI * RPOI * Figure : ARE, average relative error; PSAE, proportion of yielding estimates with smallest absolute error among 500 runs Hongmei Yang (CBIM at URMC) Immunological Data Processing & Analysis June 12, / 44
32 Recommendation Elispot Data Processing Simulation study shows: Poisson approach has optimal performance regardless of probable staining errors associated with the less diluted samples. In practice, we recommend: Remove probable staining errors associated with less diluted samples Use the Poisson approach to estimate cell counts from ELISPOT assays Hongmei Yang (CBIM at URMC) Immunological Data Processing & Analysis June 12, / 44
33 Elisa & Luminex Data Processing Standard Approach Two-step approach for statistical processing: Fit four-parameter logistic curve using standard data y = a d 1 + (x/c) b + d + ɛ y =optical density, x =concentration, a =maximum response, d =minimum response, c =ED50, b =slope-like parameter. Unknown concentrations are calibrated from the standard curve. Estimates of diluted samples are scaled back to the original scale. Scaled values are averaged to obtain an estimated concentration for the unknown sample. Hongmei Yang (CBIM at URMC) Immunological Data Processing & Analysis June 12, / 44
34 Numerical Problems Elisa & Luminex Data Processing Nonlinear maximization Nonlinear maximization requires specification of initial parameter values The choice of initial values may influence on convergence of estimation algorithm In the worst case yielding no convergence Initial values can be obtained by linearization Initial value for a: maximum y Initial value for d: minimum y Making transform as log a y y d = b{logx logc} Then initial values for b and c can be obtained by linear regression. Hongmei Yang (CBIM at URMC) Immunological Data Processing & Analysis June 12, / 44
35 Examples: Fitted Curves Elisa & Luminex Data Processing Plate14 T13 IgG Total 500K 1M Plate17 T13 IgG Total 50K 100K Optical Density Plate20 T13 IgG Total 10M 50M standard test: Outlier test: Normal standard test: Outlier test: Normal Plate21 T13 IgG Total 2.5M 5M Below lower limit standard test: Outlier test: Normal standard test: Outlier test: Normal Concentration Hongmei Yang (CBIM at URMC) Immunological Data Processing & Analysis June 12, / 44
36 Examples: Estimated Concentrations Elisa & Luminex Data Processing IgG Total T1307C 03 Concentration * *1e+05 14*5e+05 14*1e+06 21* *5e+06 20*1e+07 IgG Total T1307C 03 in Original Scale Concentration in Original Scale 0e+00 4e+06 8e+06 17* *1e+05 14*5e+05 14*1e+06 21* *5e+06 20*1e+07 Dilution Factor Hongmei Yang (CBIM at URMC) Immunological Data Processing & Analysis June 12, / 44
37 Examples: Raw Data Hemagglutination Data Processing 10053L 10072L Positive Wells (%) Positive Wells (%) L L Positive Wells (%) Positive Wells (%) Hongmei Yang (CBIM at URMC) Immunological Data Processing & Analysis June 12, / 44
38 Hemagglutination Data Processing Classical Approach Reed-Muench method: a linear interpolation with formula I = (% infected at dilution immediately above 50%)-50% (% infected at dilution immediately above 50%)-(% infected at dilution immediately below 50%) ED 50 = 10log of total dilution immediately above 50% I d I = interpolated value of the 50% endpoint d = log of the dilution factor (i.e. the difference between the log dilution intervals) Hongmei Yang (CBIM at URMC) Immunological Data Processing & Analysis June 12, / 44
39 Classical Approach: Drawbacks Hemagglutination Data Processing Using information from only two points around the potential titer: loss of information Assuming a linear dose-response relationship: subject to question Inefficient in both precision and accuracy Hongmei Yang (CBIM at URMC) Immunological Data Processing & Analysis June 12, / 44
40 Hemagglutination Data Processing Better Approach Four parameter logistic (4PL) regression y = a d 1 + (x/c) b + d + ɛ y = response, x = concentration, a = maximum response, d = minimum response, c =ED50, b = slope-like parameter. Response could be: % positive responses Arcsine square root transformation of % positive responses Numerical Problems: can be dealt with as previously. Hongmei Yang (CBIM at URMC) Immunological Data Processing & Analysis June 12, / 44
41 Examples: Fitted Curves Hemagglutination Data Processing 10053L 10072L Positive Wells (%) RM 4PL 4PL(AST) Positive Wells (%) RM 4PL 4PL(AST) L L Positive Wells (%) RM 4PL 4PL(AST) Positive Wells (%) RM 4PL 4PL(AST) Hongmei Yang (CBIM at URMC) Immunological Data Processing & Analysis June 12, / 44
42 BLIS: a Platform Combining Data Management and Statistical Processing & Analysis Bio-Lab Informatics Server Standardize R codes for statistical processing of data from Elispot, Elisa and Hemmagglutination assays. Incorporate the R codes into automated routines within our customized BLIS application. Immunologists can use the system for data management, visual exploration and statistical processing and generate up-to-date reporting. Hongmei Yang (CBIM at URMC) Immunological Data Processing & Analysis June 12, / 44
43 Reference Yang, H., Topham, D.J., Holden-Wiltse, J. andwu, H. (2012) Statistical Estimation & Inference of Cell Counts from ELISPOT Limiting Dilution Assays. Journal of Biopharmaceutical Statistics Accepted. Hongmei Yang (CBIM at URMC) Immunological Data Processing & Analysis June 12, / 44
44 Acknowledgement NIAID: UR-CBIM (HHSN C) NIAID: UR-NYICE (HHSN C) David Topham s Lab Martin Zand s Lab Tim Mosmann s Lab Andrea Sant s Lab Hongmei Yang (CBIM at URMC) Immunological Data Processing & Analysis June 12, / 44
What you should know before you collect data. BAE 815 (Fall 2017) Dr. Zifei Liu
What you should know before you collect data BAE 815 (Fall 2017) Dr. Zifei Liu Zifeiliu@ksu.edu Types and levels of study Descriptive statistics Inferential statistics How to choose a statistical test
More informationUnderstandable Statistics
Understandable Statistics correlated to the Advanced Placement Program Course Description for Statistics Prepared for Alabama CC2 6/2003 2003 Understandable Statistics 2003 correlated to the Advanced Placement
More informationAnalysis and Interpretation of Data Part 1
Analysis and Interpretation of Data Part 1 DATA ANALYSIS: PRELIMINARY STEPS 1. Editing Field Edit Completeness Legibility Comprehensibility Consistency Uniformity Central Office Edit 2. Coding Specifying
More informationisc ove ring i Statistics sing SPSS
isc ove ring i Statistics sing SPSS S E C O N D! E D I T I O N (and sex, drugs and rock V roll) A N D Y F I E L D Publications London o Thousand Oaks New Delhi CONTENTS Preface How To Use This Book Acknowledgements
More informationList of Figures. List of Tables. Preface to the Second Edition. Preface to the First Edition
List of Figures List of Tables Preface to the Second Edition Preface to the First Edition xv xxv xxix xxxi 1 What Is R? 1 1.1 Introduction to R................................ 1 1.2 Downloading and Installing
More informationSelecting the Right Data Analysis Technique
Selecting the Right Data Analysis Technique Levels of Measurement Nominal Ordinal Interval Ratio Discrete Continuous Continuous Variable Borgatta and Bohrnstedt state that "the most of central constructs
More informationTable of Contents. Plots. Essential Statistics for Nursing Research 1/12/2017
Essential Statistics for Nursing Research Kristen Carlin, MPH Seattle Nursing Research Workshop January 30, 2017 Table of Contents Plots Descriptive statistics Sample size/power Correlations Hypothesis
More informationSTATISTICS AND RESEARCH DESIGN
Statistics 1 STATISTICS AND RESEARCH DESIGN These are subjects that are frequently confused. Both subjects often evoke student anxiety and avoidance. To further complicate matters, both areas appear have
More informationBusiness Research Methods. Introduction to Data Analysis
Business Research Methods Introduction to Data Analysis Data Analysis Process STAGES OF DATA ANALYSIS EDITING CODING DATA ENTRY ERROR CHECKING AND VERIFICATION DATA ANALYSIS Introduction Preparation of
More informationBasic Biostatistics. Chapter 1. Content
Chapter 1 Basic Biostatistics Jamalludin Ab Rahman MD MPH Department of Community Medicine Kulliyyah of Medicine Content 2 Basic premises variables, level of measurements, probability distribution Descriptive
More informationExperimental Design for Immunologists
Experimental Design for Immunologists Hulin Wu, Ph.D., Dean s Professor Department of Biostatistics & Computational Biology Co-Director: Center for Biodefense Immune Modeling School of Medicine and Dentistry
More informationEcological Statistics
A Primer of Ecological Statistics Second Edition Nicholas J. Gotelli University of Vermont Aaron M. Ellison Harvard Forest Sinauer Associates, Inc. Publishers Sunderland, Massachusetts U.S.A. Brief Contents
More informationHOW STATISTICS IMPACT PHARMACY PRACTICE?
HOW STATISTICS IMPACT PHARMACY PRACTICE? CPPD at NCCR 13 th June, 2013 Mohamed Izham M.I., PhD Professor in Social & Administrative Pharmacy Learning objective.. At the end of the presentation pharmacists
More informationChoosing the Correct Statistical Test
Choosing the Correct Statistical Test T racie O. Afifi, PhD Departments of Community Health Sciences & Psychiatry University of Manitoba Department of Community Health Sciences COLLEGE OF MEDICINE, FACULTY
More informationTypes of Statistics. Censored data. Files for today (June 27) Lecture and Homework INTRODUCTION TO BIOSTATISTICS. Today s Outline
INTRODUCTION TO BIOSTATISTICS FOR GRADUATE AND MEDICAL STUDENTS Files for today (June 27) Lecture and Homework Descriptive Statistics and Graphically Visualizing Data Lecture #2 (1 file) PPT presentation
More informationOverview of Non-Parametric Statistics
Overview of Non-Parametric Statistics LISA Short Course Series Mark Seiss, Dept. of Statistics April 7, 2009 Presentation Outline 1. Homework 2. Review of Parametric Statistics 3. Overview Non-Parametric
More informationQuantitative Methods in Computing Education Research (A brief overview tips and techniques)
Quantitative Methods in Computing Education Research (A brief overview tips and techniques) Dr Judy Sheard Senior Lecturer Co-Director, Computing Education Research Group Monash University judy.sheard@monash.edu
More informationBiostatistics II
Biostatistics II 514-5509 Course Description: Modern multivariable statistical analysis based on the concept of generalized linear models. Includes linear, logistic, and Poisson regression, survival analysis,
More informationLecture Outline. Biost 517 Applied Biostatistics I. Purpose of Descriptive Statistics. Purpose of Descriptive Statistics
Biost 517 Applied Biostatistics I Scott S. Emerson, M.D., Ph.D. Professor of Biostatistics University of Washington Lecture 3: Overview of Descriptive Statistics October 3, 2005 Lecture Outline Purpose
More informationUnit 1 Exploring and Understanding Data
Unit 1 Exploring and Understanding Data Area Principle Bar Chart Boxplot Conditional Distribution Dotplot Empirical Rule Five Number Summary Frequency Distribution Frequency Polygon Histogram Interquartile
More informationStatistics as a Tool. A set of tools for collecting, organizing, presenting and analyzing numerical facts or observations.
Statistics as a Tool A set of tools for collecting, organizing, presenting and analyzing numerical facts or observations. Descriptive Statistics Numerical facts or observations that are organized describe
More informationREVIEW ARTICLE. A Review of Inferential Statistical Methods Commonly Used in Medicine
A Review of Inferential Statistical Methods Commonly Used in Medicine JCD REVIEW ARTICLE A Review of Inferential Statistical Methods Commonly Used in Medicine Kingshuk Bhattacharjee a a Assistant Manager,
More information11/18/2013. Correlational Research. Correlational Designs. Why Use a Correlational Design? CORRELATIONAL RESEARCH STUDIES
Correlational Research Correlational Designs Correlational research is used to describe the relationship between two or more naturally occurring variables. Is age related to political conservativism? Are
More informationBusiness Statistics Probability
Business Statistics The following was provided by Dr. Suzanne Delaney, and is a comprehensive review of Business Statistics. The workshop instructor will provide relevant examples during the Skills Assessment
More informationFrom Biostatistics Using JMP: A Practical Guide. Full book available for purchase here. Chapter 1: Introduction... 1
From Biostatistics Using JMP: A Practical Guide. Full book available for purchase here. Contents Dedication... iii Acknowledgments... xi About This Book... xiii About the Author... xvii Chapter 1: Introduction...
More informationChapter 1: Exploring Data
Chapter 1: Exploring Data Key Vocabulary:! individual! variable! frequency table! relative frequency table! distribution! pie chart! bar graph! two-way table! marginal distributions! conditional distributions!
More informationINTRODUCTION TO MEDICAL RESEARCH: ESSENTIAL SKILLS
INTRODUCTION TO MEDICAL RESEARCH: ESSENTIAL SKILLS SCALES OF MEASUREMENT AND WAYS OF SUMMARIZING DATA Alecsandra IRIMIE-ANA 1 1. Psychiatry Hospital Prof. Dr. Alexandru Obregia ABSTRACT Regardless the
More information10. LINEAR REGRESSION AND CORRELATION
1 10. LINEAR REGRESSION AND CORRELATION The contingency table describes an association between two nominal (categorical) variables (e.g., use of supplemental oxygen and mountaineer survival ). We have
More informationStatistics Guide. Prepared by: Amanda J. Rockinson- Szapkiw, Ed.D.
This guide contains a summary of the statistical terms and procedures. This guide can be used as a reference for course work and the dissertation process. However, it is recommended that you refer to statistical
More informationChapter 1: Explaining Behavior
Chapter 1: Explaining Behavior GOAL OF SCIENCE is to generate explanations for various puzzling natural phenomenon. - Generate general laws of behavior (psychology) RESEARCH: principle method for acquiring
More informationDescribe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo
Business Statistics The following was provided by Dr. Suzanne Delaney, and is a comprehensive review of Business Statistics. The workshop instructor will provide relevant examples during the Skills Assessment
More informationSPRING GROVE AREA SCHOOL DISTRICT. Course Description. Instructional Strategies, Learning Practices, Activities, and Experiences.
SPRING GROVE AREA SCHOOL DISTRICT PLANNED COURSE OVERVIEW Course Title: Basic Introductory Statistics Grade Level(s): 11-12 Units of Credit: 1 Classification: Elective Length of Course: 30 cycles Periods
More informationNEUROBLASTOMA DATA -- TWO GROUPS -- QUANTITATIVE MEASURES 38 15:37 Saturday, January 25, 2003
NEUROBLASTOMA DATA -- TWO GROUPS -- QUANTITATIVE MEASURES 38 15:37 Saturday, January 25, 2003 Obs GROUP I DOPA LNDOPA 1 neurblst 1 48.000 1.68124 2 neurblst 1 133.000 2.12385 3 neurblst 1 34.000 1.53148
More informationMMI 409 Spring 2009 Final Examination Gordon Bleil. 1. Is there a difference in depression as a function of group and drug?
MMI 409 Spring 2009 Final Examination Gordon Bleil Table of Contents Research Scenario and General Assumptions Questions for Dataset (Questions are hyperlinked to detailed answers) 1. Is there a difference
More informationStill important ideas
Readings: OpenStax - Chapters 1 13 & Appendix D & E (online) Plous Chapters 17 & 18 - Chapter 17: Social Influences - Chapter 18: Group Judgments and Decisions Still important ideas Contrast the measurement
More informationSUMMER 2011 RE-EXAM PSYF11STAT - STATISTIK
SUMMER 011 RE-EXAM PSYF11STAT - STATISTIK Full Name: Årskortnummer: Date: This exam is made up of three parts: Part 1 includes 30 multiple choice questions; Part includes 10 matching questions; and Part
More informationDescribe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo
Please note the page numbers listed for the Lind book may vary by a page or two depending on which version of the textbook you have. Readings: Lind 1 11 (with emphasis on chapters 10, 11) Please note chapter
More informationbivariate analysis: The statistical analysis of the relationship between two variables.
bivariate analysis: The statistical analysis of the relationship between two variables. cell frequency: The number of cases in a cell of a cross-tabulation (contingency table). chi-square (χ 2 ) test for
More informationStill important ideas
Readings: OpenStax - Chapters 1 11 + 13 & Appendix D & E (online) Plous - Chapters 2, 3, and 4 Chapter 2: Cognitive Dissonance, Chapter 3: Memory and Hindsight Bias, Chapter 4: Context Dependence Still
More informationAP Statistics. Semester One Review Part 1 Chapters 1-5
AP Statistics Semester One Review Part 1 Chapters 1-5 AP Statistics Topics Describing Data Producing Data Probability Statistical Inference Describing Data Ch 1: Describing Data: Graphically and Numerically
More informationSTATISTICS & PROBABILITY
STATISTICS & PROBABILITY LAWRENCE HIGH SCHOOL STATISTICS & PROBABILITY CURRICULUM MAP 2015-2016 Quarter 1 Unit 1 Collecting Data and Drawing Conclusions Unit 2 Summarizing Data Quarter 2 Unit 3 Randomness
More information2.75: 84% 2.5: 80% 2.25: 78% 2: 74% 1.75: 70% 1.5: 66% 1.25: 64% 1.0: 60% 0.5: 50% 0.25: 25% 0: 0%
Capstone Test (will consist of FOUR quizzes and the FINAL test grade will be an average of the four quizzes). Capstone #1: Review of Chapters 1-3 Capstone #2: Review of Chapter 4 Capstone #3: Review of
More informationPrepared by: Assoc. Prof. Dr Bahaman Abu Samah Department of Professional Development and Continuing Education Faculty of Educational Studies
Prepared by: Assoc. Prof. Dr Bahaman Abu Samah Department of Professional Development and Continuing Education Faculty of Educational Studies Universiti Putra Malaysia Serdang At the end of this session,
More informationReadings: Textbook readings: OpenStax - Chapters 1 13 (emphasis on Chapter 12) Online readings: Appendix D, E & F
Readings: Textbook readings: OpenStax - Chapters 1 13 (emphasis on Chapter 12) Online readings: Appendix D, E & F Plous Chapters 17 & 18 Chapter 17: Social Influences Chapter 18: Group Judgments and Decisions
More informationBasic Steps in Planning Research. Dr. P.J. Brink and Dr. M.J. Wood
Basic Steps in Planning Research Dr. P.J. Brink and Dr. M.J. Wood Research Levels Level 1: There is little or no literature available on the topic or on the population. The purpose is to describe what
More informationStatistics is the science of collecting, organizing, presenting, analyzing, and interpreting data to assist in making effective decisions
Readings: OpenStax Textbook - Chapters 1 5 (online) Appendix D & E (online) Plous - Chapters 1, 5, 6, 13 (online) Introductory comments Describe how familiarity with statistical methods can - be associated
More informationThe SAGE Encyclopedia of Educational Research, Measurement, and Evaluation Multivariate Analysis of Variance
The SAGE Encyclopedia of Educational Research, Measurement, Multivariate Analysis of Variance Contributors: David W. Stockburger Edited by: Bruce B. Frey Book Title: Chapter Title: "Multivariate Analysis
More informationWDHS Curriculum Map Probability and Statistics. What is Statistics and how does it relate to you?
WDHS Curriculum Map Probability and Statistics Time Interval/ Unit 1: Introduction to Statistics 1.1-1.3 2 weeks S-IC-1: Understand statistics as a process for making inferences about population parameters
More informationBiostatistics for Med Students. Lecture 1
Biostatistics for Med Students Lecture 1 John J. Chen, Ph.D. Professor & Director of Biostatistics Core UH JABSOM JABSOM MD7 February 14, 2018 Lecture note: http://biostat.jabsom.hawaii.edu/education/training.html
More informationOverview. Goals of Interpretation. Methodology. Reasons to Read and Evaluate
Overview Critical Literature Evaluation and Biostatistics Ahl Ashley N. Lewis, PharmD, BCPS Clinical Specialist, Drug Information UNC Hospitals Background Review of basic statistics Statistical tests Clinical
More informationFigure: Presentation slides:
Joni Lakin David Shannon Margaret Ross Abbot Packard Auburn University Auburn University Auburn University University of West Georgia Figure: http://www.auburn.edu/~jml0035/eera_chart.pdf Presentation
More informationContent. Basic Statistics and Data Analysis for Health Researchers from Foreign Countries. Research question. Example Newly diagnosed Type 2 Diabetes
Content Quantifying association between continuous variables. Basic Statistics and Data Analysis for Health Researchers from Foreign Countries Volkert Siersma siersma@sund.ku.dk The Research Unit for General
More informationBEST PRACTICES FOR IMPLEMENTATION AND ANALYSIS OF PAIN SCALE PATIENT REPORTED OUTCOMES IN CLINICAL TRIALS
BEST PRACTICES FOR IMPLEMENTATION AND ANALYSIS OF PAIN SCALE PATIENT REPORTED OUTCOMES IN CLINICAL TRIALS Nan Shao, Ph.D. Director, Biostatistics Premier Research Group, Limited and Mark Jaros, Ph.D. Senior
More informationinvestigate. educate. inform.
investigate. educate. inform. Research Design What drives your research design? The battle between Qualitative and Quantitative is over Think before you leap What SHOULD drive your research design. Advanced
More informationLearning Objectives 9/9/2013. Hypothesis Testing. Conflicts of Interest. Descriptive statistics: Numerical methods Measures of Central Tendency
Conflicts of Interest I have no conflict of interest to disclose Biostatistics Kevin M. Sowinski, Pharm.D., FCCP Last-Chance Ambulatory Care Webinar Thursday, September 5, 2013 Learning Objectives For
More information9/4/2013. Decision Errors. Hypothesis Testing. Conflicts of Interest. Descriptive statistics: Numerical methods Measures of Central Tendency
Conflicts of Interest I have no conflict of interest to disclose Biostatistics Kevin M. Sowinski, Pharm.D., FCCP Pharmacotherapy Webinar Review Course Tuesday, September 3, 2013 Descriptive statistics:
More informationDescribe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo
Please note the page numbers listed for the Lind book may vary by a page or two depending on which version of the textbook you have. Readings: Lind 1 11 (with emphasis on chapters 5, 6, 7, 8, 9 10 & 11)
More informationSurvey research (Lecture 1) Summary & Conclusion. Lecture 10 Survey Research & Design in Psychology James Neill, 2015 Creative Commons Attribution 4.
Summary & Conclusion Lecture 10 Survey Research & Design in Psychology James Neill, 2015 Creative Commons Attribution 4.0 Overview 1. Survey research 2. Survey design 3. Descriptives & graphing 4. Correlation
More informationSurvey research (Lecture 1)
Summary & Conclusion Lecture 10 Survey Research & Design in Psychology James Neill, 2015 Creative Commons Attribution 4.0 Overview 1. Survey research 2. Survey design 3. Descriptives & graphing 4. Correlation
More information12/30/2017. PSY 5102: Advanced Statistics for Psychological and Behavioral Research 2
PSY 5102: Advanced Statistics for Psychological and Behavioral Research 2 Selecting a statistical test Relationships among major statistical methods General Linear Model and multiple regression Special
More informationApplications. DSC 410/510 Multivariate Statistical Methods. Discriminating Two Groups. What is Discriminant Analysis
DSC 4/5 Multivariate Statistical Methods Applications DSC 4/5 Multivariate Statistical Methods Discriminant Analysis Identify the group to which an object or case (e.g. person, firm, product) belongs:
More informationUsing a Likert-type Scale DR. MIKE MARRAPODI
Using a Likert-type Scale DR. MIKE MARRAPODI Topics Definition/Description Types of Scales Data Collection with Likert-type scales Analyzing Likert-type Scales Definition/Description A Likert-type Scale
More informationChapter 14: More Powerful Statistical Methods
Chapter 14: More Powerful Statistical Methods Most questions will be on correlation and regression analysis, but I would like you to know just basically what cluster analysis, factor analysis, and conjoint
More informationApplied Medical. Statistics Using SAS. Geoff Der. Brian S. Everitt. CRC Press. Taylor Si Francis Croup. Taylor & Francis Croup, an informa business
Applied Medical Statistics Using SAS Geoff Der Brian S. Everitt CRC Press Taylor Si Francis Croup Boca Raton London New York CRC Press is an imprint of the Taylor & Francis Croup, an informa business A
More informationLecture Outline. Biost 590: Statistical Consulting. Stages of Scientific Studies. Scientific Method
Biost 590: Statistical Consulting Statistical Classification of Scientific Studies; Approach to Consulting Lecture Outline Statistical Classification of Scientific Studies Statistical Tasks Approach to
More informationExamining differences between two sets of scores
6 Examining differences between two sets of scores In this chapter you will learn about tests which tell us if there is a statistically significant difference between two sets of scores. In so doing you
More informationReadings: Textbook readings: OpenStax - Chapters 1 11 Online readings: Appendix D, E & F Plous Chapters 10, 11, 12 and 14
Readings: Textbook readings: OpenStax - Chapters 1 11 Online readings: Appendix D, E & F Plous Chapters 10, 11, 12 and 14 Still important ideas Contrast the measurement of observable actions (and/or characteristics)
More informationStatistics is the science of collecting, organizing, presenting, analyzing, and interpreting data to assist in making effective decisions
Readings: OpenStax Textbook - Chapters 1 5 (online) Appendix D & E (online) Plous - Chapters 1, 5, 6, 13 (online) Introductory comments Describe how familiarity with statistical methods can - be associated
More informationInvestigating the robustness of the nonparametric Levene test with more than two groups
Psicológica (2014), 35, 361-383. Investigating the robustness of the nonparametric Levene test with more than two groups David W. Nordstokke * and S. Mitchell Colp University of Calgary, Canada Testing
More informationOutline. Practice. Confounding Variables. Discuss. Observational Studies vs Experiments. Observational Studies vs Experiments
1 2 Outline Finish sampling slides from Tuesday. Study design what do you do with the subjects/units once you select them? (OI Sections 1.4-1.5) Observational studies vs. experiments Descriptive statistics
More information11/24/2017. Do not imply a cause-and-effect relationship
Correlational research is used to describe the relationship between two or more naturally occurring variables. Is age related to political conservativism? Are highly extraverted people less afraid of rejection
More informationNORTH SOUTH UNIVERSITY TUTORIAL 1
NORTH SOUTH UNIVERSITY TUTORIAL 1 REVIEW FROM BIOSTATISTICS I AHMED HOSSAIN,PhD Data Management and Analysis AHMED HOSSAIN,PhD - Data Management and Analysis 1 DATA TYPES/ MEASUREMENT SCALES Categorical:
More informationStatistical analysis DIANA SAPLACAN 2017 * SLIDES ADAPTED BASED ON LECTURE NOTES BY ALMA LEORA CULEN
Statistical analysis DIANA SAPLACAN 2017 * SLIDES ADAPTED BASED ON LECTURE NOTES BY ALMA LEORA CULEN Vs. 2 Background 3 There are different types of research methods to study behaviour: Descriptive: observations,
More informationProfile Analysis. Intro and Assumptions Psy 524 Andrew Ainsworth
Profile Analysis Intro and Assumptions Psy 524 Andrew Ainsworth Profile Analysis Profile analysis is the repeated measures extension of MANOVA where a set of DVs are commensurate (on the same scale). Profile
More informationApplication of Local Control Strategy in analyses of the effects of Radon on Lung Cancer Mortality for 2,881 US Counties
Application of Local Control Strategy in analyses of the effects of Radon on Lung Cancer Mortality for 2,881 US Counties Bob Obenchain, Risk Benefit Statistics, August 2015 Our motivation for using a Cut-Point
More informationModern Regression Methods
Modern Regression Methods Second Edition THOMAS P. RYAN Acworth, Georgia WILEY A JOHN WILEY & SONS, INC. PUBLICATION Contents Preface 1. Introduction 1.1 Simple Linear Regression Model, 3 1.2 Uses of Regression
More informationPerformance of Median and Least Squares Regression for Slightly Skewed Data
World Academy of Science, Engineering and Technology 9 Performance of Median and Least Squares Regression for Slightly Skewed Data Carolina Bancayrin - Baguio Abstract This paper presents the concept of
More informationSummary & Conclusion. Lecture 10 Survey Research & Design in Psychology James Neill, 2016 Creative Commons Attribution 4.0
Summary & Conclusion Lecture 10 Survey Research & Design in Psychology James Neill, 2016 Creative Commons Attribution 4.0 Overview 1. Survey research and design 1. Survey research 2. Survey design 2. Univariate
More informationBiology 345: Biometry Fall 2005 SONOMA STATE UNIVERSITY Lab Exercise 5 Residuals and multiple regression Introduction
Biology 345: Biometry Fall 2005 SONOMA STATE UNIVERSITY Lab Exercise 5 Residuals and multiple regression Introduction In this exercise, we will gain experience assessing scatterplots in regression and
More informationOn the purpose of testing:
Why Evaluation & Assessment is Important Feedback to students Feedback to teachers Information to parents Information for selection and certification Information for accountability Incentives to increase
More informationHow to describe bivariate data
Statistics Corner How to describe bivariate data Alessandro Bertani 1, Gioacchino Di Paola 2, Emanuele Russo 1, Fabio Tuzzolino 2 1 Department for the Treatment and Study of Cardiothoracic Diseases and
More informationPOST GRADUATE DIPLOMA IN BIOETHICS (PGDBE) Term-End Examination June, 2016 MHS-014 : RESEARCH METHODOLOGY
No. of Printed Pages : 12 MHS-014 POST GRADUATE DIPLOMA IN BIOETHICS (PGDBE) Term-End Examination June, 2016 MHS-014 : RESEARCH METHODOLOGY Time : 2 hours Maximum Marks : 70 PART A Attempt all questions.
More informationDr. Kelly Bradley Final Exam Summer {2 points} Name
{2 points} Name You MUST work alone no tutors; no help from classmates. Email me or see me with questions. You will receive a score of 0 if this rule is violated. This exam is being scored out of 00 points.
More informationReadings: Textbook readings: OpenStax - Chapters 1 4 Online readings: Appendix D, E & F Online readings: Plous - Chapters 1, 5, 6, 13
Readings: Textbook readings: OpenStax - Chapters 1 4 Online readings: Appendix D, E & F Online readings: Plous - Chapters 1, 5, 6, 13 Introductory comments Describe how familiarity with statistical methods
More informationAssignment #6. Chapter 10: 14, 15 Chapter 11: 14, 18. Due tomorrow Nov. 6 th by 2pm in your TA s homework box
Assignment #6 Chapter 10: 14, 15 Chapter 11: 14, 18 Due tomorrow Nov. 6 th by 2pm in your TA s homework box Assignment #7 Chapter 12: 18, 24 Chapter 13: 28 Due next Friday Nov. 13 th by 2pm in your TA
More informationResearch Methods in Forest Sciences: Learning Diary. Yoko Lu December Research process
Research Methods in Forest Sciences: Learning Diary Yoko Lu 285122 9 December 2016 1. Research process It is important to pursue and apply knowledge and understand the world under both natural and social
More informationStatistical questions for statistical methods
Statistical questions for statistical methods Unpaired (two-sample) t-test DECIDE: Does the numerical outcome have a relationship with the categorical explanatory variable? Is the mean of the outcome the
More informationKidane Tesfu Habtemariam, MASTAT, Principle of Stat Data Analysis Project work
1 1. INTRODUCTION Food label tells the extent of calories contained in the food package. The number tells you the amount of energy in the food. People pay attention to calories because if you eat more
More informationIndustrial and Manufacturing Engineering 786. Applied Biostatistics in Ergonomics Spring 2012 Kurt Beschorner
Industrial and Manufacturing Engineering 786 Applied Biostatistics in Ergonomics Spring 2012 Kurt Beschorner Note: This syllabus is not finalized and is subject to change up until the start of the class.
More informationLAB ASSIGNMENT 4 INFERENCES FOR NUMERICAL DATA. Comparison of Cancer Survival*
LAB ASSIGNMENT 4 1 INFERENCES FOR NUMERICAL DATA In this lab assignment, you will analyze the data from a study to compare survival times of patients of both genders with different primary cancers. First,
More informationIntroduction to Statistical Data Analysis I
Introduction to Statistical Data Analysis I JULY 2011 Afsaneh Yazdani Preface What is Statistics? Preface What is Statistics? Science of: designing studies or experiments, collecting data Summarizing/modeling/analyzing
More informationDay 11: Measures of Association and ANOVA
Day 11: Measures of Association and ANOVA Daniel J. Mallinson School of Public Affairs Penn State Harrisburg mallinson@psu.edu PADM-HADM 503 Mallinson Day 11 November 2, 2017 1 / 45 Road map Measures of
More informationReveal Relationships in Categorical Data
SPSS Categories 15.0 Specifications Reveal Relationships in Categorical Data Unleash the full potential of your data through perceptual mapping, optimal scaling, preference scaling, and dimension reduction
More informationAMSc Research Methods Research approach IV: Experimental [2]
AMSc Research Methods Research approach IV: Experimental [2] Marie-Luce Bourguet mlb@dcs.qmul.ac.uk Statistical Analysis 1 Statistical Analysis Descriptive Statistics : A set of statistical procedures
More informationEvidence-Based Medicine Journal Club. A Primer in Statistics, Study Design, and Epidemiology. August, 2013
Evidence-Based Medicine Journal Club A Primer in Statistics, Study Design, and Epidemiology August, 2013 Rationale for EBM Conscientious, explicit, and judicious use Beyond clinical experience and physiologic
More informationPTHP 7101 Research 1 Chapter Assignments
PTHP 7101 Research 1 Chapter Assignments INSTRUCTIONS: Go over the questions/pointers pertaining to the chapters and turn in a hard copy of your answers at the beginning of class (on the day that it is
More informationDoctoral Dissertation Boot Camp Quantitative Methods Kamiar Kouzekanani, PhD January 27, The Scientific Method of Problem Solving
Doctoral Dissertation Boot Camp Quantitative Methods Kamiar Kouzekanani, PhD January 27, 2018 The Scientific Method of Problem Solving The conceptual phase Reviewing the literature, stating the problem,
More information(C) Jamalludin Ab Rahman
SPSS Note The GLM Multivariate procedure is based on the General Linear Model procedure, in which factors and covariates are assumed to have a linear relationship to the dependent variable. Factors. Categorical
More informationStatistical Methods and Reasoning for the Clinical Sciences
Statistical Methods and Reasoning for the Clinical Sciences Evidence-Based Practice Eiki B. Satake, PhD Contents Preface Introduction to Evidence-Based Statistics: Philosophical Foundation and Preliminaries
More information