Title: A robustness study of parametric and non-parametric tests in Model-Based Multifactor Dimensionality Reduction for epistasis detection
|
|
- Emil Thompson
- 6 years ago
- Views:
Transcription
1 Author's response to reviews Title: A robustness study of parametric and non-parametric tests in Model-Based Multifactor Dimensionality Reduction for epistasis detection Authors: Jestinah M Mahachie John (jmahachie@ulg.ac.be) François Lishout Van (f.vanlishout@ulg.ac.be) Elena S Gusareva (gusareva.elena@gmail.com) Kristel Steen Van (kristel.vansteen@ulg.ac.be) Version: 2 Date: 4 March 2013 Author's response to reviews: see over
2 03 March 2013 Dear Editor (BioData Mining), We hereby submit a revised version of the manuscript entitled A robustness study of parametric and non-parametric tests in Model-Based Multifactor Dimensionality Reduction for epistasis, and will be grateful if you would reconsider this manuscript for publication in BioData Mining. We thank the editor for giving us the chance to resubmit this paper and to formulate responses to the comments given by the reviewers. We thank all reviewers for having taking time to read our manuscript and for their constructive comments and suggestions to further improve our manuscript. We have addressed all of the concerns raised. Should our point-by-point response (provided below) not have provided satisfactory replies, we remain available to provide additional information or more details. Please direct all correspondence concerning this manuscript to jmahachie@ulg.ac.be. Thank you for taking interest in our work. The authors declare no conflict of interest. Yours Sincerely, On behalf of the authors, Jestinah M. Mahachie John, PhD Phone: jmahachie@ulg.ac.be
3 Point-by-Point replies to comments from the editor and Reviewers EDITOR During the preparations towards my PhD thesis defense, which involved setting up additional simulations and interpreting these, we realized that the qq-plots provided in Figure 4 could be improved, to better convey the anticipated messages. Therefore, as was done in my PhD thesis, we have replaced Figure 4 in the original submitted version by the new qq-plots of MB-MDR step 2 test values. As a result, we have altered some statements (not linked to reviewers comments) in the main submission as follows: Text in red is the newly inserted text. Deleted text has double strikethrough lines. Black text is as it was in the original submitted manuscript. In the Results section Figure 4 shows the qq-plots related to the SNP pairs and their MB-MDR step 2 test statistics (i.e., the maximum of two association tests; one involving H-cells versus {L,O}-cells, and one involving L-cells versus {H,O}-cells). shows the qq-plots related to the SNP pairs and their final MB-MDR test results (squared Student s t). However, recreating Figure 3, now for cell (2,2) instead of (0,0) (hence, the multilocus genotype cell which has the smallest number of individuals contributing to it), also highlights hard to ignore deviations from the theoretical F(1,498) distribution at the multilocus genotype cell labeling stage (see Supplementary Figure S2). This suggests that the largely deviating results observed in Figure 4 are a cumulative effect and the result of subsequently building on invalid test results. The outlying upper right dots in Figure 4 refer to test results corresponding to the causal epistatic SNP pair. Other scenarios show similar trends (results not shown).
4 Conclusion section However, the overall performance of MB-MDR, which includes a permutationbased correction for multiple testing, is not affected in terms of type I error control. Improved power can be obtained by pre-analysis data transformations. MB-MDR permutation-based maxt correction for multiple testing keeps type I error and false positive rates under control, since in all considered simulation scenarios, the assumption of subset pivotality of the maxt permutation strategy was plausible. Figure 4: Qq-plots of MB-MDR step 2 test values (squared Student s t), for normal and chi-squared trait distributions, and non-transformed or rank-transformed to normal data. For each setting, one replicate with epistatic variance 10% is considered and F-statistics are pooled for all SNP pairs over the 999 permutations. A theoretical F-distribution according to F (1,498) is taken as the reference. Qq-plots of observed final MB-MDR test values, for normal and chi-squared trait distributions, and non-transformed or rank-transformed to normal data. For each setting, one replicate with genetic variance, 10% is considered. A generated F- distribution according tof (1,498) is taken as the reference. REVIEWERS SECTION
5 We have responded to the 3 minor comments above. We have added the following paragraph in the background section to cover the concern of the reviewer. One of the pioneer methods used in the context of dimensionality reduction and gene-gene interaction detection is the Multifactor Dimensionality Reduction (MDR) method, initially developed by Ritchie et al. [2]. MDR offers an alternative to traditional regression-based approaches. The method is model-free and nonparametric in the sense that it does not assume any particular genetic model. In particular, MDR for binary traits [2] enforces a dimensionality reduction by pooling multilocus genotype classes into two groups of risk based on some threshold value, and by evaluating the epistasis model via cross-validation
6 principles. One concern related to the initial implementations of the MDR method was that some important interactions could be missed due to pooling too many multilocus genotype classes together. Another concern was that the MDR method did not facilitate making adjustments for lower-order genetic effects or confounding factors. Lastly, it was somewhat disappointing that after computationally intensive cross-validation and permutation-based significance assessment procedures only a single best epistasis model was proposed. Over the years, several attempts have been made to further improve the MDR ideas of Ritchie et al. [2], see for instance [3]. However, an MDR-based method was needed that could tackle all of the aforementioned issues within a unified framework and would flexibly accommodate different study designs of related and unrelated individuals. Model-Based Multifactor Dimensionality Reduction (MB-MDR) originated as such a unified dimensionality reduction approach. Like MDR, MB- MDR is an intrinsic non-parametric method, and thus avoids making hard to verify assumptions about genetic modes of inheritance. The original MB-MDR implementation in R by Calle et al. [4] suffered from its own drawbacks, the major one being the significance assessment of epistasis models, which was based on the derivation of MAF dependent null-distributions. These drawbacks were handled in subsequent C++ versions of the MB-MDR software, adhering to the key principles of the MB-MDR strategy [5]. In summary, these key features are 1) dimensionality reduction via multilocus genotype cell labeling using appropriate association tests, 2) prioritization of multiple epistasis models (on reduced constructs / lowerdimensional features) via appropriate association tests and adequate multiple testing corrections to control false positives, 3) possible adjustment for lower-order effects or confounders in relevant steps of the epistasis detection process. Scale transformations are quite common as remedial strategies to meet statistical testing assumptions. However, since the optimal scale transformation is often based
7 on theoretical motivations or statistical convenience, it often leads to new constructs that are hard to interpret or are biologically meaningless. Another concern related to implementing scale transformations is that non-additive signals may be removed as a direct consequence of such transformations prior to analysis [44]. Our results confirmed that rank-based transformations are generally most powerful when quantitative traits are non-normally distributed. Rank transformations serve as a bridge between non-parametrics and parametrics [45]. They naturally eliminate any problem of skewness (e.g. chi-squared distribution). By ranking the impact of outliers is minimized: regardless of how extreme the most extreme observation is, the same rank is given to it. A particular type of rank transformation uses percentile ranks and is referred to as rank transformation to normality. In this context, a percentile rank is defined as the proportion of quantitative trait outcomes in a distribution that a specific trait value is greater than or equal to. When the number of ties is negligible, it will lead to a near to perfect normal distribution, irrespective of the original trait s distribution, which usually is a highly desirable property. We remark that MB-MDR s dimensionality reduction step involves performing multiple ANOVA tests, one for each multilocus genotype cell, while comparing
8 two groups; one group consisting of a single multilocus genotype class, and another group consisting of the pooled remaining multilocus genotype cells at the considered loci. We agree that the statistical properties of two-group comparison tests have been studied at length elsewhere, in particular in the presence of highly unbalanced groups. These unbalances naturally arise in MB-MDR association testing during the risk cell labeling, since for two-way interactions one cell is contrasted against 8 remaining cells. Hence, we agree that the generally known results for two-group comparison tests will coincide with our findings for each of these 9 tests separately. However, the 9 test results are used to construct a lowerdimensional feature with 3 possible factor levels (H, L or O), and it was unknown how the aggregation of different levels of model violation with respect to the 9 aforementioned tests would affect the final MB-MDR association test (on aggregated H (L) cells). The key question that initiated this research (as was explained in the discussion section) was whether the large number of epistasis findings we usually found for quantitative traits some of these simply had to be false positives was due to an aggregation of model violations during internal association testing or due to another aspect of the MB-MDR method. Since not correcting for main effects was shown in earlier work to lead to increased false positives, we addressed the key question of interest by setting up simulations for pure epistasis models. No LD between markers was assumed. Interestingly, type I error was kept under control with the classical implementation of MB-MDR, even for non-normally distributed data. Our work, in particular inspecting the effects of MAFs on the 1-9 test statistic distributions as well as the final MB-MDR test statistic distribution showed that indeed there may be an accumulation of model violation problems operating (one that is MAF dependent and may lead to different marginal final MB-MDR test distributions, possible highly deviating from the theoretical final MB-MDR test distribution) and that our choice of 0.10 as default value for each of the 9 tests within MB-MDR as well as our choice of multiple testing correction (in particular the step down maxt procedure as implemented in MB-MDR) was able to adequately control FWER, regardless. The fact that more hits than expected are observed on real-life data as compared to synthetic data is
9 attributed to the fact that for real-life data often the assumption of subset pivotality is violated. Violation of this assumption does not guarantee strong control of the type I error with the step-down maxt algorithm. Our work also highlighted the fact that p-values for SNP-pairs should not be derived from the theoretical distribution but on resampling based null distributions. The maxt procedure derives such null distributions and at the same time corrects for multiple testing over all possible pairs considered in the epistasis screening. By saying that MB-MDR is non-parametric, we mean that MB-MDR does not make any parametric assumptions about the mode of epistatic inheritance. Parametric as in parametric association tests always refers to distributional assumptions that may or may not be violated. Throughout our manuscript, we have adhered to these original definitions. Our two-group comparison tests within MB- MDR involve Student s t tests, which are thus parametric tests. However, note that for testing whether two loci are globally associated with the trait of interest (i.e., no correction for main effects is performed), the MB-MDR method does not rely on any parametric regression modeling and is also in this sense, non-parametric. For the latter, we rather use the term not model-based. As soon as a correction is made for confounders (whether capturing an environmental measurement or evidence about population stratification), a shift towards parametric (regressionbased) modeling is required. This explains the MB in MB-MDR. Model-Based allows you to integrate whatever is needed from the parametric regression frameworks in order to make the relevant adjustments. We agree that the whole idea about (MB-)MDR was to avoid model-misspecification that are so typical for highly-dimensional (regression-based) modeling. However, we have shown that the most severe correction possible (maximum nr of degrees of freedom) will assure adequate type I error control although the method may become somewhat conservative while doing so.
10 We believe that the paragraph added to the discussion section (crf Moore s second point) also covers these 2 points raised by Motsinger-Reif. We have responded to the typos the reviewer observed.
Investigating the robustness of the nonparametric Levene test with more than two groups
Psicológica (2014), 35, 361-383. Investigating the robustness of the nonparametric Levene test with more than two groups David W. Nordstokke * and S. Mitchell Colp University of Calgary, Canada Testing
More informationKristel Van Steen & Andreas Ziegler
Van Steen & Ziegler Genome-Wide Association Studies 0 Genome-wide Association Studies Kristel Van Steen & Andreas Ziegler kristel.vansteen@ulg.ac.be & ziegler@imbs.uni-luebeck.de Florianopolis, Brazil
More informationUNIVERSITY OF CALIFORNIA, LOS ANGELES
UNIVERSITY OF CALIFORNIA, LOS ANGELES BERKELEY DAVIS IRVINE LOS ANGELES MERCED RIVERSIDE SAN DIEGO SAN FRANCISCO UCLA SANTA BARBARA SANTA CRUZ DEPARTMENT OF EPIDEMIOLOGY SCHOOL OF PUBLIC HEALTH CAMPUS
More informationDRAFT (Final) Concept Paper On choosing appropriate estimands and defining sensitivity analyses in confirmatory clinical trials
DRAFT (Final) Concept Paper On choosing appropriate estimands and defining sensitivity analyses in confirmatory clinical trials EFSPI Comments Page General Priority (H/M/L) Comment The concept to develop
More informationLessons in biostatistics
Lessons in biostatistics The test of independence Mary L. McHugh Department of Nursing, School of Health and Human Services, National University, Aero Court, San Diego, California, USA Corresponding author:
More informationLAB ASSIGNMENT 4 INFERENCES FOR NUMERICAL DATA. Comparison of Cancer Survival*
LAB ASSIGNMENT 4 1 INFERENCES FOR NUMERICAL DATA In this lab assignment, you will analyze the data from a study to compare survival times of patients of both genders with different primary cancers. First,
More informationCitation for published version (APA): Ebbes, P. (2004). Latent instrumental variables: a new approach to solve for endogeneity s.n.
University of Groningen Latent instrumental variables Ebbes, P. IMPORTANT NOTE: You are advised to consult the publisher's version (publisher's PDF) if you wish to cite from it. Please check the document
More informationNew Enhancements: GWAS Workflows with SVS
New Enhancements: GWAS Workflows with SVS August 9 th, 2017 Gabe Rudy VP Product & Engineering 20 most promising Biotech Technology Providers Top 10 Analytics Solution Providers Hype Cycle for Life sciences
More informationPerformance of Median and Least Squares Regression for Slightly Skewed Data
World Academy of Science, Engineering and Technology 9 Performance of Median and Least Squares Regression for Slightly Skewed Data Carolina Bancayrin - Baguio Abstract This paper presents the concept of
More informationProfile Analysis. Intro and Assumptions Psy 524 Andrew Ainsworth
Profile Analysis Intro and Assumptions Psy 524 Andrew Ainsworth Profile Analysis Profile analysis is the repeated measures extension of MANOVA where a set of DVs are commensurate (on the same scale). Profile
More informationSUPPLEMENTAL MATERIAL
1 SUPPLEMENTAL MATERIAL Response time and signal detection time distributions SM Fig. 1. Correct response time (thick solid green curve) and error response time densities (dashed red curve), averaged across
More informationResearch Methods 1 Handouts, Graham Hole,COGS - version 1.0, September 2000: Page 1:
Research Methods 1 Handouts, Graham Hole,COGS - version 10, September 000: Page 1: T-TESTS: When to use a t-test: The simplest experimental design is to have two conditions: an "experimental" condition
More informationThe Pretest! Pretest! Pretest! Assignment (Example 2)
The Pretest! Pretest! Pretest! Assignment (Example 2) May 19, 2003 1 Statement of Purpose and Description of Pretest Procedure When one designs a Math 10 exam one hopes to measure whether a student s ability
More informationRecent developments for combining evidence within evidence streams: bias-adjusted meta-analysis
EFSA/EBTC Colloquium, 25 October 2017 Recent developments for combining evidence within evidence streams: bias-adjusted meta-analysis Julian Higgins University of Bristol 1 Introduction to concepts Standard
More informationAuthor's response to reviews
Author's response to reviews Title: Comparison of two Bayesian methods to detect mode effects between paper-based and computerized adaptive assessments: A preliminary Monte Carlo study Authors: Barth B.
More informationA Spreadsheet for Deriving a Confidence Interval, Mechanistic Inference and Clinical Inference from a P Value
SPORTSCIENCE Perspectives / Research Resources A Spreadsheet for Deriving a Confidence Interval, Mechanistic Inference and Clinical Inference from a P Value Will G Hopkins sportsci.org Sportscience 11,
More informationWhat you should know before you collect data. BAE 815 (Fall 2017) Dr. Zifei Liu
What you should know before you collect data BAE 815 (Fall 2017) Dr. Zifei Liu Zifeiliu@ksu.edu Types and levels of study Descriptive statistics Inferential statistics How to choose a statistical test
More informationTitle: Reliability and validity of the adolescent stress questionnaire in a sample of European adolescents - the HELENA study
Author's response to reviews Title: Reliability and validity of the adolescent stress questionnaire in a sample of European adolescents - the HELENA study Authors: Tineke De Vriendt (tineke.devriendt@ugent.be)
More informationPEER REVIEW FILE. Reviewers' Comments: Reviewer #1 (Remarks to the Author)
PEER REVIEW FILE Reviewers' Comments: Reviewer #1 (Remarks to the Author) Movement-related theta rhythm in the hippocampus is a robust and dominant feature of the local field potential of experimental
More informationTitle: Identifying work ability promoting factors for home care aides and assistant nurses
Author's response to reviews Title: Identifying work ability promoting factors for home care aides and assistant nurses Authors: Agneta Larsson (agneta.larsson@ltu.se) Lena Karlqvist (lena.karlqvist@ltu.se)
More informationMeasuring the User Experience
Measuring the User Experience Collecting, Analyzing, and Presenting Usability Metrics Chapter 2 Background Tom Tullis and Bill Albert Morgan Kaufmann, 2008 ISBN 978-0123735584 Introduction Purpose Provide
More informationGuidelines for reviewers
Guidelines for reviewers Registered Reports are a form of empirical article in which the methods and proposed analyses are pre-registered and reviewed prior to research being conducted. This format of
More informationDescribe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo
Please note the page numbers listed for the Lind book may vary by a page or two depending on which version of the textbook you have. Readings: Lind 1 11 (with emphasis on chapters 10, 11) Please note chapter
More informationUnit 1 Exploring and Understanding Data
Unit 1 Exploring and Understanding Data Area Principle Bar Chart Boxplot Conditional Distribution Dotplot Empirical Rule Five Number Summary Frequency Distribution Frequency Polygon Histogram Interquartile
More informationNumerous hypothesis tests were performed in this study. To reduce the false positive due to
Two alternative data-splitting Numerous hypothesis tests were performed in this study. To reduce the false positive due to multiple testing, we are not only seeking the results with extremely small p values
More informationWDHS Curriculum Map Probability and Statistics. What is Statistics and how does it relate to you?
WDHS Curriculum Map Probability and Statistics Time Interval/ Unit 1: Introduction to Statistics 1.1-1.3 2 weeks S-IC-1: Understand statistics as a process for making inferences about population parameters
More informationNonparametric Linkage Analysis. Nonparametric Linkage Analysis
Limitations of Parametric Linkage Analysis We previously discued parametric linkage analysis Genetic model for the disease must be specified: allele frequency parameters and penetrance parameters Lod scores
More informationMantel-Haenszel Procedures for Detecting Differential Item Functioning
A Comparison of Logistic Regression and Mantel-Haenszel Procedures for Detecting Differential Item Functioning H. Jane Rogers, Teachers College, Columbia University Hariharan Swaminathan, University of
More informationResearch Analysis MICHAEL BERNSTEIN CS 376
Research Analysis MICHAEL BERNSTEIN CS 376 Last time What is a statistical test? Chi-square t-test Paired t-test 2 Today ANOVA Posthoc tests Two-way ANOVA Repeated measures ANOVA 3 Recall: hypothesis testing
More informationTitle: A new statistical test for trends: establishing the properties of a test for repeated binomial observations on a set of items
Title: A new statistical test for trends: establishing the properties of a test for repeated binomial observations on a set of items Introduction Many studies of therapies with single subjects involve
More informationTable of Contents. Plots. Essential Statistics for Nursing Research 1/12/2017
Essential Statistics for Nursing Research Kristen Carlin, MPH Seattle Nursing Research Workshop January 30, 2017 Table of Contents Plots Descriptive statistics Sample size/power Correlations Hypothesis
More informationSmall Group Presentations
Admin Assignment 1 due next Tuesday at 3pm in the Psychology course centre. Matrix Quiz during the first hour of next lecture. Assignment 2 due 13 May at 10am. I will upload and distribute these at the
More informationISC- GRADE XI HUMANITIES ( ) PSYCHOLOGY. Chapter 2- Methods of Psychology
ISC- GRADE XI HUMANITIES (2018-19) PSYCHOLOGY Chapter 2- Methods of Psychology OUTLINE OF THE CHAPTER (i) Scientific Methods in Psychology -observation, case study, surveys, psychological tests, experimentation
More informationThe Logic of Data Analysis Using Statistical Techniques M. E. Swisher, 2016
The Logic of Data Analysis Using Statistical Techniques M. E. Swisher, 2016 This course does not cover how to perform statistical tests on SPSS or any other computer program. There are several courses
More informationTitle:Decisions on statin therapy by patients' opinions about survival gains: Cross sectional survey of general practitioners.
Author's response to reviews Title:Decisions on statin therapy by patients' opinions about survival gains: Cross sectional survey of general practitioners. Authors: Peder Andreas Halvorsen (peder.halvorsen@kraftlaget.no)
More informationEmpirical Knowledge: based on observations. Answer questions why, whom, how, and when.
INTRO TO RESEARCH METHODS: Empirical Knowledge: based on observations. Answer questions why, whom, how, and when. Experimental research: treatments are given for the purpose of research. Experimental group
More informationPlease revise your paper to respond to all of the comments by the reviewers. Their reports are available at the end of this letter, below.
Dear editor and dear reviewers Thank you very much for the additional comments and suggestions. We have modified the manuscript according to the comments below. We have also updated the literature search
More informationMEA DISCUSSION PAPERS
Inference Problems under a Special Form of Heteroskedasticity Helmut Farbmacher, Heinrich Kögel 03-2015 MEA DISCUSSION PAPERS mea Amalienstr. 33_D-80799 Munich_Phone+49 89 38602-355_Fax +49 89 38602-390_www.mea.mpisoc.mpg.de
More informationDan Koller, Ph.D. Medical and Molecular Genetics
Design of Genetic Studies Dan Koller, Ph.D. Research Assistant Professor Medical and Molecular Genetics Genetics and Medicine Over the past decade, advances from genetics have permeated medicine Identification
More informationSTATISTICAL INFERENCE 1 Richard A. Johnson Professor Emeritus Department of Statistics University of Wisconsin
STATISTICAL INFERENCE 1 Richard A. Johnson Professor Emeritus Department of Statistics University of Wisconsin Key words : Bayesian approach, classical approach, confidence interval, estimation, randomization,
More informationAuthor's response to reviews
Author's response to reviews Title: Gender differences in Greek centenarians. A cross-sectional nation-wide study, examining multiple socio-demographic and personality factors and health locus of control.
More informationStudy Guide for the Final Exam
Study Guide for the Final Exam When studying, remember that the computational portion of the exam will only involve new material (covered after the second midterm), that material from Exam 1 will make
More informationDear Dr. Villanueva,
22-12-2017 Dear Dr. Villanueva, We would like to thank you for your interest in our paper and the opportunity to resubmit our manuscript Living network meta-analysis for reducing research waste: an empirical
More informationExamining Relationships Least-squares regression. Sections 2.3
Examining Relationships Least-squares regression Sections 2.3 The regression line A regression line describes a one-way linear relationship between variables. An explanatory variable, x, explains variability
More informationStill important ideas
Readings: OpenStax - Chapters 1 11 + 13 & Appendix D & E (online) Plous - Chapters 2, 3, and 4 Chapter 2: Cognitive Dissonance, Chapter 3: Memory and Hindsight Bias, Chapter 4: Context Dependence Still
More informationTitle: What 'outliers' tell us about missed opportunities for TB control: a cross-sectional study of patients in Mumbai, India
Author's response to reviews Title: What 'outliers' tell us about missed opportunities for TB control: a cross-sectional study of patients in Authors: Anagha Pradhan (anp1002004@yahoo.com) Karina Kielmann
More informationTitle: Healthy snacks at the checkout counter: A lab and field study on the impact of shelf arrangement and assortment structure on consumer choices
Author's response to reviews Title: Healthy snacks at the checkout counter: A lab and field study on the impact of shelf arrangement and assortment structure on consumer choices Authors: Ellen van Kleef
More informationDiscontinuous Traits. Chapter 22. Quantitative Traits. Types of Quantitative Traits. Few, distinct phenotypes. Also called discrete characters
Discontinuous Traits Few, distinct phenotypes Chapter 22 Also called discrete characters Quantitative Genetics Examples: Pea shape, eye color in Drosophila, Flower color Quantitative Traits Phenotype is
More informationResearch Article Power Estimation for Gene-Longevity Association Analysis Using Concordant Twins
Genetics Research International, Article ID 154204, 8 pages http://dx.doi.org/10.1155/2014/154204 Research Article Power Estimation for Gene-Longevity Association Analysis Using Concordant Twins Qihua
More informationIntroduction to Observational Studies. Jane Pinelis
Introduction to Observational Studies Jane Pinelis 22 March 2018 Outline Motivating example Observational studies vs. randomized experiments Observational studies: basics Some adjustment strategies Matching
More informationTitle: Socioeconomic conditions and number of pain sites in women
Author's response to reviews Title: Socioeconomic conditions and number of pain sites in women Authors: Finn E Skjeldestad (fisk@fhi.no) Toril Rannestad (Toril.Rannestad@hist.no) Version: 2 Date: 17 January
More informationPsychology Research Process
Psychology Research Process Logical Processes Induction Observation/Association/Using Correlation Trying to assess, through observation of a large group/sample, what is associated with what? Examples:
More informationData and Statistics 101: Key Concepts in the Collection, Analysis, and Application of Child Welfare Data
TECHNICAL REPORT Data and Statistics 101: Key Concepts in the Collection, Analysis, and Application of Child Welfare Data CONTENTS Executive Summary...1 Introduction...2 Overview of Data Analysis Concepts...2
More informationVARIED THRUSH MANUSCRIPT REVIEW HISTORY REVIEWS (ROUND 2) Editor Decision Letter
1 VARIED THRUSH MANUSCRIPT REVIEW HISTORY REVIEWS (ROUND 2) Editor Decision Letter Thank you for submitting your revision to the Journal of Consumer Research. The manuscript and the revision notes were
More informationOn testing dependency for data in multidimensional contingency tables
On testing dependency for data in multidimensional contingency tables Dominika Polko 1 Abstract Multidimensional data analysis has a very important place in statistical research. The paper considers the
More informationA GUIDE TO ROBUST STATISTICAL METHODS IN NEUROSCIENCE. Keywords: Non-normality, heteroscedasticity, skewed distributions, outliers, curvature.
A GUIDE TO ROBUST STATISTICAL METHODS IN NEUROSCIENCE Authors: Rand R. Wilcox 1, Guillaume A. Rousselet 2 1. Dept. of Psychology, University of Southern California, Los Angeles, CA 90089-1061, USA 2. Institute
More informationReadings: Textbook readings: OpenStax - Chapters 1 11 Online readings: Appendix D, E & F Plous Chapters 10, 11, 12 and 14
Readings: Textbook readings: OpenStax - Chapters 1 11 Online readings: Appendix D, E & F Plous Chapters 10, 11, 12 and 14 Still important ideas Contrast the measurement of observable actions (and/or characteristics)
More informationStepwise method Modern Model Selection Methods Quantile-Quantile plot and tests for normality
Week 9 Hour 3 Stepwise method Modern Model Selection Methods Quantile-Quantile plot and tests for normality Stat 302 Notes. Week 9, Hour 3, Page 1 / 39 Stepwise Now that we've introduced interactions,
More informationChecking the counterarguments confirms that publication bias contaminated studies relating social class and unethical behavior
1 Checking the counterarguments confirms that publication bias contaminated studies relating social class and unethical behavior Gregory Francis Department of Psychological Sciences Purdue University gfrancis@purdue.edu
More informationRESPONSE TO DECISION LETTER
RESPONSE TO DECISION LETTER Dear Editor-in-chief, We are grateful to the editors and reviewers for their time and constructive comments on our manuscript. We have implemented their comments and suggestions
More informationTutorial on Genome-Wide Association Studies
Tutorial on Genome-Wide Association Studies Assistant Professor Institute for Computational Biology Department of Epidemiology and Biostatistics Case Western Reserve University Acknowledgements Dana Crawford
More informationNon-parametric methods for linkage analysis
BIOSTT516 Statistical Methods in Genetic Epidemiology utumn 005 Non-parametric methods for linkage analysis To this point, we have discussed model-based linkage analyses. These require one to specify a
More informationDesigning Psychology Experiments: Data Analysis and Presentation
Data Analysis and Presentation Review of Chapter 4: Designing Experiments Develop Hypothesis (or Hypotheses) from Theory Independent Variable(s) and Dependent Variable(s) Operational Definitions of each
More informationStatistics is the science of collecting, organizing, presenting, analyzing, and interpreting data to assist in making effective decisions
Readings: OpenStax Textbook - Chapters 1 5 (online) Appendix D & E (online) Plous - Chapters 1, 5, 6, 13 (online) Introductory comments Describe how familiarity with statistical methods can - be associated
More informationUnderstandable Statistics
Understandable Statistics correlated to the Advanced Placement Program Course Description for Statistics Prepared for Alabama CC2 6/2003 2003 Understandable Statistics 2003 correlated to the Advanced Placement
More informationStill important ideas
Readings: OpenStax - Chapters 1 13 & Appendix D & E (online) Plous Chapters 17 & 18 - Chapter 17: Social Influences - Chapter 18: Group Judgments and Decisions Still important ideas Contrast the measurement
More informationExperimentalPhysiology
Exp Physiol 97.5 (2012) pp 557 561 557 Editorial ExperimentalPhysiology Categorized or continuous? Strength of an association and linear regression Gordon B. Drummond 1 and Sarah L. Vowler 2 1 Department
More informationTitle: Intention-to-treat and transparency of related practices in randomized, controlled trials of anti-infectives
Author s response to reviews Title: Intention-to-treat and transparency of related practices in randomized, controlled trials of anti-infectives Authors: Robert Beckett (rdbeckett@manchester.edu) Kathryn
More informationICH E9(R1) Technical Document. Estimands and Sensitivity Analysis in Clinical Trials STEP I TECHNICAL DOCUMENT TABLE OF CONTENTS
ICH E9(R1) Technical Document Estimands and Sensitivity Analysis in Clinical Trials STEP I TECHNICAL DOCUMENT TABLE OF CONTENTS A.1. Purpose and Scope A.2. A Framework to Align Planning, Design, Conduct,
More informationReadings: Textbook readings: OpenStax - Chapters 1 13 (emphasis on Chapter 12) Online readings: Appendix D, E & F
Readings: Textbook readings: OpenStax - Chapters 1 13 (emphasis on Chapter 12) Online readings: Appendix D, E & F Plous Chapters 17 & 18 Chapter 17: Social Influences Chapter 18: Group Judgments and Decisions
More informationEstimands and Sensitivity Analysis in Clinical Trials E9(R1)
INTERNATIONAL CONCIL FOR HARMONISATION OF TECHNICAL REQUIREMENTS FOR PHARMACEUTICALS FOR HUMAN USE ICH HARMONISED GUIDELINE Estimands and Sensitivity Analysis in Clinical Trials E9(R1) Current Step 2 version
More informationAuthor s response to reviews
Author s response to reviews Title: The validity of a professional competence tool for physiotherapy students in simulationbased clinical education: a Rasch analysis Authors: Belinda Judd (belinda.judd@sydney.edu.au)
More informationSingle SNP/Gene Analysis. Typical Results of GWAS Analysis (Single SNP Approach) Typical Results of GWAS Analysis (Single SNP Approach)
High-Throughput Sequencing Course Gene-Set Analysis Biostatistics and Bioinformatics Summer 28 Section Introduction What is Gene Set Analysis? Many names for gene set analysis: Pathway analysis Gene set
More informationSimultaneous Equation and Instrumental Variable Models for Sexiness and Power/Status
Simultaneous Equation and Instrumental Variable Models for Seiness and Power/Status We would like ideally to determine whether power is indeed sey, or whether seiness is powerful. We here describe the
More informationTitle:Prediction of poor outcomes six months following total knee arthroplasty in patients awaiting surgery
Author's response to reviews Title:Prediction of poor outcomes six months following total knee arthroplasty in patients awaiting surgery Authors: Eugen Lungu (eugen.lungu@umontreal.ca) François Desmeules
More informationIssues That Should Not Be Overlooked in the Dominance Versus Ideal Point Controversy
Industrial and Organizational Psychology, 3 (2010), 489 493. Copyright 2010 Society for Industrial and Organizational Psychology. 1754-9426/10 Issues That Should Not Be Overlooked in the Dominance Versus
More informationApplication of Local Control Strategy in analyses of the effects of Radon on Lung Cancer Mortality for 2,881 US Counties
Application of Local Control Strategy in analyses of the effects of Radon on Lung Cancer Mortality for 2,881 US Counties Bob Obenchain, Risk Benefit Statistics, August 2015 Our motivation for using a Cut-Point
More informationRare Variant Burden Tests. Biostatistics 666
Rare Variant Burden Tests Biostatistics 666 Last Lecture Analysis of Short Read Sequence Data Low pass sequencing approaches Modeling haplotype sharing between individuals allows accurate variant calls
More informationMultilevel modelling of PMETB data on trainee satisfaction and supervision
Multilevel modelling of PMETB data on trainee satisfaction and supervision Chris McManus March 2007. This final report on the PMETB trainee survey of 2006 is based on a finalised version of the SPSS data
More informationBayesian and Frequentist Approaches
Bayesian and Frequentist Approaches G. Jogesh Babu Penn State University http://sites.stat.psu.edu/ babu http://astrostatistics.psu.edu All models are wrong But some are useful George E. P. Box (son-in-law
More information# BMJ entitled " Complete the antibiotic course to avoid resistance ; non-evidence-based dogma which has run its course?
Dear Dr. Llewelyn # BMJ.2017.037542 entitled " Complete the antibiotic course to avoid resistance ; non-evidence-based dogma which has run its course?" Thank you for sending us this paper and giving us
More informationPooling Subjective Confidence Intervals
Spring, 1999 1 Administrative Things Pooling Subjective Confidence Intervals Assignment 7 due Friday You should consider only two indices, the S&P and the Nikkei. Sorry for causing the confusion. Reading
More informationFurther Properties of the Priority Rule
Further Properties of the Priority Rule Michael Strevens Draft of July 2003 Abstract In Strevens (2003), I showed that science s priority system for distributing credit promotes an allocation of labor
More informationChapter 11. Experimental Design: One-Way Independent Samples Design
11-1 Chapter 11. Experimental Design: One-Way Independent Samples Design Advantages and Limitations Comparing Two Groups Comparing t Test to ANOVA Independent Samples t Test Independent Samples ANOVA Comparing
More informationRAG Rating Indicator Values
Technical Guide RAG Rating Indicator Values Introduction This document sets out Public Health England s standard approach to the use of RAG ratings for indicator values in relation to comparator or benchmark
More informationThe Impact of Relative Standards on the Propensity to Disclose. Alessandro Acquisti, Leslie K. John, George Loewenstein WEB APPENDIX
The Impact of Relative Standards on the Propensity to Disclose Alessandro Acquisti, Leslie K. John, George Loewenstein WEB APPENDIX 2 Web Appendix A: Panel data estimation approach As noted in the main
More informationChapter 1. Introduction
Chapter 1 Introduction 1.1 Motivation and Goals The increasing availability and decreasing cost of high-throughput (HT) technologies coupled with the availability of computational tools and data form a
More informationStatistical analysis DIANA SAPLACAN 2017 * SLIDES ADAPTED BASED ON LECTURE NOTES BY ALMA LEORA CULEN
Statistical analysis DIANA SAPLACAN 2017 * SLIDES ADAPTED BASED ON LECTURE NOTES BY ALMA LEORA CULEN Vs. 2 Background 3 There are different types of research methods to study behaviour: Descriptive: observations,
More informationFrom Bivariate Through Multivariate Techniques
A p p l i e d S T A T I S T I C S From Bivariate Through Multivariate Techniques R e b e c c a M. W a r n e r University of New Hampshire DAI HOC THAI NGUYEN TRUNG TAM HOC LIEU *)SAGE Publications '55'
More informationSPRING GROVE AREA SCHOOL DISTRICT. Course Description. Instructional Strategies, Learning Practices, Activities, and Experiences.
SPRING GROVE AREA SCHOOL DISTRICT PLANNED COURSE OVERVIEW Course Title: Basic Introductory Statistics Grade Level(s): 11-12 Units of Credit: 1 Classification: Elective Length of Course: 30 cycles Periods
More informationThe Impact of Continuity Violation on ANOVA and Alternative Methods
Journal of Modern Applied Statistical Methods Volume 12 Issue 2 Article 6 11-1-2013 The Impact of Continuity Violation on ANOVA and Alternative Methods Björn Lantz Chalmers University of Technology, Gothenburg,
More informationContent. Basic Statistics and Data Analysis for Health Researchers from Foreign Countries. Research question. Example Newly diagnosed Type 2 Diabetes
Content Quantifying association between continuous variables. Basic Statistics and Data Analysis for Health Researchers from Foreign Countries Volkert Siersma siersma@sund.ku.dk The Research Unit for General
More informationDescribe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo
Please note the page numbers listed for the Lind book may vary by a page or two depending on which version of the textbook you have. Readings: Lind 1 11 (with emphasis on chapters 5, 6, 7, 8, 9 10 & 11)
More informationBIOL 458 BIOMETRY Lab 7 Multi-Factor ANOVA
BIOL 458 BIOMETRY Lab 7 Multi-Factor ANOVA PART 1: Introduction to Factorial ANOVA ingle factor or One - Way Analysis of Variance can be used to test the null hypothesis that k or more treatment or group
More informationTitle:Mixed-strain Housing for Female C57BL/6, DBA/2, and BALB/c Mice: Validating a Split-plot Design that promotes Refinement and Reduction
Author's response to reviews Title:Mixed-strain Housing for Female C57BL/6, DBA/2, and BALB/c Mice: Validating a Split-plot Design that promotes Refinement and Reduction Authors: Michael Walker Mr (mwalk04@uoguelph.ca)
More informationTiago Villanueva MD Associate Editor, The BMJ. 9 January Dear Dr. Villanueva,
Tiago Villanueva MD Associate Editor, The BMJ 9 January 2018 Dear Dr. Villanueva, Thank you for your thoughtful re-review of our Manuscript (BMJ.2017.041528) entitled "Immune-related Toxicities in PD-1
More informationBasic Statistics and Data Analysis in Work psychology: Statistical Examples
Basic Statistics and Data Analysis in Work psychology: Statistical Examples WORK PSYCHOLOGY INTRODUCTION In this chapter we examine a topic which is given too little coverage in most texts of this kind,
More informationThe Loss of Heterozygosity (LOH) Algorithm in Genotyping Console 2.0
The Loss of Heterozygosity (LOH) Algorithm in Genotyping Console 2.0 Introduction Loss of erozygosity (LOH) represents the loss of allelic differences. The SNP markers on the SNP Array 6.0 can be used
More informationMidterm Exam MMI 409 Spring 2009 Gordon Bleil
Midterm Exam MMI 409 Spring 2009 Gordon Bleil Table of contents: (Hyperlinked to problem sections) Problem 1 Hypothesis Tests Results Inferences Problem 2 Hypothesis Tests Results Inferences Problem 3
More informationChapter 23. Inference About Means. Copyright 2010 Pearson Education, Inc.
Chapter 23 Inference About Means Copyright 2010 Pearson Education, Inc. Getting Started Now that we know how to create confidence intervals and test hypotheses about proportions, it d be nice to be able
More information