Paper presentation: Preliminary Guidelines for Empirical Research in Software Engineering (Kitchenham et al. 2002)

Similar documents
2 Critical thinking guidelines

Psych 1Chapter 2 Overview

Inferences: What inferences about the hypotheses and questions can be made based on the results?

Chapter-2 RESEARCH DESIGN

The degree to which a measure is free from error. (See page 65) Accuracy

Systematic Mapping Studies

Experimental Psychology

Investigative Biology (Advanced Higher)

UNIT 4 ALGEBRA II TEMPLATE CREATED BY REGION 1 ESA UNIT 4

Independent Variables Variables (factors) that are manipulated to measure their effect Typically select specific levels of each variable to test

STATISTICS & PROBABILITY

CRITICAL EVALUATION OF BIOMEDICAL LITERATURE

The Thesis Writing Process and Literature Review

Blinded Central Review and Local Review for Progression Free Survival: The Cost of Central Audits as a Cost Saving Alternative

CONSORT 2010 checklist of information to include when reporting a randomised trial*

Generalization and Theory-Building in Software Engineering Research

Vocabulary. Bias. Blinding. Block. Cluster sample

9 research designs likely for PSYC 2100

04/12/2014. Research Methods in Psychology. Chapter 6: Independent Groups Designs. What is your ideas? Testing

Sociological Research Methods and Techniques Alan S.Berger 1

Randomized Controlled Trials Shortcomings & Alternatives icbt Leiden Twitter: CAAL 1

MRS Advanced Certificate in Market & Social Research Practice. Preparing for the Exam: Section 2 Q5- Answer Guide & Sample Answers

Clinical Research Scientific Writing. K. A. Koram NMIMR

PubH 7470: STATISTICS FOR TRANSLATIONAL & CLINICAL RESEARCH

Chapter 19. Confidence Intervals for Proportions. Copyright 2010 Pearson Education, Inc.

Research Designs. Inferential Statistics. Two Samples from Two Distinct Populations. Sampling Error (Figure 11.2) Sampling Error

baseline comparisons in RCTs

Informal Aggregation Technique for Software Engineering Experiments

In this chapter we discuss validity issues for quantitative research and for qualitative research.

Measurement. 500 Research Methods Mike Kroelinger

Understandable Statistics

Chapter 1. Research : A way of thinking

MCAS Equating Research Report: An Investigation of FCIP-1, FCIP-2, and Stocking and. Lord Equating Methods 1,2

Chapter 1. Research : A way of thinking

Chapter 19. Confidence Intervals for Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc.

Applying the Experimental Paradigm to Software Engineering

Definition of Scientific Research RESEARCH METHODOLOGY CHAPTER 2 SCIENTIFIC INVESTIGATION. The Hallmarks of Scientific Research

It is crucial to follow specific steps when conducting a research.

9699 SOCIOLOGY. Mark schemes should be read in conjunction with the question paper and the Principal Examiner Report for Teachers.

What Is Science? Lesson Overview. Lesson Overview. 1.1 What Is Science?

Statistical Considerations for Research Design. Analytic Goal. Analytic Process

Chapter 5: Field experimental designs in agriculture

Chapter 3. Research Methodology. This chapter mentions the outline of research methodology and gives

Garbage in - garbage out? Impact of poor reporting on the development of systematic reviews

Examining Relationships Least-squares regression. Sections 2.3

Meta Analysis. David R Urbach MD MSc Outcomes Research Course December 4, 2014

Dolly or Shaun? A Survey to Verify Code Clones Detected using Similar Sequences of Method Calls

Measuring and Assessing Study Quality

Hypothesis-Driven Research

HOW TO WRITE A STUDY PROTOCOL

Basic Biostatistics. Dr. Kiran Chaudhary Dr. Mina Chandra

Controlled Trials. Spyros Kitsiou, PhD

Writing Reaction Papers Using the QuALMRI Framework

THE RESEARCH ENTERPRISE IN PSYCHOLOGY

CHAPTER NINE DATA ANALYSIS / EVALUATING QUALITY (VALIDITY) OF BETWEEN GROUP EXPERIMENTS

Lesson 1 Understanding Science

Introduction. 1.1 Facets of Measurement

2.75: 84% 2.5: 80% 2.25: 78% 2: 74% 1.75: 70% 1.5: 66% 1.25: 64% 1.0: 60% 0.5: 50% 0.25: 25% 0: 0%

TRANSCRIPT: This is Dr. Chumney with a very brief overview of the most common purposes served by program evaluation research.

August 26, Dept. Of Statistics, University of Illinois at Urbana-Champaign. Introductory lecture: A brief survey of what to

Chapter 2 Methodology: How Social Psychologists Do Research

Trial Designs. Professor Peter Cameron

Chapter 23. Inference About Means. Copyright 2010 Pearson Education, Inc.

USE AND MISUSE OF MIXED MODEL ANALYSIS VARIANCE IN ECOLOGICAL STUDIES1

Scientific Method in Biology

Lecture 2. Key Concepts in Clinical Research

Survey Errors and Survey Costs

Parts of a STEM Fair Project

Variables Research involves trying to determine the relationship between two or more variables.

Math 243 Sections , 6.1 Confidence Intervals for ˆp

How to interpret results of metaanalysis

Meta-Analysis. Zifei Liu. Biological and Agricultural Engineering

Organizing Scientific Thinking Using the QuALMRI Framework

A Brief Guide to Writing

Chapter 13 Summary Experiments and Observational Studies

Research Methodology. Characteristics of Observations. Variables 10/18/2016. Week Most important know what is being observed.

Recent developments for combining evidence within evidence streams: bias-adjusted meta-analysis

Introduction & Basics

Formulating Research Questions and Designing Studies. Research Series Session I January 4, 2017

Gathering. Useful Data. Chapter 3. Copyright 2004 Brooks/Cole, a division of Thomson Learning, Inc.

Running head: RESEARCH DESIGN TO IDENTIFY BIOTHREATS USING 1

STATS8: Introduction to Biostatistics. Overview. Babak Shahbaba Department of Statistics, UCI

Chapter 13. Experiments and Observational Studies. Copyright 2012, 2008, 2005 Pearson Education, Inc.

Chapter 1 Data Collection

What is Psychology? chapter 1

Data collection, summarizing data (organization and analysis of data) The drawing of inferences about a population from a sample taken from

CSC2130: Empirical Research Methods for Software Engineering

GLOSSARY OF GENERAL TERMS

SOCQ121/BIOQ121. Session 8. Methods and Methodology. Department of Social Science. endeavour.edu.au

Math 1680 Class Notes. Chapters: 1, 2, 3, 4, 5, 6

Methodology for Non-Randomized Clinical Trials: Propensity Score Analysis Dan Conroy, Ph.D., inventiv Health, Burlington, MA

Cochrane Pregnancy and Childbirth Group Methodological Guidelines

STATISTICS AND RESEARCH DESIGN

Chapter 2. The Research Enterprise in Psychology 8 th Edition

AP STATISTICS 2007 SCORING GUIDELINES

PHYSICAL FUNCTION A brief guide to the PROMIS Physical Function instruments:

Statistical Analysis Plans

Research Strategies. What We Will Cover in This Section. Research Purposes. Overview. Research techniques. Literature Reviews. Basic research.

Chapter 8 Estimating with Confidence

Unit 1 Exploring and Understanding Data

Transcription:

Paper presentation: Preliminary Guidelines for Empirical Research in Software Engineering (Kitchenham et al. 2002) Who? Eduardo Moreira Fernandes From? Federal University of Minas Gerais Department of Computer Science Belo Horizonte, State of Minas Gerais, Brazil When? April 06, 2015

Prologue Paper s name Authors Publicated in Preliminary Guidelines for Empirical Research in Software Engineering Kitchenham, B., Pfleeger, S., Pickard, L., Jones, P., Hoaglin, D., El Emam, K., and Rosenberg, J. IEEE Transactions on Software Engineering (TSE), Vol. 28, No. 8, Pages 723-734, August 2002

Overview Paper goals Work s basis To stimulate discussion about empirical research practices in software engineering, To support the design, conduction and evaluation of empirical studies, To assist in improving of empirical studies and stimulate the critical evaluation of works, and To support the combination of different and related empirical researches. Review of guidelines for medical researches, and Authors experience in reviewing empirical researches in software engineering.

Paper s motivation The problem An example Accordingly to the authors view: The standard of empirical software engineering research [case studies, surveys and others] is poor. This is not a particular problem in software research. High rate of error in medical and clinical empirical researches (as in 1987, 1988 and 1993): Methodologic errors, Statistical errors, and Lack of details about the applied statistical techniques.

Paper s motivation In TSE papers Approach to solve the problem Observed deficiencies (just like in medical researches): Poor experimental design, Inappropriate use of statistical techniques, and Discordant conclusions from the obtained results. Focus in medical research guidelines: medical statisticians are aware of the poverty of statistical analysis in journals.

Paper s content Problems origin The solution The origin of statistical problems in software research is justified by: Difficulty of applying standards of statistical procedures in software experiments, and Lack of statistical skills by empirical research people. Proposal of preliminary guidelines covering: experimental context, experimental design, conduct of the experiment and data collection, analysis, presentation of results, and interpretation of results.

Experimental context Composition The authors discuss about: 1 Background information about the context of the industry or of the developed software engineering technique, 2 Discussion of the research hypotheses, and 3 Information about related research. Goals To ensure a sufficient definition of the research objectives and a sufficient description of the research.

Experimental context C1. Be sure to specify as much of the industrial context as possible. [...] clearly define the entities, attributes, and measures that are capturing the [...] information. C2. If a specific hypothesis is being tested, state it clearly prior to performing the study and discuss the theory from which it is derived [...]. Without the link theory-hypothesis, empirical results cannot contribute to a wider body of knowledge.

Experimental context C3. If the research is exploratory, state clearly and, prior to data analysis, what questions the investigation is intended to address and how it will address them. Many subsidiary analyses vs single purpose study C4. Describe research that is similar to, or has a bearing on, the current research and how current work relates to it. It is the integration of the body of knowledge.

Experimental design Composition It describes the products, resources and processes involved in the study, such as: 1 The studied population, the rationale and technique for population sampling, 2 The intervention process, and 3 The methods applied to reduce bias and determine sample size. Goals To ensure that the design is appropriate for the study objectives.

Experimental design D1. Identify the population from which the subjects and objects are drawn. It is necessary to be drawn inferences from the experiment results (e.g., generalization of the results to a defined population). D2. Define the process by which the subjects and objects were selected. Random sampling or another method that should be explained and justified. Definition of inclusion and exclusion criteria.

Experimental design D4. Restrict yourself to simple study designs or [...] to designs that are fully analyzed in the statistical literature. If you are not using a well-documented design and analysis method, you should consult a statistician [...]. D7. Use appropriate levels of blinding.

Conducting the experiment and data collection Goals With respect to: Data collection: to ensure a well-defined data collection process to support the experiment replication. Experiment conduction: to identify any deviations from the experimental plans.

Conducting the experiment and data collection DC1. DC3. Define all software measures fully, including the entity, attribute, unit and counting rules. This will provide: a sufficient understanding of different measurements and the determination of translation among measurement schemes. Describe any quality control method used to ensure completeness and accuracy of data collection.

Conducting the experiment and data collection DC4. DC5. For surveys, monitor and report the response rate and discuss the representativeness of the responses and the impact of nonresponse. For observational studies and experiments, record data about subjects who drop out from the studies.

Analysis Main goal To ensure the correct analysis of the experimental results. Data analysis in accordance with the study design. A1. Specify any procedures used to control for multiple testing. To avoid problems with data overuse (when the data set is small and new data is hard to be obtained). There are methods for dealing with multiple tests on the same data set.

Analysis A2. Consider using blind analysis. In order to avoid the tendency to fishing for results. A5. Apply appropriate quality control procedures to verify your results. Don t manipulate your data to favor your hypothesis.

Presentation of results Main goal A sufficient presentation of results provides: The understanding of all aspects of the study, The replication of the study, and The combination of the results in a future meta-analysis. P1. Describe or cite a reference for all statistical procedures used.

Presentation of results P3. Present quantitative results as well as significance levels. Quantitative results should show the magnitude of effects and the confidence limits. P4. Present the raw data whenever possible. Otherwise, confirm that they are available for confidential review [...]. P6. Make appropriate use of graphics.

Interpretation of results Main goal To make conclusions that follow directly from the obtained results. I1. Define the population to which inferential statistics and predictive models apply. I2. Differentiate between statistical significance and practical importance. Researches may show a statistical significance in some result, but there may be no practical importance.

Interpretation of results I3. Define the type of study. In order to establish the reliance that readers should put on the conclusions of the study, and To suggest the suitability of the developed study for future meta-analysis. I4. Specify any limitations of the study. The researchers are responsible for discussing the limitations of their study.

Paper review & Discussion The good The bad The bottom line Sufficient coverage on the topic of empirical research and balance between generalization and specificity. It is hard to find bad points in this paper because of its solidity. This paper is a good starting point for those looking to venture into empirical research in software engineering.