CSC2130: Empirical Research Methods for Software Engineering

Similar documents
Conduct Rigorous and Scientific Research. Chang Xu Department of Computer Science and Technology Nanjing University Sep 21, 2018

Seminar 3: Basics of Empirical Research

Empirical Approaches, Questions & Methods

Independent Variables Variables (factors) that are manipulated to measure their effect Typically select specific levels of each variable to test

positivist phenomenological

Psych 1Chapter 2 Overview

Qualitative Research Design

Conducting Research in the Social Sciences. Rick Balkin, Ph.D., LPC-S, NCC

The Scientific Method

Hypothesis-Driven Research

MODULE 3 APPRAISING EVIDENCE. Evidence-Informed Policy Making Training

Goals of this tutorial

PRINCIPLES OF EMPIRICAL SCIENCE: A REMINDER

Formulation of Research Design


DEFINING THE CASE STUDY Yin, Ch. 1

What Case Study means? Case Studies. Case Study in SE. Key Characteristics. Flexibility of Case Studies. Sources of Evidence

Research Approach & Design. Awatif Alam MBBS, Msc (Toronto),ABCM Professor Community Medicine Vice Provost Girls Section

CHAPTER 7 QUALITATIVE RESEARCH TOOLS

Critical Thinking and Reading Lecture 15

THEORY DEVELOPMENT PROCESS

Theory, Models, Variables

382C Empirical Studies in Software Engineering. Lecture 1. Introduction. Dewayne E Perry ENS present, Dewayne E Perry

The Role of Qualitative Research in Agri-Food Systems

Survival Skills for Researchers. Study Design

School of Nursing, University of British Columbia Vancouver, British Columbia, Canada

Experimental Research in HCI. Alma Leora Culén University of Oslo, Department of Informatics, Design

Research Methods & Design Outline. Types of research design How to choose a research design Issues in research design

Science & Software Engineering

SOCIOLOGICAL RESEARCH PART I. If you've got the truth you can demonstrate it. Talking doesn't prove it. Robert A. Heinlein

Understanding Science Conceptual Framework

Evidence Informed Practice Online Learning Module Glossary

COURSE: NURSING RESEARCH CHAPTER I: INTRODUCTION

Research Methodology in Social Sciences. by Dr. Rina Astini

Insight Assessment Measuring Thinking Worldwide

Quantitative research Methods. Tiny Jaarsma

Competency Rubric Bank for the Sciences (CRBS)

Theory and Methods Question Bank

Science, Society, and Social Research (1) Benjamin Graham

Introduction Stanovich, Chapter 1

Key Ideas. Explain how science is different from other forms of human endeavor. Identify the steps that make up scientific methods.

THE SCIENTIFIC WORLD VIEW

Work, Employment, and Industrial Relations Theory Spring 2008

Lecturer: Dr. Emmanuel Adjei Department of Information Studies Contact Information:

Psychology of Dysfunctional Behaviour RESEARCH METHODS

Social Research (Complete) Agha Zohaib Khan

Goal: To become familiar with the methods that researchers use to investigate aspects of causation and methods of treatment

Introduction to the Scientific Method. Knowledge and Methods. Methods for gathering knowledge. method of obstinacy

Course summary, final remarks

Principles of Sociology

RESEARCH METHODS. Winfred, research methods, ; rv ; rv

In this chapter we discuss validity issues for quantitative research and for qualitative research.

4.0 INTRODUCTION 4.1 OBJECTIVES

HOW IS HAIR GEL QUANTIFIED?

Worldviews, Research Methods, and their Relationship to Validity in Empirical Software Engineering Research

Communication Research Practice Questions

The Role and Importance of Research

RESEARCH METHODS. Winfred, research methods,

Making the Case for Writing Case Reports Spring MNAPTA State Conference Brooklyn Center, MN April 15, 2011

2 Critical thinking guidelines

PSYCHOLOGY (413) Chairperson: Sharon Claffey, Ph.D.

Research Methodologies

Chapter 11 Nonexperimental Quantitative Research Steps in Nonexperimental Research

Sociological Research Methods and Techniques Alan S.Berger 1

Science in Natural Resource Management ESRM 304

POLI 343 Introduction to Political Research

Glossary of Research Terms Compiled by Dr Emma Rowden and David Litting (UTS Library)

Educational Research

NATURE OF SCIENCE. Professor Andrea Garrison Biology 3A

PSYC1024 Clinical Perspectives on Anxiety, Mood and Stress

Types of Research (Quantitative and Qualitative)

METHODOLOGY FOR DISSERTATION

Social Research Strategies

Chapter 1.1. The Process of Science. Essential Questions

Goal: To become familiar with the methods that researchers use to investigate aspects of causation and methods of treatment

Who? What? What do you want to know? What scope of the product will you evaluate?

Applying the Experimental Paradigm to Software Engineering

The Current State of Our Education

The Scientific Method. Philosophy of Science. Philosophy of Science. Epistemology: the philosophy of knowledge

PLANNING THE RESEARCH PROJECT

Chapter 1 Introduction to Educational Research

Nature of Science and Scientific Method Guided Notes

MSc Psychological Research Methods/ MPsych Advanced Psychology Module Catalogue / 2018

Qualitative and Quantitative Approaches Workshop. Comm 151i San Jose State U Dr. T.M. Coopman Okay for non-commercial use with attribution

HUMAN-COMPUTER INTERACTION EXPERIMENTAL DESIGN

Lab 2: The Scientific Method. Summary

Seminar 11: Critical Systems

Qualitative Data Analysis. Richard Boateng, PhD. Arguments with Qualitative Data. Office: UGBS RT18 (rooftop)

SEMINAR ON SERVICE MARKETING

CONCEPTUAL FRAMEWORK, EPISTEMOLOGY, PARADIGM, &THEORETICAL FRAMEWORK

Disposition. Quantitative Research Methods. Science what it is. Basic assumptions of science. Inductive and deductive logic

Research Design & Protocol Development

Quantitative research Methods. Tiny Jaarsma

9 research designs likely for PSYC 2100

2 Research Methodology 1.2 OBJECTIVES OF RESEARCH The purpose of research is to discover answers to questions through the application of scientific pr

Research Landscape. Qualitative = Constructivist approach. Quantitative = Positivist/post-positivist approach Mixed methods = Pragmatist approach

Department of Psychological Sciences Learning Goals and Outcomes

Lecture 4: Research Approaches

Transcription:

CSC2130: Empirical Research Methods for Software Engineering Steve Easterbrook sme@cs.toronto.edu www.cs.toronto.edu/~sme/csc2130/ 2004-5 Steve Easterbrook. This presentation is available free for non-commercial use with attribution under a creative commons license. 1 Course Goals Prepare students for advanced research (in SE): Learn how to plan, conduct and report on empirical investigations. Understand the key steps of a research project: formulating research questions, theory building, data analysis (using both qualitative and quantitative methods), building evidence, assessing validity, publishing. Motivate the need for an empirical basis for SE Cover all principal empirical methods applicable to SE: controlled experiment, case studies, surveys, archival analysis, action research, ethnographies, Relate these methods to relevant meta-theories in the philosophy and sociology of science. 2012 Steve Easterbrook. This presentation is available free for non-commercial use with attribution under a creative commons license. 2 1

Intended Audience This is an advanced software engineering course: assumes a strong grasp of the key ideas of software engineering and the common methods used in software practice. Focus: how do software developers work how do new tools and techniques affect their ability to construct high quality software efficiently qualitative and quantitative techniques from behavioural sciences The course is aimed at students who: plan to conduct SE research that demands some empirical validation wish to establish an empirical basis for an existing SE research programme wish to apply these techniques in related fields (e.g. HCI, Cog Sci) Note: we will *not* cover the kinds of experimental techniques used in CS systems areas. 2012 Steve Easterbrook. This presentation is available free for non-commercial use with attribution under a creative commons license. 3 Format Seminars: 1 three-hour seminar per week Mix of discussion, lecture, student presentations Readings Major component is discussion of weekly readings Please read the set papers before the seminar Assessment: 10% Class Participation 30% Oral Presentation - *critique a published empirical study 60% Written paper - design an empirical study for a SE research question *As part of a mock conference program committee meeting 2012 Steve Easterbrook. This presentation is available free for non-commercial use with attribution under a creative commons license. 4 2

Course Outline 1. Introduction & Orientation 2. What is Science Philosophy of Science Sociology of Science Meta-theories 3. What is software engineering Engineering & Design Disciplinary Analogies for SE Evidence-based software engineering 4. Basics of Doing Research Finding good research questions Theory building Research Design Ethics Evidence and Measurement Peer Review Process 2012 Steve Easterbrook. This presentation is available free for non-commercial use with attribution under a creative commons license. 5 Course Outline (cont) 5. Experiments Controlled Experiments Quasi-experiments Sampling Replication 6. Case Studies Single and Multi-case Longitudinal Case Studies Approaches to Data Collection 7. Histories and Simulations Artifact Analysis Archival Analysis and Post-mortems Simulation Techniques 8. Survey and Observation Surveys Focus Groups Field Studies / Ethnographies 9. Interventions Action Research Pilot Studies Benchmarking 10. Qualitative Analysis Grounded Theory Phenomenography Mixed Methods Research 11. Quantitative Analysis Stats, power analysis, meta-analysis 12. Publishing and Reviewing (mock PC meeting) 13. Replication and Beyond Internal and External Replication Biases and Influences Threats to Validity When to use empirical methods 2012 Steve Easterbrook. This presentation is available free for non-commercial use with attribution under a creative commons license. 6 3

Is this your research plan Step 1: Build a new tool Step 2: Step 3: Profit 2012 Steve Easterbrook. This presentation is available free for non-commercial use with attribution under a creative commons license. 7 Scientific Method No single official scientific method Somehow, scientists are supposed to do this: Observation Theory World Validation 2012 Steve Easterbrook. This presentation is available free for non-commercial use with attribution under a creative commons license. 8 4

Observe! 2012 Steve Easterbrook. This presentation is available free for non-commercial use with attribution under a creative commons license. 9 Scientific Inquiry Prior Knowledge (Initial Hypothesis) Observe (what is wrong with the current theory) Experiment (manipulate the variables) Theorize (refine/create a better theory) Design (Design empirical tests of the theory) 2012 Steve Easterbrook. This presentation is available free for non-commercial use with attribution under a creative commons license. 10 5

Some Characteristics of Science Science seeks to improve our understanding of the world. Explanations are based on observations Scientific truths must stand up to empirical scrutiny Sometimes scientific truth must be thrown out in the face of new findings Theory and observation affect one another: Our perceptions of the world affect how we understand it Our understanding of the world affects how we perceive it Creativity is important Theories, hypotheses, experimental designs Search for elegance, simplicity 2012 Steve Easterbrook. This presentation is available free for non-commercial use with attribution under a creative commons license. 11 All Methods are flawed E.g. Laboratory Experiments Cannot study large scale software development in the lab! Too many variables to control them all! E.g. Case Studies How do we know what s true in one project generalizes to others Researcher chose what questions to ask, hence biased the study E.g. Surveys Self-selection of respondents biases the study Respondents tell you what they think they ought to do, not what they actually do etc... 2012 Steve Easterbrook. This presentation is available free for non-commercial use with attribution under a creative commons license. 12 6

Strategies to overcome weaknesses Theory-building Testing a hypothesis is pointless (single flawed study!) unless it builds evidence for a clearly stated theory Empirical Induction Series of studies over time Each designed to probe more aspects of the theory together build evidence for a clearly stated theory Mixed Methods Research Use multiple methods to investigate the same research question Each method compensates for the flaws of the others together build evidence for a clearly stated theory 2012 Steve Easterbrook. This presentation is available free for non-commercial use with attribution under a creative commons license. 13 Source: http://xkcd.com/882/" 2012 Steve Easterbrook. This presentation is available free for non-commercial use with attribution under a creative commons license. 14 7

What is a research contribution A better understanding of how software engineers work Identification of problems with the current state-ofthe-art A characterization of the properties of new tools/ techniques Evidence that approach A is better than approach B How will you validate your claims" 2012 Steve Easterbrook. This presentation is available free for non-commercial use with attribution under a creative commons license. 15 Meet Stuart Dent Name: Stuart Dent (a.k.a. Stu ) Advisor: Prof. Helen Back Topic: Merging Stakeholder views in Model Driven Development Status: 2 years into his PhD Has built a tool Needs an evaluation plan 2012 Steve Easterbrook. This presentation is available free for non-commercial use with attribution under a creative commons license. 16 8

Stu s Evaluation Plan Formal Experiment Independent Variable: Stu-Merge vs. Rational Architect Dependent Variables: Correctness, Speed, Subjective Assessment Task: Merging Class Diagrams from two different stakeholders models Subjects: Grad Students in SE H 1 : Stu-Merge produces correct merges more often than RA H 2 : Subjects produce merges faster with Stu-Merge than with RA H 3 : Subjects prefer using Stu-Merge to RA Results H 1 accepted (strong evidence) H 2 & H 3 rejected Subjects found the tool unintuitive 2012 Steve Easterbrook. This presentation is available free for non-commercial use with attribution under a creative commons license. 17 Threats to Validity Construct Validity What do we mean by a merge What is correctness 5-point scale for subjective assessment - insufficient discriminatory power (both tools scored very low) Internal Validity Confounding variables: Time taken to learn the tool; familiarity Subjects were all familiar with RA, not with Stu-merge External Validity Task representativeness class models were of a toy problem Subject representativeness Grad students as sample of what population Theoretical Reliability Researcher bias subjects knew Stu-merge was Stu s own tool 2012 Steve Easterbrook. This presentation is available free for non-commercial use with attribution under a creative commons license. 18 9

What went wrong What was the research question Is tool A better than tool B What would count as an answer What use would the answer be How is it a contribution to knowledge How does this evaluation relate to the existing literature 2012 Steve Easterbrook. This presentation is available free for non-commercial use with attribution under a creative commons license. 19 Experiments as Clinical Trials Why would we expect it to be better Why do we need to know What will we do with the answer Is drug A better than drug B Better at doing what Better in what way Better in what situations 2012 Steve Easterbrook. This presentation is available free for non-commercial use with attribution under a creative commons license. 20 10

Why would we expect it to be better You gotta have a theory! 2012 Steve Easterbrook. This presentation is available free for non-commercial use with attribution under a creative commons license. 21 Some Definitions A model is an abstract representation of a phenomenon or set of related phenomena Some details included, others excluded A theory is a set of statements that explain a set of phenomena Serves to explain and predict Precisely defined terminology Concepts, relationships, causal inferences (operational definitions for theoretical terms) A hypothesis is a testable statement derived from a theory A hypothesis is not a theory! In SE, we have mostly folk theories 2012 Steve Easterbrook. This presentation is available free for non-commercial use with attribution under a creative commons license. 22 11

A simpler definition A Theory is the best explanation of all the available evidence" 2012 Steve Easterbrook. This presentation is available free for non-commercial use with attribution under a creative commons license. 23 The Role of Theory Building Theories lie at the heart of what it means to do science. Production of generalizable knowledge Theory provides orientation for data collection Cannot observe the world without a theoretical perspective Theories allow us to compare similar work Theories include precise definition for the key terms Theories provide a rationale for which phenomena to measure Theories support analytical generalization Provide a deeper understanding of our empirical results and hence how they apply more generally Much more powerful than statistical generalization 2012 Steve Easterbrook. This presentation is available free for non-commercial use with attribution under a creative commons license. 24 12

Stu s Theory Background Assumptions Large team projects, models contributed by many actors Models are fragmentary, capture partial views Partial views are inconsistent and incomplete most of the time Basic Theory (Brief summary:) Model merging is an exploratory process, in which the aim is to discover intended relationships between views. Goodness of a merge is a subjective judgment. If an attempted merge doesn t seem good, many need to change either the models, or the way in which they were mapped together. [Still needs some work] Derived Hypotheses Useful merge tools need to represent relationships explicitly Useful merge tools need to be complete (work for any models, even if inconsistent) 2012 Steve Easterbrook. This presentation is available free for non-commercial use with attribution under a creative commons license. 25 What type of question are you asking Existence: Does X exist Description & Classification What is X like What are its properties How can it be categorized How can we measure it What are its components Descriptive-Comparative How does X differ from Y Frequency and Distribution How often does X occur What is an average amount of X Descriptive-Process How does X normally work By what process does X happen What are the steps as X evolves Relationship Are X and Y related Do occurrences of X correlate with occurrences of Y Causality Does X cause Y Does X prevent Y What causes X What effect does X have on Y Causality-Comparative Does X cause more Y than does Z Is X better at preventing Y than is Z Does X cause more Y than does Z under one condition but not others Design What is an effective way to achieve X How can we improve X 2012 Steve Easterbrook. This presentation is available free for non-commercial use with attribution under a creative commons license. 26 13

What type of question are you asking Existence: Does X exist Exploratory Description & Classification What is X like What are its properties How can it be categorized How can we measure it What are its components Descriptive-Comparative How does X differ from Y Frequency and Distribution Baserate How often does X occur What is an average amount of X Descriptive-Process How does X normally work By what process does X happen What are the steps as X evolves Relationship Are X and Y related Do occurrences of X correlate with occurrences of Y Causality Does X cause Y Does X prevent Y What causes X What effect does X have on Y Causality-Comparative Does X cause more Y than does Z Is X better at preventing Y than is Z Does X cause more Y than does Z under one condition but not others Design Correlation Causal Relationship Design What is an effective way to achieve X How can we improve X 2012 Steve Easterbrook. This presentation is available free for non-commercial use with attribution under a creative commons license. 27 Stu s Research Question(s) Pick just one for now " Existence Does model merging ever happen in practice Description/Classification What are the different types of model merging that occur in practice on large scale systems Descriptive-Comparative How does model merging with explicit representation of relationships differ from model merging without such representation Causality Does an explicit representation of the relationship between models cause developers to explore different ways of merging models Causality-Comparative Does the algebraic representation of relationships in Stu s tool lead developers to explore more than do pointcuts in AOM 2012 Steve Easterbrook. This presentation is available free for non-commercial use with attribution under a creative commons license. 28 14

Putting the Question in Context How does this relate to the established literature Existing Theories Philosophical Context Positivist Constructivist Critical theory Pragmatist The Research Question What methods are appropriate for answering this question What will you accept as valid truth Methodological Choices New Paradigms What new perspectives are you bringing to this field Empirical Method Data Collection Techniques Data Analysis Techniques 2012 Steve Easterbrook. This presentation is available free for non-commercial use with attribution under a creative commons license. 29 Many available methods Common in the lab Methods Common in the wild Methods Controlled Experiments Rational Reconstructions Exemplars Benchmarks Simulations Quasi-Experiments Case Studies Survey Research Ethnographies Action Research Artifact/Archive Analysis ( mining!) 2012 Steve Easterbrook. This presentation is available free for non-commercial use with attribution under a creative commons license. 30 15

Stu s Method(s) Selection Existence Does model merging ever happen in practice Description/Classification What are the different types of model merging that occur in practice on large scale systems Descriptive-Comparative How does model merging with explicit representation of relationships differ from model merging without such representation Causality Does an explicit representation of the relationship between models cause developers to explore different ways of merging models Causality-Comparative Does the algebraic representation of relationships in Stu s tool lead developers to explore more than do pointcuts in AOM Case study Survey Research Ethnography Action Research Controlled Experiment 2012 Steve Easterbrook. This presentation is available free for non-commercial use with attribution under a creative commons license. 31 Warning No method is perfect Don t get hung up on methodological purity Pick something and get on with it Some knowledge is better than none 2012 Steve Easterbrook. This presentation is available free for non-commercial use with attribution under a creative commons license. 32 16

Okay, but 2004-5 Steve Easterbrook. This presentation is available free for non-commercial use with attribution under a creative commons license. 33 Why Build a Tool Build a Tool to Test a Theory Tool is part of the experimental materials needed to conduct your study Build a Tool to Develop a Theory Theory emerges as you explore the tool Build a Tool to Explain your Theory Theory as a concrete instantiation of (some aspect of) the theory 2012 Steve Easterbrook. This presentation is available free for non-commercial use with attribution under a creative commons license. 34 17