RNA-seq. Design of experiments

Size: px
Start display at page:

Download "RNA-seq. Design of experiments"

Transcription

1 RNA-seq Design of experiments

2 Experimental design

3 Introduction An experiment is a process or study that results in the collection of data. Statistical experiments are conducted in situations in which researchers can manipulate the conditions of the experiment and can control the factors that are irrelevant to the research objectives.

4 Statistical design of experiments Experimental design is the process of planning a study to meet specified objectives. Planning an experiment properly is very important in order to ensure that the right type of data and a sufficient sample size and power are available to answer the research questions of interest as clearly and efficiently as possible.

5 Designing an experiment Perform the following steps when designing an experiment: 1. Define the problem and the questions to be addressed 2. Define the population of interest 3. Determine the need for sampling 4. Define the experimental design

6 Define problem Before data collection begins, specific questions that the researcher plans to examine must be clearly identified. In addition, a researcher should identify the sources of variability in the experimental conditions. One of the main goals of a designed experiment is to partition the effects of the sources of variability into distinct components in order to examine specific questions of interest. The objective of designed experiments is to improve the precision of the results in order to examine the research hypotheses.

7 Define population A population is a collective whole of people, animals, plants, or other items that researchers collect data from. Before collecting any data, it is important that researchers clearly define the population, including a description of the members. The designed experiment should designate the population for which the problem will be examined. The entire population for which the researcher wants to draw conclusions will be the focus of the experiment.

8 Determine the need for sampling A sample is one of many possible sub-sets of units that are selected from the population of interest. In many data collection studies, the population of interest is assumed to be much larger in size than the sample so, potentially, there are a very large (usually considered infinite) number of possible samples. The results from a sample are then used to draw valid inferences about the population.

9 Determine the need for sampling A random sample is a sub-set of units that are selected randomly from a population. A random sample represents the general population or the conditions that are selected for the experiment because the population of interest is too large to study in its entirety. Using techniques such as random selection after stratification or blocking is often preferred.

10 Determine the need for sampling Determining the sample size requires some knowledge of the observed or expected variance among sample members in addition to how large a difference among treatments you want to be able to detect. Another way to describe this aspect of the design stage is to conduct a prospective power analysis, which is a brief statement about the capability of an analysis to detect a practical difference. A power analysis is essential so that the data collection plan will work to enhance the statistical tests primarily by reducing residual variation, which is one of the key components of a power analysis study.

11 Define experimental design Defining the experimental design consists of the following steps: 1. Identify the experimental unit. 2. Identify the types of variables. 3. Define the treatment structure. 4. Define the design structure.

12 Experimental units An Experimental (or sampling) units is the person or object that will be studied by the researcher. This is the smallest unit of analysis in the experiment from which data will be collected (e.g. patient, mouse, plant, or cell line).

13 Experimental units An entity receiving an independent application of a treatment is called an experimental unit. An experimental run is the process of applying a particular treatment combination to an experimental unit and recording its response. A replicate is an independent run carried out on a different experimental unit under the same conditions.

14 Example: Two pots Experimental unit: plant on the pot No replication

15 Types of variables A data collection plan considers how four important variables: background, constant, uncontrollable, and primary, fit into the study. The explanatory variables are referred to as factors. Inconclusive results are likely to result if any of these classifications are not adequately defined. It is important to consider all the relevant variables before the final data collection plan is approved in order to maximize confidence in the final results.

16 Background variables Background variables can be identified and measured yet cannot be controlled; they will influence the outcome of an experiment. Background variables will be treated as covariates in the model rather than primary variables.

17 Primary variables Primary variables are the variables of interest to the researcher. Primary variables are independent variables that are possible sources of variation in the response. These variables comprise the treatment and design structures and are referred to as factors. When background variables are used in an analysis, better estimates of the primary variables should result because the sources of variation that are supplied by the covariates have been removed.

18 Constant variables Constant variables can be controlled or measured but, for some reason, will be held constant over the duration of the study. This action increases the validity of the results by reducing extraneous sources of variation from entering the data. For this data collection plan, some of the variables that will be held constant include: the use of standard operating procedures the use of one operator for each measuring device all measurements taken at specific times and locations

19 Uncontrollable variables Uncontrollable variables are those variables that are known to exist, but conditions prevent them from being manipulated, or it is very difficult (due to cost or physical constraints) to measure them. The experimental error is due to the influential effects of uncontrollable variables, which will result in less precise evaluations of the effects of the primary and background variables. The design of the experiment should eliminate or control these types of variables as much as possible in order to increase confidence in the final results.

20 Explanatory and response variables XX YY - Explanatory variables - Factors - Response variables

21 Factors - Noise factor - Blocking factor ZZ Treatment factor or design factor XX YY Response variables Levels: XX = xx Treatment combination or treatment: a particular combination of factor levels (e.g. xx 1, xx 2 if there are two treatment factors)

22 Primary factors The treatment structure consists of factors that the researcher wants to study and about which the researcher will make inferences. The primary (treatment or design) factors are controlled by the researcher and are expected to show the effects of greatest interest on the response variable(s).

23 Levels The levels of the primary factors represent the range of the inference space relative to a study. The levels of the primary factors can represent the entire range of possibilities or a random sub-set. It is also important to recognize and define when combinations of levels of two or more treatment factors are illogical or unlikely to exist.

24 Fixed effects Fixed effects treatment factors are usually considered to be "fixed" in the sense that all levels of interest are included in the study because they are selected by some non-random process, they consist of the whole population of possible levels, or other levels were not feasible to consider as part of the study. The fixed effects represent the levels of a set of precise hypotheses of interest in the research. A fixed factor can have only a small number of inherent levels; for example, the only relevant levels for gender are male and female. A factor should also be considered fixed when only certain values of it are of interest, even though other levels might exist. Treatment factors can also be considered "fixed" as opposed to "random" because they are the only levels about which you would want to make inferences.

25 Three basic principles of experimental design Replication Randomization Blocking

26 Replication By replication we mean an independent repeat run of each treatment combination. Replication is essential for estimating experimental error. If a treatment condition appears more than one time, it is defined to be replicated. Misconceptions about the number of replications have often occurred in experiments where sub-samples or repeated observations on a unit have been mistaken as additional experimental units.

27 Randomization By randomization we mean that both the assignment of treatments to units and the order in which the individual runs of the experiments are to be performed are randomly determined. A completely randomized design is an experimental design in which treatments are assigned to all units by randomization.

28 Example: Randomized Experimental unit: plant on the pot 4 replicates for each treatment

29 Blocking Most experimental designs require experimental units to be allocated to treatments either randomly or randomly with constraints, as in blocked designs. Blocks are groups of experimental units that are formed to be as homogeneous as possible with respect to the block characteristics. The term block comes from the agricultural heritage of experimental design where a large block of land was selected for the various treatments, that had uniform soil, drainage, sunlight, and other important physical characteristics. Homogeneous clusters improve the comparison of treatments by randomly allocating levels of the treatments within each block.

30 Blocking Blocking is an experimental design strategy used to reduce or eliminate the variability transmitted from nuisance factors, which may influence the response variable but in which we are not directly interested. Blocking is the grouping of experimental units that have similar properties. Within each block, treatments are randomly assigned to experimental units. The resulting design is called a randomized block design. This design enables more precise estimates of the treatment effects because comparisons between treatments are made among homogeneous experimental units in each block.

31 Blocking ZZ XX YY

32 Blocking example Blocking removes the variation in response among chambers, allowing more precise estimates and more powerful tests of the treatment effects.

33 Design structure The design structure consists of those factors that define the blocking of the experimental units into clusters. The types of commonly used design structures: Completely randomized design Randomized complete block design Factorial design

34 Completely randomized design Subjects are assigned to treatments completely at random.

35 Randomized complete block design Subjects are divided into b blocks (see description of blocks above) according to demographic characteristics. Subjects in each block are then randomly assigned to treatments so that all treatment levels appear in each block.

36 Factorial design Many experiments in biology investigate more than one treatment factor, because: 1. answering two questions from a single experiment rather than just one makes more efficient use of time, supplies, and other costs 2. the factors might interact.

37 Factorial design An experiment having a factorial design investigates all treatment combinations of two or more treatment factors. A factorial design can measure interactions between factors. An interaction between two (or more) explanatory variables means that the effect of one variable on the response depends on the state of the other variable.

38 Factorial design XX 2 XX 1 YY

39 Analyzing data

40 A unified model: general linear model EE[yy] = ββ 0 + ββ 1 xx ββ pp 1 xx pp 1

41 Basic linear models Model formula Model Design yy~xx Linear regression Dose-response yy~t One-way ANOVA Completely randomized yy~t + b Two-way ANOVA Randomized block yy~t 1 + t 2 + t 1 t 2 Two-way, fixed-effect ANOVA Factorial design yy~tt + xx ANCOVA Observation study with one known noise factor yy~xx 1 + xx 2 + xx 1 xx 2 Multiple linear regression Dose-response xx: numerical, t: categorical treatment factor, b: categorical blocking factor

42 Randomized complete block design How does fish abundance affects the abundance and diversity of prey species?

43 Design 3mm 3mm 30 fish 90 fish Control Low High

44 Data: Zooplankton diversity in three fish abundance treatments Control Low High

45 Model: yy~t + b yy ii = ββ 0 + ββ 1 tt ii + ββ 2 b i + εε ii H0: Mean zooplankton diversity is the same in every abundance treatment yy~b H1: Mean zooplankton diversity is not the same in every abundance treatment yy~t + b

46 Fitting the model to data

47 Adjusting for a known confounding factor

48 Adjusting for a known confounding factor Mole rats are the only known mammals with distinct social castes. - A single queen and a small number of males are the only reproducing individuals in a colony. - Workers gather food, defend the colony, care for the young, and maintain the burrows. - Two worker castes in the Damaraland mole rat: - Frequent workers : do almost all of the work in the colony - Infrequent workers : do little work except on rare occasions after rains

49 Adjusting for a known confounding factor To assess the physiological differences between the two types of workers, researchers compared daily energy expenditures of wild mole rats during a dry season. Known noise factor: Energy expenditure appears to vary with body mass in both groups, but infrequent workers are heavier than frequent workers Research question: How different is mean daily energy expenditure between the two groups when adjusted for differences in body mass?

50 Data

51 Data

52 Model: yy~tt + xx H0: Castes do not differ in energy expenditure yy~xx H1: Castes differ in energy expenditure yy~tt + xx

53 Fitting the model to data

54 Example: RNA-seq

55 Multiple factors Experiments with more than one factor influencing the counts can be analyzed using design formula that include the additional variables. In fact, DESeq2 can analyze any possible experimental design that can be expressed with fixed effects terms (multiple factors, designs with interactions, designs with continuous variables, splines, and so on are all possible). By adding variables to the design, one can control for additional variation in the counts. For example, if the condition samples are balanced across experimental batches, by including the batch factor to the design, one can increase the sensitivity for finding differences due to condition. There are multiple ways to analyze experiments when the additional variables are of interest and not just controlling factors.

56 Including type

57 Accounting for type We can account for the different types of sequencing, and get a clearer picture of the differences attributable to the treatment. As condition is the variable of interest, we put it at the end of the formula. Thus the results function will by default pull the condition results unless contrast or name arguments are specified. Then we can rerun DESeq.

58 Accounting for type

59 Accounting for type

60 Accounting for type

61 Accounting for type It is also possible to retrieve the log2 fold changes, p values and adjusted p values of the type variable. The contrast argument of the function results takes a character vector of length three: the name of the variable, the name of the factor level for the numerator of the log2 ratio, and the name of the factor level for the denominator.

62 Accounting for type

63 Gene Ontology

64 Annotating and exporting results Our result table only contains information about Ensembl gene IDs, but alternative gene names may be more informative for collaborators. Bioconductor s annotation packages help with mapping various ID schemes to each other.

65 Annotating and exporting results

66 Annotating and exporting results

67 Running topgo

68 Running topgo

69 Running topgo

70 Running topgo

71 Downregulated GO

72 Upregulated GO

73 Published results The top 10 most significant terms are shown for downregulated (D) and upregulated (E) genes, respectively.

Lecture 21. RNA-seq: Advanced analysis

Lecture 21. RNA-seq: Advanced analysis Lecture 21 RNA-seq: Advanced analysis Experimental design Introduction An experiment is a process or study that results in the collection of data. Statistical experiments are conducted in situations in

More information

RNA-seq. Differential analysis

RNA-seq. Differential analysis RNA-seq Differential analysis Data transformations Count data transformations In order to test for differential expression, we operate on raw counts and use discrete distributions differential expression.

More information

Experimental Studies. Statistical techniques for Experimental Data. Experimental Designs can be grouped. Experimental Designs can be grouped

Experimental Studies. Statistical techniques for Experimental Data. Experimental Designs can be grouped. Experimental Designs can be grouped Experimental Studies Statistical techniques for Experimental Data Require appropriate manipulations and controls Many different designs Consider an overview of the designs Examples of some of the analyses

More information

Experimental Design for Immunologists

Experimental Design for Immunologists Experimental Design for Immunologists Hulin Wu, Ph.D., Dean s Professor Department of Biostatistics & Computational Biology Co-Director: Center for Biodefense Immune Modeling School of Medicine and Dentistry

More information

Statistics 2. RCBD Review. Agriculture Innovation Program

Statistics 2. RCBD Review. Agriculture Innovation Program Statistics 2. RCBD Review 2014. Prepared by Lauren Pincus With input from Mark Bell and Richard Plant Agriculture Innovation Program 1 Table of Contents Questions for review... 3 Answers... 3 Materials

More information

investigate. educate. inform.

investigate. educate. inform. investigate. educate. inform. Research Design What drives your research design? The battle between Qualitative and Quantitative is over Think before you leap What SHOULD drive your research design. Advanced

More information

9.0 L '- ---'- ---'- --' X

9.0 L '- ---'- ---'- --' X 352 C hap te r Ten 11.0 10.5 Y 10.0 9.5 9.0 L...- ----'- ---'- ---'- --' 0.0 0.5 1.0 X 1.5 2.0 FIGURE 10.23 Interpreting r = 0 for curvilinear data. Establishing causation requires solid scientific understanding.

More information

QA 605 WINTER QUARTER ACADEMIC YEAR

QA 605 WINTER QUARTER ACADEMIC YEAR Instructor: Office: James J. Cochran 117A CAB Telephone: (318) 257-3445 Hours: e-mail: URL: QA 605 WINTER QUARTER 2006-2007 ACADEMIC YEAR Tuesday & Thursday 8:00 a.m. 10:00 a.m. Wednesday 8:00 a.m. noon

More information

STATISTICAL CONCLUSION VALIDITY

STATISTICAL CONCLUSION VALIDITY Validity 1 The attached checklist can help when one is evaluating the threats to validity of a study. VALIDITY CHECKLIST Recall that these types are only illustrative. There are many more. INTERNAL VALIDITY

More information

Multiple choice questions: 1. Which of the following is a key distinction between well designed experiments and observational studies?

Multiple choice questions: 1. Which of the following is a key distinction between well designed experiments and observational studies? Experimental Design Be sure you understand that: Experiments are studies in which the researcher imposes a treatment on experimental units. Sometimes different treatments are simply compared with one another.

More information

Principles of Experimental Design

Principles of Experimental Design Principles of Experimental Design Bret Hanlon and Bret Larget Department of Statistics University of Wisconsin Madison November 15, 2011 Designing Experiments 1 / 31 Experimental Design Many interesting

More information

Principles of Experimental Design

Principles of Experimental Design Principles of Experimental Design Bret Hanlon and Bret Larget Department of Statistics University of Wisconsin Madison November 15, 2011 Designing Experiments 1 / 31 Experimental Design Many interesting

More information

Ecological Statistics

Ecological Statistics A Primer of Ecological Statistics Second Edition Nicholas J. Gotelli University of Vermont Aaron M. Ellison Harvard Forest Sinauer Associates, Inc. Publishers Sunderland, Massachusetts U.S.A. Brief Contents

More information

BIOL 458 BIOMETRY Lab 7 Multi-Factor ANOVA

BIOL 458 BIOMETRY Lab 7 Multi-Factor ANOVA BIOL 458 BIOMETRY Lab 7 Multi-Factor ANOVA PART 1: Introduction to Factorial ANOVA ingle factor or One - Way Analysis of Variance can be used to test the null hypothesis that k or more treatment or group

More information

Conducting a Good Experiment I: Variables and Control

Conducting a Good Experiment I: Variables and Control CHAPTER SIX Conducting a Good Experiment I: Variables and Control 1 The Nature of Variables! Variable! A variable is an event or behavior that can assume at least two values.! Bridgman (1927) suggested

More information

Chapter 5: Field experimental designs in agriculture

Chapter 5: Field experimental designs in agriculture Chapter 5: Field experimental designs in agriculture Jose Crossa Biometrics and Statistics Unit Crop Research Informatics Lab (CRIL) CIMMYT. Int. Apdo. Postal 6-641, 06600 Mexico, DF, Mexico Introduction

More information

Where does "analysis" enter the experimental process?

Where does analysis enter the experimental process? Lecture Topic : ntroduction to the Principles of Experimental Design Experiment: An exercise designed to determine the effects of one or more variables (treatments) on one or more characteristics (response

More information

Biostatistics for Med Students. Lecture 1

Biostatistics for Med Students. Lecture 1 Biostatistics for Med Students Lecture 1 John J. Chen, Ph.D. Professor & Director of Biostatistics Core UH JABSOM JABSOM MD7 February 14, 2018 Lecture note: http://biostat.jabsom.hawaii.edu/education/training.html

More information

The Practice of Statistics 1 Week 2: Relationships and Data Collection

The Practice of Statistics 1 Week 2: Relationships and Data Collection The Practice of Statistics 1 Week 2: Relationships and Data Collection Video 12: Data Collection - Experiments Experiments are the gold standard since they allow us to make causal conclusions. example,

More information

The essential focus of an experiment is to show that variance can be produced in a DV by manipulation of an IV.

The essential focus of an experiment is to show that variance can be produced in a DV by manipulation of an IV. EXPERIMENTAL DESIGNS I: Between-Groups Designs There are many experimental designs. We begin this week with the most basic, where there is a single IV and where participants are divided into two or more

More information

9 research designs likely for PSYC 2100

9 research designs likely for PSYC 2100 9 research designs likely for PSYC 2100 1) 1 factor, 2 levels, 1 group (one group gets both treatment levels) related samples t-test (compare means of 2 levels only) 2) 1 factor, 2 levels, 2 groups (one

More information

Analysis of Environmental Data Conceptual Foundations: En viro n m e n tal Data

Analysis of Environmental Data Conceptual Foundations: En viro n m e n tal Data Analysis of Environmental Data Conceptual Foundations: En viro n m e n tal Data 1. Purpose of data collection...................................................... 2 2. Samples and populations.......................................................

More information

Unit 1 Exploring and Understanding Data

Unit 1 Exploring and Understanding Data Unit 1 Exploring and Understanding Data Area Principle Bar Chart Boxplot Conditional Distribution Dotplot Empirical Rule Five Number Summary Frequency Distribution Frequency Polygon Histogram Interquartile

More information

VALIDITY OF QUANTITATIVE RESEARCH

VALIDITY OF QUANTITATIVE RESEARCH Validity 1 VALIDITY OF QUANTITATIVE RESEARCH Recall the basic aim of science is to explain natural phenomena. Such explanations are called theories (Kerlinger, 1986, p. 8). Theories have varying degrees

More information

26:010:557 / 26:620:557 Social Science Research Methods

26:010:557 / 26:620:557 Social Science Research Methods 26:010:557 / 26:620:557 Social Science Research Methods Dr. Peter R. Gillett Associate Professor Department of Accounting & Information Systems Rutgers Business School Newark & New Brunswick 1 Overview

More information

f WILEY ANOVA and ANCOVA A GLM Approach Second Edition ANDREW RUTHERFORD Staffordshire, United Kingdom Keele University School of Psychology

f WILEY ANOVA and ANCOVA A GLM Approach Second Edition ANDREW RUTHERFORD Staffordshire, United Kingdom Keele University School of Psychology ANOVA and ANCOVA A GLM Approach Second Edition ANDREW RUTHERFORD Keele University School of Psychology Staffordshire, United Kingdom f WILEY A JOHN WILEY & SONS, INC., PUBLICATION Contents Acknowledgments

More information

Chapter 8 Statistical Principles of Design. Fall 2010

Chapter 8 Statistical Principles of Design. Fall 2010 Chapter 8 Statistical Principles of Design Fall 2010 Experimental Design Many interesting questions in biology involve relationships between response variables and one or more explanatory variables. Biology

More information

Biostatistics 2 nd year Comprehensive Examination. Due: May 31 st, 2013 by 5pm. Instructions:

Biostatistics 2 nd year Comprehensive Examination. Due: May 31 st, 2013 by 5pm. Instructions: Biostatistics 2 nd year Comprehensive Examination Due: May 31 st, 2013 by 5pm. Instructions: 1. The exam is divided into two parts. There are 6 questions in section I and 2 questions in section II. 2.

More information

Research Methods in Forest Sciences: Learning Diary. Yoko Lu December Research process

Research Methods in Forest Sciences: Learning Diary. Yoko Lu December Research process Research Methods in Forest Sciences: Learning Diary Yoko Lu 285122 9 December 2016 1. Research process It is important to pursue and apply knowledge and understand the world under both natural and social

More information

Name: Experimental Design

Name: Experimental Design Name: Experimental Design Period: 2001 Number 4 1. Students are designing an experiment to compare the productivity of two varieties of dwarf fruit trees. The site for the experiment is a field that is

More information

Causal Research Design- Experimentation

Causal Research Design- Experimentation In a social science (such as marketing) it is very important to understand that effects (e.g., consumers responding favorably to a new buzz marketing campaign) are caused by multiple variables. The relationships

More information

THE USE OF NONPARAMETRIC PROPENSITY SCORE ESTIMATION WITH DATA OBTAINED USING A COMPLEX SAMPLING DESIGN

THE USE OF NONPARAMETRIC PROPENSITY SCORE ESTIMATION WITH DATA OBTAINED USING A COMPLEX SAMPLING DESIGN THE USE OF NONPARAMETRIC PROPENSITY SCORE ESTIMATION WITH DATA OBTAINED USING A COMPLEX SAMPLING DESIGN Ji An & Laura M. Stapleton University of Maryland, College Park May, 2016 WHAT DOES A PROPENSITY

More information

Use of the Quantitative-Methods Approach in Scientific Inquiry. Du Feng, Ph.D. Professor School of Nursing University of Nevada, Las Vegas

Use of the Quantitative-Methods Approach in Scientific Inquiry. Du Feng, Ph.D. Professor School of Nursing University of Nevada, Las Vegas Use of the Quantitative-Methods Approach in Scientific Inquiry Du Feng, Ph.D. Professor School of Nursing University of Nevada, Las Vegas The Scientific Approach to Knowledge Two Criteria of the Scientific

More information

Randomized Block Designs 1

Randomized Block Designs 1 Randomized Block Designs 1 STA305 Winter 2014 1 See last slide for copyright information. 1 / 1 Background Reading Optional Photocopy 2 from an old textbook; see course website. It s only four pages. The

More information

2.75: 84% 2.5: 80% 2.25: 78% 2: 74% 1.75: 70% 1.5: 66% 1.25: 64% 1.0: 60% 0.5: 50% 0.25: 25% 0: 0%

2.75: 84% 2.5: 80% 2.25: 78% 2: 74% 1.75: 70% 1.5: 66% 1.25: 64% 1.0: 60% 0.5: 50% 0.25: 25% 0: 0% Capstone Test (will consist of FOUR quizzes and the FINAL test grade will be an average of the four quizzes). Capstone #1: Review of Chapters 1-3 Capstone #2: Review of Chapter 4 Capstone #3: Review of

More information

AP Statistics Unit 4.2 Day 3 Notes: Experimental Design. Expt1:

AP Statistics Unit 4.2 Day 3 Notes: Experimental Design. Expt1: AP Statistics Unit 4.2 Day 3 Notes: Experimental Design OBSERVATION -observe outcomes without imposing any treatment EXPERIMENT -actively impose some treatment in order to observe the response I ve developed

More information

Chapter 4 DESIGN OF EXPERIMENTS

Chapter 4 DESIGN OF EXPERIMENTS Chapter 4 DESIGN OF EXPERIMENTS 4.1 STRATEGY OF EXPERIMENTATION Experimentation is an integral part of any human investigation, be it engineering, agriculture, medicine or industry. An experiment can be

More information

Designing Experiments... Or how many times and ways can I screw that up?!?

Designing Experiments... Or how many times and ways can I screw that up?!? www.geo.uzh.ch/microsite/icacogvis/ Designing Experiments... Or how many times and ways can I screw that up?!? Amy L. Griffin AutoCarto 2012, Columbus, OH Outline When do I need to run an experiment and

More information

Intro to SPSS. Using SPSS through WebFAS

Intro to SPSS. Using SPSS through WebFAS Intro to SPSS Using SPSS through WebFAS http://www.yorku.ca/computing/students/labs/webfas/ Try it early (make sure it works from your computer) If you need help contact UIT Client Services Voice: 416-736-5800

More information

1. The Role of Sample Survey Design

1. The Role of Sample Survey Design Vista's Approach to Sample Survey Design 1978, 1988, 2006, 2007, 2009 Joseph George Caldwell. All Rights Reserved. Posted at Internet website http://www.foundationwebsite.org. Updated 20 March 2009 (two

More information

Topic #6. Quasi-experimental designs are research studies in which participants are selected for different conditions from pre-existing groups.

Topic #6. Quasi-experimental designs are research studies in which participants are selected for different conditions from pre-existing groups. ARTHUR PSYC 204 (EXPERIMENTAL PSYCHOLOGY) 17A LECTURE NOTES [03/08/17] QUASI-EXPERIMENTAL DESIGNS PAGE 1 Topic #6 QUASI-EXPERIMENTAL DESIGNS Again, central issue is one of research validity. Quasi-experimental

More information

You can t fix by analysis what you bungled by design. Fancy analysis can t fix a poorly designed study.

You can t fix by analysis what you bungled by design. Fancy analysis can t fix a poorly designed study. You can t fix by analysis what you bungled by design. Light, Singer and Willett Or, not as catchy but perhaps more accurate: Fancy analysis can t fix a poorly designed study. Producing Data The Role of

More information

Assignment #6. Chapter 10: 14, 15 Chapter 11: 14, 18. Due tomorrow Nov. 6 th by 2pm in your TA s homework box

Assignment #6. Chapter 10: 14, 15 Chapter 11: 14, 18. Due tomorrow Nov. 6 th by 2pm in your TA s homework box Assignment #6 Chapter 10: 14, 15 Chapter 11: 14, 18 Due tomorrow Nov. 6 th by 2pm in your TA s homework box Assignment #7 Chapter 12: 18, 24 Chapter 13: 28 Due next Friday Nov. 13 th by 2pm in your TA

More information

1a Materials come in different forms (states) including solids,

1a Materials come in different forms (states) including solids, Physical Sciences 1a Materials come in different forms (states) including solids, liquids, and gases. As a basis for understanding this concept: Students know solids, liquids, and gases have different

More information

Part 8 Logistic Regression

Part 8 Logistic Regression 1 Quantitative Methods for Health Research A Practical Interactive Guide to Epidemiology and Statistics Practical Course in Quantitative Data Handling SPSS (Statistical Package for the Social Sciences)

More information

Fading Affect Bias (FAB):

Fading Affect Bias (FAB): Fading Affect Bias (FAB): The intensity of affect associated with a recalled event generally decreases over time, but this affective fading is greater for negative events than for positive events. 6 5.5

More information

PLS 506 Mark T. Imperial, Ph.D. Lecture Notes: Reliability & Validity

PLS 506 Mark T. Imperial, Ph.D. Lecture Notes: Reliability & Validity PLS 506 Mark T. Imperial, Ph.D. Lecture Notes: Reliability & Validity Measurement & Variables - Initial step is to conceptualize and clarify the concepts embedded in a hypothesis or research question with

More information

10 Intraclass Correlations under the Mixed Factorial Design

10 Intraclass Correlations under the Mixed Factorial Design CHAPTER 1 Intraclass Correlations under the Mixed Factorial Design OBJECTIVE This chapter aims at presenting methods for analyzing intraclass correlation coefficients for reliability studies based on a

More information

Applied Analysis of Variance and Experimental Design. Lukas Meier, Seminar für Statistik

Applied Analysis of Variance and Experimental Design. Lukas Meier, Seminar für Statistik Applied Analysis of Variance and Experimental Design Lukas Meier, Seminar für Statistik About Me Studied mathematics at ETH. Worked at the statistical consulting service and did a PhD in statistics (at

More information

Reliability of Ordination Analyses

Reliability of Ordination Analyses Reliability of Ordination Analyses Objectives: Discuss Reliability Define Consistency and Accuracy Discuss Validation Methods Opening Thoughts Inference Space: What is it? Inference space can be defined

More information

Completely randomized designs, Factors, Factorials, and Blocking

Completely randomized designs, Factors, Factorials, and Blocking Completely randomized designs, Factors, Factorials, and Blocking STAT:5201 Week 2: Lecture 1 1 / 35 Completely Randomized Design (CRD) Simplest design set-up Treatments are randomly assigned to EUs Easiest

More information

ANCOVA with Regression Homogeneity

ANCOVA with Regression Homogeneity ANCOVA with Regression Homogeneity The purpose of the study was to compare the effectiveness of two different treatments in two populations. Both treatments have been repeatedly shown to work better than

More information

CONSORT 2010 checklist of information to include when reporting a randomised trial*

CONSORT 2010 checklist of information to include when reporting a randomised trial* CONSORT 2010 checklist of information to include when reporting a randomised trial* Section/Topic Title and abstract Introduction Background and objectives Item No Checklist item 1a Identification as a

More information

What you should know before you collect data. BAE 815 (Fall 2017) Dr. Zifei Liu

What you should know before you collect data. BAE 815 (Fall 2017) Dr. Zifei Liu What you should know before you collect data BAE 815 (Fall 2017) Dr. Zifei Liu Zifeiliu@ksu.edu Types and levels of study Descriptive statistics Inferential statistics How to choose a statistical test

More information

USE AND MISUSE OF MIXED MODEL ANALYSIS VARIANCE IN ECOLOGICAL STUDIES1

USE AND MISUSE OF MIXED MODEL ANALYSIS VARIANCE IN ECOLOGICAL STUDIES1 Ecology, 75(3), 1994, pp. 717-722 c) 1994 by the Ecological Society of America USE AND MISUSE OF MIXED MODEL ANALYSIS VARIANCE IN ECOLOGICAL STUDIES1 OF CYNTHIA C. BENNINGTON Department of Biology, West

More information

INTRODUCTION TO STATISTICS SORANA D. BOLBOACĂ

INTRODUCTION TO STATISTICS SORANA D. BOLBOACĂ INTRODUCTION TO STATISTICS SORANA D. BOLBOACĂ OBJECTIVES Definitions Stages of Scientific Knowledge Quantification and Accuracy Types of Medical Data Population and sample Sampling methods DEFINITIONS

More information

Lecture 1 An introduction to statistics in Ichthyology and Fisheries Science

Lecture 1 An introduction to statistics in Ichthyology and Fisheries Science Lecture 1 An introduction to statistics in Ichthyology and Fisheries Science What is statistics and why do we need it? Statistics attempts to make inferences about unknown values that are common to a population

More information

Validity and Reliability. PDF Created with deskpdf PDF Writer - Trial ::

Validity and Reliability. PDF Created with deskpdf PDF Writer - Trial :: Validity and Reliability PDF Created with deskpdf PDF Writer - Trial :: http://www.docudesk.com Validity Is the translation from concept to operationalization accurately representing the underlying concept.

More information

HOW STATISTICS IMPACT PHARMACY PRACTICE?

HOW STATISTICS IMPACT PHARMACY PRACTICE? HOW STATISTICS IMPACT PHARMACY PRACTICE? CPPD at NCCR 13 th June, 2013 Mohamed Izham M.I., PhD Professor in Social & Administrative Pharmacy Learning objective.. At the end of the presentation pharmacists

More information

STATISTICS & PROBABILITY

STATISTICS & PROBABILITY STATISTICS & PROBABILITY LAWRENCE HIGH SCHOOL STATISTICS & PROBABILITY CURRICULUM MAP 2015-2016 Quarter 1 Unit 1 Collecting Data and Drawing Conclusions Unit 2 Summarizing Data Quarter 2 Unit 3 Randomness

More information

Mathacle. PSet Stats, Concepts In Statistics Level Number Name: Date:

Mathacle. PSet Stats, Concepts In Statistics Level Number Name: Date: II. DESIGN OF STUDIES Observational studies and experiments are two types of studies that aim to describe or explain the variation of responses under the hypothesized factors, without or with manipulation.

More information

STATS8: Introduction to Biostatistics. Overview. Babak Shahbaba Department of Statistics, UCI

STATS8: Introduction to Biostatistics. Overview. Babak Shahbaba Department of Statistics, UCI STATS8: Introduction to Biostatistics Overview Babak Shahbaba Department of Statistics, UCI The role of statistical analysis in science This course discusses some biostatistical methods, which involve

More information

Design of Experiments & Introduction to Research

Design of Experiments & Introduction to Research Design of Experiments & Introduction to Research 1 Design of Experiments Introduction to Research Definition and Purpose Scientific Method Research Project Paradigm Structure of a Research Project Types

More information

MTH 225: Introductory Statistics

MTH 225: Introductory Statistics Marshall University College of Science Mathematics Department MTH 225: Introductory Statistics Course catalog description Basic probability, descriptive statistics, fundamental statistical inference procedures

More information

04/12/2014. Research Methods in Psychology. Chapter 6: Independent Groups Designs. What is your ideas? Testing

04/12/2014. Research Methods in Psychology. Chapter 6: Independent Groups Designs. What is your ideas? Testing Research Methods in Psychology Chapter 6: Independent Groups Designs 1 Why Psychologists Conduct Experiments? What is your ideas? 2 Why Psychologists Conduct Experiments? Testing Hypotheses derived from

More information

PSY 250. Experimental Design: The Basic Building Blocks. Simple between subjects design. The Two-Group Design 7/25/2015. Experimental design

PSY 250. Experimental Design: The Basic Building Blocks. Simple between subjects design. The Two-Group Design 7/25/2015. Experimental design Experimental Design: The Basic Building Blocks PSY 250 Experimental design The general plan for selecting participants, assigning participants to experimental conditions, controlling extraneous variables,

More information

Lecture Outline. Biost 590: Statistical Consulting. Stages of Scientific Studies. Scientific Method

Lecture Outline. Biost 590: Statistical Consulting. Stages of Scientific Studies. Scientific Method Biost 590: Statistical Consulting Statistical Classification of Scientific Studies; Approach to Consulting Lecture Outline Statistical Classification of Scientific Studies Statistical Tasks Approach to

More information

Statistics and Probability

Statistics and Probability Statistics and a single count or measurement variable. S.ID.1: Represent data with plots on the real number line (dot plots, histograms, and box plots). S.ID.2: Use statistics appropriate to the shape

More information

Previous Example. New. Tradition

Previous Example. New. Tradition Experimental Design Previous Example New Tradition Goal? New Tradition =? Challenges Internal validity How to guarantee what you have observed is true? External validity How to guarantee what you have

More information

WDHS Curriculum Map Probability and Statistics. What is Statistics and how does it relate to you?

WDHS Curriculum Map Probability and Statistics. What is Statistics and how does it relate to you? WDHS Curriculum Map Probability and Statistics Time Interval/ Unit 1: Introduction to Statistics 1.1-1.3 2 weeks S-IC-1: Understand statistics as a process for making inferences about population parameters

More information

Lecture 4: Research Approaches

Lecture 4: Research Approaches Lecture 4: Research Approaches Lecture Objectives Theories in research Research design approaches ú Experimental vs. non-experimental ú Cross-sectional and longitudinal ú Descriptive approaches How to

More information

Epidemiologic Methods I & II Epidem 201AB Winter & Spring 2002

Epidemiologic Methods I & II Epidem 201AB Winter & Spring 2002 DETAILED COURSE OUTLINE Epidemiologic Methods I & II Epidem 201AB Winter & Spring 2002 Hal Morgenstern, Ph.D. Department of Epidemiology UCLA School of Public Health Page 1 I. THE NATURE OF EPIDEMIOLOGIC

More information

Name: emergency please discuss this with the exam proctor. 6. Vanderbilt s academic honor code applies.

Name: emergency please discuss this with the exam proctor. 6. Vanderbilt s academic honor code applies. Name: Biostatistics 1 st year Comprehensive Examination: Applied in-class exam May 28 th, 2015: 9am to 1pm Instructions: 1. There are seven questions and 12 pages. 2. Read each question carefully. Answer

More information

Exercises: Differential Methylation

Exercises: Differential Methylation Exercises: Differential Methylation Version 2018-04 Exercises: Differential Methylation 2 Licence This manual is 2014-18, Simon Andrews. This manual is distributed under the creative commons Attribution-Non-Commercial-Share

More information

Psychology Research Process

Psychology Research Process Psychology Research Process Logical Processes Induction Observation/Association/Using Correlation Trying to assess, through observation of a large group/sample, what is associated with what? Examples:

More information

Sampling Obtaining a portion that is representative of the whole. Some basic terms. Objectives of Sampling

Sampling Obtaining a portion that is representative of the whole. Some basic terms. Objectives of Sampling Sampling Obtaining a portion that is representative of the whole The total quantity from which sample is obtained is the population West Africa Graduate Course on Food Composition and Biodiversity, Ghana,

More information

Dr. Allen Back. Oct. 7, 2016

Dr. Allen Back. Oct. 7, 2016 Dr. Allen Back Oct. 7, 2016 al Was it Fair? The first draft lottery during the Vietnam War: 366 balls labeled by dates. Mixed up and pulled out in a random order. al Was it Fair? Scatterplot al Was it

More information

Introduction At times when pollen is scarce in the natural environment

Introduction At times when pollen is scarce in the natural environment Artificial Bee Diets Taryn Major Linda Eaton Kathy Haskard Philip Vlaskovsky Rob Manning (Department of Agriculture and Food) Introduction At times when pollen is scarce in the natural environment Bees

More information

The SAGE Encyclopedia of Educational Research, Measurement, and Evaluation Multivariate Analysis of Variance

The SAGE Encyclopedia of Educational Research, Measurement, and Evaluation Multivariate Analysis of Variance The SAGE Encyclopedia of Educational Research, Measurement, Multivariate Analysis of Variance Contributors: David W. Stockburger Edited by: Bruce B. Frey Book Title: Chapter Title: "Multivariate Analysis

More information

Role of Statistics in Research

Role of Statistics in Research Role of Statistics in Research Role of Statistics in research Validity Will this study help answer the research question? Analysis What analysis, & how should this be interpreted and reported? Efficiency

More information

User Guide. Association analysis. Input

User Guide. Association analysis. Input User Guide TFEA.ChIP is a tool to estimate transcription factor enrichment in a set of differentially expressed genes using data from ChIP-Seq experiments performed in different tissues and conditions.

More information

Key questions when starting an econometric project (Angrist & Pischke, 2009):

Key questions when starting an econometric project (Angrist & Pischke, 2009): Econometric & other impact assessment approaches to policy analysis Part 1 1 The problem of causality in policy analysis Internal vs. external validity Key questions when starting an econometric project

More information

INTRODUCTION TO STATISTICS

INTRODUCTION TO STATISTICS Basic Statistics Introduction to Statistics Basic Statistical Formulas Commonly used Ecological Equations INTRODUCTION TO STATISTICS Statistics is the branch of mathematics that deals with the techniques

More information

EXPERIMENTAL RESEARCH DESIGNS

EXPERIMENTAL RESEARCH DESIGNS ARTHUR PSYC 204 (EXPERIMENTAL PSYCHOLOGY) 14A LECTURE NOTES [02/28/14] EXPERIMENTAL RESEARCH DESIGNS PAGE 1 Topic #5 EXPERIMENTAL RESEARCH DESIGNS As a strict technical definition, an experiment is a study

More information

Designing an experiment 7 TH /8 TH GRADE SCIENCE

Designing an experiment 7 TH /8 TH GRADE SCIENCE Designing an experiment 7 TH /8 TH GRADE SCIENCE Scientific inquiry 1. Make an observation 2. Ask a question 3. Create a hypothesis 4. Design an experiment 5. Gather and analyze data 6. Draw conclusions

More information

lab exam lab exam Experimental Design Experimental Design when: Nov 27 - Dec 1 format: length = 1 hour each lab section divided in two

lab exam lab exam Experimental Design Experimental Design when: Nov 27 - Dec 1 format: length = 1 hour each lab section divided in two lab exam when: Nov 27 - Dec 1 length = 1 hour each lab section divided in two register for the exam in your section so there is a computer reserved for you If you write in the 1st hour, you can t leave

More information

Biostatistics II

Biostatistics II Biostatistics II 514-5509 Course Description: Modern multivariable statistical analysis based on the concept of generalized linear models. Includes linear, logistic, and Poisson regression, survival analysis,

More information

Two Factor Analysis of Variance

Two Factor Analysis of Variance BIOL 310 Two Factor Analysis of Variance In the previous discussions of analysis of variance (ANOVA), only one factor was involved. For example, in Chapter 7 the variable of interest in the sample problem

More information

Chapter 1: Exploring Data

Chapter 1: Exploring Data Chapter 1: Exploring Data Key Vocabulary:! individual! variable! frequency table! relative frequency table! distribution! pie chart! bar graph! two-way table! marginal distributions! conditional distributions!

More information

Statistical Primer for Cardiovascular Research

Statistical Primer for Cardiovascular Research Statistical Primer for Cardiovascular Research Repeated Measures Lisa M. Sullivan, PhD A repeated-measures design is one in which multiple, or repeated, measurements are made on each experimental unit.

More information

Experimental Design There is no recovery from poorly collected data!

Experimental Design There is no recovery from poorly collected data! Experimental Design There is no recovery from poorly collected data! Vocabulary List n Look over the list of words. n Count how many you feel you know. n Place a dot on the number line above that number.

More information

Chapter 9: Experiments

Chapter 9: Experiments Chapter 9: Experiments WHAT IS EXPERIMENTATION? Experiments are studies involving intervention by the researcher beyond that required for measurement. The usual intervention is to manipulate some variable

More information

Dr. Kelly Bradley Final Exam Summer {2 points} Name

Dr. Kelly Bradley Final Exam Summer {2 points} Name {2 points} Name You MUST work alone no tutors; no help from classmates. Email me or see me with questions. You will receive a score of 0 if this rule is violated. This exam is being scored out of 00 points.

More information

Profile Analysis. Intro and Assumptions Psy 524 Andrew Ainsworth

Profile Analysis. Intro and Assumptions Psy 524 Andrew Ainsworth Profile Analysis Intro and Assumptions Psy 524 Andrew Ainsworth Profile Analysis Profile analysis is the repeated measures extension of MANOVA where a set of DVs are commensurate (on the same scale). Profile

More information

DATA GATHERING. Define : Is a process of collecting data from sample, so as for testing & analyzing before reporting research findings.

DATA GATHERING. Define : Is a process of collecting data from sample, so as for testing & analyzing before reporting research findings. DATA GATHERING Define : Is a process of collecting data from sample, so as for testing & analyzing before reporting research findings. 2012 John Wiley & Sons Ltd. Measurement Measurement: the assignment

More information

Chapter 11 Nonexperimental Quantitative Research Steps in Nonexperimental Research

Chapter 11 Nonexperimental Quantitative Research Steps in Nonexperimental Research Chapter 11 Nonexperimental Quantitative Research (Reminder: Don t forget to utilize the concept maps and study questions as you study this and the other chapters.) Nonexperimental research is needed because

More information

Introduction, Evidence, and Sampling

Introduction, Evidence, and Sampling Motivation: Why analyze data? Introduction, Evidence, and Sampling Clinical trials/drug development: compare existing treatments with new methods Agriculture: enhance crop yields, improve pest resistance

More information

Biological Models of Anxiety

Biological Models of Anxiety The biomedical model of human health primarily considers physiological factors when attempting to understand illness. When applied to mental disorders, this model assumes that such disorders can be conceptualized

More information

UNIT 4 ALGEBRA II TEMPLATE CREATED BY REGION 1 ESA UNIT 4

UNIT 4 ALGEBRA II TEMPLATE CREATED BY REGION 1 ESA UNIT 4 UNIT 4 ALGEBRA II TEMPLATE CREATED BY REGION 1 ESA UNIT 4 Algebra II Unit 4 Overview: Inferences and Conclusions from Data In this unit, students see how the visual displays and summary statistics they

More information