Lecture Notes Module 2

Size: px
Start display at page:

Download "Lecture Notes Module 2"

Transcription

1 Lecture Notes Module 2 Two-group Experimental Designs The goal of most research is to assess a possible causal relation between the response variable and another variable called the independent variable. In experimental designs, the response variable is usually called a dependent variable. Three basic conditions must be satisfied to demonstrate a causal relation between a dependent variable and an independent variable. First, there must be a relation between the dependent variable and the independent variable. Second, the observed change in the dependent variable must have occurred after there was a change in the independent variable. Third, no variable other than the independent variable can be responsible for the relation between the dependent variable and the independent variable. An experiment can be used to assess a causal relation. The simplest type of experiment involves just two treatment conditions that represent the levels of the independent variable. In a two-group experiment, a random sample of n participants is selected from a study population. The random sample is then randomized (i.e., randomly divided) into two groups, and each group receives one of the two treatments with participants treated identically within each group. If one group does not receive any treatment, it is called a control group. Following treatment, a measurement on the dependent variable is obtained for each participant. In a two-group experiment with a quantitative dependent variable, a population mean could be estimated from each group. In an experimental design, the population means have interesting interpretations: μ 1 is the population mean of the dependent variable, assuming all participants in the study population had received level 1 of the independent variable, and μ 2 is the population mean of the dependent variable, assuming all participants in the study population had received level 2 of the independent variable. The difference in population means for the two treatment conditions, μ 1 μ 2, is called the effect size and describes the strength of the relation between the dependent and independent variables. In an experiment, a nonzero effect size is evidence that the independent variable has a causal effect on the dependent variable because all three conditions required for a causal association will have been satisfied: 1) a nonzero effect size implies a relation between the dependent and independent variables, 2) the change in the dependent variable occurred after the change in the independent variable, and 3) because the participants 1

2 were randomized into the levels of the independent variable, no other variable could have caused the nonzero effect size. A confidence interval for μ 1 μ 2 provides information about the direction and magnitude of the effect size. Confidence Interval for Mean Difference A 100(1 α)% confidence interval for μ 1 μ 2 is μ 1 μ 2 ± t α/2;df SE μ 1 μ 2 (2.1) where t α/2;df is a critical t-value, SE μ 1 μ 2 = σ p2 /n 1 + σ p2 /n 2 is estimated the standard error of μ 1 μ 2, df = n 1 + n 2 2 and σ p2 = [(n 1 1)σ 12 + (n 2 1)σ 22 ]/ (n 1 + n 2 2). The standard error in Equation 2.1 is called a pooled-variance standard error because it assumes equal population variances and uses a pooled estimate of the common population variance. In an experiment, recall that all participants within a particular treatment group should be treated identically. The within-group variance estimates, σ 12 and σ 22, represent unexplained variability in the dependent variable. The within-group variance is also referred to as error variance. Example 2.1. A psychologist believes that it is important for 2 nd grade students to overlearn the multiplication tables so that these computations can be made rapidly and without thought when students later begin working on more complex math problems. A population of nd grade students was identified in a particular school district, and 80 students were randomly selected from this study population. The 80 students were randomized into two groups of equal size. The first group was a control group and received no additional multiplication table training. The second group received 15 minutes per day of extra multiplication tables training for 60 days. At the end of the 60 day training period, all 80 students were given a multiplication test and the time (in seconds) to complete the test was recorded for each student. The sample means and standard deviations are given below. Group 1 Group 2 μ 1 = μ 2 = σ 1 = 27.2 σ 2 = 20.8 The 95% confidence interval for μ 1 μ 2 is ± t.05/2;df = [150.0, 171.6] where df = = 78, t.05/2;78 = 2.00, and σ p2 = [(39) (39) ]/78 = The psychologist is 95% confident that in the study population of nd grade students, the average time to complete the multiplication tables test would be to seconds faster if they had all received the extra math training for 60 days. 2

3 Hypothesis Testing The confidence intervals for μ 1 μ 2 described above can be used to test hypotheses. For instance, a confidence interval for μ 1 μ 2 may be used to implement a three-decision rule for the following hypotheses. H0: μ 1 = μ 2 H1: μ 1 > μ 2 H2: μ 1 < μ 2 If the lower limit for μ 1 μ 2 is greater than 0, then reject H0 and accept H1: μ 1 > μ 2 If the upper limit for μ 1 μ 2 is less than 0, then reject H0 and accept H2: μ 1 < μ 2. If the confidence interval includes 0, H0: μ 1 = μ 2 cannot be rejected (an inconclusive result) In a two-group design, the test of H0: μ 1 = μ 2 is commonly referred to as an independent-samples t-test and involves the computation of the test statistic t = (μ 1 μ 2)/SE μ 1 μ 2. Statistical packagers such as SPSS and R compute a p-value for the t statistic. If the p-value is less than α, then H0 is rejected and it is common to declare the results to be significant ; otherwise, the results are declared to be nonsignificant. Of course, a significant result does not imply that an important difference in population means has been detected, and a nonsignificant result does not imply that the null hypothesis is true. Most psychology journals now require authors to report the t-value, df, and p-value along with a confidence interval for μ 1 μ 2. Two-group Nonexperimental Designs The confidence interval for μ 1 μ 2 (Equation 2.1) also can be applied to nonexperimental designs where participants are classified into two groups according to some preexisting characteristic (male/female, democrat/republican, freshman/sophomore, etc.) rather than being randomly assigned to treatment conditions. In nonexperimental designs, the magnitude of μ 1 μ 2 describes the strength of a relation between the dependent variable and independent variable. In nonexperimental designs, an observed relation between the independent variable and the dependent variable cannot be interpreted as a causal relation because the relation may be due to one or more unmeasured variables called confounding variables that are related to both the dependent variable and the independent variable. For example, many nonexperimental studies have compared moderate alcohol drinkers with non-drinkers and found that moderate 3

4 drinkers live longer. However, moderate drinkers may differ from nondrinkers in education level, income, access to health care, moderation in consumption of unhealthy foods, and many other characteristics. It is possible that one or more of these confounding variables is responsible for the observed relation between alcohol consumption and life expectancy. Therefore, the nonexperimental finding that alcohol consumption is related to life expectancy does not imply that a nondrinker will live longer if that person begins to drink alcohol in moderation. In a nonexperimental design, the parameters also have a different interpretation. Specifically, μ 1 is the population mean of the dependent variable for all people in the study subpopulation who belong to one category (e.g., male, democrat, freshman), and μ 2 is the population mean of the dependent variable for all people in the study subpopulation who belong to the other category (e.g., female, republican, sophomore). The subtle but important parameter interpretation differences in experimental and nonexperimental designs will affect how the psychologist describes the results of a confidence interval or hypothesis test. Assumptions for Confidence Intervals and Tests The confidence interval for μ 1 μ 2 assumes: 1) random sampling, 2) independence among participants, 3) the dependent variable has an approximate normal distribution in the study population for each treatment condition or subpopulation, and 4) equal population variances for each treatment condition or subpopulation (the equal variance assumption). Violating the normality assumption will not be a concern if the sample sizes per group are not too small (n j > 20). Violating the equal variance assumption will not be a concern if n 1 and n 2 are not too dissimilar. However, the confidence interval for μ 1 μ 2 can perform very poorly when the population variances are unequal and the sample sizes are unequal. This problem is most serious when the smaller sample size is used in the treatment with the larger population variance. Both SPSS and R will compute a confidence interval for μ 1 μ 2 that uses a separate-variance standard error SE μ 1 μ 2 = σ 12 /n 1 + σ 22 /n 2 that does not require equal population variances, and this option should be used when one sample is considerably larger than the other sample. When the separate-variance standard error is used, the degrees of freedom is df = ( σ 1 2 than n 1 + n σ 2 2 n 1 n 2 ) 2 /[ σ 14 + n 2 1 (n 1 1) σ 2 4 n 2 2 (n 2 1) ] rather 4

5 A transformation of the dependent variable scores can reduce skewness and unequal variability between groups, but then μ 1 and μ 2 may become difficult to interpret. Interpretation difficulty is usually not an issue in hypothesis testing applications where the goal is to simply decide if μ 1 is less than μ 2 or if μ 1 is greater than μ 2. Sample Size Requirements The sample size requirement per group to estimate μ 1 μ 2 with desired confidence and precision is approximately n j = 8σ 2(z α/2 /w) 2 (2.2) where σ 2 is a planning value of the average within-group variance of the dependent variable for the two groups. This planning value can be specified using information from published research reports, a pilot study, or the opinions of experts. If prior estimates of the dependent variable variance are unavailable but the maximum and minimum values of the dependent variable are known, the planning value of the variance could be set to [(max min)/4] 2. Example 2.2. A psychologist wants to conduct a study to determine the effect of achievement motivation on the types of tasks a person chooses to undertake. The study will ask participants to play a ring-toss game where they try to throw a small plastic ring over an upright post. The participants will choose how far away from the post they are when they make their tosses. The chosen distance from the post is the dependent variable. The independent variable is degree of achievement motivation (high or low) and will be manipulated by the type of instructions given to the participants. The results of a pilot study suggest that the variance of the distance scores is about in each condition. The psychologist wants a 99% confidence interval for μ 1 μ 2 to have a width of about 1 foot. The required sample size per group is approximately n j = 8( ) (2.58/1) 2 = Unequal Sample Sizes Using equal sample sizes has two major benefits: if the population variances are approximately equal, confidence intervals are narrowest and hypothesis tests are most powerful when the sample sizes are equal, and the negative effects of violating the equal variance assumption are less severe when the sample sizes are equal. However, there are situations when equal sample sizes are less desirable. If one treatment is more expensive or risky than another treatment, the psychologist might decide to use fewer participants in the more expensive or risky treatment condition. Also, in experiments that include a control group, it might be easy and inexpensive to obtain a larger sample size for the control group. 5

6 Graphing Results The sample means for each group can be presented graphically using a bar chart. A bar chart for a two-group design consists of two bars, one for each group, with the height of each bar representing the value of the sample mean. Bar charts of sample means can be misleading because the sample means contain sampling error of unknown magnitude and direction. There is a tendency to incorrectly interpret the difference in bar heights as representing a difference in the population means. This misinterpretation can be avoided by graphically presenting the imprecision of the sample means with 95% confidence interval lines for each population mean, as shown in the graph below. Internal Validity Recall that one of the fundamental requirements for declaring a relation between two variables to be a causal relation is that the independent variable must be the only variable affecting the dependent variable. When this requirement is not satisfied, we say the internal validity of the study has been compromised. In nonexperimental designs, there will be many obvious confounding variables. However, in an experimental design, a confounding variable might go undetected. Consider the following example. Suppose a two-group experiment for the treatment of anxiety is conducted with one group receiving a widely-used medication and the second group receiving a promising new drug. Suppose a statistical analysis suggests that the new drug is more effective in reducing anxiety than the old drug. However, the psychologist cannot be sure that the new drug will cause an improvement in anxiety because patients who received the new drug also received extra safety precautions to monitor for possible negative side effects. These extra precautions involved more supervision and contact with the patients. It is possible that the additional supervision, and not the new drug, caused the improvement in the patients. 6

7 Differential attrition is another problem that threatens internal validity. Differential attrition occurs when the independent variable causes the participants in one treatment condition to withdraw from treatment with higher probability than participants in another treatment. With differential attrition, participants who complete the study could differ across treatment conditions in terms of some important attribute that would then be confounded with the independent variable. Consider the following example. Suppose a psychologist conducts an experiment to evaluate two different methods of helping people overcome their fear of public speaking. One method requires participants to practice with an audience of size 20 and the other method requires participants to practice with an audience of size 5. Fifty participants were randomly assigned to each of these two training conditions, but ten dropped out of the first group and only one dropped out of the second group. The results showed that public speaking fear was lower under the first method (audience size of 20) of training. However, it is possible that participants who stayed in the first group were initially less fearful than those who dropped out and this difference in initial fearfulness that resulted in lower fear scores in the first training condition. External Validity External validity is the extent to which the results of a study can be generalized to different types of participants and different types of research settings. In terms of random sampling, it is usually easier to sample from a small homogeneous study population than a larger and more heterogeneous study population. However, the external validity of the study will be greater if the psychologist samples from a larger and more diverse study population. Other ways to increase the external validity of a study will be discussed in Module 3. Nonrandom attrition occurs when certain types of participants, regardless of treatment condition, drop out of the study with a higher probability than other participants. With nonrandom attrition, the participants who complete the study are no longer a random sample from the original study population. The remaining participants could be assumed to be a random sample from a smaller study population of participants who would have completed the study. This change in the size and nature of the study population decreases the external validity of the study. With random attrition in both groups, the samples remain random sample from the original study population with no loss in external or internal validity. However, a random loss of participants will result in a loss of power and confidence interval precision due to the smaller sample size. 7

8 Ethical Issues Any study that uses human subjects should advance knowledge and potentially lead to improvements in the quality of life but the psychologist also has an obligation to project the rights and welfare of the participants in the study. These two goals are often in conflict and lead to ethical dilemmas. The most widely used approach to resolving ethical dilemmas is to weigh the potential benefits of the research against the costs to the participants. Evaluating the costs and benefits of a proposed research project that involves human subjects can be extremely difficult and this task is assigned to the Institutional Review Board (IRB) at most universities. Psychologists who plan to use human subjects in their research must submit a written proposal to the IRB for approval. The IRB carefully examines all proposals in terms of the following issues: Informed Consent Are participants informed of the nature of the study, have they explicitly agreed to participate, and are they allowed to freely decline to participate? Coercion to participate Were participants coerced into participating or offered excessive inducements? Confidentiality Will the data collected from participants be used only for research purposes and not divulged to others? Physical and mental stress Does the study involve more than minimal risk? Minimal risk is defined as risk that is no greater in probability or severity than ordinarily encountered in daily life or during a routine physical or psychological exam. Deception Is deception truly needed in the study? If deception is used, are participants debriefed? Debriefing is used to: 1) clarify the nature of the study to the participants, 2) reduce any stress or anxiety to the participants caused by the study, and 3) to obtain feedback from participants about the nature of the study. In addition to principles governing the treatment of human subjects, psychologists are bound by a set of ethical standards. Violation of these standards is called scientific misconduct. There are three basic types of scientific misconduct: Scientific dishonesty Examples include: the fabrication or falsification of data, and plagiarism. Plagiarism is the use of another person's ideas, processes, results, or words without giving appropriate credit. Unethical behavior Examples include: sexual harassment of research assistants or research participants, abuse of authority, failure to follow university or government regulations, inappropriately including or excluding authors on a research report or conference presentation, and providing a biased review of a manuscript or grant proposal. Questionable research practices Examples include: performing an exploratory analysis of many dependent and independent variables and reporting only the variables that yield a significant result, deleting legitimate data that adversely affect the desired result, and reporting an unexpected finding as if it had been predicted from theory. 8

PLS 506 Mark T. Imperial, Ph.D. Lecture Notes: Reliability & Validity

PLS 506 Mark T. Imperial, Ph.D. Lecture Notes: Reliability & Validity PLS 506 Mark T. Imperial, Ph.D. Lecture Notes: Reliability & Validity Measurement & Variables - Initial step is to conceptualize and clarify the concepts embedded in a hypothesis or research question with

More information

Objectives. Quantifying the quality of hypothesis tests. Type I and II errors. Power of a test. Cautions about significance tests

Objectives. Quantifying the quality of hypothesis tests. Type I and II errors. Power of a test. Cautions about significance tests Objectives Quantifying the quality of hypothesis tests Type I and II errors Power of a test Cautions about significance tests Designing Experiments based on power Evaluating a testing procedure The testing

More information

Chapter 11. Experimental Design: One-Way Independent Samples Design

Chapter 11. Experimental Design: One-Way Independent Samples Design 11-1 Chapter 11. Experimental Design: One-Way Independent Samples Design Advantages and Limitations Comparing Two Groups Comparing t Test to ANOVA Independent Samples t Test Independent Samples ANOVA Comparing

More information

Experimental Psychology

Experimental Psychology Title Experimental Psychology Type Individual Document Map Authors Aristea Theodoropoulos, Patricia Sikorski Subject Social Studies Course None Selected Grade(s) 11, 12 Location Roxbury High School Curriculum

More information

Previously, when making inferences about the population mean,, we were assuming the following simple conditions:

Previously, when making inferences about the population mean,, we were assuming the following simple conditions: Chapter 17 Inference about a Population Mean Conditions for inference Previously, when making inferences about the population mean,, we were assuming the following simple conditions: (1) Our data (observations)

More information

Samples, Sample Size And Sample Error. Research Methodology. How Big Is Big? Estimating Sample Size. Variables. Variables 2/25/2018

Samples, Sample Size And Sample Error. Research Methodology. How Big Is Big? Estimating Sample Size. Variables. Variables 2/25/2018 Research Methodology Samples, Sample Size And Sample Error Sampling error = difference between sample and population characteristics Reducing sampling error is the goal of any sampling technique As sample

More information

Student Performance Q&A:

Student Performance Q&A: Student Performance Q&A: 2009 AP Statistics Free-Response Questions The following comments on the 2009 free-response questions for AP Statistics were written by the Chief Reader, Christine Franklin of

More information

Creative Commons Attribution-NonCommercial-Share Alike License

Creative Commons Attribution-NonCommercial-Share Alike License Author: Brenda Gunderson, Ph.D., 05 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution- NonCommercial-Share Alike 3.0 Unported License:

More information

Applied Statistical Analysis EDUC 6050 Week 4

Applied Statistical Analysis EDUC 6050 Week 4 Applied Statistical Analysis EDUC 6050 Week 4 Finding clarity using data Today 1. Hypothesis Testing with Z Scores (continued) 2. Chapters 6 and 7 in Book 2 Review! = $ & '! = $ & ' * ) 1. Which formula

More information

Module 28 - Estimating a Population Mean (1 of 3)

Module 28 - Estimating a Population Mean (1 of 3) Module 28 - Estimating a Population Mean (1 of 3) In "Estimating a Population Mean," we focus on how to use a sample mean to estimate a population mean. This is the type of thinking we did in Modules 7

More information

Examining differences between two sets of scores

Examining differences between two sets of scores 6 Examining differences between two sets of scores In this chapter you will learn about tests which tell us if there is a statistically significant difference between two sets of scores. In so doing you

More information

Clever Hans the horse could do simple math and spell out the answers to simple questions. He wasn t always correct, but he was most of the time.

Clever Hans the horse could do simple math and spell out the answers to simple questions. He wasn t always correct, but he was most of the time. Clever Hans the horse could do simple math and spell out the answers to simple questions. He wasn t always correct, but he was most of the time. While a team of scientists, veterinarians, zoologists and

More information

ANOVA in SPSS (Practical)

ANOVA in SPSS (Practical) ANOVA in SPSS (Practical) Analysis of Variance practical In this practical we will investigate how we model the influence of a categorical predictor on a continuous response. Centre for Multilevel Modelling

More information

V. Gathering and Exploring Data

V. Gathering and Exploring Data V. Gathering and Exploring Data With the language of probability in our vocabulary, we re now ready to talk about sampling and analyzing data. Data Analysis We can divide statistical methods into roughly

More information

Chapter 23. Inference About Means. Copyright 2010 Pearson Education, Inc.

Chapter 23. Inference About Means. Copyright 2010 Pearson Education, Inc. Chapter 23 Inference About Means Copyright 2010 Pearson Education, Inc. Getting Started Now that we know how to create confidence intervals and test hypotheses about proportions, it d be nice to be able

More information

Confidence Intervals On Subsets May Be Misleading

Confidence Intervals On Subsets May Be Misleading Journal of Modern Applied Statistical Methods Volume 3 Issue 2 Article 2 11-1-2004 Confidence Intervals On Subsets May Be Misleading Juliet Popper Shaffer University of California, Berkeley, shaffer@stat.berkeley.edu

More information

Higher Psychology RESEARCH REVISION

Higher Psychology RESEARCH REVISION Higher Psychology RESEARCH REVISION 1 The biggest change from the old Higher course (up to 2014) is the possibility of an analysis and evaluation question (8-10) marks asking you to comment on aspects

More information

Unit 1 Exploring and Understanding Data

Unit 1 Exploring and Understanding Data Unit 1 Exploring and Understanding Data Area Principle Bar Chart Boxplot Conditional Distribution Dotplot Empirical Rule Five Number Summary Frequency Distribution Frequency Polygon Histogram Interquartile

More information

One slide on research question Literature review: structured; holes you will fill in Your research design

One slide on research question Literature review: structured; holes you will fill in Your research design Topics Ahead Week 10-11: Experimental design; Running experiment Week 12: Survey Design; ANOVA Week 13: Correlation and Regression; Non Parametric Statistics Week 14: Computational Methods; Simulation;

More information

Variables Research involves trying to determine the relationship between two or more variables.

Variables Research involves trying to determine the relationship between two or more variables. 1 2 Research Methodology Week 4 Characteristics of Observations 1. Most important know what is being observed. 2. Assign behaviors to categories. 3. Know how to Measure. 4. Degree of Observer inference.

More information

2 Critical thinking guidelines

2 Critical thinking guidelines What makes psychological research scientific? Precision How psychologists do research? Skepticism Reliance on empirical evidence Willingness to make risky predictions Openness Precision Begin with a Theory

More information

PSYCHOLOGY 320L Problem Set #4: Estimating Sample Size, Post Hoc Tests, and Two-Factor ANOVA

PSYCHOLOGY 320L Problem Set #4: Estimating Sample Size, Post Hoc Tests, and Two-Factor ANOVA PSYCHOLOGY 320L Problem Set #4: Estimating Sample Size, Post Hoc Tests, and Two-Factor ANOVA Name: Score: 1. Suppose you are planning an experiment for a class project with a group of students and you

More information

Research Methodology. Characteristics of Observations. Variables 10/18/2016. Week Most important know what is being observed.

Research Methodology. Characteristics of Observations. Variables 10/18/2016. Week Most important know what is being observed. Research Methodology 1 Characteristics of Observations 1. Most important know what is being observed. 2. Assign behaviors to categories. 3. Know how to Measure. 4. Degree of Observer inference. 2 Variables

More information

9 research designs likely for PSYC 2100

9 research designs likely for PSYC 2100 9 research designs likely for PSYC 2100 1) 1 factor, 2 levels, 1 group (one group gets both treatment levels) related samples t-test (compare means of 2 levels only) 2) 1 factor, 2 levels, 2 groups (one

More information

Glossary From Running Randomized Evaluations: A Practical Guide, by Rachel Glennerster and Kudzai Takavarasha

Glossary From Running Randomized Evaluations: A Practical Guide, by Rachel Glennerster and Kudzai Takavarasha Glossary From Running Randomized Evaluations: A Practical Guide, by Rachel Glennerster and Kudzai Takavarasha attrition: When data are missing because we are unable to measure the outcomes of some of the

More information

One-Way ANOVAs t-test two statistically significant Type I error alpha null hypothesis dependant variable Independent variable three levels;

One-Way ANOVAs t-test two statistically significant Type I error alpha null hypothesis dependant variable Independent variable three levels; 1 One-Way ANOVAs We have already discussed the t-test. The t-test is used for comparing the means of two groups to determine if there is a statistically significant difference between them. The t-test

More information

Homework Exercises for PSYC 3330: Statistics for the Behavioral Sciences

Homework Exercises for PSYC 3330: Statistics for the Behavioral Sciences Homework Exercises for PSYC 3330: Statistics for the Behavioral Sciences compiled and edited by Thomas J. Faulkenberry, Ph.D. Department of Psychological Sciences Tarleton State University Version: July

More information

Study Guide for the Final Exam

Study Guide for the Final Exam Study Guide for the Final Exam When studying, remember that the computational portion of the exam will only involve new material (covered after the second midterm), that material from Exam 1 will make

More information

UNIT II: RESEARCH METHODS

UNIT II: RESEARCH METHODS THINKING CRITICALLY WITH PSYCHOLOGICAL SCIENCE UNIT II: RESEARCH METHODS Module 4: The Need for Psychological Science Module 5: Scientific Method and Description Module 6: Correlation and Experimentation

More information

Previous Example. New. Tradition

Previous Example. New. Tradition Experimental Design Previous Example New Tradition Goal? New Tradition =? Challenges Internal validity How to guarantee what you have observed is true? External validity How to guarantee what you have

More information

Evaluation: Scientific Studies. Title Text

Evaluation: Scientific Studies. Title Text Evaluation: Scientific Studies Title Text 1 Evaluation Beyond Usability Tests 2 Usability Evaluation (last week) Expert tests / walkthroughs Usability Tests with users Main goal: formative identify usability

More information

Comparing Two Means using SPSS (T-Test)

Comparing Two Means using SPSS (T-Test) Indira Gandhi Institute of Development Research From the SelectedWorks of Durgesh Chandra Pathak Winter January 23, 2009 Comparing Two Means using SPSS (T-Test) Durgesh Chandra Pathak Available at: https://works.bepress.com/durgesh_chandra_pathak/12/

More information

Business Statistics Probability

Business Statistics Probability Business Statistics The following was provided by Dr. Suzanne Delaney, and is a comprehensive review of Business Statistics. The workshop instructor will provide relevant examples during the Skills Assessment

More information

Chapter 5: Field experimental designs in agriculture

Chapter 5: Field experimental designs in agriculture Chapter 5: Field experimental designs in agriculture Jose Crossa Biometrics and Statistics Unit Crop Research Informatics Lab (CRIL) CIMMYT. Int. Apdo. Postal 6-641, 06600 Mexico, DF, Mexico Introduction

More information

Lecture 20: Chi Square

Lecture 20: Chi Square Statistics 20_chi.pdf Michael Hallstone, Ph.D. hallston@hawaii.edu Lecture 20: Chi Square Introduction Up until now, we done statistical test using means, but the assumptions for means have eliminated

More information

PSYCHOLOGY 300B (A01) One-sample t test. n = d = ρ 1 ρ 0 δ = d (n 1) d

PSYCHOLOGY 300B (A01) One-sample t test. n = d = ρ 1 ρ 0 δ = d (n 1) d PSYCHOLOGY 300B (A01) Assignment 3 January 4, 019 σ M = σ N z = M µ σ M d = M 1 M s p d = µ 1 µ 0 σ M = µ +σ M (z) Independent-samples t test One-sample t test n = δ δ = d n d d = µ 1 µ σ δ = d n n = δ

More information

Statistics. Nur Hidayanto PSP English Education Dept. SStatistics/Nur Hidayanto PSP/PBI

Statistics. Nur Hidayanto PSP English Education Dept. SStatistics/Nur Hidayanto PSP/PBI Statistics Nur Hidayanto PSP English Education Dept. RESEARCH STATISTICS WHAT S THE RELATIONSHIP? RESEARCH RESEARCH positivistic Prepositivistic Postpositivistic Data Initial Observation (research Question)

More information

Statistical analysis DIANA SAPLACAN 2017 * SLIDES ADAPTED BASED ON LECTURE NOTES BY ALMA LEORA CULEN

Statistical analysis DIANA SAPLACAN 2017 * SLIDES ADAPTED BASED ON LECTURE NOTES BY ALMA LEORA CULEN Statistical analysis DIANA SAPLACAN 2017 * SLIDES ADAPTED BASED ON LECTURE NOTES BY ALMA LEORA CULEN Vs. 2 Background 3 There are different types of research methods to study behaviour: Descriptive: observations,

More information

BIOL 458 BIOMETRY Lab 7 Multi-Factor ANOVA

BIOL 458 BIOMETRY Lab 7 Multi-Factor ANOVA BIOL 458 BIOMETRY Lab 7 Multi-Factor ANOVA PART 1: Introduction to Factorial ANOVA ingle factor or One - Way Analysis of Variance can be used to test the null hypothesis that k or more treatment or group

More information

Final Exam Practice Test

Final Exam Practice Test Final Exam Practice Test The t distribution and z-score distributions are located in the back of your text book (the appendices) You will be provided with a new copy of each during your final exam True

More information

Final Exam: PSYC 300. Multiple Choice Items (1 point each)

Final Exam: PSYC 300. Multiple Choice Items (1 point each) Final Exam: PSYC 300 Multiple Choice Items (1 point each) 1. Which of the following is NOT one of the three fundamental features of science? a. empirical questions b. public knowledge c. mathematical equations

More information

Confounding and Bias

Confounding and Bias 28 th International Conference on Pharmacoepidemiology and Therapeutic Risk Management Barcelona, Spain August 22, 2012 Confounding and Bias Tobias Gerhard, PhD Assistant Professor, Ernest Mario School

More information

STA 3024 Spring 2013 EXAM 3 Test Form Code A UF ID #

STA 3024 Spring 2013 EXAM 3 Test Form Code A UF ID # STA 3024 Spring 2013 Name EXAM 3 Test Form Code A UF ID # Instructions: This exam contains 34 Multiple Choice questions. Each question is worth 3 points, for a total of 102 points (there are TWO bonus

More information

Variability. After reading this chapter, you should be able to do the following:

Variability. After reading this chapter, you should be able to do the following: LEARIG OBJECTIVES C H A P T E R 3 Variability After reading this chapter, you should be able to do the following: Explain what the standard deviation measures Compute the variance and the standard deviation

More information

Propensity Score Methods for Estimating Causality in the Absence of Random Assignment: Applications for Child Care Policy Research

Propensity Score Methods for Estimating Causality in the Absence of Random Assignment: Applications for Child Care Policy Research 2012 CCPRC Meeting Methodology Presession Workshop October 23, 2012, 2:00-5:00 p.m. Propensity Score Methods for Estimating Causality in the Absence of Random Assignment: Applications for Child Care Policy

More information

Results & Statistics: Description and Correlation. I. Scales of Measurement A Review

Results & Statistics: Description and Correlation. I. Scales of Measurement A Review Results & Statistics: Description and Correlation The description and presentation of results involves a number of topics. These include scales of measurement, descriptive statistics used to summarize

More information

10 Intraclass Correlations under the Mixed Factorial Design

10 Intraclass Correlations under the Mixed Factorial Design CHAPTER 1 Intraclass Correlations under the Mixed Factorial Design OBJECTIVE This chapter aims at presenting methods for analyzing intraclass correlation coefficients for reliability studies based on a

More information

Data and Statistics 101: Key Concepts in the Collection, Analysis, and Application of Child Welfare Data

Data and Statistics 101: Key Concepts in the Collection, Analysis, and Application of Child Welfare Data TECHNICAL REPORT Data and Statistics 101: Key Concepts in the Collection, Analysis, and Application of Child Welfare Data CONTENTS Executive Summary...1 Introduction...2 Overview of Data Analysis Concepts...2

More information

CHAPTER 3 DATA ANALYSIS: DESCRIBING DATA

CHAPTER 3 DATA ANALYSIS: DESCRIBING DATA Data Analysis: Describing Data CHAPTER 3 DATA ANALYSIS: DESCRIBING DATA In the analysis process, the researcher tries to evaluate the data collected both from written documents and from other sources such

More information

YSU Students. STATS 3743 Dr. Huang-Hwa Andy Chang Term Project 2 May 2002

YSU Students. STATS 3743 Dr. Huang-Hwa Andy Chang Term Project 2 May 2002 YSU Students STATS 3743 Dr. Huang-Hwa Andy Chang Term Project May 00 Anthony Koulianos, Chemical Engineer Kyle Unger, Chemical Engineer Vasilia Vamvakis, Chemical Engineer I. Executive Summary It is common

More information

Overview. Survey Methods & Design in Psychology. Readings. Significance Testing. Significance Testing. The Logic of Significance Testing

Overview. Survey Methods & Design in Psychology. Readings. Significance Testing. Significance Testing. The Logic of Significance Testing Survey Methods & Design in Psychology Lecture 11 (2007) Significance Testing, Power, Effect Sizes, Confidence Intervals, Publication Bias, & Scientific Integrity Overview Significance testing Inferential

More information

Unit 7 Comparisons and Relationships

Unit 7 Comparisons and Relationships Unit 7 Comparisons and Relationships Objectives: To understand the distinction between making a comparison and describing a relationship To select appropriate graphical displays for making comparisons

More information

Investigating the robustness of the nonparametric Levene test with more than two groups

Investigating the robustness of the nonparametric Levene test with more than two groups Psicológica (2014), 35, 361-383. Investigating the robustness of the nonparametric Levene test with more than two groups David W. Nordstokke * and S. Mitchell Colp University of Calgary, Canada Testing

More information

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo Please note the page numbers listed for the Lind book may vary by a page or two depending on which version of the textbook you have. Readings: Lind 1 11 (with emphasis on chapters 10, 11) Please note chapter

More information

CHAPTER LEARNING OUTCOMES

CHAPTER LEARNING OUTCOMES EXPERIIMENTAL METHODOLOGY CHAPTER LEARNING OUTCOMES When you have completed reading this article you will be able to: Define what is an experiment Explain the role of theory in educational research Justify

More information

MMI 409 Spring 2009 Final Examination Gordon Bleil. 1. Is there a difference in depression as a function of group and drug?

MMI 409 Spring 2009 Final Examination Gordon Bleil. 1. Is there a difference in depression as a function of group and drug? MMI 409 Spring 2009 Final Examination Gordon Bleil Table of Contents Research Scenario and General Assumptions Questions for Dataset (Questions are hyperlinked to detailed answers) 1. Is there a difference

More information

26:010:557 / 26:620:557 Social Science Research Methods

26:010:557 / 26:620:557 Social Science Research Methods 26:010:557 / 26:620:557 Social Science Research Methods Dr. Peter R. Gillett Associate Professor Department of Accounting & Information Systems Rutgers Business School Newark & New Brunswick 1 Overview

More information

Lesson 11.1: The Alpha Value

Lesson 11.1: The Alpha Value Hypothesis Testing Lesson 11.1: The Alpha Value The alpha value is the degree of risk we are willing to take when making a decision. The alpha value, often abbreviated using the Greek letter α, is sometimes

More information

Statistical Techniques. Masoud Mansoury and Anas Abulfaraj

Statistical Techniques. Masoud Mansoury and Anas Abulfaraj Statistical Techniques Masoud Mansoury and Anas Abulfaraj What is Statistics? https://www.youtube.com/watch?v=lmmzj7599pw The definition of Statistics The practice or science of collecting and analyzing

More information

Dr. Kelly Bradley Final Exam Summer {2 points} Name

Dr. Kelly Bradley Final Exam Summer {2 points} Name {2 points} Name You MUST work alone no tutors; no help from classmates. Email me or see me with questions. You will receive a score of 0 if this rule is violated. This exam is being scored out of 00 points.

More information

Final Exam PS 217, Fall 2010

Final Exam PS 217, Fall 2010 Final Exam PS 27, Fall 200. Farzin, et al. (200) wrote an article that appeared in Psychological Science titled: Spatial resolution of conscious visual perception in infants. The abstract read: Humans

More information

ID# Exam 2 PS 217, Fall 2010

ID# Exam 2 PS 217, Fall 2010 ID# Exam 2 PS 217, Fall 2010 As always, the Skidmore Honor Code is in effect, so at the end of the exam you will need to sign a sheet attesting to your adherence to the code. Read each question carefully

More information

Statistics Guide. Prepared by: Amanda J. Rockinson- Szapkiw, Ed.D.

Statistics Guide. Prepared by: Amanda J. Rockinson- Szapkiw, Ed.D. This guide contains a summary of the statistical terms and procedures. This guide can be used as a reference for course work and the dissertation process. However, it is recommended that you refer to statistical

More information

Selecting Research Participants. Conducting Experiments, Survey Construction and Data Collection. Practical Considerations of Research

Selecting Research Participants. Conducting Experiments, Survey Construction and Data Collection. Practical Considerations of Research Conducting Experiments, Survey Construction and Data Collection RCS 6740 6/28/04 Practical Considerations of Research This lecture will focus on some of the practical aspects of conducting research studies

More information

Quantitative Methods in Computing Education Research (A brief overview tips and techniques)

Quantitative Methods in Computing Education Research (A brief overview tips and techniques) Quantitative Methods in Computing Education Research (A brief overview tips and techniques) Dr Judy Sheard Senior Lecturer Co-Director, Computing Education Research Group Monash University judy.sheard@monash.edu

More information

EXPERIMENTS IN RESEARCH

EXPERIMENTS IN RESEARCH EXPERIMENTS IN RESEARCH PRESENTED BY ANNAPOORNA SHANKAR NITHYA RACHEL PREETHI CUNHA What is an experiment? Taking action and observing consequences of that action EXPERIMENTS - Controlled observation possible

More information

MBA 605 Business Analytics Don Conant, PhD. GETTING TO THE STANDARD NORMAL DISTRIBUTION

MBA 605 Business Analytics Don Conant, PhD. GETTING TO THE STANDARD NORMAL DISTRIBUTION MBA 605 Business Analytics Don Conant, PhD. GETTING TO THE STANDARD NORMAL DISTRIBUTION Variables In the social sciences data are the observed and/or measured characteristics of individuals and groups

More information

Psychology Research Process

Psychology Research Process Psychology Research Process Logical Processes Induction Observation/Association/Using Correlation Trying to assess, through observation of a large group/sample, what is associated with what? Examples:

More information

ISC- GRADE XI HUMANITIES ( ) PSYCHOLOGY. Chapter 2- Methods of Psychology

ISC- GRADE XI HUMANITIES ( ) PSYCHOLOGY. Chapter 2- Methods of Psychology ISC- GRADE XI HUMANITIES (2018-19) PSYCHOLOGY Chapter 2- Methods of Psychology OUTLINE OF THE CHAPTER (i) Scientific Methods in Psychology -observation, case study, surveys, psychological tests, experimentation

More information

Chapter 2 Methodology: How Social Psychologists Do Research

Chapter 2 Methodology: How Social Psychologists Do Research Chapter 2 Methodology: How Social Psychologists Do Research Methodology Social Psychology: An Empirical Science Empirical research allows us to test the validity of personal observations and folk wisdom.

More information

baseline comparisons in RCTs

baseline comparisons in RCTs Stefan L. K. Gruijters Maastricht University Introduction Checks on baseline differences in randomized controlled trials (RCTs) are often done using nullhypothesis significance tests (NHSTs). In a quick

More information

Bayesian and Frequentist Approaches

Bayesian and Frequentist Approaches Bayesian and Frequentist Approaches G. Jogesh Babu Penn State University http://sites.stat.psu.edu/ babu http://astrostatistics.psu.edu All models are wrong But some are useful George E. P. Box (son-in-law

More information

AP Psychology -- Chapter 02 Review Research Methods in Psychology

AP Psychology -- Chapter 02 Review Research Methods in Psychology AP Psychology -- Chapter 02 Review Research Methods in Psychology 1. In the opening vignette, to what was Alicia's condition linked? The death of her parents and only brother 2. What did Pennebaker s study

More information

Sampling for Impact Evaluation. Maria Jones 24 June 2015 ieconnect Impact Evaluation Workshop Rio de Janeiro, Brazil June 22-25, 2015

Sampling for Impact Evaluation. Maria Jones 24 June 2015 ieconnect Impact Evaluation Workshop Rio de Janeiro, Brazil June 22-25, 2015 Sampling for Impact Evaluation Maria Jones 24 June 2015 ieconnect Impact Evaluation Workshop Rio de Janeiro, Brazil June 22-25, 2015 How many hours do you expect to sleep tonight? A. 2 or less B. 3 C.

More information

Chapter 1 Applications and Consequences of Psychological Testing

Chapter 1 Applications and Consequences of Psychological Testing Chapter 1 Applications and Consequences of Psychological Testing Topic 1A The Nature and Uses of Psychological Testing The Consequences of Testing From birth to old age, people encounter tests at all most

More information

Lec 02: Estimation & Hypothesis Testing in Animal Ecology

Lec 02: Estimation & Hypothesis Testing in Animal Ecology Lec 02: Estimation & Hypothesis Testing in Animal Ecology Parameter Estimation from Samples Samples We typically observe systems incompletely, i.e., we sample according to a designed protocol. We then

More information

Before we get started:

Before we get started: Before we get started: http://arievaluation.org/projects-3/ AEA 2018 R-Commander 1 Antonio Olmos Kai Schramm Priyalathta Govindasamy Antonio.Olmos@du.edu AntonioOlmos@aumhc.org AEA 2018 R-Commander 2 Plan

More information

Research Methods 1 Handouts, Graham Hole,COGS - version 1.0, September 2000: Page 1:

Research Methods 1 Handouts, Graham Hole,COGS - version 1.0, September 2000: Page 1: Research Methods 1 Handouts, Graham Hole,COGS - version 10, September 000: Page 1: T-TESTS: When to use a t-test: The simplest experimental design is to have two conditions: an "experimental" condition

More information

INVESTIGATING FIT WITH THE RASCH MODEL. Benjamin Wright and Ronald Mead (1979?) Most disturbances in the measurement process can be considered a form

INVESTIGATING FIT WITH THE RASCH MODEL. Benjamin Wright and Ronald Mead (1979?) Most disturbances in the measurement process can be considered a form INVESTIGATING FIT WITH THE RASCH MODEL Benjamin Wright and Ronald Mead (1979?) Most disturbances in the measurement process can be considered a form of multidimensionality. The settings in which measurement

More information

Midterm Exam MMI 409 Spring 2009 Gordon Bleil

Midterm Exam MMI 409 Spring 2009 Gordon Bleil Midterm Exam MMI 409 Spring 2009 Gordon Bleil Table of contents: (Hyperlinked to problem sections) Problem 1 Hypothesis Tests Results Inferences Problem 2 Hypothesis Tests Results Inferences Problem 3

More information

Theory. = an explanation using an integrated set of principles that organizes observations and predicts behaviors or events.

Theory. = an explanation using an integrated set of principles that organizes observations and predicts behaviors or events. Definition Slides Hindsight Bias = the tendency to believe, after learning an outcome, that one would have foreseen it. Also known as the I knew it all along phenomenon. Critical Thinking = thinking that

More information

Profile Analysis. Intro and Assumptions Psy 524 Andrew Ainsworth

Profile Analysis. Intro and Assumptions Psy 524 Andrew Ainsworth Profile Analysis Intro and Assumptions Psy 524 Andrew Ainsworth Profile Analysis Profile analysis is the repeated measures extension of MANOVA where a set of DVs are commensurate (on the same scale). Profile

More information

12.1 Inference for Linear Regression. Introduction

12.1 Inference for Linear Regression. Introduction 12.1 Inference for Linear Regression vocab examples Introduction Many people believe that students learn better if they sit closer to the front of the classroom. Does sitting closer cause higher achievement,

More information

Announcement. Homework #2 due next Friday at 5pm. Midterm is in 2 weeks. It will cover everything through the end of next week (week 5).

Announcement. Homework #2 due next Friday at 5pm. Midterm is in 2 weeks. It will cover everything through the end of next week (week 5). Announcement Homework #2 due next Friday at 5pm. Midterm is in 2 weeks. It will cover everything through the end of next week (week 5). Political Science 15 Lecture 8: Descriptive Statistics (Part 1) Data

More information

BIOSTATISTICAL METHODS

BIOSTATISTICAL METHODS BIOSTATISTICAL METHODS FOR TRANSLATIONAL & CLINICAL RESEARCH Designs on Micro Scale: DESIGNING CLINICAL RESEARCH THE ANATOMY & PHYSIOLOGY OF CLINICAL RESEARCH We form or evaluate a research or research

More information

The t-test: Answers the question: is the difference between the two conditions in my experiment "real" or due to chance?

The t-test: Answers the question: is the difference between the two conditions in my experiment real or due to chance? The t-test: Answers the question: is the difference between the two conditions in my experiment "real" or due to chance? Two versions: (a) Dependent-means t-test: ( Matched-pairs" or "one-sample" t-test).

More information

Comparison of two means

Comparison of two means 1 Comparison of two means Most studies are comparative in that they compare outcomes from one group with outcomes from another, for example the mean blood pressure in reponse to two different treatments.

More information

Measuring impact. William Parienté UC Louvain J PAL Europe. povertyactionlab.org

Measuring impact. William Parienté UC Louvain J PAL Europe. povertyactionlab.org Measuring impact William Parienté UC Louvain J PAL Europe povertyactionlab.org Course overview 1. What is evaluation? 2. Measuring impact 3. Why randomize? 4. How to randomize 5. Sampling and Sample Size

More information

Experimental Methods. Anna Fahlgren, Phd Associate professor in Experimental Orthopaedics

Experimental Methods. Anna Fahlgren, Phd Associate professor in Experimental Orthopaedics Experimental Methods Anna Fahlgren, Phd Associate professor in Experimental Orthopaedics What is experimental Methods? Experimental Methdology Experimental Methdology The Formal Hypothesis The precise

More information

Where does "analysis" enter the experimental process?

Where does analysis enter the experimental process? Lecture Topic : ntroduction to the Principles of Experimental Design Experiment: An exercise designed to determine the effects of one or more variables (treatments) on one or more characteristics (response

More information

Sheila Barron Statistics Outreach Center 2/8/2011

Sheila Barron Statistics Outreach Center 2/8/2011 Sheila Barron Statistics Outreach Center 2/8/2011 What is Power? When conducting a research study using a statistical hypothesis test, power is the probability of getting statistical significance when

More information

EXERCISE: HOW TO DO POWER CALCULATIONS IN OPTIMAL DESIGN SOFTWARE

EXERCISE: HOW TO DO POWER CALCULATIONS IN OPTIMAL DESIGN SOFTWARE ...... EXERCISE: HOW TO DO POWER CALCULATIONS IN OPTIMAL DESIGN SOFTWARE TABLE OF CONTENTS 73TKey Vocabulary37T... 1 73TIntroduction37T... 73TUsing the Optimal Design Software37T... 73TEstimating Sample

More information

Hypothesis Testing. Richard S. Balkin, Ph.D., LPC-S, NCC

Hypothesis Testing. Richard S. Balkin, Ph.D., LPC-S, NCC Hypothesis Testing Richard S. Balkin, Ph.D., LPC-S, NCC Overview When we have questions about the effect of a treatment or intervention or wish to compare groups, we use hypothesis testing Parametric statistics

More information

Methods for Determining Random Sample Size

Methods for Determining Random Sample Size Methods for Determining Random Sample Size This document discusses how to determine your random sample size based on the overall purpose of your research project. Methods for determining the random sample

More information

Research Methods in Psychology UNIT 3 PSYCHOLOGY 2013

Research Methods in Psychology UNIT 3 PSYCHOLOGY 2013 + Research Methods in Psychology UNIT 3 PSYCHOLOGY 2013 + Chapter 1 Summary 2 Experimental research: construction of research hypotheses; identification of operational independent and dependent variables

More information

Purpose. Study Designs. Objectives. Observational Studies. Analytic Studies

Purpose. Study Designs. Objectives. Observational Studies. Analytic Studies Purpose Study Designs H.S. Teitelbaum, DO, PhD, MPH, FAOCOPM AOCOPM Annual Meeting Introduce notions of study design Clarify common terminology used with description and interpretation of information collected

More information

2-Group Multivariate Research & Analyses

2-Group Multivariate Research & Analyses 2-Group Multivariate Research & Analyses Research Designs Research hypotheses Outcome & Research Hypotheses Outcomes & Truth Significance Tests & Effect Sizes Multivariate designs Increased effects Increased

More information

Chapter 19. Confidence Intervals for Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc.

Chapter 19. Confidence Intervals for Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc. Chapter 19 Confidence Intervals for Proportions Copyright 2010, 2007, 2004 Pearson Education, Inc. Standard Error Both of the sampling distributions we ve looked at are Normal. For proportions For means

More information

Power & Sample Size. Dr. Andrea Benedetti

Power & Sample Size. Dr. Andrea Benedetti Power & Sample Size Dr. Andrea Benedetti Plan Review of hypothesis testing Power and sample size Basic concepts Formulae for common study designs Using the software When should you think about power &

More information

Introduction to Statistical Data Analysis I

Introduction to Statistical Data Analysis I Introduction to Statistical Data Analysis I JULY 2011 Afsaneh Yazdani Preface What is Statistics? Preface What is Statistics? Science of: designing studies or experiments, collecting data Summarizing/modeling/analyzing

More information