THIS CHAPTER COVERS: The importance of sampling. Populations, sampling frames, and samples. Qualities of a good sample.

Similar documents
I. INTRODUCING CROSSCULTURAL RESEARCH

II. WHAT S YOUR QUESTION?

Chapter 5: Producing Data

Chapter Three: Sampling Methods

Handbook of Methods in Cultural Anthropology. Copyrighted Material. Not for Reproduction

Sampling for Success. Dr. Jim Mirabella President, Mirabella Research Services, Inc. Professor of Research & Statistics

Lecturer: Dr. Emmanuel Adjei Department of Information Studies Contact Information:

Chapter 1 Data Types and Data Collection. Brian Habing Department of Statistics University of South Carolina. Outline

Empirical Knowledge: based on observations. Answer questions why, whom, how, and when.

Principle underlying all of statistics

Lecture Start

Outline. Chapter 3: Random Sampling, Probability, and the Binomial Distribution. Some Data: The Value of Statistical Consulting

AP Psychology -- Chapter 02 Review Research Methods in Psychology

Practitioner s Guide To Stratified Random Sampling: Part 1

Math 140 Introductory Statistics

III. WHAT ANSWERS DO YOU EXPECT?

Abdul Latif Jameel Poverty Action Lab Executive Training: Evaluating Social Programs Spring 2009

Chapter 1 Data Collection

BIOSTATISTICAL METHODS

You can t fix by analysis what you bungled by design. Fancy analysis can t fix a poorly designed study.

Chapter 1: Exploring Data

Sta 309 (Statistics And Probability for Engineers)

REVIEW FOR THE PREVIOUS LECTURE

MATH 2300: Statistical Methods. What is Statistics?

For each of the following cases, describe the population, sample, population parameters, and sample statistics.

PubH 7405: REGRESSION ANALYSIS. Propensity Score

Unit 1 Exploring and Understanding Data

Section 6.1 Sampling. Population each element (or person) from the set of observations that can be made (entire group)

Data = collections of observations, measurements, gender, survey responses etc. Sample = collection of some members (a subset) of the population

Analysis A step in the research process that involves describing and then making inferences based on a set of data.

World Cultures ejournal UC Irvine

COWLEY COLLEGE & Area Vocational Technical School

STA Module 1 The Nature of Statistics. Rev.F07 1

STA Rev. F Module 1 The Nature of Statistics. Learning Objectives. Learning Objectives (cont.

Introduction, Evidence, and Sampling

Chapter 7. Sampling Techniques

MATH-134. Experimental Design

Chapter 13. Experiments and Observational Studies. Copyright 2012, 2008, 2005 Pearson Education, Inc.

Protocol Development: The Guiding Light of Any Clinical Study

Causal Research Design- Experimentation

Sample Size, Power and Sampling Methods

This exam consists of three parts. Provide answers to ALL THREE sections.

Introduction to Statistics

04/12/2014. Research Methods in Psychology. Chapter 6: Independent Groups Designs. What is your ideas? Testing

STA Module 1 Introduction to Statistics and Data

STA 291 Lecture 4 Jan 26, 2010

CHAPTER 3 METHOD AND PROCEDURE

Standard Cross-Cultural Sample. International Encyclopedia of the Social Sciences, 2 nd Edition

Chapter 3. Producing Data

Chapter 5: Field experimental designs in agriculture

Statistical Power Sampling Design and sample Size Determination

Confidence Intervals and Sampling Design. Lecture Notes VI

Unit 3: Collecting Data. Observational Study Experimental Study Sampling Bias Types of Sampling

Student Performance Q&A:

Research Methodologies

Survey Research Methodology

Experimental Research in HCI. Alma Leora Culén University of Oslo, Department of Informatics, Design

Strategic Sampling for Better Research Practice Survey Peer Network November 18, 2011

AP STATISTICS 2008 SCORING GUIDELINES (Form B)

INTRODUCTION TO STATISTICS SORANA D. BOLBOACĂ

SOCI 323 Social Psychology

Assignment 4: True or Quasi-Experiment

Experimental Research. Types of Group Comparison Research. Types of Group Comparison Research. Stephen E. Brock, Ph.D.

Chapter 2. The Data Analysis Process and Collecting Data Sensibly. Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc.

Vocabulary. Bias. Blinding. Block. Cluster sample

Chapter 02. Basic Research Methodology

AP STATISTICS 2013 SCORING GUIDELINES

SOCIAL AND CULTURAL ANTHROPOLOGY

aps/stone U0 d14 review d2 teacher notes 9/14/17 obj: review Opener: I have- who has

P. 266 #9, 11. p. 289 # 4, 6 11, 14, 17

Modeling Sentiment with Ridge Regression

Chapter 19. Confidence Intervals for Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc.

Chapter 13 Summary Experiments and Observational Studies

Basic Concepts in Research and DATA Analysis

Research Synthesis and meta-analysis: themes. Graham A. Colditz, MD, DrPH Method Tuuli, MD, MPH

Math 124: Module 3 and Module 4

Behavioral Finance 1-1. Chapter 5 Heuristics and Biases

Evaluating Social Programs Course: Evaluation Glossary (Sources: 3ie and The World Bank)

Scoring Adults Secure Base Use And Support

Lecture (chapter 1): Introduction

Christopher Cairns and Elizabeth Plantan. October 9, 2016

EPSE 594: Meta-Analysis: Quantitative Research Synthesis

An analysis of the use of animals in predicting human toxicology and drug safety: a review

UN Handbook Ch. 7 'Managing sources of non-sampling error': recommendations on response rates

Incorporation of Imaging-Based Functional Assessment Procedures into the DICOM Standard Draft version 0.1 7/27/2011

SINGLE-CASE RESEARCH. Relevant History. Relevant History 1/9/2018

Houghton Mifflin Harcourt Avancemos!, Level correlated to the

Chapter 2: Research Methods in I/O Psychology Research a formal process by which knowledge is produced and understood Generalizability the extent to

Introduction. 1.1 Facets of Measurement

Answers to end of chapter questions

Chapter 3. Producing Data

Chapter 19. Confidence Intervals for Proportions. Copyright 2010 Pearson Education, Inc.

You must answer question 1.

COWLEY COLLEGE & Area Vocational Technical School

Math 124: Modules 3 and 4. Sampling. Designing. Studies. Studies. Experimental Studies Surveys. Math 124: Modules 3 and 4. Sampling.

The Research Roadmap Checklist

Statistical Sampling: An Overview for Criminal Justice Researchers April 28, 2016

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Culminating Assessments. Option A Individual Essay. Option B Group Research Proposal Presentation

Holt McDougal Avancemos!, Level correlated to the. Crosswalk Alignment of the National Standards for Learning Languages

Transcription:

VII. WHY SAMPLE?

THIS CHAPTER COVERS: The importance of sampling Populations, sampling frames, and samples Qualities of a good sample Sampling size Ways to obtain a representative sample based on probability methods Galton s Problem

ALL RESEARCHERS SAMPLE. Consciously or unconsciously, all researchers choose some cases out of a universe of cases to study. It may seem preferable to study all the cases in a population, but sampling has distinct advantages.

It is more economical to study a sample of cases than it is to study all the cases available. Studying all the cases is often impractical or prohibitively expensive.

Sampling may yield higher quality data. If you choose to study more cases than you reasonably have time for, you will likely feel rushed and make mistakes. It is better to sample and spend an adequate amount of time on each case.

Very accurate results can be obtained from samples if they are representative. For example, political polling in the U.S. has p r e d i c t e d v o t i n g r e s u l t s accurately using very small samples of the voting population (samples are usually in the hundreds to low thousands).

LIKE THE U.S. POPULATION, THE ETHNOGRAPHIC RECORD IS TOO LARGE TO STUDY IN ITS ENTIRETY. But what is the right sample size and how do we draw a sample out of thousands of cases? In the following slides, we briefly discuss important concepts and a variety of probability sampling techniques that allows us to create representative samples.

RELEVANT TERMS CONCEPTS TO KNOW: The universe of cases that one wants to generalize to is called a population (e.g. all cultures ethnographically described, all countries, or all adults in a society). A population is made up of cases, or more formally, units of analysis. The units of analysis are the units of the population that are actually studied (e.g. societies, countries, individuals). To sample from a population, we need to know all the units of analysis. At best, we have a list, called a sampling frame, that approximates a complete list of the cases to be sampled.

POPULATION OR UNIVERSE SAMPLING FRAME SAMPLE Units of Analysis* Sampling Method EXAMPLE: ALL THE WORLD S ETHNOGRAPHICALLY- DESCRIBED CULTURES LIST OF CULTURES WE SAMPLE FROM THE SAMPLE *The diagrams are symbolic. There are almost always many more cases or units of analysis in the population, the sampling frame, and the final sample.

WHAT MAKES A GOOD SAMPLE?

A good sample should represent a larger set of cases in an unbiased and representative way; this makes it possible to generalize results to the larger set from which the sample was drawn. Example: A researcher has a population of 12 people and randomly chooses 4 by using a table of random numbers (the first set of random numbers is 2, 5, 8, and 10). Visualization of simple random population sampling.

HOW DOES A RESEARCHER DETERMINE APPROPRIATE SAMPLE SIZE?

THE SIZE OF YOUR SAMPLE DEPENDS ON WHAT YOUR PURPOSE IS, THE TYPE OF MEASURES YOU HAVE, THE STATISTICAL TEST YOU PLAN TO USE, YOUR CRITERION FOR SIGNIFICANCE, AND HOW ACCURATE YOU HAVE TO BE. Representativeness is almost always more important than sample size. You can consult with advanced reference works to find formulas that help you determine the sample size you will need. There are different formulas for different types of statistical tests, and you can specify how accurate you want your results to be. Here are some general guidelines: The more accuracy you require, the larger the sample needs to be. o If you want to claim that a culture trait is universal, a relatively large sample is required. o If two presidential candidates are close in the polls, a larger representative sample will be needed to correctly predict the outcome; if the candidates are far apart, smaller samples will be accurate enough. If you are looking for strong associations between culture traits, relatively small representative cross-cultural samples (e.g., between 20-50 cases) are probably sufficient. If you expect weak associations between culture traits or you want to control on many variables, larger samples are needed.

WHICH TECHNIQUE SHOULD YOU USE?

In order to create a representative sample, we must adopt a sampling procedure that is not based on subjective criteria. Some type of probability sampling is the sampling method of choice for cross-cultural research. Probability sampling means that every case has a known (and non-zero) chance of being chosen.

SAMPLING FOR COMPARISONS USING SECONDARY DATA What population should you use cross-cultural research methods to generalize to? This can be referred to as your scale of analysis and depends upon your research question. Some cross-cultural researchers might want to generalize to all of the world s cultures of the recent past and present. Or, for example, a regional researcher might want to generalize to the cultures of Africa specifically. The problem is that the population of all the world s cultures is not all that clear. Descriptive information about the entirety of the world s cultures does not exist; nor is there a complete list of cultures that we could use as a sampling frame that represents all of the world s cultures. The Outline of World Cultures (Murdock 1983) was intended to be a complete list (or sampling frame) of ethnographically-documented cultures, but it incomplete and needs updating. Lacking a complete list, most cross-cultural researchers opt to use an existing published sample that claims to be either to be relatively complete or representative of the population of world s cultures. That published sample becomes their sampling frame from which they can draw a smaller final sample.

COMMONLY-USED SAMPLES IN CROSS-CULTURAL RESEARCH There are about 8 published cross-cultural samples [for a more complete discussion, see Ember and Ember (2009: 79-84). Below we discuss a few of the most commonly used samples today in order of their publication: The HRAF Collection of Ethnography is derived from the Cross- Cultural Survey, a project established at Yale University s Institute of Human Relations in 1935. Societies were chosen from Murdock s Outline of World Cultures. The paper/fiche version contains 385 cultures. The online version, ehraf World Cultures, contains data from almost 300 cultures and is growing annually. a.one distinctive feature of the HRAF collections is that its ethnographic data is subject-indexed to the paragraph level, facilitating rapid retrieval of information. b.the HRAF collection grew somewhat opportunistically; for this reason, it is not claimed that the entire collection is representative of all world cultures. However, the subset of cultures called the Probability Sample Files (PSF) does make such a claim (see the next slide).

COMMONLY-USED CROSS-CULTURAL SAMPLES, CONT D The Ethnographic Atlas (EA), with over 1200 societies, was intended by Murdock (1962-1980) to be a relatively complete list of the world s described societies, each of which was pinpointed to a time and place focus. A considerable number of coded variables are included. The EA was later truncated to better-described cases and was published as the Ethnographic Atlas Summary (Murdock 1967). Probability Sample Files (PSF), a subset of the HRAF collection, was designed by Naroll (1967) to be a representative sample of 60 world culture areas. Ethnographic information had to meet certain data quality control criteria to be included. After identifying which cultures met these criteria, one case was randomly chosen from each area. The Standard Cross-Cultural Sample (SCCS) of 186 societies (Murdock and White 1969) claimed to have chosen the best-described society for each of 186 culture areas. There are now over 2000 coded variables for this sample. The sampling method was subjective.

SAMPLING TYPES INCLUDE: Simple random sampling: guarantees that every case in the sampling frame has an equal chance of being chosen, often by assigning random numbers to the cases. If you proceed in a random order and run out of time before studying all the cases, your results are still generalizable to the sample frame. This is considered the method of choice. Online generators can be used to obtain a set of random case numbers. Systematic sampling: every nth case is chosen after a random start. All the cases within the sample must be studied to maintain generalizability to the sample frame. Stratified sampling: the sample is first divided into subgroups or strata and then randomly sampled from each. There are two types of stratified sampling: A. Proportionate stratified sampling: each subgroup is represented in proportion to its occurrence in the total population. B. Disproportionate stratified sampling: some subgroups are overrepresented and some are underrepresented. This technique is used when a researcher needs to over-represent a rare type of case in order to have enough cases of that type to study.

GALTON S PROBLEM

Some researchers have suggested that the validity of cross-cultural findings may be threatened due to cases in the sample being historically related (sharing a recent common origin or being near enough to each other for cultural diffusion to occur). When Edward Tylor presented the first cross-cultural study in 1889, Francis Galton (1889) suggested that many of Tylor s cases were duplicates of one another because they had similar histories, and for this reason many correlations could be falsely inflated. This idea is now known as Galton s Problem. Researchers disagree about the seriousness of Galton s problem as a threat to cross-cultural research. The following slide presents the two sides of the argument.

Galton s Problem is a serious threat Galton s Problem is not a serious threat Statistical associations cannot be causal if they can be attributed to common ancestry or diffusion. The societies in most cross-cultural studies speak mutually unintelligible languages. If language has diverged, other aspects of culture must have diverged as well. Steps must be taken to ensure that samples contain one culture per identified culture area. (The Standard Cross-Cultural Sample and HRAF Probability Sample both take this approach.) Random sampling of cases is the best way to prevent sampling bias; culture areas are chosen subjectively. Tests must be done to test for the effects of diffusion and common ancestry. There are two requirements of statistical inference: that sample cases be independent and the measures on them be independent. To achieve independence of sample cases, the choice of one case for a sample must not influence the choice of another. To achieve independence of measures, each case s score must be arrived at separately.

WAYS TO ENSURE THAT GALTON S PROBLEM IS NOT AFFECTING YOUR RESULTS: Test for the effects of spatial distance or linguistic distance. Use a small random sample selected from a larger list. This method makes it unlikely that more than a few cultures, at worst, will be selected from the same culture area. If you randomly sample from a larger sampling frame, redo your analyses by randomly omitting more than a single case from the same culture area. If the results are not substantially different, your original results are not affected by historical relatedness.

SUMMARY Sampling from a larger population is economical in terms of time and expense and often yields higher quality data. Sample representativeness is almost always more important than sample size Larger samples are needed for greater accuracy, for small differences or weak associations, for many control variables Probability sampling is the method of choice Lacking a complete list of the world s societies, most cross-cultural researchers opt to use a published sample that claims to be relatively complete or representative. The most commonly used cross-cultural samples and sampling frames are: The HRAF Collection of Ethnography (the digital version on ehraf World Cultures). The subset called the Probability Sample Files is based on probability sampling The Ethnographic Atlas with over 1200 societies claims to be relatively complete and can act as a sampling frame The Standard Cross-Cultural Sample Types of probability sampling include: simple random sampling, systematic sampling, proportionate stratified sampling, and disproportionate stratified sampling. The idea that correlations may be distorted by having multiple cultures with similar languages and histories in your sample is known as Galton s Problem. There is disagreement about how serious the problem may be. Various solutions have been proposed for those who are concerned about this issue.

REFERENCES Ember, Carol R. and Melvin Ember. 2009. Cross-Cultural Research Methods, 2nd edition. AltaMira/Roman & Littlefield. Galton, Francis. 1889. Comment in Discussion after Edward B. Tylor (1889). Murdock, George P. 1962-1980. Ethnographic Atlas. Ethnology 1-20. Murdock, George P. 1983. Outline of World Cultures, 6th rev. ed. Human Relations Area Files. Murdock, George P. and Douglas R. White. 1969. Standard Cross-Cultural Sample. Ethnology 8: 329-69. Naroll, Raoul. 1967. The Proposed HRAF Probability Sample. Cross-Cultural Research 2:70-80. Tylor, Edward B. 1889. On a Method of Investigating the Development of Institutions Applied to the Laws of Marriage and Descent. Journal of the Royal Anthropological Institute of Great Britain and Ireland 18: 245-72.

CITATION AND TERMS OF USE Using this course: The material in this course is intended for educational purposes only--either by individuals for personal use or for classroom use by instructors with appropriate attribution. For any commercial use or other uses please contact HRAF. (Email: <hraf@yale.edu>). This chapter is from: Carol R. Ember. 2016. Introducing Cross- Cultural Research. Human Relations Area Files. <http://hraf@yale.edu/ccc/> 2016. Human Relations Area Files.