What s Data Got to Do With It?

Similar documents
Lecture Slides. Elementary Statistics Eleventh Edition. by Mario F. Triola. and the Triola Statistics Series 1.1-1

Trends in Pneumonia and Influenza Morbidity and Mortality

Chapter Three: Sampling Methods

Epidemiology: Overview of Key Concepts and Study Design. Polly Marchbanks

The Community Transformation Initiative in One Rural California County

Introduction: Statistics, Data and Statistical Thinking Part II

Evaluation: Scientific Studies. Title Text

Observational Study Designs. Review. Today. Measures of disease occurrence. Cohort Studies

Podcast Interview Transcript

observational studies Descriptive studies

Psychology Research Process

Questionnaires and Surveys

Welcome to this series focused on sources of bias in epidemiologic studies. In this first module, I will provide a general overview of bias.

KSU College of Applied Medical Sciences CHS 334 Epidemiology Mohammed S. Alnaif, PhD

Level 2: Critical Appraisal of Research Evidence

Data for Social Change and Health Equity. Rural Health Conference 2017 July 25, 26 Flagstaff, Arizona

Vocabulary. Bias. Blinding. Block. Cluster sample

Formulation of Research Design

policy must come from within the community. Provide ample information to the local media.

Sheila Barron Statistics Outreach Center 2/8/2011

Chapter 8. Learning Objectives 9/10/2012. Research Principles and Evidence Based Practice

CHAPTER 2 SOCIOLOGICAL RESEARCH METHODS

Psychology Research Process

A to Z OF RESEARCH METHODS AND TERMS APPLICABLE WITHIN SOCIAL SCIENCE RESEARCH

How do we identify a good healthcare provider? - Patient Characteristics - Clinical Expertise - Current best research evidence

AP Statistics Exam Review: Strand 2: Sampling and Experimentation Date:

Population. population. parameter. Census versus Sample. Statistic. sample. statistic. Parameter. Population. Example: Census.

Analysis A step in the research process that involves describing and then making inferences based on a set of data.

Evaluation: Controlled Experiments. Title Text

Review and Wrap-up! ESP 178 Applied Research Methods Calvin Thigpen 3/14/17 Adapted from presentation by Prof. Susan Handy

A Probability Puzzler. Statistics, Data and Statistical Thinking. A Probability Puzzler. A Probability Puzzler. Statistics.

You can t fix by analysis what you bungled by design. Fancy analysis can t fix a poorly designed study.

Evidence Informed Practice Online Learning Module Glossary

Current Cigarette Smoking Among Workers in Accommodation and Food Services United States,

University of Wollongong. Research Online. Australian Health Services Research Institute

Math 140 Introductory Statistics

t-test for r Copyright 2000 Tom Malloy. All rights reserved

STATISTICS 8 CHAPTERS 1 TO 6, SAMPLE MULTIPLE CHOICE QUESTIONS

Chronic Diseases, Injury Baltimore City Health Department, Office of Chronic Disease Prevention. Baltimore, Maryland. Assignment Description

BLACK RESIDENTS VIEWS ON HIV/AIDS IN THE DISTRICT OF COLUMBIA

Outline. Introduction to Epidemiology. Epidemiology. Epidemiology. History of epidemiology

An Introduction to Epidemiology

Who? What? What do you want to know? What scope of the product will you evaluate?

Chapter 23. Inference About Means. Copyright 2010 Pearson Education, Inc.

INTRODUCTION TO STATISTICS SORANA D. BOLBOACĂ

CHS 2009 Baltimore City Community Health Survey: Summary Results Report

LPHA Contractor: Wright County Health Department. LPHA Administrator/Director or Designee: Tracy Hardcastle. Report Prepared By:

CSTE/ASTHO Webinar 7/14/10 Susan Hardman, Director Bureau of Injury Prevention New York State Department of Health

Using Predictive Analytics to Save Lives

SOCQ121/BIOQ121. Session 8. Methods and Methodology. Department of Social Science. endeavour.edu.au

Healthy Communities Conference Ana Diez Roux 1. Okay, good afternoon. It s a pleasure to be here. I guess by, I

Minnesota Cancer Alliance SUMMARY OF MEMBER INTERVIEWS REGARDING EVALUATION

Focus Words intervention phenomenon priority suspend transmit

What is the Scientific Method?

Calaveras County TRL Evaluation Reporting Plan

Pharmaceutical Care of People in the Community for the Recognition Rate and the Pharmaceutical Service Satisfaction Study

CHAPTER VI RESEARCH METHODOLOGY

What Constitutes a Good Contribution to the Literature (Body of Knowledge)?

I. Introduction and Data Collection B. Sampling. 1. Bias. In this section Bias Random Sampling Sampling Error

Validity and Quantitative Research. What is Validity? What is Validity Cont. RCS /16/04

Chapter 5 Analyzing Quantitative Research Literature

Confounding and Bias

UNIT I SAMPLING AND EXPERIMENTATION: PLANNING AND CONDUCTING A STUDY (Chapter 4)

Measuring impact. William Parienté UC Louvain J PAL Europe. povertyactionlab.org

Objectives. Quantifying the quality of hypothesis tests. Type I and II errors. Power of a test. Cautions about significance tests

Public Opinion Survey on Tobacco Use in Outdoor Dining Areas Survey Specifications and Training Guide

Social Justice in Public Health for Public Health Professionals. Renee Walker, DrPH, MPH

Basic Biostatistics. Dr. Kiran Chaudhary Dr. Mina Chandra

Reducing Tobacco Use and Secondhand Smoke Exposure: Smoke- Free Policies

Air Toxics. Questions and Answers

Health System Members of the Milwaukee Health Care Partnership

Implementing the 2017 President s Challenge: Primary, Secondary & Tertiary Prevention of Addiction & Substance Misuse

BUILDING PRIMARY CARE RESEARCH INFRASTRUCTURE AT YOUR COMMUNITY HEALTH CENTER

Experimental Research in HCI. Alma Leora Culén University of Oslo, Department of Informatics, Design

CPH601 Chapter 3 Risk Assessment

Mark B Horton, MD, MSPH 22 March 2011

SOCIOLOGICAL RESEARCH PART I. If you've got the truth you can demonstrate it. Talking doesn't prove it. Robert A. Heinlein

Understanding Statistics for Research Staff!

Lung Cancer in First Nations People in Ontario: Incidence, Mortality, Survival and Prevalence

Disparities in Tobacco Product Use in the United States

Formulating Research Questions and Designing Studies. Research Series Session I January 4, 2017

Chapter 3. Research Methodology. This chapter mentions the outline of research methodology and gives

Stat 13, Intro. to Statistical Methods for the Life and Health Sciences.

Ray County Memorial Hospital 2016 Implementation Plan 1

DEPARTMENT OF HEALTH AND HUMAN SERVICES DIVISION OF PUBLIC HEALTH MANDY COHEN, MD, MPH SECRETARY

4TH ANNUAL SOUTH DAKOTA DEMOGRAPHY CONFERENCE THEME: POPULATION AND HEALTH ORGANIZER: STATE DATA CENTER

Chapter 11. Experimental Design: One-Way Independent Samples Design

In mid-2002, the Applied Population Laboratory was contacted by the Wisconsin Primary Health

VALIDITY OF QUANTITATIVE RESEARCH

Chapter 1 Data Collection

STAKEHOLDER ENGAGEMENT IN WISCONSIN S NEEDS ASSESSMENT PROCESS

Review Sheet. 2) Which branch of science broken into chemistry and physics? a. Life science b. Earth science c. Biology d.

Terms related to Epidemiologic Data. Needs Assessment Components:

Lecture II: Difference in Difference. Causality is difficult to Show from cross

UNIVERSITY OF THE FREE STATE DEPARTMENT OF COMPUTER SCIENCE AND INFORMATICS CSIS6813 MODULE TEST 2

Chapter 5: Producing Data

Non-Infectious Diseases (Cross-Cutting), Infectious Diseases County of Marin, Health & Human Services. San Rafael, California. Assignment Description

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Highlights. Attitudes and Behaviors Regarding Weight and Tobacco. A scientific random sample telephone survey of 956 citizens in. Athens-Clarke County

New York State Department of Health Center for Environmental Health

Transcription:

What s Data Got to Do With It? Health and Demographic Data Sources Dakota Conference on Rural and Public Health June 2017 Andrea Huseth Zosel, PhD Abby Gold, PhD, MPH Mary Larson, PhD, MPH Rick Jansen, PhD Source: www.med.uottawa.ca/sim/data /Models/PH_Framework.png Acquiring Needs Assessment Data Types of Data Primary data that is collected to answer unique questions related to your specific needs assessment; Sources of Primary Data Single step (single contact) or Cross sectional surveys (point in time) gather primary data from individuals or groups with a single contact. Secondary data already collected by someone else (for another reason) & available for your use; Sources of Primary Data Single step (or cross sectional) surveys can be administered via Sources of Primary Data Windshield Tour or Walk Through (Eng & Blanchard, 1990 1991, p. 96 97) Useful indicators of community health and well being include Housing types and conditions Recreational and commercial facilities Private and public sector services Social and civic activities Identifiable neighborhoods or residential clusters Maintenance of buildings, grounds, and yards 1

Sources of Secondary Data Data Collected by Government Agencies Some data collection is mandated by law; others are collected voluntarily Federal level State & local levels Census Bureau American Community Survey Federal Government Small percentages of households Monthly Mandated Helps determine how $400 billion in funds are spent CDC Behavioral Risk Factor Surveillance System (BRFSS) The Nation's premier system of health related telephone surveys that collect state data about residents health related risk behaviors, chronic health conditions, and use of preventive services. CDC Community Health Status Indicators (CHSI) The Goals of the current CHSI: 1) Assess community health status and identify disparities; 2) Promote a shared understanding of the wide range of factors that are associated with health; and 3) Mobilize multi sector partnerships to work collaboratively to improve population health. Sources of Secondary Data Data from Nongovernment Agencies & Organizations Health care systems Voluntary health agencies (e.g., facts & figures booklets) Business, civic, and commerce groups Local agencies and organizations often have data they have collected for their own use. http://www.nrhi.org/news/2016countyhe althrankingsreleased/ Evaluating Secondary Data Sources 2

Is secondary data useful? YES! Inexpensive Primary data collection can be EXPENSIVE Use resources already available Why evaluate secondary data sources? Convenient Primary data collection can be time intensive Availability of longitudinal data trending Can compare trends over time Triangulate different sources Can use to make comparisons With different states, counties, cities, populations, etc. Sources: Boslaugh, S. (2007). An introduction to secondary data analysis. Secondary data sources for public health: a practical guide, 2 10. Institute for Work and Health. (2017). What researchers mean by primary and secondary data. Retrieved from: http://www.iwh.on.ca/wrmb/primary dataand secondary data. Source: McGarry, J. (2017). Local news graph really wants to make it seem like people don t care about Zika. Retrieved from: http://mashable.com/2016/08/15/zika graph gets it wrong/#c41bjjhddqqr Why evaluate secondary data sources? Garbage in = garbage out Bad information = bad decisions How to evaluate secondary data sources Relevance or Fitness for Purpose First and foremost is the information relevant to your specific issue, specific geography, specific population? Does the data represent what you NEED? Relevance or Fitness for Purpose Example diabetes in Wells County, ND Source: Chen, H., Hailey, D., Wang, N., & Yu, P. (2014). A review of data quality assessment methods for public health information systems. International journal of environmental research and public health, 11(5), 5170 5207. 3

Relevance or Fitness for Purpose Example diabetes in Wells County, ND http://www.healthdata.org/sites/default/files/files/county_profil es/us/2015/county_report_wells_county_north_dakota.pdf Timeliness When was the data collected? Last year, 10 years ago? Potentially out of date. QUESTION: Can you use data that is more than 5 years old? Source: Exploratory Research Design: Secondary Data. Retrieved from: https://www.fep.up.pt/disciplinas/lge508/presentation%204%20secondary%20data.pdf Timing How long has the data been collected? One time, every other year for 10 years, every 10 years, etc.? Is there enough data to detect trends? Example: Is your diabetes education program effective? 2009 and 2011 data is this enough to show a trend? Source: https://www.ndhealth.gov/nutrphyact/north_dakota_hb1443_final_draft.pdf Source: Exploratory Research Design: Secondary Data. Retrieved from: https://www.fep.up.pt/disciplinas/lge508/presentation%204%20secondary%20data.pdf Population 14,000 12,000 10,000 8,000 6,000 4,000 Wells County, ND Census Population: 1890 2016 8,310 11,814 12,957 13,285 11,198 10,417 9,237 7,847 6,979 5,864 5,102 4,207 4,098 Accuracy Completeness Did all facilities report, if appropriate? Was there any missing data? Correctness Does it make sense? Is it logical? Example: 131 motor vehicle fatalities in 2015 (NDDOT, 2015) another source says 78 fatalities in 2016 1,212 2,000 1890 1900 1910 1920 1930 1940 1950 1960 1970 1980 1990 2000 2010 Est Census Year 2016 Source: U.S. Census Bureau 22 Consistency Is the data similar to previous time periods, somewhat close? Are there outliers? Example: 9% in 2014, 30% in 2015, 7% in 2016 Sources: Chen, H., Hailey, D., Wang, N., & Yu, P. (2014). A review of data quality assessment methods for public health information systems. International journal of environmental research and public health, 11(5), 5170 5207. North Dakota Department of Transportation. (2015). North Dakota Crash Summary. Retrieved from: https://www.dot.nd.gov/divisions/safety/docs/crash summary.pdf 4

Interpretability/Accessibility Is there enough information/documentation for you to determine how the data was collected, how to interpret the information? What has been provided for you to prevent the misuse or misinterpretation of the information? Credibility Who collected the data? Who is posting the data? Are they reputable? http://www.healthdata.org/sites/default/files/files/county_profil es/us/2015/county_report_wells_county_north_dakota.pdf 25 Source: Tang, T. Data Quality and Why We Should Care. Retrieved from: https://www.fhwa.dot.gov/policyinformation/presentations/his2013/data_quality_and_why_we_should_care.pdf BRFSS https://www.cdc.gov/brfss/ Prevalence Data/Data Analysis Tools Prevalence/Trends Tools North Dakota 1. Relevance depends on public health issue of interest 2. Timeliness/Timing frequency depends on question/area 3. Accuracy (completeness, correctness, consistency) 4. Interpretability/accessibility BRFSS Data User Guide 5. Credibility survey data and documentation 26 Main Uses Assessing the health status of the population Establish priorities, allocate resources, and plan and evaluate effectiveness Generating Hypotheses Specific statement that attempts to explain observations Compare groups with very different disease rates infant mortality and access to care Look for common factors among different groups of people occupational exposure to solvents Concomitant variation prevalence of smoking and lung cancer during same period Examine disease distribution to look for causes Person Age Sex/gender Socioeconomic status Measurement Place International Geographic (within country) variation Urban/rural Localized occurrence of disease 5

Time Cyclic fluctuations/ seasonal trends Common source and point epidemics Secular time trends Clustering Temporal and spatial Presentation Counts Ratio Proportion and percentage Rate Trends Absolute vs. Relative Measures Absolute: based on the difference between two measures of disease frequency Relative: based on the ratio of two measures of disease frequency Measuring Disease Occurrence Consider: 1. Number of people affected by disease 2. Size of population at risk 3. Length of time population followed Types of data County A CountyB Number of Diabetes cases 100 75 Population size 50,000 5,000 Follow up period 1 year 3 years Comparable frequency 200/100,000/year 500/100,000/year, RR =, =.4 Crude vs. Adjusted Measures Crude measurements are a direct comparison of numbers E.g., How many more cases of diabetes seen in County A vs. B Adjusted measurements are comparison of numbers within specific characteristic group E.g., How many more cases of diabetes seen in County A vs. B adjusted for age categories Measuring Disease Occurrence Consider: 1. Number of people affected by disease 2. Size of population at risk 3. Length of time population followed What about age distribution differences? Types of data County A CountyB Number of diabetes cases 100 75 Population size 50,000 5,000 Follow up period 1 year 3 years Comparable frequency 200/100,000/year 500/100,000/year RR =, =.4, 6

Types of data Age <50 Diabetes Cases Person time incidence County A 30 25,000*1 30/25,000 County B 15 2,500*3 15/7,500, RR =, =.6 Age 50 Diabetes Cases Person time incidence County A 100 30=70 25,000*1 70/25,000 County B 75 15=60 2,500*3 60/7,500, RR =, =.35 Qualitative: Data that is usually measured and expressed in the form of words, concepts, themes, or categories rather than numbers. Qualitative data are often used to gain a more in depth understanding of a particular incident or phenomenon they answer how or why something is occurring. Qualitative techniques include, but are not limited to: Observation Ethnography: the study and subsequent recording of information about human culture Case study: a study based on an intensive observation of one (or a few) cases or examples, such as organizations or events. Open ended interview Focus group: a group of individuals led through a structured discussion of a particular topic or event. Focus groups are often used to assess social needs, develop hypothesis and survey questions, investigate the meaning of survey results, and assess the range of opinions. Types of data Quantitative: Data that is usually measured and expressed in the form of numbers or statistics and which usually answer the who, what, when and where questions of a research problem. Quantitative techniques include, but are not limited to: Census: a complete enumeration of the population Survey: A systematic way of collecting information from a defined population, usually by means of interviews or questionnaires administered to a sample of the population. Questionnaire: a method of collecting data by asking participants identical questions about a particular issue or issues. Questions may be open ended (the answer is completely left up to the respondent) or close ended (where respondents are presented with a limited number of options to reply, such as yes/no, true/false or Likert scale responses. Close ended interview Bias A systematic error in the design or conduct of study that leads to an erroneous association between the exposure and disease. After calculating a measure of effect, epidemiologists have to determine if the observed result is true This means making sure that any forms of bias is minimized or eliminated Key Facts: Alternative explanation for association Not the result of prejudiced investigator Can pull association estimate in either direction from null Varying impact of bias (can be small or large) Avoided through careful study design and conduct Random Error Lead to false association between the exposure and disease that arises from chance Unsystematic because they arise from unpredictable process Two sources: 1. Measurement error in assessing exposure and/or disease E.g.: error when identifying cases of disease in population or calculating person time of follow up 2. Random error when sampling particular subjects for study Study sample is unrepresentative of target population by chance Random and nonrandom methods Precision is a lack of random error P values and confidence intervals are probability based statistics created so inferences beyond the actual data can be made Sample size and power In the proposal/study design phase epidemiologist use sample size calculations to plan enrollment numbers needed to detect specific measure of effect Formulas take into account: 1. Expected magnitude of the association 2. The outcome rate in the comparison group or exposure prevalence in the control group 3. Probability of rejecting the null (alpha) 4. Probability of missing a true association (beta) 5. Relative size of the compared groups. Use previously collected data to estimate values Larger sample sizes give you more power to detect a smaller effect Power of a study refers to the ability of a test to correctly reject the null when the alternative is true 7

Precision (Lack of Random Error) Reduction of random error Improved by: Study Size Enlarge study population size Study Efficiency Making sure that the factors of interest are represented in the data and the correct population is enrolled Validity (Lack of systematic error) Internal Validity Selection bias Factors that determine enrollment in the study distorts study conclusions because they are also factors that effect outcome Self selection bias If self enrollment into study is related to outcome of interest or potential risk factors Diagnostic bias Criteria for diagnosis is not uniform across all populations E.g.:Oral contraceptives use and venus thromboembolism Confounders A third variable influences both exposure and outcome (see previous lecture) Generalizability Use representative study groups The observations of one study apply to a broader experience across time and place Improving Precision Efficiency and Apportionment Picking how to sample individuals to improve precision of estiamtes Precision and Stratification How are apportionments influenced when we subdivide our group in to smaller groups Improving Validity Choice of Reference Groups Avoiding Selection Bias Estrogen and Endometrial Cancer Controversy Case controls studies reported 10 fold increased risk Detection bias due to uterine bleeding Look at group with benign gynecologic diseases Still found an significant increased risk Social Math 8

Social Math Process Define your issue Determine the focus of your data Identify meaningful comparison Identify your audience Find reliable data Visualize your story Break the numbers down by time "The food and beverage industry spends $2 billion a year just targeting kids that'smore than $5 million every day just to reach children and youth." BREAK THE NUMBER DOWN BY PLACE Personalize or localize your numbers 9

Provide comparisons to familiar things Center for Science in the Public Interest Provide ironic comparisons 10

Let s try some social math! 11