Analysis of Categorical Data from the Ashe Center Student Wellness Survey

Similar documents
Lab 4 (M13) Objective: This lab will give you more practice exploring the shape of data, and in particular in breaking the data into two groups.

Introduction. Lecture 1. What is Statistics?

Unit 7 Comparisons and Relationships

ANOVA. Thomas Elliott. January 29, 2013

Popper If data follows a trend that is not linear, we cannot make a prediction about it.

Lecture 15 Chapters 12&13 Relationships between Two Categorical Variables

Popper If data follows a trend that is not linear, we cannot make a prediction about it. a. True b. False

The Effectiveness of Captopril

Survey of Smoking, Drinking and Drug Use (SDD) among young people in England, Andrew Bryant

Chapter 1: Exploring Data

Introduction to SPSS S0

Intro to SPSS. Using SPSS through WebFAS

5 To Invest or not to Invest? That is the Question.

NORTH SOUTH UNIVERSITY TUTORIAL 1

Dietary behaviors and body image recognition of college students according to the self-rated health condition

1. To review research methods and the principles of experimental design that are typically used in an experiment.

What is Data? Part 2: Patterns & Associations. INFO-1301, Quantitative Reasoning 1 University of Colorado Boulder

Stat 13, Lab 11-12, Correlation and Regression Analysis

HRS 1998 SECTION PC : PROXY COGNITIVE PAGE 71. PC1. Part of this study is concerned with people s memory, and ability to think about things.

bivariate analysis: The statistical analysis of the relationship between two variables.

Sierra Leone 2017 Fact Sheet

Philippines 2015 Fact Sheet

On behalf of the Rhode Island State Epidemiology and Outcomes Workgroup [November 2013]

5.0 Care for measurement equipment

Test and graph the glucose levels in blood plasma samples collected during the patient s glucose tolerance test.

SAMPLE. Instead of giving myself reasons why I can t, I give myself reasons why I can.

Tonga 2017 Fact Sheet

q2_2 MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Biostatistics 513 Spring Homework 1 Key solution. See the Appendix below for Stata code and output related to this assignment.

Myanmar 2016 Fact Sheet

Nepal 2015 Fact Sheet


How to describe bivariate data

Nepal Terai Region 2015 Fact Sheet

MOVEMENT PREPARATION LAB. Name: Score: Activity I: Predictability of the correct response choice & Influence of Pre cueing

Medical Statistics 1. Basic Concepts Farhad Pishgar. Defining the data. Alive after 6 months?

05/26/2011 Page 1 of 15

05/26/2011 Page 1 of 15

05/27/2011 Page 1 of 15

What s your score? 350M 11/13 (DG3)

USCLAP Project Submission Risk Factors for Teen Alcohol Use 12/22/17. Abstract

Math 2200 First Mid-Term Exam September 22, 2010

Thailand 2015 Fact Sheet

Day 11: Measures of Association and ANOVA

Bangladesh 2014 Fact Sheet

11/04/2011 Page 1 of 16

11/03/2011 Page 1 of 16

by Marilyn Williams Nutrition & Weight Management Consultant

Chapter 19: Categorical outcomes: chi-square and loglinear analysis

11/02/2011 Page 1 of 16

Demonstrating Client Improvement to Yourself and Others

Benin 2016 Fact Sheet

Beverage Density Lab

Choosing a Significance Test. Student Resource Sheet

Hamilton County Power Up YOUth Surveys 2010

2014 School Trend Report Hinsdale Middle School Hinsdale

Examining differences between two sets of scores

Part I: Alcohol Metabolization Explore and Explain

CHAPTER 3 DATA ANALYSIS: DESCRIBING DATA

BTEC SPORT LEVEL 3 FLYING START

Introduction to Quantitative Methods (SR8511) Project Report

2014 District Trend Report Hinsdale CCSD 181

7. Bivariate Graphing

SPSS Correlation/Regression

MINUTE TO WIN IT: NAMING THE PRESIDENTS OF THE UNITED STATES

STA Module 9 Confidence Intervals for One Population Mean

Table 1: One Year Net Survival Rates for All Cancers Excluding Non-Melanoma Skin Cancer:

Things you need to know about the Normal Distribution. How to use your statistical calculator to calculate The mean The SD of a set of data points.

University of New Mexico Hypothesis Testing-4 (Fall 2015) PH 538: Public Health Biostatistical Methods I (by Fares Qeadan)

Constructing a Bivariate Table: Introduction. Chapter 10: Relationships Between Two Variables. Constructing a Bivariate Table. Column Percentages

Only you can decide whether you want to give AA a try - whether you think it can help you.

Indonesia (Sumatra) 2015 Fact Sheet

Statistics Assignment 11 - Solutions

College Health Intervention Projects

Chapter 18: Categorical data

Healthier, happier, and more positive:

The Incredibles. Joseph Arellano. Andrew Bautista. Zita Bell. Natasha Kinai. Viviana Tapia. Section 12. March 12, Diversity Assignment

Further Mathematics 2018 CORE: Data analysis Chapter 3 Investigating associations between two variables

BIOL 458 BIOMETRY Lab 7 Multi-Factor ANOVA

Data Science and Statistics in Research: unlocking the power of your data

Standard Deviation and Standard Error Tutorial. This is significantly important. Get your AP Equations and Formulas sheet

SOME NOTES ON STATISTICAL INTERPRETATION

Alcohol & Drug Abuse Prevention Team. Student Survey Report 2018

What Are Your Odds? : An Interactive Web Application to Visualize Health Outcomes

STP226 Brief Class Notes Instructor: Ela Jackiewicz

ALCOHOL AWARENESS SURVEY Conducted for: THE SALVATION ARMY. Prepared by: ROY MORGAN RESEARCH. September Page 1

Bivariate Graphing Rana Yousaf, Manpreet Mann, and Dan Hiney 24 Sept, 2017

Variables and Scale Measurement (Revisited) Lulu Eva Rakhmilla, dr., M.KM Epidemiology and Biostatistics 2013

Here are the various choices. All of them are found in the Analyze menu in SPSS, under the sub-menu for Descriptive Statistics :

Graphical Exploration of Statistical Interactions. Nick Jackson University of Southern California Department of Psychology 10/25/2013

Data Analysis. A cross-tabulation allows the researcher to see the relationships between the values of two different variables

Healthy Hearts, Healthy Lives Health and Wellness Journal

State Breast and Cervical Cancer Early Detection Program WEAVING Survey

A COMPARISON BETWEEN MULTIVARIATE AND BIVARIATE ANALYSIS USED IN MARKETING RESEARCH

MULTIFACTOR DESIGNS Page Factorial experiments are more desirable because the researcher can investigate

Choosing Life: Empowerment, Action, Results! CLEAR Menu Sessions. Substance Use Risk 2: What Are My External Drug and Alcohol Triggers?

Biology 345: Biometry Fall 2005 SONOMA STATE UNIVERSITY Lab Exercise 8 One Way ANOVA and comparisons among means Introduction

Statistical analysis DIANA SAPLACAN 2017 * SLIDES ADAPTED BASED ON LECTURE NOTES BY ALMA LEORA CULEN

Homework #3. SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question.

Transcription:

Lab 6 Analysis of Categorical Data from the Ashe Center Student Wellness Survey Before starting this lab, you should be familiar with: the difference between categorical and quantitative variables, and how to describe distributions of categorical variables using frequency counts. About the Data The dataset that we will be using in this lab comes from the Ashe Center Student Wellness Survey. Load this dataset into Stata. use http://www.stat.ucla.edu/labs/datasets/ashedata.dta There should be five variables in the dataset: drink: The survey asks Have you ever drunk alcohol, other than a sip or two? Those who answered yes are represented by a 1, while those who answered no, are represented by a 0. 36

howmany: The survey asks During the academic year, about how many times would you say you have gotten drunk after drinking? The response is recorded as a numerical value. BMI: Body-mass index. Participants in the survey gave self-reported heights and weights. From this the BMI can be calculated by weight in Kg / height squared. gender: A 0 represents female, while a 1 represents a male. bodysat: This is the self-reported weight satisfaction of the participant. A 0 represents unsatisfied with body weight, while a 1 represents satisfied with body weight. Variables drink, gender, and bodysat are categorical variables. BMI and howmany are numerical variables. Constructing and Interpreting a Two-Way Table In order for us to evaluate the categorical variables in the dataset we will use two-way tables. Type:. tabulate gender drink The chart that you see shows the frequencies for gender compared with whether they have ever drank alcohol (other than a sip or two). The values in the first row represent the number of females in the survey and the values in the second row represent the number of males in the survey. For example, there are 106 females in the survey (gender = 0 ) that have not used alcohol (drink = 0 ), while 332 females in the survey have used alcohol (drink= 1 ). There are 153 students in the dataset who have never drank and 438 of the students in the dataset are women. Question 1: Interpret the values in the second row. 37

Question 2: How many students in the dataset have drank alcohol? Now we will consider the row relative frequencies. Type:. tabulate gender drink, row In addition to the counts from before, we now have row percentages. Consider the first column of relative frequencies. The percentages can be interpreted as follows: Out of females in the survey, 24.20% of them have never drank alcohol. This percentage is calculated using (106/438). 23.86% (calculated 47/197) of males in the survey have never drank alcohol. 24.09% of all the students in the dataset have not used alcohol. Question 3: Interpret the second column of relative frequencies. Question 4: According to the SHS survey are college-age men or college-age women more likely to drink alcohol, or are they the same? What numbers did you use to reach your conclusion? Now lets consider the column percentages. Type:. tabulate gender drink, column Again, we have the counts from before, but now we have another set of percentages, the relative frequencies for the columns. These percentages can be interpreted as followed: 69.28% (106/153) of the students that have never drank alcohol are women. Of the students that have been exposed to drinking, 68.88% (332/482) were women and 68.98% (438/635) of the students in the dataset are women. Question 5: Interpret the second row of relative frequencies. 38

Stata can perform one other set of percentages, cell frequencies. Type:. tabulate gender drink, cell Again, you will see the counts, but now we have cell frequencies. All of the frequencies are out of the entire dataset. For example there are 332 females who have drank alcohol before, therefore the cell percentage will be 52.28% (332/635). (Remember there are 635 total respondents in the survey.) All cell relative frequencies are calculated by dividing the cell frequency by the total number of students in the dataset. Question 6: Interpret three of the cell frequencies. Another questions posed in the survey, regarded how often the respondents had gotten drunk in the past academic year. We can continue to investigate whether men or women are comparable in their drinking habits, however we cannot use a two way table for this because howmany is not a categorical variable, it is numerical. We will use side-by-side box plots.. graph box howmany, by(gender) Question 7: Describe the box plots. Question 8: According to the box-plots, do college age men drink more than college age women? Commands used for Analysis of Categorical Data Here is a list of Stata commands we used in this Analysis of Categorical Data lab. Use the space next to each command to make notes on what that 39

command does. tabulate tabulate, row tabulate, column tabulate, cell graph box, by 40

Assignment Also included in this the survey and dataset are questions regarding body image. The variable bodysat represents body satisfaction. A 0 means the person is unsatisfied with their body weight and a 1 represents satisfaction with their body weight. The variable BMI is body-mass index, a numerical variable. Using these variables and the variables from before, answer the following questions. What percent of women are satisfied with their weight? What percent of men are unsatisfied with their weight? Who is more satisfied with their weight, men or women? Do those students who are unsatisfied with their weight have higher BMI s? 41