Here are the various choices. All of them are found in the Analyze menu in SPSS, under the sub-menu for Descriptive Statistics :

Similar documents
Explore. sexcntry Sex according to country. [DataSet1] D:\NORA\NORA Main File.sav

ANOVA in SPSS (Practical)

Examining differences between two sets of scores

Using SPSS for Correlation

One-Way Independent ANOVA

Two-Way Independent ANOVA

Intro to SPSS. Using SPSS through WebFAS

THE STATSWHISPERER. Introduction to this Issue. Doing Your Data Analysis INSIDE THIS ISSUE

Section 6: Analysing Relationships Between Variables

Quantitative Methods in Computing Education Research (A brief overview tips and techniques)

Analysis and Interpretation of Data Part 1

Descriptive statistics

Lampiran 4. Validitas dan Reliabilitas Prosoial Variabel Prososial (Putaran Pertama)

HW 1 - Bus Stat. Student:

Introduction to SPSS: Defining Variables and Data Entry

Before we get started:

2.4.1 STA-O Assessment 2

Undertaking statistical analysis of

MULTIPLE LINEAR REGRESSION 24.1 INTRODUCTION AND OBJECTIVES OBJECTIVES

Introduction to SPSS S0

Introduction to Quantitative Methods (SR8511) Project Report

Business Research Methods. Introduction to Data Analysis

7. Bivariate Graphing

1. To review research methods and the principles of experimental design that are typically used in an experiment.

Simple Linear Regression One Categorical Independent Variable with Several Categories

Daniel Boduszek University of Huddersfield

Section I: Multiple Choice Select the best answer for each question.

Making charts in Excel

Data Analysis. A cross-tabulation allows the researcher to see the relationships between the values of two different variables

What you should know before you collect data. BAE 815 (Fall 2017) Dr. Zifei Liu

APPENDIX 1 ETHICAL CLEARANCE

Daniel Boduszek University of Huddersfield

CHAPTER ONE CORRELATION

Introduction to Statistical Data Analysis I

Getting a DIF Breakdown with Lertap

Chapter 5. Describing numerical data

NEUROBLASTOMA DATA -- TWO GROUPS -- QUANTITATIVE MEASURES 38 15:37 Saturday, January 25, 2003

Statistics. Nur Hidayanto PSP English Education Dept. SStatistics/Nur Hidayanto PSP/PBI

bivariate analysis: The statistical analysis of the relationship between two variables.

Population. Sample. AP Statistics Notes for Chapter 1 Section 1.0 Making Sense of Data. Statistics: Data Analysis:

Probability and Statistics. Chapter 1

Chapter 1: Exploring Data

Use Survey Happiness Complete. SAV file on Blackboard Assigned to Mistake 8

Psychology Research Methods Lab Session Week 10. Survey Design. Due at the Start of Lab: Lab Assignment 3. Rationale for Today s Lab Session

Multiple Linear Regression Analysis

CHAPTER TWO REGRESSION

Survey research (Lecture 1) Summary & Conclusion. Lecture 10 Survey Research & Design in Psychology James Neill, 2015 Creative Commons Attribution 4.

Survey research (Lecture 1)

PRINTABLE VERSION. Quiz 1. True or False: The amount of rainfall in your state last month is an example of continuous data.

Summary & Conclusion. Lecture 10 Survey Research & Design in Psychology James Neill, 2016 Creative Commons Attribution 4.0

22CHAPTER 1 SPSS Exercises Explore additional data sets

Chapter 7: Correlation

Medical Statistics 1. Basic Concepts Farhad Pishgar. Defining the data. Alive after 6 months?

Chapter Eight: Multivariate Analysis

Announcement. Homework #2 due next Friday at 5pm. Midterm is in 2 weeks. It will cover everything through the end of next week (week 5).

Unit 7 Comparisons and Relationships

Charts Worksheet using Excel Obesity Can a New Drug Help?

4.3 Measures of Variation

Still important ideas

Understandable Statistics

Statistics as a Tool. A set of tools for collecting, organizing, presenting and analyzing numerical facts or observations.

MINUTE TO WIN IT: NAMING THE PRESIDENTS OF THE UNITED STATES

Chapter Eight: Multivariate Analysis

MULTIPLE OLS REGRESSION RESEARCH QUESTION ONE:

Daniel Boduszek University of Huddersfield

Stepwise method Modern Model Selection Methods Quantile-Quantile plot and tests for normality

Measurement Error 2: Scale Construction (Very Brief Overview) Page 1

LAB ASSIGNMENT 4 INFERENCES FOR NUMERICAL DATA. Comparison of Cancer Survival*

Lab #7: Confidence Intervals-Hypothesis Testing (2)-T Test

Readings: Textbook readings: OpenStax - Chapters 1 11 Online readings: Appendix D, E & F Plous Chapters 10, 11, 12 and 14

The Effectiveness of Captopril

YSU Students. STATS 3743 Dr. Huang-Hwa Andy Chang Term Project 2 May 2002

Still important ideas

Testing Means. Related-Samples t Test With Confidence Intervals. 6. Compute a related-samples t test and interpret the results.

Hour 2: lm (regression), plot (scatterplots), cooks.distance and resid (diagnostics) Stat 302, Winter 2016 SFU, Week 3, Hour 1, Page 1

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

Skala Stress. Putaran 1 Reliability. Case Processing Summary. N % Excluded a 0.0 Total

Impact of Group Rejections on Self Esteem

EPS 625 INTERMEDIATE STATISTICS TWO-WAY ANOVA IN-CLASS EXAMPLE (FLEXIBILITY)

V. Gathering and Exploring Data

Appendix B. Nodulus Observer XT Instructional Guide. 1. Setting up your project p. 2. a. Observation p. 2. b. Subjects, behaviors and coding p.

Homework Exercises for PSYC 3330: Statistics for the Behavioral Sciences

MATH 1040 Skittles Data Project

Name AP Statistics UNIT 1 Summer Work Section II: Notes Analyzing Categorical Data

ROC Curves. I wrote, from SAS, the relevant data to a plain text file which I imported to SPSS. The ROC analysis was conducted this way:

Chapter 1: Exploring Data

Statisticians deal with groups of numbers. They often find it helpful to use

0= Perempuan, 1= Laki-Laki

STP226 Brief Class Notes Instructor: Ela Jackiewicz

EXAMINING THE SOURCES OF RELIGIOSITY

Appendix 1. A. Procedure for preparing histopathology slides. The liver removed and stored immediately in buffered formalin 10 % for

VU Biostatistics and Experimental Design PLA.216

SPSS Correlation/Regression

SPSS GUIDE FOR THE DATA ANALYSIS TARGIL Michael Shalev January 2005

HS Exam 1 -- March 9, 2006

Business Statistics Probability

Measuring the User Experience

Statistical Techniques. Masoud Mansoury and Anas Abulfaraj

Readings: Textbook readings: OpenStax - Chapters 1 13 (emphasis on Chapter 12) Online readings: Appendix D, E & F

Transcription:

Descriptive Statistics in SPSS When first looking at a dataset, it is wise to use descriptive statistics to get some idea of what your data look like. Here is a simple dataset, showing three different variables type of psychological problem someone was treated for, type of treatment approach used, and symptom level at the end of treatment. One of these variables (symptom) is on an interval-level scale; the other two are grouping variables (i.e., they are measured on a nominal-level scale). The type of descriptive statistics you can calculate depends on the level of data that you have to work with. Here are the options: For simple description of nominal-level variables (groups) use frequencies For more complex description of nominal-level variables use crosstabs For simple description of interval- or ratio-level variables (items measured on a scale) use the descriptives command For more complex description of interval- or ratio-level variables use the explore command Each of these options will be shown in more detail below. One Important Note: Even though problem and treatment are nominal-level variables, they have to be coded as numbers (not as text) in order to use the following procedures. See the instructions on How to Recode Variables if you need to convert from a text variable to a number variable.

Here are the various choices. All of them are found in the Analyze menu in SPSS, under the sub-menu for Descriptive Statistics :

We will start by looking at frequencies for a nominal-level variable, like problem in this dataset. Open the Frequencies dialog box: Select the variable that you are interested in from the left-hand lists, and use the arrow button to move it from the left-hand list into the right-hand list. Use the Charts button to see a graphical output of the frequencies on your variable. In this case, I have selected a histogram. You can also use pie charts or bar charts. If you aren t happy with the way your data are displayed (ascending vs. descending order, etc.), try some of the options found by clicking on the Format button.

Hit OK in the main dialog box to continue. The output for this variable looks like this: Patient Diagnosis Valid Cumulative Frequency Percent Valid Percent Percent Anxiety 5 31.3 31.3 31.3 Depression 5 31.3 31.3 62.5 Eating Disorder 6 37.5 37.5 100.0 Total 16 100.0 100.0 This is the frequency table, which is the basic output from this procedure. It shows the number of people in each group, the percent of the total in each group, the percent in each group as a proportion of just the people with complete data (that s the valid percent ), and the cumulative percent reached as you add each new group to the previous ones (that s the cumulative percent ). Notice that the numeric values have been replaced with text labels these were entered in the values column in the variable view of the data. Histogram 6 5 4 Frequency 3 2 1 0 0.5 1 1.5 2 2.5 Patient Diagnosis Here s the graphical output: This histogram shows the number of people in each category. 3 3.5 Mean = 2.06 Std. Dev. = 0.854 N = 16

For a more detailed breakdown of nominal-level data, use the crosstabs command in the Analyze/Descriptive Statistics sub-menu. This is appropriate if you want to consider two variables in combination for example, what % of the patients with an anxiety disorder were treated with cognitive-behavioral therapy? The Crosstabs dialog box lets you enter one variable (or more) as rows in a frequency table, and another variable as the columns in the same table. Use the Cells command to get percentages for each row and column:

Hit OK in the main dialog box to continue. The output from this procedure looks like this: Patient Diagnosis * Type of Therapy Crosstabulation Patient Diagnosis Total Anxiety Depression Eating Disorder Count % within Patient Diagnosis % within Type of Therapy % of Total Count % within Patient Diagnosis % within Type of Therapy % of Total Count % within Patient Diagnosis % within Type of Therapy % of Total Count % within Patient Diagnosis % within Type of Therapy % of Total Type of Therapy CBT IPT Total 3 2 5 60.0% 40.0% 100.0% 37.5% 25.0% 31.3% 18.8% 12.5% 31.3% 2 3 5 40.0% 60.0% 100.0% 25.0% 37.5% 31.3% 12.5% 18.8% 31.3% 3 3 6 50.0% 50.0% 100.0% 37.5% 37.5% 37.5% 18.8% 18.8% 37.5% 8 8 16 50.0% 50.0% 100.0% 100.0% 100.0% 100.0% 50.0% 50.0% 100.0% The central area of this table shows you the total number of patients who have each individual combination of the various levels of the two variables. The far-right column shows you the total for each diagnosis, as a percent of all patients. The bottom-most row slices the data the other way, showing you the total for each type of treatment, as a percent of all patients. The interior cells of the table also show you each individual cell s total as a percentage of the patients with that particular diagnosis, as a percentage of the patients who received that particular type of therapy, and as a percentage of the overall total.

For interval- or ratio-level variables (i.e., scales ), use the descriptives sub-command on the Analyze/Descriptive Statistics menu: This opens a dialog box where you can select the variable that you want descriptive statistics on. Make sure that what you are selecting is actually an interval- or ratio-level variable. Because all of the data are entered as numbers, SPSS will actually calculate descriptive statistics on any of these variables; but these results are only meaningful for interval- or ratio-level variables (in this case, just the score variable).

As usual, select the variable(s) that you are interested in from the left-hand column, and move them to the right-hand column. Use the Options button to select the specific descriptive statistics that you are interested in. Usually, good choices include the mean, standard deviation, maximum, and minimum. If you are concerned about the impact of outliers on your data, a measure of skewness may also be appropriate. The output from this procedure looks like this: Descriptive Statistics N Minimum Maximum Mean Std. Deviation Symptom Severity (higher = worse) 16 19.00 33.00 24.9375 4.15482 Valid N (listwise) 16 This shows you the specific results for each variable that you entered into the analysis. It is possible to get a table that gives you basic descriptive statistics for many variables simultaneously, just by moving them all at once from the left-hand to the right-hand list in the main dialog box.

Finally, to get more complex descriptive results for an interval- or ratio-level variable, use the explore command in the Analyze/Descriptive Statistics sub-menu. The following dialog box will appear: The variable that you are interested in goes into the dependent list. For now, we won t use the other lists, but we will use the plots button to select additional options.

The plots dialog box lets you select options for graphical display of the data, including a stem-and-leaf plot like this: Frequency Stem & Leaf 1.00 1. 9 7.00 2. 0112444 5.00 2. 55567 3.00 3. 123 Stem width: 10.00 Each leaf: 1 case(s)

A boxplot like this: 33 30 27 24 21 18 Symptom Severity (higher = worse) A histogram like this: Histogram 7 6 5 Frequency 4 3 2 1 0 18.00 21.00 24.00 27.00 30.00 Symptom Severity (higher = worse) 33.00 Mean = 24.9375 Std. Dev. = 4.15482 N = 16

Or a plot to test whether the data are normally distributed, like this: Normal Q-Q Plot of Symptom Severity (higher = worse) 2 1 Expected Normal 0-1 -2 18 21 24 27 Observed Value (This result is showing a slight deviation from normality, based on the difference between the actual data points and the theoretical line they should fit, but probably nothing to worry about. Some tests of normality are found earlier in the printout, and give you an actual test to determine how far off the data are from a normal distribution. If these tests are statistically significant [p <.05], you may need to do a data transformation to correct for non-normality, before you run any statistical tests): Tests of Normality 30 33 Kolmogorov-Smirnov(a) Shapiro-Wilk Statistic df Sig. Statistic df Sig. Symptom Severity (higher = worse).181 16.166.924 16.196 a Lilliefors Significance Correction

The explore command also gives you descriptive statistics for each variable, with more extensive results (median, interquartile range, etc.) than the descriptives command: Descriptives Symptom Severity (higher = worse) Mean 95% Confidence Interval for Mean Lower Bound Upper Bound Statistic Std. Error 24.9375 1.03870 22.7236 27.1514 5% Trimmed Mean Median Variance Std. Deviation Minimum Maximum Range Interquartile Range Skewness Kurtosis 24.8194 24.5000 17.263 4.15482 19.00 33.00 14.00 5.50.669.564 -.205 1.091 One additional feature of the explore command is the ability to get separate descriptive results for different sub-groups within your dataset. The way to do this is by grouping your data based on some additional variable, which is treated as a factor for breaking down the analyses. Go back to the main dialog box to add one of your nominal-level variables as a factor (the grouping variable always has to be nominal-level): After you add this factor, the results will show you two separate sets of statistics and plots one for just those patients in the CBT group, and a second set for just those patients in the IPT group.