Statistics as a Tool A set of tools for collecting, organizing, presenting and analyzing numerical facts or observations.
Descriptive Statistics Numerical facts or observations that are organized describe the frequencies, measures of central tendency, and degree of dispersion of variables in a sample of a larger population.
Levels of Measurement Reflects type of information measured and helps determine what descriptive statistics and which statistical test can be used.
Four Levels of Measurement NOIR -- no one is ready Nominal lowest level, categories, no rank Ordinal second lowest, ranked categories Interval next to highest, ranked categories with known units between rankings Ratio highest level, ranked categories with known intervals and an absolute zero
Descriptives for nominal and ordinal data Frequencies and percentages Frequencies absolute number of cases Percentages relative number of cases
Frequencies for a nominal variable Reason for referral Valid mental illness sexual abuse physical abuse neglect parentreturn Total Cumulative Frequency Percent Valid Percent Percent 7 18.9 18.9 18.9 7 18.9 18.9 37.8 8 21.6 21.6 59.5 6 16.2 16.2 75.7 9 24.3 24.3 100.0 37 100.0 100.0
Descriptives for ordinal data Quality of outcome Valid Positive Neutral Negative Total Cumulative Frequency Percent Valid Percent Percent 23 62.2 62.2 62.2 4 10.8 10.8 73.0 10 27.0 27.0 100.0 37 100.0 100.0
Descriptives for interval/ratio (scale) variables Measures of central tendency Mean -- sum of all cases divided by number of cases Median case for which half of all other cases are above and half of all other cases are below. Mode most frequently occurring case
Descriptives for scale variables Measures of dispersion Range Value of cases from minimum to maximum Standard Deviation number which when added or taken away from each case adds up to zero. Variance Standard deviation squared
Descriptives for a ratio variable MONTHS Valid 3.00 4.00 5.00 6.00 7.00 8.00 9.00 11.00 12.00 13.00 14.00 15.00 16.00 17.00 18.00 19.00 20.00 Total Cumulative Frequency Percent Valid Percent Percent 1 2.7 2.7 2.7 2 5.4 5.4 8.1 2 5.4 5.4 13.5 4 10.8 10.8 24.3 5 13.5 13.5 37.8 1 2.7 2.7 40.5 1 2.7 2.7 43.2 3 8.1 8.1 51.4 3 8.1 8.1 59.5 4 10.8 10.8 70.3 3 8.1 8.1 78.4 1 2.7 2.7 81.1 1 2.7 2.7 83.8 1 2.7 2.7 86.5 2 5.4 5.4 91.9 1 2.7 2.7 94.6 2 5.4 5.4 100.0 37 100.0 100.0
More descriptives for a ratio variable Statistics MONTHS N Mean Median Mode Std. Deviation Variance Range Minimum Maximum Valid Missing 37 0 10.8919 11.0000 7.00 4.92603 24.26577 17.00 3.00 20.00
Inferential statistics Procedures used to make inferences from sample data and generalize findings to the population
Probability Statistical significance the probability that the difference or the association found in the sample would be present in the population. Three common probabilities used <.05 <.01 <.001
Sampling bias The systematic differences between sample in study and the larger population of interest. The use of inferential statistics allows us to calculate the odds that what is found in the sample is due to sampling bias.
Statistical significance (p-levels) When p <.05, the degree of difference or association being tested would only occur by chance alone five times out of a hundred. When p <.01, the difference or association being observed would only occur by chance alone one time out of a hundred. When p <.001
Testing for statistically significant differences When you want to see if there is a difference in outcome by group membership, or by treatment approach. SPSS Analyze Compare means Independent t-test
Is there a significant difference in months of service and type of outcome? Group Statistics MONTHS Was outcome positive? yes no Std. Error N Mean Std. Deviation Mean 23 11.2174 5.11643 1.06685 14 10.3571 4.73298 1.26494 Independent Samples Test MONTHS Equal variances assumed Equal variances not assumed Levene's Test for Equality of Variances F Sig. t df Sig. (2-tailed) t-test for Equality of Means Mean Difference 95% Confidence Interval of the Std. Error Difference Difference Lower Upper.123.728.510 35.613.8602 1.68725-2.56506 4.28556.520 29.309.607.8602 1.65476-2.52258 4.24307
Statistically significant differences i.v. nominal and d.v. interval/ratio Analyze Univariate (One d.v.; multiple predictors) Multivariate (Multiple d.v.; multiple predictors) Repeated measures (time series of dependent measures; one predictor.
Statistically significant associations at higher levels of measurement Analyze Correlate Bi-variate Pearson s (two interval/ratio variables) Kendall s tau (two ordinal variables) Spearman s rho (two ordinal variables)
Test of Pearson Correlation Coefficient (r) Correlations MONTHS Percent of positive cases for each referral reasons Pearson Correlation Sig. (2-tailed) N Pearson Correlation Sig. (2-tailed) N Percent of positive cases for each referral MONTHS reasons 1.069..685 37 37.069 1.685. 37 37
Independent t-test to determine statistical significance Independent Samples Test MONTHS Equal variances assumed Equal variances not assumed Levene's Test for Equality of Variances F Sig. t df Sig. (2-tailed) t-test for Equality of Means Mean Difference 95% Confidence Interval of the Std. Error Difference Difference Lower Upper.464.509-13.516 12.000-13.0000.96186-15.09571-10.90429-13.516 10.955.000-13.0000.96186-15.11809-10.88191
Differences between groups at lower levels of measurement Analyze Descriptives Crosstabs Identify variable in row and column Select statistics Two nominal (dichotomized) Chi-Square Nominal by ordinal Kendal s tau-b Nominal by interval Eta
Difference in LOS by referral Group Statistics MONTHS Reason for referral mental illness sexual abuse Std. Error N Mean Std. Deviation Mean 7 5.2857 2.05866.77810 7 18.2857 1.49603.56544
Crosstabs to determine difference between groups Reason for referral * Was outcome positive? Crosstabulation Reason for referral Total mental illness sexual abuse physical abuse neglect parentreturn Count % within Reason for referral Count % within Reason for referral Count % within Reason for referral Count % within Reason for referral Count % within Reason for referral Count % within Reason for referral Was outcome positive? yes no Total 6 1 7 85.7% 14.3% 100.0% 5 2 7 71.4% 28.6% 100.0% 5 3 8 62.5% 37.5% 100.0% 3 3 6 50.0% 50.0% 100.0% 4 5 9 44.4% 55.6% 100.0% 23 14 37 62.2% 37.8% 100.0%
Chi-Square tests Pearson Chi-Square Likelihood Ratio Linear-by-Linear Association N of Valid Cases a. Chi-Square Tests Asymp. Sig. Value df (2-sided) 3.485 a 4.480 3.696 4.449 3.334 1.068 37 9 cells (90.0%) have expected count less than 5. The minimum expected count is 2.27.
Which test to use when? Decision is made by what the question is, the level of measurement of the variable and the extent to which assumptions of parametric statistics are met. Question: Difference or Association? Level of measurement: NOIR? Sample size and distribution (normal?)
Tests comparing difference between 2 or more groups Test Paired (dependent) t-test Unpaired (independent t- test Dependent variable Interval/ratio pre and post tests Interval/ratio Independent variable Nominal Nominal (2 grps) ANOVA F-test Interval/ratio Nominal (>2 grps) Chi-Square (Nonparametric) Nominal (Dichotomous) Nominal
Tests demonstrating association between two groups Test Dependent var. Independent var. Spearman rho Ordinal Ordinal Mann-Whitney U Ordinal Nominal Non-parametric Pearson s r Interval/ratio Interval/ratio
Tests demonstrating association between two groups, controlling for third variable Test Dependent Independent Logistic Nominal Nominal regression Linear regression Interval/ratio Interval/ratio Pearson partial r Interval/ratio Interval/ratio Kendall s partial r Ordinal Ordinal
Posted with Permission from: Bernie S. Newman, Ph.D., L.C.S.W. Associate Professor MSW Program Director Temple University School of Social Administration 1301 Cecil B. Moore Ave. 505 Ritter Annex Philadelphia, PA 19122 215-204-1205