Internal structure evidence of validity

Similar documents
The Concept of Validity

The Concept of Validity

Questionnaire Basics

Questionnaire Basics

Analysis of the Reliability and Validity of an Edgenuity Algebra I Quiz

ASSESSING THE UNIDIMENSIONALITY, RELIABILITY, VALIDITY AND FITNESS OF INFLUENTIAL FACTORS OF 8 TH GRADES STUDENT S MATHEMATICS ACHIEVEMENT IN MALAYSIA

Psychometric Instrument Development

Psychometric Instrument Development

The Development of Scales to Measure QISA s Three Guiding Principles of Student Aspirations Using the My Voice TM Survey

Psychometric Instrument Development

Modeling the Influential Factors of 8 th Grades Student s Mathematics Achievement in Malaysia by Using Structural Equation Modeling (SEM)

Testing the Multiple Intelligences Theory in Oman

Psychometric Instrument Development

TheIntegrationofEFAandCFAOneMethodofEvaluatingtheConstructValidity

Making a psychometric. Dr Benjamin Cowan- Lecture 9

Applications of Structural Equation Modeling (SEM) in Humanities and Science Researches

Gezinskenmerken: De constructie van de Vragenlijst Gezinskenmerken (VGK) Klijn, W.J.L.

Development and Psychometric Properties of the Relational Mobility Scale for the Indonesian Population

By Hui Bian Office for Faculty Excellence

Factor Analysis. MERMAID Series 12/11. Galen E. Switzer, PhD Rachel Hess, MD, MS

Confirmatory Factor Analysis. Professor Patrick Sturgis

Basic concepts and principles of classical test theory

Survey research (Lecture 1) Summary & Conclusion. Lecture 10 Survey Research & Design in Psychology James Neill, 2015 Creative Commons Attribution 4.

Survey research (Lecture 1)

Rong Quan Low Universiti Sains Malaysia, Pulau Pinang, Malaysia

Research Brief Reliability of the Static Risk Offender Need Guide for Recidivism (STRONG-R)

Anumber of studies have shown that ignorance regarding fundamental measurement

Validity and reliability of measurements

The Influence of Psychological Empowerment on Innovative Work Behavior among Academia in Malaysian Research Universities

Summary & Conclusion. Lecture 10 Survey Research & Design in Psychology James Neill, 2016 Creative Commons Attribution 4.0

Alternative Methods for Assessing the Fit of Structural Equation Models in Developmental Research

Statistics for Psychosocial Research Session 1: September 1 Bill

No part of this page may be reproduced without written permission from the publisher. (

Paul Irwing, Manchester Business School

Introduction to Factor Analysis. Hsueh-Sheng Wu CFDR Workshop Series June 18, 2018

10 Exploratory Factor Analysis versus Confirmatory Factor Analysis

Questionnaire Translation

Manifestation Of Differences In Item-Level Characteristics In Scale-Level Measurement Invariance Tests Of Multi-Group Confirmatory Factor Analyses

Teachers Sense of Efficacy Scale: The Study of Validity and Reliability

HANDOUTS FOR BST 660 ARE AVAILABLE in ACROBAT PDF FORMAT AT:

Copyright is owned by the Author of the thesis. Permission is given for a copy to be downloaded by an individual for the purpose of research and

CHAPTER VI RESEARCH METHODOLOGY

Development of self efficacy and attitude toward analytic geometry scale (SAAG-S)

existing statistical techniques. However, even with some statistical background, reading and

Examining the efficacy of the Theory of Planned Behavior (TPB) to understand pre-service teachers intention to use technology*

The SAGE Encyclopedia of Educational Research, Measurement, and Evaluation Multivariate Analysis of Variance

Assessing the Validity and Reliability of a Measurement Model in Structural Equation Modeling (SEM)

The Multidimensionality of Revised Developmental Work Personality Scale

Internal Consistency: Do We Really Know What It Is and How to. Assess it?

Research Questions and Survey Development

Factorial Validity and Reliability of 12 items General Health Questionnaire in a Bhutanese Population. Tshoki Zangmo *

An Assessment of the Mathematics Information Processing Scale: A Potential Instrument for Extending Technology Education Research

NORMATIVE FACTOR STRUCTURE OF THE AAMR ADAPTIVE BEHAVIOR SCALE-SCHOOL, SECOND EDITION

LISREL analyses of the RIASEC model: Confirmatory and congeneric factor analyses of Holland's self-directed search

SEM-Based Composite Reliability Estimates of the Crisis Acuity Rating Scale with Children and Adolescents

Item Analysis & Structural Equation Modeling Society for Nutrition Education & Behavior Journal Club October 12, 2015

The measurement of media literacy in eating disorder risk factor research: psychometric properties of six measures

Estimating Individual Rater Reliabilities John E. Overall and Kevin N. Magee University of Texas Medical School

Confirmatory factor analysis (CFA) of first order factor measurement model-ict empowerment in Nigeria

FACTOR ANALYSIS Factor Analysis 2006

Reliability and Exploratory Factor Analysis of Psychological Well-being in a Persian Sample

University of Nevada, Reno. A Factor Analysis of the Student School Uniform Survey

PSYCHOMETRIC PROPERTIES OF CLINICAL PERFORMANCE RATINGS

IS THE POTENTIAL BEING FULLY EXPLOITED? ANALYSIS OF THE USE OF EXPLORATORY FACTOR ANALYSIS IN MANAGEMENT RESEARCH: YEAR 2000 TO YEAR 2006 PERSPECTIVE.

Factor analysis of alcohol abuse and dependence symptom items in the 1988 National Health Interview survey

Use of Structural Equation Modeling in Social Science Research

Unit outcomes. Summary & Conclusion. Lecture 10 Survey Research & Design in Psychology James Neill, 2018 Creative Commons Attribution 4.0.

Unit outcomes. Summary & Conclusion. Lecture 10 Survey Research & Design in Psychology James Neill, 2018 Creative Commons Attribution 4.0.

A Review and Evaluation of Exploratory Factor Analysis Practices in Organizational Research

The Construct Validity and Internal Consistency of the Adult Learning Inventory (AL-i) among Medical Students

The Asian Conference on Education & International Development 2015 Official Conference Proceedings. iafor

Empowered by Psychometrics The Fundamentals of Psychometrics. Jim Wollack University of Wisconsin Madison

Re-Assessing the Usability Metric for User Experience (UMUX) Scale

The effects of ordinal data on coefficient alpha

PÄIVI KARHU THE THEORY OF MEASUREMENT

Class 7 Everything is Related

Scale Building with Confirmatory Factor Analysis

EDINBURGH HANDEDNESS INVENTORY SHORT FORM: A REVISED VERSION BASED ON CONFIRMATORY FACTOR ANALYSIS

DEVELOPMENT AND VERIFICATION OF VALIDITY AND RELIABILITY OF

Original Article. Relationship between sport participation behavior and the two types of sport commitment of Japanese student athletes

Recommended Sample Size for Conducting Exploratory Factor Analysis on Dichotomous Data

Reliability Analysis: Its Application in Clinical Practice

Methodological Issues in Measuring the Development of Character

THE USE OF CRONBACH ALPHA RELIABILITY ESTIMATE IN RESEARCH AMONG STUDENTS IN PUBLIC UNIVERSITIES IN GHANA.

An Exploratory Factor Analysis of the Global Perspective Inventory. Kelli Samonte and Dena Pastor. James Madison University

Quantitative Research. By Dr. Achmad Nizar Hidayanto Information Management Lab Faculty of Computer Science Universitas Indonesia

Validity and reliability of measurements

Hierarchical Linear Models: Applications to cross-cultural comparisons of school culture

Panel: Using Structural Equation Modeling (SEM) Using Partial Least Squares (SmartPLS)

Chapter 9. Youth Counseling Impact Scale (YCIS)

LANGUAGE TEST RELIABILITY On defining reliability Sources of unreliability Methods of estimating reliability Standard error of measurement Factors

a, Emre Sezgin a, Sevgi Özkan a, * Systems Ankara, Turkey

Validity of the Risk & Protective Factor Model

Confirmatory Factor Analysis of the Procrastination Assessment Scale for Students

The Effect of Attitude toward Television Advertising on Materialistic Attitudes and Behavioral Intention

A Short Form of Sweeney, Hausknecht and Soutar s Cognitive Dissonance Scale

The complete Insight Technical Manual includes a comprehensive section on validity. INSIGHT Inventory 99.72% % % Mean

Constructing Indices and Scales. Hsueh-Sheng Wu CFDR Workshop Series June 8, 2015

A Comparison of First and Second Generation Multivariate Analyses: Canonical Correlation Analysis and Structural Equation Modeling 1

PTHP 7101 Research 1 Chapter Assignments

Transcription:

Internal structure evidence of validity Dr Wan Nor Arifin Lecturer, Unit of Biostatistics and Research Methodology, Universiti Sains Malaysia. E-mail: wnarifin@usm.my Wan Nor Arifin, 2017. Internal structure evidence of validity by Wan Nor Arifin is licensed under the Creative Commons Attribution-ShareAlike 4.0 International License. To view a copy of this license, visit http://creativecommons.org/licenses/by-sa/4.0/. Outlines Measurement validity and reliability The classical view of measurement validity The validity Factor analysis Reliability Measurement validity and reliability Measurement is the process observing and recording the observations that are collected as part of a research effort. (Trochim, 2006) Measurement validity is "the degree to which the data measure what they were intended to measure", or in other words, how close the data reflect the true state of what being measured (Fletcher, Fletcher and Wagner, 1996). It is synonymous to accuracy. Measurement reliability means repeatability, reproducibility, consistency or precision (Fletcher, Fletcher and Wagner, 1996; Gordis, 2009; Trochim, 2006). It is the extent to which repeated measurements of a stable phenomenon by different people and instruments, at different times and places get similar result (Fletcher, Fletcher and Wagner, 1996). Classical way viewing validation process. The classical view of measurement validity Validity used to be divided into 3Cs (DeVellis, 1991; Fletcher, Fletcher and Wagner, 1996): 1. Content validity. 2. Criterion validity. 3. Construct validity. Nowadays, validity is described differently under the unitary concept of validity (Cook, & Beckman, 2006; American Educational Research Association, American Psychological Association, & National Council on Measurement in Education [AERA, APA & NCME], 1999). Wan Nor Arifin Internal structure evidence of validity 1

The validity Validity is the degree to which all the accumulated evidence supports the intended interpretation of test scores for the proposed purpose (AERA, APA & NCME, 1999). The validity evidence can be obtained from five sources (AERA, APA & NCME, 1999; Cook, & Beckman, 2006): 1. Content. 2. Internal structure. 3. Relations to other variables 4. Response process. 5. Consequences. Our focus Internal structure. Construct is the concept or characteristic that a test is designed to measure (AERA, APA & NCME, 1999). Construct = Domain = Concept = Idea Internal structure evidence the extent of how the relationships between the test items and components reflect the construct (AERA, APA & NCME, 1999). Evidence based on internal structure can be obtained from (Cook, & Beckman, 2006): 1. Factor analysis. 2. Reliability. Factor analysis Factoring We tend to group things that have something in common. Simplify long list of items into smaller groups. Factoring = Grouping = Clustering. The factor/group may represent the construct. Intuitive factoring List of items: Orange, motorcycle, bus, durian, banana, car Do these six items have something in common? Group the items: [Orange, durian, banana] [Motorcycle, bus, car] Wan Nor Arifin Internal structure evidence of validity 2

Name the groups: Fruit Motor-vehicle Orange, durian, banana Motorcycle, bus, car Finding something in common among the items, factoring the items and naming the factors are basically factor analysis! Factor out the common idea from the items. Correlation matrix: Let say the same items are rated on Likert-scale responses from 1 to 5 on their characteristics of being fruit or motor vehicle. Then the Pearson's correlation coefficients among the items are tabulated: Items 1 2 3 4 5 6 1. Orange 1.00 2. Durian.67 1.00 3. Banana.70.81 1.00 4. Motorcycle.11.08.05 1.00 5. Bus.08.12.09.75 1.00 6. Car.18.12.22.89.83 1.00 We examine pattern of correlation in the correlation matrix, then group highly correlated items into factors. Factors Items Fruit Motor vehicle 1. Orange X - 2. Durian X - 3. Banana X - 4. Motorcycle - X 5. Bus - X 6. Car - X However such approach is tedious for a large number of items, for example for 43 items, we must examine 43(43-1)/2 = 903 correlations. Factor analysis enables objective assessment of these correlations and factor/group the items. Factor analysis It is a multivariate statistical analysis i.e. many outcomes. Factors can be determined in mathematical way. The basis is the determination of number and nature of factors that are responsible for the Wan Nor Arifin Internal structure evidence of validity 3

correlations among items (Brown, 2006). From a number of outcomes (observed variables), factors are extracted and determined. These factors are unobserved/latent independent factors. In contrast to multiple linear regression, the one outcome and many independent factors are measurable. The analysis can be (Brown, 2006): Exploratory Exploratory Factor Analysis (EFA). Confirmatory Confirmatory Factor Analysis (CFA). Exploratory factor analysis (EFA) An exploratory procedure. Aims to explore the items, factor common concepts and generate theory. Rotation of factors is used to allow simpler solution. Orthogonal method uncorrelated factors. Varimax, Quartimax, Equamax Oblique method correlated factors. Promax, Direct Oblimin Generally there are two models (Gorsuch, 1983): Full Component Model. Common Factor Model. The types of EFA determine extraction methods. Full Component Model Extraction method: Principal component analysis (PCA) Account for all variances, suitable for data reduction, e.g. items are condensed into a factor then used as a single variable for other statistical analysis. Does not account for error in measurement. Not the 'real' factor analysis (Gorsuch, 1983; Brown, 2006). Common factor model Extraction methods: Classical: Principal axis analysis. Other variants: Image analysis, alpha analysis, maximum likelihood analysis. Attempts to account for common variance and error variance. Common variance - variance shared between the related items. Error/Unique variance - variance specific to the item. It can be further partitioned into systematic error and random error variances. The 'real' factor analysis. The maximum likelihood variant allows assessment of model fit. The common factor model will be used throughout the workshop. Wan Nor Arifin Internal structure evidence of validity 4

Confirmatory factor analysis (CFA) A confirmatory procedure. Also based on common factor model. A type of Structural Equation Modeling (SEM) analysis: Measurement model (CFA) - dealing with latent variables (factors) and the relationships between the items and the factors. Structural model (path analysis) - dealing with how latent variables are related to each other. Maximum likelihood method is commonly used for estimation. Allows assessment of measurement model fit, as well as other aspects of the validity. The main difference between EFA and CFA is that by using CFA, the researcher already established the factors and which items belong to the factors No longer exploratory. For example, CFA items: I love fast food I hate vegetable I hate eating fruits I hate exercise Obesity The items are probably based on his exploratory procedure (EFA), literature reviews, theories, or experience strong theoretical basis for the items and factors. For example, EFA items: I love cat I hate snake I love traveling I love snorkeling I support ABC football team I love driving car I love computer game I like to have everything normally distributed I am a strong believer of central limit theorem My favorite food is nasi ayam I enjoy eating pisang goreng I spend most of my time in front of computer I love R??? No idea? Use EFA. Wan Nor Arifin Internal structure evidence of validity 5

EFA The differences between EFA and CFA can be summarized in the table below: CFA Explorative procedure. No pre-requisite to specify theoretical factors for a collections of items. Aims to explore the items and extract common ideas. Theory generating based on empirical findings. Items free loading. Rotation of factors is used to allow simpler solution. Explicit hypothesis is not tested. Confirmatory procedure. Pre-specified theoretical factors. Strong theory. Just want to confirm. No cross loading of items. Fixed item loadings to pre-specified factors. Rotation not used. Explicit hypothesis testing. Allows assessment of model fit (X 2 GOF, Fit indices). Reliability In the current framework, part of validity evidence from internal structure source. Reliability are generally divided into types (Trochim, 2006; Kline, 2011): 1. Test-retest reliability 2. Parallel-forms reliability 3. Interrater reliability 4. Internal consistency reliability Internal Consistency It is the degree to which responses are consistent across the items within a construct i.e. measure the same thing (Kline, 2011) in similar direction for a particular subject. In other words, how homogenous the items in a construct in term of their variance. Low internal consistency means that the items are heterogeneous within a construct i.e. do not measure the same factor, thus the total score is not the best way to summarize the construct (Kline, 2011). When responses for items within a construct are positively correlated to each other, they may measure the same factor. In this case, high internal consistency is obtained. In comparison to the rest of reliability types, it only requires measurement on a single occasion. Wan Nor Arifin Internal structure evidence of validity 6

Cronbach's Alpha Cronbach's alpha coefficient is a common way to indicate internal consistency of a construct. Ranges from 0 to 1. When α=1, the items are all identical and perfectly correlated to each other, i.e measure the same thing. When α=0, the items are all independent and none related to each other, i.e do not measure the same thing. A generally acceptable cutoff value is 0.7 and above, while 0.6 is acceptable in exploratory research (Hair et al., 2010). However, it should not exceed 0.9 (Streiner, 2003). Raykov s rho For a CFA model with good fit, it indicates the construct/composite reliability of a factor. Reliability by Raykov's rho (Raykov, 2001) is one of the reliability indices in CFA context. It also accounts for correlated errors, if specified in the model. Construct reliability 0.7 (Hair et al., 2010) is acceptable. References American Educational Research Association, American Psychological Association, & National Council on Measurement in Education (1999). Standards for educational and psychological testing. Washington DC: American Educational Research Association. Brown, T. A. (2015). Confirmatory factor analysis for applied research (2nd ed.). New York: The Guilford Press. Cook, D. A., & Beckman, T. J. (2006). Current concepts in validity and reliability for psychometric instruments: theory and application. The American journal of medicine, 119, 166.e7-166.e16. Cronbach, L. J. & Meehl, P. E. (1955). Construct validity in psychological tests. Psychological Bulletin, 52, 281-302. DeVellis, R. F. (1991). Scale development: Theory and applications. California: Sage Publications. Fletcher, R. H., Fletcher, S. W., & Wagner, E. H. (1996). Clinical epidemiology: the essentials (3rd ed.). Maryland: Williams & Wilkins. Gordis, L. (2009). Epidemiology (4th ed.). Philadelphia: Saunders. Hair Jr., J. F., Black, W. C., Babin, B. J. & Anderson, R. E. (2010). Multivariate data analysis (7th ed.) Upper Saddle River, NJ: Pearson Prentice-Hall. Kline, R. B. (2011). Principles and practice of structural equation modeling (3rd ed.). New York: Guilford Publications. Raykov, T. (2001). Estimation of congeneric scale reliability using covariance structure analysis with nonlinear constraints. British Journal of Mathematical and Statistical Psychology, 54, 315-323. Streiner, D. L. & Norman, G. R. (2008). Health measurement scales: a practical guide to their development and use. New York: Oxford University Press. Trochim, W. M. K. (2006). Research methods knowledge base. [Online] Retrieved from http://www.socialresearchmethods.net Wan Nor Arifin Internal structure evidence of validity 7