CEMO RESEARCH PROGRAM

Similar documents
PSYCHOLOGY (413) Chairperson: Sharon Claffey, Ph.D.

Methodological Issues in Measuring the Development of Character

Theory and Methods Question Bank

Selecting for Medical Education

Kantor Behavioral Profiles

THEORY DEVELOPMENT PROCESS

Master of Arts in Integrative Health Studies Course Descriptions

AU TQF 2 Doctoral Degree. Course Description

EXECUTIVE SUMMARY 9. Executive Summary

The four chapters in Part I set the stage. Chapter 1 moves from the implicit common sense theories of everyday life to explicit theories that are

EDP 548 EDUCATIONAL PSYCHOLOGY. (3) An introduction to the application of principles of psychology to classroom learning and teaching problems.

E-BOOK EXAMPLES OF SOCIAL PSYCHOLOGY

Measurement Invariance (MI): a general overview

PROGRAMME SPECIFICATION MSc The Psychology of Child & Adolescent Development. As above

PSYCHOMETRIC PROPERTIES OF CLINICAL PERFORMANCE RATINGS

Audio: In this lecture we are going to address psychology as a science. Slide #2

EDUCATION (EDUC) Education (EDUC) 1. EDUC EDUCATIONAL PSYCHOLOGY Short Title: EDUCATIONAL PSYCHOLOGY

Introduction. 1.1 Facets of Measurement

Psychological Approach to Comparative Education Aneela Farooq Afshan Nisar

Social Determinants and Consequences of Children s Non-Cognitive Skills: An Exploratory Analysis. Amy Hsin Yu Xie

Strategy Department of Psychology University of Copenhagen

INVESTIGATING FIT WITH THE RASCH MODEL. Benjamin Wright and Ronald Mead (1979?) Most disturbances in the measurement process can be considered a form

LEARNING. Learning. Type of Learning Experiences Related Factors

GENERALIZABILITY AND RELIABILITY: APPROACHES FOR THROUGH-COURSE ASSESSMENTS

COGNITIVE DEVELOPMENT AP PSYCHOLOGY

Origins of Sociology. Chapter 1B

FUNDAMENTALS OF PSYCHOANALYTIC THOUGHT COURSE CATALOG ACADEMIC YEAR

EXECUTIVE SUMMARY INTERPRETING FUND SCOPING PROJECT LAW INSTITUTE OF VICTORIA

Chapter 1. Lesson 2. Leadership Reshuffled. What You Will Learn to Do. Linked Core Abilities. Skills and Knowledge You Will Gain Along the Way

Module 2: Types of Groups Used in Substance Abuse Treatment. Based on material in Chapter 2 of TIP 41, Substance Abuse Treatment: Group Therapy

MENTAL TOUGHNESS. Steve Oakes

Department of Epidemiology and Population Health

Check List: B.A in Sociology

Examiner concern with the use of theory in PhD theses

Eurydice activities in the field of Early Childhood Education and Care (KD ECEC 2014)

introduction to the CFS PROCESS

Cognitive Science (COG SCI)

Research Designs and Frameworks for Population Health Improvement. Setting a Research Agenda for Population Health

Roskilde University. Publication date: Document Version Early version, also known as pre-print

PSYCHOLOGY TSAP COMPETENCY MAPPING 1. Knowledge Base in Psychology

Facilitated Discussion Notes Autism and Mental Health May 12, 2014

Most candidates were able to gain marks on this question, though there were relatively few who were able to explain interpretive sociology.

CONCEPTUAL FRAMEWORK, EPISTEMOLOGY, PARADIGM, &THEORETICAL FRAMEWORK

Introduction to Geoscience Education Research Methods

Mini Lecture Week 14 VALUES ETHICS

Item Response Theory. Steven P. Reise University of California, U.S.A. Unidimensional IRT Models for Dichotomous Item Responses

Ursuline College Accelerated Program

PSYCHOLOGY. The Psychology Major. Preparation for the Psychology Major. The Social Science Teaching Credential

Peer counselling A new element in the ET2020 toolbox

Ability to link signs/symptoms of current patient to previous clinical encounters; allows filtering of info to produce broad. differential.

Issues That Should Not Be Overlooked in the Dominance Versus Ideal Point Controversy

APSY445 Adolescent Psychology

Shaping the Economics of Happiness: The Fundamental Contributions of Richard A. Easterlin

Randomized Comparison of Parent-Teacher Consultation for Students with Autism

!"#$%&'()*+,-#$(.$/+)01$2)+3,$4#5#0('&#,*6 $%,$!"#$%&'()"#*%+,+%-.

Understanding the coach-athlete relationship from a cross-cultural perspective

Child Mental Health: A Review of the Scientific Discourse

Empirical Validation in Agent-Based Models

Carrying out an Empirical Project

Addendum Valorization paragraph

TITLE: Competency framework for school psychologists SCIS NO: ISBN: Department of Education, Western Australia, 2015

Worcestershire's Autism Strategy

NEW YORK STATE TEACHER CERTIFICATION EXAMINATIONS

Running Head: STEREOTYPE THREAT AND THE RACIAL ACHIEVEMENT GAP 1

VIOLENCE PREVENTION ALLIANCE TERMS OF REFERENCE

Institute: Symbiosis School for Liberal Arts. Course Name : Psychology (Major/Minor) Introduction :

The following case example illustrates the practical applicability of the DEP Model for

Subject module in Psychology

EUROPEAN ORTHODONTIC TEACHERS FORUM 2016

Understanding Science Conceptual Framework

Empowering Students to Author their Lives

HUMAN DEVELOPMENT ( ) Highlighted units are for future completion

Dimensions of Health and Illness: Toward an Integrated Model

The course is a compulsory component of semester 3 to 10 of the Master of Science programme in Psychology.

Reliability and Validity of the Hospital Survey on Patient Safety Culture at a Norwegian Hospital

COWLEY COLLEGE & Area Vocational Technical School

Table 2. Mapping graduate curriculum to Graduate Level Expectations (GDLEs) - MSc (RHBS) program

The syllabus was approved by the board of the Department of Psychology on to be valid from , autumn semester 2018.

AP PSYCH Unit 11.2 Assessing Intelligence

Key gender equality issues to be reflected in the post-2015 development framework

In 1980, a new term entered our vocabulary: Attention deficit disorder. It

COUNSELING FOUNDATIONS INSTRUCTOR DR. JOAN VERMILLION

Autism Spectrum Disorders Teacher License (proposed): Minnesota model for teacher preparation

Global Learning at Hope College Background, definitions, criteria

The Effects of Maternal Alcohol Use and Smoking on Children s Mental Health: Evidence from the National Longitudinal Survey of Children and Youth

Self-Assessment: Critical Skills for Inclusion Practitioners Developed by Kathy Obear, Ed. D., 2014

Teacher satisfaction: some practical implications for teacher professional development models

MSc Psychological Research Methods/ MPsych Advanced Psychology Module Catalogue / 2018

Field 052: Social Studies Psychology Assessment Blueprint

Why is He Doing That?

SUMMARY AND DISCUSSION

Overcoming barriers. Our strategy for

Division of Clinical Psychology The Core Purpose and Philosophy of the Profession

Multiple Act criterion:

MEASURES REGISTRY RESOURCES FOR MEASURING DIET AND PHYSICAL ACTIVITY

Leadership Practices Inventory: LPI

- Types of Conflict - Sources of Conflict - Five Styles of Intervention - Handling Conflict - Things to Remember

Curriculum for the Continuing Education Programme in Propedeutic Studies in Psychotherapy at the University of Innsbruck

Consulting Skills. Part 1: Critical assessment of Peter Block and Edgar Schein s frameworks

Transcription:

1 CEMO RESEARCH PROGRAM Methodological Challenges in Educational Measurement CEMO s primary goal is to conduct basic and applied research seeking to generate new knowledge in the field of educational measurement. In order to achieve this goal, the centre is staffed with researchers comprising a variety of backgrounds and research interests. Although CEMO is primarily concerned with measurement in the context of education, our research is also applicable to other substantive areas. CEMO research can be categorized into two major strands that are linked to each other: Basic research related to educational measurement. This includes research on both classical and modern test theory providing new insights into issues including factor analysis, item response theory, structural equation modeling or latent class analysis, as well as research on advances that combine and extend these techniques across their default boundaries. Applied research related to educational assessment. This involves research on the psychometric quality of existing national and international large-scale assessments as well as on methodological issues and challenges relating to novel assessment formats (e.g. CAT), measurement of new constructs (e.g. 21 st century skills), and how new types of data (such as logfiles) may be used to address substantive research questions.

2 Exemplary research fields Measurement (non-)invariance over time and across groups Longitudinal modelling In everyday educational work or clinical practice, development is something that is aspired, monitored, and worked on to make sure that students learn or that patients improve. Usually, progression addresses the question of how far a student or patient moved along a ruler and assumes that as long as the same ruler (i.e., measurement instrument) is used, scores can naturally be compared across time. If the ruler changes or what the ruler is trying to measure changes throughout the process, the common ground for comparisons disappears. Yet, in some situations, such changes would be a sign of development. For instance, progressing from the level of a novice student-teacher to the level of an expert veteran teacher may not simply correspond to growing more of the same competence, but may require redefining teaching practice. Similarly, the reported quality-oflife of patients may undergo a response shift as they redefine what quality of life means for them while a disease progresses or impactful events like operations happen. In such situations, the traditional assumption of measurement invariance over time does not hold and alternative ways to measure and model developmental patterns need to be provided. CEMO aims to develop the procedures involved in the measurement of such attributes and their change over time. Across-group comparisons In addition to longitudinal modelling, it is essential for educational assessment to account for potentially differing interpretations of constructs and measures across cultures or subgroups within a culture. This is not only important for clearly communicating research outcomes on group differences, but it is also essential for enabling valid and fair comparisons based on these measures. CEMO focuses on research about measurement equivalence across groups in particular with respect to national and international large-scale assessments.

3 The issue of measurement equivalence is not a binary yes/no question because measurements will always be approximately invariant only and comparable to some degree. A core objective in CEMO s research is to bridge the gap between the methodology used to study these comparability issues and the statistical modeling used to formalize measurement equivalence (e.g., factorial invariance, differential item functioning, or linking and equating errors) and the actual application of these procedures in practice. This includes both investigating reasons why some subsets of educational assessments are not comparable across countries or subgroups, as well as assessing policy implications of measurement nonequivalence, because not every detected discrepancy needs to be directly relevant for a particular comparison. The latter research objectives connect to essential gaps in research literature and practice with respect to approximate invariance and effect sizes, and the follow-up of non-invariance and biased indicators. Computer-based assessment There is a growing concern in the field of measurement that assessments should do more than provide information about how much students know or are able to do at critical stages in their educational careers (summative assessment of largely cognitive attributes). Future developments in the methodology and practice of assessment should also increasingly aim at providing students, teachers and schools with information supporting and driving their continuous learning process (formative assessment including also non-cognitive attributes). In addition to this demand for increasing the information value and widening the scope of educational tests, a demand for making data collection, processing and reporting as efficient as possible exists. Technological advances in recent years support and enable such a shift in methodology and practices of assessment. Computer-based assessment can open up opportunities to construct a new generation of educational tests, for instance with new dynamic or interactive assessment formats (possibly implemented as integrated elements of students learning activities) and with access to, modeling of and analysis of additional data

4 streams (such as response time or activity logs). Although computer-based assessments are already used to measure new areas of skills such as problem solving competences, the potential of these measures has not yet been fully exploited. Core objectives in CEMO s research are to investigate and study the potential added value and implementation challenges of new assessment formats, such as dynamic and interactive formats; and to develop an expanded psychometric toolbox for dealing with these new forms of assessment. Whereas this might require new and/or adapted statistical measurement models, the new generation of tests will still have to be evaluated in terms of reliability and validity. Thus, although educational testing purposes and formats might change, several of the fundamental measurement concerns will prevail. SUBSTANTIVE AREAS An overall rationale of CEMO s applied research is to unpack the Nordic model. The Nordic countries provide a unique social and educational context. At a macro level, economies are mostly thriving, there is low income inequality and unemployment, and there is a broad consensus that education across the life-span is mostly a public responsibility. Thus, education, including both early- and higher education, is largely free or heavily subsidized, and accessible for everyone. Moreover, the Nordic pedagogical model differs in considerable ways from educational practice in most other countries. In early education, this includes a strong focus on play-based activities and children s participation. In primary, secondary and higher education, this includes a strong focus on social skills and a positive class climate. CEMO takes two main strategies to unpack this Nordic model: the first is through international comparisons, the other is by addressing research questions investigated in other sociopolitical contexts, and analyze them with the specifics of the Nordic contexts in focus.

5 Early childhood education and care (ECEC) While considerable research on ECEC has been conducted internationally, the research base is rather scarce in the Nordic countries both regarding effects of ECEC on cognitive and language development and academic achievement, and on potentially negative side effects. Moreover, little is known about variability in the quality of children s ECEC-experiences in the Nordic countries and the consequences of this variability for children s educational attainment. Development of high-quality methodological approaches to measure socioemotional and cognitive outcomes prior to school age, and to estimate short- and long-term causal effects of ECEC on child outcomes are on CEMO s agenda to enhance knowledge of both positive and negative effects of the Nordic ECEC model. Primary and secondary education In Norway, as in many other countries, international large-scale assessments such as PISA and TIMSS, have a central role in monitoring the development of the school system. There is, however, much debate and controversy with respect to the comparability of Norwegian students performance on these tests to the results of their peers in other countries. Moreover, it is of interest, to what extent cultural differences ( are all Nordic countries the same? ), response styles, translation/language issues, within-country differences ( are all Norwegians the same? ), testing traditions, and more affect these comparisons. Such questions of measurement equivalence have high priority on CEMO s research agenda. The dominant element of national assessment in education is trust in teachers own judgements of students. In practical terms this means that teachers are given a mandate for both formative and summative assessment purposes. Grading starts in year 8, and students have to sit for selected central exams at the end of compulsory schooling. Early screening tests and national assessments are applied to gather standardized information about student achievement. CEMO research focuses on the reliability and validity of the national assessments for system-wide monitoring, as well as on the reliability and validity of teacher grades and central exams. In addition, a need exists for studies that investigate how the different types of assessments over grades and years can be linked in order to measure student progress and its antecedents and outcomes.

6 Furthermore, many research questions in the field of education are causal in nature (e.g., effects of programs or policies), and educational researchers often use observational (non-experimental data), when addressing these questions. Modeling of causes and effects requires the application of sophisticated designs. Objectives of CEMO are to develop and use state-of-the-art research designs involving experimental and longitudinal components as well as causal modeling approaches with observational data. Specifically, there is a need for research aiming at transferring to or redeveloping the latter approaches in the educational context by emphasizing more strongly the psychometrical issues so often neglected in econometric analysis of education. Higher education In cooperation with the Faculty of Medicine at the University of Oslo, CEMO examines the reliability and validity of the examination and grading system in medical education. This includes studying the benefits and limitations of different assessment formats as well as the structure, level and development of knowledge and skills during medical education along with its impact on workplace performance. Dimensionality and growth of medical knowledge and skills within a conceptual framework of medical competence, analyses of methods and rater effects within a multi-trait-multi-method framework, and the examination of content, criterion and construct validity will be major research topics in this context. This also includes the development of a feedback system to improve the formative purpose of assessment.