Maltreatment Reliability Statistics last updated 11/22/05

Size: px
Start display at page:

Download "Maltreatment Reliability Statistics last updated 11/22/05"

Transcription

1 Maltreatment Reliability Statistics last updated 11/22/05 Historical Information In July 2004, the Coordinating Center (CORE) / Collaborating Studies Coordinating Center (CSCC) identified a protocol to assess the reliability of RNA/MB coding. This was approved by 4 of 5 sites (plus CORE) in August The plan was to run reliability statistics on: (1) number of codeable allegations, (2) number of codeable findings, (3) Type of maltreatment (allegations), (4) type of maltreatment (findings), and (5) conclusion codes. All sites coded all selected observations to equal a final number of 129 cases (X 5 sites + the original data). Case Selection Five percent of cases were selected from the RNA/MB pool (for 0407 data). Selection criteria was as follows: (1) must be an allegation and findings narrative, (2) must be at least 1 valid NIS2 & MMCS code from the allegations sections and findings sections, (3) there is a valid date of referral and/or incident, and (4) the allegation/findings narratives are available in the dataset or can be obtained from the sites. Cases were randomly selected according to the criteria above. The CSCC selected the cases. On the initial run, all sites were able to provide narratives or narratives were in the datasets with the exception of the Southern Site (SO). They were unable to provide narratives on two of their selected cases. A second round of random selection for SO occurred and two additional cases were identified that did have narratives in the dataset. Data Entry The CSCC created a data entry system so that the reliability coding from the five sites could be entered. The data entry system was set up using FSEDIT (in SAS). Laura Respess (CORE Program Assistant) was charged with entering the data. Jamie Smith (CORE Applications Analyst) reviewed 10% (n = 60) of the entered cases for data entry accuracy. There were NO data entry errors in any of the 60 cases. During data entry, Jamie Smith reviewed the coding from the sites for any (nonreliability) errors. Jamie (and Liz Knight, CORE co-investigator) communicated with the sites to correct the errors and/or clarify the instructions. A subsample of cases required additional information (not otherwise included in the narrative) to enable sites to accurately code the narrative (n = 8). Sites who contributed the original data provided the additional information. All sites were requested to use the supplemental information to code the reliability narratives for the identified subsample. Reliability Statistics Reliability statistics were conducted for allegation and findings data separately. Kappas and Interclass Correlation Coefficients were conducted depending on the type of data. The specific analysis variables (and necessary statistics) are as follows:

2 1. Number of codeable allegations ICC 2. Number of codeable findings ICC 3. Conclusion codes Kappa 4. Maltreatment type allegations NIS2 - Kappa 5. Maltreatment type findings NIS2 - Kappa 6. Maltreatment type allegations MMCS - Kappa 7. Maltreatment type findings MMCS- Kappa 8. Maximum Severity Codes by Maltreatment Type for MMCS Allegations - ICC Note that sites coded conclusions and maltreatment types according to the RNA/MB reliability codebook as if they were coding an original case file. However, analyses will be conducted only on the broad types of maltreatment (not the subtypes) and whether an allegation was substantiated or not (yes or no). See below: NIS2 allegation & findings codes = broad type Physical Abuse = Sexual Abuse = Emotional Abuse = Physical Neglect = Educational Neglect = Emotional Neglect = Other maltreatment = MMCS allegation & findings codes = broad type Physical Abuse = Sexual Abuse = 200 Physical Neglect, failure to provide = Physical Neglect, lack of supervision = Emotional maltreatment = 500 Moral/Legal Maltreatment = 600 Educational Maltreatment = 700 Drugs/Alcohol = 800 Conclusion Codes Codes of 1 or 3 = substantiated Codes of 2, 4, 5, 6, or 7 = not substantiated Severity Codes The maximum severity value for a particular type of maltreatment across a record Results Analyses were completed on 4/29/05. See Tables 1 and 2 for Kappa statistics for type of allegation, finding, and conclusion codes for MMCS and NIS2 codes. The methodology used for computing kappa statistics are presented in Fleiss (1981). The macro used to compute interclass correlations were based on Shrout and Fleiss (1979). The Shrout- Fleiss Reliability Random Set are reported for the current analyses. Analyses on the Maximum Severity Codes were conducted on 9/22/05. Table 3 details the maximum severity code reliability statistics.

3 Allegations Kappas for MMCS codes from the allegations narrative ranged from (M =.76). All of the categories with the exception of the moral/legal category have Kappas exceeding.70 (the value typically considered acceptable). Similar results were obtained for Kappas of NIS2 allegation codes with a range of (M =.77). Again, with the exception of one category (emotional neglect) all Kappas were above.70. Interclass correlation coefficients were obtained for the number of allegations coded for MMCS and NIS2 and =.79 and.74 respectively. Findings Kappas for MMCS codes from the findings narrative ranged from (M =.72). All of the categories with the exception of the moral/legal category had Kappas exceeding.70. Similar results were obtained for Kappas of NIS2 allegation codes with a range of (M =.73). Again, with the exception of one category (emotional neglect) all Kappas were above.70. Interclass correlation coefficients were obtained (Shrout-Fleiss Reliability Random Set) for the number of findings coded for MMCS and NIS2 and =.75 and.65 respectively. Conclusion Codes Kappas for conclusion codes based on the MMCS coding of the findings narrative ranged from (M =.54). The lowest value was obtained for the moral/legal category (k=.14), the highest for educational maltreatment (k =.73). Only two kappa values were at or above.70. Kappas for conclusion codes based on the NIS2 coding of the findings narrative ranged from (M =.56). The lowest value was obtained for the emotional abuse category (k=.34), the highest for educational neglect (k =.73). Only one kappa value was at or above.70. Maximum Severity Reliability statistics were conducted on the maximum severity codes by each MMCS maltreatment category except Moral/Legal and Drugs/Alcohol. Two sets of analyses were conducted. The first set included the substitution of a missing value if the maltreatment type was not coded from the allegation (range of values = 1 6, depending on the maltreatment type). The second set included the substitution of a value of 0 if the maltreatment type was not coded from the allegation (range of values = 0-6). Note that the first set are a much more restrictive set of analyses given any disagreement over the maltreatment type are subsequently thrown out before assessing agreement on maximum severity. The latter set of runs may potentially inflate agreement. ICCs for the first set ranged from.30 (Educational Neglect) to.65 (Lack of supervision). Most were in the.6 range. ICCs from the second set ranged from.57 (Educational Neglect) to.87 (Sexual Abuse).

4 A Word about Kappas Landis and Koch (1977) attempted to provide some measure of agreement for kappa values in various ranges: < = 0 = poor = slight = fair = moderate = substantial = almost perfect However, Munoz and Bangdiwala (1997) compared the Kappa statistics by Cohen (1960) and the B statistic by Bangdiwala (1985) and offer an alternative labeling system for the Kappa statistic = poor = fair = moderate = substantial = almost perfect 1.0 = perfect Conclusions With either interpretation, the Kappas obtained for the current analyses appear to range from moderate to almost perfect in most instances. Caution is suggested for use of the conclusion codes. Assessment of severity codes proved more difficult given the conditional nature of the presence of severity codes (maltreatment type had to be coded). Analyses were run two ways, one that is highly conservative and one that may potentially inflate agreement. Although neither of these approaches is optimal, arguments can be made that either approach is sufficient. From the conservative perspective, reliability was only assessed for those coders/records who agreed that a maltreatment type was codeable from the referral narrative. The denominator for these analyses will be significantly lower than that used in the second set. Thus disagreements have a greater impact on the reliability. In contrast, those who agreed that a maltreatment type did not occur are getting credit for agreeing there is no severity code (= 0). Thus all records (N = 129*6 raters) were included in the analyses potentially minimizing disagreements as compared to the first approach. Either is defensible and reportable, however the CSCC recommends reporting the range (or upper and lower figure) from these two sets of analyses when describing reliability in manuscripts with a brief description of the assessment process. Given the complexity of coding CPS records, the span of the ages represented in the current sample, and coder turnover at the sites, we are very encouraged with these figures and congratulate (and thank) everyone for their efforts with this process.

5 Reliability Summary In winter of a formal assessment of CPS narrative coding reliability was conducted among all active coders at each of the five sites plus the original data. Approximately five percent of CPS records (N = 129) currently in the LONGSCAN cross-site database, as of 9/20/04, were selected for review. Analyses were conducted to measure agreement on (a) the number of allegations and substantiations, (b) the type of maltreatment at referral and the investigation by CPS, (c) conclusions about maltreatment based on CPS investigation, and (d) the severity of maltreatment based on the referral information. These categories are consistent with the way the CPS data is commonly used for analyses within LONGSCAN and for classifying the maltreatment experiences of the study child participants. Reliability analyses focused on coding using the MMCS and NIS2 classification systems. Results indicated reliability ranged from moderate to almost perfect for nearly every category of analysis except coding of substantiated maltreatment based on the CPS findings narratives. Given the complexity of coding CPS records across agencies and states, the span of the ages at the time of referral, and the change in coders inherent in a longitudinal study, these figures are encouraging and represent the quality and consistency of training. Results Table 1. Kappas for MMCS Allegation, Findings, and Conclusion Codes Type of Maltx Allegation Findings Conclusion Code Physical Abuse Sexual Abuse Neglect Failure to Provide Lack of Supervision Emotional Maltx Moral/Legal Educational Maltx Drugs/Alcohol Table 2. Kappas for NIS2 Allegation, Findings, and Conclusion Codes Type of Maltx Allegation Findings Conclusion Code Physical Abuse Sexual Abuse Emotional Abuse Physical Neglect Educational Neglect Emotional Neglect Other Maltx

6 Table 3. ICCs for Maximum Severity Codes for MMCS Maltreatment Types Set 1 (severity without a maltreatment code =.) Set 2 (severity without a maltreatment code = 0) Type of Maltx Physical Abuse Sexual Abuse Failure to Provide Lack of Supervision Emotional Maltx Educational Maltx Note: Maximum severity codes were selected from MMCS Allegations Bibliography Cohen, J. (1960). A coefficient for agreement for nominal scales. Educational and Psychological Measurement, 20, Fleiss, J. (1981). Balanced incomplete block designs for inter-rater reliability studies. Applied-Psychological-Measurement, 5, Landis, J.R., & Koch, G.G. (1977). The measure of observer agreement for categorical data. Biometrics, 33, Munoz, S.R., & Bangdiwala, S.I. (1997). Interpretation of Kappa and B Statistics measures of agreement. Journal of Applied Statistics, 24, Shrout, P.E., & Fleiss, J.L. (1979). Interclass correlations: Uses in assessing rater reliability. Psychological Bulletin, 86,

COMPUTING READER AGREEMENT FOR THE GRE

COMPUTING READER AGREEMENT FOR THE GRE RM-00-8 R E S E A R C H M E M O R A N D U M COMPUTING READER AGREEMENT FOR THE GRE WRITING ASSESSMENT Donald E. Powers Princeton, New Jersey 08541 October 2000 Computing Reader Agreement for the GRE Writing

More information

COMMITMENT &SOLUTIONS UNPARALLELED. Assessing Human Visual Inspection for Acceptance Testing: An Attribute Agreement Analysis Case Study

COMMITMENT &SOLUTIONS UNPARALLELED. Assessing Human Visual Inspection for Acceptance Testing: An Attribute Agreement Analysis Case Study DATAWorks 2018 - March 21, 2018 Assessing Human Visual Inspection for Acceptance Testing: An Attribute Agreement Analysis Case Study Christopher Drake Lead Statistician, Small Caliber Munitions QE&SA Statistical

More information

English 10 Writing Assessment Results and Analysis

English 10 Writing Assessment Results and Analysis Academic Assessment English 10 Writing Assessment Results and Analysis OVERVIEW This study is part of a multi-year effort undertaken by the Department of English to develop sustainable outcomes assessment

More information

Unequal Numbers of Judges per Subject

Unequal Numbers of Judges per Subject The Reliability of Dichotomous Judgments: Unequal Numbers of Judges per Subject Joseph L. Fleiss Columbia University and New York State Psychiatric Institute Jack Cuzick Columbia University Consider a

More information

Closed Coding. Analyzing Qualitative Data VIS17. Melanie Tory

Closed Coding. Analyzing Qualitative Data VIS17. Melanie Tory Closed Coding Analyzing Qualitative Data Tutorial @ VIS17 Melanie Tory A code in qualitative inquiry is most often a word or short phrase that symbolically assigns a summative, salient, essence capturing,

More information

LEVEL ONE MODULE EXAM PART TWO [Reliability Coefficients CAPs & CATs Patient Reported Outcomes Assessments Disablement Model]

LEVEL ONE MODULE EXAM PART TWO [Reliability Coefficients CAPs & CATs Patient Reported Outcomes Assessments Disablement Model] 1. Which Model for intraclass correlation coefficients is used when the raters represent the only raters of interest for the reliability study? A. 1 B. 2 C. 3 D. 4 2. The form for intraclass correlation

More information

Ryan Mattek, PhD Letitia Johnson PhD. SRA-FV: Evidence of Inter-rater Reliability in a Combined SOMMI Sample

Ryan Mattek, PhD Letitia Johnson PhD. SRA-FV: Evidence of Inter-rater Reliability in a Combined SOMMI Sample Ryan Mattek, PhD Letitia Johnson PhD SRA-FV: Evidence of Inter-rater Reliability in a Combined SOMMI Sample Declarations We have no financial interests to declare Goals/Objectives 1. Participants will

More information

Statistical Validation of the Grand Rapids Arch Collapse Classification

Statistical Validation of the Grand Rapids Arch Collapse Classification Statistical Validation of the Grand Rapids Arch Collapse Classification David Burkard, BS Michelle Padley, CRTM John Anderson, MD Donald Bohay, MD John Maskill, MD Daniel Patton, MD Orthopaedic Associates

More information

A review of statistical methods in the analysis of data arising from observer reliability studies (Part 11) *

A review of statistical methods in the analysis of data arising from observer reliability studies (Part 11) * A review of statistical methods in the analysis of data arising from observer reliability studies (Part 11) * by J. RICHARD LANDIS** and GARY G. KOCH** 4 Methods proposed for nominal and ordinal data Many

More information

Running head: ATTRIBUTE CODING FOR RETROFITTING MODELS. Comparison of Attribute Coding Procedures for Retrofitting Cognitive Diagnostic Models

Running head: ATTRIBUTE CODING FOR RETROFITTING MODELS. Comparison of Attribute Coding Procedures for Retrofitting Cognitive Diagnostic Models Running head: ATTRIBUTE CODING FOR RETROFITTING MODELS Comparison of Attribute Coding Procedures for Retrofitting Cognitive Diagnostic Models Amy Clark Neal Kingston University of Kansas Corresponding

More information

2 Philomeen Weijenborg, Moniek ter Kuile and Frank Willem Jansen.

2 Philomeen Weijenborg, Moniek ter Kuile and Frank Willem Jansen. Adapted from Fertil Steril 2007;87:373-80 Intraobserver and interobserver reliability of videotaped laparoscopy evaluations for endometriosis and adhesions 2 Philomeen Weijenborg, Moniek ter Kuile and

More information

Comparing Vertical and Horizontal Scoring of Open-Ended Questionnaires

Comparing Vertical and Horizontal Scoring of Open-Ended Questionnaires A peer-reviewed electronic journal. Copyright is retained by the first or sole author, who grants right of first publication to the Practical Assessment, Research & Evaluation. Permission is granted to

More information

(true) Disease Condition Test + Total + a. a + b True Positive False Positive c. c + d False Negative True Negative Total a + c b + d a + b + c + d

(true) Disease Condition Test + Total + a. a + b True Positive False Positive c. c + d False Negative True Negative Total a + c b + d a + b + c + d Biostatistics and Research Design in Dentistry Reading Assignment Measuring the accuracy of diagnostic procedures and Using sensitivity and specificity to revise probabilities, in Chapter 12 of Dawson

More information

Relationship Between Intraclass Correlation and Percent Rater Agreement

Relationship Between Intraclass Correlation and Percent Rater Agreement Relationship Between Intraclass Correlation and Percent Rater Agreement When raters are involved in scoring procedures, inter-rater reliability (IRR) measures are used to establish the reliability of measures.

More information

Validity and reliability of measurements

Validity and reliability of measurements Validity and reliability of measurements 2 Validity and reliability of measurements 4 5 Components in a dataset Why bother (examples from research) What is reliability? What is validity? How should I treat

More information

Victoria YY Xu PGY-3 Internal Medicine University of Toronto. Supervisor: Dr. Camilla Wong

Victoria YY Xu PGY-3 Internal Medicine University of Toronto. Supervisor: Dr. Camilla Wong Validity, Reliability, Feasibility and Acceptability of Using the Consultation Letter Rating Scale to Assess Written Communication Competencies Among Geriatric Medicine Postgraduate Trainees Victoria YY

More information

AAPOR Exploring the Reliability of Behavior Coding Data

AAPOR Exploring the Reliability of Behavior Coding Data Exploring the Reliability of Behavior Coding Data Nathan Jurgenson 1 and Jennifer Hunter Childs 1 Center for Survey Measurement, U.S. Census Bureau, 4600 Silver Hill Rd. Washington, DC 20233 1 Abstract

More information

Comparison of the Null Distributions of

Comparison of the Null Distributions of Comparison of the Null Distributions of Weighted Kappa and the C Ordinal Statistic Domenic V. Cicchetti West Haven VA Hospital and Yale University Joseph L. Fleiss Columbia University It frequently occurs

More information

Victoria YY Xu PGY-2 Internal Medicine University of Toronto. Supervisor: Dr. Camilla Wong

Victoria YY Xu PGY-2 Internal Medicine University of Toronto. Supervisor: Dr. Camilla Wong Validity, Reliability, Feasibility, and Acceptability of Using the Consultation Letter Rating Scale to Assess Written Communication Competencies Among Geriatric Medicine Postgraduate Trainees Victoria

More information

reproducibility of the interpretation of hysterosalpingography pathology

reproducibility of the interpretation of hysterosalpingography pathology Human Reproduction vol.11 no.6 pp. 124-128, 1996 Reproducibility of the interpretation of hysterosalpingography in the diagnosis of tubal pathology Ben WJ.Mol 1 ' 2 ' 3, Patricia Swart 2, Patrick M-M-Bossuyt

More information

A profiling system for the assessment of individual needs for rehabilitation with hearing aids

A profiling system for the assessment of individual needs for rehabilitation with hearing aids A profiling system for the assessment of individual needs for rehabilitation with hearing aids WOUTER DRESCHLER * AND INGE BRONS Academic Medical Center, Department of Clinical & Experimental Audiology,

More information

NIH Public Access Author Manuscript Tutor Quant Methods Psychol. Author manuscript; available in PMC 2012 July 23.

NIH Public Access Author Manuscript Tutor Quant Methods Psychol. Author manuscript; available in PMC 2012 July 23. NIH Public Access Author Manuscript Published in final edited form as: Tutor Quant Methods Psychol. 2012 ; 8(1): 23 34. Computing Inter-Rater Reliability for Observational Data: An Overview and Tutorial

More information

Psychotherapy research historically focused predominantly

Psychotherapy research historically focused predominantly Rater Agreement on Interpersonal Psychotherapy Problem Areas John C. Markowitz, M.D. Andrew C. Leon, Ph.D. Nina L. Miller, Ph.D. Sabrina Cherry, M.D. Kathleen F. Clougherty, A.C.S.W. Liliana Villalobos,

More information

7/17/2013. Evaluation of Diagnostic Tests July 22, 2013 Introduction to Clinical Research: A Two week Intensive Course

7/17/2013. Evaluation of Diagnostic Tests July 22, 2013 Introduction to Clinical Research: A Two week Intensive Course Evaluation of Diagnostic Tests July 22, 2013 Introduction to Clinical Research: A Two week Intensive Course David W. Dowdy, MD, PhD Department of Epidemiology Johns Hopkins Bloomberg School of Public Health

More information

THE ESTIMATION OF INTEROBSERVER AGREEMENT IN BEHAVIORAL ASSESSMENT

THE ESTIMATION OF INTEROBSERVER AGREEMENT IN BEHAVIORAL ASSESSMENT THE BEHAVIOR ANALYST TODAY VOLUME 3, ISSUE 3, 2002 THE ESTIMATION OF INTEROBSERVER AGREEMENT IN BEHAVIORAL ASSESSMENT April A. Bryington, Darcy J. Palmer, and Marley W. Watkins The Pennsylvania State University

More information

02a: Test-Retest and Parallel Forms Reliability

02a: Test-Retest and Parallel Forms Reliability 1 02a: Test-Retest and Parallel Forms Reliability Quantitative Variables 1. Classic Test Theory (CTT) 2. Correlation for Test-retest (or Parallel Forms): Stability and Equivalence for Quantitative Measures

More information

Examining Inter-Rater Reliability of a CMH Needs Assessment measure in Ontario

Examining Inter-Rater Reliability of a CMH Needs Assessment measure in Ontario Examining Inter-Rater Reliability of a CH Needs Assessment measure in Ontario CAHSPR, Halifax, ay 2011 Team: Janet Durbin, Elizabeth Lin, Carolyn Dewa, Brenda Finlayson, Stephen Gallant, April Collins

More information

Figure 1: Design and outcomes of an independent blind study with gold/reference standard comparison. Adapted from DCEB (1981b)

Figure 1: Design and outcomes of an independent blind study with gold/reference standard comparison. Adapted from DCEB (1981b) Page 1 of 1 Diagnostic test investigated indicates the patient has the Diagnostic test investigated indicates the patient does not have the Gold/reference standard indicates the patient has the True positive

More information

Pain Assessment in Elderly Patients with Severe Dementia

Pain Assessment in Elderly Patients with Severe Dementia 48 Journal of Pain and Symptom Management Vol. 25 No. 1 January 2003 Original Article Pain Assessment in Elderly Patients with Severe Dementia Paolo L. Manfredi, MD, Brenda Breuer, MPH, PhD, Diane E. Meier,

More information

Seemingly isolated greater trochanter fractures do not exist

Seemingly isolated greater trochanter fractures do not exist Seemingly isolated greater trochanter fractures do not exist Poster No.: B-0950 Congress: ECR 2012 Type: Scientific Paper Authors: D. Dunker, J. H. Göthlin, M. Geijer ; Gothenburg/SE, Lund/SE Keywords:

More information

Agreement Coefficients and Statistical Inference

Agreement Coefficients and Statistical Inference CHAPTER Agreement Coefficients and Statistical Inference OBJECTIVE This chapter describes several approaches for evaluating the precision associated with the inter-rater reliability coefficients of the

More information

Evaluating the Endoscopic Reference Score for eosinophilic esophagitis: moderate to substantial intra- and interobserver reliability

Evaluating the Endoscopic Reference Score for eosinophilic esophagitis: moderate to substantial intra- and interobserver reliability Original article 1049 Evaluating the Endoscopic Reference Score for eosinophilic esophagitis: moderate to substantial intra- and interobserver reliability Authors Institution submitted 29. January 2014

More information

A Coding System to Measure Elements of Shared Decision Making During Psychiatric Visits

A Coding System to Measure Elements of Shared Decision Making During Psychiatric Visits Measuring Shared Decision Making -- 1 A Coding System to Measure Elements of Shared Decision Making During Psychiatric Visits Michelle P. Salyers, Ph.D. 1481 W. 10 th Street Indianapolis, IN 46202 mpsalyer@iupui.edu

More information

Evaluating Quality in Creative Systems. Graeme Ritchie University of Aberdeen

Evaluating Quality in Creative Systems. Graeme Ritchie University of Aberdeen Evaluating Quality in Creative Systems Graeme Ritchie University of Aberdeen Graeme Ritchie {2007} Some Empirical Criteria for Attributing Creativity to a Computer Program. Minds and Machines 17 {1}, pp.67-99.

More information

Validity and reliability of measurements

Validity and reliability of measurements Validity and reliability of measurements 2 3 Request: Intention to treat Intention to treat and per protocol dealing with cross-overs (ref Hulley 2013) For example: Patients who did not take/get the medication

More information

alternate-form reliability The degree to which two or more versions of the same test correlate with one another. In clinical studies in which a given function is going to be tested more than once over

More information

Model 1: Subject As Single Factor

Model 1: Subject As Single Factor 3 Model 1: Subject As Single Factor 3.1 The Model In this section, I am considering reliability experiments where each subject is scored by a different group of r raters 1. The experimenter controls which

More information

Repeatability of a questionnaire to assess respiratory

Repeatability of a questionnaire to assess respiratory Journal of Epidemiology and Community Health, 1988, 42, 54-59 Repeatability of a questionnaire to assess respiratory symptoms in smokers CELIA H WITHEY,' CHARLES E PRICE,' ANTHONY V SWAN,' ANNA 0 PAPACOSTA,'

More information

2012 Summary Report of the San Francisco Eligible Metropolitan Area. Quality Management Performance Measures

2012 Summary Report of the San Francisco Eligible Metropolitan Area. Quality Management Performance Measures San Francisco Department of Public Health HIV Health Services 2012 Summary Report of the San Francisco Eligible Metropolitan Area Health Resource Service Administration s HIV/AIDS Bureau's Quality Management

More information

2. How do different moderators (in particular, modality and orientation) affect the results of psychosocial treatment?

2. How do different moderators (in particular, modality and orientation) affect the results of psychosocial treatment? Role of psychosocial treatments in management of schizophrenia: a meta-analytic review of controlled outcome studies Mojtabai R, Nicholson R A, Carpenter B N Authors' objectives To investigate the role

More information

Evidence-Based Practice Fidelity Site Visit Tools

Evidence-Based Practice Fidelity Site Visit Tools Evidence-Based Practice Fidelity Site Visit Tools This product was supported by Florida Department of Children and Families Substance Abuse and Mental Health Program Office funding. Evidence-Based Practice

More information

County of Santa Cruz: Serving Families Involved with Family and Children s Services and Alcohol and Drug Programs

County of Santa Cruz: Serving Families Involved with Family and Children s Services and Alcohol and Drug Programs County of Santa Cruz Alcohol & Drug Program March 6, 2014 1 County of Santa Cruz: Serving Families Involved with Family and Children s Services and Alcohol and Drug Programs Sherra Clinton, MSW Senior

More information

Magnetic Resonance Imaging Interpretation in Patients With Symptomatic Lumbar Spine Disc Herniations

Magnetic Resonance Imaging Interpretation in Patients With Symptomatic Lumbar Spine Disc Herniations Magnetic Resonance Imaging Interpretation in Patients With Symptomatic Lumbar Spine Disc Herniations Comparison of Clinician and Radiologist Readings Jon D. Lurie, MD, MS,* David M. Doman, MD, Kevin F.

More information

Research with the SAPROF

Research with the SAPROF SAPROF 2nd Edition manual updated Research chapter May 2012 M. de Vries Robbé & V. de Vogel Research with the SAPROF Retrospective file studies Research with the SAPROF is being conducted in various settings

More information

Using Direct Behavior Ratings in a Middle School Setting

Using Direct Behavior Ratings in a Middle School Setting Using Direct Behavior Ratings in a Middle School Setting P R E S E N T E R S : R I V K A H R O S E N, U N I V E R S I T Y O F C O N N E C T I C U T N I C H O L A S C R O V E L L O, U N I V E R S I T Y

More information

Nova Scotia Board of Examiners in Psychology. Custody and Access Evaluation Guidelines

Nova Scotia Board of Examiners in Psychology. Custody and Access Evaluation Guidelines Nova Scotia Board of Examiners in Psychology Custody and Access Evaluation Guidelines We are grateful to the Ontario Psychological Association and to the College of Alberta Psychologists for making their

More information

It s a New World New PT and PTA Coursework Tools

It s a New World New PT and PTA Coursework Tools It s a New World New PT and PTA Coursework Tools This article is based on a presentation by Susan K. Lindeblad, PhD, PT, FSBPT Staff, and Emilee Tison, PhD, DCI Consulting Group, Inc., at the 2016 FSBPT

More information

Authorship Guidelines for CAES Faculty Collaborating with Students

Authorship Guidelines for CAES Faculty Collaborating with Students Authorship Guidelines for CAES Faculty Collaborating with Students College of Agriculture and Environmental Sciences Table of Contents Authorship Guidelines for CAES Faculty Collaborating with Students...

More information

A Validated Classification for External Immobilization of the Cervical Spine

A Validated Classification for External Immobilization of the Cervical Spine 72 Original Research A Validated Classification for External Immobilization of the Cervical Spine Micha Holla 1 Joske M. R. Huisman 1 Allard J. F. Hosman 1 1 Department of Orthopaedics, Radboud University

More information

DATA is derived either through. Self-Report Observation Measurement

DATA is derived either through. Self-Report Observation Measurement Data Management DATA is derived either through Self-Report Observation Measurement QUESTION ANSWER DATA DATA may be from Structured or Unstructured questions? Quantitative or Qualitative? Numerical or

More information

Shoplifting Inventory: Standardization Study

Shoplifting Inventory: Standardization Study Shoplifting Inventory: Standardization Study Donald D Davignon, Ph.D. 10-2-02 Abstract The Shoplifting Inventory (SI) is an adult shoplifting offender assessment test that accurately measures offender

More information

Performance of intraclass correlation coefficient (ICC) as a reliability index under various distributions in scale reliability studies

Performance of intraclass correlation coefficient (ICC) as a reliability index under various distributions in scale reliability studies Received: 6 September 2017 Revised: 23 January 2018 Accepted: 20 March 2018 DOI: 10.1002/sim.7679 RESEARCH ARTICLE Performance of intraclass correlation coefficient (ICC) as a reliability index under various

More information

Psychometric qualities of the Dutch Risk Assessment Scales (RISc)

Psychometric qualities of the Dutch Risk Assessment Scales (RISc) Summary Psychometric qualities of the Dutch Risk Assessment Scales (RISc) Inter-rater reliability, internal consistency and concurrent validity 1 Cause, objective and research questions The Recidive InschattingsSchalen

More information

Reliability of Obituaries as a Data Source in Epidemiologic Studies: Agreement in Age, Residence and Occupation

Reliability of Obituaries as a Data Source in Epidemiologic Studies: Agreement in Age, Residence and Occupation Science Journal of Medicine and Clinical Trials ISSN: 2276-7487 http://www.sjpub.org Author(s) 2013. CC Attribution 3.0 License. Research Article Published By Science Journal Publication International

More information

TURNING POINT ASSESSMENT/TREATMENT WOMAN ABUSE PROTOCOL DEPARTMENT OF JUSTICE AND PUBLIC SAFETY

TURNING POINT ASSESSMENT/TREATMENT WOMAN ABUSE PROTOCOL DEPARTMENT OF JUSTICE AND PUBLIC SAFETY J&PS-03-05 February 2001 Cover TURNING POINT ASSESSMENT/TREATMENT WOMAN ABUSE PROTOCOL DEPARTMENT OF JUSTICE AND PUBLIC SAFETY Revised March 31 2010 J&PS-03-05 February 2001 Table of Contents 1.0 PREAMBLE...

More information

SBIRT IOWA THE IOWA CONSORTIUM FOR SUBSTANCE ABUSE RESEARCH AND EVALUATION. Iowa Army National Guard. Biannual Report Fall 2015

SBIRT IOWA THE IOWA CONSORTIUM FOR SUBSTANCE ABUSE RESEARCH AND EVALUATION. Iowa Army National Guard. Biannual Report Fall 2015 SBIRT IOWA Iowa Army National Guard THE IOWA CONSORTIUM FOR SUBSTANCE ABUSE RESEARCH AND EVALUATION Iowa Army National Guard Biannual Report Fall 2015 With Funds Provided By: Iowa Department of Public

More information

BMC Medical Research Methodology

BMC Medical Research Methodology BMC Medical Research Methodology BioMed Central Research article Dealing with missing data in a multi-question depression scale: a comparison of imputation methods Fiona M Shrive 1,3,4, Heather Stuart

More information

Reliability and Validity checks S-005

Reliability and Validity checks S-005 Reliability and Validity checks S-005 Checking on reliability of the data we collect Compare over time (test-retest) Item analysis Internal consistency Inter-rater agreement Compare over time Test-Retest

More information

Evaluation of a clinical test. I: Assessment of reliability

Evaluation of a clinical test. I: Assessment of reliability British Journal of Obstetrics and Gynaecology June 2001, Vol. 108, pp. 562±567 COMMENTARY Evaluation of a clinical test. I: Assessment of reliability Introduction Testing and screening are critical parts

More information

An Exploratory Case Study of the Use of Video Digitizing Technology to Detect Answer-Copying on a Paper-and-Pencil Multiple-Choice Test

An Exploratory Case Study of the Use of Video Digitizing Technology to Detect Answer-Copying on a Paper-and-Pencil Multiple-Choice Test An Exploratory Case Study of the Use of Video Digitizing Technology to Detect Answer-Copying on a Paper-and-Pencil Multiple-Choice Test Carlos Zerpa and Christina van Barneveld Lakehead University czerpa@lakeheadu.ca

More information

A study of adverse reaction algorithms in a drug surveillance program

A study of adverse reaction algorithms in a drug surveillance program A study of adverse reaction algorithms in a drug surveillance program To improve agreement among observers, several investigators have recently proposed methods (algorithms) to standardize assessments

More information

Trauma Symptom Checklist for Children Briere, J Purpose To assess the effects of childhood trauma through the child s self-report.

Trauma Symptom Checklist for Children Briere, J Purpose To assess the effects of childhood trauma through the child s self-report. Description of Measure Trauma Symptom Checklist for Children Briere, J. 1996 Purpose To assess the effects of childhood trauma through the child s self-report. Conceptual Organization The 54-item Trauma

More information

Workforce Analysis: Children and Young People s Mental Health and Wellbeing Wider system

Workforce Analysis: Children and Young People s Mental Health and Wellbeing Wider system Workforce Analysis: Children and Young People s Mental Health and Wellbeing Wider system This questionnaire is aimed at any member of the workforce supporting the mental health and wellbeing for children

More information

The study of communication is interdisciplinary, sharing topics, literatures,

The study of communication is interdisciplinary, sharing topics, literatures, Lombard et al. / CONTENT ANALYSIS 587 Content Analysis in Mass Communication Assessment and Reporting of Intercoder Reliability MATTHEW LOMBARD Temple University JENNIFER SNYDER-DUCH Carlow College CHERYL

More information

Agreement Between Retrospective Accounts

Agreement Between Retrospective Accounts Agreement Between Retrospective Accounts of Substance Use and Earlier Reported Substance Use Linda M. Collins, John W. Graham, William B. Hansen, and C. Anderson Johnson University of Southern California

More information

Chapter 2. Traumatic stress symptomatology after child maltreatment and single traumatic events: Different profiles. Slightly adapted for consistency:

Chapter 2. Traumatic stress symptomatology after child maltreatment and single traumatic events: Different profiles. Slightly adapted for consistency: Chapter 2 Traumatic stress symptomatology after child maltreatment and single traumatic events: Different profiles. Slightly adapted for consistency: Jonkman, C.S., Verlinden, E., Bolle, E.A., Boer, F.

More information

Measurement and Reliability: Statistical Thinking Considerations

Measurement and Reliability: Statistical Thinking Considerations VOL 7, NO., 99 Measurement and Reliability: Statistical Thinking Considerations 8 by John J. Bartko Abstract Reliability is defined as the degree to which multiple assessments of a subject agree (reproducibility).

More information

Chapter IR:VIII. VIII. Evaluation. Laboratory Experiments Logging Effectiveness Measures Efficiency Measures Training and Testing

Chapter IR:VIII. VIII. Evaluation. Laboratory Experiments Logging Effectiveness Measures Efficiency Measures Training and Testing Chapter IR:VIII VIII. Evaluation Laboratory Experiments Logging Effectiveness Measures Efficiency Measures Training and Testing IR:VIII-1 Evaluation HAGEN/POTTHAST/STEIN 2018 Retrieval Tasks Ad hoc retrieval:

More information

Update on the Reliability of Diagnosis in Older Psychiatric Outpatients Using the Structured Clinical Interview for DSM IIIR

Update on the Reliability of Diagnosis in Older Psychiatric Outpatients Using the Structured Clinical Interview for DSM IIIR Journal of Clinical Geropsychology, Vol., No. 4, 995 Update on the Reliability of Diagnosis in Older Psychiatric Outpatients Using the Structured Clinical Interview for DSM IIIR Daniel L. Segal, Robert

More information

Assessment of Interrater Agreement for Multiple Nominal Responses Among Several Raters Chul W. Ahn, City of Hope National Medical Center

Assessment of Interrater Agreement for Multiple Nominal Responses Among Several Raters Chul W. Ahn, City of Hope National Medical Center Assessment of Interrater Agreement for Multiple Nominal Responses Among Several Raters Chul W. Ahn, City of Hope National Medical Center ABSTRACT An interrater agreement coefficient is computed using a

More information

CEMO RESEARCH PROGRAM

CEMO RESEARCH PROGRAM 1 CEMO RESEARCH PROGRAM Methodological Challenges in Educational Measurement CEMO s primary goal is to conduct basic and applied research seeking to generate new knowledge in the field of educational measurement.

More information

Children s Advocacy Centers: A Natural (and Local) Partner for Youth-Serving Organizations

Children s Advocacy Centers: A Natural (and Local) Partner for Youth-Serving Organizations Children s Advocacy Centers: A Natural (and Local) Partner for Youth-Serving Organizations This presentation was supported [in part] by Grant No. 2015-CI-FX-K003 awarded by the Office of Juvenile Justice

More information

RECOMMENDATIONS FOR THE DIAGNOSIS AND MANAGEMENT OF CHURG-STRAUSS SYNDROME METHODS AND SCORING

RECOMMENDATIONS FOR THE DIAGNOSIS AND MANAGEMENT OF CHURG-STRAUSS SYNDROME METHODS AND SCORING PROJECT 1 March 2009 Prepared by Jean-François Cordier (Prof of Pneumology) and Loïc Guillevin (Prof of Medicine) RECOMMENDATIONS FOR THE DIAGNOSIS AND MANAGEMENT OF CHURG-STRAUSS SYNDROME METHODS AND

More information

A practical tool for locomotion scoring in sheep: Reliability when used by veterinary surgeons and sheep farmers

A practical tool for locomotion scoring in sheep: Reliability when used by veterinary surgeons and sheep farmers See discussions, stats, and author profiles for this publication at: https://www.researchgate.net/publication/272945558 A practical tool for locomotion scoring in sheep: Reliability when used by veterinary

More information

DEPRESSION-FOCUSED INTERVENTION FOR PREGNANT SMOKERS 1. Supplemental Material For Online Use Only

DEPRESSION-FOCUSED INTERVENTION FOR PREGNANT SMOKERS 1. Supplemental Material For Online Use Only DEPRESSION-FOCUSED INTERVENTION FOR PREGNANT SMOKERS 1 Supplemental Material For Online Use Only Effects of an Intensive Depression-Focused Intervention for Smoking Cessation in Pregnancy DEPRESSION-FOCUSED

More information

Interpreting Kappa in Observational Research: Baserate Matters

Interpreting Kappa in Observational Research: Baserate Matters Interpreting Kappa in Observational Research: Baserate Matters Cornelia Taylor Bruckner Sonoma State University Paul Yoder Vanderbilt University Abstract Kappa (Cohen, 1960) is a popular agreement statistic

More information

Research Article The Study on the Agreement between Automatic Tongue Diagnosis System and Traditional Chinese Medicine Practitioners

Research Article The Study on the Agreement between Automatic Tongue Diagnosis System and Traditional Chinese Medicine Practitioners Evidence-Based Complementary and Alternative Medicine Volume 2012, Article ID 505063, 9 pages doi:10.1155/2012/505063 Research Article The Study on the Agreement between Automatic Tongue Diagnosis System

More information

Assessment of Peer Rejection and Externalizing Behavior Problems in Preschool Boys: A Short-Term Longitudinal Study

Assessment of Peer Rejection and Externalizing Behavior Problems in Preschool Boys: A Short-Term Longitudinal Study Journal of Abnormal Child Psychology, Vol. 19, No. 4, 1991 Assessment of Peer Rejection and Externalizing Behavior Problems in Preschool Boys: A Short-Term Longitudinal Study Sheryl L. Olson 1,2 and Pearl

More information

A profile of young Albertans with Fetal Alcohol Spectrum Disorder

A profile of young Albertans with Fetal Alcohol Spectrum Disorder A profile of young Albertans with Fetal Alcohol Spectrum Disorder Child and Youth Data Laboratory (CYDL) Key findings This report is an overview of the experiences of young Albertans (0 to 25 years) with

More information

Brief Report: Interrater Reliability of Clinical Diagnosis and DSM-IV Criteria for Autistic Disorder: Results of the DSM-IV Autism Field Trial

Brief Report: Interrater Reliability of Clinical Diagnosis and DSM-IV Criteria for Autistic Disorder: Results of the DSM-IV Autism Field Trial Journal of Autism and Developmental Disorders, Vol. 30, No. 2, 2000 Brief Report: Interrater Reliability of Clinical Diagnosis and DSM-IV Criteria for Autistic Disorder: Results of the DSM-IV Autism Field

More information

Gambler Addiction Index: Gambler Assessment

Gambler Addiction Index: Gambler Assessment Gambler Addiction Index: Gambler Assessment Donald D Davignon, Ph.D. 8-2-02 Abstract The Gambler Addiction Index (GAI) is an adult gambler assessment test that accurately measures gambler risk of gambling

More information

MODEL CHURCH POLICIES

MODEL CHURCH POLICIES MODEL CHURCH POLICIES Model Church Policies Policy for the Methodist Church 2010 Approved by the Methodist Conference 2010 The Methodist Church, Methodist Church House, 25 Marylebone Road, London NW1 5JR

More information

Diagnostic concordance among dermatopathologists in basal cell carcinoma subtyping: Results of a study in a skin referral hospital in Tehran, Iran

Diagnostic concordance among dermatopathologists in basal cell carcinoma subtyping: Results of a study in a skin referral hospital in Tehran, Iran Original Article Diagnostic concordance among dermatopathologists in basal cell carcinoma subtyping: Results of a study in a skin referral hospital in Azita Nikoo, MD 1 Zahra Naraghi, MD 1 Kambiz Kamyab,

More information

2019 COLLECTION TYPE: MIPS CLINICAL QUALITY MEASURES (CQMS) MEASURE TYPE: Process High Priority

2019 COLLECTION TYPE: MIPS CLINICAL QUALITY MEASURES (CQMS) MEASURE TYPE: Process High Priority Quality ID #181: Elder Maltreatment Screen and Follow-Up Plan National Quality Strategy Domain: Patient Safety Meaningful Measure Area: Preventive Care 2019 COLLECTION TYPE: MIPS CLINICAL QUALITY MEASURES

More information

Reliability of motor development data in the WHO Multicentre Growth Reference Study

Reliability of motor development data in the WHO Multicentre Growth Reference Study Acta Pædiatrica, 2006; Suppl 450: 47 /55 Reliability of motor development data in the WHO Multicentre Growth Reference Study WHO MULTICENTRE GROWTH REFERENCE STUDY GROUP 1,2 1 Department of Nutrition,

More information

SFHPT24 Undertake an assessment for family and systemic therapy

SFHPT24 Undertake an assessment for family and systemic therapy Undertake an assessment for family and systemic therapy Overview This standard is about systemic assessment. It is not a once-only event and may change as the therapeutic work proceeds. Systemic assessment

More information

doi: /j.jad

doi: /j.jad doi: 10.1016/j.jad.2013.09.006 Title Reconsidering the effects of blue-light installation for prevention of railway suicides Author names and affiliations Masao Ichikawa a, *, Haruhiko Inada a, Minae Kumeji

More information

CareerCruising. MatchMaker Reliability and Validity Analysis

CareerCruising. MatchMaker Reliability and Validity Analysis CareerCruising MatchMaker Reliability and Validity Analysis 1 Career Cruising MatchMaker Analysis Reliability and Validity Analysis CECS contracted with Career Cruising to conduct psychometric analyses

More information

Data and Statistics 101: Key Concepts in the Collection, Analysis, and Application of Child Welfare Data

Data and Statistics 101: Key Concepts in the Collection, Analysis, and Application of Child Welfare Data TECHNICAL REPORT Data and Statistics 101: Key Concepts in the Collection, Analysis, and Application of Child Welfare Data CONTENTS Executive Summary...1 Introduction...2 Overview of Data Analysis Concepts...2

More information

Cuyahoga County Division of Children and Family Services (CCDCFS) Policy Statement

Cuyahoga County Division of Children and Family Services (CCDCFS) Policy Statement Cuyahoga County Division of Children and Family Services (CCDCFS) Policy Statement Policy Chapter: Case Requirements Policy Number: 5.01.03 Policy Name: Family Cases Involving Substance Use Original Effective

More information

Hounslow Safeguarding Children Board. Training Strategy Content.. Page. Introduction 2. Purpose 3

Hounslow Safeguarding Children Board. Training Strategy Content.. Page. Introduction 2. Purpose 3 Hounslow Safeguarding Children Board. Training Strategy 2018-2020. Content.. Page Introduction 2 Purpose 3 What does the Training Strategy hope to achieve?. 4 Review.. 4 Local context.. 4 Training sub

More information

Juvenile Pre-Disposition Evaluation: Reliability and Validity

Juvenile Pre-Disposition Evaluation: Reliability and Validity Juvenile Pre-Disposition Evaluation: Reliability and Validity Donald D Davignon, Ph.D. Abstract The Juvenile Pre-Disposition Evaluation (JPE) is a juvenile defendant assessment test that accurately measures

More information

Sandwell Safeguarding Adults Board. ANNUAL REPORT 2016/2017 Executive Summary

Sandwell Safeguarding Adults Board. ANNUAL REPORT 2016/2017 Executive Summary Sandwell Safeguarding Adults Board SSAB@SSAdultsBoard ANNUAL REPORT 2016/2017 Executive Summary SEE SOMETHING DO SOMETHING Safeguarding is everyone s business SEE SOMETHING If you are concerned that an

More information

and The 95% confidence intervals are calculated as follows: where

and The 95% confidence intervals are calculated as follows: where Using SAS 8 to Calculate Kappa and Confidence Intervals for Binary Data with Multiple Raters, and for the Consensus of Multiple Diagnoses Art Noda, Stanford University School of Medicine, Stanford, CA

More information

Making the Subjective Objective? Computer-Assisted Quantification of Qualitative Content Cues to Deception

Making the Subjective Objective? Computer-Assisted Quantification of Qualitative Content Cues to Deception Making the Subjective Objective? Computer-Assisted Quantification of Qualitative Content Cues to Deception Siegfried L. Sporer Department of Psychology and Sports Science University of Giessen, Germany

More information

Applying the Risk of Bias Tool in a Systematic Review of Combination Long-Acting Beta-Agonists and Inhaled Corticosteroids for Persistent Asthma

Applying the Risk of Bias Tool in a Systematic Review of Combination Long-Acting Beta-Agonists and Inhaled Corticosteroids for Persistent Asthma Applying the Risk of Bias Tool in a Systematic Review of Combination Long-Acting Beta-Agonists and Inhaled Corticosteroids for Persistent Asthma Lisa Hartling 1 *, Kenneth Bond 1, Ben Vandermeer 1, Jennifer

More information

Victim Index Reliability and Validity Study

Victim Index Reliability and Validity Study Victim Index Reliability and Validity Study Abstract The validity of the Victim Index (VI) was investigated in a sample of 666 participants. The VI has eight scales for measuring morale, suicide ideation,

More information

What Smokers Who Switched to Vapor Products Tell Us About Themselves. Presented by Julie Woessner, J.D. CASAA National Policy Director

What Smokers Who Switched to Vapor Products Tell Us About Themselves. Presented by Julie Woessner, J.D. CASAA National Policy Director What Smokers Who Switched to Vapor Products Tell Us About Themselves Presented by Julie Woessner, J.D. CASAA National Policy Director The CASAA Consumer Testimonials Database Collection began in 2013 through

More information

Week 2 Video 2. Diagnostic Metrics, Part 1

Week 2 Video 2. Diagnostic Metrics, Part 1 Week 2 Video 2 Diagnostic Metrics, Part 1 Different Methods, Different Measures Today we ll focus on metrics for classifiers Later this week we ll discuss metrics for regressors And metrics for other methods

More information

Exploring Normalization Techniques for Human Judgments of Machine Translation Adequacy Collected Using Amazon Mechanical Turk

Exploring Normalization Techniques for Human Judgments of Machine Translation Adequacy Collected Using Amazon Mechanical Turk Exploring Normalization Techniques for Human Judgments of Machine Translation Adequacy Collected Using Amazon Mechanical Turk Michael Denkowski and Alon Lavie Language Technologies Institute School of

More information