STUDY. Reflectance Confocal Microscopy and Features of Melanocytic Lesions. An Internet-Based Study of the Reproducibility of Terminology

Similar documents
Reflectance-Mode Confocal Microscopy for the In Vivo Characterization of Pagetoid Melanocytosis in Melanomas and Nevi

STUDY. Morphologic Features of Melanophages Under In Vivo Reflectance Confocal Microscopy

Management of patients with melanocytic and non-melanocytic neoplasms

Sensitivity and Specificity of Confocal Laser-Scanning Microscopy for In Vivo Diagnosis of Malignant Skin Tumors

EARLY ONLINE RELEASE

F006 Imaging in Dermatology Melanocytic Neoplasia Clinical-Confocal-Pathological-Correlations

Morphologic Features of Melanocytes, Pigmented Keratinocytes, and Melanophages by In Vivo Confocal Scanning Laser Microscopy

STUDY. Microscopic In Vivo Description of Cellular Architecture of Dermoscopic Pigment Network in Nevi and Melanomas

F109 Imaging in Dermatology Melanocytic Neoplasia Clinical-Confocal-Pathological-Correlations

In vivo confocal scanning laser microscopy of pigmented Spitz nevi: Comparison of in vivo confocal images with dermoscopy and routine histopathology

ORIGINAL ARTICLE Journal of Investigative Dermatology (2010), Volume 130 & 2010 The Society for Investigative Dermatology

Validation Study of Automated Dermal/Epidermal Junction Localization Algorithm in Reflectance Confocal Microscopy Images of Skin

Confocalist. Why this is important? No Relevant Conflict of Interest Dermpath Lab

Diagnostic Applicability of In Vivo Confocal Laser Scanning Microscopy in Melanocytic Skin Tumors

NIH Public Access Author Manuscript Br J Dermatol. Author manuscript; available in PMC 2015 April 01.

Diagnosis of Lentigo Maligna Melanoma. Steven Q. Wang, M.D. Memorial Sloan-Kettering Cancer Center Basking Ridge, NJ

Dermoscopy: Recognizing Top Five Common In- Office Diagnoses

Confocal Microscopy in Skin Cancer

Basics in Dermoscopy

DIFFERENCES IN DERMOSCOPIC IMAGES FROM NON-POLARIZED DERMOSCOPE AND POLARIZED DERMOSCOPE INFLUENCE THE DIAGNOSTIC ACCURACY AND CONFIDENCE LEVEL.

Confocalist. Why this is important? 7/17/2017. No Relevant Conflict of Interest. Dermpath Lab

Reflectance Confocal Microscopy Real-Time In Vivo Imaging of Basal and Squamous Cell Carcinomas

Benign and malignant epithelial lesions: Seborrheic keratosis: A common benign pigmented epidermal tumor occur in middle-aged or older persons more

Keywords: Microscopy, confocal - Skin neoplasms - Melanoma. Corrisponding author: Francesca Farnetani

STUDY. Correlation of Dermoscopy With In Vivo Reflectance Confocal Microscopy of Streaks in Melanocytic Lesions

MODULE 1. LOCAL AND GENERAL CRITERIA IN PIGMENTED MELANOCYTIC LESIONS.

Histopathology of Melanoma

INCREASE IN incidence and mortality rates for

Disclosure. Objectives. PAFP CME Conference Lou Mancano MD, FAAFP Reading Health System November 18, 2016

Diagnostics Assessment Programme

October 8 th, :00-21:00 EPIDERMAL JUNCTION IN REFLECTANCE CONFOCAL MICROSCOPIC IMAGES OF SKIN

Noninvasive imaging devices have emerged as powerful

Pathology of the skin. 2nd Department of Pathology, Semmelweis University

Associate Clinical Professor of Dermatology MUSC

Appendix : Dermoscopy

Confocal Reflectance Microscopy in Dermatology: Promise and Reality of Non-Invasive Diagnosis and Monitoring

Abrupt Intralesional Color Change on Dermoscopy as a New Indicator of Early Superficial Spreading Melanoma in a Japanese Woman

Dermoscopy. Enhanced Diagnostic Ability: Pigmented Lesions. Ted Rosen, MD Baylor College of Medicine Houston, Texas

Clinical and Dermoscopic Features of Thin Nodular Melanoma

Corporate Overview. January 2014 OTC/QB: LCDX

Dermoscopy. Sir William Osler. Dermoscopy. Dermoscopy. Melanoma USA Primary Care Update Faculty Disclosure Statement

Diagnostics guidance Published: 11 November 2015 nice.org.uk/guidance/dg19

Age-related prevalence of dermatoscopic patterns of acral melanocytic nevi

Histopathology: skin pathology

Algorithmic reproduction of asymmetry and border cut-off parameters according to the ABCD rule for dermoscopy

HIGH-RESOLUTION OPTICAL COHERENCE TOMOGRAPHY FOR THE DIAGNOSIS OF ACTINIC KERATOSIS

Management of Atypical Pigmented Lesions

Multispectral Digital Skin Lesion Analysis. Summary

Female 18. Deeply pigmented lesion on trunk.?warty naevus?seborrhoeic keratosis?malignant melanoma. The best diagnosis is:

SPECIAL TOPIC. Institut National de la Sante et de la Recherche Medicale (INSERM U895), Nice, France c

STUDY. Frequency of Dermoscopic Nevus Subtypes by Age and Body Site

Malignant tumors of melanocytes: Part 1. Deba P Sarma, MD., Omaha

DERMATOLOGY PRACTICAL & CONCEPTUAL. Introduction. Dermoscopy. Hiroshi Sakai 1, Kyoko Tonomura 1, Hirotsugu Shirabe 1, Masaru Tanaka 2

Assisting diagnosis of melanoma through the noninvasive biopsy of skin lesions

Accepted Article. Dermoscopic diagnosis of amelanotic/hypomelanotic melanoma

Morphologic characteristics of nevi associated with melanoma: a clinical, dermatoscopic and histopathologic analysis

INTRODUCTION HOUSEKEEPING June 11 th Dr John Adams Dermatologist/Dermoscopist MOLEMAP NZ/Australia MOLESAFE USA

Apps and Telemedicine H. Peter Soyer Dermatology Research Centre

Desmoplastic Melanoma R/O BCC. Clinical Information. 74 y.o. man with lesion on left side of neck r/o BCC

Case Report A Case of Cystic Basal Cell Carcinoma Which Shows a Homogenous Blue/Black Area under Dermatoscopy

STUDY. Dermatologists Accuracy in Early Diagnosis of Melanoma of the Nail Matrix

Malignant tumors of melanocytes : Part 3. Deba P Sarma, MD., Omaha

What is Dermoscopy? Early Dermoscopes. Deciphering Dermoscopy: Terminology, Features & Algorithms 6/17/2018

The application of confocal microscopy for optical sectioning

ISPUB.COM. Seborrheic Keratosis: A Pictorial Review of the Histopathologic Variations. D Sarma, S Repertinger

STUDY. Characteristic Epiluminescent Microscopic Features of Early Malignant Melanoma on Glabrous Skin

Actinic keratosis (AK) is the most common precancerous

FACTORS ASSOCIATED WITH NEVUS VOLATILITY IN EARLY ADOLESCENCE

Description of Some Dermatoscopic Features of Acral Pigmented Lesions in Iranian Patients: A Preliminary Study

ELECTRON MICROSCOPY OF A SMALL PIGMENTED CUTANEOUS LESION*

Interrater and intrarater agreement of confocal microscopy imaging in diagnosing and subtyping basal cell carcinoma

المركب النموذج--- سبيتز وحمة = Type Spitz's Nevus, Compound SPITZ NEVUS 1 / 7

4 Skin and Body Membranes Study Guide

Case Report Micromelanomas: A Review of Melanomas 2mmand a Case Report

22/04/2015. Dermoscopy of Melanoma. Ilsphi Browne. Overview

STUDY. Confocal Examination of Untreated Fresh Specimens From Basal Cell Carcinoma

6/17/2018. Breaking Bad (Part 1) Dermoscopy of Brown(ish) Things. Bad?

Regression 2/3/18. Histologically regression is characterized: melanosis fibrosis combination of both. Distribution: partial or focal!

Clinics in Dermatology

Melanoma-Back to Basics I Thought I Knew Ya! Paul K. Shitabata, M.D. Dermatopathologist APMG

STUDY. Identification of Clinically Featureless Incipient Melanoma Using Sequential Dermoscopy Imaging

Multiple Primary Melanoma in a Thai Male: A Case Report

Dermatopathology. Dr. Rafael Botella Estrada. Hospital La Fe de Valencia

Cover Page. The handle holds various files of this Leiden University dissertation.

Fluorescence spectroscopy and microscopy of cutaneous tumours correlation between micro- and macro- spectral measurements

Brief Report. Shivanand Gundalli 1, Smita Kadadavar 1, Somil Singhania 1, Rutuja Kolekar 2 INTRODUCTION. Melanocytic Nevus

Strategies for early recognition of cutaneous melanoma present and future

Type IV collagen and laminin staining patterns in benign

Maligna Melanoma and Atypical Fibroxanthoma: An Unusual Collision Tumour G Türkcü 1, A Keleş 1, U Alabalık 1, D Uçmak 2, H Büyükbayram 1 ABSTRACT

Dermoscopy of Acral Melanoma: A Multicenter Study on Behalf of the International Dermoscopy Society

Triage amalgamated dermoscopic algorithm (TADA) for skin cancer screening

Dermatoscopic features of cutaneous non-facial non-acral lentiginous growth pattern melanomas

OBSERVATION. Detection of Clinically Amelanotic Malignant Melanoma and Assessment of Its Margins by In Vivo Confocal Scanning Laser Microscopy

Lichenoid Tissue Reaction in Malignant Melanoma A Potential Diagnostic Pitfall

STUDY. Scott W. Menzies, MB,BS, PhD; Karin Westerhoff, MD; Harold Rabinovitz, MD; Alfred W. Kopf, MD; William H. McCarthy, MBBS, MEd; Brian Katz

Differences in the Known Cellular Composition of Benign Pigmented Skin Lesions Reflected in Computer-Aided Image Analysis

insight Exceptional SKINTELL in-depth, non-invasive dermatological solution HealthCare

Dermoscopic patterns of melanocytic nevi in children and adolescents: a cross-sectional study *

Transcription:

STUDY Reflectance Confocal Microscopy and Features of Melanocytic Lesions An Internet-Based Study of the Reproducibility of Terminology Giovanni Pellacani, MD; Marco Vinceti, MD; Sara Bassoli, MD; Ralph Braun, MD; Salvador Gonzalez, MD; Pascale Guitera, MD; Caterina Longo, MD; Ashfaq A. Marghoob, MD; Scott W. Menzies, MD; Susana Puig, MD; Alon Scope, MD; Stefania Seidenari, MD; Josep Malvehy, MD Objective: To test the interobserver and intraobserver reproducibility of the standard terminology for description and diagnosis of melanocytic lesions in in vivo confocal microscopy. Design: A dedicated Web platform was developed to train the participants and to allow independent distant evaluations of confocal images via the Internet. Setting: Department of Dermatology, University of Modena and Reggio Emilia, Modena, Italy. Participants: The study population was composed of 15 melanomas, 30 nevi, and 5 Spitz/Reed nevi. Six expert centers were invited to participate at the study. Intervention: Evaluation of 36 features in 345 confocal microscopic images from melanocytic lesions. Main Outcome Measure: Interobserved and intraobserved agreement, by calculating the Cohen statistics measure for each descriptor. Results: High overall levels of reproducibility were shown for most of the evaluated features. In both the training and test sets there was a parallel trend of decreasing values as deeper anatomic skin levels were evaluated. All of the features, except 1, used for melanoma diagnosis, including roundish pagetoid cells, nonedged papillae, atypical cells in basal layer, cerebriform clusters, and nucleated cells infiltrating dermal papillae, showed high overall levels of reproducibility. However, less-than-ideal reproducibility was obtained for some descriptors, such as grainy appearance of the epidermis, junctional thickening, mild atypia in basal layer, plump bright cells, small bright cells, and reticulated fibers in the dermis. Conclusion: The standard consensus confocal terminology useful for the evaluation of melanocytic lesions was reproducibly recognized by independent observers. Arch Dermatol. 2009;145(10):1137-1143 Author Affiliations are listed at the end of this article. IN VIVO REFLECTANCE MODE CONfocal microscopy (RCM) is an emerging technique that allows for noninvasive imaging of the epidermis and the upper dermis at near-histologic resolution. 1 The clinical application of RCM in skin oncology seems to be very promising, particularly for evaluation of melanocytic lesions, owing to the high contrast provided by hyperreflective melanin and melanosomes. 2 Since 2001, RCM features that are useful for the distinction between melanocytic nevi and melanomas (MMs) have been identified and have been correlated with histopathologic features. 3-7 A diagnostic algorithm was recently proposed and blindly tested on a large population of equivocal melanocytic lesions, resulting in high diagnostic accuracy. 8,9 Reflectance mode confocal microscopy has the potential to improve diagnostic confidence and diagnostic accuracy. 9-11 Similar to other branches of medicine that require visual analysis of images, RCM image interpretation remains subjective and operator dependent. However, despite this subjectivity, a high degree of agreement among different observers is achievable. Toward this end, it is paramount to first establish a glossary of terms for RCM evaluation of melanocytic lesions; a consensus terminology along with illustrative images was recently published by experts in the field. 12 The next step requires validation of this new terminology. We tested the interobserver and intraobserver reproducibility of the standard terms used for description and diagnosis of melanocytic lesions. For this purpose, we developed a dedicated Web-platform that allowed for training of study participants with an online RCM tutorial and for testing the reproducibility of their RCM image evaluations by interobserver and intraobserver agreement. 1137

METHODS Based on a critical review of the scientific literature, a list of 36 RCM descriptive terms was previously agreed on by expert consensus. 12 A Web-based tutorial was provided to the participants, and it contained the definitions of the RCM descriptive terms and illustrative images for each term. The tutorial and the whole data set are available online. 13 The study was entirely conducted online, using the dedicated Web site. In Italy, ethics committee review of a deidentified database is not required. STUDY POPULATION AND STUDY DESIGN The tutorial and training sets were based on individual images that were previously used in a published study by Pellacani et al. 8 The training set consisted of 45 images that were acquired from benign and malignant melanocytic lesions at 3 different depths based on the anatomic levels of the skin (15 images from superficial epidermis, 15 from the dermal epidermal junction, and 15 from the papillary dermis). No mosaics were evaluated. The test set consisted of 270 images, corresponding to 90 images per skin layer. It included images from 50 new cases of melanocytic lesions that were not used in previous studies, corresponding to 15 consecutive MMs, 5 consecutive Spitz nevi, and a random set of 30 melanocytic nevi, acquired from March 1, 2006, to June 30, 2006, at the Department of Dermatology at the University of Modena and Reggio Emilia, Modena, Italy. All lesions were diagnostically challenging on clinical and/or dermoscopic examination and were excised to rule out MM. All RCM imaging was acquired according to a standardized protocol previously published. 8 On average, 250 RCM images were acquired per lesion, and all images were classified according to the anatomic depth or level into superficial epidermis, dermal epidermal junction (including the basal cell layer of the epidermis), or papillary dermis. In addition, all images were also classified according to image quality. The test set consisted of randomly selected images of good quality, and for each lesion at least 1 image was selected from the superficial epidermis level, 1 from the dermal epidermal junction, and 1 from the papillary dermis. An image was judged to have good quality based on a lack of artifacts and on a good overall contrast (excluding too bright or too dim images derived from acquisition mistakes). Finally, from each lesion, no more than 3 images per anatomical layer were selected and included in the final test set cases. Overall, each participant performed 300 test evaluations. To test for intraobserver agreement, 30 randomly selected images were duplicated within the test set. The order of images in the test set was randomized. Overall, more than 20 000 observations were considered for statistical analysis, enough to acquire a satisfactory statistical power. Study participants were invited to view the tutorial. Six centers, of which 4 were in Europe (Madrid and Barcelona, Spain; Modena, Italy; and Zurich, Switzerland), 1 in the United States (New York, New York), and 1 in Australia (Sydney), with renowned expertise in the use of RCM for diagnosing melanocytic lesions, were involved. Subsequently, the training set of 45 images was used to allow participants to test and refine their skills of applying the descriptive terms to the RCM images. Each participant filled out an online electronic response form consisting of questions and accompanying choices for each image. The form included questions regarding all descriptive terms used in the study stratified by the anatomic skin level. Participants were required to first submit their evaluation of each image, after which they were able to compare their evaluation with that proposed in consensus by 2 of us (G.P. and J.M.). This realtime feedback was designed to assist the participants in applying the criteria to the RCM images. After the completion of the training set evaluations, the investigators were asked to access the test set. The same electronic data sheets were used, this time without feedback. Participants could evaluate only 1 image at a time, and each case had to be submitted before participants could access the next image. Although images from the tutorial and training sets were accessible at all times, participants could not go back to previously analyzed images in the test set. Each participant was given 4 months ( January 1 April 30, 2007) to complete the evaluation of the images. STATISTICAL ANALYSIS Data analysis was carried out using the SPSS statistical package (release 10.0.06; SPSS Inc, Chicago, Illinois) and Stata statistical software (release 10.0; Stata Corp LP; College Station, Texas). We computed the statistics for evaluation of interobserver and intraobserved agreement by calculating the Cohen statistics measure for each descriptor and considering P.01 as significant. The interobserver reproducibility on the training set was assessed for each study participant and for each RCM feature by calculating the percentage of ratings concordant with the proposed ones considered as gold standards. The overall interrater reproducibility was determined on the test set by calculating the proportion of concordant outcomes, considering reproducibility to be in full agreement if evaluation of an RCM feature in an image was concordant for all 6 participants and in good agreement if 5 of the 6 participants were concordant. Disagreement was defined as images that resulted in agreement inferior to good agreement. For each feature, the frequency of positive observations, corresponding to the percentage of observations reporting the feature as present, was calculated. For each RCM feature across the entire test set, we considered a feature to have a high overall level of reproducibility if more than 70% of the images showing this RCM feature resulted in good agreement (ie, disagreement in no more than 30% of images). The intraobserver reproducibility was assessed in a subset of 30 images, randomly selected from the test set, and submitted twice to each study participant. To overcome the intrinsic limitations of the statistics interpretation in evaluating reproducibility on observations lacking a gold standard and showing an imbalance of positive vs negative responses, we decided to consider also the proportion of concordant evaluations. Thus, the interpretation of the results should take into account both the proportion of concordant evaluations (full agreement and good agreement) and the value. To avoid misinterpretation, only significant values were considered, and a very low P value (P.01) was considered significant in order to obtain more consistent and stable results. RESULTS Training set results are summarized in Table 1. A high proportion of evaluations were concordant with the proposed ones for each parameter, ranging from 68% for isolated cells in the papillary dermis to 99% for a honeycombed epidermal pattern. Although not significant (P =.17), reproducibility of RCM features showed a decreasing trend from the superficial layers to the dermis; overall, concordance in RCM feature recognition was 86% for the features observable in the superficial epidermis, 83% for terms related to the dermal-epidermal junction, and 81% for dermal aspects. 1138

Test set results are reported in Table 2. In the superficial epidermis, high overall levels of reproducibility ( 70% of the images showing the feature achieved agreement by at least 5 participants) were obtained for honeycombed and cobblestone patterns and their variants (broadened honeycombed and cobblestone with small nucleated cells) (Figure 1A). Complete disarrangement or uneven pattern of the superficial epidermis (ie, atypical honeycombed and atypical cobblestone patterns) resulted in high overall levels of reproducibility, with good to full agreement seen in over 74% of images. A grainy pattern was not reproducibly recognized, resulting in disagreement in almost half of images showing this feature. For pagetoid cells, high overall levels of reproducibility were obtained both for the recognition of the presence of pagetoid cells and for their morphologic description (Figure 1B), obtaining good agreement in more than 80% of images, with the exception of the descriptors small cells and cell density. Evaluating the dermal-epidermal junction (including the basal cell layer of the epidermis), edged and nonedged papillae, junctional clusters, marked atypia, and sheetlike structures showed high overall levels of reproducibility, whereas the distinction of junctional thickening, typical cells, or mild atypia resulted in disagreement in approximately half of the images (Figure 2). Within the papillary dermis, identification of cell clusters and their distinction into different subtypes showed high overall levels of reproducibility, with the exception of dishomogeneous clusters, which showed disagreement in approximately 35% of cases. Although nucleated cells within the papillary dermis were reproducibly recognized, identification of plump bright cells or small bright cells and particles resulted in disagreement in almost 45% of image evaluations. For collagen structures, identification of collagen bundles showed high overall levels of reproducibility, whereas identification of reticulated fibers was not reproducible. Table 3 lists the values for the intraobserver reproducibility. With the exception of a grainy pattern in superficial epidermis, mild atypia, sheetlike structures at dermal-epidermal junction, dishomogeneous and sparse clusters, and small bright cells and particles in the dermis, all the other 28 parameters showed good intraobserver reproducibility, with values greater than 0.4. In 11 features, the value was at least 0.7. Two parameters (collagen bundles and cerebriform clusters) were not eligible for intraobserver evaluation owing to their scarcity in the set of 30 reevaluated images. COMMENT In vivo RCM is a noninvasive, morphologic characteristic based technique that may have application in the field of dermatological oncology. The use of this technique resulted in high diagnostic accuracy rates for MM 7,9 and basal cell carcinoma. 14 In a recent publication, 15 a blinded evaluation of a series of more than 300 lesions, including 136 MMs, showed that the diagnosis based on RCM was much more specific than dermoscopic diagnosis, whereas sensitivity values were comparable between the 2 modalities. Table 1. Training Set: Proportion of Evaluations Coincident With the Proposed Evaluation and Mean Cohen Value for Each Parameter Calculated on 45 Images (15 per Layer) Evaluated by 6 Observers Reflectance Mode Confocal Microscopy Features Coincident Evaluations, % Cohen Value a Superficial epidermal layers Honeycombed 98.9 0.94 Honeycombed atypical 94.4 0.37 Honeycombed broadened 96.7 0.48 Cobblestone 93.3 0.85 Cobblestone atypical 80.0 0.64 Cobblestone with small nucleated 73.3 0.33 cells Disarranged 87.8 0.68 Grainy 80.0 0.40 Pagetoid cells 92.2 0.84 Roundish pagetoid cells 88.9 0.73 Dendritic pagetoid cells 87.8 0.72 Cell size: small 77.8 0.20 Cell size: large 81.1 0.52 Pleomorphism 76.7 0.44 Cell density 82.2 0.67 Dermal-epidermal junction Edged papillae 87.8 0.63 Nonedged papillae 81.1 0.49 Junctional nests 83.3 0.61 Junctional clusters 88.9 0.54 Junctional thickenings 74.4 0.40 Typical cells 83.3 0.57 Mild atypia 74.4 0.25 Marked atypia 85.6 0.58 Sheetlike structures 94.4 0.65 Papillary dermis Dermal cell clusters 83.3 0.46 Dense 85.6 0.51 Dishomogeneous 85.6 0.53 Sparse 84.4 0.21 Cerebriform 88.9 0.46 Isolated cells in the papilla 67.8 0.37 Nucleated 76.7 0.17 Small bright cells and particles 68.9 0.32 Plump bright cells 81.1 0.29 Collagen structures 81.1 0.40 Collagen reticulated 87.8 0.30 Collagen bundles 91.1 0.59 a All Cohen values displayed were significant at P.01. These findings suggest that the use of RCM may be a useful adjunct to clinical and dermoscopic examination. Although an overall good agreement was reported in the previous studies, 9,15 both studies were conducted in close collaboration by 2 medical centers, and each evaluation concerned the whole lesion in context, introducing a possible bias in image interpretation by adjusting the interpretations of individual features accordingly with the overall diagnostic impression. In the study described herein, we focused on the reproducibility of RCM terminology, evaluating the possibility of recognizing features within single confocal images out of the lesion contest, including evaluators from independent medical centers. Although physicians are becoming increasingly aware of the value of RCM, with more and more groups engaging in RCM research, interpretation of RCM features has lacked standardization and has relied mostly on personal expertise. For this reason, 1139

Table 2. Test Set: Proportion of Full Agreement and Good Agreement (5 of 6 Coincident Evaluations), Percentage of Positive Observations, and Cohen Value for Each Parameter Evaluated on 270 Images (90 per Layer) by 6 Observers Patterns Full Agreement, % Good Agreement, % Positive Observations, % Cohen Value a Superficial epidermal layers Honeycombed 72.2 18.9 33.1 0.75 Honeycombed atypical 65.6 12.2 15.6 0.37 Honeycombed broadened 90.0 4.4 3.1 0.26 Cobblestone 78.9 11.1 52.2 0.81 Cobblestone atypical 53.3 28.9 23.1 0.45 Cobblestone with small nucleated cells 74.4 14.4 10.2 0.40 Disarranged 52.2 22.2 28.5 0.48 Grainy 31.1 22.2 27.4 0.16 Pagetoid cells 64.4 17.8 44.6 0.68 Roundish pagetoid cells 64.4 20.0 23.1 0.57 Dendritic pagetoid cells 66.7 15.6 38.9 0.69 Cell size: small 45.6 23.3 27.8 0.37 Cell size: large 55.6 27.8 27.2 0.53 Pleomorphism 64.4 18.9 20.4 0.51 Cell density 44.5 23.3 44.6 0.71 Dermal-epidermal junction Edged papillae 66.7 22.2 40.2 0.54 Nonedged papillae 48.9 30.0 61.5 0.29 Junctional nests 25.6 37.8 63.9 0.30 Junctional clusters 42.2 30.0 23.7 0.30 Junctional thickenings 16.7 33.3 51.5 0.22 Typical cells 33.3 25.6 49.1 0.37 Mild atypia 28.9 18.9 37.8 0.24 Marked atypia 74.4 15.6 10.6 0.43 Sheetlike structures 81.1 13.3 4.4 0.11 Papillary dermis Dermal cell clusters 45.6 35.6 55.9 0.55 Dense 53.3 25.6 30.4 0.52 Dishomogeneous 52.2 12.2 21.9 0.31 Sparse 68.9 16.7 8.9 0.31 Cerebriform 90.0 6.7 3.0 0.30 Isolated cells in the papilla 54.4 22.2 80.7 0.35 Nucleated 54.4 18.9 24.6 0.44 Small bright cells and particles 30.0 25.6 65.2 0.27 Plump bright cells 30.0 25.6 34.8 0.27 Collagen structures 22.2 30.0 44.8 0.25 Collagen reticulated 28.9 30.0 28.9 0.19 Collagen bundles 58.9 21.1 17.0 0.34 a All Cohen values displayed were significant at P.01. we previously proposed a standard RCM terminology for melanocytic lesions that was based on a consensus among experts. 12 In the present study, we evaluated the interobserver and intraobserver reproducibility in recognition of these standard RCM parameters. The fact that RCM analysis is image-based enabled us to design the study as a Webbased platform, offering remote tutorials for participants, followed by blinded evaluation of a large test set of images. The intraobserver agreement was assessed in 30 images and showed good values for all assessed parameters. However, cerebriform clusters and collagen bundles were not assessed because there were not enough cases present in the test set to allow for intraobserver statistical analysis. On the training set of images, all study participants showed consistency in RCM feature recognition. Preselection of a highly illustrative set of images may account for the overall higher agreement on feature recognition compared with the subsequent test set analysis. However, in both the training and test sets there was a parallel trend of decreasing values as deeper anatomic skin levels were evaluated. This may be partially explained by decreasing image resolution with increasing imaging depth. Notably, pagetoid infiltration (Figure 1B) showed high overall levels of reproducibility; over 80% of the images that included pagetoid infiltration resulted in agreement among at least 5 of the 6 observers. The good agreement was maintained in morphologic characterization of the pagetoid cells (eg, cell size and shape). Because pagetoid infiltration was previously reported to be one of the most relevant RCM features for MM diagnosis, 9 these data further support the inclusion of this parameter in an RCMbased diagnostic algorithm. Distinction of typical and mildly atypical cells at the dermal-epidermal junction was difficult, showing good agreement in a little more than 50% of evaluated images and suggesting a deficiency in the definition or intrinsic limitations in the differentiation of both cellular morphologic characteristics in the dermal-epidermal junction; however, marked atypia, a specific feature for MM diagnosis, 9 was strongly repro- 1140

A A 50 µm 50 µm B B Figure 1. Confocal images from the test set (0.5 0.5 mm). Level: superficial layer. A, Honeycombed pattern in a nevus. The honeycombed architecture is clearly distinguishable throughout the whole image. This image achieved 100% agreement. B, Pagetoid cells in a melanoma. Pleomorphic large cells with bright cytoplasm and dark nucleus (white circles) are present within a disarranged epidermal architecture. This image also achieved 100% agreement (original magnification 30; numerical aperture, 0.9). ducible. The greater variability in the background architecture of the dermal-epidermal junction, comprising edged papillae, nonedged papillae, and/or junctional clusters or thickenings, could be responsible for the greater difficulty in consistently assessing cellular morphologic characteristics in the absence of striking atypia (Figure 2). In contrast, pagetoid cells usually stand out in sharp contrast with the more homogeneous background arrangement of the keratinocytes in the superficial epidermis (ie, small polygonal hyporeflective structures in a honeycombed pattern or round bright structures with a cobblestone architecture). Single, large, nucleated bright cells, Figure 2. Confocal images from the test set (0.5 0.5 mm). Level: dermalepidermal junction. A, Mild atypical cells. Three round to oval cells with bright cytoplasm and a dark nucleus (arrowheads) can be identified. In this case, the observers showed disagreement for the presence of mild atypia. B, Marked atypia. Numerous and pleomorphic bright nucleated cells are clearly present in this picture within the basal layer. The presence of inflammatory plump bright cells within dermal papillae (asterisk) seemed not to be a confounding factor for this parameter. This image achieved 100% agreement (original magnification 30; numerical aperture, 0.9). corresponding to atypical melanocytes, were recognizable with good agreement within dermal papillae, probably due again to the sharp contrast between the bright cells and the usually dark homogeneous background of the dermis. Cellular aggregate recognition was reproducible, particularly for large, roundish to oval structures forming junctional or dermal clusters. Analyzing different cluster subtypes, a good agreement was maintained in all the 3 subtypes (dense, sparse-cell, and cerebriform). The differences observed in values were influenced by the 1141

Table 3. Intraobserver Agreement Calculated on 30 Images (10 per Layer) Evaluated by 6 Observers Patterns Cohen Value a Superficial epidermal layers Honeycombed 0.84 Honeycombed atypical 0.62 Honeycombed broadened 0.49 Cobblestone 0.89 Cobblestone atypical 0.53 Cobblestone with small nucleated cells 0.70 Disarranged 0.54 Grainy 0.26 Pagetoid cells 0.77 Roundish pagetoid cells 0.69 Dendritic pagetoid cells 0.83 Cell size: small 0.51 Cell size: large 0.71 Pleomorphism 0.56 Cell density 0.54 Dermal-epidermal junction Edged papillae 0.93 Nonedged papillae 0.77 Junctional nests 0.85 Junctional clusters 0.56 Junctional thickenings 0.59 Typical cells 0.70 Mild atypia 0.29 Marked atypia 0.44 Sheetlike structures 0.38 Papillary dermis Dermal cell clusters 0.54 Dense 0.73 Dishomogeneous 0.34 Sparse 0.38 Cerebriform NS Isolated cells in the papilla 0.48 Nucleated 0.49 Small bright cells and particles 0.40 Plump bright cells 0.41 Collagen structures 0.42 Collagen reticulated 0.48 Collagen bundles NS Abbreviation: NS, not significant. a All Cohen values displayed were significant at P.01. prevalence of the feature in the image set, which was high for dense clusters and rare for sparse-cell and cerebriform clusters. Junctional thickenings were not reproducibly recognized. Similarly, evaluating dermalepidermal architecture, edged papillae, which are rings of bright cells (mostly pigmented keratinocytes at the basal layer) surrounding dermal papillae, were highly reproducible, whereas nonedged papillae, lacking a clear rim of bright polygonal cells, obtained a good, but lower, agreement. Both small bright and plump bright cells, which correspond to an inflammatory infiltrate, and collagen structures obtained a good agreement in just over the 50% of observed images. Comparing intraobserver and interobserver reproducibility for features showing lower concordance, we speculate that some features were not sufficiently described to obtain an unequivocal interpretation (grainy epidermis or mild atypia), whereas others were probably subjectively but consistently interpreted (junctional thickenings, plump bright cells, and small bright particles). Better definitions and consequent further tutorial and training are required to improve agreement on these attributes. On the whole, 5 of the 6 features used for MM diagnosis in the recently proposed RCM algorithm, 8,9 corresponding to roundish pagetoid cells, nonedged papillae, atypical cells in basal layer, cerebriform clusters, and nucleated cells infiltrating dermal papillae, showed high overall levels of reproducibility, with concordance in 5 of 6 observers in over 70% of evaluated images; identification of mild atypia in basal layers showed lower agreement and is potentially a weaker diagnostic criterion. Diffuse pagetoid infiltration, corresponding to the sixth feature of the RCM algorithm, was not evaluated because it requires the evaluation of a wider lesion area (using RCM mosaic images) than that assessed in the individual high-resolution study images. In conclusion, the standard consensus RCM terms useful for the evaluation of melanocytic lesions were reproducibly recognized by independent RCM observers. Specifically, the parameters that were recently described in an RCM algorithm for MM diagnosis showed a high level of agreement among observers. Considering the disagreement among pathologists on the diagnosis of MM vs benign melanocytic lesions and the less-than-ideal reproducibility of some histologic features of dysplastic nevi and MM, 16 we may state that the identified RCM features can be reproducibly identified by trained observers. The good level of concordance for RCM descriptors was notably obtained after allowing participants access to a common training resource that consisted of a tutorial and a short training set of images. Of note, all participants were skilled in RCM use and interpretation of its images and knowledgeable of the literature and standard RCM terminology. Thus, more elaborate training may be required for beginners to achieve this level of feature recognition. Such training may be offered via didactic lectures and online tutorials. Finally, it is also important to stress that the final diagnosis with RCM is based not only on the evaluation of an isolated image but on the global evaluation of much more numerous images obtained per tumor, including VivaBlocks (mosaics of 1 1 to 8 8 mm of images at the same level of the epidermis), VivaStacks (several images at different depth), or videos. In the present study, only the reproducibility of isolated images was evaluated and not the reproducibility of a complete evaluation. Accepted for Publication: March 17, 2009. Author Affiliations: Departments of Dermatology (Drs Pellacani, Bassoli, Longo, and Seidenari) and Hygiene and Public Health (Dr Vinceti), University of Modena and Reggio Emilia, Modena, Italy; Department of Dermatology, University of Zurich, Zurich, Switzerland (Dr Braun); Dermatology Service, Ramon y Cajal Hospital, Alcala University, Madrid, Spain (Dr Gonzales); Department of Dermatology, Memorial Sloan Kettering Cancer Center, New York, New York (Drs Gonzales, Marghoob, and Scope); Sydney Melanoma Diagnostic Centre, Sydney Cancer Centre and Department of Dermatology, Royal Prince Alfred Hospital, Faculty of Medicine, University of Sydney, Sydney, Australia (Drs Guitera and Menzies); 1142

Department of Dermatology, Melanoma Unit, Hospital Clínic, Institute de Investigacions Biomèdiques August Pi-Sunyer, Barcelona, Spain (Drs Puig and Malvehy); and Centro de Investigacion Biomedica en Red Enfermedades Raras, Barcelona (Drs Puig and Malvehy). Correspondence: Giovanni Pellacani, MD, Department of Dermatology, University of Modena and Reggio Emilia, Via del Pozzo 71, 41100 Modena, Italy (pellacani.giovanni @unimo.it). Author Contributions: All of the authors contributed to the study in a manner that meets authorship criteria, saw the final draft of the manuscript, and approved the validity of the data presented. Dr Pellacani had full access to all the data in the study and takes responsibility for the integrity of the data and the accuracy of the data analysis. Study concept and design: Pellacani, Puig, Seidenari, and Malvehy. Acquisition of data: Pellacani, Bassoli, Braun, Longo, Marghoob, and Malvehy. Analysis and interpretation of data: Pellacani, Vinceti, Bassoli, Braun, Gonzalez, Guitera, Marghoob, Menzies, Puig, Scope, and Malvehy. Drafting of the manuscript: Pellacani. Critical revision of the manuscript for important intellectual content: Pellacani, Vinceti, Bassoli, Braun, Gonzalez, Guitera, Longo, Marghoob, Menzies, Puig, Scope, Seidenari, and Malvehy. Statistical analysis: Vinceti. Study supervision: Pellacani, Puig, Seidenari, and Malvehy. Financial Disclosure: None reported. Funding/Support: This study was supported in part by a grant from the Istituto Superiore di Sanità (ISS), Italy (project No. 527/B/3A/4). Additional Contributions: Fabio Cannillo, informatics engineer, created the dedicated Web site and provided technical assistance. REFERENCES 1. González S, Gilaberte-Calzada Y, González-Rodríguez A, Torres A, Mihm MC Jr. In vivo reflectance-mode confocal scanning laser microscopy in dermatology. Adv Dermatol. 2004;20:371-387. 2. Rajadhyaksha M, Grossman M, Esterowitz D, Webb RH, Anderson RR. In vivo confocal scanning laser microscopy of human skin: melanin provides strong contrast. J Invest Dermatol. 1995;104(6):946-952. 3. Langley RGB, Rajadhyaksha M, Dwyer PJ, Sober AJ, Flotte TJ, Anderson RR. Confocal scanning laser microscopy of benign and malignant melanocytic skin lesions in vivo. J Am Acad Dermatol. 2001;45(3):365-376. 4. Pellacani G, Cesinaro AM, Longo C, Grana C, Seidenari S. Microscopic in vivo description of cellular architecture of dermoscopic pigment network in nevi and melanomas. Arch Dermatol. 2005;141(2):147-154. 5. Pellacani G, Cesinaro AM, Seidenari S. In vivo assessment of melanocytic nests in nevi and melanomas by reflectance confocal microscopy. Mod Pathol. 2005; 18(4):469-474. 6. Pellacani G, Cesinaro AM, Seidenari S. Reflectance-mode confocal microscopy for the in vivo characterization of pagetoid melanocytosis in melanomas and nevi. J Invest Dermatol. 2005;125(3):532-537. 7. Gerger A, Koller S, Kern T, et al. Diagnostic applicability of in vivo confocal laser scanning microscopy in melanocytic skin tumors. J Invest Dermatol. 2005; 124(3):493-498. 8. Pellacani G, Cesinaro AM, Seidenari S. Reflectance-mode confocal microscopy of pigmented skin lesions: improvement in melanoma diagnostic specificity. J Am Acad Dermatol. 2005;53(6):979-985. 9. Pellacani G, Guitera P, Longo C, Avramidis M, Seidenari S, Menzies S. The impact of in vivo reflectance confocal microscopy for the diagnostic accuracy of melanoma and equivocal melanocytic lesions. J Invest Dermatol. 2007;127 (12):2759-2765. 10. Langley RG, Walsh N, Sutherland AE, et al. The diagnostic accuracy of in vivo confocal scanning laser microscopy compared to dermoscopy of benign and malignant melanocytic lesions: a prospective study. Dermatology. 2007;215(4): 365-372. 11. Gerger A, Hofmann-Wellenhof R, Langsenlehner U, et al. In vivo confocal laser scanning microscopy of melanocytic skin tumours: diagnostic applicability using unselected tumour images. Br J Dermatol. 2008;158(2):329-333. 12. Scope A, Benvenuto-Andrade C, Agero AL, et al. In vivo reflectance confocal microscopy imaging of melanocytic skin lesions: consensus terminology glossary and illustrative images. J Am Acad Dermatol. 2007;57(4):644-658. 13. In vivo reflectance confocal microscopy: pattern definition and reproducibility of descriptors. http://www.skinconfocalmicroscopy.org/. Accessed July 20. 2009. 14. Nori S, Rius-Díaz F, Cuevas J, et al. Sensitivity and specificity of reflectancemode confocal microscopy for in vivo diagnosis of basal cell carcinoma: a multicenter study. J Am Acad Dermatol. 2004;51(6):923-930. 15. Guitera P, Pellacani G, Longo C, Seidenari S, Avramidis M, Menzies SW. In vivo reflectance confocal microscopy enhances secondary evaluation of melanocytic lesions. J Invest Dermatol. 2009;129(1):236-247. 16. Corona R, Mele A, Amini M, et al. Interobserver variability on the histopathologic diagnosis of cutaneous melanoma and other pigmented skin lesions. J Clin Oncol. 1996;14(4):1218-1223. 1143