Clinical Validation of a Gene Expression Signature that Differentiates Benign Nevi from Malignant Melanoma

Similar documents
Clinical validation of a gene expression signature that differentiates benign nevi from malignant melanoma

Page 1 of 3. We suggest the following changes:

6/22/2015. Original Paradigm. Correlating Histology and Molecular Findings in Melanocytic Neoplasms

Vernon K. Sondak. Department of Cutaneous Oncology Moffitt Cancer Center Tampa, Florida

Molecular Methods in the Diagnosis and Prognostication of Melanoma: Pros & Cons

RNA preparation from extracted paraffin cores:

> 6000 Mutations in Melanoma. Tests That Cay Be Employed. FISH for Additions/Deletions. Comparative Genomic Hybridization

I have no relevant conflicts of interest to disclose. John T. Seykora MD PhD Departments of Dermatology & Pathology and Laboratory Medicine

MAPK Pathway. CGH Next Generation Sequencing. Molecular Tools in Care of Patients with Pigmented Lesions 7/20/2017

Update on Genetic Testing for Melanoma

Molecular classification of melanomas and nevi using gene expression microarray signatures and formalin-fixed and paraffin-embedded tissue

Melanocytic proliferations in sundamaged

There is NO single Melanoma Stain. > 6000 Mutations in Melanoma. What else can be done to discriminate atypical nevi from melanoma?

Impact of Prognostic Factors

A PRACTICAL APPROACH TO ATYPICAL MELANOCYTIC LESIONS BIJAN HAGHIGHI M.D, DIRECTOR OF DERMATOPATHOLOGY, ST. JOSEPH HOSPITAL

SUPPLEMENTARY APPENDIX

Genetic Testing: When should it be ordered? Julie Schloemer, MD Dermatology

Case 26 Male 37. Right jawline 5mm nodule?keloid. The best diagnosis is:

David B. Troxel, MD. Common Medicolegal Situations: Misdiagnosis of Melanoma

Supplementary webappendix

Contemporary Classification of Breast Cancer

Lentigo Maligna: Striking a Balance With the Risk-Benefit Ratio. Glen M. Bowen, MD Huntsman Cancer Institute University of Utah

Integrating Fluorescence in situ Hybridization and Genomic Array Results into the Diagnostic Workup of Melanoma

RECENT ADVANCES IN THE MOLECULAR DIAGNOSIS OF BREAST CANCER

Melanocytic lesions on Genital Skin Melanoma vs. Melanocytic Nevus, Revisited. Timothy H. McCalmont, MD University of California, San Francisco

FAQs for UK Pathology Departments

Michael T. Tetzlaff MD, PhD

RASA: Robust Alternative Splicing Analysis for Human Transcriptome Arrays

The Pathology of Neoplasia Part II

Gene Expression Profiling for Cutaneous Melanoma

Medical Policy An independent licensee of the Blue Cross Blue Shield Association

Which melanoma patients benefit from genetic testing?

Approximately 2% of the United States population

Molecular Aspects of Melanocytic Neoplasia. Iwei Yeh MD, PhD University of California, San Francisco

Helping you make better-informed decisions 1-5

Supplementary Figure 1. Spitzoid Melanoma with PPFIBP1-MET fusion. (a) Histopathology (4x) shows a domed papule with melanocytes extending into the

Supplementary Online Content

THE SPITZ NEVUS OFTEN POSES

Talk to Your Doctor. Fact Sheet

Diagnosis of melanocytic proliferations remains one of

Growth rate of melanoma in vivo and correlation with dermatoscopic and dermatopathologic findings

MEDICAL POLICY Genetic and Protein Biomarkers for Diagnosis and Risk Assessment of

Management of pediatric melanocytic lesions

Melanocytic Lesions: Use of Immunohistochemistry and Special Studies Napa Valley 2018

Update on Lymph Node Management in Melanoma

Supplementary Information Titles Journal: Nature Medicine

Melanoma and the genes: Molecular alterations informing the diagnosis of melanocytic tumors

STUDY. Sensitivity of Fluorescence In Situ Hybridization for Melanoma Diagnosis Using RREB1, MYB, Cep6, and 11q13 Probes in Melanoma Subtypes

Reviewers' comments: Reviewer #1 (Remarks to the Author):

Roadmap for Developing and Validating Therapeutically Relevant Genomic Classifiers. Richard Simon, J Clin Oncol 23:

Dermatopathology. Dr. Rafael Botella Estrada. Hospital La Fe de Valencia

Update on Spitzoid and Blue nevus-like melanocytic lesions Emphasis on molecular studies informing diagnosis, prognosis and therapy

Gene Expression Profiling for Cutaneous Melanoma

Personalized Therapy for Prostate Cancer due to Genetic Testings

Time to reconsider Spitzoid neoplasms?

Decipher Bladder Predicts Which Patients May Benefit from Neoadjuvant Chemotherapy Prior to Radical Cystectomy

PROSTATE CANCER METRICS

Corporate Medical Policy

Novel Biomarkers (Kallikreins) for Prognosis and Therapy Response in Ovarian cancer

Patnaik SK, et al. MicroRNAs to accurately histotype NSCLC biopsies

PODIATRIC PATHOLOGY REPORT IS;RL;MMR; 1 of 1. A Copy was sent to: DR. JANE DOE 456 SAMPLE BLVD NEW YORK, NY DIAGNOSIS

Artur Zembowicz, MD, PhD; Sung-Eun Yang, MD; Antonios Kafanas, MD; Stephen R. Lyle, MD, PhD

Detection of Anaplastic Lymphoma Kinase (ALK) gene in Non-Small Cell lung Cancer (NSCLC) By CISH Technique

Histotechnological problems in dermatopathology and their possible consequences

Associate Clinical Professor of Dermatology MUSC

The Enigmatic Spitz Lesion

Utility of Adequate Core Biopsy Samples from Ultrasound Biopsies Needed for Today s Breast Pathology

Determination of gene signatures to subgroup melanoma patients using novel branched DNA hybridization assays

WHAT SHOULD WE DO WITH TUMOUR BUDDING IN EARLY COLORECTAL CANCER?

Transform genomic data into real-life results

Molecular Genetics of Paediatric Tumours. Gino Somers MBBS, BMedSci, PhD, FRCPA Pathologist-in-Chief Hospital for Sick Children, Toronto, ON, CANADA

Limitations of nonsurgical treatment modalities. Nonsurgical Treatments (Table V) 1/31/2018

2/6/2018. Original Paradigm. Clonal Chromosomal A berrations. Only 20% of Spitz Nevi 95% 6p, 7q, 17q, 20q, 4q,8q, 1q, 11q. Isolated Gain in 11p

Corporate Medical Policy

Medical Policy An independent licensee of the Blue Cross Blue Shield Association

MEDICAL POLICY Gene Expression Profiling for Cancers of Unknown Primary Site

Published Ahead of Print on December 14, 2009 as /JCO J Clin Oncol by American Society of Clinical Oncology

Melanoma Underwriting Presented at 2018 AHOU Conference. Hank George FALU

Simulators of melanoma

ACE ImmunoID. ACE ImmunoID. Precision immunogenomics. Precision Genomics for Immuno-Oncology

Abstract. Optimization strategy of Copy Number Variant calling using Multiplicom solutions APPLICATION NOTE. Introduction

Good Old clinical markers have similar power in breast cancer prognosis as microarray gene expression profilers q

Dr. dr. Primariadewi R, SpPA(K)

CHAPTER 7 Concluding remarks and implications for further research

Regression 2/3/18. Histologically regression is characterized: melanosis fibrosis combination of both. Distribution: partial or focal!

RNA extraction, RT-PCR and real-time PCR. Total RNA were extracted using

VL Network Analysis ( ) SS2016 Week 3

Melanoma-Back to Basics I Thought I Knew Ya! Paul K. Shitabata, M.D. Dermatopathologist APMG

Interesting Case Series. Desmoplastic Melanoma

Interpretation of Breast Pathology in the Era of Minimally Invasive Procedures

Copy number and somatic mutations drive tumors

Diagnosis of Lentigo Maligna Melanoma. Steven Q. Wang, M.D. Memorial Sloan-Kettering Cancer Center Basking Ridge, NJ

A Genomic Approach to Active Surveillance

Applying Risk Management Principles to QA in Surgical Pathology: From Principles to Practice

Molecular in vitro diagnostic test for the quantitative detection of the mrna expression of ERBB2, ESR1, PGR and MKI67 in breast cancer tissue.

For additional information on meeting the criteria for Mohs, see Appendix 2.

Patricia Chevez-Barrrios AAOOP-USCAP /12/2016

Ways to get into trouble, ideas on avoiding trouble, and diagnostic approaches to keep trouble at bay

Gene Expression Profiling and Protein Biomarkers for Prostate Cancer Management

Prognostic value of an RNA expression signature derived from cell cycle proliferation genes in patients with prostate cancer: a retrospective study

Transcription:

Clinical Validation of a Gene Expression Signature that Differentiates Benign Nevi from Malignant Melanoma Running Title: Validation of Melanoma Diagnostic Assay Loren E. Clarke, M.D. 1 *, M. Bryan Warf, PhD 1 *, Darl D. Flake II, MS 2, Anne-Renee Hartman, M.D. 1, Steven Tahan, M.D. 3, Christopher R. Shea, M.D. 4, Pedram Gerami, M.D. 5, Jane Messina, M.D. 6, Scott R. Florell, M.D. 7, Richard J. Wenstrup, M.D. 1, Kristen Rushton, MBA 1, Kirstin M. Roundy, MS 1, Colleen Rock, PhD 1, Benjamin Roa, PhD 1, Kathryn A. Kolquist, M.D. 1, Alexander Gutin, PhD 2, Steven Billings, M.D. 8, Sancy Leachman, M.D. 9 * Both authors contributed equally to this work. 1-Myriad Genetic Laboratories, Inc., Salt Lake City, UT 2-Myriad Genetics, Inc., Salt Lake City, UT 3-Harvard Medical School and Beth Israel Deaconess Medical Center, Boston, MA 4-The University of Chicago, Chicago, IL 5-Northwestern Memorial Hospital, Chicago, IL 6-Moffitt Cancer Center, Tampa, FL 7-University of Utah, Salt Lake City, UT 8-Cleveland Clinic, Cleveland, OH 9-Oregon Health & Science University, Portland, OR Corresponding Author: Loren E. Clarke 320 Wakara Way Salt Lake City, UT 84108 Phone: 801-883-3470 Fax: 801-584-3099 lclarke@myriad.com

2 ABSTRACT Background Histopathologic examination is sometimes inadequate for accurate and reproducible diagnosis of certain melanocytic neoplasms. As a result, more sophisticated and objective methods have been sought. The goal of this study was to identify a gene expression signature that reliably differentiated benign and malignant melanocytic lesions and evaluate its potential clinical applicability. Here, we describe the development of a gene expression signature and its clinical validation using multiple independent cohorts of melanocytic lesions representing a broad spectrum of histopathologic subtypes. Methods Using quantitative reverse-transcription PCR on a selected set of 23 differentially expressed genes, and by applying a threshold value and weighting algorithm, we developed a gene expression signature that produced a score that differentiated benign nevi from malignant melanomas. Results The gene expression signature classified melanocytic lesions as benign or malignant with a sensitivity of 89% and a specificity of 93% in a training cohort of 464 samples. The signature was validated in an independent clinical cohort of 437 samples, with a sensitivity of 90% and specificity of 91%. Conclusions The performance, objectivity, reliability, and minimal tissue requirements of this test suggest that it could have clinical application as an adjunct to histopathology in the diagnosis of melanocytic neoplasms.not For Distribution

3 INTRODUCTION Skin cancer is the most common cancer worldwide with melanoma being the most fatal form. Accurate diagnosis of the vast majority of melanocytic lesions can be accomplished with conventional light microscopy by a skilled histopathologist. However, numerous studies have reported inter-observer variability in some cases even among experienced dermatopathologists. 1-4 Farmer and colleagues illustrated this discordance with a review of 40 benign and malignant melanocytic lesions; in a panel of eight expert dermatopathologists, diagnostic discordance between two or more panel members was observed in 38% of cases. 1 While the observed frequency of discordance varies across additional studies, evidence suggests it is a significant problem, even among expert dermatopathologists. Moreover, diagnosis by expert consensus may not reliably classify some types of melanocytic tumors. 5, 6 In a study by Cerroni et al, experts sought histopathologic criteria that could reliably classify atypical melanocytic tumors as benign or malignant. 7 Pathologists incorrectly classified 53% of cases with a favorable outcome as malignant and 27% of cases with an unfavorable outcome (local recurrence, metastasis, or death) as benign. Similarly, in a recent study of atypical Spitz tumors, 33% of cases with evidence of advanced disease failed to be diagnosed as melanoma by a panel of experts. 8 Histopathologic assessment may be further limited by the apparent trend toward increasingly smaller biopsies. 9 Conventional histopathologic criteria that differentiate nevus from melanoma (e.g. symmetry, circumscription, maturation) require evaluation of all or most of the neoplasm. Small samples often contain only part of a lesion and may compel even the best diagnostician to withhold a definitive diagnosis and recommend re-excision.

4 Due to these issues, adjunctive methods have been sought to supplement histopathology for the diagnosis of melanocytic neoplasms. These have included comparative genomic hybridization (CGH), fluorescence in situ hybridization (FISH), and microarray technologies. 10-17 Each has shown utility in some situations but limitations still exist. 18, 19 We sought to determine if quantitative reverse transcription PCR (qrt-pcr) could be used to identify groups of genes that are differentially expressed between malignant and benign melanocytic lesions in a manner that could be clinically applicable. We sought solely to explore the gene expression differences between benign nevi and primary melanoma samples (not examining the differences between benign nevi, primary melanomas and metastatic melanomas), as this differentiation would be of the highest clinical value. With a training cohort of 464 melanocytic lesions, we developed a 23-gene expression signature that effectively differentiated benign and malignant melanocytic neoplasms. We then assessed performance of the signature in an independent cohort of 437 malignant melanomas and benign nevi. MATERIALS AND METHODS Sample Cohorts All testing was performed on archival formalin-fixed paraffin-embedded (FFPE) tissue sections of melanocytic lesions. Specimens were selected by experienced dermatopathologists and represented a broad spectrum of clinical and histopathologic subtypes, including banal/common nevi (junctional, compound, and intradermal), junctional and compound dysplastic nevi, Spitz nevi, Reed nevi, blue nevi and other subtypes. Non-melanocytic lesions,

5 metastatic melanomas, and melanocytic lesions not primary to skin were excluded. Re-excision specimens were excluded but melanocytic neoplasms that were inflamed and/or traumatized were included. The training cohort initially contained 595 melanocytic lesions acquired from The University of Munich and Provitro (Berlin, Germany). The validation cohort initially consisted of 571 melanocytic lesions acquired from The Cleveland Clinic, University of South Florida, Northwestern University, and the University of Utah. Each case underwent review by a second expert dermatopathologist (S.T.) who was blinded to the diagnosis of the contributing dermatopathologist. If there was discordance between the diagnoses, the case was reviewed in a blinded manner by a third expert dermatopathologist (C.S.) for adjudication. Regardless of their preferred terminology for various subtypes, the reviewing dermatopathologists were encouraged to designate each case as either benign or malignant (rather than indeterminate or of uncertain malignant potential ) such that difficult cases would not be excluded from the cohorts. Diagnostic concordance was maximized since the contributing dermatopathologists provided lesions for which they felt a definitive diagnosis could be established. This study was conducted with institutional IRB oversight. Measurement of Gene Expression Archival FFPE melanocytic lesions were used for these studies. Each case required one H&E stained slide and 2-5 slides containing a total of 20 µm of unstained tissue, with a minimal requirement of 0.5 mm by 0.1 mm of melanocytic tissue. An area representative of the lesions was identified and circled on the H&E slide by an anatomic pathologist, and the corresponding area was macro-dissected from the unstained tissue slides and pooled into a single tube. RNA was extracted from the tissue using the RNeasy FFPE kit (QIAGEN) [run on a QIACube

6 instrument (QIAGEN), then nuclease treated using DNAse I (Sigma-Aldrich), and quantified on a Nanodrop spectrophotometer (Thermo Scientific)]. RNA samples were normalized to 40 ng/µl, or were tested at the current concentration if <40 ng/µl. RNA was next used for cdna synthesis using the High Capacity cdna Reverse Transcription Kit (Life Technologies). Genes of interest were pre-amplified for 14 cycles, according to the manufacturer s guidelines (Life Technologies) using the TaqMan PreAmp Master Mix and an associated TaqMan amplicon for each target gene [Note, the specific sequence of these primers are proprietary (Life Technologies)]. Finally, TaqMan quantitative PCR was used to measure the expression of each gene using the TaqMan Universal PCR Master Mix and custom TaqMan Low Density Array (TLDA) cards, analyzed on an Applied Biosystems 7900HT instrument. All samples were run in triplicate by dividing each sample into 3 aliquots after cdna synthesis, but prior to preamplification. Expression levels for each gene were calculated using the standard C T method. 20 In brief, the expression values of each gene were measured by determining the C T (crossing threshold) of each gene. The C T of each gene for each sample replicate was normalized by the average C T of housekeeper genes on the same replicate, yielding C T. The C T was calculated by centering the C T values of each gene by the average of the C T values of the gene across all samples in the training cohort. The median replicate C T value was used for each amplicon. Two of the three replicates of each gene on each sample were required to be within 2 C T units of each other to be considered appropriately measured. Development of the Gene Expression Signature Genes with highly correlated expression and similar biological function were averaged into a single measurement to differentiate benign and malignant melanocytic tumors (see

7 Supplemental Materials for more information). The selection of these genes will be addressed in more detail the Results section. Consolidated gene groups and single genes were subjected to forward selection in a series of logistic regression models. The p-value from a likelihood ratio test comparing the models with and without each new variable was used as the criterion for inclusion. Variables were sequentially selected by their ability to enhance the discriminatory power of the previously selected variables, until the addition did not result in significant improvement (Bonferroni adjusted p-value <0.05). The logistic regression model resulting from the final selection of genes was refined to allow for a more realistic fit of the model to the data. Specifically, parameters were added to estimate the false negative and the false positive rates of the best set of genes. Maximum likelihood estimation was used to fit this non-standard model. A single melanoma diagnostic score (MDS) was produced by applying the optimal linear combination of the individual components expression, as derived in the generalized logistic regression model. An MDS of zero (inflection point of the probability curve) was chosen as the threshold to differentiate nevi and melanoma. The signature was further refined to improve its technical robustness by supplementing the measurements of the components in the model that were represented by one measurement (see Supplemental Materials for more detail). A final TLDA card with all the selected amplicons was designed and the component weights were adjusted to make the score from the final TLDA card (after the refinements) match the score from the original TLDA card (prior to the refinements) so that the components with supplemental measurements were not over-weighted. Validation of the Gene Expression Signature

8 Samples from the validation cohort were used to validate the predefined gene expression signature with a pre-defined cutoff of zero to differentiate melanoma and nevi. The association between the MDS and pathologic diagnosis was assessed using the Wilcoxon rank-sum test. Exact 95% confidence intervals were computed for sensitivity and specificity based on the binomial distribution. The MDS was then used to assess the performance of the gene expression signature within specific histopathologic subtypes. Data analysis was only performed on subtypes with 30 samples to avoid issues with sampling artifacts. RESULTS Development of a Multivariate Gene Expression Signature We first identified 79 candidate biomarker genes whose expression might prove useful in a gene signature to differentiate benign nevi and malignant melanoma. These biomarkers were selected because they have been previously observed in literature to have differential expression in benign nevi and malignant melanoma, or were observed to have increased expression in aggressive tumors. The RNA expression of these 79 biomarkers was initially evaluated in a dataset of 83 melanocytic lesions, and the 79 markers were narrowed to a smaller set of the 40 most promising biomarkers that could be evaluated in greater detail. See the Supplemental Materials for additional details on the selection of candidate biomarkers and their initial evaluation. We then evaluated the RNA expression of a refined set of the 40 most promising candidate genes in a large training cohort of 595 melanoma and nevus samples (see Supplemental Materials for details on the selection of these 40 genes). In the initial cohort of 595

9 melanocytic lesions, 51 samples were excluded due to re-excision, lack of sufficient tissue for processing, or because classification as benign or malignant could not be made definitively. Of the remaining 544 samples, 42 were excluded because we could not reliably measure the housekeeping genes expression. Additionally, expression for one or more genes of interest could not be detected in 38 samples. Thus, 464 samples produced sufficient data for analysis. See Table 1 for the histopathology of the samples used in this study. In the 464 sample training cohort, 27 of the 40 candidate genes were able to differentiate the melanoma and nevi samples with an area under the curve (AUC) that was >70% (Supplemental Figure 1). To build a diagnostic signature from this dataset, we first determined which genes had correlated expression and could be assessed as a single averaged component (see Supplemental Materials for more information on this analysis). In brief, genes with both highly correlated expression and similar biological functions were assessed as a single averaged component. All 10 cell cycle progression genes were averaged into one component (Supplemental Table 1), and 8 genes within the immune cluster were averaged in a second component (CCL5, CD38, CXCL10, CXCL9, IRF1, LCP2, PTPRC and SELL). All the remaining genes were assessed on an individual basis. We next used forward selection and traditional logistic regression to explore the performance of various diagnostic models (preliminary signature) that included different combinations of the individual genes and 2 gene components. PRAME expression was the single most effective differentiating feature (P = 1.2x10-68 ) and was the first gene added to the preliminary signature. Through forward selection, the preliminary signature was improved first by the addition of S100A9 and was further improved by the addition of the 8 gene immune component (PRAME: P = 4.5x10-68 S100A9: P = 3.9x10-12 immune average: P = 7.2x10-5 )

1 (Figure 1A). The diagnostic power of the preliminary signature was not improved by incorporating any additional genes, or by adding the cell cycle progression gene component (Supplemental Table 1). Thus, the optimal diagnostic gene signature consisted of three components: PRAME, S100A9, and the eight gene immune component (CCL5, CD38, CXCL9, CXCL10, IRF1, LCP2, PTPRC and SELL). Each component within this signature had a distinct expression profile and was not highly correlated with the expression of the other components (Figure 1B), verifying that each component contributes independent information to the signature. The three-component signature was further refined using generalized logistic regression, which significantly improved the fit (likelihood ratio test P = 6.0x10-4 ). This refined multivariate signature generated a melanoma diagnostic score (MDS) for each sample from the 464 sample training cohort, with a resulting AUC of 95%. Setting a score of zero as the threshold to differentiate nevi and melanoma, the sensitivity of the signature was determined to be 89% and the specificity to be 93% (Supplemental Figure 3). To improve the technical robustness of the signature for a production setting, additional measurements were added to three different components (see the Supplemental Materials for additional information). First, we added another amplicon to measure the expression of PRAME, with the PRAME component of the MDS being the average of the two measurements. Next, four genes with highly correlated expression (S100A7, S100A8, S100A12, and PI3) were added to the S100A9 component, with the resulting S100A9 component of the MDS being the averaged expression of all 5 genes (see Supplemental Materials). Finally, we also increased the number of housekeeper genes from five to nine. In a 77 sample concordance study, we verified that these

1 additions did not alter the performance of the gene expression signature. Thus, the refined gene signature had a total of 24 measurements of 23 different genes. Validation of the Gene Expression Signature The signature was validated in a cohort of 571 samples, derived from four different institutions that were independent from the contributing institutions in the training cohort. An MDS was determined for these samples using the pre-determined score calculation that was developed from the training cohort, with a pre-defined cutoff of zero to differentiate melanoma and nevi. Of the initial cohort of 571 melanocytic lesions, 20 samples were excluded because they were determined to be re-excision specimens. Another 24 were excluded based on insufficient tissue for processing or lack of a consensus diagnosis. Ninety samples did not produce a score based on gene signature expression levels being below the detection threshold. Thus, the final validation cohort was comprised of 437 samples, which represented several melanoma subtypes (n=211), as well as both dysplastic (n=67) and conventional nevi (n=149). See Table 1 for the histopathology of all samples in the validation cohort. [The mean and median Breslow depths of the 211 melanoma samples were 1.74 and 0.82 mm, respectively, with a range of 0.10 to 29.0 mm]. The signature performance was determined in the cohort of 437 samples by comparing the MDS to the consensus histopathologic diagnosis. Using the predefined cutoff score of zero established in the training cohort, the sensitivity and specificity were determined to be 90% (95% CI: 85%-93%) and 91% (95% CI: 87%-95%), respectively (Figure 3). The range of scores from the validation cohort followed a bimodal distribution, with the majority of malignant lesions

1 producing a score greater than zero and the benign lesions a score less than zero (Figure 2). The scores ranged from -16.7 to +11.1. The AUC value was 96% (P = 3.7 x 10-63 ). During the histopathologic review, adjudication review was necessary for 17 of the 437 validation samples, indicating that these 17 cases were histopathologically ambiguous. Nine of these cases had an adjudicated diagnosis of malignant, all of which were classified as malignant by the signature. Of the eight cases for which the adjudicated diagnosis was benign, the signature classified four as benign and four as malignant. Performance of the gene signature was also assessed within specific histopathologic subtypes (Table 3). The specificity in compound and dermal nevi were each similar to the specificity within all nevi. The sensitivity within each melanoma subtype was also similar to the overall sensitivity for all melanoma samples. Thus, the performance of the gene signature within these histopathologic subtypes was similar to the overall performance of the gene signature. The performance of a signature generally decreases upon validation due to over-fitting cohort specific differences between case and control samples in the training dataset. In this study, the performance of the signature was similar in both the training and validation cohorts, indicating that the signature was not significantly over-fit in its development. To confirm this observation, the same generalized logistic regression used in training was used to determine the optimal fit of the signature within the validation cohort. The MDS and the optimal score were nearly identical, with a correlation of 99%.

1 DISCUSSION In this study of melanocytic lesions, the MDS differentiated benign nevi from malignant melanoma with a sensitivity of 90% and a specificity of 91%. Further validation is necessary, but based upon the results in this retrospective cohort, the signature appears applicable to a broad array of melanocytic tumors, including some that might prove challenging to classify by histopathology alone. This is likely due, in part, to the fact that the original training cohort included a broad spectrum of melanocytic lesions, including lesions that were somewhat ambiguous. However, the requirement for diagnostic concordance among two expert dermatopathologists helped to ensure that diagnoses for the studied lesions were accurate. Importantly, the score established using the training cohort was evaluated in an entirely separate validation cohort. Like the training cohort, this validation cohort also included a broad spectrum of histopathologic subtypes. Some were classic examples of various nevus and melanoma subtypes, but lesions from subtypes known to be associated with diagnostic uncertainty were also included. Subset analyses of results by nevus and melanoma subtypes indicated that the signature performed with 94%-98% specificity and 86%-97% sensitivity. While the total number of samples analyzed in common subtypes is large, additional studies will be required to validate the signature s performance in uncommon nevus and melanoma subtypes. Additionally, it should be noted that metastatic lesions were purposefully excluded in these studies. Gene expression patterns within a primary melanoma can change drastically upon metastasis, causing a large number of genes (including some within this signature) to change their expression. 11, 13-15 Thus, this gene expression signature should only be used in the context of differentiating benign nevi and primary melanomas. However, there is significant clinical value in having an adjunctive tool for diagnosing primary melanomas, not metastatic melanomas.

1 The relatively high sensitivity and specificity are encouraging, but given the clinical consequences of misclassifying a malignant melanoma as a benign nevus, it is important for the sensitivity to be maximized. To this end, an indeterminate zone could be introduced. Approximately half of the malignant lesions (5%) that appear to have been misclassified by the signature in the validation cohort have a score between -2 and 0. Excluding all scores in this range from the clinical validation cohort would bring the signature sensitivity to 94%, while the specificity would remain at 90%. This signature differs from other widely used molecular diagnostic methods for melanoma in that the other assays rely upon identification of chromosomal abnormalities in melanocytes. 21-23 Analysis by qrt-pcr can detect changes in gene expression that may not result from gains or losses of DNA. This gene expression signature also contributes additional molecular information by assessing the gene expression of PRAME, a known melanoma tumor antigen. For example, PRAME expression is strongly reduced in the melanocytes of benign nevi, 11 but significantly higher in 88% of primary melanomas and 95% of metastatic melanomas. 24 However, its expression appears to result from changes in the methylation status rather than chromosomal gains or deletions. 24 In addition, it is becoming clear that the malignant potential of melanoma is at least partially determined by the interaction of the tumor with its microenvironment. The fact that both S100A9 and the other immune signaling gene group are critical components of the signature appears to reflect the importance of this interaction. Certain S100 protein subtypes, particularly S100A8 and S100A9, play a role in the inflammatory response to many types of neoplasms. 25 Melanomas expressing CCL5, CXCL9, and CXCL10 have been associated with prolonged patient

1 survival and are more likely to respond to ipilimumab, a human monoclonal antibody that 26, 27 improves overall survival in metastatic melanoma patients. One potential limitation of the current study is that it was carried out with archived FFPE tissue. mrna extracted from archived FFPE samples is more prone to fragmentation, when compared to recently prepared FFPE lesions. This might explain the relatively high sample failure rate of the signature on older samples. Indeed, we observed a much lower failure rate in contemporary samples (data not shown). Samples with low scores have an increased chance of failure if the mrna is from an older sample and is degraded, which could bias estimates of the sensitivity and specificity. To mitigate this bias, both the training and validation cohorts consisted of a variety of older archival samples and newer contemporary samples. Given the prevalence of melanocytic neoplasms that are ambiguous or difficult to classify by histopathology alone, there is a great need for adjunctive tests that could enhance reproducibility and diagnostic accuracy. The development and validation of the gene expression signature presented here consists of a total of 901 samples. The strong agreement between the performance of both the training and validation cohorts suggests that this method accurately assesses the performance of the gene expression signature to differentiate malignant melanoma and benign nevi. Additional outcomes-based and prospective studies are needed to further assess the performance of this test.

1 REFERENCES 1. Farmer ER, Gonin R, Hanna MP. Discordance in the histopathologic diagnosis of melanoma and melanocytic nevi between expert pathologists. Hum Pathol 1996;27:528-31. 2. McGinnis KS, Lessin SR, Elder DE, et al. Pathology review of cases presenting to a multidisciplinary pigmented lesion clinic. Arch Dermatol 2002;138:617-21. 3. Shoo BA, Sagebiel RW, Kashani-Sabet M. Discordance in the histopathologic diagnosis of melanoma at a melanoma referral center. J Am Acad Dermatol 2010;62:751-6. 4. Veenhuizen KC, De Wit PE, Mooi WJ, et al. Quality assessment by expert opinion in melanoma pathology: experience of the pathology panel of the Dutch Melanoma Working Party. J Pathol 1997;182:266-72. 5. Barnhill RL, Argenyi ZB, From L, et al. Atypical Spitz nevi/tumors: lack of consensus for diagnosis, discrimination from melanoma, and prediction of outcome. Hum Pathol 1999;30:513-20. 6. Pusiol T, Morichetti D, Piscioli F, Zorzi MG. Theory and practical application of superficial atypical melanocytic proliferations of uncertain significance (SAMPUS) and melanocytic tumours of uncertain malignant potential (MELTUMP) terminology: experience with second opinion consultation. Pathologica 2012;104:70-7. 7. Cerroni L, Barnhill R, Elder D, et al. Melanocytic tumors of uncertain malignant potential: results of a tutorial held at the XXIX Symposium of the International Society of Dermatopathology in Graz, October 2008. Am J Surg Pathol 2010;34:314-26. 8. Gerami P, Busam K, Cochran A, et al. Histomorphologic Assessment and Interobserver Diagnostic Reproducibility of Atypical Spitzoid Melanocytic Neoplasms With Long-term Follow-up. Am J Surg Pathol 2014. 9. Fernandez EM, Helm T, Ioffreda M, Helm KF. The vanishing biopsy: the trend toward smaller specimens. Cutis 2005;76:335-9. 10. Bangash HK, Romegialli A, Dadras SS. What's new in prognostication of melanoma in the dermatopathology laboratory? Clin Dermatol 2013;31:317-23.

1 11. Haqq C, Nosrati M, Sudilovsky D, et al. The gene expression signatures of melanoma progression. Proc Natl Acad Sci U S A 2005;102:6092-7. 12. Koh SS, Opel ML, Wei JP, et al. Molecular classification of melanomas and nevi using gene expression microarray signatures and formalin-fixed and paraffin-embedded tissue. Mod Pathol 2009;22:538-46. 13. Mauerer A, Roesch A, Hafner C, et al. Identification of new genes associated with melanoma. Exp Dermatol 2011;20:502-7. 14. Scatolini M, Grand MM, Grosso E, et al. Altered molecular pathways in melanocytic lesions. Int J Cancer 2010;126:1869-81. 15. Smith AP, Hoek K, Becker D. Whole-genome expression profiling of the melanoma progression pathway reveals marked molecular differences between nevi/melanoma in situ and advanced-stage melanomas. Cancer Biol Ther 2005;4:1018-29. 16. Talantov D, Mazumder A, Yu JX, et al. Novel genes associated with malignant melanoma but not benign melanocytic lesions. Clin Cancer Res 2005;11:7234-42. 17. Wachsman W, Morhenn V, Palmer T, et al. Noninvasive genomic detection of melanoma. Br J Dermatol 2011;164:797-806. 18. Pawitan Y, Michiels S, Koscielny S, Gusnanto A, Ploner A. False discovery rate, sensitivity and sample size for microarray studies. Bioinformatics 2005;21:3017-24. 19. Song J, Mooi WJ, Petronic-Rosic V, et al. Nevus versus melanoma: to FISH, or not to FISH. Adv Anat Pathol 2011;18:229-34. 20. Livak KJ, Schmittgen TD. Analysis of relative gene expression data using real-time quantitative PCR and the 2(-Delta Delta C(T)) Method. Methods 2001;25:402-8. 21. Gerami P, Jewell SS, Morrison LE, et al. Fluorescence in situ hybridization (FISH) as an ancillary diagnostic tool in the diagnosis of melanoma. Am J Surg Pathol 2009;33:1146-56. 22. Bastian BC, Olshen AB, LeBoit PE, Pinkel D. Classifying melanocytic tumors based on DNA copy number changes. Am J Pathol 2003;163:1765-70.

1 23. Bauer J, Bastian BC. Distinguishing melanocytic nevi from melanoma by DNA copy number changes: comparative genomic hybridization as a research and diagnostic tool. Dermatol Ther 2006;19:40-9. 24. Ikeda H, Lethe B, Lehmann F, et al. Characterization of an antigen that is recognized on a melanoma showing partial HLA loss by CTL expressing an NK inhibitory receptor. Immunity 1997;6:199-208. 25. Srikrishna G. S100A8 and S100A9: new insights into their roles in malignancy. J Innate Immun 2012;4:31-40. 26. Ji RR, Chasalow SD, Wang L, et al. An immune-active tumor microenvironment favors clinical response to ipilimumab. Cancer Immunol Immunother 2012;61:1019-31. 27. Ulloa-Montoya F, Louahed J, Dizier B, et al. Predictive gene signature in MAGE-A3 antigen-specific cancer immunotherapy. J Clin Oncol 2013;31:2388-95.

1 Table 1. Distribution of melanocytic lesions by subtype. Melanocytic Lesions Training Cohort Validation Total Melanomas Superficial Spreading 167 105 272 Nodular 23 38 61 Acral 20 9 29 Lentigo Maligna / Lentigo Maligna Melanoma 39 31 70 Other 5 28 33 Total 254 211 465 Nevi* Compound 68 101 169 Junctional 38 20 58 Intradermal 28 41 69 Spitz 34 7 41 Blue 38 22 60 Other 4 35 39 Total 210 226 436 * Includes dysplastic nevi [n=117 (Training) and n=67 (Validation)].

2 Table 2. List of genes included in the final multivariate signature. Genes Component 1 Component 2 Component 3 PRAME* S100A9 CCL5 S100A7 CD38 S100A8 CXCL10 S100A12 CXCL9 PI3 IRF1 LCP2 PTPRC SELL * PRAME gene expression represents the average of two amplicon measurements. These genes were added to the gene expression signature after evaluation of the signature with the training cohort. These eight immune genes were evaluated as an averaged group in the multivariate signature. Housekeeping genes included: CLTC, MRFAP1, PPP2CA, PSMA1, RPL13A, RPL8, RPS29, SLC25A3, and TXNL1.

2 Table 3. Performance of the Signature within individual subtypes* Pathologist Classification Signature Classification Signature Performance Malignant Benign Sensitivity Specificity All Melanomas 90% Superficial Spreading 90 15 86% Nodular 37 1 97% Lentigo Maligna 28 3 90% All Nevi 91% Compound 6 95 94% Intradermal 1 40 98% *Results reported only for subtypes with 30 samples. Compound nevi group contained 52 dysplastic nevi.

2 Figure 1. Distribution of the gene expression for the three components in the best performing multivariate model. A) The individual distribution of RNA expression in benign (grey) and malignant (black) samples in the training cohort for each of the three components. B) Comparison of the RNA expression of each component to each of the other two components in the training cohort. The low correlation (cor) of each comparison is noted.

2 Figure 2. Distribution of diagnostic scores in the clinical validation cohort.

2 Figure 3. ROC curve of diagnostic scores in the clinical validation cohort.

2 Clinical Validation of a Gene Expression Signature that Differentiates Benign Nevi from Malignant Melanoma Supplemental Methods and Results Selection and Initial Evaluation of 79 Candidate Biomarker Genes We identified and evaluated 79 candidate genes from published literature whose RNA expression had the potential to differentiate benign and malignant melanocytic tumors (Supplemental Table 1). These genes were selected because they have been observed to have differential expression in benign nevi and malignant melanoma, or have increased expression in more aggressive cancers. 1-9 This list includes genes with known immune functions, cell cycle progression (CCP) genes, as well as genes that regulate cellular differentiation, Notch signaling, migration, fat metabolism and the cytoskeleton. We evaluated the RNA expression of these 79 biomarkers in 83 melanocytic lesions (52 benign and 31 malignant lesions) using quantitative RT-PCR (see Quantification of Gene Expression in Methods). Some of the genes were able to differentiate between the melanoma and nevus samples in a highly significant manner. Genes were ranked for individual performance to facilitate selection of genes for the next round of studies. By using a conservative area under the curve (AUC) cutoff of 70%, we chose 40 genes for further assessment (Supplemental Table 2). All CCP genes were similarly effective when differentiating benign and malignant melanocytic lesions (AUC >70%) and had highly correlated expressions. Thus, only the 10 most effective CCP genes were evaluated. 1, 9 The 30 most effective non-ccp genes with an AUC >70% were also chosen for additional evaluation. All genes with an AUC <70% or with

2 insufficient technical performance (missing expression values in a large number of samples) were excluded from further study. Cluster Analysis and Consolidation of Correlated Genes Since many of the 40 genes evaluated in the training cohort are involved in similar biological pathways, we determined which of the 40 genes had correlated expression across the 544 training cohort by performing a clustering analysis. Genes with both highly correlated gene expression and similar biological functions were then consolidated into averaged components for assessment during the development of the gene expression signature. To perform the cluster analysis, one minus the absolute value of the pair-wise Spearman correlation between genes was used as the distance metric and Ward s method was used as the criterion to form the clusters. We observed two different groups of strongly correlated genes, with the remaining genes forming a third loose cluster (Supplemental Figure 2). The S100A9 and all 10 CCP genes formed one cluster. This was expected as these CCP genes have been observed to have highly correlated expression in other cancer. 1, 9, 10 It was not expected for the CCP genes to cluster with S100A9, which is a multifunctional protein that is thought to be involved in various immune pathways. However, the average correlation of S100A9 with each of the CCP genes was less than the average pairwise correlation between CCP genes, and the range in expression of S100A9 was much greater than any of the CCP genes. Thus, due to functional differences and a lower degree of correlation with the CCP genes, S100A9 was further evaluated on an individual basis. In contrast, all 10 CCP gene expression values were averaged into a single component for further analysis, as we have also 1, 9, 10 done in previous studies.

2 The second main cluster consisted of 12 genes with known immune functions. This subset of genes could represent a specific immune pathway (as opposed to a generic immune signature), as only 12 of the 23 genes with known immune functions are clustered within this group. Similar to the CCP genes, 10 of the 12 immune genes had highly correlated expression across the 544 samples, and we excluded these two genes that were not highly correlated (HCLS1 and PECAM1) from an averaged gene group. The PTPN22 and CXCL13 genes were also excluded from an averaged group, due to insufficient technical performance (i.e. too many samples missing values, due to low expression and/or technical failures). Thus, we created an averaged group of eight immune genes for further analysis, consisting of the genes CCL5, CD38, CXCL9, CXCL10, IRF1, LCP2, PTPRC and SELL. The four immune genes excluded from this averaged set were evaluated on an individual basis. The remaining 17 genes were within a loose miscellaneous cluster that consisted of genes with a variety of known functions, such as immune response, signaling, structural regulation, fat metabolism and cell differentiation. As this group was only loosely clustered and did not have functional similarities, each gene within the cluster was evaluated on an individual basis. Gene Signature Refinement The benefit of the eight gene immune component is that eight averaged measurements improve the technical reproducibility of that component. We wanted to also improve the technical robustness of the PRAME and S100A9 components by similarly having multiple measurements for each gene. To this end, we added a second TaqMan amplicon to measure PRAME, with the PRAME measurement being an average of the two amplicons. Unfortunately, no other TaqMan amplicons were available for S100A9, so we added measurements of four genes

2 that were highly correlated with S100A9, which were S100A7, S100A8, S100A12 and PI3 (which were not initially evaluated in these studies). This created an averaged five gene S100A9-related component. To improve the technical robustness of the normalization genes, we also increased the number of housekeeper genes from five to nine, using additional housekeeper genes that are well characterized in previous studies. 1, 9, 10 We wanted to verify that these refinements did not alter the performance of the signature, so a concordance study was performed on 77 samples. We compared the data generated with and without the technical refinements and observed a 99% concordance when comparing samples across the two datasets. This indicates the technical improvements do not alter the performance of the gene signature. Thus, the refined gene signature had a total of 24 measurements, from 23 different genes: PRAME (two averaged measurements), an averaged group of five S100A9-related genes (S100A7, S100A8, S100A9, S100A12 and PI3), an averaged group of eight immune genes (CCL5, CD38, CXCL10, CXCL9, IRF1, LCP2, PTPRC and SELL), and nine averaged housekeeper genes used for normalization.

2 Supplemental Table 1. List of the 79 candidate biomarker genes ARPC2 CXCL12 IRF1 PRC1 ASF1B CXCL13 IRF4 PTN ASPM CXCL9 ITGB2 PTPN22 BCL2A1 DLGAP5 KIAA0101 PTPRC BIRC5 DTL KIF11 PTTG1 BUB1B FABP7 KIF20A RAD51 CCL19 FN1 KRT15 RAD54L CCL3 FOXM1 LCP2 RGS1 CCL5 GDF15 MCM10 RRM2 CD38 HCLS1 NCOA3 S100A9 CDC20 HEY1 NR4A1 SELL CDCA3 HLA-DMA NUSAP1 SERPPINB4 CDCA8 HLA-DPA1 ORC6L SKA1 CDK1 HLA-DPB1 PBK SOCS3 CDKN3 HLA-DRA PECAM1 SPP1 CENPF HLA-E PHACTR1 TK1 CENPM IFI6 PHIP TOP2A CEP55 IGHM PLK1 WIF1 CFH IGJ POU5F1 WNT2 CXCL10 IGLL5;CKAP2 PRAME

3 Supplemental Table 2. Performance of the candidate diagnostic genes in a multivariate model using the training cohort. Gene Function P-value in multivariate model CENPF Cell Cycle Progression NS CEP55 Cell Cycle Progression NS DLGAP5 Cell Cycle Progression NS DTL Cell Cycle Progression NS FOXM1 Cell Cycle Progression NS MCM10 Cell Cycle Progression NS PBK Cell Cycle Progression NS PLK1 Cell Cycle Progression NS RRM2 Cell Cycle Progression NS SKA1 Cell Cycle Progression NS PRAME Cell Differentiation 4.5 x 10-28 PTN Cell Differentiation NS FABP7 Fat Metabolism NS S100A9 Immune 3.9 x 10-12 CCL5 Immune 7.2 x 10-5 * CD38 Immune 7.2 x 10-5 * CXCL10 Immune 7.2 x 10-5 * CXCL9 Immune 7.2 x 10-5 * IRF1 Immune 7.2 x 10-5 * LCP2 Immune 7.2 x 10-5 * PTPRC Immune 7.2 x 10-5 * SELL Immune 7.2 x 10-5 * BCL2A1 Immune NS CCL3 Immune NS CFH Immune NS CXCL13 Immune NS HCLS1 Immune NS HLA-DMA Immune NS HLA-DRA Immune NS IFI6 Immune NS IGJ Immune NS ITGB2 Immune NS PECAM1 Immune NS PTPN22 Immune NS RGS1 Immune NS SPP1 Immune NS FN1 Signaling NS HEY1 Signaling NS KRT15 Structural NS PHACTR1 Structural NS P-values are Bonferonni adjusted. NS, not significant (p-value >0.05). All 10 cell cycle progression genes were evaluated as an averaged group in the multivariate model. * These eight immune genes were evaluated as averaged group in the multivariate model.

Supplemental Figure 1. Performance of the 40 genes tested in the training cohort. The performance of each gene is measured by the AUC of each gene when differentiating benign and malignant melanocytic lesions.

Supplemental Figure 2. Clustering of the gene expression of the 40 genes evaluated in the training cohort. The three main clusters are annotated. CCP=cell cycle progression.

Supplemental Figure 3. Performance of the best multivariate model in the training cohort. A) Distribution of the diagnostic score from the model in malignant and benign samples. B) The ROC curve of the model, with the sensitivity and specificity displayed at the chosen cutoff point. The AUC of the ROC is also noted.

REFERENCES 1. Cuzick J, Swanson GP, Fisher G, Brothman AR, Berney DM, Reid JE et al. Prognostic value of an RNA expression signature derived from cell cycle proliferation genes in patients with prostate cancer: a retrospective study. Lancet Oncol 2011;12:245-55. 2. Haqq C, Nosrati M, Sudilovsky D, Crothers J, Khodabakhsh D, Pulliam BL et al. The gene expression signatures of melanoma progression. Proc Natl Acad Sci U S A 2005;102:6092-7. 3. Koh SS, Opel ML, Wei JP, Yau K, Shah R, Gorre ME et al. Molecular classification of melanomas and nevi using gene expression microarray signatures and formalin-fixed and paraffin-embedded tissue. Mod Pathol 2009;22:538-46. 4. Mauerer A, Roesch A, Hafner C, Stempfl T, Wild P, Meyer S et al. Identification of new genes associated with melanoma. Exp Dermatol 2011;20:502-7. 5. Scatolini M, Grand MM, Grosso E, Venesio T, Pisacane A, Balsamo A et al. Altered molecular pathways in melanocytic lesions. Int J Cancer 2010;126:1869-81. 6. Smith AP, Hoek K, Becker D. Whole-genome expression profiling of the melanoma progression pathway reveals marked molecular differences between nevi/melanoma in situ and advanced-stage melanomas. Cancer Biol Ther 2005;4:1018-29. 7. Talantov D, Mazumder A, Yu JX, Briggs T, Jiang Y, Backus J et al. Novel genes associated with malignant melanoma but not benign melanocytic lesions. Clin Cancer Res 2005;11:7234-42. 8. Wachsman W, Morhenn V, Palmer T, Walls L, Hata T, Zalla J et al. Noninvasive genomic detection of melanoma. Br J Dermatol 2011;164:797-806.

9. Wistuba, II, Behrens C, Lombardi F, Wagner S, Fujimoto J, Raso MG et al. Validation of a proliferation-based expression signature as prognostic marker in early stage lung adenocarcinoma. Clin Cancer Res 2013;19:6261-71. 10. Cuzick J, Berney DM, Fisher G, Mesher D, Moller H, Reid JE et al. Prognostic value of a cell cycle progression signature for prostate cancer death in a conservatively managed needle biopsy cohort. Br J Cancer 2012;106:1095-9.