SUPPLEMENTARY APPENDIX

Similar documents
Nature Methods: doi: /nmeth.3115

Nature Immunology: doi: /ni Supplementary Figure 1. RNA-Seq analysis of CD8 + TILs and N-TILs.

Impact of Prognostic Factors

Melanoma Update: 8th Edition of AJCC Staging System

SUPPLEMENTARY FIGURES: Supplementary Figure 1

Supplemental materials

Patnaik SK, et al. MicroRNAs to accurately histotype NSCLC biopsies

A Versatile Algorithm for Finding Patterns in Large Cancer Cell Line Data Sets

Digitizing the Proteomes From Big Tissue Biobanks

Genetic Testing: When should it be ordered? Julie Schloemer, MD Dermatology

SSM signature genes are highly expressed in residual scar tissues after preoperative radiotherapy of rectal cancer.

Supplementary Figure 1. Spitzoid Melanoma with PPFIBP1-MET fusion. (a) Histopathology (4x) shows a domed papule with melanocytes extending into the

Supplementary Figures

Title: Human breast cancer associated fibroblasts exhibit subtype specific gene expression profiles

Gene-microRNA network module analysis for ovarian cancer

Supplementary Figure 1. Metabolic landscape of cancer discovery pipeline. RNAseq raw counts data of cancer and healthy tissue samples were downloaded

Nature Medicine: doi: /nm.3967

Journal: Nature Methods

Supplementary Figure S1. Gene expression analysis of epidermal marker genes and TP63.

ARTICLE RESEARCH. Macmillan Publishers Limited. All rights reserved

Dermatopathology: The tumor is composed of keratinocytes which show atypia, increase mitoses and abnormal mitoses.

Benign and malignant epithelial lesions: Seborrheic keratosis: A common benign pigmented epidermal tumor occur in middle-aged or older persons more

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1

Malignant tumors of melanocytes : Part 3. Deba P Sarma, MD., Omaha

Introduction to Discrimination in Microarray Data Analysis

Supplementary Materials for

The 16th KJC Bioinformatics Symposium Integrative analysis identifies potential DNA methylation biomarkers for pan-cancer diagnosis and prognosis

PBZ FT01_PBZ FT01_TZ FT01_NZ. interface zone (I) tumor zone (TZ) necrotic zone (NZ)

SUPPLEMENTARY INFORMATION

Supplementary Tables. Supplementary Figures

SUPPLEMENTARY INFORMATION

6/22/2015. Original Paradigm. Correlating Histology and Molecular Findings in Melanocytic Neoplasms

underlying metastasis and recurrence in HNSCC, we analyzed two groups of patients. The

Update on 8 th Edition Cutaneous AJCC Staging of Primary Cutaneous Melanoma. Michael T. Tetzlaff MD, PhD

SUPPLEMENTARY INFORMATION

User s Manual Version 1.0

Identifying Thyroid Carcinoma Subtypes and Outcomes through Gene Expression Data Kun-Hsing Yu, Wei Wang, Chung-Yu Wang

Nature Genetics: doi: /ng Supplementary Figure 1. Mutational signatures in BCC compared to melanoma.

Nature Immunology: doi: /ni Supplementary Figure 1. Transcriptional program of the TE and MP CD8 + T cell subsets.

Integrative analysis of survival-associated gene sets in breast cancer

Broad H3K4me3 is associated with increased transcription elongation and enhancer activity at tumor suppressor genes

Page 1 of 3. We suggest the following changes:

Cancer Informatics Lecture

Putative effectors for prognosis in lung adenocarcinoma are ethnic and gender specific

Supplementary Figure 1. Efficiency of Mll4 deletion and its effect on T cell populations in the periphery. Nature Immunology: doi: /ni.

Relationship between genomic features and distributions of RS1 and RS3 rearrangements in breast cancer genomes.

<10. IL-1β IL-6 TNF + _ TGF-β + IL-23

Nature Genetics: doi: /ng Supplementary Figure 1. SEER data for male and female cancer incidence from

Breslow Thickness and Clark Level Evaluation in Albanian Cutaneous Melanoma

An Overview of Melanoma. Harriet Kluger, M.D. Associate Professor Section of Medical Oncology Yale Cancer Center

Evaluation of public cancer datasets and signatures identifies TP53 mutant signatures with robust prognostic and predictive value

- 1 - Reviewers' comments: Reviewer #1 (Remarks to the Author):

Machine Learning! Robert Stengel! Robotics and Intelligent Systems MAE 345,! Princeton University, 2017

Nature Neuroscience: doi: /nn Supplementary Figure 1. Behavioral training.

MODEL-BASED CLUSTERING IN GENE EXPRESSION MICROARRAYS: AN APPLICATION TO BREAST CANCER DATA

Chapter 1. Introduction

Case Studies on High Throughput Gene Expression Data Kun Huang, PhD Raghu Machiraju, PhD

المركب النموذج--- سبيتز وحمة = Type Spitz's Nevus, Compound SPITZ NEVUS 1 / 7

COMPUTATIONAL OPTIMISATION OF TARGETED DNA SEQUENCING FOR CANCER DETECTION

Expanded View Figures

Predicting Diabetes and Heart Disease Using Features Resulting from KMeans and GMM Clustering

SCATTER PLOTS AND TREND LINES

Pathology of the skin. 2nd Department of Pathology, Semmelweis University

F006 Imaging in Dermatology Melanocytic Neoplasia Clinical-Confocal-Pathological-Correlations

DNA methylation signatures for 2016 WHO classification subtypes of diffuse gliomas

Early Learning vs Early Variability 1.5 r = p = Early Learning r = p = e 005. Early Learning 0.

Desmoplastic Melanoma R/O BCC. Clinical Information. 74 y.o. man with lesion on left side of neck r/o BCC

I have no relevant conflicts of interest to disclose. John T. Seykora MD PhD Departments of Dermatology & Pathology and Laboratory Medicine

SubLasso:a feature selection and classification R package with a. fixed feature subset

SUPPLEMENTARY INFORMATION

Abrupt Intralesional Color Change on Dermoscopy as a New Indicator of Early Superficial Spreading Melanoma in a Japanese Woman

Protocol applies to melanoma of cutaneous surfaces only.

Transcriptional Profiles from Paired Normal Samples Offer Complementary Information on Cancer Patient Survival -- Evidence from TCGA Pan-Cancer Data

Computational Analysis of UHT Sequences Histone modifications, CAGE, RNA-Seq

Accepted Manuscript. Pancreatic Cancer Subtypes: Beyond Lumping and Splitting. Andrew J. Aguirre

Identification of Tissue Independent Cancer Driver Genes

MIR retrotransposon sequences provide insulators to the human genome

Melanoma-Back to Basics I Thought I Knew Ya! Paul K. Shitabata, M.D. Dermatopathologist APMG

Expert-guided Visual Exploration (EVE) for patient stratification. Hamid Bolouri, Lue-Ping Zhao, Eric C. Holland

Supplementary. properties of. network types. randomly sampled. subsets (75%

Patient age and cutaneous malignant melanoma: Elderly patients are likely to have more aggressive histological features and poorer survival

MECHANISMS OF HUMAN DISEASE: LABORATORY SESSION PATHOLOGY OF THE SKIN LAB. Friday, February 12, :30 am 11:00 am

Analysis of skin basalioma and melanoma by multispectral imaging

Figure S2. Distribution of acgh probes on all ten chromosomes of the RIL M0022

Self reported ethnicity

Nature Neuroscience: doi: /nn Supplementary Figure 1

Histopathology of Melanoma

Supplementary Figures

10/15/2012. Biologic Subtypes of TNBC. Topics. Topics. Histopathology Molecular pathology Clinical relevance

Expanded View Figures

Supplementary Figure 1: Classification scheme for non-synonymous and nonsense germline MC1R variants. The common variants with previously established

National Surgical Adjuvant Breast and Bowel Project (NSABP) Foundation Annual Progress Report: 2009 Formula Grant

(a) Significant biological processes (upper panel) and disease biomarkers (lower panel)

ASMS 2015 ThP 459 Glioblastoma Multiforme Subtype Classification: Integrated Analysis of Protein and Gene Expression Data

Associate Clinical Professor of Dermatology MUSC

ELECTRON MICROSCOPY OF A SMALL PIGMENTED CUTANEOUS LESION*

MicroRNA expression profiling and functional analysis in prostate cancer. Marco Folini s.c. Ricerca Traslazionale DOSL

NOD-like receptor signaling and inflammasome-related pathways are highlighted in psoriatic epidermis

Supplementary Materials for

Clinical characteristics

Transcription:

SUPPLEMENTARY APPENDIX 1) Supplemental Figure 1. Histopathologic Characteristics of the Tumors in the Discovery Cohort 2) Supplemental Figure 2. Incorporation of Normal Epidermal Melanocytic Signature into Differential Gene Expression and Functional Pathway Analysis 3) Supplemental Figure 3. Differentially Expressed Endogenous Elements between Melanoma vs. Nevus Subsets 4) Supplemental Figure 4. Identification of the Epgn3 (High-Risk) Cluster in Independent Melanoma Datasets 5) Supplemental Figure 5. Prognostic Value of the 122-Epigenetic Gene Signature 6) Supplemental Figure 6. TP53, TP63, and TP73 Correlate with LINE1 and the Epigenetic Signature

Supplemental Figure 1. Histopathologic Characteristics of the Tumors in the Discovery Cohort Figure S1. H&E staining of representative examples of the tumors samples within our cohort are shown. (A, B) Common acquired nevus (intradermal melanocytic nevus). (C, D) Primary melanoma, nodular type, with multifocal brisk tumor infiltrating lymphocytes located at the periphery. (E, F) Primary melanoma, nodular type, with segmental brisk tumor infiltrating lymphocytes throughout the entire neoplasm. (G, H) Primary melanoma, nodular type, with focal scattered non-brisk tumor infiltrating lymphocytes.

Supplemental Figure 2. Incorporation of Normal Epidermal Melanocytic Signature into Differential Gene Expression and Functional Pathway Analysis Skin biopsies of melanocytic nevus and primary melanoma of the skin harbor cell types other than nevus and melanoma cells, respectively (i.e. epidermal keratinocytes and stromal cells). Moreover, nevus and melanoma cell contents vary across different samples. Although all our samples had an overlying epidermis and the nevus samples were selected to harbor large melanocytic content (compound and intradermal nevi), dissecting the transcriptional signal output of different cell types within human tumors can be challenging. In order to measure the influence of variable melanocyte content on differential gene expression, we used the results of Reemann, et al. (1) and defined gene expression signatures of normal epidermal melanocytes (normal melanocytic signature, NMS) relative to dermal fibroblasts (FB), epidermal keratinocytes (KC) and whole skin (WS). Additionally, we utilized the RNA-sequencing data of the TCGA (http://cancergenome.nih.gov/) as well as RNA-sequencing of normal skin (NS-extremely low melanocytic content) and dysplastic compound nevus (DN-low melanocytic content) samples that were collected as a part of our study. We performed differential expression analysis using limma-voom for regional cutaneous metastasis (RCM), regional lymph node metastasis (RLNM), and distant metastasis to skin (DMs). For this analysis, we considered only genes with count per million > 10 in at least 20 samples. We considered only DEG with Benjamini-Hochberg corrected P value < 0.005 and a fold change > 1.5. We performed PCA plots to confirm effective differentiation of subtypes and annotated genes by melanocytic expression in the NMS. For each pair of subtypes, we determined similarity of the set of differentially expressed genes (DEG) to the NMS. We confirmed that for all subtypes, our DEGs were significantly enriched for NMS genes by Fisher s exact test. We refer to the NMS genes present in our DEGs as NMS/DEG. We next examined whether the over- or under-expression of DEGs of a given subtype was aligned with the over-/under-expression of genes expressed on melanocytes. To this end, we compared the principal component analysis (PCA) plots of factor scores and loadings. For each pair of subtypes, the plot of the factor scores demonstrated that samples are clustered by subtype predominantly along the first component. Similarly, the corresponding plot of variable loadings also showed that all DEGs are clustered into over- and under-expressed groups of genes mostly along the first principal component. To better determine how well these genes clustered along the first principal component, we performed linear discriminant analysis on the variable loadings with differential expression by

subtype as a class label. The resulting linear discriminant describes a supervised projection of the genes that explicitly maximizes the distance between the centroids of the two classes, unlike PCA that preserves covariance and thereby clusters genes in an unsupervised manner. The decision boundary separating the two classes is orthogonal to the linear discriminant and is depicted in as a solid black line in Figure S2. A decision boundary that is also orthogonal to the first principal component indicates that the linear discriminant and the first principal component coincide, and that clustering by subtype is maximal along the first principal component. For each comparison of subtypes, we determined whether the two clusters of over- and underexpressed DEGs coincided with the over- or under-expressed melanocytic genes. We inferred that clustering of over- or under-expressed NMS/DEG gene along the first principal component (or linear discriminant) would indicate that differential expression of that pair of subtypes is heavily influenced by the melanocytic content. Conversely, clustering of over- or under-expressed NMS/DEG genes along the second component (or linear discriminant decision boundary) would suggest that melanocytic content did not play an essential role in the differentiation between subtypes. To quantify how strongly NMS/DEG genes contribute to each component, we considered the two distributions of over- or under-expressed NMS/DEG genes projected onto the given principal component. We calculated the corresponding Kolmogorov-Smirnov statistic KS n as a measure for how much principal component n explains normal melanocytic expression in the two samples. Plotting KS1 and KS2 for each pair of subtypes revealed two distinct clusters, as highlighted by a minimum-distance decision boundary. In the dense cluster with high KS1 and low KS2, the first (and second) principal component maximally (and minimally) explained the differentiation of both subtypes (i.e. T4 vs. T1) and NMS (i.e. MC vs. WS). On the other hand, the second cluster characterized by either high KS2 or low KS1 included comparisons between PM vs. CAN, DN vs. NS, RCM vs. PM, and RLNM vs. PM. Similar results were found for KS statistics corresponding to the projected distribution of genes on the linear discriminant and orthogonal decision boundary. We therefore concluded that differential gene expression in subtype comparisons with high KS2 or low KS1 were not heavily influenced by the NMS, likely due to a smaller difference in the melanocytic cell content.

Figure S2. (A) Selected PCA plots of PM vs. CAN (first row), RCM vs. PM (second row), PM vs. NS (third row), and T4 vs. T1 (fourth row). Left column: Factor scores show clustering of samples by subtype. Right column: Variable loadings show clustering of genes by differential expression across subtype, with genes annotated by melanocytic expression in green and red. (B) Plot of Kolmogorov-Smirnov statistics (top) along the first and second principal components (bottom) along the linear discriminant and orthogonal decision boundary.

Supplemental Figure 3. Differentially Expressed Endogenous Elements between Melanoma vs. Nevus Subsets Figure S3. Heatmap represents differentially expressed noncoding repetitive element transcripts (endogenous retroviruses (ERVs), short interspersed nuclear elements (SINEs), and long interspersed nuclear elements (LINEs).

Supplemental Figure 4. Identification of the Epgn3 (high-risk) Cluster in Independent Melanoma Datasets Figure S4. TCGA dataset of melanomas (n = 473) were downloaded from https://cancergenome.nih.gov/. (A) and (B) Samples were selected based on sample type as primary tumor or metastasis. TMM normalization was utilized across the samples. Heatmaps represent unsupervised hierarchical clustering of the tumor samples using the 122-epigenetic gene signature.

Supplemental Figure 5. Prognostic Significance of the 122-Epigenetic Gene Signature on Patient Survival

Figure S5. (A) The discovery cohort consisted of 51 primary melanoma patients. We excluded the outlier group that consisted of a few number of cases (Epgn2 group, n = 6) and examined the resultant 45 patients that clustered within either Epgn1 (low-risk) or Epgn3 (high-risk) groups. Survival outcome data was available in forty of the 45 patients. Relatively thin tumors (T1/T2) fell mostly in the Epgn1 group (Epgn1 n =15, Epgn3 n =1) whereas the thicker tumors (T3/T4) had a uniform distribution of Epgn1 (n =11) and Epgn3 (n =13) samples. (B) Thus, we first examined the 5-year survival of T3/T4 cases stratified by the Epgn1/3 classification and found a significant prognostic value on patient outcomes (P = 0.000173). Of importance, none of the patients in the Epgn1 (low-risk) group died within this time frame. (C) When the 5-year survival rates of T3/T4 patients were stratified by tumor thickness categories (thinner tumors n =11, thicker tumors n =13), as expected, we found that tumor thickness showed a significant prognostic value (P = 0.0221). Of note, some patients in the Epgn1 group (low-risk) died within the 5-year time interval. (D-E) Similar analysis was applied to the external dataset (2). Relatively thin tumors (T1) consisted of Epgn1 cases (Epgn1 n =7, Epgn3 n =1), whereas the thicker tumors (T2/T3/T4) had a uniform distribution (Epgn1 n =14, Epgn3 n =22). When stratified either by Epgn1/3 or tumor thickness categories, Epgn1/3 classification performed better in predicting patient outcome (P = 00358); all patients in the Epgn3 group died within this interval. Log-rank test was used for the analyses.

Supplemental Figure 6. TP53, TP63, and TP73 Correlate with LINE1 and the Epigenetic Signature Figure S6. Expression correlation matrix of 126 total genes in primary melanoma samples from the discovery cohort (Epgn1 and Epgn3 samples, n = 45). Positive correlation is indicated by increasing red color and negative correlation by increasing blue color intensity. 1. Reemann, P., et al. 2014. Melanocytes in the skin- - comparative whole transcriptome analysis of main skin cell types. PLoS One 9:e115717. 2. Raskin, L., et al. 2013. Transcriptome profiling identifies HMGA2 as a biomarker of melanoma progression and prognosis. J Invest Dermatol 133:2585-2592.