Models of cell signalling uncover molecular mechanisms of high-risk neuroblastoma and predict outcome

Similar documents
High throughput estimation of functional cell activities reveals disease mechanisms and predicts relevant clinical outcomes

Predicting clinical outcomes in neuroblastoma with genomic data integration

A General Workflow to Analyze Key Cancer-Related Genes Based on NCBI illustrated by the case of gastric cancer

Multiplexed Cancer Pathway Analysis

Case Studies on High Throughput Gene Expression Data Kun Huang, PhD Raghu Machiraju, PhD

GENE EXPRESSION AND microrna PROFILING IN LYMPHOMAS. AN INTEGRATIVE GENOMICS ANALYSIS

High-Throughput Sequencing Course

Supplementary Material

OncoPPi Portal A Cancer Protein Interaction Network to Inform Therapeutic Strategies

Molecular Hematopathology Leukemias I. January 14, 2005

Deregulation of signal transduction and cell cycle in Cancer

SUPPLEMENTARY INFORMATION

Discovery of Novel Human Gene Regulatory Modules from Gene Co-expression and

Molecular Genetics of Paediatric Tumours. Gino Somers MBBS, BMedSci, PhD, FRCPA Pathologist-in-Chief Hospital for Sick Children, Toronto, ON, CANADA

mirna Dr. S Hosseini-Asl

Computer Science, Biology, and Biomedical Informatics (CoSBBI) Outline. Molecular Biology of Cancer AND. Goals/Expectations. David Boone 7/1/2015

Molecular Cell Biology. Prof. D. Karunagaran. Department of Biotechnology. Indian Institute of Technology Madras

BCHM3972 Human Molecular Cell Biology (Advanced) 2013 Course University of Sydney

The 16th KJC Bioinformatics Symposium Integrative analysis identifies potential DNA methylation biomarkers for pan-cancer diagnosis and prognosis

Introduction to Systems Biology of Cancer Lecture 2

Molecular Markers. Marcie Riches, MD, MS Associate Professor University of North Carolina Scientific Director, Infection and Immune Reconstitution WC

Karyotype analysis reveals transloction of chromosome 22 to 9 in CML chronic myelogenous leukemia has fusion protein Bcr-Abl

MET skipping mutation, EGFR

Biol403 MAP kinase signalling

Determination Differentiation. determinated precursor specialized cell

Supplementary Figure 1.

Cancer. The fundamental defect is. unregulated cell division. Properties of Cancerous Cells. Causes of Cancer. Altered growth and proliferation

Single SNP/Gene Analysis. Typical Results of GWAS Analysis (Single SNP Approach) Typical Results of GWAS Analysis (Single SNP Approach)

Mechanism for targeting CDK4, BTK and PI3K in Mantle Cell Lymphoma. Selina Chen-Kiang, Ph.D. Weill Cornell Medicine

Gene expression in cancer diagnostics

RASA: Robust Alternative Splicing Analysis for Human Transcriptome Arrays

BIMM 143. RNA sequencing overview. Genome Informatics II. Barry Grant. Lecture In vivo. In vitro.

Targeting voltage-gated K + channels in cancer reveal new biochemical pathways and therapeutic opportunities

Supporting Information

Network-assisted data analysis

Introduction to Cancer Biology

Analysis of Massively Parallel Sequencing Data Application of Illumina Sequencing to the Genetics of Human Cancers

Cancer immunity and immunotherapy. General principles

G-Protein Signaling. Introduction to intracellular signaling. Dr. SARRAY Sameh, Ph.D

* Kyoto Encyclopedia of Genes and Genomes.

The 14th Annual International Conference On Critical Assessment of Massive Data Analysis (CAMDA)

HALLA KABAT * Outreach Program, mircore, 2929 Plymouth Rd. Ann Arbor, MI 48105, USA LEO TUNKLE *

Supplementary Information. Table of contents

The clinical relevance of circulating, cell-free and exosomal micrornas as biomarkers for gynecological tumors

R2: web-based genomics analysis and visualization platform

Supplementary Figure 1

Alternative splicing analysis for finding novel molecular signatures in renal carcinomas

Li-Xuan Qin 1* and Douglas A. Levine 2

PREPARED FOR: U.S. Army Medical Research and Materiel Command Fort Detrick, Maryland

Stochastic Simulation of P53, MDM2 Interaction

Oncogenes and Tumor. supressors

Supplementary Figure 1: Tissue of Origin analysis on 152 cell lines. (a) Heatmap representation of the 30 Tissue scores for the 152 cell lines.

Set the stage: Genomics technology. Jos Kleinjans Dept of Toxicogenomics Maastricht University, the Netherlands

Accessing and Using ENCODE Data Dr. Peggy J. Farnham

The Avatar System TM Yields Biologically Relevant Results

Chapter 4 Cellular Oncogenes ~ 4.6 -

Chapt 15: Molecular Genetics of Cell Cycle and Cancer

A Practical Guide to Integrative Genomics by RNA-seq and ChIP-seq Analysis

Ets-1 identifying polynucleotide sequence for targeted delivery of anti-cancer drugs

Supplementary Material. Part I: Sample Information. Part II: Pathway Information

Biomarker development in the era of precision medicine. Bei Li, Interdisciplinary Technical Journal Club

Supplement to SCnorm: robust normalization of single-cell RNA-seq data

From reference genes to global mean normalization

Gene-microRNA network module analysis for ovarian cancer

Cancer. The fundamental defect is. unregulated cell division. Properties of Cancerous Cells. Causes of Cancer. Altered growth and proliferation

Protein Domain-Centric Approach to Study Cancer Somatic Mutations from High-throughput Sequencing Studies

Identifying Novel Targets for Non-Small Cell Lung Cancer Just How Novel Are They?

Identifying transcriptional dichotomies by "stochastic sampling"

Introduction. Cancer Biology. Tumor-suppressor genes. Proto-oncogenes. DNA stability genes. Mechanisms of carcinogenesis.

Section D: The Molecular Biology of Cancer

PCxN: The pathway co-activity map: a resource for the unification of functional biology

MRD in CML (BCR-ABL1)

Epstein-Barr virus driven promoter hypermethylated genes in gastric cancer

Award Number: W81XWH TITLE: Characterizing an EMT Signature in Breast Cancer. PRINCIPAL INVESTIGATOR: Melanie C.

On the Reproducibility of TCGA Ovarian Cancer MicroRNA Profiles

p53 and Apoptosis: Master Guardian and Executioner Part 2

The Transcriptional Response to Irradiation: Time to Move the Focus from Cellular to Tissutal Level

Enzyme-coupled Receptors. Cell-surface receptors 1. Ion-channel-coupled receptors 2. G-protein-coupled receptors 3. Enzyme-coupled receptors

Tolerance, autoimmunity and the pathogenesis of immunemediated inflammatory diseases. Abul K. Abbas UCSF

Intracellular signalling pathways activated by leptin by Gema FRUHBECK. Presentation by Amnesiacs Anonymous

Cell Communication. Cell Communication. Communication between cells requires: ligand: the signaling molecule

SureSelect Cancer All-In-One Custom and Catalog NGS Assays

genomics for systems biology / ISB2020 RNA sequencing (RNA-seq)

CanAssist-Breast: New test for prediction of risk of recurrence in ER+/Her2- early stage breast cancer patients Manjiri Bakre, Ph.D.

Cell cycle, signaling to cell cycle, and molecular basis of oncogenesis

Augmented Medical Decisions

Title: Human breast cancer associated fibroblasts exhibit subtype specific gene expression profiles

GeneOverlap: An R package to test and visualize

The Tissue Engineer s Toolkit

International Journal of Computer Science Trends and Technology (IJCST) Volume 5 Issue 1, Jan Feb 2017

Low levels of serum mir-99a is a predictor of poor prognosis in breast cancer

About OMICS Group Conferences

MicroRNA expression profiling and functional analysis in prostate cancer. Marco Folini s.c. Ricerca Traslazionale DOSL

Section D. Genes whose Mutation can lead to Initiation

Bayesian Prediction Tree Models

Transcriptome Analysis

VIII Curso Internacional del PIRRECV. Some molecular mechanisms of cancer

The Role of ploidy in neuroblastoma. Michael D. Hogarty, MD Division of Oncology Children s Hospital of Philadelphia

International Journal of Innovative Research in Computer and Communication Engineering. (An ISO 3297: 2007 Certified Organization)

Regulation of cell signaling cascades by influenza A virus

Transcription:

Models of cell signalling uncover molecular mechanisms of high-risk neuroblastoma and predict outcome Marta R. Hidalgo 1, Alicia Amadoz 2, Cankut Çubuk 1, José Carbonell-Caballero 2 and Joaquín Dopazo 1,3,4 1 Clinical Bioinformatics Research Area. Fundación Progreso y Salud (FPS). Hospital Virgen del Rocio. 41013. Sevilla. Spain 2 Computational Genomics Department. Centro de Investigación Principe Felipe (CIPF). 46012 Valencia. Spain 3 Functional Genomics Node (INB). FPS. Hospital Virgen del Rocio. 41013 Sevilla. Spain 4 Bioinformatics in Rare Diseases (BiER). Centro de Investigación Biomédica en Red de Enfermedades Raras (CIBERER). FPS. Hospital Virgen del Rocio. 41013. Sevilla. Spain; Abstract Despite the progress in neuroblastoma therapies the mortality of high-risk patients is still high (40% - 50%) and the molecular basis of the disease remains poorly known. Here we use models of cell signalling, a key process in this cancer, to understand the molecular determinants of bad prognostic. We also show how the activity of signalling circuits can be used as a predictor of survival in neuroblastoma patients. Introduction Neuroblastoma is a tumour derived from primitive cells of the sympathetic nervous system that, despite advances in its treatment still has a poor survival for high-risk patients (Brodeur, 2003). Risk groups are defined according to disease stage, patient age, and MYCN amplification status (Øra and Eggert, 2011). Although the use of biomarkers has demonstrated a clear clinical utility, they represent statistical associations to clinical parameters and frequently lack any mechanistic relationship with the molecular mechanisms responsible for tumorigenesis or therapeutic response. On the contrary, signalling pathways play a key role in these processes. Actually, it has recently been demonstrated in neuroblastoma (Fey, et al., 2015) and other cancers (Hidalgo, et al., 2017) that the activity of specific circuits of signalling pathways was more correlated to patient survival that any of their constituent genes. Here, we have used models that produce a realistic estimation of signalling circuit activity within pathways from gene expression (Hidalgo, et al., 2017) to discover the molecular mechanisms behind the differences between the patients with MYCN amplification and the others as well as the determinants of survival in neuroblastoma.

Results Data were downloaded and processed as indicated in Methods. Overall, our results document extensive differences at the level of signalling activity between patients with MYCN amplification and those lacking this biomarker. Specifically, the MAPK and NFKβ pathways present a dramatic reduction of activity in patients with MYCN amplification. Some relevant functions significantly deactivated in high-risk patients are, for example, cellular response to DNA damage stimulus (see Figure 1), negative regulation of cell division or cell cycle arrest. Figure 1. Circuit MAPK signalling pathway: ATF2 that triggers cellular response to DNA damage stimulus is significantly downregulated (FDR-adjusted p-value=1.45x10-15 Across different pathways high-risk patients present a systematic deactivation of cellular functions related to DNA repair and immune response. Figure 2. Kaplan-Meyer plots of survival of patients with MYCN amplification which have downregulated Adipocytokine: PTPN11 (left) and camp: AFDN (right) signalling circuits Regarding survival across all the patients, processes such as negative regulation of apoptosis process, triggered by multiple signalling circuits: p53: CDK1 CCNB3 (FDR-

adjusted p-value=7.6x10-11 ) PI3K-Akt: BCL2L1* (adj. p-val.=5.1x10-7 ) Jak-STAT: RAF1 (adj. p-val.=5.4x10-3 ) Fc epsilon RI: MAPK8 (adj. p-val.=5.5x10-3 ), is associated to bad prognostic. Similarly, negative regulation of cell cycle arrest (triggered by p53: MDM2*, adj. p-val. <10-20 ) and positive regulation of cell growth (Sphingolipid:TP53 BCL2, adj. p-val.= 1.5x10-12 ) seem to play an important role in low survival. We also tried to find the molecular drivers of bad prognostic among patients with MYCN amplifications. Only two circuits, Adipocytokine: PTPN11 and camp: AFDN are clearly associated to bad prognostic. One of the effector proteins, PTPN11 has been implicated in mitogenic activation, metabolic control, transcription regulation, and cell migration (Chan, et al., 2008). The other effector protein, AFDN, is the fusion partner of acute lymphoblastic leukemia (ALL-1) gen involved in acute myeloid leukemias with t(6;11)(q27;q23) translocation, with a known role in cell adhesion (Mandai, et al., 2013). Conclusions The use of models that quantify cell behavioural outcomes provides a unique opportunity to understand the molecular mechanisms of cancer development and progression as well as open the possibility to highly specific, individualized therapeutic interventions. Methods Data Source and data preprocessing The matrix GSE49711_SEQC_NB_TUC_G_log2.txt, with gene expression levels estimated by Cufflinks (Trapnell, et al., 2012) and quantified as log 2 (1+FPKM), was downloaded from the GEO database. Batch effect was corrected with COMBAT (Johnson, et al., 2007). Finally, the values were normalized between 0 and 1. Signalling circuit activity model Circuit activities are modelled from gene expression values as described in (Hidalgo, et al., 2017). Briefly, KEGG pathways (Kanehisa, et al., 2014) are used to define circuits connecting receptor proteins to effector proteins. A total of 98 KEGG pathways involving a total of 3057 gene products that compose 4726 nodes were used to define a total of 1287 signalling circuits. Normalized gene expression values are used as proxies of protein activity (Efroni, et al., 2007; Montaner, et al., 2009; Sebastian-Leon, et al., 2014). The signal transmission is estimated by starting with an initial signal of 1, which is propagated along the nodes of the signalling circuits according to the following recursive rule:

S n = υ n (1 (1 s a )) (1 s i ) s a A s i I (1) Where S n is the signal intensity for the current node n, v n is its normalized gene expression value, A is the total number of activation signals (s a ), arriving to the current node from activation edges, I is the total number of inhibitory signals (s i ) arriving to the node from inhibition edges (Hidalgo, et al., 2016). In addition to circuit activities, the signal received by specific cell functions -according to either Gene Ontology (Ashburner, et al., 2000) or Uniprot (UniProt_Consortium, 2015) definitions-, triggered by more than one circuit, can also be estimated. Survival analysis Kaplan-Meier (K-M) curves (Kaplan and Meier, 1958) are used to relate module activity to patient survival in the different cancers. The value of the activity estimated for each module in each individual was used to assess its relationship with individual patient survival. Calculations were carried out using the function survdiff from the survival R package (https://cran.r-project.org/web/packages/survival/). References Ashburner, M., et al. (2000) Gene ontology: tool for the unification of biology. The Gene Ontology Consortium, Nat Genet, 25, 25-29. Brodeur, G.M. (2003) Neuroblastoma: biological insights into a clinical enigma, Nature Reviews Cancer, 3, 203-216. Chan, G., Kalaitzidis, D. and Neel, B.G. (2008) The tyrosine phosphatase Shp2 (PTPN11) in cancer, Cancer and metastasis reviews, 27, 179-192. Efroni, S., Schaefer, C.F. and Buetow, K.H. (2007) Identification of key processes underlying cancer phenotypes using biologic pathway analysis, PLoS ONE, 2, e425. Fey, D., et al. (2015) Signaling pathway models as biomarkers: Patient-specific simulations of JNK activity predict the survival of neuroblastoma patients, Sci Signal, 8, ra130. Hidalgo, M., et al. (2016) High throughput estimation of functional cell activities reveals disease mechanisms and predicts relevant clinical outcomes, Oncotarget, In press. Hidalgo, M.R., et al. (2017) High throughput estimation of functional cell activities reveals disease mechanisms and predicts relevant clinical outcomes, Oncotarget, 8, 5160-5178. Johnson, W.E., Li, C. and Rabinovic, A. (2007) Adjusting batch effects in microarray expression data using empirical Bayes methods, Biostatistics, 8, 118-127. Kanehisa, M., et al. (2014) Data, information, knowledge and principle: back to metabolism in KEGG, Nucleic Acids Res, 42, D199-205. Kaplan, E. and Meier, P. (1958) Nonparametric estimation from incomplete observations, Journal of the American Statistical Association, 53, 457-481. Mandai, K., et al. (2013) Afadin/AF-6 and canoe: roles in cell adhesion and beyond, Progress in molecular biology and translational science, 116, 433-454. Montaner, D., et al. (2009) Gene set internal coherence in the context of functional profiling, BMC Genomics, 10, 197.

Øra, I. and Eggert, A. (2011) Progress in treatment and risk stratification of neuroblastoma: impact on future clinical and basic research. Seminars in cancer biology. Elsevier, pp. 217-228. Sebastian-Leon, P., et al. (2014) Understanding disease mechanisms with models of signaling pathway activities, BMC Syst Biol, 8, 121. Trapnell, C., et al. (2012) Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks, Nat Protoc, 7, 562-578. UniProt_Consortium (2015) UniProt: a hub for protein information, Nucleic Acids Res, 43, D204-212.