Elevated RNA Editing Activity Is a Major Contributor to Transcriptomic Diversity in Tumors

Similar documents
Supplementary Figure 1: LUMP Leukocytes unmethylabon to infer tumor purity

Nature Genetics: doi: /ng Supplementary Figure 1. Workflow of CDR3 sequence assembly from RNA-seq data.

Machine-Learning on Prediction of Inherited Genomic Susceptibility for 20 Major Cancers

Exploring TCGA Pan-Cancer Data at the UCSC Cancer Genomics Browser

Nature Getetics: doi: /ng.3471

The 16th KJC Bioinformatics Symposium Integrative analysis identifies potential DNA methylation biomarkers for pan-cancer diagnosis and prognosis

Solving Problems of Clustering and Classification of Cancer Diseases Based on DNA Methylation Data 1,2

RNA SEQUENCING AND DATA ANALYSIS

Supplementary. properties of. network types. randomly sampled. subsets (75%

User s Manual Version 1.0

LinkedOmics. A Web-based platform for analyzing cancer-associated multi-dimensional data. Manual. First edition 4 April 2017 Updated on July 3, 2017

Nature Genetics: doi: /ng Supplementary Figure 1. SEER data for male and female cancer incidence from

Identification of Tissue Independent Cancer Driver Genes

Ahrim Youn 1,2, Kyung In Kim 2, Raul Rabadan 3,4, Benjamin Tycko 5, Yufeng Shen 3,4,6 and Shuang Wang 1*

RNA SEQUENCING AND DATA ANALYSIS

Supplementary Figures

Nicholas Borcherding, Nicholas L. Bormann, Andrew Voigt, Weizhou Zhang 1-4

TCGA. The Cancer Genome Atlas

File Name: Supplementary Information Description: Supplementary Figures and Supplementary Tables. File Name: Peer Review File Description:

LncRNA TUSC7 affects malignant tumor prognosis by regulating protein ubiquitination: a genome-wide analysis from 10,237 pancancer

Extensive RNA editing and splicing increase immune self-representation diversity in medullary thymic epithelial cells

Comprehensive analyses of tumor immunity: implications for cancer immunotherapy

Nature Genetics: doi: /ng Supplementary Figure 1. Sample selection procedure and TL ratio across cancer.

Tissue-specific DNA methylation loss during ageing and carcinogenesis is linked to chromosome structure, replication timing and cell division rates

OncoLnc: linking TCGA survival data to mrnas, mirnas, and lncrnas

Supplemental Figures. Figure S1: 2-component Gaussian mixture model of Bourdeau et al. s fold-change distribution

LncMAP: Pan-cancer atlas of long noncoding RNA-mediated transcriptional network perturbations

High throughput estimation of functional cell activities reveals disease mechanisms and predicts relevant clinical outcomes

How do they reconcile their data with Raghu Khalluri's data in Nature Cell Biology?

Evidence Based Medicine

Chapter 2 Autophagy in Cancer Cells vs. Cancer Tissues: Two Different Stories

Distinct cellular functional profiles in pan-cancer expression analysis of cancers with alterations in oncogenes c-myc and n-myc

An integrative pan-cancer-wide analysis of epigenetic enzymes reveals universal patterns of epigenomic deregulation in cancer

RNA-Seq Preparation Comparision Summary: Lexogen, Standard, NEB

TP53 mutations, expression and interaction networks in human cancers

Computer Science, Biology, and Biomedical Informatics (CoSBBI) Outline. Molecular Biology of Cancer AND. Goals/Expectations. David Boone 7/1/2015

SRARP and HSPB7 are epigenetically regulated gene pairs that function as tumor suppressors and predict clinical outcome in malignancies

Promoter methylation of DNA damage repair (DDR) genes in human tumor entities: RBBP8/CtIP is almost exclusively methylated in bladder cancer

Pan-cancer analysis of expressed somatic nucleotide variants in long intergenic non-coding RNA

Supplemental Information For: The genetics of splicing in neuroblastoma

Type of file: XLSX Size of file: 0 KB Title of file for HTML: Supplementary Data 7 Description: Genes reversed by effective compounds.

Genomic and Functional Approaches to Understanding Cancer Aneuploidy

Global regulation of alternative splicing by adenosine deaminase acting on RNA (ADAR)

Endogenous retroviral signatures predict immunotherapy response in clear cell renal cell carcinoma

MRHCA: a nonparametric statistics based method for hub and co-expression module identification in large gene co-expression network

TCGA-Assembler: Pipeline for TCGA Data Downloading, Assembling, and Processing. (Supplementary Methods)

Relationship between genomic features and distributions of RS1 and RS3 rearrangements in breast cancer genomes.

Supplementary Table S1. List of PTPRK-RSPO3 gene fusions in TCGA's colon cancer cohort. Chr. # of Gene 2. Chr. # of Gene 1

EBioMedicine 7 (2016) Contents lists available at ScienceDirect. EBioMedicine. journal homepage:

SSM signature genes are highly expressed in residual scar tissues after preoperative radiotherapy of rectal cancer.

Figure S1. Analysis of endo-sirna targets in different microarray datasets. The

Package mirlab. R topics documented: June 29, Type Package

Genomic alterations underlie a pan-cancer metabolic shift associated with tumour hypoxia

Systematic analyses of glutamine and glutamate metabolisms across different cancer types

Supplement for: CD4 cell dynamics in untreated HIV-1 infection: overall rates, and effects of age, viral load, gender and calendar time.

Supplementary Figure 1

Studio delle modificazioni post-trascrizionali mediante tecnologia RNA-seq

DiffVar: a new method for detecting differential variability with application to methylation in cancer and aging

The Cancer Genome Atlas Pan-cancer analysis Katherine A. Hoadley

Epigenetics & Chromatin. Open Access RESEARCH

Supplementary Figure 1: High-throughput profiling of survival after exposure to - radiation. (a) Cells were plated in at least 7 wells in a 384-well

Supplementary Materials for

Pan-organ transcriptome variation across 21 cancer types

Transcriptome sequencing reveals altered long intergenic non-coding RNAs in lung cancer

Cancer develops as a result of the accumulation of somatic

Personalised Medicine in Practice. Dr Ingrid Slade MBChB, PhD, MRCPCH, MFPH Dr Chris Spencer DPhil Dr Gabriele De Luca MD, DPhil

Everolimus for the Treatment of Advanced Renal Cell Carcinoma (arcc ) - Additional Analysis Using RPSFT

Large-scale DNA methylation expression analysis across 12 solid cancers reveals hypermethylation in the calcium-signaling pathway

Clustered mutations of oncogenes and tumor suppressors.

Analyse de données de séquençage haut débit

Fast dimension reduction and integrative clustering of multi-omics data using lowrank approximation: application to cancer molecular classification

LncRNA PVT1 as an effective biomarker for cancer diagnosis and detection based on transcriptome data and meta-analysis

Impact of in vitro driven expression signatures of CD133 stem cell marker and tumor stroma on clinical outcomes in gastric cancers

w ª wy xvwz A ª vw xvw P ª w} xvw w Æ w Æ V w,x Æ w Æ w Æ y,z Æ { Æ y,z, w w w~ w wy}æ zy Æ wyw{ xæ wz w xywæ xx Æ wv Æ } w x w x w Æ w Æ wy} zy Æ wz

Clinical roles of the aberrantly expressed lncrnas in lung squamous cell carcinoma: a study based on RNA-sequencing and microarray data mining

Correlation of gene expression and associated mutation profiles of APOBEC3A, chemosensitivity of cancer cell lines to drug treatment

SUPPLEMENTARY INFORMATION

Predictive Assays in Radiation Therapy

Enterprise Interest Thermo Fisher Scientific / Employee

Supplementary Figure 1: Attenuation of association signals after conditioning for the lead SNP. a) attenuation of association signal at the 9p22.

Supporting Information. Electrophoretic Deformation of Individual Transfer. RNA Molecules Reveals Their Identity

Supplementary Table 1 Clinicopathological characteristics of 35 patients with CRCs

Predominant contribution of cis-regulatory divergence in the evolution of mouse alternative splicing

The Tandem Duplicator Phenotype Is a Prevalent Genome-Wide Cancer Configuration Driven by Distinct Gene Mutations

Transcriptional landscape of human cancers

RNA-seq: filtering, quality control and visualisation. COMBINE RNA-seq Workshop

Alternative splicing analysis for finding novel molecular signatures in renal carcinomas

Transcriptional response profiles of paired tumor-normal samples offer novel perspectives in pan-cancer analysis

Molecular in vitro diagnostic test for the quantitative detection of the mrna expression of ERBB2, ESR1, PGR and MKI67 in breast cancer tissue.

Only Estrogen receptor positive is not enough to predict the prognosis of breast cancer

Hands-On Ten The BRCA1 Gene and Protein

Supplementary Figure S1 Expression of mir-181b in EOC (A) Kaplan-Meier

MIR retrotransposon sequences provide insulators to the human genome

Package IMAS. March 10, 2019

Graphical Methods for Meta-analysis: Looking at the forest and the trees

SUPPLEMENTARY INFORMATION. Intron retention is a widespread mechanism of tumor suppressor inactivation.

Predicting clinical outcomes in neuroblastoma with genomic data integration

Broad H3K4me3 is associated with increased transcription elongation and enhancer activity at tumor suppressor genes

Landscape of tumor-infiltrating T cell repertoire of human cancers

Transcription:

Cell Reports Supplemental Information Elevated RNA Editing Activity Is a Major Contributor to Transcriptomic Diversity in s Nurit Paz-Yaacov, Lily Bazak, Ilana Buchumenski, Hagit T. Porath, Miri Danan-Gotthold, Binyamin A. Knisbacher, Eli Eisenberg, and Erez Y. Levanon

A B 1 Mismatches intersecting Refseq in sense orientation (%) 75 25 Percent of SNPs in unique mismatches 4 3 2 1 C D 1 1 Percent of mismatches in genes that are on sense strand 75 25 Mismatches in plus orientation (%) 75 25 Mismatch A2G Control

Figure S1, Related to Figure 2. Clean editing signal in A-to-G compared to other mismatches A clear editing signal in Alu repeats reflected in (A) A-to-G mismatches originate from the expressed strand of the Alu repeat. Percentage of mismatches intersecting Refseq in sense orientation. As the sequencing data is not stranded, generally one cannot distinct A-to-G mismatches from T-to-C ones. However, for those Alu repeats that overlap a RefSeq gene, we plot the fraction of A-to-G mismatches (A on the expressed strand) among all A-to-G and T-to-C mismatches. The clear bias towards the expressed strand (many more A-to-G than T-to-C mismatches) supports the notion that these mismatches are due to RNA editing. In contrast, the control (G-to-A) shows no dominance to the expressed strand (roughly same number of G-to-A and C-to-T) on the expressed strand. (B) Many of the non-a-to-g mismatches are known SNP. Percent of SNPs in mismatch sites for A-to-G and control mismatches detected by the hyper-editing scheme show, for the different tissues, a minute fraction of the A-to-G sites (average 2%) overlapping known SNPs. Control sites exhibit a high SNPs content (average 1%). (C) Percentage of mismatches intersecting RefSeq in sense orientation (same as panel A) for the hyper-editing detection approach. (D) Hyperediting A-to-G signal does not show read-strand preference. Mismatches in plus and minus strand of sequencing reads of A-to-G and control were analyzed. Reads are generated in the sequencing reaction from both orientation, and thus mismatches originating from the RNA molecule are expected to be equally distributed among the two sequencing directions. Indeed, A-to-G mismatches show no strand preference, while control mismatches (the second most frequent mismatch in each specific RNAseq) suffer from high and variable strand preference (and huge variation) suggesting these mismatches are partly due to a sequencing artifact.

A 1. BRCA p =.446.75 Survival Probability..25 Index <.51 Index >=.51. 44 88 132 176 22 264 38 352 396 Days Index <.51 Index >=.51 32 18 17 12 8 6 5 3 3 1 63 51 33 21 8 5 3 2 1 Numbers at risk B AZIN1 C Index Cutoff QuantileCutoff PValue FDR BLCA.53.75 1.6E-1 2.8E-1 BRCA.51.29 4.5E-2 1.3E-1 COAD.232.2 2.8E-1 3.5E-1 HNSC.3724.29 7.5E-4 5.5E-3 KIRC.44.5 2.6E-1 3.5E-1 LIHC.692.65 1.2E-3 5.5E-3 LUAD.312.26 1.1E-1 2.4E-1 PRAD.37.2 1 1 THCA.52.33 3.1E-1 3.5E-1

Figure S2, Related to Figure 3. Kaplan-Meier analysis for the different cancer tissues (A) Low Alu editing index correlates with a better prognosis. Kaplan-Meier curve in breast cancer, based on an index cutoff. (B) Kaplan-Meier curve for liver cancer based on AZIN1 editing level. The cutoff is 1% over editing (patients for which AZIN1 editing level increases by 1% or more in tumor compared to its matched normal sample). (C) Kaplan-Meier analysis for the different cancer tissues. Survival analysis was done for each tissue using the Kaplan-Meier curve based on AEI Values. Breast, head and neck and liver cancers revealed significant difference in survival depending on the AEI values. Note that 8/9 of the tests are below a cutoff of FDR=.35, indicating that most of the cancer types can be successfully classified using the AEI.

Table S1, Related to table 1. Editing differences between cancer and normal samples in the hyper-editing detection scheme BLCA (n=13) BRCA (n=95) LIHC (n=3) HNSC (n=29) LUAD (n=36) %Map_raeds.86.86.78.76.774.775.822.813.81.78 Total Editing sites (ES) 7,497 1,138 11,956 17,424 1,897 17,981 3,763 1,359 7,19 9,426 uniq_eswosnp 4,792 6,421 7,87 9,864 5,926 9,721 2,3 5,93 4,759 5,824 KIRC (n=62) PRAD (n=31) THCA (n=42) COAD (n=18) %Map_raeds.73.67.85.85.77.8.82.84 Total Editing sites (ES) 126 1189 8674 9156 968 12587 445 5322 uniq ES Without SNP 8,1 7,4 5578 5855 5289 6775 293 339