SUPPLEMENTARY INFORMATION. Intron retention is a widespread mechanism of tumor suppressor inactivation.

Size: px
Start display at page:

Download "SUPPLEMENTARY INFORMATION. Intron retention is a widespread mechanism of tumor suppressor inactivation."

Transcription

1 SUPPLEMENTARY INFORMATION Intron retention is a widespread mechanism of tumor suppressor inactivation. Hyunchul Jung 1,2,3, Donghoon Lee 1,4, Jongkeun Lee 1,5, Donghyun Park 2,6, Yeon Jeong Kim 2,6, Woong-Yang Park 2,7, Dongwan Hong 1,5, Peter J. Park 8,9, Eunjung Lee 8 1 Research Institute, National Cancer Center, Gyeonggi-do, South Korea 2 Samsung Genome Institute, Samsung Medical Center, Seoul, South Korea 3 Bioinformatics and Systems Biology Graduate Program, University of California San Diego, San Diego, CA, USA 4 Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA 5 Cancer Immunology Branch, Division of Cancer Biology, National Cancer Center, Gyeonggi-do, South Korea 6 Samsung Biomedical Research Institute, Samsung Advanced Institute of Technology, Samsung Electronics Company, Ltd., Seoul, South Korea 7 Department of Molecular Cell Biology, Sungkyunkwan University School of Medicine, Suwon, South Korea 8 Department of Medicine, Division of Genetics, Brigham and Women's Hospital, Boston, MA, USA 9 Center for Biomedical Informatics, Harvard Medical School, Boston, MA, USA Correspondence should be addressed to E.L. (elee20@partners.org), P.J.P. (peter_park@hms.harvard.edu), or D.H. (dwhong@ncc.re.kr)

2 Table of contents Supplementary Note... 3 Supplementary Figures Suppl. Figure 1 Flowchart with results of the identified SNVs associated with abnormal splicing... 8 Suppl. Figure 2 Examples of RNA-seq reads showing complex abnormal splicing... 9 Suppl. Figure 3 Comparison between the two splicing-altering SNV sets identified by AS and/or RS analysis Suppl. Figure 4 Fraction of SNVs showing reference allele expression only Suppl. Figure 5 Enrichment of splicing-altering SNVs in cancer gene sets Suppl. Figure 6 Experimentally confirmed intron retention-causing LBEMs in tumor suppressors Suppl. Figure 7 Splicing-disrupting LBEMs in TP Suppl. Figure 8 Splicing-disrupting LBEMs in SPSB Suppl. Figure 9 A diagram of TSG inactivation Suppl. Figure 10 TSGs with LBEM causing intron retention Suppl. Figures Characterization of genomic and transcriptomic patterns of LBEMs Suppl. Figure 11 Comparison of 5 splice site strength disrupted by LBEMs Suppl. Figure 12 Comparison of intronic size Suppl. Figure 13 Comparison of GC content Suppl. Figure 14 Comparison of density of intronic splicing regulatory elements Suppl. Figure 15 Comparison of expression level of adjacent up- and down-stream exons Suppl. Figure 16 Characterization of predicted LBEMs Suppl. Figure 17 Example of RNA-seq reads with a mutant allele supporting an intronic cryptic splice site Suppl. Figure 18 Exonic SNVs causing intron retention Suppl. Figure 19 Splice site SNVs causing intron retention References... 25

3 Supplementary Note Read pattern criteria for Ratio-based Spicing (RS) analysis Using the criteria described below, we classified somatic single-nucleotide variations (SNVs) into different types of abnormal splicing: intron retention, exon skipping, and intronic and exonic cryptic site activation. SNVs that showed multiple types of abnormal splicing were classified as complex abnormal splicing. As a negative control for the effect of an SNV on splicing in each sample, normally spliced transcripts were also required to be observed. Split reads supporting any alternative splicing transcripts from RefSeq annotation were not regarded as abnormal splicing. Using the original alignment data, we extracted reads mapped near SNVs and discarded PCR duplicates after marking them. Furthermore, reads with mapping quality scores less than 20 or those with indels were filtered out. For each target exon containing an SNV, we separately checked abnormal splicing (except for exon skipping) for its adjacent up- and down-stream exons. To identify an SNV significantly associated with abnormal splicing, we compared the ratio of abnormal reads to normal reads in a target mutated sample to two background ratio distributions (The definition of ratios for each category of abnormal splicing is described below). To create the background distributions, we first selected control samples from 741 normal samples or from 1,812 cancer samples that did not have mutations in the gene of interest. For each control sample, we calculated the same ratio of abnormal to normal reads; these were collected into two background ratio distributions; one for normal and one for cancer samples. When the target mutated sample s ratio was within both the top 1% and the top 5% of the normal and cancer background ratio distributions, respectively, then the SNV was considered significantly associated with the abnormal splicing. The criteria for each category of abnormal splicing are described below: Intron retention For an exon with an SNV (E*), its adjacent exon (E), and an intron between the two exons (I), 1) 2 abnormal reads that span each of the two exon-intron junctions (E*_I and I_E) were required for intron retention, covering 10 bp of the exon and 10 bp of the intron without split-read alignment. 2) 70% of bases in the intron (I) were required to be covered. 3) 2 normal reads spanning the exon-exon junction (E*_E) were required. 4) To ensure the whole intron was retained, both of the ratios of abnormal / normal reads, (E*_I) / (E*_E) and (E_I) / (E*_E), were required to be higher than those from the background distributions.

4 Exon skipping For an exon (E*) with an SNV, and its adjacent exons (E 1, E 2 ), 1) 2 abnormal reads spanning the junction of the two adjacent exons (E1_E2) were required. 2) The coverage of the two adjacent exons (E 1 and E 2 ) was required to be greater than that of exon E*. 3) 2 normal reads spanning the exon-exon junctions on each side (E1_E* and E*_E2) were required. 4) The ratio of abnormal / normal reads, (E1_E2) / (E1_E* + E*_E2), was required to be higher than that from the background distributions. Intronic cryptic splice site activation For an exon with an SNV (E*), its adjacent exon (E), and an intron between the two exons (I) which has a cryptic splice site (the parts before and after the cryptic site: I 1 and I 2 ), (when there are more than one cryptic splice sites, read pattern was examined for each splice site) 1) 2 abnormal reads that span the intronic cryptic splice site and the adjacent exon junction (I 1 _E). 2) 2 abnormal reads that span the exon and the first part of the intron junction (E*_I 1 ) covering both 10 bp of the exon and 10 bp of the intron. 3) 70% of bases of the intronic part before the cryptic splice site (I 1 ) were required to be covered. 4) 2 normal reads that span the exon-exon junction (E*_E). 5) To ensure the splice site activation, both of the ratios of abnormal / normal reads, (I 1 _E) / (E*_E) and (E*_I 1 ) / (E*_E), were required to be higher than those from the background distributions.

5 Exonic cryptic splice site activation For an exon with an SNV (E*) and an exonic cryptic splice site (the parts before and after the cryptic site: E* 1 and E* 2 ), its adjacent exon (E), and an intron between the exons (I), 1) 2 abnormal reads that span the exonic cryptic splice site and the adjacent exon junction (E* 1 _E). 2) The coverages of the exonic part before the cryptic splice site (E* 1 ) and the adjacent exon (E) should be greater than that of the exonic part after the cryptic splice site (E* 2 ). 3) 2 normal reads that span the exonic part after the cryptic splice site and the adjacent exon junction (E* 2 _E). 4) The ratio of abnormal / normal reads, (E* 1 _E) / (E* 2 _E), was required to be higher than that from the background distributions. Analysis of exonic splicing regulatory elements To test enrichment of SNVs associated with abnormal splicing in splicing regulatory elements, we first obtained splicing regulatory elements; 238 exonic splicing enhancers (ESEs) 1 and 176 exonic splicing silencers (ESS) 2. We utilized Burrows-Wheeler Aligner (BWA) 3 with default parameters, except for allowing exact match and all multiple alignment, to align the regulatory sequences to exons containing an SNV and annotated SNVs within the regulatory sequences in Supplementary Table 3. For each set of exonic SNVs causing different types of abnormal splicing (332 exon skipping, 74 exonic or 145 intronic cryptic site activation, 252 intron retention, and 45 complex splicing), we counted how often splicingaltering SNVs were within the splicing regulatory sequences. To evaluate whether the observed count was significantly different from the expected, we generated a null distribution based on randomly selecting the same number of SNVs out of 226,956 exonic somatic SNVs (from original total set; Supplementary Table 1) 100,000 times. The P value was calculated as the fraction of trials in which the number of SNVs in the splicing regulatory sequences is greater than or equal to the observed count. We found that SNVs associated with exon skipping were significantly enriched in ESEs (P = ), whereas those associated with other types of abnormal splicing were not. We did not find any enrichment of SNVs associated with

6 abnormal splicing in ESSs. Examination of relationship between abnormal splicing types Using splice-site SNVs associated with abnormal splicing, we found that exon skipping and intron retention occurred in a mutually exclusive manner (P = ). Intronic cryptic site activation and exon skipping were also mutually exclusive (P = ). The lack of co-occurrence of these aberrations suggests that mechanisms involved in exon-skipping act independently of those involving intronic elements. Enrichment of abnormal splicing in Tumor Suppressor Genes We also observed enrichment in the TSG sets when using an extended set of 453 intron retention-causing SNVs from AS analysis, including 22 non-lbems and 365 intronic splice site SNVs (P ; Fig. 4a and Supplementary Table 11; see Supplementary Fig. 18 and 19 for examples and Online Methods). In contrast, SNVs in the exon skipping or normal splicing groups were not enriched in the oncogene or tumor suppressor gene sets. Characteristics of genomic features We found that the genomic loci in the intron retention group had originally lower splicing strength than the loci in the other two groups (the scores calculated with a wild-type allele; P ; Supplementary Fig. 11b and 11c for MAXENT 4 and H-Bond scores 5, respectively). Both exons and introns in the intron retention group had higher GC content than those in the exon skipping group (P = ; Fig. 5b and Supplementary Fig. 13). However, we found that the density of intronic splicing regulatory elements (ISREs) was lower for the intron retention group than for the exon skipping group (P = ; Fig. 5b and Supplementary Fig. 14). Given that GC content dominates intrinsic nucleosome occupancy 6, the observed inverse correlation between GC content and ISRE density matches with a previous study reporting that ISREs were significantly enriched in nucleosome-disfavoring sequences 7. This suggests the possibility that in the intron retention group, shorter nucleosome-free regions limit regulatory protein access to ISREs. Although the density of intronic splicing regulatory elements (ISREs) was lower for the intron retention group than for the exon skipping group (Fig 5b and Supplementary Fig. 14), the density of exonic splicing regulatory elements (ESREs) did not show any differences between the groups (data not shown). Analysis of expression at the exon level revealed that the exon skipping group showed higher expression of adjacent exons relative to the exons with an LBEM (i.e., skipped exons) compared to the intron retention and normal splicing groups (Supplementary Fig. 15). Examination of excluded SNVs We excluded SNVs with additional SNVs or indels in the same gene, to unambiguously trace the causal SNV for each splicing event. For the excluded 2,797 SNVs near the exon-intron junctions (2,797 out of

7 19,540 SNVs with more than one SNVs or indels satisfied prerequisites of AS analysis), we performed AS analysis and found that 10 somatic SNVs caused intron retention as follows. Chr Position Gene Strand Transcript Variant type Reference Variant Sample chr COPE - NM_ Silent C T TCGA-B5-A0JY chr MBP - NM_ Missense C T TCGA-BG-A0M4 chrx KDM5C - NM_ Missense C A TCGA-B5-A11E chr SPG11 - NM_ Missense C A TCGA-D1-A103 chr NCOR2 - NM_ Missense C A TCGA-AP-A059 chr RALBP1 + NM_ Nonsense G T TCGA-AP-A051 chr JOSD2 - NM_ Missense C T TCGA-BS-A0UF chr LTBP4 + NM_ Silent C T TCGA-B5-A0JY chr NUP133 - NM_ Nonsense G A TCGA chr RALBP1 + NM_ Nonsense G T TCGA-AP-A051

8 Supplementary Figures Supplementary Figure 1. Flowchart with results of the identified SNVs associated with abnormal splicing. Allele-specific Splicing (AS) and Ratio-based Splicing (RS) analysis were performed to identify somatic SNVs associated with abnormal splicing. Given the difficulty of detecting exon skipping-causing SNVs (i.e., the absence of mutant allele in observed reads) in AS analysis, we first identified SNVs associated with exon skipping through RS analysis (n=707, outlined box). Since trasns-acting mutations are not likely to affect splicing of other genes in an allele-specific manner, we used the SNVs associated with exon skipping before removing SNVs from tumor samples with detects in splicing factors.

9

10 Supplementary Figure 2. Examples of RNA-seq reads showing complex abnormal splicing. IGV browser 8 screenshots where RNA-seq reads showed complex abnormal splicing associated with an SNV in (a) CDKN2A and (b) TP53.

11 Supplementary Figure 3. Comparison between the two splicing-altering SNV sets identified by AS and/or RS analysis. (a) The number of identified exonic SNVs within 30 bp from exon-intron junction. The number in parenthesis in RS analysis indicates the number of exonic SNVs outside of 30 bp from exon-intron junction. (b) The number of identified intronic splice-site SNVs. Our AS analysis method cannot detect intronic splice-site SNVs causing exon skipping, because neither the wild-type nor mutant allele at intronic splice sites appear in RNA-seq reads unless the intron is retained (see Online Methods for AS analysis of intronic splice site SNVs). Therefore, the number of splice-site SNVs causing intron retention is only shown for AS analysis. For exon-skipping cases, the SNVs from AS analysis are a subset of those from RS analysis. AS analysis requires SNVs to pass RS analysis first (noted in the analysis flowchart in Supplementary Fig 1). For intron retention, both analyses provide high-confidence sets. For example, there are 66 LBEMs from AS, 44 from RS, and 29 are shared. When we examine each inconsistent case, we find that the discrepancy was mainly due to different criteria rather than false positive predictions. For example, RS analysis required many reads spanning exon-intron junctions of the exon with the SNV and the next exon and required 70% of bases in the intron to be covered, whereas AS analysis required reads spanning exon-intron junction of only the exon with the SNV in an allele-specific manner. Some SNVs from AS analysis, especially under the influence of NMD, were not able to meet those RS criteria.

12 Supplementary Figure 4. Fraction of SNVs showing reference allele expression only. (a) Fraction of synonymous SNVs that showed reference allele expression only. Gray and black solid lines represent the average fraction of nonsense and synonymous SNVs. Error bars represent standard errors. Synonymous SNVs at the last base of exons showed the highest fraction of reference allele expression only, comparable to that of nonsense SNVs demonstrating their strong mutational impact. (b) Fraction of missense SNVs that showed reference allele expression only is depicted in the same way as in (a).

13 Supplementary Figure 5. Enrichment of splicing-altering SNVs in cancer gene sets. Somatic synonymous and missense SNVs within 30 bp from exon-intron junctions identified by RS analysis were tested for their enrichment in two oncogene sets (upper panel) and four TSG sets (lower panel). As we categorized intronic cryptic site activation under intron retention in AS analysis, we followed the same strategy in this analysis. Due to relatively small number of SNVs associated with exonic cryptic site activation across all positions (n=80), we did not further analyze them. P value was assessed by random permutation test. The bars exceeding the y-axis value of 5 represent an empirical p-value of zero with 100,000 random permutation tests.

14 Supplementary Figure 6. Experimentally confirmed intron retention-causing LBEMs in tumor suppressors. Validation experiments for the LBEM in CDH1 (RNA-seq reads in Fig. 1c), PTCH1, and MLL2. The whole exon with an LBEM (green box) and flanking intronic (~ bp; green line) sequences of wild-type and mutant alleles were cloned and replaced an alternative cassette in a reporter plasmid. The percentages of abnormally spliced transcripts are shown above each band (mean ± SD % from triplicate).

15 Supplementary Figure 7. Splicing-disrupting LBEMs in TP53. (a) Positions of 11 TP53 LBEMs that caused intron retention were marked in TP53 gene structure. The LBEMs were clustered in three locations (the last base of exon 4, 6, and 9) rather than being distributed across the gene. (b) Screenshot examples of three different LBEMs causing intron retention. Reads with the reference allele preferentially showed normal splicing pattern whereas reads with the mutant allele spanned the exon-intron junction.

16 Supplementary Figure 8. Splicing-disrupting LBEMs in SPSB3. IGV browser 8 screenshots showing RNAseq reads spanning the exon-intron junction caused by the same missense LBEM of SPSB3 exon 6 (chr16: C>G) in two renal cancer samples.

17 Supplementary Figure 9. A diagram of TSG inactivation by somatic SNVs causing intron retention.

18 Supplementary Figure 10. TSGs with LBEM causing intron retention. (a) The upper panels are the same as the main figures 1a-d. The lower panel shows mrna level of TP53 (RPKM) across 502 breast cancer samples was plotted (vertical black dotted line for median expression; breast cancer sample BH-A0HL whose express data was not available was not used), and the expression levels of the two cases carrying the LBEM causing intron retention were marked by red lines. The average TP53 expression of 70 tumors with truncating (nonsense, frameshift indel, or splice-site) mutations in TP53 and the average TP53 expression of 111 tumors with missense mutations in TP53 were marked by orange and purple lines, respectively. P-values for differential expression due to truncating or missense mutations were calculated by Monte Carlo simulation of 100,000 permutations. (b) ARID1A. (c) CDH1. (d) TPR with LBEM that did not disrupt splicing.

19 Supplementary Figure 11. Comparison of 5 splice site strength disrupted by LBEMs. (a) H-bond 5 score was compared between the groups of SNVs causing different types of abnormal splicing. Splicing-disrupting SNVs decreased splicing strength more than those that did not disrupt splicing. (b-c) Splicing strength of 5 splice sites calculated using wild-type alleles. (b) MaxENT 4 and (c) H-bond 5 was calculated using a wildtype allele at the last base of exons. P-values were calculated using nonparametric Mann-Whitney test (P < 0.05).

20 Supplementary Figure 12. Comparison of intronic size. Comparison of down- (a) and up-stream (b) intronic length. Intronic length was significantly shorter for the intron retention group than for the exon skipping group, whereas exonic length did not show differences between the groups (data not shown). Down- (c) and up-stream (d) intronic size relative to exon. Intronic size relative to exon was significantly shorter for the intron retention group than for the exon skipping group.

21 Supplementary Figure 13. Comparison of GC content. Comparison of exonic (a) and up-stream intronic GC content (b). Down-stream GC content was shown in the main Fig 5b. Exons and Up-stream introns for the intron retention group had higher GC content than those for the exon skipping group. Supplementary Figure 14. Comparison of density of intronic splicing regulatory elements (ISREs) in upstream intronic regions. Density of them in down-stream intronic regions was shown in main Fig 5b. Density of ISREs was significantly lower for the intron retention group than for the exon skipping group. Density of exonic regulatory splicing elements did not show any differences between the groups (data not shown).

22 Supplementary Figure 15. Comparison of expression level of adjacent up- and down-stream exons. Expression level of adjacent (a) down- and (b) up-stream exons relative to that of exons with an LBEM was compared among different abnormal splicing groups. As expected, the exon skipping group showed higher expression of adjacent exons relative to the exons with an LBEM (i.e., skipped exons) compared to the intron retention and normal splicing groups.

23 Supplementary Figure 16. Characterization of predicted LBEMs. (a) Enrichment of LBEMs predicted to cause abnormal splicing in cancer-associated gene sets. Two oncogene- (upper panel) 9,10 and four tumor suppressor-like gene sets (lower panel) 9-11 were tested. P value was assessed by random permutation test (Online Methods). The predicted LBEMs showed a consistent enrichment pattern with the confirmed LBEMs through AS analysis (Note that many TSGs were manually confirmed and not used for the prediction). (b) Analysis of sequence motifs using LBEMs predicted to cause abnormal splicing. The motif pattern of LBEMs causing abnormal splicing mirrored left-handed (higher relative entropy scores in the exon) model like the LBEMs identified by AS analysis.

24 Supplementary Figure 17. Example of RNA-seq reads with a mutant allele supporting an intronic cryptic splice site. This abnormal splicing was caused by an LBEM of TP53 exon 6 in an ovarian cancer sample (TCGA ). The first two and last two reads with a mutant allele T spanned the exon-intron junction and the intronic cryptic splice site activation, respectively.

25 Supplementary Figure 18. Exonic SNVs causing intron retention. MLL2 and MLL3 had intron retention caused by missense and synonymous SNVs at the second last base of exon 13 and 24, respectively. The exonic SNVs in MLL2 and MLL3 caused intron retention and intronic cryptic site activation, respectively. Supplementary Figure 19. Splice site SNVs causing intron retention. The SNVs in BAP1 and TP53 were identified through AS analysis. IGV browser 8 screenshots showing RNA-seq reads spanning the exon-intron junction in BAP1 and TP53. The mutant allele was only observed in RNA-seq reads mapping to the retained part of the intron (Online Methods).

26 References 1. Fairbrother, W.G., Yeh, R.F., Sharp, P.A. & Burge, C.B. Predictive identification of exonic splicing enhancers in human genes. Science 297, (2002). 2. Wang, Z. et al. Systematic identification and analysis of exonic splicing silencers. Cell 119, (2004). 3. Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25, (2009). 4. Yeo, G. & Burge, C.B. Maximum entropy modeling of short sequence motifs with applications to RNA splicing signals. J Comput Biol 11, (2004). 5. Freund, M. et al. A novel approach to describe a U1 snrna binding site. Nucleic Acids Res 31, (2003). 6. Kaplan, N. et al. The DNA-encoded nucleosome organization of a eukaryotic genome. Nature 458, (2009). 7. Schwartz, S., Meshorer, E. & Ast, G. Chromatin organization marks exon-intron structure. Nat Struct Mol Biol 16, (2009). 8. Thorvaldsdottir, H., Robinson, J.T. & Mesirov, J.P. Integrative Genomics Viewer (IGV): highperformance genomics data visualization and exploration. Brief Bioinform 14, (2013). 9. Solimini, N.L. et al. Recurrent hemizygous deletions in cancers may optimize proliferative potential. Science 337, (2012). 10. Futreal, P.A. et al. A census of human cancer genes. Nat Rev Cancer 4, (2004). 11. Zhao, M., Sun, J. & Zhao, Z. TSGene: a web resource for tumor suppressor genes. Nucleic Acids Res 41, D970-6 (2013).

BWA alignment to reference transcriptome and genome. Convert transcriptome mappings back to genome space

BWA alignment to reference transcriptome and genome. Convert transcriptome mappings back to genome space Whole genome sequencing Whole exome sequencing BWA alignment to reference transcriptome and genome Convert transcriptome mappings back to genome space genomes Filter on MQ, distance, Cigar string Annotate

More information

Introduction. Introduction

Introduction. Introduction Introduction We are leveraging genome sequencing data from The Cancer Genome Atlas (TCGA) to more accurately define mutated and stable genes and dysregulated metabolic pathways in solid tumors. These efforts

More information

RASA: Robust Alternative Splicing Analysis for Human Transcriptome Arrays

RASA: Robust Alternative Splicing Analysis for Human Transcriptome Arrays Supplementary Materials RASA: Robust Alternative Splicing Analysis for Human Transcriptome Arrays Junhee Seok 1*, Weihong Xu 2, Ronald W. Davis 2, Wenzhong Xiao 2,3* 1 School of Electrical Engineering,

More information

Supplemental Information For: The genetics of splicing in neuroblastoma

Supplemental Information For: The genetics of splicing in neuroblastoma Supplemental Information For: The genetics of splicing in neuroblastoma Justin Chen, Christopher S. Hackett, Shile Zhang, Young K. Song, Robert J.A. Bell, Annette M. Molinaro, David A. Quigley, Allan Balmain,

More information

Supplementary Figure 1: Features of IGLL5 Mutations in CLL: a) Representative IGV screenshot of first

Supplementary Figure 1: Features of IGLL5 Mutations in CLL: a) Representative IGV screenshot of first Supplementary Figure 1: Features of IGLL5 Mutations in CLL: a) Representative IGV screenshot of first intron IGLL5 mutation depicting biallelic mutations. Red arrows highlight the presence of out of phase

More information

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1 Supplementary Figure 1 Frequency of alternative-cassette-exon engagement with the ribosome is consistent across data from multiple human cell types and from mouse stem cells. Box plots showing AS frequency

More information

Hands-On Ten The BRCA1 Gene and Protein

Hands-On Ten The BRCA1 Gene and Protein Hands-On Ten The BRCA1 Gene and Protein Objective: To review transcription, translation, reading frames, mutations, and reading files from GenBank, and to review some of the bioinformatics tools, such

More information

Vertical Magnetic Separation of Circulating Tumor Cells and Somatic Genomic-Alteration Analysis in Lung Cancer Patients

Vertical Magnetic Separation of Circulating Tumor Cells and Somatic Genomic-Alteration Analysis in Lung Cancer Patients Vertical Magnetic Separation of Circulating Cells and Somatic Genomic-Alteration Analysis in Lung Cancer Patients Chang Eun Yoo 1,2#, Jong-Myeon Park 3#, Hui-Sung Moon 1,2, Je-Gun Joung 2, Dae-Soon Son

More information

Supplemental Data. Integrating omics and alternative splicing i reveals insights i into grape response to high temperature

Supplemental Data. Integrating omics and alternative splicing i reveals insights i into grape response to high temperature Supplemental Data Integrating omics and alternative splicing i reveals insights i into grape response to high temperature Jianfu Jiang 1, Xinna Liu 1, Guotian Liu, Chonghuih Liu*, Shaohuah Li*, and Lijun

More information

CRISPR/Cas9 Enrichment and Long-read WGS for Structural Variant Discovery

CRISPR/Cas9 Enrichment and Long-read WGS for Structural Variant Discovery CRISPR/Cas9 Enrichment and Long-read WGS for Structural Variant Discovery PacBio CoLab Session October 20, 2017 For Research Use Only. Not for use in diagnostics procedures. Copyright 2017 by Pacific Biosciences

More information

Supplementary Methods

Supplementary Methods Supplementary Methods Short Read Preprocessing Reads are preprocessed differently according to how they will be used: detection of the variant in the tumor, discovery of an artifact in the normal or for

More information

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1 Supplementary Figure 1 Effect of HSP90 inhibition on expression of endogenous retroviruses. (a) Inducible shrna-mediated Hsp90 silencing in mouse ESCs. Immunoblots of total cell extract expressing the

More information

Nature Getetics: doi: /ng.3471

Nature Getetics: doi: /ng.3471 Supplementary Figure 1 Summary of exome sequencing data. ( a ) Exome tumor normal sample sizes for bladder cancer (BLCA), breast cancer (BRCA), carcinoid (CARC), chronic lymphocytic leukemia (CLLX), colorectal

More information

Reporting TP53 gene analysis results in CLL

Reporting TP53 gene analysis results in CLL Reporting TP53 gene analysis results in CLL Mutations in TP53 - From discovery to clinical practice in CLL Discovery Validation Clinical practice Variant diversity *Leroy at al, Cancer Research Review

More information

Computational Identification and Prediction of Tissue-Specific Alternative Splicing in H. Sapiens. Eric Van Nostrand CS229 Final Project

Computational Identification and Prediction of Tissue-Specific Alternative Splicing in H. Sapiens. Eric Van Nostrand CS229 Final Project Computational Identification and Prediction of Tissue-Specific Alternative Splicing in H. Sapiens. Eric Van Nostrand CS229 Final Project Introduction RNA splicing is a critical step in eukaryotic gene

More information

Whole Genome and Transcriptome Analysis of Anaplastic Meningioma. Patrick Tarpey Cancer Genome Project Wellcome Trust Sanger Institute

Whole Genome and Transcriptome Analysis of Anaplastic Meningioma. Patrick Tarpey Cancer Genome Project Wellcome Trust Sanger Institute Whole Genome and Transcriptome Analysis of Anaplastic Meningioma Patrick Tarpey Cancer Genome Project Wellcome Trust Sanger Institute Outline Anaplastic meningioma compared to other cancers Whole genomes

More information

Nature Medicine: doi: /nm.4439

Nature Medicine: doi: /nm.4439 Figure S1. Overview of the variant calling and verification process. This figure expands on Fig. 1c with details of verified variants identification in 547 additional validation samples. Somatic variants

More information

Session 6: Integration of epigenetic data. Peter J Park Department of Biomedical Informatics Harvard Medical School July 18-19, 2016

Session 6: Integration of epigenetic data. Peter J Park Department of Biomedical Informatics Harvard Medical School July 18-19, 2016 Session 6: Integration of epigenetic data Peter J Park Department of Biomedical Informatics Harvard Medical School July 18-19, 2016 Utilizing complimentary datasets Frequent mutations in chromatin regulators

More information

Supplementary Figure S1. Gene expression analysis of epidermal marker genes and TP63.

Supplementary Figure S1. Gene expression analysis of epidermal marker genes and TP63. Supplementary Figure Legends Supplementary Figure S1. Gene expression analysis of epidermal marker genes and TP63. A. Screenshot of the UCSC genome browser from normalized RNAPII and RNA-seq ChIP-seq data

More information

Nature Genetics: doi: /ng Supplementary Figure 1. Somatic coding mutations identified by WES/WGS for 83 ATL cases.

Nature Genetics: doi: /ng Supplementary Figure 1. Somatic coding mutations identified by WES/WGS for 83 ATL cases. Supplementary Figure 1 Somatic coding mutations identified by WES/WGS for 83 ATL cases. (a) The percentage of targeted bases covered by at least 2, 10, 20 and 30 sequencing reads (top) and average read

More information

The Emergence of Alternative 39 and 59 Splice Site Exons from Constitutive Exons

The Emergence of Alternative 39 and 59 Splice Site Exons from Constitutive Exons The Emergence of Alternative 39 and 59 Splice Site Exons from Constitutive Exons Eli Koren, Galit Lev-Maor, Gil Ast * Department of Human Molecular Genetics, Sackler Faculty of Medicine, Tel Aviv University,

More information

REGULATED SPLICING AND THE UNSOLVED MYSTERY OF SPLICEOSOME MUTATIONS IN CANCER

REGULATED SPLICING AND THE UNSOLVED MYSTERY OF SPLICEOSOME MUTATIONS IN CANCER REGULATED SPLICING AND THE UNSOLVED MYSTERY OF SPLICEOSOME MUTATIONS IN CANCER RNA Splicing Lecture 3, Biological Regulatory Mechanisms, H. Madhani Dept. of Biochemistry and Biophysics MAJOR MESSAGES Splice

More information

SUPPLEMENTARY FIGURES: Supplementary Figure 1

SUPPLEMENTARY FIGURES: Supplementary Figure 1 SUPPLEMENTARY FIGURES: Supplementary Figure 1 Supplementary Figure 1. Glioblastoma 5hmC quantified by paired BS and oxbs treated DNA hybridized to Infinium DNA methylation arrays. Workflow depicts analytic

More information

a) List of KMTs targeted in the shrna screen. The official symbol, KMT designation,

a) List of KMTs targeted in the shrna screen. The official symbol, KMT designation, Supplementary Information Supplementary Figures Supplementary Figure 1. a) List of KMTs targeted in the shrna screen. The official symbol, KMT designation, gene ID and specifities are provided. Those highlighted

More information

Supplementary Figure 1. Estimation of tumour content

Supplementary Figure 1. Estimation of tumour content Supplementary Figure 1. Estimation of tumour content a, Approach used to estimate the tumour content in S13T1/T2, S6T1/T2, S3T1/T2 and S12T1/T2. Tissue and tumour areas were evaluated by two independent

More information

NGS in tissue and liquid biopsy

NGS in tissue and liquid biopsy NGS in tissue and liquid biopsy Ana Vivancos, PhD Referencias So, why NGS in the clinics? 2000 Sanger Sequencing (1977-) 2016 NGS (2006-) ABIPrism (Applied Biosystems) Up to 2304 per day (96 sequences

More information

of TERT, MLL4, CCNE1, SENP5, and ROCK1 on tumor development were discussed.

of TERT, MLL4, CCNE1, SENP5, and ROCK1 on tumor development were discussed. Supplementary Note The potential association and implications of HBV integration at known and putative cancer genes of TERT, MLL4, CCNE1, SENP5, and ROCK1 on tumor development were discussed. Human telomerase

More information

Analysis of Massively Parallel Sequencing Data Application of Illumina Sequencing to the Genetics of Human Cancers

Analysis of Massively Parallel Sequencing Data Application of Illumina Sequencing to the Genetics of Human Cancers Analysis of Massively Parallel Sequencing Data Application of Illumina Sequencing to the Genetics of Human Cancers Gordon Blackshields Senior Bioinformatician Source BioScience 1 To Cancer Genetics Studies

More information

The Alternative Choice of Constitutive Exons throughout Evolution

The Alternative Choice of Constitutive Exons throughout Evolution The Alternative Choice of Constitutive Exons throughout Evolution Galit Lev-Maor 1[, Amir Goren 1[, Noa Sela 1[, Eddo Kim 1, Hadas Keren 1, Adi Doron-Faigenboim 2, Shelly Leibman-Barak 3, Tal Pupko 2,

More information

Computational Analysis of UHT Sequences Histone modifications, CAGE, RNA-Seq

Computational Analysis of UHT Sequences Histone modifications, CAGE, RNA-Seq Computational Analysis of UHT Sequences Histone modifications, CAGE, RNA-Seq Philipp Bucher Wednesday January 21, 2009 SIB graduate school course EPFL, Lausanne ChIP-seq against histone variants: Biological

More information

MODULE 4: SPLICING. Removal of introns from messenger RNA by splicing

MODULE 4: SPLICING. Removal of introns from messenger RNA by splicing Last update: 05/10/2017 MODULE 4: SPLICING Lesson Plan: Title MEG LAAKSO Removal of introns from messenger RNA by splicing Objectives Identify splice donor and acceptor sites that are best supported by

More information

Relationship between genomic features and distributions of RS1 and RS3 rearrangements in breast cancer genomes.

Relationship between genomic features and distributions of RS1 and RS3 rearrangements in breast cancer genomes. Supplementary Figure 1 Relationship between genomic features and distributions of RS1 and RS3 rearrangements in breast cancer genomes. (a,b) Values of coefficients associated with genomic features, separately

More information

Supplementary Materials for

Supplementary Materials for www.sciencetranslationalmedicine.org/cgi/content/full/7/283/283ra54/dc1 Supplementary Materials for Clonal status of actionable driver events and the timing of mutational processes in cancer evolution

More information

Mechanisms of alternative splicing regulation

Mechanisms of alternative splicing regulation Mechanisms of alternative splicing regulation The number of mechanisms that are known to be involved in splicing regulation approximates the number of splicing decisions that have been analyzed in detail.

More information

Variant Classification. Author: Mike Thiesen, Golden Helix, Inc.

Variant Classification. Author: Mike Thiesen, Golden Helix, Inc. Variant Classification Author: Mike Thiesen, Golden Helix, Inc. Overview Sequencing pipelines are able to identify rare variants not found in catalogs such as dbsnp. As a result, variants in these datasets

More information

UNIVERSITY OF TORINO DEPARTMENT OF ONCOLOGY. Giorgio V. Scagliotti University of Torino Dipartment of Oncology

UNIVERSITY OF TORINO DEPARTMENT OF ONCOLOGY. Giorgio V. Scagliotti University of Torino Dipartment of Oncology Giorgio V. Scagliotti University of Torino Dipartment of Oncology giorgio.scagliotti@unito.it Molecular landscape of MM not fully characterized to allow personalized treatment Recurrent genetic alterations

More information

Nature Genetics: doi: /ng Supplementary Figure 1. Mutational signatures in BCC compared to melanoma.

Nature Genetics: doi: /ng Supplementary Figure 1. Mutational signatures in BCC compared to melanoma. Supplementary Figure 1 Mutational signatures in BCC compared to melanoma. (a) The effect of transcription-coupled repair as a function of gene expression in BCC. Tumor type specific gene expression levels

More information

Supplemental Figure S1. Expression of Cirbp mrna in mouse tissues and NIH3T3 cells.

Supplemental Figure S1. Expression of Cirbp mrna in mouse tissues and NIH3T3 cells. SUPPLEMENTAL FIGURE AND TABLE LEGENDS Supplemental Figure S1. Expression of Cirbp mrna in mouse tissues and NIH3T3 cells. A) Cirbp mrna expression levels in various mouse tissues collected around the clock

More information

Mutation Detection and CNV Analysis for Illumina Sequencing data from HaloPlex Target Enrichment Panels using NextGENe Software for Clinical Research

Mutation Detection and CNV Analysis for Illumina Sequencing data from HaloPlex Target Enrichment Panels using NextGENe Software for Clinical Research Mutation Detection and CNV Analysis for Illumina Sequencing data from HaloPlex Target Enrichment Panels using NextGENe Software for Clinical Research Application Note Authors John McGuigan, Megan Manion,

More information

MSI positive MSI negative

MSI positive MSI negative Pritchard et al. 2014 Supplementary Figure 1 MSI positive MSI negative Hypermutated Median: 673 Average: 659.2 Non-Hypermutated Median: 37.5 Average: 43.6 Supplementary Figure 1: Somatic Mutation Burden

More information

RECAP (1)! In eukaryotes, large primary transcripts are processed to smaller, mature mrnas.! What was first evidence for this precursorproduct

RECAP (1)! In eukaryotes, large primary transcripts are processed to smaller, mature mrnas.! What was first evidence for this precursorproduct RECAP (1) In eukaryotes, large primary transcripts are processed to smaller, mature mrnas. What was first evidence for this precursorproduct relationship? DNA Observation: Nuclear RNA pool consists of

More information

6/12/2018. Disclosures. Clinical Genomics The CLIA Lab Perspective. Outline. COH HopeSeq Heme Panels

6/12/2018. Disclosures. Clinical Genomics The CLIA Lab Perspective. Outline. COH HopeSeq Heme Panels Clinical Genomics The CLIA Lab Perspective Disclosures Raju K. Pillai, M.D. Hematopathologist / Molecular Pathologist Director, Pathology Bioinformatics City of Hope National Medical Center, Duarte, CA

More information

Cancer Informatics Lecture

Cancer Informatics Lecture Cancer Informatics Lecture Mayo-UIUC Computational Genomics Course June 22, 2018 Krishna Rani Kalari Ph.D. Associate Professor 2017 MFMER 3702274-1 Outline The Cancer Genome Atlas (TCGA) Genomic Data Commons

More information

ChIP-seq data analysis

ChIP-seq data analysis ChIP-seq data analysis Harri Lähdesmäki Department of Computer Science Aalto University November 24, 2017 Contents Background ChIP-seq protocol ChIP-seq data analysis Transcriptional regulation Transcriptional

More information

An Increased Specificity Score Matrix for the Prediction of. SF2/ASF-Specific Exonic Splicing Enhancers

An Increased Specificity Score Matrix for the Prediction of. SF2/ASF-Specific Exonic Splicing Enhancers HMG Advance Access published July 6, 2006 1 An Increased Specificity Score Matrix for the Prediction of SF2/ASF-Specific Exonic Splicing Enhancers Philip J. Smith 1, Chaolin Zhang 1, Jinhua Wang 2, Shern

More information

Identifying Mutations Responsible for Rare Disorders Using New Technologies

Identifying Mutations Responsible for Rare Disorders Using New Technologies Identifying Mutations Responsible for Rare Disorders Using New Technologies Jacek Majewski, Department of Human Genetics, McGill University, Montreal, QC Canada Mendelian Diseases Clear mode of inheritance

More information

Cancer gene discovery via network analysis of somatic mutation data. Insuk Lee

Cancer gene discovery via network analysis of somatic mutation data. Insuk Lee Cancer gene discovery via network analysis of somatic mutation data Insuk Lee Cancer is a progressive genetic disorder. Accumulation of somatic mutations cause cancer. For example, in colorectal cancer,

More information

Comparison of open chromatin regions between dentate granule cells and other tissues and neural cell types.

Comparison of open chromatin regions between dentate granule cells and other tissues and neural cell types. Supplementary Figure 1 Comparison of open chromatin regions between dentate granule cells and other tissues and neural cell types. (a) Pearson correlation heatmap among open chromatin profiles of different

More information

V24 Regular vs. alternative splicing

V24 Regular vs. alternative splicing Regular splicing V24 Regular vs. alternative splicing mechanistic steps recognition of splice sites Alternative splicing different mechanisms how frequent is alternative splicing? Effect of alternative

More information

7SK ChIRP-seq is specifically RNA dependent and conserved between mice and humans.

7SK ChIRP-seq is specifically RNA dependent and conserved between mice and humans. Supplementary Figure 1 7SK ChIRP-seq is specifically RNA dependent and conserved between mice and humans. Regions targeted by the Even and Odd ChIRP probes mapped to a secondary structure model 56 of the

More information

Abstract. Optimization strategy of Copy Number Variant calling using Multiplicom solutions APPLICATION NOTE. Introduction

Abstract. Optimization strategy of Copy Number Variant calling using Multiplicom solutions APPLICATION NOTE. Introduction Optimization strategy of Copy Number Variant calling using Multiplicom solutions Michael Vyverman, PhD; Laura Standaert, PhD and Wouter Bossuyt, PhD Abstract Copy number variations (CNVs) represent a significant

More information

V23 Regular vs. alternative splicing

V23 Regular vs. alternative splicing Regular splicing V23 Regular vs. alternative splicing mechanistic steps recognition of splice sites Alternative splicing different mechanisms how frequent is alternative splicing? Effect of alternative

More information

Nature Genetics: doi: /ng Supplementary Figure 1. SEER data for male and female cancer incidence from

Nature Genetics: doi: /ng Supplementary Figure 1. SEER data for male and female cancer incidence from Supplementary Figure 1 SEER data for male and female cancer incidence from 1975 2013. (a,b) Incidence rates of oral cavity and pharynx cancer (a) and leukemia (b) are plotted, grouped by males (blue),

More information

The search for cis-regulatory driver mutations in cancer genomes

The search for cis-regulatory driver mutations in cancer genomes /, Vol. 6, No. 32 The search for cis-regulatory driver mutations in cancer genomes Rebecca C. Poulos 1, Mathew A. Sloane 1, Luke B. Hesson 1, Jason W. H. Wong 1 1 Prince of Wales Clinical School and Lowy

More information

Using the Bravo Liquid-Handling System for Next Generation Sequencing Sample Prep

Using the Bravo Liquid-Handling System for Next Generation Sequencing Sample Prep Using the Bravo Liquid-Handling System for Next Generation Sequencing Sample Prep Tom Walsh, PhD Division of Medical Genetics University of Washington Next generation sequencing Sanger sequencing gold

More information

Supplementary Figures

Supplementary Figures Supplementary Figures Supplementary Figure 1. Heatmap of GO terms for differentially expressed genes. The terms were hierarchically clustered using the GO term enrichment beta. Darker red, higher positive

More information

COSMIC - Catalogue of Somatic Mutations in Cancer

COSMIC - Catalogue of Somatic Mutations in Cancer COSMIC - Catalogue of Somatic Mutations in Cancer http://cancer.sanger.ac.uk/cosmic https://academic.oup.com/nar/articl e-lookup/doi/10.1093/nar/gkw1121 Data In Large-scale systematic screens Detailed

More information

MODULE 3: TRANSCRIPTION PART II

MODULE 3: TRANSCRIPTION PART II MODULE 3: TRANSCRIPTION PART II Lesson Plan: Title S. CATHERINE SILVER KEY, CHIYEDZA SMALL Transcription Part II: What happens to the initial (premrna) transcript made by RNA pol II? Objectives Explain

More information

Computer Science, Biology, and Biomedical Informatics (CoSBBI) Outline. Molecular Biology of Cancer AND. Goals/Expectations. David Boone 7/1/2015

Computer Science, Biology, and Biomedical Informatics (CoSBBI) Outline. Molecular Biology of Cancer AND. Goals/Expectations. David Boone 7/1/2015 Goals/Expectations Computer Science, Biology, and Biomedical (CoSBBI) We want to excite you about the world of computer science, biology, and biomedical informatics. Experience what it is like to be a

More information

RECAP (1)! In eukaryotes, large primary transcripts are processed to smaller, mature mrnas.! What was first evidence for this precursorproduct

RECAP (1)! In eukaryotes, large primary transcripts are processed to smaller, mature mrnas.! What was first evidence for this precursorproduct RECAP (1) In eukaryotes, large primary transcripts are processed to smaller, mature mrnas. What was first evidence for this precursorproduct relationship? DNA Observation: Nuclear RNA pool consists of

More information

Supplementary Figure 1

Supplementary Figure 1 Count Count Supplementary Figure 1 Coverage per amplicon for error-corrected sequencing experiments. Errorcorrected consensus sequence (ECCS) coverage was calculated for each of the 568 amplicons in the

More information

MEDICAL GENOMICS LABORATORY. Next-Gen Sequencing and Deletion/Duplication Analysis of NF1 Only (NF1-NG)

MEDICAL GENOMICS LABORATORY. Next-Gen Sequencing and Deletion/Duplication Analysis of NF1 Only (NF1-NG) Next-Gen Sequencing and Deletion/Duplication Analysis of NF1 Only (NF1-NG) Ordering Information Acceptable specimen types: Fresh blood sample (3-6 ml EDTA; no time limitations associated with receipt)

More information

Inference of Isoforms from Short Sequence Reads

Inference of Isoforms from Short Sequence Reads Inference of Isoforms from Short Sequence Reads Tao Jiang Department of Computer Science and Engineering University of California, Riverside Tsinghua University Joint work with Jianxing Feng and Wei Li

More information

DNA-seq Bioinformatics Analysis: Copy Number Variation

DNA-seq Bioinformatics Analysis: Copy Number Variation DNA-seq Bioinformatics Analysis: Copy Number Variation Elodie Girard elodie.girard@curie.fr U900 institut Curie, INSERM, Mines ParisTech, PSL Research University Paris, France NGS Applications 5C HiC DNA-seq

More information

Finding subtle mutations with the Shannon human mrna splicing pipeline

Finding subtle mutations with the Shannon human mrna splicing pipeline Finding subtle mutations with the Shannon human mrna splicing pipeline Presentation at the CLC bio Medical Genomics Workshop American Society of Human Genetics Annual Meeting November 9, 2012 Peter K Rogan

More information

Correspondence to Nature Genetics: Exploring pediatric cancer mutation information using ProteinPaint

Correspondence to Nature Genetics: Exploring pediatric cancer mutation information using ProteinPaint SUPPLEMENTARY INFORMATION FOR Correspondence to Nature Genetics: Exploring pediatric cancer mutation information using ProteinPaint Xin Zhou 1, Michael Edmonson 1, Mark R. Wilkinson 1, Aman Patel 1, Gang

More information

An Analysis of MDM4 Alternative Splicing and Effects Across Cancer Cell Lines

An Analysis of MDM4 Alternative Splicing and Effects Across Cancer Cell Lines An Analysis of MDM4 Alternative Splicing and Effects Across Cancer Cell Lines Kevin Hu Mentor: Dr. Mahmoud Ghandi 7th Annual MIT PRIMES Conference May 2021, 2017 Outline Introduction MDM4 Isoforms Methodology

More information

The Epigenome Tools 2: ChIP-Seq and Data Analysis

The Epigenome Tools 2: ChIP-Seq and Data Analysis The Epigenome Tools 2: ChIP-Seq and Data Analysis Chongzhi Zang zang@virginia.edu http://zanglab.com PHS5705: Public Health Genomics March 20, 2017 1 Outline Epigenome: basics review ChIP-seq overview

More information

Nature Genetics: doi: /ng Supplementary Figure 1. TCGA data set on HNSCCs reanalyzed in this study.

Nature Genetics: doi: /ng Supplementary Figure 1. TCGA data set on HNSCCs reanalyzed in this study. Supplementary Figure 1 TCGA data set on HNSCCs reanalyzed in this study. Summary of the TCGA dataset on HNSCCs re-analyzed in this study and the respective numbers of samples available within each. Supplementary

More information

genomics for systems biology / ISB2020 RNA sequencing (RNA-seq)

genomics for systems biology / ISB2020 RNA sequencing (RNA-seq) RNA sequencing (RNA-seq) Module Outline MO 13-Mar-2017 RNA sequencing: Introduction 1 WE 15-Mar-2017 RNA sequencing: Introduction 2 MO 20-Mar-2017 Paper: PMID 25954002: Human genomics. The human transcriptome

More information

TP53 mutational profile in CLL : A retrospective study of the FILO group.

TP53 mutational profile in CLL : A retrospective study of the FILO group. TP53 mutational profile in CLL : A retrospective study of the FILO group. Fanny Baran-Marszak Hopital Avicenne Bobigny France 2nd ERIC workshop on TP53 analysis in CLL, Stresa 2017 TP53 abnormalities :

More information

REGULATED AND NONCANONICAL SPLICING

REGULATED AND NONCANONICAL SPLICING REGULATED AND NONCANONICAL SPLICING RNA Processing Lecture 3, Biological Regulatory Mechanisms, Hiten Madhani Dept. of Biochemistry and Biophysics MAJOR MESSAGES Splice site consensus sequences do have

More information

Supplementary Tables. Supplementary Figures

Supplementary Tables. Supplementary Figures Supplementary Files for Zehir, Benayed et al. Mutational Landscape of Metastatic Cancer Revealed from Prospective Clinical Sequencing of 10,000 Patients Supplementary Tables Supplementary Table 1: Sample

More information

Supplementary Figure 1. Efficiency of Mll4 deletion and its effect on T cell populations in the periphery. Nature Immunology: doi: /ni.

Supplementary Figure 1. Efficiency of Mll4 deletion and its effect on T cell populations in the periphery. Nature Immunology: doi: /ni. Supplementary Figure 1 Efficiency of Mll4 deletion and its effect on T cell populations in the periphery. Expression of Mll4 floxed alleles (16-19) in naive CD4 + T cells isolated from lymph nodes and

More information

RNA SEQUENCING AND DATA ANALYSIS

RNA SEQUENCING AND DATA ANALYSIS RNA SEQUENCING AND DATA ANALYSIS Download slides and package http://odin.mdacc.tmc.edu/~rverhaak/package.zip http://odin.mdacc.tmc.edu/~rverhaak/rna-seqlecture.zip Overview Introduction into the topic

More information

RNA SEQUENCING AND DATA ANALYSIS

RNA SEQUENCING AND DATA ANALYSIS RNA SEQUENCING AND DATA ANALYSIS Length of mrna transcripts in the human genome 5,000 5,000 4,000 3,000 2,000 4,000 1,000 0 0 200 400 600 800 3,000 2,000 1,000 0 0 2,000 4,000 6,000 8,000 10,000 Length

More information

TITLE: - Whole Genome Sequencing of High-Risk Families to Identify New Mutational Mechanisms of Breast Cancer Predisposition

TITLE: - Whole Genome Sequencing of High-Risk Families to Identify New Mutational Mechanisms of Breast Cancer Predisposition AD Award Number: W81XWH-13-1-0336 TITLE: - Whole Genome Sequencing of High-Risk Families to Identify New Mutational Mechanisms of Breast Cancer Predisposition PRINCIPAL INVESTIGATOR: Mary-Claire King,

More information

Accessing and Using ENCODE Data Dr. Peggy J. Farnham

Accessing and Using ENCODE Data Dr. Peggy J. Farnham 1 William M Keck Professor of Biochemistry Keck School of Medicine University of Southern California How many human genes are encoded in our 3x10 9 bp? C. elegans (worm) 959 cells and 1x10 8 bp 20,000

More information

Nature Structural & Molecular Biology: doi: /nsmb.2419

Nature Structural & Molecular Biology: doi: /nsmb.2419 Supplementary Figure 1 Mapped sequence reads and nucleosome occupancies. (a) Distribution of sequencing reads on the mouse reference genome for chromosome 14 as an example. The number of reads in a 1 Mb

More information

Breast and ovarian cancer in Serbia: the importance of mutation detection in hereditary predisposition genes using NGS

Breast and ovarian cancer in Serbia: the importance of mutation detection in hereditary predisposition genes using NGS Breast and ovarian cancer in Serbia: the importance of mutation detection in hereditary predisposition genes using NGS dr sc. Ana Krivokuća Laboratory for molecular genetics Institute for Oncology and

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION SUPPLEMENTARY INFORMATION Systematic investigation of cancer-associated somatic point mutations in SNP databases HyunChul Jung 1,2, Thomas Bleazard 3, Jongkeun Lee 1 and Dongwan Hong 1 1. Cancer Genomics

More information

Analyse de données de séquençage haut débit

Analyse de données de séquençage haut débit Analyse de données de séquençage haut débit Vincent Lacroix Laboratoire de Biométrie et Biologie Évolutive INRIA ERABLE 9ème journée ITS 21 & 22 novembre 2017 Lyon https://its.aviesan.fr Sequencing is

More information

Multifactorial Interplay Controls the Splicing Profile of Alu-Derived Exons

Multifactorial Interplay Controls the Splicing Profile of Alu-Derived Exons MOLECULAR AND CELLULAR BIOLOGY, May 2008, p. 3513 3525 Vol. 28, No. 10 0270-7306/08/$08.00 0 doi:10.1128/mcb.02279-07 Copyright 2008, American Society for Microbiology. All Rights Reserved. Multifactorial

More information

p.r623c p.p976l p.d2847fs p.t2671 p.d2847fs p.r2922w p.r2370h p.c1201y p.a868v p.s952* RING_C BP PHD Cbp HAT_KAT11

p.r623c p.p976l p.d2847fs p.t2671 p.d2847fs p.r2922w p.r2370h p.c1201y p.a868v p.s952* RING_C BP PHD Cbp HAT_KAT11 ARID2 p.r623c KMT2D p.v650fs p.p976l p.r2922w p.l1212r p.d1400h DNA binding RFX DNA binding Zinc finger KMT2C p.a51s p.d372v p.c1103* p.d2847fs p.t2671 p.d2847fs p.r4586h PHD/ RING DHHC/ PHD PHD FYR N

More information

NGS for Cancer Predisposition

NGS for Cancer Predisposition NGS for Cancer Predisposition Colin Pritchard MD, PhD University of Washington Dept. of Lab Medicine AMP Companion Society Meeting USCAP Boston March 22, 2015 Disclosures I am an employee of the University

More information

Broad H3K4me3 is associated with increased transcription elongation and enhancer activity at tumor suppressor genes

Broad H3K4me3 is associated with increased transcription elongation and enhancer activity at tumor suppressor genes Broad H3K4me3 is associated with increased transcription elongation and enhancer activity at tumor suppressor genes Kaifu Chen 1,2,3,4,5,10, Zhong Chen 6,10, Dayong Wu 6, Lili Zhang 7, Xueqiu Lin 1,2,8,

More information

Supplemental Figure 1. Genes showing ectopic H3K9 dimethylation in this study are DNA hypermethylated in Lister et al. study.

Supplemental Figure 1. Genes showing ectopic H3K9 dimethylation in this study are DNA hypermethylated in Lister et al. study. mc mc mc mc SUP mc mc Supplemental Figure. Genes showing ectopic HK9 dimethylation in this study are DNA hypermethylated in Lister et al. study. Representative views of genes that gain HK9m marks in their

More information

Global regulation of alternative splicing by adenosine deaminase acting on RNA (ADAR)

Global regulation of alternative splicing by adenosine deaminase acting on RNA (ADAR) Global regulation of alternative splicing by adenosine deaminase acting on RNA (ADAR) O. Solomon, S. Oren, M. Safran, N. Deshet-Unger, P. Akiva, J. Jacob-Hirsch, K. Cesarkas, R. Kabesa, N. Amariglio, R.

More information

fl/+ KRas;Atg5 fl/+ KRas;Atg5 fl/fl KRas;Atg5 fl/fl KRas;Atg5 Supplementary Figure 1. Gene set enrichment analyses. (a) (b)

fl/+ KRas;Atg5 fl/+ KRas;Atg5 fl/fl KRas;Atg5 fl/fl KRas;Atg5 Supplementary Figure 1. Gene set enrichment analyses. (a) (b) KRas;At KRas;At KRas;At KRas;At a b Supplementary Figure 1. Gene set enrichment analyses. (a) GO gene sets (MSigDB v3. c5) enriched in KRas;Atg5 fl/+ as compared to KRas;Atg5 fl/fl tumors using gene set

More information

Supplementary Figure 1: Classification scheme for non-synonymous and nonsense germline MC1R variants. The common variants with previously established

Supplementary Figure 1: Classification scheme for non-synonymous and nonsense germline MC1R variants. The common variants with previously established Supplementary Figure 1: Classification scheme for nonsynonymous and nonsense germline MC1R variants. The common variants with previously established classifications 1 3 are shown. The effect of novel missense

More information

CITATION FILE CONTENT/FORMAT

CITATION FILE CONTENT/FORMAT CITATION For any resultant publications using please cite: Matthew A. Field, Vicky Cho, T. Daniel Andrews, and Chris C. Goodnow (2015). "Reliably detecting clinically important variants requires both combined

More information

Raymond Auerbach PhD Candidate, Yale University Gerstein and Snyder Labs August 30, 2012

Raymond Auerbach PhD Candidate, Yale University Gerstein and Snyder Labs August 30, 2012 Elucidating Transcriptional Regulation at Multiple Scales Using High-Throughput Sequencing, Data Integration, and Computational Methods Raymond Auerbach PhD Candidate, Yale University Gerstein and Snyder

More information

RNA-seq Introduction

RNA-seq Introduction RNA-seq Introduction DNA is the same in all cells but which RNAs that is present is different in all cells There is a wide variety of different functional RNAs Which RNAs (and sometimes then translated

More information

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1 Supplementary Figure 1 U1 inhibition causes a shift of RNA-seq reads from exons to introns. (a) Evidence for the high purity of 4-shU-labeled RNAs used for RNA-seq. HeLa cells transfected with control

More information

ESEfinder: a Web resource to identify exonic splicing enhancers

ESEfinder: a Web resource to identify exonic splicing enhancers ESEfinder: a Web resource to identify exonic splicing enhancers Luca Cartegni, Jinhua Wang, Zhengwei Zhu, Michael Q. Zhang, Adrian R. Krainer Cold Spring Harbor Laboratory, Cold Spring Harbor, New York,

More information

Supplementary Appendix

Supplementary Appendix Supplementary Appendix This appendix has been provided by the authors to give readers additional information about their work. Supplement to: Yatsenko AN, Georgiadis AP, Röpke A, et al. X-linked TEX11

More information

Frequency(%) KRAS G12 KRAS G13 KRAS A146 KRAS Q61 KRAS K117N PIK3CA H1047 PIK3CA E545 PIK3CA E542K PIK3CA Q546. EGFR exon19 NFS-indel EGFR L858R

Frequency(%) KRAS G12 KRAS G13 KRAS A146 KRAS Q61 KRAS K117N PIK3CA H1047 PIK3CA E545 PIK3CA E542K PIK3CA Q546. EGFR exon19 NFS-indel EGFR L858R Frequency(%) 1 a b ALK FS-indel ALK R1Q HRAS Q61R HRAS G13R IDH R17K IDH R14Q MET exon14 SS-indel KIT D8Y KIT L76P KIT exon11 NFS-indel SMAD4 R361 IDH1 R13 CTNNB1 S37 CTNNB1 S4 AKT1 E17K ERBB D769H ERBB

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION doi: 1.138/nature8645 Physical coverage (x haploid genomes) 11 6.4 4.9 6.9 6.7 4.4 5.9 9.1 7.6 125 Neither end mapped One end mapped Chimaeras Correct Reads (million ns) 1 75 5 25 HCC1187 HCC1395 HCC1599

More information

Supplementary Information. Supplementary Figures

Supplementary Information. Supplementary Figures Supplementary Information Supplementary Figures.8 57 essential gene density 2 1.5 LTR insert frequency diversity DEL.5 DUP.5 INV.5 TRA 1 2 3 4 5 1 2 3 4 1 2 Supplementary Figure 1. Locations and minor

More information

Supplementary Information Titles Journal: Nature Medicine

Supplementary Information Titles Journal: Nature Medicine Supplementary Information Titles Journal: Nature Medicine Article Title: Corresponding Author: Supplementary Item & Number Supplementary Fig.1 Fig.2 Fig.3 Fig.4 Fig.5 Fig.6 Fig.7 Fig.8 Fig.9 Fig. Fig.11

More information