Supplementary Figure 1. 1,000-cell Scale Nano-ChIPseq Validation Comparison between small-scale ChIPseq (ie Nano-ChIPseq) and Standard ChIPseq for

Size: px
Start display at page:

Download "Supplementary Figure 1. 1,000-cell Scale Nano-ChIPseq Validation Comparison between small-scale ChIPseq (ie Nano-ChIPseq) and Standard ChIPseq for"

Transcription

1 Supplementary Figure 1. 1,000-cell Scale Nano-ChIPseq Validation Comparison between small-scale ChIPseq (ie Nano-ChIPseq) and Standard ChIPseq for K27me3 in murine ES cells. A representative Genome Browser track is shown. Additional details are presented in Ng et al., (2013) Developmental Cell 1.

2 Supplementary Figure 2. Nano-ChIPseq Peak Calling and Peak Merging. (a) Representative UCSC browser view of Nano-ChIPseq tag density for 5 histone modifications in patient normal ( N). Peak regions are indicated above each ChIPseq track. The gene track shows RefSeq transcripts. (b) K4me3 peak profile of 6 tissue samples. Merged peak regions are indicated below the RefSeq transcript track.

3 Supplementary Figure 3. Identification and Removal of Amplified Regions. Representative views of amplified loci around the KRAS locus in tissue C (a) and MYC in tissue C (b). Regions exhibiting an abundance of sequence tags in the cancer input DNAs were removed from subsequent analysis.

4 Supplementary Figure 4. Overlap of Chromatin Marks in GC/Normal Pairs. Overlap of chromatin mark regions in normal (top) and cancer tissue (bottom) for sample pairs (a) and (b). Numbers represent the fraction of 1 st set regions (vertical) overlapping with the 2 nd set regions (horizontal). Blue = exclusive, Red = overlapping.

5 Supplementary Figure 5. Null distributions of overlap between enhancer predictions and P300/CEBP/CTCF binding sites in HeLaS3. The overlap of predicted enhancers with each different transcription factor binding site repertoire is statistically significant (P<10-4 ). Similar results were obtained for K562 cells.

6 Supplementary Figure 6. Nano-ChIPseq RPKM Pre-processing. (a-d) Boxplots of log2-transformed ChIPseq RPKM values (after ComBat normalization) for promoter and predicted enhancer regions. (e-h) Raw p-value distributions from linear model fitting.

7 Supplementary Figure 7. Validation of Nano-ChIPseq altered chromatin peaks by qpcr Using qpcr we validated 18 altered chromatin peaks identified by Nano-ChIPseq analysis as being different between tumor and normal samples (9 K27ac, 9 K4me3; corresponding to 14 genomic regions). log 2 fold change<0 is defined as Loss, log 2 fold change>0 is defined as Gain. All 18 peaks showed a similar concordance between the mean Nano-ChIPseq fold change and mean qpcr fold change. qpcr was performed on 3 GC/normal pairs for which tissue was available.

8 Supplementary Figure 8. PCA plots of GCs and Normal Tissues. (a,b) and (d,e) PCA plot of GCs (purple) and normal (blue) tissues using (a) all identified K4me3 (promoter) and (b) associated K27ac (activity); or (d) all identified K4me1 (predicted enhancer) and (e) associated K27ac regions. (c and f) PCA plot using K27ac regions exhibiting somatic alterations.

9 Supplementary Figure 9. Cryptic promoter usage in GC cell lines (a) Venn diagram of K4me3 and K27ac Nano-ChIPseq overlap with cryptic promoters in GC cell lines (e.g. N87, YCC3, OCUM1 and SNU16). Of 368 cryptic promoters identified, 157 are found in N87, 91 are found in YCC3, 187 are found in OCUM1 and 197 are found in SNU16. When combined, a total set of 249 (67.7%) cryptic promoters were observed in the cell lines. (b) Example of a cryptic promoter region (orange box) marked by both K4me3 (top brown) and K27ac (bottom purple) in GC cell lines (N87, YCC3) and tumor sample , but not in matched normal gastric tissue.

10 Supplementary Figure 10. RT-qPCR Validation of Non-RefSeq Transcripts. RT-qPCR validation results for 10 non-refseq transcripts in GCs and normal tissues, compared to FPKM values derived from RNAseq analysis. Primer sets are provided in Supplementary Table 10.

11

12 Supplementary Figure 11. MET Gene RNAseq Alignment. (a) RNAseq tag alignments supporting exon-intron structures of expressed RNAs at the MET gene locus. MET RefSeq transcripts are indicated in the top annotation. (b,c) Close-up view of RNAseq alignments. Exon-intron boundaries for cryptic promoter-driven exons were confirmed by manual inspection of sequence tag alignments. (d) Visualization of MET cryptic promoter showing location of representative RNAseq reads and K4me3 enrichment. The downstream MET RefSeq exon is shown. (e) 5 RACE analysis. Close-up cryptic promoter view with RNAseq and K4me3-enriched reads. The location of the 5 RACE primer is indicated, and the product of the 5 RACE product from GC Hs746T cells is shown*. (f) Schematic of predicted Met protein isoforms driven by RefSeq or cryptic promoters. (g) Western blotting. Full length Met and predicted isoforms were monitored using antibodies targeting the Met C-terminal cytoplasmic domain. The band size of the predicted isoform corresponding to the truncated Met variant driven by the internal cryptic promoter is 978 aa (~100 kda), while the RefSeq associated full length Met receptor is 190kDa 2. β- actin was used as a loading control. NUGC4 cells are predicted to express both the truncated Met variant and full-length Met, while KATOIII cells are predicted to predominantly express full length Met. *Hs746T cells were not used for Western blotting as these cells contain an exon 14 Met splice site mutation that also leads to other aberrant Met proteins 3.

13 Supplementary Figure 12. MET RNAseq Analysis and Full Image of Met Western blot. (a) RNAseq analysis demonstrating expression of exons associated with the MET internal cryptic promoter (red box) in GC C and NUGC4 cells (red arrows) but not in N (matched normal gastric tissue) and KATOIII cells (b) Original Met and β-actin Western blot image for NUGC4 and KATOIII as shown in Supplementary Fig. 11g. Molecular weight markers are indicated.

14

15 Supplementary Figure 13. NKX6-3 RNAseq Alignment. (a) RNAseq tag alignments supporting exon-intron structures of expressed RNAs. NKX6-3 RefSeq transcript (NM152568) is shown in green, and predicted cryptic-promoter expressed mrnas are indicated at the top. (b-f) Close-up view of RNAseq alignments. Exon-intron boundaries were confirmed by manual inspection of sequence tag alignments. (g) Gel photo of two distinct 5 RACE products from 7 GC cell-lines. (h) Genome browser view showing positions of 5 RACE fragments from NUGC3 (larger RACE product) and KATOIII (smaller RACE product) cell-lines. The location of the 5 RACE primer is indicated. Both products validate the expression of a 5 non-refseq exon, where NUGC3 has a larger product. Transcript 5 ends are shown by the red arrows (i) Predicted mrna and polypeptide structures. Location of the NKX6-3 homeodomain is indicated based on the RefSeq database.

16 Supplementary Figure 14. HOXB9 Locus RNAseq Alignment. (a) RNAseq alignments at the HOXB9 locus. Multiple spliced RNA isoforms were predicted based on RNAseq alignments. (b-h) Exon-intron boundaries were confirmed by RNAseq tag alignments. (i) Coding potential of predicted mrnas and microrna precursors are indicated. The oncogenic mirna MIR196A also falls in this region.

17 Supplementary Figure 15. Microarray Validation Cohort. Heatmap of log2-transformed expression of upregulated K4me3-marked cancer-associated genes (n =218) in independent microarray data. Cancer (n = 185) and Normal (n = 89) samples are from Singapore. Cancer and Normal samples show distinct patterns of expression where the majority of K4me3-marked cancer-associated genes are upregulated in tumors.

18 Supplementary Figure 16. Association of Cancer-associated Genes with Clinicopathologic Features. Mosaic plots of factors with significant association with K4me3-marked cancer-associated genes: (a) M- stage, (b) Lauren s histopathology, and (c) Intrinsic signature from Tan et al. 4. Higher M-stage (p = 0.033, Pearson s Chi-square test), diffuse Lauren s histopathology (p = 9.99x10-5, Pearson s Chi-square test), and GDIFF intrinsic signature classification (p =1.46x10-11, Pearson s Chi-square test) are significantly associated with high-expression of the K4me3-marked cancer-associated genes. Red: high-enrichment of signature; blue: low-enrichment of signature.

19 Supplementary Figure 17. Overlap of GC Promoters and Enhancers with ENCODE Data. (a) Frequency of TFBS in promoter regions. (b) Frequency of TFBS in predicted enhancer regions.

20 Supplementary Figure 18. Validation of somatic mutations by Sanger sequencing. (a-c) Three representative Sanger traces are shown, depicting somatic mutations. Mutation sites are indicated by arrows. C and N refer to Cancer and Normal respectively.

21 Supplementary Figure 19. CDH10 Locus Somatic Mutation Analysis. (a) Genome browser view of CDH10 locus. (b) Magnified view. (c) K4me3 ChIPseq tag alignment, showing variant and allele bias. Note that the mutation is a dinucleotide substitution compared to the hg19 reference. (d) Sanger sequencing traces of normal and cancer input DNA and K4me3 ChIP DNA. Cancer input contains a small fraction of the mutant allele, and the mutant allele is enriched by K4me3 ChIP.

22 Supplementary Figure 20. HOXA5 Locus Somatic Mutation Analysis. (a) Genome browser view of HOXA5 locus. (b) K4me3 ChIPseq tag alignment, showing variant and allele bias. (c) Sanger sequencing traces of normal and cancer input DNA and K4me3 ChIP DNA. Cancer input contains a small fraction of mutant allele, and the mutant allele is enriched by K4me3 ChIP.

23 Supplementary Figure 21. FAR2 Locus Somatic Mutation Analysis. (a) Genome browser view of FAR2 locus. (b) Magnified view. (c) K4me3 ChIPseq tag alignment, showing variant and allele bias. (d) Sanger sequencing traces of normal and cancer input DNA and K4me3 ChIP DNA. Cancer input contains a small fraction of mutant allele, and the mutant allele is enriched by K4me3 ChIP.

24 Supplementary Table 1. Clinical Data for Samples Analyzed by Nano-ChIPseq and RNAseq. Tissue ID Age at Surgery Gender site.of.tumour StageT.7 StageN.7 StageM.7 AJCC7 Grade H.pylori.status moderately male lesser curve T4a N3a m1 4 differentiated yes poorly male lesser curve T4a T3b m1 4 differentiated yes poorly female PYLORUS T4a T3b m1 4 differentiated UNKNOWN poorly female incisura T4a T3b m0 3C differentiated UNKNOWN Lauren's Classification "intestinal type" adenocarcinoma "diffuse type" adenocarcinoma "diffuse type" adenocarcinoma "intestinal type" adenocarcinoma male unknown T2 N1 m0 2A undifferentiated UNKNOWN mixed/others ChIPseq male lesser curve T3 N3a m0 3B poorly differentiated yes "intestinal type" adenocarcinoma RNAseq male antrum T2 N1 m0 2A well differentiated UNKNOWN "intestinal type" adenocarcinoma RNAseq female lesser curve T2 N1 m0 2A moderately differentiated no "intestinal type" adenocarcinoma RNAseq male CARDIA T2 N1 m0 2A poorly differentiated UNKNOWN "intestinal type" adenocarcinoma RNAseq male lesser curve T4a T3b m0 3C poorly differentiated yes "diffuse type" adenocarcinoma RNAseq male antrum T4a N3a m1 4 moderately differentiated UNKNOWN "intestinal type" adenocarcinoma RNAseq female greater curve T1b N0 m0 1A moderately differentiated no "intestinal type" adenocarcinoma RNAseq male antrum T2 N1 m0 2A moderately differentiated UNKNOWN "intestinal type" adenocarcinoma RNAseq Platform ChIPseq ChIPseq ChIPseq ChIPseq

25 Supplementary Table 2. ChIPseq Library and Peak Call Summary. Library Patient Tissue ChIP antibody Read length Uniquely mapped read CCAT3 fold-cutoff filtered CCAT Peak region CHG Cancer Input ,569,842 CHG Normal Input ,367,968 CHG Cancer Input ,306,207 CHG Normal Input ,360,435 CHG Cancer Input ,686,506 CHG Normal Input ,097,151 CHG Cancer H3K4me ,268, ,652 CHG Normal H3K4me ,235, ,336 CHG Cancer H3K4me ,437, ,339 CHG Normal H3K4me ,563, ,361 CHG Cancer H3K4me ,614, ,158 CHG Normal H3K4me ,395, ,861 CHG Cancer H3K27ac ,047, ,580 CHG Normal H3K27ac ,024, ,042 CHG Cancer H3K27ac ,004, ,222 CHG Normal H3K27ac ,336, ,208 CHG Cancer H3K27ac ,575, ,955 CHG Normal H3K27ac ,176, ,619 CHG Cancer H3K4me ,374, ,934 CHG Normal H3K4me ,100, ,932 CHG Cancer H3K4me ,464, ,087 CHG Normal H3K4me ,970, ,379 CHG Cancer H3K4me ,591, ,963 CHG Normal H3K4me ,976, ,074 CHG Cancer H3K36me ,870, ,308 CHG Normal H3K36me ,978, ,980 CHG Cancer H3K36me ,758, ,449 CHG Normal H3K36me ,309, ,879 CHG Cancer H3K36me ,619, ,912 CHG Normal H3K36me ,690, ,370 CHG Cancer H3K27me ,833, ,363 CHG Normal H3K27me ,631, ,910 CHG Cancer H3K27me ,335, ,308 CHG Normal H3K27me ,850, ,862 CHG Cancer H3K27me ,847, ,694 CHG Normal H3K27me ,898, ,682 CHG Normal Input ,653,370 CHG Cancer Input ,916,056 CHG Normal Input ,021,674 CHG Cancer Input ,781,611 CHG Normal H3K27ac ,671,862 CHG Cancer H3K27ac ,597,536 CHG Normal H3K27ac ,983,068 CHG Cancer H3K27ac ,051,266 CHG Normal H3K4me ,841, ,644 CHG Cancer H3K4me ,794, ,997 CHG Normal H3K4me ,908, ,131 CHG Cancer H3K4me ,713, ,382 CHG Normal H3K4me ,451, ,116 CHG Cancer H3K4me ,926, ,974 CHG Normal H3K4me ,946, ,007 CHG Cancer H3K4me ,434, ,167 CCAT peak call results were filtered by fold-above background cut-off to obtain peak regions.

26 Supplementary Table 3. Accuracy of Enhancer Predictions Using ENCODE ChIPseq data and CCAT version 3.0 (FDR < 0.05 and excluding H3K4me3 enriched regions). Cell Line Transcription Factor Total Enhancer Predictions % TFBS overlap w/ Enhancer Prediction Sensitivity Specificity 52% 67% 51% HeLaS3 P300 HeLaS3 CEBP 40% 65% 53% 73,929 HeLaS3 CTCF 23% 55% 50% HeLaS3 P300, CEBP, CTCF 24% 63% 54% K562 P300 30% 37% 39% K562 CEBP 29% 50% 40% 109,082 K562 CTCF 25% 53% 40% K562 P300,CEBP, CTCF 23% 43% 35%

27 Supplementary Table 4. Cryptic Promoters in Primary Gastric Adenocarcinomas. Transcript Database Cryptic Promoters Overlap with Annotated TSS RefSeq* Ensembl74** Gencodev * Values provided in Table correspond to RefSeq transcripts as of Nov The updated RefSeq database (Release 63) has 22 additional promoter regions associated with annotated transcripts ** Ensembl 74 contains Met and NKX6-3 isoforms described in Fig. 2 of the Main Text

28 Supplementary Table 5. Clinical Characteristics of Validation Cohort. Age (years) range mean ± standard deviation Singapore GC Cohort (n = 185) (1 missing) 64.5 ± 12.9 (1 missing) Gender (percent) Female 68 (36.8) Male 116 (62.7) missing 1 (0.54) Stage (percent) 1 29 (15.7) 2 30 (16.2) 3 66 (35.7) 4 59 (31.9) missing 1 (0.54) Lauren's histopathology (percent) Intestinal 92 (49.7) Diffuse 72 (38.9) Mixed/ unclassifiable 20 (10.8) missing 1 (0.54) Intrinsic signature** (percent) GINT 92 (49.7) GDIFF 70 (37.8) Ambiguous 23 (12.4) Helicobacter Pylori status (percent) Positive 59 (31.9) Negative 37 (20.0) missing 89 (48.1) Median overall survival* (months) 22.5 (1 missing) Number of overall death events 110 (2 missing) *Defined as time from surgery to last follow-up **Classified according to Tan et al. (2011)

29 Supplementary Table 6. Association of Clinicopathological Factors with K4me3-marked Cancer - Associated Genes. Clinicopathological factor class of factor Pearson's Chi-sq P- value t-test P-value Direction of association Age continuous N.A N.A. Gender categorical N.A. N.A. Site of tumor categorical N.A. N.A. Grade categorical N.A. N.A. TMN stage (AJCC7) categorical N.A. N.A. T.stage categorical N.A. N.A. M.stage categorical N.A. positive Number of lymph nodes continuous N.A N.A. Helicobacter Pylori status categorical N.A. N.A. Lauren's histopathology categorical 9.99E-05 N.A. positive Intrinsic signature* categorical 1.46E-11 N.A. positive Ulcerative status categorical N.A. N.A. Adjuvant 5FU therapy categorical N.A. N.A. *Intrinsic signature from Tan et al. (2011)

30 Supplementary Table 7. Transcription Factor Binding Site Overlap Analysis. Yellow: p<0.05 for promoter regions. Blue: p<0.05 for predicted enhancer regions. TFs all_k 4me3 all_k 4me1 gainp romot er losspr omote r gaine nhanc er losse nhanc er all_k 4me3 _per1 0kb gainp romot er_pe r10kb losspr omote r_per 10kb all_k 4me1 _per1 0kb gaine nhanc er_pe r10kb losse nhanc er_pe r10kb Pval_H4 me3_ga in corpval _H4me3 _Gain Pval_H4 me1_ga in corpval _H4me1 _Gain ARID3A E E E E-36 ATF E E E E-12 ATF E E E E-02 ATF E E E E-40 BACH E E E E-04 BATF E E E E-05 BCL11A E E E E-03 BCL E E E E-17 BCLAF E E E E-03 BDP E E E E+00 BHLHE E E E E-12 BRCA E E E E-13 BRF E E E E+00 BRF E E E E+00 CBX E E E E-08 CCNT E E E E-06 CEBPB E E E E-54 CEBPD E E E E-15 CHD E E E E+00 CHD E E E E-31 CREB E E E E-16 CTBP E E E E+00 CTCF E E E E-11 CTCFL E E E E+00 E2F E E E E-09 E2F E E E E-11 E2F E E E E-05 EBF E E E E-03 EGR E E E E-01 ELF E E E E-17 ELK E E E E-04 ELK E E E E-10 EP E E E E-71 ESR E E E E-21 ESRRA E E E E-01 ETS E E E E-12 EZH E E E E+00 FAM48A E E E E-04 FOS E E E E-72 FOSL E E E E-19 FOSL E E E E-80 FOXA E E E E-32

31 FOXA E E E E-23 FOXM E E E E-01 FOXP E E E E-03 GABPA E E E E-40 GATA E E E E-03 GATA E E E E-19 GATA E E E E-11 GRp E E E E+00 GTF2B E E E E+00 GTF2F E E E E-19 GTF3C E E E E+00 HDAC E E E E+00 HDAC E E E E-22 HDAC E E E E+00 HDAC E E E E+00 HMGN E E E E-03 HNF4A E E E E-20 HNF4G E E E E-22 HSF E E E E+00 IKZF E E E E+00 IRF E E E E-02 IRF E E E E-01 IRF E E E E+00 JUN E E E E-68 JUNB E E E E-23 JUND E E E E-59 KAP E E E E-14 KDM5A E E E E+00 KDM5B E E E E+00 MAFF E E E E-13 MAFK E E E E-14 MAX E E E E-36 MAZ E E E E-19 MBD E E E E-11 MEF2A E E E E-01 MEF2C E E E E+00 MTA E E E E+00 MXI E E E E-12 MYBL E E E E-23 MYC E E E E-48 NANOG E E E E+00 NFATC E E E E+00 NFE E E E E-01 NFIC E E E E-20 NFYA E E E E-08 NFYB E E E E-05 NR2C E E E E-02

32 NR2F E E E E-11 NR3C E E E E-55 NRF E E E E-01 PAX E E E E-03 PBX E E E E+00 PHF E E E E-01 PML E E E E-03 POLR2A E E E E-44 POLR3G E E E E+00 POU2F E E E E-02 POU5F E E E E+00 PPARGC1 A E E E E-02 PRDM E E E E-08 RAD E E E E-21 RBBP E E E E-03 RCOR E E E E-30 RDBP E E E E+00 RELA E E E E-03 REST E E E E-27 RFX E E E E-35 RPC E E E E+00 RUNX E E E E-01 RXRA E E E E-25 SAP E E E E+00 SETDB E E E E-02 SIN3A E E E E+00 SIN3AK E E E E-19 SIRT E E E E-02 SIX E E E E-07 SMARCA E E E E-05 SMARCB E E E E-16 SMARCC E E E E-35 SMARCC E E E E-05 SMC E E E E-22 SP E E E E-27 SP E E E E+00 SP E E E E+00 SPI E E E E+00 SREBP E E E E+00 SRF E E E E-04 STAT E E E E-26 STAT E E E E-02 STAT E E E E-87 STAT5A E E E E-06 SUZ E E E E+00 TAF E E E E-24

33 TAF E E E E+00 TAL E E E E-08 TBL1XR E E E E-06 TBP E E E E-34 TCF E E E E-42 TCF E E E E+00 TCF7L E E E E-63 TEAD E E E E-25 TFAP2A E E E E-27 TFAP2C E E E E-35 THAP E E E E+00 TRIM E E E E-08 UBTF E E E E+00 USF E E E E-21 USF E E E E-16 WRNIP E E E E+00 YY E E E E-21 ZBTB E E E E-40 ZBTB7A E E E E-08 ZEB E E E E+00 ZKSCAN E E E E-05 ZNF E E E E-05 ZNF E E E E-18 ZNF E E E E-01 ZNF E E E E+00 ZZZ E E E E+00

34 Supplementary Table 8. Genetic Variation in Nano-ChIPseq Data. Germline Variants Somatically altered Detected elements Somatic Mutations Somatically altered Detected elements Tissue Gain Loss Gain Loss , , , , , Total 439,160 1, Germline SNVs and somatic mutations in 5 tumor/normal pairs detected by MuTect at minimum 20x coverage and/or validated using Sanger sequencing in both tumor and normal. Sequence coverage for the last two pairs ( and ) was lower than the first three, hence the lower rate of SNV identification. 439,160 dbsnp (ie germline) SNVs were detected across the 5 patients (249,786 unique SNPs). Approximately 188,500 were heterozygous in at least one sample. Across the 5 patients, 1,760 SNPs mapped to somatically altered elements. 947 SNPs were unique, of which 714 were heterozygous. Similarly, 144 somatic mutations were detected across the 5 patients. Sanger sequencing confirmed the somatic mutation status of 55 out of 71 sites (77%).

35 Supplementary Table 9. Assessing Allele Bias at 16 Sites Using Input DNAs and ChIP-enriched Reads. 11 sites were validated using quantitative PyroMark sequencing (5 sites were not validated due to PCR or Pyrosequencing (PSQ) failure highlighted in grey). 8 sites (73%) confirm allele bias (>0.30, in bold) in ChIP-enriched reads. Tissue Region Coordinate (0-based) Ref SNP Alternate Allele Fraction by PyroMark INPUT K4me3 K27ac Tumor Normal Bias Tumor Normal Bias Tumor Normal Bias Regulome DB Score Nearby genes Gain chr2: T C b ID Gain chr2: C G b CAPN Loss chr4: T C a SORBS Gain chr6: A G f AK Gain chr6: C PCR failure 1a ZDHHC Gain chr7: C PSQ failure 1f CLDN Gain chr7: G PSQ failure 2b CLDN Gain chr7: T G b CFTR Gain chr8: T C b FER1L Gain chr9: T A b MIR31HG, MTAP Gain chr9: G C b MIR31HG, MTAP Gain chr11: C PCR failure 2b KCNE Gain chr15: G PCR RPUSD2, 2c failure CASC Gain chr18: G A b SERPINB Gain chr19: A G f KLK Gain chr20: T C b AK307192

36 Supplementary Table 10. PCR, qpcr, 5 RACE, PyroMark and Cloning Primers Description Non-RefSeq exon expression Exon1, chr1: Forward Exon1, chr1: Reverse Exon2, chr1: Forward Exon2, chr1: Reverse Exon3, chr7: Forward Exon3, chr7: Reverse Exon4, chr8: Forward Exon4, chr8: Reverse Exon5, chr8: Forward Exon5, chr8: Reverse Exon6, chr8: Forward Exon6, chr8: Reverse Exon7, chr11: Forward Exon7, chr11: Reverse Exon8, chr17: Forward Exon8, chr17: Reverse Exon9, chr19: Forward Exon9, chr19: Reverse Exon10, chr19: Forward Exon10, chr19: Reverse 5' RACE primers 1st strand MET Refseq exon 3 1st strand NKX6-3Refseq exon 1 PCR MET exon 3 PCR NKX6-3 exon 1 PCR NKX6-3 exon 1 for nested PCR qpcr and Pyrosequencing TNK2_A45_R_5btn TNK2_A45_F TNK2_A45_PSQ_F NUDT4_SN78_R_5bio NUDT4_SN78_F NUDT4_SN78_PSQ_F KLK1_E7_F436_btn KLK1_E7_R577 KLK1_E7_PSQ_R HOXA11_A64_R_5btn_2 HOXA11_A64_F HOXA11_A64_PSQ_F2 Reporter cloning FOS-99_BglII FOS-99_HindIII HOX11AS_F308_BglII HOX11AS_R645_BglII Differential promoters Promoter1, chr1: , Forward Promoter1, chr1: , Reverse Promoter2, chr1: , Forward Promoter2, chr1: , Reverse Promoter3, chr2: , Forward Promoter3, chr2: , Reverse Sequence (5'>3') AGGCAGGAGAGTAGAGTCTGGAGG GTTCTAGGAAGTCAGCGTGAAACG TGGCGGACTCAACAAGGC GGAAGGCAGGAAGTGGCT ACTTTGGCAAGGTTAGGT GTTGTAGGGATGGGTTCT CCTGCGACTCGGTCATGC ACACCCGGCCCACCTTCA GTTGTTCAGCAGGAACGT TTTGGGGCTAGAGAGTGG GATTTTGCTGTGTTCCTA CTTCTCCATTCTTCCTCT GGACCTGCCACCTCCAAAC TCCTGAAAGAGCTGCCTTC AGGTGCCCCCAGGTGACTC GCCCTTTCGCTGCTATGCC ATGCTCCCCCTTTTCCACC AATGTCCCTTCTCCCCTTG TCCCAGCCTGCATTCCAT GGTTTGCCTCGGGTTCTT CTTCAGTGCAGGG GAAGGTAGGCTCCTC GGCTCCAGGGTCTTCACCTCCA CCAGGCTGAGCACCGAGAAGGC GCTTGCGCAGCAGCAGGCGGAT [Btn]TGGGACCCCTAGGGAGAAGA CAGAGGCCGGTGCTGAGA TCTGGAAATCTCAGCAC [Btn]GCCAAGACTGGGCCATGT CCTACGAACAGAAATAAAACAATTGTGA TGGGAGAAGCAGAGTG [Btn]TTGAGAGTGCAAAAGGCAAATG GCTGTCCCTGTCTTTTATCCTTCT CCTCCCCTCCGCCTT [Btn]CCCTTTTTCCCCTAGATTTGGT CCCAAGGCTGCGCAATAT TGAATGTGAAGGTTTCTC GTAGCTGCATAGATCTGCGCGCCACCCCTCTGGCGCCACCGT GTAGCTGCATCAAGCTTGCCGGCTCAGTCTTGGCTTCTC AGCTATGCTGAGATCTGAAGATGGATGAGGGAAAAGGTT AGCTATGCTGAGATCTTGTTGGACCTCCTGCAAAGC ACCTGCGACTGTCTCTTT GGCTGTTTACCGTTGTTT CTTTGTTCTCCTCTGCTT CTGTATTTGTCCCGATTG AGGGTGTGAATGATGGTA TGTCTCCTCGTATGCTGC

37 Promoter4, chr4: , Forward Promoter4, chr4: , Reverse Promoter5, chr5: , Forward Promoter5, chr5: , Reverse Promoter6, chr5: , Forward Promoter6, chr5: , Reverse Promoter7, chr7: , Forward Promoter7, chr7: , Reverse Promoter8, chr8: , Forward Promoter8, chr8: , Reverse Promoter9, chr9: , Forward Promoter9, chr9: , Reverse Promoter10, chr11: , Forward Promoter10, chr11: , Reverse Promoter11, chr12: , Forward Promoter11, chr12: , Reverse Promoter12, chr18: , Forward Promoter12, chr18: , Reverse Promoter13, chr18: , Forward Promoter13, chr18: , Reverse Promoter14, chr20: , Forward Promoter14, chr20: , Reverse TTCAGCAAAGGAGAAACA GAGTGAGAGTGGCAAAAC ATGACTTCTGGGTTTCTG CAACAGACATCATCGTGG ATAGAGATTGGCAAACCG TGTCTCCTCGTATGCTGC AAGCACTTATCGCCAATC ATACATCCCTGACTCCAC AAAACCTGATATGTCCCT TAGTGCCAACCTCTAACC CCCACATACCCATTACTT AGCAGAAACTTCAACACA TTCTTTCCTGAATCCTGC GACTCAGGGAGAGGAGAG TGTGGCTAAGAGGTTGGT AGGGAGCCTGCGATAAGT CACCTTTACCTCCCAACA ATTCCTTTCTCTTGTTCC AACAAGAAAGCAGGTGAT GGCTCCCTCTACACACAG ATCCCACCATTAGAGAGA CGCCATCAGATCAGAATC

38 Supplementary References 1. Ng JH, et al. In vivo epigenomic profiling of germ cells reveals germ cell molecular signatures. Dev Cell 24, (2013). 2. Rodrigues GA, Naujokas MA, Park M. Alternative splicing generates isoforms of the met receptor tyrosine kinase which undergo differential processing. Mol Cell Biol 11, (1991). 3. Asaoka Y, et al. Gastric cancer cell line Hs746T harbors a splice site mutation of c-met causing juxtamembrane domain deletion. Biochem Biophys Res Commun 394, (2010). 4. Tan IB, et al. Intrinsic subtypes of gastric cancer, based on gene expression pattern, predict survival and respond differently to chemotherapy. Gastroenterology 141, , 485 e (2011).

15. Supplementary Figure 9. Predicted gene module expression changes at 24hpi during HIV

15. Supplementary Figure 9. Predicted gene module expression changes at 24hpi during HIV Supplementary Information Table of content 1. Supplementary Table 1. Summary of RNAseq data and mapping statistics 2. Supplementary Table 2. Biological functions enriched in 12 hpi DE genes, derived from

More information

Nanoscale chromatin profiling of gastric adenocarcinoma reveals cancer-associated cryptic promoters and somatically acquired regulatory elements

Nanoscale chromatin profiling of gastric adenocarcinoma reveals cancer-associated cryptic promoters and somatically acquired regulatory elements Received Nov 13 Accepted 1 Jun 14 Published 1 Jul 14 DOI: 1.138/ncomms5361 Nanoscale chromatin profiling of gastric adenocarcinoma reveals cancer-associated cryptic promoters and somatically acquired regulatory

More information

Supplementary Figure S1. Gene expression analysis of epidermal marker genes and TP63.

Supplementary Figure S1. Gene expression analysis of epidermal marker genes and TP63. Supplementary Figure Legends Supplementary Figure S1. Gene expression analysis of epidermal marker genes and TP63. A. Screenshot of the UCSC genome browser from normalized RNAPII and RNA-seq ChIP-seq data

More information

Chromatin landscapes and genetic risk in systemic lupus

Chromatin landscapes and genetic risk in systemic lupus Hui-Yuen et al. Arthritis Research & Therapy (2016) 18:281 DOI 10.1186/s13075-016-1169-9 RESEARCH ARTICLE Chromatin landscapes and genetic risk in systemic lupus Open Access Joyce S. Hui-Yuen 1,2*, Lisha

More information

Comparison of open chromatin regions between dentate granule cells and other tissues and neural cell types.

Comparison of open chromatin regions between dentate granule cells and other tissues and neural cell types. Supplementary Figure 1 Comparison of open chromatin regions between dentate granule cells and other tissues and neural cell types. (a) Pearson correlation heatmap among open chromatin profiles of different

More information

7SK ChIRP-seq is specifically RNA dependent and conserved between mice and humans.

7SK ChIRP-seq is specifically RNA dependent and conserved between mice and humans. Supplementary Figure 1 7SK ChIRP-seq is specifically RNA dependent and conserved between mice and humans. Regions targeted by the Even and Odd ChIRP probes mapped to a secondary structure model 56 of the

More information

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1 Supplementary Figure 1 Effect of HSP90 inhibition on expression of endogenous retroviruses. (a) Inducible shrna-mediated Hsp90 silencing in mouse ESCs. Immunoblots of total cell extract expressing the

More information

Computational Analysis of UHT Sequences Histone modifications, CAGE, RNA-Seq

Computational Analysis of UHT Sequences Histone modifications, CAGE, RNA-Seq Computational Analysis of UHT Sequences Histone modifications, CAGE, RNA-Seq Philipp Bucher Wednesday January 21, 2009 SIB graduate school course EPFL, Lausanne ChIP-seq against histone variants: Biological

More information

Nature Structural & Molecular Biology: doi: /nsmb.2419

Nature Structural & Molecular Biology: doi: /nsmb.2419 Supplementary Figure 1 Mapped sequence reads and nucleosome occupancies. (a) Distribution of sequencing reads on the mouse reference genome for chromosome 14 as an example. The number of reads in a 1 Mb

More information

fl/+ KRas;Atg5 fl/+ KRas;Atg5 fl/fl KRas;Atg5 fl/fl KRas;Atg5 Supplementary Figure 1. Gene set enrichment analyses. (a) (b)

fl/+ KRas;Atg5 fl/+ KRas;Atg5 fl/fl KRas;Atg5 fl/fl KRas;Atg5 Supplementary Figure 1. Gene set enrichment analyses. (a) (b) KRas;At KRas;At KRas;At KRas;At a b Supplementary Figure 1. Gene set enrichment analyses. (a) GO gene sets (MSigDB v3. c5) enriched in KRas;Atg5 fl/+ as compared to KRas;Atg5 fl/fl tumors using gene set

More information

Supplementary Figure 1: Attenuation of association signals after conditioning for the lead SNP. a) attenuation of association signal at the 9p22.

Supplementary Figure 1: Attenuation of association signals after conditioning for the lead SNP. a) attenuation of association signal at the 9p22. Supplementary Figure 1: Attenuation of association signals after conditioning for the lead SNP. a) attenuation of association signal at the 9p22.32 PCOS locus after conditioning for the lead SNP rs10993397;

More information

Supplemental Information For: The genetics of splicing in neuroblastoma

Supplemental Information For: The genetics of splicing in neuroblastoma Supplemental Information For: The genetics of splicing in neuroblastoma Justin Chen, Christopher S. Hackett, Shile Zhang, Young K. Song, Robert J.A. Bell, Annette M. Molinaro, David A. Quigley, Allan Balmain,

More information

a) List of KMTs targeted in the shrna screen. The official symbol, KMT designation,

a) List of KMTs targeted in the shrna screen. The official symbol, KMT designation, Supplementary Information Supplementary Figures Supplementary Figure 1. a) List of KMTs targeted in the shrna screen. The official symbol, KMT designation, gene ID and specifities are provided. Those highlighted

More information

Supplementary Information

Supplementary Information Supplementary Information - chimeric fusion transcript in human gastric cancer promotes tumorigenesis through activation of PI3K/AKT signaling Sun Mi Yun, Kwiyeom Yoon, Sunghoon Lee, Eunjeong Kim, Seong-Ho

More information

Nature Immunology: doi: /ni Supplementary Figure 1. Characteristics of SEs in T reg and T conv cells.

Nature Immunology: doi: /ni Supplementary Figure 1. Characteristics of SEs in T reg and T conv cells. Supplementary Figure 1 Characteristics of SEs in T reg and T conv cells. (a) Patterns of indicated transcription factor-binding at SEs and surrounding regions in T reg and T conv cells. Average normalized

More information

Supplemental Figure 1. Genes showing ectopic H3K9 dimethylation in this study are DNA hypermethylated in Lister et al. study.

Supplemental Figure 1. Genes showing ectopic H3K9 dimethylation in this study are DNA hypermethylated in Lister et al. study. mc mc mc mc SUP mc mc Supplemental Figure. Genes showing ectopic HK9 dimethylation in this study are DNA hypermethylated in Lister et al. study. Representative views of genes that gain HK9m marks in their

More information

Supplementary Figure 1. Efficiency of Mll4 deletion and its effect on T cell populations in the periphery. Nature Immunology: doi: /ni.

Supplementary Figure 1. Efficiency of Mll4 deletion and its effect on T cell populations in the periphery. Nature Immunology: doi: /ni. Supplementary Figure 1 Efficiency of Mll4 deletion and its effect on T cell populations in the periphery. Expression of Mll4 floxed alleles (16-19) in naive CD4 + T cells isolated from lymph nodes and

More information

Lung Met 1 Lung Met 2 Lung Met Lung Met H3K4me1. Lung Met H3K27ac Primary H3K4me1

Lung Met 1 Lung Met 2 Lung Met Lung Met H3K4me1. Lung Met H3K27ac Primary H3K4me1 a Gained Met-VELs 1.5 1.5 -.5 Lung Met 1 Lung Met Lung Met 3 1. Lung Met H3K4me1 Lung Met H3K4me1 1 Lung Met H3K4me1 Lung Met H3K7ac 1.5 Lung Met H3K7ac Lung Met H3K7ac.8 Primary H3K4me1 Primary H3K7ac

More information

genomics for systems biology / ISB2020 RNA sequencing (RNA-seq)

genomics for systems biology / ISB2020 RNA sequencing (RNA-seq) RNA sequencing (RNA-seq) Module Outline MO 13-Mar-2017 RNA sequencing: Introduction 1 WE 15-Mar-2017 RNA sequencing: Introduction 2 MO 20-Mar-2017 Paper: PMID 25954002: Human genomics. The human transcriptome

More information

Supplementary Figures

Supplementary Figures Supplementary Figures Supplementary Figure 1. Heatmap of GO terms for differentially expressed genes. The terms were hierarchically clustered using the GO term enrichment beta. Darker red, higher positive

More information

Accessing and Using ENCODE Data Dr. Peggy J. Farnham

Accessing and Using ENCODE Data Dr. Peggy J. Farnham 1 William M Keck Professor of Biochemistry Keck School of Medicine University of Southern California How many human genes are encoded in our 3x10 9 bp? C. elegans (worm) 959 cells and 1x10 8 bp 20,000

More information

Chromatin 3D Structure and Cancer Typing via Deep Learning

Chromatin 3D Structure and Cancer Typing via Deep Learning Chromatin 3D Structure and Cancer Typing via Deep Learning 石毅 (Shi, Yi) 2017.06.21 Center for Systems Biomedicine Shanghai Jiao Tong University USyd-SJTU Joint Research Alliance for Translational Medicine

More information

Analysis of Massively Parallel Sequencing Data Application of Illumina Sequencing to the Genetics of Human Cancers

Analysis of Massively Parallel Sequencing Data Application of Illumina Sequencing to the Genetics of Human Cancers Analysis of Massively Parallel Sequencing Data Application of Illumina Sequencing to the Genetics of Human Cancers Gordon Blackshields Senior Bioinformatician Source BioScience 1 To Cancer Genetics Studies

More information

A computational approach to identify microrna (mirna) based biomarker from the regulation of disease pathology.

A computational approach to identify microrna (mirna) based biomarker from the regulation of disease pathology. Research Article http://www.alliedacademies.org/biology-and-medicine-case-report/ A computational approach to identify microrna (mirna) based biomarker from the regulation of disease pathology. Anandaram

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION SUPPLEMENTARY INFORMATION doi:10.1038/nature19360 Supplementary Tables Supplementary Table 1. Number of monoclonal reads in each sample Sample Number of cells Total reads Aligned reads Monoclonal reads

More information

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1 Supplementary Figure 1 U1 inhibition causes a shift of RNA-seq reads from exons to introns. (a) Evidence for the high purity of 4-shU-labeled RNAs used for RNA-seq. HeLa cells transfected with control

More information

Nature Genetics: doi: /ng Supplementary Figure 1

Nature Genetics: doi: /ng Supplementary Figure 1 Supplementary Figure 1 Expression deviation of the genes mapped to gene-wise recurrent mutations in the TCGA breast cancer cohort (top) and the TCGA lung cancer cohort (bottom). For each gene (each pair

More information

The Epigenome Tools 2: ChIP-Seq and Data Analysis

The Epigenome Tools 2: ChIP-Seq and Data Analysis The Epigenome Tools 2: ChIP-Seq and Data Analysis Chongzhi Zang zang@virginia.edu http://zanglab.com PHS5705: Public Health Genomics March 20, 2017 1 Outline Epigenome: basics review ChIP-seq overview

More information

Supplementary Figure 1. Spitzoid Melanoma with PPFIBP1-MET fusion. (a) Histopathology (4x) shows a domed papule with melanocytes extending into the

Supplementary Figure 1. Spitzoid Melanoma with PPFIBP1-MET fusion. (a) Histopathology (4x) shows a domed papule with melanocytes extending into the Supplementary Figure 1. Spitzoid Melanoma with PPFIBP1-MET fusion. (a) Histopathology (4x) shows a domed papule with melanocytes extending into the deep dermis. (b) The melanocytes demonstrate abundant

More information

Supplemental Data. Integrating omics and alternative splicing i reveals insights i into grape response to high temperature

Supplemental Data. Integrating omics and alternative splicing i reveals insights i into grape response to high temperature Supplemental Data Integrating omics and alternative splicing i reveals insights i into grape response to high temperature Jianfu Jiang 1, Xinna Liu 1, Guotian Liu, Chonghuih Liu*, Shaohuah Li*, and Lijun

More information

Supplementary Figure 1

Supplementary Figure 1 Count Count Supplementary Figure 1 Coverage per amplicon for error-corrected sequencing experiments. Errorcorrected consensus sequence (ECCS) coverage was calculated for each of the 568 amplicons in the

More information

ESCs were lysed with Trizol reagent (Life technologies) and RNA was extracted according to

ESCs were lysed with Trizol reagent (Life technologies) and RNA was extracted according to Supplemental Methods RNA-seq ESCs were lysed with Trizol reagent (Life technologies) and RNA was extracted according to the manufacturer's instructions. RNAse free DNaseI (Sigma) was used to eliminate

More information

Supplementary Figures

Supplementary Figures Supplementary Figures Supplementary Figure 1. Confirmation of Dnmt1 conditional knockout out mice. a, Representative images of sorted stem (Lin - CD49f high CD24 + ), luminal (Lin - CD49f low CD24 + )

More information

Nature Immunology: doi: /ni Supplementary Figure 1 33,312. Aire rep 1. Aire rep 2 # 44,325 # 44,055. Aire rep 1. Aire rep 2.

Nature Immunology: doi: /ni Supplementary Figure 1 33,312. Aire rep 1. Aire rep 2 # 44,325 # 44,055. Aire rep 1. Aire rep 2. a 33,312 b rep 1 rep 1 # 44,325 rep 2 # 44,055 [0-84] rep 2 [0-84] 1810043G02Rik Pfkl Dnmt3l Icosl rep 1 [0-165] rep 2 [0-165] Rps14 Cd74 Mir5107 Tcof1 rep 1 [0-69] rep 2 [0-68] Id3 E2f2 Asap3 rep 1 [0-141]

More information

Sequence and chromatin determinants of cell-type specific transcription factor binding

Sequence and chromatin determinants of cell-type specific transcription factor binding Method Sequence and chromatin determinants of cell-type specific transcription factor binding Aaron Arvey, 1 Phaedra Agius, 1 William Stafford Noble, 2 and Christina Leslie 1,3 1 Computational Biology

More information

Supplementary Information. PRC2 overexpression and PRC2-target gene repression relating to poorer prognosis in small

Supplementary Information. PRC2 overexpression and PRC2-target gene repression relating to poorer prognosis in small Sato T, et al. Supplementary Information PRC2 overexpression and PRC2-target gene repression relating to poorer prognosis in small cell lung cancer Teruyuki Sato 1,2, Atsushi Kaneda 1,3,4 *, Shingo Tsuji

More information

Nature Immunology: doi: /ni Supplementary Figure 1. Transcriptional program of the TE and MP CD8 + T cell subsets.

Nature Immunology: doi: /ni Supplementary Figure 1. Transcriptional program of the TE and MP CD8 + T cell subsets. Supplementary Figure 1 Transcriptional program of the TE and MP CD8 + T cell subsets. (a) Comparison of gene expression of TE and MP CD8 + T cell subsets by microarray. Genes that are 1.5-fold upregulated

More information

Comprehensive nucleosome mapping of the human genome in cancer progression

Comprehensive nucleosome mapping of the human genome in cancer progression /, Vol. 7, No. 12 Comprehensive nucleosome mapping of the human genome in cancer progression Brooke R. Druliner 1,5, Daniel Vera 1,6, Ruth Johnson 2, Xiaoyang Ruan 3, Lynn M. Apone 4, Eileen T. Dimalanta

More information

Coding probability Ratio of sequences with ORF <= Size Size. Expression Levels by Class

Coding probability Ratio of sequences with ORF <= Size Size. Expression Levels by Class Supplementary Figure S Characterization of unannotated transcripts. (A) CPAT coding probability scores and (B) cummulative distribution of ORF length are plotted for each category of protein-coding genes,

More information

Table S1. Total and mapped reads produced for each ChIP-seq sample

Table S1. Total and mapped reads produced for each ChIP-seq sample Tale S1. Total and mapped reads produced for each ChIP-seq sample Sample Total Reads Mapped Reads Col- H3K27me3 rep1 125662 1334323 (85.76%) Col- H3K27me3 rep2 9176437 7986731 (87.4%) atmi1a//c H3K27m3

More information

Transcription factor p63 bookmarks and regulates dynamic enhancers during epidermal differentiation

Transcription factor p63 bookmarks and regulates dynamic enhancers during epidermal differentiation Published online: June 1, 15 Article Transcription factor p63 bookmarks and regulates dynamic enhancers during epidermal differentiation Evelyn N Kouwenhoven 1,, Martin Oti, Hanna Niehues 3, Simon J van

More information

Supplementary Information

Supplementary Information Supplementary Information Supplementary Figure 1! a! b! Nfatc1!! Nfatc1"! P1! P2! pa1! pa2! ex1! ex2! exons 3-9! ex1! ex11!!" #" Nfatc1A!!" Nfatc1B! #"!" Nfatc1C! #" DN1! DN2! DN1!!A! #A!!B! #B!!C! #C!!A!

More information

Polymorphisms associated with expression of BPIFA1/BPIFB1 and lung disease severity in cystic fibrosis

Polymorphisms associated with expression of BPIFA1/BPIFB1 and lung disease severity in cystic fibrosis Polymorphisms associated with expression of BPIFA1/BPIFB1 and lung disease severity in cystic fibrosis Aabida Saferali, Ma en Obeidat, Jean-Christophe Bérubé, Maxime Lamontagne, Yohan Bossé, Michel Laviolette,

More information

Nature Genetics: doi: /ng Supplementary Figure 1. Assessment of sample purity and quality.

Nature Genetics: doi: /ng Supplementary Figure 1. Assessment of sample purity and quality. Supplementary Figure 1 Assessment of sample purity and quality. (a) Hematoxylin and eosin staining of formaldehyde-fixed, paraffin-embedded sections from a human testis biopsy collected concurrently with

More information

Introduction. Introduction

Introduction. Introduction Introduction We are leveraging genome sequencing data from The Cancer Genome Atlas (TCGA) to more accurately define mutated and stable genes and dysregulated metabolic pathways in solid tumors. These efforts

More information

Supplementary Figure 1 IL-27 IL

Supplementary Figure 1 IL-27 IL Tim-3 Supplementary Figure 1 Tc0 49.5 0.6 Tc1 63.5 0.84 Un 49.8 0.16 35.5 0.16 10 4 61.2 5.53 10 3 64.5 5.66 10 2 10 1 10 0 31 2.22 10 0 10 1 10 2 10 3 10 4 IL-10 28.2 1.69 IL-27 Supplementary Figure 1.

More information

H3K4 demethylase KDM5B regulates global dynamics of transcription elongation and alternative splicing in embryonic stem cells

H3K4 demethylase KDM5B regulates global dynamics of transcription elongation and alternative splicing in embryonic stem cells Nucleic Acids Research, 2017 1 doi: 10.1093/nar/gkx251 H3K4 demethylase KDM5B regulates global dynamics of transcription elongation and alternative splicing in embryonic stem cells Runsheng He 1,2 and

More information

SUPPLEMENTARY INFORMATION. Intron retention is a widespread mechanism of tumor suppressor inactivation.

SUPPLEMENTARY INFORMATION. Intron retention is a widespread mechanism of tumor suppressor inactivation. SUPPLEMENTARY INFORMATION Intron retention is a widespread mechanism of tumor suppressor inactivation. Hyunchul Jung 1,2,3, Donghoon Lee 1,4, Jongkeun Lee 1,5, Donghyun Park 2,6, Yeon Jeong Kim 2,6, Woong-Yang

More information

gliomas. Fetal brain expected who each low-

gliomas. Fetal brain expected who each low- Supplementary Figure S1. Grade-specificity aberrant expression of HOXA genes in gliomas. (A) Representative RT-PCR analyses of HOXA gene expression in human astrocytomas. Exemplified glioma samples include

More information

EPIGENOMICS PROFILING SERVICES

EPIGENOMICS PROFILING SERVICES EPIGENOMICS PROFILING SERVICES Chromatin analysis DNA methylation analysis RNA-seq analysis Diagenode helps you uncover the mysteries of epigenetics PAGE 3 Integrative epigenomics analysis DNA methylation

More information

Relationship between genomic features and distributions of RS1 and RS3 rearrangements in breast cancer genomes.

Relationship between genomic features and distributions of RS1 and RS3 rearrangements in breast cancer genomes. Supplementary Figure 1 Relationship between genomic features and distributions of RS1 and RS3 rearrangements in breast cancer genomes. (a,b) Values of coefficients associated with genomic features, separately

More information

MSI positive MSI negative

MSI positive MSI negative Pritchard et al. 2014 Supplementary Figure 1 MSI positive MSI negative Hypermutated Median: 673 Average: 659.2 Non-Hypermutated Median: 37.5 Average: 43.6 Supplementary Figure 1: Somatic Mutation Burden

More information

Supplemental Figure S1. Tertiles of FKBP5 promoter methylation and internal regulatory region

Supplemental Figure S1. Tertiles of FKBP5 promoter methylation and internal regulatory region Supplemental Figure S1. Tertiles of FKBP5 promoter methylation and internal regulatory region methylation in relation to PSS and fetal coupling. A, PSS values for participants whose placentas showed low,

More information

Transcription factor Foxp3 and its protein partners form a complex regulatory network

Transcription factor Foxp3 and its protein partners form a complex regulatory network Supplementary figures Resource Paper Transcription factor Foxp3 and its protein partners form a complex regulatory network Dipayan Rudra 1, Paul deroos 1, Ashutosh Chaudhry 1, Rachel Niec 1, Aaron Arvey

More information

Plasma-Seq conducted with blood from male individuals without cancer.

Plasma-Seq conducted with blood from male individuals without cancer. Supplementary Figures Supplementary Figure 1 Plasma-Seq conducted with blood from male individuals without cancer. Copy number patterns established from plasma samples of male individuals without cancer

More information

MIR retrotransposon sequences provide insulators to the human genome

MIR retrotransposon sequences provide insulators to the human genome Supplementary Information: MIR retrotransposon sequences provide insulators to the human genome Jianrong Wang, Cristina Vicente-García, Davide Seruggia, Eduardo Moltó, Ana Fernandez- Miñán, Ana Neto, Elbert

More information

Nature Genetics: doi: /ng Supplementary Figure 1. Immunofluorescence (IF) confirms absence of H3K9me in met-2 set-25 worms.

Nature Genetics: doi: /ng Supplementary Figure 1. Immunofluorescence (IF) confirms absence of H3K9me in met-2 set-25 worms. Supplementary Figure 1 Immunofluorescence (IF) confirms absence of H3K9me in met-2 set-25 worms. IF images of wild-type (wt) and met-2 set-25 worms showing the loss of H3K9me2/me3 at the indicated developmental

More information

Heintzman, ND, Stuart, RK, Hon, G, Fu, Y, Ching, CW, Hawkins, RD, Barrera, LO, Van Calcar, S, Qu, C, Ching, KA, Wang, W, Weng, Z, Green, RD,

Heintzman, ND, Stuart, RK, Hon, G, Fu, Y, Ching, CW, Hawkins, RD, Barrera, LO, Van Calcar, S, Qu, C, Ching, KA, Wang, W, Weng, Z, Green, RD, Heintzman, ND, Stuart, RK, Hon, G, Fu, Y, Ching, CW, Hawkins, RD, Barrera, LO, Van Calcar, S, Qu, C, Ching, KA, Wang, W, Weng, Z, Green, RD, Crawford, GE, Ren, B (2007) Distinct and predictive chromatin

More information

Supervised Learner for the Prediction of Hi-C Interaction Counts and Determination of Influential Features. Tyler Yue Lab

Supervised Learner for the Prediction of Hi-C Interaction Counts and Determination of Influential Features. Tyler Yue Lab Supervised Learner for the Prediction of Hi-C Interaction Counts and Determination of Influential Features Tyler Derr @ Yue Lab tsd5037@psu.edu Background Hi-C is a chromosome conformation capture (3C)

More information

Whole Genome and Transcriptome Analysis of Anaplastic Meningioma. Patrick Tarpey Cancer Genome Project Wellcome Trust Sanger Institute

Whole Genome and Transcriptome Analysis of Anaplastic Meningioma. Patrick Tarpey Cancer Genome Project Wellcome Trust Sanger Institute Whole Genome and Transcriptome Analysis of Anaplastic Meningioma Patrick Tarpey Cancer Genome Project Wellcome Trust Sanger Institute Outline Anaplastic meningioma compared to other cancers Whole genomes

More information

Supplemental Figure S1. Expression of Cirbp mrna in mouse tissues and NIH3T3 cells.

Supplemental Figure S1. Expression of Cirbp mrna in mouse tissues and NIH3T3 cells. SUPPLEMENTAL FIGURE AND TABLE LEGENDS Supplemental Figure S1. Expression of Cirbp mrna in mouse tissues and NIH3T3 cells. A) Cirbp mrna expression levels in various mouse tissues collected around the clock

More information

Table S1. Primer sequences used for qrt-pcr. CACCATTGGCAATGAGCGGTTC AGGTCTTTGCGGATGTCCACGT ACTB AAGTCCATGTGCTGGCAGCACT ATCACCACTCCGAAGTCCGTCT LCOR

Table S1. Primer sequences used for qrt-pcr. CACCATTGGCAATGAGCGGTTC AGGTCTTTGCGGATGTCCACGT ACTB AAGTCCATGTGCTGGCAGCACT ATCACCACTCCGAAGTCCGTCT LCOR Table S1. Primer sequences used for qrt-pcr. ACTB LCOR KLF6 CTBP1 CDKN1A CDH1 ATF3 PLAU MMP9 TFPI2 CACCATTGGCAATGAGCGGTTC AGGTCTTTGCGGATGTCCACGT AAGTCCATGTGCTGGCAGCACT ATCACCACTCCGAAGTCCGTCT CGGCTGCAGGAAAGTTTACA

More information

Accel-Amplicon Panels

Accel-Amplicon Panels Accel-Amplicon Panels Amplicon sequencing has emerged as a reliable, cost-effective method for ultra-deep targeted sequencing. This highly adaptable approach is especially applicable for in-depth interrogation

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION doi: 1.138/nature89 IFN- (ng ml ) 5 4 3 1 Splenocytes NS IFN- (ng ml ) 6 4 Lymph node cells NS Nfkbiz / Nfkbiz / Nfkbiz / Nfkbiz / IL- (ng ml ) 3 1 Splenocytes IL- (ng ml ) 1 8 6 4 *** ** Lymph node cells

More information

Recurrent noncoding regulatory mutations in pancreatic ductal adenocarcinoma

Recurrent noncoding regulatory mutations in pancreatic ductal adenocarcinoma Recurrent noncoding regulatory mutations in pancreatic ductal adenocarcinoma Michael E Feigin 1,2,22, Tyler Garvin 3,22, Peter Bailey 4, Nicola Waddell 5,6, David K Chang 4,7 9, David R Kelley 1, Shimin

More information

Supplementary Information

Supplementary Information Supplementary Information mediates STAT3 activation at retromer-positive structures to promote colitis and colitis-associated carcinogenesis Zhang et al. a b d e g h Rel. Luc. Act. Rel. mrna Rel. mrna

More information

p.r623c p.p976l p.d2847fs p.t2671 p.d2847fs p.r2922w p.r2370h p.c1201y p.a868v p.s952* RING_C BP PHD Cbp HAT_KAT11

p.r623c p.p976l p.d2847fs p.t2671 p.d2847fs p.r2922w p.r2370h p.c1201y p.a868v p.s952* RING_C BP PHD Cbp HAT_KAT11 ARID2 p.r623c KMT2D p.v650fs p.p976l p.r2922w p.l1212r p.d1400h DNA binding RFX DNA binding Zinc finger KMT2C p.a51s p.d372v p.c1103* p.d2847fs p.t2671 p.d2847fs p.r4586h PHD/ RING DHHC/ PHD PHD FYR N

More information

Supplemental Material for:

Supplemental Material for: Supplemental Material for: Transcriptional silencing of γ-globin by BCL11A involves long-range interactions and cooperation with SOX6 Jian Xu, Vijay G. Sankaran, Min Ni, Tobias F. Menne, Rishi V. Puram,

More information

RNA-seq Introduction

RNA-seq Introduction RNA-seq Introduction DNA is the same in all cells but which RNAs that is present is different in all cells There is a wide variety of different functional RNAs Which RNAs (and sometimes then translated

More information

Protein SD Units (P-value) Cluster order

Protein SD Units (P-value) Cluster order SUPPLEMENTAL TABLE AND FIGURES Table S1. Signature Phosphoproteome of CD22 E12 Transgenic Mouse BPL Cells. T-test vs. Other Protein SD Units (P-value) Cluster order ATPase (Ab-16) 1.41 0.000880 1 mtor

More information

cis-regulatory enrichment analysis in human, mouse and fly

cis-regulatory enrichment analysis in human, mouse and fly cis-regulatory enrichment analysis in human, mouse and fly Zeynep Kalender Atak, PhD Laboratory of Computational Biology VIB-KU Leuven Center for Brain & Disease Research Laboratory of Computational Biology

More information

RNA SEQUENCING AND DATA ANALYSIS

RNA SEQUENCING AND DATA ANALYSIS RNA SEQUENCING AND DATA ANALYSIS Length of mrna transcripts in the human genome 5,000 5,000 4,000 3,000 2,000 4,000 1,000 0 0 200 400 600 800 3,000 2,000 1,000 0 0 2,000 4,000 6,000 8,000 10,000 Length

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION doi:10.1038/nature10866 a b 1 2 3 4 5 6 7 Match No Match 1 2 3 4 5 6 7 Turcan et al. Supplementary Fig.1 Concepts mapping H3K27 targets in EF CBX8 targets in EF H3K27 targets in ES SUZ12 targets in ES

More information

Data mining with Ensembl Biomart. Stéphanie Le Gras

Data mining with Ensembl Biomart. Stéphanie Le Gras Data mining with Ensembl Biomart Stéphanie Le Gras (slegras@igbmc.fr) Guidelines Genome data Genome browsers Getting access to genomic data: Ensembl/BioMart 2 Genome Sequencing Example: Human genome 2000:

More information

Seq-ing Answers to Questions of Transcriptional Regulation and Epigenetics. Theodore J. Perkins OttBUG Seminar Nov.

Seq-ing Answers to Questions of Transcriptional Regulation and Epigenetics. Theodore J. Perkins   OttBUG Seminar Nov. Seq-ing Answers to Questions of Transcriptional Regulation and Epigenetics Theodore J. Perkins www.perkinslab.ca OttBUG Seminar Nov. 7, 2016 Outline ChIP-seq What s it for? How does it work? Two examples

More information

BWA alignment to reference transcriptome and genome. Convert transcriptome mappings back to genome space

BWA alignment to reference transcriptome and genome. Convert transcriptome mappings back to genome space Whole genome sequencing Whole exome sequencing BWA alignment to reference transcriptome and genome Convert transcriptome mappings back to genome space genomes Filter on MQ, distance, Cigar string Annotate

More information

ChIP-seq hands-on. Iros Barozzi, Campus IFOM-IEO (Milan) Saverio Minucci, Gioacchino Natoli Labs

ChIP-seq hands-on. Iros Barozzi, Campus IFOM-IEO (Milan) Saverio Minucci, Gioacchino Natoli Labs ChIP-seq hands-on Iros Barozzi, Campus IFOM-IEO (Milan) Saverio Minucci, Gioacchino Natoli Labs Main goals Becoming familiar with essential tools and formats Visualizing and contextualizing raw data Understand

More information

Supplementary Figure 1

Supplementary Figure 1 Supplementary Figure 1 Supplementary Fig. 1: Quality assessment of formalin-fixed paraffin-embedded (FFPE)-derived DNA and nuclei. (a) Multiplex PCR analysis of unrepaired and repaired bulk FFPE gdna from

More information

Nature Genetics: doi: /ng Supplementary Figure 1. PCA for ancestry in SNV data.

Nature Genetics: doi: /ng Supplementary Figure 1. PCA for ancestry in SNV data. Supplementary Figure 1 PCA for ancestry in SNV data. (a) EIGENSTRAT principal-component analysis (PCA) of SNV genotype data on all samples. (b) PCA of only proband SNV genotype data. (c) PCA of SNV genotype

More information

Predictive PP1Ca binding region in BIG3 : 1,228 1,232aa (-KAVSF-) HEK293T cells *** *** *** KPL-3C cells - E E2 treatment time (h)

Predictive PP1Ca binding region in BIG3 : 1,228 1,232aa (-KAVSF-) HEK293T cells *** *** *** KPL-3C cells - E E2 treatment time (h) Relative expression ERE-luciferase activity activity (pmole/min) activity (pmole/min) activity (pmole/min) activity (pmole/min) MCF-7 KPL-3C ZR--1 BT-474 T47D HCC15 KPL-1 HBC4 activity (pmole/min) a d

More information

Abstract. Optimization strategy of Copy Number Variant calling using Multiplicom solutions APPLICATION NOTE. Introduction

Abstract. Optimization strategy of Copy Number Variant calling using Multiplicom solutions APPLICATION NOTE. Introduction Optimization strategy of Copy Number Variant calling using Multiplicom solutions Michael Vyverman, PhD; Laura Standaert, PhD and Wouter Bossuyt, PhD Abstract Copy number variations (CNVs) represent a significant

More information

An Unexpected Function of the Prader-Willi Syndrome Imprinting Center in Maternal Imprinting in Mice

An Unexpected Function of the Prader-Willi Syndrome Imprinting Center in Maternal Imprinting in Mice An Unexpected Function of the Prader-Willi Syndrome Imprinting Center in Maternal Imprinting in Mice Mei-Yi Wu 1 *, Ming Jiang 1, Xiaodong Zhai 2, Arthur L. Beaudet 2, Ray-Chang Wu 1 * 1 Department of

More information

A Practical Guide to Integrative Genomics by RNA-seq and ChIP-seq Analysis

A Practical Guide to Integrative Genomics by RNA-seq and ChIP-seq Analysis A Practical Guide to Integrative Genomics by RNA-seq and ChIP-seq Analysis Jian Xu, Ph.D. Children s Research Institute, UTSW Introduction Outline Overview of genomic and next-gen sequencing technologies

More information

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1 Supplementary Figure 1 Design of isolated protein and RNC constructs, and homogeneity of purified RNCs. (a) Schematic depicting the design and nomenclature used for all the isolated proteins and RNCs used

More information

Raymond Auerbach PhD Candidate, Yale University Gerstein and Snyder Labs August 30, 2012

Raymond Auerbach PhD Candidate, Yale University Gerstein and Snyder Labs August 30, 2012 Elucidating Transcriptional Regulation at Multiple Scales Using High-Throughput Sequencing, Data Integration, and Computational Methods Raymond Auerbach PhD Candidate, Yale University Gerstein and Snyder

More information

STAT1 regulates microrna transcription in interferon γ stimulated HeLa cells

STAT1 regulates microrna transcription in interferon γ stimulated HeLa cells CAMDA 2009 October 5, 2009 STAT1 regulates microrna transcription in interferon γ stimulated HeLa cells Guohua Wang 1, Yadong Wang 1, Denan Zhang 1, Mingxiang Teng 1,2, Lang Li 2, and Yunlong Liu 2 Harbin

More information

Peak-calling for ChIP-seq and ATAC-seq

Peak-calling for ChIP-seq and ATAC-seq Peak-calling for ChIP-seq and ATAC-seq Shamith Samarajiwa CRUK Autumn School in Bioinformatics 2017 University of Cambridge Overview Peak-calling: identify enriched (signal) regions in ChIP-seq or ATAC-seq

More information

Sudin Bhattacharya Institute for Integrative Toxicology

Sudin Bhattacharya Institute for Integrative Toxicology Beyond the AHRE: the Role of Epigenomics in Gene Regulation by the AHR (or, Varied Applications of Computational Modeling in Toxicology and Ingredient Safety) Sudin Bhattacharya Institute for Integrative

More information

Polycomb protein Ezh2 regulates pancreatic β-cell Ink4a/Arf expression and regeneration in streptozotocin-induced diabetes mellitus

Polycomb protein Ezh2 regulates pancreatic β-cell Ink4a/Arf expression and regeneration in streptozotocin-induced diabetes mellitus Chen et al. 1 Polycomb protein Ezh2 regulates pancreatic β-cell Ink4a/Arf expression and regeneration in streptozotocin-induced diabetes mellitus Hainan Chen 1, Xueying Gu 1, I-hsin Su 2, Rita Bottino

More information

Dr Rick Tearle Senior Applications Specialist, EMEA Complete Genomics Complete Genomics, Inc.

Dr Rick Tearle Senior Applications Specialist, EMEA Complete Genomics Complete Genomics, Inc. Dr Rick Tearle Senior Applications Specialist, EMEA Complete Genomics Topics Overview of Data Processing Pipeline Overview of Data Files 2 DNA Nano-Ball (DNB) Read Structure Genome : acgtacatgcattcacacatgcttagctatctctcgccag

More information

Computational aspects of ChIP-seq. John Marioni Research Group Leader European Bioinformatics Institute European Molecular Biology Laboratory

Computational aspects of ChIP-seq. John Marioni Research Group Leader European Bioinformatics Institute European Molecular Biology Laboratory Computational aspects of ChIP-seq John Marioni Research Group Leader European Bioinformatics Institute European Molecular Biology Laboratory ChIP-seq Using highthroughput sequencing to investigate DNA

More information

Supplementary Figures

Supplementary Figures Supplementary Figures Supplementary Figure 1. Pan-cancer analysis of global and local DNA methylation variation a) Variations in global DNA methylation are shown as measured by averaging the genome-wide

More information

(A) Cells grown in monolayer were fixed and stained for surfactant protein-c (SPC,

(A) Cells grown in monolayer were fixed and stained for surfactant protein-c (SPC, Supplemental Figure Legends Figure S1. Cell line characterization (A) Cells grown in monolayer were fixed and stained for surfactant protein-c (SPC, green) and co-stained with DAPI to visualize the nuclei.

More information

Cluster Dendrogram. dist(cor(na.omit(tss.exprs.chip[, c(1:10, 24, 27, 30, 48:50, dist(cor(na.omit(tss.exprs.chip[, c(1:99, 103, 104, 109, 110,

Cluster Dendrogram. dist(cor(na.omit(tss.exprs.chip[, c(1:10, 24, 27, 30, 48:50, dist(cor(na.omit(tss.exprs.chip[, c(1:99, 103, 104, 109, 110, A Transcriptome (RNA-seq) Transcriptome (RNA-seq) 3. 2.5 2..5..5...5..5 2. 2.5 3. 2.5 2..5..5...5..5 2. 2.5 Cluster Dendrogram RS_ES3.2 RS_ES3. RS_SHS5.2 RS_SHS5. PS_SHS5.2 PS_SHS5. RS_LJ3 PS_LJ3..4 _SHS5.2

More information

DDR2 controls the epithelial-mesenchymal-transition-related gene expression via c-myb acetylation upon matrix stiffening

DDR2 controls the epithelial-mesenchymal-transition-related gene expression via c-myb acetylation upon matrix stiffening Supplementary information controls the epithelialmesenchymaltransitionrelated gene expression via cmyb acetylation upon matrix stiffening Daehwan Kim, Eunae You, Jangho Jeong, Panseon Ko, JungWoong Kim

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION doi:.38/nature8975 SUPPLEMENTAL TEXT Unique association of HOTAIR with patient outcome To determine whether the expression of other HOX lincrnas in addition to HOTAIR can predict patient outcome, we measured

More information

The epigenetic landscape of T cell subsets in SLE identifies known and potential novel drivers of the autoimmune response

The epigenetic landscape of T cell subsets in SLE identifies known and potential novel drivers of the autoimmune response Abstract # 319030 Poster # F.9 The epigenetic landscape of T cell subsets in SLE identifies known and potential novel drivers of the autoimmune response Jozsef Karman, Brian Johnston, Sofija Miljovska,

More information

Lecture 8 Understanding Transcription RNA-seq analysis. Foundations of Computational Systems Biology David K. Gifford

Lecture 8 Understanding Transcription RNA-seq analysis. Foundations of Computational Systems Biology David K. Gifford Lecture 8 Understanding Transcription RNA-seq analysis Foundations of Computational Systems Biology David K. Gifford 1 Lecture 8 RNA-seq Analysis RNA-seq principles How can we characterize mrna isoform

More information

underlying metastasis and recurrence in HNSCC, we analyzed two groups of patients. The

underlying metastasis and recurrence in HNSCC, we analyzed two groups of patients. The Supplementary Figures Figure S1. Patient cohorts and study design. To define and interrogate the genetic alterations underlying metastasis and recurrence in HNSCC, we analyzed two groups of patients. The

More information

Broad H3K4me3 is associated with increased transcription elongation and enhancer activity at tumor suppressor genes

Broad H3K4me3 is associated with increased transcription elongation and enhancer activity at tumor suppressor genes Broad H3K4me3 is associated with increased transcription elongation and enhancer activity at tumor suppressor genes Kaifu Chen 1,2,3,4,5,10, Zhong Chen 6,10, Dayong Wu 6, Lili Zhang 7, Xueqiu Lin 1,2,8,

More information