ShinyGO: a web application for in-depth analysis of gene sets

Size: px
Start display at page:

Download "ShinyGO: a web application for in-depth analysis of gene sets"

Transcription

1 ShinyGO: a web application for in-depth analysis of gene sets Steven Xijin Ge 1,*, and Dongmin Jung 1,2 1 Department of Mathematics and Statistics, Box 2225, Brookings, SD, USA Avison Biomedical Research Center, Yonsei University, Seoul, South Korea Table 1 compares ShinyGO ( ) with several other tools for enrichment analysis using gene lists. Methods To enable gene ID conversion, we downloaded all available gene ID mappings from Ensembl. The final mapping table for the current ShinyGO v0.4 release consists of 135,832,098 rows, mapping various gene IDs, including DNA microarray probe names, into Ensembl gene IDs. Enrichment analysis is calculated based on hypergeometric distribution followed by false discovery rate (FDR) correction. Background gene-sets are all protein-coding genes in the genome. As many of the enriched GO terms are related or redundant (i.e., cell cycle and cell cycle process ), we provide two plots to summarize such correlation following a method developed in [2]. We first measure the distance among two gene-sets by 1 N i / N u, where N i and N u is the number of Table 1. Comparison of selected enrichment analysis tools. #Organisms Custom Background Enrichr 2 No [1] GOrilla 8 No [3] PlantGSEA 15 No [5] Panther 112 No [7] STRING 2031 No [8] DAVID 65,000 Yes [10] g:profiler 208 No [12] Reference ShinyGO** 208 No Present study ShinyGO new features: visualization of overlapping gene sets, gene characteristics plots (gene length, GC content, chromosomal location), KEGG pathway diagram, protein-protein interaction (PPI) network. genes in the intersect and the union of the two sets, respectively. The distance matrix is used to construct a hierarchical clustering tree using average linkage, and to construct a network of GO terms using a cutoff of 0.05 overlap ratio [2]. To identify enriched TF binding motifs, transcript annotation and promoter sequences are retrieved from Ensembl. For genes with multiple transcripts, the transcription start site (TSS) with multiple transcripts is used. If multiple TSS locations have the same number of transcripts, then the most upstream TSS is used. Promoters are scanned using TF binding motifs in CIS-BP [4]. Instead of defining a binary outcome of binding or not binding, which depends on arbitrary cutoffs, we recorded the best score for each of the TFs in every 300bp and 600bp promoter sequences. Then student s t-test is used to compare the scores observed in a group of genes against the rest of genes. The P-values are corrected for multiple testing using false discovery rate (FDR). Use case As an example, we analyzed a set of 149 genes (Table 2) up-regulated in lymphoblasts cells (TK6, WTK1, and NH32) treated with ionizing radiation [6]. This gene list is available on the MSigDB [9] website [11]. The 149 human genes are mapped to 147 Ensembl gene IDs for enrichment analysis, as suggested by the mapping information is available at the Genes tab. Below, we will use a large collection of human gene-sets (Table 3) to investigate this list. GO Biological Process Using Gene Ontology (GO) Biological Process for gene-sets, we get an enrichment results as Table 4. The top terms are related with positive regulation of cellular metabolic process, response to stress, apoptosis, etc. These terms, many of which are related, are ranked by FDR in the table. A more organized presentation of these terms is shown in Figure 1, where related terms are grouped together. For example, several apoptosis related terms are grouped in the branch at the bottom of Figure 1. The most significant groups of terms are related to nitrogen metabolism. Ionizing radiation induces reactive oxygen and nitrogen species, which might activate signaling pathways in response to DNA 1

2 damage[13]. As shown in Figure 1, other groups of terms are related to regulation of biosynthesis, response to stimulus, and apoptosis. Many of these processes are known to underpinning cellular response to ionizing radiation [14]. GO Cellular Component When switching to GO cellular component, we can detect that this list is overrepresented with 61 (41%) nuclear proteins (Table 5 and Figure 2) (FDR < , hypergeometric test). Several small and highly specific functional categories are also identified. For example, 4 out of 7 proteins involved in the I-κB/NF-κB complex are included in the gene list (FDR < ). As shown in Table 5, the 4 proteins are RELA NFKB2 NFKBIA NFKB1, which is available in the downloaded enrichment results. The I-κB/NF-κB complex plays important roles in immune response [15]. This list also contains 6 out of the 42 proteins in the Cyclin-dependent protein kinase holoenzyme complex (FDR < ). Both proteins in the PCNA-p21 complex are included in the list (FDR < ). Downstream of p53 signaling pathways, the interaction of p21 and PCNA plays a role in regulating DNA cell cycle after DNA damage [16]. Enriched terms can also be displayed as connected networks. For example, Figure 4. The network of enriched GO Cellular Component terms. It shows a big cluster of interconnected terms related to chromatin and nuclear, and a group of 3 terms related to membrane and cell surfaces. GO Molecular Function Using GO molecular function (Table 6 and Figure 3), ShinyGO reveals that 40 (27%) of the 147 genes have DNA binding transcription factor activity (FDR < ). This list contains many transcription factors such as JUNB, NFKB1, STAT1, MYC and so on, which give rise to many of the terms in the big branch in the lower side of Figure 3. Other less significant terms include kinase binding, and cytokine receptor binding. KEGG pathways Using KEGG pathways, we can detect overrepresentation of genes in cancer pathways with FDR < (Table 7). Thirty-two genes in the lists are related to tumorigenesis (Figure 5). Other significant pathways include the P53 signaling pathway, for DNA damage response, and TNF and NF-κB signaling pathways for immune response. ShinyGO retrieves the pathway diagrams from KEGG web server and highlights the user s genes (Figure 6). Transcription Factor target genes To investigate whether the 149 genes can be regulated by common transcription factors (TF)s, we choose the TF.Target gene sets. These are verified or predicted TF target genes compiled from various sources, including RegNetwork [17], CircuitsDB [18], TRED [19], ENCODE [20], and TRRUST[21]. As shown Tale 8, the most significantly enriched TF is p53, which is represented by multiple gene-sets from different databases. Among the 147 genes, 27 (18%) are target genes of p53 (FDR < ), which play a critical role in cellular response to ionizing radiation[16]. Other significant TFs are NFKB and RELA, which probably mediate the immune response via the Rela/NF-κB pathway[15]. Consistent with this enrichment, NFKB1, NFKB2, and RELA are included in the 147 query genes. Other TFs with enriched target genes include SP1 and BRCA1. These enrichment results are organized in Figure 7. The 4 gene-sets related to p53 target genes are grouped together. The 12 gene-sets of Rela/NF-κB also form a bigger branch on the tree. Another groups of TFs at the bottom of the tree includes SP1, as well as FOS, and JUN. Taking advantage of a large collection of transcription factor target genes, ShinyGO can help generate hypothesis on gene regulation. microrna target genes As shown in Table 10, 13 of the 147 genes are target genes of mir-145, with FDR < Previous studies have shown that mir-145 is involved in DNA damage repair, and is regulated by p53 [22]. As shown in Figure 8, two gene-sets are related to mir-145. mir-21 target genes are also overrepresented in the 147 genes (FDR < ). mir-21 has also been shown to be involved in DNA damage repair [23, 24], probably by targeting MSH2, a mismatch repair gene. Other micrornas in Figure 8, are less significant but may also be further investigated. For example, mir- 146 might be regulated by p53 as well [25]. Gene characteristics ShinyGO can also compare the list of 147 genes with the rest of the genes from several aspects. Figure 9A shows these genes are distributed randomly on the chromosomes (Chi-squared test, P=0.97). A detailed genomic location 2

3 map is shown in Figure 11. Figure 9C indicates that the genes are all protein-coding. These genes seem to have less exons (Chi-squared test, P= 0.023) and more transcript isoforms (Chi-squared test, P= ) than other proteincoding genes (Figure 9B and D). The distribution of the length of various gene features are shown in Figure 10. The list of 147 genes have similar lengths for coding sequences, transcripts, and genomic span, and 3 UTR (untranslated region). Their 5 UTR are slightly longer than the rest of coding-genes (T-test, P = 0.026, Figure 10B). Plotting and t- test on these gene lengths are done on a log-scale, as the transformed data are closer to normal distribution. The GC content of the genes are also similar to other genes. STRING API The STRING API recognized 138 (94%) of the 147 genes. In addition to enrichment analysis based on GO and KEGG, STRING also offers enrichment of protein domains using Pfam and InterPro databases. As shown in Table 11. Enriched Pfam domains in the query genes.table 11, this list is overrepresented with 5 proteins with Cyclin, N- terminal domain, and 5 proteins with Helix-loop-helix DNA-binding domain. A protein-protein interaction (PPI) network can be retrieved for all or some of the genes in the list. Figure 12 shows the network for 50 genes, indicating the interactions of NFKBIA, RELA and other factors underlying the immune response. This network is also richly annotated with links to original studies and even protein structures. In conclusion, using ShinyGO, users can easily perform in-depth analysis of gene lists and generating novel hypothesis for further study. Table 2. Genes upregulated by radiation in lymphoblasts [6]. ADORA3 AHR AKAP5 ARL2BP ATF5 AXL BAI3 BATF BBC3 BIRC3 BTG1 BTG2 CARTPT CASP1 CCL18 CCL20 CCL21 CCL3 CCL4 CCNE2 CCNF CCNG1 CCNG2 CCNI CCNK CCR1 CCT4 CCT7 CD164 CD59 CD79A CD80 CD83 CDK1 CDK6 CDKN1A CDKN1C CFLAR CGRRF1 CLK1 COL4A2 COL6A1 COX7B CRAT CXCL10 DDB2 DDIT3 DKC1 DLX2 DPYSL3 DUSP6 ENC1 ENO1 EP300 FAM134C FAS FCN3 FGFR1 FN1 GDF15 GRN HBEGF HOXC5 IFITM1 IFNGR1 IL2RB IRF4 IRF9 ITGA2 JAK2 JUNB KIF20B KLF4 KLRC2 KRT14 KRT19 KRT8 LGALS3 LZTR1 MAP7 MAX MDFI MED21 MSC MSH2 MYC NCOA3 NDUFB5 NFE2L1 NFKB1 NFKB2 NFKBIA NFKBIE NR4A3 PCNA PKD2L1 PLAGL2 PLAU PLCG2 PLK1 PLOD3 POU3F4 POU5F1 PPM1D PRMT1 PTN PTPN11 PTPN22 PTPN6 PTTG1 RABGAP1 RAD50 RANBP1 RB1 RELA RFX5 ROCK1 RORA RPS6KA1 RRM2B SGK1 SIPA1 SMAD3 SNAPC2 STAT1 STAT5A STK4 TANK TCF21 TFDP2 TGFBR3 TGIF1 TLE3 TNFAIP3 TNFAIP8 TNFRSF8 TNFSF10 TP53I3 TP53TG1 TRAF1 TRRAP TSC22D3 UBE2C VEGFC XPC ZBTB48 ZNF141 ZNF274 ZNF85 3

4 Table 3. Gene Set databases for enrichment analysis in human. Type Subtype/Database name #GeneSets Source Ref. Gene Ontology Biological Process(BP) Ensembl 91 [26] Cellular Component (CC) 1581 Ensembl 91 Molecular Function (MF) 3191 Ensembl 91 KEGG KEGG 321 Release 84.1 [27] MSigDB.Computational Computational gene sets 858 MSigDB 6.1 [9] MSigDB.Curated REACTOME 674 MSigDB 6.1 BIOCARTA 217 MSigDB 6.1 PID 196 MSigDB 6.1 KEGG 186 MSigDB 6.1 Literature 3465 MSigDB 6.1 MSigDB.Hallmark hallmark 50 MSigDB 6.1 MSigDB.Immune Immune system 4872 MSigDB 6.1 MSigDB.Location Cytogenetic band 326 MSigDB 6.1 MSigDB.Motif TF and mirna Motifs 836 MSigDB 6.1 MSigDB.Oncogenic Oncogenic signatures 189 MSigDB 6.1 GeneSetDB.Pathways Biocarta 197 GeneSetDB [28] EHMN 55 GeneSetDB HumanCyc 59 GeneSetDB KEGG(Disease) 81 GeneSetDB NetPath 25 GeneSetDB Reactome 776 GeneSetDB WikiPathways 152 GeneSetDB GeneSetDB.Drug CTD 1047 GeneSetDB DrugBank 176 GeneSetDB MATADOR 266 GeneSetDB SIDER 473 GeneSetDB SMPDB 81 GeneSetDB GeneSetDB.Other CancerGenes 23 GeneSetDB HPO 1639 GeneSetDB INOH 76 GeneSetDB MethCancerDB 21 GeneSetDB MethyCancer 54 GeneSetDB MPO 3134 GeneSetDB PID 195 GeneSetDB STITCH 4616 GeneSetDB T3DB 846 GeneSetDB TF.Target TFactS 109 GeneSetDB RegNetwork 1041 v.2015 [17] CircuitsDB 849 V.2012 [18] TRED 131 tftargets [19] ITFP 1974 tftargets [29] Neph tftargets [30] Marbach tftargets [31] ENCODE 157 tftargets [20] TRRUST 798 tftargets [21] mirna.targets MicroCosm 44 GeneSetDB [28] mirtarbase 62 GeneSetDB mirdb 2588 V 5.0 [32] TargetScan 583 V 7.1 [33] mirtarbase 2599 V 7.0 [34] RegNetwork 618 V [17] CircuitsDB 140 V [18] Total: 72,394 4

5 Table 4. Enriched GO Biological Process terms for the 147 genes induced by radiation. FDR Genes in list Total genes Functional Category Genes (first 6) 1.52E Positive regulation of cellular metabolic process NFE2L1 CCNK AHR CCL20 GDF15 CCL4 1.52E Positive regulation of nitrogen compound metabolic process NFE2L1 CCNK AHR CCL20 GDF15 CCL4 2.1E Negative regulation of cellular process BIRC3 NFKB2 MSH2 MDFI TNFRSF8 TSC22D3 4.86E Positive regulation of macromolecule metabolic process NFE2L1 CCNK AHR CCL20 GDF15 CCL4 1.38E Positive regulation of metabolic process NFE2L1 CCNK AHR CCL20 GDF15 CCL4 1.39E Response to stress FAS NFKB2 CD59 MSH2 JAK2 STK4 1.63E Response to organic substance FAS NFKB2 CCL20 TNFRSF8 NCOA3 GDF E Programmed cell death CFLAR BIRC3 MSH2 JAK2 TNFRSF8 PLAGL2 7.54E Cellular response to chemical stimulus CCL20 NCOA3 GDF15 CCL21 DUSP6 JUNB 1.22E Cell death CFLAR BIRC3 MSH2 JAK2 TNFRSF8 PLAGL2 1.62E Regulation of programmed cell death BIRC3 JAK2 GDF15 TSC22D3 DDIT3 CFLAR 3.56E Apoptotic process CFLAR BIRC3 MSH2 JAK2 TNFRSF8 PLAGL2 4.04E System development MSH2 JAK2 CD79A SGK1 TNFRSF8 DUSP6 4.21E Regulation of cell death BIRC3 JAK2 GDF15 TSC22D3 JUNB DDIT3 4.96E Regulation of apoptotic process BIRC3 JAK2 GDF15 TSC22D3 DDIT3 CFLAR 5.15E Regulation of cellular biosynthetic process NFKB2 NFE2L1 CCNK AHR MDFI TFDP2 5.33E Regulation of macromolecule biosynthetic process NFKB2 NFE2L1 CCNK AHR MDFI TFDP2 1.18E Regulation of biosynthetic process NFKB2 NFE2L1 CCNK AHR MDFI TFDP2 1.28E Regulation of multicellular organismal process CD83 TNFRSF8 DUSP6 VEGFC FGFR1 CDK6 1.48E Cellular response to organic substance CCL20 NCOA3 GDF15 CCL21 DUSP6 JUNB 1.49E Response to chemical FAS NFKB2 CCL20 TNFRSF8 NCOA3 GDF E Animal organ development JAK2 CD79A DUSP6 TGFBR3 CDK6 PRMT1 3.45E Positive regulation of multicellular organismal process VEGFC FGFR1 TNFRSF8 IRF4 CASP1 CARTPT 4.03E Regulation of nucleobase-containing compound metabolic process NFKB2 NFE2L1 CCNK MSH2 AHR MDFI 4.03E Regulation of molecular function RABGAP1 BIRC3 CCNK CCL20 CCL3 CCL4 4.18E Cell surface receptor signaling pathway CD79A CCL20 GDF15 CCL21 DUSP6 VEGFC 1.59E Cell proliferation FAS JAK2 HBEGF VEGFC JUNB FGFR1 2.25E Immune system development MSH2 JAK2 CD79A CDK6 PRMT1 AXL 2.91E Response to external stimulus FAS CD59 PKD2L1 CCL20 TNFAIP3 TNFRSF8 3.34E Intracellular signal transduction NFKB2 MSH2 JAK2 STK4 NFKB1 CCL20 5

6 Nitrogen metabolism Regulation of Biosynthesis Response to stimulus Apoptosis Figure 1. Hierarchical tree summarizing significant GO terms. Bigger dots at the end of branches correspond to more significant FDR values, which is printed in front of the terms. Terms sharing more genes are grouped together. 6

7 Table 5. Enriched GO Cellular Component terms. FDR Genes in list Total genes Functional Category Genes (first 6) 7.26E Nucleoplasm BIRC3 FAS RRM2B CCNK JAK2 EP E Protein complex CD79A COX7B CASP1 KIF20B XPC RELA 3.68E Nuclear lumen RAD50 JUNB BIRC3 FAS RRM2B FGFR1 1.25E Nuclear part AHR RAD50 XPC JUNB BIRC3 FAS 2.60E I-kappaB/NF-kappaB complex RELA NFKB2 NFKBIA NFKB1 9.44E Cyclin-dependent protein kinase holoenzyme complex CDK6 CDKN1A RB1 CDK1 CCNE2 CCNK 9.44E Transcription factor complex TFDP2 MAX RB1 SMAD3 JUNB RELA 5.26E Nuclear chromosome part RAD50 JUNB MSH2 STAT1 NCOA3 PCNA 9.52E Nuclear chromosome RAD50 JUNB MSH2 STAT1 NCOA3 PCNA 1.82E Nuclear chromatin RAD50 JUNB STAT1 NCOA3 KLF4 SMAD3 1.82E External side of plasma membrane CD79A TGFBR3 CD59 CCR1 FAS IL2RB 7.54E PCNA-p21 complex CDKN1A PCNA 7.99E Serine/threonine protein kinase complex CDK6 CDKN1A RB1 CDK1 CCNE2 CCNK 1.30E Chromatin RAD50 JUNB STAT1 NCOA3 KLF4 SMAD3 1.45E Protein kinase complex CDK6 CDKN1A RB1 CDK1 CCNE2 CCNK 1.71E Chromosomal part RAD50 JUNB MSH2 STAT1 NCOA3 PCNA 1.71E Nuclear transcription factor complex TFDP2 MAX RB1 JUNB DDIT3 TRRAP 2.49E Cell surface CD79A FAS TGFBR3 ENO1 CD59 HBEGF 3.59E Side of membrane JAK2 CD79A TGFBR3 CD59 PTPN22 CCR1 3.59E Transferase complex CDK6 CDKN1A MAX PRMT1 DDB2 RB1 3.72E RNA polymerase II transcription factor complex TFDP2 MAX RB1 JUNB DDIT3 TRRAP 3.91E Transcription factor AP-1 complex JUNB DDIT3 3.91E Receptor complex CD79A AHR TGFBR3 FGFR1 ITGA2 KLRC2 3.95E Centrosome RANBP1 ARL2BP CDK6 CCT4 PCNA KIF20B 5.13E Chromosome RAD50 JUNB MSH2 STAT1 NCOA3 PCNA 5.13E CD95 death-inducing signaling complex FAS CFLAR 5.29E Extracellular space CCL20 CCL21 VEGFC CXCL10 CCL4 CCL E Zona pellucida receptor complex CCT4 CCT7 1.09E Death-inducing signaling complex CFLAR FAS 1.47E Chaperonin-containing T-complex CCT4 CCT7 7

8 Figure 2. Enriched GO Cellular Component terms. 8

9 Table 6. Enriched GO Molecular Function Terms. Enrichment Genes in Total Functional Category FDR list genes 2.90E DNA binding transcription factor activity 2.90E Transcription regulator activity 3.66E RNA polymerase II transcription factor activity, sequence-specific DNA binding 3.66E Double-stranded DNA binding 6.40E Regulatory region DNA binding 6.40E Regulatory region nucleic acid binding 6.40E Transcription regulatory region DNA binding 1.90E Sequence-specific DNA binding 1.64E Sequence-specific double-stranded DNA binding 3.19E Transcription regulatory region sequence-specific DNA binding 1.00E RNA polymerase II regulatory region sequence-specific DNA binding 1.01E RNA polymerase II regulatory region DNA binding 1.51E Transcriptional activator activity, RNA polymerase II transcription regulatory region sequence-specific binding 2.66E Transcription factor binding 7.34E Kinase binding 2.25E Enzyme binding 3.93E Cytokine receptor binding 4.00E Molecular function regulator 9.50E Receptor binding 1.33E Transcription factor activity, protein binding 1.49E DNA binding 5.08E Transcription factor activity, transcription factor binding 5.08E Transcription cofactor activity 6.59E Protein complex binding 9.86E Identical protein binding 2.28E Hormone receptor binding 2.63E Core promoter proximal region sequence-specific DNA binding 2.70E Core promoter proximal region DNA binding 4.81E Transcription factor activity, RNA polymerase II core promoter proximal region sequence-specific binding 1.01E RNA polymerase II core promoter proximal region sequence-specific DNA binding 9

10 Figure 3. Enriched GO Molecular Function terms. 10

11 Figure 4. The network of enriched GO Cellular Component terms. Size of the node represents the number of genes in the geneset. Thickness of the edges indicate the percent of overlapping genes. 11

12 Table 7. Enriched KEGG pathways. FDR Genes in list Total genes Functional Category Genes 8.46E Pathways in cancer CDK6 CDKN1A COL4A2 DDB2 EP300 FGFR1 FN1 BBC3 BIRC3 IFNGR1 FAS IL2RB 2.74E Small cell lung cancer CDK6 CDKN1A COL4A2 DDB2 FN1 BIRC3 ITGA2 MAX MYC NFKB1 NFKBIA RB1 RELA TRAF1 CCNE2 4.55E Measles CDK6 IRF9 BBC3 IFNGR1 FAS IL2RB JAK2 NFKB1 NFKBIA RELA STAT1 STAT5A TNFAIP3 TNFSF10 CCNE2 1.02E Hepatitis B CDK6 CDKN1A DDB2 EP300 FAS SMAD3 MYC NFKB1 NFKBIA PCNA RB1 RELA STAT1 STAT5A CCNE2 6.51E Th17 cell differentiation AHR IFNGR1 IL2RB IRF4 JAK2 SMAD3 NFKB1 NFKBIA NFKBIE RELA RORA STAT1 STAT5A 1.54E P53 signaling pathway CDK6 CDKN1A DDB2 BBC3 FAS RRM2B PPM1D CCNG1 CCNG2 CCNE2 TP53I3 2.61E Kaposi s sarcoma-associated CDK6 CDKN1A IRF9 CCR1 EP300 IFNGR1 FAS JAK2 MYC NFKB1 NFKBIA PLCG2 RB1 RELA STAT1 herpesvirus infection 2.80E Cell cycle CDK6 CDKN1A CDKN1C EP300 SMAD3 MYC PCNA PLK1 RB1 TFDP2 CCNE2 PTTG1 CDK1 4.92E Chronic myeloid leukemia CDK6 CDKN1A DDB2 SMAD3 MYC NFKB1 NFKBIA PTPN11 RB1 RELA STAT5A 8.20E TNF signaling pathway BIRC3 FAS CXCL10 JUNB NFKB1 NFKBIA RELA CCL20 TNFAIP3 TRAF1 VEGFC CFLAR 2.90E NF-kappa B signaling pathway BIRC3 NFKB1 NFKB2 NFKBIA PLAU PLCG2 RELA CCL21 TNFAIP3 TRAF1 CFLAR 6.78E Viral carcinogenesis CDK6 CDKN1A IRF9 EP300 NFKB1 NFKB2 NFKBIA RANBP1 RB1 RELA STAT5A TRAF1 CCNE2 CDK1 1.38E HTLV-I infection CDKN1A EP300 IL2RB SMAD3 MYC NFKB1 NFKB2 NFKBIA PCNA RANBP1 RB1 RELA STAT5A TRRAP PTTG1 2.41E Transcriptional misregulation in CDKN1A DDB2 DDIT3 DUSP6 BIRC3 IL2RB MAX MYC NFKB1 PLAU RELA TRAF1 NR4A3 cancer 5.91E Epstein-Barr virus infection CDKN1A EP300 MYC NFKB1 NFKB2 NFKBIA NFKBIE PLCG2 RB1 RELA TNFAIP3 TRAF1 CDK1 8.00E AGE-RAGE signaling pathway in COL4A2 FN1 JAK2 SMAD3 NFKB1 PLCG2 RELA STAT1 STAT5A VEGFC diabetic complications 1.22E Influenza A IRF9 EP300 IFNGR1 FAS CXCL10 JAK2 NFKB1 NFKBIA RELA STAT1 CASP1 TNFSF E Herpes simplex infection IRF9 EP300 IFNGR1 FAS JAK2 NFKB1 NFKBIA PTPN11 RELA STAT1 TRAF1 CDK1 5.65E Proteoglycans in cancer CDKN1A HBEGF FGFR1 FN1 FAS ITGA2 MYC PLAU PLCG2 PTPN6 PTPN11 ROCK1 5.96E PI3K-Akt signaling pathway CDK6 CDKN1A COL4A2 COL6A1 FGFR1 FN1 IL2RB ITGA2 JAK2 MYC NFKB1 RELA SGK1 VEGFC CCNE2 5.96E Necroptosis IRF9 BIRC3 IFNGR1 FAS JAK2 STAT1 STAT5A TNFAIP3 CASP1 TNFSF10 CFLAR 5.96E Jak-STAT signaling pathway CDKN1A IRF9 EP300 IFNGR1 IL2RB JAK2 MYC PTPN6 PTPN11 STAT1 STAT5A 5.96E Th1 and Th2 cell differentiation IFNGR1 IL2RB JAK2 NFKB1 NFKBIA NFKBIE RELA STAT1 STAT5A 7.99E Prostate cancer CDKN1A EP300 FGFR1 NFKB1 NFKBIA PLAU RB1 RELA CCNE2 9.44E B cell receptor signaling NFKB1 NFKBIA NFKBIE PLCG2 PTPN6 RELA IFITM1 CD79A pathway 1.17E Apoptosis DDIT3 BBC3 BIRC3 FAS NFKB1 NFKBIA RELA TRAF1 TNFSF10 CFLAR 1.19E Human papillomavirus infection CDK6 CDKN1A IRF9 COL4A2 COL6A1 EP300 FN1 FAS ITGA2 NFKB1 RB1 RELA STAT1 CCNE2 1.46E Pancreatic cancer CDK6 CDKN1A DDB2 SMAD3 NFKB1 RB1 RELA STAT1 2.49E MicroRNAs in cancer CDK6 CDKN1A EP300 MYC NFKB1 PLAU PLCG2 ROCK1 CCNG1 CCNE2 4.18E Cellular senescence RAD50 CDK6 CDKN1A SMAD3 MYC NFKB1 RB1 RELA CCNE2 CDK1 12

13 Figure 5. Pathways in cancer from KEGG with query genes highlighted in red. 13

14 Figure 6. The gene list is enriched with genes belong to P53 signaling pathway. 14

15 Table 8. Enriched TF target genes. FDR Genes in list Total genes Functional Category Genes (first 10) 7.08E Tftargets:TF Target TP53 BIRC3 FAS CASP1 CCNG1 CDKN1A CLK1 DDB2 JAK2 LGALS3 MSH2 3.73E TRRUST:TF Target RELA ADORA3 BIRC3 BTG2 CCL20 CCL3 CCL4 CD80 CD83 CDK6 CDKN1A 1.22E RegNetwork:TF Target TP53 CDKN1A CCNG1 TP53I3 RRM2B JAK2 FAS DDB2 BTG2 XPC TP53TG1 3.78E TRRUST:TF Target NFKB1 ADORA3 BIRC3 BTG2 CCL20 CCL3 CCL4 CD80 CD83 CDKN1A CFLAR 4.05E TFactS Rel/NF-kappaB target genes FN1 FAS IRF4 JUNB LGALS3 MYC NFKB1 NFKB2 NFKBIA PLAU 1.88E TRRUST:TF Target TP53 BBC3 BTG2 CASP1 CDK1 CDKN1A DDB2 FAS GDF15 MYC NFKB1 4.36E TFactS SP1 CDKN1A CDKN1C COL4A2 COL6A1 DKC1 HBEGF AHR FN1 FAS IL2RB 4.51E Tftargets:TF Target SP1 FAS TNFRSF8 CDK1 CDKN1A CDKN1C COL4A2 DKC1 FN1 IFNGR1 IL2RB 1.81E TFactS TP53 CDKN1A DDB2 BBC3 FAS MSH2 MYC RRM2B PCNA PLK1 RB1 1.84E RegNetwork:TF Target RELA NFKB2 TRAF1 SMAD3 NFKBIA NFKB1 CXCL10 BIRC3 VEGFC TSC22D3 TANK 1.84E RegNetwork:TF Target BRCA1 PCNA JAK2 DDB2 CDK1 EP300 JUNB MED21 MYC NFKB1 RB1 2.09E Tftargets:TF Target NFKB1 BIRC3 FAS CD80 CD79A FN1 CXCL10 IRF4 JUNB MYC NFKB1 4.95E RegNetwork:TF Target NFKB1 TRAF1 VEGFC SMAD3 PLAU NFKBIA NFKB2 NFKB1 JUNB CXCL10 CD E RegNetwork:TF Target AR SMAD3 RELA POU5F1 PLAU NFKBIA NCOA3 MYC KRT19 FGFR1 CDKN1A 3.69E TRRUST:TF Target SP1 AHR CD83 CDK6 CDKN1A DDB2 FAS FGFR1 HBEGF IFNGR1 IL2RB 5.94E Tftargets:TF Target c-myc TNFRSF8 CDK6 COL4A2 CRAT DDIT3 DKC1 DPYSL3 ENO1 IFNGR1 JUNB 8.53E Tftargets:TF Target NFKB TGIF1 NFKBIA PTTG1 ARL2BP BIRC3 RFX5 TNFAIP3 JUNB STAT5A MSC 3.47E RegNetwork:TF Target JUN KRT8 CDKN1A ZNF274 VEGFC TRAF1 TNFSF10 TNFRSF8 TNFAIP3 TGIF1 STAT1 2.22E RegNetwork:TF Target NFKB2 TRAF1 NFKB2 NFKB1 TANK STAT5A SMAD3 ROCK1 RELA RANBP1 POU3F4 4.16E TFactS GLI2 CDKN1A IRF9 CLK1 IFNGR1 JUNB MYC PCNA IFITM1 CFLAR CCNG2 4.48E TFactS NFKB1 FN1 FAS CXCL10 IRF4 MYC RRM2B PLAU CCL3 CCL4 TRAF1 4.48E RegNetwork:TF Target FOS KRT8 VEGFC TRAF1 TNFAIP3 TGIF1 RABGAP1 PTN NR4A3 LGALS3 KLF4 5.52E TRRUST:TF Target BRCA1 CDKN1A DDB2 DDIT3 EP300 IRF9 JAK2 MYC STAT1 XPC 1.39E Tftargets:TF Target RELA BIRC3 FN1 CXCL10 IRF4 MYC NFKB1 NFKB2 NFKBIA CCL3 CCL E RegNetwork:TF Target IKBKB CFLAR NCOA3 NFKB1 NFKB2 NFKBIA NFKBIE RELA TANK TNFAIP3 2.55E TFactS FOXO1 CDKN1A CDKN1C COL4A2 BIRC3 KRT19 RELA TRAF1 TNFSF10 CFLAR CCNG2 2.64E TFactS E2F1 CDKN1A CDKN1C DDB2 FGFR1 BBC3 MYC PCNA PLK1 RB1 NCOA3 3.41E RegNetwork:TF Target SP1 NFKB2 SMAD3 SIPA1 RORA RELA RABGAP1 POU5F1 NFE2L1 KRT19 IFNGR1 3.77E TRRUST:TF Target RB1 CDK1 CDKN1A DDIT3 FGFR1 MYC PCNA RB1 4.17E Tftargets:TF Target RELA CXCL10 STAT5A NR4A3 CCL20 NFKBIA PTTG1 TNFAIP3 ARL2BP NFKB2 BTG2 15

16 Figure 7. Enriched TF target genes presented as a tree. 16

17 Table 9. Overrepresented TF binding motifs in the promoters of the 147 genes. Note that the NFKB2 and TFDP2 genes themselves are included in this list. Enriched motif in promoter TF TF family P val. FDR Score Note GGGGGGGGGCC PATZ1 C2H2 ZF 4.31E GGGATTTCC REL Rel GGGCGTG KLF7 C2H2 ZF GGCGGGAA E2F4 E2F GTGGGCGTGGC SP6 C2H2 ZF TGCGGG ZBTB1 C2H2 ZF CACGTG TCFL5 bhlh GGGGGCGGGGC SP2 C2H2 ZF GGGGGGT ZIC5 C2H2 ZF GGGCGGGAA E2F6 E2F GGGGATTCCCC NFKB2 Rel * Query GGCCGGAG MBD2 MBD CCCGCATACAACGAA CENPB CENPB CTGACTCAT FOSB bzip CACAGCGGGGGGTC ZIC4 C2H2 ZF GGAAGTGC ZSCAN10 C2H2 ZF GTGGGCGTGG SP8 C2H2 ZF TGTCAGGGGGC INSM1 C2H2 ZF GGGATTTCCCA HIVEP1 C2H2 ZF CTTCCGGGTC NR2C2 Nuclear receptor GCCAATCA PBX3 Homeodomain CAATAGCGGTGGTG ZBTB4 C2H2 ZF TGGGCA HIC2 C2H2 ZF CG KDM2B CxxC AAATGGCGGGAAA TFDP2 DP,E2F * Query AGCATGACTCAT BACH1 bzip TGACTCAGCA NFE2 bzip GATGACTCA BACH2 bzip

18 Table 10. Overrepresented microrna target gene-sets. FDR Genes in list Total genes Functional Category Genes 9.23E MiRTarBase:miRNA Target hsa-mir-145-5p KLF4 CDKN1A STAT1 MYC POU5F1 PPM1D SMAD3 CDK6 PLAGL2 ROCK1 BTG1 DUSP6 HBEGF 5.99E MiRTarBase:miRNA Target hsa-mir-21-5p PLOD3 CDK6 BTG2 FAS TGIF1 NCOA3 MYC TGFBR3 MSH2 TNFAIP3 CCR1 CCL20 NFKB1 TCF21 RB1 RABGAP1 CCNG1 CXCL E MiRTarBase hsa-mir-21 MSH2 MYC TGFBR3 TGIF1 BTG2 NCOA3 PLOD3 1.83E MiRTarBase:miRNA Target hsa-mir-504-5p FAS BBC3 TP53I3 RABGAP1 CDK6 PLCG2 RORA NDUFB5 1.83E MiRTarBase hsa-mir-145 CDKN1A MYC POU5F1 STAT1 KLF4 3.05E MiRTarBase hsa-mir-222 CDKN1C BBC3 STAT5A TNFSF E MiRTarBase:miRNA Target hsa-mir-29a-3p CDK6 COL4A2 PPM1D TNFAIP3 CCT4 KLF4 BTG2 SGK1 BBC3 AHR MYC 3.05E MiRTarBase:miRNA Target hsa-mir-18a-5p NCOA3 TSC22D3 SMAD3 TLE3 COX7B PCNA ITGA2 STK4 FAS RORA TNFAIP3 4.61E MiRTarBase:miRNA Target hsa-mir p CDK1 BTG2 ROCK1 SGK1 TNFAIP3 RANBP1 TGIF1 STK4 4.61E RegNetwork:miRNA Target hsa-mir-21 BTG2 CCL20 CCR1 CDK6 CDKN1A FAS MSH2 MYC NCOA3 PLOD3 TGFBR3 TGIF1 TNFAIP3 5.11E MiRTarBase:miRNA Target hsa-mir-26b-5p EP300 CDK6 ZNF141 TRAF1 PPM1D BTG1 CCNI NDUFB5 FN1 AKAP5 TP53I3 NR4A3 KLRC2 NFKBIE COL4A2 BTG2 ZNF85 TNFAIP3 DDB2 DPYSL3 CCT7 BATF KRT8 NFKB1 SIPA1 PCNA IRF4 RB1 ITGA2 TNFSF E RegNetwork:miRNA Target hsa-mir-146a CCNE2 CDKN1A FAS NFKB1 NR4A3 ROCK1 STAT1 TGIF1 TNFAIP8 1.28E MiRTarBase hsa-mir-146a CDKN1A FAS NFKB1 ROCK1 1.81E MiRTarBase:miRNA Target hsa-mir-98-5p MYC NCOA3 BIRC3 TRAF1 CDKN1A AHR FAS NFKB2 IFNGR1 TNFSF10 CCNF HBEGF PLAGL2 STK4 PLCG2 CD59 TGFBR3 2.19E MiRTarBase:miRNA Target hsa-mir-3665 TRAF1 ZNF85 BTG2 CDKN1A FGFR1 GDF15 PLAGL2 PLK1 3.41E MiRTarBase hsa-mir-34b* CDK6 MYC CCNE2 4.16E MiRTarBase:miRNA Target hsa-mir-24-3p MYC UBE2C NFKBIA CDK1 TNFAIP3 PLAGL2 PCNA RANBP1 CCNG1 TFDP2 NFE2L1 AKAP5 PPM1D STK4 CCL3 CCL4 BBC3 4.16E RegNetwork:miRNA Target hsa-mir-34b* CCNE2 CDK6 MYC 4.43E MiRTarBase:miRNA Target hsa-let-7e-5p COL6A1 CCNG1 TRRAP STK4 NCOA3 CDKN1A MYC AHR PLAGL2 PLCG2 CD59 TGFBR3 PLK1 TNFAIP3 4.50E MiRTarBase:miRNA Target hsa-mir-146a-5p ROCK1 NFKB1 CDKN1A FAS STAT1 IFITM1 CD80 MSC 4.50E MiRTarBase:miRNA Target hsa-mir-29b-3p COL4A2 CDK6 NCOA3 GRN BBC3 BTG2 SGK1 TNFAIP3 MYC 4.50E MiRTarBase hsa-mir-503 CDKN1A CCNF CCNE2 4.53E MiRTarBase hsa-mir-124 CDK6 TSC22D3 AHR RELA CD164 PLOD3 CD E MiRTarBase:miRNA Target hsa-let-7g-5p MYC FN1 STK4 NCOA3 CDKN1A AHR PLAGL2 PLCG2 CD59 TGFBR3 5.46E MiRTarBase:miRNA Target hsa-let-7a-5p CDK6 MYC NFKB1 CDKN1A TRRAP CCNG1 STK4 NCOA3 AHR PLAGL2 PLCG2 CD59 TGFBR3 TNFAIP3 6.01E MiRTarBase:miRNA Target hsa-mir-3919 ITGA2 MYC SGK1 IFNGR1 CD59 BTG1 RELA 6.26E RegNetwork:miRNA Target hsa-mir-518c CARTPT GRN NDUFB5 NFE2L1 PTN RB1 TFDP2 6.70E MiRTarBase:miRNA Target hsa-mir p CDK6 RORA CDK1 XPC 7.52E MiRTarBase hsa-mir-221 CDKN1C BBC3 TNFSF E MiRTarBase:miRNA Target hsa-let-7c-5p MYC CCNF TRRAP CCNG1 STK4 NCOA3 CDKN1A AHR PLAGL2 PLCG2 CD59 TGFBR3 18

19 Figure 8. Overrepresented mirna target genes. Most significant are two gene-sets for mir-145 and three gene-sets for mir

20 A B C D Figure 9. Gene characteristics plots. A) Distribution of the 147 genes on the chromosomes. B) Number of exons. C) Type of genes. D). Number of transcripts per gene. Chi-squared test are done to test the difference between query genes with other genes on the genome. 20

21 A B C D E F Figure 10. Distribution of gene feature lengths. Gene lengths are log-transformed. T-test are conducted to test the difference between the query and other protein-coding genes on the genome. The 147 genes have slightly longer 5 UTR when compared with other genes. 21

22 Figure 11. Display of query genes on the genome. Only part of the plot covering chromosomes and X is shown. 22

23 Table 11. Enriched Pfam domains in the query genes. adj.pval ngenes Pathways 6.10E-08 5 Cyclin, N-terminal domain 1.60E-05 5 Helix-loop-helix DNA-binding domain 3.50E-04 3 bzip transcription factor 5.80E-04 2 Rel homology domain (RHD) 7.00E-04 3 Small cytokines (intecrine/chemokine), interleukin-8 like 7.80E-04 2 BTG family 1.70E-03 2 TCP-1/cpn60 chaperonin family 2.00E-03 2 Cyclin, C-terminal domain 2.40E-03 2 Protein-tyrosine phosphatase 2.80E-03 2 Ankyrin repeat Figure 12. Protein-protein interaction network among query genes. This interactively, richly annotated network is accessible on the STRING website through a custom link generated by the API. 23

24 References: 1. Kuleshov MV, Jones MR, Rouillard AD, Fernandez NF, Duan Q, Wang Z, Koplev S, Jenkins SL, Jagodnik KM, Lachmann A, et al: Enrichr: a comprehensive gene set enrichment analysis web server 2016 update. Nucleic Acids Res 2016, 44:W Jung D, Ge SX: PPInfer: a Bioconductor package for inferring functionally related proteins using protein interaction networks. F1000Research 2018, 6: Eden E, Navon R, Steinfeld I, Lipson D, Yakhini Z: GOrilla: a tool for discovery and visualization of enriched GO terms in ranked gene lists. BMC Bioinformatics 2009, 10: Weirauch MT, Yang A, Albu M, Cote AG, Montenegro-Montero A, Drewe P, Najafabadi HS, Lambert SA, Mann I, Cook K, et al: Determination and inference of eukaryotic transcription factor sequence specificity. Cell 2014, 158: Yi X, Du Z, Su Z: PlantGSEA: a gene set enrichment analysis toolkit for plant community. Nucleic Acids Res 2013, 41:W Tsai MH, Chen X, Chandramouli GV, Chen Y, Yan H, Zhao S, Keng P, Liber HL, Coleman CN, Mitchell JB, Chuang EY: Transcriptional responses to ionizing radiation reveal that p53r2 protects against radiation-induced mutagenesis in human lymphoblastoid cells. Oncogene 2006, 25: Mi H, Muruganujan A, Casagrande JT, Thomas PD: Large-scale gene function analysis with the PANTHER classification system. Nat Protoc 2013, 8: Szklarczyk D, Franceschini A, Wyder S, Forslund K, Heller D, Huerta-Cepas J, Simonovic M, Roth A, Santos A, Tsafou KP, et al: STRING v10: protein-protein interaction networks, integrated over the tree of life. Nucleic Acids Res 2015, 43:D Liberzon A, Birger C, Thorvaldsdottir H, Ghandi M, Mesirov JP, Tamayo P: The Molecular Signatures Database (MSigDB) hallmark gene set collection. Cell Syst 2015, 1: Huang da W, Sherman BT, Lempicki RA: Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc 2009, 4: MSigDB website [ ml] 12. Reimand J, Arak T, Adler P, Kolberg L, Reisberg S, Peterson H, Vilo J: g:profiler-a web server for functional interpretation of gene lists (2016 update). Nucleic Acids Res 2016, 44:W Mikkelsen RB, Wardman P: Biological chemistry of reactive oxygen and nitrogen and radiationinduced signal transduction mechanisms. Oncogene 2003, 22: Maier P, Hartmann L, Wenz F, Herskind C: Cellular Pathways in Response to Ionizing Radiation and Their Targetability for Tumor Radiosensitization. Int J Mol Sci 2016, Zhang Q, Lenardo MJ, Baltimore D: 30 Years of NF-kappaB: A Blossoming of Relevance to Human Pathobiology. Cell 2017, 168: Ando T, Kawabe T, Ohara H, Ducommun B, Itoh M, Okamoto T: Involvement of the interaction between p21 and proliferating cell nuclear antigen for the maintenance of G2/M arrest after DNA damage. J Biol Chem 2001, 276: Liu ZP, Wu C, Miao H, Wu H: RegNetwork: an integrated database of transcriptional and posttranscriptional regulatory networks in human and mouse. Database (Oxford) 2015, Friard O, Re A, Taverna D, De Bortoli M, Cora D: CircuitsDB: a database of mixed microrna/transcription factor feed-forward regulatory circuits in human and mouse. BMC Bioinformatics 2010, 11: Jiang C, Xuan Z, Zhao F, Zhang MQ: TRED: a transcriptional regulatory element database, new entries and other development. Nucleic Acids Res 2007, 35:D Consortium EP: An integrated encyclopedia of DNA elements in the human genome. Nature 2012, 489: Han H, Shim H, Shin D, Shim JE, Ko Y, Shin J, Kim H, Cho A, Kim E, Lee T, et al: TRRUST: a reference database of human transcriptional regulatory interactions. Sci Rep 2015, 5: Suzuki HI, Yamagata K, Sugimoto K, Iwamoto T, Kato S, Miyazono K: Modulation of microrna processing by p53. Nature 2009, 460:

25 23. Yu Y, Wang Y, Ren X, Tsuyada A, Li A, Liu LJ, Wang SE: Context-dependent bidirectional regulation of the MutS homolog 2 by transforming growth factor beta contributes to chemoresistance in breast cancer cells. Mol Cancer Res 2010, 8: Wan G, Mathur R, Hu X, Zhang X, Lu X: mirna response to DNA damage. Trends Biochem Sci 2011, 36: Ghose J, Sinha M, Das E, Jana NR, Bhattacharyya NP: Regulation of mir-146a by RelA/NFkB and p53 in STHdh(Q111)/Hdh(Q111) cells, a cell model of Huntington's disease. PLoS One 2011, 6:e Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, et al: Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet 2000, 25: Kanehisa M, Furumichi M, Tanabe M, Sato Y, Morishima K: KEGG: new perspectives on genomes, pathways, diseases and drugs. Nucleic Acids Res 2017, 45:D353-D Araki H, Knapp C, Tsai P, Print C: GeneSetDB: A comprehensive meta-database, statistical and visualisation framework for gene set analysis. FEBS Open Bio 2012, 2: Zheng G, Tu K, Yang Q, Xiong Y, Wei C, Xie L, Zhu Y, Li Y: ITFP: an integrated platform of mammalian transcription factors. Bioinformatics 2008, 24: Neph S, Stergachis AB, Reynolds A, Sandstrom R, Borenstein E, Stamatoyannopoulos JA: Circuitry and dynamics of human transcription factor regulatory networks. Cell 2012, 150: Marbach D, Lamparter D, Quon G, Kellis M, Kutalik Z, Bergmann S: Tissue-specific regulatory circuits reveal variable modular perturbations across complex diseases. Nat Methods 2016, 13: Wong N, Wang X: mirdb: an online resource for microrna target prediction and functional annotations. Nucleic Acids Res 2015, 43:D Agarwal V, Bell GW, Nam JW, Bartel DP: Predicting effective microrna target sites in mammalian mrnas. Elife 2015, Chou CH, Shrestha S, Yang CD, Chang NW, Lin YL, Liao KW, Huang WC, Sun TH, Tu SJ, Lee WH, et al: mirtarbase update 2018: a resource for experimentally validated microrna-target interactions. Nucleic Acids Res 2018, 46:D296-D

TNFSF13B tumor necrosis factor (ligand) superfamily, member 13b NF-kB pathway cluster, Enrichment Score: 3.57

TNFSF13B tumor necrosis factor (ligand) superfamily, member 13b NF-kB pathway cluster, Enrichment Score: 3.57 Appendix 2. Highly represented clusters of genes in the differential expression of data. Immune Cluster, Enrichment Score: 5.17 GO:0048584 positive regulation of response to stimulus GO:0050778 positive

More information

Micro-RNA web tools. Introduction. UBio Training Courses. mirnas, target prediction, biology. Gonzalo

Micro-RNA web tools. Introduction. UBio Training Courses. mirnas, target prediction, biology. Gonzalo Micro-RNA web tools UBio Training Courses Gonzalo Gómez//ggomez@cnio.es Introduction mirnas, target prediction, biology Experimental data Network Filtering Pathway interpretation mirs-pathways network

More information

* Kyoto Encyclopedia of Genes and Genomes.

* Kyoto Encyclopedia of Genes and Genomes. Supplemental Material Complete gene expression data using Affymetrix 3PRIME IVT ID Chip (54,614 genes) and human immature dendritic cells stimulated with rbmasnrs, IL-8 and control (media) has been deposited

More information

Cancer. The fundamental defect is. unregulated cell division. Properties of Cancerous Cells. Causes of Cancer. Altered growth and proliferation

Cancer. The fundamental defect is. unregulated cell division. Properties of Cancerous Cells. Causes of Cancer. Altered growth and proliferation Cancer The fundamental defect is unregulated cell division. Properties of Cancerous Cells Altered growth and proliferation Loss of growth factor dependence Loss of contact inhibition Immortalization Alterated

More information

Gene-microRNA network module analysis for ovarian cancer

Gene-microRNA network module analysis for ovarian cancer Gene-microRNA network module analysis for ovarian cancer Shuqin Zhang School of Mathematical Sciences Fudan University Oct. 4, 2016 Outline Introduction Materials and Methods Results Conclusions Introduction

More information

High AU content: a signature of upregulated mirna in cardiac diseases

High AU content: a signature of upregulated mirna in cardiac diseases https://helda.helsinki.fi High AU content: a signature of upregulated mirna in cardiac diseases Gupta, Richa 2010-09-20 Gupta, R, Soni, N, Patnaik, P, Sood, I, Singh, R, Rawal, K & Rani, V 2010, ' High

More information

Models of cell signalling uncover molecular mechanisms of high-risk neuroblastoma and predict outcome

Models of cell signalling uncover molecular mechanisms of high-risk neuroblastoma and predict outcome Models of cell signalling uncover molecular mechanisms of high-risk neuroblastoma and predict outcome Marta R. Hidalgo 1, Alicia Amadoz 2, Cankut Çubuk 1, José Carbonell-Caballero 2 and Joaquín Dopazo

More information

OncoPPi Portal A Cancer Protein Interaction Network to Inform Therapeutic Strategies

OncoPPi Portal A Cancer Protein Interaction Network to Inform Therapeutic Strategies OncoPPi Portal A Cancer Protein Interaction Network to Inform Therapeutic Strategies 2017 Contents Datasets... 2 Protein-protein interaction dataset... 2 Set of known PPIs... 3 Domain-domain interactions...

More information

CURRICULUM VITA OF Xiaowen Chen

CURRICULUM VITA OF Xiaowen Chen Xiaowen Chen College of Bioinformatics Science and Technology Harbin Medical University Harbin, 150086, P. R. China Mobile Phone:+86 13263501862 E-mail: hrbmucxw@163.com Education Experience 2008-2011

More information

15. Supplementary Figure 9. Predicted gene module expression changes at 24hpi during HIV

15. Supplementary Figure 9. Predicted gene module expression changes at 24hpi during HIV Supplementary Information Table of content 1. Supplementary Table 1. Summary of RNAseq data and mapping statistics 2. Supplementary Table 2. Biological functions enriched in 12 hpi DE genes, derived from

More information

Accessing and Using ENCODE Data Dr. Peggy J. Farnham

Accessing and Using ENCODE Data Dr. Peggy J. Farnham 1 William M Keck Professor of Biochemistry Keck School of Medicine University of Southern California How many human genes are encoded in our 3x10 9 bp? C. elegans (worm) 959 cells and 1x10 8 bp 20,000

More information

The Transcriptional Response to Irradiation: Time to Move the Focus from Cellular to Tissutal Level

The Transcriptional Response to Irradiation: Time to Move the Focus from Cellular to Tissutal Level Laboratory of Functional Genomics and Proteomics of Model System Rodolfo Negri Sapienza Università di Roma The Transcriptional Response to Irradiation: Time to Move the Focus from Cellular to Tissutal

More information

EXPression ANalyzer and DisplayER

EXPression ANalyzer and DisplayER EXPression ANalyzer and DisplayER Tom Hait Aviv Steiner Igor Ulitsky Chaim Linhart Amos Tanay Seagull Shavit Rani Elkon Adi Maron-Katz Dorit Sagir Eyal David Roded Sharan Israel Steinfeld Yossi Shiloh

More information

Supplementary information for: Community detection for networks with. unipartite and bipartite structure. Chang Chang 1, 2, Chao Tang 2

Supplementary information for: Community detection for networks with. unipartite and bipartite structure. Chang Chang 1, 2, Chao Tang 2 Supplementary information for: Community detection for networks with unipartite and bipartite structure Chang Chang 1, 2, Chao Tang 2 1 School of Life Sciences, Peking University, Beiing 100871, China

More information

STAT1 regulates microrna transcription in interferon γ stimulated HeLa cells

STAT1 regulates microrna transcription in interferon γ stimulated HeLa cells CAMDA 2009 October 5, 2009 STAT1 regulates microrna transcription in interferon γ stimulated HeLa cells Guohua Wang 1, Yadong Wang 1, Denan Zhang 1, Mingxiang Teng 1,2, Lang Li 2, and Yunlong Liu 2 Harbin

More information

Cancer. The fundamental defect is. unregulated cell division. Properties of Cancerous Cells. Causes of Cancer. Altered growth and proliferation

Cancer. The fundamental defect is. unregulated cell division. Properties of Cancerous Cells. Causes of Cancer. Altered growth and proliferation Cancer The fundamental defect is unregulated cell division. Properties of Cancerous Cells Altered growth and proliferation Loss of growth factor dependence Loss of contact inhibition Immortalization Alterated

More information

Supporting Information

Supporting Information Supporting Information Retinal expression of small non-coding RNAs in a murine model of proliferative retinopathy Chi-Hsiu Liu 1, Zhongxiao Wang 1, Ye Sun 1, John Paul SanGiovanni 2, Jing Chen 1, * 1 Department

More information

a) List of KMTs targeted in the shrna screen. The official symbol, KMT designation,

a) List of KMTs targeted in the shrna screen. The official symbol, KMT designation, Supplementary Information Supplementary Figures Supplementary Figure 1. a) List of KMTs targeted in the shrna screen. The official symbol, KMT designation, gene ID and specifities are provided. Those highlighted

More information

Inferring condition-specific mirna activity from matched mirna and mrna expression data

Inferring condition-specific mirna activity from matched mirna and mrna expression data Inferring condition-specific mirna activity from matched mirna and mrna expression data Junpeng Zhang 1, Thuc Duy Le 2, Lin Liu 2, Bing Liu 3, Jianfeng He 4, Gregory J Goodall 5 and Jiuyong Li 2,* 1 Faculty

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION doi: 1.138/nature89 IFN- (ng ml ) 5 4 3 1 Splenocytes NS IFN- (ng ml ) 6 4 Lymph node cells NS Nfkbiz / Nfkbiz / Nfkbiz / Nfkbiz / IL- (ng ml ) 3 1 Splenocytes IL- (ng ml ) 1 8 6 4 *** ** Lymph node cells

More information

Deregulation of signal transduction and cell cycle in Cancer

Deregulation of signal transduction and cell cycle in Cancer Deregulation of signal transduction and cell cycle in Cancer Tuangporn Suthiphongchai, Ph.D. Department of Biochemistry Faculty of Science, Mahidol University Email: tuangporn.sut@mahidol.ac.th Room Pr324

More information

Statistical Issues in Translational Cancer Research

Statistical Issues in Translational Cancer Research Statistical Issues in Translational Cancer Research Martin Bøgsted Department of Haematology Aalborg University Hospital and Department of Clinical Medicine Aalborg University The only useful function

More information

PCxN: The pathway co-activity map: a resource for the unification of functional biology

PCxN: The pathway co-activity map: a resource for the unification of functional biology PCxN: The pathway co-activity map: a resource for the unification of functional biology Sheffield Institute for Translational Neurosciences Center for Integrative Genome Translation GENOME INFORMATICS

More information

Review (V1) - The Hallmarks of Cancer

Review (V1) - The Hallmarks of Cancer Review (V1) - The Hallmarks of Cancer Robert A. Weinberg 1 Review (V1) - The Hallmarks of Cancer 2 Review (V1) - Number of somatic mutations in human cancers Top: children vs. adults Numbers in parentheses

More information

cis-regulatory enrichment analysis in human, mouse and fly

cis-regulatory enrichment analysis in human, mouse and fly cis-regulatory enrichment analysis in human, mouse and fly Zeynep Kalender Atak, PhD Laboratory of Computational Biology VIB-KU Leuven Center for Brain & Disease Research Laboratory of Computational Biology

More information

Discovery of Novel Human Gene Regulatory Modules from Gene Co-expression and

Discovery of Novel Human Gene Regulatory Modules from Gene Co-expression and Discovery of Novel Human Gene Regulatory Modules from Gene Co-expression and Promoter Motif Analysis Shisong Ma 1,2*, Michael Snyder 3, and Savithramma P Dinesh-Kumar 2* 1 School of Life Sciences, University

More information

Review (V1) - The Hallmarks of Cancer

Review (V1) - The Hallmarks of Cancer Review (V1) - The Hallmarks of Cancer Robert A. Weinberg 1 Review (V1) - The Hallmarks of Cancer 2 Review (V1) - Number of somatic mutations in human cancers Top: children vs. adults Numbers in parentheses

More information

Supplementary Materials for

Supplementary Materials for www.sciencesignaling.org/cgi/content/full/7/310/ra11/dc1 Supplementary Materials for STAT3 Induction of mir-146b Forms a Feedback Loop to Inhibit the NF-κB to IL-6 Signaling Axis and STAT3-Driven Cancer

More information

Interleukin-6; pathogenesis and treatment of autoimmune inflammatory diseases

Interleukin-6; pathogenesis and treatment of autoimmune inflammatory diseases 54 Review Article Interleukin-6; pathogenesis and treatment of autoimmune inflammatory diseases Toshio Tanaka 1, 2), Masashi Narazaki 3), Kazuya Masuda 4) and Tadamitsu Kishimoto 4, ) 1) Department of

More information

RAS Genes. The ras superfamily of genes encodes small GTP binding proteins that are responsible for the regulation of many cellular processes.

RAS Genes. The ras superfamily of genes encodes small GTP binding proteins that are responsible for the regulation of many cellular processes. ۱ RAS Genes The ras superfamily of genes encodes small GTP binding proteins that are responsible for the regulation of many cellular processes. Oncogenic ras genes in human cells include H ras, N ras,

More information

Broad H3K4me3 is associated with increased transcription elongation and enhancer activity at tumor suppressor genes

Broad H3K4me3 is associated with increased transcription elongation and enhancer activity at tumor suppressor genes Broad H3K4me3 is associated with increased transcription elongation and enhancer activity at tumor suppressor genes Kaifu Chen 1,2,3,4,5,10, Zhong Chen 6,10, Dayong Wu 6, Lili Zhang 7, Xueqiu Lin 1,2,8,

More information

Computational discovery of mir-tf regulatory modules in human genome

Computational discovery of mir-tf regulatory modules in human genome Computational discovery of mir-tf regulatory modules in human genome Dang Hung Tran* 1,3, Kenji Satou 1,2, Tu Bao Ho 1, Tho Hoan Pham 3 1 School of Knowledge Science, Japan Advanced Institute of Science

More information

Breast cancer. Risk factors you cannot change include: Treatment Plan Selection. Inferring Transcriptional Module from Breast Cancer Profile Data

Breast cancer. Risk factors you cannot change include: Treatment Plan Selection. Inferring Transcriptional Module from Breast Cancer Profile Data Breast cancer Inferring Transcriptional Module from Breast Cancer Profile Data Breast Cancer and Targeted Therapy Microarray Profile Data Inferring Transcriptional Module Methods CSC 177 Data Warehousing

More information

Comparison of open chromatin regions between dentate granule cells and other tissues and neural cell types.

Comparison of open chromatin regions between dentate granule cells and other tissues and neural cell types. Supplementary Figure 1 Comparison of open chromatin regions between dentate granule cells and other tissues and neural cell types. (a) Pearson correlation heatmap among open chromatin profiles of different

More information

Oncogenes and Tumor Suppressors MCB 5068 November 12, 2013 Jason Weber

Oncogenes and Tumor Suppressors MCB 5068 November 12, 2013 Jason Weber Oncogenes and Tumor Suppressors MCB 5068 November 12, 2013 Jason Weber jweber@dom.wustl.edu Oncogenes & Cancer DNA Tumor Viruses Simian Virus 40 p300 prb p53 Large T Antigen Human Adenovirus p300 E1A

More information

Hands-On Ten The BRCA1 Gene and Protein

Hands-On Ten The BRCA1 Gene and Protein Hands-On Ten The BRCA1 Gene and Protein Objective: To review transcription, translation, reading frames, mutations, and reading files from GenBank, and to review some of the bioinformatics tools, such

More information

Regulation of cell signaling cascades by influenza A virus

Regulation of cell signaling cascades by influenza A virus The Second International Symposium on Optimization and Systems Biology (OSB 08) Lijiang, China, October 31 November 3, 2008 Copyright 2008 ORSC & APORC, pp. 389 394 Regulation of cell signaling cascades

More information

Ch. 18 Regulation of Gene Expression

Ch. 18 Regulation of Gene Expression Ch. 18 Regulation of Gene Expression 1 Human genome has around 23,688 genes (Scientific American 2/2006) Essential Questions: How is transcription regulated? How are genes expressed? 2 Bacteria regulate

More information

Supplementary Table 1: List of the 242 hypoxia/reoxygenation marker genes collected from literature and databases

Supplementary Table 1: List of the 242 hypoxia/reoxygenation marker genes collected from literature and databases Supplementary Tables: Supplementary Table 1: List of the 242 hypoxia/reoxygenation marker genes collected from literature and databases Supplementary Table 2: Contingency tables used in the fisher exact

More information

MicroRNA expression profiling and functional analysis in prostate cancer. Marco Folini s.c. Ricerca Traslazionale DOSL

MicroRNA expression profiling and functional analysis in prostate cancer. Marco Folini s.c. Ricerca Traslazionale DOSL MicroRNA expression profiling and functional analysis in prostate cancer Marco Folini s.c. Ricerca Traslazionale DOSL What are micrornas? For almost three decades, the alteration of protein-coding genes

More information

A Practical Guide to Integrative Genomics by RNA-seq and ChIP-seq Analysis

A Practical Guide to Integrative Genomics by RNA-seq and ChIP-seq Analysis A Practical Guide to Integrative Genomics by RNA-seq and ChIP-seq Analysis Jian Xu, Ph.D. Children s Research Institute, UTSW Introduction Outline Overview of genomic and next-gen sequencing technologies

More information

Interpreting Diverse Genomic Data Using Gene Sets

Interpreting Diverse Genomic Data Using Gene Sets Interpreting Diverse Genomic Data Using Gene Sets giovanni parmigiani@dfci.harvard.edu FHCRC February 2012 Gene Sets Leary 2008 PNAS Alterations in the combined FGF, EGFR, ERBB2 and PIK3 pathways. Red:

More information

SUPPLEMENTAL DATA AGING, July 2014, Vol. 6 No. 7

SUPPLEMENTAL DATA AGING, July 2014, Vol. 6 No. 7 SUPPLEMENTAL DATA Figure S1. Muscle mass changes in different anatomical regions with age. (A) The TA and gastrocnemius muscle showed a significant loss of weight in aged mice (24 month old) compared to

More information

Human breast milk mirna, maternal probiotic supplementation and atopic dermatitis in offsrping

Human breast milk mirna, maternal probiotic supplementation and atopic dermatitis in offsrping Human breast milk mirna, maternal probiotic supplementation and atopic dermatitis in offsrping Melanie Rae Simpson PhD candidate Department of Public Health and General Practice Norwegian University of

More information

A bioinformatics driven cancer study

A bioinformatics driven cancer study The 11th China-Japan-Korea Bioinformatics Training Course & Symposium, June 16-18, Suzhou, China A bioinformatics driven cancer study Chao Li and Yixue Li yxli@sibs.ac.cn Shanghai Institutes for Bio-Medicine

More information

of TERT, MLL4, CCNE1, SENP5, and ROCK1 on tumor development were discussed.

of TERT, MLL4, CCNE1, SENP5, and ROCK1 on tumor development were discussed. Supplementary Note The potential association and implications of HBV integration at known and putative cancer genes of TERT, MLL4, CCNE1, SENP5, and ROCK1 on tumor development were discussed. Human telomerase

More information

In-Silico Screening of Biomarker Genes of Hepatocellular Carcinoma Using R/Bioconductor

In-Silico Screening of Biomarker Genes of Hepatocellular Carcinoma Using R/Bioconductor Computational Biology and Bioinformatics 2017; 5(3): 36-42 http://www.sciencepublishinggroup.com/j/cbb doi: 10.11648/j.cbb.20170503.12 ISSN: 2330-8265 (Print); ISSN: 2330-8281 (Online) In-Silico Screening

More information

Supplementary Material. Table S1. Summary of mapping results

Supplementary Material. Table S1. Summary of mapping results Supplementary Material Table S1. Summary of mapping results Sample Total reads Mapped (%) HNE0-1 99687516 80786083 (81.0%) HNE0-2 94318720 77047792 (81.7%) HNE0-3 104033900 84348102 (81.1%) HNE15-1 94426598

More information

Computational Analysis of UHT Sequences Histone modifications, CAGE, RNA-Seq

Computational Analysis of UHT Sequences Histone modifications, CAGE, RNA-Seq Computational Analysis of UHT Sequences Histone modifications, CAGE, RNA-Seq Philipp Bucher Wednesday January 21, 2009 SIB graduate school course EPFL, Lausanne ChIP-seq against histone variants: Biological

More information

Supplementary Files. Novel blood-based microrna biomarker panel for early diagnosis of chronic pancreatitis

Supplementary Files. Novel blood-based microrna biomarker panel for early diagnosis of chronic pancreatitis Supplementary Files Novel blood-based microrna biomarker panel for early diagnosis of chronic pancreatitis Lei Xin 1 *, M.D., Jun Gao 1 *, Ph.D., Dan Wang 1 *, M.D., Jin-Huan Lin 1, M.D., Zhuan Liao 1,

More information

Analysis of paired mirna-mrna microarray expression data using a stepwise multiple linear regression model

Analysis of paired mirna-mrna microarray expression data using a stepwise multiple linear regression model Analysis of paired mirna-mrna microarray expression data using a stepwise multiple linear regression model Yiqian Zhou 1, Rehman Qureshi 2, and Ahmet Sacan 3 1 Pure Storage, 650 Castro Street, Suite #260,

More information

Supplementary Figures

Supplementary Figures Supplementary Figures Figure S1. Urodynamic recording of BOO-induced LUTD patients. (A) DO group. BOO patients with increased detrusor pressure and reduced urine flow during pressure flow in combination

More information

Inferring Biological Meaning from Cap Analysis Gene Expression Data

Inferring Biological Meaning from Cap Analysis Gene Expression Data Inferring Biological Meaning from Cap Analysis Gene Expression Data HRYSOULA PAPADAKIS 1. Introduction This project is inspired by the recent development of the Cap analysis gene expression (CAGE) method,

More information

LUMC: LOPAC screen analysis

LUMC: LOPAC screen analysis LUMC: LOPAC screen analysis Preliminary results November 9, 2012 This document briefly presents the results from the LOPAC compounds screen performed at LUMC. It contains the following information: The

More information

Integrative modeling of transcriptional regulatory networks in head and neck cancer

Integrative modeling of transcriptional regulatory networks in head and neck cancer Integrative modeling of transcriptional regulatory networks in head and neck cancer Bin Yan 1*, Huai Li, Zhong Chen 3, Jiaofang Shao 1, Ming Zhan 4* 1 Department of Biology, Hong Kong Baptist University,

More information

BIO360 Fall 2013 Quiz 1

BIO360 Fall 2013 Quiz 1 BIO360 Fall 2013 Quiz 1 1. Examine the diagram below. There are two homologous copies of chromosome one and the allele of YFG carried on the light gray chromosome has undergone a loss-of-function mutation.

More information

Introduction. Cancer Biology. Tumor-suppressor genes. Proto-oncogenes. DNA stability genes. Mechanisms of carcinogenesis.

Introduction. Cancer Biology. Tumor-suppressor genes. Proto-oncogenes. DNA stability genes. Mechanisms of carcinogenesis. Cancer Biology Chapter 18 Eric J. Hall., Amato Giaccia, Radiobiology for the Radiologist Introduction Tissue homeostasis depends on the regulated cell division and self-elimination (programmed cell death)

More information

Protein SD Units (P-value) Cluster order

Protein SD Units (P-value) Cluster order SUPPLEMENTAL TABLE AND FIGURES Table S1. Signature Phosphoproteome of CD22 E12 Transgenic Mouse BPL Cells. T-test vs. Other Protein SD Units (P-value) Cluster order ATPase (Ab-16) 1.41 0.000880 1 mtor

More information

Oncomine meta-analysis of breast cancer microarray data identifies upregulation of NRF-1 expression in human breast carcinoma

Oncomine meta-analysis of breast cancer microarray data identifies upregulation of NRF-1 expression in human breast carcinoma 18 th World IMACS / MODSIM Congress, Cairns, Australia 13-17 July 2009 http://mssanz.org.au/modsim09 Oncomine meta-analysis of breast cancer microarray data identifies upregulation of NRF-1 expression

More information

Interpreting Diverse Genomic Data Using Gene Sets

Interpreting Diverse Genomic Data Using Gene Sets Interpreting Diverse Genomic Data Using Gene Sets giovanni parmigiani@dfci.harvard.edu FHCRC February 2012 Gene Sets Leary 2008 PNAS Alterations in the combined FGF, EGFR, ERBB2 and PIK3 pathways. Red:

More information

RNA-Seq profiling of circular RNAs in human colorectal Cancer liver metastasis and the potential biomarkers

RNA-Seq profiling of circular RNAs in human colorectal Cancer liver metastasis and the potential biomarkers Xu et al. Molecular Cancer (2019) 18:8 https://doi.org/10.1186/s12943-018-0932-8 LETTER TO THE EDITOR RNA-Seq profiling of circular RNAs in human colorectal Cancer liver metastasis and the potential biomarkers

More information

Plasma-Seq conducted with blood from male individuals without cancer.

Plasma-Seq conducted with blood from male individuals without cancer. Supplementary Figures Supplementary Figure 1 Plasma-Seq conducted with blood from male individuals without cancer. Copy number patterns established from plasma samples of male individuals without cancer

More information

Gene Ontology 2 Function/Pathway Enrichment. Biol4559 Thurs, April 12, 2018 Bill Pearson Pinn 6-057

Gene Ontology 2 Function/Pathway Enrichment. Biol4559 Thurs, April 12, 2018 Bill Pearson Pinn 6-057 Gene Ontology 2 Function/Pathway Enrichment Biol4559 Thurs, April 12, 2018 Bill Pearson wrp@virginia.edu 4-2818 Pinn 6-057 Function/Pathway enrichment analysis do sets (subsets) of differentially expressed

More information

mirna Dr. S Hosseini-Asl

mirna Dr. S Hosseini-Asl mirna Dr. S Hosseini-Asl 1 2 MicroRNAs (mirnas) are small noncoding RNAs which enhance the cleavage or translational repression of specific mrna with recognition site(s) in the 3 - untranslated region

More information

IPA Advanced Training Course

IPA Advanced Training Course IPA Advanced Training Course October 2013 Academia sinica Gene (Kuan Wen Chen) IPA Certified Analyst Agenda I. Data Upload and How to Run a Core Analysis II. Functional Interpretation in IPA Hands-on Exercises

More information

Mechanism for targeting CDK4, BTK and PI3K in Mantle Cell Lymphoma. Selina Chen-Kiang, Ph.D. Weill Cornell Medicine

Mechanism for targeting CDK4, BTK and PI3K in Mantle Cell Lymphoma. Selina Chen-Kiang, Ph.D. Weill Cornell Medicine Mechanism for targeting CDK4, BTK and PI3K in Mantle Cell Lymphoma Selina Chen-Kiang, Ph.D. Weill Cornell Medicine Goal To develop mechanism-based therapy in mantle cell lymphoma Effective, tolerable,

More information

Supplementary Materials for

Supplementary Materials for www.sciencesignaling.org/cgi/content/full/6/264/rs4/dc1 Supplementary Materials for A Systems Approach for Decoding Mitochondrial Retrograde Signaling Pathways Sehyun Chae, Byung Yong Ahn, Kyunghee Byun,

More information

Cell cycle, signaling to cell cycle, and molecular basis of oncogenesis

Cell cycle, signaling to cell cycle, and molecular basis of oncogenesis Cell cycle, signaling to cell cycle, and molecular basis of oncogenesis MUDr. Jiří Vachtenheim, CSc. CELL CYCLE - SUMMARY Basic terminology: Cyclins conserved proteins with homologous regions; their cellular

More information

A General Workflow to Analyze Key Cancer-Related Genes Based on NCBI illustrated by the case of gastric cancer

A General Workflow to Analyze Key Cancer-Related Genes Based on NCBI illustrated by the case of gastric cancer A General Workflow to Analyze Key Cancer-Related Genes Based on NCBI illustrated by the case of gastric cancer Zhikuan Quan School of Statistics, Beijing Normal University, Beijing 100875, China 201511011114@bnu.edu.cn

More information

Comparison of Gene Set Analysis with Various Score Transformations to Test the Significance of Sets of Genes

Comparison of Gene Set Analysis with Various Score Transformations to Test the Significance of Sets of Genes Comparison of Gene Set Analysis with Various Score Transformations to Test the Significance of Sets of Genes Ivan Arreola and Dr. David Han Department of Management of Science and Statistics, University

More information

VL Network Analysis ( ) SS2016 Week 3

VL Network Analysis ( ) SS2016 Week 3 VL Network Analysis (19401701) SS2016 Week 3 Based on slides by J Ruan (U Texas) Tim Conrad AG Medical Bioinformatics Institut für Mathematik & Informatik, Freie Universität Berlin 1 Motivation 2 Lecture

More information

Supplementary figures legends

Supplementary figures legends Supplementary Information to Nijmegen Breakage Syndrome fibroblasts and ipscs: cellular models for uncovering diseaseassociated signaling pathways and establishing a screening platform for anti-oxidants

More information

Case Studies on High Throughput Gene Expression Data Kun Huang, PhD Raghu Machiraju, PhD

Case Studies on High Throughput Gene Expression Data Kun Huang, PhD Raghu Machiraju, PhD Case Studies on High Throughput Gene Expression Data Kun Huang, PhD Raghu Machiraju, PhD Department of Biomedical Informatics Department of Computer Science and Engineering The Ohio State University Review

More information

Identifying transcriptional dichotomies by "stochastic sampling"

Identifying transcriptional dichotomies by stochastic sampling Identifying transcriptional dichotomies by "stochastic sampling" Kevin Janes and Joan Brugge Department of Cell Biology Harvard Medical School pakt (S473) keratin 10 DAPI National Cancer Institute (CA105134)

More information

Case Study - Informatics

Case Study - Informatics bd@jubilantbiosys.com Case Study - Informatics www.jubilantbiosys.com Validating as a Prognostic marker and Therapeutic Target for Multiple Cancers Introduction Human genome projects and high throughput

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION SUPPLEMENTARY INFORMATION doi:10.1038/nature19360 Supplementary Tables Supplementary Table 1. Number of monoclonal reads in each sample Sample Number of cells Total reads Aligned reads Monoclonal reads

More information

To test the possible source of the HBV infection outside the study family, we searched the Genbank

To test the possible source of the HBV infection outside the study family, we searched the Genbank Supplementary Discussion The source of hepatitis B virus infection To test the possible source of the HBV infection outside the study family, we searched the Genbank and HBV Database (http://hbvdb.ibcp.fr),

More information

Deciphering the Role of micrornas in BRD4-NUT Fusion Gene Induced NUT Midline Carcinoma

Deciphering the Role of micrornas in BRD4-NUT Fusion Gene Induced NUT Midline Carcinoma www.bioinformation.net Volume 13(6) Hypothesis Deciphering the Role of micrornas in BRD4-NUT Fusion Gene Induced NUT Midline Carcinoma Ekta Pathak 1, Bhavya 1, Divya Mishra 1, Neelam Atri 1, 2, Rajeev

More information

CELL CYCLE MOLECULAR BASIS OF ONCOGENESIS

CELL CYCLE MOLECULAR BASIS OF ONCOGENESIS CELL CYCLE MOLECULAR BASIS OF ONCOGENESIS Summary of the regulation of cyclin/cdk complexes during celll cycle Cell cycle phase Cyclin-cdk complex inhibitor activation Substrate(s) G1 Cyclin D/cdk 4,6

More information

How To Use MiRSEA. Junwei Han. July 1, Overview 1. 2 Get the pathway-mirna correlation profile(pmset) and a weighting matrix 2

How To Use MiRSEA. Junwei Han. July 1, Overview 1. 2 Get the pathway-mirna correlation profile(pmset) and a weighting matrix 2 How To Use MiRSEA Junwei Han July 1, 2015 Contents 1 Overview 1 2 Get the pathway-mirna correlation profile(pmset) and a weighting matrix 2 3 Discovering the dysregulated pathways(or prior gene sets) based

More information

Signal Transduction Pathway Smorgasbord

Signal Transduction Pathway Smorgasbord Molecular Cell Biology Lecture. Oct 28, 2014 Signal Transduction Pathway Smorgasbord Ron Bose, MD PhD Biochemistry and Molecular Cell Biology Programs Washington University School of Medicine Outline 1.

More information

Supplemental Data. Integrating omics and alternative splicing i reveals insights i into grape response to high temperature

Supplemental Data. Integrating omics and alternative splicing i reveals insights i into grape response to high temperature Supplemental Data Integrating omics and alternative splicing i reveals insights i into grape response to high temperature Jianfu Jiang 1, Xinna Liu 1, Guotian Liu, Chonghuih Liu*, Shaohuah Li*, and Lijun

More information

A quick review. The clustering problem: Hierarchical clustering algorithm: Many possible distance metrics K-mean clustering algorithm:

A quick review. The clustering problem: Hierarchical clustering algorithm: Many possible distance metrics K-mean clustering algorithm: The clustering problem: partition genes into distinct sets with high homogeneity and high separation Hierarchical clustering algorithm: 1. Assign each object to a separate cluster. 2. Regroup the pair

More information

Analysis of Massively Parallel Sequencing Data Application of Illumina Sequencing to the Genetics of Human Cancers

Analysis of Massively Parallel Sequencing Data Application of Illumina Sequencing to the Genetics of Human Cancers Analysis of Massively Parallel Sequencing Data Application of Illumina Sequencing to the Genetics of Human Cancers Gordon Blackshields Senior Bioinformatician Source BioScience 1 To Cancer Genetics Studies

More information

Research Article Modeling Gene Regulation in Liver Hepatocellular Carcinoma with Random Forests

Research Article Modeling Gene Regulation in Liver Hepatocellular Carcinoma with Random Forests BioMed Volume 2016, Article ID 1035945, 6 pages http://dx.doi.org/10.1155/2016/1035945 Research Article Modeling Gene Regulation in Liver Hepatocellular Carcinoma with Random Forests Hilal Kazan Department

More information

GENE EXPRESSION AND microrna PROFILING IN LYMPHOMAS. AN INTEGRATIVE GENOMICS ANALYSIS

GENE EXPRESSION AND microrna PROFILING IN LYMPHOMAS. AN INTEGRATIVE GENOMICS ANALYSIS GENE EXPRESSION AND microrna PROFILING IN LYMPHOMAS. AN INTEGRATIVE GENOMICS ANALYSIS Dra. Margarita Sánchez-Beato Lymphoma Group Molecular Pathology Programme CNIO Agilent Microarray and Sequencing Roadshow,

More information

Pathogenesis of Natural Killer / T-cell Lymphoma

Pathogenesis of Natural Killer / T-cell Lymphoma Pathogenesis of Natural Killer / T-cell Lymphoma Prof Chng Wee Joo Provost s Chair Professor Yong Loo Lin School of Medicine, NUS Director National University Cancer Institute, Singapore Deputy Director

More information

Herpesviruses. Virion. Genome. Genes and proteins. Viruses and hosts. Diseases. Distinctive characteristics

Herpesviruses. Virion. Genome. Genes and proteins. Viruses and hosts. Diseases. Distinctive characteristics Herpesviruses Virion Genome Genes and proteins Viruses and hosts Diseases Distinctive characteristics Virion Enveloped icosahedral capsid (T=16), diameter 125 nm Diameter of enveloped virion 200 nm Capsid

More information

RASA: Robust Alternative Splicing Analysis for Human Transcriptome Arrays

RASA: Robust Alternative Splicing Analysis for Human Transcriptome Arrays Supplementary Materials RASA: Robust Alternative Splicing Analysis for Human Transcriptome Arrays Junhee Seok 1*, Weihong Xu 2, Ronald W. Davis 2, Wenzhong Xiao 2,3* 1 School of Electrical Engineering,

More information

Supplementary information for: Human micrornas co-silence in well-separated groups and have different essentialities

Supplementary information for: Human micrornas co-silence in well-separated groups and have different essentialities Supplementary information for: Human micrornas co-silence in well-separated groups and have different essentialities Gábor Boross,2, Katalin Orosz,2 and Illés J. Farkas 2, Department of Biological Physics,

More information

CPM (x 10-3 ) Tregs +Teffs. Tregs alone ICOS CLTA-4

CPM (x 10-3 ) Tregs +Teffs. Tregs alone ICOS CLTA-4 A 2,5 B 4 Number of cells (x 1-6 ) 2, 1,5 1, 5 CPM (x 1-3 ) 3 2 1 5 1 15 2 25 3 Days of culture 1/1 1/2 1/4 1/8 1/16 1/32 Treg/Teff ratio C alone alone alone alone CD25 FoxP3 GITR CD44 ICOS CLTA-4 CD127

More information

Dynamic reorganization of the AC16 cardiomyocyte transcriptome in response to TNFα signaling revealed by integrated genomic analyses. Luo et al.

Dynamic reorganization of the AC16 cardiomyocyte transcriptome in response to TNFα signaling revealed by integrated genomic analyses. Luo et al. Dynamic reorganization of the AC1 cardiomyocyte transcriptome in response to TNFα signaling revealed by integrated genomic analyses Luo et al. Luo et al. BMC Genomics 1, 15:155 Luo et al. BMC Genomics

More information

Epstein-Barr virus driven promoter hypermethylated genes in gastric cancer

Epstein-Barr virus driven promoter hypermethylated genes in gastric cancer RESEARCH FUND FOR THE CONTROL OF INFECTIOUS DISEASES Epstein-Barr virus driven promoter hypermethylated genes in gastric cancer J Yu *, KF To, QY Liang K e y M e s s a g e s 1. Somatostatin receptor 1

More information

T-Cell Network Example for GeneNet (August 2015) or later

T-Cell Network Example for GeneNet (August 2015) or later T-Cell Network Example for GeneNet 1.2.13 (August 2015) or later This note reproduces the T-Cell network example from R. Opgen-Rhein and K. Strimmer. 2006a. Using regularized dynamic correlation to infer

More information

Supplementary Material Correlation matrices on FP and FN profiles

Supplementary Material Correlation matrices on FP and FN profiles Supplementary Material Correlation matrices on FP and FN profiles The following two tables give the correlation coefficients for the FP profiles and the FN profiles of a single tagging solutions against

More information

Cancer. Questions about cancer. What is cancer? What causes unregulated cell growth? What regulates cell growth? What causes DNA damage?

Cancer. Questions about cancer. What is cancer? What causes unregulated cell growth? What regulates cell growth? What causes DNA damage? Questions about cancer What is cancer? Cancer Gil McVean, Department of Statistics, Oxford What causes unregulated cell growth? What regulates cell growth? What causes DNA damage? What are the steps in

More information

HALLA KABAT * Outreach Program, mircore, 2929 Plymouth Rd. Ann Arbor, MI 48105, USA LEO TUNKLE *

HALLA KABAT * Outreach Program, mircore, 2929 Plymouth Rd. Ann Arbor, MI 48105, USA   LEO TUNKLE * CERNA SEARCH METHOD IDENTIFIED A MET-ACTIVATED SUBGROUP AMONG EGFR DNA AMPLIFIED LUNG ADENOCARCINOMA PATIENTS HALLA KABAT * Outreach Program, mircore, 2929 Plymouth Rd. Ann Arbor, MI 48105, USA Email:

More information

V16: involvement of micrornas in GRNs

V16: involvement of micrornas in GRNs What are micrornas? V16: involvement of micrornas in GRNs How can one identify micrornas? What is the function of micrornas? Elisa Izaurralde, MPI Tübingen Huntzinger, Izaurralde, Nat. Rev. Genet. 12,

More information

BIO360 Fall 2013 Quiz 1

BIO360 Fall 2013 Quiz 1 BIO360 Fall 2013 Quiz 1 Name: Key 1. Examine the diagram below. There are two homologous copies of chromosome one and the allele of YFG carried on the light gray chromosome has undergone a loss-of-function

More information

Whole liver transcriptome analysis for the metabolic adaptation of dairy cows

Whole liver transcriptome analysis for the metabolic adaptation of dairy cows Whole liver transcriptome analysis for the metabolic adaptation of dairy cows Ngoc-Thuy Ha Animal Breeding and Genetics Group Department of Animal Sciences Georg-August-University Goettingen, Germany 1

More information