Supplementary Fig. 1 Composition of small RNA populations during 2 MEF reprogramming. Continued on next page.

Size: px
Start display at page:

Download "Supplementary Fig. 1 Composition of small RNA populations during 2 MEF reprogramming. Continued on next page."

Transcription

1 Supplementary Fig. 1 Composition of small RNA populations during 2 MEF reprogramming. Continued on next page.

2 Supplementary Fig. 1 Composition of small RNA populations during 2 MEF reprogramming. (a), Proportion of tags mapping to mirna loci across the 13 samples of the high/low OSKM trajectories and controls. (b), Length distribution of tags mapped to the nuclear genome outside mirna loci in selected libraries. (c), Length distribution of tags mapped to mitochondrial DNA in selected libraries. (d), Length distribution of tags mapped to known mirna loci (mirbase v18) in selected libraries. (e), Tags were grouped by genome feature and groups then clustered by expression across the 13 samples (thresholded to groups within the top 99 th percentile of expression in at least one sample). Tag length distribution and highest proportion of library reached is given on the right for each genome feature, which are further colour-coded as follows: annotated mirna loci, orange; mtdna loci, green; other loci, black. A subset of nuclear encoded loci gave rise to tags of 22 nt modal length (dust and trf simple repeats, LTR/ERVK, LINE/L1, LTR/ERVL-MaLR, SINE/MIR and non-assigned tags), potentially representing un-annotated mirnas in some cases, and showed little change in bulk expression. Relative to other small RNA species, expression of mirna (and mirna-sized small RNAs) is relatively stable during reprogramming.

3 Supplementary Fig. 2 Small RNAs mapped to mitochondrial DNA. Continued on next page.

4 Supplementary Fig. 2 Small RNAs derived from mitochondrial DNA. (a), Tag density distribution across the mitochondrial genome with schematic of mtdna features shown above. Two differently scaled views are shown to highlight distinct tag abundances in selected samples as indicated on the right. (b), Detailed view of tag density around specific trna loci (left to right, trnas: MT-TF, MT-TV and MT-TI, MT-TQ, MT-TM) and within the D-loop region (LSP, light strand promoter, HSP, heavy strand promoter, CSBs, conserved sequence block, Ori H, Origin of DNA replication). Genomic co-ordinates of features are shown below (mm9 assembly). (Colour code for tag density: blue, heavy strand; orange, light strand) (c), Normalised expression of short RNA (sml RNA) at selected mtdna loci across the 13 samples of the high/low OSKM trajectories and controls. All tags beginning at the nucleotide position indicated in brackets were included. Normalised expression of overlapping transcripts and/or surrounding trnas derived from long RNA sequencing 2 is also included (long read). The function of mt-dna encoded small RNAs is currently not known 3, although lack of correlation with long RNA from overlapping and surrounding loci for the trna-associated small RNAs suggests they may have independent functions 4, and good correlation of LSP small RNAs with surrounding long RNA reads suggests these are light strand promoter primers 5.

5 Supplementary Fig. 3 Detailed view of mirna processing variants during reprogramming. Contribution of non-canonical processing variants to mirna expression across 13 samples represented in Fig. 1a as red, blue and black circles. (a), 5 arm bias for each mirna hairpin precursor. (b), 5 isomir expression for each mature mirna. (c), 3 isomir expression. (d), Nontemplated additions in the last 2nt of tags (NTA). Stacked bar graph shows the observed base frequency for all tags with NTA. (e), Incidence of internal editing. mir-5099, mir-107-3p and mir- 92a-3p (grey circles) were excluded due to sequence discrepancies between genome and mirbase sequences (suggesting these observations are not true editing events). NOTE: mir-5117 has a SNP consistent with the observed base substitution (rs , dbsnp). All graphs display data for 424 mirnas with an average expression of >25 counts per million (CPM) per sample. Median variant proportion of total tags across all samples is shown, bars represent 25 th and 75 th percentiles; plotted against average tag abundance). mirnas with variation of isoform proportion between samples exceeding 20% are labelled in red, although visual inspection of the individual examples did not reveal any consistent trends along reprogramming trajectories. Furthermore, the relatively constant levels of NTA for each mirna despite major changes in abundance suggest that NTA might be a mirna feature with a function other than regulating mirna turnover 6. Shaded areas indicate mirnas with a proportion of 20% isoform variant expression.

6 Supplementary Fig. 4 Contribution of processing isoforms to mirna expression at the mir- 290/95 locus and targeting of Clock mrna by mir-292-3p. Continued on next page.

7 Supplementary Fig. 4 Contribution of processing isoforms to mirna expression at the mir- 290/95 locus and targeting of Clock mrna by mir-292-3p. (a) Schematic of the locus organisation with tag density (2 ipsc), incidence of hairpin arm bias, 5 and 3 isomirs, as well as non-templated additions indicated below. (b), Detailed views of tag density (mir-291a-5p) or sequences (mir-292 hairpin) are shown to highlight non-templated additions and isomir variability. Representative data from the 2 o ips sample are shown. (c), mir-292-3p was chosen for further analysis as it is the most highly expressed mirna from this locus exhibiting the ESC-dominant AAGUGC seed hexamer, shared between members of the mir-290/95, mir-302/367 and mir-17/92 clusters 7. Custom targetscan target prediction for the mir-292-3p isomirs were performed using 3 UTRs as observed in the long RNA-seq data generated in parallel 2. Gene ontology and KEGG pathway analysis of targets (performed using DAVID 8 ) suggests involvement of both 5 isomirs in regulating transcription and the cytoskeleton, whereas the canonical form is implicated in regulating protein catabolism and cell cycle. Bold circles indicate statistically significant enrichment (p-value <0.05, Benjamini-Hochberg multiple comparison test). Gene ontology terms were derived from GO:Biological process, unless specified. (d), To shortlist putative new targets of mir-292-3p we required multiple target sites to be present with a degree of anti-correlation between expression of mir-292-3p and the target. Clock, a master regulator of circadian rhythm (which is lost during reprogramming 9 ) and is also a predicted target of other mirnas with the AAGUGC seed, emerged as the best candidate for validation: i, Schematic of Clock mrna indicating two putative mir-292-3p target sites (site 1 and site 2) within its 3 UTR as observed in the long RNA-seq data generated in parallel 2. ii, Minimum free energy (MFE) calculations for both sites are shown with the predicted structure of mir-292-3p and its putative target. The DNA conservation at each site from vertebrate and avian species is shown (light grey shading); the seed region is indicated (black shading); the predicted bases bound by mi-292-3p are underlined. iii, Expression of mir-292-3p and Clock mrna across the 13 samples of the high/low OSKM trajectories and controls. iv, Validation of mir-292-3p targets by dual-luciferase assay showing the luciferase constructs used to test mir-292-3p binding. The ratios of Renilla versus firefly luciferase values were calculated and normalised to wildtype mir-292-3p binding site transfected with control mirna mimic. Each plasmid and mirna mimic pair was tested in six to eight independent experiments, each performed in triplicate. Data represent mean ± s.e.m and statistical significance was determined by Mann-Whitney U-test.

8 Supplementary Fig. 5 Contribution of processing isoforms to mirna expression at the mir- 302/367 locus. (a), Schematic of the locus organisation with tag density (2 ipsc), incidence of hairpin arm bias, 5 and 3 isomirs, as well as non-templated additions indicated below. (b), Detailed views of tag density (mir-302c-3p) or sequences (mir-302d hairpin) are shown to highlight non-templated additions and isomir variability. Representative data from the 2 o ips sample are shown. (c), mir-302d was chosen for further analysis as it is the most highly expressed mirna from this locus exhibiting the ESC-dominant AAGUGC seed hexamer, shared between members of the mir-290/95, mir-302/367 and mir-17/92 clusters 7. Custom targetscan target prediction of three prevalent mir-302d-3p 5 isomirs (seed homology with other mirnas expressed from this locus shown on right) 3 UTR as observed in the long RNA-seq data generated in parallel 2. Gene ontology and KEGG pathway analysis of targets (performed using DAVID 8 ) suggests these isomirs cooperate in regulating many similar reprogramming-related pathways. Bold circles indicate statistically significant enrichment (p-value <0.05, Benjamini-Hochberg multiple comparison test). The term cell cycle is derived from GO: Biological Process.

9 Supplementary Fig. 6 Validation of expression changes for 17 mirnas by qpcr. RNA for qpcr validation was obtained from the same samples used for sequencing and results normalised to total RNA, then to the average expression of 5 mirnas (mir-191-5p, p, -16-5p, p and -30e-5p) chosen because of their invariant expression in NGS data and confirmed to be a relatively stable reference by qpcr (shown in Fig. 1c). Expression measured by qpcr (orange, n=1) and sequencing (blue, n=1) is shown and mirnas are divided into three groups based on relative expression in F-class and ESC-like samples. Boxed mirnas were >4-fold different between ipsc and F-class states, represented in the top 95% of mirnas in at least one ipsc or F-class sample and included as class identifying mirnas in Supplementary Fig. 12. Note, D16L data was removed due to compromised cdna.

10 Supplementary Fig. 7 Hierarchical clustering of mirna expression profiles in reprogramming cell states and related Gene Ontology. Continued on next page.

11 Supplementary Fig. 7 Hierarchical clustering of mirna expression profiles in reprogramming cell states and related Gene Ontology. (a), Expression profiles were normalised to the average expression of each individual mirna across the 13 samples of the high/low OSKM trajectories and controls and log 2 transformed prior to clustering using Pearson correlation with complete linkage and 10,000 rounds of bootstrapping 10. The heatmap contains expression profiles of 217 mirnas that were expressed within the 95 th percentile of expression in at least one sample. Discrete clusters were defined as those with the highest-level node having a greater than 60% likelihood of co-clustering. A subset of clusters were categorised based on expression in the high OSKM trajectory and ESC-like cells. These subsets are shown in Fig. 3a and all clusters are detailed in Supplementary Data 1. (b), GO term Biological Process enrichment analysis of mirna expression clusters shown in a. using experimentally verified vertebrate mirna targets (see Methods). Terms with significant enrichment in at least one cluster are shown with selected parental terms (p<0.01, Benjamini-Hochberg multiple comparison test). Equivalent enrichment patterns were seen with GO Molecular Function and Cellular Compartment (not shown).

12 Supplementary Fig. 8 Early mirna expression changes under high OSKM conditions. Continued on next page.

13 Supplementary Fig. 8 Early mirna expression changes under high OSKM conditions. Samples were taken at 24-hour intervals for four days following addition of 1500 ng/ml of doxycycline to 2 MEF (see green dots in schematic shown in Fig. 1a), followed by small RNA library preparation and NGS analysis to an average depth of ~29.4 million mapped tags (See Methods, Supplementary Methods, Supplementary Data 2 and for details). 134 mirnas that were within the 95 th percentile of expression in at least one sample were included for further analyses. (a), Principal component projection plots for five sample mirna profiles in this set. (b), Percentage of differentially expressed mirnas (> 4-fold) at successive time points (up-regulated, yellow; down-regulated, blue). (c), mirna expression profiles were clustered as detailed for Supplementary Fig. 7, and shown on the right (with key constituent mirnas indicated). (d), Gene ontology (Biological Process) enrichment analysis of mirna expression clusters shown in c using DAVID 8 and experimentally verified vertebrate mirna targets from mirtarbase 11. Terms with significant enrichment in at least one cluster are shown with selected parental terms (p<0.01, Benjamini-Hochberg multiple comparison test). Equivalent enrichment patterns were seen with GO Molecular Function and Cellular Compartment (not shown). mirnas that changed expression early had diverse known functions resulting in no enriched gene ontologies within their targets as a whole. Later mirna changes were associated with targeting of pathways and processes involved in reprogramming such as cell cycle, cell survival, gene expression, development and differentiation.

14 Supplementary Fig. 9 Expression of mature and pri-mirnas, and associated epigenetic modifications at promoters. Continued on next page.

15 Supplementary Fig. 9 Expression of mature and pri-mirnas, and associated epigenetic modifications at promoters. Shown is an analysis of mature mirnas that were expressed within the 95 th percentile of expression in at least one sample, had observable pri-mirna expression in long RNA sequencing and could be unambiguously assigned to one genomic locus (128 mature mirnas and 71 loci). Transcriptional start sites were determined from the 5 end of observed primirna. (a), Expression profiles of mature and pri-mirnas were normalised to the average expression across the 13 samples of the high/low OSKM trajectories and controls. Corresponding levels of H3K4me3, H3K27me3 and DNA methylation at the promoters of these loci are shown. (b), Global median levels of H3K4me3 at mirna promoters. (c), Global median levels of H3K27me3 at mirna promoters. (d), Global median levels of DNA methylation at mirna promoters. b-d, Line shows median and whiskers show 5-95% range. Base data for figure is in Supplementary Data 1. Note: When assigning promoter regions based on observed pri-mirnas the mirnas of the imprinted Dlk1-Dio3 cluster are the main contributors to the group exhibiting high DNA methylation at promoters (shown with asterisk). DNA methylation levels of previously reported regulatory DMRs for the region are shown in Supplementary Fig 12.

16 Supplementary Fig. 10 Posttranscriptional mechanisms and complex transcriptional regulation can explain lack of correlation between mature mirna and pri-mirna expression. (a), Heatmaps of 16 mature mirnas (from 15 genomic loci) whose expression is negatively correlated with pri-mirna expression across the 13 samples of the high/low OSKM trajectories and controls. Mature mirna (top) and pri-mirna (bottom) expression was normalised to the average expression across the 13 samples and grouped into: Novel intragenic promoter, due to an intronic H3K4me3 peak, see panel (b); post-transcriptional, where there is no evidence for complex transcriptional regulation; Multiple pri-mirnas, where multiple promoters/overlapping pri-mirnas are observed (e.g. mir-17-5p, detail shown in Supplementary Fig. 11); complex genomic region, where assignment of relevant promoter and pri-mirna was difficult. Mature mirna name, genic/intergenic location and Pearson s correlation with pri-mirna expression (orange) are indicated on the right. (b), mir-149-5p does not correlate with host gene expression but instead with the activity of a novel internal promoter. Above, Ensembl transcript of Gpc1 (mir-149 host gene) and location of mir-149. Below, Normalised read coverage of long RNA sequencing (pri-mirna), epigenetic marks (ChIP sequencing) and percentage of DNA methylation 2,12. H3K4me3 indicates presence of a novel promoter for mir-149, which is supported by RNA sequencing (not shown). mir-149 could be transcribed from either or both of these transcripts.

17 Supplementary Fig. 11 Complex control of mir-17/92 group of mirnas expression across multiple genomic loci during reprogramming. Continued on next page.

18 Supplementary Fig. 11 Complex control of mir-17/92 group of mirnas expression across multiple genomic loci during reprogramming. Above, Refseq (black) and further Ensembl (orange) annotated transcripts of the genomic region surrounding the pri-mirnas of mir-17/92 (Mir17hg), mir-106b/25 (Mcm7) and mir-106a/363 (Kis2). Below, Normalised read coverage of pri-mirna, epigenetic marks and percentage of DNA methylation 2,12, with transcriptional start sites indicated by black arrows (derived from the 5 ends of observed pri-mirna and supported by promoter H3K4me3). The mir-17/92 cluster is poorly correlated with its pri-mirna, which might result from observed pri-mirna expression being a conglomeration of multiple transcripts of unknown structure. RNA sequencing and multiple H3K4me3 promoter peaks, one of which is prominently expressed only in the F-class state (TSS indicated by arrow in D16H H3K4me3 panel), support this interpretation. (Note, RNA sequencing also suggests expression of 3 extended variants of pri-mir-17/92 than have previously been annotated). The mir-106b/25 cluster is poorly correlated with its pri-mirna, but this is likely due to post-transcriptional mechanisms (no evidence of complex transcriptional regulation). The mir-106a/363 cluster is extremely well correlated with its pri-mirna expression and is repressed in the F-class state by sequential H3K27me3 and DNA methylation (which are both relieved in ipscs/escs).

19 Supplementary Fig. 12 Expression profile of mirnas with distinct expression between F-state and ipsc cells. Continued on next page.

20 Supplementary Fig. 12 Expression profile of mirnas with distinct expression between F-state and ipsc cells. (a), The mirna profile of an established F-class cell line (clone 1 Day 30, green 13 ) and those of the 13 samples of the high/low OSKM trajectories and controls were clustered as detailed for Supplementary Fig. 7. Only the 67 mirnas with >4-fold difference in expression between F-class and ipscs from Fig. 6a were included and raw tag counts from all libraries were TMM normalized prior to clustering (as detailed in Supplementary Data 1). (b), Expression of primirna and epigenetic signatures around their transcriptional start sites (TSS). Expression profiles were normalised to the average expression of each individual pri-mirna across the 13 samples of the high/low OSKM trajectories and controls. For the mirnas in panel (a) where a TSS could be unambiguously assigned is shown the level of H3K4me3 and H3K27me3, as well as % of DNA methylation. The regulatory DMR for the Dlk1-Dio3 locus mirnas was derived from previous reports 14, mm9 coordinates Chr12:110,768, ,768,750 (see Methods). (c), Pearson s correlation of pri-mirna expression with mature mirnas listed in panel (b), over all samples or in samples groups 2 MEFs-D8H (inclusive), D8H-D18H (inclusive) or D8H-iPSC (including D8H, D16L, D21L, D21, 2 IPSCs and 1 IPSCs). Identifier mirnas with high expression in ipscs have a white background and mirnas with high expression in F-class have a blue background. Individual mature mirnas are coloured as in panel (a) and the red line indicates median level of correlation. P- values were determined using a two tailed, heteroscedastic t-test. (d), Pearsons correlation of mature mirnas with their known, experimentally validated protein targets shown in Fig. 6b&c (where protein data was available). A random background for each trajectory was created by randomly assigning mirna-protein pairings (>10,000). Dots represent individual mirna:protein expression correlations and background density plot shows distribution of correlations (as a percentage). As expected, most high ipsc expressing mirnas were negatively correlated with their targets in the ipsc trajectory (as this was a factor in the original discovery of their targeting interaction; samples included 2 MEFs-D8H, 2 IPSCs, 1 IPSCs and ESCs). Interestingly, for the mirnas highly expressed in the F-class mirnas were mostly anti-correlated or strongly correlated with targets in the F-class trajectory (samples included 2 MEFs-D18H). For the latter, ChIP sequencing data from Chen et al., 2008 suggests that mir-186, Cdk2 and Cdk6 are regulated by Klf4, mir-18 and Smad4 are regulated by Myc and Oct4, while mir-138 is proposed to be in a feed forward loop with Myc, regulating many of its target including Ccnd3 15. Transcription factor co-regulation of mirnas with their targets is thought to be a common regulatory method to contain the upper limits of transcriptional stimulation (termed an incoherent feed-forward loop 16, and highlights the complexity of mirna targeting relationships in reprogramming. [target interactions included for ipsc trajectory analysis: mir-302 and Cdk2, Mecp2, TgfBr2, Ccnd2, Ccnd1,Cdkn1a, Cdk4; mir-290/295 and Rb1, Casp2, Ei24, cdkn1a; Mir-106a/363 and Cdkn1a. Target interactions included for F-class trajectory analysis: mir-138 and Ccnd3, Trp53; mir-186 and Cdk6 and Cdk2; mir-342 and Dmnt1; mir-298 and Cdkn1a; mir-296 and Cdkn1a; mir-10b and Cdkn1a; mir-124 and Stat3; mir-18a and Smad4, Tgfbr2; mir-20a and cdkn1a, Tgfbr2, Ccnd1, Ccnd2 smad4; mir-17 and Tgfbr2, Ccnd1, Cdkn1a, Ccnd2, Smad4 11 ].

21 Supplementary Methods Mapping protocol for total genome analysis Mapping to the genome: High quality tags were mapped such that one random mapping position was selected for tags mapping to more than 1 locations by invoking the M 1 option in Bowtie command. This set of mapping referred to as genomic alignments was used for counting tag distribution in various genomic features. Bowtie Command, Genomic alignments: bowtie -f -C - Q input.qv.qual --integer-quals -l 20 --nomaqround --maxbts tryhard --chunkmbs M 1 -a --best --strata --snpfrac col-cqual --colkeepends --sam --mapq 20 --offrate 2 --threads 12 --shmem reference_bowtie_colorspace_index input.csfasta >output.sam Assigning tags to genomic features: We used the genomic alignments to determine the proportion of tags mapping to various genomic features. Coordinates for genomic features such as protein coding loci, non-coding RNA loci and genomic repeats (e.g. LINE, SINE, simple repeats) were obtained from Ensembl v67.additionally, mirbase v18 annotations were used for mirna loci, pirna coordinates were obtained from the pirnabank 17 and protrac 18,and 18/28S rrna were obtained from the GenBank. A custom BAM file parser was designed (Supplementary Software 3) by using the BamTools API 19 to identify the number of tags that overlap each genomic feature. Tags were preferentially assigned to mirna loci even if multiple, overlapping genomic features were present for that locus. If the mapped position of the tag overlapped a non-mirna locus with multiple, overlapping genomic features, the tag was randomly assigned to one genomic feature. Mitochondrial genome-focused analysis: We included only uniquely mapping tags in the analysis of mitochondria-derived tags for increased specificity (shown in Supplementary Fig. 2). Normalisation: Tag numbers assigned to each genomic feature (or mitochondrial locus) were normalised to library size and expressed as CPM. Analysis of mitochondrial gene expression in long RNA sequencing libraries: RNA was prepared as described 2, with no depletion of mitochondrial rrna. Sequence mapping was performed using Applied Biosystems LifeScope v2.5 whole transcriptome (paired-end) analysis pipeline against the NCBIM37 (mm9) genome and exon-junction libraries constructed from the Ensembl v64 gene model. This pipeline first removes potential contaminant reads by aligning to a filter set containing nuclear rrna, nuclear trna, adaptor sequences and retrotransposon sequences. Following filtering, LifeScope then aligns all reads to the genome and exon-junction library. Mapping protocol for mirna-focused analysis Mature mirnas derived from multiple genomic loci can be identical in sequence. Therefore randomly assigning tags to all of the identical loci can effectively reduce the number of tags assigned to each individual locus. This random assignment can affect the downstream expression analysis. Therefore, we chose to assign tags to all of the identical loci to retain the true levels of tag counts observed for any mature mirna in our data. To assign the number of tags at each mirna locus, tags were allowed to map to up to 20 locations. If a tag mapped equally to more than 20 locations, one location was randomly chosen. These set of alignments are referred to as mirna alignments from here on. Bowtie Command, mirna alignments:

22 bowtie -f -C - Q input.qv.qual --integer-quals -l 20 --nomaqround --maxbts tryhard --chunkmbs M 20 -a --best --strata --snpfrac col-cqual -- col-keepends --sam --mapq 20 --offrate 2 --threads 12 --shmem reference_bowtie_colorspace_index input.csfasta >output.sam Assigning tags to mirna loci: Tags were assigned to a mature mirna if their 5 mapped position was within +/-3nt of the mirbase annotated 5 start site and were between 20 26nt in length inclusive. These mature mirna tags are referred as mirna tags from here on. Tag count information tables were generated by using Supplementary Software 4. These tables were then loaded into MySQL database. Using Supplementary Software 5, which used the Supplementary Software 6 and MySQL database as input, final counts tables were generated. Normalisation: mirna tag counts for each mature mirna were first normalized against the total mirna mapped tags in each library to correct for library size differences and counts per million (CPM) were obtained. Additionally, tag counts were also normalized for the total RNA output by the trimmed mean of M-value (TMM) scaling method described in ref. 20 to obtain normalized CPM (ncpm), which were used for all expression analysis. Datasets were normalized in three batches for the following analyses: (a) core samples, 2 MEFs, D2H, D5H, D8H, D11H, D16H, D18H high doxycycline treated cells, ESCs, 1 ipscs, 2 ipscs, D16L, D21L and D21 (Fig 1-6, Supplementary Figs. 3-7,9-11, Supplementary Data 1); (b) Day0-4 focused analysis, 2 MEF E, D1H E, D1H E, D2H E, D3H E, D4H E (second sequencing run only) (Supplementary Fig. 8, Supplementary Data 2); (c) clone 1 and core samples, which is (a) plus clone 1, day 30 libraries (Supplementary Fig. 12, Supplementary Data 1: clone 1 sheets). Analysis of isomirs, arm bias, non-templated addition and editing frequencies. This analysis was restricted to mature mirnas or hairpins that had >25 average ncpm in either (a) core samples or (b) Day 0-4 focused analysis. isomirs: mirbase v18 annotated start and end positions for a mature mirna are considered as canonical start and canonical end positions, respectively, for this analysis. mirna tags differing in their 5 start positions compared to the canonical start position were considered as 5 isomirs and those that differed at 3 end positions compared to the canonical end position were considered as 3 isomirs. The percentage of canonical mirna tags for each mature mirna in each sample was calculated with respect to total tags for that mirna, which includes all 5 /3 isomirs, with 1 st and 3 rd quartiles. Arm Bias: The percentage of mirna tags derived from the 5 arm of the hairpin was determined for all mirnas in all samples. The median percentage of 5 arm-derived mirnas for each hairpin was then calculated with 1 st and 3 rd quartiles. Non-templated addition (NTA): mirna tags with a nucleotide mismatch in the last two nucleotides of tags were considered as tags containing NTA. For each mature mirna the number of tags with NTA was recorded as well as the identity of the non-templated base. The percentage of NTA tags was then determined relative to the total mirna tags for a given mature mirna. The median percentage of NTA was then calculated with 1 st and 3 rd quartiles for all mirnas. Note, only NTA distinct from the genomic template can be assessed and thus the true prevalence of NTA is likely

23 under-represented. NTA for individual mirnas can be visualized in associated mirspring 1 documents (Supplementary Data 3-21). Internal Editing: mirna tags with a nucleotide mismatch a) before the last two nucleotides of canonical end position and b) before the last two nucleotides of the tag were considered as tags containing possible editing event. For each mature mirna the number of tags with possible editing event as well as the identity of the base substitution was determined. The median percentage of internally edited micrornas was then calculated with 1 st and 3 rd quartiles. Supplementary References 1. Humphreys, D. T. & Suter, C. M. mirspring: a compact standalone research tool for analyzing mirna-seq data. Nucleic Acids Res. 41, e147, doi: /nar/gkt485 (2013). 2. Hussein, S. M. I. et al. Routes to induced pluripotency: A genome wide, multiple omics characterization. Submitted (2014). 3. Mercer, T. R. et al. The human mitochondrial transcriptome. Cell 146, , doi: /j.cell (2011). 4. Small, I. D., Rackham, O. & Filipovska, A. Organelle transcriptomes: products of a deconstructed genome. Curr. Opin. Microbiol. 16, , doi: /j.mib (2013). 5. Chang, D. D. & Clayton, D. A. Priming of human mitochondrial DNA replication occurs at the light-strand promoter. Proc. Natl. Acad. Sci. U. S. A. 82, (1985). 6. Heo, I. et al. Mono-uridylation of pre-microrna as a key step in the biogenesis of group II let-7 micrornas. Cell 151, , doi: /j.cell (2012). 7. Zheng, G. X. Y. et al. A latent pro-survival function for the mir cluster in mouse embryonic stem cells. PLoS Genet. 7, e , doi: /journal.pgen (2011). 8. Huang da, W., Sherman, B. T. & Lempicki, R. A. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat. Protoc. 4, 44-57, doi: /nprot (2009). 9. Yagita, K. et al. Development of the circadian oscillator during differentiation of mouse embryonic stem cells in vitro. Proc. Natl. Acad. Sci. U. S. A. 107, , doi: /pnas (2010). 10. Shimodaira, H. An approximately unbiased test of phylogenetic tree selection. Syst. Biol. 51, , doi: / (2002). 11. Hsu, S. D. et al. mirtarbase: a database curates experimentally validated microrna-target interactions. Nucleic Acids Res. 39, D , doi: /nar/gkq1107 (2011). 12. Lee, D. et al. DNA methylation as a reprogramming modulator: An epigenomic roadmap to induced pluripotency. Submitted (2014). 13. Tonge, P. D. et al. Divergent reprogramming routes lead to alternative stem cell states. Submitted (2014). 14. Sato, S., Yoshida, W., Soejima, H., Nakabayashi, K. & Hata, K. Methylation dynamics of IG- DMR and Gtl2-DMR during murine embryonic and placental development. Genomics 98, , doi: /j.ygeno (2011). 15. Poos, K. et al. How microrna and transcription factor co-regulatory networks affect osteosarcoma cell proliferation. PLoS Comput. Biol. 9, e , doi: /journal.pcbi (2013). 16. Ebert, M. S. & Sharp, P. A. Roles for micrornas in conferring robustness to biological processes. Cell 149, , doi: /j.cell (2012). 17. Sai Lakshmi, S. & Agrawal, S. pirnabank: a web resource on classified and clustered Piwiinteracting RNAs. Nucleic Acids Res. 36, D , doi: /nar/gkm696 (2008).

24 18. Rosenkranz, D. & Zischler, H. protrac--a software for probabilistic pirna cluster detection, visualization and analysis. BMC Bioinformatics 13, 5, doi: / (2012). 19. Barnett, D. W., Garrison, E. K., Quinlan, A. R., Stromberg, M. P. & Marth, G. T. BamTools: a C++ API and toolkit for analyzing and managing BAM files. Bioinformatics 27, , doi: /bioinformatics/btr174 (2011). 20. Robinson, M. D. & Oshlack, A. A scaling normalization method for differential expression analysis of RNA-seq data. Genome Biol. 11, R25, doi: /gb r25 (2010).

Obstacles and challenges in the analysis of microrna sequencing data

Obstacles and challenges in the analysis of microrna sequencing data Obstacles and challenges in the analysis of microrna sequencing data (mirna-seq) David Humphreys Genomics core Dr Victor Chang AC 1936-1991, Pioneering Cardiothoracic Surgeon and Humanitarian The ABCs

More information

Supplementary Figure S1. Gene expression analysis of epidermal marker genes and TP63.

Supplementary Figure S1. Gene expression analysis of epidermal marker genes and TP63. Supplementary Figure Legends Supplementary Figure S1. Gene expression analysis of epidermal marker genes and TP63. A. Screenshot of the UCSC genome browser from normalized RNAPII and RNA-seq ChIP-seq data

More information

Computational Analysis of UHT Sequences Histone modifications, CAGE, RNA-Seq

Computational Analysis of UHT Sequences Histone modifications, CAGE, RNA-Seq Computational Analysis of UHT Sequences Histone modifications, CAGE, RNA-Seq Philipp Bucher Wednesday January 21, 2009 SIB graduate school course EPFL, Lausanne ChIP-seq against histone variants: Biological

More information

Comparison of open chromatin regions between dentate granule cells and other tissues and neural cell types.

Comparison of open chromatin regions between dentate granule cells and other tissues and neural cell types. Supplementary Figure 1 Comparison of open chromatin regions between dentate granule cells and other tissues and neural cell types. (a) Pearson correlation heatmap among open chromatin profiles of different

More information

Supplemental Figure 1. Small RNA size distribution from different soybean tissues.

Supplemental Figure 1. Small RNA size distribution from different soybean tissues. Supplemental Figure 1. Small RNA size distribution from different soybean tissues. The size of small RNAs was plotted versus frequency (percentage) among total sequences (A, C, E and G) or distinct sequences

More information

a) List of KMTs targeted in the shrna screen. The official symbol, KMT designation,

a) List of KMTs targeted in the shrna screen. The official symbol, KMT designation, Supplementary Information Supplementary Figures Supplementary Figure 1. a) List of KMTs targeted in the shrna screen. The official symbol, KMT designation, gene ID and specifities are provided. Those highlighted

More information

Supplemental Figure S1. Expression of Cirbp mrna in mouse tissues and NIH3T3 cells.

Supplemental Figure S1. Expression of Cirbp mrna in mouse tissues and NIH3T3 cells. SUPPLEMENTAL FIGURE AND TABLE LEGENDS Supplemental Figure S1. Expression of Cirbp mrna in mouse tissues and NIH3T3 cells. A) Cirbp mrna expression levels in various mouse tissues collected around the clock

More information

Nature Genetics: doi: /ng.3731

Nature Genetics: doi: /ng.3731 Supplementary Figure 1 Circadian profiles of Adarb1 transcript and ADARB1 protein in mouse tissues. (a) Overlap of rhythmic transcripts identified in the previous transcriptome analyses. The mouse liver

More information

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1. Differential expression of mirnas from the pri-mir-17-92a locus.

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1. Differential expression of mirnas from the pri-mir-17-92a locus. Supplementary Figure 1 Differential expression of mirnas from the pri-mir-17-92a locus. (a) The mir-17-92a expression unit in the third intron of the host mir-17hg transcript. (b,c) Impact of knockdown

More information

Lentiviral Delivery of Combinatorial mirna Expression Constructs Provides Efficient Target Gene Repression.

Lentiviral Delivery of Combinatorial mirna Expression Constructs Provides Efficient Target Gene Repression. Supplementary Figure 1 Lentiviral Delivery of Combinatorial mirna Expression Constructs Provides Efficient Target Gene Repression. a, Design for lentiviral combinatorial mirna expression and sensor constructs.

More information

Small RNAs and how to analyze them using sequencing

Small RNAs and how to analyze them using sequencing Small RNAs and how to analyze them using sequencing RNA-seq Course November 8th 2017 Marc Friedländer ComputaAonal RNA Biology Group SciLifeLab / Stockholm University Special thanks to Jakub Westholm for

More information

Accessing and Using ENCODE Data Dr. Peggy J. Farnham

Accessing and Using ENCODE Data Dr. Peggy J. Farnham 1 William M Keck Professor of Biochemistry Keck School of Medicine University of Southern California How many human genes are encoded in our 3x10 9 bp? C. elegans (worm) 959 cells and 1x10 8 bp 20,000

More information

7SK ChIRP-seq is specifically RNA dependent and conserved between mice and humans.

7SK ChIRP-seq is specifically RNA dependent and conserved between mice and humans. Supplementary Figure 1 7SK ChIRP-seq is specifically RNA dependent and conserved between mice and humans. Regions targeted by the Even and Odd ChIRP probes mapped to a secondary structure model 56 of the

More information

EPIGENOMICS PROFILING SERVICES

EPIGENOMICS PROFILING SERVICES EPIGENOMICS PROFILING SERVICES Chromatin analysis DNA methylation analysis RNA-seq analysis Diagenode helps you uncover the mysteries of epigenetics PAGE 3 Integrative epigenomics analysis DNA methylation

More information

genomics for systems biology / ISB2020 RNA sequencing (RNA-seq)

genomics for systems biology / ISB2020 RNA sequencing (RNA-seq) RNA sequencing (RNA-seq) Module Outline MO 13-Mar-2017 RNA sequencing: Introduction 1 WE 15-Mar-2017 RNA sequencing: Introduction 2 MO 20-Mar-2017 Paper: PMID 25954002: Human genomics. The human transcriptome

More information

Supplementary Figure 1 IL-27 IL

Supplementary Figure 1 IL-27 IL Tim-3 Supplementary Figure 1 Tc0 49.5 0.6 Tc1 63.5 0.84 Un 49.8 0.16 35.5 0.16 10 4 61.2 5.53 10 3 64.5 5.66 10 2 10 1 10 0 31 2.22 10 0 10 1 10 2 10 3 10 4 IL-10 28.2 1.69 IL-27 Supplementary Figure 1.

More information

Nature Genetics: doi: /ng Supplementary Figure 1. Assessment of sample purity and quality.

Nature Genetics: doi: /ng Supplementary Figure 1. Assessment of sample purity and quality. Supplementary Figure 1 Assessment of sample purity and quality. (a) Hematoxylin and eosin staining of formaldehyde-fixed, paraffin-embedded sections from a human testis biopsy collected concurrently with

More information

RNA interference induced hepatotoxicity results from loss of the first synthesized isoform of microrna-122 in mice

RNA interference induced hepatotoxicity results from loss of the first synthesized isoform of microrna-122 in mice SUPPLEMENTARY INFORMATION RNA interference induced hepatotoxicity results from loss of the first synthesized isoform of microrna-122 in mice Paul N Valdmanis, Shuo Gu, Kirk Chu, Lan Jin, Feijie Zhang,

More information

Analysis of Massively Parallel Sequencing Data Application of Illumina Sequencing to the Genetics of Human Cancers

Analysis of Massively Parallel Sequencing Data Application of Illumina Sequencing to the Genetics of Human Cancers Analysis of Massively Parallel Sequencing Data Application of Illumina Sequencing to the Genetics of Human Cancers Gordon Blackshields Senior Bioinformatician Source BioScience 1 To Cancer Genetics Studies

More information

Simple, rapid, and reliable RNA sequencing

Simple, rapid, and reliable RNA sequencing Simple, rapid, and reliable RNA sequencing RNA sequencing applications RNA sequencing provides fundamental insights into how genomes are organized and regulated, giving us valuable information about the

More information

Nature Immunology: doi: /ni Supplementary Figure 1. Characteristics of SEs in T reg and T conv cells.

Nature Immunology: doi: /ni Supplementary Figure 1. Characteristics of SEs in T reg and T conv cells. Supplementary Figure 1 Characteristics of SEs in T reg and T conv cells. (a) Patterns of indicated transcription factor-binding at SEs and surrounding regions in T reg and T conv cells. Average normalized

More information

RASA: Robust Alternative Splicing Analysis for Human Transcriptome Arrays

RASA: Robust Alternative Splicing Analysis for Human Transcriptome Arrays Supplementary Materials RASA: Robust Alternative Splicing Analysis for Human Transcriptome Arrays Junhee Seok 1*, Weihong Xu 2, Ronald W. Davis 2, Wenzhong Xiao 2,3* 1 School of Electrical Engineering,

More information

Eukaryotic small RNA Small RNAseq data analysis for mirna identification

Eukaryotic small RNA Small RNAseq data analysis for mirna identification Eukaryotic small RNA Small RNAseq data analysis for mirna identification P. Bardou, C. Gaspin, S. Maman, J. Mariette, O. Rué, M. Zytnicki INRA Sigenae Toulouse INRA MIA Toulouse GenoToul Bioinfo INRA MaIAGE

More information

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1 Supplementary Figure 1 Effect of HSP90 inhibition on expression of endogenous retroviruses. (a) Inducible shrna-mediated Hsp90 silencing in mouse ESCs. Immunoblots of total cell extract expressing the

More information

Supplementary information for: Human micrornas co-silence in well-separated groups and have different essentialities

Supplementary information for: Human micrornas co-silence in well-separated groups and have different essentialities Supplementary information for: Human micrornas co-silence in well-separated groups and have different essentialities Gábor Boross,2, Katalin Orosz,2 and Illés J. Farkas 2, Department of Biological Physics,

More information

Cross species analysis of genomics data. Computational Prediction of mirnas and their targets

Cross species analysis of genomics data. Computational Prediction of mirnas and their targets 02-716 Cross species analysis of genomics data Computational Prediction of mirnas and their targets Outline Introduction Brief history mirna Biogenesis Why Computational Methods? Computational Methods

More information

Supplementary Figure 1

Supplementary Figure 1 Supplementary Figure 1 Asymmetrical function of 5p and 3p arms of mir-181 and mir-30 families and mir-142 and mir-154. (a) Control experiments using mirna sensor vector and empty pri-mirna overexpression

More information

Nature Structural & Molecular Biology: doi: /nsmb.2419

Nature Structural & Molecular Biology: doi: /nsmb.2419 Supplementary Figure 1 Mapped sequence reads and nucleosome occupancies. (a) Distribution of sequencing reads on the mouse reference genome for chromosome 14 as an example. The number of reads in a 1 Mb

More information

he micrornas of Caenorhabditis elegans (Lim et al. Genes & Development 2003)

he micrornas of Caenorhabditis elegans (Lim et al. Genes & Development 2003) MicroRNAs: Genomics, Biogenesis, Mechanism, and Function (D. Bartel Cell 2004) he micrornas of Caenorhabditis elegans (Lim et al. Genes & Development 2003) Vertebrate MicroRNA Genes (Lim et al. Science

More information

Supplementary Figures

Supplementary Figures Supplementary Figures Supplementary Figure 1. Heatmap of GO terms for differentially expressed genes. The terms were hierarchically clustered using the GO term enrichment beta. Darker red, higher positive

More information

MODULE 4: SPLICING. Removal of introns from messenger RNA by splicing

MODULE 4: SPLICING. Removal of introns from messenger RNA by splicing Last update: 05/10/2017 MODULE 4: SPLICING Lesson Plan: Title MEG LAAKSO Removal of introns from messenger RNA by splicing Objectives Identify splice donor and acceptor sites that are best supported by

More information

Broad H3K4me3 is associated with increased transcription elongation and enhancer activity at tumor suppressor genes

Broad H3K4me3 is associated with increased transcription elongation and enhancer activity at tumor suppressor genes Broad H3K4me3 is associated with increased transcription elongation and enhancer activity at tumor suppressor genes Kaifu Chen 1,2,3,4,5,10, Zhong Chen 6,10, Dayong Wu 6, Lili Zhang 7, Xueqiu Lin 1,2,8,

More information

Soft Agar Assay. For each cell pool, 100,000 cells were resuspended in 0.35% (w/v)

Soft Agar Assay. For each cell pool, 100,000 cells were resuspended in 0.35% (w/v) SUPPLEMENTARY MATERIAL AND METHODS Soft Agar Assay. For each cell pool, 100,000 cells were resuspended in 0.35% (w/v) top agar (LONZA, SeaKem LE Agarose cat.5004) and plated onto 0.5% (w/v) basal agar.

More information

Human breast milk mirna, maternal probiotic supplementation and atopic dermatitis in offsrping

Human breast milk mirna, maternal probiotic supplementation and atopic dermatitis in offsrping Human breast milk mirna, maternal probiotic supplementation and atopic dermatitis in offsrping Melanie Rae Simpson PhD candidate Department of Public Health and General Practice Norwegian University of

More information

mirna Dr. S Hosseini-Asl

mirna Dr. S Hosseini-Asl mirna Dr. S Hosseini-Asl 1 2 MicroRNAs (mirnas) are small noncoding RNAs which enhance the cleavage or translational repression of specific mrna with recognition site(s) in the 3 - untranslated region

More information

OncoPPi Portal A Cancer Protein Interaction Network to Inform Therapeutic Strategies

OncoPPi Portal A Cancer Protein Interaction Network to Inform Therapeutic Strategies OncoPPi Portal A Cancer Protein Interaction Network to Inform Therapeutic Strategies 2017 Contents Datasets... 2 Protein-protein interaction dataset... 2 Set of known PPIs... 3 Domain-domain interactions...

More information

Nature Immunology: doi: /ni Supplementary Figure 1. Transcriptional program of the TE and MP CD8 + T cell subsets.

Nature Immunology: doi: /ni Supplementary Figure 1. Transcriptional program of the TE and MP CD8 + T cell subsets. Supplementary Figure 1 Transcriptional program of the TE and MP CD8 + T cell subsets. (a) Comparison of gene expression of TE and MP CD8 + T cell subsets by microarray. Genes that are 1.5-fold upregulated

More information

Small RNAs and how to analyze them using sequencing

Small RNAs and how to analyze them using sequencing Small RNAs and how to analyze them using sequencing Jakub Orzechowski Westholm (1) Long- term bioinforma=cs support, Science For Life Laboratory Stockholm (2) Department of Biophysics and Biochemistry,

More information

Transcriptome profiling of the developing male germ line identifies the mir-29 family as a global regulator during meiosis

Transcriptome profiling of the developing male germ line identifies the mir-29 family as a global regulator during meiosis RNA BIOLOGY 2017, VOL. 14, NO. 2, 219 235 http://dx.doi.org/10.1080/15476286.2016.1270002 RESEARCH PAPER Transcriptome profiling of the developing male germ line identifies the mir-29 family as a global

More information

DNA Sequence Bioinformatics Analysis with the Galaxy Platform

DNA Sequence Bioinformatics Analysis with the Galaxy Platform DNA Sequence Bioinformatics Analysis with the Galaxy Platform University of São Paulo, Brazil 28 July - 1 August 2014 Dave Clements Johns Hopkins University Robson Francisco de Souza University of São

More information

Supplementary Figure 1

Supplementary Figure 1 Supplementary Figure 1 a Location of GUUUCA motif relative to RNA position 4e+05 3e+05 Reads 2e+05 1e+05 0e+00 35 40 45 50 55 60 65 Distance upstream b 1: Egg 2: L3 3: Adult 4: HES 4e+05 3e+05 2e+05 start

More information

Nature Methods: doi: /nmeth.3115

Nature Methods: doi: /nmeth.3115 Supplementary Figure 1 Analysis of DNA methylation in a cancer cohort based on Infinium 450K data. RnBeads was used to rediscover a clinically distinct subgroup of glioblastoma patients characterized by

More information

Not IN Our Genes - A Different Kind of Inheritance.! Christopher Phiel, Ph.D. University of Colorado Denver Mini-STEM School February 4, 2014

Not IN Our Genes - A Different Kind of Inheritance.! Christopher Phiel, Ph.D. University of Colorado Denver Mini-STEM School February 4, 2014 Not IN Our Genes - A Different Kind of Inheritance! Christopher Phiel, Ph.D. University of Colorado Denver Mini-STEM School February 4, 2014 Epigenetics in Mainstream Media Epigenetics *Current definition:

More information

Prediction of micrornas and their targets

Prediction of micrornas and their targets Prediction of micrornas and their targets Introduction Brief history mirna Biogenesis Computational Methods Mature and precursor mirna prediction mirna target gene prediction Summary micrornas? RNA can

More information

MicroRNA in Cancer Karen Dybkær 2013

MicroRNA in Cancer Karen Dybkær 2013 MicroRNA in Cancer Karen Dybkær RNA Ribonucleic acid Types -Coding: messenger RNA (mrna) coding for proteins -Non-coding regulating protein formation Ribosomal RNA (rrna) Transfer RNA (trna) Small nuclear

More information

ChIP-seq hands-on. Iros Barozzi, Campus IFOM-IEO (Milan) Saverio Minucci, Gioacchino Natoli Labs

ChIP-seq hands-on. Iros Barozzi, Campus IFOM-IEO (Milan) Saverio Minucci, Gioacchino Natoli Labs ChIP-seq hands-on Iros Barozzi, Campus IFOM-IEO (Milan) Saverio Minucci, Gioacchino Natoli Labs Main goals Becoming familiar with essential tools and formats Visualizing and contextualizing raw data Understand

More information

Discovery of Novel Human Gene Regulatory Modules from Gene Co-expression and

Discovery of Novel Human Gene Regulatory Modules from Gene Co-expression and Discovery of Novel Human Gene Regulatory Modules from Gene Co-expression and Promoter Motif Analysis Shisong Ma 1,2*, Michael Snyder 3, and Savithramma P Dinesh-Kumar 2* 1 School of Life Sciences, University

More information

High AU content: a signature of upregulated mirna in cardiac diseases

High AU content: a signature of upregulated mirna in cardiac diseases https://helda.helsinki.fi High AU content: a signature of upregulated mirna in cardiac diseases Gupta, Richa 2010-09-20 Gupta, R, Soni, N, Patnaik, P, Sood, I, Singh, R, Rawal, K & Rani, V 2010, ' High

More information

MIR retrotransposon sequences provide insulators to the human genome

MIR retrotransposon sequences provide insulators to the human genome Supplementary Information: MIR retrotransposon sequences provide insulators to the human genome Jianrong Wang, Cristina Vicente-García, Davide Seruggia, Eduardo Moltó, Ana Fernandez- Miñán, Ana Neto, Elbert

More information

RNA-seq Introduction

RNA-seq Introduction RNA-seq Introduction DNA is the same in all cells but which RNAs that is present is different in all cells There is a wide variety of different functional RNAs Which RNAs (and sometimes then translated

More information

fl/+ KRas;Atg5 fl/+ KRas;Atg5 fl/fl KRas;Atg5 fl/fl KRas;Atg5 Supplementary Figure 1. Gene set enrichment analyses. (a) (b)

fl/+ KRas;Atg5 fl/+ KRas;Atg5 fl/fl KRas;Atg5 fl/fl KRas;Atg5 Supplementary Figure 1. Gene set enrichment analyses. (a) (b) KRas;At KRas;At KRas;At KRas;At a b Supplementary Figure 1. Gene set enrichment analyses. (a) GO gene sets (MSigDB v3. c5) enriched in KRas;Atg5 fl/+ as compared to KRas;Atg5 fl/fl tumors using gene set

More information

Processing, integrating and analysing chromatin immunoprecipitation followed by sequencing (ChIP-seq) data

Processing, integrating and analysing chromatin immunoprecipitation followed by sequencing (ChIP-seq) data Processing, integrating and analysing chromatin immunoprecipitation followed by sequencing (ChIP-seq) data Bioinformatics methods, models and applications to disease Alex Essebier ChIP-seq experiment To

More information

Identification of mirnas in Eucalyptus globulus Plant by Computational Methods

Identification of mirnas in Eucalyptus globulus Plant by Computational Methods International Journal of Pharmaceutical Science Invention ISSN (Online): 2319 6718, ISSN (Print): 2319 670X Volume 2 Issue 5 May 2013 PP.70-74 Identification of mirnas in Eucalyptus globulus Plant by Computational

More information

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1 Supplementary Figure 1 Frequency of alternative-cassette-exon engagement with the ribosome is consistent across data from multiple human cell types and from mouse stem cells. Box plots showing AS frequency

More information

Staged mirna re-regulation patterns during reprogramming

Staged mirna re-regulation patterns during reprogramming Henzler et al. Genome Biology 2013, 14:R149 RESEARCH Staged mirna re-regulation patterns during reprogramming Christine M Henzler 1,5, Zhonghan Li 2, Jason Dang 2, Mary Luz Arcila 1, Hongjun Zhou 1, Jingya

More information

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1 Supplementary Figure 1 U1 inhibition causes a shift of RNA-seq reads from exons to introns. (a) Evidence for the high purity of 4-shU-labeled RNAs used for RNA-seq. HeLa cells transfected with control

More information

Computational Identification and Prediction of Tissue-Specific Alternative Splicing in H. Sapiens. Eric Van Nostrand CS229 Final Project

Computational Identification and Prediction of Tissue-Specific Alternative Splicing in H. Sapiens. Eric Van Nostrand CS229 Final Project Computational Identification and Prediction of Tissue-Specific Alternative Splicing in H. Sapiens. Eric Van Nostrand CS229 Final Project Introduction RNA splicing is a critical step in eukaryotic gene

More information

Supplementary Figure 1 ITGB1 and ITGA11 increase with evidence for heterodimers following HSC activation. (a) Time course of rat HSC activation

Supplementary Figure 1 ITGB1 and ITGA11 increase with evidence for heterodimers following HSC activation. (a) Time course of rat HSC activation Supplementary Figure 1 ITGB1 and ITGA11 increase with evidence for heterodimers following HSC activation. (a) Time course of rat HSC activation indicated by the detection of -SMA and COL1 (log scale).

More information

HALLA KABAT * Outreach Program, mircore, 2929 Plymouth Rd. Ann Arbor, MI 48105, USA LEO TUNKLE *

HALLA KABAT * Outreach Program, mircore, 2929 Plymouth Rd. Ann Arbor, MI 48105, USA   LEO TUNKLE * CERNA SEARCH METHOD IDENTIFIED A MET-ACTIVATED SUBGROUP AMONG EGFR DNA AMPLIFIED LUNG ADENOCARCINOMA PATIENTS HALLA KABAT * Outreach Program, mircore, 2929 Plymouth Rd. Ann Arbor, MI 48105, USA Email:

More information

microrna PCR System (Exiqon), following the manufacturer s instructions. In brief, 10ng of

microrna PCR System (Exiqon), following the manufacturer s instructions. In brief, 10ng of SUPPLEMENTAL MATERIALS AND METHODS Quantitative RT-PCR Quantitative RT-PCR analysis was performed using the Universal mircury LNA TM microrna PCR System (Exiqon), following the manufacturer s instructions.

More information

SUPPLEMENTAL INFORMATION

SUPPLEMENTAL INFORMATION SUPPLEMENTAL INFORMATION GO term analysis of differentially methylated SUMIs. GO term analysis of the 458 SUMIs with the largest differential methylation between human and chimp shows that they are more

More information

SUPPLEMENTAL DATA AGING, July 2014, Vol. 6 No. 7

SUPPLEMENTAL DATA AGING, July 2014, Vol. 6 No. 7 SUPPLEMENTAL DATA Figure S1. Muscle mass changes in different anatomical regions with age. (A) The TA and gastrocnemius muscle showed a significant loss of weight in aged mice (24 month old) compared to

More information

Variant Classification. Author: Mike Thiesen, Golden Helix, Inc.

Variant Classification. Author: Mike Thiesen, Golden Helix, Inc. Variant Classification Author: Mike Thiesen, Golden Helix, Inc. Overview Sequencing pipelines are able to identify rare variants not found in catalogs such as dbsnp. As a result, variants in these datasets

More information

Supplementary Figure 1: Features of IGLL5 Mutations in CLL: a) Representative IGV screenshot of first

Supplementary Figure 1: Features of IGLL5 Mutations in CLL: a) Representative IGV screenshot of first Supplementary Figure 1: Features of IGLL5 Mutations in CLL: a) Representative IGV screenshot of first intron IGLL5 mutation depicting biallelic mutations. Red arrows highlight the presence of out of phase

More information

Supplementary Figure 1. Metabolic landscape of cancer discovery pipeline. RNAseq raw counts data of cancer and healthy tissue samples were downloaded

Supplementary Figure 1. Metabolic landscape of cancer discovery pipeline. RNAseq raw counts data of cancer and healthy tissue samples were downloaded Supplementary Figure 1. Metabolic landscape of cancer discovery pipeline. RNAseq raw counts data of cancer and healthy tissue samples were downloaded from TCGA and differentially expressed metabolic genes

More information

RNA-Seq Preparation Comparision Summary: Lexogen, Standard, NEB

RNA-Seq Preparation Comparision Summary: Lexogen, Standard, NEB RNA-Seq Preparation Comparision Summary: Lexogen, Standard, NEB CSF-NGS January 22, 214 Contents 1 Introduction 1 2 Experimental Details 1 3 Results And Discussion 1 3.1 ERCC spike ins............................................

More information

A Practical Guide to Integrative Genomics by RNA-seq and ChIP-seq Analysis

A Practical Guide to Integrative Genomics by RNA-seq and ChIP-seq Analysis A Practical Guide to Integrative Genomics by RNA-seq and ChIP-seq Analysis Jian Xu, Ph.D. Children s Research Institute, UTSW Introduction Outline Overview of genomic and next-gen sequencing technologies

More information

Computational Modeling of mirna Biogenesis

Computational Modeling of mirna Biogenesis Computational Modeling of mirna Biogenesis Brian Caffrey and Annalisa Marsico Abstract Over the past few years it has been observed, thanks in no small part to high-throughput methods, that a large proportion

More information

Gene-microRNA network module analysis for ovarian cancer

Gene-microRNA network module analysis for ovarian cancer Gene-microRNA network module analysis for ovarian cancer Shuqin Zhang School of Mathematical Sciences Fudan University Oct. 4, 2016 Outline Introduction Materials and Methods Results Conclusions Introduction

More information

Circular RNAs (circrnas) act a stable mirna sponges

Circular RNAs (circrnas) act a stable mirna sponges Circular RNAs (circrnas) act a stable mirna sponges cernas compete for mirnas Ancestal mrna (+3 UTR) Pseudogene RNA (+3 UTR homolgy region) The model holds true for all RNAs that share a mirna binding

More information

MODULE 3: TRANSCRIPTION PART II

MODULE 3: TRANSCRIPTION PART II MODULE 3: TRANSCRIPTION PART II Lesson Plan: Title S. CATHERINE SILVER KEY, CHIYEDZA SMALL Transcription Part II: What happens to the initial (premrna) transcript made by RNA pol II? Objectives Explain

More information

Chapter 2. Investigation into mir-346 Regulation of the nachr α5 Subunit

Chapter 2. Investigation into mir-346 Regulation of the nachr α5 Subunit 15 Chapter 2 Investigation into mir-346 Regulation of the nachr α5 Subunit MicroRNA s (mirnas) are small (< 25 base pairs), single stranded, non-coding RNAs that regulate gene expression at the post transcriptional

More information

STAT1 regulates microrna transcription in interferon γ stimulated HeLa cells

STAT1 regulates microrna transcription in interferon γ stimulated HeLa cells CAMDA 2009 October 5, 2009 STAT1 regulates microrna transcription in interferon γ stimulated HeLa cells Guohua Wang 1, Yadong Wang 1, Denan Zhang 1, Mingxiang Teng 1,2, Lang Li 2, and Yunlong Liu 2 Harbin

More information

CRS4 Seminar series. Inferring the functional role of micrornas from gene expression data CRS4. Biomedicine. Bioinformatics. Paolo Uva July 11, 2012

CRS4 Seminar series. Inferring the functional role of micrornas from gene expression data CRS4. Biomedicine. Bioinformatics. Paolo Uva July 11, 2012 CRS4 Seminar series Inferring the functional role of micrornas from gene expression data CRS4 Biomedicine Bioinformatics Paolo Uva July 11, 2012 Partners Pharmaceutical company Fondazione San Raffaele,

More information

cis-regulatory enrichment analysis in human, mouse and fly

cis-regulatory enrichment analysis in human, mouse and fly cis-regulatory enrichment analysis in human, mouse and fly Zeynep Kalender Atak, PhD Laboratory of Computational Biology VIB-KU Leuven Center for Brain & Disease Research Laboratory of Computational Biology

More information

RNA-Seq profiling of circular RNAs in human colorectal Cancer liver metastasis and the potential biomarkers

RNA-Seq profiling of circular RNAs in human colorectal Cancer liver metastasis and the potential biomarkers Xu et al. Molecular Cancer (2019) 18:8 https://doi.org/10.1186/s12943-018-0932-8 LETTER TO THE EDITOR RNA-Seq profiling of circular RNAs in human colorectal Cancer liver metastasis and the potential biomarkers

More information

Cluster Dendrogram. dist(cor(na.omit(tss.exprs.chip[, c(1:10, 24, 27, 30, 48:50, dist(cor(na.omit(tss.exprs.chip[, c(1:99, 103, 104, 109, 110,

Cluster Dendrogram. dist(cor(na.omit(tss.exprs.chip[, c(1:10, 24, 27, 30, 48:50, dist(cor(na.omit(tss.exprs.chip[, c(1:99, 103, 104, 109, 110, A Transcriptome (RNA-seq) Transcriptome (RNA-seq) 3. 2.5 2..5..5...5..5 2. 2.5 3. 2.5 2..5..5...5..5 2. 2.5 Cluster Dendrogram RS_ES3.2 RS_ES3. RS_SHS5.2 RS_SHS5. PS_SHS5.2 PS_SHS5. RS_LJ3 PS_LJ3..4 _SHS5.2

More information

Arabidopsis thaliana small RNA Sequencing. Report

Arabidopsis thaliana small RNA Sequencing. Report Arabidopsis thaliana small RNA Sequencing Report September 2015 Project Information Client Name Client Company / Institution Macrogen Order Number Order ID Species Arabidopsis thaliana Reference UCSC hg19

More information

Supplemental Data. Integrating omics and alternative splicing i reveals insights i into grape response to high temperature

Supplemental Data. Integrating omics and alternative splicing i reveals insights i into grape response to high temperature Supplemental Data Integrating omics and alternative splicing i reveals insights i into grape response to high temperature Jianfu Jiang 1, Xinna Liu 1, Guotian Liu, Chonghuih Liu*, Shaohuah Li*, and Lijun

More information

Transcriptome-wide analysis of microrna expression in the malaria mosquito Anopheles gambiae

Transcriptome-wide analysis of microrna expression in the malaria mosquito Anopheles gambiae Biryukova et al. BMC Genomics 2014, 15:557 RESEARCH ARTICLE Open Access Transcriptome-wide analysis of microrna expression in the malaria mosquito Anopheles gambiae Inna Biryukova 1*, Tao Ye 2 and Elena

More information

Peak-calling for ChIP-seq and ATAC-seq

Peak-calling for ChIP-seq and ATAC-seq Peak-calling for ChIP-seq and ATAC-seq Shamith Samarajiwa CRUK Autumn School in Bioinformatics 2017 University of Cambridge Overview Peak-calling: identify enriched (signal) regions in ChIP-seq or ATAC-seq

More information

On the Reproducibility of TCGA Ovarian Cancer MicroRNA Profiles

On the Reproducibility of TCGA Ovarian Cancer MicroRNA Profiles On the Reproducibility of TCGA Ovarian Cancer MicroRNA Profiles Ying-Wooi Wan 1,2,4, Claire M. Mach 2,3, Genevera I. Allen 1,7,8, Matthew L. Anderson 2,4,5 *, Zhandong Liu 1,5,6,7 * 1 Departments of Pediatrics

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION Supplementary text Collectively, we were able to detect ~14,000 expressed genes with RPKM (reads per kilobase per million) > 1 or ~16,000 with RPKM > 0.1 in at least one cell type from oocyte to the morula

More information

To test the possible source of the HBV infection outside the study family, we searched the Genbank

To test the possible source of the HBV infection outside the study family, we searched the Genbank Supplementary Discussion The source of hepatitis B virus infection To test the possible source of the HBV infection outside the study family, we searched the Genbank and HBV Database (http://hbvdb.ibcp.fr),

More information

Supplementary Figure 1

Supplementary Figure 1 Supplementary Figure 1 Supplementary Fig. 1: Quality assessment of formalin-fixed paraffin-embedded (FFPE)-derived DNA and nuclei. (a) Multiplex PCR analysis of unrepaired and repaired bulk FFPE gdna from

More information

Supplementary Figure 1: Attenuation of association signals after conditioning for the lead SNP. a) attenuation of association signal at the 9p22.

Supplementary Figure 1: Attenuation of association signals after conditioning for the lead SNP. a) attenuation of association signal at the 9p22. Supplementary Figure 1: Attenuation of association signals after conditioning for the lead SNP. a) attenuation of association signal at the 9p22.32 PCOS locus after conditioning for the lead SNP rs10993397;

More information

Heintzman, ND, Stuart, RK, Hon, G, Fu, Y, Ching, CW, Hawkins, RD, Barrera, LO, Van Calcar, S, Qu, C, Ching, KA, Wang, W, Weng, Z, Green, RD,

Heintzman, ND, Stuart, RK, Hon, G, Fu, Y, Ching, CW, Hawkins, RD, Barrera, LO, Van Calcar, S, Qu, C, Ching, KA, Wang, W, Weng, Z, Green, RD, Heintzman, ND, Stuart, RK, Hon, G, Fu, Y, Ching, CW, Hawkins, RD, Barrera, LO, Van Calcar, S, Qu, C, Ching, KA, Wang, W, Weng, Z, Green, RD, Crawford, GE, Ren, B (2007) Distinct and predictive chromatin

More information

Table S1. Total and mapped reads produced for each ChIP-seq sample

Table S1. Total and mapped reads produced for each ChIP-seq sample Tale S1. Total and mapped reads produced for each ChIP-seq sample Sample Total Reads Mapped Reads Col- H3K27me3 rep1 125662 1334323 (85.76%) Col- H3K27me3 rep2 9176437 7986731 (87.4%) atmi1a//c H3K27m3

More information

Table S1. Relative abundance of AGO1/4 proteins in different organs. Table S2. Summary of smrna datasets from various samples.

Table S1. Relative abundance of AGO1/4 proteins in different organs. Table S2. Summary of smrna datasets from various samples. Supplementary files Table S1. Relative abundance of AGO1/4 proteins in different organs. Table S2. Summary of smrna datasets from various samples. Table S3. Specificity of AGO1- and AGO4-preferred 24-nt

More information

omiras: MicroRNA regulation of gene expression

omiras: MicroRNA regulation of gene expression omiras: MicroRNA regulation of gene expression Sören Müller, Goethe University of Frankfurt am Main Molecular Bioinformatics Group, Institute of Computer Science Plant Molecular Biology Group, Institute

More information

Analyse de données de séquençage haut débit

Analyse de données de séquençage haut débit Analyse de données de séquençage haut débit Vincent Lacroix Laboratoire de Biométrie et Biologie Évolutive INRIA ERABLE 9ème journée ITS 21 & 22 novembre 2017 Lyon https://its.aviesan.fr Sequencing is

More information

Paternal exposure and effects on microrna and mrna expression in developing embryo. Department of Chemical and Radiation Nur Duale

Paternal exposure and effects on microrna and mrna expression in developing embryo. Department of Chemical and Radiation Nur Duale Paternal exposure and effects on microrna and mrna expression in developing embryo Department of Chemical and Radiation Nur Duale Our research question Can paternal preconceptional exposure to environmental

More information

Assignment 5: Integrative epigenomics analysis

Assignment 5: Integrative epigenomics analysis Assignment 5: Integrative epigenomics analysis Due date: Friday, 2/24 10am. Note: no late assignments will be accepted. Introduction CpG islands (CGIs) are important regulatory regions in the genome. What

More information

Supplementary Materials for

Supplementary Materials for www.sciencesignaling.org/cgi/content/full/8/393/rs9/dc1 Supplementary Materials for Identification of potential drug targets for tuberous sclerosis complex by synthetic screens combining CRISPR-based knockouts

More information

Histone Modifications Are Associated with Transcript Isoform Diversity in Normal and Cancer Cells

Histone Modifications Are Associated with Transcript Isoform Diversity in Normal and Cancer Cells Histone Modifications Are Associated with Transcript Isoform Diversity in Normal and Cancer Cells Ondrej Podlaha 1, Subhajyoti De 2,3,4, Mithat Gonen 5, Franziska Michor 1 * 1 Department of Biostatistics

More information

Computational aspects of ChIP-seq. John Marioni Research Group Leader European Bioinformatics Institute European Molecular Biology Laboratory

Computational aspects of ChIP-seq. John Marioni Research Group Leader European Bioinformatics Institute European Molecular Biology Laboratory Computational aspects of ChIP-seq John Marioni Research Group Leader European Bioinformatics Institute European Molecular Biology Laboratory ChIP-seq Using highthroughput sequencing to investigate DNA

More information

Supplementary Information

Supplementary Information Supplementary Information 5-hydroxymethylcytosine-mediated epigenetic dynamics during postnatal neurodevelopment and aging By Keith E. Szulwach 1,8, Xuekun Li 1,8, Yujing Li 1, Chun-Xiao Song 2, Hao Wu

More information

SUPPLEMENTAL FILE. mir-22 and mir-29a are members of the androgen receptor cistrome modulating. LAMC1 and Mcl-1 in prostate cancer

SUPPLEMENTAL FILE. mir-22 and mir-29a are members of the androgen receptor cistrome modulating. LAMC1 and Mcl-1 in prostate cancer 1 SUPPLEMENTAL FILE 2 3 mir-22 and mir-29a are members of the androgen receptor cistrome modulating LAMC1 and Mcl-1 in prostate cancer 4 5 6 Lorenza Pasqualini 1, Huajie Bu 1,2, Martin Puhr 1, Narisu Narisu

More information

SUPPLEMENTARY APPENDIX

SUPPLEMENTARY APPENDIX SUPPLEMENTARY APPENDIX 1) Supplemental Figure 1. Histopathologic Characteristics of the Tumors in the Discovery Cohort 2) Supplemental Figure 2. Incorporation of Normal Epidermal Melanocytic Signature

More information

NJMerge: A generic technique for scaling phylogeny estimation methods and its application to species trees Supplementary Materials

NJMerge: A generic technique for scaling phylogeny estimation methods and its application to species trees Supplementary Materials NJMerge: A generic technique for scaling phylogeny estimation methods and its application to species trees Supplementary Materials Erin K. Molloy 1[ 1 5553 3312] and Tandy Warnow 1[ 1 7717 3514] Department

More information