Pitfalls of Mapping High-Throughput Sequencing Data to Repetitive Sequences: Piwi s Genomic Targets Still Not Identified

Size: px
Start display at page:

Download "Pitfalls of Mapping High-Throughput Sequencing Data to Repetitive Sequences: Piwi s Genomic Targets Still Not Identified"

Transcription

1 Matters Arising Pitfalls of Mapping High-Throughput Sequencing Data to Repetitive Sequences: Piwi s Genomic Targets Still Not Identified Highlights d Published ChIP-seq datasets do not reveal Piwi s genomic binding sites Authors Georgi K. Marinov, Jie Wang,..., Julius Brennecke, Katalin Fejes Toth d Loss of Piwi does not lead to a broad redistribution of Pol II to transposons Correspondence julius.brennecke@imba.oeaw.ac.at (J.B.), kft@caltech.edu (K.F.T.) In Brief Piwi silences transposon transcription in Drosophila ovaries. A previous report claimed the identification of Piwi s genomic binding sites by ChIP-seq. Marinov et al. re-analyzed the published datasets and find no support for an enrichment of Piwi at transposons. Instead, previous conclusions result from flawed bioinformatics analyses. Piwi s genomic binding sites remain unknown. Marinov et al., 2015, Developmental Cell 32, March 23, 2015 ª2015 Elsevier Inc.

2 Developmental Cell Matters Arising Pitfalls of Mapping High-Throughput Sequencing Data to Repetitive Sequences: Piwi s Genomic Targets Still Not Identified Georgi K. Marinov, 1,7 Jie Wang, 2,7 Dominik Handler, 3 Barbara J. Wold, 1 Zhiping Weng, 4 Gregory J. Hannon, 5 Alexei A. Aravin, 1 Phillip D. Zamore, 6 Julius Brennecke, 3, * and Katalin Fejes Toth 1, * 1 Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA 91125, USA 2 Department of Biochemistry, University at Buffalo, Buffalo, NY 14214, USA 3 Institute of Molecular Biotechnology of the Austrian Academy of Sciences IMBA, Vienna Biocenter (VBC), 1030 Vienna, Austria 4 Program in Bioinformatics and Integrative Biology, University of Massachusetts Medical School, Worcester, MA 01605, USA 5 Watson School of Biological Sciences, Howard Hughes Medical Institute, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA 6 Howard Hughes Medical Institute, RNA Therapeutics Institute and Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Worcester, MA 01605, USA 7 Co-first author *Correspondence: julius.brennecke@imba.oeaw.ac.at (J.B.), kft@caltech.edu (K.F.T.) SUMMARY Huang et al. (2013) recently reported that chromatin immunoprecipitation sequencing (ChIP-seq) reveals the genome-wide sites of occupancy by Piwi, a pirna-guided Argonaute protein central to transposon silencing in Drosophila. Their study also reported that loss of Piwi causes widespread rewiring of transcriptional patterns, as evidenced by changes in RNA polymerase II occupancy across the genome. Here we reanalyze their data and report that the underlying deep-sequencing dataset does not support the authors genome-wide conclusions. INTRODUCTION PIWI-clade Argonaute proteins and their small RNA guides, PIWI-interacting RNAs (pirnas), collaborate to repress selfish genetic elements such as transposons in animal gonads (Malone and Hannon, 2009; Siomi et al., 2011). The nt pirnas guide PIWI proteins to targets with complementary sequences. One of the three Drosophila PIWI-clade proteins, Piwi is localized to the nucleus and represses transposon expression via transcriptional gene silencing (TGS). Target repression is accompanied by reduced RNA polymerase II (Pol II) occupancy and increased trimethylation of histone H3 Lysine 9 (H3K9me3), a mark of heterochromatin (Le Thomas et al., 2013; Rozhkov et al., 2013; Shpiz et al., 2011; Sienski et al., 2012; Wang and Elgin, 2011). By analogy to centromeric silencing in Schizosaccharomyces pombe (Bühler and Moazed, 2007; Grewal, 2010), these data suggest that pirnas guide Piwi to nascent transcripts at target loci where Piwi promotes TGS and heterochromatin formation. Such a model is intuitively consistent with the findings of Huang et al. (Huang et al., 2013), who reported strong chromatin immunoprecipitation sequencing (ChIP-seq) enrichments for Piwi at many genomic regions, typically transposons, for which complementary pirnas are observed. ChIP experiments in our laboratories, however, have consistently failed to detect significant enrichment of Piwi at Piwi-repressed transposons, despite the use of various cross-linking conditions and different antibodies and tags for immunoprecipitation. We therefore reanalyzed the published ChIP-seq data (Huang et al., 2013). We determined (1) the degree of enrichment for Piwi at transposon loci and (2) the changes in Pol II occupancy at transposon loci upon loss of Piwi. In both cases, our independent analyses failed to confirm the published conclusions. Instead, we found that different data processing methods underlie the different outcomes. We conclude that the genome-wide pattern of Piwi occupancy remains an open question despite multiple attempts to map it using contemporary ChIP-seq methods. RESULTS No Significant Enrichment of Piwi at Transposon Loci in the Huang et al. Datasets For the re-analysis of the Huang et al. deep sequencing data (Huang et al., 2013) we used standard read mapping procedures and retained only reads that align to the genome with % 2 mismatches (for details, see Supplemental Experimental Procedures). For comparative purposes, we applied this strategy to a published H3K9me3 ChIP-seq dataset from Drosophila ovaries (Muerdter et al., 2013). This histone mark is enriched in heterochromatin and on transposons and other genomic repeats. It is also present at transposon insertions repressed by nuclear Piwi via the pirna pathway. To ask whether the Piwi ChIP-seq dataset was enriched for transposon sequences, we first mapped all genome-mapping ChIP-seq and input control reads to a comprehensive list of consensus transposon sequences for Drosophila melanogaster. For each, we calculated normalized RPM values (Reads Per Million sequenced reads; for details, see Supplemental Experimental Procedures). This resulted in Piwi occupancy levels for transposons that were indistinguishable from background (Figure 1A). In contrast, the H3K9me3 mark was as much as 10-fold enriched over most transposons. These results are in marked contrast with the conclusion that 86% of the Piwi Developmental Cell 32, , March 23, 2015 ª2015 Elsevier Inc. 765

3 A B C Figure 1. Piwi Is Not Enriched over Transposons in the Huang et al. Dataset (A) Absence of enrichment in the Piwi ChIP-seq dataset and high enrichment of H3K9me3 (from Muerdter et al., 2013) over consensus transposons; each dot corresponds to a transposon consensus sequence. (B) The concentration of Piwi signal over transposons in the Huang et al. dataset arises from failure to normalize multiply mapping reads. Shown is the region from Figure 2C of Huang et al. (2013). Top: Piwi ChIP-seq and background (input) data from Huang et al. showing (1) unique alignments; (2) all alignments, with reads normalized for mapping multiplicity; and (3) all alignments, with each alignment treated as a uniquely mapped read. Bottom: data processed per Huang et al. The enrichment of Piwi over repetitive elements is only observed when no multi-read normalization is applied and is seen in both ChIP and control datasets. (C) The minimal Piwi ChIP-seq enrichment observed over some individual transposable elements is well within the range of experimental noise. Shown is the cumulative distribution function (CDF) of the ratio between total ChIP RPM and control/background RPM for each DNA, LINE, or LTR repetitive element (each dot represents an individual TE insertion). Piwi ChIP-seq data from Huang et al. (red) and H3K9me3 data from Muerdter et al. (blue) are plotted alongside the cumulative distribution for 11 transcription factor ChIP-seq datasets from modencode (gray), for which there is no expectation of enrichment at repetitive elements. Only repeat instances with at least 10 RPM in at least one of the ChIP and control datasets for each ChIP/background pairing were included. H3K9me3 showed high average enrichment over background at most of the elements in all three classes. In contrast, the Piwi ChIP-seq data were well within the range of the distributions for modencode transcription factors. ChIP-seq signal overlaps with transposons and repetitive sequences (Huang et al., 2013). Our analysis of the Piwi ChIPseq data does also not support the ChIP-qPCR data presented by Huang et al. in their Figure S1 (Huang et al., 2013), which shows that DNA fragments of two transposons (F-element and 1360) were retrieved at least 10-fold more efficiently in Piwi ChIP experiments compared to control IPs; in fact, neither of these transposons was detectably enriched in the Piwi ChIPseq dataset in our analysis, although both were significantly enriched in the H3K9me3 ChIP-seq data (Figure S1). The positive H3K9me3 ChIP-seq outcome from our analysis shows that a heterochromatin-associated mark can be and was successfully captured and associated DNA efficiently sequenced. This argues against scenarios in which ChIP-enriched heterochromatic regions are detected by qpcr, even though they are missed by ChIP-seq because they are especially poor substrates for library building and/or sequencing. Of note, Huang et al. reported similar Piwi enrichments when ChIP-qPCR experiments were conducted from dissected ovaries compared to whole flies (Figure S1 of Huang et al., 2013). Because Piwi is expressed at high levels only in gonadal cells, ChIP-qPCR signals are predicted to be diluted by somatic nuclei when whole flies instead of gonads are used as experimental input. Next, we analyzed the Piwi ChIP-seq data at the genomic level. Figure 1B depicts a genomic region harboring three transposon insertions; this same region is shown in Figure 2C of Huang et al. (Huang et al., 2013). Read coverage for Piwi ChIPseq and the corresponding input datasets was calculated in three ways: (1) considering only reads that map the genome uniquely, (2) considering all reads mapping to the genome but normalizing each for the number of times it mapped to the genome, and (3) considering all alignments as if each is a unique read, without any normalization. None of the three transposon insertions nor their immediate genomic neighborhoods stood out 766 Developmental Cell 32, , March 23, 2015 ª2015 Elsevier Inc.

4 A B Figure 2. Distribution of Piwi and H3K9me3 over Repetitive Elements in the Genome (A and B) The average signal distribution over LINE repetitive elements for ChIP (red) and background (yellow) datasets for Piwi from Huang et al. (2013) (A) and for H3K9me3 from Muerdter et al. (2013) (B). The background-normalized enrichment is in black. The 100 bp around the beginning and the end of individual elements are shown to scale; the rest of each LINE element is rescaled to 100 units. The repeat-masker repetitive element annotation from the UCSC Genome Browser was used. A clear enrichment over background is observed in H3K9me3 datasets, even when only uniquely aligning reads are considered. In contrast, the Piwi dataset from Huang et al. is essentially indistinguishable from background. for the transcription factors on the same set of transposons (Figure 1C). In contrast, the H3K9me3 mark was strongly enriched over all transposon classes. Taken together, these analyses show that the published Piwi ChIP-seq datasets do not support a specific enrichment of Piwi at transposons. in the Piwi ChIP data compared to the background when (1) unique reads or (2) normalized reads were considered (Figure 1B). When the reads were (3) not corrected for mapping to multiple genomic sites, transposons emerged as strong peaks relative to flanking genomic sequences. However, transposons also emerged as strong peaks when the control dataset, the input genomic DNA itself, was mapped without accounting for mapping multiplicity. We used each of the three mapping strategies to determine the genome-wide average read density for Piwi ChIP and input datasets over the three major transposable element classes in Drosophila (e.g., LINE elements in Figure 2). In all cases, we found no enrichment of Piwi over background, whereas the H3K9me3 dataset again displayed strong enrichment. Finally, we asked whether the Piwi ChIP dataset was enriched for Piwi due to Piwi occupying a subset of the thousands of transposon insertions in the Drosophila genome. Such a subset might go undetected when analyzing genome-wide average signals. We compared the enrichment of Piwi at individual transposons with that of eleven transcription factors whose genomewide occupancy has been determined from early fly embryos (modencode Consortium, 2010; Nègre et al., 2011); none of these developmental regulators is expected to be selectively enriched at transposon loci. Again, we found no specific enrichment of Piwi at transposon loci: the enrichment of Piwi at transposons was well within the range of enrichment observed The Huang et al. Computational Pipeline Generates Artificial Enrichment of ChIP-Seq Datasets at Repetitive Loci To identify the discrepancy between our standard analysis pipeline and that of Huang et al., we examined the computational pipeline used in their studies (originally described in Yin et al., 2011), which the authors kindly shared with us. Rather than defining enrichments by the ratio of ChIP versus input sample reads, the Huang et al. pipeline identifies genomic regions of Piwi enrichment via a multi-step procedure (see Figure S2 and Supplemental Experimental Procedures for details). Two features of this pipeline could artificially amplify minor differences between ChIP and control datasets into large apparent enrichments at transposons. First, the pipeline makes no correction for reads mapping to multiple genomic locations. Of course, one single read must come from a single genomic locus, no matter how many times it maps to the genome, so all widely used mapping software either randomly assign a multiply mapping read to a single locus or apportion the read among the multiple loci. Without such standard corrections for mapping multiplicity, all datasets both ChIP-seq and input genomic DNA produce artificially elevated signals at repetitive loci such as transposons. Considering that Huang et al. apply a cutoff threshold (see Experimental Procedures), this artificially elevated signal focuses the analysis strongly toward repetitive regions. Second, although the subsequent analysis does take the input datasets into account, it does so in a non-standard way by applying nonlinear transformations to the resulting signal tracks. The consequence is that the final score displays positive enrichments but sets negative enrichments (i.e., depletions) to Developmental Cell 32, , March 23, 2015 ª2015 Elsevier Inc. 767

5 A C B (legend on next page) 768 Developmental Cell 32, , March 23, 2015 ª2015 Elsevier Inc.

6 zero. Ultimately, the combination of these steps leads to exclusively positive enrichments preferentially at transposons (Figure 1B), while signal in the direction of depletion is obscured. The algorithm is particularly prone to creating artificial peaks from ChIP-seq datasets with low signal-to-noise ratios (see below). By way of example, we recapitulated the Huang et al. analysis, but swapping the input background and Piwi ChIP-seq data, and then calculated the percentage of signal at annotated repeats. Strikingly, treating the genomic DNA input as the experiment and the PIWI ChIP-seq as the control produced strong signal enrichment at transposons. In fact, an even higher proportion of the final signal mapped to repeats in this analysis than when the data sets were correctly assigned to experiment and control (Figure 3A). The identity of the particular repeats contributing to the final signal, however, differed as is expected if the result stems from erroneously identifying amplified positive noise for true signals. Figure 3B displays the final Huang et al. scores for Piwi ChIP-seq over background and background over Piwi ChIP-seq at three individual, fulllength transposon insertions (Figure 3B). While some transposon insertions showed high signal in the Piwi/background track (e.g., roo), others showed high enrichment in the background/piwi track (e.g., Max) and some transposon insertions showed a mixed signal, in which different portions of the element are highly enriched in either the background or the ChIP tracks (e.g., blood). These observations also suggest that the Huang et al. pipeline has the somewhat counterintuitive effect of generating much higher enrichments over transposons for ChIP datasets that contain very little or no true signal than it does for ChIP datasets that are strongly enriched at genomic features other than transposons. In the latter case, transposons are globally depleted relative to the control because a high fraction of reads is concentrated in regions of true occupancy located elsewhere in the genome. This is not the case in input and poorly enriching ChIP experiments leading to a higher apparent enrichment over TE sequences. Indeed, when we calculated the percentage of signal at transposons for the modencode transcription factor ChIP-seq dataset using the method of Huang et al., we observed highly variable results (Figure 3C). For some developmental regulators, the Huang et al., signal on repeats was similar to the Piwi dataset, while other factors displayed little signal on transposons. The experimental characterization of the true genomic distribution of Piwi on chromatin thus remains an unresolved challenge. The difficulty in obtaining high-quality Piwi ChIP-seq datasets likely reflects the complexity of recovering DNA sequences that are transiently tethered to Piwi protein via nascent RNA. The inherent difficulty in shearing heterochromatin may also contribute to the problem (Teytelman et al., 2009). No Support for Widespread Transcriptional Changes in piwi Mutants Based on the same computational pipeline, Huang et al. also reported that in piwi mutants Pol II is broadly redistributed from protein-coding genes to transposons. We calculated consensus transposon RPM values for the Pol II ChIP-seq datasets and their respective controls (Figure 4A). We found no clear differences between Pol II enrichments over transposons in wild-type versus piwi mutant flies. In both samples, Pol II was depleted at transposons compared to the input (Figures 4A and 4B), likely due to its enrichment at protein-coding genes in the Pol II ChIP-seq data but not the input control. In contrast, Huang et al. reported that Pol II concentrated on transposons in piwi mutants compared to wild-type. A meta-profile of Pol II occupancy at all protein-coding loci showed an 2-fold greater enrichment at promoters in wild-type compared to the mutant (Figure S3). For the piwi mutant dataset this means that proportionally fewer reads originate from expressed genes versus the remainder of the genome. In consequence, more background reads from transposons are recovered, and these are then amplified by the Huang et al. pipeline. Taken together, our analyses find no support for a widespread role of Piwi in specifying patterns of transcription at transposons in the published ChIP-seq datasets. On the other hand, loss of Piwi has been shown in several studies to lead to pronounced changes in Pol II occupancy at pirna-pathway-repressed transposon loci (Le Thomas et al., 2013; Rozhkov et al., 2013; Sienski et al., 2012). We note that these studies analyzed isolated ovaries or cultured ovarian somatic cells rather than entire flies. One conclusion of these studies is that biologically meaningful analyses of Piwi function using ChIP experiments require the use of isolated tissues where nuclear Piwi is highly expressed: the gonads. The biologically relevant pattern of Piwi genomic occupancy remains unknown. Piwi associates with pirnas complementary to virtually all transposon families, and loss of Piwi leads to the selective loss of the of H3K9me3 mark at several transposon insertions (Sienski et al., 2012). These observations suggest that sequence complementarity between pirnas and nascent target transcripts dictates the chromatin occupancy of Piwi. Considering the technical difficulties that have surrounded Piwi ChIP-seq, a first step toward identifying Piwi binding sites should be to verify direct occupancy at one or a few functional genomic target sites using alternative methods Figure 3. The Huang et al. Data Processing Pipeline Generates Artificial Enrichment over Repetitive Regions The Piwi ChIP-seq and input/background datasets were processed following the Huang et al. pipeline ( Piwi ChIP ). In addition, the pipeline was also run swapping the ChIP and the input, i.e., the control sample was treated as ChIP and vice versa, resulting in the background track. (A) The fraction of signal mapping to transposable elements was calculated, revealing higher enrichment in the background than in the Piwi ChIP-seq dataset. (B) Strong apparent enrichment over individual transposable elements was observed in the ChIP track (upper track), as reported by Huang et al., but also in the background track (lower track), and even over different portions of the same transposable element in both tracks (middle track), strongly arguing that the enrichment over transposable elements reported by Huang et al. is a computational artifact. Signal observed on individual copies correlates well with enrichment profiles when mapped to the consensus sequence of the respective transposons (shown below each track). Sequences showing enrichment in the background are indicated with gray blocks to depict the correlations between the signal on individual TE copies and the consensus sequence. (C) Fraction of signal (calculated with the Huang et al. pipeline) mapping to transposable elements for the modencode transcription factor set. Developmental Cell 32, , March 23, 2015 ª2015 Elsevier Inc. 769

7 A Figure 4. No Redistribution of Pol II over Transposons Is Observed in piwi Mutant Files (A) Scatterplot displaying Pol II ChIP-seq RPM values versus input RPM values over consensus transposable elements in wild-type and piwi mutant flies. (B) Shown are Pol II ChIP-seq and input RPM levels over the transposon consensus sequences of F-element and mdg3. B such as Dam-ID (van Steensel and Henikoff, 2000). These validated sites could then be used as internal standards to establish approaches for the mapping of Piwi on chromatin across the genome. EXPERIMENTAL PROCEDURES Data Processing A detailed description of our computational analysis is provided in the Supplemental Experimental Procedures. In summary, the data from Huang et al. (2013) as well as from Muerdter et al. (2013) were processed using both the Huang et al. pipeline and more conventional procedures, incorporating three different signal normalization approaches. We aligned reads to the Drosophila melanogaster genome (dm3) using Bowtie (Langmead, 2010; version ) and then generated signal tracks by calculating: (1) normalized (RPM) coverage using only uniquely alignable reads; (2) RPM coverage using all alignments, weighting each according to the number of locations in the genome to which the read maps; and (3) RPM coverage using all alignments treated as if they were uniquely aligned reads (i.e., without normalization for multi-mappers, as in the Huang et al. pipeline). The Huang et al. pipeline was reproduced according to the description and parameters presented in Yin et al. (2011). Briefly, it begins by recursively aligning reads with SOAP, allowing up to five mismatches and four indels. Alignments are then converted into 5 0 coordinates, the chromosomes are split into 50 base pair (bp) bins, and each alignment contributes to ten bins according to a weighting scheme that decreases its weight in more distant bins. The scores are then normalized according to the total number of alignments (rather than the total number of reads, i.e., no multi-mapping normalization is applied) and a critical value is calculated for each ChIP/Input pair so that beyond that value the bin values are always higher in the ChIP than in the control dataset (Figure S2); a normalizer score is calculated based on the bins with values lower than the critical value, and is applied to the ChIP. The ChIP is further normalized by subtracting the background. Critically, when this step is performed, negative values are set to zero, leading to loss of data over regions of depletion relative to background. Finally, scores are divided by the trimmed mean, log-transformed, and again set to zero if negative. Repeat Analysis RepeatMasker annotation, downloaded from the UCSC Genome Browser, was used for the analysis of repetitive element coverage in genomic space. Consensus repetitive elements were downloaded from FlyBase (Marygold et al., 2013); reads were aligned against them using Bowtie, allowing for three mismatches and unlimited multi-mappers, and normalized RPM values calculated for each element. SUPPLEMENTAL INFORMATION Supplemental Information includes Supplemental Experimental Procedures and three figures and can be found with this article online at org/ /j.devcel AUTHOR CONTRIBUTIONS G.K.M., J.W., and D.H. performed the computational analyses; all authors analyzed the data and wrote the manuscript. ACKNOWLEDGMENTS We thank Haifan Lin for kindly sharing the detailed computational pipeline underlying the analyses in Huang et al. (2013). Received: July 17, 2014 Revised: December 18, 2014 Accepted: January 14, 2015 Published: March 23, Developmental Cell 32, , March 23, 2015 ª2015 Elsevier Inc.

8 REFERENCES Bühler, M., and Moazed, D. (2007). Transcription and RNAi in heterochromatic gene silencing. Nat. Struct. Mol. Biol. 14, Grewal, S.I. (2010). RNAi-dependent formation of heterochromatin and its diverse functions. Curr. Opin. Genet. Dev. 20, Huang, X.A., Yin, H., Sweeney, S., Raha, D., Snyder, M., and Lin, H. (2013). A major epigenetic programming mechanism guided by pirnas. Dev. Cell 24, Langmead, B. (2010). Aligning short sequencing reads with Bowtie. Curr Protoc Bioinformatics. Chapter 11. Unit bi1107s32. Le Thomas, A., Rogers, A.K., Webster, A., Marinov, G.K., Liao, S.E., Perkins, E.M., Hur, J.K., Aravin, A.A., and Tóth, K.F. (2013). Piwi induces pirna-guided transcriptional silencing and establishment of a repressive chromatin state. Genes Dev. 27, Malone, C.D., and Hannon, G.J. (2009). Small RNAs as guardians of the genome. Cell 136, Marygold, S.J., Leyland, P.C., Seal, R.L., Goodman, J.L., Thurmond, J., Strelets, V.B., and Wilson, R.J.; FlyBase consortium (2013). FlyBase: improvements to the bibliography. Nucleic Acids Res. 41, D751 D757. modencode Consortium, Roy, S., Ernst, J., Kharchenko, P.V., Kheradpour, P., Negre, N., Eaton, M.L., Landolin, J.M., Bristow, C.A., Ma, L., et al. (2010). Identification of functional elements and regulatory circuits by Drosophila modencode. Science 330, Muerdter, F., Guzzardo, P.M., Gillis, J., Luo, Y., Yu, Y., Chen, C., Fekete, R., and Hannon, G.J. (2013). A genome-wide RNAi screen draws a genetic framework for transposon control and primary pirna biogenesis in Drosophila. Mol. Cell 50, Nègre, N., Brown, C.D., Ma, L., Bristow, C.A., Miller, S.W., Wagner, U., Kheradpour, P., Eaton, M.L., Loriaux, P., Sealfon, R., et al. (2011). A cis-regulatory map of the Drosophila genome. Nature 471, Rozhkov, N.V., Hammell, M., and Hannon, G.J. (2013). Multiple roles for Piwi in silencing Drosophila transposons. Genes Dev. 27, Shpiz, S., Olovnikov, I., Sergeeva, A., Lavrov, S., Abramov, Y., Savitsky, M., and Kalmykova, A. (2011). Mechanism of the pirna-mediated silencing of Drosophila telomeric retrotransposons. Nucleic Acids Res. 39, Sienski, G., Dönertas, D., and Brennecke, J. (2012). Transcriptional silencing of transposons by Piwi and maelstrom and its impact on chromatin state and gene expression. Cell 151, Siomi, M.C., Sato, K., Pezic, D., and Aravin, A.A. (2011). PIWI-interacting small RNAs: the vanguard of genome defence. Nat. Rev. Mol. Cell Biol. 12, Teytelman, L., Ozaydin, B., Zill, O., Lefrançois, P., Snyder, M., Rine, J., and Eisen, M.B. (2009). Impact of chromatin structures on DNA processing for genomic analyses. PLoS ONE 4, e6700. van Steensel, B., and Henikoff, S. (2000). Identification of in vivo DNA targets of chromatin proteins using tethered dam methyltransferase. Nat. Biotechnol. 18, Wang, S.H., and Elgin, S.C. (2011). Drosophila Piwi functions downstream of pirna production mediating a chromatin-based transposon silencing mechanism in female germ line. Proc. Natl. Acad. Sci. USA 108, Yin, H., Sweeney, S., Raha, D., Snyder, M., and Lin, H. (2011). A high-resolution whole-genome map of key chromatin modifications in the adult Drosophila melanogaster. PLoS Genet. 7, e Developmental Cell 32, , March 23, 2015 ª2015 Elsevier Inc. 771

A Transgenerational Process Defines pirna Biogenesis in Drosophila virilis

A Transgenerational Process Defines pirna Biogenesis in Drosophila virilis Report A Transgenerational Process Defines pirna Biogenesis in Drosophila virilis Graphical Abstract Authors Adrien Le Thomas, Georgi K. Marinov, Alexei A. Aravin Correspondence aaa@caltech.edu In Brief

More information

pirna pathway targets active LINE1 elements to establish the repressive H3K9me3markingermcells

pirna pathway targets active LINE1 elements to establish the repressive H3K9me3markingermcells pirna pathway targets active LINE1 elements to establish the repressive H3K9me3markingermcells Dubravka Pezic, 1,3 Sergei A. Manakov, 1,3 Ravi Sachidanandam, 2 and Alexei A. Aravin 1,4 1 Division of Biology

More information

Repressive Transcription

Repressive Transcription Repressive Transcription The MIT Faculty has made this article openly available. Please share how this access benefits you. Your story matters. Citation As Published Publisher Guenther, M. G., and R. A.

More information

Computational Analysis of UHT Sequences Histone modifications, CAGE, RNA-Seq

Computational Analysis of UHT Sequences Histone modifications, CAGE, RNA-Seq Computational Analysis of UHT Sequences Histone modifications, CAGE, RNA-Seq Philipp Bucher Wednesday January 21, 2009 SIB graduate school course EPFL, Lausanne ChIP-seq against histone variants: Biological

More information

Accessing and Using ENCODE Data Dr. Peggy J. Farnham

Accessing and Using ENCODE Data Dr. Peggy J. Farnham 1 William M Keck Professor of Biochemistry Keck School of Medicine University of Southern California How many human genes are encoded in our 3x10 9 bp? C. elegans (worm) 959 cells and 1x10 8 bp 20,000

More information

Nature Structural & Molecular Biology: doi: /nsmb.2419

Nature Structural & Molecular Biology: doi: /nsmb.2419 Supplementary Figure 1 Mapped sequence reads and nucleosome occupancies. (a) Distribution of sequencing reads on the mouse reference genome for chromosome 14 as an example. The number of reads in a 1 Mb

More information

Processing, integrating and analysing chromatin immunoprecipitation followed by sequencing (ChIP-seq) data

Processing, integrating and analysing chromatin immunoprecipitation followed by sequencing (ChIP-seq) data Processing, integrating and analysing chromatin immunoprecipitation followed by sequencing (ChIP-seq) data Bioinformatics methods, models and applications to disease Alex Essebier ChIP-seq experiment To

More information

Comparison of open chromatin regions between dentate granule cells and other tissues and neural cell types.

Comparison of open chromatin regions between dentate granule cells and other tissues and neural cell types. Supplementary Figure 1 Comparison of open chromatin regions between dentate granule cells and other tissues and neural cell types. (a) Pearson correlation heatmap among open chromatin profiles of different

More information

Peak-calling for ChIP-seq and ATAC-seq

Peak-calling for ChIP-seq and ATAC-seq Peak-calling for ChIP-seq and ATAC-seq Shamith Samarajiwa CRUK Autumn School in Bioinformatics 2017 University of Cambridge Overview Peak-calling: identify enriched (signal) regions in ChIP-seq or ATAC-seq

More information

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1 Supplementary Figure 1 Effect of HSP90 inhibition on expression of endogenous retroviruses. (a) Inducible shrna-mediated Hsp90 silencing in mouse ESCs. Immunoblots of total cell extract expressing the

More information

ChIP-seq data analysis

ChIP-seq data analysis ChIP-seq data analysis Harri Lähdesmäki Department of Computer Science Aalto University November 24, 2017 Contents Background ChIP-seq protocol ChIP-seq data analysis Transcriptional regulation Transcriptional

More information

Nature Genetics: doi: /ng Supplementary Figure 1. Immunofluorescence (IF) confirms absence of H3K9me in met-2 set-25 worms.

Nature Genetics: doi: /ng Supplementary Figure 1. Immunofluorescence (IF) confirms absence of H3K9me in met-2 set-25 worms. Supplementary Figure 1 Immunofluorescence (IF) confirms absence of H3K9me in met-2 set-25 worms. IF images of wild-type (wt) and met-2 set-25 worms showing the loss of H3K9me2/me3 at the indicated developmental

More information

Supplemental Figure 1. Genes showing ectopic H3K9 dimethylation in this study are DNA hypermethylated in Lister et al. study.

Supplemental Figure 1. Genes showing ectopic H3K9 dimethylation in this study are DNA hypermethylated in Lister et al. study. mc mc mc mc SUP mc mc Supplemental Figure. Genes showing ectopic HK9 dimethylation in this study are DNA hypermethylated in Lister et al. study. Representative views of genes that gain HK9m marks in their

More information

MIR retrotransposon sequences provide insulators to the human genome

MIR retrotransposon sequences provide insulators to the human genome Supplementary Information: MIR retrotransposon sequences provide insulators to the human genome Jianrong Wang, Cristina Vicente-García, Davide Seruggia, Eduardo Moltó, Ana Fernandez- Miñán, Ana Neto, Elbert

More information

High Throughput Sequence (HTS) data analysis. Lei Zhou

High Throughput Sequence (HTS) data analysis. Lei Zhou High Throughput Sequence (HTS) data analysis Lei Zhou (leizhou@ufl.edu) High Throughput Sequence (HTS) data analysis 1. Representation of HTS data. 2. Visualization of HTS data. 3. Discovering genomic

More information

7SK ChIRP-seq is specifically RNA dependent and conserved between mice and humans.

7SK ChIRP-seq is specifically RNA dependent and conserved between mice and humans. Supplementary Figure 1 7SK ChIRP-seq is specifically RNA dependent and conserved between mice and humans. Regions targeted by the Even and Odd ChIRP probes mapped to a secondary structure model 56 of the

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION doi:10.1038/nature23267 Discussion Our findings reveal unique roles for the methylation states of histone H3K9 in RNAi-dependent and - independent heterochromatin formation. Clr4 is the sole S. pombe enzyme

More information

Piwi function and pirna cluster regulation : Drosophila melanogaster

Piwi function and pirna cluster regulation : Drosophila melanogaster Piwi function and pirna cluster regulation : Drosophila melanogaster Adrien Le Thomas To cite this version: Adrien Le Thomas. Piwi function and pirna cluster regulation : Drosophila melanogaster. Development

More information

Nature Genetics: doi: /ng Supplementary Figure 1. Assessment of sample purity and quality.

Nature Genetics: doi: /ng Supplementary Figure 1. Assessment of sample purity and quality. Supplementary Figure 1 Assessment of sample purity and quality. (a) Hematoxylin and eosin staining of formaldehyde-fixed, paraffin-embedded sections from a human testis biopsy collected concurrently with

More information

MODULE 3: TRANSCRIPTION PART II

MODULE 3: TRANSCRIPTION PART II MODULE 3: TRANSCRIPTION PART II Lesson Plan: Title S. CATHERINE SILVER KEY, CHIYEDZA SMALL Transcription Part II: What happens to the initial (premrna) transcript made by RNA pol II? Objectives Explain

More information

Supplemental Figures Legends and Supplemental Figures. for. pirna-guided slicing of transposon transcripts enforces their transcriptional

Supplemental Figures Legends and Supplemental Figures. for. pirna-guided slicing of transposon transcripts enforces their transcriptional Supplemental Figures Legends and Supplemental Figures for pirn-guided slicing of transposon transcripts enforces their transcriptional silencing via specifying the nuclear pirn repertoire Kirsten-ndré

More information

Histones modifications and variants

Histones modifications and variants Histones modifications and variants Dr. Institute of Molecular Biology, Johannes Gutenberg University, Mainz www.imb.de Lecture Objectives 1. Chromatin structure and function Chromatin and cell state Nucleosome

More information

Table S1. Total and mapped reads produced for each ChIP-seq sample

Table S1. Total and mapped reads produced for each ChIP-seq sample Tale S1. Total and mapped reads produced for each ChIP-seq sample Sample Total Reads Mapped Reads Col- H3K27me3 rep1 125662 1334323 (85.76%) Col- H3K27me3 rep2 9176437 7986731 (87.4%) atmi1a//c H3K27m3

More information

Eukaryotic Gene Regulation

Eukaryotic Gene Regulation Eukaryotic Gene Regulation Chapter 19: Control of Eukaryotic Genome The BIG Questions How are genes turned on & off in eukaryotes? How do cells with the same genes differentiate to perform completely different,

More information

Raymond Auerbach PhD Candidate, Yale University Gerstein and Snyder Labs August 30, 2012

Raymond Auerbach PhD Candidate, Yale University Gerstein and Snyder Labs August 30, 2012 Elucidating Transcriptional Regulation at Multiple Scales Using High-Throughput Sequencing, Data Integration, and Computational Methods Raymond Auerbach PhD Candidate, Yale University Gerstein and Snyder

More information

Patterns of Histone Methylation and Chromatin Organization in Grapevine Leaf. Rachel Schwope EPIGEN May 24-27, 2016

Patterns of Histone Methylation and Chromatin Organization in Grapevine Leaf. Rachel Schwope EPIGEN May 24-27, 2016 Patterns of Histone Methylation and Chromatin Organization in Grapevine Leaf Rachel Schwope EPIGEN May 24-27, 2016 What does H3K4 methylation do? Plant of interest: Vitis vinifera Culturally important

More information

RNA-seq Introduction

RNA-seq Introduction RNA-seq Introduction DNA is the same in all cells but which RNAs that is present is different in all cells There is a wide variety of different functional RNAs Which RNAs (and sometimes then translated

More information

STAT1 regulates microrna transcription in interferon γ stimulated HeLa cells

STAT1 regulates microrna transcription in interferon γ stimulated HeLa cells CAMDA 2009 October 5, 2009 STAT1 regulates microrna transcription in interferon γ stimulated HeLa cells Guohua Wang 1, Yadong Wang 1, Denan Zhang 1, Mingxiang Teng 1,2, Lang Li 2, and Yunlong Liu 2 Harbin

More information

The Epigenome Tools 2: ChIP-Seq and Data Analysis

The Epigenome Tools 2: ChIP-Seq and Data Analysis The Epigenome Tools 2: ChIP-Seq and Data Analysis Chongzhi Zang zang@virginia.edu http://zanglab.com PHS5705: Public Health Genomics March 20, 2017 1 Outline Epigenome: basics review ChIP-seq overview

More information

a) List of KMTs targeted in the shrna screen. The official symbol, KMT designation,

a) List of KMTs targeted in the shrna screen. The official symbol, KMT designation, Supplementary Information Supplementary Figures Supplementary Figure 1. a) List of KMTs targeted in the shrna screen. The official symbol, KMT designation, gene ID and specifities are provided. Those highlighted

More information

Tutorial. ChIP Sequencing. Sample to Insight. September 15, 2016

Tutorial. ChIP Sequencing. Sample to Insight. September 15, 2016 ChIP Sequencing September 15, 2016 Sample to Insight CLC bio, a QIAGEN Company Silkeborgvej 2 Prismet 8000 Aarhus C Denmark Telephone: +45 70 22 32 44 www.clcbio.com support-clcbio@qiagen.com ChIP Sequencing

More information

ChIP-seq hands-on. Iros Barozzi, Campus IFOM-IEO (Milan) Saverio Minucci, Gioacchino Natoli Labs

ChIP-seq hands-on. Iros Barozzi, Campus IFOM-IEO (Milan) Saverio Minucci, Gioacchino Natoli Labs ChIP-seq hands-on Iros Barozzi, Campus IFOM-IEO (Milan) Saverio Minucci, Gioacchino Natoli Labs Main goals Becoming familiar with essential tools and formats Visualizing and contextualizing raw data Understand

More information

EPIGENOMICS PROFILING SERVICES

EPIGENOMICS PROFILING SERVICES EPIGENOMICS PROFILING SERVICES Chromatin analysis DNA methylation analysis RNA-seq analysis Diagenode helps you uncover the mysteries of epigenetics PAGE 3 Integrative epigenomics analysis DNA methylation

More information

Supplementary Figure S1. Gene expression analysis of epidermal marker genes and TP63.

Supplementary Figure S1. Gene expression analysis of epidermal marker genes and TP63. Supplementary Figure Legends Supplementary Figure S1. Gene expression analysis of epidermal marker genes and TP63. A. Screenshot of the UCSC genome browser from normalized RNAPII and RNA-seq ChIP-seq data

More information

ChIP-seq analysis. J. van Helden, M. Defrance, C. Herrmann, D. Puthier, N. Servant, M. Thomas-Chollier, O.Sand

ChIP-seq analysis. J. van Helden, M. Defrance, C. Herrmann, D. Puthier, N. Servant, M. Thomas-Chollier, O.Sand ChIP-seq analysis J. van Helden, M. Defrance, C. Herrmann, D. Puthier, N. Servant, M. Thomas-Chollier, O.Sand Tuesday : quick introduction to ChIP-seq and peak-calling (Presentation + Practical session)

More information

Small RNAs and how to analyze them using sequencing

Small RNAs and how to analyze them using sequencing Small RNAs and how to analyze them using sequencing RNA-seq Course November 8th 2017 Marc Friedländer ComputaAonal RNA Biology Group SciLifeLab / Stockholm University Special thanks to Jakub Westholm for

More information

38 Int'l Conf. Bioinformatics and Computational Biology BIOCOMP'16

38 Int'l Conf. Bioinformatics and Computational Biology BIOCOMP'16 38 Int'l Conf. Bioinformatics and Computational Biology BIOCOMP'16 PGAR: ASD Candidate Gene Prioritization System Using Expression Patterns Steven Cogill and Liangjiang Wang Department of Genetics and

More information

Computational aspects of ChIP-seq. John Marioni Research Group Leader European Bioinformatics Institute European Molecular Biology Laboratory

Computational aspects of ChIP-seq. John Marioni Research Group Leader European Bioinformatics Institute European Molecular Biology Laboratory Computational aspects of ChIP-seq John Marioni Research Group Leader European Bioinformatics Institute European Molecular Biology Laboratory ChIP-seq Using highthroughput sequencing to investigate DNA

More information

Broad H3K4me3 is associated with increased transcription elongation and enhancer activity at tumor suppressor genes

Broad H3K4me3 is associated with increased transcription elongation and enhancer activity at tumor suppressor genes Broad H3K4me3 is associated with increased transcription elongation and enhancer activity at tumor suppressor genes Kaifu Chen 1,2,3,4,5,10, Zhong Chen 6,10, Dayong Wu 6, Lili Zhang 7, Xueqiu Lin 1,2,8,

More information

Not IN Our Genes - A Different Kind of Inheritance.! Christopher Phiel, Ph.D. University of Colorado Denver Mini-STEM School February 4, 2014

Not IN Our Genes - A Different Kind of Inheritance.! Christopher Phiel, Ph.D. University of Colorado Denver Mini-STEM School February 4, 2014 Not IN Our Genes - A Different Kind of Inheritance! Christopher Phiel, Ph.D. University of Colorado Denver Mini-STEM School February 4, 2014 Epigenetics in Mainstream Media Epigenetics *Current definition:

More information

The Insulator Binding Protein CTCF Positions 20 Nucleosomes around Its Binding Sites across the Human Genome

The Insulator Binding Protein CTCF Positions 20 Nucleosomes around Its Binding Sites across the Human Genome The Insulator Binding Protein CTCF Positions 20 Nucleosomes around Its Binding Sites across the Human Genome Yutao Fu 1, Manisha Sinha 2,3, Craig L. Peterson 3, Zhiping Weng 1,4,5 * 1 Bioinformatics Program,

More information

ChromHMM Tutorial. Jason Ernst Assistant Professor University of California, Los Angeles

ChromHMM Tutorial. Jason Ernst Assistant Professor University of California, Los Angeles ChromHMM Tutorial Jason Ernst Assistant Professor University of California, Los Angeles Talk Outline Chromatin states analysis and ChromHMM Accessing chromatin state annotations for ENCODE2 and Roadmap

More information

Supplemental Figure S1. Expression of Cirbp mrna in mouse tissues and NIH3T3 cells.

Supplemental Figure S1. Expression of Cirbp mrna in mouse tissues and NIH3T3 cells. SUPPLEMENTAL FIGURE AND TABLE LEGENDS Supplemental Figure S1. Expression of Cirbp mrna in mouse tissues and NIH3T3 cells. A) Cirbp mrna expression levels in various mouse tissues collected around the clock

More information

Session 6: Integration of epigenetic data. Peter J Park Department of Biomedical Informatics Harvard Medical School July 18-19, 2016

Session 6: Integration of epigenetic data. Peter J Park Department of Biomedical Informatics Harvard Medical School July 18-19, 2016 Session 6: Integration of epigenetic data Peter J Park Department of Biomedical Informatics Harvard Medical School July 18-19, 2016 Utilizing complimentary datasets Frequent mutations in chromatin regulators

More information

Transcript-indexed ATAC-seq for immune profiling

Transcript-indexed ATAC-seq for immune profiling Transcript-indexed ATAC-seq for immune profiling Technical Journal Club 22 nd of May 2018 Christina Müller Nature Methods, Vol.10 No.12, 2013 Nature Biotechnology, Vol.32 No.7, 2014 Nature Medicine, Vol.24,

More information

Metadata of the chapter that will be visualized online

Metadata of the chapter that will be visualized online Metadata of the chapter that will be visualized online ChapterTitle Chapter Sub-Title Advanced Analysis of Human Plasma Circulating DNA Sequences Produced by Parallel Tagged Sequencing on the 454 Platform

More information

cis-regulatory enrichment analysis in human, mouse and fly

cis-regulatory enrichment analysis in human, mouse and fly cis-regulatory enrichment analysis in human, mouse and fly Zeynep Kalender Atak, PhD Laboratory of Computational Biology VIB-KU Leuven Center for Brain & Disease Research Laboratory of Computational Biology

More information

Table S1. Relative abundance of AGO1/4 proteins in different organs. Table S2. Summary of smrna datasets from various samples.

Table S1. Relative abundance of AGO1/4 proteins in different organs. Table S2. Summary of smrna datasets from various samples. Supplementary files Table S1. Relative abundance of AGO1/4 proteins in different organs. Table S2. Summary of smrna datasets from various samples. Table S3. Specificity of AGO1- and AGO4-preferred 24-nt

More information

Alternative splicing. Biosciences 741: Genomics Fall, 2013 Week 6

Alternative splicing. Biosciences 741: Genomics Fall, 2013 Week 6 Alternative splicing Biosciences 741: Genomics Fall, 2013 Week 6 Function(s) of RNA splicing Splicing of introns must be completed before nuclear RNAs can be exported to the cytoplasm. This led to early

More information

MODULE 4: SPLICING. Removal of introns from messenger RNA by splicing

MODULE 4: SPLICING. Removal of introns from messenger RNA by splicing Last update: 05/10/2017 MODULE 4: SPLICING Lesson Plan: Title MEG LAAKSO Removal of introns from messenger RNA by splicing Objectives Identify splice donor and acceptor sites that are best supported by

More information

Supplemental Figure S1. Tertiles of FKBP5 promoter methylation and internal regulatory region

Supplemental Figure S1. Tertiles of FKBP5 promoter methylation and internal regulatory region Supplemental Figure S1. Tertiles of FKBP5 promoter methylation and internal regulatory region methylation in relation to PSS and fetal coupling. A, PSS values for participants whose placentas showed low,

More information

CTCF-Mediated Functional Chromatin Interactome in Pluripotent Cells

CTCF-Mediated Functional Chromatin Interactome in Pluripotent Cells SUPPLEMENTARY INFORMATION CTCF-Mediated Functional Chromatin Interactome in Pluripotent Cells Lusy Handoko 1,*, Han Xu 1,*, Guoliang Li 1,*, Chew Yee Ngan 1, Elaine Chew 1, Marie Schnapp 1, Charlie Wah

More information

Allelic reprogramming of the histone modification H3K4me3 in early mammalian development

Allelic reprogramming of the histone modification H3K4me3 in early mammalian development Allelic reprogramming of the histone modification H3K4me3 in early mammalian development 张戈 Method and material STAR ChIP seq (small-scale TELP-assisted rapid ChIP seq) 200 mouse embryonic stem cells PWK/PhJ

More information

An epigenetic approach to understanding (and predicting?) environmental effects on gene expression

An epigenetic approach to understanding (and predicting?) environmental effects on gene expression www.collaslab.com An epigenetic approach to understanding (and predicting?) environmental effects on gene expression Philippe Collas University of Oslo Institute of Basic Medical Sciences Stem Cell Epigenetics

More information

Discovery of Novel Human Gene Regulatory Modules from Gene Co-expression and

Discovery of Novel Human Gene Regulatory Modules from Gene Co-expression and Discovery of Novel Human Gene Regulatory Modules from Gene Co-expression and Promoter Motif Analysis Shisong Ma 1,2*, Michael Snyder 3, and Savithramma P Dinesh-Kumar 2* 1 School of Life Sciences, University

More information

Supplementary Figures

Supplementary Figures Supplementary Figures Supplementary Figure 1. Heatmap of GO terms for differentially expressed genes. The terms were hierarchically clustered using the GO term enrichment beta. Darker red, higher positive

More information

Supplemental Figure 1. Small RNA size distribution from different soybean tissues.

Supplemental Figure 1. Small RNA size distribution from different soybean tissues. Supplemental Figure 1. Small RNA size distribution from different soybean tissues. The size of small RNAs was plotted versus frequency (percentage) among total sequences (A, C, E and G) or distinct sequences

More information

The genetics of heterochromatin. in metazoa. mutations by means of X-ray irradiation" "for the discovery of the production of

The genetics of heterochromatin. in metazoa. mutations by means of X-ray irradiation for the discovery of the production of The genetics of heterochromatin in metazoa 1 Hermann Joseph Muller 1946 Nobel Prize in Medicine: "for the discovery of the production of mutations by means of X-ray irradiation" 3 4 The true meaning of

More information

Analysis of the peroxisome proliferator-activated receptor-β/δ (PPARβ/δ) cistrome reveals novel co-regulatory role of ATF4

Analysis of the peroxisome proliferator-activated receptor-β/δ (PPARβ/δ) cistrome reveals novel co-regulatory role of ATF4 Khozoie et al. BMC Genomics 2012, 13:665 RESEARCH ARTICLE Open Access Analysis of the peroxisome proliferator-activated receptor-β/δ (PPARβ/δ) cistrome reveals novel co-regulatory role of ATF4 Combiz Khozoie

More information

RNA-Seq Preparation Comparision Summary: Lexogen, Standard, NEB

RNA-Seq Preparation Comparision Summary: Lexogen, Standard, NEB RNA-Seq Preparation Comparision Summary: Lexogen, Standard, NEB CSF-NGS January 22, 214 Contents 1 Introduction 1 2 Experimental Details 1 3 Results And Discussion 1 3.1 ERCC spike ins............................................

More information

Nature Biotechnology: doi: /nbt.1904

Nature Biotechnology: doi: /nbt.1904 Supplementary Information Comparison between assembly-based SV calls and array CGH results Genome-wide array assessment of copy number changes, such as array comparative genomic hybridization (acgh), is

More information

2009 LANDES BIOSCIENCE. DO NOT DISTRIBUTE.

2009 LANDES BIOSCIENCE. DO NOT DISTRIBUTE. [Epigenetics 4:2, 1-6; 16 February 2009]; 2009 Landes Bioscience Research Paper Determining the conservation of DNA methylation in Arabidopsis This manuscript has been published online, prior to printing.once

More information

Computational Identification and Prediction of Tissue-Specific Alternative Splicing in H. Sapiens. Eric Van Nostrand CS229 Final Project

Computational Identification and Prediction of Tissue-Specific Alternative Splicing in H. Sapiens. Eric Van Nostrand CS229 Final Project Computational Identification and Prediction of Tissue-Specific Alternative Splicing in H. Sapiens. Eric Van Nostrand CS229 Final Project Introduction RNA splicing is a critical step in eukaryotic gene

More information

Hands-On Ten The BRCA1 Gene and Protein

Hands-On Ten The BRCA1 Gene and Protein Hands-On Ten The BRCA1 Gene and Protein Objective: To review transcription, translation, reading frames, mutations, and reading files from GenBank, and to review some of the bioinformatics tools, such

More information

Nature Immunology: doi: /ni Supplementary Figure 1. Characteristics of SEs in T reg and T conv cells.

Nature Immunology: doi: /ni Supplementary Figure 1. Characteristics of SEs in T reg and T conv cells. Supplementary Figure 1 Characteristics of SEs in T reg and T conv cells. (a) Patterns of indicated transcription factor-binding at SEs and surrounding regions in T reg and T conv cells. Average normalized

More information

Plasticity in patterns of histone modifications and chromosomal proteins in Drosophila heterochromatin

Plasticity in patterns of histone modifications and chromosomal proteins in Drosophila heterochromatin Research Plasticity in patterns of histone modifications and chromosomal proteins in Drosophila heterochromatin Nicole C. Riddle, 1,9 Aki Minoda, 2,9 Peter V. Kharchenko, 3,9 Artyom A. Alekseyenko, 4 Yuri

More information

This is a published version of a paper published in PLoS genetics. Access to the published version may require subscription.

This is a published version of a paper published in PLoS genetics. Access to the published version may require subscription. Umeå University This is a published version of a paper published in PLoS genetics. Citation for the published paper: Holmqvist, P., Boija, A., Philip, P., Crona, F., Stenberg, P. et al. (2012) "Preferential

More information

Chip Seq Peak Calling in Galaxy

Chip Seq Peak Calling in Galaxy Chip Seq Peak Calling in Galaxy Chris Seward PowerPoint by Pei-Chen Peng Chip-Seq Peak Calling in Galaxy Chris Seward 2018 1 Introduction This goals of the lab are as follows: 1. Gain experience using

More information

Lecture 8 Understanding Transcription RNA-seq analysis. Foundations of Computational Systems Biology David K. Gifford

Lecture 8 Understanding Transcription RNA-seq analysis. Foundations of Computational Systems Biology David K. Gifford Lecture 8 Understanding Transcription RNA-seq analysis Foundations of Computational Systems Biology David K. Gifford 1 Lecture 8 RNA-seq Analysis RNA-seq principles How can we characterize mrna isoform

More information

Exploring chromatin regulation by ChIP-Sequencing

Exploring chromatin regulation by ChIP-Sequencing Exploring chromatin regulation by ChIP-Sequencing From datasets quality assessment, enrichment patterns identification and multi-profiles integration to the reconstitution of gene regulatory wires describing

More information

Supplementary Figure 1. Efficiency of Mll4 deletion and its effect on T cell populations in the periphery. Nature Immunology: doi: /ni.

Supplementary Figure 1. Efficiency of Mll4 deletion and its effect on T cell populations in the periphery. Nature Immunology: doi: /ni. Supplementary Figure 1 Efficiency of Mll4 deletion and its effect on T cell populations in the periphery. Expression of Mll4 floxed alleles (16-19) in naive CD4 + T cells isolated from lymph nodes and

More information

mirna Dr. S Hosseini-Asl

mirna Dr. S Hosseini-Asl mirna Dr. S Hosseini-Asl 1 2 MicroRNAs (mirnas) are small noncoding RNAs which enhance the cleavage or translational repression of specific mrna with recognition site(s) in the 3 - untranslated region

More information

Genetics and Genomics in Medicine Chapter 6 Questions

Genetics and Genomics in Medicine Chapter 6 Questions Genetics and Genomics in Medicine Chapter 6 Questions Multiple Choice Questions Question 6.1 With respect to the interconversion between open and condensed chromatin shown below: Which of the directions

More information

RASA: Robust Alternative Splicing Analysis for Human Transcriptome Arrays

RASA: Robust Alternative Splicing Analysis for Human Transcriptome Arrays Supplementary Materials RASA: Robust Alternative Splicing Analysis for Human Transcriptome Arrays Junhee Seok 1*, Weihong Xu 2, Ronald W. Davis 2, Wenzhong Xiao 2,3* 1 School of Electrical Engineering,

More information

Sirt1 Hmg20b Gm (0.17) 24 (17.3) 877 (857)

Sirt1 Hmg20b Gm (0.17) 24 (17.3) 877 (857) 3 (0.17) 24 (17.3) Sirt1 Hmg20 Gm4763 877 (857) c d Suppl. Figure 1. Screen validation for top candidate antagonists of Dot1L (a) Numer of genes with one (gray), two (cyan) or three (red) shrna scored

More information

Comparative analyses of histone H3K9 trimethylations in the heart and spleen of normal humans

Comparative analyses of histone H3K9 trimethylations in the heart and spleen of normal humans Comparative analyses of histone H3K9 trimethylations in the heart and spleen of normal humans W. Sui 1, C. Cao 1,2, W. Che 1, J. Chen 1, W. Xue 1, P. Liu 1, L. Guo 2 and Y. Dai 3 1 Nephrology Department

More information

ddm1a (PFG_3A-51065) ATG ddm1b (PFG_2B-60109) ATG osdrm2 (PFG_3A-04110) osdrm2 osdrm2 osdrm2

ddm1a (PFG_3A-51065) ATG ddm1b (PFG_2B-60109) ATG osdrm2 (PFG_3A-04110) osdrm2 osdrm2 osdrm2 Relative expression.6.5.4.3.2.1 OsDDM1a OsDDM1b OsDRM2 TG TG TG ddm1a (PFG_3-5165) P1 P3 ddm1b (PFG_2-619) P2 531bp 5233bp P1 P4 P3 P2 F1 R1 F1 TG TG TG F1 R1 C ddm1a -/- -/- ddm1b -/- +/- +/- -/- D (PFG_3-411)

More information

Yingying Wei George Wu Hongkai Ji

Yingying Wei George Wu Hongkai Ji Stat Biosci (2013) 5:156 178 DOI 10.1007/s12561-012-9066-5 Global Mapping of Transcription Factor Binding Sites by Sequencing Chromatin Surrogates: a Perspective on Experimental Design, Data Analysis,

More information

Inferring Biological Meaning from Cap Analysis Gene Expression Data

Inferring Biological Meaning from Cap Analysis Gene Expression Data Inferring Biological Meaning from Cap Analysis Gene Expression Data HRYSOULA PAPADAKIS 1. Introduction This project is inspired by the recent development of the Cap analysis gene expression (CAGE) method,

More information

Supplementary Figure 1 IL-27 IL

Supplementary Figure 1 IL-27 IL Tim-3 Supplementary Figure 1 Tc0 49.5 0.6 Tc1 63.5 0.84 Un 49.8 0.16 35.5 0.16 10 4 61.2 5.53 10 3 64.5 5.66 10 2 10 1 10 0 31 2.22 10 0 10 1 10 2 10 3 10 4 IL-10 28.2 1.69 IL-27 Supplementary Figure 1.

More information

Small RNAs and how to analyze them using sequencing

Small RNAs and how to analyze them using sequencing Small RNAs and how to analyze them using sequencing Jakub Orzechowski Westholm (1) Long- term bioinforma=cs support, Science For Life Laboratory Stockholm (2) Department of Biophysics and Biochemistry,

More information

Breast cancer. Risk factors you cannot change include: Treatment Plan Selection. Inferring Transcriptional Module from Breast Cancer Profile Data

Breast cancer. Risk factors you cannot change include: Treatment Plan Selection. Inferring Transcriptional Module from Breast Cancer Profile Data Breast cancer Inferring Transcriptional Module from Breast Cancer Profile Data Breast Cancer and Targeted Therapy Microarray Profile Data Inferring Transcriptional Module Methods CSC 177 Data Warehousing

More information

Figure 1: Final annotation map of Contig 9

Figure 1: Final annotation map of Contig 9 Introduction With rapid advances in sequencing technology, particularly with the development of second and third generation sequencing, genomes for organisms from all kingdoms and many phyla have been

More information

Measuring DNA Methylation with the MinION. Winston Timp Department of Biomedical Engineering Johns Hopkins University 12/1/16

Measuring DNA Methylation with the MinION. Winston Timp Department of Biomedical Engineering Johns Hopkins University 12/1/16 Measuring DNA Methylation with the MinION Winston Timp Department of Biomedical Engineering Johns Hopkins University 12/1/16 Epigenetics: Modern Modern Definition of epigenetics involves heritable changes

More information

User Guide. Association analysis. Input

User Guide. Association analysis. Input User Guide TFEA.ChIP is a tool to estimate transcription factor enrichment in a set of differentially expressed genes using data from ChIP-Seq experiments performed in different tissues and conditions.

More information

Circular RNAs (circrnas) act a stable mirna sponges

Circular RNAs (circrnas) act a stable mirna sponges Circular RNAs (circrnas) act a stable mirna sponges cernas compete for mirnas Ancestal mrna (+3 UTR) Pseudogene RNA (+3 UTR homolgy region) The model holds true for all RNAs that share a mirna binding

More information

Heintzman, ND, Stuart, RK, Hon, G, Fu, Y, Ching, CW, Hawkins, RD, Barrera, LO, Van Calcar, S, Qu, C, Ching, KA, Wang, W, Weng, Z, Green, RD,

Heintzman, ND, Stuart, RK, Hon, G, Fu, Y, Ching, CW, Hawkins, RD, Barrera, LO, Van Calcar, S, Qu, C, Ching, KA, Wang, W, Weng, Z, Green, RD, Heintzman, ND, Stuart, RK, Hon, G, Fu, Y, Ching, CW, Hawkins, RD, Barrera, LO, Van Calcar, S, Qu, C, Ching, KA, Wang, W, Weng, Z, Green, RD, Crawford, GE, Ren, B (2007) Distinct and predictive chromatin

More information

Epigenetics DNA methylation. Biosciences 741: Genomics Fall, 2013 Week 13. DNA Methylation

Epigenetics DNA methylation. Biosciences 741: Genomics Fall, 2013 Week 13. DNA Methylation Epigenetics DNA methylation Biosciences 741: Genomics Fall, 2013 Week 13 DNA Methylation Most methylated cytosines are found in the dinucleotide sequence CG, denoted mcpg. The restriction enzyme HpaII

More information

SUPPLEMENTAL INFORMATION

SUPPLEMENTAL INFORMATION SUPPLEMENTAL INFORMATION GO term analysis of differentially methylated SUMIs. GO term analysis of the 458 SUMIs with the largest differential methylation between human and chimp shows that they are more

More information

Analysis of Massively Parallel Sequencing Data Application of Illumina Sequencing to the Genetics of Human Cancers

Analysis of Massively Parallel Sequencing Data Application of Illumina Sequencing to the Genetics of Human Cancers Analysis of Massively Parallel Sequencing Data Application of Illumina Sequencing to the Genetics of Human Cancers Gordon Blackshields Senior Bioinformatician Source BioScience 1 To Cancer Genetics Studies

More information

Assignment 5: Integrative epigenomics analysis

Assignment 5: Integrative epigenomics analysis Assignment 5: Integrative epigenomics analysis Due date: Friday, 2/24 10am. Note: no late assignments will be accepted. Introduction CpG islands (CGIs) are important regulatory regions in the genome. What

More information

Cross species analysis of genomics data. Computational Prediction of mirnas and their targets

Cross species analysis of genomics data. Computational Prediction of mirnas and their targets 02-716 Cross species analysis of genomics data Computational Prediction of mirnas and their targets Outline Introduction Brief history mirna Biogenesis Why Computational Methods? Computational Methods

More information

Global regulation of alternative splicing by adenosine deaminase acting on RNA (ADAR)

Global regulation of alternative splicing by adenosine deaminase acting on RNA (ADAR) Global regulation of alternative splicing by adenosine deaminase acting on RNA (ADAR) O. Solomon, S. Oren, M. Safran, N. Deshet-Unger, P. Akiva, J. Jacob-Hirsch, K. Cesarkas, R. Kabesa, N. Amariglio, R.

More information

Genome-wide Association Studies (GWAS) Pasieka, Science Photo Library

Genome-wide Association Studies (GWAS) Pasieka, Science Photo Library Lecture 5 Genome-wide Association Studies (GWAS) Pasieka, Science Photo Library Chi-squared test to evaluate whether the odds ratio is different from 1. Corrected for multiple testing Source: wikipedia.org

More information

Testi del Syllabus. Testi in italiano. Resp. Did. SCHOEFTNER STEFAN Matricola: Docente SCHOEFTNER STEFAN, 6 CFU

Testi del Syllabus. Testi in italiano. Resp. Did. SCHOEFTNER STEFAN Matricola: Docente SCHOEFTNER STEFAN, 6 CFU Testi del Syllabus Resp. Did. SCHOEFTNER STEFAN Matricola: 022775 Docente SCHOEFTNER STEFAN, 6 CFU Anno offerta: 2017/2018 Insegnamento: 676SM - REGOLAZIONE EPIGENETICA Corso di studio: SM53 - GENOMICA

More information

Iso-Seq Method Updates and Target Enrichment Without Amplification for SMRT Sequencing

Iso-Seq Method Updates and Target Enrichment Without Amplification for SMRT Sequencing Iso-Seq Method Updates and Target Enrichment Without Amplification for SMRT Sequencing PacBio Americas User Group Meeting Sample Prep Workshop June.27.2017 Tyson Clark, Ph.D. For Research Use Only. Not

More information

Pirna Sequence Variants Associated With Prostate Cancer In African Americans And Caucasians

Pirna Sequence Variants Associated With Prostate Cancer In African Americans And Caucasians Yale University EliScholar A Digital Platform for Scholarly Publishing at Yale Public Health Theses School of Public Health January 2015 Pirna Sequence Variants Associated With Prostate Cancer In African

More information

Package NarrowPeaks. August 3, Version Date Type Package

Package NarrowPeaks. August 3, Version Date Type Package Package NarrowPeaks August 3, 2013 Version 1.5.0 Date 2013-02-13 Type Package Title Analysis of Variation in ChIP-seq using Functional PCA Statistics Author Pedro Madrigal , with contributions

More information

A complete next-generation sequencing workfl ow for circulating cell-free DNA isolation and analysis

A complete next-generation sequencing workfl ow for circulating cell-free DNA isolation and analysis APPLICATION NOTE Cell-Free DNA Isolation Kit A complete next-generation sequencing workfl ow for circulating cell-free DNA isolation and analysis Abstract Circulating cell-free DNA (cfdna) has been shown

More information

Obstacles and challenges in the analysis of microrna sequencing data

Obstacles and challenges in the analysis of microrna sequencing data Obstacles and challenges in the analysis of microrna sequencing data (mirna-seq) David Humphreys Genomics core Dr Victor Chang AC 1936-1991, Pioneering Cardiothoracic Surgeon and Humanitarian The ABCs

More information