Chip Seq Peak Calling in Galaxy

Similar documents
DNA Sequence Bioinformatics Analysis with the Galaxy Platform

DNA-seq Bioinformatics Analysis: Copy Number Variation

ChIP-seq hands-on. Iros Barozzi, Campus IFOM-IEO (Milan) Saverio Minucci, Gioacchino Natoli Labs

The Epigenome Tools 2: ChIP-Seq and Data Analysis

Part-II: Statistical analysis of ChIP-seq data

A Practical Guide to Integrative Genomics by RNA-seq and ChIP-seq Analysis

MODULE 4: SPLICING. Removal of introns from messenger RNA by splicing

Module 3: Pathway and Drug Development

Tutorial. ChIP Sequencing. Sample to Insight. September 15, 2016

High Throughput Sequence (HTS) data analysis. Lei Zhou

Data mining with Ensembl Biomart. Stéphanie Le Gras

Training Peaks P90X and P90X2 Workout Schedule Instructions

User Guide. Association analysis. Input

Accessing and Using ENCODE Data Dr. Peggy J. Farnham

Fully Automated IFA Processor LIS User Manual

ChIP-seq data analysis

Supplemental Figure S1. Tertiles of FKBP5 promoter methylation and internal regulatory region

Managing and Taking Notes

Managing and Taking Notes

Assignment 5: Integrative epigenomics analysis

Hands-On Ten The BRCA1 Gene and Protein

Creating YouTube Captioning

Instructor Guide to EHR Go

Statistical analysis of ChIP-seq data

Mutation Detection and CNV Analysis for Illumina Sequencing data from HaloPlex Target Enrichment Panels using NextGENe Software for Clinical Research

Utilizing the SAP ISR Report to Monitor ISRs

Student Guide to EHR Go

The Hospital Anxiety and Depression Scale Guidance and Information

Nature Structural & Molecular Biology: doi: /nsmb.2419

Analysis with SureCall 2.1

Unit 1: Introduction to the Operating System, Computer Systems, and Networks 1.1 Define terminology Prepare a list of terms with definitions

Exploring chromatin regulation by ChIP-Sequencing

iworx Sample Lab Experiment AN-5: Cockroach Leg Mechanoreceptors

7SK ChIRP-seq is specifically RNA dependent and conserved between mice and humans.

cis-regulatory enrichment analysis in human, mouse and fly

Click on the Case Entry tab and the Procedure Menu will display. To add new procedures, click on Add.

MODULE 3: TRANSCRIPTION PART II

Supplementary Figure S1. Gene expression analysis of epidermal marker genes and TP63.

Obstacles and challenges in the analysis of microrna sequencing data

OECD QSAR Toolbox v.4.2. An example illustrating RAAF scenario 6 and related assessment elements

Quick guide for 3shape order form

Steps to Creating a New Workout Program

HOW TO USE THE BENCHMARK CALENDAR SYSTEM

Computational Analysis of UHT Sequences Histone modifications, CAGE, RNA-Seq

Care Pathways User Guide

Molecular Characterization of Tumors Using Next-Generation Sequencing

AVENIO family of NGS oncology assays ctdna and Tumor Tissue Analysis Kits

Comparison of open chromatin regions between dentate granule cells and other tissues and neural cell types.

P. Tang ( 鄧致剛 ); PJ Huang ( 黄栢榕 ) g( ); g ( ) Bioinformatics Center, Chang Gung University.

Tutorial: RNA-Seq Analysis Part II: Non-Specific Matches and Expression Measures

INSTRUCTOR WALKTHROUGH

ESCs were lysed with Trizol reagent (Life technologies) and RNA was extracted according to

WebPT University Training Guide

Chronic Pain Management Workflow Getting Started: Wrenching In Assessments into Favorites (do once!)

NGS in Cancer Pathology After the Microscope: From Nucleic Acid to Interpretation

A Quick-Start Guide for rseqdiff

Appendix B. Nodulus Observer XT Instructional Guide. 1. Setting up your project p. 2. a. Observation p. 2. b. Subjects, behaviors and coding p.

Eukaryotic small RNA Small RNAseq data analysis for mirna identification

Table of Contents Morning Set-up (GSI equipment, only)... 2 Opening AudBase... 3 Choosing a patient... 3 Performing Pure-Tone Air & Bone

ChIP-seq analysis. J. van Helden, M. Defrance, C. Herrmann, D. Puthier, N. Servant, M. Thomas-Chollier, O.Sand

EDUCATIONAL TECHNOLOGY MAKING AUDIO AND VIDEO ACCESSIBLE

Michigan State University Alumni Communities

NIH BIOSKETCH CLINIC

Databehandling. 3. Mark e.g. the first fraction (1: 0-45 min, 2: min, 3; min, 4: min, 5: min, 6: min).

Exercise 1: Eye-Hand Reaction Times Aim: To measure the reaction time of a subject to a visual cue when responding with the hand.

Registration Instructions Thank you for joining the Million Mile Movement!

Trinity: Transcriptome Assembly for Genetic and Functional Analysis of Cancer [U24]

R2 Training Courses. Release The R2 support team

Package NarrowPeaks. August 3, Version Date Type Package

Supplementary Figure 1 IL-27 IL

ChromHMM Tutorial. Jason Ernst Assistant Professor University of California, Los Angeles

EHR Go Guide: Vitals, Pain, and Measurement

Cerner COMPASS ICD-10 Transition Guide

Crime Scene Investigation. Story

RADAR Report. Self Guided Tutorial

Analyzing the opioid crisis in America

Figure HN-5-L1: The Sequence menu containing the preprogramming light sequences used in this experiment.

User Instruction Guide

Technical Bulletin. Technical Information for Quidel Molecular Influenza A+B Assay on the Bio-Rad CFX96 Touch

Lionbridge Connector for Hybris. User Guide

[HEALTH MAINTENANCE AND INVITATIONS/REMINDERS] June 2, 2014

Tip Sheet: Investigating an Acute Hepatitis B Event using MAVEN Purpose of Surveillance and Case Investigation:

BlueBayCT - Warfarin User Guide

Lesson: A Ten Minute Course in Epidemiology

About REACH: Machine Captioning for Video

EXPression ANalyzer and DisplayER

TMWSuite. DAT Interactive interface

Term Definition Example Amino Acids

VIP: an integrated pipeline for metagenomics of virus

Clay Tablet Connector for hybris. User Guide. Version 1.5.0

Patterns of Histone Methylation and Chromatin Organization in Grapevine Leaf. Rachel Schwope EPIGEN May 24-27, 2016

The Effectiveness of Captopril

This document presents the reader with instructions on using AudBase in the UCSF environment. AudBase Guidebook. Application How-To.

Navigating the Business (ISR) Workplace

Global reporting system for hepatitis (GRSH) data approval manual

Warfarin Help Documentation

Cancer Gene Panels. Dr. Andreas Scherer. Dr. Andreas Scherer President and CEO Golden Helix, Inc. Twitter: andreasscherer

NMR. Sample preparation. and Analysis

NERVE ACTION POTENTIAL SIMULATION version 2013 John Cornell

Transcription:

Chip Seq Peak Calling in Galaxy Chris Seward PowerPoint by Pei-Chen Peng Chip-Seq Peak Calling in Galaxy Chris Seward 2018 1

Introduction This goals of the lab are as follows: 1. Gain experience using Galaxy. 2. Teach how to map next generation (NSG) reads to a reference genome using Bowtie. 3. Demonstrate how to call peaks from Chip-Seq data. Chip-Seq Peak Calling in Galaxy Chris Seward 2018 2

Step 0B: Logging into Galaxy Go to: http://galaxy.illinois.edu/galaxy Click Enter Click Login Input your login credentials. Click Login. Chip-Seq Peak Calling in Galaxy Chris Seward 2018 3

Step 1B: Galaxy Start Screen The resulting screen should look like the figure below: Chip-Seq Peak Calling in Galaxy Chris Seward 2018 4

Step 2A: Importing the Data In this step, we will import the following data files: Filename G1E_ER4_CTCF_(chr19).fastqsanger G1E_ER4_input_(chr19).fastqsanger G1E_CTCF.fastqsanger G1E_input.fastqsanger Meaning A sample ChIP-seq dataset on CTCF in G1E_ER4 cells, reads have been reduced to those mapping to chr19 for demonstration use. Control DNA taken from chr19. CTCF Chip for G1E line. Control for G1E line. Note: G1E cell lines are erythroid, red blood cell, cell lines missing the GATA-1 gene. GATA-1 is crucial for the maturation of erythroid cells. G1E_E4R cell lines conditionally express GATA-1 in the presence of estradiol, enabling erythdoi maturation. Chip-Seq Peak Calling in Galaxy Chris Seward 2018 5

Step 2B: Import Data in Galaxy Go to http://galaxy.illinois.edu/galaxy/u/ppeng5/h/compgenchip1-1 Click Import history Chip-Seq Peak Calling in Galaxy Chris Seward 2018 6

Step 2B: Import Data in Galaxy Go to http://galaxy.illinois.edu/galaxy/u/ppeng5/h/compgenchip1-1 Click Import History and on the next page click Import. Chip-Seq Peak Calling in Galaxy Chris Seward 2018 7

Step 2C: Import Data into Galaxy Your Galaxy page should look like the following now: Tools Pane Main Pane History Pane Chip-Seq Peak Calling in Galaxy Chris Seward 2018 8

Read Mapping and Peak Calling In this exercise, we will map ChIP Reads to a reference genome and call peaks among the mapped reads using MACs. Chip-Seq Peak Calling in Galaxy Chris Seward 2018 9

Step 3A: Summary Statistics In this step, we will gather summary statistics of ChIP data for quality control. Click NGS: QC and manipulation from the Tools pane. Then click FASTQ Summary Statistics. Chip-Seq Peak Calling in Galaxy Chris Seward 2018 10

Step 3B: FASTQ Summary Statistics On the next page, make sure 1:G1E_ER4_CTCF_(chr9).fastqsanger is selected. Press Execute. Chip-Seq Peak Calling in Galaxy Chris Seward 2018 11

Step 3C: FASTQ Summary Statistics The summary file will be the 5 th file in the History pane. Click to display the file in the Main pane. How long are these reads? Discussion What is the median quality at the last position? Chip-Seq Peak Calling in Galaxy Chris Seward 2018 12

Step 4A: Map ChIP-Seq Reads to MM9 Genome Next, we will map the reads in G1E_E4R_CTCF_(chr9).fastqsanger to the mouse genome. Select NGS: Mapping Then select Map with Bowtie for Illumina Chip-Seq Peak Calling in Galaxy Chris Seward 2018 13

Step 4B: Map ChIP-Seq Reads to MM9 Genome Make sure to select mm9 as the reference genome. Make sure 1: G1E_ER4_CTCF_(chr9).fastqsanger is selected. Hit Execute. It will take a few moments to complete. When done, click the view icon. Chip-Seq Peak Calling in Galaxy Chris Seward 2018 14

Step 4C: Map ChIP-Seq Reads to MM9 Genome Your Main pane should look like the following: Chip-Seq Peak Calling in Galaxy Chris Seward 2018 15

Step 5A: Calling Peaks with MACs With our mapped ChiP-Seq reads, we now want to call peaks. Select NGS: Peak Calling Select MACS Chip-Seq Peak Calling in Galaxy Chris Seward 2018 16

Step 5B: Calling Peaks with MACs Run MACs with the default parameters. Chip-Seq Peak Calling in Galaxy Chris Seward 2018 17

Step 5C: Calling Peaks with MACs When done, MACs will create two files. 8: Macs on data 6 (html report) is an html document with information on the peak calling process. 7: MACs on data 6 (peaks:bed) is a BED file with coordinates and scores of ChiP-Seq peaks in chr19. Chip-Seq Peak Calling in Galaxy Chris Seward 2018 18

Step 5D: Calling Peaks with MACs In the 7: MACs on data 6 (peaks:bed) section in the History pane click main next to display at UCSC browser. The result should look similar to below: Discussion 1. Look at the BED file. How many peaks were found? Chip-Seq Peak Calling in Galaxy Chris Seward 2018 19

Call Chip-Seq Peaks with a Control Sample We will perform the same procedure we did in the previous exercise. This time though, we will work with a control sample instead of an experimental one. Chip-Seq Peak Calling in Galaxy Chris Seward 2018 20

Step 6A: Map Control ChIP-Seq Reads to MM9 Genome Let s map the reads in G1E_E4R_input_(chr19).fastqsanger to the mouse genome. Select NGS: Mapping Then select Map with Bowtie for Illumina Chip-Seq Peak Calling in Galaxy Chris Seward 2018 21

Step 6B: Map Control ChIP-Seq Reads to MM9 Genome Make sure to select mm9 as the reference genome. Make sure 2: G1E_ER4_input_(chr19).fastqsanger is selected. Click Execute. It will take a few moments to complete. When done, click the view icon. Chip-Seq Peak Calling in Galaxy Chris Seward 2018 22

Step 6C: Map Control ChIP-Seq Reads to MM9 Genome Your Main pane should look like the following: Chip-Seq Peak Calling in Galaxy Chris Seward 2018 23

Step 7A: Calling Peaks with MACs on Control Chip-Seq Reads Like before, we want to call peaks in our mapped Control ChiP-Seq reads. Select NGS: Peak Calling Select MACS Chip-Seq Peak Calling in Galaxy Chris Seward 2018 24

Step 7B: Calling Peaks with MACs on Control Chip-Seq Reads Select 6: Map with Bowtie for Illumina on data 1 (the experimental aligned reads) for the Chip-Seq Tag File. Select 9: Map with Bowtie for Illumina on data 2 (the control aligned reads) for the Chip-Seq Control File. Click Execute Chip-Seq Peak Calling in Galaxy Chris Seward 2018 25

Step 7C: Calling Peaks with MACs on Control Chip-Seq Reads Once again, MACs creates a BED file containing the peak coordinates and an HTML file containing information on the peak calling process. Discussion 1. Examine the BED track. 2. How many peaks are called when using a control sample? 3. How does this compare to the previous situation where we only had experimental Chip-Seqreads? Chip-Seq Peak Calling in Galaxy Chris Seward 2018 26

Workflow Extraction In this exercise, we will automate the Bowtie runs of our other 2 datasets. Chip-Seq Peak Calling in Galaxy Chris Seward 2018 27

Step 1A: Workflow Extraction Click the located in the History pane. From the drop down menu, select Extract Workflow. Chip-Seq Peak Calling in Galaxy Chris Seward 2018 28

Step 1B: Workflow Extraction In the resulting window, give the workflow name a new name. Click Uncheck All. Check these boxes. Press Create Wor kflow. Chip-Seq Peak Calling in Galaxy Chris Seward 2018 29

Step 1C: Workflow Extraction Select All Workflows at the bottom of the Tool pane. Select chipseq2 > run On the next page, ensure the following settings and click Run. Chip-Seq Peak Calling in Galaxy Chris Seward 2018 30

Step 1D: Workflow Extraction Click when done Chip-Seq Peak Calling in Galaxy Chris Seward 2018 31

Identifying Differential Binding Sites In this exercise, we will identify binding sites exclusive to undifferentiated and differentiated cell lines as well as those common to both. Chip-Seq Peak Calling in Galaxy Chris Seward 2018 32

Step 1: Subtract Peaks Between Cell Lines Select Operate on Genomic Intervals and Subtract. Choose your G1E MACS peaks as your 2 nd dataset and your G1E-ER4 peaks as your 1 st dataset. Click Execute. Chip-Seq Peak Calling in Galaxy Chris Seward 2018 33

Step 2: Subtract Peaks Between Cell Lines. The resulting BED file contains peaks exclusive to the differentiated cell line (G1E-ER4). Discussion 1. How many peaks are exclusive to G1E-ER4? --------- Redo Step1 only SWITCH the input order. Choose your G1E MACS peaks as your 1 st dataset and your G1E-ER4 peaks as your 2nd dataset. Chip-Seq Peak Calling in Galaxy Chris Seward 2018 34

Step 3: Intersect Peaks Between Cell Lines Select Operate on Genomic Intervals and Intersect. Choose your G1E-ER4 MACS peaks as your 1 st dataset and your G1E peaks as your 2 nd dataset. Click Execute. Chip-Seq Peak Calling in Galaxy Chris Seward 2018 35