RmiR package vignette

Similar documents
Micro-RNA web tools. Introduction. UBio Training Courses. mirnas, target prediction, biology. Gonzalo

Supplementary information for: Human micrornas co-silence in well-separated groups and have different essentialities

microrna analysis Merete Molton Worren Ståle Nygård

User Guide. Association analysis. Input

How to use MiRaGE Package

SUPPLEMENTAY FIGURES AND TABLES

Data mining with Ensembl Biomart. Stéphanie Le Gras

Systematic exploration of autonomous modules in noisy microrna-target networks for testing the generality of the cerna hypothesis

Figure S1. Analysis of endo-sirna targets in different microarray datasets. The

Figure S1: Heat map based on the relative expression of genes and mirnas in human

Hands-On Ten The BRCA1 Gene and Protein

Silencing the Host The Role of Intronic micrornas

Package TargetScoreData

a) List of KMTs targeted in the shrna screen. The official symbol, KMT designation,

Analysis of paired mirna-mrna microarray expression data using a stepwise multiple linear regression model

SUPPLEMENTARY INFORMATION. Supp. Fig. 1. Autoimmunity. Tolerance APC APC. T cell. T cell. doi: /nature06253 ICOS ICOS TCR CD28 TCR CD28

Pathway Exercises Metabolism and Pathways

Prediction of micrornas and their targets

MethylMix An R package for identifying DNA methylation driven genes

ShadeVision v Color Map

38 Int'l Conf. Bioinformatics and Computational Biology BIOCOMP'16

Patnaik SK, et al. MicroRNAs to accurately histotype NSCLC biopsies

Human breast milk mirna, maternal probiotic supplementation and atopic dermatitis in offsrping

Bootstrapped Integrative Hypothesis Test, COPD-Lung Cancer Differentiation, and Joint mirnas Biomarkers

STAT1 regulates microrna transcription in interferon γ stimulated HeLa cells

Intro to SPSS. Using SPSS through WebFAS

Immunization Scheduler Quick Start Guide

Module 3: Pathway and Drug Development

A Practical Guide to Integrative Genomics by RNA-seq and ChIP-seq Analysis

MODULE 4: SPLICING. Removal of introns from messenger RNA by splicing

MicroRNA and Male Infertility: A Potential for Diagnosis

Vega: Variational Segmentation for Copy Number Detection

Let s get started with the OneTouch Reveal web app

Integration of high-throughput biological data

Selective filtering defect at the axon initial segment in Alzheimer s disease mouse models. Yu Wu

Inferring condition-specific mirna activity from matched mirna and mrna expression data

IMPaLA tutorial.

Package splicer. R topics documented: August 19, Version Date 2014/04/29

Table S2. Baseline characteristics of the Rotterdam Study for interaction analysis

Getting the Design Right Daniel Luna, Mackenzie Miller, Saloni Parikh, Ben Tebbs

Instructor Guide to EHR Go

MicroRNAs and cardiac sarcoplasmic reticulum calcium ATPase-2 in human myocardial infarction: expression and bioinformatic analysis

Profiling of the Exosomal Cargo of Bovine Milk Reveals the Presence of Immune- and Growthmodulatory Non-coding RNAs (ncrna)

OneTouch Reveal Web Application. User Manual for Patients Instructions for Use

On the Reproducibility of TCGA Ovarian Cancer MicroRNA Profiles

The application can be accessed from the pull down menu by selecting ODOT > Drafting Apps > Signs, or by the following key-in command:

sirna count per 50 kb small RNAs matching the direct strand Repeat length (bp) per 50 kb repeats in the chromosome

User s Manual Version 1.0

omiras: MicroRNA regulation of gene expression

Gene-microRNA network module analysis for ovarian cancer

Ascribe Rx Online Resource Center User Guide

Physiological Simulations: Plasma Glucose Regulation 1 Physiology Biology 390

Package citccmst. February 19, 2015

IPA Advanced Training Course

How To Use MiRSEA. Junwei Han. July 1, Overview 1. 2 Get the pathway-mirna correlation profile(pmset) and a weighting matrix 2

Exercises: Differential Methylation

Annotation of Chimp Chunk 2-10 Jerome M Molleston 5/4/2009

Synthetic microrna Reference Standards Genomics Research Group ABRF 2015

Run Time Tester Requirements Document

A Quick-Start Guide for rseqdiff

I. Setup. - Note that: autohgpec_v1.0 can work on Windows, Ubuntu and Mac OS.

Identification of mirnas in Eucalyptus globulus Plant by Computational Methods

Supplementary materials and methods.

CNV PCA Search Tutorial

starbase Pan-Cancer Tutorial

Package Actigraphy. R topics documented: January 15, Type Package Title Actigraphy Data Analysis Version 1.3.

Small RNA Sequencing. Project Workflow. Service Description. Sequencing Service Specification BGISEQ-500 SERVICE OVERVIEW SAMPLE PREPARATION

For true visualisation

National Surgical Adjuvant Breast and Bowel Project (NSABP) Foundation Annual Progress Report: 2009 Formula Grant

R2 Training Courses. Release The R2 support team

EXPression ANalyzer and DisplayER

Overview, page 1 Shortcut Keys for Cisco Unity Connection Administration, page 1 Other Unity Connection Features, page 4

microrna PCR System (Exiqon), following the manufacturer s instructions. In brief, 10ng of

Deciphering the Role of micrornas in BRD4-NUT Fusion Gene Induced NUT Midline Carcinoma

The Impact of Relevance Judgments and Data Fusion on Results of Image Retrieval Test Collections

Patrocles: a database of polymorphic mirna-mediated gene regulation

CEU MASS MEDIATOR USER'S MANUAL Version 2.0, 31 st July 2017

Lesson 3 Profex Graphical User Interface for BGMN and Fullprof

Supplementary Materials for

Data Management, Data Management PLUS User Guide

The FunCluster Package

Package xseq. R topics documented: September 11, 2015

POPULATION TRACKER MIDS USER GUIDE

CRS4 Seminar series. Inferring the functional role of micrornas from gene expression data CRS4. Biomedicine. Bioinformatics. Paolo Uva July 11, 2012

Section 6: Analysing Relationships Between Variables

Applications. DSC 410/510 Multivariate Statistical Methods. Discriminating Two Groups. What is Discriminant Analysis

Data Management System (DMS) User Guide

Blood Glucose Monitoring System. Copyright 2016 Ascensia Diabetes Care Holdings AG diabetes.ascensia.com

Lionbridge Connector for Hybris. User Guide

High AU content: a signature of upregulated mirna in cardiac diseases

MedRx HLS Plus. An Instructional Guide to operating the Hearing Loss Simulator and Master Hearing Aid. Hearing Loss Simulator

GST: Step by step Build Diary page

Clay Tablet Connector for hybris. User Guide. Version 1.5.0

Therapy Management Software for Diabetes

Evaluating the Consistency of Differential Expression of MicroRNA Detected in Human Cancers

OECD QSAR Toolbox v.4.2. An example illustrating RAAF scenario 6 and related assessment elements

Using SPSS for Correlation

Metabolic programming. Role of micrornas. M Elizabeth Tejero, PhD Laboratory of Nutrigenetics and Nutrigenomics INMEGEN Mexico City

NYSIIS. Immunization Evaluator and Manage Schedule Manual. October 16, Release 1.0

Transcription:

RmiR package vignette Francesco Favero * October 17, 2016 Contents 1 Introduction 1 2 Coupling mirna and gene expression data 2 2.1 Default parameters.............................. 3 2.2 Change the at.least parameter....................... 3 2.3 Different genes identification......................... 4 2.4 Results by platform probe ID........................ 5 3 Correlation between series 6 3.1 Expected results............................... 6 3.2 The function RmiRtc............................. 6 4 Plot and visualization of the results 7 4.1 SVG output.................................. 7 4.2 Plot of single experiment........................... 7 4.3 Plot a time course experiment........................ 9 1 Introduction RmiR is an R package for the analysis of microrna and gene expression microarrays. The goal of this package is to couple microrna and gene expression data (coming from the same RNA). We match mirnas to corresponding gene targets using the criteria applied by different databases. The package uses various databases of microrna targets from the RmiR.Hs.miRNA package: ˆ mirbase http://microrna.sanger.ac.uk/targets/v5/ * favero.francesco@gmail.com 1

ˆ targetscan http://www.targetscan.org/ ˆ miranda from microrna.org http://www.microrna.org/microrna/ ˆ tarbase http://diana.cslab.ece.ntua.gr/tarbase/ ˆ mirtarget2 from mirdb http://mirdb.org/mirdb/download.html ˆ PicTar http://pictar.mdc-berlin.de/ To use the package a list of mirna and a list of genes are required, both with the respective expression values. It is more interesting to have two series, one for microrna expression and another for gene expression, relative to different times or different treatments. In this case RmiR is useful to investigate the correlation between mirna/target couples. It is also possible to use the databases to simply retrieve the targets or the mirnas, given a list of mirnas or a list of genes respectively. The package also includes some tools to visualise the results for coupled data and for time series experiments. The RSVGTipsDevice[1] package is required which creates plots in SVG format, that one can visualize with a SVG viewer or in a browser like Mozilla Firefox, Safari and others. For more details about using the databases have a look at RmiR.Hs.miRNA package vignette. > vignette("rmir.hs.mirna") 2 Coupling mirna and gene expression data An analysis with RmiR starts giving to the read.mir function the list of mirnas and the list of genes with the right annotation. Leaving the default values the function searches in targetscan and pictar and prints only the couples present in both databases. > genes <- data.frame(genes=c("a_23_p171258","a_23_p150053", "A_23_P150053", + "A_23_P150053", "A_23_P202435", "A_24_P90097", + "A_23_P127948")) > genes$expr <- c(1.21, -1.50, -1.34, -1.45, -2.41, -2.32, -3.03) > mirna <- data.frame(mirna=c("hsa-mir-148b", "hsa-mir-27b", "hsa-mir-25", + "hsa-mir-181a", "hsa-mir-27a", "hsa-mir-7", + "hsa-mir-32", "hsa-mir-32", "hsa-mir-7")) > mirna$expr <- c(1.23, 3.52, -2.42, 5.2, 2.2, -1.42, -1.23, -1.20, -1.37) 2

> mirna > genes genes expr 1 A_23_P171258 1.21 2 A_23_P150053-1.50 3 A_23_P150053-1.34 4 A_23_P150053-1.45 5 A_23_P202435-2.41 6 A_24_P90097-2.32 7 A_23_P127948-3.03 mirna expr 1 hsa-mir-148b 1.23 2 hsa-mir-27b 3.52 3 hsa-mir-25-2.42 4 hsa-mir-181a 5.20 5 hsa-mir-27a 2.20 6 hsa-mir-7-1.42 7 hsa-mir-32-1.23 8 hsa-mir-32-1.20 9 hsa-mir-7-1.37 2.1 Default parameters The default parameters are set to search the items present in both TargetScan and in PicTar databases, and the genes object is identified by probe name. If it is not specified, the function makes the average of the different probes identifying the same gene, computing also the coefficient of variation (CV). If there is just one result, no average can be done, so the CV will be NA. > library(rmir) > read.mir(genes=genes, mirna=mirna, annotation="hgug4112a.db") gene_id mature_mirna mirexpr mircv symbol geneexpr genecv 1 22 hsa-mir-148b 1.230 NA ABCB7 1.21 NA 4 133 hsa-mir-181a 5.200 NA ADM -3.03 NA 5 133 hsa-mir-25-2.420 NA ADM -3.03 NA 6 133 hsa-mir-32-1.215 0.01745943 ADM -3.03 NA 2.2 Change the at.least parameter We can select the result present in at least one database: > read.mir(genes=genes, mirna=mirna, annotation="hgug4112a.db", at.least=1) gene_id mature_mirna mirexpr mircv symbol geneexpr genecv 1 22 hsa-mir-148b 1.230 NA ABCB7 1.210 NA 2 59 hsa-mir-27a 2.200 NA ADD3-1.450 0.05724023 3 59 hsa-mir-27b 3.520 NA ADD3-1.450 0.05724023 4 133 hsa-mir-181a 5.200 NA ADM -3.030 NA 5 133 hsa-mir-25-2.420 NA ADM -3.030 NA 6 133 hsa-mir-32-1.215 0.01745943 ADM -3.030 NA 23 120 hsa-mir-27a 2.200 NA ACTA2-2.365 0.02690893 61 120 hsa-mir-27b 3.520 NA ACTA2-2.365 0.02690893 3

Searching with at.least equal to 1 basically gives the union of the results from the selected databases. 2.3 Different genes identification. If the result is annotated by another identification than the platform probe ID, we can specify the annotation identifiers with the parameter id. The possible values are probes, genes, alias, ensembl and unigene. An example with entrez gene id identifiers: > genes.e <- genes > genes.e$gene_id <- c(22, 59, 59, 59, 120, 120, 133) > genes.e <- genes.e[, c("gene_id", "expr")] > genes.e gene_id expr 1 22 1.21 2 59-1.50 3 59-1.34 4 59-1.45 5 120-2.41 6 120-2.32 7 133-3.03 > read.mir(genes = genes.e, mirna = mirna, annotation = "hgug4112a.db", + id="genes") gene_id mature_mirna mirexpr mircv symbol geneexpr genecv 1 22 hsa-mir-148b 1.230 NA ABCB7 1.21 NA 4 133 hsa-mir-181a 5.200 NA ADM -3.03 NA 5 133 hsa-mir-25-2.420 NA ADM -3.03 NA 6 133 hsa-mir-32-1.215 0.01745943 ADM -3.03 NA Another example mixing official HGNC symbols and others aliases: > genes.a <- genes > genes.a$alias <- c("abcb7", "ADD3", "ADDL", "ADD3", "AAT6", "ACTA2", "ADM") > genes.a <- genes.a[, c("alias", "expr")] > genes.a alias expr 1 ABCB7 1.21 2 ADD3-1.50 3 ADDL -1.34 4

4 ADD3-1.45 5 AAT6-2.41 6 ACTA2-2.32 7 ADM -3.03 > read.mir(genes = genes.a, mirna = mirna, annotation = "hgug4112a.db", + id="alias") gene_id mature_mirna mirexpr mircv symbol geneexpr genecv 1 22 hsa-mir-148b 1.230 NA ABCB7 1.21 NA 4 133 hsa-mir-181a 5.200 NA ADM -3.03 NA 5 133 hsa-mir-25-2.420 NA ADM -3.03 NA 6 133 hsa-mir-32-1.215 0.01745943 ADM -3.03 NA 2.4 Results by platform probe ID If we are using the object genes annotated by microarray probes, sometimes we do not need the average of the results, for example when we would like to test each probe separately. In this case, it is preferable to have just the result as it is, annotated by probe. This will obviously cause much redundancy: > read.mir(genes=genes, mirna=mirna, annotation="hgug4112a.db", at.least=1, + id.out="probes") gene_id mature_mirna mirexpr mircv probe_id geneexpr genecv 1 22 hsa-mir-148b 1.230 NA A_23_P171258 1.21 NA 2 59 hsa-mir-27a 2.200 NA A_23_P150053-1.50 NA 3 59 hsa-mir-27a 2.200 NA A_23_P150053-1.34 NA 4 59 hsa-mir-27a 2.200 NA A_23_P150053-1.45 NA 5 59 hsa-mir-27b 3.520 NA A_23_P150053-1.50 NA 6 59 hsa-mir-27b 3.520 NA A_23_P150053-1.34 NA 7 59 hsa-mir-27b 3.520 NA A_23_P150053-1.45 NA 8 133 hsa-mir-181a 5.200 NA A_23_P127948-3.03 NA 9 133 hsa-mir-25-2.420 NA A_23_P127948-3.03 NA 10 133 hsa-mir-32-1.215 0.01745943 A_23_P127948-3.03 NA 21 120 hsa-mir-27a 2.200 NA A_23_P202435-2.41 NA 31 120 hsa-mir-27a 2.200 NA A_24_P90097-2.32 NA 101 120 hsa-mir-27b 3.520 NA A_23_P202435-2.41 NA 11 120 hsa-mir-27b 3.520 NA A_24_P90097-2.32 NA 5

3 Correlation between series 3.1 Expected results The control of target genes by micrornas occurs at post-trascriptional level. The mirna binds to its target gene and inhibits translation. In some cases the mrna is degraded by the mirna annealing, but this is not necessary to stop the translation process. In the first case it is possible to see a decrease of the gene expression, while in the other cases we shouldn t observe any particular trend or even an increase of the expression value. With a time series of microarray data it is possible to see if a target of one or more mirnas gradually changes its expression with the equal or opposite change of the relative mirna. Looking at the correlation between the trend of the different mirna/target couples, we can obtain the correlated and the anti-correlated couples. This does not mean that there is a sure biological relevance, but it could give some hints for further inverstigation. 3.2 The function RmiRtc To use the function RmiRtc we need two or more objects created with the function read.mir. > data(rmir) > res1<- read.mir(gene=gene1, mirna=mir1, annotation="hgug4112a.db",verbose=true) In targetscan database there are 13 genes and 35 microrna which are in your files. -------------------------------------------- In pictar database there are 7 genes and 27 microrna which are in your files. -------------------------------------------- > res2<- read.mir(gene=gene2, mirna=mir2, annotation="hgug4112a.db",verbose=true) In targetscan database there are 12 genes and 23 microrna which are in your files. -------------------------------------------- In pictar database there are 6 genes and 24 microrna which are in your files. -------------------------------------------- > res3<- read.mir(gene=gene3, mirna=mir3, annotation="hgug4112a.db",verbose=true) In targetscan database there are 13 genes and 35 microrna which are in your files. -------------------------------------------- In pictar database there are 7 genes and 27 microrna which are in your files. -------------------------------------------- 6

> res_tc <- RmiRtc(timeline=c("res1", "res2", "res3"), + timevalue=c(12, 24, 48)) We can decide to filter the object by a correlation and/or a gene expression threshold: > res_fil <- readrmirtc(res_tc, correlation=-0.9, exprlev=1, + annotation="hgug4112a.db") > res_fil$reps symbol mirnas gene_id 2 APP 3 351 3 VLDLR 3 7436 1 CENPV 1 201161 The function readrmirtc filter the genes by the absolute expression value set in the arguments, and returns a list with the genes ranked by the number of mirnas satisfying the correlation threshold. To see in details the expression of mirna and genes it is possible to plot the various trends, or print the desired results: > cbind (res_fil$couples, res_fil$geneexpr, + res_fil$mirexpr)[res_fil$couples$gene_id==351 & res_fil$cor<=-0.9, ] gene_id mature_mirna 12 24 48 12 24 48 22 351 hsa-mir-20a 0.71-0.95-1.67 0.32 1.73 2.12 17 351 hsa-mir-20b 0.71-0.95-1.67 0.06 1.10 1.61 19 351 hsa-mir-93 0.71-0.95-1.67 0.30 1.25 1.19 4 Plot and visualization of the results 4.1 SVG output In order to present the results, the pachage uses a tool to visualize the data. With the package RSVGTipsDevice we can plot the data for a single mirna and gene coupled experiment or for a series of experiments. The resulting image is in svg format with the properties to have dynamic tips and hyperlinks. 4.2 Plot of single experiment > plotrmirtc(res1[res1$gene_id==351,],svgname="res1_351.svg",svgtips=t) 7

Figure 1: Visualization in a browser with SVG support of the results of the read.mir function selecting a single gene target and its mirna 8

4.3 Plot a time course experiment > plotrmirtc(res_fil,gene_id=351) APP and its mirnas expression trends Expression 1 0 1 2 15 20 25 30 35 40 45 Time Setting the coordinate of the legend: > plotrmirtc(res_fil, gene_id=351, legend.y=0, legend.x=30) 9

APP and its mirnas expression trends Expression 1 0 1 2 APP hsa mir 20a hsa mir 20b hsa mir 93 15 20 25 30 35 40 45 Time > plotrmirtc(res_fil, gene_id=351, legend.y=0, legend.x=30, svgtips=t) 10

Figure 2: Visualization in a browser with SVG support of a time course experiment of the selected gene with hyperlinks and tips 11

References [1] Tony Plate. RSVGTipsDevice: An R SVG graphics device with dynamic tips and hyperlinks, 2009. R package version 1.0-1. 12