Package bionetdata. R topics documented: February 19, Type Package Title Biological and chemical data networks Version 1.0.

Similar documents
Package GANPAdata. February 19, 2015

Package citccmst. February 19, 2015

OncoPPi Portal A Cancer Protein Interaction Network to Inform Therapeutic Strategies

Package inote. June 8, 2017

Package diggitdata. April 11, 2019

Package ORIClust. February 19, 2015

Package rvtdt. February 20, rvtdt... 1 rvtdt.example... 3 rvtdts... 3 TrendWeights Index 6. population control weighted rare-varaints TDT

Package AbsFilterGSEA

Package CompareTests

User Guide. Association analysis. Input

Package nopaco. R topics documented: April 13, Type Package Title Non-Parametric Concordance Coefficient Version 1.0.

Package propoverlap. R topics documented: February 20, Type Package

Package TargetScoreData

Journal: Nature Methods

38 Int'l Conf. Bioinformatics and Computational Biology BIOCOMP'16

Package clust.bin.pair

DrugBank: a comprehensive resource for in silico drug discovery and exploration

Package singlecase. April 19, 2009

Cancer Informatics Lecture

SpliceDB: database of canonical and non-canonical mammalian splice sites

Package PELVIS. R topics documented: December 18, Type Package

Package Actigraphy. R topics documented: January 15, Type Package Title Actigraphy Data Analysis Version 1.3.

Package xseq. R topics documented: September 11, 2015

Gene Expression Analysis Web Forum. Jonathan Gerstenhaber Field Application Specialist

EXPression ANalyzer and DisplayER

Package ordibreadth. R topics documented: December 4, Type Package Title Ordinated Diet Breadth Version 1.0 Date Author James Fordyce

ivlewbel: Uses heteroscedasticity to estimate mismeasured and endogenous regressor models

Genomes2Drugs: Identifies Target Proteins and Lead Drugs from Proteome Data

Package cssam. February 19, 2015

Package leukemiaseset

Package prognosticroc

Table of content. -Supplementary methods. -Figure S1. -Figure S2. -Figure S3. -Table legend

Package QualInt. February 19, 2015

Package HAP.ROR. R topics documented: February 19, 2015

Package appnn. R topics documented: July 12, 2015

Supplementary Information

Package p3state.msm. R topics documented: February 20, 2015

CoINcIDE: A framework for discovery of patient subtypes across multiple datasets

Package bgafun. R topics documented: August 12, Type Package

Package frailtyhl. February 19, 2015

A Practical Guide to Integrative Genomics by RNA-seq and ChIP-seq Analysis

R documentation. of GSCA/man/GSCA-package.Rd etc. June 8, GSCA-package. LungCancer metadi... 3 plotmnw... 5 plotnw... 6 singledc...

Data mining with Ensembl Biomart. Stéphanie Le Gras

Package speff2trial. February 20, 2015

Nature Neuroscience: doi: /nn Supplementary Figure 1

Package Interpol. R topics documented: February 19, Type Package. Title Interpolation of amino acid sequences. Version

A Network Partition Algorithm for Mining Gene Functional Modules of Colon Cancer from DNA Microarray Data

Package RNAsmc. December 11, 2018

PCxN: The pathway co-activity map: a resource for the unification of functional biology

Facts from text: Automated gene annotation with ontologies and text-mining

CSE 255 Assignment 9

Terminology. Metabo*omics

Package ICCbin. November 14, 2017

Package AIMS. June 29, 2018

Package ExactCIdiff. February 15, 2013

Discovery of Novel Human Gene Regulatory Modules from Gene Co-expression and

DeSigN: connecting gene expression with therapeutics for drug repurposing and development. Bernard lee GIW 2016, Shanghai 8 October 2016

Chapter 1. Introduction

Package msgbsr. January 17, 2019

5 Listening to the data

DrugBank: A General Resource for Pharmaceutical and Pharmacological Research

Systems Biology in Metabolomics

Package inctools. January 3, 2018

a) List of KMTs targeted in the shrna screen. The official symbol, KMT designation,

Package ncar. July 20, 2018

Package pathifier. August 17, 2018

Package ClusteredMutations

Contents. 2 Statistics Static reference method Sampling reference set Statistics Sampling Types...

Predicting disease associations via biological network analysis

Package ega. March 21, 2017

Integrative analysis of survival-associated gene sets in breast cancer

Package DeconRNASeq. November 19, 2017

SubLasso:a feature selection and classification R package with a. fixed feature subset

Translational Bioinformatics: Connecting Genes with Drugs

Package cancer. July 10, 2018

MethylMix An R package for identifying DNA methylation driven genes

Package xmeta. August 16, 2017

Package memapp. R topics documented: June 2, Version 2.10 Date

A Comparison of Collaborative Filtering Methods for Medication Reconciliation

IMPaLA tutorial.

bivariate analysis: The statistical analysis of the relationship between two variables.

Parameter Estimation of Cognitive Attributes using the Crossed Random- Effects Linear Logistic Test Model with PROC GLIMMIX

HBV. Next Generation Sequencing, data analysis and reporting. Presenter Leen-Jan van Doorn

Package SAMURAI. February 19, 2015

Nature Methods: doi: /nmeth.3115

Package labstats. December 5, 2016

Package netclass. July 2, 2014

Case Studies on High Throughput Gene Expression Data Kun Huang, PhD Raghu Machiraju, PhD

SMPD 287 Spring 2015 Bioinformatics in Medical Product Development. Final Examination

Integrating Genome and Functional Genomics Data to Reveal Perturbed Signaling Pathways in Ovarian Cancers

Package TEQR. R topics documented: February 5, Version Date Title Target Equivalence Range Design Author M.

Package NarrowPeaks. August 3, Version Date Type Package

High AU content: a signature of upregulated mirna in cardiac diseases

Package pepdat. R topics documented: March 7, 2015

The value of Omics to chemical risk assessment

Inferring Biological Meaning from Cap Analysis Gene Expression Data

Supplementary Figure 1. Metabolic landscape of cancer discovery pipeline. RNAseq raw counts data of cancer and healthy tissue samples were downloaded

MALDI Activity 4 MALDI-TOF Mass Spectrometry: Data Analysis

Module 3: Pathway and Drug Development

BIMM 143. RNA sequencing overview. Genome Informatics II. Barry Grant. Lecture In vivo. In vitro.

Transcription:

Type Package Title Biological and chemical data networks Version 1.0.1 Date 2014-09-07 Package bionetdata February 19, 2015 Author Matteo Re, Giorgio Valentini -- Universita' degli Studi di Milano Maintainer Matteo Re<re@di.unimi.it> Data Package that includes several examples of chemical and biological data networks, i.e. data graph structured. License GPL (>= 2) LazyLoad yes Depends R (>= 2.10) NeedsCompilation no Repository CRAN Date/Publication 2014-09-07 13:10:15 R topics documented: bionetdata-package..................................... 2 CGM.Cat.......................................... 2 DD.chem.data........................................ 3 DrugBank.Cat........................................ 4 FIN.data........................................... 5 Yeast.Biogrid.data...................................... 6 Yeast.Biogrid.FunCat.................................... 7 Yeast.STRING.data..................................... 8 Yeast.STRING.FunCat................................... 8 Index 10 1

2 CGM.Cat bionetdata-package bionetdata: Biological and chemical data networks Details Data Package that includes several examples of chemical and biological data networks represented through adjacency matrices of a graph Package: bionetdata Type: Package Version: 1.0.1 Date: 2014-09-07 License: GPL (>= 2) LazyLoad: yes Author(s) Giorgio Valentini and Matteo Re DI, Dipartimento di Scienze dell Informazione Universita degli Studi di Milano <{valentini,re}@di.unimi.it> Maintainer: Matteo Re CGM.Cat Cancer Gene Modules Cancer Gene Modules classes for the genes included in FIN.data. Annotations are taken from the GSEA MSigDB (Molecular Signatures Database) public repository: http://www.broadinstitute.org/gsea/msigdb The annotations are available in MSigDB C4 (computational gene sets collections) CM (cancer modules), file: c4.cgn.v3.0.symbols.gmt. This collection of gene sets is taken from a work published by Segal and colleagues aimed at the definition of a Cancer Modules Maps. According to the definition of the authors Cancer Gene Modules are groups of genes that act in concert to carry out a biological function/process. Cancer Gene Modules have been used to describe expression profiles in different tumors types in terms of the behavior of modules. Further information about specific

DD.chem.data 3 modules can be found at: http://robotics.stanford.edu/~erans/cancer/browse_by_modules.html It is worth noting that the Cancer Gene Modules annotations contained in bionetdata covers only the 2033 genes included in FIN.data. data(cgm.cat) A 2033 x 10 named matrix where rows refer to human gene symbols, columns to 10 Cancer Modules classes. Original annotations (c4.cm.v3.0.symbols.gmt) are available at: http://www.broadinstitute.org/gsea/msigdb Segal E., Friedman N., Koller D. and Regev A., A module map showing conditional activity of expression modules in cancer, Nature Genetics, 36(10), 2004 data(cgm.cat); CGM.Cat[1:10,1:10]; DD.chem.data Drug-drug chemical similarities data Chemical structure similarities between 1253 FDA approved drugs obtained from DrugBank 3.0. data(dd.chem.data) A 1253 x 1253 named matrix where both rows and columns refer to DrugBank drugs. Drugs names are DrugBank 3.0 identifiers.

4 DrugBank.Cat Details This matrix contains the Tanimoto chemical structure similarity scores between 1253 DrugBank drugs. Canonical Simplified molecular-input line-entry specifications (SMILES) of the drugs were extracted from the DrugBank Drugcards. The SMILES were then converted into molecular extended fingerprints (1024 bits) using the rcdk package. The set of fingerprints was finally converted into a Tanimoto similarity matrix using the fp.sim.matrix rcdk function. All the real values contained in this matrix represent the chemical structure similarity between each possible pair of drugs and are comprised between 0 (completely different chimical structures) and 1 (identical chemical structures). The SMILES representations of the drugs were extracted by the Smiles_String_canonical fields of the DrugCards contained in the file drugbank.txt available at: http://www.drugbank.ca/system/downloads/current/drugbank.txt.zip Wishart, D., Knox, C., Guo, A., Shrivastava, S., Hassanali, M., Stothard, P., Chang, Z., Woolsey, J.: DrugBank: a comprehensive resource for in silico drug discovery and exploration. Nucleic Acids Res. 34(Jan), D668-D672 (2006) Guha, R.: Chemical Informatics Functionality in R. Journal of Statistical Software 18(6), (2007) data(dd.chem.data) DD.chem.data[1:10,1:10] DrugBank.Cat DrugBank categories DrugBank 3.0 drugs categories of the 1253 drugs contained in the DD.chem.data matrix. data(drugbank.cat) A 1253 x 45 named binary matrix where rows refer to DrugBank drugs, columns to 45 Drug categories extracted from the DrugBank DrugCards. This matrix contains a 1 if the drug associated to the considered row is annotated in the category represented by the considered column and a 0 otherwise.

FIN.data 5 Details The drug categories were extracted by parsing the Drug_Category field of the DrugBank 3.0 Drug- Cards of the 1253 FDA approved drugs contained in the DD.chem.data matrix. The DrugBank Drug categories are produced by manual curators. On the contrary of other drugs functional annotation schemes these drugs categories are not restricted to specific deseases. Indeed it is possible to find in this functional annotation scheme categories associated to drugs involved in the treatment of specific deseases (i.e. AntiParkinsons_Agents ) or to the treatment of symptoms (i.e. Anticonvulsants ) associated to many deseases. This functional labelling scheme contains also categories based on the Mode of Action (MOA) of the drugs as in the case of the Adrenergic_Uptake_Inhibitors category. The drug categories were extracted by parsing the Drug_Category field of the DrugCards contained in the file drugbank.txt available at: http://www.drugbank.ca/system/downloads/current/drugbank.txt.zip Knox, C., Law, V., Jewison, T., Liu, P., Ly, S., Frolkis, A., Pon, A., Banco, K., Mak, C., Neveu, V., Djoumbou, Y., Eisner, R., Guo, A., Wishart, D.: DrugBank 3.0: a comprehensive resource for "omics" research on drugs. Nucleic Acids Res. 39(Jan), D1035-41 (2011) data(drugbank.cat) DrugBank.Cat[1:10,1:10]; dimnames(drugbank.cat)[[2]]; FIN.data Functional Interaction Network data for human. Functional Interaction data obtained from the supplemental materials of a paper written by Wu and colleagues: A human functional protein interaction network and its application to cancer data analysis. Data are represented through a binary named matrix and represent the presence or absence of gene-gene functional interactions. The original network is composed by more than 9000 genes. The network contained in the bionetdata R package is a reduced version containing 2033 nodes and obtained using the walktrap.community function of the R package igraphi. The walktrap.community function is an implementation of a community detection method developed by Pons and Latapy based on a between vertices similarity measure computed by means of a random walk. data(fin.data)

6 Yeast.Biogrid.data Binary named matrix. Entry FIN.data[i,j] = 1 if there is a functional interaction between gene i and j, otherwise FIN.data[i,j] = 0. Wu and colleagues original data are available as : http://genomebiology.com/content/supplementary/gb- 2010-11-5-r53-s3.zip Wu G., Feng X. and Stein L. A human functional protein interaction network and its application to cancer data analysis, Genome Biology 11:R53, 2010. Pons P. and Latapy M. Computing communities in large networks using random walks, J. of Graph Alg. and App. bf, 10:284-293, 2004. data(fin.data); FIN.data[1:10,1:10]; Yeast.Biogrid.data Yeast BioGRID data Protein-protein interaction (PPI) data (BioGRID) of yeast have been downloaded from the BioGRID database, that collects PPI data from both high-throughput studies and conventional focused studies (Stark et al. 2006). Data are represented through a binary named matrix and represent the presence or absence of protein-protein interactions. Names correspond to systematic names of yeast genes. data(yeast.biogrid.data) Binary named matrix. Entry Yeast.Biogrid.data[i,j] = 1 if there is an interaction between gene i and j, otherwise Yeast.Biogrid.data[i,j] = 0. BioGRID data base: http://thebiogrid.org Stark, C., Breitkreutz, B., Reguly, T., Boucher, L., Breitkreutz, A., and Tyers, M. (2006). BioGRID: a general repository for interaction datasets. Nucleic Acids Res., 34, D535-D539.

Yeast.Biogrid.FunCat 7 data(yeast.biogrid.data); Yeast.Biogrid.data[1:10,1:5]; Yeast.Biogrid.FunCat Yeast FunCat classes for BioGRID data. FunCat classes for the genes included in Yeast.Biogrid.data. Annotations refer the funcat-2.1 scheme, and funcat-2.1 data 20070316 data, available from the MIPS web site. data(yeast.biogrid.funcat) A named matrix where rows refer to yeast genes, columns to FunCat classes. Names of yeast genes are systematic names. Names of columns correspond to FunCat IDs. http://mips.gsf.de/projects/funcat Ruepp, A., Zollner, A., Maier, D., Albermann, K., Hani, J., Mokrejs, M., Tetko, I., Guldener, U., Mannhaupt, G., Munsterkotter, M., and Mewes, H. (2004). The FunCat, a functional annotation scheme for systematic classification of proteins from whole genomes. Nucleic Acids Research, 32(18), 5539-5545. data(yeast.biogrid.funcat) Yeast.Biogrid.FunCat[1:10,1:6]

8 Yeast.STRING.FunCat Yeast.STRING.data Yeast interactions from STRING Binary protein-protein interactions from the STRING data base (von Mering et al. 2002), representing interaction data from yeast two-hybrid assay, mass-spectrometry of purified complexes, correlated mrna expression and genetic interactions. data(yeast.string.data) Binary named matrix. Entry Yeast.Biogrid.STRING[i,j] = 1 if there is an interaction between gene i and j, otherwise Yeast.STRING.data[i,j] = 0. http://string-db.org von Mering, C., Krause, R., Snel, B., Cornell, M., Oliver, S., Fields, S., and Bork, P. (2002). Comparative assessment of large-scale data sets of protein-protein interactions. Nature, 417, 399-403. data(yeast.string.data) Yeast.STRING.FunCat Yeast FunCat classes for STRING data. FunCat classes for the genes included in Yeast.STRING.data. Annotations refer the funcat-2.1 scheme, and funcat-2.1 data 20070316 data, available from the MIPS web site. data(yeast.string.funcat)

Yeast.STRING.FunCat 9 A named matrix where rows refer to yeat genes, columns to FunCat classes. Names of yeast genes are systematic names. Names of columns correspond to FunCat IDs. http://mips.gsf.de/projects/funcat Ruepp, A., Zollner, A., Maier, D., Albermann, K., Hani, J., Mokrejs, M., Tetko, I., Guldener, U., Mannhaupt, G., Munsterkotter, M., and Mewes, H. (2004). The FunCat, a functional annotation scheme for systematic classification of proteins from whole genomes. Nucleic Acids Research, 32(18), 5539-5545. data(yeast.string.funcat)

Index Topic datasets CGM.Cat, 2 DD.chem.data, 3 DrugBank.Cat, 4 FIN.data, 5 Yeast.Biogrid.data, 6 Yeast.Biogrid.FunCat, 7 Yeast.STRING.data, 8 Yeast.STRING.FunCat, 8 Topic package bionetdata-package, 2 bionetdata (bionetdata-package), 2 bionetdata-package, 2 CGM.Cat, 2 DD.chem.data, 3, 4, 5 DrugBank.Cat, 4 FIN.data, 2, 3, 5 Yeast.Biogrid.data, 6, 7 Yeast.Biogrid.FunCat, 7 Yeast.STRING.data, 8, 8 Yeast.STRING.FunCat, 8 10