Facts from text: Automated gene annotation with ontologies and text-mining

Size: px
Start display at page:

Download "Facts from text: Automated gene annotation with ontologies and text-mining"

Transcription

1 1. Workshop des GI-Arbeitskreises Ontologien in Biomedizin und Lebenswissenschaften (OBML) Facts from text: Automated gene annotation with ontologies and text-mining Conrad Plake Schroeder Group (Bioinformatics), Technische Unversität Dresden 26th November 2009

2 Facts are in databases Genes, proteins, diseases,... Annotations from controlled vocabularies and ontologies MeSH Protein-protein interactions 2

3 Facts are in databases Genes, proteins, diseases,... Annotations from controlled vocabularies and ontologies MeSH Protein-protein interactions Problem: Manual curation is not enough! 3

4 The Problem: Manual curation is not enough! Literature references for function annotation Baumgartner, W. A. et al. Bioinformatics :i41-48i 4

5 Can text mining help to scale-up high-quality manual curation of gene products with ontologies? Winnenburg et al., Briefings in Bioinformatics,

6 Can text mining help to scale-up high-quality manual curation of gene products with ontologies? Winnenburg et al., Briefings in Bioinformatics, 2008 Yes, it can! 6

7 Identification of genes... 7

8 Gene Ontology terms 8

9 and diseases 9

10 and diseases How to associate genes with GO terms and diseases? 10

11 Filter for significant co-occurrences Association score: Royer, Plake, Schroeder, GCB,

12 A search engine for genes Query for: - gene names, IDs (screen) - sequence (gene or protein) - keywords (PubMed) Sorts results by GO & MeSH Gene profiling Links to literature Supports curators Trend analysis for genes Plake et al. (2009) GoGene: gene annotation in the fast lane Nucleic Acids Research 12

13 Which rat genes have implications in osteoporosis and bone resorption? Rat Genome Database: 0 genes PubMed: 859 hits 13

14 Which rat genes have implications in osteoporosis and bone resorption? Rat Genome Database: 0 genes PubMed: 859 hits GoGene: 5 genes found 14

15 Linking back to the literature Csf1 and osteoporosis: Ctsk and osteoporosis: Tnfrsb1 and bone resorption 15

16 Prediction of drug-target interactions Only ~30% of drug-target pairs in DrugBank co-occur in PubMed 16

17 Prediction of drug-target interactions Only ~30% of drug-target pairs in DrugBank co-occur in PubMed Idea: Link drugs with targets via disease, process, function! Sparfloxacin (Zagam) DNA topoisomerase II neutrophil apoptosis apoptotic chromosome condensation endopeptidase activity aspartic endopeptidase activity dermatitis, phototoxic ataxia telangiectasia 17

18 Prediction of drug-target interactions Only ~30% of drug-target pairs in DrugBank co-occur in PubMed Idea: Link drugs with targets via disease, process, function! Sparfloxacin (Zagam) DNA topoisomerase II neutrophil apoptosis apoptotic chromosome condensation endopeptidase activity aspartic endopeptidase activity dermatitis, phototoxic ataxia telangiectasia Arbutamine (GenESA) beta-1-adrenergic receptor adrenoceptor activity beta-adrenergic receptor activity blood circulation regulation of heart contraction coronary diseases myocardial infarction 18

19 Similarity between Drugs and Proteins Similarity based on shortest path length between concepts in a concept graph Drug with concept Protein with concept B Plake et al., submitted 19

20 Recovering DrugBank by similarity ranking AUC for all drugs All targets: Human targets: AUC for approved drugs All targets: Human targets: AUC for experimental drugs All targets: Human targets: Plake et al., submitted 20

21 Summary Employ ontologies to facilitate search and summarization of information Link genes and drugs to ontologies via literature Scale-up high quality annotation of gene products with text-mining Matching ontological profiles of proteins and drugs finds binding pairs GoGene: search genes at 21

22 Kudos to: Michael Schroeder Rainer Winnenburg Loic Royer 22

23 The Problem: Manual curation is not enough! Progress of GO annotations in the protein database Swiss-Prot Baumgartner, W. A. et al. Bioinformatics :i41-48i 23

24 Ontologies in the Life Sciences The Open Biomedical Ontologies ( Smith et al. (2007) The OBO Foundry: coordinated evolution of ontologies to support biomedical data integration. Nature Biotechnology 25,

25 Ontologies in the Life Sciences Annotation of gene products Molecular Function Cellular Component Biological Process Annotation of literature MeSH Medical Subject Headings Disease Anatomy Chemicals Organisms... 25

26 Prediction of drug-target interactions Only ~30% of drug-target pairs from DrugBank co-occur in PubMed Co-occurrence of proteins and ontological concepts Co-occurrence of drugs and ontological concepts 26

Introduction to Annotation for Gene Expression Analyses

Introduction to Annotation for Gene Expression Analyses References Introduction to Annotation for Gene Expression Analyses Utah State University Spring 2014 STAT 5570: Statistical Bioinformatics Notes 4.1 Chapters 7 & 14 of Bioconductor Monograph (course text)

More information

OncoPPi Portal A Cancer Protein Interaction Network to Inform Therapeutic Strategies

OncoPPi Portal A Cancer Protein Interaction Network to Inform Therapeutic Strategies OncoPPi Portal A Cancer Protein Interaction Network to Inform Therapeutic Strategies 2017 Contents Datasets... 2 Protein-protein interaction dataset... 2 Set of known PPIs... 3 Domain-domain interactions...

More information

Semi-Automatic Construction of Thyroid Cancer Intervention Corpus from Biomedical Abstracts

Semi-Automatic Construction of Thyroid Cancer Intervention Corpus from Biomedical Abstracts jsci2016 Semi-Automatic Construction of Thyroid Cancer Intervention Corpus from Biomedical Wutthipong Kongburan, Praisan Padungweang, Worarat Krathu, Jonathan H. Chan School of Information Technology King

More information

Biomedical resources for text mining

Biomedical resources for text mining August 30, 2005 Workshop Terminologies and ontologies in biomedicine: Can text mining help? Biomedical resources for text mining Olivier Bodenreider Lister Hill National Center for Biomedical Communications

More information

The PlantFAdb website and database are based on the superb SOFA database (sofa.mri.bund.de).

The PlantFAdb website and database are based on the superb SOFA database (sofa.mri.bund.de). A major goal of PlantFAdb is to allow users to easily explore relationships between unusual fatty acid structures and the plant species that produce them. Clicking on Tree from the home page provides an

More information

Building a Diseases Symptoms Ontology for Medical Diagnosis: An Integrative Approach

Building a Diseases Symptoms Ontology for Medical Diagnosis: An Integrative Approach Building a Diseases Symptoms Ontology for Medical Diagnosis: An Integrative Approach Osama Mohammed, Rachid Benlamri and Simon Fong* Department of Software Engineering, Lakehead University, Ontario, Canada

More information

A quick review. The clustering problem: Hierarchical clustering algorithm: Many possible distance metrics K-mean clustering algorithm:

A quick review. The clustering problem: Hierarchical clustering algorithm: Many possible distance metrics K-mean clustering algorithm: The clustering problem: partition genes into distinct sets with high homogeneity and high separation Hierarchical clustering algorithm: 1. Assign each object to a separate cluster. 2. Regroup the pair

More information

Phenotype analysis in humans using OMIM

Phenotype analysis in humans using OMIM Outline: 1) Introduction to OMIM 2) Phenotype similarity map 3) Exercises Phenotype analysis in humans using OMIM Rosario M. Piro Molecular Biotechnology Center University of Torino, Italy 1 MBC, Torino

More information

Schema-Driven Relationship Extraction from Unstructured Text

Schema-Driven Relationship Extraction from Unstructured Text Wright State University CORE Scholar Kno.e.sis Publications The Ohio Center of Excellence in Knowledge- Enabled Computing (Kno.e.sis) 2007 Schema-Driven Relationship Extraction from Unstructured Text Cartic

More information

PONTE: A Context-Aware Approach for Automated Clinical Trial Protocol Design

PONTE: A Context-Aware Approach for Automated Clinical Trial Protocol Design PONTE: A Context-Aware Approach for Automated Clinical Trial Protocol Design George Tsatsaronis 1, Konstantinos Mourtzoukos 2, Vassiliki Andronikou 2, Tassos Tagaris 2, Iraklis Varlamis 3, Michael Schroeder

More information

Advancing methods to develop behaviour change interventions: A Scoping Review of relevant ontologies

Advancing methods to develop behaviour change interventions: A Scoping Review of relevant ontologies Advancing methods to develop behaviour change interventions: A Scoping Review of relevant ontologies Participating organisations Emma Norris @EJ_Norris Ailbhe Finnerty Janna Hastings Gillian Stokes Susan

More information

A Network Partition Algorithm for Mining Gene Functional Modules of Colon Cancer from DNA Microarray Data

A Network Partition Algorithm for Mining Gene Functional Modules of Colon Cancer from DNA Microarray Data Method A Network Partition Algorithm for Mining Gene Functional Modules of Colon Cancer from DNA Microarray Data Xiao-Gang Ruan, Jin-Lian Wang*, and Jian-Geng Li Institute of Artificial Intelligence and

More information

A 10-year summary of kinase small molecule research Text mining AACR abstracts (white paper)

A 10-year summary of kinase small molecule research Text mining AACR abstracts (white paper) A 10-year summary of kinase small molecule research Text mining AACR abstracts (white paper) Approved Kinase Inhibitors September 2014 Sponsored by PamGene www.pamgene.com The Kinase Activity Profiling

More information

Bioinformatics Laboratory Exercise

Bioinformatics Laboratory Exercise Bioinformatics Laboratory Exercise Biology is in the midst of the genomics revolution, the application of robotic technology to generate huge amounts of molecular biology data. Genomics has led to an explosion

More information

Knowledge networks of biological and medical data An exhaustive and flexible solution to model life sciences domains

Knowledge networks of biological and medical data An exhaustive and flexible solution to model life sciences domains Knowledge networks of biological and medical data An exhaustive and flexible solution to model life sciences domains Dr. Sascha Losko, Dr. Karsten Wenger, Dr. Wenzel Kalus, Dr. Andrea Ramge, Dr. Jens Wiehler,

More information

A framework for the study of diseases and adverse drug reactions

A framework for the study of diseases and adverse drug reactions A framework for the study of diseases and adverse drug reactions Laura I. Furlong IBI group Research Programme on Biomedical Informatics (GRIB) Hospital del Mar Research Institute (IMIM) Information on

More information

Phenobridge WP 7. Crossing the species bridge between mouse and human. 17 February 2015, Helmholtz Zentrum München

Phenobridge WP 7. Crossing the species bridge between mouse and human. 17 February 2015, Helmholtz Zentrum München Phenobridge WP 7 Crossing the species bridge between mouse and human 17 February 2015, Helmholtz Zentrum München Michael Raess on behalf of the WP7 collaborators Who is WP7? Helen Parkinson, Nathalie Conte,

More information

Towards an automated procedure for annotation of gene products through SAS Methods. Henrik Tveit Norwegian University of Science and Technology

Towards an automated procedure for annotation of gene products through SAS Methods. Henrik Tveit Norwegian University of Science and Technology Towards an automated procedure for annotation of gene products through SAS Methods Henrik Tveit Norwegian University of Science and Technology Overview! Biology: the Structure of Life! Functional genomics!

More information

Mitelman Database of Chromosome Aberrations and Gene Fusions in Cancer

Mitelman Database of Chromosome Aberrations and Gene Fusions in Cancer Mitelman Database of Chromosome Aberrations and Gene Fusions in Cancer Mitelman Database of Chromosome Aberrations and Gene Fusions in Cancer (2017). Mitelman F, Johansson B and Mertens F (Eds.), http://cgap.nci.nih.gov/chromosomes/mitel

More information

38 Int'l Conf. Bioinformatics and Computational Biology BIOCOMP'16

38 Int'l Conf. Bioinformatics and Computational Biology BIOCOMP'16 38 Int'l Conf. Bioinformatics and Computational Biology BIOCOMP'16 PGAR: ASD Candidate Gene Prioritization System Using Expression Patterns Steven Cogill and Liangjiang Wang Department of Genetics and

More information

How preferred are preferred terms?

How preferred are preferred terms? How preferred are preferred terms? Gintare Grigonyte 1, Simon Clematide 2, Fabio Rinaldi 2 1 Computational Linguistics Group, Department of Linguistics, Stockholm University Universitetsvagen 10 C SE-106

More information

Data Mining in Bioinformatics Day 4: Text Mining

Data Mining in Bioinformatics Day 4: Text Mining Data Mining in Bioinformatics Day 4: Text Mining Karsten Borgwardt February 25 to March 10 Bioinformatics Group MPIs Tübingen Karsten Borgwardt: Data Mining in Bioinformatics, Page 1 What is text mining?

More information

Predicting disease associations via biological network analysis

Predicting disease associations via biological network analysis Sun et al. BMC Bioinformatics 2014, 15:304 RESEARCH ARTICLE Open Access Predicting disease associations via biological network analysis Kai Sun 1, Joana P Gonçalves 1, Chris Larminie 2 and Nataša Pržulj

More information

DISCOVERING IMPLICIT ASSOCIATIONS BETWEEN GENES AND HEREDITARY DISEASES

DISCOVERING IMPLICIT ASSOCIATIONS BETWEEN GENES AND HEREDITARY DISEASES DISCOVERING IMPLICIT ASSOCIATIONS BETWEEN GENES AND HEREDITARY DISEASES KAZUHIRO SEKI Graduate School of Science and Technology, Kobe University 1-1 Rokkodai, Nada, Kobe 657-8501, Japan E-mail: seki@cs.kobe-u.ac.jp

More information

Data driven Ontology Alignment. Nigam Shah

Data driven Ontology Alignment. Nigam Shah Data driven Ontology Alignment Nigam Shah nigam@stanford.edu What is Ontology Alignment? Alignment = the identification of near synonymy relationship b/w terms from different ontologies. Mapping = the

More information

Automated Annotation of Biomedical Text

Automated Annotation of Biomedical Text Automated Annotation of Biomedical Text Kevin Livingston, Ph.D. Postdoctoral Fellow Pharmacology Department, School of Medicine University of Colorado Anschutz Medical Campus Kevin.Livingston@ucdenver.edu

More information

Following virus recombination and evolution

Following virus recombination and evolution Following virus recombination and evolution Coping with the avalanche of sequencing data Dmitry Kusnetsov, Anne Gleizes, Robin Liechti, Ioannis Xenarios, Philippe Le Mercier Vital-IT/Swiss-Prot group SIB

More information

Supplementary Figure 1

Supplementary Figure 1 Supplementary Figure 1 An example of the gene-term-disease network automatically generated by Phenolyzer web server for 'autism'. The largest word represents the user s input term, Autism. The pink round

More information

I. Setup. - Note that: autohgpec_v1.0 can work on Windows, Ubuntu and Mac OS.

I. Setup. - Note that: autohgpec_v1.0 can work on Windows, Ubuntu and Mac OS. autohgpec: Automated prediction of novel disease-gene and diseasedisease associations and evidence collection based on a random walk on heterogeneous network Duc-Hau Le 1,*, Trang T.H. Tran 1 1 School

More information

A Simple Pipeline Application for Identifying and Negating SNOMED CT in Free Text

A Simple Pipeline Application for Identifying and Negating SNOMED CT in Free Text A Simple Pipeline Application for Identifying and Negating SNOMED CT in Free Text Anthony Nguyen 1, Michael Lawley 1, David Hansen 1, Shoni Colquist 2 1 The Australian e-health Research Centre, CSIRO ICT

More information

NIH Public Access Author Manuscript Stud Health Technol Inform. Author manuscript; available in PMC 2010 February 28.

NIH Public Access Author Manuscript Stud Health Technol Inform. Author manuscript; available in PMC 2010 February 28. NIH Public Access Author Manuscript Published in final edited form as: Stud Health Technol Inform. 2009 ; 150: 537 541. An Evolutionary Approach to the Representation of Adverse Events Werner Ceusters

More information

Data mining with Ensembl Biomart. Stéphanie Le Gras

Data mining with Ensembl Biomart. Stéphanie Le Gras Data mining with Ensembl Biomart Stéphanie Le Gras (slegras@igbmc.fr) Guidelines Genome data Genome browsers Getting access to genomic data: Ensembl/BioMart 2 Genome Sequencing Example: Human genome 2000:

More information

Representation of Part-Whole Relationships in SNOMED CT

Representation of Part-Whole Relationships in SNOMED CT Representation of Part-Whole Relationships in SNOMED CT A. Patrice Seyed 1, Alan Rector 2, Uli Sattler 2, Bijan Parsia 2, and Robert Stevens 2 1 Department of Computer Science and Engineering, University

More information

DrugBank: a comprehensive resource for in silico drug discovery and exploration

DrugBank: a comprehensive resource for in silico drug discovery and exploration D668 D672 Nucleic Acids Research, 2006, Vol. 34, Database issue doi:10.1093/nar/gkj067 DrugBank: a comprehensive resource for in silico drug discovery and exploration David S. Wishart*, Craig Knox, An

More information

Data and text mining applied to the computational study of protein interaction networks

Data and text mining applied to the computational study of protein interaction networks Data and text mining applied to the computational study of protein interaction networks Miguel Andrade Faculty of Biology, Johannes Gutenberg University Institute of Molecular Biology Mainz, Germany andrade@uni-mainz.de

More information

EMBASE Find quick, relevant answers to your biomedical questions

EMBASE Find quick, relevant answers to your biomedical questions EMBASE Find quick, relevant answers to your biomedical questions Piotr Golkiewicz Solution Sales Manager Life Sciences Central and Eastern Europe and Russia Embase is a registered trademark of Elsevier

More information

Evidence-based Laboratory Medicine: Finding and Assessing the Evidence

Evidence-based Laboratory Medicine: Finding and Assessing the Evidence Find Assess Decide Evidence-based Laboratory Medicine: Finding and Assessing the Evidence Pieter Vermeersch, M.D. Ph.D. Laboratory Medicine, UZ Leuven November 18th 2008 Introduction Archie Cochrane (1908-1988)

More information

Human health. Molecular mechanisms of biological systems. Teaching at. Research at. Brandeis University. Marine Biological Laboratory

Human health. Molecular mechanisms of biological systems. Teaching at. Research at. Brandeis University. Marine Biological Laboratory Human health Molecular mechanisms of biological systems Research at Marine Biological Laboratory Bay Paul Center for Comparative Molecular Biology and Evolution Woods Hole, MA Teaching at Brandeis University

More information

Bjoern Peters La Jolla Institute for Allergy and Immunology Buenos Aires, Oct 31, 2012

Bjoern Peters La Jolla Institute for Allergy and Immunology Buenos Aires, Oct 31, 2012 www.iedb.org Bjoern Peters bpeters@liai.org La Jolla Institute for Allergy and Immunology Buenos Aires, Oct 31, 2012 Overview 1. Introduction to the IEDB 2. Application: 2009 Swine-origin influenza virus

More information

Integration of Genomic, Proteomic and Biomedical Information on the Semantic Web

Integration of Genomic, Proteomic and Biomedical Information on the Semantic Web Integration of Genomic, Proteomic and Biomedical Information on the Semantic Web Bill Andreopoulos ab, Aijun An b, iangji Huang b, and Dirk Labudde a a Biotechnological Centre, Technische Universität Dresden,

More information

OMIM The Online Mendelian Inheritance in Man Knowledgebase: A Wardrobe Full of Genes. Ada Hamosh, MD, MPH

OMIM The Online Mendelian Inheritance in Man Knowledgebase: A Wardrobe Full of Genes. Ada Hamosh, MD, MPH OMIM The Online Mendelian Inheritance in Man Knowledgebase: A Wardrobe Full of Genes Ada Hamosh, MD, MPH OMIM THE ONLINE MENDELIAN INHERITANCE IN MAN KNOWLEDGEBASE: A WARDROBE FULL OF GENES The OMIM knowledgebase

More information

How to code rare diseases with international terminologies?

How to code rare diseases with international terminologies? How to code rare diseases with international terminologies? Ana Rath Inserm US14, Paris, France ana.rath@inserm.fr Special thanks to Prof Paul Landais for his kind presentation. Needs for terminologies

More information

Cellular Reproduction

Cellular Reproduction Chapter Test A CHAPTER 9 Cellular Reproduction Part A: Multiple Choice In the space at the left, write the letter of the term or phrase that best answers each question. Part B: Matching 1. What can limit

More information

Clinical NLP, PubGene Clinical trials in Coremine Oncology Text processing and information extraction for surgery planning form

Clinical NLP, PubGene Clinical trials in Coremine Oncology Text processing and information extraction for surgery planning form Clinical NLP, PubGene Clinical trials in Coremine Oncology Text processing and information extraction for surgery planning form November 2017 Dag Are Steenhoff Hov, PubGene AS 1 PubGene, founded 2001 ArrayIt

More information

CURRICULUM VITA OF Xiaowen Chen

CURRICULUM VITA OF Xiaowen Chen Xiaowen Chen College of Bioinformatics Science and Technology Harbin Medical University Harbin, 150086, P. R. China Mobile Phone:+86 13263501862 E-mail: hrbmucxw@163.com Education Experience 2008-2011

More information

SFARI Gene 2.0 User Guide

SFARI Gene 2.0 User Guide 1 SFARI Gene 2.0 User Guide This document is designed to acquaint the new user with SFARI Gene release 4.0, an integrated resource for autism research, and to provide enough information to allow the user

More information

Vocabulary: cell division, centriole, centromere, chromatid, chromatin, chromosome, cytokinesis, DNA, interphase, mitosis

Vocabulary: cell division, centriole, centromere, chromatid, chromatin, chromosome, cytokinesis, DNA, interphase, mitosis Cell Division Answer Key Vocabulary: cell division, centriole, centromere, chromatid, chromatin, chromosome, cytokinesis, DNA, interphase, mitosis Prior Knowledge Questions (Do these BEFORE using the Gizmo.)

More information

USEFULNESS OF ONTOLOGIES FOR RARE DISEASES

USEFULNESS OF ONTOLOGIES FOR RARE DISEASES USEFULNESS OF ONTOLOGIES FOR RARE DISEASES M a n u e l P o s a d a, Ve r ó n i c a A l o n s o a n d E s t r e l l a L ó p e z M a r t í n I n s t i t u t e o f R a r e D i s e a s e s R e s e a r c h

More information

Considering a new paradigm for Alzheimer s disease research a response

Considering a new paradigm for Alzheimer s disease research a response Considering a new paradigm for Alzheimer s disease research a response Martin Hofmann Apitius Department of Bioinformatics Fraunhofer Institute for Algorithms and Scientific Computing (SCAI) BioMed 21

More information

An Evolutionary Approach to the Representation of Adverse Events

An Evolutionary Approach to the Representation of Adverse Events Medical Informatics in a United and Healthy Europe K.-P. Adlassnig et al. (Eds.) IOS Press, 2009 2009 European Federation for Medical Informatics. All rights reserved. doi:10.3233/978-1-60750-044-5-537

More information

Student Exploration: Cell Division

Student Exploration: Cell Division Name: Date: Student Exploration: Cell Division Vocabulary: cell division, centriole, centromere, chromatid, chromatin, chromosome, cytokinesis, DNA, interphase, mitosis Prior Knowledge Questions (Do these

More information

Abstract. Patricia G. Melloy*

Abstract. Patricia G. Melloy* Laboratory Exercise Using an International p53 Mutation Database as a Foundation for an Online Laboratory in an Upper Level Undergraduate Biology Class ws Patricia G. Melloy* From the Department of Biological

More information

Analyzing the Semantics of Patient Data to Rank Records of Literature Retrieval

Analyzing the Semantics of Patient Data to Rank Records of Literature Retrieval Proceedings of the Workshop on Natural Language Processing in the Biomedical Domain, Philadelphia, July 2002, pp. 69-76. Association for Computational Linguistics. Analyzing the Semantics of Patient Data

More information

UPF Bioinformatics course projects

UPF Bioinformatics course projects UPF Bioinformatics course projects Students guide 2017 Didac Santesmasses (PhD) didac.santesmasses@crg.eu Aida Ripoll (Master student) aida.ripoll@crg.eu Bioinformatics and genomics programme Roderic Guigó's

More information

Hands-On Ten The BRCA1 Gene and Protein

Hands-On Ten The BRCA1 Gene and Protein Hands-On Ten The BRCA1 Gene and Protein Objective: To review transcription, translation, reading frames, mutations, and reading files from GenBank, and to review some of the bioinformatics tools, such

More information

R2 Training Courses. Release The R2 support team

R2 Training Courses. Release The R2 support team R2 Training Courses Release 2.0.2 The R2 support team Nov 08, 2018 Students Course 1 Student Course: Investigating Intra-tumor Heterogeneity 3 1.1 Introduction.............................................

More information

Introduction to the Partners Biobank Portal. December 2016

Introduction to the Partners Biobank Portal. December 2016 Introduction to the Partners Biobank Portal December 2016 Agenda About the Partners Biobank About the Biobank Portal Data Types in the Biobank Sample types DNA. Plasma and Serum Consent Genomic Data Health

More information

CELL CYCLE REGULATION AND CANCER. Cellular Reproduction II

CELL CYCLE REGULATION AND CANCER. Cellular Reproduction II CELL CYCLE REGULATION AND CANCER Cellular Reproduction II THE CELL CYCLE Interphase G1- gap phase 1- cell grows and develops S- DNA synthesis phase- cell replicates each chromosome G2- gap phase 2- cell

More information

Pilot Study: Clinical Trial Task Ontology Development. A prototype ontology of common participant-oriented clinical research tasks and

Pilot Study: Clinical Trial Task Ontology Development. A prototype ontology of common participant-oriented clinical research tasks and Pilot Study: Clinical Trial Task Ontology Development Introduction A prototype ontology of common participant-oriented clinical research tasks and events was developed using a multi-step process as summarized

More information

PubMed and Beyond: Clinical Resources from the National Library of Medicine

PubMed and Beyond: Clinical Resources from the National Library of Medicine PubMed and Beyond: Clinical Resources from the National Library of Medicine Erin Seger, MPH, CHES National Network of Libraries of Medicine, Middle Atlantic Region November 17, 2017 Objectives Find patient

More information

a. From the grey navigation bar, mouse over Analyze & Visualize and click Annotate Nucleotide Sequences.

a. From the grey navigation bar, mouse over Analyze & Visualize and click Annotate Nucleotide Sequences. Section D. Custom sequence annotation After this exercise you should be able to use the annotation pipelines provided by the Influenza Research Database (IRD) and Virus Pathogen Resource (ViPR) to annotate

More information

Erasmus MC at CLEF ehealth 2016: Concept Recognition and Coding in French Texts

Erasmus MC at CLEF ehealth 2016: Concept Recognition and Coding in French Texts Erasmus MC at CLEF ehealth 2016: Concept Recognition and Coding in French Texts Erik M. van Mulligen, Zubair Afzal, Saber A. Akhondi, Dang Vo, and Jan A. Kors Department of Medical Informatics, Erasmus

More information

Gene Ontology and Functional Enrichment. Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein

Gene Ontology and Functional Enrichment. Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein Gene Ontology and Functional Enrichment Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein The parsimony principle: A quick review Find the tree that requires the fewest

More information

Extracting geographic locations from the literature for virus phylogeography using supervised and distant supervision methods

Extracting geographic locations from the literature for virus phylogeography using supervised and distant supervision methods Extracting geographic locations from the literature for virus phylogeography using supervised and distant supervision methods D. Weissenbacher 1, A. Sarker 2, T. Tahsin 1, G. Gonzalez 2 and M. Scotch 1

More information

Drug repurposing and therapeutic anti-mirna predictions in oxldl-induced the proliferation of vascular smooth muscle cell associated diseases

Drug repurposing and therapeutic anti-mirna predictions in oxldl-induced the proliferation of vascular smooth muscle cell associated diseases Drug repurposing and therapeutic anti-mirna predictions in oxldl-induced the proliferation of vascular smooth muscle cell associated diseases Shun-Tsung Chen, Chien-Hung Huang, Victor C. Kok, Chi-Ying

More information

A Study of Abbreviations in Clinical Notes Hua Xu MS, MA 1, Peter D. Stetson, MD, MA 1, 2, Carol Friedman Ph.D. 1

A Study of Abbreviations in Clinical Notes Hua Xu MS, MA 1, Peter D. Stetson, MD, MA 1, 2, Carol Friedman Ph.D. 1 A Study of Abbreviations in Clinical Notes Hua Xu MS, MA 1, Peter D. Stetson, MD, MA 1, 2, Carol Friedman Ph.D. 1 1 Department of Biomedical Informatics, Columbia University, New York, NY, USA 2 Department

More information

The Proteasix Ontology

The Proteasix Ontology The Proteasix Ontology Mercedes Arguello Casteleiro 1, Julie Klein 2 and Robert Stevens 1 1 School of Computer Science, University of Manchester, Oxford road, Manchester, United Kingdom. 2 Institut National

More information

Assessing Functional Neural Connectivity as an Indicator of Cognitive Performance *

Assessing Functional Neural Connectivity as an Indicator of Cognitive Performance * Assessing Functional Neural Connectivity as an Indicator of Cognitive Performance * Brian S. Helfer 1, James R. Williamson 1, Benjamin A. Miller 1, Joseph Perricone 1, Thomas F. Quatieri 1 MIT Lincoln

More information

Table of content. -Supplementary methods. -Figure S1. -Figure S2. -Figure S3. -Table legend

Table of content. -Supplementary methods. -Figure S1. -Figure S2. -Figure S3. -Table legend Table of content -Supplementary methods -Figure S1 -Figure S2 -Figure S3 -Table legend Supplementary methods Yeast two-hybrid bait basal transactivation test Because bait constructs sometimes self-transactivate

More information

Ontology-based interactive visualization of patient-generated research questions

Ontology-based interactive visualization of patient-generated research questions Ontology-based interactive visualization of patient-generated research questions David Borland Laura Christopherson Charles Schmitt RENCI, UNC-Chapel Hill RENCI, UNC-Chapel Hill NIEHS Overview Background

More information

Wikipedia-Based Automatic Diagnosis Prediction in Clinical Decision Support Systems

Wikipedia-Based Automatic Diagnosis Prediction in Clinical Decision Support Systems Wikipedia-Based Automatic Diagnosis Prediction in Clinical Decision Support Systems Danchen Zhang 1, Daqing He 1, Sanqiang Zhao 1, Lei Li 1 School of Information Sciences, University of Pittsburgh, USA

More information

SpliceDB: database of canonical and non-canonical mammalian splice sites

SpliceDB: database of canonical and non-canonical mammalian splice sites 2001 Oxford University Press Nucleic Acids Research, 2001, Vol. 29, No. 1 255 259 SpliceDB: database of canonical and non-canonical mammalian splice sites M.Burset,I.A.Seledtsov 1 and V. V. Solovyev* The

More information

The Cancer Genome Atlas Project Overview

The Cancer Genome Atlas Project Overview TheCancerGenomeAtlasProjectOverview InSeptember2006,TheCancerGenomeAtlas(TCGA)programannouncedplanstomap genomicchangesoflung,brainandovariancancertumorsusinglargescalesequencing technologies. Seven institutions

More information

HALLA KABAT * Outreach Program, mircore, 2929 Plymouth Rd. Ann Arbor, MI 48105, USA LEO TUNKLE *

HALLA KABAT * Outreach Program, mircore, 2929 Plymouth Rd. Ann Arbor, MI 48105, USA   LEO TUNKLE * CERNA SEARCH METHOD IDENTIFIED A MET-ACTIVATED SUBGROUP AMONG EGFR DNA AMPLIFIED LUNG ADENOCARCINOMA PATIENTS HALLA KABAT * Outreach Program, mircore, 2929 Plymouth Rd. Ann Arbor, MI 48105, USA Email:

More information

Package bionetdata. R topics documented: February 19, Type Package Title Biological and chemical data networks Version 1.0.

Package bionetdata. R topics documented: February 19, Type Package Title Biological and chemical data networks Version 1.0. Type Package Title Biological and chemical data networks Version 1.0.1 Date 2014-09-07 Package bionetdata February 19, 2015 Author Matteo Re, Giorgio Valentini -- Universita' degli Studi di Milano Maintainer

More information

Supplementary Information

Supplementary Information Supplementary Information Modelling the Yeast Interactome Vuk Janjić, Roded Sharan 2 and Nataša Pržulj, Department of Computing, Imperial College London, London, United Kingdom 2 Blavatnik School of Computer

More information

SUPPLEMENTAL DATA AGING, July 2014, Vol. 6 No. 7

SUPPLEMENTAL DATA AGING, July 2014, Vol. 6 No. 7 SUPPLEMENTAL DATA Figure S1. Muscle mass changes in different anatomical regions with age. (A) The TA and gastrocnemius muscle showed a significant loss of weight in aged mice (24 month old) compared to

More information

Sub-Topic Classification of HIV related Opportunistic Infections. Miguel Anderson and Joseph Fonseca

Sub-Topic Classification of HIV related Opportunistic Infections. Miguel Anderson and Joseph Fonseca Sub-Topic Classification of HIV related Opportunistic Infections Miguel Anderson and Joseph Fonseca Introduction Image collected from the CDC https://www.cdc.gov/hiv/basics/statistics.html Background Info

More information

Session 4 Rebecca Poulos

Session 4 Rebecca Poulos The Cancer Genome Atlas (TCGA) & International Cancer Genome Consortium (ICGC) Session 4 Rebecca Poulos Prince of Wales Clinical School Introductory bioinformatics for human genomics workshop, UNSW 20

More information

SNPrints: Defining SNP signatures for prediction of onset in complex diseases

SNPrints: Defining SNP signatures for prediction of onset in complex diseases SNPrints: Defining SNP signatures for prediction of onset in complex diseases Linda Liu, Biomedical Informatics, Stanford University Daniel Newburger, Biomedical Informatics, Stanford University Grace

More information

Ontologies for the Study of Neurological Disease

Ontologies for the Study of Neurological Disease Alexander P. Cox 1, Mark Jensen 1, William Duncan 1, Bianca Weinstock-Guttman 3, Kinga Szigiti 3, Alan Ruttenberg 2, Barry Smith 1 and Alexander D. Diehl 3* 1 Department of Philosophy, University at Buffalo,

More information

LUMC: LOPAC screen analysis

LUMC: LOPAC screen analysis LUMC: LOPAC screen analysis Preliminary results November 9, 2012 This document briefly presents the results from the LOPAC compounds screen performed at LUMC. It contains the following information: The

More information

ABSTRACT. mircancer: a microrna-cancer Association Database and Toolkit Based on Text Mining. by Boya Xie. November, 2010

ABSTRACT. mircancer: a microrna-cancer Association Database and Toolkit Based on Text Mining. by Boya Xie. November, 2010 ABSTRACT mircancer: a microrna-cancer Association Database and Toolkit Based on Text Mining by Boya Xie November, 2010 Director: Dr. Qin Ding DEPARTMENT OF COMPUTER SCIENCE MicroRNAs (mirnas) are a class

More information

What can we contribute to cancer research and treatment from Computer Science or Mathematics? How do we adapt our expertise for them

What can we contribute to cancer research and treatment from Computer Science or Mathematics? How do we adapt our expertise for them From Bioinformatics to Health Information Technology Outline What can we contribute to cancer research and treatment from Computer Science or Mathematics? How do we adapt our expertise for them Introduction

More information

Automatically extracting, ranking and visually summarizing the treatments for a disease

Automatically extracting, ranking and visually summarizing the treatments for a disease Automatically extracting, ranking and visually summarizing the treatments for a disease Prakash Reddy Putta, B.Tech 1,2, John J. Dzak III, BS 1, Siddhartha R. Jonnalagadda, PhD 1 1 Division of Health and

More information

Blast Searcher Formative Evaluation. March 02, Adam Klinger and Josh Gutwill

Blast Searcher Formative Evaluation. March 02, Adam Klinger and Josh Gutwill Blast Searcher Formative Evaluation March 02, 2006 Keywords: Genentech Biology Life Sciences Adam Klinger and Josh Gutwill Traits of Life \ - 1 - BLAST Searcher Formative Evaluation March 2, 2006 Adam

More information

Medical information: Where to find it, what to trust. Lewis H. Rowett Executive Editor Annals of Oncology

Medical information: Where to find it, what to trust. Lewis H. Rowett Executive Editor Annals of Oncology Medical information: Where to find it, what to trust Lewis H. Rowett Executive Editor Annals of Oncology Presentation structure The problem The question +Use Google ~better Trust and trust proxies Medline

More information

CLINICAL PROCESS IMPROVEMENT INITIATIVE (CPII) EFFICIENCY REPORT EXPLANATION January 4, 2016

CLINICAL PROCESS IMPROVEMENT INITIATIVE (CPII) EFFICIENCY REPORT EXPLANATION January 4, 2016 CLINICAL PROCESS IMPROVEMENT INITIATIVE (CPII) EFFICIENCY REPORT EXPLANATION January 4, 2016 WHAT IS AN EPISODE OF CARE? An episode of care is a grouping of a patient s health care claims for a unique

More information

Name: Due on Wensday, December 7th Bioinformatics Take Home Exam #9 Pick one most correct answer, unless stated otherwise!

Name: Due on Wensday, December 7th Bioinformatics Take Home Exam #9 Pick one most correct answer, unless stated otherwise! Name: Due on Wensday, December 7th Bioinformatics Take Home Exam #9 Pick one most correct answer, unless stated otherwise! 1. What process brought 2 divergent chlorophylls into the ancestor of the cyanobacteria,

More information

University of Pittsburgh Cancer Institute UPMC CancerCenter. Uma Chandran, MSIS, PhD /21/13

University of Pittsburgh Cancer Institute UPMC CancerCenter. Uma Chandran, MSIS, PhD /21/13 University of Pittsburgh Cancer Institute UPMC CancerCenter Uma Chandran, MSIS, PhD chandran@pitt.edu 412-648-9326 2/21/13 University of Pittsburgh Cancer Institute Founded in 1985 Director Nancy Davidson,

More information

Section B. Comparative Genomics Analysis of Influenza H5N2 Viruses. Objective

Section B. Comparative Genomics Analysis of Influenza H5N2 Viruses. Objective Section B. Comparative Genomics Analysis of Influenza H5N2 Viruses Objective Upon completion of this exercise, you will be able to use the Influenza Research Database (IRD; http://www.fludb.org/) to: Search

More information

Development of Description Framework of Pharmacodynamics Ontology and its Application to Possible Drug-drug Interaction Reasoning

Development of Description Framework of Pharmacodynamics Ontology and its Application to Possible Drug-drug Interaction Reasoning Development of Description Framework of Pharmacodynamics Ontology and its Application to Possible Drug-drug Interaction Reasoning Takeshi IMAI, Ph.D Masayo Hayakawa Kazuhiko OHE, MD, Ph.D The University

More information

Translational Bioinformatics: Connecting Genes with Drugs

Translational Bioinformatics: Connecting Genes with Drugs Translational Bioinformatics: Connecting Genes with Drugs Aik Choon Tan, Ph.D. Associate Professor of Bioinformatics Division of Medical Oncology Department of Medicine aikchoon.tan@ucdenver.edu 11/14/2017

More information

Title From User Usage to Subject Analysis A Case Study on the Oncogene

Title From User Usage to Subject Analysis A Case Study on the Oncogene Title From User Usage to Subject Analysis A Case Study on the Oncogene Author Chung-Yen Yu, National Taiwan Normal University, Taiwan Jiann-Cherng Shieh, National Taiwan Normal University, Taiwan Objective

More information

Automated Social Network Epidemic Data Collector

Automated Social Network Epidemic Data Collector Automated Social Network Epidemic Data Collector Luis F Lopes, João M Zamite, Bruno C Tavares, Francisco M Couto, Fabrício Silva, and Mário J Silva LaSIGE, Universidade de Lisboa epiwork@di.fc.ul.pt Abstract.

More information

Cancer Informatics Lecture

Cancer Informatics Lecture Cancer Informatics Lecture Mayo-UIUC Computational Genomics Course June 22, 2018 Krishna Rani Kalari Ph.D. Associate Professor 2017 MFMER 3702274-1 Outline The Cancer Genome Atlas (TCGA) Genomic Data Commons

More information

Standardize and Optimize. Trials and Drug Development

Standardize and Optimize. Trials and Drug Development Informatics Infrastructure to Standardize and Optimize Quantitative Imaging in Clinical Trials and Drug Development Daniel L. Rubin, MD, MS Assistant Professor of Radiology Member, Stanford Cancer Center

More information

Development of a NGS Cancer Research Database CancerBase

Development of a NGS Cancer Research Database CancerBase Development of a NGS Cancer Research Database CancerBase Quashiya M. Soudagar 1, Akshatha Prasanna 2, V. G. Shanmuga Priya 3 1 M.Tech, Bioinformatics, KLE Dr. M.S Sheshgiri College of Engineering and Technology,

More information

a) List of KMTs targeted in the shrna screen. The official symbol, KMT designation,

a) List of KMTs targeted in the shrna screen. The official symbol, KMT designation, Supplementary Information Supplementary Figures Supplementary Figure 1. a) List of KMTs targeted in the shrna screen. The official symbol, KMT designation, gene ID and specifities are provided. Those highlighted

More information

ITS accuracy at GenBank. Conrad Schoch Barbara Robbertse

ITS accuracy at GenBank. Conrad Schoch Barbara Robbertse ITS accuracy at GenBank Conrad Schoch Barbara Robbertse Improving accuracy Barcode tag in GenBank Barcode submission tool Standards RefSeq Targeted Loci Well validated sequences already in GenBank Bacteria

More information