Extracting geographic locations from the literature for virus phylogeography using supervised and distant supervision methods

Size: px
Start display at page:

Download "Extracting geographic locations from the literature for virus phylogeography using supervised and distant supervision methods"

Transcription

1 Extracting geographic locations from the literature for virus phylogeography using supervised and distant supervision methods D. Weissenbacher 1, A. Sarker 2, T. Tahsin 1, G. Gonzalez 2 and M. Scotch 1 1 Department of Biomedical Informatics, Arizona State University 2 Department of Biostatistics, Epidemiology and Informatics, University of Pennsylvania

2 Outline Phylogeography Geospatial insufficiency in Sequence Databases Automatic Toponyms Detection Evaluation of Toponym Detection Next Steps

3 Learning Objectives At the end of the presentation, the audience will be able to: Define phylogeography Explain the insufficiency of the geospatial metadata in the GenBank database Define the natural language processing task of toponyms resolution List the main methods for toponym detection Report on the current performances of toponym detection when applied to free-text journal articles

4 Phylogeography Like phylogenetics, but: TX TX CA CA AZ Geography is additional dimension RNA Viruses (zoonotic) Virus DNA sequences

5

6

7 Genetic Sequence Databases GenBank, GISAID for flu, FLuDB, ViPR Contain virus sequences plus metadata: Virus Name Gene Host Location Collection Date

8

9 Insufficient Metadata Majority of records have imprecise geospatial metadata (China, USA, France, etc.) [Scotch et al., 2011; Bioinformatics] What are the options? 1. Discard record from study 2. Use it anyway (change scope) 3. Find a more precise location in the corresponding journal article

10 Our Work: Resolve Toponyms in Publications Pubmed Central Article, PMC A H1N1 virus was isolated in 2009 from a child hospitalized in Nanjing. GenBank Organism = H1N1 Host = Homo Sapiens Country = China, Nanjing

11 Toponym Resolution Overview Assign unique coordinates to all names of places in documents [Speriosu, 2013; PhD thesis] Detection Disambiguation Here a H5N1 influenza virus was isolated from a sick pigeon in Paris in We determined the complete genomic? sequence with an ABI 3730 genetic analyzer using the Sanger method.

12 Objective Previous work, disambiguation is easy but not the detection [Weissenbacher et al. 2015; Bioinformatics] Detection and Disambiguation (end-to-end): 0.68 F-score Disambiguation only (toponyms are given): 0.94 F-score Detection only: 0.72 F-score Goal: improving the toponym detection

13 Toponym Detection Difficulties Toponyms ambiguous with named entities (e.g. Paris in France vs. Paris Hilton) Other English words (e.g. May in Russia vs. May ) Acronyms, Abbreviations, Misspellings Errors when pre-processing documents

14 Detection Methods Dictionary-based: find exact match with a lexicon (e.g. Geonames) Rule-based: manually defined (e.g. stop lists, preceding words, etc.) Machine Learning (ML): weighted features (Conditional Random Fields) Deep Learning: features learned automatically

15 Conditional Random Fields Classifier (CRF) Token True Class Predicted Class Previous Token Next Token The O O:0.98 <S> H3XXX Yes H3N2 O O:0.97 The virus Yes Predict the label of each token virus O O:0.78 H3XXX was No was O O:0.99 virus infecting No infecting O O:0.98 was swines No swines O O:0.76 infecting in No in O O:0.56 swines South No South B B:0.83 in Korea Yes Korea I I:0.97 South and Yes and O O:0.56 Korea China No China B O:0.51 and. Yes. O O:0.95 China </S> No IsCapitalized given a set of features

16 Predefined Features Lexical word to classify, its lexical properties, surrounding words, POS tags Knowledge-based Geonames mapping + stop list, MetaMap mapping (UMLS) Semantic clusters of similar words

17 Gold Standard 1,881 toponyms annotated 500,000 words 60 PubMed articles on influenza 5,730 GenBank records linked Inter-annotator agreement:.97p,.98r (16 articles)

18 Results Training: 1,596 toponyms (48 documents) Testing: 285 toponyms (12 documents) Precision Recall F-Score Rule-based Naïve Bayes (ML) CRF (ML)

19 CRF Learning Curve

20 Summary Toponym detector reaching state-of-the-art performances Existing toponym resolution improved Integration of our toponym resolver in our platform to complete missing geographic metadata in GenBank

21 Ongoing Work Distance supervision: use metadata to find sentences with toponyms Deep learning training a convolutional neural network to learn the features Phylogeographic evaluation of the contribution of more precise locations to phylogeography models (in progress)

22 Thank You! Funding: NIH/NIAID R01AI s: D. Weissenbacher: M. Scotch: G. Gonzalez: A. Sarker: Resources:

A Study of Abbreviations in Clinical Notes Hua Xu MS, MA 1, Peter D. Stetson, MD, MA 1, 2, Carol Friedman Ph.D. 1

A Study of Abbreviations in Clinical Notes Hua Xu MS, MA 1, Peter D. Stetson, MD, MA 1, 2, Carol Friedman Ph.D. 1 A Study of Abbreviations in Clinical Notes Hua Xu MS, MA 1, Peter D. Stetson, MD, MA 1, 2, Carol Friedman Ph.D. 1 1 Department of Biomedical Informatics, Columbia University, New York, NY, USA 2 Department

More information

Asthma Surveillance Using Social Media Data

Asthma Surveillance Using Social Media Data Asthma Surveillance Using Social Media Data Wenli Zhang 1, Sudha Ram 1, Mark Burkart 2, Max Williams 2, and Yolande Pengetnze 2 University of Arizona 1, PCCI-Parkland Center for Clinical Innovation 2 {wenlizhang,

More information

Semi-Automatic Construction of Thyroid Cancer Intervention Corpus from Biomedical Abstracts

Semi-Automatic Construction of Thyroid Cancer Intervention Corpus from Biomedical Abstracts jsci2016 Semi-Automatic Construction of Thyroid Cancer Intervention Corpus from Biomedical Wutthipong Kongburan, Praisan Padungweang, Worarat Krathu, Jonathan H. Chan School of Information Technology King

More information

Automatic Context-Aware Image Captioning

Automatic Context-Aware Image Captioning Technical Disclosure Commons Defensive Publications Series May 23, 2017 Automatic Context-Aware Image Captioning Sandro Feuz Sebastian Millius Follow this and additional works at: http://www.tdcommons.org/dpubs_series

More information

Erasmus MC at CLEF ehealth 2016: Concept Recognition and Coding in French Texts

Erasmus MC at CLEF ehealth 2016: Concept Recognition and Coding in French Texts Erasmus MC at CLEF ehealth 2016: Concept Recognition and Coding in French Texts Erik M. van Mulligen, Zubair Afzal, Saber A. Akhondi, Dang Vo, and Jan A. Kors Department of Medical Informatics, Erasmus

More information

Distillation of Knowledge from the Research Literatures on Alzheimer s Dementia

Distillation of Knowledge from the Research Literatures on Alzheimer s Dementia JSCI 2017 1 Distillation of Knowledge from the Research Literatures on Alzheimer s Dementia Wutthipong Kongburan, Mark Chignell, and Jonathan H. Chan School of Information Technology King Mongkut's University

More information

How preferred are preferred terms?

How preferred are preferred terms? How preferred are preferred terms? Gintare Grigonyte 1, Simon Clematide 2, Fabio Rinaldi 2 1 Computational Linguistics Group, Department of Linguistics, Stockholm University Universitetsvagen 10 C SE-106

More information

How to code rare diseases with international terminologies?

How to code rare diseases with international terminologies? How to code rare diseases with international terminologies? Ana Rath Inserm US14, Paris, France ana.rath@inserm.fr Special thanks to Prof Paul Landais for his kind presentation. Needs for terminologies

More information

Text mining for lung cancer cases over large patient admission data. David Martinez, Lawrence Cavedon, Zaf Alam, Christopher Bain, Karin Verspoor

Text mining for lung cancer cases over large patient admission data. David Martinez, Lawrence Cavedon, Zaf Alam, Christopher Bain, Karin Verspoor Text mining for lung cancer cases over large patient admission data David Martinez, Lawrence Cavedon, Zaf Alam, Christopher Bain, Karin Verspoor Opportunities for Biomedical Informatics Increasing roll-out

More information

Identifying Adverse Drug Events from Patient Social Media: A Case Study for Diabetes

Identifying Adverse Drug Events from Patient Social Media: A Case Study for Diabetes Identifying Adverse Drug Events from Patient Social Media: A Case Study for Diabetes Authors: Xiao Liu, Department of Management Information Systems, University of Arizona Hsinchun Chen, Department of

More information

Memory-Augmented Active Deep Learning for Identifying Relations Between Distant Medical Concepts in Electroencephalography Reports

Memory-Augmented Active Deep Learning for Identifying Relations Between Distant Medical Concepts in Electroencephalography Reports Memory-Augmented Active Deep Learning for Identifying Relations Between Distant Medical Concepts in Electroencephalography Reports Ramon Maldonado, BS, Travis Goodwin, PhD Sanda M. Harabagiu, PhD The University

More information

A Simple Pipeline Application for Identifying and Negating SNOMED CT in Free Text

A Simple Pipeline Application for Identifying and Negating SNOMED CT in Free Text A Simple Pipeline Application for Identifying and Negating SNOMED CT in Free Text Anthony Nguyen 1, Michael Lawley 1, David Hansen 1, Shoni Colquist 2 1 The Australian e-health Research Centre, CSIRO ICT

More information

Brendan O Connor,

Brendan O Connor, Some slides on Paul and Dredze, 2012. Discovering Health Topics in Social Media Using Topic Models. PLOS ONE. http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0103408 Brendan O Connor,

More information

Extraction of Adverse Drug Effects from Clinical Records

Extraction of Adverse Drug Effects from Clinical Records MEDINFO 2010 C. Safran et al. (Eds.) IOS Press, 2010 2010 IMIA and SAHIA. All rights reserved. doi:10.3233/978-1-60750-588-4-739 739 Extraction of Adverse Drug Effects from Clinical Records Eiji Aramaki

More information

Deep Learning based Information Extraction Framework on Chinese Electronic Health Records

Deep Learning based Information Extraction Framework on Chinese Electronic Health Records Deep Learning based Information Extraction Framework on Chinese Electronic Health Records Bing Tian Yong Zhang Kaixin Liu Chunxiao Xing RIIT, Beijing National Research Center for Information Science and

More information

Appendix 81. From OPENFLU to OPENFMD. Open Session of the EuFMD: 2012, Jerez de la Frontera, Spain 1. Conclusions and recommendations

Appendix 81. From OPENFLU to OPENFMD. Open Session of the EuFMD: 2012, Jerez de la Frontera, Spain 1. Conclusions and recommendations From OPENFLU to OPENFMD a resource for automatic and curated nomenclature and tools for the FMD community Filip Claes, Philippe Le Mercier, Dmitry Kuznetsov, Robin Liechti, Anne Gleizes, Ioannis Xenarios,

More information

Building a framework for handling clinical abbreviations a long journey of understanding shortened words "

Building a framework for handling clinical abbreviations a long journey of understanding shortened words Building a framework for handling clinical abbreviations a long journey of understanding shortened words " Yonghui Wu 1 PhD, Joshua C. Denny 2 MD MS, S. Trent Rosenbloom 2 MD MPH, Randolph A. Miller 2

More information

a. From the grey navigation bar, mouse over Analyze & Visualize and click Annotate Nucleotide Sequences.

a. From the grey navigation bar, mouse over Analyze & Visualize and click Annotate Nucleotide Sequences. Section D. Custom sequence annotation After this exercise you should be able to use the annotation pipelines provided by the Influenza Research Database (IRD) and Virus Pathogen Resource (ViPR) to annotate

More information

WHO s code of conduct for open and timely sharing of pathogen genetic sequence data during outbreaks of infectious disease

WHO s code of conduct for open and timely sharing of pathogen genetic sequence data during outbreaks of infectious disease 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 WHO s code of conduct for open and timely sharing of pathogen genetic sequence data during outbreaks of infectious

More information

Chapter 12 Conclusions and Outlook

Chapter 12 Conclusions and Outlook Chapter 12 Conclusions and Outlook In this book research in clinical text mining from the early days in 1970 up to now (2017) has been compiled. This book provided information on paper based patient record

More information

DATA, TOOLS AND RESOURCES FOR MINING SOCIAL MEDIA DRUG CHATTER

DATA, TOOLS AND RESOURCES FOR MINING SOCIAL MEDIA DRUG CHATTER DATA, TOOLS AND RESOURCES FOR MINING SOCIAL MEDIA DRUG CHATTER Abeed Sarker Health Language Processing Lab Institute of Biomedical Informatics Email: abeed@upenn.edu Overview Social media data and pharmacovigilance

More information

KNOWLEDGE-BASED METHOD FOR DETERMINING THE MEANING OF AMBIGUOUS BIOMEDICAL TERMS USING INFORMATION CONTENT MEASURES OF SIMILARITY

KNOWLEDGE-BASED METHOD FOR DETERMINING THE MEANING OF AMBIGUOUS BIOMEDICAL TERMS USING INFORMATION CONTENT MEASURES OF SIMILARITY KNOWLEDGE-BASED METHOD FOR DETERMINING THE MEANING OF AMBIGUOUS BIOMEDICAL TERMS USING INFORMATION CONTENT MEASURES OF SIMILARITY 1 Bridget McInnes Ted Pedersen Ying Liu Genevieve B. Melton Serguei Pakhomov

More information

Sub-Topic Classification of HIV related Opportunistic Infections. Miguel Anderson and Joseph Fonseca

Sub-Topic Classification of HIV related Opportunistic Infections. Miguel Anderson and Joseph Fonseca Sub-Topic Classification of HIV related Opportunistic Infections Miguel Anderson and Joseph Fonseca Introduction Image collected from the CDC https://www.cdc.gov/hiv/basics/statistics.html Background Info

More information

Medical Knowledge Attention Enhanced Neural Model. for Named Entity Recognition in Chinese EMR

Medical Knowledge Attention Enhanced Neural Model. for Named Entity Recognition in Chinese EMR Medical Knowledge Attention Enhanced Neural Model for Named Entity Recognition in Chinese EMR Zhichang Zhang, Yu Zhang, Tong Zhou College of Computer Science and Engineering, Northwest Normal University,

More information

Knowledge networks of biological and medical data An exhaustive and flexible solution to model life sciences domains

Knowledge networks of biological and medical data An exhaustive and flexible solution to model life sciences domains Knowledge networks of biological and medical data An exhaustive and flexible solution to model life sciences domains Dr. Sascha Losko, Dr. Karsten Wenger, Dr. Wenzel Kalus, Dr. Andrea Ramge, Dr. Jens Wiehler,

More information

READ-BIOMED-SS: ADVERSE DRUG REACTION CLASSIFICATION OF MICROBLOGS USING EMOTIONAL AND CONCEPTUAL ENRICHMENT

READ-BIOMED-SS: ADVERSE DRUG REACTION CLASSIFICATION OF MICROBLOGS USING EMOTIONAL AND CONCEPTUAL ENRICHMENT READ-BIOMED-SS: ADVERSE DRUG REACTION CLASSIFICATION OF MICROBLOGS USING EMOTIONAL AND CONCEPTUAL ENRICHMENT BAHADORREZA OFOGHI 1, SAMIN SIDDIQUI 1, and KARIN VERSPOOR 1,2 1 Department of Computing and

More information

Challenges and Practical Approaches with Word Sense Disambiguation of Acronyms and Abbreviations in the Clinical Domain

Challenges and Practical Approaches with Word Sense Disambiguation of Acronyms and Abbreviations in the Clinical Domain Original Article Healthc Inform Res. 2015 January;21(1):35-42. pissn 2093-3681 eissn 2093-369X Challenges and Practical Approaches with Word Sense Disambiguation of Acronyms and Abbreviations in the Clinical

More information

Following virus recombination and evolution

Following virus recombination and evolution Following virus recombination and evolution Coping with the avalanche of sequencing data Dmitry Kusnetsov, Anne Gleizes, Robin Liechti, Ioannis Xenarios, Philippe Le Mercier Vital-IT/Swiss-Prot group SIB

More information

Text Mining of Patient Demographics and Diagnoses from Psychiatric Assessments

Text Mining of Patient Demographics and Diagnoses from Psychiatric Assessments University of Wisconsin Milwaukee UWM Digital Commons Theses and Dissertations December 2014 Text Mining of Patient Demographics and Diagnoses from Psychiatric Assessments Eric James Klosterman University

More information

Modeling the Antigenic Evolution of Influenza Viruses from Sequences

Modeling the Antigenic Evolution of Influenza Viruses from Sequences Modeling the Antigenic Evolution of Influenza Viruses from Sequences Taijiao Jiang Center of Systems Medicine, Chinese Academy of Medical Sciences Suzhou Institute of Systems Medicine October 8-10, 2015.

More information

Origins and evolutionary genomics of the novel avian-origin H7N9 influenza A virus in China: Early findings

Origins and evolutionary genomics of the novel avian-origin H7N9 influenza A virus in China: Early findings Origins and evolutionary genomics of the novel 2013 avian-origin H7N9 influenza A virus in : Early findings Jiankui He*, Luwen Ning, Yin Tong Department of Biology, South University of Science and Technology

More information

Rotavirus Genotyping and Enhanced Annotation in the Virus Pathogen Resource (ViPR) Yun Zhang J. Craig Venter Institute ASV 2016 June 19, 2016

Rotavirus Genotyping and Enhanced Annotation in the Virus Pathogen Resource (ViPR) Yun Zhang J. Craig Venter Institute ASV 2016 June 19, 2016 Rotavirus Genotyping and Enhanced Annotation in the Virus Pathogen Resource (ViPR) Yun Zhang J. Craig Venter Institute ASV 2016 June 19, 2016 Loading Virus Pathogen Database and Analysis About Resource

More information

Wikipedia-Based Automatic Diagnosis Prediction in Clinical Decision Support Systems

Wikipedia-Based Automatic Diagnosis Prediction in Clinical Decision Support Systems Wikipedia-Based Automatic Diagnosis Prediction in Clinical Decision Support Systems Danchen Zhang 1, Daqing He 1, Sanqiang Zhao 1, Lei Li 1 School of Information Sciences, University of Pittsburgh, USA

More information

Bjoern Peters La Jolla Institute for Allergy and Immunology Buenos Aires, Oct 31, 2012

Bjoern Peters La Jolla Institute for Allergy and Immunology Buenos Aires, Oct 31, 2012 www.iedb.org Bjoern Peters bpeters@liai.org La Jolla Institute for Allergy and Immunology Buenos Aires, Oct 31, 2012 Overview 1. Introduction to the IEDB 2. Application: 2009 Swine-origin influenza virus

More information

Animal Disease Event Recognition and Classification

Animal Disease Event Recognition and Classification 19th World Wide Web Conference WWW-2010 26-30 April 2010: Raleigh Conference Center, Raleigh, NC, USA Animal Disease Event Recognition and Classification Svitlana Volkova, Doina Caragea, William H. Hsu,

More information

CLAMP-Cancer an NLP tool to facilitate cancer research using EHRs Hua Xu, PhD

CLAMP-Cancer an NLP tool to facilitate cancer research using EHRs Hua Xu, PhD CLAMP-Cancer an NLP tool to facilitate cancer research using EHRs Hua Xu, PhD School of Biomedical Informatics The University of Texas Health Science Center at Houston 1 Advancing Cancer Pharmacoepidemiology

More information

TeamHCMUS: Analysis of Clinical Text

TeamHCMUS: Analysis of Clinical Text TeamHCMUS: Analysis of Clinical Text Nghia Huynh Faculty of Information Technology University of Science, Ho Chi Minh City, Vietnam huynhnghiavn@gmail.com Quoc Ho Faculty of Information Technology University

More information

HIV Informatics Service

HIV Informatics Service HIV Informatics Service Understanding HIV with high tech computational molecular algorithms HIV NGS data analysis Drug resistance prediction Treatment strategy suggestion Geographical movement tracking

More information

Detecting Patient Complexity from Free Text Notes Using a Hybrid AI Approach

Detecting Patient Complexity from Free Text Notes Using a Hybrid AI Approach Detecting Patient Complexity from Free Text Notes Using a Hybrid AI Approach Malcolm Pradhan, CMO MBBS, PhD, FACHI Daniel Padilla, ML Engineer BEng,, PhD Alcidion Corporation Overview Alcidion s Natural

More information

Standardize and Optimize. Trials and Drug Development

Standardize and Optimize. Trials and Drug Development Informatics Infrastructure to Standardize and Optimize Quantitative Imaging in Clinical Trials and Drug Development Daniel L. Rubin, MD, MS Assistant Professor of Radiology Member, Stanford Cancer Center

More information

Health monitoring and surveillance from social media

Health monitoring and surveillance from social media Health monitoring and surveillance from social media A discussion of past work, progress, easy and difficult tasks Abeed Sarker (@sarkerabeed ) Research Associate Department of Biostatistics, Epidemiology

More information

Mapping the Antigenic and Genetic Evolution of Influenza Virus

Mapping the Antigenic and Genetic Evolution of Influenza Virus Mapping the Antigenic and Genetic Evolution of Influenza Virus Derek J. Smith, Alan S. Lapedes, Jan C. de Jong, Theo M. Bestebroer, Guus F. Rimmelzwaan, Albert D. M. E. Osterhaus, Ron A. M. Fouchier Science

More information

Use of Twitter to Assess Sentiment toward Waterpipe Tobacco Smoking

Use of Twitter to Assess Sentiment toward Waterpipe Tobacco Smoking @ColditzJB #SBM2016 Use of Twitter to Assess Sentiment toward Waterpipe Tobacco Smoking Jason B. Colditz, MEd Maharsi Naidu, Class of 2018 Noah A. Smith, PhD Joel Welling, PhD Brian A. Primack, MD, PhD

More information

SNOMED CT and Orphanet working together

SNOMED CT and Orphanet working together SNOMED CT and Orphanet working together Ian Green Business Services Executive, IHTSDO Dr. Romina Armando INSERM Session outline What is Orphanet? Rare disorders Orphanet nomenclature Mappings to other

More information

Connecting Distant Entities with Induction through Conditional Random Fields for Named Entity Recognition: Precursor-Induced CRF

Connecting Distant Entities with Induction through Conditional Random Fields for Named Entity Recognition: Precursor-Induced CRF Connecting Distant Entities with Induction through Conditional Random Fields for Named Entity Recognition: Precursor-Induced Wangjin Lee 1 and Jinwook Choi 1,2,3 * 1 Interdisciplinary Program for Bioengineering,

More information

SpliceDB: database of canonical and non-canonical mammalian splice sites

SpliceDB: database of canonical and non-canonical mammalian splice sites 2001 Oxford University Press Nucleic Acids Research, 2001, Vol. 29, No. 1 255 259 SpliceDB: database of canonical and non-canonical mammalian splice sites M.Burset,I.A.Seledtsov 1 and V. V. Solovyev* The

More information

Synthetic Genomics and Its Application to Viral Infectious Diseases. Timothy Stockwell (JCVI) David Wentworth (JCVI)

Synthetic Genomics and Its Application to Viral Infectious Diseases. Timothy Stockwell (JCVI) David Wentworth (JCVI) Synthetic Genomics and Its Application to Viral Infectious Diseases Timothy Stockwell (JCVI) David Wentworth (JCVI) Outline Using informatics to predict drift (strain selection) Synthetic Genomics: Preparedness

More information

Natural Language Processing for EHR-Based Computational Phenotyping

Natural Language Processing for EHR-Based Computational Phenotyping IEEE TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS (ACCEPTED) 1 Natural Language Processing for EHR-Based Computational Phenotyping Zexian Zeng, Yu Deng, Xiaoyu Li, Tristan Naumann, Yuan Luo

More information

Biomedical resources for text mining

Biomedical resources for text mining August 30, 2005 Workshop Terminologies and ontologies in biomedicine: Can text mining help? Biomedical resources for text mining Olivier Bodenreider Lister Hill National Center for Biomedical Communications

More information

FDA Workshop NLP to Extract Information from Clinical Text

FDA Workshop NLP to Extract Information from Clinical Text FDA Workshop NLP to Extract Information from Clinical Text Murthy Devarakonda, Ph.D. Distinguished Research Staff Member PI for Watson Patient Records Analytics Project IBM Research mdev@us.ibm.com *This

More information

RITM Laboratory Response During the 2009 Influenza H1N1 Pandemic

RITM Laboratory Response During the 2009 Influenza H1N1 Pandemic RITM Laboratory Response During the 2009 Influenza H1N1 Pandemic REMIGIO M. OLVEDA, MD Director IV Research Institute for Tropical Medicine-DOH Philippines RITM Pre-Pandemic Preparedness Plan Ongoing surveillance

More information

Introduction to Bioinformatics

Introduction to Bioinformatics Introduction to Bioinformatics Sirkka-Liisa Varvio sirkka-liisa.varvio@helsinki.fi Autumn 2009, I period www.cs.helsinki.fi/mbi/courses/09-10/itb 582606 Introduction to Bioinformatics, Autumn 2009 8. Sept

More information

Regional bioinformatics network. Aurelie Brioudes On behalf of Filip Claes, Regional Laboratory Coordinator

Regional bioinformatics network. Aurelie Brioudes On behalf of Filip Claes, Regional Laboratory Coordinator Regional bioinformatics network Aurelie Brioudes On behalf of Filip Claes, Regional Laboratory Coordinator Virus evolution Novel serotypes H5N6 H5N8 New reassortments H5N2 Other EIDs 2 Need to know which

More information

VIP: an integrated pipeline for metagenomics of virus

VIP: an integrated pipeline for metagenomics of virus VIP: an integrated pipeline for metagenomics of virus identification and discovery Yang Li 1, Hao Wang 2, Kai Nie 1, Chen Zhang 1, Yi Zhang 1, Ji Wang 1, Peihua Niu 1 and Xuejun Ma 1 * 1. Key Laboratory

More information

Big Data in Healthcare: motivation, current state and specific use cases

Big Data in Healthcare: motivation, current state and specific use cases Big Data in Healthcare: motivation, current state and specific use cases Alejandro Rodríguez González Centro de Tecnología Biomédica Universidad Politécnica de Madrid Who we are?

More information

Epimining: Using Web News for Influenza Surveillance

Epimining: Using Web News for Influenza Surveillance Epimining: Using Web News for Influenza Surveillance Didier Breton 1, Sandra Bringay 2 3, François Marques 1, Pascal Poncelet 2, Mathieu Roche 2 1 Nevantropic, France 2 LIRMM CNRS, Univ. Montpellier 2,

More information

Pandemic Preparedness

Pandemic Preparedness Pandemic Preparedness Disclosure Dr. Michael Braida is a full-time International SOS / MedAire employee Regional Medical Director Northern Seas based in Aberdeen MedAire Medical Director for Europe Medical

More information

Social Media Mining for Toxicovigilance

Social Media Mining for Toxicovigilance Social Media Mining for Toxicovigilance Automatic Monitoring of Prescription Medication Abuse from Twitter Abeed Sarker (@sarkerabeed) Health Language Processing Lab Research Associate Department of Biostatistics,

More information

Identifying, Preparing for & Reducing Pandemic Risk

Identifying, Preparing for & Reducing Pandemic Risk Identifying, Preparing for & Reducing Pandemic Risk Jonna Mazet, DVM, MPVM, PhD Professor of Epidemiology & Disease Ecology One Health Institute School of Veterinary Medicine University of California,

More information

Palindromes drive the re-assortment in Influenza A

Palindromes drive the re-assortment in Influenza A www.bioinformation.net Hypothesis Volume 7(3) Palindromes drive the re-assortment in Influenza A Abdullah Zubaer 1, 2 * & Simrika Thapa 1, 2 1Swapnojaatra Bioresearch Laboratory, DataSoft Systems, Dhaka-15,

More information

Tweet Location Detection

Tweet Location Detection Tweet Location Detection Bahareh Rahmanzadeh Heravi Insight Centre for Data Analytics National University of Ireland Galway, Ireland Bahareh.Heravi@insightcentre.org Ihab Salawdeh Insight Centre for Data

More information

Retrieving disorders and findings: Results using SNOMED CT and NegEx adapted for Swedish

Retrieving disorders and findings: Results using SNOMED CT and NegEx adapted for Swedish Retrieving disorders and findings: Results using SNOMED CT and NegEx adapted for Swedish Maria Skeppstedt 1,HerculesDalianis 1,andGunnarHNilsson 2 1 Department of Computer and Systems Sciences (DSV)/Stockholm

More information

. Semi-automatic WordNet Linking using Word Embeddings. Kevin Patel, Diptesh Kanojia and Pushpak Bhattacharyya Presented by: Ritesh Panjwani

. Semi-automatic WordNet Linking using Word Embeddings. Kevin Patel, Diptesh Kanojia and Pushpak Bhattacharyya Presented by: Ritesh Panjwani Semi-automatic WordNet Linking using Word Embeddings Kevin Patel, Diptesh Kanojia and Pushpak Bhattacharyya Presented by: Ritesh Panjwani January 11, 2018 Kevin Patel WordNet Linking via Embeddings 1/22

More information

Planning Informed by Epidemiological Simulation: The Geography of Avian Flu in Nigeria

Planning Informed by Epidemiological Simulation: The Geography of Avian Flu in Nigeria Planning Informed by Epidemiological Simulation: The Geography of Avian Flu in Nigeria Jeanne Fair, Ph.D. Biosecurity & Public Health Group (B-7) Los Alamos National Laboratory Slide 1 Geospatial Epidemiology

More information

Exploiting Task-Oriented Resources to Learn Word Embeddings for Clinical Abbreviation Expansion

Exploiting Task-Oriented Resources to Learn Word Embeddings for Clinical Abbreviation Expansion Exploiting Task-Oriented Resources to Learn Word Embeddings for Clinical Abbreviation Expansion Yue Liu 1, Tao Ge 2, Kusum Mathews 3, Heng Ji 1, Deborah L. McGuinness 1 1 Department of Computer Science,

More information

Avian Influenza/Newcastle Disease Virus Subcommittee

Avian Influenza/Newcastle Disease Virus Subcommittee Avian Influenza/Newcastle Disease Virus Subcommittee David L. Suarez D.V.M., Ph.D. Patti Miller D.V.M., Ph.D. Claudio Afonso Ph.D. Southeast Poultry Research Laboratory United States National Poultry Research

More information

MedDRA Coding/ AE Log Item 1 Refresher. ASPIRE Protocol Team Meeting February 10, 2013

MedDRA Coding/ AE Log Item 1 Refresher. ASPIRE Protocol Team Meeting February 10, 2013 MedDRA Coding/ AE Log Item 1 Refresher ASPIRE Protocol Team Meeting February 10, 2013 MedDRA Coding Overview MedDRA: standardized dictionary of medical terminology Results from ICH initiative to standardize

More information

Contribution of avian influenza data through OFFLU network

Contribution of avian influenza data through OFFLU network Dr Gounalan Pavade Chargé de Mission, OIE Headquarters Contribution of avian influenza data through OFFLU network Asia-Pacific Workshop on surveillance, prevention and control of zoonotic influenza Paro,

More information

Influenza A(H5N1) activity from 27 September 2010 to 15 February 2011

Influenza A(H5N1) activity from 27 September 2010 to 15 February 2011 Antigenic and genetic characteristics of influenza A(H5N1) and influenza A(H9N2) viruses for the development of candidate vaccine viruses for pandemic preparedness February 2011 The development of representative

More information

The Impact of Belief Values on the Identification of Patient Cohorts

The Impact of Belief Values on the Identification of Patient Cohorts The Impact of Belief Values on the Identification of Patient Cohorts Travis Goodwin, Sanda M. Harabagiu Human Language Technology Research Institute University of Texas at Dallas Richardson TX, 75080 {travis,sanda}@hlt.utdallas.edu

More information

Project PRACE 1IP, WP7.4

Project PRACE 1IP, WP7.4 Project PRACE 1IP, WP7.4 Plamenka Borovska, Veska Gancheva Computer Systems Department Technical University of Sofia The Team is consists of 5 members: 2 Professors; 1 Assist. Professor; 2 Researchers;

More information

Outbreaks and control of swine influenza viruses in Korea

Outbreaks and control of swine influenza viruses in Korea Inception meeting of the OIE/JTF Project for Controlling Zoonoses in Asia under One Health Concept Outbreaks and control of swine influenza viruses in Korea Tokyo, Japan, 19-20, December 2013 Yeun-Kyung

More information

Ambiguity of Human Gene Symbols in LocusLink and MEDLINE: Creating an Inventory and a Disambiguation Test Collection

Ambiguity of Human Gene Symbols in LocusLink and MEDLINE: Creating an Inventory and a Disambiguation Test Collection Ambiguity of Human Gene Symbols in LocusLink and MEDLINE: Creating an Inventory and a Disambiguation Test Collection Marc Weeber, PhD, Bob J. A. Schijvenaars, PhD, Erik M. van Mulligen, PhD, Barend Mons,

More information

Automatically extracting, ranking and visually summarizing the treatments for a disease

Automatically extracting, ranking and visually summarizing the treatments for a disease Automatically extracting, ranking and visually summarizing the treatments for a disease Prakash Reddy Putta, B.Tech 1,2, John J. Dzak III, BS 1, Siddhartha R. Jonnalagadda, PhD 1 1 Division of Health and

More information

Object Detectors Emerge in Deep Scene CNNs

Object Detectors Emerge in Deep Scene CNNs Object Detectors Emerge in Deep Scene CNNs Bolei Zhou, Aditya Khosla, Agata Lapedriza, Aude Oliva, Antonio Torralba Presented By: Collin McCarthy Goal: Understand how objects are represented in CNNs Are

More information

Boundary identification of events in clinical named entity recognition

Boundary identification of events in clinical named entity recognition Boundary identification of events in clinical named entity recognition Azad Dehghan School of Computer Science The University of Manchester Manchester, UK a.dehghan@cs.man.ac.uk Abstract The problem of

More information

Headings: Information Extraction. Natural Language Processing. Blogs. Discussion Forums. Named Entity Recognition

Headings: Information Extraction. Natural Language Processing. Blogs. Discussion Forums. Named Entity Recognition Het R. Mehta. Quantitative Analysis of Physician language and Patient language in Social Media. A Master s Paper for the M.S. in IS degree. April 2017. 33 pages. Advisor: Stephanie W. Haas There has been

More information

Learning the Fine-Grained Information Status of Discourse Entities

Learning the Fine-Grained Information Status of Discourse Entities Learning the Fine-Grained Information Status of Discourse Entities Altaf Rahman and Vincent Ng Human Language Technology Research Institute The University of Texas at Dallas Plan for the talk What is Information

More information

Guidelines for Wildlife Disease Surveillance: An Overview 1

Guidelines for Wildlife Disease Surveillance: An Overview 1 Guidelines for Wildlife Disease Surveillance: An Overview 1 Purpose of Wildlife Disease Surveillance Wildlife disease surveillance can be a useful and complementary component of human and animal disease

More information

Evaluating Classifiers for Disease Gene Discovery

Evaluating Classifiers for Disease Gene Discovery Evaluating Classifiers for Disease Gene Discovery Kino Coursey Lon Turnbull khc0021@unt.edu lt0013@unt.edu Abstract Identification of genes involved in human hereditary disease is an important bioinfomatics

More information

Exploiting Task-Oriented Resources to Learn Word Embeddings for Clinical Abbreviation Expansion

Exploiting Task-Oriented Resources to Learn Word Embeddings for Clinical Abbreviation Expansion Exploiting Task-Oriented Resources to Learn Word Embeddings for Clinical Abbreviation Expansion Yue Liu 1, Tao Ge 2, Kusum S. Mathews 3, Heng Ji 1, Deborah L. McGuinness 1 1 Department of Computer Science,

More information

Schema-Driven Relationship Extraction from Unstructured Text

Schema-Driven Relationship Extraction from Unstructured Text Wright State University CORE Scholar Kno.e.sis Publications The Ohio Center of Excellence in Knowledge- Enabled Computing (Kno.e.sis) 2007 Schema-Driven Relationship Extraction from Unstructured Text Cartic

More information

Clinical Event Detection with Hybrid Neural Architecture

Clinical Event Detection with Hybrid Neural Architecture Clinical Event Detection with Hybrid Neural Architecture Adyasha Maharana Biomedical and Health Informatics University of Washington, Seattle adyasha@uw.edu Meliha Yetisgen Biomedical and Health Informatics

More information

A long journey to short abbreviations: developing an open-source framework for clinical abbreviation recognition and disambiguation (CARD)

A long journey to short abbreviations: developing an open-source framework for clinical abbreviation recognition and disambiguation (CARD) Journal of the American Medical Informatics Association, 24(e1), 2017, e79 e86 doi: 10.1093/jamia/ocw109 Advance Access Publication Date: 18 August 2016 Research and Applications Research and Applications

More information

Clinical Narratives Context Categorization: The Clinician Approach using RapidMiner

Clinical Narratives Context Categorization: The Clinician Approach using RapidMiner , pp.128-138 http://dx.doi.org/10.14257/astl.2014.51.30 Clinical Narratives Context Categorization: The Clinician Approach using RapidMiner Osama Mohammed 1, Sabah Mohammed 2, Jinan Fiaidhi 2, Simon Fong

More information

WikiWarsDE: A German Corpus of Narratives Annotated with Temporal Expressions

WikiWarsDE: A German Corpus of Narratives Annotated with Temporal Expressions WikiWarsDE: A German Corpus of Narratives Annotated with Temporal Expressions Jannik Strötgen, Michael Gertz Institute of Computer Science, Heidelberg University Im Neuenheimer Feld 348, 69120 Heidelberg,

More information

Influenza; tracking an emerging pathogen by popularity of Google Searches

Influenza; tracking an emerging pathogen by popularity of Google Searches Davids 1 Influenza; tracking an emerging pathogen by popularity of Google Searches Background Influenza is a wide spread and occasionally fatal disease, typically infecting children and the elderly. Each

More information

May All Your Wishes Come True: A Study of Wishes and How to Recognize Them

May All Your Wishes Come True: A Study of Wishes and How to Recognize Them May All Your Wishes Come True: A Study of Wishes and How to Recognize Them Andrew B. Goldberg, Nathanael Fillmore, David Andrzejewski, Zhiting Xu, Bryan Gibson & Xiaojin Zhu Computer Sciences Department

More information

Workshop/Hackathon for the Wordnet Bahasa

Workshop/Hackathon for the Wordnet Bahasa Workshop/Hackathon for the Wordnet Bahasa Francis Bond Computational Linguistics Group Linguistics and Multilingual Studies, Nanyang Technological University 2014-10-25 Wordnet Bahasa Boleh! Overview Thanks

More information

The Inclusion of Seasonal Influenza Viruses and Genetic Sequence Data (GSD) in the Context of the Pandemic Influenza Preparedness (PIP) Framework

The Inclusion of Seasonal Influenza Viruses and Genetic Sequence Data (GSD) in the Context of the Pandemic Influenza Preparedness (PIP) Framework The Inclusion of Seasonal Influenza Viruses and Genetic Sequence Data (GSD) in the Context of the Pandemic Influenza Preparedness (PIP) Framework A position paper from the Directors of the GISRS WHO Collaborating

More information

Shu Kong. Department of Computer Science, UC Irvine

Shu Kong. Department of Computer Science, UC Irvine Ubiquitous Fine-Grained Computer Vision Shu Kong Department of Computer Science, UC Irvine Outline 1. Problem definition 2. Instantiation 3. Challenge and philosophy 4. Fine-grained classification with

More information

Current Vaccines: Progress & Challenges. Influenza Vaccine what are the challenges?

Current Vaccines: Progress & Challenges. Influenza Vaccine what are the challenges? Current Vaccines: Progress & Challenges Influenza Vaccine what are the challenges? Professor John S. Tam The Hong Kong Polytechnic University Asia-Pacific Alliance for the Control of Influenza (APACI)

More information

General Symptom Extraction from VA Electronic Medical Notes

General Symptom Extraction from VA Electronic Medical Notes General Symptom Extraction from VA Electronic Medical Notes Guy Divita a,b, Gang Luo, PhD c, Le-Thuy T. Tran, PhD a,b, T. Elizabeth Workman, PhD a,b, Adi V. Gundlapalli, MD, PhD a,b, Matthew H. Samore,

More information

DOWNLOAD EASYCIE AND DATASET

DOWNLOAD EASYCIE AND DATASET DOWNLOAD EASYCIE AND DATASET https://goo.gl/evdf1c https://goo.gl/xjr7xe EASYCIE: A DEVELOPMENT PLATFORM TO SUPPORT QUICK AND EASY, RULE-BASED CLINICAL INFORMATION EXTRACTION JIANLIN SHI, MS MD DANIELLE

More information

TWITTER SENTIMENT ANALYSIS TO STUDY ASSOCIATION BETWEEN FOOD HABIT AND DIABETES. A Thesis by. Nazila Massoudian

TWITTER SENTIMENT ANALYSIS TO STUDY ASSOCIATION BETWEEN FOOD HABIT AND DIABETES. A Thesis by. Nazila Massoudian TWITTER SENTIMENT ANALYSIS TO STUDY ASSOCIATION BETWEEN FOOD HABIT AND DIABETES A Thesis by Nazila Massoudian Bachelor of Science, Tehran Azad University of Engineering and Technology, 2006 Submitted to

More information

GENETIC TESTING FOR PREDICTING RISK OF NONFAMILIAL BREAST CANCER

GENETIC TESTING FOR PREDICTING RISK OF NONFAMILIAL BREAST CANCER Non-Discrimination Statement and Multi-Language Interpreter Services information are located at the end of this document. Coverage for services, procedures, medical devices and drugs are dependent upon

More information

Study the Evolution of the Avian Influenza Virus

Study the Evolution of the Avian Influenza Virus Designing an Algorithm to Study the Evolution of the Avian Influenza Virus Arti Khana Mentor: Takis Benos Rachel Brower-Sinning Department of Computational Biology University of Pittsburgh Overview Introduction

More information

Building Evaluation Scales for NLP using Item Response Theory

Building Evaluation Scales for NLP using Item Response Theory Building Evaluation Scales for NLP using Item Response Theory John Lalor CICS, UMass Amherst Joint work with Hao Wu (BC) and Hong Yu (UMMS) Motivation Evaluation metrics for NLP have been mostly unchanged

More information

OIE Situation Report for Avian Influenza

OIE Situation Report for Avian Influenza OIE Situation Report for Avian Influenza Latest update: 24/04/2017 This report presents an overview of current disease events reported to the OIE by its Members. The objective is to describe what is happening

More information

arxiv: v1 [cs.ai] 27 Apr 2015

arxiv: v1 [cs.ai] 27 Apr 2015 Concept Extraction to Identify Adverse Drug Reactions in Medical Forums: A Comparison of Algorithms Alejandro Metke-Jimenez Sarvnaz Karimi CSIRO, Australia Email: {alejandro.metke,sarvnaz.karimi}@csiro.au

More information