Rotavirus Genotyping and Enhanced Annotation in the Virus Pathogen Resource (ViPR) Yun Zhang J. Craig Venter Institute ASV 2016 June 19, 2016

Similar documents
Section D. Identification of serotype-specific amino acid positions in DENV NS1. Objective

Module 3. Genomic data and annotations in public databases Exercises Custom sequence annotation

Fondation Merieux J Craig Venter Institute Bioinformatics Workshop. December 5 8, 2017

a. From the grey navigation bar, mouse over Analyze & Visualize and click Annotate Nucleotide Sequences.

Section B. Comparative Genomics Analysis of Influenza H5N2 Viruses. Objective

Influenza Virus HA Subtype Numbering Conversion Tool and the Identification of Candidate Cross-Reactive Immune Epitopes

Objective. Background

Influenza H3N2 Virus Variation Analysis

From Mosquitos to Humans: Genetic evolution of Zika Virus

SEQUENCE FEATURE VARIANT TYPES

Nucleic acid Strands Family Example Accession Base pairs

Food-borne viruses and transformation of foodpractices

Matthew Cotten Department of Viroscience Erasmus Medical Center Useful data analysis for clinical applications of viral next generation sequencing

PROTOCOL FOR INFLUENZA A VIRUS GLOBAL SWINE H1 CLADE CLASSIFICATION

Hands-On Ten The BRCA1 Gene and Protein

How could the small size of viruses have helped researchers detect viruses before the invention of the electron microscope? 13-1

Characterizing the Respiratory Microbiome of Commercial Broilers on the Delmarva Peninsula

Viral Agents of Paediatric Gastroenteritis

ViPR SWG Meeting 17NOV2015

Appendix 81. From OPENFLU to OPENFMD. Open Session of the EuFMD: 2012, Jerez de la Frontera, Spain 1. Conclusions and recommendations

Contribution of avian influenza data through OFFLU network

Immune Epitope Database NEWSLETTER Volume 6, Issue 2 July 2009

Emerging TTIs How Singapore secure its blood supply

Section B. Comparative Genomics Analysis of 2013 H7N9 Influenza A Viruses. Objective

Identifying, Preparing for & Reducing Pandemic Risk

Bioinformation by Biomedical Informatics Publishing Group

Zika Virus. It may be devastating, But we might just get one step ahead of it. Learning in Retirement Winter 2017 Daniel Burnside. photo: Newsweek.

Bjoern Peters La Jolla Institute for Allergy and Immunology Buenos Aires, Oct 31, 2012

Student Handout Bioinformatics

Selection of epitope-based vaccine targets of HCV genotype 1 of Asian origin: a systematic in silico approach

Evolution of influenza

VIP: an integrated pipeline for metagenomics of virus

Genomes and Genetics

Bioinformatics Laboratory Exercise

LESSON 4.6 WORKBOOK. Designing an antiviral drug The challenge of HIV

Ch10 Classification and Nomenclature of Viruses 國立台灣海洋大學海洋生物研究所陳歷歷

Evolution of DS-1-like human G2P[4] rotaviruses assessed by complete genome analyses

Challenges and opportunities in risk assessment for viruses Marion

Influenza A virus subtype H5N1

Lecture # 1: Course Introduction

VIRAL AGENTS CAUSING GASTROENTERITIS

When infections go viral Zika Virus

Arbovirus Infections and the animal reservoir

Image of Ebola viruses exiting host cells HUMAN VIRUSES & THE LIMITATION OF ANTIVIRAL DRUG AGENTS

Intro II - Viral Replication. All living things survive in a sea of viruses

Reassortment of influenza A virus genes linked to PB1 polymerase gene

Fayth K. Yoshimura, Ph.D. September 7, of 7 HIV - BASIC PROPERTIES

Training in Infectious Diseases Modeling. A reflection on vaccination as a disease control measure

Chikungunya Vaccines in the Pipeline

Genomes and Gene*cs. Lecture 3 Biology W3310/4310 Virology Spring 2015

IASR Back Number Vol.35. The Topic of This Month Vol.35 No.3 (No.409) Rotavirus, , Japan. (IASR 35: 63-64, March 2014) Phoca PDF

in vitro Key words F N CONH2 Fig. 1. Structure of Favipiravir.

aM (modules 1 and 10 are required)

Chapters 21-26: Selected Viral Pathogens

The Ebola Virus. By Emilio Saavedra

Gastroenteritis and viral infections

HBV. Next Generation Sequencing, data analysis and reporting. Presenter Leen-Jan van Doorn

Extracting geographic locations from the literature for virus phylogeography using supervised and distant supervision methods

Image of Ebola viruses exiting host cells HUMAN VIRUSES & THE LIMITATION OF ANTIVIRAL DRUG AGENTS

Nature Medicine: doi: /nm.4322

NON HUMAN PRIMATE BIOMEDICAL RESEARCH FOR TACKLING EMERGING INFECTIOUS DISEASES (II): ZIKA VIRUS. Special series on Laboratory Animal Science

Malik Sallam. Ola AL-juneidi. Ammar Ramadan. 0 P a g e

Section 1 Individual viruses. Introduction to virology. History of viruses. Viral taxonomy

Food-borne viruses. Marion Koopmans. Professor of public health virology

Introduction to Avian Influenza

Current Vaccines: Progress & Challenges. Influenza Vaccine what are the challenges?

5. Over the last ten years, the proportion of HIV-infected persons who are women has: a. Increased b. Decreased c. Remained about the same 1

PAirwise Sequence Comparison (PASC) and Its Application in the Classification of Filoviruses

Epidemiological profiles of viral hepatitis in Italy Effects of migration

Envelope e_ :------, Envelope glycoproteins e_ ~ Single-stranded RNA ----, Nucleocapsid

Sequence analysis for VP4 of enterovirus 71 isolated in Beijing during 2007 to 2008

Supplementary Information

Patterns of hemagglutinin evolution and the epidemiology of influenza

Viral Hemorrhagic Disease

Avian influenza Avian influenza ("bird flu") and the significance of its transmission to humans

ITS accuracy at GenBank. Conrad Schoch Barbara Robbertse

SMPD 287 Spring 2015 Bioinformatics in Medical Product Development. Final Examination

HEPATITIS B: are escape mutants of concern?

Patricia Fitzgerald-Bocarsly

Recommended composition of influenza virus vaccines for use in the 2007 influenza season

Data mining with Ensembl Biomart. Stéphanie Le Gras

Chair of Medical Biology, Microbiology, Virology, and Immunology STRUCTURE, CLASSIFICATION AND PHYSIOLOGY OF VIRUSES

Control and Management of EV 71 Associated HFMD

Modeling the Antigenic Evolution of Influenza Viruses from Sequences

Duane J. Gubler, ScD Professor and Founding Director, Signature Research Program in Emerging Infectious Diseases, Duke-NUS Medical School, Singapore

Exploring HIV Evolution: An Opportunity for Research Sam Donovan and Anton E. Weisstein

Dealing with Post-market Issues: PCV Case Study

Picornaviruses. Virion. Genome. Genes and proteins. Viruses and hosts. Diseases. Distinctive characteristics

Phylogenetic Tree Practical Problems

Noronet report, April 2013

Lecture 19 Evolution and human health

ISPUB.COM. Bird flu: A Throbbing Stone In An Infectious Era. T Wadhwa, P Kumar Thirupathi EPIDEMIOLOGY TRANSMISSION FROM AVIAN TO HUMAN

in control group 7, , , ,

the world and viruses

Development of a predictive model for vaccine matching for serotype O FMDV from serology and capsid sequence

The Immune Epitope Database Analysis Resource: MHC class I peptide binding predictions. Edita Karosiene, Ph.D.

Ebola Virus Introduction

Viral Replication and Genetics

Public Health Wales CDSC Weekly Influenza Surveillance Report Wednesday 21 st January 2015 (covering week )

Web-based tools for Bioinformatics; A (free) introduction to (freely available) NCBI, MUSC and Worldwide.

Transcription:

Rotavirus Genotyping and Enhanced Annotation in the Virus Pathogen Resource (ViPR) Yun Zhang J. Craig Venter Institute ASV 2016 June 19, 2016

Loading Virus Pathogen Database and Analysis About Resource Us Community (ViPR)... Announcements Links Resources Support Search Search our comprehensive database for: Genomes Analyze Analyze data online: Sequence Alignment Save to Workbench Use your workbench to: Store and share data Free virus database and analysis web resource funded by the US National Institute of Allergy and Infectious Diseases Genes & proteins Immune epitopes 3D protein structures Host Factor Data Phylogenetic Tree Sequence Variation (SNP) Metadata driven Comparative Analysis BLAST Combine working sets Integrate your data with ViPR data Store and share analyses Custom search alert Broad coverage of human pathogenic and related viruses Antiviral Drugs Browse All Search Types Browse All Tools Customized portals for high priority featured viruses Virus Families Click on icon of family or species of interest. Click here to to view all families and species in list format. Don't know family of species? Provide species name Single Stranded Positive Sense RNA Single Stranded Negative Sense RNA Double Stranded RNA Double Stranded DNA Tight integration of a wide variety of different data types with a broad array of analysis and visualization tools Caliciviridae Hepeviridae Arenaviridae Paramyxoviridae Reoviridae Herpesviridae Novel derived data only available in ViPR Coronaviridae Flaviviridae Featured Viruses Picornaviridae Togaviridae Click on a featured virus of interest to go to virus specific home page. Bunyaviridae Filoviridae Click on a featured virus link to go to virus specific sequence or protein search result page Highlights Rhabdoviridae Dengue Ebolavirus Hepatitis C virus Zika Virus Enterovirus D68 Genomes Start Search Enterovirus D68 Proteins Mature Peptides Mature viral protein products resulting from protease and self cleavage of the polyprotein are displayed according to their location in the genome and the polyprotein. Key Highlights: Visualize viral mature peptides Display location of mature peptide Access additional information relating to the mature peptide Virus News Zika virus (updated 7 Apr 2016) Sirohi et al. reported a 3.8 A resolution cryo EM structure of mature virion from ZIKV H/PF/2013. Zhu et al. compared pre epidemic and epidemic ZIKV sequences, showing amino acid changes in epidemic strains. Faria et al. used phylogenetic analysis of new, complete genomes to estimate a single entry into Brazil in 2013, prior to the 2014 World Cup, and coincident with reported outbreaks in the Pacific islands. ViPR supports research on ZIKV with weekly data updates (293 genomes as of 4 Apr), consistent mature peptide annotations, and a suite of bioinformatics analysis tools. Ebola virus (updated 7 Apr 2016) Corti, et al. reported that human mabs from blood samples of a 1995 EVD survivor neutralize outbreak variants of EBV and protect macaques as late as 5 d after lethal challenge. ViPR supports research on EBV with weekly data updates (1,794 genomes as of 4 Apr) and a suite of bioinformatics analysis tools. MERS CoV (updated 7 Apr 2016) van Doremalen et al extended earlier findings to show that when 4 amino acids in hamster DPP4 are changed to human DPP4 residues, the hamster protein acts as a MERS CoV receptor. Previous Scientific Reports Poxviridae Individual personal online Workbenches Large user community 1,375 sessions/week (2015 average) 36% from U.S. 64% international (Top 10 - India, UK, China, Canada, France, Brazil, Australia, Italy, South Korea) 152 citations in scientific publications (13JUN2016)

Reoviridae Genome Search Result RVA Annotations About Us Community Announcements Links Resources Support SEARCH DATA ANALYZE & VISUALIZE WORKBENCH SUBMIT DATA VIRUS FAMILIES HELP yun.zhang@jcvi.org ViPR Home Reoviridae Home Genome Search Results Your Selected Items: 0 items selected Add to Working Set Save Search Run Analysis Download Your search returned 4,033 genomes. Search Criteria Displaying 500 records per page, sorted by GenBank Protein Name in ascending order. Display Settings Select all 4,033 genomes < Previous 3 4 5 6 7 8 9 Page: 9 of 9 More columns were returned than can be displayed without scrolling. Use scroll bars at top and bottom of display to move right and left or reduce the number of ViPR curation >600 records RotaC genotyping & ViPR concatenation ViPR curation VIGOR annotation 107E1B Rotavirus AB081594 1062 G3-P[4] -N/A- Human Homo India genomic VP7 VP7 protein

Reoviridae Loading Virus Pathogen Database and Analysis Resource (ViPR)... About Us Community Announcements Links Resources Support SEARCH DATA ANALYZE & VISUALIZE WORKBENCH VIRUS FAMILIES HELP yun.zhang@jcvi.org ViPR Home Reoviridae Home Genome Search Genome Search Search for virus genomic sequences and related information. You can search for the whole virus family or search for specified genus, species etc. You can also find your strain or genome record if you have its information, such as strain name, accession. Rotavirus A (RVA) the major rotavirus group that infects humans, remains a major cause of morbidity in infants and children worldwide. Given the public health significance, ViPR has introduced dedicated options for searching RVA genomes. Make a selection in the taxonomy browser that includes RVA or check the Rotavirus A checkbox found below the browser. Results matching your criteria: 238 SELECT VIRUS(ES) TO INCLUDE IN SEARCH Jump to strain in taxonomy: Start to type strain to get suggestions Deselect All COLLECTION YEAR Start: YYYY ViPR curated controlled vocabulary GEOGRAPHIC GROUPING HOST SELECTION Species: Rotavirus A Deselect All (15507/15507 strains selected) (15507 Strains 26963 complete segments) Species: Rotavirus B Select All (0/403 strains selected) (403 Strains 477 complete segments) Species: Rotavirus C Select All (0/888 strains selected) (888 Strains 1114 complete segments) Species: Rotavirus D Select All (0/45 strains selected) (45 Strains 34 complete segments) Species: Rotavirus F Select All End: YYYY To add month to search, see Advance Search Options: Month Range COUNTRY Browse to and Select Rotavirus A COMPLETE SEQUENCES Complete Sequences Only ROTAVIRUS A SPECIFIC OPTIONS ViPR annotations GENOTYPE SEGMENTS MINIMUM SEGMENT LENGTH COMPLETE GENOME G G1 Ex: G1, G3, G% P P[8] Ex: P[2], P[7], P% Mixed Genotype only Specify Segment Lengths: RefSeq (nt) VP1 (R) 3302 VP2 (C) 2693 VP3 (M) 2591 VP4 (P) 2362 NSP1 (A) 1614 VP6 (I) 1356 NSP3 (T) 1105 NSP2 (N) 1059 VP7 (G) 1062 NSP4 (E) 751 NSP5/NSP6 (H) 667 Genomes with All Segments

Genotype-based Analysis & Visualization VP7 tree

RVA Genotyping Tool Reoviridae Rotavirus A Genotyping Report (SOP) Download Raw Result Loading Virus Pathogen Database and Analysis Resource (ViPR)... About Us Community Announcements Links Resources Support SEARCH DATA ANALYZE & VISUALIZE WORKBENCH SUBMIT DATA VIRUS FAMILIES HELP yun.zhang@jcvi.org ViPR Home Reoviridae Home Rotavirus A Genotype Determination Results Results of Genotyped Sequences Sequence Identifier Segment Number Gene Name Genotype Closest Strain Query Coverage % Ident % E Value gb:eu984102 Organism:Rotavirus 11 NSP5/NSP6 H1 RVA/Human-wt/BGD/Dhaka25/2002/G12P8 89.46 98.99 0E0 gb:eu984100 Organism:Rotavirus 7 NSP3 T1 RVA/Human-tc/USA/P/1974/G3P8 86.87 97.75 0E0 gb:eu984109 Organism:Rotavirus 9 VP7 G1 RVA/Human-tc/JPN/KU/1974/G1P8 92.37 93.58 0E0 gb:eu984098 Organism:Rotavirus 5 NSP1 A1 RVA/Human-wt/BGD/Dhaka25/2002/G12P8 93.41 97.60 0E0 gb:eu984103 Organism:Rotavirus 1 VP1 R1 RVA/Human-tc/USA/Wa/1974/G1P8 98.94 98.19 0E0 gb:eu984104 Organism:Rotavirus 2 VP2 C1 RVA/Human-tc/USA/P/1974/G3P8 93.22 97.68 0E0 gb:eu984099 Organism:Rotavirus 8 NSP2 N1 RVA/Human-wt/BGD/Dhaka25/2002/G12P8 90.08 92.34 0E0 gb:eu984101 Organism:Rotavirus 10 NSP4 E1 RVA/Human-tc/USA/D/1974/G1P8 70.40 98.67 0E0 gb:eu984105 Organism:Rotavirus 3 VP3 M1 RVA/Human-wt/BEL/B4633/2003/G12P8 96.80 99.04 0E0 gb:eu984108 Organism:Rotavirus 6 VP6 I1 RVA/Human-tc/GBR/ST3/1975/G4P6 88.05 97.49 0E0 gb:eu984106 Organism:Rotavirus 4 VP4 P[8] RVA/Human-wt/BEL/B3458/2003/G9P8 94.11 96.76 0E0 Results of Non-Genotyped Sequences No Sequences in the input fasta cannot be genotyped

Exploration of RVA Genotype Data

Genotype Constellation Preference Human RVA G-P Genotypes Human RVA Internal Gene Genotypes Others 20% Others 10% G4-P[8] 4% G12-P[8] 7% G3-P[8] 10% G9-P[8] 11% G2-P[4] 14% G1-P[8] 34% I2-R2-C2- M2-A2-N2- T2-E2-H2 20% I1-R1-C1- M1-A1-N1- T1-E1-H1 70% G1 4, 9, 12 X P[4], [8] account for 80% of all RVA human strains. Others: the majority considered as reassortments

Genotype Geographic Distribution 0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% Strains Africa Asia Europe North_America Oceania 783 1,314 727 698 217 South_America 614 Global 4,353 G1-P[8] G2-P[4] G9-P[8] G3-P[8] G12-P[8] G12-P[6] G4-P[8] G8-P[6] Others

Genotype Variation in the Context of Vaccination 100% 90% USA Canada G12 G9 G8 G4 G3 G2 G1 80% 70% 60% 50% 40% 30% 20% 10% 0% 200 2005 2006 2007 2008 2009 2010 2011 2012 2013 Vaccine 2005 2006 2007 2008 2009 2010 2011 2012 2013 No vaccine 0 Number of VP7 sequences 50 0

Antigenic Drift in the Context of Vaccination Genetic changes in VP7/G1 Country Position Chi-square Value Vaccine introduced in 2006 P-value <=2005 2006-2008 >=2009 B-cell Epitope USA 72 172.631 3.26E-38 201 Q, 6 R 23 Q, 4 R 4 Q, 32 R Yes Brazil 72 86.739 1.46E-19 79 Q 7 Q 6 Q, 24 R Yes No vaccine program Thailand 72 14.914 5.77E-04 1 Q, 2 R 26 Q, 5 R 205 Q, 16 R Yes 250 200 150 100 50 0 <=2005 2006-2008 >=2009 <=2005 2006-2008 >=2009 <=2005 2006-2008 >=2009 USA Brazil Thailand R Q Vaccine Vaccine No vaccine program

Suggestions Strain name Uniformity of rotavirus strain nomenclature proposed by the Rotavirus Classification Working Group (RCWG). Arch. Virol., 2011, 156(8):1397-413. PMID: 21597953 Metadata missing, format, field Sequence data scarcity, sampling bias

Acknowledgements Richard Scheuermann, PI Tim Stockwell, Viral Group Lead Dan Katzel Karla Stucker Seth Schobel Ed Klem, Technical Lead Sanjeev Kumar Hongtao Zhao Sherry He Lei Tong Rotavirus Classification Working Group No. HHSN272201400028C