Influenza Virus HA Subtype Numbering Conversion Tool and the Identification of Candidate Cross-Reactive Immune Epitopes

Similar documents
Module 3. Genomic data and annotations in public databases Exercises Custom sequence annotation

SEQUENCE FEATURE VARIANT TYPES

a. From the grey navigation bar, mouse over Analyze & Visualize and click Annotate Nucleotide Sequences.

EVOLUTIONARY TRAJECTORY ANALYSIS: RECENT ENHANCEMENTS. R. Burke Squires

Fondation Merieux J Craig Venter Institute Bioinformatics Workshop. December 5 8, 2017

PROTOCOL FOR INFLUENZA A VIRUS GLOBAL SWINE H1 CLADE CLASSIFICATION

Section B. Comparative Genomics Analysis of Influenza H5N2 Viruses. Objective

Conserved Epitope Regions

Rotavirus Genotyping and Enhanced Annotation in the Virus Pathogen Resource (ViPR) Yun Zhang J. Craig Venter Institute ASV 2016 June 19, 2016

Automated Quantification and Description of the Evolutionary Patterns of Influenza Viruses in U.S. Swine

User Guide. Protein Clpper. Statistical scoring of protease cleavage sites. 1. Introduction Protein Clpper Analysis Procedure...

Web-based tools for Bioinformatics; A (free) introduction to (freely available) NCBI, MUSC and Worldwide.

Final Report. Identification of Residue Mutations that Increase the Binding Affinity of LSTc to HA RBD. Wendy Fong ( 方美珍 ) CNIC; Beijing, China

Section B. Comparative Genomics Analysis of 2013 H7N9 Influenza A Viruses. Objective

Evolution of influenza

Bioinformation Volume 5

Hands-On Ten The BRCA1 Gene and Protein

SMPD 287 Spring 2015 Bioinformatics in Medical Product Development. Final Examination

Bjoern Peters La Jolla Institute for Allergy and Immunology Buenos Aires, Oct 31, 2012

Title: Prediction of HIV-1 virus-host protein interactions using virus and host sequence motifs

Broadly protective influenza vaccines for pandemic preparedness. Suresh Mittal Department of Comparative Pathobiology Purdue University

Section D. Identification of serotype-specific amino acid positions in DENV NS1. Objective

Mapping the Antigenic and Genetic Evolution of Influenza Virus

Going Nowhere Fast: Lentivirus genetic sequence evolution does not correlate with phenotypic evolution.

Crystal structure of the neutralizing antibody HK20 in complex with its gp41 antigen

Data Mining in Bioinformatics Day 9: String & Text Mining in Bioinformatics

Influenza IDO. Influenza Ontology

It is well known that some pathogenic microbes undergo

Appendix 81. From OPENFLU to OPENFMD. Open Session of the EuFMD: 2012, Jerez de la Frontera, Spain 1. Conclusions and recommendations

Influenza H3N2 Virus Variation Analysis

7/14/2014 VACCINE-INDUCED ANTI-HA2 ANTIBODIES PROMOTE VIRUS FUSION AND ENHANCE INFLUENZA VIRUS RESPIRATORY DISEASE (VAERD)

Origins and evolutionary genomics of the novel avian-origin H7N9 influenza A virus in China: Early findings

Multiple sequence alignment

YUMI YAMAGUCHI-KABATA AND TAKASHI GOJOBORI* Center for Information Biology, National Institute of Genetics, Mishima , Japan

Name: Due on Wensday, December 7th Bioinformatics Take Home Exam #9 Pick one most correct answer, unless stated otherwise!

COMPUTER PREDICTION OF PROTEIN SURFACE FEATURES AND ANTIGENIC DETERMINANTS

Min Levine, Ph. D. Influenza Division US Centers for Disease Control and Prevention. June 18, 2015 NIBSC

Rajesh Kannangai Phone: ; Fax: ; *Corresponding author

Cover Page. The handle holds various files of this Leiden University dissertation

Patterns of hemagglutinin evolution and the epidemiology of influenza

A HLA-DRB supertype chart with potential overlapping peptide binding function

CoCoLysis: A Web-Accessible Coiled-Coil Protein Database with Analysis Tools

Immunogenicity of Avian Influenza H7N9 Virus in Birds

Evolutionary interactions between haemagglutinin and neuraminidase in avian influenza

Bioinformatics and Comparative Genomics Applications

Bioinformatic analyses: methodology for allergen similarity search. Zoltán Divéki, Ana Gomes EFSA GMO Unit

Colorspace & Matching

VIP: an integrated pipeline for metagenomics of virus

Variant Classification. Author: Mike Thiesen, Golden Helix, Inc.

Data mining with Ensembl Biomart. Stéphanie Le Gras

Hemagglutinin-stalk specific antibodies: How to induce them and how to measure them

H C. C α. Proteins perform a vast array of biological function including: Side chain

COMPUTATIONAL ANALYSIS OF CONSERVED AND MUTATED AMINO ACIDS IN GP160 PROTEIN OF HIV TYPE-1

A Method to Determine an E-score Threshold to Identify Potential Cross Reactive Allergens

Viral and Host Factors in Vulvar Disease DR MICHELLE ETHERSON 26 TH OF APRIL 2016

Study the Evolution of the Avian Influenza Virus

CONSTRUCTION OF PHYLOGENETIC TREE USING NEIGHBOR JOINING ALGORITHMS TO IDENTIFY THE HOST AND THE SPREADING OF SARS EPIDEMIC

Public health relevant virological features of Influenza A(H7N9) causing human infection in China

Sebastian Jaenicke. trnascan-se. Improved detection of trna genes in genomic sequences

Phylogenetic Methods

The Influenza Virus Resource at the National Center for Biotechnology Information. Running Title: NCBI Influenza Virus Resource ACCEPTED

Cerner COMPASS ICD-10 Transition Guide

HIV-1 acute infection: evidence for selection?

SUPPLEMENTARY INFORMATION

Influenza Sequence Feature Variant Type (Flu-SFVT) analysis: evidence for a role of

Structural Analysis of TCRpMHC Complexes Using Computational Tools. Feroze Mohideen Briarcliff High School

BIRKBECK COLLEGE (University of London)

Received 25 April 2002/Accepted 21 May 2002

Ras and Cell Signaling Exercise

Four melanocyte-stimulating hormones have the following amino acid sequences:

For all of the following, you will have to use this website to determine the answers:

Problem Set 2 September 18, 2009

PAirwise Sequence Comparison (PASC) and Its Application in the Classification of Filoviruses

ABSTRACT 1. INTRODUCTION. Wei Hu. JBiSE

Re-engineering HA as a strategy to develop universal influenza vaccines

Phenotype Inference from Genotype in RNA Viruses

Newborn Screening Issues and Answers Workshop: Laboratory Health Information Exchange November 1-2, Utah Department of Health

Tool for Pandemic Influenza Risk Assessment (TIPRA) Presented by Gina Samaan Global Influenza Programme

Crystal Structure of the Subtilisin Carlsberg: OMTKY3 Complex

Bioinformatics Laboratory Exercise

PHARMACOPHORE MODELING

Development of Real Time RT-PCR Assays for Neuraminidase Subtyping of Avian Influenza Virus

Supplementary Fig. 1: Ex vivo tetramer enrichment with anti-c-myc beads

Annotation of Chimp Chunk 2-10 Jerome M Molleston 5/4/2009

Supplementary Figure 1. SC35M polymerase activity in the presence of Bat or SC35M NP encoded from the phw2000 rescue plasmid.

Supplementary Materials for

LIPOPREDICT: Bacterial lipoprotein prediction server

Cross-Reactivity to Field Isolates of Canine Influenza Virus by a Killed Canine Influenza Virus (H3N8, Iowa05) Vaccine

Project PRACE 1IP, WP7.4

!"#$%&' (#%) /&'(2+"( /&3&4,, ! " #$% - &'()!% *-sheet -(!-Helix - &'(&') +,(-. - &'()&+) /&%.(0&+(! - &'(1&2%( Basic amino acids

Palindromes drive the re-assortment in Influenza A

Recombinant Baculovirus Derived HIV-1 Virus-Like Particles Elicit Potent Neutralizing Antibody Responses

Section B. Comparative Genomics Analysis of 2013 H7N9 Influenza A Viruses. Objective

Phenylketonuria (PKU) Structure of Phenylalanine Hydroxylase. Biol 405 Molecular Medicine

Prospect of Universal Influenza Vaccine Designed by Broadly Neutralizing Monoclonal Antibodies (1) Yoshinobu Okuno

Plasmodium and host glutathione reductase: molecular function and biological process

Aquaporins. Fatemeh Khalili Elizabeth Villa VMD Developers: John Stone Dan Wright John Eargle

Supplementary Appendix

IN SILICO GENETIC VARIATION PATHOGENICITY ANALYSIS OF HEMAGGLUTININ, MATRIX 1, AND NON STRUCTURAL 1 PROTEIN OF HUMAN H5N1 INDONESIAN STRAIN

In Silico Sequence Analysis, Structure Prediction and Evaluation of Fatty Acid Synthase Protein (FAS) of Gallus gallus (Chicken) by Homology Modelling

Transcription:

Influenza Virus HA Subtype Numbering Conversion Tool and the Identification of Candidate Cross-Reactive Immune Epitopes Brian J. Reardon, Ph.D. J. Craig Venter Institute breardon@jcvi.org

Introduction: Cross-Subtype Comparison of Hemagglutinin Background: Interest in comparing amino acid substitutions across influenza A hemagglutinin (HA) subtypes. Amino acid equivalence across HA subtypes difficult to determine. Sequence alignment tools unreliable for cross-subtype comparison of HA residues. High sequence variability and differing sequence lengths. Algorithms do not use known structural information. Numbering scheme proposed by David Burke and Derek Smith defining amino acid equivalence across HA subtypes (PLoS ONE; 2014).

The Burke-Smith Alignment Method: Algorithm: 1. Collect representative HA sequences from 18 subtypes of influenza A 2. Trim signal peptide 3. Align sequences with known crystal structural based on structure similarity using PyMOL 4. Align remaining sequences to structure using FUGUE Group 2 Group 1 Alpha helix Beta strand Structurally variable region Used sequence and structural information utilized to infer amino acid equivalence

The HA Subtype Numbering Conversion Tool Goal at Influenza Research Database (IRD): Develop a tool incorporating Burke and Smith s amino acid numbering scheme to facilitate comparison of amino acid substitutions across different HA subtypes The HA Subtype Numbering Conversion tool: Developed by and freely available at the IRD webpage: fludb.org Allows IRD users to computationally convert between HA subtype coordinate systems.

Workflow for the IRD HA Numbering Algorithm 1 BLAST Align Map 2 Query sequence 3 Query sequence --BLAST against-- Burke-Smith reference sequences --Pairwise alignment-- BLAST hit Query sequence --mapped to-- Burke-Smith reference sequence Query H1 Closest BLAST hit - 1-2 - 3 1 4 2 5 H1 query position 1 maps to position 4 of BLAST hit (Burke-Smith sequence) Query H1 Closest BLAST hit H3 - - 1 - - 2 - - 3 - - 4 - - 5 - - 6 - - 7 - - 8 - - 9 - - 10-1 11-2 12-3 13 1 4 14 2 5 15 Position 1 of queried H1 sequence maps to position 14 in H3 subtype

HA Subtype Numbering Tool: Cross-Subtype Conservation of H1 Epitopes Objective: Use the HA subtype numbering tool in IRD to investigate sequence conservation of putative broadly cross-reactive H1 B-Cell Epitopes across 18 HA subtypes. General Approach: 1. Obtain H1 B-cell epitope sequences 2. Use HA subtype numbering tool to align H1 B-cell epitopes with the equivalent positions in each HA subtype. 3. Calculate H1 B-cell epitope conservation across HA subtypes

HA Subtype Numbering Conversion 93 H1 B-cell epitopes available in IRD Download sequence and coordinates in H1 reference strain Navigate to HA Subtype Numbering Conversion page in IRD 1 Input (paste) sequence for H1 reference strain 2 Select HA subtypes to identify coordinates of amino acid positions equivalent to the H1 reference strain. 3 Click

HA Subtype Numbering Conversion Result Query sequence alignment with the closest reference sequence and other HA subtype sequence Mapping between positions in epitope And HA sequences

Analyzing H1 B-Cell Epitope Conservation Across 18 HA Subtypes Analysis: Organize H1 B-cell epitope and HA subtype sequence data into a spreadsheet so equivalent amino acids are located in the same column Count number of matching amino acids in each HA subtype sequence matching epitope E.g., Conservation of 15 amino acid H1 epitope, SF355 in H2 subtype Amino Acid Position 508 509 510 511 512 513 514 515 516 517 518 519 520 521 522 H1 Epitope: SF355 E E A K L N R E E I D G V K L HA subtype: H2 E E S K L N R N E I K G V K L 12 out of 15 amino acids in SF355 match with the corresponding position in the H2 subtype sequence; calculate percent matching amino acids Percent amino acid identity (per epitope) # matches between epitope and HA subtype = = 12 epitope length 15 x 100 = 80% For every epitope, calculate percent identity in each HA subtype sequence

Results: Conservation of each Epitope Across All HA Subtypes H1 B-cell epitope percent amino acid identity in HA subtypes; averaged across all HA subtypes Hemagglutinin Signal peptide HA1 Fusion peptide HA2 Transmembrane Cytoplasmic Domain Domain Average Percent Amino Acid Identity 1 100 200 300 400 500 No obvious correlation between epitope conservation and secondary structure Epitopes in HA2 stem region highly conserved across HA subtypes; particularly near fusion peptide Lower overall conservation in HA1 head region, but several epitopes highly conserved across subtypes

Results: Average Epitope Conservation in Each HA Subtype For each HA subtype, average percent amino acid identity in all 93 H1 epitopes H1 Average Percent Identity (Standard Error) H5 H7 Phylogenetic Group 1 Phylogenetic Group 2 H1 B-cell Epitope conservation not equal in all HA subtypes H1 epitopes more highly conserved in phylogenetic group 1 (of which H1 is a member)

Conclusions: HA Subtype Numbering Conversion tool: Identification and comparison of functionally equivalent amino acids across HA subtypes straightforward using HA Subtype Numbering Conversion tool in IRD Analysis of H1 B-cell Epitope Conservation across HA subtypes H1 epitopes more conserved in HA subtypes in phylogenetic group 1. High epitope conservation across HA subtypes in stem region. Lower overall conservation in head region, but several epitopes highly conserved across subtypes Including epitope (amino acids 252-270) adjacent to sialic acid binding site Next Steps: Integrate this numbering conversion scheme into other existing tools in IRD Extend analysis to include HA sequence data from all strains

Acknowledgements: J. Craig Venter Institute Alexandra Lee Brian Aevermann Yun Zhang Brett Pickett Richard Scheuermann (PI) Northrop Grumman Sherry He University of Auckland Catherine Macken University of Cambridge David Burke Derek Smith Thank you To learn more about the Influenza Research Database, go to fludb.org or stop by our table in the Commonwealth Ballroom (Squires)