Project PRACE 1IP, WP7.4
|
|
- Candace Butler
- 5 years ago
- Views:
Transcription
1 Project PRACE 1IP, WP7.4 Plamenka Borovska, Veska Gancheva Computer Systems Department Technical University of Sofia
2 The Team is consists of 5 members: 2 Professors; 1 Assist. Professor; 2 Researchers; Coordinator: Prof. PhD Plamenka Borovska
3 Task1: Parallel computer simulation for investigating the interaction of influenza virus type A genome with the host genome. Task2: Parallel performance evaluation, scaling and profiling of software packages for genomic sequence processing.
4 Biological sequence processing is a key of information technology for molecular biology. As the number of DNA and protein sequences in databases increases, it is important to be able to create sequence alignments and searching for very large number of sequences. Bioinformatics algorithms used in processing wide range data require large computational resources. Comparison of the existing sequences: pairwise alignment; multiple alignment; phylogenetic reconstruction; recombination analysis.
5 Major goal: Find the similarity between two (or more) DNA-sequences by finding a good alignment between them. DNA-sequence-1 tcctctgcctctgccatcat---caaccccaaagt tcctgtgcatctgcaatcatgggcaaccccaaagt DNA-sequence-2 Alignment
6 The Basic Local Alignment Search Tool is by far the most widely used to look at sequence alignments and similarities. BLAST searches to find sequences in the database that are similar to the query sequence. BLAST efficiently calculates local pairwise alignments between sequences. BLAST utilize heuristics approach for increasing the performance of the alignment searching.
7 Multiple alignment is an extension of pairwise alignment to incorporate more than two sequences into an alignment. Multiple alignment methods try to align all of the sequences in a specified set. The most popular multiple alignment tool is CLUSTAL. Multiple sequence alignment is computationally difficult and is classified as an NP-Hard problem.
8 ClustalW proceeds in three stages: pairwise alignment (PA), guide tree (GT) and multiple alignment (MA). Pairwise alignment computes the optimal alignment cost for each sequences pairs and a distance matrix is built up. An evolutionary tree is computed from the distance matrix and is used as guide tree. The algorithm aligns the sequences progressively according to the branching order in the guide tree.
9 Thousands of influenza viral genome sequences originating from different viral isolates are accumulated in GeneBank. The huge amount of biological sequences requires efficient parallel tools and powerful computers for structural genomic and functional analysis. The great challenges are: to optimize parallel computational models; to optimize parallel program implementations; to evaluate the scalability of parallel software.
10 To achieve the aim the following specific subtasks have been solved: Local database of all influenza A viral genomes sequences Local database of common influenza A virus hosts Consensus motifs and variable domains Homology between viral RNAs and host genome Identification of recombination hot-spots in the influenza virus
11 The following methods have been applied. DNA and RNA databases have been screened by local searching algorithm based on mpiblast. Consensus and variable domains in influenza A viral genomes have been searched by comparative analysis of influenza RNA using multiple sequence alignment method ClustalW; Recombination sites have been identified and phylogenetic tree have been studied by the RAT (Recombination Analysis Tool) and Phylip respectively.
12 To achieve the aim the following specific subtasks have been solved: Local database of 35 types nucleotide mosaic virus sequences Local database of 35 types protein nucleotide mosaic virus sequences local alignment and homology searching of all mosaic virus isolates Parallel performance evaluation and scaling of software mpiblast and ScalaBlast in various modes: blastn, blastp, blastx, tblastn, tblastx, searching of consensus domains in protein sequences, limited multiple alignment Parallel performance evaluation and scaling of multiple sequence alignment method ClustalW
13 The main goal of this research is to investigate the scalability of sequence alignment software on supercomputer BlueGene/P. mpiblast for sequence alignment and searching Clistalw for multiple sequence alignment The case study is comparison of all available influenza virus A aimed to finding the consensus and variable domains and searching the similarity of consensus motifs of influenza virus A in the human genome.
14 Parallel computational models; Parallel program implementations; Experimental datasets; Supercomputer BlueGene/P. Parallel computational models Experimental datasets Parallel program implementations BlueGene/P
15 Bulgarian IBM Blue Gene/P supercomputer two racks, 2048 PowerPC 450 computing nodes, 8192 cores; 4 TB random access memory; maximum LINPACK performance Rmax= Tflops; MPICH2 distribution of the MPI standard; Operation System - SuSE Linux Enterprise Server 10.
16 In order to achieve the aims, next software is installed, tested, verified and experimentally studied: mpiblast for local alignment and homology searching; Scalablast for local alignment and homology searching; ClustalW for multiple sequence alignment; Phylip for phylogenetic studies; RAT (Recombination Analysis Tool) for investigation and visualization of the "hot" spots of recombination; Scalasca for performance analysis; Cube 3.1 for performance parameters visualization.
17 A mirror local database in working format of the existing database installed on the supercomputer BlueGene/P, allows online updating of data. Nucleotide and protein sequences obtained from different isolates of 35 types mosaic virus (tomato, potato, tobacco, celery, etc.); Nucleotide and protein sequences obtained from different isolates of influenza virus - all the available isolates of the 8 segments of the influenza virus A (all subtypes); Complete human genome extracted from GenBank.
18 All existing sequences of influenza virus obtained from various isolates have been segmented based on specified criteria such as subtype, segment, host, and region. Influenza Virus Type Host Segment Subtype Type A All Hosts PB2 All Subtypes Type B Human PB1 H1N1 Type C Avian PA H3N8 Swine HA H7N7 Horse NP NA MP NS
19 Different data sets of influenza virus nucleotide sequences have been combined based on specified criteria such as subtype, segment, host, region, into a batched virtual query in order to search for similarities with the human genome. This allows: comparing a large set of sequences against a sequence database simultaneously by sending batched virtual query; reducing the execution time, respectively improving the performance.
20 Dividing up the database (human genome) into multiple fragments 64 small segments of approximately equal size stored in the shared memory. The searching of a fragment is independent of any other fragment.
21 The master process: distributes the virtual query and part of the database to the worker processes; receives the results of the comparison; sends new parts of the database for the next comparison to the worker processes; integrates the results obtained from them. Each worker process: compares the virtual query with a segment of the database; sends the results of the comparison to the master process.
22 The model is based on: data parallelism using the message passing programming model Master process Termination Termination Send results Task distribution Send results Termination Task distribution Send results Task distribution Slave process Slave process Slave process
23 The model is based on: phase based parallel algorithmic paradigm PROCESS PROCESS PROCESS PROCESS PAIRWISE ALIGNMENT INTERACTION PHASE PROCESS PROCESS PROCESS PROCESS GUIDE TREE INTERACTION PHASE PROCESS PROCESS PROCESS PROCESS MULTIPLE ALIGNMENT INTERACTION PHASE
24 Local alignment and homology searching of all nucleotide sequences of mosaic virus using mpiblast
25 Local alignment and homology searching of protein sequences of mosaic virus using mpiblast
26 Local alignment and homology searching of protein sequences of mosaic virus using scalablast
27 Local alignment and homology searching of nucleotide sequences of mosaic virus using scalablast
28 Database of nucleotide sequences: human_genomic Query of protein sequence: alpha_1_interferon_[homo_sapiens] Query of nucleotide sequence: alcoholgene Using formula is: Speedup = T (1)/T (p) Number of cores Scalability of the parallel system in respect of the speedup for various sizes of queries and numbers of cores
29 Execution time in the case of human and avian influenza virus A/H1N1 nucleotide segment 4 (HA) used for searching in the human genome in respect to various virtual batched queries size and numbers of cores (4, 128, 256 and 512). Time [min] Time [min] Batch queries Batch queries
30 Database of nucleotide sequences: human_genomic 3.4 billion bases Query of nucleotide sequences: human and avian influenza virus A/H1N1 segment 4 (HA) query sequence length bases batch query size varies from 60 to 6729 sequences Scalability of the parallel system in respect to the machine size and batch query size
31 The results show that the batch size impacts the execution time, because more sequences need to be searched. The parallel system shows good scalability in respect to both the number of cores size and the workload size.
32 128 experiments comprise nucleotide sequences of all the available isolates of the 8 segments of viruses A (avian, human, swine, horse) Scalability of the parallel system in respect to the machine size and number of sequences
33 Increasing the number of cores results in accelerating computations is 26.84% for the case of 1024 cores and 34.9% for the case of 2048 cores respectively Parallel execution time decrease for the cases of 1024 and 2048 cores vs. 512 cores
34 process rank 0 sends messages (function MPI_send())
35 slave processes receive massages function MPI_Recv()).
36 well balanced except for the process rank 0 that performs synchronization, communication and data sending to all other processors
37 computational imbalance of the parallel system is 0 i.e. the workload is evenly distributed among the cores
38 The recombination sites of all existing nucleotide sequences of influenza virus, after multiple alignment by ClustalW algorithm, have been investigated and visualized using the software package Recombination Analysis Tool. Recombination in the case of all Hosts Influenza Virus A/H1N1 segment 2
39 The consensus motifs and variable domains of Influenza virus A have been determined and output by utilizing the biological sequence alignment editor UGENE UniPro.
40 Representation of circular phylogenetic trees by PHYLIP software package
41 The WP7.4 team results can be summarized as follows: Parallel computational model for biological sequence processing have been suggested; Parallel software for in-silico molecular biology experiments have been installed, configured and verified experimentally. Mirror local database is developed and installed on the supercomputer BlueGene/P: all the available isolates of the 8 segments of the influenza virus A (all subtypes); all the available isolates of 35 types of mosaic virus; complete human genome;
42 Performance parameters such as execution time, speedup, and scalability of sequence alignment have been estimated experimentally; The performance estimation and analyses show that the parallel system has good scalability and is well balanced both in respect to the workload and machine size; The case study is investigating the influenza virus and mosaic virus variability, including: finding out consensus motifs and variable domains in the different segments of influenza virus A and mosaic virus; searching for homology with human genome; identification of recombination hot-spots; phylogenetic studies of the influenza virus genome.
43 The WP7.4 results on Task 1 and Task 2 are published in 6 papers in Proceedings of International Conferences: IEEE - International Conference on Intelligent Data Acquisition and Advanced Computing Systems: Technology and Applications WSEAS World Scientific and Engineering Academy and Society; ACM - International Conference CompSysTech; IADIS - International Conference Applied Computing; Computer Science - organized by Computer System Department, Technical University Sofia.
44 P. Borovska, O.Nakov, V. Gancheva, I. Georgiev, Parallel Genome Sequence Searching on Supercomputer BlueGene/P, European Conference of Computer Science (ECCS'10), 30 November 2 December 2010, Puerto De La Cruz, Tenerife. P. Borovska, V. Gancheva, St. Markov, Parallel Performance Evaluation of Sequence Nucleotide Alignment on the Supercomputer BlueGene/P, 5th EUROPEAN COMPUTING CONFERENCE (ECC '11), April 2011, Paris, France. P. Borovska, V. Gancheva, G. Dimitrov, K. Chintov, S. Gurov, Parallel Performance Evaluation of Multithreaded Local Sequence Alignment, International Conference CompSysTech, June 2011, Vienna, Austria. P. Borovska, I. Georgiev, Influenza Virus Investigation Using Visualization Methods and Tools, International Conference Computer Science, 1-3 September 2011, Ohrid, Macedonia. P. Borovska, V. Gancheva, St. Markov, I. Georgiev, E. Assenov, Parallel Performance and Profiling of Multiple Sequence Nucleotide Alignment on the Supercomputer Blue Gene/P, The 6 th IEEE International Conference on Intelligent Data Acquisition and Advanced Computing Systems: Technology and Applications, September 2011, Prague, Czech Republic. P. Borovska, O.Nakov, V. Gancheva, I. Georgiev, St. Markov, In Silico Biological Experiments for Investigating the Pandemic Influenza Virus A Variability on the Supercomputer BlueGene/P, IADIS International Conference Applied Computing 2011, 6-8 November 2011, Rio de Janeiro, Brazil.
45
Origins and evolutionary genomics of the novel avian-origin H7N9 influenza A virus in China: Early findings
Origins and evolutionary genomics of the novel 2013 avian-origin H7N9 influenza A virus in : Early findings Jiankui He*, Luwen Ning, Yin Tong Department of Biology, South University of Science and Technology
More informationFollowing virus recombination and evolution
Following virus recombination and evolution Coping with the avalanche of sequencing data Dmitry Kusnetsov, Anne Gleizes, Robin Liechti, Ioannis Xenarios, Philippe Le Mercier Vital-IT/Swiss-Prot group SIB
More informationPhylogenetic Methods
Phylogenetic Methods Multiple Sequence lignment Pairwise distance matrix lustering algorithms: NJ, UPM - guide trees Phylogenetic trees Nucleotide vs. amino acid sequences for phylogenies ) Nucleotides:
More informationa. From the grey navigation bar, mouse over Analyze & Visualize and click Annotate Nucleotide Sequences.
Section D. Custom sequence annotation After this exercise you should be able to use the annotation pipelines provided by the Influenza Research Database (IRD) and Virus Pathogen Resource (ViPR) to annotate
More informationEvolution of influenza
Evolution of influenza Today: 1. Global health impact of flu - why should we care? 2. - what are the components of the virus and how do they change? 3. Where does influenza come from? - are there animal
More informationStudy the Evolution of the Avian Influenza Virus
Designing an Algorithm to Study the Evolution of the Avian Influenza Virus Arti Khana Mentor: Takis Benos Rachel Brower-Sinning Department of Computational Biology University of Pittsburgh Overview Introduction
More informationDIGITIZING HUMAN BRAIN: BLUE BRAIN PROJECT
DIGITIZING HUMAN BRAIN: BLUE BRAIN PROJECT Diwijesh 1, Ms. Pooja Khanna 2 1 M.tech CSE, Amity University 2 Associate Professor, ASET ABSTRACT Human brain is most complex and unique creation of nature which
More informationMultiple sequence alignment
Multiple sequence alignment Bas. Dutilh Systems Biology: Bioinformatic Data Analysis Utrecht University, February 18 th 2016 Protein alignments We have seen how to create a pairwise alignment of two sequences
More informationCONSTRUCTION OF PHYLOGENETIC TREE USING NEIGHBOR JOINING ALGORITHMS TO IDENTIFY THE HOST AND THE SPREADING OF SARS EPIDEMIC
CONSTRUCTION OF PHYLOGENETIC TREE USING NEIGHBOR JOINING ALGORITHMS TO IDENTIFY THE HOST AND THE SPREADING OF SARS EPIDEMIC 1 MOHAMMAD ISA IRAWAN, 2 SITI AMIROCH 1 Institut Teknologi Sepuluh Nopember (ITS)
More informationModeling the Antigenic Evolution of Influenza Viruses from Sequences
Modeling the Antigenic Evolution of Influenza Viruses from Sequences Taijiao Jiang Center of Systems Medicine, Chinese Academy of Medical Sciences Suzhou Institute of Systems Medicine October 8-10, 2015.
More informationExploring HIV Evolution: An Opportunity for Research Sam Donovan and Anton E. Weisstein
Microbes Count! 137 Video IV: Reading the Code of Life Human Immunodeficiency Virus (HIV), like other retroviruses, has a much higher mutation rate than is typically found in organisms that do not go through
More informationEVOLUTIONARY TRAJECTORY ANALYSIS: RECENT ENHANCEMENTS. R. Burke Squires
EVOLUTIONARY TRAJECTORY ANALYSIS: RECENT ENHANCEMENTS R. Burke Squires Pandemic H1N1 2009 Origin? April / May 2009 Cases of an Influenza-like Illness (ILI) occurred in California, Texas and Mexico New
More informationThe BLAST search on NCBI ( and GISAID
Supplemental materials and methods The BLAST search on NCBI (http:// www.ncbi.nlm.nih.gov) and GISAID (http://www.platform.gisaid.org) showed that hemagglutinin (HA) gene of North American H5N1, H5N2 and
More informationExistence of reassortant A (H1N2) swine influenza viruses in Saitama Prefecture, Japan
International Congress Series 1263 (2004) 749 753 Existence of reassortant A (H1N2) swine influenza viruses in Saitama Prefecture, Japan Shin ichi Shimada a, *, Takayasu Ohtsuka b, Masayuki Tanaka b, Munehito
More informationSynthetic Genomics and Its Application to Viral Infectious Diseases. Timothy Stockwell (JCVI) David Wentworth (JCVI)
Synthetic Genomics and Its Application to Viral Infectious Diseases Timothy Stockwell (JCVI) David Wentworth (JCVI) Outline Using informatics to predict drift (strain selection) Synthetic Genomics: Preparedness
More informationAppendix 81. From OPENFLU to OPENFMD. Open Session of the EuFMD: 2012, Jerez de la Frontera, Spain 1. Conclusions and recommendations
From OPENFLU to OPENFMD a resource for automatic and curated nomenclature and tools for the FMD community Filip Claes, Philippe Le Mercier, Dmitry Kuznetsov, Robin Liechti, Anne Gleizes, Ioannis Xenarios,
More informationPROTOCOL FOR INFLUENZA A VIRUS GLOBAL SWINE H1 CLADE CLASSIFICATION
PROTOCOL FOR INFLUENZA A VIRUS GLOBAL SWINE H1 CLADE CLASSIFICATION January 23, 2017 1. Background Swine H1 viruses have diversified into three major genetic lineages over time. Recently, Anderson et al.
More informationEvolutionary distances between proteins of the Influenza A Virus
Evolutionary distances between proteins of the Influenza A Virus Hatem Nassrat B00393388 Bioinformatics: Winter 06/07 Dr. Christian Blouin Table of Contents Evolutionary distances between proteins of the
More informationIn April 2009, a new strain of
managing and reducing Uncertainty in an Emerging Influenza Pandemic 2. Brundage JF, Shanks GD. Deaths from bacterial pneumonia during 1918 19 influenza pandemic. Emerg Infect Dis 2008;14:1193-9. 3. Update:
More informationComputational Analysis and Visualization of the Evolution of Influenza Virus
Computational Analysis and Visualization of the Evolution of Influenza Virus A DISSERTATION SUBMITTED TO THE FACULTY OF THE GRADUATE SCHOOL OF THE UNIVERSITY OF MINNESOTA BY Ham Ching, Lam IN PARTIAL FULFILLMENT
More informationIntroduction to Bioinformatics
Introduction to Bioinformatics Sirkka-Liisa Varvio sirkka-liisa.varvio@helsinki.fi Autumn 2009, I period www.cs.helsinki.fi/mbi/courses/09-10/itb 582606 Introduction to Bioinformatics, Autumn 2009 8. Sept
More informationVIP: an integrated pipeline for metagenomics of virus
VIP: an integrated pipeline for metagenomics of virus identification and discovery Yang Li 1, Hao Wang 2, Kai Nie 1, Chen Zhang 1, Yi Zhang 1, Ji Wang 1, Peihua Niu 1 and Xuejun Ma 1 * 1. Key Laboratory
More informationSection B. Comparative Genomics Analysis of Influenza H5N2 Viruses. Objective
Section B. Comparative Genomics Analysis of Influenza H5N2 Viruses Objective Upon completion of this exercise, you will be able to use the Influenza Research Database (IRD; http://www.fludb.org/) to: Search
More informationRelative activity (%) SC35M
a 125 Bat (H17N) b 125 A/WSN (H1N1) Relative activity (%) 0 75 50 25 Relative activity (%) 0 75 50 25 0 Pos. Neg. PA PB1 Pos. Neg. NP PA PB1 PB2 0 Pos. Neg. NP PA PB1 PB2 SC35M Bat Supplementary Figure
More informationModule 3. Genomic data and annotations in public databases Exercises Custom sequence annotation
Module 3. Genomic data and annotations in public databases Exercises Custom sequence annotation Objectives Upon completion of this exercise, you will be able to use the annotation pipelines provided by
More informationModern Technology of Internet
Modern Technology of Internet Jiří Navrátil, Josef Vojtěch, Jan Furman, Tomáš Košnar, Sven Ubik, Milan Šárek, Jan Růžička, Martin Pustka, Laban Mwansa, Rudolf Blažek Katedra počítačových systémů FIT České
More informationReassortment of influenza A virus genes linked to PB1 polymerase gene
International Congress Series 1263 (2004) 714 718 Reassortment of influenza A virus genes linked to PB1 polymerase gene Jean C. Downie* www.ics-elsevier.com Centre for Infectious Diseases and Microbiology,
More informationHost Dependent Evolutionary Patterns and the Origin of 2009 H1N1 Pandemic Influenza
Host Dependent Evolutionary Patterns and the Origin of 2009 H1N1 Pandemic Influenza The origin of H1N1pdm constitutes an unresolved mystery, as its most recently observed ancestors were isolated in pigs
More informationName: Due on Wensday, December 7th Bioinformatics Take Home Exam #9 Pick one most correct answer, unless stated otherwise!
Name: Due on Wensday, December 7th Bioinformatics Take Home Exam #9 Pick one most correct answer, unless stated otherwise! 1. What process brought 2 divergent chlorophylls into the ancestor of the cyanobacteria,
More informationPrinciples of phylogenetic analysis
Principles of phylogenetic analysis Arne Holst-Jensen, NVI, Norway. Fusarium course, Ås, Norway, June 22 nd 2008 Distance based methods Compare C OTUs and characters X A + D = Pairwise: A and B; X characters
More informationRNA Secondary Structures: A Case Study on Viruses Bioinformatics Senior Project John Acampado Under the guidance of Dr. Jason Wang
RNA Secondary Structures: A Case Study on Viruses Bioinformatics Senior Project John Acampado Under the guidance of Dr. Jason Wang Table of Contents Overview RSpredict JAVA RSpredict WebServer RNAstructure
More informationChapter 19: The Genetics of Viruses and Bacteria
Chapter 19: The Genetics of Viruses and Bacteria What is Microbiology? Microbiology is the science that studies microorganisms = living things that are too small to be seen with the naked eye Microorganisms
More informationBjoern Peters La Jolla Institute for Allergy and Immunology Buenos Aires, Oct 31, 2012
www.iedb.org Bjoern Peters bpeters@liai.org La Jolla Institute for Allergy and Immunology Buenos Aires, Oct 31, 2012 Overview 1. Introduction to the IEDB 2. Application: 2009 Swine-origin influenza virus
More informationCristina Cassetti, Ph.D.
NIAID Extramural Research Update: Recombinant Influenza Viruses and Biosafety Cristina Cassetti, Ph.D. Influenza Program Officer Division of Microbiology and Infectious Diseases NIAID Influenza virus DMID
More informationERA: Architectures for Inference
ERA: Architectures for Inference Dan Hammerstrom Electrical And Computer Engineering 7/28/09 1 Intelligent Computing In spite of the transistor bounty of Moore s law, there is a large class of problems
More informationHands-On Ten The BRCA1 Gene and Protein
Hands-On Ten The BRCA1 Gene and Protein Objective: To review transcription, translation, reading frames, mutations, and reading files from GenBank, and to review some of the bioinformatics tools, such
More informationThe Influenza Virus Resource at the National Center for Biotechnology Information. Running Title: NCBI Influenza Virus Resource ACCEPTED
JVI Accepts, published online ahead of print on 17 October 2007 J. Virol. doi:10.1128/jvi.02005-07 Copyright 2007, American Society for Microbiology and/or the Listed Authors/Institutions. All Rights Reserved.
More informationCloud Computing. Hwajung Lee. Key Reference: Prof. Jong-Moon Chung s Lecture Notes at Yonsei University
Cloud Computing Hwajung Lee Key Reference: Prof. Jong-Moon Chung s Lecture Notes at Yonsei University Cloud Computing Cloud Introduction Cloud Service Model Big Data Hadoop MapReduce HDFS (Hadoop Distributed
More informationGene Finding in Eukaryotes
Gene Finding in Eukaryotes Jan-Jaap Wesselink jjwesselink@cnio.es Computational and Structural Biology Group, Centro Nacional de Investigaciones Oncológicas Madrid, April 2008 Jan-Jaap Wesselink jjwesselink@cnio.es
More informationOFFLU Technical Meeting Coordinating world-wide surveillance for influenza in swine. OIE headquarters, Paris, France March 27-28, 2012
OFFLU Technical Meeting Coordinating world-wide surveillance for influenza in swine OIE headquarters, Paris, France March 27-28, 2012 Brazil update Janice Reis Ciacci Zanella Brazilian Agricultural Research
More informationSMPD 287 Spring 2015 Bioinformatics in Medical Product Development. Final Examination
Final Examination You have a choice between A, B, or C. Please email your solutions, as a pdf attachment, by May 13, 2015. In the subject of the email, please use the following format: firstname_lastname_x
More informationHBV. Next Generation Sequencing, data analysis and reporting. Presenter Leen-Jan van Doorn
HBV Next Generation Sequencing, data analysis and reporting Presenter Leen-Jan van Doorn HBV Forum 3 October 24 th, 2017 Marriott Marquis, Washington DC www.forumresearch.org HBV Biomarkers HBV biomarkers:
More informationComparing Multifunctionality and Association Information when Classifying Oncogenes and Tumor Suppressor Genes
000 001 002 003 004 005 006 007 008 009 010 011 012 013 014 015 016 017 018 019 020 021 022 023 024 025 026 027 028 029 030 031 032 033 034 035 036 037 038 039 040 041 042 043 044 045 046 047 048 049 050
More informationBioinformation by Biomedical Informatics Publishing Group
Predicted RNA secondary structures for the conserved regions in dengue virus Pallavi Somvanshi*, Prahlad Kishore Seth Bioinformatics Centre, Biotech Park, Sector G, Jankipuram, Lucknow 226021, Uttar Pradesh,
More informationLecture 19 Evolution and human health
Lecture 19 Evolution and human health The evolution of flu viruses The evolution of flu viruses Google Flu Trends data US data Check out: http://www.google.org/flutrends/ The evolution of flu viruses the
More informationApplication of the Mole-8.5 supercomputer: Probing the whole influenza virion at the atomic level
Article Molecular Biology July 2011 Vol.56 No.20: 2114 2118 doi: 10.1007/s11434-011-4543-7 SPECIAL TOPICS: Application of the Mole-8.5 supercomputer: Probing the whole influenza virion at the atomic level
More informationParametric Inference using Persistence Diagrams: A Case Study in Population Genetics
: A Case Study in Population Genetics Kevin Emmett Daniel Rosenbloom Pablo Camara University of Barcelona, Barcelona, Spain. Raul Rabadan KJE2109@COLUMBIA.EDU DSR2131@COLUMBIA.EDU PABLO.G.CAMARA@GMAIL.COM
More informationStudent Handout Bioinformatics
Student Handout Bioinformatics Introduction HIV-1 mutates very rapidly. Because of its high mutation rate, the virus will continue to change (evolve) after a person is infected. Thus, within an infected
More informationIntraseasonal Dynamics and Dominant Sequences in H3N2 Influenza
Intraseasonal Dynamics and Dominant Sequences in H3N2 Influenza Nicole Creanza 1., Jason S. Schwarz 1. *, Joel E. Cohen 1,2 1 Laboratory of Populations, Rockefeller University, New York, New York, United
More informationPatricia Fitzgerald-Bocarsly
FLU Patricia Fitzgerald-Bocarsly October 23, 2008 Orthomyxoviruses Orthomyxo virus (ortho = true or correct ) Negative-sense RNA virus (complementary to mrna) Five different genera Influenza A, B, C Thogotovirus
More informationSupplementary Figure 1. SC35M polymerase activity in the presence of Bat or SC35M NP encoded from the phw2000 rescue plasmid.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 Supplementary Figure 1. SC35M polymerase activity in the presence of Bat or SC35M NP encoded from the phw2000 rescue plasmid. HEK293T
More informationClass 34: Computing with Life (and the Chicken Flu)
Class 34: Computing with Life (and the Chicken Flu) CS150: Computer Science University of Virginia Computer Science David Evans http://www.cs.virginia.edu/evans DNA Sequence of nucleotides: adenine (A),
More informationPatterns of hemagglutinin evolution and the epidemiology of influenza
2 8 US Annual Mortality Rate All causes Infectious Disease Patterns of hemagglutinin evolution and the epidemiology of influenza DIMACS Working Group on Genetics and Evolution of Pathogens, 25 Nov 3 Deaths
More informationPhylogenetic Tree Practical Problems
Phylogenetic Tree Practical Problems Software Tools: MEGA A software package for constructing phylogenetic trees using neighbor-joining, UPGMA, and maximum parsimony. ClustalW A tool for constructing multiple
More informationOFFLU the joint OIE-FAO network of expertise on animal influenza
OFFLU the joint OIE-FAO network of expertise on animal influenza FAO/OIE HPAI Laboratory Network Meeting for SE Asia Keith Hamilton OIE Scientific and Technical Department, OIE Paris OFFLU Objectives Share
More informationJ. A. Sands, 21 October 2013 Lehigh University
J. A. Sands, 21 October 2013 Lehigh University Cryptococcus, Candidiasis, Aspergillosis Tuberculosis Cholera Plague Bact. Meningitis Salmonella Listeria Leptospirosis Staph. (MRSA) E. coli Clostridium
More informationDevelopment of an Epileptic Seizure Detection Application based on Parallel Computing
Development of an Epileptic Seizure Detection Application based on Parallel Computing K. Sivasankari #1, Dr. K. Thanushkodi *2 #1 Department of ECE, Akshaya College of Engineering and Technology, Coimbatore
More informationThe Open Bioinformatics Journal, 2014, 8, 1-5 1
Send Orders for Reprints to reprints@benthamscience.net The Open Bioinformatics Journal, 214, 8, 1-5 1 Open Access Performances of Bioinformatics Pipelines for the Identification of Pathogens in Clinical
More informationAvian Influenza Virus H7N9. Dr. Di Liu Network Information Center Institute of Microbiology Chinese Academy of Sciences
Avian Influenza Virus H7N9 Dr. Di Liu Network Information Center Institute of Microbiology Chinese Academy of Sciences Avian Influenza Virus RNA virus, Orthomyxoviruses Influenza A virus Eight Gene segments
More informationA Bioinformatics Method for Identifying RNA Structures within Human Cells
Wright State University CORE Scholar Physics Seminars Physics 11-1-2013 A Bioinformatics Method for Identifying RNA Structures within Human Cells Stephen Donald Huff Follow this and additional works at:
More informationAnnotation of Drosophila mojavensis fosmid 8 Priya Srikanth Bio 434W
Annotation of Drosophila mojavensis fosmid 8 Priya Srikanth Bio 434W 5.1.2007 Overview High-quality finished sequence is much more useful for research once it is annotated. Annotation is a fundamental
More informationSebastian Jaenicke. trnascan-se. Improved detection of trna genes in genomic sequences
Sebastian Jaenicke trnascan-se Improved detection of trna genes in genomic sequences trnascan-se Improved detection of trna genes in genomic sequences 1/15 Overview 1. trnas 2. Existing approaches 3. trnascan-se
More informationVirusDetect pipeline - virus detection with small RNA sequencing
VirusDetect pipeline - virus detection with small RNA sequencing CSC webinar 16.1.2018 Eija Korpelainen, Kimmo Mattila, Maria Lehtivaara Big thanks to Jan Kreuze and Jari Valkonen! Outline Small interfering
More informationIdentifying, Preparing for & Reducing Pandemic Risk
Identifying, Preparing for & Reducing Pandemic Risk Jonna Mazet, DVM, MPVM, PhD Professor of Epidemiology & Disease Ecology One Health Institute School of Veterinary Medicine University of California,
More informationNanoparticulate Vaccine Design: The VesiVax System
Nanoparticulate Vaccine Design: The VesiVax System Gary Fujii, Ph.D. President and CEO Molecular Express, Inc. May 16, 2006 Orlando, Florida Influenza Each year up to 20% of the world's population contracts
More informationNermin Ashraf Shaltout
THE AMERICAN UNIVERSITY IN CAIRO SCHOOL OF SCIENCES AND ENGINEERING Improving Machine Learning Techniques for Influenza-A Classification A THESIS DEFENSE SUBMITTED TO DEPARTMENT OF COMPUTER SCIENCE AND
More informationAutoOrthoGen. Multiple Genome Alignment and Comparison
AutoOrthoGen Multiple Genome Alignment and Comparison Orion Buske, Yogesh Saletore, Kris Weber CSE 428 Spring 2009 Martin Tompa Computer Science and Engineering University of Washington, Seattle, WA Abstract
More information19 2 Viruses Slide 1 of 34
1 of 34 What Is a Virus? What Is a Virus? Viruses are particles of nucleic acid, protein, and in some cases, lipids. Viruses can reproduce only by infecting living cells. 2 of 34 What Is a Virus? Viruses
More informationExtracting geographic locations from the literature for virus phylogeography using supervised and distant supervision methods
Extracting geographic locations from the literature for virus phylogeography using supervised and distant supervision methods D. Weissenbacher 1, A. Sarker 2, T. Tahsin 1, G. Gonzalez 2 and M. Scotch 1
More informationIdentification of single de novo drug candidate for dengue and filaria on Aedes aegypti and Culex quinquefasciatus mosquitoes using insilico Protocols
Available online at www.ijntps.org ISSN: 2277 2782 INTERNATIONAL JOURNAL OF NOVEL TRENDS IN PHARMACEUTICAL SCIENCES RESEARCH ARTICLE Identification of single de novo drug candidate for dengue and filaria
More informationEvolution of hepatitis C virus in blood donors and their respective recipients
Journal of General Virology (2003), 84, 441 446 DOI 10.1099/vir.0.18642-0 Short Communication Correspondence Jean-François Cantaloube jfc-ets-ap@gulliver.fr Evolution of hepatitis C virus in blood donors
More informationAbstract. Optimization strategy of Copy Number Variant calling using Multiplicom solutions APPLICATION NOTE. Introduction
Optimization strategy of Copy Number Variant calling using Multiplicom solutions Michael Vyverman, PhD; Laura Standaert, PhD and Wouter Bossuyt, PhD Abstract Copy number variations (CNVs) represent a significant
More informationMicrobiological Societies (IUMS) Congress This conference took place in Montreal, CANADA during
(Abroad Domestic)Official trip report form(student) 2014/08/19 (Year/Month/Day) Name Laboratory Year (Grade) Destination Nipawit Karnbunchob Division of Bioinformatics DC2 Montreal, CANADA Period of trip
More informationBioinformatics Laboratory Exercise
Bioinformatics Laboratory Exercise Biology is in the midst of the genomics revolution, the application of robotic technology to generate huge amounts of molecular biology data. Genomics has led to an explosion
More informationREADME file for GRASTv1.0.pl
README file for GRASTv.0.pl Genome Reduction Analysing Software Tool (GRAST). Produced by Christina Toft and Mario A. Fares Date 03/04/06 Reference and more information: Toft, C and Fares, MA (2006). GRAST:
More information100 years of Influenza Pandemic and the prospects for new influenza vaccines
100 years of Influenza Pandemic and the prospects for new influenza vaccines Dr John McCauley Director, WHO Collaborating Centre for Reference and Research on influenza The Francis Crick Institute London
More informationMapping of the influenza A hemagglutinin serotypes evolution by the ISSCOR method
Regular paper Supplementary Materials pp 441 452 Vol. 61, 2014 on-line at: www.actabp.pl Mapping of the influenza A hemagglutinin serotypes evolution by the ISSCOR method Jan P. Radomski 1,4 *, Piotr P.
More informationFinal Report. Identification of Residue Mutations that Increase the Binding Affinity of LSTc to HA RBD. Wendy Fong ( 方美珍 ) CNIC; Beijing, China
Final Report Identification of Residue Mutations that Increase the Binding Affinity of LSTc to HA RBD Wendy Fong ( 方美珍 ) CNIC; Beijing, China Influenza s Viral Life Cycle 1. HA on virus binds to sialic
More informationMapping the Antigenic and Genetic Evolution of Influenza Virus
Mapping the Antigenic and Genetic Evolution of Influenza Virus Derek J. Smith, Alan S. Lapedes, Jan C. de Jong, Theo M. Bestebroer, Guus F. Rimmelzwaan, Albert D. M. E. Osterhaus, Ron A. M. Fouchier Science
More informationEvolutionary Programming
Evolutionary Programming Searching Problem Spaces William Power April 24, 2016 1 Evolutionary Programming Can we solve problems by mi:micing the evolutionary process? Evolutionary programming is a methodology
More informationSupplementary Figure 1. Prevalence of U539C and G540A nucleotide and E172K amino acid substitutions among H9N2 viruses. Full-length H9N2 NS
Supplementary Figure 1. Prevalence of U539C and G540A nucleotide and E172K amino acid substitutions among H9N2 viruses. Full-length H9N2 NS nucleotide sequences (a, b) or amino acid sequences (c) from
More informationBioinformation Volume 5
Identification of sequence mutations affecting hemagglutinin specificity to sialic acid receptor in influenza A virus subtypes Usman Sumo Friend Tambunan*, Ramdhan Department of Chemistry, Faculty of Mathematics
More informationBlue Brain - The Future Generation
Blue Brain - The Future Generation Siva Kumar Avula 1, Vedrucha S Pakale 2 and Sheetal V Kashid 3 1 Assistant Professor, Department of Computer Science & Engineering, Ashokrao Mane Group of Institutions
More informationRecognition of HIV-1 subtypes and antiretroviral drug resistance using weightless neural networks
Recognition of HIV-1 subtypes and antiretroviral drug resistance using weightless neural networks Caio R. Souza 1, Flavio F. Nobre 1, Priscila V.M. Lima 2, Robson M. Silva 2, Rodrigo M. Brindeiro 3, Felipe
More informationDe Novo Viral Quasispecies Assembly using Overlap Graphs
De Novo Viral Quasispecies Assembly using Overlap Graphs Alexander Schönhuth joint with Jasmijn Baaijens, Amal Zine El Aabidine, Eric Rivals Milano 18th of November 2016 Viral Quasispecies Assembly: HaploClique
More informationGENOME-WIDE DETECTION OF ALTERNATIVE SPLICING IN EXPRESSED SEQUENCES USING PARTIAL ORDER MULTIPLE SEQUENCE ALIGNMENT GRAPHS
GENOME-WIDE DETECTION OF ALTERNATIVE SPLICING IN EXPRESSED SEQUENCES USING PARTIAL ORDER MULTIPLE SEQUENCE ALIGNMENT GRAPHS C. GRASSO, B. MODREK, Y. XING, C. LEE Department of Chemistry and Biochemistry,
More informationInfluenza H3N2 Virus Variation Analysis
Influenza H3N2 Virus Variation Analysis Objective Upon completion of this exercise, you will be able to: search for virus sequences and view detailed information about these sequences in IRD, build a phylogenetic
More informationGenetic alterations of histone lysine methyltransferases and their significance in breast cancer
Genetic alterations of histone lysine methyltransferases and their significance in breast cancer Supplementary Materials and Methods Phylogenetic tree of the HMT superfamily The phylogeny outlined in the
More informationNature Medicine: doi: /nm.4322
1 2 3 4 5 6 7 8 9 10 11 Supplementary Figure 1. Predicted RNA structure of 3 UTR and sequence alignment of deleted nucleotides. (a) Predicted RNA secondary structure of ZIKV 3 UTR. The stem-loop structure
More informationFrom Mosquitos to Humans: Genetic evolution of Zika Virus
Article: From Mosquitos to Humans: Genetic evolution of Zika Virus Renata Pellegrino, PhD Director, Sequencing lab Center for Applied Genomics The Children s Hospital of Philadelphia Journal Club Clinical
More informationVirtual Screening of Potential Inhibitors of Anti-Influenza A H1N1
Virtual Screening of Potential Inhibitors of Anti-Influenza A H1N1 Praveen Kumar Guttula a *, Soumyakantapanda b a Department of Biotechnology and Medical Engineering, National Institute of Technology
More informationContribution of avian influenza data through OFFLU network
Dr Gounalan Pavade Chargé de Mission, OIE Headquarters Contribution of avian influenza data through OFFLU network Asia-Pacific Workshop on surveillance, prevention and control of zoonotic influenza Paro,
More informationGkikas Magiorkinis, 1 Dimitrios Paraskevis, 1 Anne-Mieke Vandamme, 2 Emmanouil Magiorkinis, 1 Vana Sypsa 1 and Angelos Hatzakis 1 INTRODUCTION
Journal of General Virology (2003), 84, 2715 2722 DOI 10.1099/vir.0.19180-0 In vivo characteristics of human immunodeficiency virus type 1 intersubtype recombination: determination of hot spots and correlation
More informationReliable Prediction of Viral RNA Structures
Reliable Prediction of Viral RNA Structures Winterseminar 2016, Bled Roman Ochsenreiter TBI Wien Bled, 15.2.2016 Roman Ochsenreiter TBI Wien Reliable Prediction of Viral RNA Structures Bled, 15.2.2016
More informationInfluenza hemagglutinin protein stability correlates with fitness and seasonal evolutionary dynamics
Influenza hemagglutinin protein stability correlates with fitness and seasonal evolutionary dynamics Eili Y. Klein, Ph.D. Assistant Professor, Department of Emergency Medicine & Fellow, Center for Disease
More informationChapter 1. Introduction
Chapter 1 Introduction 1.1 Motivation and Goals The increasing availability and decreasing cost of high-throughput (HT) technologies coupled with the availability of computational tools and data form a
More informationInfluenza Genome Sequencing Project Proposal
Date of Proposal (MM/DD/YY): 06/13/12 TITLE: Role of the Untranslated Regions of the Influenza A Virus Replication and Vaccines Purpose/Objective-please provide a brief one to two paragraph description
More informationI. H. Brown*, S. M. Reid, G. Simon, W. Loeffen, L. E. Larsen, P. Kellam, S.Watson, N. Lewis, S. M. Brookes, O.Pybus & ESNIP3 Consortium
Coordinated surveillance of influenza viruses in European pigs: Enhanced Virological and Epidemiological analysis from the European Surveillance Network for Influenza in Pigs (ESNIP3) I. H. Brown*, S.
More informationAPHL Next Generation Sequencing (NGS) Survey
Next Generation Sequencing 1. How long has your lab had a sequencer? [If lab does not have a sequencer go to 1a1 through 1a3 and then end survey] [If lab does have a sequencer continue to 1a and the rest
More informationCS 312: Algorithms Analysis. Gene Sequence Alignment. Overview: Objectives: Code:
CS 312: Algorithms Analysis Gene Sequence Alignment Overview: In this project, you will implement dynamic programming algorithms for computing the minimal cost of aligning gene sequences and for extracting
More information