CONSTRUCTION OF PHYLOGENETIC TREE USING NEIGHBOR JOINING ALGORITHMS TO IDENTIFY THE HOST AND THE SPREADING OF SARS EPIDEMIC

Similar documents
Phylogenetic Methods

Molecular Evolution of the SARS Coronavirus During the Course of the SARS Epidemic in China

Study the Evolution of the Avian Influenza Virus

Evolution of influenza

The BLAST search on NCBI ( and GISAID

Principles of phylogenetic analysis

CS 312: Algorithms Analysis. Gene Sequence Alignment. Overview: Objectives: Code:

Emerging Diseases. Biosciences in the 21 st Century Dr. Amber Rice October 26, 2012

Exploring HIV Evolution: An Opportunity for Research Sam Donovan and Anton E. Weisstein

Antigenic Epitope Prediction towards SARS Corona Virus using Bioinformatics Analysis *1

Linfa Wang CSIRO Australian Animal Health Laboratory (AAHL) 5740 papers. in 12 minutes?!

Multiple sequence alignment

Project PRACE 1IP, WP7.4

Mapping the Antigenic and Genetic Evolution of Influenza Virus

MERS. G Blackburn DO, MACOI Clinical Professor of Medicine MSUCOM

Novel Coronavirus 2012

SEA/CD/154 Distribution : General. Avian Influenza in South-East Asia Region: Priority Areas for Research

An Evolutionary Story about HIV

FACT SHEET FOR ADDITIONAL INFORMATION CONTACT

Phylogenetic Tree Practical Problems

Contact Tracing in Health-care Information System - with SARS as a Case Study

بسم هللا الرحمن الرحيم

Distinguishing epidemiological dependent from treatment (resistance) dependent HIV mutations: Problem Statement

Trends of Pandemics in the 21 st Century

It is well known that some pathogenic microbes undergo

LESSON 4.4 WORKBOOK. How viruses make us sick: Viral Replication

Origins and evolutionary genomics of the novel avian-origin H7N9 influenza A virus in China: Early findings

virus infectious severe sample praise isolate spreads expert

Existence of reassortant A (H1N2) swine influenza viruses in Saitama Prefecture, Japan

EVOLUTIONARY TRAJECTORY ANALYSIS: RECENT ENHANCEMENTS. R. Burke Squires

Contents. Just Classifier? Rules. Rules: example. Classification Rule Generation for Bioinformatics. Rule Extraction from a trained network

Global updates on avian influenza and tools to assess pandemic potential. Julia Fitnzer Global Influenza Programme WHO Geneva

Update: Porcine epidemic diarrhoea and swine influenza. Eric Neumann DVM PhD Epi-Insight Limited Palmerston North, New Zealand

VIP: an integrated pipeline for metagenomics of virus

The mathematics of diseases

Early February Surveillance of severe atypical pneumonia in Hospital Authority in Hong Kong. Initiate contacts in Guangdong

INFLUENZA SURVEILLANCE

Cough, Cough, Sneeze, Wheeze: Update on Respiratory Disease

Pandemic Preparedness

What do epidemiologists expect with containment, mitigation, business-as-usual strategies for swine-origin human influenza A?

SARS Outbreaks in Ontario, Hong Kong and Singapore

Outbreaks: Identification, Investigation, Management

Chapter 19: The Genetics of Viruses and Bacteria

Lesson 20 Study Guide: Medical Biotechnology Pandemic Flu & Emergent Disease

Characterizing intra-host influenza virus populations to predict emergence

Viruses. Instructions fill in the blanks with the appropriate term to have the sentence make sense.

Rajesh Kannangai Phone: ; Fax: ; *Corresponding author

INFLUENZA-2 Avian Influenza

LEARNING FROM OUTBREAKS: SARS

De Novo Viral Quasispecies Assembly using Overlap Graphs

Going Nowhere Fast: Lentivirus genetic sequence evolution does not correlate with phenotypic evolution.

Ebola Virus. Emerging Diseases. Biosciences in the 21 st Century Dr. Amber Rice December 4, 2017

Mapping Evolutionary Pathways of HIV-1 Drug Resistance. Christopher Lee, UCLA Dept. of Chemistry & Biochemistry

TITLE: Influenza A (H7N9) virus evolution: Which genetic mutations are antigenically important?

The Open Bioinformatics Journal, 2014, 8, 1-5 1

Appendix 81. From OPENFLU to OPENFMD. Open Session of the EuFMD: 2012, Jerez de la Frontera, Spain 1. Conclusions and recommendations

Genome Sequencing and Analysis of a Porcine Delta Coronavirus from Eastern China

A Bioinformatics Approach to Analyzing Influenza Ray Hylock

aM (modules 1 and 10 are required)

Supplemental Information Dose Response Parameters for Gain of Function Pathogens

a-hV. Code assigned:

In April 2009, a new strain of

Background Describing epidemics Modelling epidemics Predicting epidemics Manipulating epidemics

Swine Acute Diarrhea Syndrome Coronavirus (SADS-CoV) in China

Mapping evolutionary pathways of HIV-1 drug resistance using conditional selection pressure. Christopher Lee, UCLA

7.014 Problem Set 7 Solutions

Introduction to Bioinformatics

Influenza: The Threat of a Pandemic

Influenza Virus HA Subtype Numbering Conversion Tool and the Identification of Candidate Cross-Reactive Immune Epitopes

Coordinating Global Surveillance for Influenza in Swine. OFFLU Swine Influenza Group

Finding protein sites where resistance has evolved

Influenza B viruses are not divided into subtypes, but can be further broken down into different strains.

Patterns of hemagglutinin evolution and the epidemiology of influenza

LESSON 4.5 WORKBOOK. How do viruses adapt Antigenic shift and drift and the flu pandemic

SISEA Surveillance and Investigation of Epidemic Situations in South-East Asiaww

ph1n1 H3N2: A Novel Influenza Virus Reassortment

OFFLU contribution to the WHO vaccine selection meeting

2.1 VIRUSES. 2.1 Learning Goals

Unit 2: Lesson 2 Case Studies: Influenza and HIV LESSON QUESTIONS

Influenza hemagglutinin protein stability correlates with fitness and seasonal evolutionary dynamics

Emergence and Fixing of Antiviral Resistance in Influenza A Via Recombination and Hitch Hiking. Henry L Niman

The mathematics of diseases

Evolution of hepatitis C virus in blood donors and their respective recipients

Emergence and Changes in swine and avian influenza viruses of Pandemic Potential

Structure & Function of Viruses

Maximum Likelihood ofevolutionary Trees is Hard p.1

Influenza: Ecology and Continuing Evolution

SEIQR-Network Model with Community Structure

Emerging infectious disease: trends in the literature on SARS and H7N9 influenza

Avian influenza Avian influenza ("bird flu") and the significance of its transmission to humans

Fifty-third session Johannesburg, South Africa, 1 5 September 2003

Molecular Characterization Confirms the Presence of a Divergent Strain of Canine Coronavirus (UWSMN-1) in Australia

OIE work in pathogen differentiation (ISA as example)

Coordinating Global Surveillance for Influenza in Swine. OFFLU Swine Influenza Virus Group

Modeling the Antigenic Evolution of Influenza Viruses from Sequences

Outbreak evaluation of highly pathogenic avian influenza in Bangladesh. Mymensingh *Corresponding author:

HS-LS4-4 Construct an explanation based on evidence for how natural selection leads to adaptation of populations.

aV (modules 1 and 9 are required)

Name: Due on Wensday, December 7th Bioinformatics Take Home Exam #9 Pick one most correct answer, unless stated otherwise!

Image of Ebola viruses exiting host cells HUMAN VIRUSES & THE LIMITATION OF ANTIVIRAL DRUG AGENTS

Transcription:

CONSTRUCTION OF PHYLOGENETIC TREE USING NEIGHBOR JOINING ALGORITHMS TO IDENTIFY THE HOST AND THE SPREADING OF SARS EPIDEMIC 1 MOHAMMAD ISA IRAWAN, 2 SITI AMIROCH 1 Institut Teknologi Sepuluh Nopember (ITS) Faculty of Mathematics and Natural Sciences Department of Mathematics 2 Darul Ulum Islamic University Faculty of Mathematics and Natural Sciences Department of Mathematics E-mail: 1 mii@its.ac.id, 2 amirast_117@yahoo.com ABSTRACT The epidemic of SARS is a dramatic example of how quickly the virus spread throughout the world. The pattern of spread of the SARS virus can be likened to a tree called a phylogenetic tree. All of SARS virus related to one another are connected to a network that is transmitted from one individual to another. The results of this branching process to determine the beginning of the epidemic as well as the host of the corona virus. To construct a phylogenetic tree takes inputs in the form of a distance matrix. In this study the distance matrix is derived from the alignments between sequence by using Super Pairwise Alignment (SPA). The output of the alignment is in the form of number of differences of sequences that determine in genetic distance. From the distance matrix, the genetic distance is converted to evolutionary distances using the Jukes Cantor model of which was established by phylogenetic tree by using the neighbor joining algorithm. From the results of the phylogenetic tree formation are known as host SARS Co-V is a palm civet with evolutionary distance of 0.1314. The epidemic began on December 16, 2002 in Guangzhou of South China and then spread to Zhongshan. Rallying point of the epidemic in Guangzhou Hospital and the Metropole hotel subsequently spread to Hanoi, Toronto, Singapore, Taiwan, and Hong Kong so that SARS to be international case. Furthermore, from the result of alignments of hosts and human SARS Co-V with SPA is known that the mutation is a mutation that occurs with type I protein replacement at position 77, 139, 147, 244, 344, 360, 472, 480, 487, 577, 609, 613, 665, 765, 778, and 1163. Keywords: SARS Coronavirus, Phylogenetic Tree, Neighbor Joining Algorithm, Super Pairwise Alignment. 1. INTRODUCTION Allegations of SARS (Severe Acute Respiratory Syndrome) which was originally caused by influenza virus because the symptoms are similar to the symptoms of people with influenza breakable because influenza viruses are not isolated from the patient. Finally, it was found that most of SARS patients were isolated by Corona virus when it is seen from its morphology. Virus of genome sequence is then analyzed and confirmed that the cause of SARS is Corona virus that has changed (mutation) of Corona viruses are around for this [9]. The pattern of spread of the SARS virus during the winter and spring of 2003 can be described as a tree. All of SARS virus related to one another is a branched network of relationships that are transmitted from one individual to another. The results of this process of branching of tree diagram are to determine the beginning of the epidemic and the host of the Corona virus. Along with technological advances in the technology of DNA sequences, it was concluded that diagram of the phylogenetic tree is a task that is associated with DNA analysis and is based on a comparison of the whole genome between different SARS virus. The diagram of this tree is known to 424

be able to be analyzed of comparative DNA sequence of an organism because it is always changing and mutating [1]. Bioinformatics as a science that studies the application of computational techniques to manage and to analyze biological information includes the application of the methods of mathematics, statistics, and informatics to solve biological problems, especially with the use of DNA sequences and amino acid as well as information related to it [12]. In this case, bioinformatics plays an important role in the process of reading of the Corona virus genome. Then it searches for sequence similarity (homology alignment) with other human Corona virus that is carried by chickens, birds, dogs, pigs, rats, cats and ferrets. Further results of analysis is known that the genome of Corona virus causes SARS Corona virus is different from the other, so that the virus is called a virus SARS (SARS Co-V). Furthermore it is formed phylogenetic tree to identify the host and to determine the spread of the epidemic. This study using the neighbor joining method with the input of distance matrix which is generated from protein sequence alignments of human SARS Co-V and SARS virus sequences from several animals suspected with Super Pairwise Alignment. Having obtained the phylogenetic tree, the virus in the closest branching to the human SARS Co-V will be analyzed for mutations that happened to him. 2. REVIEW OF RELATED LITERATURE 3. 2.1 Sequence Alignment Sequence alignment is an important method in the analysis of the position and type of mutation that is hidden in biological sequences and allow appropriate comparison is done. The most important thing in a sequence alignment is to determine the displacement of mutation [12]. Given two sequences and which are defined as follows:,,,,,,, Insertion of symbol "-" to the sequences and, and aims to establish two new sequences, namely, the sequences and. Furthermore, the elements of the sequences and that represent the DNA sequence having the range V 5 = (0,1,2,3,4) or (a, c, g, t,-). As for the protein sequence has the range V 21 = {A, R, N, D, C, Q, E, G, H, I, L, K, M, F, P, S, T, W, Y, V, - }. 2.2 Super Pairwise Alignment Super Pairwise Alignment or SPA predicts the existence of insertions or deletions and long 425 insertions that rely on sequence local similarity. 2.3 Model Jukes Cantor Model of Jukes Cantor is one of the simplest model of evolution of sequence that assumes that the four nucleotide has the same frequency, and that all substitutions are equally likely [5]. In this model the distance between two nucleotide sequences is given by: 1 Where p is the proportion of different nucleotides in the two sequences. As for the protein sequence, the model used is: 1 2.4 Neighbor-Joining Algorithm Neighbor-joining method is a distance-based method used to construct phylogenetic tree [10]. For the set of N sequences, computational steps are given as follows: 1. Determine the distance matrix of N sequences. 2. Assume a tree with all OTU (operational taxonomic units) in the matrix as branching from a central point, then form of the star-like pattern as in the schematic representation in Figure 2.2a. 3. For each OTU, compute S, where S is the sum of the distance (D) between the OTU one with other OTU, divided by (N-2), where N is the total number of OTU. 1 2 Picture. 2.2 a, b. Neighbor-joining. a. initial structure of star-like b. structure of tree-like after joining nodes 1 and 2. 4. Identify OTU pair with the minimum value: 5. Combine 2 taxa at a node in a sub-tree 6. Calculate the length of the branch: 2 2 7. Calculate a new distance matrix by connecting i and j and replace it with a node (x) which connect:

2 8. Repeat the above steps until N = 3. 3. ANALYSIS AND DISCUSSION 3.1 Results of Phylogenetic Tree Formation To Identify Host Before constructing phylogenetic tree, all of protein sequences are aligned first with the SPA method to determine the difference (genetic distance). Furthermore, genetic distances represented in the form of a distance matrix is first converted into a matrix of evolutionary distance with Jukes Cantor method. Evolutionary distance matrix is formed, to be input that is used for the process of the formation of the phylogenetic tree. Table 3.1 Matrix Of Evolutionary Distance Cycle 1 Figure 3.1 Tree on cycle 1 e. Step 5: Combine the new distance of all of taxa to U1 The result of the new distance U1 to all taxa for subsequent inclusion in a new evolutionary distance matrix. The next step is the same calculation as in cycle 1, the value is different for different N, the smallest Mij is different, the new distance to the end of each taxa is also different. In the same way as in cycle 1 was obtained tree for each cycle as follows: A B C D... L A 0 0,119 2,205 1,188 0,458 B 0,119 0 1,200 1,192 0,459 C 2,205 1,200 0 0,131 1,218 D 1,188 1,192 0,131 0 1,212 E 1,361 1,444 1,401 1,392 1,387 L 0,458 0,459 1,218 1,213 0 a. Step 1: Obtained results of the calculation of each Si is: S A = 1.08618 ; S B = 1.0002 ; S C = 1.3991 ; S D = 1.2942 ; S E = 1.42744 ; S F = 1.4156 ; S G = 1.4312 ; S H = 1.4889 ; S I = 0.95125 ; S J = 0.9172 ; S K = 0.9158 ; S L = 0.9252. b. Step 2: Wanted minimum value for each pair of sequences: M is the smallest pair obtained = -2.5620 c. Step 3: Define a new OTU, namely, U1 replaces the smallest pair (C and D). Furthermore, these taxa are combined as U1 following formula: 0.5 0.11815 0.5 0.01325 d. Step 4: Connect taxa U1 with C and U1 with D, each with an edge length or distance follow as the result of the calculation in step 3. Figure 3.2 Figure 3.3 Tree on cycle 2 Tree on cycle 3 Figure 3.4 Figure 3.5 Tree on cycle 4 Tree on cycle 5 Figure 3.6 Figure 3.7 Tree on cycle 6 Tree on cycle 7 Figure 3.8 Figure 3.9 Tree on cycle 8 Tree on cycle 9 426

Persejajaran Host Figure 3.10 Tree on cycle 10 And the last branching is U10 to L. After all branches combined, phylogenetic trees obtained the following: L U8 U10 A U7 U9 U6 I K B U5 J U1 Figure 3.11 Phylogenetic Tree Of SARS Co-V Host When depicted in the form of a Cladogram are as follows: Figure 3.12: Phylogenetic Tree Of Host SARS Co-V Results Of Manual Workmanship. C U4 H U3 D E U6 G F 0.052597 0.060066 0.079688 0.23035 0.039391 0.11815 0.58347 0.013242 0.40538 0.44231 0.026127 0.39313 0.26141 0.42717 0.074677 0.64015 0.10446 0.0028128 0.036249 0.0023369 0.051259 1 2 3 4 5 6 7 8 Murine HV 1 Murine HV 2 Human SARS CoV Palm Civet Canine Co-V Feline CoV4 Porcine PEDV Figure 3.13: Phylogenetic Tree Of Host SARS Co-V Are Simulated In Matlab. The phylogenetic tree in Figure 3.12 and 3.13 are explaining kinship between species to be estimated as the host of the Corona virus. Each leaf (A, B, C, D, E, F, G, H, I, J, K, L) in the tree is also called OTU (Operational Taxonomy Unit) which represents the aligned species. In the images appear closest branching is between OTU A with B, OTU C with D OTU, OTU E to F, and OUT J with K. At the picture 3.12 or picture 3.13, OTU C representing Human SARS Co-V are on the same branch with the OTU D, namely, Palm Civet. Human SARS Co-V is Corona virus that attack humans, while Palm Civet is a Himalayan palm civet. Both of evolutionary is distance of 0.1314. This figure can be seen from a distance summation of branching of Palm Civet to branch U1 (0.01325) with a distance of Human SARS Co-V to branch U1 (0.11815). By looking at the results of phylogenetic tree analysis turns SARS Co-V relationship of evolution is closest to Palm Civet. This indicates that Palm Civet is the host to be sought in this case. Based on the phylogenetic tree analysis above can be concluded that the evolutionary distance that is closest to the human SARS Co-V is the Palm Civet. This is because the protein sequences in Palm Civet when it is juxtaposed with the human protein sequences of SARS Co-V have been mutated as much as 0.1314 which indicates the smallest number when it is compared with the protein mutation that occurs in other host sequences. And the smallest evolutionary distance indicates the host is sought. In Figure 3.13 the simulated phylogenetic tree in matlab gives the same results with the results of manual calculation. It looks all leaf and branch was formed in the same position. The value of the IBV 3 Porcine HEV 3 Bovine CoV1 Bovine CoV2 Human CoV OC43 427

evolutionary distance between species also showed the same value as the manual calculation and computation, both of them used neighbor joining algorithm with Jukes Cantor correction model. 3.2 Analysis Of Results Of Human Alignment Sars Co-V With Palm Civet From the resulting phylogenetic tree, it is known that the host of the SARS epidemic is Palm Civet. The sequence alignment 3 which represents Human SARS Co-V (Corona viruses that infect humans) and sequences 4 representing Palm Civet (Corona virus that attacks the weasel), both of them have the same length of sequence is 1255aa. The difference of protein of both of is 16 with length of alignment 1255aa, this indicates the genetic distance of two strains of the virus by 0.0128 which is the quotient between different genetic with length of its alignment. In the alignment occurs mutation of type I (replacement of protein) and occurs in the following positions: a. position 77 mutation of protein G to D, b. position 139 mutation of protein A to V, c. position 147 mutation of protein Q into R, d. position 244 mutation of protein I to T, e. position 344 mutation of protein K into R, f. position 360 mutation of protein F to S, g. position 472 mutation of protein L to P, h. position 480 mutation of protein D to G, i. position 487 mutation of protein T to S, j. position 577 mutation of protein A to S, k. position 609 mutation of protein A to L, l. position 613 mutation of protein D to E, m. position 665 mutation of protein L to S, n. position 765 mutation of protein A to V, o. position 778 mutation of protein Y to D, p. position 1163 mutation of protein K into E. Alignment above when it is compared with the BLAST software will show the results as follows: Table 3.2: Comparison of alignment of Human SARS Co- V with Palm Civet Alignment The level of homologous Gap Genetic Distance SPA Global alignment 87,3 % 0 % 0.0127 BLAST Local Alignment 99 % 0 % 0.01 3.3 Results Of Phylogenetic Tree Establishment For Spreading Sars Epidemic Here is Phylogenetic tree with neighbor joining algorithm and Jukes Cantor model for the spread of the SARS epidemic of the simulation results from Matlab. 3.2717e-12 1.366e-08 5.0757e-08-8.4599e-08 Pensejajaran Epidemi 1.4814e-07 0.00013306 0.00053131 0.00026599 0.0013027 0.0002652 0.00013297 0.0002651 0.00029164 0.00013232-2.5924e-07-1.4814e-07 0.00053113-5.0757e-08-6.4829e-08 3.7108e-08 0.000531 0.00026547 0 0.00026547 0.00026547 0 2 4 6 8 10 12 Guangzhou 16/12/02 Palm Civet Zhongshan 26/12/02 Ghuangzhou 18/2/03 Ghuangzhou Hospital Metropole 21/02/03 Zhongshan 4/01/03 Hong Kong 19/03/03 Singapore 01/03/03 Taiwan 08/03/03 Hong Kong 15/05/03 Hanoi 26/02/03 Toronto 27/02/03 Ghuangzhou 24/01/03 Figure 3.16: Phylogenetic tree for the spread of SARS epidemic Phylogenetic trees above depict the story of an epidemic. Palm Civet is included in the alignment because it has been known as the host. Infection begins in Guangzhou (16/12/2002) and Zhongshan (12/26/2002), respectively it is abbreviated GZ and ZS. The epidemic began to spread in Guangzhou hospital (31/01/2003) as indeed all coronavirus-infected patients being treated there. Further it spreads to Zhongshan (01/04/2003), Guangzhou (18/02/2003), and the hotel Metropole (02/21/2003). Doctor Carlo Urbani is assigned to the Guangzhou hospital to treat SARS patients who stay at the hotel Metropole eventually contracting as well, and the other guests staying at the hotel is going home to their countries with the disease and Corona viruses on them. From there, the virus eventually spreads to Hanoi (02/26/2003), Toronto (02/27/2003), Singapore (03/01/2003), Taiwan (03/08/2003), Hong Kong (19/03/2003) and (15/05/2003) that the case of SARS to be an international outbreak. Phylogenetic tree was simulated in matlab and results of running software clustal W2 was different because in the neighbor joining algorithm used was without distance correction. The results shown are also different from the results of running matlab program due to some residual value has the smallest Mij more than one. Selection of different Mij values will also result the different phylogenetic tree. The value of the distances between sequences on the results of running software clustal w in some strains showed the 428

number 0, but the results of the matlab indicate a higher level of accuracy. 4. CONCLUSION From the discussions that have been done, the conclusions obtained are as follows : a. Based on the data processing of the protein sequences of various animals were suspected of hosts of SARS Corona virus and the DNA sequence of data of SARS to be infected patients were taken according to the location and date of the sample, can be proven that the phylogenetic tree constructed by using the neighbor joining algorithm with Jukes Cantor models used successfully to identify hosts and spread of the SARS epidemic. b. The application of alignment SPA (Super Pairwise Alignment) is successfully applied to align the sequences of human SARS Corona virus with another carried by animals. From the results of alignment of hosts and human SARS Co-V with SPA is known that the mutation that occurs is a mutation of type I with the protein replacement at positions of 77, 139, 147, 244, 344, 360, 472, 480, 487, 577, 609, 613, 665, 765, 778, and 1163. c. To construct the phylogenetic tree of the data sequence which is very high similarity is not recommended to use neighbor joining algorithm as it will be obtained several different tree (tree is not stable). REFRENCES: [1] Christianini, N., Hahn, M.W, 2006, Introduction to Computational Genomics A Case studies Approach, Cambridge University Press, New York. [2] Durbin R., Eddy S.R., Krogh A., Mitchigan G., 1998, Biological Sequence Probabilistic Models of Proteins and Nucleic Acids, Cambridge University Press. [3] Guan, et al., 2003, Isolation and Characterization of Viruses Related to the SARS Coronavirus from Animals in Shouthern China. Science 302, 276 (2003), The American Association for the Advancement of science, Washington, (www.sciencemag.org), accessed on 29 January 2014. [4] Isaev, A., 2006, Introduction to Mathematical Methods in Bioinformatics, Springer. [5] Lemey P, Salemi M, Vandamme M, 2009, The Phylogenetic Handbook: a Practical Approach to Phylogenetic Analysis and Hypothesis Testing. Cambridge University Press. [6] Marra, et al., 2003, The Genome Sequence of the SARS-Associated Coronavirus. Science 300, 1399 (2003), The American Association for the Advancement of science, Washington, (www.sciencemag.org), accessed on 29 January 2014. [7] Narshingh Deo, 1989, Graph Theory with Application to Engineering and Computer Science. Prentice Hallof India Private Limited, New Delhi. [8] Page and Holmes, 1998. Molecular Evolution: A Phylogenetic Approach, Blackwell Publishing Ltd. [9] Rota, et al., 2003, Characterization of a Novel Coronavirus Associated with Severe Acute Respiratory Syndrome. Science 300, 1394 (2003), The American Association for the Advancement of science, Washington, (www.sciencemag.org), accessed on 29 January 2014. [10] Saitou N, Nei M, The Neighbor-Joining Method: A New Method for Reconstructing Phylogenetic Trees. Mol. Biol. Evol. 4(4), 1987, PP. 406-425. [11] Severe Acute Respiratory Syndrome (SARS), Pubmed Health, (http://www.ncbi.nlm.nih.gov/ pubmedhealth/pmh0004460), accessed on 29 January 2014. [12] Shen, S.N., (2007), Theory and Mathematical Methods for Bioinformatics, Biological and Medical Physics, biomedical Engineering, Springer. 429