ALIGNMENTS AND COMPARATIVE PROFILESOF PICORNAVIRUS GENERA

Size: px
Start display at page:

Download "ALIGNMENTS AND COMPARATIVE PROFILESOF PICORNAVIRUS GENERA"

Transcription

1 In: Molecular Biology of Picornaviruses. Eds: B. Semler & E. Wimmer, ASM Press, pp (2002) ALIGNMENTS AND COMPARATIVE PROFILESOF PICORNAVIRUS GENERA Ann C. Palmenberg* and Jean-Yves Sgro Insti. for Molecular Virology, University of Wisconsin-Madison,, 1525 Linden Drive, Madison, WI Contact Information: Ann Palmenberg, Ph: , FAX: , Jean-Yves Sgro, Ph: , FAX: , *Corresponding Author (for galley proofs): A. Palmenberg, Institute for Molecular Virology, University of Wisconsin-Madison, 1525 Linden Drive, Madison, WI Background : Picornavirus sequence data have accumulated at an astonishing rate. Since the first 1060 bases at the 3 terminus of PV-1 Mahoney genome were published in 1980 (10), the polio database alone has grown to include at least 12 complete genomes and more than 112,000 bases from >36 strains. A minimum of 1 other complete picornavirus genomes also have been sequenced, with >90 additional strains available as significant sequence fragments (>3000 b), in a tally that does not even begin to include several hundred thousand additional bases from a multitude of smaller segments generated in epidemiological or evolutionary studies. A rough estimate of 2,000,000 bases, would not undervalue the current dataset. International sequence agencies like GenBank ( record the published picornavirus offerings under the broad sub-group classification of viruses. The annotated information may be searched, compared and retrieved with standard Internet tools. But if you don t know what you re looking for, or whether a favored strain is actually on deposit in GenBank, a better place to start is The Picornavirus Sequence Database ( a beautifully designed web page compiled by Nick Knowles at the Institute for Animal Health, Pirbright, UK. This comprehensive site catalogues all known family member into appropriate genera and species according to the latest taxonomic designations. GenBank accession numbers, published (and unpublished) references, fragment size and strain origin are registered for every sequence. Should you need to know, for example, that a 327 bp fragment from the HRV-72 1B gene (GenBank: Z47574) was available for comparison (9), The Picornavirus Home Page is the place to inquire. This outstanding resource, like a huge technical library, represents an invaluable research service contribution by Nick, and we are certainly indebted for his extraordinary efforts. Nevertheless, placing books on shelves is not the same as reading them. Companion literary tools such as dictionaries or thesauruses can also help make any text more informative, especially when the contents are written in a foreign language. Comparative sequence analysis, in some respects, is analogous to compiling a thesaurus for Nature s language. Although various sequence segments may appear superficially to encode similar biological meanings, it is true in Nature, as in English, that there are very few exact synonyms. That is, few words in either language are ever observed to exactly substitute for each other in alternative contexts without changing their logic or specific meaning. Just as the etymology of words reflects the sum of their linguistic histories, the subtle peculiarities of each sequence are indicative of unique evolutionary histories. Sequence context and content can both contribute to phenotype. Genomics is a young field, and unfortunately, when it comes to the language of life, our current interpretive skills more closely resemble gauche tourists in Paris, rather than fluent natives. Any singular ribosome still translates the delicate nuances of the basic genetic code with more precision and finesse than our best bioinformaticists. Yet we continue to compare sequences because (hubris aside), it s still one of the small number of available tools with any possibility of breaking down biological syntax

2 into the underlying codes. Within such compilations, we vest the continual hope of teasing from shadowy precedents, those common elements of circumstance and function that perhaps foretell a significant phenotypic impact. Why does this gene convey persistence? How does this IRES attenuate the virus? What specific capsid changes might allow compromised immunity? Since the authors earliest feeble pair wise comparisons between the 3 UTRs of EMCV (3) and PV1 (10), it s been clear that properly formed picornavirus alignments can have some practical use in this regard (15). Good alignments are never trivial to generate, but given that one can learn to understand and respect the principles that underlie algorithmic reconstruction of evolution, reasonable datasets can be created by judicious application of computing power and logic. Another rather trickier problem lies in the definition of a suitable presentation format for effective public data mining. In 1989, with our (then) comparative sequences refined by capsid crystallographic data, we published an 11 page figure summarizing the relative orientations of 33 picornavirus P1-region sequences (16), an exercise we thought rather impressive at the time. With today s much larger alignments, that density would translate into 1 published pages, just for the current protein sequences. Obviously, the printing logistics for huge alignments are hopelessly impractical. Instead, the updated picornavirus datasets cited in this chapter have been placed on a public web site ( and mirrored by the Picornavirus Home Page (above). This site also lists the strain abbreviations, gives accession numbers, sequence bibliographies and algorithms, as well as provides tables of base compositions, codon frequencies, genome RNA folds (17)and other comparative picornavirus data. Reference use of this information or its future updates should cite this chapter as the published source. How the Alignments were Formed: These days anyone with a hard drive and an Internet connection can align sequences. The commonly available CLUSTAL algorithm and similar pair wise distance methods give a reasonable fit for closely related sequences or for proteins that are obvious homologs of similar length. Very nice alignments for 3C pro, 3D pol, or within individual genera have been generated in this way (19). Genome connoisseurs however, are always looking for new methods to tweak the data, especially in sequence regions where good fits are harder to achieve or recognize. Our previous capsid alignments were based on superimposition of virion crystal structure hydrogen-bonding maps, then extended by multiple, reiterative pair wise comparisons to include similar related sequences (16). More recently, we used those alignments as founder data for profile hidden Markov model analyses (HMM) aimed at the optimal inclusion of newer, less-related sequences (4). Profiles are position-specific scoring matrices that sum the variability for each column in an existing alignment (7) creating chained likelihood tables (a Markov model) of substitution probabilities (4). New sequences are then fit snugly against the total framework of the profile. Using the HMMER program suite (Wisconsin Package Version 10.2, Genetics Computer Group, Madison, Wisc.), full-length picornavirus genome HMM-profiles (lengths of b) were calculated from the previous alignments (16). The profiles proved remarkably powerful at positioning new 5 and 3 UTRs and fitting them into the composites. Parallel polyprotein HMM-profiles (lengths of aa) also helped ease new protein-coding sequences into logical, in-frame slots that were by definition, consistent and optimal with regard to all other members of a given genus. Back-translation of the protein alignments and linkage with the UTR fragments, gave genome-length RNA alignments that were in agreement with the best fits for the proteins and for the UTRs. The new files include (nearly) all available non-redundant sequence fragments of at least 2000 bases in length, as listed in The Picornavirus Sequence Database (January, 2001). The alignments were checked for consistency, and then refined mathematically and heuristically to (a) maximize the number of matched bases and encoded amino acids, (b) minimize the location and frequency of indels, and (c) emphasize the conservation of homologous features such as catalytic sites and proteolytic cleavage sites. - 2-

3 Current Picornavirus Alignments: The current iterations now include genome-length alignments for the seven most populous picornaviral genera (Enterovirus, Rhinovirus, Cardiovirus, Aphthovirus, Hepatovirus, Parechovirus, and Teschovirus), and extend over 173 different strains, providing formats for about 1,000,000 bases. The correlate polyprotein alignments, derived by translation of the aligned RNAs, include about 291,000 amino acids. Kobuvirus and Erbovirus data will be added when there are sufficient strains for comparison. The new datasets, clearly impractical in printed form, are Internet accessible at: The web files are displayed as (edited) output from the GCG Wisconsin Package PRETTY program and should be downloaded with a monospace font to maintain alignment continuity. No consensus is shown, but upper case letters highlight alignment columns where at least 2/3 of the sequences conserve an identical residue. The exact plurality required for these votes varies by alignment, as described within each file. Alternative GCG file formats (*.msf or *.rsf) may be requested by (jsgro@facstaff.wisc.edu). The web site also includes RNA and protein alignments for composite picornaviral genera, covering all genome regions with common evolutionary origins (e.g. P1, 2C, 3B, 3C and 3D). Similarity Plots: What then, do the alignments teach us about synonymous regions in picornaviruses? As avowed sequence addicts, we ve spent countless hours staring at alignment columns and marveling at the complex variation versus conservation in the lexicon of viral proteins. Anyone who revels in this level of detail is guaranteed to find personal enlightenment in the web files. Admittedly however, even for the book of life, the repetitive text can become mind numbing after a while, and there is a certain truth to the adage, One picture is worth a thousand words. In this case, we chose graphs instead of pictures, but the point is similar (or is that identical?). After many false starts with different summary methods, it was ultimately decided that only full-length polyproteins should be allowed to vote in the graph-building process. This culled many sequences from the deeper, more redundant P1 regions and kept them from overpowering the P2+P3 regions, where invariably, there were fewer strains in each alignment. The chosen sequences (n = 3 to 33, depending upon the genus), were run through the GCG program, PLOTSIMILARITY, to capture an averaged identity score within a window (win =20) that slid across the alignment. Window averaging smoothes the plot contours and makes it easier to identify to regions with different identity profiles. Figure 1, A-G shows respectively, the relative amino acid conservation in the Enterovirus, Rhinovirus, Aphthovirus, Hepatovirus, Parechovirus, Cardiovirus, and Teschovirus polyproteins. The full-length sequences for individual species were then segregated further and independently recalculated to emphasize how they varied among themselves, while contributing to the contours of the parental genera. All plots for all species with at least two complete members are available on the web. For brevity, only some are shown here. The number of participating sequences (n), the averaged polyprotein identity (average), and the count of alignment columns where all residues were identical (identities), is shown relative to a polyprotein map, punctuated with appropriate cleavage sites. The protein distance scales are numbered according to alignment positions in the files for each genus. To state the obvious, conserved residues in the P2+P3 regions generally outnumber those in the P1 region for every alignment. However, the details in the distribution of conserved elements display a much more subtle and powerfully pattern. Not surprising, and without exception, there is a 1:1 correspondence between all mapped picornavirus antigenic sites and the observed P1 troughs of limited sequence identity. In the Rhinoviruses, for example, the first three segments with less than % identity correspond exactly to the EF-loop of 1B, the BC-loop of 1C, and the BC-loop of 1D. Better known as Nim-II, Nim-III, Nim- IA, respectively, these are the antigenic neutralization sites on HRV-14 (18;20), and presumably other HRVs as well. The shorter Nim-1B site in the DE-loop of 1D (alignment position ~720), and the COOHend of 1D are hypervariable in all rhinos. The Enterovirus P1 troughs, as typified by poliovirus, again correspond to the Nim sites mapped on the surface of the PV-1 particle (14). Also as expected, the deepest - 3-

4 trough in the Aphthovirus 1D protein (position ~9), is the antigenic FMDV-loop between the GH strands (1). Less dominant antigenic contributions from other FMDV P1 segments are consistent with this profile. Relative to these genera, the P1 regions of Cardioviruses and Hepatoviruses are almost flat line. The EMCV and HAV are both represented by multiple sequences (n = 9, n = 15 respectively) but each has only one serotype. Nevertheless, the little blips within the 1B and 1D (EMCV), and at the amino end of 1D (HAV), indeed denote known neutralization sites (2;5). The ThV, which share less antigenic crossreactivity than the EMCV, have deeper troughs in the same locations, presumably from the residues that contribute to this diversity. The Teschoviruses and Parechoviruses have not yet been mapped for epitopes, but since the graphs mirror the sequence variability in the alignments, extrapolation from the better-known viruses clearly point to several probable antigenic sites. Tracings for the HPeV P1 region plot, actually superimpose quite with well with many elements of the HRV plot. In contrast, the Teschovirus plot suggests a prominent, immunogenic FMDV loop, as well as another very reactive site in 1B. In several of these graphs, the P1-P2 boundary marks an apparently abrupt transition in sequence conservation. To illustrate with the Teschoviruses, the data from 30 complete polyproteins, representing 11 PTV serotypes (21) share an average identity of 82% in the P1 region, and 97% in the P2+P3 regions. The PV (83% and 96%), HEV-B (72% and 95%), FMDV (74% and 94%) and HPeV (81% and 92%) also have abrupt transitions. Again, it s the individual sequence patterns, not the averages that should be of interest here. Surely, immunogenic pressures are responsible for the high mutation fixation rates in the epitope-encoding regions. That s why there s local variability in the loops. But the P1 segments between these troughs, representing internal beta sheets or helices have, in fact, identity values quite similar to the P2+P3 region averages. That is, the infrastructure of each genome as a whole, including internal portions of the capsid region, tends to fix mutations at a uniform rate across the species lineages. On the other hand, the non-structural regions aren't uniformly equivalent either. All the plots show the central segment of protein 2C with higher than average conservation levels. The sacrosanct region spans about 1 amino acids, with 97-99% averaged identity, and includes the NTP binding motif (Prosite # PDOC00017, [AG]- x(4)-g-k-[st]), which probably marks the functional core of this protein. The lower than average variability here, again reflects a special selective pressure that works to moderate mutation fixation, with the goal of conserving some key structure or activity that is apparently vital for all picornaviruses. If one looks closely, each genus shows additional regions with greater or lesser genetic stability, as reflected by variability in the alignments. The FMDV 3A region and the EMCV 2A region are among the most variable in their respective species, while the polio, HEV-B and PTV 3D proteins are almost invariant. These plots and the alignments that formed them record the sum of the selective pressures that honed these segments during evolution. There is nothing new about the concept of different mutational fixation rates for different regions of a genome. The concept becomes important only when we want to accurately compare genomes or use their collective histories to retrace evolution. Family history, genus phylogeny, and relationships between and among species, should be followed by using the largest possible sequence fragments that are true homologs and have developed under similar selective pressures. Stated another way, long-term relationships are best fished out of the quiet end of the gene pool. Invariant regions like 2C lend themselves to comparisons outside of the family. Proteins 3B, 3C, 3D and the beta regions of P1 have obvious shared family ancestry with similar fixation rates and their relative mutations reasonably record the history of each genus. The chaotic mutational fixation in the epitope regions, like 1D, are better suited for following short term associations, like strain variation during outbreaks. The remaining genome regions like L, 2A, 3A proteins, and the 5 and 3 UTRs may be analogs in their functions, but they are not always genetic homologs. These regions are the best examples of loosely structured biological synonyms that can be quite useful in ordering species within genera, or strains - 4-

5 within species, but their informational content is very sensitive to specific genome context, and is rarely directly comparable. Nucleotide Preferences: If the alignments and similarity plots tabulate the thesaurus of sequence synonyms, then base composition and codon frequency data are like dictionaries of regional dialects, reflecting nucleotide preferences and codon biases that are characteristic and diagnostic of each species. The tendency for Hepatoviruses or Rhinoviruses to have overabundant A+U sequences, while the Aphthoviruses are rich in G+C, for example, has been well-documented (16). While we stirred the picornavirus sequence pot for the new alignments, it seemed opportune to revisit these parameters along the lines of the updated family taxonomy. Base composition and codon frequency data were collected for 164 strains, representing 19 of the 20 (current) picornaviral species (11). Porcine enterovirus A (PEV-A, 1 kb) was excluded only because there were too few bases for reliable comparison. Within the other species, several redundant strain iterations were sometimes excluded to avoid over weighting by any particular serotype. The final dataset (919 kb total) included: Bovine enterovirus (BEV; 23 kb), Human enterovirus A (HEV-A; 18 kb), Human enterovirus B (HEV-B; 144 kb), Human enterovirus C (HEV-C; 16 kb), Human enterovirus D (HEV-D; 17 kb), Poliovirus (PV; 59 kb), Porcine enterovirus B (PEV-B; 7 kb), Human rhinovirus A (HRV-A; 58 kb), Human rhinovirus B (HRV-B; 10 kb), Encephalomyocarditis virus (EMCV; 77 kb), Theilovirus (ThV; 49 kb), Foot-and-mouth disease virus (FMDV; 161 kb), Equine rhinitis A virus (ERAV; 31 kb), Hepatitis A virus (HAV; 131 kb), Avian encephalomyelitis virus (AEV; 7 kb), Human parechovirus (HePV; 17 kb), Equine rhinitis B virus (ERBV; 9 kb), Porcine teschovirus (PTV; 77 kb), Aichi virus (AiV; 8 kb). The genome base compositions (omitting 5 polyc tracts and 3 polya tails) were very consistent within each tabulated species. The standard deviation of averaged values for any given species (e.g. %A content among all FMDV) was never greater than 2%, and more commonly, was less than 0.5%. The same was true for codon frequencies, in that the codon distribution of each sequence was closely mirrored by other members of that species. Among species, however, there was a strong skew for particular base compositions and codon frequencies (Fig 2). At the far end of the scale, the AiV clearly had the most anomalous composition. Although admittedly, there is only one current complete sequence in this species (GenBank: AB010145), additional data from smaller 3CD-region fragments are consistent in that the C content (38%), nearly exceeds the combined purine content (19% A, 21% G) of the entire genome. Pyrimidines therefore, make up an astonishing 62% of the AiV base count. The bias extends through both UTRs (A:G:C:U of 19%:32%:24%:25%), but is most evident in the coding region, where more than half of the reading frame triplets end in C (xxa = 11%, xxg = 17%, xxc = 54%, xxu = 18%) To understand the magnitude of this extraordinary skew, it s important to remember that five of the twenty amino acids (Glu, Lys, Met, Gln and Trp) representing 14% of the Aichi virus polyprotein composition can t even be encrypted by xxc codons (e.g. Met = AUG, Lys = AAR, etc.), so the bias can only be exerted through encryption of the other fifteen amino acids. The triplet assignments in the standard genetic code are not random. The code is an evolutionary masterpiece that generally works to minimize the impact of mutational and error frequencies caused by polymerase and trna misincorporations. Sterically, the third codon position is the most difficult to read accurately by trna-mrna pairings. Although this is the most degenerate position of the present code (perhaps originally a spacer base), more often than not, mispairings allow the same or similar amino acid to be inserted, regardless of specific nucleotide selection. The second codon position is the least error prone and probably most closely reflects the conservation of an original singlet code (4 bases, 4 amino acids), which discriminated functionality based on "inside" (hydrophobic: xcx or xux) and "outside" (hydrophilic: xax) amino acids. The first position of a codon tends to signify the relative size of the encoded amino acid R-chain (small: Axx or Gxx; large: Cxx or Uxx), with a special partiality towards - 5-

6 Gxx in eucaryotes, as a possible ribosomal proof checking mechanism (12). Because of this codon balance, degeneracy, transitions or errors in the first or second codon positions are likely to cause only minimum coding damage (UUA6CUA = Leu) or at worst, permit conservative substitutions (UUA6GUA = Val). The average eucaryotic or procaryotic protein composition tends to distribute more or less equivalently among inside, outside, large, and small amino acids, so it is very unusual to find a significant nucleotide preferences in the first two codon positions (12;13). Among all other picornaviruses, except for AiV, this rule is generally observed, and the variance is only a few percent among other sequences, in their selection of first or second codon bases (see Fig 2). The average picornavirus ratios for Axx:Gxx:Cxx:Uxx are 30:31:19:20 (StDev of <4% for any base). For xax:xgx:xcx:xux the average ratios are 31:16:25:28 (StDev of <1.5% for any base). The AiV on the other hand, have much higher than average C compositions (Cxx = 28%, xcx = 34%) at all codon bases, signifying an extraordinary selective pressure on this genome s content, that carries even into the first and second codon bases as well as the third base. Other picornaviruses also have different atypical skews, though none is as dramatic as the AiV. The HAVs have high A+U content (29% and 33%) that presents as a marked preference in third codon position (xxa = 28%, xxu = 42%, xxc = 10%), but is also influenced by an unusual under-representation of any codons with CG dinucleotides, such as GCG (Ala), UCG (Ser), ACG (Thr), CGX (Arg), and CCG (Pro). No codons with CG dinucleotides are used in any HAV sequence more than 2-3 times per reading frame, and certain codons, most notably CGG (Arg), GCG (Ala) and CCG (Pro), are excluded entirely from many genomes. The Rhinoviruses (HRV-A and HRV-B) have similar prejudices in their preference for high A+U content, but with the ratios reversed (A = 31%, U = 29%). Again these genomes discriminate against CGG (Arg), CCG (Pro), UCG (Ser), ACG (Thr) and GCG (Ala) codons, selecting nearly any alternative synonymous triplet at a -60 fold higher frequency than those with CG dinucleotides. The CGG (Arg), UCG (Thr), and CCG (Pro) codons are absent from most HRV sequences, and none is used more 20 times in total among the 21,000 codons surveyed for this genus. At the other end of the scale, the FMDV have high G+C contents and favor codons ending with these bases (xxg = 27%, xxc = 37%). Consequently, the FMDV shun a different cohort of triplets, like CUA (Leu), GUA (Val), UUA (Leu) and AUA (Ile), but are rich in CG-containing codons. Each species of Cardioviruses, Enteroviruses, Parechoviruses and Teschoviruses likewise can be seen to have individual characteristics that border on diagnostic, with regard to base and codon preferences. The ThV prefer U+C in the third codon position; the AEV prefer U but not C; the PV and EMCV are reasonably equivalent in their base distributions, etc. So why should we care about these sequence tendencies? Evolution leaves its fingerprints on every page in the book of life. The clustering of base compositions and codon preferences are telling us volumes about shared lineages among viral strains, just like gene maps, IRES types, and proteolytic processing schemes are recognized predicators of familial homology. The peculiar pressures or bottlenecks that shaped one genome, will have also shaped its brethren and progeny. It s hard to imagine any biological situation that would favor 54% of the codons ending in C, but in truth, even stranger skews are common in other organisms. The sequence of Microplasma capricolum is 25% G+C, while Micrococcus leuteus is 75% G+C (12). Originally thought to be a logical adaptation to growth temperature, most evidence now suggests that genome G+C content is more closely related to phylogeny (6), and follows whatever twisted history was survived by the parental lineage. In the procaryotes, the presence of mutagens, suboptimal polymerase base preferences, defective repair mechanisms or other directional mutational pressures are known to vary considerably among phylogenetic lines, and each of these pressures can contribute to a different mutational fixation rate that manifests in characteristic composition preferences (12). However bizarre the outcome, we are formed by the trials and cultures that were survived by our parents. - 6-

7 For eucaryotes, there is very little information about the specific pressures that might shape a given lineage. Certainly, the symmetrical methylation of CG dinucleotides has a unique structural significance in many higher DNA genomes, reducing the value and frequency of this sequence in coding regions (8). There would be no reason for a cell to support CG codons, if they weren t present in their own mrnas. Moreover, unlike procaryotes, where entire genomes usually skew as a unit, the chromosomes of vertebrates are complex mosaics of G+C rich regions and G+C poor regions. Eucaryotic replication occurs in multiple phases, and each phase can use different enzymes, with the G+C rich regions generally copied before the G+C poor regions, in every cell cycle (6). The repertoire of required trnas needed to translate expressed messages, must therefore follow these cycles, particularly in cells with slow division times, like neurons. In short, the cumulative environment of A+U and G+C pressures can vary widely among vertebrates and tissues according to host age and cell cycle timing. We should not be surprised that our viruses adapt to these circumstances, we should only regret that we don t yet understand what these tendencies are telling us about their niches. Acknowledgments. Comparative sequence analyses of picornaviruses is supported by NIH grant AI to ACP. References: 1.Barnett, P.V., E.J. Ouldridge, D.J. Rowlands, F. Brown, and N.R. Parry Neutralizing epitopes of type O foot-and-mouth disease virus. I. Identification and characterization of three functionally independent, conformational sites. J.Gen.Virol. 70: Boege, U., S. Onodera, D.G. Scraba, G.D. Parks, and A.C. Palmenberg Characterization of Mengo virus neutralization epitopes. Virology 181: Drake, N.L., A.C. Palmenberg, A. Ghosh, D.R. Omilianowski, and P. Kaesberg Identification of the polyprotein termination site on encephalomyocarditis viral RNA. J.Virol. 41: Eddy, S.R Profile hidden Markov models. Bioinformatics 14: Emini, E.A., J.V. Hughes, D.S. Perlow, and J. Boger Induction of hepatitis A virusneutralizing antibody by a virus-specific synthetic peptide. J.Virol 55: Filipski, J Evolution of DNA sequences. Contribution of mutational bias and selection to the origin of chromosomal compartments. Adv. Mutagenesis Res. 2: Gribskov, M. and S. Veretnik Identification of sequence patterns with profile analysis. Methods in Enzymology 266: Herbert, A. and A. Rich Topology and formation of left-handed Z-DNA. J.Biol.Chem. 271: Horsnell, C., R.E. Gama, P.J. Hughes, and G. Stanway Molecular relationships berween 21 human rhinovirus serotypes. J.Gen. Virology 76: Khesin, Y.E., A.M. Amchenkova, N.E. Gulevich, and A.E. Narovlysanskii Association between group G chromosomes, especially chromosome G21, and susceptibility of human cells to coxsackie B viruses. Bulletin of Experimental Biology and Medicine. 86: King, A.M.Q., F. Brown, P. Christian, T. Hovi, T. Hyypiä, N.J. Knowles, S.M. Lemon, P.D. Minor, A.C. Palmenberg, T. Skern, and G. Stanway Picornaviridae, p In AnonymousVirus Taxonomy. Seventh Report of the International Committee for the Taxonomy of Viruses. Academic Press, New York. 12. Osawa, S Evolution of the Genetic Code. Oxford University Press, New York. 13. Osawa, S., T.H. Jukes, K. Watanabe, and A. Muto Recent evidence for evolution of the genetic code. Microbiological Rev. 56: Page, G.S., A.G. Mosser, J.M. Hogle, D.J. Filman, R.R. Rueckert, and M. Chow Threedimensional structure of poliovirus serotype 1 neutralizing determinants. J.Virol. 62: Palmenberg, A.C Genome organization, translation and processing in picornaviruses, p

8 15. In D.J. Rolands, B.W.J. Mahy, and M. Mayo (eds.), The Molecular Biology of Positive Strand RNA Viruses. Academic Press, London. 16. Palmenberg, A.C Sequence alignments of picornaviral capsid proteins, p In B. Semler and E. Ehrenfeld (eds.), Molecular Aspects of Picornavirus Infection and Detection. ASM Publications, Washington,DC. 17. Palmenberg, A.C. and J.-Y. Sgro Topological organization of picornaviral genomes: statistical prediction of RNA structural signals. Seminars in Virology 8: Rossmann, M.G., E. Arnold, J.W. Erickson, E.A. Frankenberger, J.P. Griffith, H.-J. Hecht, J.E. Johnson, G. Kamer, M. Luo, A.G. Mosser, R.R. Rueckert, B. Sherry, and G. Vriend The structure of a human common cold virus (Rhinovirus 14) and its functional relations to other picornaviruses. Nature 317: Ryan, M.D. and Flint.M Virus-encoded proteinases of the picornavirus super-group. J. Gen. Virology 78: Sherry, B., A.G. Mosser, R.J. Colonno, and R.R. Rueckert Use of monoclonal antibodies to identify four neutralization immunogens on a common cold picornavirus, human rhinovirus 14. J.Virol. 57: Zell, R., M. Dauber, A. Krumbholz, A. Henke, E. Birch-Hirschfeld, A. Stelzner, D. Prager, and R. Wurm Porcine teschoviruses comprise at least eleven distinct serotypes: molecular and evoutionary aspects. J.Virol. 75: Figure Legends Figure 1. Polyprotein similarity plots. The protein alignments for picornaviral genera were culled to include only full-length sequences. The GCG program PLOTSIMILARITY calculated the average shared identity (window = 20) along the length of each alignment. The data for each genus and for separately calculated select species were plotted relative to a polyprotein map showing the cleavage site divisions and numbered according to the parental alignment columns. The dashed horizontal line and the average value on each plot refer to the overall identity shared among included sequences (calculated with window = 1). The identities value gives the percent of alignment columns where all (n) sequences were the same. This value (occasionally) includes some locations with shared gaps, in addition to locations with identical amino acids. The Parechovirus plot is partial (1ABC only), because a full-length sequence of Ljungan virus, the type member of the LV species (GenBank: AF020541), and the only non-hpev strain in this genus, was not available. Figure 2. Codon and genome base compositions. Base compositions for 19 picornavirus species were summed separately then averaged for (a) the whole genomes, (b) 1 st codon base, (c) 2 nd codon base, and (d) 3 rd codon base. Each concentric ring represents an individual species, subdivided according to the averaged observed frequency (percent) of A:G:C:U. The data were sorted in ascending order by A content in the 3 rd codon position, and maintain the same order in each plot: (inside ring) AiV, ThV, FMDV, ERBV, BEV, AEV, ERAV, HEV-A, EMCV, PTV, HEV-B, PV, PEV-B, HEV-C, HAV, HPeV, HEV-D, HRV-B, HRV-A (outside ring). - 8-

9 A Sequence Identity for Aligned Enterovirus Polyproteins B Sequence Identity for Aligned Rhinovirus Polyproteins polio (n=8) average 93.4% identities 86.7% Rhinovirus A (n=7) average 82.6% identities 66.0% HEVB (n=10) average 91.1% identities 79.0% Rhinoviruses (n=8) average 75.0% identities 43.7% C AB 1C 1D 2A 2B 2C 3 AB 3C 3D Sequence Identity for Aligned Aphthovirus Polyproteins HEVA+C (n=4) average 66.3% identities 51.4% FMDV (n=8) average 89.5% identities 74.1% AB 1C 1D 2A 2B 2C Enteroviruses (n=33) average 69.8% identities 30.0% 3 AB 3C 3D Aphthoviruses (n=10) average 70.5% identities 29.9% L 1AB 1C 1D 2AB 2C 3ABBB 3C 3D

10 D Sequence Identity for Aligned Hepatovirus Polyproteins F Sequence Identity for Aligned Cardiovirus Polyproteins HAV (n=15) average 97.9% identities 90.1% EMCV (n=9) average 97.2% identities 92.0% Hepatoviruses (n=16) average 90.7% identities 38.2% ThV (n=4) average 96.3% identities 93.5% AB 1C 1D 2A 2B 2C 3 AB 3C 3D E Sequence Identity for Aligned Parechovirus Polyproteins HPeV (n=2) average 89.8% identities 89.8% L 1AB 1C 1D 2A 2B 2C Cardioviruses (n=13) average 77.6% identities 52.0% AB 3C 3D G Sequence Identity for Aligned Teschovirus Polyproteins Parechoviruses (n=3) (1ABC region only) average 88.4% identities 80.7% Teschoviruses (n=30) average 91.6% identities 74.7% AB 1C 1D 2A 2B 2C 3AB 3C 3D L 1AB 1C 1D 2AB 2C 3AB 3C 3D

11

Translation. Host Cell Shutoff 1) Initiation of eukaryotic translation involves many initiation factors

Translation. Host Cell Shutoff 1) Initiation of eukaryotic translation involves many initiation factors Translation Questions? 1) How does poliovirus shutoff eukaryotic translation? 2) If eukaryotic messages are not translated how can poliovirus get its message translated? Host Cell Shutoff 1) Initiation

More information

Picornaviruses. Virion. Genome. Genes and proteins. Viruses and hosts. Diseases. Distinctive characteristics

Picornaviruses. Virion. Genome. Genes and proteins. Viruses and hosts. Diseases. Distinctive characteristics Picornaviruses Virion Genome Genes and proteins Viruses and hosts Diseases Distinctive characteristics Virion Naked icosahedral capsid (T=1) Diameter of 30 nm Genome Linear single-stranded RNA, positive

More information

ICTVdB Virus Descriptions

ICTVdB Virus Descriptions [Home] [Index of Viruses ] [Descriptions] [ Character List ] [ Picture Gallery ] [ Interactive Key ] [ Data Entry] [ 2002 ICTV] ICTVdB Virus Descriptions Descriptions are generated automatically from the

More information

aV. Code assigned:

aV. Code assigned: This form should be used for all taxonomic proposals. Please complete all those modules that are applicable (and then delete the unwanted sections). For guidance, see the notes written in blue and the

More information

Recombination and Selection in the Evolution of Picornaviruses and Other Mammalian Positive-Stranded RNA Viruses

Recombination and Selection in the Evolution of Picornaviruses and Other Mammalian Positive-Stranded RNA Viruses JOURNAL OF VIROLOGY, Nov. 2006, p. 11124 11140 Vol. 80, No. 22 0022-538X/06/$08.00 0 doi:10.1128/jvi.01076-06 Copyright 2006, American Society for Microbiology. All Rights Reserved. Recombination and Selection

More information

This exam consists of two parts. Part I is multiple choice. Each of these 25 questions is worth 2 points.

This exam consists of two parts. Part I is multiple choice. Each of these 25 questions is worth 2 points. MBB 407/511 Molecular Biology and Biochemistry First Examination - October 1, 2002 Name Social Security Number This exam consists of two parts. Part I is multiple choice. Each of these 25 questions is

More information

TRANSLATION: 3 Stages to translation, can you guess what they are?

TRANSLATION: 3 Stages to translation, can you guess what they are? TRANSLATION: Translation: is the process by which a ribosome interprets a genetic message on mrna to place amino acids in a specific sequence in order to synthesize polypeptide. 3 Stages to translation,

More information

Patterns of hemagglutinin evolution and the epidemiology of influenza

Patterns of hemagglutinin evolution and the epidemiology of influenza 2 8 US Annual Mortality Rate All causes Infectious Disease Patterns of hemagglutinin evolution and the epidemiology of influenza DIMACS Working Group on Genetics and Evolution of Pathogens, 25 Nov 3 Deaths

More information

Sequencing of Porcine Enterovirus Groups II and III Reveals Unique Features of Both Virus Groups

Sequencing of Porcine Enterovirus Groups II and III Reveals Unique Features of Both Virus Groups REFERENCES CONTENT ALERTS Sequencing of Porcine Enterovirus Groups II and III Reveals Unique Features of Both Virus Groups Andi Krumbholz, Malte Dauber, Andreas Henke, Eckhard Birch-Hirschfeld, Nick J.

More information

Chapter 12-4 DNA Mutations Notes

Chapter 12-4 DNA Mutations Notes Chapter 12-4 DNA Mutations Notes I. Mutations Introduction A. Definition: Changes in the DNA sequence that affect genetic information B. Mutagen= physical or chemical agent that interacts with DNA to cause

More information

CS612 - Algorithms in Bioinformatics

CS612 - Algorithms in Bioinformatics Spring 2016 Protein Structure February 7, 2016 Introduction to Protein Structure A protein is a linear chain of organic molecular building blocks called amino acids. Introduction to Protein Structure Amine

More information

Name: Due on Wensday, December 7th Bioinformatics Take Home Exam #9 Pick one most correct answer, unless stated otherwise!

Name: Due on Wensday, December 7th Bioinformatics Take Home Exam #9 Pick one most correct answer, unless stated otherwise! Name: Due on Wensday, December 7th Bioinformatics Take Home Exam #9 Pick one most correct answer, unless stated otherwise! 1. What process brought 2 divergent chlorophylls into the ancestor of the cyanobacteria,

More information

SURVEILLANCE TECHNICAL

SURVEILLANCE TECHNICAL CHAPTER 5 SURVEILLANCE TECHNICAL ASPECTS 55 Protect - detect - protect Polio eradication strategies can be summed up as protect and detect protect children against polio by vaccinating them, and detect

More information

Evolution of influenza

Evolution of influenza Evolution of influenza Today: 1. Global health impact of flu - why should we care? 2. - what are the components of the virus and how do they change? 3. Where does influenza come from? - are there animal

More information

Point total. Page # Exam Total (out of 90) The number next to each intermediate represents the total # of C-C and C-H bonds in that molecule.

Point total. Page # Exam Total (out of 90) The number next to each intermediate represents the total # of C-C and C-H bonds in that molecule. This exam is worth 90 points. Pages 2- have questions. Page 1 is for your reference only. Honor Code Agreement - Signature: Date: (You agree to not accept or provide assistance to anyone else during this

More information

Bioinformatics Laboratory Exercise

Bioinformatics Laboratory Exercise Bioinformatics Laboratory Exercise Biology is in the midst of the genomics revolution, the application of robotic technology to generate huge amounts of molecular biology data. Genomics has led to an explosion

More information

... Department of Biological Sciences, John Tabor Laboratories, University of Essex, Colchester CO4 3SQ, UK

... Department of Biological Sciences, John Tabor Laboratories, University of Essex, Colchester CO4 3SQ, UK Journal of General Virology (2000), 81, 201 207. Printed in Great Britain... The 2A proteins of three diverse picornaviruses are related to each other and to the H-rev107 family of proteins involved in

More information

YUMI YAMAGUCHI-KABATA AND TAKASHI GOJOBORI* Center for Information Biology, National Institute of Genetics, Mishima , Japan

YUMI YAMAGUCHI-KABATA AND TAKASHI GOJOBORI* Center for Information Biology, National Institute of Genetics, Mishima , Japan JOURNAL OF VIROLOGY, May 2000, p. 4335 4350 Vol. 74, No. 9 0022-538X/00/$04.00 0 Copyright 2000, American Society for Microbiology. All Rights Reserved. Reevaluation of Amino Acid Variability of the Human

More information

Viral Taxonomic Classification

Viral Taxonomic Classification Viruses Part I Viral Taxonomic Classification Order>> -virales Family>> - viridae Subfamily>> -virinae Genus>> -virus Species Order>> Picornavirales Family>> Picornaviridae Subfamily>> Picornavirinae Genus>>

More information

Vincent Racaniello

Vincent Racaniello Vincent Racaniello vrr1@columbia.edu www.virology.ws Poliomyelitis Polio (grey), myelon (marrow) = Greek itis (inflammation of) = Latin A common, acute viral disease characterized clinically by a brief

More information

Sequences in the 5" Non-coding Region of Human Rhinovims 14 RNA that Affect in vitro Translation

Sequences in the 5 Non-coding Region of Human Rhinovims 14 RNA that Affect in vitro Translation J. gen. Virol. (1989), 70, 2799-2804. Printed in Great Britain 2799 Key words: rhinovirus, human type 14/5' non-coding region~in vitro translation Sequences in the 5" Non-coding Region of Human Rhinovims

More information

Review article. Structure, function and evolution of picornaviruses. Glyn Stanway. Introduction. Structure of picornaviruses

Review article. Structure, function and evolution of picornaviruses. Glyn Stanway. Introduction. Structure of picornaviruses Journal of General Virology (1990), 71, 2483-2501. Printed in Great Britain 2483 Review article Structure, function and evolution of picornaviruses Glyn Stanway Department of Biology, University of Essex,

More information

Molecular analysis of duck hepatitis virus type 1 reveals a novel lineage close to the genus Parechovirus in the family Picornaviridae

Molecular analysis of duck hepatitis virus type 1 reveals a novel lineage close to the genus Parechovirus in the family Picornaviridae Journal of General Virology (2006), 87, 3307 3316 DOI 10.1099/vir.0.81804-0 Molecular analysis of duck hepatitis virus type 1 reveals a novel lineage close to the genus Parechovirus in the family Picornaviridae

More information

Susanne Johansson, 1 Bo Niklasson, 2 Jacob Maizel, 3 Alexander E. Gorbalenya, 4 * and A. Michael Lindberg 1 *

Susanne Johansson, 1 Bo Niklasson, 2 Jacob Maizel, 3 Alexander E. Gorbalenya, 4 * and A. Michael Lindberg 1 * JOURNAL OF VIROLOGY, Sept. 2002, p. 8920 8930 Vol. 76, No. 17 0022-538X/02/$04.00 0 DOI: 10.1128/JVI.76.17.8920 8930.2002 Copyright 2002, American Society for Microbiology. All Rights Reserved. Molecular

More information

Coronaviruses. Virion. Genome. Genes and proteins. Viruses and hosts. Diseases. Distinctive characteristics

Coronaviruses. Virion. Genome. Genes and proteins. Viruses and hosts. Diseases. Distinctive characteristics Coronaviruses Virion Genome Genes and proteins Viruses and hosts Diseases Distinctive characteristics Virion Spherical enveloped particles studded with clubbed spikes Diameter 120-160 nm Coiled helical

More information

Objective: You will be able to explain how the subcomponents of

Objective: You will be able to explain how the subcomponents of Objective: You will be able to explain how the subcomponents of nucleic acids determine the properties of that polymer. Do Now: Read the first two paragraphs from enduring understanding 4.A Essential knowledge:

More information

numbe r Done by Corrected by Doctor

numbe r Done by Corrected by Doctor numbe r 5 Done by Mustafa Khader Corrected by Mahdi Sharawi Doctor Ashraf Khasawneh Viral Replication Mechanisms: (Protein Synthesis) 1. Monocistronic Method: All human cells practice the monocistronic

More information

بسم هللا الرحمن الرحيم

بسم هللا الرحمن الرحيم - 1 - - - 1 P a g e بسم هللا الرحمن الرحيم This sheet was made from record section 1 all information are included - Introduction Our respiratory tract is divided anatomically to upper (URT),middle and

More information

Enterovirus D68-US Outbreak 2014 Laboratory Perspective

Enterovirus D68-US Outbreak 2014 Laboratory Perspective Enterovirus D68-US Outbreak 2014 Laboratory Perspective Allan Nix Team Lead Picornavirus Laboratory Polio and Picornavirus Laboratory Branch PAHO / SARInet Webinar November 6, 2014 National Center for

More information

The Basics: A general review of molecular biology:

The Basics: A general review of molecular biology: The Basics: A general review of molecular biology: DNA Transcription RNA Translation Proteins DNA (deoxy-ribonucleic acid) is the genetic material It is an informational super polymer -think of it as the

More information

LESSON 4.4 WORKBOOK. How viruses make us sick: Viral Replication

LESSON 4.4 WORKBOOK. How viruses make us sick: Viral Replication DEFINITIONS OF TERMS Eukaryotic: Non-bacterial cell type (bacteria are prokaryotes).. LESSON 4.4 WORKBOOK How viruses make us sick: Viral Replication This lesson extends the principles we learned in Unit

More information

Variant Classification. Author: Mike Thiesen, Golden Helix, Inc.

Variant Classification. Author: Mike Thiesen, Golden Helix, Inc. Variant Classification Author: Mike Thiesen, Golden Helix, Inc. Overview Sequencing pipelines are able to identify rare variants not found in catalogs such as dbsnp. As a result, variants in these datasets

More information

Biochemistry 2000 Sample Question Transcription, Translation and Lipids. (1) Give brief definitions or unique descriptions of the following terms:

Biochemistry 2000 Sample Question Transcription, Translation and Lipids. (1) Give brief definitions or unique descriptions of the following terms: (1) Give brief definitions or unique descriptions of the following terms: (a) exon (b) holoenzyme (c) anticodon (d) trans fatty acid (e) poly A tail (f) open complex (g) Fluid Mosaic Model (h) embedded

More information

Bioinformation by Biomedical Informatics Publishing Group

Bioinformation by Biomedical Informatics Publishing Group Predicted RNA secondary structures for the conserved regions in dengue virus Pallavi Somvanshi*, Prahlad Kishore Seth Bioinformatics Centre, Biotech Park, Sector G, Jankipuram, Lucknow 226021, Uttar Pradesh,

More information

Introduction: RNA viruses

Introduction: RNA viruses SECTION I Introduction: RNA viruses Carol Shoshkes Reiss Viruses that infect the central nervous system may cause acute, chronic, or latent infections. In some cases, the diseases manifested are attributable

More information

T he family Picornaviridae is comprised of

T he family Picornaviridae is comprised of 214 REVIEW Picornavirus uncoating M S Smyth, J H Martin... Recently, much has been learned about the molecular mechanisms involved in the pathogenesis of picornaviruses. This has been accelerated by the

More information

HAV HBV HCV HDV HEV HGV

HAV HBV HCV HDV HEV HGV Viral Hepatitis HAV HBV HCV HDV HEV HGV Additional well-characterized viruses that can cause sporadic hepatitis, such as yellow fever virus, cytomegalovirus, Epstein-Barr virus, herpes simplex virus, rubella

More information

Pre-mRNA has introns The splicing complex recognizes semiconserved sequences

Pre-mRNA has introns The splicing complex recognizes semiconserved sequences Adding a 5 cap Lecture 4 mrna splicing and protein synthesis Another day in the life of a gene. Pre-mRNA has introns The splicing complex recognizes semiconserved sequences Introns are removed by a process

More information

number Done by Corrected by Doctor Ashraf

number Done by Corrected by Doctor Ashraf number 4 Done by Nedaa Bani Ata Corrected by Rama Nada Doctor Ashraf Genome replication and gene expression Remember the steps of viral replication from the last lecture: Attachment, Adsorption, Penetration,

More information

Biological systems interact, and these systems and their interactions possess complex properties. STOP at enduring understanding 4A

Biological systems interact, and these systems and their interactions possess complex properties. STOP at enduring understanding 4A Biological systems interact, and these systems and their interactions possess complex properties. STOP at enduring understanding 4A Homework Watch the Bozeman video called, Biological Molecules Objective:

More information

Complete Student Notes for BIOL2202

Complete Student Notes for BIOL2202 Complete Student Notes for BIOL2202 Revisiting Translation & the Genetic Code Overview How trna molecules interpret a degenerate genetic code and select the correct amino acid trna structure: modified

More information

Identification of Mutation(s) in. Associated with Neutralization Resistance. Miah Blomquist

Identification of Mutation(s) in. Associated with Neutralization Resistance. Miah Blomquist Identification of Mutation(s) in the HIV 1 gp41 Subunit Associated with Neutralization Resistance Miah Blomquist What is HIV 1? HIV-1 is an epidemic that affects over 34 million people worldwide. HIV-1

More information

Last time we talked about the few steps in viral replication cycle and the un-coating stage:

Last time we talked about the few steps in viral replication cycle and the un-coating stage: Zeina Al-Momani Last time we talked about the few steps in viral replication cycle and the un-coating stage: Un-coating: is a general term for the events which occur after penetration, we talked about

More information

Hepadnaviruses: Variations on the Retrovirus Theme

Hepadnaviruses: Variations on the Retrovirus Theme WBV21 6/27/03 11:34 PM Page 377 Hepadnaviruses: Variations on the Retrovirus Theme 21 CHAPTER The virion and the viral genome The viral replication cycle The pathogenesis of hepatitis B virus A plant hepadnavirus

More information

Peptide hydrolysis uncatalyzed half-life = ~450 years HIV protease-catalyzed half-life = ~3 seconds

Peptide hydrolysis uncatalyzed half-life = ~450 years HIV protease-catalyzed half-life = ~3 seconds Uncatalyzed half-life Peptide hydrolysis uncatalyzed half-life = ~450 years IV protease-catalyzed half-life = ~3 seconds Life Sciences 1a Lecture Slides Set 9 Fall 2006-2007 Prof. David R. Liu In the absence

More information

Phenylketonuria (PKU) Structure of Phenylalanine Hydroxylase. Biol 405 Molecular Medicine

Phenylketonuria (PKU) Structure of Phenylalanine Hydroxylase. Biol 405 Molecular Medicine Phenylketonuria (PKU) Structure of Phenylalanine Hydroxylase Biol 405 Molecular Medicine 1998 Crystal structure of phenylalanine hydroxylase solved. The polypeptide consists of three regions: Regulatory

More information

Bioinformatics. Sequence Analysis: Part III. Pattern Searching and Gene Finding. Fran Lewitter, Ph.D. Head, Biocomputing Whitehead Institute

Bioinformatics. Sequence Analysis: Part III. Pattern Searching and Gene Finding. Fran Lewitter, Ph.D. Head, Biocomputing Whitehead Institute Bioinformatics Sequence Analysis: Part III. Pattern Searching and Gene Finding Fran Lewitter, Ph.D. Head, Biocomputing Whitehead Institute Course Syllabus Jan 7 Jan 14 Jan 21 Jan 28 Feb 4 Feb 11 Feb 18

More information

Severe Acute Respiratory Syndrome (SARS) Coronavirus

Severe Acute Respiratory Syndrome (SARS) Coronavirus Severe Acute Respiratory Syndrome (SARS) Coronavirus Coronaviruses Coronaviruses are single stranded enveloped RNA viruses that have a helical geometry. Coronaviruses are the largest of RNA viruses with

More information

Hands-On Ten The BRCA1 Gene and Protein

Hands-On Ten The BRCA1 Gene and Protein Hands-On Ten The BRCA1 Gene and Protein Objective: To review transcription, translation, reading frames, mutations, and reading files from GenBank, and to review some of the bioinformatics tools, such

More information

Foot-and-Mouth Disease

Foot-and-Mouth Disease CLINICAL MICROBIOLOGY REVIEWS, Apr. 2004, p. 465 493 Vol. 17, No. 2 0893-8512/04/$08.00 0 DOI: 10.1128/CMR.17.2.465 493.2004 Foot-and-Mouth Disease Marvin J. Grubman* and Barry Baxt Plum Island Animal

More information

SMPD 287 Spring 2015 Bioinformatics in Medical Product Development. Final Examination

SMPD 287 Spring 2015 Bioinformatics in Medical Product Development. Final Examination Final Examination You have a choice between A, B, or C. Please email your solutions, as a pdf attachment, by May 13, 2015. In the subject of the email, please use the following format: firstname_lastname_x

More information

Rajesh Kannangai Phone: ; Fax: ; *Corresponding author

Rajesh Kannangai   Phone: ; Fax: ; *Corresponding author Amino acid sequence divergence of Tat protein (exon1) of subtype B and C HIV-1 strains: Does it have implications for vaccine development? Abraham Joseph Kandathil 1, Rajesh Kannangai 1, *, Oriapadickal

More information

Malik Sallam. Ola AL-juneidi. Ammar Ramadan. 0 P a g e

Malik Sallam. Ola AL-juneidi. Ammar Ramadan. 0 P a g e 1 Malik Sallam Ola AL-juneidi Ammar Ramadan 0 P a g e Today's lecture will be about viral upper respiratory tract infections. Those include: common cold, sinusitis, otitis, etc. Infections in the upper

More information

Multiple sequence alignment

Multiple sequence alignment Multiple sequence alignment Bas. Dutilh Systems Biology: Bioinformatic Data Analysis Utrecht University, February 18 th 2016 Protein alignments We have seen how to create a pairwise alignment of two sequences

More information

LESSON 3.2 WORKBOOK. How do normal cells become cancer cells? Workbook Lesson 3.2

LESSON 3.2 WORKBOOK. How do normal cells become cancer cells? Workbook Lesson 3.2 For a complete list of defined terms, see the Glossary. Transformation the process by which a cell acquires characteristics of a tumor cell. LESSON 3.2 WORKBOOK How do normal cells become cancer cells?

More information

Virus Genetic Diversity

Virus Genetic Diversity Virus Genetic Diversity Jin-Ching Lee, Ph.D. 李 jclee@kmu.edu.tw http://jclee.dlearn.kmu.edu.t jclee.dlearn.kmu.edu.tw TEL: 2369 Office: N1024 Faculty of Biotechnology Kaohsiung Medical University Outline

More information

7.012 Quiz 3 Answers

7.012 Quiz 3 Answers MIT Biology Department 7.012: Introductory Biology - Fall 2004 Instructors: Professor Eric Lander, Professor Robert A. Weinberg, Dr. Claudette Gardel Friday 11/12/04 7.012 Quiz 3 Answers A > 85 B 72-84

More information

To test the possible source of the HBV infection outside the study family, we searched the Genbank

To test the possible source of the HBV infection outside the study family, we searched the Genbank Supplementary Discussion The source of hepatitis B virus infection To test the possible source of the HBV infection outside the study family, we searched the Genbank and HBV Database (http://hbvdb.ibcp.fr),

More information

paper and beads don t fall off. Then, place the beads in the following order on the pipe cleaner:

paper and beads don t fall off. Then, place the beads in the following order on the pipe cleaner: Beady Pipe Cleaner Proteins Background: Proteins are the molecules that carry out most of the cell s dayto-day functions. While the DNA in the nucleus is "the boss" and controls the activities of the cell,

More information

Breast cancer. Risk factors you cannot change include: Treatment Plan Selection. Inferring Transcriptional Module from Breast Cancer Profile Data

Breast cancer. Risk factors you cannot change include: Treatment Plan Selection. Inferring Transcriptional Module from Breast Cancer Profile Data Breast cancer Inferring Transcriptional Module from Breast Cancer Profile Data Breast Cancer and Targeted Therapy Microarray Profile Data Inferring Transcriptional Module Methods CSC 177 Data Warehousing

More information

Unit 1 Exploring and Understanding Data

Unit 1 Exploring and Understanding Data Unit 1 Exploring and Understanding Data Area Principle Bar Chart Boxplot Conditional Distribution Dotplot Empirical Rule Five Number Summary Frequency Distribution Frequency Polygon Histogram Interquartile

More information

Bio 111 Study Guide Chapter 17 From Gene to Protein

Bio 111 Study Guide Chapter 17 From Gene to Protein Bio 111 Study Guide Chapter 17 From Gene to Protein BEFORE CLASS: Reading: Read the introduction on p. 333, skip the beginning of Concept 17.1 from p. 334 to the bottom of the first column on p. 336, and

More information

Polyomaviridae. Spring

Polyomaviridae. Spring Polyomaviridae Spring 2002 331 Antibody Prevalence for BK & JC Viruses Spring 2002 332 Polyoma Viruses General characteristics Papovaviridae: PA - papilloma; PO - polyoma; VA - vacuolating agent a. 45nm

More information

Introduction retroposon

Introduction retroposon 17.1 - Introduction A retrovirus is an RNA virus able to convert its sequence into DNA by reverse transcription A retroposon (retrotransposon) is a transposon that mobilizes via an RNA form; the DNA element

More information

Central Dogma. Central Dogma. Translation (mrna -> protein)

Central Dogma. Central Dogma. Translation (mrna -> protein) Central Dogma Central Dogma Translation (mrna -> protein) mrna code for amino acids 1. Codons as Triplet code 2. Redundancy 3. Open reading frames 4. Start and stop codons 5. Mistakes in translation 6.

More information

Cells N5 Homework book

Cells N5 Homework book 1 Cells N5 Homework book 2 Homework 1 3 4 5 Homework2 Cell Ultrastructure and Membrane 1. Name and give the function of the numbered organelles in the cell below: A E B D C 2. Name 3 structures you might

More information

Catalysis & specificity: Proteins at work

Catalysis & specificity: Proteins at work Catalysis & specificity: Proteins at work Introduction Having spent some time looking at the elements of structure of proteins and DNA, as well as their ability to form intermolecular interactions, it

More information

Copyright 2008 Pearson Education, Inc., publishing as Pearson Benjamin Cummings

Copyright 2008 Pearson Education, Inc., publishing as Pearson Benjamin Cummings Concept 5.4: Proteins have many structures, resulting in a wide range of functions Proteins account for more than 50% of the dry mass of most cells Protein functions include structural support, storage,

More information

Genetic information flows from mrna to protein through the process of translation

Genetic information flows from mrna to protein through the process of translation Genetic information flows from mrn to protein through the process of translation TYPES OF RN (RIBONUCLEIC CID) RN s job - protein synthesis (assembly of amino acids into proteins) Three main types: 1.

More information

NEXT GENERATION SEQUENCING OPENS NEW VIEWS ON VIRUS EVOLUTION AND EPIDEMIOLOGY. 16th International WAVLD symposium, 10th OIE Seminar

NEXT GENERATION SEQUENCING OPENS NEW VIEWS ON VIRUS EVOLUTION AND EPIDEMIOLOGY. 16th International WAVLD symposium, 10th OIE Seminar NEXT GENERATION SEQUENCING OPENS NEW VIEWS ON VIRUS EVOLUTION AND EPIDEMIOLOGY S. Van Borm, I. Monne, D. King and T. Rosseel 16th International WAVLD symposium, 10th OIE Seminar 07.06.2013 Viral livestock

More information

Lecture 2: Virology. I. Background

Lecture 2: Virology. I. Background Lecture 2: Virology I. Background A. Properties 1. Simple biological systems a. Aggregates of nucleic acids and protein 2. Non-living a. Cannot reproduce or carry out metabolic activities outside of a

More information

Translational Control of Viral Gene Expression in Eukaryotes

Translational Control of Viral Gene Expression in Eukaryotes MICROBIOLOGY AND MOLECULAR BIOLOGY REVIEWS, June 2000, p. 239 280 Vol. 64, No. 2 1092-2172/00/$04.00 0 Copyright 2000, American Society for Microbiology. All Rights Reserved. Translational Control of Viral

More information

RNA Secondary Structures: A Case Study on Viruses Bioinformatics Senior Project John Acampado Under the guidance of Dr. Jason Wang

RNA Secondary Structures: A Case Study on Viruses Bioinformatics Senior Project John Acampado Under the guidance of Dr. Jason Wang RNA Secondary Structures: A Case Study on Viruses Bioinformatics Senior Project John Acampado Under the guidance of Dr. Jason Wang Table of Contents Overview RSpredict JAVA RSpredict WebServer RNAstructure

More information

7.014 Problem Set 7 Solutions

7.014 Problem Set 7 Solutions MIT Department of Biology 7.014 Introductory Biology, Spring 2005 7.014 Problem Set 7 Solutions Question 1 Part A Antigen binding site Antigen binding site Variable region Light chain Light chain Variable

More information

Mutants and HBV vaccination. Dr. Ulus Salih Akarca Ege University, Izmir, Turkey

Mutants and HBV vaccination. Dr. Ulus Salih Akarca Ege University, Izmir, Turkey Mutants and HBV vaccination Dr. Ulus Salih Akarca Ege University, Izmir, Turkey Geographic Distribution of Chronic HBV Infection 400 million people are carrier of HBV Leading cause of cirrhosis and HCC

More information

Insulin Resistance. Biol 405 Molecular Medicine

Insulin Resistance. Biol 405 Molecular Medicine Insulin Resistance Biol 405 Molecular Medicine Insulin resistance: a subnormal biological response to insulin. Defects of either insulin secretion or insulin action can cause diabetes mellitus. Insulin-dependent

More information

Evidence for enteroviral persistence in humans

Evidence for enteroviral persistence in humans Journal of General Virology (1997), 78, 307 312. Printed in Great Britain...... SHORT COMMUNICATION Evidence for enteroviral persistence in humans Daniel N. Galbraith, Carron Nairn and Geoffrey B. Clements

More information

The Structure and Function of Large Biological Molecules Part 4: Proteins Chapter 5

The Structure and Function of Large Biological Molecules Part 4: Proteins Chapter 5 Key Concepts: The Structure and Function of Large Biological Molecules Part 4: Proteins Chapter 5 Proteins include a diversity of structures, resulting in a wide range of functions Proteins Enzymatic s

More information

Exploring HIV Evolution: An Opportunity for Research Sam Donovan and Anton E. Weisstein

Exploring HIV Evolution: An Opportunity for Research Sam Donovan and Anton E. Weisstein Microbes Count! 137 Video IV: Reading the Code of Life Human Immunodeficiency Virus (HIV), like other retroviruses, has a much higher mutation rate than is typically found in organisms that do not go through

More information

Development of a predictive model for vaccine matching for serotype O FMDV from serology and capsid sequence

Development of a predictive model for vaccine matching for serotype O FMDV from serology and capsid sequence Development of a predictive model for vaccine matching for serotype O FMDV from serology and capsid sequence D. Borley, S. Upadhyaya, D. Paton, R. Reeve and Mana Mahapatra Pirbright Laboratory United Kingdom

More information

Amino acids. Side chain. -Carbon atom. Carboxyl group. Amino group

Amino acids. Side chain. -Carbon atom. Carboxyl group. Amino group PROTEINS Amino acids Side chain -Carbon atom Amino group Carboxyl group Amino acids Primary structure Amino acid monomers Peptide bond Peptide bond Amino group Carboxyl group Peptide bond N-terminal (

More information

Envelope e_ :------, Envelope glycoproteins e_ ~ Single-stranded RNA ----, Nucleocapsid

Envelope e_ :------, Envelope glycoproteins e_ ~ Single-stranded RNA ----, Nucleocapsid Virus Antib(tdies Your Expertise, Our Antibodies, Accelerated Discovery. Envelope e_---------:------, Envelope glycoproteins e_-------1~ Single-stranded RNA ----, Nucleocapsid First identified in 1989,

More information

Chapter 23. Inference About Means. Copyright 2010 Pearson Education, Inc.

Chapter 23. Inference About Means. Copyright 2010 Pearson Education, Inc. Chapter 23 Inference About Means Copyright 2010 Pearson Education, Inc. Getting Started Now that we know how to create confidence intervals and test hypotheses about proportions, it d be nice to be able

More information

Bacteria and Viruses

Bacteria and Viruses CHAPTER 13 LESSON 3 Bacteria and Viruses What are viruses? Key Concepts What are viruses? How do viruses affect human health? What do you think? Read the two statements below and decide whether you agree

More information

AP Biology. Viral diseases Polio. Chapter 18. Smallpox. Influenza: 1918 epidemic. Emerging viruses. A sense of size

AP Biology. Viral diseases Polio. Chapter 18. Smallpox. Influenza: 1918 epidemic. Emerging viruses. A sense of size Hepatitis Viral diseases Polio Chapter 18. Measles Viral Genetics Influenza: 1918 epidemic 30-40 million deaths world-wide Chicken pox Smallpox Eradicated in 1976 vaccinations ceased in 1980 at risk population?

More information

LAB#23: Biochemical Evidence of Evolution Name: Period Date :

LAB#23: Biochemical Evidence of Evolution Name: Period Date : LAB#23: Biochemical Evidence of Name: Period Date : Laboratory Experience #23 Bridge Worth 80 Lab Minutes If two organisms have similar portions of DNA (genes), these organisms will probably make similar

More information

NOMENCLATURE & CLASSIFICATION OF PLANT VIRUSES. P.N. Sharma Department of Plant Pathology, CSK HPKV, Palampur (H.P.)

NOMENCLATURE & CLASSIFICATION OF PLANT VIRUSES. P.N. Sharma Department of Plant Pathology, CSK HPKV, Palampur (H.P.) NOMENCLATURE & CLASSIFICATION OF PLANT VIRUSES P.N. Sharma Department of Plant Pathology, CSK HPKV, Palampur (H.P.) What is the purpose of classification? To make structural arrangement comprehension for

More information

Translation Activity Guide

Translation Activity Guide Translation Activity Guide Student Handout β-globin Translation Translation occurs in the cytoplasm of the cell and is defined as the synthesis of a protein (polypeptide) using information encoded in an

More information

Bioinformatics for molecular biology

Bioinformatics for molecular biology Bioinformatics for molecular biology Structural bioinformatics tools, predictors, and 3D modeling Structural Biology Review Dr Research Scientist Department of Microbiology, Oslo University Hospital -

More information

HOST-PATHOGEN CO-EVOLUTION THROUGH HIV-1 WHOLE GENOME ANALYSIS

HOST-PATHOGEN CO-EVOLUTION THROUGH HIV-1 WHOLE GENOME ANALYSIS HOST-PATHOGEN CO-EVOLUTION THROUGH HIV-1 WHOLE GENOME ANALYSIS Somda&a Sinha Indian Institute of Science, Education & Research Mohali, INDIA International Visiting Research Fellow, Peter Wall Institute

More information

Protein Structure Monday, March

Protein Structure Monday, March Michael Morales ffice: 204/202 ary; knock hard if you visit as my office is some door. Lab: 238 & 205 ary ffice hours Either make an appointment or just drop by Phone: 829-3965 E-Mail: moralesm@buffalo.edu

More information

Host Dependent Evolutionary Patterns and the Origin of 2009 H1N1 Pandemic Influenza

Host Dependent Evolutionary Patterns and the Origin of 2009 H1N1 Pandemic Influenza Host Dependent Evolutionary Patterns and the Origin of 2009 H1N1 Pandemic Influenza The origin of H1N1pdm constitutes an unresolved mystery, as its most recently observed ancestors were isolated in pigs

More information

SEQUENCE FEATURE VARIANT TYPES

SEQUENCE FEATURE VARIANT TYPES SEQUENCE FEATURE VARIANT TYPES DEFINITION OF SFVT: The Sequence Feature Variant Type (SFVT) component in IRD (http://www.fludb.org) is a relatively novel approach that delineates specific regions, called

More information

7.012 Problem Set 6 Solutions

7.012 Problem Set 6 Solutions Name Section 7.012 Problem Set 6 Solutions Question 1 The viral family Orthomyxoviridae contains the influenza A, B and C viruses. These viruses have a (-)ss RNA genome surrounded by a capsid composed

More information

PROTEIN SYNTHESIS. It is known today that GENES direct the production of the proteins that determine the phonotypical characteristics of organisms.

PROTEIN SYNTHESIS. It is known today that GENES direct the production of the proteins that determine the phonotypical characteristics of organisms. PROTEIN SYNTHESIS It is known today that GENES direct the production of the proteins that determine the phonotypical characteristics of organisms.» GENES = a sequence of nucleotides in DNA that performs

More information

RNA Processing in Eukaryotes *

RNA Processing in Eukaryotes * OpenStax-CNX module: m44532 1 RNA Processing in Eukaryotes * OpenStax This work is produced by OpenStax-CNX and licensed under the Creative Commons Attribution License 3.0 By the end of this section, you

More information

Protein Synthesis and Mutation Review

Protein Synthesis and Mutation Review Protein Synthesis and Mutation Review 1. Using the diagram of RNA below, identify at least three things different from a DNA molecule. Additionally, circle a nucleotide. 1) RNA is single stranded; DNA

More information

Antiviral agent blocks breathing of the common cold virus

Antiviral agent blocks breathing of the common cold virus Proc. Natl. Acad. Sci. USA Vol. 95, pp. 6774 6778, June 1998 Biophysics Antiviral agent blocks breathing of the common cold virus J. KATHLEEN LEWIS*, BRIAN BOTHNER*, THOMAS J. SMITH, AND GARY SIUZDAK*

More information