A Bioinformatics Method for Identifying RNA Structures within Human Cells

Similar documents
LESSON 4.4 WORKBOOK. How viruses make us sick: Viral Replication

Study the Evolution of the Avian Influenza Virus

Chapter 18. Viral Genetics. AP Biology

Synthetic Genomics and Its Application to Viral Infectious Diseases. Timothy Stockwell (JCVI) David Wentworth (JCVI)

LESSON 4.5 WORKBOOK. How do viruses adapt Antigenic shift and drift and the flu pandemic

VIRUSES. Biology Applications Control. David R. Harper. Garland Science Taylor & Francis Group NEW YORK AND LONDON

Overview: Chapter 19 Viruses: A Borrowed Life


Islamic University Faculty of Medicine

Bi 8 Lecture 17. interference. Ellen Rothenberg 1 March 2016

Viruses. Instructions fill in the blanks with the appropriate term to have the sentence make sense.

Evolution of influenza

Viruses Tomasz Kordula, Ph.D.

VIROLOGY PRINCIPLES AND APPLICATIONS WILEY. John B. Carter and Venetia A. Saunders

Chapter 6- An Introduction to Viruses*

Chapter 19: Viruses. 1. Viral Structure & Reproduction. 2. Bacteriophages. 3. Animal Viruses. 4. Viroids & Prions

Chapter13 Characterizing and Classifying Viruses, Viroids, and Prions

AP Biology. Viral diseases Polio. Chapter 18. Smallpox. Influenza: 1918 epidemic. Emerging viruses. A sense of size

Chapter 13 Viruses, Viroids, and Prions. Biology 1009 Microbiology Johnson-Summer 2003

7.012 Problem Set 6 Solutions

Lecture 19 Evolution and human health

Bioinformation by Biomedical Informatics Publishing Group

Phenomena first observed in petunia

Ch. 19 Viruses & Bacteria: What Is a Virus?

Name Section Problem Set 6

LESSON 1.4 WORKBOOK. Viral sizes and structures. Workbook Lesson 1.4

Some living things are made of ONE cell, and are called. Other organisms are composed of many cells, and are called. (SEE PAGE 6)

Chapter 13B: Animal Viruses

Modeling the Antigenic Evolution of Influenza Viruses from Sequences

DNA codes for RNA, which guides protein synthesis.

Viral Genetics. BIT 220 Chapter 16

Viruses. An Illustrated Guide to Viral Life Cycles to Accompany Lecture. By Noel Ways

Viruses. Picture from:

number Done by Corrected by Doctor Ashraf

Lecture 2: Virology. I. Background

Last time we talked about the few steps in viral replication cycle and the un-coating stage:

Viral reproductive cycle

علم األحياء الدقيقة Microbiology Introduction to Virology & Immunology

What is influenza virus? 13,000 base RNA genome: 1/ the size of the human genome

Unit 2: Lesson 2 Case Studies: Influenza and HIV LESSON QUESTIONS

VIRUSES. 1. Describe the structure of a virus by completing the following chart.

Chapter 19: The Genetics of Viruses and Bacteria

19/06/2013. Viruses are not organisms (do not belong to any kingdom). Viruses are not made of cells, have no cytoplasm, and no membranes.

Grade Level: Grades 9-12 Estimated Time Allotment Part 1: One 50- minute class period Part 2: One 50- minute class period

Intrinsic cellular defenses against virus infection

7.013 Spring 2005 Problem Set 7

LESSON 4.6 WORKBOOK. Designing an antiviral drug The challenge of HIV

Class 34: Computing with Life (and the Chicken Flu)

Revisiting the Definition of Living Thing

Palindromes drive the re-assortment in Influenza A

Exploring Energy and Fitness Landscapes of Proteins

Breast cancer. Risk factors you cannot change include: Treatment Plan Selection. Inferring Transcriptional Module from Breast Cancer Profile Data

Chapter 19: Viruses. 1. Viral Structure & Reproduction. What exactly is a Virus? 11/7/ Viral Structure & Reproduction. 2.

Viruses. Properties. Some viruses contain other ingredients (e.g., lipids, carbohydrates), but these are derived from their host cells.

AP Biology Reading Guide. Concept 19.1 A virus consists of a nucleic acid surrounded by a protein coat

Reoviruses. Virion. Genome. Genes and proteins. Viruses and hosts. Diseases. Distinctive characteristics

1) DNA unzips - hydrogen bonds between base pairs are broken by special enzymes.

III. What are the requirements for taking and passing this course?

WHY? Viruses are considered non-living because they do:

Systems Biology and Animal Models: STRIDE

CONTENTS. 1. Introduction. 4. Virology. 2. Virus Structure. 5. Virus and Medicine. 3. Virus Replication. 6. Review

Viruses defined acellular organisms genomes nucleic acid replicate inside host cells host metabolic machinery ribosomes

Bacteriophage Reproduction

The Flu: The Virus, The Vaccine, and Surveillance. Joanna Malukiewicz GK12 Program School of Life Sciences Arizona State University

5/6/17. Diseases. Disease. Pathogens. Domain Bacteria Characteristics. Bacteria Viruses (including HIV) Pathogens are disease-causing organisms

HS-LS4-4 Construct an explanation based on evidence for how natural selection leads to adaptation of populations.

Purpose: To describe the characteristics of viruses and how they infect a host cell.

Part Of A Virus That Contains The Instructions For Making New Viruses

INTERfering and Co-Evolving Prevention and Therapy (INTERCEPT) Proposers Day

Early Diagnosis: A Critical Step in Bird Flu Prevention

Virus Basics. General Characteristics of Viruses. Chapter 13 & 14. Non-living entities. Can infect organisms of every domain

*viruses have no cell wall and made up of nucleic acid components.

Phylogenetic Methods

Viruses. CLS 212: Medical Microbiology Miss Zeina Alkudmani

Viral structure م.م رنا مشعل

Virus Basics. General Characteristics of Viruses 5/9/2011. General Characteristics of Viruses. Chapter 13 & 14. Non-living entities

Exam 1-Key. Biology II Winter 2013

Protein Modeling Event

Patricia Fitzgerald-Bocarsly

Section 9. Junaid Malek, M.D.

Chapter 25. 바이러스 (The Viruses)

Influenza viruses. Virion. Genome. Genes and proteins. Viruses and hosts. Diseases. Distinctive characteristics

Biology 350: Microbial Diversity

Human Genome Complexity, Viruses & Genetic Variability

PROCEDURE Follow the instructions as you proceed through the Click and Learn, and answer the questions in the spaces provided.

HOST-PATHOGEN CO-EVOLUTION THROUGH HIV-1 WHOLE GENOME ANALYSIS

2.1 VIRUSES. 2.1 Learning Goals

Protein Synthesis

Raymond Auerbach PhD Candidate, Yale University Gerstein and Snyder Labs August 30, 2012

RNA and Protein Synthesis Guided Notes

NEXT GENERATION SEQUENCING OPENS NEW VIEWS ON VIRUS EVOLUTION AND EPIDEMIOLOGY. 16th International WAVLD symposium, 10th OIE Seminar

IPA Advanced Training Course

Viral Replication and Genetics

RNA Secondary Structures: A Case Study on Viruses Bioinformatics Senior Project John Acampado Under the guidance of Dr. Jason Wang

MedChem 401~ Retroviridae. Retroviridae

Biol115 The Thread of Life"

LESSON 1.4 WORKBOOK. Viral structures. Just how small are viruses? Workbook Lesson 1.4 1

Lecture Guide Viruses (CH13)

7.014 Problem Set 7 Solutions

VIRUS TAXONOMY AND REPLICATION

Transcription:

Wright State University CORE Scholar Physics Seminars Physics 11-1-2013 A Bioinformatics Method for Identifying RNA Structures within Human Cells Stephen Donald Huff Follow this and additional works at: http://corescholar.libraries.wright.edu/physics_seminars Part of the Physics Commons Repository Citation Huff, S. D. (2013). A Bioinformatics Method for Identifying RNA Structures within Human Cells.. This Presentation is brought to you for free and open access by the Physics at CORE Scholar. It has been accepted for inclusion in Physics Seminars by an authorized administrator of CORE Scholar. For more information, please contact corescholar@www.libraries.wright.edu.

A Bioinform atics Method for Identifying RNA Stru ctu res within Human Cells Stephen Huff Biological Informatics Group USAFRL - RHDJ WPAFB

Agenda Background Overview The Problem The Solution (?) Ideally, Resultant Knowledge (Bioinformatics = biology + computers)

Background - The BIG Picture The "Law s" of The "Laws" of Chem istry

Background - The Human Picture Imag es courtesy of MS Office ClipArt, per fair use

Background - The Dogma Images courtesy of MS Office ClipArt, per fair use.

Background - The Dogma Images courtesy of MS Office ClipArt, per fair use.

Background - Viral Infection Viral infection can be defined as the process of "hijacking" a host cell to change it into a VIRAL FACTORY (producing mainly viral genomes and proteins NOT host genomes and proteins).

Remember the Dogma... DNA -> RNA -> Protein (The Stuff" of Life) Background - Viral Infection A Cell -> Viral Attachment Nucleus Genomeg -^(dsdna)^ Viral Transcripti on & Translation Viral Payload Delivery Viral Budding (Release) Viral Assembly

Background - Viruses I II III IV V VI VI dsdna ssdna dsrna ssrna + + ssrna- ssdna + dsdjnia dsdn A ssrna D N A / R N dsdn A ssrna- Messenger RNA (Translates to Proteins)

Overview - Virus vs. YOU Influenza virus as example vector - ssrna-, replicates in nucleus - Genome ~ 14Kb (kilo-bases), segmented (8) - Proteome ~ 12 Human as example host - dsdna, multiple organs, tissues and cell types - Genome ~ 3Gb (giga-bases), segmented (22+1) - Proteome ~ 21,000 + 10

Overview - Curious Question SO... how can something so SMALL (virus) so completely overwhelm something so LARGE (host)??? Traditional view focuses on proteins. - 12 proteins overwhelm and undermine 21K+??? Could other phenomena affect the system? - Nucleotide phenomena. specifically, formation of secondary, tertiary (and 1 q u a rte n a ry?) stru c tu r e s

Overview - Nucleotide Pair Bonding Single-Stranded DNA/RNA T A C G A C G T C C G G T A A T A T T C G A C T ^ AGCTGA A T G C T G C A G G C C A T T A T A Single-Stranded DNA/RNA (-) Double-Stranded DNA/RNA (+/-) 12

Overview - Nucleotide Self-Bonding T. thermophila telomerase RNA The _"Pseudoknot" Multiple Hairpins" Single-stranded nucleotides free to bond... w ith ju st about A N Ything biological Images courtesy of WikiPedia, per fair use.

Overview - Interesting Possibilities Many examples of nucleotide-structure-based control have recently been discovered - Silencing RNA, micro RNA, nucleolar RNA, etc... - Exert surprising biological control (start/stop/alter protein synthesis and other functions) - Others probably exist (so called "junk" DNA) Potential for biological exploitation (therapies) is VAST but unknown 14

The Problem Given the human transcriptome/proteome (RNA -> protein), identify key control structures - Compared to what? - 21K+ candidates, each with 10K-10M + nucleotides - Permutations may be more numerous than stars in the known COSMOS (given mutational variants) Currently, this problem is intractable

The Solution (?) Viruses have been "picking biological locks" for eons - More than just proteins - nucleotide structures are probably involved, too - They have evolved to become VERY GOOD at this Use smaller viral genomes to identify keys oldviral&hostranscriptsthen F - Presumably they are RICH in such structures

The Solution - Method 1. Use evolutionary distance to choose viral candidates (proprietary software) 2. Fold viral candidates (Rosetta/ViennaRNA) 3. Fold host transcriptome (Same as 2) 4. Use GORS to identify conserved structures within virus (proprietary software) - These are likely to be vital to virus-host relation 5. Compare to human 6. Validate in wet-lab via high-throughput screens 17

The Solution Graphically Images courtesy of MS Office ClipArt, per fair use.

Solution - Fofanov-Distance Influenza Type-A as example parasite - Segmented genome, currently attenuating to human host (SOMEtimes) Some segments attenuated to human, others to avian/swine or other host(s) Temporal changes - Use of wrong segments at wrong times = noise F-Distance is novel tool for non-heuristic analysis of genomic distance, computationally intense due to exhaustive mutational analysis 19

F-Distances for Segment 5 (NP) Human Sero-Types Isolated from Human or Avian Hosts for the Type-A Influenza Virus (Human Background) Normalized Mean Genomic Distance Note: Error bars represent one standard error, +/- 20

Mean Normalized F-Distances for Segment 4 (HA) H1N1 for Human Hosts by Annual Cohorts (a) Figure I inferred isolated 2008. In 2011 d i m e CLi o H1N1 1909-2008 o H1N1 p-value : 2.6e-05 o H1N1 Adj. R-Sq.: 3.3e-01 o H1N1 Slope r GO 03 n CD CD : -8.3e-04 GO CN 03 GO CO 03 OO 03 XI 03 CD CD 03 GO r - 03 CD CD 03 CD 03 03 CD o CM lumans, q lienees 973 and a n t i g,, --------- Biological Sciences, Online May ' J '=-J 21

Segment 5 (NP) Fifty Year Timelines with Analytically Bifurcated Regressions for F-Distances of H3N2 Type A Influenza (Human Background) Mean Normalized Distance CD CD CD CD CD I D CD N - CD 0 5 O) O i CD CD CD 2008

Solution - RNA Folding ViennaRNA and Rosetta are best candidates - Open source software for Windows and Linux Accepts RNA sequences as input, uses lab-validated thermodynamics and chemistry Folds primary structure (sequence) into secondary and tertiary structures

m nf hnrlr nr ^ nvirf inn m nf hnrlr Solution - GORS Primarily, need to identify changing patterns of structures over time (within parasite) - Conservation indicates need (survival pressure) - This indicates something KEY to structures in host GORS is statistical method for such analysis May have to invent new statistical 24

Solution - Products F-Distance - R-extension (dll), overlaid by custom GUI (Windows, Mac, smart-devices) - Publication ViennaRNA/Rosetta - Custom computational pipeline GORS - R-extension (dll), GUI, pipeline, publication Final results - publication and software suite 25

Ideally - Resultant Knowledge Improved grasp of virus-host relationships - Influenza vaccine, new generation? Toxin mitigation (RHDJ focus) - Structures bond to metallic ions, small molecules, proteins, other nucleotides, etc... Exploit discovered structures for therapies Investigate biology from a new

Questions 27

Sequence Background Array IO r Binary Version Index Present A A A A A A A A A A A A A A A A 00000000000000000000000000000000 0 0 A A A A A A A A A A A A A A A T 00000000000000000000000000000001 1 0 A A A A A A A A A A A A A A A G 00000000000000000000000000000010 2 0 A A A A A A A A A A A A A A A C 00000000000000000000000000000011 3 0 A A A A A A A A A A A A A A T A 0000000000000000000000000000000100 4 1 A A A A A A A A A A A A A A T T 00000000000000000000000000000101 5 0 A A A A A A A A A A A A A A T G 00000000000000000000000000000110 6 0 A A A A A A A A A A A A A A T C 00000000000000000000000000000111 7 0 A A A A A A A A A A A A A A G A 000000000000000000000000000001000 8 1 A A A A A A A A A A A A A A G T 00000000000000000000000000001001 9 0 A A A A A A A A A A A A A A G G 00000000000000000000000000001010 10 0 A A A A A A A A A A A A A A G C 00(300000000000000000000000001011 11 0 T C C C C C C C C C C C C C C C 010011111111111111111111111111 i CO 0 G C C C C C C C C C C C C C C C A01111101111111111111111111111 4n - 2 0 C C C C C C C C C C C C C C C C 111111111111111111111111111111 4n - 1 0 1

Foreground Array IO T G A T C G C C A C G T A G C T G A A T G A T C G C C A C G T A T A C G T. (Foreground Sequence) 1 1 1 2 3 2 1 1 1 1 1 1 2 1 1 1 0. (Foreground Array) Background Array Index Present 0 0...... 410,766,922 0 410,766,923 1... 1,643,067,690 1 1,643,067,691 0 1,643,067,692 1,643,067,693 0...... 2,173,156,579 0 2,173,156,580 f \ v w 2,173,156,581 0...... 4n - 1 0 2,173,156,588 10100001111001111011110011100101 G A A T G A T C G C C A C G T A 29