ITS accuracy at GenBank. Conrad Schoch Barbara Robbertse

Size: px
Start display at page:

Download "ITS accuracy at GenBank. Conrad Schoch Barbara Robbertse"

Transcription

1 ITS accuracy at GenBank Conrad Schoch Barbara Robbertse

2 Improving accuracy Barcode tag in GenBank Barcode submission tool Standards RefSeq Targeted Loci Well validated sequences already in GenBank Bacteria all type sequences Limited fungal sequences

3 Formal selection of the fungal DNA barcode Schoch et al

4

5 ITS sequence standards 1. Standardized sequence title should be "Fungal ITS barcode". 2. Annotation 3. Length 4. Quality of sequence 5. Unique or not? 6. Meta data

6

7 Difference Between GenBank and RefSeq Targeted Loci GenBank Not curated Author submits Only author can revise Multiple records for same loci common Records can contradict each other No limit to species included Data exchanged among INSDC members Akin to primary literature Proteins identified and linked Access via NCBI Nucleotide databases RefSeq Curated NCBI creates from existing data NCBI revises as new data emerge Single records for each molecule of major organisms Limited to model organisms Exclusive NCBI database Akin to review articles Proteins and transcripts identified and linked Access via Nucleotide & Protein databases

8

9 150 Yeast sequences 1. Same ITS accession associated with different species/strains in the list. 2. Taxon name associated with accession in provide list not found in NCBI Strain name indicated on GenBank accession not found at culture collection database (wrong strain name in GenBank?) 3. Incomplete accession identifier in the list. 4. A few accessions in the list does not exist in Genbank.

10 Checklist for ITS Accessions added to the target loci RefSeq project: ) Source from a type specimen. 2) Primary GenBank name and Current name at CBS is the same. 3) Strand in the correct orientation. 4) Type info added from CBS to /note. 5) Added feature /culture_collection 6) Added feature /identified_by (source CBS) 7) Moved information in note to /isolation_source 7) All 26S labled 28S 8) Reannotated (used 5.8S Rfam borders; used 3 18S boundaries (CATTA motif) and 5 28S border (GACCT motif) as guide in an alignment). 9) Added PMID if available. 10) Checked hits with moleblast. 11) Example defline (note it has no strain info): DEFINITION Trichosporon veenhuisii 18S ribosomal RNA gene, partial sequence; internal transcribed spacer 1, 5.8S ribosomal RNA gene, and internal transcribed spacer 2, complete sequence; and 28S ribosomal RNA gene, partial sequence. 12) Features used: /rrna for 18S and 28S and /misc_rna for ITS1 and ITS2 13) Standardized names used in product qualifier: 18S ribosomal RNA, 5.8S ribosomal RNA, 28S ribosomal RNA, internal transcribed spacer 1, internal transcribed spacer 2

11 Annotation of 150 ITS records in GenBank #records Features Annotation in note or product 127 /misc_rna contains 18S ribosomal RNA, internal transcribed spacer 1, 5.8S ribosomal RNA, internal transcribed spacer 2, and 28S ribosomal RNA (or 26S ribosomal RNA) 4 /misc_rna or /misc_feature 8 /misc_rna and /rrna 2 /misc_feature and /gene and /rrna 6 /rrna and /gene and /misc_feature 3 /rrna and /misc_rna contains internal transcribed spacer 1, 5.8S ribosomal RNA and internal transcribed spacer 2 internal transcribed spacer 1 5.8S ribosomal RNA internal transcribed spacer 2 internal transcribed spacer 1 5.8S ribosomal RNA internal transcribed spacer 2 18S ribosomal RNA internal transcribed spacer 1 5.8S ribosomal RNA internal transcribed spacer 2 26S ribosomal RNA 18S ribosomal RNA internal transcribed spacer 1 5.8S ribosomal RNA internal transcribed spacer 2 28S ribosomal RNA (or 26S ribosomal RNA)

12 Annotation of 150 ITS records in RefSeq #records Features Annotation in product 138 /rrna and /misc_rna 18S ribosomal RNA internal transcribed spacer 1 5.8S ribosomal RNA internal transcribed spacer 2 28S ribosomal RNA 2 /rrna and /misc_rna 18S ribosomal RNA internal transcribed spacer 1 5.8S ribosomal RNA internal transcribed spacer 2 10 /misc_rna and /rrna internal transcribed spacer 1 5.8S ribosomal RNA internal transcribed spacer 2

13 RefSeq Accession number Project number RefSeq references Expanded qualifiers

14 Algorithm: 1. For each query run BLAST search against nr and collect top five hits. 2. Cluster query sequences into groups corresponding to different loci 3. For each locus: Compute multiple alignment for queries and their top five BLAST hits Compute phylogenetic tree based on the multiple alignment MOLE-BLAST

15 Adding microbial type strain data to the taxonomy database Upload all types together with names in NCBI Taxonomy Cross reference this as a property in Entrez Enable search restricted to ex-type sequences Start with Euzeby list

16 What next? Expand other markers for the known universe Secondary barcode-type markers list and communicate resources Highlight problematic ITS taxa Provide barcodes for all genomes Ensure genome samples are correctly identified Integrate sequences with fungal names

17 BaG (Barcode all genera) of Fungi, proposed goals Sequence for more than 3000 genera in GenBank Compare GenBank and MycoBank taxonomies Highlight types in GenBank taxonomy Target lists for all fungal genera focused on type species Genera (5000 with full meta-data in MycoBank)

18 One name one fungus = opportunity

19 Acknowledgments ITS Meta data Centraalbureau voor Schimmelcultures (CBS) MOLE-BLAST Grzegorz Boratyn Tom Madden Taxonomy type updates Scott Federhen

OVERVIEW OF CURRENT IDENTIFICATION SYSTEMS AND DATABASES

OVERVIEW OF CURRENT IDENTIFICATION SYSTEMS AND DATABASES OVERVIEW OF CURRENT IDENTIFICATION SYSTEMS AND DATABASES EVERY STEP OF THE WAY 1 EVERY STEP OF THE WAY MICROBIAL IDENTIFICATION METHODS DNA RNA Genotypic Sequencing of ribosomal RNA regions of bacteria

More information

Hands-On Ten The BRCA1 Gene and Protein

Hands-On Ten The BRCA1 Gene and Protein Hands-On Ten The BRCA1 Gene and Protein Objective: To review transcription, translation, reading frames, mutations, and reading files from GenBank, and to review some of the bioinformatics tools, such

More information

Annotation of Drosophila mojavensis fosmid 8 Priya Srikanth Bio 434W

Annotation of Drosophila mojavensis fosmid 8 Priya Srikanth Bio 434W Annotation of Drosophila mojavensis fosmid 8 Priya Srikanth Bio 434W 5.1.2007 Overview High-quality finished sequence is much more useful for research once it is annotated. Annotation is a fundamental

More information

Bioinformatics Laboratory Exercise

Bioinformatics Laboratory Exercise Bioinformatics Laboratory Exercise Biology is in the midst of the genomics revolution, the application of robotic technology to generate huge amounts of molecular biology data. Genomics has led to an explosion

More information

Student Handout Bioinformatics

Student Handout Bioinformatics Student Handout Bioinformatics Introduction HIV-1 mutates very rapidly. Because of its high mutation rate, the virus will continue to change (evolve) after a person is infected. Thus, within an infected

More information

Appendix 81. From OPENFLU to OPENFMD. Open Session of the EuFMD: 2012, Jerez de la Frontera, Spain 1. Conclusions and recommendations

Appendix 81. From OPENFLU to OPENFMD. Open Session of the EuFMD: 2012, Jerez de la Frontera, Spain 1. Conclusions and recommendations From OPENFLU to OPENFMD a resource for automatic and curated nomenclature and tools for the FMD community Filip Claes, Philippe Le Mercier, Dmitry Kuznetsov, Robin Liechti, Anne Gleizes, Ioannis Xenarios,

More information

Annotation of Chimp Chunk 2-10 Jerome M Molleston 5/4/2009

Annotation of Chimp Chunk 2-10 Jerome M Molleston 5/4/2009 Annotation of Chimp Chunk 2-10 Jerome M Molleston 5/4/2009 1 Abstract A stretch of chimpanzee DNA was annotated using tools including BLAST, BLAT, and Genscan. Analysis of Genscan predicted genes revealed

More information

AutoOrthoGen. Multiple Genome Alignment and Comparison

AutoOrthoGen. Multiple Genome Alignment and Comparison AutoOrthoGen Multiple Genome Alignment and Comparison Orion Buske, Yogesh Saletore, Kris Weber CSE 428 Spring 2009 Martin Tompa Computer Science and Engineering University of Washington, Seattle, WA Abstract

More information

Evidence of a Pathway of Reduction in Bacteria: Reduced Quantities of Restriction Sites Impact trna Activity in a Trial Set

Evidence of a Pathway of Reduction in Bacteria: Reduced Quantities of Restriction Sites Impact trna Activity in a Trial Set Evidence of a of Reduction in Bacteria: Reduced Quantities of Restriction Sites Impact trna Activity in a Trial Set Oliver Bonham-Carter, Lotfollah Najjar, Dhundy Bastola School of Interdisciplinary Informatics

More information

NCBI will no longer make taxonomy identifiers for individual influenza strains on January 15, 2018

NCBI will no longer make taxonomy identifiers for individual influenza strains on January 15, 2018 Page 1 of 10 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 NCBI will no longer make taxonomy identifiers for individual influenza

More information

The Blueprint of Life: DNA to Protein. What is genetics? DNA Structure 4/27/2011. Chapter 7

The Blueprint of Life: DNA to Protein. What is genetics? DNA Structure 4/27/2011. Chapter 7 The Blueprint of Life: NA to Protein Chapter 7 What is genetics? The science of heredity; includes the study of genes, how they carry information, how they are replicated, how they are expressed NA Structure

More information

The Blueprint of Life: DNA to Protein

The Blueprint of Life: DNA to Protein The Blueprint of Life: NA to Protein Chapter 7 What is genetics? The science of heredity; includes the y; study of genes, how they carry information, how they are replicated, how they are expressed 1 NA

More information

SMPD 287 Spring 2015 Bioinformatics in Medical Product Development. Final Examination

SMPD 287 Spring 2015 Bioinformatics in Medical Product Development. Final Examination Final Examination You have a choice between A, B, or C. Please email your solutions, as a pdf attachment, by May 13, 2015. In the subject of the email, please use the following format: firstname_lastname_x

More information

DNA codes for RNA, which guides protein synthesis.

DNA codes for RNA, which guides protein synthesis. Section 3: DNA codes for RNA, which guides protein synthesis. K What I Know W What I Want to Find Out L What I Learned Vocabulary Review synthesis New RNA messenger RNA ribosomal RNA transfer RNA transcription

More information

Rotavirus Genotyping and Enhanced Annotation in the Virus Pathogen Resource (ViPR) Yun Zhang J. Craig Venter Institute ASV 2016 June 19, 2016

Rotavirus Genotyping and Enhanced Annotation in the Virus Pathogen Resource (ViPR) Yun Zhang J. Craig Venter Institute ASV 2016 June 19, 2016 Rotavirus Genotyping and Enhanced Annotation in the Virus Pathogen Resource (ViPR) Yun Zhang J. Craig Venter Institute ASV 2016 June 19, 2016 Loading Virus Pathogen Database and Analysis About Resource

More information

PAirwise Sequence Comparison (PASC) and Its Application in the Classification of Filoviruses

PAirwise Sequence Comparison (PASC) and Its Application in the Classification of Filoviruses Viruses 2012, 4, 1318-1327; doi:10.3390/v4081318 Article OPEN ACCESS viruses ISSN 1999-4915 www.mdpi.com/journal/viruses PAirwise Sequence Comparison (PASC) and Its Application in the Classification of

More information

PROTOCOL FOR INFLUENZA A VIRUS GLOBAL SWINE H1 CLADE CLASSIFICATION

PROTOCOL FOR INFLUENZA A VIRUS GLOBAL SWINE H1 CLADE CLASSIFICATION PROTOCOL FOR INFLUENZA A VIRUS GLOBAL SWINE H1 CLADE CLASSIFICATION January 23, 2017 1. Background Swine H1 viruses have diversified into three major genetic lineages over time. Recently, Anderson et al.

More information

a. From the grey navigation bar, mouse over Analyze & Visualize and click Annotate Nucleotide Sequences.

a. From the grey navigation bar, mouse over Analyze & Visualize and click Annotate Nucleotide Sequences. Section D. Custom sequence annotation After this exercise you should be able to use the annotation pipelines provided by the Influenza Research Database (IRD) and Virus Pathogen Resource (ViPR) to annotate

More information

Phylogenomics. Antonis Rokas Department of Biological Sciences Vanderbilt University.

Phylogenomics. Antonis Rokas Department of Biological Sciences Vanderbilt University. Phylogenomics Antonis Rokas Department of Biological Sciences Vanderbilt University http://as.vanderbilt.edu/rokaslab High-Throughput DNA Sequencing Technologies 454 / Roche 450 bp 1.5 Gbp / day Illumina

More information

Name: Due on Wensday, December 7th Bioinformatics Take Home Exam #9 Pick one most correct answer, unless stated otherwise!

Name: Due on Wensday, December 7th Bioinformatics Take Home Exam #9 Pick one most correct answer, unless stated otherwise! Name: Due on Wensday, December 7th Bioinformatics Take Home Exam #9 Pick one most correct answer, unless stated otherwise! 1. What process brought 2 divergent chlorophylls into the ancestor of the cyanobacteria,

More information

Following virus recombination and evolution

Following virus recombination and evolution Following virus recombination and evolution Coping with the avalanche of sequencing data Dmitry Kusnetsov, Anne Gleizes, Robin Liechti, Ioannis Xenarios, Philippe Le Mercier Vital-IT/Swiss-Prot group SIB

More information

Module 3. Genomic data and annotations in public databases Exercises Custom sequence annotation

Module 3. Genomic data and annotations in public databases Exercises Custom sequence annotation Module 3. Genomic data and annotations in public databases Exercises Custom sequence annotation Objectives Upon completion of this exercise, you will be able to use the annotation pipelines provided by

More information

a) SSR with core motif > 2 and repeats number >3. b) MNR with repeats number>5.

a) SSR with core motif > 2 and repeats number >3. b) MNR with repeats number>5. 1 2 APPENDIX Legends to figures 3 4 5 Figure A1: Distribution of perfect SSR along chromosome 1 of V. cholerae (El-Tor N191). a) SSR with core motif > 2 and repeats number >3. b) MNR with repeats number>5.

More information

Molecular phylogeny of Australian isolates of Sporothrix schenckii sensu lato. David New Microbiology Registrar, PathWest

Molecular phylogeny of Australian isolates of Sporothrix schenckii sensu lato. David New Microbiology Registrar, PathWest Molecular phylogeny of Australian isolates of Sporothrix schenckii sensu lato David New Microbiology Registrar, PathWest Background Sporothrix schenckii causes a cutaneous mycosis Sporotrichosis (aka rose

More information

VirusDetect pipeline - virus detection with small RNA sequencing

VirusDetect pipeline - virus detection with small RNA sequencing VirusDetect pipeline - virus detection with small RNA sequencing CSC webinar 16.1.2018 Eija Korpelainen, Kimmo Mattila, Maria Lehtivaara Big thanks to Jan Kreuze and Jari Valkonen! Outline Small interfering

More information

Long non-coding RNAs

Long non-coding RNAs Long non-coding RNAs Dominic Rose Bioinformatics Group, University of Freiburg Bled, Feb. 2011 Outline De novo prediction of long non-coding RNAs (lncrnas) Genome-wide RNA gene-finding Intrinsic properties

More information

PROTEIN SYNTHESIS. It is known today that GENES direct the production of the proteins that determine the phonotypical characteristics of organisms.

PROTEIN SYNTHESIS. It is known today that GENES direct the production of the proteins that determine the phonotypical characteristics of organisms. PROTEIN SYNTHESIS It is known today that GENES direct the production of the proteins that determine the phonotypical characteristics of organisms.» GENES = a sequence of nucleotides in DNA that performs

More information

Cross species analysis of genomics data. Computational Prediction of mirnas and their targets

Cross species analysis of genomics data. Computational Prediction of mirnas and their targets 02-716 Cross species analysis of genomics data Computational Prediction of mirnas and their targets Outline Introduction Brief history mirna Biogenesis Why Computational Methods? Computational Methods

More information

High-throughput transcriptome sequencing

High-throughput transcriptome sequencing High-throughput transcriptome sequencing Erik Kristiansson (erik.kristiansson@zool.gu.se) Department of Zoology Department of Neuroscience and Physiology University of Gothenburg, Sweden Outline Genome

More information

SEQUENCE FEATURE VARIANT TYPES

SEQUENCE FEATURE VARIANT TYPES SEQUENCE FEATURE VARIANT TYPES DEFINITION OF SFVT: The Sequence Feature Variant Type (SFVT) component in IRD (http://www.fludb.org) is a relatively novel approach that delineates specific regions, called

More information

Bioinformation by Biomedical Informatics Publishing Group

Bioinformation by Biomedical Informatics Publishing Group Predicted RNA secondary structures for the conserved regions in dengue virus Pallavi Somvanshi*, Prahlad Kishore Seth Bioinformatics Centre, Biotech Park, Sector G, Jankipuram, Lucknow 226021, Uttar Pradesh,

More information

GENOME-WIDE COMPUTATIONAL ANALYSIS OF SMALL NUCLEAR RNA GENES OF ORYZA SATIVA (INDICA AND JAPONICA)

GENOME-WIDE COMPUTATIONAL ANALYSIS OF SMALL NUCLEAR RNA GENES OF ORYZA SATIVA (INDICA AND JAPONICA) GENOME-WIDE COMPUTATIONAL ANALYSIS OF SMALL NUCLEAR RNA GENES OF ORYZA SATIVA (INDICA AND JAPONICA) M.SHASHIKANTH, A.SNEHALATHARANI, SK. MUBARAK AND K.ULAGANATHAN Center for Plant Molecular Biology, Osmania

More information

Molecular Identification of Lipase Producing Bacteria based on 16S rdna Sequencing

Molecular Identification of Lipase Producing Bacteria based on 16S rdna Sequencing International Journal of Current Microbiology and Applied Sciences ISSN: 2319-7706 Volume 6 Number 5 (2017) pp. 2067-2071 Journal homepage: http://www.ijcmas.com Original Research Article https://doi.org/10.20546/ijcmas.2017.605.230

More information

RNA and Protein Synthesis Guided Notes

RNA and Protein Synthesis Guided Notes RNA and Protein Synthesis Guided Notes is responsible for controlling the production of in the cell, which is essential to life! o DNARNAProteins contain several thousand, each with directions to make

More information

Identification of mirnas in Eucalyptus globulus Plant by Computational Methods

Identification of mirnas in Eucalyptus globulus Plant by Computational Methods International Journal of Pharmaceutical Science Invention ISSN (Online): 2319 6718, ISSN (Print): 2319 670X Volume 2 Issue 5 May 2013 PP.70-74 Identification of mirnas in Eucalyptus globulus Plant by Computational

More information

genomics for systems biology / ISB2020 RNA sequencing (RNA-seq)

genomics for systems biology / ISB2020 RNA sequencing (RNA-seq) RNA sequencing (RNA-seq) Module Outline MO 13-Mar-2017 RNA sequencing: Introduction 1 WE 15-Mar-2017 RNA sequencing: Introduction 2 MO 20-Mar-2017 Paper: PMID 25954002: Human genomics. The human transcriptome

More information

CHAPTER 6 METABOLIC PATHWAY RECONSTRUCTION

CHAPTER 6 METABOLIC PATHWAY RECONSTRUCTION 66 CHAPTER 6 METABOLIC PATHWAY RECONSTRUCTION 6.1 GENERAL A metabolic pathway is a series of biochemical reactions that occur within a cell and these chemical reactions are catalyzed by certain enzymes.

More information

FINAL ANNOTATION REPORT: Drosophila virilis Fosmid 11 (48P14) Robert Carrasquillo Bio 4342

FINAL ANNOTATION REPORT: Drosophila virilis Fosmid 11 (48P14) Robert Carrasquillo Bio 4342 FINAL ANNOTATION REPORT: Drosophila virilis Fosmid 11 (48P14) Robert Carrasquillo Bio 4342 2006 TABLE OF CONTENTS I. Overview... 3 II. Genes... 4 III. Clustal Analysis... 15 IV. Repeat Analysis... 17 V.

More information

Association for Molecular Pathology Promoting Clinical Practice, Basic Research, and Education in Molecular Pathology

Association for Molecular Pathology Promoting Clinical Practice, Basic Research, and Education in Molecular Pathology Association for Molecular Pathology Promoting Clinical Practice, Basic Research, and Education in Molecular Pathology 9650 Rockville Pike, Bethesda, Maryland 20814 Tel: 301-634-7939 Fax: 301-634-7990 Email:

More information

HBV. Next Generation Sequencing, data analysis and reporting. Presenter Leen-Jan van Doorn

HBV. Next Generation Sequencing, data analysis and reporting. Presenter Leen-Jan van Doorn HBV Next Generation Sequencing, data analysis and reporting Presenter Leen-Jan van Doorn HBV Forum 3 October 24 th, 2017 Marriott Marquis, Washington DC www.forumresearch.org HBV Biomarkers HBV biomarkers:

More information

First Report of Penicillium adametzioides from Decayed Grapes (Vitis vinifera) in Pakistan

First Report of Penicillium adametzioides from Decayed Grapes (Vitis vinifera) in Pakistan International Journal of Current Microbiology and Applied Sciences ISSN: 2319-7706 Volume 5 Number 12 (2016) pp. 316-320 Journal homepage: http://www.ijcmas.com Original Research Article http://dx.doi.org/10.20546/ijcmas.2016.512.034

More information

Data mining with Ensembl Biomart. Stéphanie Le Gras

Data mining with Ensembl Biomart. Stéphanie Le Gras Data mining with Ensembl Biomart Stéphanie Le Gras (slegras@igbmc.fr) Guidelines Genome data Genome browsers Getting access to genomic data: Ensembl/BioMart 2 Genome Sequencing Example: Human genome 2000:

More information

High Throughput Sequence (HTS) data analysis. Lei Zhou

High Throughput Sequence (HTS) data analysis. Lei Zhou High Throughput Sequence (HTS) data analysis Lei Zhou (leizhou@ufl.edu) High Throughput Sequence (HTS) data analysis 1. Representation of HTS data. 2. Visualization of HTS data. 3. Discovering genomic

More information

The BLAST search on NCBI ( and GISAID

The BLAST search on NCBI (    and GISAID Supplemental materials and methods The BLAST search on NCBI (http:// www.ncbi.nlm.nih.gov) and GISAID (http://www.platform.gisaid.org) showed that hemagglutinin (HA) gene of North American H5N1, H5N2 and

More information

The RNA Virus Database

The RNA Virus Database Nucleic Acs Research Advance Access published October 23, 2008 The RNA Virus Database Nucleic Acs Research, 2008, 1 5 doi:10.1093/nar/gkn729 Robert Belshaw 1, *, Tulio de Oliveira 1,2, Sney Markowitz 3

More information

38 Int'l Conf. Bioinformatics and Computational Biology BIOCOMP'16

38 Int'l Conf. Bioinformatics and Computational Biology BIOCOMP'16 38 Int'l Conf. Bioinformatics and Computational Biology BIOCOMP'16 PGAR: ASD Candidate Gene Prioritization System Using Expression Patterns Steven Cogill and Liangjiang Wang Department of Genetics and

More information

Prokaryotes and eukaryotes alter gene expression in response to their changing environment

Prokaryotes and eukaryotes alter gene expression in response to their changing environment Chapter 18 Prokaryotes and eukaryotes alter gene expression in response to their changing environment In multicellular eukaryotes, gene expression regulates development and is responsible for differences

More information

Quality control of Saccharomyces yeasts: differentiation of species level and strain grouping using COX 2 gene analysis and MALDI-TOF-MS analysis

Quality control of Saccharomyces yeasts: differentiation of species level and strain grouping using COX 2 gene analysis and MALDI-TOF-MS analysis Quality control of Saccharomyces yeasts: differentiation of species level and strain grouping using COX 2 gene analysis and MALDI-TOF-MS analysis Hiroko Kawasaki 1, Shinya Ninomiya 1, Kanae Teramoto 2,

More information

David M. Underhill, Ph.D.

David M. Underhill, Ph.D. The 3 rd Microbiome R&D and Business Collaboration Forum: USA San Diego, CA Sept. 2015 David M. Underhill, Ph.D. F. Widjaja Foundation Inflammatory Bowel & Immunobiology Research Institute Division of

More information

Deciphering the Role of micrornas in BRD4-NUT Fusion Gene Induced NUT Midline Carcinoma

Deciphering the Role of micrornas in BRD4-NUT Fusion Gene Induced NUT Midline Carcinoma www.bioinformation.net Volume 13(6) Hypothesis Deciphering the Role of micrornas in BRD4-NUT Fusion Gene Induced NUT Midline Carcinoma Ekta Pathak 1, Bhavya 1, Divya Mishra 1, Neelam Atri 1, 2, Rajeev

More information

Bioinformatics. Sequence Analysis: Part III. Pattern Searching and Gene Finding. Fran Lewitter, Ph.D. Head, Biocomputing Whitehead Institute

Bioinformatics. Sequence Analysis: Part III. Pattern Searching and Gene Finding. Fran Lewitter, Ph.D. Head, Biocomputing Whitehead Institute Bioinformatics Sequence Analysis: Part III. Pattern Searching and Gene Finding Fran Lewitter, Ph.D. Head, Biocomputing Whitehead Institute Course Syllabus Jan 7 Jan 14 Jan 21 Jan 28 Feb 4 Feb 11 Feb 18

More information

Sections 12.3, 13.1, 13.2

Sections 12.3, 13.1, 13.2 Sections 12.3, 13.1, 13.2 Now that the DNA has been copied, it needs to send its genetic message to the ribosomes so proteins can be made Transcription: synthesis (making of) an RNA molecule from a DNA

More information

For all of the following, you will have to use this website to determine the answers:

For all of the following, you will have to use this website to determine the answers: For all of the following, you will have to use this website to determine the answers: http://blast.ncbi.nlm.nih.gov/blast.cgi We are going to be using the programs under this heading: Answer the following

More information

Web-based tools for Bioinformatics; A (free) introduction to (freely available) NCBI, MUSC and World-wide.

Web-based tools for Bioinformatics; A (free) introduction to (freely available) NCBI, MUSC and World-wide. 1 of 17 12/10/2003 11:33 AM Web-based tools for Bioinformatics; A (free) introduction to (freely available) NCBI, MUSC and World-wide. When and Where---Wednesdays 1-2pm, Room 438 Library Admin Building

More information

High AU content: a signature of upregulated mirna in cardiac diseases

High AU content: a signature of upregulated mirna in cardiac diseases https://helda.helsinki.fi High AU content: a signature of upregulated mirna in cardiac diseases Gupta, Richa 2010-09-20 Gupta, R, Soni, N, Patnaik, P, Sood, I, Singh, R, Rawal, K & Rani, V 2010, ' High

More information

Basic Local Alignment Search Tool

Basic Local Alignment Search Tool NCBI Blast:MARVRSTARVEREGDEAEGAETVPISEAMQRSGLVTSERIPTAETDA... Page 1 of 24 BLAST Basic Local Alignment Search Tool Edit and Resubmit Save Search Strategies Formatting options Download MARVRSTARVEREGDEAEGAETVPISEAMQRSGLVTSERIPTAETDAETEQAAAEAEQGD

More information

Supplementary Figure 1. SC35M polymerase activity in the presence of Bat or SC35M NP encoded from the phw2000 rescue plasmid.

Supplementary Figure 1. SC35M polymerase activity in the presence of Bat or SC35M NP encoded from the phw2000 rescue plasmid. 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 Supplementary Figure 1. SC35M polymerase activity in the presence of Bat or SC35M NP encoded from the phw2000 rescue plasmid. HEK293T

More information

Nature Structural & Molecular Biology: doi: /nsmb.2419

Nature Structural & Molecular Biology: doi: /nsmb.2419 Supplementary Figure 1 Mapped sequence reads and nucleosome occupancies. (a) Distribution of sequencing reads on the mouse reference genome for chromosome 14 as an example. The number of reads in a 1 Mb

More information

Exemplar for Internal Assessment Resource Mathematics and Statistics Level 1 Resource title: Carbon Credits

Exemplar for Internal Assessment Resource Mathematics and Statistics Level 1 Resource title: Carbon Credits Exemplar for internal assessment resource Mathematics and Statistics 1.1B for Achievement Standard 91026! Exemplar for Internal Assessment Resource Mathematics and Statistics Level 1 Resource title: Carbon

More information

Working with gene features and genomes

Working with gene features and genomes Working with gene features and genomes Typical workflow when working with sequence data (e.g., genomes) 1. Download sequence data from NCBI 2. Save on local harddrive 3. Do analysis on locally stored sequence

More information

Transcriptome Analysis

Transcriptome Analysis Transcriptome Analysis Data Preprocessing Sample Preparation Illumina Sequencing Demultiplexing Raw FastQ Reference Genome (fasta) Reference Annotation (GTF) Reference Genome Analysis Tophat Accepted hits

More information

Web-based tools for Bioinformatics; A (free) introduction to (freely available) NCBI, MUSC and Worldwide.

Web-based tools for Bioinformatics; A (free) introduction to (freely available) NCBI, MUSC and Worldwide. Page 1 of 32 Web-based tools for Bioinformatics; A (free) introduction to (freely available) NCBI, MUSC and Worldwide. When and Where---Wednesdays 1-2pm Room 438 Library Admin Building Beginning September

More information

Influenza Virus HA Subtype Numbering Conversion Tool and the Identification of Candidate Cross-Reactive Immune Epitopes

Influenza Virus HA Subtype Numbering Conversion Tool and the Identification of Candidate Cross-Reactive Immune Epitopes Influenza Virus HA Subtype Numbering Conversion Tool and the Identification of Candidate Cross-Reactive Immune Epitopes Brian J. Reardon, Ph.D. J. Craig Venter Institute breardon@jcvi.org Introduction:

More information

Sebastian Jaenicke. trnascan-se. Improved detection of trna genes in genomic sequences

Sebastian Jaenicke. trnascan-se. Improved detection of trna genes in genomic sequences Sebastian Jaenicke trnascan-se Improved detection of trna genes in genomic sequences trnascan-se Improved detection of trna genes in genomic sequences 1/15 Overview 1. trnas 2. Existing approaches 3. trnascan-se

More information

Alternative RNA processing: Two examples of complex eukaryotic transcription units and the effect of mutations on expression of the encoded proteins.

Alternative RNA processing: Two examples of complex eukaryotic transcription units and the effect of mutations on expression of the encoded proteins. Alternative RNA processing: Two examples of complex eukaryotic transcription units and the effect of mutations on expression of the encoded proteins. The RNA transcribed from a complex transcription unit

More information

IDENTIFICATION OF IN SILICO MIRNAS IN FOUR PLANT SPECIES FROM FABACEAE FAMILY

IDENTIFICATION OF IN SILICO MIRNAS IN FOUR PLANT SPECIES FROM FABACEAE FAMILY Original scientific paper 10.7251/AGRENG1803122A UDC633:34+582.736.3]:577.2 IDENTIFICATION OF IN SILICO MIRNAS IN FOUR PLANT SPECIES FROM FABACEAE FAMILY Bihter AVSAR 1*, Danial ESMAEILI ALIABADI 2 1 Sabanci

More information

Table of content. -Supplementary methods. -Figure S1. -Figure S2. -Figure S3. -Table legend

Table of content. -Supplementary methods. -Figure S1. -Figure S2. -Figure S3. -Table legend Table of content -Supplementary methods -Figure S1 -Figure S2 -Figure S3 -Table legend Supplementary methods Yeast two-hybrid bait basal transactivation test Because bait constructs sometimes self-transactivate

More information

RNA Secondary Structures: A Case Study on Viruses Bioinformatics Senior Project John Acampado Under the guidance of Dr. Jason Wang

RNA Secondary Structures: A Case Study on Viruses Bioinformatics Senior Project John Acampado Under the guidance of Dr. Jason Wang RNA Secondary Structures: A Case Study on Viruses Bioinformatics Senior Project John Acampado Under the guidance of Dr. Jason Wang Table of Contents Overview RSpredict JAVA RSpredict WebServer RNAstructure

More information

MODULE 3: TRANSCRIPTION PART II

MODULE 3: TRANSCRIPTION PART II MODULE 3: TRANSCRIPTION PART II Lesson Plan: Title S. CATHERINE SILVER KEY, CHIYEDZA SMALL Transcription Part II: What happens to the initial (premrna) transcript made by RNA pol II? Objectives Explain

More information

Studying Alternative Splicing

Studying Alternative Splicing Studying Alternative Splicing Meelis Kull PhD student in the University of Tartu supervisor: Jaak Vilo CS Theory Days Rõuge 27 Overview Alternative splicing Its biological function Studying splicing Technology

More information

Comparing Amino Acid Sequences Abstract

Comparing Amino Acid Sequences Abstract Teacher Guide Comparing Amino Acid Sequences Abstract In this hands-on activity, students work in small groups to compare amino acid sequences for a particular protein from a to the same protein from 5

More information

The Open Bioinformatics Journal, 2014, 8, 1-5 1

The Open Bioinformatics Journal, 2014, 8, 1-5 1 Send Orders for Reprints to reprints@benthamscience.net The Open Bioinformatics Journal, 214, 8, 1-5 1 Open Access Performances of Bioinformatics Pipelines for the Identification of Pathogens in Clinical

More information

Pre-mRNA Secondary Structure Prediction Aids Splice Site Recognition

Pre-mRNA Secondary Structure Prediction Aids Splice Site Recognition Pre-mRNA Secondary Structure Prediction Aids Splice Site Recognition Donald J. Patterson, Ken Yasuhara, Walter L. Ruzzo January 3-7, 2002 Pacific Symposium on Biocomputing University of Washington Computational

More information

Molecular Cell Biology - Problem Drill 10: Gene Expression in Eukaryotes

Molecular Cell Biology - Problem Drill 10: Gene Expression in Eukaryotes Molecular Cell Biology - Problem Drill 10: Gene Expression in Eukaryotes Question No. 1 of 10 1. Which of the following statements about gene expression control in eukaryotes is correct? Question #1 (A)

More information

Principles of phylogenetic analysis

Principles of phylogenetic analysis Principles of phylogenetic analysis Arne Holst-Jensen, NVI, Norway. Fusarium course, Ås, Norway, June 22 nd 2008 Distance based methods Compare C OTUs and characters X A + D = Pairwise: A and B; X characters

More information

Supplemental Figure 1. Small RNA size distribution from different soybean tissues.

Supplemental Figure 1. Small RNA size distribution from different soybean tissues. Supplemental Figure 1. Small RNA size distribution from different soybean tissues. The size of small RNAs was plotted versus frequency (percentage) among total sequences (A, C, E and G) or distinct sequences

More information

Hepatitis A Outbreaks In Australia Molecular Epidemiology

Hepatitis A Outbreaks In Australia Molecular Epidemiology Hepatitis A Outbreaks In Australia Molecular Epidemiology Scott Bowden, Lilly Tracy, Sara Bonanzinga VIDRL, The Doherty Institute, VIC Joy Gregory, Marion Easton Dept of Health & Human Services, Melbourne,

More information

ELF3and. FOXA2Loci, Probable Putative Trans-QTL for Abnormal Sperm Percentage in Cattle: An In- silicoanalysis

ELF3and. FOXA2Loci, Probable Putative Trans-QTL for Abnormal Sperm Percentage in Cattle: An In- silicoanalysis ELF3and FOXA2Loci, Probable Putative Trans-QTL for Abnormal Sperm Percentage in Cattle: An In- silicoanalysis Suneel Kumar Onteru, Varij Nayan & Dheer Singh Animal Biochemistry Division National Dairy

More information

DEVELOPING BIOINFORMATICS TOOLS FOR THE STUDY OF ALTERNATIVE SPLICING IN EUKARYOTIC GENES LIM YUN PING

DEVELOPING BIOINFORMATICS TOOLS FOR THE STUDY OF ALTERNATIVE SPLICING IN EUKARYOTIC GENES LIM YUN PING DEVELOPING BIOINFORMATICS TOOLS FOR THE STUDY OF ALTERNATIVE SPLICING IN EUKARYOTIC GENES LIM YUN PING SCHOOL OF MECHANICAL & PRODUCTION ENGINEERING 2005 TOOLS FOR STUDY OF ALTERNATIVE SPLICING LIM YUN

More information

Bjoern Peters La Jolla Institute for Allergy and Immunology Buenos Aires, Oct 31, 2012

Bjoern Peters La Jolla Institute for Allergy and Immunology Buenos Aires, Oct 31, 2012 www.iedb.org Bjoern Peters bpeters@liai.org La Jolla Institute for Allergy and Immunology Buenos Aires, Oct 31, 2012 Overview 1. Introduction to the IEDB 2. Application: 2009 Swine-origin influenza virus

More information

Protein Synthesis and Mutation Review

Protein Synthesis and Mutation Review Protein Synthesis and Mutation Review 1. Using the diagram of RNA below, identify at least three things different from a DNA molecule. Additionally, circle a nucleotide. 1) RNA is single stranded; DNA

More information

AP Biology Reading Guide. Concept 19.1 A virus consists of a nucleic acid surrounded by a protein coat

AP Biology Reading Guide. Concept 19.1 A virus consists of a nucleic acid surrounded by a protein coat AP Biology Reading Guide Name Chapter 19: Viruses Overview Experimental work with viruses has provided important evidence that genes are made of nucleic acids. Viruses were also important in working out

More information

Bioinformatic analyses: methodology for allergen similarity search. Zoltán Divéki, Ana Gomes EFSA GMO Unit

Bioinformatic analyses: methodology for allergen similarity search. Zoltán Divéki, Ana Gomes EFSA GMO Unit Bioinformatic analyses: methodology for allergen similarity search Zoltán Divéki, Ana Gomes EFSA GMO Unit EFSA info session on applications - GMO Parma, Italy 28 October 2014 BIOINFORMATIC ANALYSES Analysis

More information

Section B. Comparative Genomics Analysis of Influenza H5N2 Viruses. Objective

Section B. Comparative Genomics Analysis of Influenza H5N2 Viruses. Objective Section B. Comparative Genomics Analysis of Influenza H5N2 Viruses Objective Upon completion of this exercise, you will be able to use the Influenza Research Database (IRD; http://www.fludb.org/) to: Search

More information

Genetics. Instructor: Dr. Jihad Abdallah Transcription of DNA

Genetics. Instructor: Dr. Jihad Abdallah Transcription of DNA Genetics Instructor: Dr. Jihad Abdallah Transcription of DNA 1 3.4 A 2 Expression of Genetic information DNA Double stranded In the nucleus Transcription mrna Single stranded Translation In the cytoplasm

More information

Supplemental Information For: The genetics of splicing in neuroblastoma

Supplemental Information For: The genetics of splicing in neuroblastoma Supplemental Information For: The genetics of splicing in neuroblastoma Justin Chen, Christopher S. Hackett, Shile Zhang, Young K. Song, Robert J.A. Bell, Annette M. Molinaro, David A. Quigley, Allan Balmain,

More information

Overview: Chapter 19 Viruses: A Borrowed Life

Overview: Chapter 19 Viruses: A Borrowed Life Overview: Chapter 19 Viruses: A Borrowed Life Viruses called bacteriophages can infect and set in motion a genetic takeover of bacteria, such as Escherichia coli Viruses lead a kind of borrowed life between

More information

Exploring HIV Evolution: An Opportunity for Research Sam Donovan and Anton E. Weisstein

Exploring HIV Evolution: An Opportunity for Research Sam Donovan and Anton E. Weisstein Microbes Count! 137 Video IV: Reading the Code of Life Human Immunodeficiency Virus (HIV), like other retroviruses, has a much higher mutation rate than is typically found in organisms that do not go through

More information

Gene Expression. From a gene to a protein

Gene Expression. From a gene to a protein Gene Expression From a gene to a protein Central Dogma (Crick 1958) Determines the genetic flow of information Central Dogma First step in decoding a genetic message from DNA is to copy (transcribe) it

More information

Non-messenger RNAs. Karin Lagesen

Non-messenger RNAs. Karin Lagesen Non-messenger RNAs Karin Lagesen Outline RNA is not DNA Kinds of non-messenger RNAs sirna, mirna trnas, rrnas ncrnas: Finding non-messenger RNA trnascan-se, RNammer RFAM RNA basepairing RNA is not DNA

More information

Part III: Basic Immunology

Part III: Basic Immunology Part III: Basic Immunology Introduction: This is an introductory unit on immunology. Important topics addressed include bacterial pathogens, vaccines, antibiotics, and cells of the immune system. After

More information

Fondation Merieux J Craig Venter Institute Bioinformatics Workshop. December 5 8, 2017

Fondation Merieux J Craig Venter Institute Bioinformatics Workshop. December 5 8, 2017 Fondation Merieux J Craig Venter Institute Bioinformatics Workshop December 5 8, 2017 Module 5: Comparative Genomics Analysis Outline Definition of comparative genomics Applications in studying human pathogens

More information

In Memoriam July 2006 Elisa Santry, 16, Boston, MA Dave Bushow, 29, River Vale, NJ /27/2006 1of 33

In Memoriam July 2006 Elisa Santry, 16, Boston, MA Dave Bushow, 29, River Vale, NJ /27/2006 1of 33 In Memoriam July 2006 Elisa Santry, 16, Boston, MA Dave Bushow, 29, River Vale, NJ www.sltrib.com 7-20-06 7/27/2006 1of 33 A Genetic Element Common to Tumor Viruses and Human Angelo Kontgas BBSI Intern

More information

Micro-RNA web tools. Introduction. UBio Training Courses. mirnas, target prediction, biology. Gonzalo

Micro-RNA web tools. Introduction. UBio Training Courses. mirnas, target prediction, biology. Gonzalo Micro-RNA web tools UBio Training Courses Gonzalo Gómez//ggomez@cnio.es Introduction mirnas, target prediction, biology Experimental data Network Filtering Pathway interpretation mirs-pathways network

More information

Characterizing the Respiratory Microbiome of Commercial Broilers on the Delmarva Peninsula

Characterizing the Respiratory Microbiome of Commercial Broilers on the Delmarva Peninsula Characterizing the Respiratory Microbiome of Commercial Broilers on the Delmarva Peninsula Kelly Mulholland 1,2, Calvin Keeler, Jr. 2, Sharon J. Keeler 2 and Monique Robinson 2 1 Center for Bioinformatics

More information

RESEARCH PROJECT. Comparison of searching tools and outcomes of different providers of the Medline database (OVID and PubMed).

RESEARCH PROJECT. Comparison of searching tools and outcomes of different providers of the Medline database (OVID and PubMed). RESEARCH PROJECT Comparison of searching tools and outcomes of different providers of the Medline database (OVID and PubMed). Karolina Kucerova Charles University in Prague Evidence-Based Medicine Course

More information

Protein Synthesis

Protein Synthesis Protein Synthesis 10.6-10.16 Objectives - To explain the central dogma - To understand the steps of transcription and translation in order to explain how our genes create proteins necessary for survival.

More information

Next Generation Surveillance Systems integrating whole genome sequencing data into real-time detection and control (of TADs / EADs)

Next Generation Surveillance Systems integrating whole genome sequencing data into real-time detection and control (of TADs / EADs) Next Generation Surveillance Systems integrating whole genome sequencing data into real-time detection and control (of TADs / EADs) Peter Durr Ivano Broz Kerryne Graham Jianning Wang Frank Wong Sam McCullough

More information

CS 312: Algorithms Analysis. Gene Sequence Alignment. Overview: Objectives: Code:

CS 312: Algorithms Analysis. Gene Sequence Alignment. Overview: Objectives: Code: CS 312: Algorithms Analysis Gene Sequence Alignment Overview: In this project, you will implement dynamic programming algorithms for computing the minimal cost of aligning gene sequences and for extracting

More information

aM (modules 1 and 10 are required)

aM (modules 1 and 10 are required) This form should be used for all taxonomic proposals. Please complete all those modules that are applicable (and then delete the unwanted sections). For guidance, see the notes written in blue and the

More information