De novo iden)fica)on of SNPs from RNA- seq data in non- model species
|
|
- Doris Stewart
- 5 years ago
- Views:
Transcription
1 De novo iden)fica)on of SNPs from RNA- seq data in non- model species Hélène Lopez- Maestre 8th Novembre 2016
2 Why work with RNAseq? Lower cost SNPs from expressed regions SNPs with a more direct func:onal impact
3 How to detect SNPs in RNAseq data? for non model species?
4 reads from RNAseq SNP iden:fica:on from RNAseq with a reference genome
5 reads from RNAseq SNP iden:fica:on from RNAseq 1) with a reference genome reference genome
6 reads from RNAseq SNP iden:fica:on from RNAseq 1) with a reference genome reference genome Several tools : SNPiR GATK SAMtools mpileup etc.
7 reads from RNAseq SNP iden:fica:on from RNAseq 2) with an assembled transcriptome transcriptome assembly : Trinity Oases
8 reads from RNAseq SNP iden:fica:on from RNAseq 2) with an assembled transcriptome transcriptome assembly : { two assembled transcripts from one gene due to alterna)ve splicing A A C B D D
9 reads from RNAseq SNP iden:fica:on from RNAseq 2) with an assembled transcriptome transcriptome assembly : { two assembled transcripts from one gene due to alterna)ve splicing A A C B D D
10 reads from RNAseq SNP iden:fica:on from RNAseq 2) with an assembled transcriptome transcriptome assembly : { two assembled transcripts from one gene due to alterna)ve splicing A A C B D D
11 reads from RNAseq SNP iden:fica:on from RNAseq 2) with an assembled transcriptome transcriptome assembly : { two assembled transcripts from one gene due to alterna)ve splicing A A C B D D
12 reads from RNAseq SNP iden:fica:on from RNAseq 2) with an assembled transcriptome transcriptome assembly : { two assembled transcripts from one gene due to alterna)ve splicing A A C B D D
13 reads from RNAseq SNP iden:fica:on from RNAseq 2) with an assembled transcriptome transcriptome assembly : { two assembled transcripts from one gene due to alterna)ve splicing A A C B D D
14 reads from RNAseq SNP iden:fica:on from RNAseq 2) with an assembled transcriptome transcriptome assembly : { two assembled transcripts from one gene due to alterna)ve splicing A A C B D D
15 reads from RNAseq SNP iden:fica:on from RNAseq 2) with an assembled transcriptome transcriptome assembly : { two assembled transcripts from one gene due to alterna)ve splicing A A C B D D
16 SNP iden:fica:on from RNAseq 3) from a De Bruijn graph reads from RNAseq local assembly
17 SNP iden:fica:on from RNAseq 3) from a De Bruijn graph ATTCCTGCAATAC ATTCCTACAATAC GGTAGGTACATCGTGT GTTGTGCATGAATTCTG TGCATGAATTCTACAATA ATGAGGTACATGCAT GCATGAATTCTGCAATAC GTTGTGCATGAATTCTA KisSplice DiscoSNP Bubble of size = 2k+1
18 Kissplice Filtres Sequencing errors Inexacts repeats
19 Sequencing errors low expression 39G 1A high expression 390G 10A ) absolute cutoff rela)ve cutoff
20 Inexacts repeats...attgctgcaattc...attgctacaattc......attgctgcaattc...attgctacaattc...
21 Inexacts repeats...attgctgcaattc...attgctacaattc......attgctgcaattc...attgctacaattc......attgctgcacttc...attgctgcaattc...attgctacaattc......attgctgcacttc...attgctgcaattc...attgctacaattc...
22 Inexacts repeats...attgctgcaattc...attgctacaattc......attgctgcaattc...attgctacaattc......attgctgcacttc...attgctgcaattc...attgctacaattc......attgctgcacttc...attgctgcaattc...attgctacaattc... X10 copies
23 SNP iden:fica:on from RNAseq 1) with a reference genome 2) with an assembled transcriptome 3) directly from the De Bruijn graph
24 Valida:on of the method on real data Human dataset genome and annota)on available 1000G project : genotype of 1000 individuals Geuvadis : RNAseq of these individuals RNAseq from 2 x 5 central europeans and 2 x 5 toscans
25 Valida:on of the method on real data Recall : Out of all the «True» SNPs, the propor)on predicted by the method Precision : Out of all the predicted SNPs, the propor)on that is correctly predicted
26 True SNPs All the SNPs according to the genotype of the 20 indivuals (1000G project) locus covered by a minumum of 5 reads
27 Recall and precision Recall GATK genome MP transcriptome MPlong transcript. Kp default Kp C= True SNPs minimum Coverage
28 Recall and precision Recall GATK genome MP transcriptome MPlong transcript. Kp default Kp C=0.02 Precision True SNPs minimum Coverage Predicted SNPs minimum coverage
29 Recall and precision Recall GATK genome MP transcriptome MPlong transcript. Kp default Kp C=0.02 Precision True SNPs minimum Coverage Predicted SNPs minimum coverage
30 Recall and precision Recall GATK genome MP transcriptome MPlong transcript. Kp default Kp C=0.02 Precision True SNPs minimum Coverage Predicted SNPs minimum coverage
31 Recall and precision Recall GATK genome MP transcriptome MPlong transcript. Kp default Kp C=0.02 Precision True SNPs minimum Coverage Predicted SNPs minimum coverage
32 Recall and precision Recall GATK genome MP transcriptome MPlong transcript. Kp default Kp C=0.02 Precision True SNPs minimum Coverage Predicted SNPs minimum coverage
33 Recall and Allele Frequency MAF : Minor Allele Frequency All MAF MAF > 0.05 MAF > 0.15 Recall GATK genome MP transcriptome MPlong transcript. MPcdhit transcript. Kp default Kp C=0.02 Recall Recall True SNPs minimum Coverage True SNPs minimum Coverage True SNPs minimum Coverage
34 Downstream analyses
35 Downstream analyses i) Variant impact on the amino- acid? ii) Variant specific to a biological condi)on?
36 Predic:on of the amino acid changes reads from RNAseq transcriptome assembly
37 Predic:on of the amino acid changes KisSplice2refTranscriptome transcriptome assembly
38 Predic:on of the amino acid changes KisSplice2refTranscriptome transcriptome assembly
39 Predic:on of the amino acid changes KisSplice2refTranscriptome transcriptome assembly
40 Predic:on of the amino acid changes KisSplice2refTranscriptome C G A T transcriptome assembly
41 Predic:on of the amino acid changes KisSplice2refTranscriptome Arginin C G A T His)din transcriptome assembly
42 Quan:fica:on
43 Quan:fica:on 25 22
44 Condi:on specific variant Condi)on 1 Condi)on 2 Condi)on 1 23 Condi)on Condi)on 1 18 Condi)on 2
45 Condi:on specific variant Condi)on 1 Condi)on 2 Condi:on 1 23 Condi)on 2 2 KissDE 4 Condi)on 1 18 Condi:on 2
46 Pipeline : Input fastq files D00393:39:C3PHDACXX:2:1101:8716:2263 1:N:0:CTATAC TCAGTCTTTTTCTCTCTCCTAGTCCCATAATGCCTTGACGAACATCAGTCTGTCCCATAATGCTCTCCTATTA + D00393:39:C3PHDACXX:2:1101:11867:2372 1:N:0:CTATAC TACGTCGGAAATATATAAAAAAATGGCACTGGAAGAATTCACTGGCACTGGAAGAATTCATGGCACTGGA + D00393:39:C3PHDACXX:2:1101:3047:2629 1:N:0:CTATAA CCGTGGTCCCATAATGCCTTGACACGGTCTTGGCACTGGAAGAATTCAAGAATTCATGGCACTGGAAGAA + D00393:39:C3PHDACXX:2:1101:8009:2719 1:N:0:CTATAC CCCGACTGAAGGTGTACTCATGGCCAGGTCCCATAATGCCTTGACCTCTCCTATTACTGAAGGGGTGTACT + CCCCCCCCCCCCCCCCCBCCCCCCCCCCBC?CBBBBBBBBBBBBBCFFFFFGGHHFHHHHHHHHJJIIJIIJJJI
47 Pipeline : Output KisSplice + KissDE + Kissplice2refTranscriptome output : c54770_g1_i bcc_487 Cycle_259 CAA CCA Q P NonSyn. No
48 Pipeline : Output KisSplice + KissDE + Kissplice2refTranscriptome output : c54770_g1_i bcc_487 Cycle_259 CAA CCA Q P NonSyn. No c54495_g1_i bcc_6275 Cycle_1 GCA GCC A A Syn. No
49 Pipeline : Output KisSplice + KissDE + Kissplice2refTranscriptome output : c54770_g1_i bcc_487 Cycle_259 CAA CCA Q P NonSyn. No c54495_g1_i bcc_6275 Cycle_1 GCA GCC A A Syn. No c59969_g1_i bcc_9888 Cycle_0 AGC TGC S C NonSyn. Yes
50 Pipeline : Output KisSplice + KissDE + Kissplice2refTranscriptome output : c54770_g1_i bcc_487 Cycle_259 CAA CCA Q P NonSyn. No c54495_g1_i bcc_6275 Cycle_1 GCA GCC A A Syn. No c59969_g1_i bcc_9888 Cycle_0 AGC TGC S C NonSyn. Yes c65293_g5_i2 847 bcc_6446 Cycle_0 N/A N/A N/A N/A NonCod. Yes
51 Experimental Valida:ons
52 Asobara tabida ovaries infected with Wolbachia
53 Asobara tabida ovaries infected with Wolbachia 2 lines with variable dependency
54 Asobara tabida ovaries infected with Wolbachia 2 lines with variable dependency RNAseq (ovaries) pool of 30 individuals
55 Asobara tabida ovaries infected with Wolbachia 2 lines with variable dependency RNAseq (ovaries) pool of 30 individuals SNPs of which condi)on specific 36/36 SNPs validated (rt- PCR + sequencing) of which 7 predicted by Kissplice only
56 Take- home message Detec)ng SNPs in non model species from RNAseq Similar performances (precision and recall) compared to methods using a ref. genome Precision for SNP discovery : >80% Test for condi)on specific variant Precision : 96% Experimental valida)ons Lopez- Maestre et al. NAR h^p://kissplice.prabi.fr/twas/
57 Perspec:ves Improve the detec)on of close SNPs and SNPs in highly polymorphic regions : increase KisSplice recall «True» SNPs hard to detect for all methods
58 Acknowledgments ARN Colibread L. Brinza C. Marchet J. Kielbassa S. Bas)en M. Bou)gny D. Monin C. Vieira F. Picard N. Kremer F. Vavre M. Sagot V. Lacroix G. Sacomoto
59 Experimental valida:on * * * SFR2 SFR3 Nitric oxide synthase SFR2: 0% / SFR3: 100% Protein ovarian tumor SFR2: 0% / SFR3: 33% Pyrimidodiazepine synthase SFR2: 100% / SFR3: 67%
60 Mul:ple SNPs Exclus de l étude : Fort taux de faux posi)fs Probablement des répé))ons inexactes
61 Close SNPs : phased distance <k Bubble of size >2k+1
62 Close SNPs : non- phased
63 Recall vs MAF All MAF MAF > 0.05 MAF > 0.15 Recall GATK genome MP transcriptome MPlong transcript. MPcdhit transcript. Kp default Kp C=0.02 Recall Recall True SNPs minimum Coverage True SNPs minimum Coverage True SNPs minimum Coverage
64 Kissplice parameters Recall MPlong Kp default Kp k=31 Kp k=51 Kp b=10 Kp C=0.10 Kp C=0.02 Precision True SNPs minimum Coverage Predicted SNPs minimum coverage
65 Impact on amino acids
66 Impact on amino acids
67 Impact on amino acids Precision of impact on aa : 96%
68 Impact on amino acids Precision of impact on aa : 96% Precision of the pipeline > 77%
69 KissDE C1 C2 Var Var % 0%
70 KissDE C1 C2 C1 C2 Var Var Var Var % 0% 100% 30%
71 KissDE C1 C2 C1 C2 Var Var Var Var % 0% 100% 30% C1 C2 Var Var % 30%
72 Difference in allele frquency
73 Difference in allele frquency
74 Difference in allele frquency Pvalue 0.05 #reads 100
75 KissDE model GLM : log(λ ijk )= μ + α i + β j log(λ ijk )= μ + α i + β j + (αβ) ij
Analyse de données de séquençage haut débit
Analyse de données de séquençage haut débit Vincent Lacroix Laboratoire de Biométrie et Biologie Évolutive INRIA ERABLE 9ème journée ITS 21 & 22 novembre 2017 Lyon https://its.aviesan.fr Sequencing is
More informationTranscript reconstruction
Transcript reconstruction Summary I Data types, file formats and utilities Annotation: Genomic regions Genes Peaks bedtools Alignment: Map reads BAM/SAM Samtools Aggregation: Summary files Wig (UCSC) TDF
More informationAnalysis of Massively Parallel Sequencing Data Application of Illumina Sequencing to the Genetics of Human Cancers
Analysis of Massively Parallel Sequencing Data Application of Illumina Sequencing to the Genetics of Human Cancers Gordon Blackshields Senior Bioinformatician Source BioScience 1 To Cancer Genetics Studies
More informationVariant Classification. Author: Mike Thiesen, Golden Helix, Inc.
Variant Classification Author: Mike Thiesen, Golden Helix, Inc. Overview Sequencing pipelines are able to identify rare variants not found in catalogs such as dbsnp. As a result, variants in these datasets
More informationDr Rick Tearle Senior Applications Specialist, EMEA Complete Genomics Complete Genomics, Inc.
Dr Rick Tearle Senior Applications Specialist, EMEA Complete Genomics Topics Overview of Data Processing Pipeline Overview of Data Files 2 DNA Nano-Ball (DNB) Read Structure Genome : acgtacatgcattcacacatgcttagctatctctcgccag
More informationTranscriptome Analysis
Transcriptome Analysis Data Preprocessing Sample Preparation Illumina Sequencing Demultiplexing Raw FastQ Reference Genome (fasta) Reference Annotation (GTF) Reference Genome Analysis Tophat Accepted hits
More informationTranscriptome and isoform reconstruc1on with short reads. Tangled up in reads
Transcriptome and isoform reconstruc1on with short reads Tangled up in reads Topics of this lecture Mapping- based reconstruc1on methods Case study: The domes1c dog De- novo reconstruc1on method Trinity
More informationData mining with Ensembl Biomart. Stéphanie Le Gras
Data mining with Ensembl Biomart Stéphanie Le Gras (slegras@igbmc.fr) Guidelines Genome data Genome browsers Getting access to genomic data: Ensembl/BioMart 2 Genome Sequencing Example: Human genome 2000:
More informationRNA- seq Introduc1on. Promises and pi7alls
RNA- seq Introduc1on Promises and pi7alls DNA is the same in all cells but which RNAs that is present is different in all cells There is a wide variety of different func1onal RNAs Which RNAs (and some1mes
More informationBreast and ovarian cancer in Serbia: the importance of mutation detection in hereditary predisposition genes using NGS
Breast and ovarian cancer in Serbia: the importance of mutation detection in hereditary predisposition genes using NGS dr sc. Ana Krivokuća Laboratory for molecular genetics Institute for Oncology and
More informationAnalysis with SureCall 2.1
Analysis with SureCall 2.1 Danielle Fletcher Field Application Scientist July 2014 1 Stages of NGS Analysis Primary analysis, base calling Control Software FASTQ file reads + quality 2 Stages of NGS Analysis
More informationHBV. Next Generation Sequencing, data analysis and reporting. Presenter Leen-Jan van Doorn
HBV Next Generation Sequencing, data analysis and reporting Presenter Leen-Jan van Doorn HBV Forum 3 October 24 th, 2017 Marriott Marquis, Washington DC www.forumresearch.org HBV Biomarkers HBV biomarkers:
More informationCanadian Bioinforma1cs Workshops
5/12/16 Canadian Bioinforma1cs Workshops www.bioinforma1cs.ca Module #: Title of Module 2 1 Module 3 Introduc1on to WGBS and analysis Guillaume Bourque Learning Objec/ves of Module Know the different technologies
More informationgenomics for systems biology / ISB2020 RNA sequencing (RNA-seq)
RNA sequencing (RNA-seq) Module Outline MO 13-Mar-2017 RNA sequencing: Introduction 1 WE 15-Mar-2017 RNA sequencing: Introduction 2 MO 20-Mar-2017 Paper: PMID 25954002: Human genomics. The human transcriptome
More informationA Quick-Start Guide for rseqdiff
A Quick-Start Guide for rseqdiff Yang Shi (email: shyboy@umich.edu) and Hui Jiang (email: jianghui@umich.edu) 09/05/2013 Introduction rseqdiff is an R package that can detect differential gene and isoform
More informationP. Tang ( 鄧致剛 ); PJ Huang ( 黄栢榕 ) g( ); g ( ) Bioinformatics Center, Chang Gung University.
Databases and Tools for High Throughput Sequencing Analysis P. Tang ( 鄧致剛 ); PJ Huang ( 黄栢榕 ) g( ); g ( ) Bioinformatics Center, Chang Gung University. HTseq Platforms Applications on Biomedical Sciences
More informationHands-On Ten The BRCA1 Gene and Protein
Hands-On Ten The BRCA1 Gene and Protein Objective: To review transcription, translation, reading frames, mutations, and reading files from GenBank, and to review some of the bioinformatics tools, such
More informationRare Variant Burden Tests. Biostatistics 666
Rare Variant Burden Tests Biostatistics 666 Last Lecture Analysis of Short Read Sequence Data Low pass sequencing approaches Modeling haplotype sharing between individuals allows accurate variant calls
More informationInference of Isoforms from Short Sequence Reads
Inference of Isoforms from Short Sequence Reads Tao Jiang Department of Computer Science and Engineering University of California, Riverside Tsinghua University Joint work with Jianxing Feng and Wei Li
More informationRNA-seq Introduction
RNA-seq Introduction DNA is the same in all cells but which RNAs that is present is different in all cells There is a wide variety of different functional RNAs Which RNAs (and sometimes then translated
More informationIdentification of heritable genetic risk factors for bladder cancer through genome-wide association studies (GWAS)
BCAN 2014 August 9, 2014 Identification of heritable genetic risk factors for bladder cancer through genome-wide association studies (GWAS) Ludmila Prokunina-Olsson, PhD Investigator Laboratory of Translational
More informationIntroduction. Introduction
Introduction We are leveraging genome sequencing data from The Cancer Genome Atlas (TCGA) to more accurately define mutated and stable genes and dysregulated metabolic pathways in solid tumors. These efforts
More informationStatistical Tests for X Chromosome Association Study. with Simulations. Jian Wang July 10, 2012
Statistical Tests for X Chromosome Association Study with Simulations Jian Wang July 10, 2012 Statistical Tests Zheng G, et al. 2007. Testing association for markers on the X chromosome. Genetic Epidemiology
More informationSUPPLEMENTARY INFORMATION
Supplementary text Collectively, we were able to detect ~14,000 expressed genes with RPKM (reads per kilobase per million) > 1 or ~16,000 with RPKM > 0.1 in at least one cell type from oocyte to the morula
More informationRNA SEQUENCING AND DATA ANALYSIS
RNA SEQUENCING AND DATA ANALYSIS Download slides and package http://odin.mdacc.tmc.edu/~rverhaak/package.zip http://odin.mdacc.tmc.edu/~rverhaak/rna-seqlecture.zip Overview Introduction into the topic
More informationCharacterizing intra-host influenza virus populations to predict emergence
Characterizing intra-host influenza virus populations to predict emergence June 12, 2012 Forum on Microbial Threats Washington, DC Elodie Ghedin Center for Vaccine Research Dept. Computational & Systems
More informationMissing Heritablility How to Analyze Your Own Genome Fall 2013
Missing Heritablility 02-223 How to Analyze Your Own Genome Fall 2013 Heritability Heritability: the propor>on of observed varia>on in a par>cular trait (as height) that can be agributed to inherited gene>c
More informationNature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1
Supplementary Figure 1 U1 inhibition causes a shift of RNA-seq reads from exons to introns. (a) Evidence for the high purity of 4-shU-labeled RNAs used for RNA-seq. HeLa cells transfected with control
More informationRNA SEQUENCING AND DATA ANALYSIS
RNA SEQUENCING AND DATA ANALYSIS Length of mrna transcripts in the human genome 5,000 5,000 4,000 3,000 2,000 4,000 1,000 0 0 200 400 600 800 3,000 2,000 1,000 0 0 2,000 4,000 6,000 8,000 10,000 Length
More informationResearch Strategy: 1. Background and Significance
Research Strategy: 1. Background and Significance 1.1. Heterogeneity is a common feature of cancer. A better understanding of this heterogeneity may present therapeutic opportunities: Intratumor heterogeneity
More informationDNA-seq Bioinformatics Analysis: Copy Number Variation
DNA-seq Bioinformatics Analysis: Copy Number Variation Elodie Girard elodie.girard@curie.fr U900 institut Curie, INSERM, Mines ParisTech, PSL Research University Paris, France NGS Applications 5C HiC DNA-seq
More informationCopy Number Varia/on Detec/on. Alex Mawla UCD Genome Center Bioinforma5cs Core Tuesday June 16, 2015
Copy Number Varia/on Detec/on Alex Mawla UCD Genome Center Bioinforma5cs Core Tuesday June 16, 2015 Today s Goals Understand the applica5on and capabili5es of using targe5ng sequencing and CNV calling
More informationBreast cancer. Risk factors you cannot change include: Treatment Plan Selection. Inferring Transcriptional Module from Breast Cancer Profile Data
Breast cancer Inferring Transcriptional Module from Breast Cancer Profile Data Breast Cancer and Targeted Therapy Microarray Profile Data Inferring Transcriptional Module Methods CSC 177 Data Warehousing
More informationncounter Assay Automated Process Immobilize and align reporter for image collecting and barcode counting ncounter Prep Station
ncounter Assay ncounter Prep Station Automated Process Hybridize Reporter to RNA Remove excess reporters Bind reporter to surface Immobilize and align reporter Image surface Count codes Immobilize and
More informationCopy Number Variations and Association Mapping Advanced Topics in Computa8onal Genomics
Copy Number Variations and Association Mapping 02-715 Advanced Topics in Computa8onal Genomics SNP and CNV Genotyping SNP genotyping assumes two copy numbers at each locus (i.e., no CNVs) CNV genotyping
More informationIlluminating the genetics of complex human diseases
Illuminating the genetics of complex human diseases Michael Schatz Sept 27, 2012 Beyond the Genome @mike_schatz / #BTG2012 Outline 1. De novo mutations in human diseases 1. Autism Spectrum Disorder 2.
More informationPALB2 c g>c is. VARIANT OF UNCERTAIN SIGNIFICANCE (VUS) CGI s summary of the available evidence is in Appendices A-C.
Consultation sponsor (may not be the patient): First LastName [Patient identity withheld] Date received by CGI: 2 Sept 2017 Variant Fact Checker Report ID: 0000001.5 Date Variant Fact Checker issued: 12
More informationMODULE 4: SPLICING. Removal of introns from messenger RNA by splicing
Last update: 05/10/2017 MODULE 4: SPLICING Lesson Plan: Title MEG LAAKSO Removal of introns from messenger RNA by splicing Objectives Identify splice donor and acceptor sites that are best supported by
More informationVIP: an integrated pipeline for metagenomics of virus
VIP: an integrated pipeline for metagenomics of virus identification and discovery Yang Li 1, Hao Wang 2, Kai Nie 1, Chen Zhang 1, Yi Zhang 1, Ji Wang 1, Peihua Niu 1 and Xuejun Ma 1 * 1. Key Laboratory
More informationSmall RNAs and how to analyze them using sequencing
Small RNAs and how to analyze them using sequencing Jakub Orzechowski Westholm (1) Long- term bioinforma=cs support, Science For Life Laboratory Stockholm (2) Department of Biophysics and Biochemistry,
More informationChIP-seq hands-on. Iros Barozzi, Campus IFOM-IEO (Milan) Saverio Minucci, Gioacchino Natoli Labs
ChIP-seq hands-on Iros Barozzi, Campus IFOM-IEO (Milan) Saverio Minucci, Gioacchino Natoli Labs Main goals Becoming familiar with essential tools and formats Visualizing and contextualizing raw data Understand
More informationIso-Seq Method Updates and Target Enrichment Without Amplification for SMRT Sequencing
Iso-Seq Method Updates and Target Enrichment Without Amplification for SMRT Sequencing PacBio Americas User Group Meeting Sample Prep Workshop June.27.2017 Tyson Clark, Ph.D. For Research Use Only. Not
More informationGolden Helix s End-to-End Solution for Clinical Labs
Golden Helix s End-to-End Solution for Clinical Labs Steven Hystad - Field Application Scientist Nathan Fortier Senior Software Engineer 20 most promising Biotech Technology Providers Top 10 Analytics
More informationNGS in Cancer Pathology After the Microscope: From Nucleic Acid to Interpretation
NGS in Cancer Pathology After the Microscope: From Nucleic Acid to Interpretation Michael R. Rossi, PhD, FACMG Assistant Professor Division of Cancer Biology, Department of Radiation Oncology Department
More informationCOMPARISON OF HIV DRUG-RESISTANT MUTANT DETECTION BY NGS WITH AND WITHOUT UNIQUE MOLECULAR IDENTIFIERS (UMI)
COMPARISON OF HIV DRUG-RESISTANT MUTANT DETECTION BY NGS WITH AND WITHOUT UNIQUE MOLECULAR IDENTIFIERS (UMI) Kevin D McCormick; Kerri J Penrose; Rahil Sethi; Jacob Waldman; William Schwarzmann; Uma Chandran;
More informationSupplementary Figure 1 MicroRNA expression in human synovial fibroblasts from different locations. MicroRNA, which were identified by RNAseq as most
Supplementary Figure 1 MicroRNA expression in human synovial fibroblasts from different locations. MicroRNA, which were identified by RNAseq as most differentially expressed between human synovial fibroblasts
More informationSupplementary Note. Nature Genetics: doi: /ng.2928
Supplementary Note Loss of heterozygosity analysis (LOH). We used VCFtools v0.1.11 to extract only singlenucleotide variants with minimum depth of 15X and minimum mapping quality of 20 to create a ped
More informationSimple, rapid, and reliable RNA sequencing
Simple, rapid, and reliable RNA sequencing RNA sequencing applications RNA sequencing provides fundamental insights into how genomes are organized and regulated, giving us valuable information about the
More informationPREPARED FOR: U.S. Army Medical Research and Materiel Command Fort Detrick, Maryland
AD Award Number: W81XWH-12-1-0298 TITLE: MTHFR Functional Polymorphism C677T and Genomic Instability in the Etiology of Idiopathic Autism in Simplex Families PRINCIPAL INVESTIGATOR: Xudong Liu, PhD CONTRACTING
More informationCITATION FILE CONTENT/FORMAT
CITATION For any resultant publications using please cite: Matthew A. Field, Vicky Cho, T. Daniel Andrews, and Chris C. Goodnow (2015). "Reliably detecting clinically important variants requires both combined
More informationStructured Association Advanced Topics in Computa8onal Genomics
Structured Association 02-715 Advanced Topics in Computa8onal Genomics Structured Association Lasso ACGTTTTACTGTACAATT Gflasso (Kim & Xing, 2009) ACGTTTTACTGTACAATT Greater power Fewer false posi2ves Phenome
More informationAmbient temperature regulated flowering time
Ambient temperature regulated flowering time Applications of RNAseq RNA- seq course: The power of RNA-seq June 7 th, 2013; Richard Immink Overview Introduction: Biological research question/hypothesis
More informationSupplementary note: Comparison of deletion variants identified in this study and four earlier studies
Supplementary note: Comparison of deletion variants identified in this study and four earlier studies Here we compare the results of this study to potentially overlapping results from four earlier studies
More informationSupplementary Document
Supplementary Document 1. Supplementary Table legends 2. Supplementary Figure legends 3. Supplementary Tables 4. Supplementary Figures 5. Supplementary References 1. Supplementary Table legends Suppl.
More informationSupplementary Materials and Methods
DD2 suppresses tumorigenicity of ovarian cancer cells by limiting cancer stem cell population Chunhua Han et al. Supplementary Materials and Methods Analysis of publicly available datasets: To analyze
More informationAn Analysis of MDM4 Alternative Splicing and Effects Across Cancer Cell Lines
An Analysis of MDM4 Alternative Splicing and Effects Across Cancer Cell Lines Kevin Hu Mentor: Dr. Mahmoud Ghandi 7th Annual MIT PRIMES Conference May 2021, 2017 Outline Introduction MDM4 Isoforms Methodology
More informationTrinity: Transcriptome Assembly for Genetic and Functional Analysis of Cancer [U24]
Trinity: Transcriptome Assembly for Genetic and Functional Analysis of Cancer [U24] ITCR meeting, June 2016 The Cancer Transcriptome A window into the (expressed) genetic and epigenetic state of a tumor
More informationSMPD 287 Spring 2015 Bioinformatics in Medical Product Development. Final Examination
Final Examination You have a choice between A, B, or C. Please email your solutions, as a pdf attachment, by May 13, 2015. In the subject of the email, please use the following format: firstname_lastname_x
More informationSupplementary Material for IPred - Integrating Ab Initio and Evidence Based Gene Predictions to Improve Prediction Accuracy
1 SYSTEM REQUIREMENTS 1 Supplementary Material for IPred - Integrating Ab Initio and Evidence Based Gene Predictions to Improve Prediction Accuracy Franziska Zickmann and Bernhard Y. Renard Research Group
More informationCV and publications of Alexey Larionov
CV and publications of Alexey Larionov PhD in Molecular Oncology, MSc in Bioinformatics, MBChB in Medicine www.larionov.co.uk Employment 2013-present: Research Associate (Bioinformatics) Acad. Lab. of
More informationACE ImmunoID Biomarker Discovery Solutions ACE ImmunoID Platform for Tumor Immunogenomics
ACE ImmunoID Biomarker Discovery Solutions ACE ImmunoID Platform for Tumor Immunogenomics Precision Genomics for Immuno-Oncology Personalis, Inc. ACE ImmunoID When one biomarker doesn t tell the whole
More informationSupplemental Information For: The genetics of splicing in neuroblastoma
Supplemental Information For: The genetics of splicing in neuroblastoma Justin Chen, Christopher S. Hackett, Shile Zhang, Young K. Song, Robert J.A. Bell, Annette M. Molinaro, David A. Quigley, Allan Balmain,
More informationGenetics/Genomics: role of genes in diagnosis and/risk and in personalised medicine
Genetics/Genomics: role of genes in diagnosis and/risk and in personalised medicine Lynn Greenhalgh, Macmillan Cancer and General Clinical Geneticist Cancer Genetics Service Cancer is common 1 in
More informationObstacles and challenges in the analysis of microrna sequencing data
Obstacles and challenges in the analysis of microrna sequencing data (mirna-seq) David Humphreys Genomics core Dr Victor Chang AC 1936-1991, Pioneering Cardiothoracic Surgeon and Humanitarian The ABCs
More informationBIOMEDICAL SCIENCES GRADUATE PROGRAM SPRING 2015
THE OHIO STATE UNIVERSITY BIOMEDICAL SCIENCES GRADUATE PROGRAM SPRING 2015 Adam Suhy PhD Candidate Regulation of Cholesteryl Ester Transfer Protein and Expression of Transporters in the Blood Brain Barrier
More informationRNA- seq Introduc1on. Promises and pi7alls
RNA- seq Introduc1on Promises and pi7alls RNA gives informa1on on which genes that are expressed How DNA get transcribed to RNA (and some1mes then translated to proteins) varies between e. g. - Tissues
More informationSingle Nucleotide Polymorphisms in the NOS Genes as Predictors of the Levels of Exhaled Nitric Oxide
Single Nucleotide Polymorphisms in the NOS Genes as Predictors of the Levels of Exhaled Nitric Oxide Santosh Dahgam, Anna-Carin Olin, Lars Modig, Åsa Torinsson Naluai, Fredrik Nyberg Occupational and Environmental
More informationConsidera*ons when undergoing personal genotyping
Considera*ons when undergoing personal genotyping Kelly Ormond, MS, CGC Louanne Hudgins, MD, FACMG January 19, 2011 GENE 210 Disclosures and introduc*ons Professor Ormond provided paid consulta*on for
More informationReporting TP53 gene analysis results in CLL
Reporting TP53 gene analysis results in CLL Mutations in TP53 - From discovery to clinical practice in CLL Discovery Validation Clinical practice Variant diversity *Leroy at al, Cancer Research Review
More informationMODULE 3: TRANSCRIPTION PART II
MODULE 3: TRANSCRIPTION PART II Lesson Plan: Title S. CATHERINE SILVER KEY, CHIYEDZA SMALL Transcription Part II: What happens to the initial (premrna) transcript made by RNA pol II? Objectives Explain
More informationNGS panels in clinical diagnostics: Utrecht experience. Van Gijn ME PhD Genome Diagnostics UMCUtrecht
NGS panels in clinical diagnostics: Utrecht experience Van Gijn ME PhD Genome Diagnostics UMCUtrecht 93 Gene panels UMC Utrecht Cardiovascular disease (CAR) (5 panels) Epilepsy (EPI) (11 panels) Hereditary
More informationDNA Sequence Bioinformatics Analysis with the Galaxy Platform
DNA Sequence Bioinformatics Analysis with the Galaxy Platform University of São Paulo, Brazil 28 July - 1 August 2014 Dave Clements Johns Hopkins University Robson Francisco de Souza University of São
More informationDeconRNASeq: A Statistical Framework for Deconvolution of Heterogeneous Tissue Samples Based on mrna-seq data
DeconRNASeq: A Statistical Framework for Deconvolution of Heterogeneous Tissue Samples Based on mrna-seq data Ting Gong, Joseph D. Szustakowski April 30, 2018 1 Introduction Heterogeneous tissues are frequently
More informationNot IN Our Genes - A Different Kind of Inheritance.! Christopher Phiel, Ph.D. University of Colorado Denver Mini-STEM School February 4, 2014
Not IN Our Genes - A Different Kind of Inheritance! Christopher Phiel, Ph.D. University of Colorado Denver Mini-STEM School February 4, 2014 Epigenetics in Mainstream Media Epigenetics *Current definition:
More informationLectures 13: High throughput sequencing: Beyond the genome. Spring 2017 March 28, 2017
Lectures 13: High throughput sequencing: Beyond the genome Spring 2017 March 28, 2017 h@p://www.fejes.ca/2009/06/science- cartoons- 5- rna- seq.html Omics Transcriptome - the set of all mrnas present in
More informationVirusDetect pipeline - virus detection with small RNA sequencing
VirusDetect pipeline - virus detection with small RNA sequencing CSC webinar 16.1.2018 Eija Korpelainen, Kimmo Mattila, Maria Lehtivaara Big thanks to Jan Kreuze and Jari Valkonen! Outline Small interfering
More informationncounter Assay Automated Process Capture & Reporter Probes Bind reporter to surface Remove excess reporters Hybridize CodeSet to RNA
ncounter Assay Automated Process Hybridize CodeSet to RNA Remove excess reporters Bind reporter to surface Immobilize and align reporter Image surface Count codes mrna Capture & Reporter Probes slides
More informationAssignment 5: Integrative epigenomics analysis
Assignment 5: Integrative epigenomics analysis Due date: Friday, 2/24 10am. Note: no late assignments will be accepted. Introduction CpG islands (CGIs) are important regulatory regions in the genome. What
More informationWhole Genome and Transcriptome Analysis of Anaplastic Meningioma. Patrick Tarpey Cancer Genome Project Wellcome Trust Sanger Institute
Whole Genome and Transcriptome Analysis of Anaplastic Meningioma Patrick Tarpey Cancer Genome Project Wellcome Trust Sanger Institute Outline Anaplastic meningioma compared to other cancers Whole genomes
More informationCRISPR/Cas9 Enrichment and Long-read WGS for Structural Variant Discovery
CRISPR/Cas9 Enrichment and Long-read WGS for Structural Variant Discovery PacBio CoLab Session October 20, 2017 For Research Use Only. Not for use in diagnostics procedures. Copyright 2017 by Pacific Biosciences
More informationTITLE: Exploring the pathogenic and therapeutic implications of aberrant splicing in breast cancer
AD Award Number: W81XWH-08-1-0403 TITLE: Exploring the pathogenic and therapeutic implications of aberrant splicing in breast cancer PRINCIPAL INVESTIGATOR: Dr. Sandeep Dave CONTRACTING ORGANIZATION: Duke
More informationClassifica4on. CSCI1950 Z Computa4onal Methods for Biology Lecture 18. Ben Raphael April 8, hip://cs.brown.edu/courses/csci1950 z/
CSCI1950 Z Computa4onal Methods for Biology Lecture 18 Ben Raphael April 8, 2009 hip://cs.brown.edu/courses/csci1950 z/ Binary classifica,on Given a set of examples (x i, y i ), where y i = + 1, from unknown
More informationEpigenetics. Jenny van Dongen Vrije Universiteit (VU) Amsterdam Boulder, Friday march 10, 2017
Epigenetics Jenny van Dongen Vrije Universiteit (VU) Amsterdam j.van.dongen@vu.nl Boulder, Friday march 10, 2017 Epigenetics Epigenetics= The study of molecular mechanisms that influence the activity of
More informationDetection of copy number variations in PCR-enriched targeted sequencing data
Detection of copy number variations in PCR-enriched targeted sequencing data German Demidov Parseq Lab, Saint-Petersburg University of Russian Academy of Sciences, current: Center for Genomic Regulation
More informationClinical Genetics. Functional validation in a diagnostic context. Robert Hofstra. Leading the way in genetic issues
Clinical Genetics Leading the way in genetic issues Functional validation in a diagnostic context Robert Hofstra Clinical Genetics Leading the way in genetic issues Future application of exome sequencing
More informationCharacteriza*on of Soma*c Muta*ons in Cancer Genomes
Characteriza*on of Soma*c Muta*ons in Cancer Genomes Ben Raphael Department of Computer Science Center for Computa*onal Molecular Biology Soma*c Muta*ons and Cancer Clonal Theory (Nowell 1976) Passenger
More informationTOWARDS ACCURATE GERMLINE AND SOMATIC INDEL DISCOVERY WITH MICRO-ASSEMBLY. Giuseppe Narzisi, PhD Bioinformatics Scientist
TOWARDS ACCURATE GERMLINE AND SOMATIC INDEL DISCOVERY WITH MICRO-ASSEMBLY Giuseppe Narzisi, PhD Bioinformatics Scientist July 29, 2014 Micro-Assembly Approach to detect INDELs 2 Outline 1 Detecting INDELs:
More informationMapping evolutionary pathways of HIV-1 drug resistance using conditional selection pressure. Christopher Lee, UCLA
Mapping evolutionary pathways of HIV-1 drug resistance using conditional selection pressure Christopher Lee, UCLA HIV-1 Protease and RT: anti-retroviral drug targets protease RT Protease: responsible for
More informationSupplementary Figure 1: Features of IGLL5 Mutations in CLL: a) Representative IGV screenshot of first
Supplementary Figure 1: Features of IGLL5 Mutations in CLL: a) Representative IGV screenshot of first intron IGLL5 mutation depicting biallelic mutations. Red arrows highlight the presence of out of phase
More informationGenome. Institute. GenomeVIP: A Genomics Analysis Pipeline for Cloud Computing with Germline and Somatic Calling on Amazon s Cloud. R. Jay Mashl.
GenomeVIP: the Genome Institute at Washington University A Genomics Analysis Pipeline for Cloud Computing with Germline and Somatic Calling on Amazon s Cloud R. Jay Mashl October 20, 2014 Turnkey Variant
More informationPan-cancer analysis of expressed somatic nucleotide variants in long intergenic non-coding RNA
Pan-cancer analysis of expressed somatic nucleotide variants in long intergenic non-coding RNA Travers Ching 1,2, Lana X. Garmire 1,2 1 Molecular Biosciences and Bioengineering Graduate Program, University
More informationGenetic Tests and Genetic Counseling How to Analyze Your Own Genome
Genetic Tests and Genetic Counseling 02-223 How to Analyze Your Own Genome Genetic Tests for Huntington Disease Hun7ngton Disease Incurable brain disorder that runs in families Movement, cogni7ve, and
More informationSupplementary Figure 1. Schematic diagram of o2n-seq. Double-stranded DNA was sheared, end-repaired, and underwent A-tailing by standard protocols.
Supplementary Figure 1. Schematic diagram of o2n-seq. Double-stranded DNA was sheared, end-repaired, and underwent A-tailing by standard protocols. A-tailed DNA was ligated to T-tailed dutp adapters, circularized
More informationGenomic structure of the horse major histocompatibility complex class II region resolved using PacBio long-read sequencing technology
Genomic structure of the horse major histocompatibility complex class II region resolved using PacBio long-read sequencing technology Agnese Viļuma, Sofia Mikko, Daniela Hahn, Loren Skow, Göran Andersson,
More informationSupplementary Figures
Supplementary Figures Supplementary Figure 1. Heatmap of GO terms for differentially expressed genes. The terms were hierarchically clustered using the GO term enrichment beta. Darker red, higher positive
More informationClinical utility of NGS for the detection of HIV and HCV resistance
18 th Annual Resistance and Antiviral Therapy Meeting v Professor Janke Schinkel Academic Medical Centre, Amsterdam, The Netherlands Thursday 18 September 2014, Royal College of Physicians, London Clinical
More informationHistone Modifications Are Associated with Transcript Isoform Diversity in Normal and Cancer Cells
Histone Modifications Are Associated with Transcript Isoform Diversity in Normal and Cancer Cells Ondrej Podlaha 1, Subhajyoti De 2,3,4, Mithat Gonen 5, Franziska Michor 1 * 1 Department of Biostatistics
More informationCancer Informatics Lecture
Cancer Informatics Lecture Mayo-UIUC Computational Genomics Course June 22, 2018 Krishna Rani Kalari Ph.D. Associate Professor 2017 MFMER 3702274-1 Outline The Cancer Genome Atlas (TCGA) Genomic Data Commons
More informationThe Loss of Heterozygosity (LOH) Algorithm in Genotyping Console 2.0
The Loss of Heterozygosity (LOH) Algorithm in Genotyping Console 2.0 Introduction Loss of erozygosity (LOH) represents the loss of allelic differences. The SNP markers on the SNP Array 6.0 can be used
More informationACE ImmunoID. ACE ImmunoID. Precision immunogenomics. Precision Genomics for Immuno-Oncology
ACE ImmunoID ACE ImmunoID Precision immunogenomics Precision Genomics for Immuno-Oncology Personalis, Inc. A universal biomarker platform for immuno-oncology Patient response to cancer immunotherapies
More information