TOWARDS ACCURATE GERMLINE AND SOMATIC INDEL DISCOVERY WITH MICRO-ASSEMBLY. Giuseppe Narzisi, PhD Bioinformatics Scientist
|
|
- Neil Wiggins
- 5 years ago
- Views:
Transcription
1 TOWARDS ACCURATE GERMLINE AND SOMATIC INDEL DISCOVERY WITH MICRO-ASSEMBLY Giuseppe Narzisi, PhD Bioinformatics Scientist
2 July 29, 2014 Micro-Assembly Approach to detect INDELs 2 Outline 1 Detecting INDELs: issues and challenges 2 Scalpel micro-assembly pipeline 3 Applications 1 Large-scale validation experiment 2 De novo/transmitted mutations in Autism
3 July 29, 2014 Micro-Assembly Approach to detect INDELs 3 DETECTING INDELS Issues and challenges of variation discovery
4 July 29, 2014 Micro-Assembly Approach to detect INDELs 4 Why do we care? 1. Indels: insertion or deletion of DNA sequences. [The 1000 Genomes Project, Nature 2010] Second most common sources of variation in human genomes and the most common structural variant [Mullane et al. Hum. Mol. Genet. 2010] 2. Microsatellites: short tandem repeats (STRs), repeating sequences of 2-6 base pairs. Length changes are linked to more than 40 neurological diseases in humans. [Pearson et al., Nat. Rev. Gen. 2005] 3. De novo mutations: Autism: de novo mutations that are likely to severely disrupt the encoded protein are significantly more abundant in affected children than in unaffected siblings. [Ivan Iossifov et al., Neuron, 2012]
5 July 29, 2014 Micro-Assembly Approach to detect INDELs 5 The detection challenge Repeats Mapping errors Coverage Coverage Father Mother Self Sibling Genome location Irregularity in capture efficiency near the edges of the coding region SNP or Indel? Long insertion..ttgaattagccttggtgaattgagcctt...tttag agtgc..!! GAATGAGCC GAAT-GAGCC TTTAGAATAGGC! ATAGGCGAGTGC R SNP R
6 July 29, 2014 Micro-Assembly Approach to detect INDELs 6 Human Exome Capture technology: ~200,000 target regions Mb (~2% of HG) 3-4 Gb 95% of exon-targets 400bp
7 July 29, 2014 Micro-Assembly Approach to detect INDELs 7 Localized Repeats in Human Exome Specificity challenge: 30% of exons have a perfect 10bp or larger repeat
8 July 29, 2014 Micro-Assembly Approach to detect INDELs 8 Tools sensitivity and performance Standard mapping and scanning algorithms (BWA, GATK, SAMTools, SOAP): 1 show low concordance on the same datasets. 2 suitable for detecting mutations only of a few nucleotides. SNPs INDELs Ti/Tv = ratio of the number of transition to transversion substitutions. When there is no bias there are twice as many possible transversions as transitions. Jason O'Rawe et al., Genome Medicine 2013, 5:28
9 July 29, 2014 Micro-Assembly Approach to detect INDELs 9 SCALPEL Micro-assembly pipeline
10 July 29, 2014 Micro-Assembly Approach to detect INDELs 10 Scalpel Novel DNA sequence micro-assembly pipeline to detect mutations within exome-capture data. Whole-Genome assembly Large scale genome structure Genotypic Heuristics to optimize resources (Time and Space) Micro-assembly Detect genome variations Haplotypic (Hom/Het state) Feasible to perform exhaustive search Features: 1. Self-tuning k-mer. 2. On-the-fly repeat composition analysis. 3. Exhaustive search of haplotypes. 4. Family pedigree: joint analysis of family members to detect de novo and transmitted mutations. NRXN1 de novo mutation
11 July 29, 2014 Micro-Assembly Approach to detect INDELs 11 Extract reads reference Build de Bruijn graph K = K+1 Remove low coverage nodes, dead-ends and compress Mark Source and Sink source yes If cycle or near-perfect repeat in any path source sink no sink Traverse graph and enumerate haplotype paths deletion insertion Align to reference
12 July 29, 2014 Micro-Assembly Approach to detect INDELs 12 Sliding window approach Extraction, assembly, alignment and INDEL detection performed in overlapping windows along the genome. 1. Localized assembly (smaller graph). 2. Minimize problem with coverage drops. 3. Distributed approach Father Mother Self Sibling Coverage Genome location
13 July 29, 2014 Micro-Assembly Approach to detect INDELs Scalpel modes of operation Single exome - detects INDELs in one single dataset (e.g., one individual exome) De novo - detects de novo INDELs in one family of four individuals (mom, dad, aff, sib). Somatic: detects somatic INDELs in a tumor/sample pair given in input 13
14 July 29, 2014 Micro-Assembly Approach to detect INDELs 14 LARGE SCALE EXPERIMENT Re-sequencing of 1000 INDELs In collaboration with Lyon Lab
15 July 29, 2014 Micro-Assembly Approach to detect INDELs 15 Simulated Exome Sequencing ~200 thousand exons One INDEL per exon INDEL size: truncated lognormal distribution (μ=1,σ=10) in [1,100]. 100bp perfect reads Coverage distribution based on probe patterns (~30X). SAMtools, UnifiedGenotyper, FreeBayes and Platypus limited power to discover deletions larger than 40bp Scalpel, SOAPindel, and HaplotypeCaller perform the best.
16 July 29, 2014 Micro-Assembly Approach to detect INDELs 16 INDELs in one Exome Individual affected by Obsessive Compulsive Disorder (OCD), Tourette Syndrome, and Psychoses Captured using Agilent SureSelect v.2 and sequenced on the Illumina platform. 80% of the target at >20x coverage INDELs for validation: 200 Scalpel 200 Haplotype Caller 200 SOAPindel 200 within intersection 200 long INDELs (>30bp) Hard to judge the quality of INDELs specific to each pipeline. Superior sensitivity or poor specificity??
17 July 29, 2014 Micro-Assembly Approach to detect INDELs 17 MiSeq validation PCR primers design: Primer 3 used to produce amplicons ranging in size from 200 to 350 bps centered around INDELs. Library construction: TruSeq DNA Sample Prep LS protocol from Illumina. MiSeq sequencing: 250 paired reads. avg cov: 10,000x count HaplotypeCaller Intersected pipelines Large INDELs Scalpel Soap 100,000x depth
18 July 29, 2014 Micro-Assembly Approach to detect INDELs 18 Focus on size distribution Bias towards deletions (for HaplotypeCaller 2.4.3) or insertion (for SOAPindel). Scalpel and HaplotypeCalle v3.0 instead show a well-balanced distribution between insertions and deletions
19 July 29, 2014 Micro-Assembly Approach to detect INDELs 19 Validated INDELs log total 10 Scalpel 77% PPV Invalid Valid size log total 10 SOAPindel 50% PPV Invalid Valid size log total 10 HaplotypeCaller 22% PPV Invalid Valid size INDELs not passing validation correlate well with size bias.
20 July 29, 2014 Micro-Assembly Approach to detect INDELs 20 Validated INDELs log total 10 Scalpel 77% PPV Invalid Valid size log total 10 SOAPindel 50% PPV Invalid Valid size log total 10 HaplotypeCaller 22% PPV Invalid Valid size INDELs not passing validation correlate well with size bias.
21 July 29, 2014 Micro-Assembly Approach to detect INDELs 21 HaplotypeCaller GATK 2.8 Additional validation experiments Frequency GATK v Size (base pairs) Raw Hard-filter Frequency GATK v Size (base pairs) Frequency GATK v Size (base pairs) Scalpel shows a significantly higher validation rate for longer indels (size >5bp)
22 July 29, 2014 Micro-Assembly Approach to detect INDELs 22 Microsatellites instability Position based match Exact match (position+sequence) Due to instability and higher error rate (e.g., homopolymers): Multiple candidates at microsatellite loci. Ref:..TAGCC- AAAAAAATGGTGC..! Alt:..TAGCC-AAAAAAAATGGTGC.. Alt:..TAGCCAAAAAAAAATGGTGC.. Different tools report different signatures.
23 July 29, 2014 Micro-Assembly Approach to detect INDELs 23 Characterization of False Discovery Rate (1) Outside of microsatellites Within microsatellites 614 indels detected by Scalpel and validated by re-sequencing. Higher FDR for mutations within microsatellites. Recommendation on how to select a chi-square score cutoff to achieve a given FDR.
24 July 29, 2014 Micro-Assembly Approach to detect INDELs 24 Characterization of False Discovery Rate (2) Valid Invalid Coverage Chi-Square Score Mutations that do not pass validation (red dots) are generally characterized by lower coverage.
25 July 29, 2014 Micro-Assembly Approach to detect INDELs 25 POPULATION SCALE INDELS ANALYSIS Simons Simplex Collection In collaboration with Wigler Lab
26 July 29, 2014 Micro-Assembly Approach to detect INDELs 26 Simons Simplex Collection ~2700 families. Quad: two parents, one affected child and one unaffected child. NimbleGen SeqCap EZ Exome v2.0 (36 Mb). Illumina HiSeq: ~93bp reads after removing barcodes. Three major studies reporting strong enrichment for de novo gene desrupting mutations in autistic kids: 1 CSHL: Iossifov et al. (2012) Neuron. 74: Yale: Sanders et al. (2012) Nature. 485, WashU: O Roak et al. (2012) Nature. 485,
27 July 29, 2014 Micro-Assembly Approach to detect INDELs 27 Transmitted INDELs 1303 families (5212 individuals) Database with > 12 million INDELs Increased power to detect insertions. Subdivide by DNA context. Goal: discover significant biology that was impossible to measure a few year ago.
28 July 29, 2014 Micro-Assembly Approach to detect INDELs 28 INDEL counts
29 July 29, 2014 Micro-Assembly Approach to detect INDELs 29 Frame-preserving INDELs are more abundant within coding sequences (CDS)
30 July 29, 2014 Micro-Assembly Approach to detect INDELs 30 Deletion-to-Insertion ratio! Category Insertions Deletions DI-ratio Total Intron Intergenic Frame-shift Non-frame-shift UTR Splice site All Twice as many deletion than insertions across all annotation category. In agreement with other recent studies: Montgomery et al., Genome Res. Published in Advance March 11, Sjodin et al. Plos ONE, 2010, Volume 5, Issue 1.
31 July 29, 2014 Micro-Assembly Approach to detect INDELs 31 Transmitted Frame-shifts at lower frequency In agreement with MacArthur et al., Loss-of-function variants in the genomes of healthy humans. Hum. Mol. Genet. 19 (R2): R125-R130
32 July 29, 2014 Micro-Assembly Approach to detect INDELs 32 DE NOVO MUTATIONS IN AUTISM Simons Simplex Collection In collaboration with Wigler Lab
33 July 29, 2014 Micro-Assembly Approach to detect INDELs 33 Unified Genetic Model of Autism Sporadic Autism Sporadic autism: spontaneous mutation with high penetrance in males and relatively poor penetrance in females Family autism: offspring, most often females, who carry a new causative mutation but are unaffected and in turn transmit the mutation in dominant fashion to their offspring. Familial Autism Legend Sporadic muta-on Fails to procreate Michael Wigler et al. PNAS 2007
34 July 29, 2014 Micro-Assembly Approach to detect INDELs 34 Risk by kind of rare genetic variants Population-level enrichment in ASD versus controls of each form of rare mutation studied to date INDELs? Neuron, 2013, volume 77, Issue 2, 23
35 July 29, 2014 Micro-Assembly Approach to detect INDELs 35 De novo INDELs in Autism 1303 families: 686 CSHL, 308 State Lab, and 309 Eichler Lab # INDEL&effect& Aut& Sib& Aut&M& Aut&F& Sib&M& Sib&F& Total& Frame&shift& 85# 47# 66# 19# 31# 16# 132# Intron& 29# 32# 23# 6# 13# 19# 61# Intergenic& 4# 0# 4# 0# 0# 0# 4# No&frame&shift& 9# 9# 9# 0# 3# 6# 18# Splice=site& 3# 1# 3# 0# 1# 0# 4# UTR& 5# 2# 5# 0# 0# 2# 7# Total& 135# 91# 110# 25# 48# 43# 226# De novo INDELs that are likely to severely disrupt the encoded protein are significantly more abundant in affected children than in unaffected siblings Significant overlap (25 out of 132) between the LGD target genes and the set of 842 FMRP-associated genes (Darnell, J.C. et al. Cell, 146: , 2011).
36 July 29, 2014 Micro-Assembly Approach to detect INDELs 36 De novo filters 1 Population filter: Eliminate candidate loci that are common in the population (focus on ultra rare events). 2 Re-assembly each region around candidate INDELs using more sensitive parameters (max sensitivity): Reduce starting k-mer value to 10. turn off removal of low coverage nodes. 3 Final filters: Parents coverage 15x. chi-square (Χ 2 ) coverage score (corresponding to a p-value of 0.001). Χ! =!!!!!!!!!!!!!! +!!!!!!!
37 July 29, 2014 Micro-Assembly Approach to detect INDELs 37 Some statistics Most de novo mutations are small [1bp-6bp] and could be found by the GATK-based pipeline. For a few loci, the sizes of deletions were corrected by micro-assembly compared to BWA+GATK protocol. Validation rate re-sequencing of 102 candidate INDELs; 84 were confirmed as de novo mutations, 11 were invalid and 7 failed, giving an 82% positive predictive rate for Scalpel.
38 July 29, 2014 Micro-Assembly Approach to detect INDELs 38 Suspicious deletion aussc12252, chr16: Suspicious set of reads starting at the same position
39 July 29, 2014 Micro-Assembly Approach to detect INDELs 39 Suspicious deletion Soft-clipped bases Suspicious high number of soft-clipped bases
40 July 29, 2014 Micro-Assembly Approach to detect INDELs 40 Large 33bp de novo deletion MiSeq validation 33bp deletion!
41 July 29, 2014 Micro-Assembly Approach to detect INDELs 41 IGV snapshot aussc13578, 13: bp del 5bp del
42 July 29, 2014 Micro-Assembly Approach to detect INDELs 42 Complex de novo event Deletion of 6 bp together with two adjacent SNPs detected in only one of the two haplotypes of the autistic child (aussc13578, 13: ). Detected as two different deletions (1bp and 5bp) by the GATK-based sequencing pipeline.
43 July 29, 2014 Micro-Assembly Approach to detect INDELs 43 CONCLUSION
44 July 29, 2014 Micro-Assembly Approach to detect INDELs 44 Conclusion Assembly is the missing link towards high accuracy and increased power for INDEL mutation discovery: Allows the algorithm to break free from the expectations of the reference. Extended power crucial for analysis of inherited and somatic mutations. Scalpel: highly accurate tool to detect de novo, transmitted, and somatic INDELs. Errors of current detection software explained by a largescale re-sequencing experiment. Population wide analysis: de novo INDELs in Autism.
45 July 29, 2014 Micro-Assembly Approach to detect INDELs 45 Software & reference Open source project at: Paper preprint: Narzisi G., O'Rawe J.A., Iossifov I., Lee Y., Wang Z., Wu Y., Lyon G.J., Wigler M., Schatz M.C. Accurate detection of de novo and transmitted INDELs within exome-capture data using micro-assembly. CSHL BioRxiv (DOI: /001370). Manuscript accepted for publication in Nature Methods.
46 July 29, 2014 Micro-Assembly Approach to detect INDELs 46 Acknowledgment Michael C. Schatz Michael Wigler Gholson J. Lyon Ivan Iossifov ADHD project Jason O Rawe Han Fang Yiyang Wu Autism project Dan Levy Michael Ronemus Yoonha Lee Zihua Wang Ewa Grabowska Peter Andrews Mitchell Bekritsky Jude Kendall
47 July 29, 2014 Micro-Assembly Approach to detect INDELs 47 THANK YOU
SCALPEL MICRO-ASSEMBLY APPROACH TO DETECT INDELS WITHIN EXOME-CAPTURE DATA. Giuseppe Narzisi, PhD Schatz Lab
SCALPEL MICRO-ASSEMBLY APPROACH TO DETECT INDELS WITHIN EXOME-CAPTURE DATA Giuseppe Narzisi, PhD Schatz Lab November 14, 2013 Micro-Assembly Approach to detect INDELs 2 Outline Scalpel micro-assembly pipeline
More informationIlluminating the genetics of complex human diseases
Illuminating the genetics of complex human diseases Michael Schatz Sept 27, 2012 Beyond the Genome @mike_schatz / #BTG2012 Outline 1. De novo mutations in human diseases 1. Autism Spectrum Disorder 2.
More informationReducing INDEL calling errors in whole genome and exome sequencing data.
Reducing INDEL calling errors in whole genome and exome sequencing data. Han Fang November 8, 2014 CSHL Biological Data Science Meeting Acknowledgments Lyon Lab Yiyang Wu Jason O Rawe Laura J Barron Max
More informationThe next 10 years of quantitative biology
The next 10 years of quantitative biology Michael Schatz March 25, 2014 Keystone Meeting on Big Data in Biology @mike_schatz / #KSBigData Unsolved Questions in Biology What is your genome sequence? How
More informationSUPPLEMENTARY INFORMATION
doi:10.1038/nature13908 Supplementary Tables Supplementary Table 1: Families in this study (.xlsx) All families included in the study are listed. For each family, we show: the genders of the probands and
More informationAdvance Your Genomic Research Using Targeted Resequencing with SeqCap EZ Library
Advance Your Genomic Research Using Targeted Resequencing with SeqCap EZ Library Marilou Wijdicks International Product Manager Research For Life Science Research Only. Not for Use in Diagnostic Procedures.
More informationAlgorithms for studying the structure and function of genomes
Algorithms for studying the structure and function of genomes Michael Schatz Feb 5, 2015 JHU Dept. of Biology Schatzlab Overview Human Genetics Role of mutations in disease Narzisi et al. (2014) Iossifov
More informationLecture 20. Disease Genetics
Lecture 20. Disease Genetics Michael Schatz April 12 2018 JHU 600.749: Applied Comparative Genomics Part 1: Pre-genome Era Sickle Cell Anaemia Sickle-cell anaemia (SCA) is an abnormality in the oxygen-carrying
More informationHuman Genetics and Plant Genomics: The long and the short of it
Human Genetics and Plant Genomics: The long and the short of it Michael Schatz Simons Center for Quantitative Biology CSHL In-House Symposium XXVI November 20, 2012 Schatz Lab Overview Human Genetics Computation
More informationIdentifying Mutations Responsible for Rare Disorders Using New Technologies
Identifying Mutations Responsible for Rare Disorders Using New Technologies Jacek Majewski, Department of Human Genetics, McGill University, Montreal, QC Canada Mendelian Diseases Clear mode of inheritance
More informationBig Data Meets DNA How Biological Data Science is improving our health, foods, and energy needs
Big Data Meets DNA How Biological Data Science is improving our health, foods, and energy needs Michael Schatz April 8, 2014 IEEE Fellows Night Syracuse @mike_schatz The secret of life Your DNA, along
More informationCalling DNA variants SNVs, CNVs, and SVs. Steve Laurie Variant Effect Predictor Training Course Prague, 6 th November 2017
1 Calling DNA variants SNVs, CNVs, and SVs Steve Laurie Variant Effect Predictor Training Course Prague, 6 th November 2017 Calling DNA variants SNVs, CNVs, SVs 2 1. What is a variant? 2. Paired End read
More informationMutation Detection and CNV Analysis for Illumina Sequencing data from HaloPlex Target Enrichment Panels using NextGENe Software for Clinical Research
Mutation Detection and CNV Analysis for Illumina Sequencing data from HaloPlex Target Enrichment Panels using NextGENe Software for Clinical Research Application Note Authors John McGuigan, Megan Manion,
More informationBelow, we included the point-to-point response to the comments of both reviewers.
To the Editor and Reviewers: We would like to thank the editor and reviewers for careful reading, and constructive suggestions for our manuscript. According to comments from both reviewers, we have comprehensively
More informationNo mutations were identified.
Hereditary High Cholesterol Test ORDERING PHYSICIAN PRIMARY CONTACT SPECIMEN Report date: Aug 1, 2017 Dr. Jenny Jones Sample Medical Group 123 Main St. Sample, CA Kelly Peters Sample Medical Group 123
More informationAccel-Amplicon Panels
Accel-Amplicon Panels Amplicon sequencing has emerged as a reliable, cost-effective method for ultra-deep targeted sequencing. This highly adaptable approach is especially applicable for in-depth interrogation
More informationUsing the Bravo Liquid-Handling System for Next Generation Sequencing Sample Prep
Using the Bravo Liquid-Handling System for Next Generation Sequencing Sample Prep Tom Walsh, PhD Division of Medical Genetics University of Washington Next generation sequencing Sanger sequencing gold
More informationMultiplex target enrichment using DNA indexing for ultra-high throughput variant detection
Multiplex target enrichment using DNA indexing for ultra-high throughput variant detection Dr Elaine Kenny Neuropsychiatric Genetics Research Group Institute of Molecular Medicine Trinity College Dublin
More informationInvestigating rare diseases with Agilent NGS solutions
Investigating rare diseases with Agilent NGS solutions Chitra Kotwaliwale, Ph.D. 1 Rare diseases affect 350 million people worldwide 7,000 rare diseases 80% are genetic 60 million affected in the US, Europe
More informationDe Novo Gene Disruptions in Children on the Autistic Spectrum
Article De Novo Gene Disruptions in Children on the Autistic Spectrum Ivan Iossifov, 1,6 Michael Ronemus, 1,6 Dan Levy, 1 Zihua Wang, 1 Inessa Hakker, 1 Julie Rosenbaum, 1 Boris Yamrom, 1 Yoon-ha Lee,
More informationGenome Wide Variant Analysis of Simplex Autism Families with an Integrative Clinical-Bioinformatics Pipeline
Genome Wide Variant Analysis of Simplex Autism Families with an Integrative Clinical-Bioinformatics Pipeline Laura T. Jiménez-Barrón 1,5, Jason A. O Rawe 1,2, Yiyang Wu 1,2, Margaret Yoon 1, Han Fang 1,
More informationAnalysis with SureCall 2.1
Analysis with SureCall 2.1 Danielle Fletcher Field Application Scientist July 2014 1 Stages of NGS Analysis Primary analysis, base calling Control Software FASTQ file reads + quality 2 Stages of NGS Analysis
More informationBreast and ovarian cancer in Serbia: the importance of mutation detection in hereditary predisposition genes using NGS
Breast and ovarian cancer in Serbia: the importance of mutation detection in hereditary predisposition genes using NGS dr sc. Ana Krivokuća Laboratory for molecular genetics Institute for Oncology and
More informationVariant Classification. Author: Mike Thiesen, Golden Helix, Inc.
Variant Classification Author: Mike Thiesen, Golden Helix, Inc. Overview Sequencing pipelines are able to identify rare variants not found in catalogs such as dbsnp. As a result, variants in these datasets
More informationBWA alignment to reference transcriptome and genome. Convert transcriptome mappings back to genome space
Whole genome sequencing Whole exome sequencing BWA alignment to reference transcriptome and genome Convert transcriptome mappings back to genome space genomes Filter on MQ, distance, Cigar string Annotate
More informationWHOLE EXOME SEQUENCING PIPELINE EVALUATION AND MUTATION DETECTION IN ESOPHAGEAL CANCER PATIENTS
WHOLE EXOME SEQUENCING PIPELINE EVALUATION AND MUTATION DETECTION IN ESOPHAGEAL CANCER PATIENTS SUMMARY Tran Thi Bich Ngoc 1 ; Ho Viet Hoanh 2 ; Vu Phuong Nhung 1 ; Nguyen Hai Ha 1 Nguyen Van Ba 2 ; Nguyen
More informationAlgorithms for the analysis of complex genomes
Algorithms for the analysis of complex genomes Michael Schatz Oct 18, 2013 CSHL In House Introductions Srividya Sri Ramakrishnan DOE Systems Biology Knowledgebase Worlds fastest genomics pipelines Tyler
More informationVictor Guryev. European Research Institute for the Biology of Ageing
Victor Guryev European Research Institute for the Biology of Ageing September 29, 2014 Genomic resequencing in Medical diagnostics course Erasmus MC, Rotterdam /a /g Low coverage whole genome and deep
More informationNature Genetics: doi: /ng Supplementary Figure 1
Supplementary Figure 1 Illustrative example of ptdt using height The expected value of a child s polygenic risk score (PRS) for a trait is the average of maternal and paternal PRS values. For example,
More informationMEDICAL GENOMICS LABORATORY. Next-Gen Sequencing and Deletion/Duplication Analysis of NF1 Only (NF1-NG)
Next-Gen Sequencing and Deletion/Duplication Analysis of NF1 Only (NF1-NG) Ordering Information Acceptable specimen types: Fresh blood sample (3-6 ml EDTA; no time limitations associated with receipt)
More informationGenomic structural variation
Genomic structural variation Mario Cáceres The new genomic variation DNA sequence differs across individuals much more than researchers had suspected through structural changes A huge amount of structural
More informationVARIANT PRIORIZATION AND ANALYSIS INCORPORATING PROBLEMATIC REGIONS OF THE GENOME ANIL PATWARDHAN
VARIANT PRIORIZATION AND ANALYSIS INCORPORATING PROBLEMATIC REGIONS OF THE GENOME ANIL PATWARDHAN Email: apatwardhan@personalis.com MICHAEL CLARK Email: michael.clark@personalis.com ALEX MORGAN Email:
More informationNature Biotechnology: doi: /nbt.1904
Supplementary Information Comparison between assembly-based SV calls and array CGH results Genome-wide array assessment of copy number changes, such as array comparative genomic hybridization (acgh), is
More informationColorspace & Matching
Colorspace & Matching Outline Color space and 2-base-encoding Quality Values and filtering Mapping algorithm and considerations Estimate accuracy Coverage 2 2008 Applied Biosystems Color Space Properties
More informationAnalysis of Massively Parallel Sequencing Data Application of Illumina Sequencing to the Genetics of Human Cancers
Analysis of Massively Parallel Sequencing Data Application of Illumina Sequencing to the Genetics of Human Cancers Gordon Blackshields Senior Bioinformatician Source BioScience 1 To Cancer Genetics Studies
More informationDNA-seq Bioinformatics Analysis: Copy Number Variation
DNA-seq Bioinformatics Analysis: Copy Number Variation Elodie Girard elodie.girard@curie.fr U900 institut Curie, INSERM, Mines ParisTech, PSL Research University Paris, France NGS Applications 5C HiC DNA-seq
More informationFluxion Biosciences and Swift Biosciences Somatic variant detection from liquid biopsy samples using targeted NGS
APPLICATION NOTE Fluxion Biosciences and Swift Biosciences OVERVIEW This application note describes a robust method for detecting somatic mutations from liquid biopsy samples by combining circulating tumor
More informationCRISPR/Cas9 Enrichment and Long-read WGS for Structural Variant Discovery
CRISPR/Cas9 Enrichment and Long-read WGS for Structural Variant Discovery PacBio CoLab Session October 20, 2017 For Research Use Only. Not for use in diagnostics procedures. Copyright 2017 by Pacific Biosciences
More informationNGS in tissue and liquid biopsy
NGS in tissue and liquid biopsy Ana Vivancos, PhD Referencias So, why NGS in the clinics? 2000 Sanger Sequencing (1977-) 2016 NGS (2006-) ABIPrism (Applied Biosystems) Up to 2304 per day (96 sequences
More informationChIP-seq data analysis
ChIP-seq data analysis Harri Lähdesmäki Department of Computer Science Aalto University November 24, 2017 Contents Background ChIP-seq protocol ChIP-seq data analysis Transcriptional regulation Transcriptional
More informationBig Data Meets DNA How Biological Data Science is improving our health, foods, and energy needs
Big Data Meets DNA How Biological Data Science is improving our health, foods, and energy needs Michael Schatz June 18, 2014 CSHL Public Lecture Series DNA: The secret of life Your DNA, along with your
More informationAVENIO family of NGS oncology assays ctdna and Tumor Tissue Analysis Kits
AVENIO family of NGS oncology assays ctdna and Tumor Tissue Analysis Kits Accelerating clinical research Next-generation sequencing (NGS) has the ability to interrogate many different genes and detect
More informationGolden Helix s End-to-End Solution for Clinical Labs
Golden Helix s End-to-End Solution for Clinical Labs Steven Hystad - Field Application Scientist Nathan Fortier Senior Software Engineer 20 most promising Biotech Technology Providers Top 10 Analytics
More informationWelcome to the Genetic Code: An Overview of Basic Genetics. October 24, :00pm 3:00pm
Welcome to the Genetic Code: An Overview of Basic Genetics October 24, 2016 12:00pm 3:00pm Course Schedule 12:00 pm 2:00 pm Principles of Mendelian Genetics Introduction to Genetics of Complex Disease
More informationDr Rick Tearle Senior Applications Specialist, EMEA Complete Genomics Complete Genomics, Inc.
Dr Rick Tearle Senior Applications Specialist, EMEA Complete Genomics Topics Overview of Data Processing Pipeline Overview of Data Files 2 DNA Nano-Ball (DNB) Read Structure Genome : acgtacatgcattcacacatgcttagctatctctcgccag
More informationDe Novo Viral Quasispecies Assembly using Overlap Graphs
De Novo Viral Quasispecies Assembly using Overlap Graphs Alexander Schönhuth joint with Jasmijn Baaijens, Amal Zine El Aabidine, Eric Rivals Milano 18th of November 2016 Viral Quasispecies Assembly: HaploClique
More informationDetection of copy number variations in PCR-enriched targeted sequencing data
Detection of copy number variations in PCR-enriched targeted sequencing data German Demidov Parseq Lab, Saint-Petersburg University of Russian Academy of Sciences, current: Center for Genomic Regulation
More informationGenetics and Genomics in Medicine Chapter 8 Questions
Genetics and Genomics in Medicine Chapter 8 Questions Linkage Analysis Question Question 8.1 Affected members of the pedigree above have an autosomal dominant disorder, and cytogenetic analyses using conventional
More informationCITATION FILE CONTENT/FORMAT
CITATION For any resultant publications using please cite: Matthew A. Field, Vicky Cho, T. Daniel Andrews, and Chris C. Goodnow (2015). "Reliably detecting clinically important variants requires both combined
More informationSupplementary Figure 1. Estimation of tumour content
Supplementary Figure 1. Estimation of tumour content a, Approach used to estimate the tumour content in S13T1/T2, S6T1/T2, S3T1/T2 and S12T1/T2. Tissue and tumour areas were evaluated by two independent
More informationAVENIO ctdna Analysis Kits The complete NGS liquid biopsy solution EMPOWER YOUR LAB
Analysis Kits The complete NGS liquid biopsy solution EMPOWER YOUR LAB Analysis Kits Next-generation performance in liquid biopsies 2 Accelerating clinical research From liquid biopsy to next-generation
More informationSUPPLEMENTARY INFORMATION
doi:10.1038/nature11396 A total of 2078 samples from a large sequencing project at decode were used in this study, 219 samples from 78 trios with two grandchildren who were not also members of other trios,
More informationPERSONALIZED GENETIC REPORT CLIENT-REPORTED DATA PURPOSE OF THE X-SCREEN TEST
INCLUDED IN THIS REPORT: REVIEW OF YOUR GENETIC INFORMATION RELEVANT TO ENDOMETRIOSIS PERSONAL EDUCATIONAL INFORMATION RELEVANT TO YOUR GENES INFORMATION FOR OBTAINING YOUR ENTIRE X-SCREEN DATA FILE PERSONALIZED
More informationgenomics for systems biology / ISB2020 RNA sequencing (RNA-seq)
RNA sequencing (RNA-seq) Module Outline MO 13-Mar-2017 RNA sequencing: Introduction 1 WE 15-Mar-2017 RNA sequencing: Introduction 2 MO 20-Mar-2017 Paper: PMID 25954002: Human genomics. The human transcriptome
More informationHands-On Ten The BRCA1 Gene and Protein
Hands-On Ten The BRCA1 Gene and Protein Objective: To review transcription, translation, reading frames, mutations, and reading files from GenBank, and to review some of the bioinformatics tools, such
More informationFrequency(%) KRAS G12 KRAS G13 KRAS A146 KRAS Q61 KRAS K117N PIK3CA H1047 PIK3CA E545 PIK3CA E542K PIK3CA Q546. EGFR exon19 NFS-indel EGFR L858R
Frequency(%) 1 a b ALK FS-indel ALK R1Q HRAS Q61R HRAS G13R IDH R17K IDH R14Q MET exon14 SS-indel KIT D8Y KIT L76P KIT exon11 NFS-indel SMAD4 R361 IDH1 R13 CTNNB1 S37 CTNNB1 S4 AKT1 E17K ERBB D769H ERBB
More informationCharacterisation of structural variation in breast. cancer genomes using paired-end sequencing on. the Illumina Genome Analyser
Characterisation of structural variation in breast cancer genomes using paired-end sequencing on the Illumina Genome Analyser Phil Stephens Cancer Genome Project Why is it important to study cancer? Why
More informationJULY 21, Genetics 101: SCN1A. Katie Angione, MS CGC Certified Genetic Counselor CHCO Neurology
JULY 21, 2018 Genetics 101: SCN1A Katie Angione, MS CGC Certified Genetic Counselor CHCO Neurology Disclosures: I have no financial interests or relationships to disclose. Objectives 1. Review genetic
More informationAlgorithms for de novo genome assembly and disease analytics
Algorithms for de novo genome assembly and disease analytics Michael Schatz April 7, 2014 Hamilton College Schatz Lab Overview Computation Human Genetics Sequencing Modeling Plant Genomics Introductions
More informationAlgorithms for de novo genome assembly and disease analytics
Algorithms for de novo genome assembly and disease analytics Michael Schatz Feb 11, 2014 IDIES Seminar, Johns Hopkins University Outline 1. Biological Data Science 2. De novo genome assembly 3. Disease
More informationNGS in Cancer Pathology After the Microscope: From Nucleic Acid to Interpretation
NGS in Cancer Pathology After the Microscope: From Nucleic Acid to Interpretation Michael R. Rossi, PhD, FACMG Assistant Professor Division of Cancer Biology, Department of Radiation Oncology Department
More informationSingle-strand DNA library preparation improves sequencing of formalin-fixed and paraffin-embedded (FFPE) cancer DNA
www.impactjournals.com/oncotarget/ Oncotarget, Supplementary Materials 2016 Single-strand DNA library preparation improves sequencing of formalin-fixed and paraffin-embedded (FFPE) DNA Supplementary Materials
More informationNature Genetics: doi: /ng Supplementary Figure 1. PCA for ancestry in SNV data.
Supplementary Figure 1 PCA for ancestry in SNV data. (a) EIGENSTRAT principal-component analysis (PCA) of SNV genotype data on all samples. (b) PCA of only proband SNV genotype data. (c) PCA of SNV genotype
More informationMSI positive MSI negative
Pritchard et al. 2014 Supplementary Figure 1 MSI positive MSI negative Hypermutated Median: 673 Average: 659.2 Non-Hypermutated Median: 37.5 Average: 43.6 Supplementary Figure 1: Somatic Mutation Burden
More informationP. Tang ( 鄧致剛 ); PJ Huang ( 黄栢榕 ) g( ); g ( ) Bioinformatics Center, Chang Gung University.
Databases and Tools for High Throughput Sequencing Analysis P. Tang ( 鄧致剛 ); PJ Huang ( 黄栢榕 ) g( ); g ( ) Bioinformatics Center, Chang Gung University. HTseq Platforms Applications on Biomedical Sciences
More informationSimple, rapid, and reliable RNA sequencing
Simple, rapid, and reliable RNA sequencing RNA sequencing applications RNA sequencing provides fundamental insights into how genomes are organized and regulated, giving us valuable information about the
More informationWhole Genome and Transcriptome Analysis of Anaplastic Meningioma. Patrick Tarpey Cancer Genome Project Wellcome Trust Sanger Institute
Whole Genome and Transcriptome Analysis of Anaplastic Meningioma Patrick Tarpey Cancer Genome Project Wellcome Trust Sanger Institute Outline Anaplastic meningioma compared to other cancers Whole genomes
More informationDOES THE BRCAX GENE EXIST? FUTURE OUTLOOK
CHAPTER 6 DOES THE BRCAX GENE EXIST? FUTURE OUTLOOK Genetic research aimed at the identification of new breast cancer susceptibility genes is at an interesting crossroad. On the one hand, the existence
More informationNGS ONCOPANELS: FDA S PERSPECTIVE
NGS ONCOPANELS: FDA S PERSPECTIVE CBA Workshop: Biomarker and Application in Drug Development August 11, 2018 Rockville, MD You Li, Ph.D. Division of Molecular Genetics and Pathology Food and Drug Administration
More informationPersonalis ACE Clinical Exome The First Test to Combine an Enhanced Clinical Exome with Genome- Scale Structural Variant Detection
Personalis ACE Clinical Exome The First Test to Combine an Enhanced Clinical Exome with Genome- Scale Structural Variant Detection Personalis, Inc. 1350 Willow Road, Suite 202, Menlo Park, California 94025
More informationRaymond Auerbach PhD Candidate, Yale University Gerstein and Snyder Labs August 30, 2012
Elucidating Transcriptional Regulation at Multiple Scales Using High-Throughput Sequencing, Data Integration, and Computational Methods Raymond Auerbach PhD Candidate, Yale University Gerstein and Snyder
More informationCentoXome FUTURE'S KNOWLEDGE APPLIED TODAY
CentoXome FUTURE'S KNOWLEDGE APPLIED TODAY More genetic information requires cutting-edge interpretation techniques Whole Exome Sequencing For certain patients the combination of symptoms does not allow
More informationThe feasibility of circulating tumour DNA as an alternative to biopsy for mutational characterization in Stage III melanoma patients
The feasibility of circulating tumour DNA as an alternative to biopsy for mutational characterization in Stage III melanoma patients ASSC Scientific Meeting 13 th October 2016 Prof Andrew Barbour UQ SOM
More informationPerformance Characteristics BRCA MASTR Plus Dx
Performance Characteristics BRCA MASTR Plus Dx with drmid Dx for Illumina NGS systems Manufacturer Multiplicom N.V. Galileïlaan 18 2845 Niel Belgium Table of Contents 1. Workflow... 4 2. Performance Characteristics
More informationAlgorithms for de novo genome assembly and disease analytics
Algorithms for de novo genome assembly and disease analytics Michael Schatz Feb 18, 2014 CCMB, Brown University Outline 1. De novo assembly by analogy 2. Long Read Assembly 3. Disease Analytics Shredded
More informationTitle:Exome sequencing helped the fine diagnosis of two siblings afflicted with atypical Timothy syndrome (TS2)
Author's response to reviews Title:Exome sequencing helped the fine diagnosis of two siblings afflicted with atypical Timothy syndrome (TS2) Authors: Sebastian Fröhler (Sebastian.Froehler@mdc-berlin.de)
More information!"##"$%#"&!'&$'()$(%&'*& Terapia Pediatrica e Farmacologia dello Sviluppo +,-./&01,23&34,53& :&;.<&2-.=;:3&;.;2>6-6&-.&;&
!!! "#$%&'($)*!+&,-$!.)/+$!+$!01,-$1'$!!"##"$%#"&!'&$'()$(%&'*& Terapia Pediatrica e Farmacologia dello Sviluppo 0$2-3!44566!! +,-./&01,23&34,53&63783.9-.:&;.6-6&-.&;& 582/-:3.3?;/-,.;2&@;5-2>&63:?3:;/-.:&#>A3&B&!-;C3/36&!.&))3'&!(2$&#)$7$23!+$(2$8-$#1'&!+$!177&'[!!
More informationThe Importance of Coverage Uniformity Over On-Target Rate for Efficient Targeted NGS
WHITE PAPER The Importance of Coverage Uniformity Over On-Target Rate for Efficient Targeted NGS Yehudit Hasin-Brumshtein, Ph.D., Maria Celeste M. Ramirez, Ph.D., Leonardo Arbiza, Ph.D., Ramsey Zeitoun,
More informationIntroduction to genetic variation. He Zhang Bioinformatics Core Facility 6/22/2016
Introduction to genetic variation He Zhang Bioinformatics Core Facility 6/22/2016 Outline Basic concepts of genetic variation Genetic variation in human populations Variation and genetic disorders Databases
More informationCentoXome FUTURE'S KNOWLEDGE APPLIED TODAY
CentoXome FUTURE'S KNOWLEDGE APPLIED TODAY More genetic information requires cutting-edge interpretation techniques Whole Exome Sequencing For some patients, the combination of symptoms does not allow
More informationProblem 3: Simulated Rheumatoid Arthritis Data
Problem 3: Simulated Rheumatoid Arthritis Data Michael B Miller Michael Li Gregg Lind Soon-Young Jang The plan
More informationAnalysis of Genetic Inheritance in a Family Quartet by Whole-Genome Sequencing
www.sciencemag.org/cgi/content/full/science.1186802/dc1 Supporting Online Material for Analysis of Genetic Inheritance in a Family Quartet by Whole-Genome Sequencing Jared C. Roach, Gustavo Glusman, Arian
More informationStructural Variation and Medical Genomics
Structural Variation and Medical Genomics Andrew King Department of Biomedical Informatics July 8, 2014 You already know about small scale genetic mutations Single nucleotide polymorphism (SNPs) Deletions,
More informationAbstract. Optimization strategy of Copy Number Variant calling using Multiplicom solutions APPLICATION NOTE. Introduction
Optimization strategy of Copy Number Variant calling using Multiplicom solutions Michael Vyverman, PhD; Laura Standaert, PhD and Wouter Bossuyt, PhD Abstract Copy number variations (CNVs) represent a significant
More informationNew Enhancements: GWAS Workflows with SVS
New Enhancements: GWAS Workflows with SVS August 9 th, 2017 Gabe Rudy VP Product & Engineering 20 most promising Biotech Technology Providers Top 10 Analytics Solution Providers Hype Cycle for Life sciences
More informationGenetic Approaches to Alcoholism, Alcohol Abuse Susceptibility, and Therapeutic Response. Ray White Verona March
Genetic Approaches to Alcoholism, Alcohol Abuse Susceptibility, and Therapeutic Response Ray White Verona March 18. 2008 Craig Venter Genome 2,810,000,000 bases, 7.5-fold coverage 3,213,401 SNPs 85% in
More informationIntegration of Cancer Genome into GECCO- Genetics and Epidemiology of Colorectal Cancer Consortium
Integration of Cancer Genome into GECCO- Genetics and Epidemiology of Colorectal Cancer Consortium Ulrike Peters Fred Hutchinson Cancer Research Center University of Washington U01-CA137088-05, PI: Peters
More informationA Practical Guide to Integrative Genomics by RNA-seq and ChIP-seq Analysis
A Practical Guide to Integrative Genomics by RNA-seq and ChIP-seq Analysis Jian Xu, Ph.D. Children s Research Institute, UTSW Introduction Outline Overview of genomic and next-gen sequencing technologies
More informationHome Brewed Personalized Genomics
Home Brewed Personalized Genomics The Quest for Meaningful Analysis Results of a 23andMe Exome Pilot Trio of Myself, Wife, and Son February 22, 2013 Gabe Rudy, Vice President of Product Development Exome
More informationNGS for Cancer Predisposition
NGS for Cancer Predisposition Colin Pritchard MD, PhD University of Washington Dept. of Lab Medicine AMP Companion Society Meeting USCAP Boston March 22, 2015 Disclosures I am an employee of the University
More informationNature Neuroscience: doi: /nn Supplementary Figure 1
Supplementary Figure 1 Illustration of the working of network-based SVM to confidently predict a new (and now confirmed) ASD gene. Gene CTNND2 s brain network neighborhood that enabled its prediction by
More informationInteractive analysis and quality assessment of single-cell copy-number variations
Interactive analysis and quality assessment of single-cell copy-number variations Tyler Garvin, Robert Aboukhalil, Jude Kendall, Timour Baslan, Gurinder S. Atwal, James Hicks, Michael Wigler, Michael C.
More informationDe novo assembly of complex genomes
De novo assembly of complex genomes Michael Schatz April 10, 2013 CPHG, University of Virginia Schatz Lab Overview Computation Human Genetics Sequencing Modeling Plant Genomics Outline 1. Genome assembly
More informationTo test the possible source of the HBV infection outside the study family, we searched the Genbank
Supplementary Discussion The source of hepatitis B virus infection To test the possible source of the HBV infection outside the study family, we searched the Genbank and HBV Database (http://hbvdb.ibcp.fr),
More informationMEDICAL GENOMICS LABORATORY. Peripheral Nerve Sheath Tumor Panel by Next-Gen Sequencing (PNT-NG)
Peripheral Nerve Sheath Tumor Panel by Next-Gen Sequencing (PNT-NG) Ordering Information Acceptable specimen types: Blood (3-6ml EDTA; no time limitations associated with receipt) Saliva (OGR-575 DNA Genotek;
More informationIdentification of genomic alterations in cervical cancer biopsies by exome sequencing
Chapter- 4 Identification of genomic alterations in cervical cancer biopsies by exome sequencing 105 4.1 INTRODUCTION Athough HPV has been identified as the prime etiological factor for cervical cancer,
More informationSupplementary Methods
Supplementary Methods Short Read Preprocessing Reads are preprocessed differently according to how they will be used: detection of the variant in the tumor, discovery of an artifact in the normal or for
More informationEvaluation of MIA FORA NGS HLA test and software. Lisa Creary, PhD Department of Pathology Stanford Blood Center Research & Development Group
Evaluation of MIA FORA NGS HLA test and software Lisa Creary, PhD Department of Pathology Stanford Blood Center Research & Development Group Disclosure Alpha and Beta Studies Sirona Genomics Reagents,
More information2/10/2016. Evaluation of MIA FORA NGS HLA test and software. Disclosure. NGS-HLA typing requirements for the Stanford Blood Center
Evaluation of MIA FORA NGS HLA test and software Lisa Creary, PhD Department of Pathology Stanford Blood Center Research & Development Group Disclosure Alpha and Beta Studies Sirona Genomics Reagents,
More information