Recherche de variants génomiques en oncologie clinique. Avec des diapos, données & scripts R de: Yannick Boursin, IGR Bastien Job, IGR
|
|
- Claude Curtis
- 5 years ago
- Views:
Transcription
1 Recherche de variants génomiques en oncologie clinique Avec des diapos, données & scripts R de: Yannick Boursin, IGR Bastien Job, IGR
2 Génétique constitutionnelle At hospital Blood sample Sequence gene panel Look for specific alteration (BRCA) Research Genotype or Sequence Compare disease and healthy cohorts GWAS studies, 1000 Genome Project
3 NGS dans le diagnostic de génétique familiale BRCA1/2 (breast/ovary cancer) XPC, XPV.. (melanoma) ERCC1 (colorectal cancer) 10/19/16 Yannick Boursin 3
4 Génétique somatique Finding somatic mutations in the tumor genome Normal tissue (blood) Tumor tissue Extract and sequence DNA Look for mutations, structural variations Repeat for several patients Look for recurrent events Warning: ,000 SNVs between a tumor and normal tissue from the same person (genome-wide)
5 NGS in precision medicine trials Clinical trials: MOSCATO (GR), SAFIR (GR), SHIVA (Curie), Ipilimumab (anti-ctla4), Nivolumab (anti-pd1), Trastuzumab (anti-her2), Cetuximab (anti-egfr) 10/19/16 Yannick Boursin 5
6 Sequencer quoi? Panel de gènes Une série d exons d intérêt (gènes de cancer= 100kb) Exome Tous les exons du génome (30 Mb) Whole genome Le génome complet (3 Gb)
7 Un pipeline «Variants»
8 Un vrai pipeline «variants» 10 Reads (Fastq) Reference genome (Fasta) Variant Selection Manual Variant Annotation Anovar Quality Control FastQC Mapping Bowtie2 Variant calling Varscan Somatic Index Samtools Mpileup Target Regions (Bed) Target Intersection Intersect Bam Preprocess #1 GATK local realignment Preprocess #2 GATK QC recalibration
9 FastQC Metrics Look at the different metrics for both reads Problem: the per base sequence quality of the Read2 are quite lowtowards the end Solution: Trim the 25bp from the 3 end of the reads Ø Higher confidence in the sequenced information (FastqTrimmer)
10 Illumina Illumina run C Illumina run C Illumina run D Illumina run D
11 PGM PGM run A PGM run A PGM run B PGM run B
12 Trimming and discarding low quality reads A first Quality Control of raw reads is mandatory and can be established according to the application ('N', adapter sequences, barcode, contamination, etc.) ACTGATTAGTCTGAATTAGANNGATAGGAT GATCGATGCATAGCGATCAGCATCGATACG CGGCGCTCCGCTCTCGAAACTAGCATCGAC ACTGAC AGCATCAGGATCTACGATCTAGCGAACTGAC ACTGAC ACTACTTACGACATCGAGGTTAGGAGCATCA ACTAGGCATCGGCATCACGGACNNNNNNNN ACTAGCTATCGAGCTATCAGCGAGCATCTATC ACTAGCTACTATCGAGCGAGCGATCATCGAC CTGACTACTATCGAGCGAGCTACTAACTGAC ACTGAC ACTATCAGCTAGCGCTTCAGCATTACCGT ACTANNGACTAGGAATTAGCTACTGAGCTAC ACTAGCAGCTATATGAGCTACTAGCACTGAC ACTGAC NNNNNNNNNNNNNNNNNNNNNNNNNNNNN 10/19/16 Yannick Boursin Processed reads: blue parts are to be kept, green and red parts to be 12 removed
13 Reads (Fastq) Reference genome (Fasta) Variant Selection Manual Variant Annotation Anovar Quality Control FastQC Mapping Bowtie2 Variant calling Varscan Somatic Index Samtools Mpileup Target Regions (Bed) Target Intersection Intersect Bam Preprocess #1 GATK local realignment Preprocess #2 GATK QC recalibration 7-9 AVRIL 2014
14 Alignement des reads sur le génome de référence A G T C Programmes: Bowtie, BWA, Tophat, STAR, HiSat
15 Alignment key parameters - Repeats Approximately 50% of the human genome is comprised of repeats Treangen T.J. and Salzberg S.L Nature review Genetics 13, /19/16 NGS and Bioinformatics Yannick Boursin 15
16 Alignment key parameters Repeats 3 strategies -1- Report only unique alignment -2- Report best alignments and randomly assign reads across equaly good loci -3- Report all (best) alignments A B A B A B Treangen T.J. and Salzberg S.L Nature review Genetics 13, /19/16 Yannick Boursin 16
17 Alignment key parameters Using single or paired-end reads? The type of sequencing (i.e. single or paired-end reads) is often driven by the application. Exemple : Finding large indels, genomic rearrangements,... However, in most cases, the pair information can improve mapping specificity - Single-end alignment repeated sequence A C G A C T C A C G A C T C Reference Genome Sequence A C T A C G A C T C T A C G A G C A T C T A C G A G C T A C T A G C G A T C T A C G A G C T G C G A G C A A C G GC C A A C - Paired-end alignment unique sequence A C G A C T C G G C C A A C A C G A C T C G G C C A A C Reference Genome Sequence A C T A C G A C T C T A C G A G C A T C T A C G A G C T A C T A G C G A T C T A C G A G C T G C G A G C A A C G GC C A A C Yannick Boursin 17
18 Most popular aligners for variant analysis (support mismatched, gapped, paired-end alignment) BWA Li H. and Durbin R. (2009) Bowtie2 Langmead B, Salzberg S (2012)
19 Reads alignés sur le génome
20 Format SAM Contient les séquences alignées sur le génome Dans l idéal : chr ACGTGCGTTTGCGT chr GCGTGATGCGTAAG chr GTATGTTATATGTA
21 Dans la réalité: Format SAM
22 Reads (Fastq) Reference genome (Fasta) Variant Selection Manual Variant Annotation Anovar Quality Control FastQC Mapping Bowtie2 Variant calling Varscan Somatic Index Samtools Mpileup Target Regions (Bed) Target Intersection Intersect Bam Preprocess #1 GATK local realignment Preprocess #2 GATK QC recalibration 7-9 AVRIL 2014
23 Target intersection Comparer l alignement obtenu à la liste des positions visées par le protocole de capture Exon capturé Alignement non fiable Exon capturé
24 Le variant calling
25 Reads (Fastq) Reference genome (Fasta) Variant Selection Manual Variant Annotation Anovar Quality Control FastQC Mapping Bowtie2 Variant calling Varscan Somatic Index Samtools Mpileup Target Regions (Bed) Target Intersection Intersect Bam Preprocess #1 GATK local realignment Preprocess #2 GATK QC recalibration 7-9 AVRIL 2014
26 Why realign around indels? Small Insertion/deletion (Indels) in reads (especially near the ends) can trick the mappers into wrong alignments Ø Alignment scoring cheaper to introduce multiple Single Nucleotide Variants (SNVs) than an indel: induce a lot of false positive SNVs èartifactual mismatches Realignment around indels helps improve the downstream processing steps Formation NGS & Cancer - Analyses Exome
27 Local realignment identifies most parsimonious alignment along all reads at problematic loci 1. Find the best alternate consensus sequence that, together with the reference, best fits the reads in a pile 3 adjacent SNPs Realigning determines which is better consistent with the reference consistent with a 3bp insertion 2. The score for an alternate consensus is the total sum of the quality scores of mismatched bases 3. If the score of the best alternate consensus is sufficiently better than the original alignment, then we accept the proposed realignment of the reads Formation NGS & Cancer - Analyses Exome
28 Local realignment around indels Before SNVs After SNVs Deletion Deletion Formation NGS & Cancer - Analyses Exome
29 Local realignment around indels Formation NGS & Cancer - Analyses Exome
30 Local realignment around indels Formation NGS & Cancer - Analyses Exome
31 Three types of realignment targets Indels seen in original alignments (in CIGAR, indicated by I for Insertion or D for Deletion) Sites where evidences suggest a hidden indel (SNV abundance) Known sites: Ø Common polymorphisms: dbsnp, 1000Genomes Formation NGS & Cancer - Analyses Exome
32 Known sites Why are they important? Each tool uses known sites differently, but what is common to all is that they use them to help distinguish true variants from false positives, which is very important to how these tools work. If you don't provide known sites, the statistical analysis of the data will be skewed, which can dramatically affect the sensitivity and reliability of the results. 2. Recommended sets of known sites per tool Formation NGS & Cancer - Analyses Exome
33 Indel realignment steps/tools 1. Identify what regions need to be realigned Ø RealignerTargetCreator + known sites Intervals 2. Perform the actual realignment (BAM output) Ø IndelRealigner Formation NGS & Cancer - Analyses Exome
34 The quality scores issued by sequencers are biased Quality scores are critical for all downstream analysis Systematic biases are a major contributor to bad calls Example of sequence context bias in the reported qualities: before Formation NGS & Cancer - Analyses Exome after
35 Reads (Fastq) Reference genome (Fasta) Variant Selection Manual Variant Annotation Anovar Quality Control FastQC Mapping Bowtie2 Variant calling Varscan Somatic Index Samtools Mpileup Target Regions (Bed) Target Intersection Intersect Bam Preprocess #1 GATK local realignment Preprocess #2 GATK QC recalibration 7-9 AVRIL 2014
36 Reads (Fastq) Reference genome (Fasta) Variant Selection Manual Variant Annotation Anovar Quality Control FastQC Mapping Bowtie2 Variant calling Varscan Somatic Index Samtools Mpileup Target Regions (Bed) Target Intersection Intersect Bam Preprocess #1 GATK local realignment Preprocess #2 GATK QC recalibration 7-9 AVRIL 2014
37 Pileup format Describes base-pair information at each position Reference base chr A 108.$,$.,,.,,..,,.,.,..,.,,.,.,,.,..,,..,,.,.,.,.,,.,.,.,,,.,.,.,.,,,.,.,.,.,.,,.,.,.,.,,,,.,,.,.,..,,,,,.,.,,,,^F, Base qualities A 00 chr C 108 Number of reads covering the site (total depth).$t,.,,.t,tt,.,t.,.,t.tttt.ttt,t.t,ttt.tt,t,,.t TtTttt.,.,Tt.ttt.,T,.,.tT,,T,T,.tT,,t,TttTtT,T.,tt,,T,T,tttt^F.^F, Read bases:. /, = match on forward/reverse strand ??6??6 66=????? 6=7??=???<8@7=7=? 77?7?8??78?7????? 8 <8??88 9?8?0048 ACGTN / acgtn = mismatch on forward/reverse strand `-\+[0-9]+[ACGTNacgtn]+ indicates an indel Formation NGS & Cancer - Analyses Exome
38 Analyse du PileUp
39 Analyse du PileUp
40 Analyse du PileUp
41 Depth of Coverage Depth of Coverage = number of reads supporting one position ex: 1X, 5X, 100X... >1000X Reference Base SNV Sequencing Error Reference Genome Aligned Reads 2X 7X 5X brin + 2X brin - 17X 9X brin + 8X brin Calling Confidence +++ NGS - Applications and Analysis 100% = SNV Homozygote 50% 50% = SNV Heterozygote SNV and sequence context (errors) Formation NGS & Cancer - Analyses Exome
42 Variant Calling Factors to consider when calling a SNVs: Base call qualities of each supporting base (base quality) Proximity to small indels, or homopolymer run Mapping qualities of the reads supporting the SNP Sequencing depth: >=30x for constit ; >=100 for tumor SNVs position within the reads: Higher error rate at the reads ends Look at strand bias (SNVs supported by only one strand are more likely to be artifactual) Allelic frequency: Tumor cellularity will reduce the % of an heterozygous variant Higher stringency when calling indels (and sanger validation often needed) Formation NGS & Cancer - Analyses Exome
43 VarScan2 Mutation caller written in Java (no installation required) working with Pileup files of Targeted, Exome, and Whole-Genome sequencing data (DNAseq or RNAseq) Multi-platforms: Illumina, SOLiD, Life/PGM, Roche/454 Detection of different kinds of Germline SNVs/Indels (classical mode): Ø Variants in individual samples Ø Multi-sample variants shared or private in multi-sample datasets VarScan specificity is to be able to work with Tumor/Normal pairs (somatic mode): Ø Somatic and germline mutation, LOH events in tumor-normal pairs Ø Somaticcopy number alterations (CNAs) in tumor-normal exome data Formation NGS & Cancer - Analyses Exome
44 VarScan2 Most published variant callers use Bayesian statistics (a probabilistic framework) to detect variants and assess confidence in them (e.g.: GATK) VarScan uses a robust heuristic/statistic approach to call variants that meet desired thresholds for read depth, base quality, variant allele frequency, and statistical significance Stead et al. (2013) compared 3 different somatic callers : MuTect, Strelka, VarScan2 Ø VarScan2 performed best overall with sequencing depths of 100x, 250x, 500x and 1000x required to accuratelyidentify variants present at 10%, 5%, 2.5% and 1% respectively Formation NGS & Cancer - Analyses Exome
45 Somatic calling
46 Un variant tumoral: polymorphisme ou mutation?
47 Un polymorphisme «normal» (SNP) GERMLINE TISSUE TUMORAL TISSUE
48 Une mutation somatique GERMLINE TISSUE TUMORAL TISSUE
49 Varscan s Somatic P-value Variant Calling and Comparison At every position where both normal and tumor have sufficient coverage, a comparison is made. First, normal and tumor are called independently using the germline consensus calling functionality. Then, their genotypes are compared by the following algorithm: If tumor does not match normal: Calculate significance of allele frequency difference by Fisher's Exact Test If difference is significant (p-value < threshold): If normal matches reference ==> Call Somatic Else If normal is heterozygous ==> Call LOH Else normal and tumor are variant, but different ==> Call IndelFilter or Unknown If difference is not significant: Combined tumor and normal read counts for each allele. Recalculate p-value. ==> Call Germline
50 Format VCF
51 VarScan Tabulated Format Chrom Position Ref Cons Reads1 Reads2 VarFreq Strands Strands 1 2 Qual1 Qual2 Pvalue Map Qual1 Map Qual2 R1 + R1 - R2 + Rs2 - chr C Y % T chr G R % A chr G A % A chr G A % A Alt Cons : Consensus Genotype of Variant Called (IUPAC code): M -> A or C Y -> C or T D -> A or G or T W -> A or T V -> A or C or G R -> A or G K -> G or T B -> C or G or T S -> C or G H -> A or C or T Formation NGS & Cancer - Analyses Exome
52 Reads (Fastq) Reference genome (Fasta) Variant Selection Manual Variant Annotation Anovar Quality Control FastQC Mapping Bowtie2 Variant calling Varscan Somatic Index Samtools Mpileup Target Regions (Bed) Target Intersection Intersect Bam Preprocess #1 GATK local realignment Preprocess #2 GATK QC recalibration 7-9 AVRIL 2014
53 Different types of SNVs SNVs and short indels are the most frequent events: Ø Intergenic Ø Intronic Ø cis-regulatory Ø splice sites Ø frameshift or not Ø synonymous or not Ø benign or damaging etc... Example of SNV one want to pinpoint: Ø non-synonymous + highly deleterious + somatically acquired Formation NGS & Cancer - Analyses Exome
54 Resources dedicated to human genetic variations dbsnp and 1000-genomes Ø Population-scale DNA polymorphisms COSMIC Ø Catalogue Of Somatic Mutations In Cancer Non synonymous SNVs predictions Ø SIFT, Polyphen2 (damaging impact)... PhyloP, GERP++ (conservation) ANNOVAR Tools to annotate genetic variations Formation NGS & Cancer - Analyses Exome
55 Annovar «Annovar» annotates SNVs and Indels Takes Multi sample VCF (Tumor &+normal samples) Ø RefGene: Gene & Function & AminoAcid Change (HGVS format: c.a155g ; p.lys45arg) Ø 1000g2012apr_all: Minor Allele Frequency for all ethnies Ø ESP6500: Exome Sequencing Project Ø Ljb_all : predictions (SIFT, Polyphen2, LRT, MutationTaster, PhyloP, GERP++) ² Tabulated file Formation NGS & Cancer - Analyses Exome
56 Ng & Henikoff, Genome Res Utilise la conservation des domaines protéiques comme indication du caractère délétère d une substitution Classe en D(eleterious), T(olerant),.(unknown) effet réél: LacI
57 PolyPhen2 Adzhubei et al. Nature Methods Probabilistic classifier: Estimates the probability of the missense mutation being damaging based on a combination of seq+struct properties. Classe en: Benign, Possibly damaging, or probably Damaging
58 Coverage & Allelic Frequencies For CNV detection
59 Detection of copy-number variations Are there any copy-numberalteration (gain or lossof chomosomal regions, amplifications ) that could explain tumorigenesis? Yannick Boursin 59
60 La fraction allélique Vocabulaire: Germline/population: Allelic frequency, MAF (minor allele frequency). Par ex. dans données 1000Genomes. Somatic: Allelic fraction (mais souvent on utilise VAF or BAF: variant allele frequency) Où trouver l info? Colonne info#af dans VCF 10/19/16 Yannick Boursin 60
61 BAF Cellularité et Fréquence Allelique
62 Segmentation et fréquence allélique BAF Rratio R ratio=utilisé en CGH, =couverture en NGS Scott et al. Gene 2014
63 Les «signatures d Alexandrov» Yannick Boursin L. Alexandrov M Stratton, Nature
64 Signatures et origine des tumeurs 10/19/16 Yannick Boursin 64 L. Alexandrov M Stratton, Nature 201
65 Données exercice Données de cancer ovarien WES Normal + tumeur Original BAM: 10Go 2 fichiers tabulés: tumor-xxx : Tous les variants de la tumeur (somatique ou non) som-xxx : Les variants déclarés somatiques (par comparaison avec normal) et annotés Pipeline: Alignement bwa Post traitement bam picardtools sortsam GATK indelrealigner GATK variantrecalibration Variant Calling samtools mpileup varscan2 somatic Filtering variants somatiques ou LOH & Somatic P-value < 10^-3 Annotation SNPeff, SNPSIFT Script VCF>Tabulé
66 Fichier «questions» Atelier pratique
DNA-seq Bioinformatics Analysis: Copy Number Variation
DNA-seq Bioinformatics Analysis: Copy Number Variation Elodie Girard elodie.girard@curie.fr U900 institut Curie, INSERM, Mines ParisTech, PSL Research University Paris, France NGS Applications 5C HiC DNA-seq
More informationMutation Detection and CNV Analysis for Illumina Sequencing data from HaloPlex Target Enrichment Panels using NextGENe Software for Clinical Research
Mutation Detection and CNV Analysis for Illumina Sequencing data from HaloPlex Target Enrichment Panels using NextGENe Software for Clinical Research Application Note Authors John McGuigan, Megan Manion,
More informationNGS, Cancer and Bioinforma;cs. 20/10/15 Yannick Boursin
NGS, Cancer and Bioinforma;cs 1 NGS and Clinical Oncology NGS in hereditary cancer genome tes;ng BRCA1/2 (breast/ovary cancer) XPC (melanoma) ERCC1 (colorectal cancer) NGS for personalized cancer treatment
More informationGolden Helix s End-to-End Solution for Clinical Labs
Golden Helix s End-to-End Solution for Clinical Labs Steven Hystad - Field Application Scientist Nathan Fortier Senior Software Engineer 20 most promising Biotech Technology Providers Top 10 Analytics
More informationSupplementary Methods
Supplementary Methods Short Read Preprocessing Reads are preprocessed differently according to how they will be used: detection of the variant in the tumor, discovery of an artifact in the normal or for
More informationAnalysis with SureCall 2.1
Analysis with SureCall 2.1 Danielle Fletcher Field Application Scientist July 2014 1 Stages of NGS Analysis Primary analysis, base calling Control Software FASTQ file reads + quality 2 Stages of NGS Analysis
More informationSupplementary Figure 1. Estimation of tumour content
Supplementary Figure 1. Estimation of tumour content a, Approach used to estimate the tumour content in S13T1/T2, S6T1/T2, S3T1/T2 and S12T1/T2. Tissue and tumour areas were evaluated by two independent
More informationAbstract. Optimization strategy of Copy Number Variant calling using Multiplicom solutions APPLICATION NOTE. Introduction
Optimization strategy of Copy Number Variant calling using Multiplicom solutions Michael Vyverman, PhD; Laura Standaert, PhD and Wouter Bossuyt, PhD Abstract Copy number variations (CNVs) represent a significant
More informationIdentifying Mutations Responsible for Rare Disorders Using New Technologies
Identifying Mutations Responsible for Rare Disorders Using New Technologies Jacek Majewski, Department of Human Genetics, McGill University, Montreal, QC Canada Mendelian Diseases Clear mode of inheritance
More informationAnalysis of Massively Parallel Sequencing Data Application of Illumina Sequencing to the Genetics of Human Cancers
Analysis of Massively Parallel Sequencing Data Application of Illumina Sequencing to the Genetics of Human Cancers Gordon Blackshields Senior Bioinformatician Source BioScience 1 To Cancer Genetics Studies
More informationWhole Genome and Transcriptome Analysis of Anaplastic Meningioma. Patrick Tarpey Cancer Genome Project Wellcome Trust Sanger Institute
Whole Genome and Transcriptome Analysis of Anaplastic Meningioma Patrick Tarpey Cancer Genome Project Wellcome Trust Sanger Institute Outline Anaplastic meningioma compared to other cancers Whole genomes
More information6/12/2018. Disclosures. Clinical Genomics The CLIA Lab Perspective. Outline. COH HopeSeq Heme Panels
Clinical Genomics The CLIA Lab Perspective Disclosures Raju K. Pillai, M.D. Hematopathologist / Molecular Pathologist Director, Pathology Bioinformatics City of Hope National Medical Center, Duarte, CA
More informationWHOLE EXOME SEQUENCING PIPELINE EVALUATION AND MUTATION DETECTION IN ESOPHAGEAL CANCER PATIENTS
WHOLE EXOME SEQUENCING PIPELINE EVALUATION AND MUTATION DETECTION IN ESOPHAGEAL CANCER PATIENTS SUMMARY Tran Thi Bich Ngoc 1 ; Ho Viet Hoanh 2 ; Vu Phuong Nhung 1 ; Nguyen Hai Ha 1 Nguyen Van Ba 2 ; Nguyen
More informationBreast and ovarian cancer in Serbia: the importance of mutation detection in hereditary predisposition genes using NGS
Breast and ovarian cancer in Serbia: the importance of mutation detection in hereditary predisposition genes using NGS dr sc. Ana Krivokuća Laboratory for molecular genetics Institute for Oncology and
More informationSupplementary Information
1 Supplementary Information MATERIALS AND METHODS Study subjects Exome-sequenced HBOC patients Clinical characteristics of 24 exome-sequenced HBOC patients are presented in Table 1. Table 1. Clinical characteristics
More informationFrequency(%) KRAS G12 KRAS G13 KRAS A146 KRAS Q61 KRAS K117N PIK3CA H1047 PIK3CA E545 PIK3CA E542K PIK3CA Q546. EGFR exon19 NFS-indel EGFR L858R
Frequency(%) 1 a b ALK FS-indel ALK R1Q HRAS Q61R HRAS G13R IDH R17K IDH R14Q MET exon14 SS-indel KIT D8Y KIT L76P KIT exon11 NFS-indel SMAD4 R361 IDH1 R13 CTNNB1 S37 CTNNB1 S4 AKT1 E17K ERBB D769H ERBB
More informationCHR POS REF OBS ALLELE BUILD CLINICAL_SIGNIFICANCE
CHR POS REF OBS ALLELE BUILD CLINICAL_SIGNIFICANCE is_clinical dbsnp MITO GENE chr1 13273 G C heterozygous - - -. - DDX11L1 chr1 949654 A G Homozygous 52 - - rs8997 - ISG15 chr1 1021346 A G heterozygous
More informationPSSV User Manual (V1.0)
PSSV User Manual (V1.0) 1. Introduction A novel pattern-based probabilistic approach, PSSV, is developed to identify somatic structural variations from WGS data. Specifically, discordant and concordant
More informationAACR GENIE Data Guide
AACR GENIE Data Guide About this Document Version of Data Data Access Terms of Access Introduction to AACR GENIE Human Subjects Protection and Privacy Summary of Data by Center Genomic Profiling at Each
More informationAVENIO family of NGS oncology assays ctdna and Tumor Tissue Analysis Kits
AVENIO family of NGS oncology assays ctdna and Tumor Tissue Analysis Kits Accelerating clinical research Next-generation sequencing (NGS) has the ability to interrogate many different genes and detect
More informationDr Rick Tearle Senior Applications Specialist, EMEA Complete Genomics Complete Genomics, Inc.
Dr Rick Tearle Senior Applications Specialist, EMEA Complete Genomics Topics Overview of Data Processing Pipeline Overview of Data Files 2 DNA Nano-Ball (DNB) Read Structure Genome : acgtacatgcattcacacatgcttagctatctctcgccag
More informationAdvance Your Genomic Research Using Targeted Resequencing with SeqCap EZ Library
Advance Your Genomic Research Using Targeted Resequencing with SeqCap EZ Library Marilou Wijdicks International Product Manager Research For Life Science Research Only. Not for Use in Diagnostic Procedures.
More informationCITATION FILE CONTENT/FORMAT
CITATION For any resultant publications using please cite: Matthew A. Field, Vicky Cho, T. Daniel Andrews, and Chris C. Goodnow (2015). "Reliably detecting clinically important variants requires both combined
More informationReporting TP53 gene analysis results in CLL
Reporting TP53 gene analysis results in CLL Mutations in TP53 - From discovery to clinical practice in CLL Discovery Validation Clinical practice Variant diversity *Leroy at al, Cancer Research Review
More informationACE ImmunoID Biomarker Discovery Solutions ACE ImmunoID Platform for Tumor Immunogenomics
ACE ImmunoID Biomarker Discovery Solutions ACE ImmunoID Platform for Tumor Immunogenomics Precision Genomics for Immuno-Oncology Personalis, Inc. ACE ImmunoID When one biomarker doesn t tell the whole
More informationGenome. Institute. GenomeVIP: A Genomics Analysis Pipeline for Cloud Computing with Germline and Somatic Calling on Amazon s Cloud. R. Jay Mashl.
GenomeVIP: the Genome Institute at Washington University A Genomics Analysis Pipeline for Cloud Computing with Germline and Somatic Calling on Amazon s Cloud R. Jay Mashl October 20, 2014 Turnkey Variant
More informationVariant Classification. Author: Mike Thiesen, Golden Helix, Inc.
Variant Classification Author: Mike Thiesen, Golden Helix, Inc. Overview Sequencing pipelines are able to identify rare variants not found in catalogs such as dbsnp. As a result, variants in these datasets
More informationIdentification of genomic alterations in cervical cancer biopsies by exome sequencing
Chapter- 4 Identification of genomic alterations in cervical cancer biopsies by exome sequencing 105 4.1 INTRODUCTION Athough HPV has been identified as the prime etiological factor for cervical cancer,
More informationISOWN: accurate somatic mutation identification in the absence of normal tissue controls
Kalatskaya et al. Genome Medicine (2017) 9:59 DOI 10.1186/s13073-017-0446-9 SOFTWARE ISOWN: accurate somatic mutation identification in the absence of normal tissue controls Irina Kalatskaya 1*, Quang
More informationP. Tang ( 鄧致剛 ); PJ Huang ( 黄栢榕 ) g( ); g ( ) Bioinformatics Center, Chang Gung University.
Databases and Tools for High Throughput Sequencing Analysis P. Tang ( 鄧致剛 ); PJ Huang ( 黄栢榕 ) g( ); g ( ) Bioinformatics Center, Chang Gung University. HTseq Platforms Applications on Biomedical Sciences
More informationVARIANT PRIORIZATION AND ANALYSIS INCORPORATING PROBLEMATIC REGIONS OF THE GENOME ANIL PATWARDHAN
VARIANT PRIORIZATION AND ANALYSIS INCORPORATING PROBLEMATIC REGIONS OF THE GENOME ANIL PATWARDHAN Email: apatwardhan@personalis.com MICHAEL CLARK Email: michael.clark@personalis.com ALEX MORGAN Email:
More informationFigure S4. 15 Mets Whole Exome. 5 Primary Tumors Cancer Panel and WES. Next Generation Sequencing
Figure S4 Next Generation Sequencing 15 Mets Whole Exome 5 Primary Tumors Cancer Panel and WES Get coverage of all variant loci for all three Mets Variant Filtering Sequence Alignments Index and align
More informationAACR GENIE Data Guide
AACR GENIE Data Guide About this Document Version of Data Data Access Terms of Access Introduction to AACR GENIE Human Subjects Protection and Privacy Summary of Data by Center Genomic Profiling at Each
More informationPerformance Characteristics BRCA MASTR Plus Dx
Performance Characteristics BRCA MASTR Plus Dx with drmid Dx for Illumina NGS systems Manufacturer Multiplicom N.V. Galileïlaan 18 2845 Niel Belgium Table of Contents 1. Workflow... 4 2. Performance Characteristics
More informationACE ImmunoID. ACE ImmunoID. Precision immunogenomics. Precision Genomics for Immuno-Oncology
ACE ImmunoID ACE ImmunoID Precision immunogenomics Precision Genomics for Immuno-Oncology Personalis, Inc. A universal biomarker platform for immuno-oncology Patient response to cancer immunotherapies
More informationIlluminating the genetics of complex human diseases
Illuminating the genetics of complex human diseases Michael Schatz Sept 27, 2012 Beyond the Genome @mike_schatz / #BTG2012 Outline 1. De novo mutations in human diseases 1. Autism Spectrum Disorder 2.
More informationBelow, we included the point-to-point response to the comments of both reviewers.
To the Editor and Reviewers: We would like to thank the editor and reviewers for careful reading, and constructive suggestions for our manuscript. According to comments from both reviewers, we have comprehensively
More informationVariant Annotation and Functional Prediction
Variant Annotation and Functional Prediction Copyrighted 2018 Isabelle Schrauwen and Suzanne M. Leal This exercise touches on several functionalities of the program ANNOVAR to annotate and interpret candidate
More informationTumor mutational burden and its transition towards the clinic
Tumor mutational burden and its transition towards the clinic G C C A T C A C Wolfram Jochum Institute of Pathology Kantonsspital St.Gallen CH-9007 St.Gallen wolfram.jochum@kssg.ch 30th European Congress
More informationAVENIO ctdna Analysis Kits The complete NGS liquid biopsy solution EMPOWER YOUR LAB
Analysis Kits The complete NGS liquid biopsy solution EMPOWER YOUR LAB Analysis Kits Next-generation performance in liquid biopsies 2 Accelerating clinical research From liquid biopsy to next-generation
More informationBWA alignment to reference transcriptome and genome. Convert transcriptome mappings back to genome space
Whole genome sequencing Whole exome sequencing BWA alignment to reference transcriptome and genome Convert transcriptome mappings back to genome space genomes Filter on MQ, distance, Cigar string Annotate
More informationNature Biotechnology: doi: /nbt.1904
Supplementary Information Comparison between assembly-based SV calls and array CGH results Genome-wide array assessment of copy number changes, such as array comparative genomic hybridization (acgh), is
More informationNGS in tissue and liquid biopsy
NGS in tissue and liquid biopsy Ana Vivancos, PhD Referencias So, why NGS in the clinics? 2000 Sanger Sequencing (1977-) 2016 NGS (2006-) ABIPrism (Applied Biosystems) Up to 2304 per day (96 sequences
More informationTOWARDS ACCURATE GERMLINE AND SOMATIC INDEL DISCOVERY WITH MICRO-ASSEMBLY. Giuseppe Narzisi, PhD Bioinformatics Scientist
TOWARDS ACCURATE GERMLINE AND SOMATIC INDEL DISCOVERY WITH MICRO-ASSEMBLY Giuseppe Narzisi, PhD Bioinformatics Scientist July 29, 2014 Micro-Assembly Approach to detect INDELs 2 Outline 1 Detecting INDELs:
More informationInvestigating rare diseases with Agilent NGS solutions
Investigating rare diseases with Agilent NGS solutions Chitra Kotwaliwale, Ph.D. 1 Rare diseases affect 350 million people worldwide 7,000 rare diseases 80% are genetic 60 million affected in the US, Europe
More informationTransform genomic data into real-life results
CLINICAL SUMMARY Transform genomic data into real-life results Biomarker testing and targeted therapies can drive improved outcomes in clinical practice New FDA-Approved Broad Companion Diagnostic for
More informationAssessing Laboratory Performance for Next Generation Sequencing Based Detection of Germline Variants through Proficiency Testing
Assessing Laboratory Performance for Next Generation Sequencing Based Detection of Germline Variants through Proficiency Testing Karl V. Voelkerding, MD Professor of Pathology University of Utah Medical
More informationMEDICAL GENOMICS LABORATORY. Next-Gen Sequencing and Deletion/Duplication Analysis of NF1 Only (NF1-NG)
Next-Gen Sequencing and Deletion/Duplication Analysis of NF1 Only (NF1-NG) Ordering Information Acceptable specimen types: Fresh blood sample (3-6 ml EDTA; no time limitations associated with receipt)
More informationHome Brewed Personalized Genomics
Home Brewed Personalized Genomics The Quest for Meaningful Analysis Results of a 23andMe Exome Pilot Trio of Myself, Wife, and Son February 22, 2013 Gabe Rudy, Vice President of Product Development Exome
More informationChIP-seq hands-on. Iros Barozzi, Campus IFOM-IEO (Milan) Saverio Minucci, Gioacchino Natoli Labs
ChIP-seq hands-on Iros Barozzi, Campus IFOM-IEO (Milan) Saverio Minucci, Gioacchino Natoli Labs Main goals Becoming familiar with essential tools and formats Visualizing and contextualizing raw data Understand
More informationCancer Gene Panels. Dr. Andreas Scherer. Dr. Andreas Scherer President and CEO Golden Helix, Inc. Twitter: andreasscherer
Cancer Gene Panels Dr. Andreas Scherer Dr. Andreas Scherer President and CEO Golden Helix, Inc. scherer@goldenhelix.com Twitter: andreasscherer About Golden Helix - Founded in 1998 - Main outside investor:
More informationAn integrated map of genetic variation from 1092 human genomes
SUPPLEMENTAL METHODS AND MATERIALS Whole genome sequencing Alignment: Short insert paired-end reads were aligned to the GRCh37 reference human genome with 1000 genomes decoy contigs using BWA-mem(1). Somatic
More informationComprehensive genomic profiling for various solid tumors
Content Highlight Test Specification Test Content Performance Validation Test Report I II III IV V The NovoPM TM comprehensive cancer genomic profiling test Comprehensive genomic profiling for various
More informationNGS ONCOPANELS: FDA S PERSPECTIVE
NGS ONCOPANELS: FDA S PERSPECTIVE CBA Workshop: Biomarker and Application in Drug Development August 11, 2018 Rockville, MD You Li, Ph.D. Division of Molecular Genetics and Pathology Food and Drug Administration
More informationTP53 mutational profile in CLL : A retrospective study of the FILO group.
TP53 mutational profile in CLL : A retrospective study of the FILO group. Fanny Baran-Marszak Hopital Avicenne Bobigny France 2nd ERIC workshop on TP53 analysis in CLL, Stresa 2017 TP53 abnormalities :
More informationFluxion Biosciences and Swift Biosciences Somatic variant detection from liquid biopsy samples using targeted NGS
APPLICATION NOTE Fluxion Biosciences and Swift Biosciences OVERVIEW This application note describes a robust method for detecting somatic mutations from liquid biopsy samples by combining circulating tumor
More informationCalling DNA variants SNVs, CNVs, and SVs. Steve Laurie Variant Effect Predictor Training Course Prague, 6 th November 2017
1 Calling DNA variants SNVs, CNVs, and SVs Steve Laurie Variant Effect Predictor Training Course Prague, 6 th November 2017 Calling DNA variants SNVs, CNVs, SVs 2 1. What is a variant? 2. Paired End read
More informationGlobal assessment of genomic variation in cattle by genome resequencing and high-throughput genotyping
RESEARCH ARTICLE Open Access Global assessment of genomic variation in cattle by genome resequencing and high-throughput genotyping Bujie Zhan 1, João Fadista 1, Bo Thomsen 1, Jakob Hedegaard 1,2, Frank
More informationDNA Sequence Bioinformatics Analysis with the Galaxy Platform
DNA Sequence Bioinformatics Analysis with the Galaxy Platform University of São Paulo, Brazil 28 July - 1 August 2014 Dave Clements Johns Hopkins University Robson Francisco de Souza University of São
More informationGenetic Testing and Analysis. (858) MRN: Specimen: Saliva Received: 07/26/2016 GENETIC ANALYSIS REPORT
GBinsight Sample Name: GB4411 Race: Gender: Female Reason for Testing: Type 2 diabetes, early onset MRN: 0123456789 Specimen: Saliva Received: 07/26/2016 Test ID: 113-1487118782-4 Test: Type 2 Diabetes
More informationUsing the Bravo Liquid-Handling System for Next Generation Sequencing Sample Prep
Using the Bravo Liquid-Handling System for Next Generation Sequencing Sample Prep Tom Walsh, PhD Division of Medical Genetics University of Washington Next generation sequencing Sanger sequencing gold
More informationSUPPLEMENTARY INFORMATION
SUPPLEMENTARY INFORMATION Systematic investigation of cancer-associated somatic point mutations in SNP databases HyunChul Jung 1,2, Thomas Bleazard 3, Jongkeun Lee 1 and Dongwan Hong 1 1. Cancer Genomics
More informationNew Enhancements: GWAS Workflows with SVS
New Enhancements: GWAS Workflows with SVS August 9 th, 2017 Gabe Rudy VP Product & Engineering 20 most promising Biotech Technology Providers Top 10 Analytics Solution Providers Hype Cycle for Life sciences
More informationResearch Strategy: 1. Background and Significance
Research Strategy: 1. Background and Significance 1.1. Heterogeneity is a common feature of cancer. A better understanding of this heterogeneity may present therapeutic opportunities: Intratumor heterogeneity
More informationIdentification of Genetic Determinants of Metastasis and Clonal Relationships Between Primary and Metastatic Tumors
Washington University in St. Louis Washington University Open Scholarship All Theses and Dissertations (ETDs) Spring 5-1-2013 Identification of Genetic Determinants of Metastasis and Clonal Relationships
More informationCopy Number Varia/on Detec/on. Alex Mawla UCD Genome Center Bioinforma5cs Core Tuesday June 16, 2015
Copy Number Varia/on Detec/on Alex Mawla UCD Genome Center Bioinforma5cs Core Tuesday June 16, 2015 Today s Goals Understand the applica5on and capabili5es of using targe5ng sequencing and CNV calling
More informationMultiplex target enrichment using DNA indexing for ultra-high throughput variant detection
Multiplex target enrichment using DNA indexing for ultra-high throughput variant detection Dr Elaine Kenny Neuropsychiatric Genetics Research Group Institute of Molecular Medicine Trinity College Dublin
More informationGenomic structural variation
Genomic structural variation Mario Cáceres The new genomic variation DNA sequence differs across individuals much more than researchers had suspected through structural changes A huge amount of structural
More informationSupplementary Figure 1: Classification scheme for non-synonymous and nonsense germline MC1R variants. The common variants with previously established
Supplementary Figure 1: Classification scheme for nonsynonymous and nonsense germline MC1R variants. The common variants with previously established classifications 1 3 are shown. The effect of novel missense
More informationPSSV User Manual (V2.1)
PSSV User Manual (V2.1) 1. Introduction A novel pattern-based probabilistic approach, PSSV, is developed to identify somatic structural variations from WGS data. Specifically, discordant and concordant
More informationMolecular Characterization of Tumors Using Next-Generation Sequencing
Molecular Characterization of Tumors Using Next-Generation Sequencing Using BaseSpace to visualize molecular changes in cancer. Tumor-Normal Sequencing Data in BaseSpace To enable researchers new to next-generation
More informationTranscriptome Analysis
Transcriptome Analysis Data Preprocessing Sample Preparation Illumina Sequencing Demultiplexing Raw FastQ Reference Genome (fasta) Reference Annotation (GTF) Reference Genome Analysis Tophat Accepted hits
More informationThe$mitochondrial$and$autosomal$mutation$landscapes$of$ prostate$cancer$ $supplementary$material$
The$mitochondrial$and$autosomal$mutation$landscapes$of$ prostate$cancer$ $supplementary$material$ Johan Lindberg 1, Ian G. Mills 2-4, Daniel Klevebring 1, Wennuan Liu 5, Mårten Neiman 1, Jianfeng Xu 5,
More informationSCALPEL MICRO-ASSEMBLY APPROACH TO DETECT INDELS WITHIN EXOME-CAPTURE DATA. Giuseppe Narzisi, PhD Schatz Lab
SCALPEL MICRO-ASSEMBLY APPROACH TO DETECT INDELS WITHIN EXOME-CAPTURE DATA Giuseppe Narzisi, PhD Schatz Lab November 14, 2013 Micro-Assembly Approach to detect INDELs 2 Outline Scalpel micro-assembly pipeline
More informationAD (Leave blank) TITLE: Genomic Characterization of Brain Metastasis in Non-Small Cell Lung Cancer Patients
AD (Leave blank) Award Number: W81XWH-12-1-0444 TITLE: Genomic Characterization of Brain Metastasis in Non-Small Cell Lung Cancer Patients PRINCIPAL INVESTIGATOR: Mark A. Watson, MD PhD CONTRACTING ORGANIZATION:
More informationFunctional analysis of DNA variants
Functional analysis of DNA variants GS011143, Introduction to Bioinformatics The University of Texas GSBS program, Fall 2012 Ken Chen, Ph.D. Department of Bioinformatics and Computational Biology UT MD
More informationVariant Detection & Interpretation in a diagnostic context. Christian Gilissen
Variant Detection & Interpretation in a diagnostic context Christian Gilissen c.gilissen@gen.umcn.nl 28-05-2013 So far Sequencing Johan den Dunnen Marja Jakobs Ewart de Bruijn Mapping Victor Guryev Variant
More informationNo mutations were identified.
Hereditary High Cholesterol Test ORDERING PHYSICIAN PRIMARY CONTACT SPECIMEN Report date: Aug 1, 2017 Dr. Jenny Jones Sample Medical Group 123 Main St. Sample, CA Kelly Peters Sample Medical Group 123
More informationRNA SEQUENCING AND DATA ANALYSIS
RNA SEQUENCING AND DATA ANALYSIS Length of mrna transcripts in the human genome 5,000 5,000 4,000 3,000 2,000 4,000 1,000 0 0 200 400 600 800 3,000 2,000 1,000 0 0 2,000 4,000 6,000 8,000 10,000 Length
More informationEXAMPLE. - Potentially responsive to PI3K/mTOR and MEK combination therapy or mtor/mek and PKC combination therapy. ratio (%)
Dr Kate Goodhealth Goodhealth Medical Clinic 123 Address Road SUBURBTOWN NSW 2000 Melanie Citizen Referring Doctor Your ref Address Dr John Medico 123 Main Street, SUBURBTOWN NSW 2000 Phone 02 9999 9999
More informationPERSONALIZED GENETIC REPORT CLIENT-REPORTED DATA PURPOSE OF THE X-SCREEN TEST
INCLUDED IN THIS REPORT: REVIEW OF YOUR GENETIC INFORMATION RELEVANT TO ENDOMETRIOSIS PERSONAL EDUCATIONAL INFORMATION RELEVANT TO YOUR GENES INFORMATION FOR OBTAINING YOUR ENTIRE X-SCREEN DATA FILE PERSONALIZED
More informationCytogenetics 101: Clinical Research and Molecular Genetic Technologies
Cytogenetics 101: Clinical Research and Molecular Genetic Technologies Topics for Today s Presentation 1 Classical vs Molecular Cytogenetics 2 What acgh? 3 What is FISH? 4 What is NGS? 5 How can these
More informationCharacterisation of structural variation in breast. cancer genomes using paired-end sequencing on. the Illumina Genome Analyser
Characterisation of structural variation in breast cancer genomes using paired-end sequencing on the Illumina Genome Analyser Phil Stephens Cancer Genome Project Why is it important to study cancer? Why
More informationNGS IN ONCOLOGY: FDA S PERSPECTIVE
NGS IN ONCOLOGY: FDA S PERSPECTIVE ASQ Biomed/Biotech SIG Event April 26, 2018 Gaithersburg, MD You Li, Ph.D. Division of Molecular Genetics and Pathology Food and Drug Administration (FDA) Center for
More informationCRISPR/Cas9 Enrichment and Long-read WGS for Structural Variant Discovery
CRISPR/Cas9 Enrichment and Long-read WGS for Structural Variant Discovery PacBio CoLab Session October 20, 2017 For Research Use Only. Not for use in diagnostics procedures. Copyright 2017 by Pacific Biosciences
More informationChIP-seq data analysis
ChIP-seq data analysis Harri Lähdesmäki Department of Computer Science Aalto University November 24, 2017 Contents Background ChIP-seq protocol ChIP-seq data analysis Transcriptional regulation Transcriptional
More informationSUPPLEMENTARY INFORMATION
doi:10.1038/nature11396 A total of 2078 samples from a large sequencing project at decode were used in this study, 219 samples from 78 trios with two grandchildren who were not also members of other trios,
More informationLEIDEN, THE NETHERLANDS
Full-length CYP2D6 diplotyping for better drug dosage and response management Henk Buermans, PhD Leiden University Medical Center Human Genetics, LGTC LEIDEN, THE NETHERLANDS CYP2D6 Function Metabolism
More informationNature Genetics: doi: /ng Supplementary Figure 1. Rates of different mutation types in CRC.
Supplementary Figure 1 Rates of different mutation types in CRC. (a) Stratification by mutation type indicates that C>T mutations occur at a significantly greater rate than other types. (b) As for the
More informationSVIM: Structural variant identification with long reads DAVID HELLER MAX PLANCK INSTITUTE FOR MOLECULAR GENETICS, BERLIN JUNE 2O18, SMRT LEIDEN
SVIM: Structural variant identification with long reads DAVID HELLER MAX PLANCK INSTITUTE FOR MOLECULAR GENETICS, BERLIN JUNE 2O18, SMRT LEIDEN Structural variation (SV) Variants larger than 50bps Affect
More informationAndrew Parrish, Richard Caswell, Garan Jones, Christopher M. Watson, Laura A. Crinnion 3,4, Sian Ellard 1,2
METHOD ARTICLE An enhanced method for targeted next generation sequencing copy number variant detection using ExomeDepth [version 1; referees: 1 approved, 1 approved with reservations] 1 2 1 3,4 Andrew
More informationSupplementary Material for IPred - Integrating Ab Initio and Evidence Based Gene Predictions to Improve Prediction Accuracy
1 SYSTEM REQUIREMENTS 1 Supplementary Material for IPred - Integrating Ab Initio and Evidence Based Gene Predictions to Improve Prediction Accuracy Franziska Zickmann and Bernhard Y. Renard Research Group
More informationMultiple gene sequencing for risk assessment in patients with early-onset or familial breast cancer
www.impactjournals.com/oncotarget/ Oncotarget, Supplementary Materials 2016 Multiple gene sequencing for risk assessment in patients with early-onset or familial breast cancer Supplementary Materials Supplementary
More informationSupplementary Materials for
www.sciencetranslationalmedicine.org/cgi/content/full/7/283/283ra54/dc1 Supplementary Materials for Clonal status of actionable driver events and the timing of mutational processes in cancer evolution
More informationRare Variant Burden Tests. Biostatistics 666
Rare Variant Burden Tests Biostatistics 666 Last Lecture Analysis of Short Read Sequence Data Low pass sequencing approaches Modeling haplotype sharing between individuals allows accurate variant calls
More informationSupplementary Tables. Supplementary Figures
Supplementary Files for Zehir, Benayed et al. Mutational Landscape of Metastatic Cancer Revealed from Prospective Clinical Sequencing of 10,000 Patients Supplementary Tables Supplementary Table 1: Sample
More informationCentoXome FUTURE'S KNOWLEDGE APPLIED TODAY
CentoXome FUTURE'S KNOWLEDGE APPLIED TODAY More genetic information requires cutting-edge interpretation techniques Whole Exome Sequencing For certain patients the combination of symptoms does not allow
More information!"##"$%#"&!'&$'()$(%&'*& Terapia Pediatrica e Farmacologia dello Sviluppo +,-./&01,23&34,53& :&;.<&2-.=;:3&;.;2>6-6&-.&;&
!!! "#$%&'($)*!+&,-$!.)/+$!+$!01,-$1'$!!"##"$%#"&!'&$'()$(%&'*& Terapia Pediatrica e Farmacologia dello Sviluppo 0$2-3!44566!! +,-./&01,23&34,53&63783.9-.:&;.6-6&-.&;& 582/-:3.3?;/-,.;2&@;5-2>&63:?3:;/-.:&#>A3&B&!-;C3/36&!.&))3'&!(2$&#)$7$23!+$(2$8-$#1'&!+$!177&'[!!
More informationSession 4 Rebecca Poulos
The Cancer Genome Atlas (TCGA) & International Cancer Genome Consortium (ICGC) Session 4 Rebecca Poulos Prince of Wales Clinical School Introductory bioinformatics for human genomics workshop, UNSW 20
More informationCentoXome FUTURE'S KNOWLEDGE APPLIED TODAY
CentoXome FUTURE'S KNOWLEDGE APPLIED TODAY More genetic information requires cutting-edge interpretation techniques Whole Exome Sequencing For some patients, the combination of symptoms does not allow
More information