Nature Genetics: doi: /ng Supplementary Figure 1. PCA for ancestry in SNV data.

Similar documents
Nature Genetics: doi: /ng Supplementary Figure 1

SUPPLEMENTARY INFORMATION

November 9, Johns Hopkins School of Medicine, Baltimore, MD,

Nature Neuroscience: doi: /nn Supplementary Figure 1

Analysis with SureCall 2.1

Supplementary note: Comparison of deletion variants identified in this study and four earlier studies

RACP Congress 2017 Genetics of Intellectual Disability and Autism: Past Present and Future 9 th May 2017

CURRENT GENETIC TESTING TOOLS IN NEONATAL MEDICINE. Dr. Bahar Naghavi

Figure S2. Distribution of acgh probes on all ten chromosomes of the RIL M0022

Nature Biotechnology: doi: /nbt.1904

Clinical evaluation of microarray data

Neuropsychiatric Disease Working Group Plan June 14, 2016 NHGRI Centers for Common Disease Genomics Program

Strength of functional signature correlates with effect size in autism

SNP Array NOTE: THIS IS A SAMPLE REPORT AND MAY NOT REFLECT ACTUAL PATIENT DATA. FORMAT AND/OR CONTENT MAY BE UPDATED PERIODICALLY.

Recurrent de novo mutations in neurodevelopmental disorders: properties and clinical implications

SNP Array NOTE: THIS IS A SAMPLE REPORT AND MAY NOT REFLECT ACTUAL PATIENT DATA. FORMAT AND/OR CONTENT MAY BE UPDATED PERIODICALLY.

Genetics All somatic cells contain 23 pairs of chromosomes 22 pairs of autosomes 1 pair of sex chromosomes Genes contained in each pair of chromosomes

Global variation in copy number in the human genome

Genomic structural variation

Nature Genetics: doi: /ng Supplementary Figure 1. SEER data for male and female cancer incidence from

Approach to Mental Retardation and Developmental Delay. SR Ghaffari MSc MD PhD

Update on DSM 5 and the Genomics of ASD

What s the Human Genome Project Got to Do with Developmental Disabilities?

Genome-wide copy-number calling (CNAs not CNVs!) Dr Geoff Macintyre

What can genetic studies tell us about ADHD? Dr Joanna Martin, Cardiff University

Genetic Testing for the Developmental Delay/Intellectual Disability, Autism Spectrum Disorder, and Congenital Anomalies

Challenges of CGH array testing in children with developmental delay. Dr Sally Davies 17 th September 2014

Lecture 20. Disease Genetics

DNA-seq Bioinformatics Analysis: Copy Number Variation

SFARI Gene 2.0 User Guide

Faravareh Khordadpoor (PhD in molecular genetics) 1- Tehran Medical Genetics Laboratory 2- Science and research branch, Islamic Azad University

Title:Exome sequencing helped the fine diagnosis of two siblings afflicted with atypical Timothy syndrome (TS2)

Illuminating the genetics of complex human diseases

Applications of Chromosomal Microarray Analysis (CMA) in pre- and postnatal Diagnostic: advantages, limitations and concerns

Whole Genome and Transcriptome Analysis of Anaplastic Meningioma. Patrick Tarpey Cancer Genome Project Wellcome Trust Sanger Institute

Investigating rare diseases with Agilent NGS solutions

Understanding DNA Copy Number Data

Nature Genetics: doi: /ng Supplementary Figure 1. Somatic coding mutations identified by WES/WGS for 83 ATL cases.

Sequencing studies implicate inherited mutations in autism

LTA Analysis of HapMap Genotype Data

Supplementary Appendix

Structural Variation and Medical Genomics

Nature Neuroscience: doi: /nn Supplementary Figure 1. Missense damaging predictions as a function of allele frequency

CentoXome FUTURE'S KNOWLEDGE APPLIED TODAY

An Introduction to Quantitative Genetics I. Heather A Lawson Advanced Genetics Spring2018

22q11.2 DELETION SYNDROME. Anna Mª Cueto González Clinical Geneticist Programa de Medicina Molecular y Genética Hospital Vall d Hebrón (Barcelona)

5/2/18. After this class students should be able to: Stephanie Moon, Ph.D. - GWAS. How do we distinguish Mendelian from non-mendelian traits?

CentoXome FUTURE'S KNOWLEDGE APPLIED TODAY

Multiple Copy Number Variations in a Patient with Developmental Delay ASCLS- March 31, 2016

The Amazing Brain Webinar Series: Select Topics in Neuroscience and Child Development for the Clinician

Identifying Mutations Responsible for Rare Disorders Using New Technologies

Mutation Detection and CNV Analysis for Illumina Sequencing data from HaloPlex Target Enrichment Panels using NextGENe Software for Clinical Research

Nature Genetics: doi: /ng Supplementary Figure 1. Rates of different mutation types in CRC.

Nature Methods: doi: /nmeth.3115

Genetics and Genomics: Applications to Developmental Disability

Congenital Heart Disease How much of it is genetic?

1 in 68 in US. Autism Update: New research, evidence-based intervention. 1 in 45 in NJ. Selected New References. Autism Prevalence CDC 2014

SCALPEL MICRO-ASSEMBLY APPROACH TO DETECT INDELS WITHIN EXOME-CAPTURE DATA. Giuseppe Narzisi, PhD Schatz Lab

Genome Wide Variant Analysis of Simplex Autism Families with an Integrative Clinical-Bioinformatics Pipeline

Dr Rick Tearle Senior Applications Specialist, EMEA Complete Genomics Complete Genomics, Inc.

Cancer Informatics Lecture

CNV detection. Introduction and detection in NGS data. G. Demidov 1,2. NGSchool2016. Centre for Genomic Regulation. CNV detection. G.

Implementation of the DDD/ClinGen OGT (CytoSure v3) Microarray

Supplementary Figure 1. Estimation of tumour content

Identification of regions with common copy-number variations using SNP array

Children, Toronto, Ontario, Canada. Department of Laboratory Medicine and Pathobiology Hospital for Sick Children, Toronto, Ontario, Canada, M5G 1X8

Supplementary Figure 1: Classification scheme for non-synonymous and nonsense germline MC1R variants. The common variants with previously established

Practical challenges that copy number variation and whole genome sequencing create for genetic diagnostic labs

Golden Helix s End-to-End Solution for Clinical Labs

Supplementary Figures

CNV Detection and Interpretation in Genomic Data

Introduction to genetic variation. He Zhang Bioinformatics Core Facility 6/22/2016

Quality Control Analysis of Add Health GWAS Data

Merging single gene-level CNV with sequence variant interpretation following the ACMGG/AMP sequence variant guidelines

NGS panels in clinical diagnostics: Utrecht experience. Van Gijn ME PhD Genome Diagnostics UMCUtrecht

CHROMOSOMAL MICROARRAY (CGH+SNP)

The contribution of rare variants to risk of schizophrenia in individuals with and without intellectual disability

GENETIC TESTING FOR PREDICTING RISK OF NONFAMILIAL BREAST CANCER

Introduction to the Genetics of Complex Disease

Copy number variation detection and genotyping from exome sequence data

The Deciphering Development Disorders (DDD) project: What a genomic approach can achieve

Introduction. Introduction

Journal: Nature Methods

The role of sex-differential biology in risk for autism spectrum disorder

Genetics and Genetic Testing for Autism:

Macmillan Publishers Limited. All rights reserved

Personalis ACE Clinical Exome The First Test to Combine an Enhanced Clinical Exome with Genome- Scale Structural Variant Detection

p e r s p e c t i v e

Generating Spontaneous Copy Number Variants (CNVs) Jennifer Freeman Assistant Professor of Toxicology School of Health Sciences Purdue University

Corporate Medical Policy

MEDICAL GENOMICS LABORATORY. Next-Gen Sequencing and Deletion/Duplication Analysis of NF1 Only (NF1-NG)

Systematizing in vivo modeling of pediatric disorders

Abstract. Optimization strategy of Copy Number Variant calling using Multiplicom solutions APPLICATION NOTE. Introduction

National Disease Research Interchange Annual Progress Report: 2010 Formula Grant

Association mapping (qualitative) Association scan, quantitative. Office hours Wednesday 3-4pm 304A Stanley Hall. Association scan, qualitative

Proposal form for the evaluation of a genetic test for NHS Service Gene Dossier

Guided Notes: Simple Genetics

Mendelian Genetics. Activity. Part I: Introduction. Instructions

TADA: Analyzing De Novo, Transmission and Case-Control Sequencing Data

Supplementary Note. Patient #1 Additional Details

Transcription:

Supplementary Figure 1 PCA for ancestry in SNV data. (a) EIGENSTRAT principal-component analysis (PCA) of SNV genotype data on all samples. (b) PCA of only proband SNV genotype data. (c) PCA of SNV genotype data with colors indicating ethnicity specified in the Simons Foundation Autism Research Initiative (SFARI) Base. (d) PCA of SNV genotype data with colors indicating calculated ethnicity.

Supplementary Figure 2 Allele balance in probands at identified de novo variant sites. Histogram of proband s allele balance at 141 sites that we attempted to validate. Green, Sanger validation was successful (the site is a true de novo variant); black outline, the site failed to validate as a de novo variant.

Supplementary Figure 3 Distribution of IQ for autism, PDD and Asperger s clinical impressions. Pearson correlation of IQ and all three categories: r 2 = 0.18, P < 1 10 10.

Supplementary Figure 4 CNV validation rates for CoNIFER and XHMM. Shown are Venn diagrams with blue indicating events (deletions/duplications), at varying size ranges, called by CoNIFER and yellow indicating events called by XHMM. These are events validated by SNP microarray data.

Supplementary Figure 5 Head size distribution for maternal CNV carriers. The top portion of the diagram shows density distributions of normalized head size per individual for maternally inherited deletions (red) and duplications (blue). Below are jitter plots with each point indicating an individual sample and the respective normalized head size. These are shown for individuals with deletions and duplications carrying a CHD8 target gene and also those who have deletions and duplications that do not contain a CHD8 target gene. A normalized head size of less than or equal to 2 is considered microcephaly and greater than or equal to 2 is macrocephaly.

a CSMD1 CSMD1 gene model 3 de novo missense 4 inherited LGD SNVs (one shared with sibling) 12906.p1 14695.p1 14165.p1 14621.p1* 13638.p1 13913.p1 13619.p1 13619.s1 5 inherited CNVs (4 focal) (one shared with sibling) 13700.p1 * 11336.p1 11336.s1 14152.p1 12661.p1 13503.p1 CSMD1 Expression pattern (GTEx portal data) RPKM 4-3 - 2-1 - Brain tissues Testis b RIMS1 RIMS1 gene model 2 de novo LGD 13162.p1 13497.p1 Probands Siblings 2 private LGD SNVs 6 rare LGD SNVs 3 rare LGD SNVs (2 shared with probands) 14608.p1 12929.p1 12929.s1 14336.p1 13965.p1 13965.s1 14550.p1 13368.p1 * 12690.p1 12673.s1 14448.p1 RIMS1 Expression pattern (GTEx portal data) RPKM 60-40 - 20 - Brain tissues Esophagus (muscular) c CUL7 CUL7 gene model 2 de novo missense SNVs 2 inherited LGD SNVs 14698.p1 11888.p1 14597.p1 13271.p1

Supplementary Figure 6 Convergence of de novo and inherited mutations. Examples of genes with an excess of disruptive mutations in probands include (a) CSMD1, (b) RIMS1 and (c) CUL7. Displayed for each gene is a RefSeq gene model (larger ticks are exons), validated de novo and private LGD mutations (red arrows) and missense (yellow arrows); disruptive CNV deletions (red) and duplications (blue) are compared for proband (p1) and sibling (s1). CSMD1 and RIMS1 show highly brain-specific expression patterns (adapted from the GTEx Consortium online portal: http://gtexportal.org/). Proband IDs with asterisks are part of trios, not quads.

Supplementary Figure 7

Population attributable risk estimates. Epidemiological features of variant data were assessed for SNVs in 1,786 quad families (parents plus 1 affected proband and 1 unaffected sibling). We subsequently looked at subsets of these families (1,785 families where both proband and sibling sex were known) stratified by (a) proband sex and ultimately by (b) proband-sibling pairing by sex (male proband/male sibling, male proband/female sibling, female proband/male sibling, female proband/female sibling) in families where the sex was known for both the proband and the sibling.

Supplementary Figure 8 Detailed view of known CNVs affecting the CSMD1 locus. Of note is the depletion of CNV events at the 3 end of the gene in the Database of Genomic Variants as compared to the 5 portion of the gene.

Supplementary Figure 9 Sampling results from overlap analysis of genes enriched for de novo and inherited events. The blue distribution shows the sum of inherited CNVs and SNVs in probands for random 100-gene samplings from Supplementary Table 8 (10,000 iterations). The red vertical line represents the actual (observed) count of CNVs and SNVs in probands for the 100 top enriched genes by de novo events.

Supplementary Figure 10 Detail of de novo and inherited mutations in interacting proteins found in a single family. A proband with a compound mutation in synaptic receptor ligand pair; namely, a de novo non-conservative mutation in neuroligin and an inherited two-exon deletion in neurexin. Phenotypic details of this proband include nonverbal IQ = 57, verbal IQ = 46, adaptive IQ = 77, mixed expressive-receptive language disorder, reported autism severity = 8 (of 10), elevated externalizing symptoms, delays in phrase speech, diagnosed heart murmurs, abnormal EEG, elevated BMI (z = 3.69) and macrocephaly (z = 3.08).

Supplementary Figure 11 Sequence file workflow and SNV identification.

Supplementary Figure 12 CNV calling workflow. 1. BAM data were assembled from all previous SSC studies. 2. All reads were realigned using mrsfast for CoNIFER and using BWAmem for XHMM. For calling CNVs as part of CoNIFER, a pipeline using DNAcopy and CGHcall was used for segmentation. 3. Postcalling, CNVs were combined into one data set. 4. To validate exome calls, we utilized both available SNP microarray and array comparative genomic hybridization (acgh) data. For SNP microarray analysis, probe-level copy number estimates were determined by

CRLMM, and validation of events was performed by permutation testing. 5. To identify true de novo events, parents were re-genotyped using CRLMM. 6. Post-validation, there were 291 de novo and 6,319 inherited events.

Supplementary Figure 13 RVIS values in the known autism network. Histogram of RVIS values for all genes in the autism network described in Krumm et al. 53. Also indicated is P value based on 1,000,000 permutations of a randomly selected gene set of the same size from the whole genome.