Figure S2. Distribution of acgh probes on all ten chromosomes of the RIL M0022

Similar documents
CNV PCA Search Tutorial

Nature Genetics: doi: /ng Supplementary Figure 1. PCA for ancestry in SNV data.

Nature Genetics: doi: /ng Supplementary Figure 1. SEER data for male and female cancer incidence from

November 9, Johns Hopkins School of Medicine, Baltimore, MD,

Nature Biotechnology: doi: /nbt.1904

Global variation in copy number in the human genome

Supplementary Figures

Take a look at the three adult bears shown in these photographs:

Nature Methods: doi: /nmeth.3115

Supplementary Figure S1. Gene expression analysis of epidermal marker genes and TP63.

SUPPLEMENTARY INFORMATION

Mendelian Genetics using Fast Plants Report due Sept. 15/16. Readings: Mendelian genetics: Hartwell Chapter 2 pp , Chapter 5 pp

DNA-seq Bioinformatics Analysis: Copy Number Variation

Supplementary Information. Supplementary Figures

Nature Neuroscience: doi: /nn Supplementary Figure 1

Nature Structural & Molecular Biology: doi: /nsmb.2419

Name Period. Keystone Vocabulary: genetics fertilization trait hybrid gene allele Principle of dominance segregation gamete probability

Tutorial: RNA-Seq Analysis Part II: Non-Specific Matches and Expression Measures

Biology. Chapter 13. Observing Patterns in Inherited Traits. Concepts and Applications 9e Starr Evers Starr. Cengage Learning 2015

Passing It On. QUESTION: How are inherited characteristics passed from parent to offspring? toothpicks - red and green

Introduction to Genetics and Heredity

Vega: Variational Segmentation for Copy Number Detection

Supplementary Figure 1: Attenuation of association signals after conditioning for the lead SNP. a) attenuation of association signal at the 9p22.

Identification of both copy number variation-type and constant-type core elements in a large segmental duplication region of the mouse genome

Genetics and Diversity Punnett Squares

Mosaic loss of chromosome Y in peripheral blood is associated with shorter survival and higher risk of cancer

Relationship between genomic features and distributions of RS1 and RS3 rearrangements in breast cancer genomes.

Nature Immunology: doi: /ni Supplementary Figure 1. RNA-Seq analysis of CD8 + TILs and N-TILs.

Supplemental Figure legends

Chapter 11 Introduction to Genetics

UNIT 6 GENETICS 12/30/16

Changes in genome content generated via segregation of non-allelic homologs

The laws of Heredity. Allele: is the copy (or a version) of the gene that control the same characteristics.

Case Studies on High Throughput Gene Expression Data Kun Huang, PhD Raghu Machiraju, PhD

Children, Toronto, Ontario, Canada. Department of Laboratory Medicine and Pathobiology Hospital for Sick Children, Toronto, Ontario, Canada, M5G 1X8

Exercises: Differential Methylation

LTA Analysis of HapMap Genotype Data

Journal: Nature Methods

Supplementary Figure 1: Features of IGLL5 Mutations in CLL: a) Representative IGV screenshot of first

Mendel and Heredity. Chapter 12

Name Class Date. Complete each of the following sentences by choosing the correct term from the word bank. sex cells genotype sex chromosomes

Chapter 12 Multiple Choice

Introduction to Genetics & Heredity Gregor Mendel Mendel s Pea Plant Experiments self-pollination cross-pollinated Principle of Dominance

MENDELIAN GENETICS. Punnet Squares and Pea Plants

Table S1. Relative abundance of AGO1/4 proteins in different organs. Table S2. Summary of smrna datasets from various samples.

Genomic structural variation

Supplementary Materials for

Abstract. Optimization strategy of Copy Number Variant calling using Multiplicom solutions APPLICATION NOTE. Introduction

B-4.7 Summarize the chromosome theory of inheritance and relate that theory to Gregor Mendel s principles of genetics

Challenges of CGH array testing in children with developmental delay. Dr Sally Davies 17 th September 2014

fl/+ KRas;Atg5 fl/+ KRas;Atg5 fl/fl KRas;Atg5 fl/fl KRas;Atg5 Supplementary Figure 1. Gene set enrichment analyses. (a) (b)

UNIT 3 GENETICS LESSON #30: TRAITS, GENES, & ALLELES. Many things come in many forms. Give me an example of something that comes in many forms.

Numerous hypothesis tests were performed in this study. To reduce the false positive due to

High Throughput Sequence (HTS) data analysis. Lei Zhou

Table S1. Trait summary for Northern Leaf Blight resistance in the maize nested association mapping (NAM) population and individual NAM families

Nature Genetics: doi: /ng Supplementary Figure 1. Phenotypic characterization of MES- and ADRN-type cells.

DRAGON GENETICS Understanding Inheritance 1

Assignment 5: Integrative epigenomics analysis

PBZ FT01_PBZ FT01_TZ FT01_NZ. interface zone (I) tumor zone (TZ) necrotic zone (NZ)

Introduction to LOH and Allele Specific Copy Number User Forum

Supplemental Figure 1. Small RNA size distribution from different soybean tissues.

EXPression ANalyzer and DisplayER

SUPPLEMENTARY APPENDIX

Supplementary Figure 1

New Enhancements: GWAS Workflows with SVS

Supplemental Data. Genome-wide Association of Copy-Number Variation. Reveals an Association between Short Stature

Cancer Informatics Lecture

Lab 5: Testing Hypotheses about Patterns of Inheritance

Mendel and Heredity. Chapter 12

Genetics of extreme body size evolution in mice from Gough Island

Unit 5 Review Name: Period:

Objectives. ! Describe the contributions of Gregor Mendel to the science of genetics. ! Explain the Law of Segregation.

Genetics: field of biology that studies heredity, or the passing of traits from parents to offspring Trait: an inherited characteristic, such as eye

14 1 Human Heredity. Week 8 vocab Chapter 14

Single SNP/Gene Analysis. Typical Results of GWAS Analysis (Single SNP Approach) Typical Results of GWAS Analysis (Single SNP Approach)

Supplementary Materials for

For a long time, people have observed that offspring look like their parents.

Name: PS#: Biol 3301 Midterm 1 Spring 2012

Understanding DNA Copy Number Data

Dragon Genetics, pt. VI: Making a dragon

SSM signature genes are highly expressed in residual scar tissues after preoperative radiotherapy of rectal cancer.

Large-scale identity-by-descent mapping discovers rare haplotypes of large effect. Suyash Shringarpure 23andMe, Inc. ASHG 2017

7SK ChIRP-seq is specifically RNA dependent and conserved between mice and humans.

Detection of aneuploidy in a single cell using the Ion ReproSeq PGS View Kit

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1

Copyright The McGraw-Hill Companies, Inc. Permission required for reproduction or display. Chapter 6 Patterns of Inheritance

Supporting Online Material for

Dr Rick Tearle Senior Applications Specialist, EMEA Complete Genomics Complete Genomics, Inc.

UNIT IV. Chapter 14 The Human Genome

Chapter 15: The Chromosomal Basis of Inheritance

DRAGON GENETICS LAB -- Principles of Mendelian Genetics

Introduction to Genetics

Nature Genetics: doi: /ng.3731

Strength of functional signature correlates with effect size in autism

ddm1a (PFG_3A-51065) ATG ddm1b (PFG_2B-60109) ATG osdrm2 (PFG_3A-04110) osdrm2 osdrm2 osdrm2

Chapter 10 Notes Patterns of Inheritance, Part 1

11.1 The Work of Mendel

Potato Head Genetics Gina Ford & Jennifer Hladun Twelve Bridges Middle School Lincoln California

Supplementary Materials

Laws of Inheritance. Bởi: OpenStaxCollege

Transcription:

96 APPENDIX B. Supporting Information for chapter 4 "changes in genome content generated via segregation of non-allelic homologs" Figure S1. Potential de novo CNV probes and sizes of apparently de novo CNV segments Figure S2. Distribution of acgh probes on all ten chromosomes of the RIL M0022 M0023 Figure S3. Distribution of apparently de novo CNV segments on nine chromosomes for the RIL Figure S4. Validation of apparently de novo CNV segments using exome-seq data. Table S1: Detailed information of acgh probes Table S2: Apparently de novo CNV segments in the RILs Table S3: PCR detection of copy number loss in some apparently de novo CNV segments Table S4. Mapping Mo17 mate-pair reads associated with apparently de novo CNV segments on the B73 reference genome Table S5: Mapping result of Mo17 5-kb mate-pair reads derived from apparently de novo CNV segments to the B73 reference genome Table S6: Genes in apparently de novo CNV segments Table S7: Genotype/phenotype associations between apparently de novo CNV segments and traits Figure S1. Potential de novo CNV probes and sizes of apparently de novo CNV segments a-b, Scatter plots of log2 signal ratios of the RIL vs. B73 (y-axis) and log2 signal ratios of Mo17 vs. B73 (xaxis) for potential de novo CNV probes that exhibited unique hybridization signals relative to both B73 and Mo17. Purple dots represent probes with signal gain in the RIL relative to B73; blue dots represent probes with signal loss in the RIL relative to B73. c-d, Histograms of log2 sizes of apparently de novo CNV segments. Red lines indicate means.

97 a b c

98 d e

99 f g h

100 i j

101 Figure S2. Distribution of acgh probes on all ten chromosomes of the RIL M0022. For each acgh probe the log2 signal ratio of the RIL to each of the parents, B73 and Mo17, was determined. The ratio with smaller absolute log2 value (y-axis) was plotted against the physical position of the corresponding probe on the chromosome (x-axis) in each of panel (a-j). Each dot in each panel represents a probe. Probes that exhibited statistically significant signal losses or gains relative to both parents (potential de novo CNV probes) were highlighted in red. All other probes are plotted in grey. Chromosomal regions in the RIL that are derived from B73 and Mo17 are plotted in blue and green, respectively. These assignments were inferred via segmentation of all probes that could be reliably classified as having B73-like or Mo17-like signals. Subsequently, a second segmentation was conducted to determine the apparently de novo CNV segments (Methods) that are indicated in red. Alternative mechanisms other than SNH might be involved to result in those segments with high number of copy gains in the RILs relative to the parental lines based on acgh signal.

102 a b c

103 d e

104 f g h

105 i Figure S3. Distribution of apparently de novo CNV segments on nine chromosomes for the RIL M0023. For each acgh probe the log2 signal ratio of the RIL to each of the parents, B73 and Mo17, was determined. The ratio with smaller absolute log2 value (y-axis) was plotted against the physical position of the corresponding probe on the chromosome (x-axis) in each of panel (a-i). Each dot in each panel represents a probe. Probes that exhibited statistically significant signal losses or gains relative to both parents (potential de novo CNV probes) were highlighted in red. All other probes are plotted in grey. Chromosomal regions in the RIL that are derived from B73 and Mo17 are plotted in blue and green, respectively. These assignments were inferred via segmentation of all probes that could be reliably classified as having B73-like or Mo17-like signals. Subsequently, a second segmentation was conducted to determine the apparently de novo CNV segments (Methods) that are indicated in red.

106 Figure S4. Validation of apparently de novo CNV segments using exome-seq data. a, Scatter plot of normalized read counts from testable apparently de novo CNV segments in exome-seq data from the RIL plotted against read counts in B73 exome-seq data. Testable segments are those having 30 B73 reads (a-b). Green triangles or circles represent apparently de novo CNV segments with signal gain in the RIL (M0022: triangle and M0023: circle) when compared to both B73 and Mo17 from the acgh experiment; red triangles or circles represent apparently de novo CNV segments with signal loss in the RIL (M0022: triangle and M0023: circle) when compared to both B73 and Mo17 from the acgh experiment. The same color-codes are used in panels b-d. b, Scatter plot of normalized read counts per kb of segment size in the RIL of testable apparently de novo CNV segments against the corresponding B73 read counts per kb of segment size. c, Scatter plot of normalized read counts in the RIL of testable apparently de novo CNV segments against the corresponding Mo17 read counts. Testable segments require 30 supporting Mo17 reads (c-d). d, Scatter plot of normalized read counts per kb of segment size in the RIL of the testable apparently de

107 novo CNV segments against the corresponding Mo17 read counts per kb of segment size. Tables Table S1: Detailed information of acgh probes A comma separated file (file size = 194 MB) providing the chromosomal location of each of 1,780,475 acgh probes and the detailed result of statistical contrasts between B73 vs. Mo17, B73 vs. RIL and Mo17 vs. RIL. Since this is a large data set, we only showed 100 probes in the Table S1 for now. The rest of the probe list is available upon request. Description of each column is: 1. ID: probe ID 2. Chr: chromosome on which the probe is located 3. Start: start position of the probe on the chromosome 4. End: end position of the probe on the chromosome 5. LR.M0022vsB: log2 of signal ratio of M0022 to B73 in acgh 6. LR.M0023vsB: log2 of signal ratio of M0023 to B73 in acgh 7. LR.MvsB: log2 of signal ratio of Mo17 to B73 in acgh 8. Q.M0022vsB: q-value of the contrast of M0022 vs. B73 for a given probe in acgh 9. Q.M0023vsB: q-value of the contrast of M0023 vs. B73 for a given probe in acgh 10. Q.MvsB: q-value of the contrast of Mo17 vs. B73 for a given probe in acgh 11. LR.M0022vsM: log2 of signal ratio of M0022 to Mo17 in acgh 12. LR.M0023vsM: log2 of signal ratio of M0023 to Mo17 in acgh 13. Q.M0022vsM: q-value of the contrast of M0022 vs. Mo17 for a given probe in acgh 14. Q.M0023vsM: q-value of the contrast of M0023 vs. Mo17 for a given probe in acgh 15. M0022.non_parental: "1" denotes potential de novo CNV probes in M0022; other probes were denoted with "0" 16. M0023.non_parental: "1" denotes potential de novo CNV probes in M0023; other probes were denoted with "0"

108 Table S2: Apparently de novo CNV segments in the RILs An Excel file providing the detailed information for each of the identified apparently de novo CNV segments. Description of each column is: 1. SegID: segment ID 2. Chr: chromosome 3. Start: start position of the segment 4. End: end position of the segment 5. Feature: gain/loss feature of the RIL compared to the parents 6. NumCGHmarker: the number of acgh probes in the segment 7. MvsBmean: mean of log2(mo17/b73) across all acgh probes in the segment 8. M0022vsBmean: mean of log2(m0022/b73) across all acgh probes in the segment 9. M0023vsBmean: mean of log2(m0023/b73) across all acgh probes in the segment 10. M0022vsMmean: mean of log2(m0022/mo17) across all acgh probes in the segment 11. M0023vsMmean: mean of log2(m0023/mo17) across all acgh probes in the segment 12. Gene_list(Gene,size,percentage): all genes from the ZmB73_4a.53 filtered gene set in the segment, size and percentage represent the overlapping size between the gene and the segment and the overlapping percentage of the gene 13. BM.seg: the source of the segment, B73(B) or Mo17(M) 14ExomeB73: B73 read counts in the segment from Exome-seq 15. ExomeMo17_norm: normalized read count for Mo17 in segment. The total number of uniquely mapped reads was used to normalize Mo17 counts to B73 level (B73 uniquely mapped reads: 61,547,945; Mo17 uniquely mapped reads: 57,028,009) 16. ExomeM0022_norm: normalized read count for M0022 in segment. The total number of uniquely mapped reads was used to normalize Mo17 counts to B73 level (B73 uniquely mapped reads: 61,547,945; M0022 uniquely mapped reads: 58,842,192) 17. ExomeM0023_norm: normalized read count for M0023 in segment. The total number of uniquely mapped reads was used to normalize Mo17 counts to B73 level (B73 totally uniquely mapped reads: 61,547,945; M0023 uniquely mapped reads: 50,612,131) 18. Overlap(ID,bp): whether or not the segment overlaps with apparently de novo CNV segments in the other RIL and which segment is overlapped and its size

109 Table S3: PCR detection of copy number loss in some apparently de novo CNV segments Order SegID No. RILs with expected PCR band No. RILs without the expected PCR band p-value of Chi-sq test 3:1 p-value of Chi-sq test 7:1 1 M0023_seg3 262 29 3.2E-09* 1.90E-01 2 M0022_seg19 259 18 1.1E-12* 2.50E-03* 3 M0022_seg20/M 0023_seg39 278 42 9.3E-07* 7.40E-01 4 M0023_seg64 208 59 2.7E-01 2.10E-06* 5 M0022_seg30 285 44 1.1E-06* 6.30E-01 6 M0022_seg31/M 0023_seg66 194 67 8.0E-01 1.40E-09* 9 M0023_seg120 262 87 9.8E-01 2.20E-12* 10 M0023_seg16 210 65 6.0E-01 2.30E-08* 11 M0023_seg92 252 70 1.8E-01 5.40E-07* 12 M0022_seg15/M 0023_seg22 194 72 4.4E-01 6.80E-13* 13 M0023_seg29 184 76 1.2E-01 3.40E-16* 14 M0023_seg125 239 27 2.2E-08* 2.50E-01 15 M0023_seg130 202 63 6.4E-01 2.90E-08* 16 M0023_seg103 173 89 8.0E-04* 8.00E-26* *Significantly different at p<0.05

110 Table S4. Mapping Mo17 mate-pair reads associated with apparently de novo CNV segments on the B73 reference genome (Methods) M0022 Random (M0022 control) M0023 Random (M0023 control) Number of read pairs locally mapped (<1Mb) Number of read pairs distally mapped 188 (10%) 520 (67%) 460 (16%) 1,466 (73%) 1,712 (90%) 256 (33%) 2,490 (84%) 539 (27%) Total read pairs 1,900 776 2,950 2,005 Percentage of segments that 60% (40/67)* 3% (2/67) 52% (68/130)* 5% (6/130) could be distally mapped (Number of segments with reads clustered at non-allelic regions / Total number of observed segments) * Statistical significance with the chi-square test to the null hypothesis of no enrichment of non-allelic mapped segments compared to the random control

111 Table S5: Mapping result of Mo17 5-kb mate-pair reads derived from apparently de novo CNV segments to the B73 reference genome An Excel file providing the detailed information of the mapping result of Mo17 5-kb mate-pair reads

112 Table S6: Genes in apparently de novo CNV segments An Excel file providing the detailed information of genes that overlap with de novo CNV segments. Description of each column is: 1. GeneID: Order of genes 2. Possible phenotype association: whether or not the segment at which the gene is located is associated with phenotypic variation 3. Gene: Gene name 4. Chr: Chromosome 5. Ori: Orientation of gene on the B73 reference genome 6. Start: Start position of gene 7. End: End position of gene 8. M0022.Gene.cov: Percentage of gene covered by M0022 apparently de novo CNV segment 9. M0022.SegID: Corresponding M0022 apparently de novo CNV segment 10. M0022.Feature: Gain or loss feature of corresponding M0022 apparently de novo CNV segment 11. M0023.Gene.cov: Percentage of gene covered by M0023 apparently de novo CNV segment 12. M0023.SegID: Corresponding M0023 apparently de novo CNV segment 13. M0023.Feature: Gain or loss feature of corresponding M0023 apparently de novo CNV segment 14. B73_Apex: Number of B73 Apex RNA-Seq reads from the corresponding gene 15. Mo17_Apex: Number of Mo17 Apex RNA-Seq reads from the corresponding gene 16. B73_Apex_RPM: Number of B73 RNA-Seq reads per million total Apex reads from for the corresponding gene 17. Mo17_Apex_RPM: Number of Mo17 RNA-Seq reads per million of total Apex reads for the corresponding gene 18. B73_exome_cap: Number of B73 exome-seq reads for the corresponding gene 19. Mo17_norm_exome_cap: Normalized number of Mo17 exome-seq reads for the corresponding gene 20. M0022_norm_exome_cap: Normalized number of M0022 exome-seq reads for the corresponding gene 21. M0023_norm_exome_cap: Normalized number of M0023 exome-seq reads for the corresponding gene 22. Annotation: Gene annotation (4a.53) from maizesequence.org 23. ZmHomo: Number of homologous genes in maize (CoGE) 24. SgHomo: Number of homologous genes in sorghum (CoGE) 25. OsHomo: Number of homologous genes in rice (CoGE)

113 Table S7: Genotype/phenotype associations between apparently de novo CNV segments and traits An Excel file providing summarized data about genotyping of de novo CNV segments and associations with phenotypic data. Description of each column is: 1. SegID: name of apparently de novo CNV segments from both RILs; if both apparently de novo CNV segments have overlap, both were written down with a slash separator 2. No. RILs with expected PCR band: number of RILs without the expected PCR band 3. No. RILs without expected PCR band: number of RILs without the expected PCR band after removing RILs with potentially false negative PCR results 4. phenotype: measured traits, including average kernel weight (AKW), cob diameter (CD), cob length (CL), cob weight (CW), kernel count (KC), kernel row number (KRN), total kernel weight (TKW), seedling dry weight (SDW), tiller number (TN), ferulic acid (FA), p-coumaric acid (PCA), brace node number (BN), node number above primary ear (NA), node number below primary ear (NB), total node number (NT) 5. phenotypic mean of RILs without expected PCR band: Mean phenotypic values of the corresponding trait of those RILs without the expected PCR band 6. phenotypic mean of RILs with expected PCR band: Mean phenotype values of the corresponding trait of those RILs with the expected PCR band 7. t-test value: t statistic from a pair-wise t-test of phenotypic values for a given trait between groups of RIL with and without the expected PCR band; "NA", no test performed 8. t-test p_value: p-value of the pair-wise t-test; "NA", no tests preformed 9. adjust.p: adjusted p-value with the permutation test; "NA", no test performed 10. significant: significance if adjusted p-value is less than 0.05; "NA", no test performed