STATISTICAL METHODS IN BIOLOGY

Size: px
Start display at page:

Download "STATISTICAL METHODS IN BIOLOGY"

Transcription

1 STATISTICAL METHODS IN BIOLOGY JOANNA SZYDA MAGDALENA FRĄSZCZAK

2 INTRODUCTION 1. Statistical methods in biology??? 2. The Biostatistic Group current projects 3. Course contents 4. Contact 5. Literature Copyright 2017 Joanna Szyda

3 STATISTICAL METHODS IN BIOLOGY??? science is not data. Data are the raw material of science. It is what you do with the data that is science the interpretation you make, the story you tell. ASHG 2011 Writing Workshop; Albertine 2011 /

4 STATISTICAL METHODS IN BIOLOGY - SNP [Header] BSGT Version Processing Date 11/24/ :14 AM Content BovineSNP50_A.bpm Num SNPs Total SNPs Num Samples 32 Total Samples 2636 [Data] SNP Name Sample ID SNP GC Score Index Allele1 - AB Allele2 - AB Chr Position GT Score ARS-BFGL-BAC _K B B ARS-BFGL-BAC _K B B ARS-BFGL-BAC _K B B ARS-BFGL-BAC _K A B ARS-BFGL-BAC _K B B ARS-BFGL-BAC _K A B ARS-BFGL-BAC _K A B ARS-BFGL-BAC _K B B ARS-BFGL-BAC _K A B ARS-BFGL-BAC _K A B ARS-BFGL-BAC _K B B ARS-BFGL-BAC _K B B ARS-BFGL-BAC _K A B ARS-BFGL-BAC _K A B ARS-BFGL-BAC _K A B ARS-BFGL-BAC _K A A N = Copyright 2017 Joanna Szyda

5 STATISTICAL METHODS IN BIOLOGY - SNP ##FORMAT=<ID=SP,Number=1,Type=Integer,Description="Phred-scaled strand bias P-value"> ##FORMAT=<ID=PL,Number=G,Type=Integer,Description="List of Phred-scaled genotype likelihoods"> ##INFO=<ID=PR,Number=1,Type=Integer,Description="# permutations yielding a smaller PCHI2."> #CHROM POS ID REF ALT QUAL FILTER INFO FORMAT BSWCHEM Chr1 182 C T 30.8 DP=2;VDB= e-02;AF1=1;AC1=2;DP4=0,0,2,0;MQ=34;FQ=-33 GT:PL:GQ 1/1:62,6,0:10 Chr1 300 A G 87 DP=6;VDB= e-02;RPB= e+00;AF1=0.5;AC1=1;DP4 GT:PL:GQ 0/1:117,0,52:5 Chr1 324 A G 34 DP=9;VDB= e-02;RPB= e+00;AF1=0.5;AC1=1;DP4= GT:PL:GQ 0/1:64,0,160:6 Chr1 340 G A 90 DP=14;VDB= e-02;RPB= e-01;AF1=0.5;AC1=1;DP4= GT:PL:GQ 0/1:120,0,209:9 Chr1 353 T A 136 DP=14;VDB= e-01;RPB= e+00;AF1=0.5;AC1=1;DP GT:PL:GQ 0/1:166,0,49:5 Chr1 355 T A 141 DP=14;VDB= e-02;RPB= e+00;AF1=0.5;AC1=1;DP4= GT:PL:GQ 0/1:171,0,50:53 Chr1 380 G T 103 DP=18;VDB= e-01;RPB= e-01;AF1=0.5;AC1=1;DP GT:PL:GQ 0/1:133,0,241:9 Chr1 420 T A 211 DP=19;VDB= e-01;RPB= e-01;AF1=0.5;AC1=1;DP GT:PL:GQ 0/1:241,0,81: polymorphic variants for 1 individual

6 STATISTICAL METHODS IN BIOLOGY - CNV duplication chr1: e e-49 1 deletion chr1: e e+06 1 duplication chr1: e e+09 1 duplication chr1: e e+09 1 duplication chr1: e e-13 1 deletion chr1: e e deletion chr1: e deletion chr1: e deletion chr1: e e deletion chr1: e deletion chr1: e deletion chr1: e e+06 1 deletion chr1: e deletion chr1: e e deletions duplications for 1 individual Copyright 2017 Joanna Szyda

7 STATISTICAL METHODS IN BIOLOGY GENE EXPRESSION Row _ _ _ _ _ _ _ genes 28 individuals 14 comparisons Copyright 2017 Joanna Szyda

8 THE BIOSTATISTIC GROUP PROJECT Copy number variations analysis among diverse cattle breeds Magda Mielczarek Joanna Szyda Magdalena Frąszczak Giulietta Minozzi Ezequiel L. Nicolazzi John Williams Katarzyna Wojdak-Maksymiec

9 THE BIOSTATISTIC GROUP PROJECT CNV in whole genome sequence of 155 bulls Various breeds: Brown Swiss 48 Guernsey 20 Fleckvieh 31 Simmental 16 Norwegian Red 26 Parda de la Montaña 4 Pezzata Rossa Italiana 3 Bruna Italiana 1 Avileña 2 Albera 1 Rubia Gallega 1 Toro de Lidia 1 Pirenaica 1

10 THE BIOSTATISTIC GROUP PROJECT Bioinformatics pipeline

11 THE BIOSTATISTIC GROUP PROJECT # CNV

12 THE BIOSTATISTIC GROUP PROJECT CNV length

13 THE BIOSTATISTIC GROUP PROJECT Genomic distribution of CNV duplications

14 THE BIOSTATISTIC GROUP PROJECT Genomic distribution of CNV deletions

15 THE BIOSTATISTIC GROUP PROJECT # of breed-specific CNVs

16 LECTURE CONTENTS 1. Ability to use biological data of various structures 2. Principles of statistical data analysis 3. Interpretation of results 4. Presence 5. Questions

17 LECTURE CONTENTS Principles of statistical data analysis 1. Introductory lecture 2. Populations and samples 3. Hypotheses testing and parameter estimation 4. Experimental design for biological data 5. Most widely used statistical tests I 6. Most widely used statistical tests II

18 LECTURE CONTENTS Elements of statistical modelling of data 7. Linear regression 8. Nonlinear regression 9. Regression model fit 10. Correlation 11. Elements of statistical data modelling 12. Model comparison 13. Variance analysis 14. Covariance analysis 15. Summary of the material, analysis of examples, discussion

19 LAB CONTENTS 1. Presence 2. Final grade average of particular grades 3. Grading: Written exams - lectures + labs Presentations 4. Computer labs

20 LAB CONTENTS Principles of statistical data analysis 1. Introductory lab 2. Populations and samples 3. Parameter estimation 4. Hypotheses testing I 5. Hypotheses testing II 6. Exam I

21 LAB CONTENTS Elements of statistical modelling of data 7. Correlation 8. Linear regression 9. Nonlinear regression 10. Interpreting results from various models 11. Exam II 12. Model comparison 13. Variance analysis 14. Presentations 15. Presentations

22 CONTACT Statistical mmethods in biology Copyright 2017 Joanna Szyda

23 CONTACT address: Institute of Genetics Kożuchowska 7 consultation: time scheduled individually

24 LITERATURE 1. Lectures 2. Statistical books e.g. Collett, D. (1991) Modelling Binary Data, Chapmann and Hall Draper, N.R., Smith, H. (1998) Applied Regression Analysis, Wiley Hawkins, D. (2005) Biomeasurement. Understanding, analysing, and communicating data in the biosciences. Oxford University Press Ruxton and Colegrave (2003) Experimental design for the life sciences

25 grading STATISTICAL METHODS

CHR POS REF OBS ALLELE BUILD CLINICAL_SIGNIFICANCE

CHR POS REF OBS ALLELE BUILD CLINICAL_SIGNIFICANCE CHR POS REF OBS ALLELE BUILD CLINICAL_SIGNIFICANCE is_clinical dbsnp MITO GENE chr1 13273 G C heterozygous - - -. - DDX11L1 chr1 949654 A G Homozygous 52 - - rs8997 - ISG15 chr1 1021346 A G heterozygous

More information

New Enhancements: GWAS Workflows with SVS

New Enhancements: GWAS Workflows with SVS New Enhancements: GWAS Workflows with SVS August 9 th, 2017 Gabe Rudy VP Product & Engineering 20 most promising Biotech Technology Providers Top 10 Analytics Solution Providers Hype Cycle for Life sciences

More information

Dr Rick Tearle Senior Applications Specialist, EMEA Complete Genomics Complete Genomics, Inc.

Dr Rick Tearle Senior Applications Specialist, EMEA Complete Genomics Complete Genomics, Inc. Dr Rick Tearle Senior Applications Specialist, EMEA Complete Genomics Topics Overview of Data Processing Pipeline Overview of Data Files 2 DNA Nano-Ball (DNB) Read Structure Genome : acgtacatgcattcacacatgcttagctatctctcgccag

More information

Supplementary note: Comparison of deletion variants identified in this study and four earlier studies

Supplementary note: Comparison of deletion variants identified in this study and four earlier studies Supplementary note: Comparison of deletion variants identified in this study and four earlier studies Here we compare the results of this study to potentially overlapping results from four earlier studies

More information

Global variation in copy number in the human genome

Global variation in copy number in the human genome Global variation in copy number in the human genome Redon et. al. Nature 444:444-454 (2006) 12.03.2007 Tarmo Puurand Study 270 individuals (HapMap collection) Affymetrix 500K Whole Genome TilePath (WGTP)

More information

Nature Genetics: doi: /ng Supplementary Figure 1

Nature Genetics: doi: /ng Supplementary Figure 1 Supplementary Figure 1 Illustrative example of ptdt using height The expected value of a child s polygenic risk score (PRS) for a trait is the average of maternal and paternal PRS values. For example,

More information

MBG* Animal Breeding Methods Fall Final Exam

MBG* Animal Breeding Methods Fall Final Exam MBG*4030 - Animal Breeding Methods Fall 2007 - Final Exam 1 Problem Questions Mick Dundee used his financial resources to purchase the Now That s A Croc crocodile farm that had been operating for a number

More information

Understanding DNA Copy Number Data

Understanding DNA Copy Number Data Understanding DNA Copy Number Data Adam B. Olshen Department of Epidemiology and Biostatistics Helen Diller Family Comprehensive Cancer Center University of California, San Francisco http://cc.ucsf.edu/people/olshena_adam.php

More information

Colorspace & Matching

Colorspace & Matching Colorspace & Matching Outline Color space and 2-base-encoding Quality Values and filtering Mapping algorithm and considerations Estimate accuracy Coverage 2 2008 Applied Biosystems Color Space Properties

More information

Genomic structural variation

Genomic structural variation Genomic structural variation Mario Cáceres The new genomic variation DNA sequence differs across individuals much more than researchers had suspected through structural changes A huge amount of structural

More information

Practical challenges that copy number variation and whole genome sequencing create for genetic diagnostic labs

Practical challenges that copy number variation and whole genome sequencing create for genetic diagnostic labs Practical challenges that copy number variation and whole genome sequencing create for genetic diagnostic labs Joris Vermeesch, Center for Human Genetics K.U.Leuven, Belgium ESHG June 11, 2010 When and

More information

Statistical Tests for X Chromosome Association Study. with Simulations. Jian Wang July 10, 2012

Statistical Tests for X Chromosome Association Study. with Simulations. Jian Wang July 10, 2012 Statistical Tests for X Chromosome Association Study with Simulations Jian Wang July 10, 2012 Statistical Tests Zheng G, et al. 2007. Testing association for markers on the X chromosome. Genetic Epidemiology

More information

Nature Genetics: doi: /ng Supplementary Figure 1. PCA for ancestry in SNV data.

Nature Genetics: doi: /ng Supplementary Figure 1. PCA for ancestry in SNV data. Supplementary Figure 1 PCA for ancestry in SNV data. (a) EIGENSTRAT principal-component analysis (PCA) of SNV genotype data on all samples. (b) PCA of only proband SNV genotype data. (c) PCA of SNV genotype

More information

Association of a nicotine receptor polymorphism with reduced ability to quit smoking in pregnancy

Association of a nicotine receptor polymorphism with reduced ability to quit smoking in pregnancy Research Symposium, MRC CAiTE & Department of Social Medicine, University of Bristol, 3 rd March 2009. Association of a nicotine receptor polymorphism with reduced ability to quit smoking in pregnancy

More information

Genome-wide copy-number calling (CNAs not CNVs!) Dr Geoff Macintyre

Genome-wide copy-number calling (CNAs not CNVs!) Dr Geoff Macintyre Genome-wide copy-number calling (CNAs not CNVs!) Dr Geoff Macintyre Structural variation (SVs) Copy-number variations C Deletion A B C Balanced rearrangements A B A B C B A C Duplication Inversion Causes

More information

On Missing Data and Genotyping Errors in Association Studies

On Missing Data and Genotyping Errors in Association Studies On Missing Data and Genotyping Errors in Association Studies Department of Biostatistics Johns Hopkins Bloomberg School of Public Health May 16, 2008 Specific Aims of our R01 1 Develop and evaluate new

More information

Rare Variant Burden Tests. Biostatistics 666

Rare Variant Burden Tests. Biostatistics 666 Rare Variant Burden Tests Biostatistics 666 Last Lecture Analysis of Short Read Sequence Data Low pass sequencing approaches Modeling haplotype sharing between individuals allows accurate variant calls

More information

Breast and ovarian cancer in Serbia: the importance of mutation detection in hereditary predisposition genes using NGS

Breast and ovarian cancer in Serbia: the importance of mutation detection in hereditary predisposition genes using NGS Breast and ovarian cancer in Serbia: the importance of mutation detection in hereditary predisposition genes using NGS dr sc. Ana Krivokuća Laboratory for molecular genetics Institute for Oncology and

More information

Combining Different Marker Densities in Genomic Evaluation

Combining Different Marker Densities in Genomic Evaluation Combining Different Marker Densities in Genomic Evaluation 1, Jeff O Connell 2, George Wiggans 1, Kent Weigel 3 1 Animal Improvement Programs Lab, USDA, Beltsville, MD, USA 2 University of Maryland School

More information

Cancer Gene Panels. Dr. Andreas Scherer. Dr. Andreas Scherer President and CEO Golden Helix, Inc. Twitter: andreasscherer

Cancer Gene Panels. Dr. Andreas Scherer. Dr. Andreas Scherer President and CEO Golden Helix, Inc. Twitter: andreasscherer Cancer Gene Panels Dr. Andreas Scherer Dr. Andreas Scherer President and CEO Golden Helix, Inc. scherer@goldenhelix.com Twitter: andreasscherer About Golden Helix - Founded in 1998 - Main outside investor:

More information

Abstract. Optimization strategy of Copy Number Variant calling using Multiplicom solutions APPLICATION NOTE. Introduction

Abstract. Optimization strategy of Copy Number Variant calling using Multiplicom solutions APPLICATION NOTE. Introduction Optimization strategy of Copy Number Variant calling using Multiplicom solutions Michael Vyverman, PhD; Laura Standaert, PhD and Wouter Bossuyt, PhD Abstract Copy number variations (CNVs) represent a significant

More information

Investigating causality in the association between 25(OH)D and schizophrenia

Investigating causality in the association between 25(OH)D and schizophrenia Investigating causality in the association between 25(OH)D and schizophrenia Amy E. Taylor PhD 1,2,3, Stephen Burgess PhD 1,4, Jennifer J. Ware PhD 1,2,5, Suzanne H. Gage PhD 1,2,3, SUNLIGHT consortium,

More information

Econometrics II - Time Series Analysis

Econometrics II - Time Series Analysis University of Pennsylvania Economics 706, Spring 2008 Econometrics II - Time Series Analysis Instructor: Frank Schorfheide; Room 525, McNeil Building E-mail: schorf@ssc.upenn.edu URL: http://www.econ.upenn.edu/

More information

Supplementary Figure 1. Estimation of tumour content

Supplementary Figure 1. Estimation of tumour content Supplementary Figure 1. Estimation of tumour content a, Approach used to estimate the tumour content in S13T1/T2, S6T1/T2, S3T1/T2 and S12T1/T2. Tissue and tumour areas were evaluated by two independent

More information

DNA is the genetic material that provides instructions for what our bodies look like and how they function. DNA is packaged into structures called

DNA is the genetic material that provides instructions for what our bodies look like and how they function. DNA is packaged into structures called DNA is the genetic material that provides instructions for what our bodies look like and how they function. DNA is packaged into structures called chromosomes. We have 23 pairs of chromosomes (for a total

More information

Calling DNA variants SNVs, CNVs, and SVs. Steve Laurie Variant Effect Predictor Training Course Prague, 6 th November 2017

Calling DNA variants SNVs, CNVs, and SVs. Steve Laurie Variant Effect Predictor Training Course Prague, 6 th November 2017 1 Calling DNA variants SNVs, CNVs, and SVs Steve Laurie Variant Effect Predictor Training Course Prague, 6 th November 2017 Calling DNA variants SNVs, CNVs, SVs 2 1. What is a variant? 2. Paired End read

More information

Variant Classification. Author: Mike Thiesen, Golden Helix, Inc.

Variant Classification. Author: Mike Thiesen, Golden Helix, Inc. Variant Classification Author: Mike Thiesen, Golden Helix, Inc. Overview Sequencing pipelines are able to identify rare variants not found in catalogs such as dbsnp. As a result, variants in these datasets

More information

Supplementary Figure 1. Quantile-quantile (Q-Q) plot of the log 10 p-value association results from logistic regression models for prostate cancer

Supplementary Figure 1. Quantile-quantile (Q-Q) plot of the log 10 p-value association results from logistic regression models for prostate cancer Supplementary Figure 1. Quantile-quantile (Q-Q) plot of the log 10 p-value association results from logistic regression models for prostate cancer risk in stage 1 (red) and after removing any SNPs within

More information

EECS 433 Statistical Pattern Recognition

EECS 433 Statistical Pattern Recognition EECS 433 Statistical Pattern Recognition Ying Wu Electrical Engineering and Computer Science Northwestern University Evanston, IL 60208 http://www.eecs.northwestern.edu/~yingwu 1 / 19 Outline What is Pattern

More information

CLINICAL BIOSTATISTICS

CLINICAL BIOSTATISTICS 09/06/17 1 Overview and Descriptive Statistics a. Application of statistics in biomedical research b. Type of data c. Graphic representation of data d. Summary statistics: central tendency and dispersion

More information

LTA Analysis of HapMap Genotype Data

LTA Analysis of HapMap Genotype Data LTA Analysis of HapMap Genotype Data Introduction. This supplement to Global variation in copy number in the human genome, by Redon et al., describes the details of the LTA analysis used to screen HapMap

More information

Importance of Attention. The Attention System 7/16/2013

Importance of Attention. The Attention System 7/16/2013 Importance of Attention Preliminary Evidence of an Association Between an IL6 Promoter Polymorphism and Self-Reported Attentional Function in Oncology Patients and Their Family Caregivers John Merriman,

More information

DNA-seq Bioinformatics Analysis: Copy Number Variation

DNA-seq Bioinformatics Analysis: Copy Number Variation DNA-seq Bioinformatics Analysis: Copy Number Variation Elodie Girard elodie.girard@curie.fr U900 institut Curie, INSERM, Mines ParisTech, PSL Research University Paris, France NGS Applications 5C HiC DNA-seq

More information

BIOSTATISTICAL METHODS AND RESEARCH DESIGNS. Xihong Lin Department of Biostatistics, University of Michigan, Ann Arbor, MI, USA

BIOSTATISTICAL METHODS AND RESEARCH DESIGNS. Xihong Lin Department of Biostatistics, University of Michigan, Ann Arbor, MI, USA BIOSTATISTICAL METHODS AND RESEARCH DESIGNS Xihong Lin Department of Biostatistics, University of Michigan, Ann Arbor, MI, USA Keywords: Case-control study, Cohort study, Cross-Sectional Study, Generalized

More information

Dan Koller, Ph.D. Medical and Molecular Genetics

Dan Koller, Ph.D. Medical and Molecular Genetics Design of Genetic Studies Dan Koller, Ph.D. Research Assistant Professor Medical and Molecular Genetics Genetics and Medicine Over the past decade, advances from genetics have permeated medicine Identification

More information

Session 4 Rebecca Poulos

Session 4 Rebecca Poulos The Cancer Genome Atlas (TCGA) & International Cancer Genome Consortium (ICGC) Session 4 Rebecca Poulos Prince of Wales Clinical School Introductory bioinformatics for human genomics workshop, UNSW 28

More information

Genetic Recessives and Carrier Codes

Genetic Recessives and Carrier Codes Genetic Recessives and Carrier Codes Genetic Recessives Explained Definitions: Haplotype is defined as a group of SNPs located close to each other on the chromosome and that are usually inherited together.

More information

Ginkgo Interactive analysis and quality assessment of single-cell CNV data

Ginkgo Interactive analysis and quality assessment of single-cell CNV data Ginkgo Interactive analysis and quality assessment of single-cell CNV data @RobAboukhalil Robert Aboukhalil, Tyler Garvin, Jude Kendall, Timour Baslan, Gurinder S. Atwal, Jim Hicks, Michael Wigler, Michael

More information

Big Data Training for Translational Omics Research. Session 1, Day 3, Liu. Case Study #2. PLOS Genetics DOI: /journal.pgen.

Big Data Training for Translational Omics Research. Session 1, Day 3, Liu. Case Study #2. PLOS Genetics DOI: /journal.pgen. Session 1, Day 3, Liu Case Study #2 PLOS Genetics DOI:10.1371/journal.pgen.1005910 Enantiomer Mirror image Methadone Methadone Kreek, 1973, 1976 Methadone Maintenance Therapy Long-term use of Methadone

More information

Genome. Institute. GenomeVIP: A Genomics Analysis Pipeline for Cloud Computing with Germline and Somatic Calling on Amazon s Cloud. R. Jay Mashl.

Genome. Institute. GenomeVIP: A Genomics Analysis Pipeline for Cloud Computing with Germline and Somatic Calling on Amazon s Cloud. R. Jay Mashl. GenomeVIP: the Genome Institute at Washington University A Genomics Analysis Pipeline for Cloud Computing with Germline and Somatic Calling on Amazon s Cloud R. Jay Mashl October 20, 2014 Turnkey Variant

More information

Supplementary information to:

Supplementary information to: Supplementary information to: Digital Sorting of Pure Cell Populations Enables Unambiguous Genetic Analysis of Heterogeneous Formalin-Fixed Paraffin Embedded Tumors by Next Generation Sequencing Authors

More information

Georgetown University ECON-616, Fall Macroeconometrics. URL: Office Hours: by appointment

Georgetown University ECON-616, Fall Macroeconometrics.   URL:  Office Hours: by appointment Georgetown University ECON-616, Fall 2016 Macroeconometrics Instructor: Ed Herbst E-mail: ed.herbst@gmail.com URL: http://edherbst.net/ Office Hours: by appointment Scheduled Class Time and Organization:

More information

Challenges of CGH array testing in children with developmental delay. Dr Sally Davies 17 th September 2014

Challenges of CGH array testing in children with developmental delay. Dr Sally Davies 17 th September 2014 Challenges of CGH array testing in children with developmental delay Dr Sally Davies 17 th September 2014 CGH array What is CGH array? Understanding the test Benefits Results to expect Consent issues Ethical

More information

EVOLUTION. Reading. Research in my Lab. Who am I? The Unifying Concept in Biology. Professor Carol Lee. On your Notecards please write the following:

EVOLUTION. Reading. Research in my Lab. Who am I? The Unifying Concept in Biology. Professor Carol Lee. On your Notecards please write the following: Evolution 410 9/5/18 On your Notecards please write the following: EVOLUTION (1) Name (2) Year (3) Major (4) Courses taken in Biology (4) Career goals (5) Email address (6) Why am I taking this class?

More information

STATISTICS IN CLINICAL AND TRANSLATIONAL RESEARCH

STATISTICS IN CLINICAL AND TRANSLATIONAL RESEARCH 09/07/11 1 Overview and Descriptive Statistics a. Application of statistics in biomedical research b. Type of data c. Graphic representation of data d. Summary statistics: central tendency and dispersion

More information

Session 4 Rebecca Poulos

Session 4 Rebecca Poulos The Cancer Genome Atlas (TCGA) & International Cancer Genome Consortium (ICGC) Session 4 Rebecca Poulos Prince of Wales Clinical School Introductory bioinformatics for human genomics workshop, UNSW 20

More information

Estimation of the heritability of a newly developed ketosis risk indicator and the genetic correlations to other traits in three German cattle breeds

Estimation of the heritability of a newly developed ketosis risk indicator and the genetic correlations to other traits in three German cattle breeds Estimation of the heritability of a newly developed ketosis risk indicator and the genetic correlations to other traits in three German cattle breeds H. Hamann 1, A. Werner 2, L. Dale 2, P. Herold 1 1

More information

Supplementary Material to. Genome-wide association study identifies new HLA Class II haplotypes strongly protective against narcolepsy

Supplementary Material to. Genome-wide association study identifies new HLA Class II haplotypes strongly protective against narcolepsy Supplementary Material to Genome-wide association study identifies new HLA Class II haplotypes strongly protective against narcolepsy Hyun Hor, 1,2, Zoltán Kutalik, 3,4, Yves Dauvilliers, 2,5 Armand Valsesia,

More information

Business Statistics Probability

Business Statistics Probability Business Statistics The following was provided by Dr. Suzanne Delaney, and is a comprehensive review of Business Statistics. The workshop instructor will provide relevant examples during the Skills Assessment

More information

Appendix 1. Sensitivity analysis for ACQ: missing value analysis by multiple imputation

Appendix 1. Sensitivity analysis for ACQ: missing value analysis by multiple imputation Appendix 1 Sensitivity analysis for ACQ: missing value analysis by multiple imputation A sensitivity analysis was carried out on the primary outcome measure (ACQ) using multiple imputation (MI). MI is

More information

Illumina Trusight Myeloid Panel validation A R FHAN R A FIQ

Illumina Trusight Myeloid Panel validation A R FHAN R A FIQ Illumina Trusight Myeloid Panel validation A R FHAN R A FIQ G E NETIC T E CHNOLOGIST MEDICAL G E NETICS, CARDIFF To Cover Background to the project Choice of panel Validation process Genes on panel, Protocol

More information

Friday, September 9, :00-11:00 am Warwick Evans Conference Room, Building D Refreshments will be provided at 9:45am

Friday, September 9, :00-11:00 am Warwick Evans Conference Room, Building D Refreshments will be provided at 9:45am The Role of the Biostatistician in Cancer Research Edmund A. Gehan, PhD Professor Emeritus, Department of Biostatistics, Bioinformatics and Biomathematics Lombardi Comprehensive Cancer Center Georgetown

More information

Analysis with SureCall 2.1

Analysis with SureCall 2.1 Analysis with SureCall 2.1 Danielle Fletcher Field Application Scientist July 2014 1 Stages of NGS Analysis Primary analysis, base calling Control Software FASTQ file reads + quality 2 Stages of NGS Analysis

More information

Deliverable 2.1 List of relevant genetic variants for pre-emptive PGx testing

Deliverable 2.1 List of relevant genetic variants for pre-emptive PGx testing GA N 668353 H2020 Research and Innovation Deliverable 2.1 List of relevant genetic variants for pre-emptive PGx testing WP N and Title: WP2 - Towards shared European Guidelines for PGx Lead beneficiary:

More information

Introduction of Genome wide Complex Trait Analysis (GCTA) Presenter: Yue Ming Chen Location: Stat Gen Workshop Date: 6/7/2013

Introduction of Genome wide Complex Trait Analysis (GCTA) Presenter: Yue Ming Chen Location: Stat Gen Workshop Date: 6/7/2013 Introduction of Genome wide Complex Trait Analysis (GCTA) resenter: ue Ming Chen Location: Stat Gen Workshop Date: 6/7/013 Outline Brief review of quantitative genetics Overview of GCTA Ideas Main functions

More information

Illuminating the genetics of complex human diseases

Illuminating the genetics of complex human diseases Illuminating the genetics of complex human diseases Michael Schatz Sept 27, 2012 Beyond the Genome @mike_schatz / #BTG2012 Outline 1. De novo mutations in human diseases 1. Autism Spectrum Disorder 2.

More information

LEIDEN, THE NETHERLANDS

LEIDEN, THE NETHERLANDS Full-length CYP2D6 diplotyping for better drug dosage and response management Henk Buermans, PhD Leiden University Medical Center Human Genetics, LGTC LEIDEN, THE NETHERLANDS CYP2D6 Function Metabolism

More information

QA 605 WINTER QUARTER ACADEMIC YEAR

QA 605 WINTER QUARTER ACADEMIC YEAR Instructor: Office: James J. Cochran 117A CAB Telephone: (318) 257-3445 Hours: e-mail: URL: QA 605 WINTER QUARTER 2006-2007 ACADEMIC YEAR Tuesday & Thursday 8:00 a.m. 10:00 a.m. Wednesday 8:00 a.m. noon

More information

Human Genetics 542 Winter 2018 Syllabus

Human Genetics 542 Winter 2018 Syllabus Human Genetics 542 Winter 2018 Syllabus Monday, Wednesday, and Friday 9 10 a.m. 5915 Buhl Course Director: Tony Antonellis Jan 3 rd Wed Mapping disease genes I: inheritance patterns and linkage analysis

More information

Statistical Analysis of Single Nucleotide Polymorphism Microarrays in Cancer Studies

Statistical Analysis of Single Nucleotide Polymorphism Microarrays in Cancer Studies Statistical Analysis of Single Nucleotide Polymorphism Microarrays in Cancer Studies Stanford Biostatistics Workshop Pierre Neuvial with Henrik Bengtsson and Terry Speed Department of Statistics, UC Berkeley

More information

Complex Traits Activity INSTRUCTION MANUAL. ANT 2110 Introduction to Physical Anthropology Professor Julie J. Lesnik

Complex Traits Activity INSTRUCTION MANUAL. ANT 2110 Introduction to Physical Anthropology Professor Julie J. Lesnik Complex Traits Activity INSTRUCTION MANUAL ANT 2110 Introduction to Physical Anthropology Professor Julie J. Lesnik Introduction Human variation is complex. The simplest form of variation in a population

More information

Identification of regions with common copy-number variations using SNP array

Identification of regions with common copy-number variations using SNP array Identification of regions with common copy-number variations using SNP array Agus Salim Epidemiology and Public Health National University of Singapore Copy Number Variation (CNV) Copy number alteration

More information

Hands-On Ten The BRCA1 Gene and Protein

Hands-On Ten The BRCA1 Gene and Protein Hands-On Ten The BRCA1 Gene and Protein Objective: To review transcription, translation, reading frames, mutations, and reading files from GenBank, and to review some of the bioinformatics tools, such

More information

Global assessment of genomic variation in cattle by genome resequencing and high-throughput genotyping

Global assessment of genomic variation in cattle by genome resequencing and high-throughput genotyping RESEARCH ARTICLE Open Access Global assessment of genomic variation in cattle by genome resequencing and high-throughput genotyping Bujie Zhan 1, João Fadista 1, Bo Thomsen 1, Jakob Hedegaard 1,2, Frank

More information

Human Genetics 542 Winter 2017 Syllabus

Human Genetics 542 Winter 2017 Syllabus Human Genetics 542 Winter 2017 Syllabus Monday, Wednesday, and Friday 9 10 a.m. 5915 Buhl Course Director: Tony Antonellis Module I: Mapping and characterizing simple genetic diseases Jan 4 th Wed Mapping

More information

Internal structure evidence of validity

Internal structure evidence of validity Internal structure evidence of validity Dr Wan Nor Arifin Lecturer, Unit of Biostatistics and Research Methodology, Universiti Sains Malaysia. E-mail: wnarifin@usm.my Wan Nor Arifin, 2017. Internal structure

More information

Heritability. The concept

Heritability. The concept Heritability The concept What is the Point of Heritability? Is a trait due to nature or nurture? (Genes or environment?) You and I think this is a good point to address, but it is not addressed! What is

More information

To open a CMA file > Download and Save file Start CMA Open file from within CMA

To open a CMA file > Download and Save file Start CMA Open file from within CMA Example name Effect size Analysis type Level Tamiflu Hospitalized Risk ratio Basic Basic Synopsis The US government has spent 1.4 billion dollars to stockpile Tamiflu, in anticipation of a possible flu

More information

Introduction to genetic variation. He Zhang Bioinformatics Core Facility 6/22/2016

Introduction to genetic variation. He Zhang Bioinformatics Core Facility 6/22/2016 Introduction to genetic variation He Zhang Bioinformatics Core Facility 6/22/2016 Outline Basic concepts of genetic variation Genetic variation in human populations Variation and genetic disorders Databases

More information

Introduction to Multilevel Models for Longitudinal and Repeated Measures Data

Introduction to Multilevel Models for Longitudinal and Repeated Measures Data Introduction to Multilevel Models for Longitudinal and Repeated Measures Data Today s Class: Features of longitudinal data Features of longitudinal models What can MLM do for you? What to expect in this

More information

MS&E 226: Small Data

MS&E 226: Small Data MS&E 226: Small Data Lecture 10: Introduction to inference (v2) Ramesh Johari ramesh.johari@stanford.edu 1 / 17 What is inference? 2 / 17 Where did our data come from? Recall our sample is: Y, the vector

More information

Estimates of Genetic Parameters for the Canadian Test Day Model with Legendre Polynomials for Holsteins Based on More Recent Data

Estimates of Genetic Parameters for the Canadian Test Day Model with Legendre Polynomials for Holsteins Based on More Recent Data Estimates of Genetic Parameters for the Canadian Test Day Model with Legendre Polynomials for Holsteins Based on More Recent Data Bethany Muir, Gerrit Kistemaker and Brian Van Doormaal Canadian Dairy Network

More information

Introduction to LOH and Allele Specific Copy Number User Forum

Introduction to LOH and Allele Specific Copy Number User Forum Introduction to LOH and Allele Specific Copy Number User Forum Jonathan Gerstenhaber Introduction to LOH and ASCN User Forum Contents 1. Loss of heterozygosity Analysis procedure Types of baselines 2.

More information

Copy Number Variations and Association Mapping Advanced Topics in Computa8onal Genomics

Copy Number Variations and Association Mapping Advanced Topics in Computa8onal Genomics Copy Number Variations and Association Mapping 02-715 Advanced Topics in Computa8onal Genomics SNP and CNV Genotyping SNP genotyping assumes two copy numbers at each locus (i.e., no CNVs) CNV genotyping

More information

MATH : Design and Analysis of Clinical Trials

MATH : Design and Analysis of Clinical Trials MATH 654-102: Design and Analysis of Clinical Trials MID-TERM EXAM Spring, 2012 (Time allowed: TWO AND HALF HOURS) INSTRUCTIONS TO STUDENTS: 1. This test contains FIVE questions and comprises SIX printed

More information

Lecture 7: Introduction to Selection. September 14, 2012

Lecture 7: Introduction to Selection. September 14, 2012 Lecture 7: Introduction to Selection September 14, 2012 Announcements Schedule of open computer lab hours on lab website No office hours for me week. Feel free to make an appointment for M-W. Guest lecture

More information

GENOME-WIDE ASSOCIATION STUDIES

GENOME-WIDE ASSOCIATION STUDIES GENOME-WIDE ASSOCIATION STUDIES SUCCESSES AND PITFALLS IBT 2012 Human Genetics & Molecular Medicine Zané Lombard IDENTIFYING DISEASE GENES??? Nature, 15 Feb 2001 Science, 16 Feb 2001 IDENTIFYING DISEASE

More information

Children, Toronto, Ontario, Canada. Department of Laboratory Medicine and Pathobiology Hospital for Sick Children, Toronto, Ontario, Canada, M5G 1X8

Children, Toronto, Ontario, Canada. Department of Laboratory Medicine and Pathobiology Hospital for Sick Children, Toronto, Ontario, Canada, M5G 1X8 Supplementary Information for Clinically Relevant Copy Number Variations Detected In Cerebral Palsy Maryam Oskoui 1, *, Matthew J. Gazzellone 2,3, *, Bhooma Thiruvahindrapuram 2,3, Mehdi Zarrei 2,3, John

More information

Chromosomal regions underlying noncoagulation of milk in Finnish Ayrshire cows

Chromosomal regions underlying noncoagulation of milk in Finnish Ayrshire cows EAAP session 35, abstract 3331, 27 August 2008 Chromosomal regions underlying noncoagulation of milk in Finnish Ayrshire cows Anna-Maria Tyrisevä, Kari Elo, Arja Kuusipuro, Veijo Vilva, Isto Jänönen, Heidi

More information

Detection of copy number variations in PCR-enriched targeted sequencing data

Detection of copy number variations in PCR-enriched targeted sequencing data Detection of copy number variations in PCR-enriched targeted sequencing data German Demidov Parseq Lab, Saint-Petersburg University of Russian Academy of Sciences, current: Center for Genomic Regulation

More information

Investigating rare diseases with Agilent NGS solutions

Investigating rare diseases with Agilent NGS solutions Investigating rare diseases with Agilent NGS solutions Chitra Kotwaliwale, Ph.D. 1 Rare diseases affect 350 million people worldwide 7,000 rare diseases 80% are genetic 60 million affected in the US, Europe

More information

Supplementary Figure 1: Attenuation of association signals after conditioning for the lead SNP. a) attenuation of association signal at the 9p22.

Supplementary Figure 1: Attenuation of association signals after conditioning for the lead SNP. a) attenuation of association signal at the 9p22. Supplementary Figure 1: Attenuation of association signals after conditioning for the lead SNP. a) attenuation of association signal at the 9p22.32 PCOS locus after conditioning for the lead SNP rs10993397;

More information

Large-scale identity-by-descent mapping discovers rare haplotypes of large effect. Suyash Shringarpure 23andMe, Inc. ASHG 2017

Large-scale identity-by-descent mapping discovers rare haplotypes of large effect. Suyash Shringarpure 23andMe, Inc. ASHG 2017 Large-scale identity-by-descent mapping discovers rare haplotypes of large effect Suyash Shringarpure 23andMe, Inc. ASHG 2017 1 Why care about rare variants of large effect? Months from randomization 2

More information

Supplementary Figures

Supplementary Figures Supplementary Figures Supplementary Fig 1. Comparison of sub-samples on the first two principal components of genetic variation. TheBritishsampleisplottedwithredpoints.The sub-samples of the diverse sample

More information

Figure S2. Distribution of acgh probes on all ten chromosomes of the RIL M0022

Figure S2. Distribution of acgh probes on all ten chromosomes of the RIL M0022 96 APPENDIX B. Supporting Information for chapter 4 "changes in genome content generated via segregation of non-allelic homologs" Figure S1. Potential de novo CNV probes and sizes of apparently de novo

More information

Cancer Informatics Lecture

Cancer Informatics Lecture Cancer Informatics Lecture Mayo-UIUC Computational Genomics Course June 22, 2018 Krishna Rani Kalari Ph.D. Associate Professor 2017 MFMER 3702274-1 Outline The Cancer Genome Atlas (TCGA) Genomic Data Commons

More information

Applied Medical. Statistics Using SAS. Geoff Der. Brian S. Everitt. CRC Press. Taylor Si Francis Croup. Taylor & Francis Croup, an informa business

Applied Medical. Statistics Using SAS. Geoff Der. Brian S. Everitt. CRC Press. Taylor Si Francis Croup. Taylor & Francis Croup, an informa business Applied Medical Statistics Using SAS Geoff Der Brian S. Everitt CRC Press Taylor Si Francis Croup Boca Raton London New York CRC Press is an imprint of the Taylor & Francis Croup, an informa business A

More information

Below, we included the point-to-point response to the comments of both reviewers.

Below, we included the point-to-point response to the comments of both reviewers. To the Editor and Reviewers: We would like to thank the editor and reviewers for careful reading, and constructive suggestions for our manuscript. According to comments from both reviewers, we have comprehensively

More information

PSSV User Manual (V2.1)

PSSV User Manual (V2.1) PSSV User Manual (V2.1) 1. Introduction A novel pattern-based probabilistic approach, PSSV, is developed to identify somatic structural variations from WGS data. Specifically, discordant and concordant

More information

Generating Spontaneous Copy Number Variants (CNVs) Jennifer Freeman Assistant Professor of Toxicology School of Health Sciences Purdue University

Generating Spontaneous Copy Number Variants (CNVs) Jennifer Freeman Assistant Professor of Toxicology School of Health Sciences Purdue University Role of Chemical lexposure in Generating Spontaneous Copy Number Variants (CNVs) Jennifer Freeman Assistant Professor of Toxicology School of Health Sciences Purdue University CNV Discovery Reference Genetic

More information

NGS panels in clinical diagnostics: Utrecht experience. Van Gijn ME PhD Genome Diagnostics UMCUtrecht

NGS panels in clinical diagnostics: Utrecht experience. Van Gijn ME PhD Genome Diagnostics UMCUtrecht NGS panels in clinical diagnostics: Utrecht experience Van Gijn ME PhD Genome Diagnostics UMCUtrecht 93 Gene panels UMC Utrecht Cardiovascular disease (CAR) (5 panels) Epilepsy (EPI) (11 panels) Hereditary

More information

Final Exam Version A

Final Exam Version A Final Exam Version A Open Book and Notes your 4-digit code: Staple the question sheets to your answers Write your name only once on the back of this sheet. Problem 1: (10 points) A popular method to isolate

More information

Daniel Boduszek University of Huddersfield

Daniel Boduszek University of Huddersfield Daniel Boduszek University of Huddersfield d.boduszek@hud.ac.uk Introduction to Logistic Regression SPSS procedure of LR Interpretation of SPSS output Presenting results from LR Logistic regression is

More information

Overview of Animal Breeding

Overview of Animal Breeding Overview of Animal Breeding 1 Required Information Successful animal breeding requires 1. the collection and storage of data on individually identified animals; 2. complete pedigree information about the

More information

For more information about how to cite these materials visit

For more information about how to cite these materials visit Author(s): Kerby Shedden, Ph.D., 2010 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution Share Alike 3.0 License: http://creativecommons.org/licenses/by-sa/3.0/

More information

QTL detection for traits of interest for the dairy goat industry

QTL detection for traits of interest for the dairy goat industry QTL detection for traits of interest for the dairy goat industry 64 th Annual Meeting EAAP 2013 26 th -30 th august Nantes, France C. Maroteau, I. Palhière, H. Larroque, V. Clément, G. Tosser-Klopp, R.

More information

Complex Trait Genetics in Animal Models. Will Valdar Oxford University

Complex Trait Genetics in Animal Models. Will Valdar Oxford University Complex Trait Genetics in Animal Models Will Valdar Oxford University Mapping Genes for Quantitative Traits in Outbred Mice Will Valdar Oxford University What s so great about mice? Share ~99% of genes

More information

Cognitive, affective, & social neuroscience

Cognitive, affective, & social neuroscience Cognitive, affective, & social neuroscience Time: Wed, 10:15 to 11:45 Prof. Dr. Björn Rasch, Division of Cognitive Biopsychology University of Fribourg 1 Content } 5.11. Introduction to imaging genetics

More information

Chapter 1 : Genetics 101

Chapter 1 : Genetics 101 Chapter 1 : Genetics 101 Understanding the underlying concepts of human genetics and the role of genes, behavior, and the environment will be important to appropriately collecting and applying genetic

More information

Golden Helix s End-to-End Solution for Clinical Labs

Golden Helix s End-to-End Solution for Clinical Labs Golden Helix s End-to-End Solution for Clinical Labs Steven Hystad - Field Application Scientist Nathan Fortier Senior Software Engineer 20 most promising Biotech Technology Providers Top 10 Analytics

More information