Mendel Short IGES 2003 Data Preparation. Eric Sobel. Department of of Human Genetics UCLA School of of Medicine

Similar documents
Introduction to linkage and family based designs to study the genetic epidemiology of complex traits. Harold Snieder

Exam #2 BSC Fall. NAME_Key correct answers in BOLD FORM A

MULTIFACTORIAL DISEASES. MG L-10 July 7 th 2014

Non-parametric methods for linkage analysis

Performing. linkage analysis using MERLIN

Pedigree Analysis Why do Pedigrees? Goals of Pedigree Analysis Basic Symbols More Symbols Y-Linked Inheritance

BST227 Introduction to Statistical Genetics. Lecture 4: Introduction to linkage and association analysis

Problem 3: Simulated Rheumatoid Arthritis Data

Name Class Date. KEY CONCEPT The chromosomes on which genes are located can affect the expression of traits.

B-4.7 Summarize the chromosome theory of inheritance and relate that theory to Gregor Mendel s principles of genetics

Pedigree Analysis. A = the trait (a genetic disease or abnormality, dominant) a = normal (recessive)

Association mapping (qualitative) Association scan, quantitative. Office hours Wednesday 3-4pm 304A Stanley Hall. Association scan, qualitative

Ch 8 Practice Questions

Dan Koller, Ph.D. Medical and Molecular Genetics

By Mir Mohammed Abbas II PCMB 'A' CHAPTER CONCEPT NOTES

Statistical Tests for X Chromosome Association Study. with Simulations. Jian Wang July 10, 2012

IB BIO I Genetics Test Madden

Stat 531 Statistical Genetics I Homework 4

Literature databases OMIM

Genetics and Genomics in Medicine Chapter 8 Questions

14.1 Human Chromosomes pg

A gene is a sequence of DNA that resides at a particular site on a chromosome the locus (plural loci). Genetic linkage of genes on a single

UNIT 2: GENETICS Chapter 7: Extending Medelian Genetics

Biology 2C03: Genetics What is a Gene?

Downloaded from Chapter 5 Principles of Inheritance and Variation

Tutorial on Genome-Wide Association Studies

Genetics Review. Alleles. The Punnett Square. Genotype and Phenotype. Codominance. Incomplete Dominance

GENETICS - NOTES-

An Introduction to Quantitative Genetics I. Heather A Lawson Advanced Genetics Spring2018

Problem set questions from Final Exam Human Genetics, Nondisjunction, and Cancer

The laws of Heredity. Allele: is the copy (or a version) of the gene that control the same characteristics.

Genetics & The Work of Mendel. AP Biology

Chapter 2. Linkage Analysis. JenniferH.BarrettandM.DawnTeare. Abstract. 1. Introduction

Lecture 6 Practice of Linkage Analysis

For a long time, people have observed that offspring look like their parents.

Does Mendel s work suggest that this is the only gene in the pea genome that can affect this particular trait?

For more information about how to cite these materials visit

Mendel. The pea plant was ideal to work with and Mendel s results were so accurate because: 1) Many. Purple versus flowers, yellow versus seeds, etc.

Genetics. The study of heredity. Father of Genetics: Gregor Mendel (mid 1800 s) Developed set of laws that explain how heredity works

Psych 3102 Lecture 3. Mendelian Genetics

MULTIPLE ALLELES. Ms. Gunjan M. Chaudhari

WHAT S IN THIS LECTURE?

Alzheimer Disease and Complex Segregation Analysis p.1/29

GENETIC LINKAGE ANALYSIS

Genetics All somatic cells contain 23 pairs of chromosomes 22 pairs of autosomes 1 pair of sex chromosomes Genes contained in each pair of chromosomes

QTs IV: miraculous and missing heritability

Genomewide Linkage of Forced Mid-Expiratory Flow in Chronic Obstructive Pulmonary Disease

Statistical Genetics : Gene Mappin g through Linkag e and Associatio n

2. A normal human germ cell before meiosis has how many nuclear chromosomes?

Chapter 7: Pedigree Analysis B I O L O G Y

(b) What is the allele frequency of the b allele in the new merged population on the island?

Mendel s Methods: Monohybrid Cross

Mendelian Genetics. 7.3 Gene Linkage and Mapping Genes can be mapped to specific locations on chromosomes.

Genetics Unit Exam. Number of progeny with following phenotype Experiment Red White #1: Fish 2 (red) with Fish 3 (red) 100 0

Lab 5: Testing Hypotheses about Patterns of Inheritance

During the hyperinsulinemic-euglycemic clamp [1], a priming dose of human insulin (Novolin,

Genetics & The Work of Mendel

Mendelian Genetics. KEY CONCEPT Mendel s research showed that traits are inherited as discrete units.

Pedigree Construction Notes

Genome-wide Association Analysis Applied to Asthma-Susceptibility Gene. McCaw, Z., Wu, W., Hsiao, S., McKhann, A., Tracy, S.

PRINCIPLE OF INHERITANCE AND

Mendelian Inheritance. Jurg Ott Columbia and Rockefeller Universities New York

Patterns of Inheritance. Game Plan. Gregor Mendel ( ) Overview of patterns of inheritance Determine how some genetic disorders are inherited

BIOLOGY - CLUTCH CH.15 - CHROMOSOMAL THEORY OF INHERITANCE

Lecture 1 Mendelian Inheritance

CHAPTER- 05 PRINCIPLES OF INHERITANCE AND VARIATION

Unit 7 Section 2 and 3

8.1 Genes Are Particulate and Are Inherited According to Mendel s Laws 8.2 Alleles and Genes Interact to Produce Phenotypes 8.3 Genes Are Carried on

Chromosomes, Mapping, and the Meiosis-Inheritance Connection. Chapter 13

Chapter 17 Genetics Crosses:

HST.161 Molecular Biology and Genetics in Modern Medicine Fall 2007

Figure 1: Transmission of Wing Shape & Body Color Alleles: F0 Mating. Figure 1.1: Transmission of Wing Shape & Body Color Alleles: Expected F1 Outcome

Multifactorial Inheritance. Prof. Dr. Nedime Serakinci

Interaction of Genes and the Environment

Mass Modification User Guide for Service Providers and Service Provider Consultants

draw and interpret pedigree charts from data on human single allele and multiple allele inheritance patterns; e.g., hemophilia, blood types

Copyright The McGraw-Hill Companies, Inc. Permission required for reproduction or display. Chapter 6 Patterns of Inheritance

ncounter Data Analysis Guidelines for Copy Number Variation (CNV) Molecules That Count NanoString Technologies, Inc.

Patterns of Inheritance

Mendelian Genetics and Beyond Chapter 4 Study Prompts

Solutions to Genetics Unit Exam

9/25/ Some traits are controlled by a single gene. Selective Breeding: Observing Heredity

Genetics. Genetics. True or False. Genetics Vocabulary. Chapter 5. Objectives. Heredity

Chapter 9. Patterns of Inheritance. Lectures by Chris C. Romero, updated by Edward J. Zalisko

STATISTICAL ANALYSIS FOR GENETIC EPIDEMIOLOGY (S.A.G.E.) INTRODUCTION

The Determination of the Genetic Order and Genetic Map for the Eye Color, Wing Size, and Bristle Morphology in Drosophila melanogaster

Downloaded from

Overview of Animal Breeding

Biology: Life on Earth

Sexual Reproduction & Inheritance

Genetics & The Work of Mendel

Genetics Practice Test

11.1 The Work of Mendel

New Enhancements: GWAS Workflows with SVS

Mendelian Genetics. Gregor Mendel. Father of modern genetics

Lecture 13: May 24, 2004

Genetics Mutations 2 Teacher s Guide

VOCABULARY somatic cell autosome fertilization gamete sex chromosome diploid homologous chromosome sexual reproduction meiosis

Introduction of Genome wide Complex Trait Analysis (GCTA) Presenter: Yue Ming Chen Location: Stat Gen Workshop Date: 6/7/2013

Transcription:

Mendel Short Course @ IGES 2003 Data Preparation Eric Sobel Department of of Human Genetics UCLA School of of Medicine 02 November 2003 Mendel Short Course @ IGES Slide 1

Web Sites Mendel5: www.genetics.ucla.edu/software SimWalk3: www.genetics.ucla.edu/software FBAT: biosun1.harvard.edu/~fbat/default.html Mega2: watson.hgen.pitt.edu/mega2.html 02 November 2003 Mendel Short Course @ IGES Slide 2

Before Statistical Analysis: Data Preparation (Overview) Types of data communicating with the software Utilities that assist in creating the data files Gregor SimRun Mega2 02 November 2003 Mendel Short Course @ IGES Slide 3

Before Statistical Analysis: Data Preparation (Overview) Analysis robustness to small perturbations in the data Mistyping Analysis Making the data more useful Allele consolidation Locus consolidation 02 November 2003 Mendel Short Course @ IGES Slide 4

Data File Manipulations Data translations are tedious and errorprone (although less so when done by a program, e.g., Mega2) There are many statistical genetics programs but few methods! Settle on a few programs you know well, both how to use it it and its assumptions, and that you trust 02 November 2003 Mendel Short Course @ IGES Slide 5

Types of Data Control data: which analysis should the software perform? Locus data: which loci are the data from? Qualitative genetic loci, e.g., traits and markers Qualitative non-genetic factors, e.g., smoker Quantitative variables, e.g., birthyear, BMI, ACE Map data: genomic layout of genetic loci? 02 November 2003 Mendel Short Course @ IGES Slide 6

More Types of Data Pedigree data: each individual s data Parents, sex and twin-status Phenotypes at the loci, factors, and variables Penetrance data (for parametric analyses): how does genotype affect phenotype at the trait loci? SNP data (Mendel-specific option): which loci should be consolidated? 02 November 2003 Mendel Short Course @ IGES Slide 7

Control Data Here one sets all the parameters necessary to to specify the type of of analysis to to run. For example, the following is is a Mendel Control.in File: OUTPUT_FILE = Mendel.out!The name of of output file LOCUS_FILE = Locus0.in!The name of of the locus file MAP_FILE = Map0.in!The name of of the map file PEDIGREE_FILE = Ped0.in!The name of of the pedigree file VARIABLE_FILE = Variable0.in!The name of of the variable file ANALYSIS_OPTION = Mistyping!Analysis option MODEL = 1!Sub-option for the analysis MALE = M!Symbol for male FEMALE = F!Symbol for female Allele_separator = -!Symbol used within genotypes 02 November 2003 Mendel Short Course @ IGES Slide 8

Comments on Flexible, Mendel Format Data Files Comma-delimited files are shown Column-specific files are also permitted This is is particularly useful for the pedigree data, since the software can be told how to to read almost any consistently formatted data set Missing values are blanks All objects can be named using words (eight or fewer characters) not just integers LINKAGE format pedigree files are accepted Many more Mendel features listed in the manual 02 November 2003 Mendel Short Course @ IGES Slide 9

Locus Data Qualitative Genetic Loci Name of of loci Chromosomal region, if if known; X-linked or Autosomal is is allowed Number and name of of alleles (optional) Allele frequencies (optional) Number and name of of phenotypes and with which genotypes they are compatible (only required if if you use phenotypes at at that locus) 02 November 2003 Mendel Short Course @ IGES Slide 10

Qualitative Genetic Locus Data File Example Egomania,1q,2,2, a,0.99, b,0.01, NORMAL,3, a-a, a-a, a-b, a-b, b-b, b-b, AFFECTED,3, a-a, a-a, a-b, a-b, b-b, b-b, Marker1,1q,2,0, 213,0.445, 217,0.555, Marker2,autosome,0,0, 02 November 2003 Mendel Short Course @ IGES Slide 11

More Locus Data Qualitative Non-Genetic Factors Name of factor Number and name of categories Quantitative Variables Name of variable Minimum and maximum values allowed 02 November 2003 Mendel Short Course @ IGES Slide 12

More Example Locus Files Factors are listed at at the end of of the Mendel Locus File: HEALTH,FACTOR,2,0, Good, Poor, PROBAND,FACTOR,1,0, PROBAND, Quantitative variables are listed in in the Mendel Variable File, one per line: YearBorn,1900,2003, 02 November 2003 Mendel Short Course @ IGES Slide 13

Map Data Contains the relative position of the qualitative genetic loci in the genome For Mendel (& SimWalk), only those loci in both the Locus and Map files will be analyzed! Sex-specific recombination fractions (and thus genetic distances) are allowed One can also specify the number of analysis points within each interval 02 November 2003 Mendel Short Course @ IGES Slide 14

Example Map Data File For example, the following is is a Mendel Map File: Egomania, 0.10,0.05,, Marker1, 0.01,,4, Marker2,,,,, Marker3, 02 November 2003 Mendel Short Course @ IGES Slide 15

Pedigree Data For each individual, one lists: Pedigree name Person name Parental names Either both parents in in pedigree or or none (none Founder) Sex Name of of twin set Phenotypes listed for each of of the loci, factors and quantitative variables in in the Locus and Variable Files, and in in the same order! (Blanks imply a missing value.) 02 November 2003 Mendel Short Course @ IGES Slide 16

Example Pedigree File For example, the following is is a Mendel Pedigree File: Bush, George,,,M,,AFFECTED,213-217,1946, Bush, Laura,,,F,,NORMAL, 213-213,1946, Bush, Barbara,George,Laura,F,,NORMAL, 213-213,1981, Bush, Jenna, George,Laura,F,,AFFECTED,,1981, Clinton,Bill,,,M,,AFFECTED,213-217,1946, Clinton,Hillary,,,F,,AFFECTED,213-217,1947, Clinton,Chelsea,Hillary,Bill,F,,NORMAL, 213-213,1980, 02 November 2003 Mendel Short Course @ IGES Slide 17

Penetrance Data For a few analyses only, e.g., Parametric Linkage and Genetic Counseling Contains the model specifying how genotype influences phenotype at a trait locus. For each phenotype, set the values: Pr( phenotype 1/1 ) Pr( phenotype 1/2 ) Pr( phenotype 2/2 ) 02 November 2003 Mendel Short Course @ IGES Slide 18

Example Penetrance File For example, the following is is a Mendel Penetrance File: Egomania,PROB,,,2, NORMAL, 0.90, 0.05, 0.05, AFFECTED,0.10, 0.95, 0.95, 02 November 2003 Mendel Short Course @ IGES Slide 19

SNP File Only used for Locus Consolidation Utility Can consolidate up to four loci into one super-locus Each locus can have up to nine alleles 02 November 2003 Mendel Short Course @ IGES Slide 20

Example SNP File For example, the following is is a Mendel SNP File: 2, 2, SNP1,SNP2, 3, 3, SNP3,SNP4,SNP5, 02 November 2003 Mendel Short Course @ IGES Slide 21

Constructing the Data Files The Gregor program eases construction of the Mendel Control.in File and running Mendel itself SimRun does the same for SimWalk and its control file called BATCH3.DAT Many pedigree formats are supported by Mendel and the other files are small and easily constructed! SimWalk will copy Mendel s file formatting flexibility by 2004. 02 November 2003 Mendel Short Course @ IGES Slide 22

Constructing the Data Files More and more databases will generate analysis input files directly Mega2 is a useful utility that converts from LINKAGE format data and pedigree files to the input files for many other packages, including Mendel and SimWalk2 Next major version of of Mega2 may better support Mendel 5 and SimWalk 3 02 November 2003 Mendel Short Course @ IGES Slide 23