Quantitative Biology Lecture 1 (Introduction + Probability)

Size: px
Start display at page:

Download "Quantitative Biology Lecture 1 (Introduction + Probability)"

Transcription

1 21 st Sep 2015 Quantitative Biology Lecture 1 (Introduction + Probability) Gurinder Singh Mickey Atwal Center for Quantitative Biology

2 Why Quantitative Biology? (1. Models) Galileo Galilei The Book of Nature is written in the language of mathematics. Charles Darwin "I have deeply regretted that I did not proceed far enough at least to understand something of the great leading principles of mathematics; for men thus endowed seem to have an extra sense.

3 Why Quantitative Biology? (2. Data Analysis) Explosion of biological data by recent developments in high throughput biotechnology. Moore s Law: doubling of transistors on a chip every 18 months.

4 Quantitative Biology at CSHL Systems Neuroscience Developmental Biology QB Genetics and Genomics

5 Why Statistics? Molecular biology is noisy Biological molecules diffuse Small number fluctuations Genetics is stochastic Genetic variation mainly due to random events (drift) Large data analysis Experimental measurements are noisy Require probabilistic models

6 Key Numbers in Biology See handout Database of useful biological numbers ( Quantified biological properties can Test our understanding of the biology Aid in experimental design

7 Key Numbers Example 1 (Class Exercise) Q: Is there enough time to replicate the E. coli genome? Bionumbers

8 Key Numbers Example 2 (Class Exercise) Q: How far can macromolecules move by diffusion? It takes about 10 seconds on average for a protein to traverse a HeLa cell. An axon 1 mm long is about 100 times longer than a HeLa cell, and as the diffusion time scales as the square of the distance it would take 10 5 seconds or 2 days for a molecule to travel this distance by diffusion. This demonstrates the necessity of mechanisms other than diffusion for moving molecules long distances. A molecular motor moving at a rate of 1 µm/s will take a reasonable time (15 min) to traverse an axon 1 mm long.

9 Classic Examples of Quantitative Reasoning in Biology Two-hit Tumor Suppressor Hypothesis Tumor Subtyping from Expression Profiles Delbruck-Luria Experiment (next lecture)

10 Case study 1 Tumor suppressor gene hypothesis

11 Case Study 1 Incidence of childhood retinoblastoma Retinoblastoma : childhood form of retinal cancer Alfred Knudson (geneticist) studied 48 patients and tabulated age of diagnoses Tumor occurred either in one eye (unilateral) or both eyes (bilateral) Knudson 1971

12 Case Study 1 Incidence of childhood retinoblastoma Probability distribution of retinoblastoma cases by type and laterality Bilateral Unilateral Total Hereditary Nonhereditary Total Knudson observed that bilateral hereditary cases of retinoblastoma occurred at a younger age and showed a curve (previous slide) consistent with a single-mutation process.

13 Case Study 1 Tumour Suppressor Genes Statistical analyses by Knudson (1971) lead to the concept of tumor suppressor genes and the two-hit hypothesis The gene s normal function is to regulate cell division. Both alleles need to be mutated or removed in order to lose the gene activity. The first mutation may be inherited or somatic. The second mutation will often be a gross event leading to loss of heterozygosity in the surrounding area.

14 Case Study 1 Knudsen s two hit hypothesis

15 Case Study 2 Cancer subtyping by genome-wide expression analysis

16 Case Study 2 Cancer Classification Leukemia: AML, ALL Unusual morphology of lymphoblasts in ALL

17 Case Study 2 Microarray Clustering Clustering of genomewide expression date revealed a new subtype of ALL with poor prognosis from conventional therapy Armstrong et al, Nature Genetics ALL and MLL are indistinguishable from a pathologist s perspective

18 Case Study 2 MLL translocations specify a distinct gene expression profile that distinguishes a unique leukemia a, Principal component analysis (PCA) plot of ALL (red), MLL (blue) and AML (yellow) carried out using 8,700 genes that passed filtering. b, PCA plot comparing ALL (red), MLL (blue) and AML (yellow) using the 500 genes that best distinguished ALL from AML.

19 Now let s dive into some mathematics: Probability theory

20 Probabilities and Ensembles An ensemble is set of of probabilities describing a particular random process. An ensemble is defined by x, value of a random variable, taken from the alphabet A x ={a 1,a 2,a 3, a I }, alphabet P X ={p 1,p 2,p 3,..p I }, probabilities All the probabilities have to be positive and add up to one. P(x=a i )=p i p i 0 # a i "A x p(x = a i ) =1 Example: Probability of a single nucleotide in the human genome p A 0.29 p C 0.21 p G 0.21 p T 0.29

21 Joint Probabilities JOINT PROBABILITY: P(x, y) P(x, y) = P(x)P(y) if X and Y are INDEPENDENT MARGINAL PROBABILITY: " y P(x) = P(x, y)

22 Joint Probability: Dinucleotides Single-strand frequencies of human genes A 2 nd position A C G T MARGINAL st position C G T

23 Conditional Probability P(x y) = P(x, y) P(y) Probability of x given y

24 Probability Rules Product Rule (Chain Rule) Sum Rule P(x, y) = P(x y)p(y) P(x) = " y " y P(x,y) = P(x y)p(y)

25 Thomas Bayes ( ), English Presbyterian minister Bayes Theorem P(y x) = P(x y)p(y) P(x) P(y x) = " y P(x y)p(y) P(x y)p(y) Bayes Theorem tells you how to switch the order of variables in a condition probability, i.e., how to convert P(x y) into P(y x).

26 Class Exercise: Am I diseased? There is a rare disease that affects 0.1% of the population. You take a test, which is marketed to be 99% correct, i.e., in 99/100 cases test shows positive when user actually has disease, and in 99/100 cases test shows negative when user actually does not have disease. Test shows positive for you. What is the probability that you have the disease given the test result?

27 Class Exercise continued Write down all the probabilities i) P(disease)=0.001 ii) P(test positive disease)=0.99 iii) P(test negative no disease)=0.99 We want to evaluate P(disease test positive). Use Bayes Theorem (disease test positive) = P (test positive disease)p (disease) P (test positive)

28 Class Exercise continued Use sum rule to obtain P(test positive): (test positive) =P (test positive disease)p (disease)+ P (test positive no disease)p (no disease) =( ) + ( ) = Plugging this into Bayes theorem: P (disease test positive) = =0.09

29 Class Exercise continued Therefore the actual probability of you having the disease given the test result is only 9%, i.e. unlikely you are diseased. Note that the p-value of getting the test result, with the null hypothesis that you are diseasefree, is Thus a frequentist statistician would conclude that the null hypothesis can be safely rejected, and you likely are diseased. This is wrong!

30 Class Exercise continued This picture may help in understanding the counterintuitive Bayesian result Sample of 1000 typical individuals Individual that truly has disease and tests positive (true positive) Individual that does not have disease and tests positive (false positive) Individual that does not have disease and tests negative (true negative)

31 Class Exercise continued In a sample of 1000 typical individuals you may have one individual who likely has the disease [P(disease)=0.01] and likely tests positive [P(positive disease)=0.99]. Of the remaining 999 disease-free individuals, approximately ten will likely test as a false positive [P(positive no disease)=0.01]. Therefore, since there are only 11 individuals who likely test positive P (disease test positive) %

Introduction to Cancer Biology

Introduction to Cancer Biology Introduction to Cancer Biology Robin Hesketh Multiple choice questions (choose the one correct answer from the five choices) Which ONE of the following is a tumour suppressor? a. AKT b. APC c. BCL2 d.

More information

Cancer Genetics. What is Cancer? Cancer Classification. Medical Genetics. Uncontrolled growth of cells. Not all tumors are cancerous

Cancer Genetics. What is Cancer? Cancer Classification. Medical Genetics. Uncontrolled growth of cells. Not all tumors are cancerous Session8 Medical Genetics Cancer Genetics J avad Jamshidi F a s a U n i v e r s i t y o f M e d i c a l S c i e n c e s, N o v e m b e r 2 0 1 7 What is Cancer? Uncontrolled growth of cells Not all tumors

More information

Case Studies on High Throughput Gene Expression Data Kun Huang, PhD Raghu Machiraju, PhD

Case Studies on High Throughput Gene Expression Data Kun Huang, PhD Raghu Machiraju, PhD Case Studies on High Throughput Gene Expression Data Kun Huang, PhD Raghu Machiraju, PhD Department of Biomedical Informatics Department of Computer Science and Engineering The Ohio State University Review

More information

MBios 478: Systems Biology and Bayesian Networks, 27 [Dr. Wyrick] Slide #1. Lecture 27: Systems Biology and Bayesian Networks

MBios 478: Systems Biology and Bayesian Networks, 27 [Dr. Wyrick] Slide #1. Lecture 27: Systems Biology and Bayesian Networks MBios 478: Systems Biology and Bayesian Networks, 27 [Dr. Wyrick] Slide #1 Lecture 27: Systems Biology and Bayesian Networks Systems Biology and Regulatory Networks o Definitions o Network motifs o Examples

More information

UNLOCKING VALUE WITH DATA SCIENCE BAYES APPROACH: MAKING DATA WORK HARDER

UNLOCKING VALUE WITH DATA SCIENCE BAYES APPROACH: MAKING DATA WORK HARDER UNLOCKING VALUE WITH DATA SCIENCE BAYES APPROACH: MAKING DATA WORK HARDER 2016 DELIVERING VALUE WITH DATA SCIENCE BAYES APPROACH - MAKING DATA WORK HARDER The Ipsos MORI Data Science team increasingly

More information

The Human Side of Science: I ll Take That Bet! Balancing Risk and Benefit. Uncertainty, Risk and Probability: Fundamental Definitions and Concepts

The Human Side of Science: I ll Take That Bet! Balancing Risk and Benefit. Uncertainty, Risk and Probability: Fundamental Definitions and Concepts The Human Side of Science: I ll Take That Bet! Balancing Risk and Benefit Uncertainty, Risk and Probability: Fundamental Definitions and Concepts What Is Uncertainty? A state of having limited knowledge

More information

Early detection of Retinoblastoma in children. Max Mantik

Early detection of Retinoblastoma in children. Max Mantik Early detection of Retinoblastoma in children Max Mantik Introduction The most common primary intraocular malignancy of childhood 10 to 15 % of cancers that occur within the first year of life Typical

More information

Introduction to Genetics

Introduction to Genetics Introduction to Genetics Table of contents Chromosome DNA Protein synthesis Mutation Genetic disorder Relationship between genes and cancer Genetic testing Technical concern 2 All living organisms consist

More information

Neurophysiology and Information: Theory of Brain Function

Neurophysiology and Information: Theory of Brain Function Neurophysiology and Information: Theory of Brain Function Christopher Fiorillo BiS 527, Spring 2012 042 350 4326, fiorillo@kaist.ac.kr Part 1: Inference in Perception, Cognition, and Motor Control Reading:

More information

incidence of retinoblastoma (germinal mutation/somatic mutation)

incidence of retinoblastoma (germinal mutation/somatic mutation) Proc. Nat. Acad. Sci. USA Vol. 72, No. 12, pp. 5116-5120, December 1975 Medical Sciences Mutation and childhood cancer: A probabilistic model for the incidence of retinoblastoma (germinal mutation/somatic

More information

Mendelian Genetics using Fast Plants Report due Sept. 15/16. Readings: Mendelian genetics: Hartwell Chapter 2 pp , Chapter 5 pp

Mendelian Genetics using Fast Plants Report due Sept. 15/16. Readings: Mendelian genetics: Hartwell Chapter 2 pp , Chapter 5 pp 1 Biology 423L Sept. 1/2 Mendelian Genetics using Fast Plants Report due Sept. 15/16. Readings: Mendelian genetics: Hartwell Chapter 2 pp. 13-27, Chapter 5 pp. 127-130. FASTPLANTS: Williams et al. (1986)

More information

Lecture 9 Internal Validity

Lecture 9 Internal Validity Lecture 9 Internal Validity Objectives Internal Validity Threats to Internal Validity Causality Bayesian Networks Internal validity The extent to which the hypothesized relationship between 2 or more variables

More information

T. R. Golub, D. K. Slonim & Others 1999

T. R. Golub, D. K. Slonim & Others 1999 T. R. Golub, D. K. Slonim & Others 1999 Big Picture in 1999 The Need for Cancer Classification Cancer classification very important for advances in cancer treatment. Cancers of Identical grade can have

More information

Bayes theorem describes the probability of an event, based on prior knowledge of conditions that might be related to the event.

Bayes theorem describes the probability of an event, based on prior knowledge of conditions that might be related to the event. Bayes theorem Bayes' Theorem is a theorem of probability theory originally stated by the Reverend Thomas Bayes. It can be seen as a way of understanding how the probability that a theory is true is affected

More information

1) Describe the difference between a theory and a hypothesis.

1) Describe the difference between a theory and a hypothesis. 1 st Semester Review 1) Describe the difference between a theory and a hypothesis. 2) Define: a. Independent variable b. Dependent variable c. Control 3) List the characteristics of living organisms. 4)

More information

Psychology Research Process

Psychology Research Process Psychology Research Process Logical Processes Induction Observation/Association/Using Correlation Trying to assess, through observation of a large group/sample, what is associated with what? Examples:

More information

Microarray Comparative Genomic Hybridisation (array CGH)

Microarray Comparative Genomic Hybridisation (array CGH) Saint Mary s Hospital Manchester Centre for Genomic Medicine Information for Patients Microarray Comparative Genomic Hybridisation (array CGH) An array CGH test looks for small changes in a person s chromosomes,

More information

contact info. Course coordinator can be reached at

contact info. Course coordinator can be reached at Course Outline BGEN3020 Introduction to Human Genetics Course Coordinator: Dr. David Merz Department of Biochemistry and Medical Genetics Faculty of Health Sciences Rm 324 Basic Medical Sciences Building

More information

Constructing a Bivariate Table:

Constructing a Bivariate Table: Introduction Bivariate Analysis: A statistical method designed to detect and describe the relationship between two nominal or ordinal variables (typically independent and dependent variables). Cross-Tabulation:

More information

DATA TO INSIGHT: AN INTRODUCTION TO DATA ANALYSIS THE UNIVERSITY OF AUCKLAND

DATA TO INSIGHT: AN INTRODUCTION TO DATA ANALYSIS THE UNIVERSITY OF AUCKLAND DATA TO INSIGHT: AN INTRODUCTION TO DATA ANALYSIS THE UNIVERSITY OF AUCKLAND WEEK 3 3.2 RELATIONSHIPS BETWEEN CATEGORICAL VARIABLES Hi again. In this video, you'll learn how to plot data on two categorical

More information

A Vision-based Affective Computing System. Jieyu Zhao Ningbo University, China

A Vision-based Affective Computing System. Jieyu Zhao Ningbo University, China A Vision-based Affective Computing System Jieyu Zhao Ningbo University, China Outline Affective Computing A Dynamic 3D Morphable Model Facial Expression Recognition Probabilistic Graphical Models Some

More information

MTAT Bayesian Networks. Introductory Lecture. Sven Laur University of Tartu

MTAT Bayesian Networks. Introductory Lecture. Sven Laur University of Tartu MTAT.05.113 Bayesian Networks Introductory Lecture Sven Laur University of Tartu Motivation Probability calculus can be viewed as an extension of classical logic. We use many imprecise and heuristic rules

More information

An Introduction to Bayesian Statistics

An Introduction to Bayesian Statistics An Introduction to Bayesian Statistics Robert Weiss Department of Biostatistics UCLA Fielding School of Public Health robweiss@ucla.edu Sept 2015 Robert Weiss (UCLA) An Introduction to Bayesian Statistics

More information

Probability. Esra Akdeniz. February 26, 2016

Probability. Esra Akdeniz. February 26, 2016 Probability Esra Akdeniz February 26, 2016 Terminology An experiment is any action or process whose outcome is subject to uncertainty. Example: Toss a coin, roll a die. The sample space of an experiment

More information

Genome 371, Autumn 2018 Quiz Section 9: Genetics of Cancer Worksheet

Genome 371, Autumn 2018 Quiz Section 9: Genetics of Cancer Worksheet Genome 371, Autumn 2018 Quiz Section 9: Genetics of Cancer Worksheet All cancer is due to genetic mutations. However, in cancer that clusters in families (familial cancer) at least one of these mutations

More information

Statistics: Making Sense of the Numbers

Statistics: Making Sense of the Numbers Statistics: Making Sense of the Numbers Chapter 9 This multimedia product and its contents are protected under copyright law. The following are prohibited by law: any public performance or display, including

More information

Introduction. We can make a prediction about Y i based on X i by setting a threshold value T, and predicting Y i = 1 when X i > T.

Introduction. We can make a prediction about Y i based on X i by setting a threshold value T, and predicting Y i = 1 when X i > T. Diagnostic Tests 1 Introduction Suppose we have a quantitative measurement X i on experimental or observed units i = 1,..., n, and a characteristic Y i = 0 or Y i = 1 (e.g. case/control status). The measurement

More information

Undesirable Optimality Results in Multiple Testing? Charles Lewis Dorothy T. Thayer

Undesirable Optimality Results in Multiple Testing? Charles Lewis Dorothy T. Thayer Undesirable Optimality Results in Multiple Testing? Charles Lewis Dorothy T. Thayer 1 Intuitions about multiple testing: - Multiple tests should be more conservative than individual tests. - Controlling

More information

CPS331 Lecture: Coping with Uncertainty; Discussion of Dreyfus Reading

CPS331 Lecture: Coping with Uncertainty; Discussion of Dreyfus Reading CPS331 Lecture: Coping with Uncertainty; Discussion of Dreyfus Reading Objectives: 1. To discuss ways of handling uncertainty (probability; Mycin CF) 2. To discuss Dreyfus views on expert systems Materials:

More information

Lecture Outline Biost 517 Applied Biostatistics I. Statistical Goals of Studies Role of Statistical Inference

Lecture Outline Biost 517 Applied Biostatistics I. Statistical Goals of Studies Role of Statistical Inference Lecture Outline Biost 517 Applied Biostatistics I Scott S. Emerson, M.D., Ph.D. Professor of Biostatistics University of Washington Statistical Inference Role of Statistical Inference Hierarchy of Experimental

More information

Supplementary notes for lecture 8: Computational modeling of cognitive development

Supplementary notes for lecture 8: Computational modeling of cognitive development Supplementary notes for lecture 8: Computational modeling of cognitive development Slide 1 Why computational modeling is important for studying cognitive development. Let s think about how to study the

More information

Doing Thousands of Hypothesis Tests at the Same Time. Bradley Efron Stanford University

Doing Thousands of Hypothesis Tests at the Same Time. Bradley Efron Stanford University Doing Thousands of Hypothesis Tests at the Same Time Bradley Efron Stanford University 1 Simultaneous Hypothesis Testing 1980: Simultaneous Statistical Inference (Rupert Miller) 2, 3,, 20 simultaneous

More information

Statistical Tests Using Experimental Data

Statistical Tests Using Experimental Data Statistical Tests Using Experimental Data Alec Brandon July 15, 2015 Alternative title So you ve worked your tail off and have some experimental data. Now what? Why are we even talking about statistics?

More information

Reasoning with Uncertainty. Reasoning with Uncertainty. Bayes Rule. Often, we want to reason from observable information to unobservable information

Reasoning with Uncertainty. Reasoning with Uncertainty. Bayes Rule. Often, we want to reason from observable information to unobservable information Reasoning with Uncertainty Reasoning with Uncertainty Often, we want to reason from observable information to unobservable information We want to calculate how our prior beliefs change given new available

More information

MS&E 226: Small Data

MS&E 226: Small Data MS&E 226: Small Data Lecture 10: Introduction to inference (v2) Ramesh Johari ramesh.johari@stanford.edu 1 / 17 What is inference? 2 / 17 Where did our data come from? Recall our sample is: Y, the vector

More information

CANCER. Inherited Cancer Syndromes. Affects 25% of US population. Kills 19% of US population (2nd largest killer after heart disease)

CANCER. Inherited Cancer Syndromes. Affects 25% of US population. Kills 19% of US population (2nd largest killer after heart disease) CANCER Affects 25% of US population Kills 19% of US population (2nd largest killer after heart disease) NOT one disease but 200-300 different defects Etiologic Factors In Cancer: Relative contributions

More information

HS Exam 1 -- March 9, 2006

HS Exam 1 -- March 9, 2006 Please write your name on the back. Don t forget! Part A: Short answer, multiple choice, and true or false questions. No use of calculators, notes, lab workbooks, cell phones, neighbors, brain implants,

More information

Bayesian Inference. Review. Breast Cancer Screening. Breast Cancer Screening. Breast Cancer Screening

Bayesian Inference. Review. Breast Cancer Screening. Breast Cancer Screening. Breast Cancer Screening STAT 101 Dr. Kari Lock Morgan Review What is the deinition of the p- value? a) P(statistic as extreme as that observed if H 0 is true) b) P(H 0 is true if statistic as extreme as that observed) SETION

More information

EECS 433 Statistical Pattern Recognition

EECS 433 Statistical Pattern Recognition EECS 433 Statistical Pattern Recognition Ying Wu Electrical Engineering and Computer Science Northwestern University Evanston, IL 60208 http://www.eecs.northwestern.edu/~yingwu 1 / 19 Outline What is Pattern

More information

Bayes theorem (1812) Thomas Bayes ( ) English statistician, philosopher, and Presbyterian minister

Bayes theorem (1812) Thomas Bayes ( ) English statistician, philosopher, and Presbyterian minister Bayes theorem (1812) Thomas Bayes (1701 1761) English statistician, philosopher, and Presbyterian minister Bayes theorem (simple) Definitions: P(B A) =P(B A)/P(A); P(A B) =P(A B)/P(B) In Science we often

More information

Evaluating Classifiers for Disease Gene Discovery

Evaluating Classifiers for Disease Gene Discovery Evaluating Classifiers for Disease Gene Discovery Kino Coursey Lon Turnbull khc0021@unt.edu lt0013@unt.edu Abstract Identification of genes involved in human hereditary disease is an important bioinfomatics

More information

Understandable Statistics

Understandable Statistics Understandable Statistics correlated to the Advanced Placement Program Course Description for Statistics Prepared for Alabama CC2 6/2003 2003 Understandable Statistics 2003 correlated to the Advanced Placement

More information

Chapter 23. Inference About Means. Copyright 2010 Pearson Education, Inc.

Chapter 23. Inference About Means. Copyright 2010 Pearson Education, Inc. Chapter 23 Inference About Means Copyright 2010 Pearson Education, Inc. Getting Started Now that we know how to create confidence intervals and test hypotheses about proportions, it d be nice to be able

More information

MODEL-BASED CLUSTERING IN GENE EXPRESSION MICROARRAYS: AN APPLICATION TO BREAST CANCER DATA

MODEL-BASED CLUSTERING IN GENE EXPRESSION MICROARRAYS: AN APPLICATION TO BREAST CANCER DATA International Journal of Software Engineering and Knowledge Engineering Vol. 13, No. 6 (2003) 579 592 c World Scientific Publishing Company MODEL-BASED CLUSTERING IN GENE EXPRESSION MICROARRAYS: AN APPLICATION

More information

4/20/11 More complications to Mendel

4/20/11 More complications to Mendel 4/20/11 More complications to Mendel Complications to the relationship between genotype to phenotype Commentary written in response to the release of the first draft of the human genome sequence From Science

More information

Name Student Response sheet Period. Cell Division Mitosis Lab

Name Student Response sheet Period. Cell Division Mitosis Lab ame Student Response sheet Period Cell Division Mitosis Lab Modeling Mitosis (Refer to your summary sheets from Mitosis & Meiosis on the Table ) Part 1: Effects of Environment on Mitosis Questions: 1.

More information

How_Scientific_Teams_Develop_New_Anti- Cancer_Drugs English mp4_ [00:00:00.00]

How_Scientific_Teams_Develop_New_Anti- Cancer_Drugs English mp4_ [00:00:00.00] How_Scientific_Teams_Develop_New_Anti- Cancer_Drugs English mp4_ [00:00:00.00] [00:00:20.09] KRISTINA MASSON: Welcome to our lesson on drug development and cancer. My name is Kristina, and I work here

More information

Family Trees for all grades. Learning Objectives. Materials, Resources, and Preparation

Family Trees for all grades. Learning Objectives. Materials, Resources, and Preparation page 2 Page 2 2 Introduction Family Trees for all grades Goals Discover Darwin all over Pittsburgh in 2009 with Darwin 2009: Exploration is Never Extinct. Lesson plans, including this one, are available

More information

Biological Information Neuroscience

Biological Information Neuroscience Biological Information Neuroscience The Nervous System Sensory neurons Motor neurons Interneurons Gathers, stores, process, and communicate vast amounts of information and uses it successfully. 10/25/2012

More information

Standing Between a Bayesian and a Frequentist: An Emperical Bayes Exploration of Movies, Baseball, and Williams College.

Standing Between a Bayesian and a Frequentist: An Emperical Bayes Exploration of Movies, Baseball, and Williams College. Standing Between a Bayesian and a Frequentist: An Emperical Bayes Exploration of Movies, Baseball, and Williams College Arthur Berg Pennsylvania State University Bayesian and Frequentist Representatives

More information

Chapter 15 Notes 15.1: Mendelian inheritance chromosome theory of inheritance wild type 15.2: Sex-linked genes

Chapter 15 Notes 15.1: Mendelian inheritance chromosome theory of inheritance wild type 15.2: Sex-linked genes Chapter 15 Notes The Chromosomal Basis of Inheritance Mendel s hereditary factors were genes, though this wasn t known at the time Now we know that genes are located on The location of a particular gene

More information

Types of questions. You need to know. Short question. Short question. Measurement Scale: Ordinal Scale

Types of questions. You need to know. Short question. Short question. Measurement Scale: Ordinal Scale You need to know Materials in the slides Materials in the 5 coglab presented in class Textbooks chapters Information/explanation given in class you can have all these documents with you + your notes during

More information

Inheritance of Aldehyde Oxidase in Drosophila melanogaster

Inheritance of Aldehyde Oxidase in Drosophila melanogaster Inheritance of Aldehyde Oxidase in Drosophila melanogaster (adapted from Morgan, J. G. and V. Finnerty. 1991. Inheritance of aldehyde oxidase in Drosophilia melanogaster. Pages 33-47, in Tested studies

More information

Ch. 23 The Evolution of Populations

Ch. 23 The Evolution of Populations Ch. 23 The Evolution of Populations 1 Essential question: Do populations evolve? 2 Mutation and Sexual reproduction produce genetic variation that makes evolution possible What is the smallest unit of

More information

Biomedical Machine Learning

Biomedical Machine Learning Marco F. Ramoni, PhD Children s Hospital Informatics Program Harvard Medical School Dental Informatics and Dental Research Making The Connection NIH, Bethesda, MD June 12-13, 2003 Machine Learning Artificial

More information

Computational Systems Biology: Biology X

Computational Systems Biology: Biology X Bud Mishra Room 1002, 715 Broadway, Courant Institute, NYU, New York, USA L#4:(October-0-4-2010) Cancer and Signals 1 2 1 2 Evidence in Favor Somatic mutations, Aneuploidy, Copy-number changes and LOH

More information

Lab 5: Testing Hypotheses about Patterns of Inheritance

Lab 5: Testing Hypotheses about Patterns of Inheritance Lab 5: Testing Hypotheses about Patterns of Inheritance How do we talk about genetic information? Each cell in living organisms contains DNA. DNA is made of nucleotide subunits arranged in very long strands.

More information

Common Statistical Issues in Biomedical Research

Common Statistical Issues in Biomedical Research Common Statistical Issues in Biomedical Research Howard Cabral, Ph.D., M.P.H. Boston University CTSI Boston University School of Public Health Department of Biostatistics May 15, 2013 1 Overview of Basic

More information

Statistical sciences. Schools of thought. Resources for the course. Bayesian Methods - Introduction

Statistical sciences. Schools of thought. Resources for the course. Bayesian Methods - Introduction Bayesian Methods - Introduction Dr. David Lucy d.lucy@lancaster.ac.uk Lancaster University Bayesian methods p.1/38 Resources for the course All resources for this course can be found at: http://www.maths.lancs.ac.uk/

More information

Introduction to Computational Neuroscience

Introduction to Computational Neuroscience Introduction to Computational Neuroscience Lecture 5: Data analysis II Lesson Title 1 Introduction 2 Structure and Function of the NS 3 Windows to the Brain 4 Data analysis 5 Data analysis II 6 Single

More information

Multistep nature of cancer development. Cancer genes

Multistep nature of cancer development. Cancer genes Multistep nature of cancer development Phenotypic progression loss of control over cell growth/death (neoplasm) invasiveness (carcinoma) distal spread (metastatic tumor) Genetic progression multiple genetic

More information

Agro/ANSC/Biol/Gene/Hort 305 Fall, 2017 MENDELIAN INHERITANCE Chapter 2, Genetics by Brooker (Lecture outline) #2

Agro/ANSC/Biol/Gene/Hort 305 Fall, 2017 MENDELIAN INHERITANCE Chapter 2, Genetics by Brooker (Lecture outline) #2 Agro/ANSC/Biol/Gene/Hort 305 Fall, 2017 MENDELIAN INHERITANCE Chapter 2, Genetics by Brooker (Lecture outline) #2 MENDEL S LAWS OF INHERITANCE Gregor Johann Mendel (1822-1884) is considered the father

More information

Classification of cancer profiles. ABDBM Ron Shamir

Classification of cancer profiles. ABDBM Ron Shamir Classification of cancer profiles 1 Background: Cancer Classification Cancer classification is central to cancer treatment; Traditional cancer classification methods: location; morphology, cytogenesis;

More information

Bayesian and Frequentist Approaches

Bayesian and Frequentist Approaches Bayesian and Frequentist Approaches G. Jogesh Babu Penn State University http://sites.stat.psu.edu/ babu http://astrostatistics.psu.edu All models are wrong But some are useful George E. P. Box (son-in-law

More information

Introduction to Quantitative Genetics

Introduction to Quantitative Genetics Introduction to Quantitative Genetics 1 / 17 Historical Background Quantitative genetics is the study of continuous or quantitative traits and their underlying mechanisms. The main principals of quantitative

More information

Regression CHAPTER SIXTEEN NOTE TO INSTRUCTORS OUTLINE OF RESOURCES

Regression CHAPTER SIXTEEN NOTE TO INSTRUCTORS OUTLINE OF RESOURCES CHAPTER SIXTEEN Regression NOTE TO INSTRUCTORS This chapter includes a number of complex concepts that may seem intimidating to students. Encourage students to focus on the big picture through some of

More information

STAT 200. Guided Exercise 4

STAT 200. Guided Exercise 4 STAT 200 Guided Exercise 4 1. Let s Revisit this Problem. Fill in the table again. Diagnostic tests are not infallible. We often express a fale positive and a false negative with any test. There are further

More information

Development of Carcinoma Pathways

Development of Carcinoma Pathways The Construction of Genetic Pathway to Colorectal Cancer Moriah Wright, MD Clinical Fellow in Colorectal Surgery Creighton University School of Medicine Management of Colon and Diseases February 23, 2019

More information

Molecular Pathology of Lymphoma (Part 1) Rex K.H. Au-Yeung Department of Pathology, HKU

Molecular Pathology of Lymphoma (Part 1) Rex K.H. Au-Yeung Department of Pathology, HKU Molecular Pathology of Lymphoma (Part 1) Rex K.H. Au-Yeung Department of Pathology, HKU Lecture outline Time 10:00 11:00 11:15 12:10 12:20 13:15 Content Introduction to lymphoma Review of lymphocyte biology

More information

15.301/310, Managerial Psychology Prof. Dan Ariely Recitation 8: T test and ANOVA

15.301/310, Managerial Psychology Prof. Dan Ariely Recitation 8: T test and ANOVA 15.301/310, Managerial Psychology Prof. Dan Ariely Recitation 8: T test and ANOVA Statistics does all kinds of stuff to describe data Talk about baseball, other useful stuff We can calculate the probability.

More information

Tumor suppressor genes D R. S H O S S E I N I - A S L

Tumor suppressor genes D R. S H O S S E I N I - A S L Tumor suppressor genes 1 D R. S H O S S E I N I - A S L What is a Tumor Suppressor Gene? 2 A tumor suppressor gene is a type of cancer gene that is created by loss-of function mutations. In contrast to

More information

GENETIC DRIFT & EFFECTIVE POPULATION SIZE

GENETIC DRIFT & EFFECTIVE POPULATION SIZE Instructor: Dr. Martha B. Reiskind AEC 450/550: Conservation Genetics Spring 2018 Lecture Notes for Lectures 3a & b: In the past students have expressed concern about the inbreeding coefficient, so please

More information

Cancer Genomics 101. BCCCP 2015 Annual Meeting

Cancer Genomics 101. BCCCP 2015 Annual Meeting Cancer Genomics 101 BCCCP 2015 Annual Meeting Objectives Identify red flags in a person s personal and family medical history that indicate a potential inherited susceptibility to cancer Develop a systematic

More information

Abstract. Optimization strategy of Copy Number Variant calling using Multiplicom solutions APPLICATION NOTE. Introduction

Abstract. Optimization strategy of Copy Number Variant calling using Multiplicom solutions APPLICATION NOTE. Introduction Optimization strategy of Copy Number Variant calling using Multiplicom solutions Michael Vyverman, PhD; Laura Standaert, PhD and Wouter Bossuyt, PhD Abstract Copy number variations (CNVs) represent a significant

More information

Making Inferences from Experiments

Making Inferences from Experiments 11.6 Making Inferences from Experiments Essential Question How can you test a hypothesis about an experiment? Resampling Data Yield (kilograms) Control Group Treatment Group 1. 1.1 1.2 1. 1.5 1.4.9 1.2

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION doi:10.1038/nature13908 Supplementary Tables Supplementary Table 1: Families in this study (.xlsx) All families included in the study are listed. For each family, we show: the genders of the probands and

More information

Ecological Statistics

Ecological Statistics A Primer of Ecological Statistics Second Edition Nicholas J. Gotelli University of Vermont Aaron M. Ellison Harvard Forest Sinauer Associates, Inc. Publishers Sunderland, Massachusetts U.S.A. Brief Contents

More information

Comparing multiple proportions

Comparing multiple proportions Comparing multiple proportions February 24, 2017 psych10.stanford.edu Announcements / Action Items Practice and assessment problem sets will be posted today, might be after 5 PM Reminder of OH switch today

More information

Math HL Chapter 12 Probability

Math HL Chapter 12 Probability Math HL Chapter 12 Probability Name: Read the notes and fill in any blanks. Work through the ALL of the examples. Self-Check your own progress by rating where you are. # Learning Targets Lesson I have

More information

Nerve Cell Communication

Nerve Cell Communication Nerve Cell Communication Core Concept: Nerve cells communicate using electrical and chemical signals. Class time required: Approximately 40 minutes if Part 4 is completed for homework. Teacher Provides:

More information

Machine Learning! Robert Stengel! Robotics and Intelligent Systems MAE 345,! Princeton University, 2017

Machine Learning! Robert Stengel! Robotics and Intelligent Systems MAE 345,! Princeton University, 2017 Machine Learning! Robert Stengel! Robotics and Intelligent Systems MAE 345,! Princeton University, 2017 A.K.A. Artificial Intelligence Unsupervised learning! Cluster analysis Patterns, Clumps, and Joining

More information

Signal Detection Theory and Bayesian Modeling

Signal Detection Theory and Bayesian Modeling Signal Detection Theory and Bayesian Modeling COGS 202: Computational Modeling of Cognition Omar Shanta, Shuai Tang, Gautam Reddy, Reina Mizrahi, Mehul Shah Detection Theory and Psychophysics: A Review

More information

Collated questions Demonstrate understanding of biological ideas relating to genetic variation DNA STRUCTURE

Collated questions Demonstrate understanding of biological ideas relating to genetic variation DNA STRUCTURE Collated questions Demonstrate understanding of biological ideas relating to genetic variation DNA STRUCTURE THE ROLE OF DNA IN INHERITANCE (2013:2) Use the diagram above to help you explain the relationship

More information

Agro/Ansc/Bio/Gene/Hort 305 Fall, 2017 MEDICAL GENETICS AND CANCER Chpt 24, Genetics by Brooker (lecture outline) #17

Agro/Ansc/Bio/Gene/Hort 305 Fall, 2017 MEDICAL GENETICS AND CANCER Chpt 24, Genetics by Brooker (lecture outline) #17 Agro/Ansc/Bio/Gene/Hort 305 Fall, 2017 MEDICAL GENETICS AND CANCER Chpt 24, Genetics by Brooker (lecture outline) #17 INTRODUCTION - Our genes underlie every aspect of human health, both in function and

More information

Probabilistic Reasoning with Bayesian Networks and BayesiaLab

Probabilistic Reasoning with Bayesian Networks and BayesiaLab The presentation will start at: 13:00:00 The current time is: 13:00:42 Central Standard Time, UTC-6 Probabilistic Reasoning with Bayesian Networks and BayesiaLab Introduction Your Hosts Today Stefan Conrady

More information

Structural Variation and Medical Genomics

Structural Variation and Medical Genomics Structural Variation and Medical Genomics Andrew King Department of Biomedical Informatics July 8, 2014 You already know about small scale genetic mutations Single nucleotide polymorphism (SNPs) Deletions,

More information

Section D: The Molecular Biology of Cancer

Section D: The Molecular Biology of Cancer CHAPTER 19 THE ORGANIZATION AND CONTROL OF EUKARYOTIC GENOMES Section D: The Molecular Biology of Cancer 1. Cancer results from genetic changes that affect the cell cycle 2. Oncogene proteins and faulty

More information

Breast cancer. Risk factors you cannot change include: Treatment Plan Selection. Inferring Transcriptional Module from Breast Cancer Profile Data

Breast cancer. Risk factors you cannot change include: Treatment Plan Selection. Inferring Transcriptional Module from Breast Cancer Profile Data Breast cancer Inferring Transcriptional Module from Breast Cancer Profile Data Breast Cancer and Targeted Therapy Microarray Profile Data Inferring Transcriptional Module Methods CSC 177 Data Warehousing

More information

Cancer develops after somatic mutations overcome the multiple

Cancer develops after somatic mutations overcome the multiple Genetic variation in cancer predisposition: Mutational decay of a robust genetic control network Steven A. Frank* Department of Ecology and Evolutionary Biology, University of California, Irvine, CA 92697-2525

More information

Introduction. Table of Contents. Background Information. 2 VISIT for complete experiment details & free student protocols.

Introduction. Table of Contents. Background Information. 2 VISIT   for complete experiment details & free student protocols. Introduction Cancer contributes to almost one in every four deaths in the United States. Fortunately, innovations in biomedical research have improved our understanding of the differences between normal

More information

Designing Psychology Experiments: Data Analysis and Presentation

Designing Psychology Experiments: Data Analysis and Presentation Data Analysis and Presentation Review of Chapter 4: Designing Experiments Develop Hypothesis (or Hypotheses) from Theory Independent Variable(s) and Dependent Variable(s) Operational Definitions of each

More information

Genetics. by their offspring. The study of the inheritance of traits is called.

Genetics. by their offspring. The study of the inheritance of traits is called. Genetics DNA contains the genetic code for the production of. A gene is a part of DNA, which has enough bases to make for many different proteins. These specific proteins made by a gene decide the of an

More information

4 Diagnostic Tests and Measures of Agreement

4 Diagnostic Tests and Measures of Agreement 4 Diagnostic Tests and Measures of Agreement Diagnostic tests may be used for diagnosis of disease or for screening purposes. Some tests are more effective than others, so we need to be able to measure

More information

Unit 5 Review Name: Period:

Unit 5 Review Name: Period: Unit 5 Review Name: Period: 1 4 5 6 7 & give an example of the following. Be able to apply their meanings: Homozygous Heterozygous Dominant Recessive Genotype Phenotype Haploid Diploid Sex chromosomes

More information

Analysis of Massively Parallel Sequencing Data Application of Illumina Sequencing to the Genetics of Human Cancers

Analysis of Massively Parallel Sequencing Data Application of Illumina Sequencing to the Genetics of Human Cancers Analysis of Massively Parallel Sequencing Data Application of Illumina Sequencing to the Genetics of Human Cancers Gordon Blackshields Senior Bioinformatician Source BioScience 1 To Cancer Genetics Studies

More information

Analysis of in-vivo extracellular recordings. Ryan Morrill Bootcamp 9/10/2014

Analysis of in-vivo extracellular recordings. Ryan Morrill Bootcamp 9/10/2014 Analysis of in-vivo extracellular recordings Ryan Morrill Bootcamp 9/10/2014 Goals for the lecture Be able to: Conceptually understand some of the analysis and jargon encountered in a typical (sensory)

More information

SYMSYS 130: Research Methods in the Cognitive and Information Sciences (Spring 2013)

SYMSYS 130: Research Methods in the Cognitive and Information Sciences (Spring 2013) SYMSYS 130: Research Methods in the Cognitive and Information Sciences (Spring 2013) Take Home Final June 20, 2013 Instructor' s Responses Please respond to the following questions with short essays (300-500

More information

International Journal of Computer Science Trends and Technology (IJCST) Volume 5 Issue 1, Jan Feb 2017

International Journal of Computer Science Trends and Technology (IJCST) Volume 5 Issue 1, Jan Feb 2017 RESEARCH ARTICLE Classification of Cancer Dataset in Data Mining Algorithms Using R Tool P.Dhivyapriya [1], Dr.S.Sivakumar [2] Research Scholar [1], Assistant professor [2] Department of Computer Science

More information

GENOME-WIDE ASSOCIATION STUDIES

GENOME-WIDE ASSOCIATION STUDIES GENOME-WIDE ASSOCIATION STUDIES SUCCESSES AND PITFALLS IBT 2012 Human Genetics & Molecular Medicine Zané Lombard IDENTIFYING DISEASE GENES??? Nature, 15 Feb 2001 Science, 16 Feb 2001 IDENTIFYING DISEASE

More information

BAYESIAN HYPOTHESIS TESTING WITH SPSS AMOS

BAYESIAN HYPOTHESIS TESTING WITH SPSS AMOS Sara Garofalo Department of Psychiatry, University of Cambridge BAYESIAN HYPOTHESIS TESTING WITH SPSS AMOS Overview Bayesian VS classical (NHST or Frequentist) statistical approaches Theoretical issues

More information