Ensemble size: How suboptimal is less than infinity?

Size: px
Start display at page:

Download "Ensemble size: How suboptimal is less than infinity?"

Transcription

1 Ensemble size: How suboptimal is less than infinity? Martin Leutbecher 13 September 2017 Acknowledgements: Zied Ben Bouallègue, Chris Ferro, Sarah-Jane Lock, David Richardson M. Leutbecher Ensemble size: How suboptimal is less than infinity? 13 September

2 Ensemble size at ECMWF M. Leutbecher Ensemble size: How suboptimal is less than infinity? 13 September

3 Ensemble size at ECMWF M. Leutbecher Ensemble size: How suboptimal is less than infinity? 13 September

4 Ensemble size at ECMWF 50 member since Dec 1996 Why 50? M. Leutbecher Ensemble size: How suboptimal is less than infinity? 13 September

5 Ensemble size at ECMWF 50 member since Dec 1996 Why 50? Are the benefits of more than 50 members marginal? M. Leutbecher Ensemble size: How suboptimal is less than infinity? 13 September

6 Talagrand, Vautard and Strauss (1997) (adapted from their Fig. 4.5) M. Leutbecher Ensemble size: How suboptimal is less than infinity? 13 September

7 Talagrand, Vautard and Strauss (1997) According to Talagrand et al (1997) not more than 30 members are needed. (adapted from their Fig. 4.5) M. Leutbecher Ensemble size: How suboptimal is less than infinity? 13 September

8 Buizza and Palmer (1998) Comparison of 2, 4, 8, 16, 32 members Verified against analysis Verified against member (adapted from their Fig. 11) Z500 in NH; T63L19 model, initial uncertainty represented with singular vectors, no representation of model uncertainty Careful conclusions that do not rule out increases in skill beyond 32 members. M. Leutbecher Ensemble size: How suboptimal is less than infinity? 13 September

9 Buizza et al. (1998) Impact of resolution and ensemble size In December 1996, resolution was increased from T63 (quadratic grid) to TL159 (linear grid) and ensemble size was increased from 32 to 50 members. M. Leutbecher Ensemble size: How suboptimal is less than infinity? 13 September

10 Miyoshi et al (2014) identical twin EnKF using SPEEDY model Horizontal correlations of mid-tropospheric specific humidity with yellow star location (from their Fig. 3) M. Leutbecher (from their Fig. 4) Ensemble size: How suboptimal is less than infinity? 13 September

11 Chua s circuit Machete and Smith (2016) ensemble forecasts of electronic circuit (from Wikipedia) (from their Fig. 12; colour corresponds to lead time; competitive advantage is similar to a probabilistic skill score) M. Leutbecher Ensemble size: How suboptimal is less than infinity? 13 September

12 Guess the ensemble size i.i.d. members; pdfs for 20 realisations; ensemble size fixed in each panel M. Leutbecher Ensemble size: How suboptimal is less than infinity? 13 September

13 Guess the ensemble size i.i.d. members; pdfs for 20 realisations; ensemble size fixed in each panel k 4k 16k M. Leutbecher Ensemble size: How suboptimal is less than infinity? 13 September

14 Ensemble size at ECMWF M. Leutbecher Ensemble size: How suboptimal is less than infinity? 13 September

15 Experiments with IFS ensembles IFS cycle 41r2 as operational ensemble but lower resolution: TCo members June-July-August 2016 (92 cases) probabilistic skill evaluated with continuous ranked probability score (CRPS): mean squared error of cumulative distribution M. Leutbecher Ensemble size: How suboptimal is less than infinity? 13 September

16 CRPS Impact of ensemble size on CRPS M=8 M=20 M=50 M=100 M= hpa zonal wind in northern extratropics M. Leutbecher Ensemble size: How suboptimal is less than infinity? 13 September

17 CRPS Impact of ensemble size on CRPS Predictions of CRPS for infinite ensemble size M=8 M=20 M=50 M=100 M= hpa zonal wind in northern extratropics M. Leutbecher Ensemble size: How suboptimal is less than infinity? 13 September

18 Where does increased skill come from? Sampling uncertainty of Z500 ensemble mean at D7 8 member 50 member M. Leutbecher Ensemble size: How suboptimal is less than infinity? 13 September

19 CRPS and ensemble size: What to expect? Kernel representation of CRPS kernel representation of CRPS CRPS(x j, y) = 1 M x j y 1 M 2M 2 j=1 M j=1 k=1 M x j x k With exchangeability of members, the expected CRPS is E x CRPS(x j, y) = E x x y M 1 2M E x,x x x For an infinite size ensemble we get E x CRPS(x j, y) = E x x y 1 2 E x,x x x M. Leutbecher Ensemble size: How suboptimal is less than infinity? 13 September

20 How can CRPS for infinite ensemble size be predicted with a finite ensemble? The fair CRPS is a modified version of the CRPS that removes the bias in the score due to the finite ensemble size (see Chris Ferro s talk) From the kernel representation, one can see easily that the CRPS for infinite ensemble size is obtained by the estimator CRPS (x j, y) = CRPS(x j, y) 1 2M 2 (M 1) The correction term is a measure of ensemble spread. M j=1 k=1 M x j x k M. Leutbecher Ensemble size: How suboptimal is less than infinity? 13 September

21 Analytic result for statistically consistent ensembles When members are statistically consistent (iid) draws from same distribution as observation (perfectly reliable ensemble), the CRPS for an m-member ensemble satisfies ( CRPS M = 1 M 1 ) E x x ( = ) CRPS 2M M Eqns. (8) and (9) in Richardson (2001) show that the Brier score also satisfies BS M = (1 + M 1 ) BS. M. Leutbecher Ensemble size: How suboptimal is less than infinity? 13 September

22 Analytic result for statistically consistent ensembles When members are statistically consistent (iid) draws from same distribution as observation (perfectly reliable ensemble), the CRPS for an m-member ensemble satisfies ( CRPS M = 1 M 1 ) E x x ( = ) CRPS 2M M Eqns. (8) and (9) in Richardson (2001) show that the Brier score also satisfies BS M = (1 + M 1 ) BS. Extreme events? Relationship for BS implies that for any weighting in the twcrps (Gneiting and Ranjan, 2011) we also have ( twcrps M = ) twcrps M M. Leutbecher Ensemble size: How suboptimal is less than infinity? 13 September

23 Actual convergence with ensemble size from right to left 2, 4, 8, 20, 50, 100 and 200 members 1.5 CRPS(M)/fairCRPS(M) /M Data from 200 member TCo399 IFS experiment, JJA data points for each ensemble size 15 lead times 4 variables (z500, T850, u850, u200) 2 regions (NH and SH extratropics) 50 and 200 members are 2% and 0.5% worse than, respectively M. Leutbecher Ensemble size: How suboptimal is less than infinity? 13 September

24 from right to left 2, 4, 8, 20, 50, 100 and 200 members Actual convergence with ensemble size zoom 20, 50, 100 and 200 members CRPS(M)/fairCRPS(M) CRPS(M)/fairCRPS(M) /M /M Data from 200 member TCo399 IFS experiment, JJA data points for each ensemble size 15 lead times 4 variables (z500, T850, u850, u200) 2 regions (NH and SH extratropics) 50 and 200 members are 2% and 0.5% worse than, respectively M. Leutbecher Ensemble size: How suboptimal is less than infinity? 13 September

25 Quantile score and CRPS QS α α = 0.5 α = 0.1 QS α (q, y) = 2 (I {y < q} α) (q y) y α = 0.9 q with indicator function I(true) = 1 and I(false) = 0, quantile q and observation y; α (0, 1) denotes the probability level CRPS(F, y) = 1 0 QS α ( F 1 (α), y ) dα where the quantile q for cumulative distribution F is F 1 (α) M. Leutbecher Ensemble size: How suboptimal is less than infinity? 13 September

26 QS M / QS Quantile score for a standard Gaussian Simulations with M = 20 to 1000 members quantile probability level For QS of q.98, 50 and 200 members are 7% and 2% worse than, respectively M. Leutbecher Ensemble size: How suboptimal is less than infinity? 13 September

27 Ensemble size at ECMWF M. Leutbecher Ensemble size: How suboptimal is less than infinity? 13 September

28 Research and development What is a good ensemble size? Large ensemble size can delay progress in R&D It would be most efficient to use the smallest ensemble size that is sufficient to estimate impact for operational ensemble size Using proper scores with small ensembles can mislead though M. Leutbecher Ensemble size: How suboptimal is less than infinity? 13 September

29 0 Ensemble configurations R, O and N CRPS for 850 hpa temperature in northern extratropics 50 member 4 member CRPS CRPS N R, M= R, M= O, M=50 N, M= O, M=4 N, M=4 M. Leutbecher Ensemble size: How suboptimal is less than infinity? 13 September

30 0 Ensemble configurations R, O and N CRPS for 850 hpa temperature in northern extratropics 50 member 4 member CRPS CRPS N R, M= R, M= O, M=50 N, M= O, M=4 N, M= R, M= R, M=4 fair CRPS O, M= O, M= N, M= N, M=4 M. Leutbecher Ensemble size: How suboptimal is less than infinity? 13 September

31 Ensemble configurations R, O and N CRPS for 850 hpa zonal wind in tropics 50 member 4 member R, M=50 O, M=50 N, M= CRPS CRPS N R, M=4 O, M=4 N, M=4 M. Leutbecher Ensemble size: How suboptimal is less than infinity? 13 September

32 Ensemble configurations R, O and N CRPS for 850 hpa zonal wind in tropics 50 member 4 member 0 R, M=50 O, M=50 N, M= CRPS CRPS N R, M=4 O, M=4 N, M=4-0.1 R, M= R, M=4 fair CRPS -0.2 O, M=50 O, M=4 N, M=50 N, M=4 M. Leutbecher Ensemble size: How suboptimal is less than infinity? 13 September

33 0 Ensemble configurations R, O and N CRPS for 850 hpa zonal wind in tropics 50 member 2 member (1,3) CRPS CRPS N R, M= R, M=2-0.2 O, M= O, M=2 0 N, M= N, M=2-0.1 R, M= R, M=2 fair CRPS -0.2 O, M=50 O, M=2 N, M=50 N, M=2 M. Leutbecher Ensemble size: How suboptimal is less than infinity? 13 September

34 0 Ensemble configurations R, O and N CRPS for 850 hpa zonal wind in tropics 50 member 2 member (1,2) CRPS CRPS N R, M= R, M=2-0.2 O, M= O, M=2 0 N, M= N, M=2-0.1 R, M= R, M=2 fair CRPS -0.2 O, M=50 O, M=2 N, M= N, M=2 M. Leutbecher Ensemble size: How suboptimal is less than infinity? 13 September

35 Small ensemble sizes Can be used for R&D if evaluation uses fair scores Can be used in reforecasts for estimating skill Applicability of fair scores is linked to ensemble generation Current ensemble generation at ECMWF not fully consistent with exchangeability required for fair scores Benefits for R&D faster turnaround time more configurations can be explored scope for increasing statistical significance by using less members but more start dates M. Leutbecher Ensemble size: How suboptimal is less than infinity? 13 September

36 Conclusions How suboptimal is less than infinity? Three possible answers: M. Leutbecher Ensemble size: How suboptimal is less than infinity? 13 September

37 Conclusions How suboptimal is less than infinity? Three possible answers: A bit or maybe a lot, tell me the score and your ensemble size... M. Leutbecher Ensemble size: How suboptimal is less than infinity? 13 September

38 Conclusions How suboptimal is less than infinity? Three possible answers: A bit or maybe a lot, tell me the score and your ensemble size... Operational ensemble forecasts: 50 members are too few let s increase the ensemble size to... M. Leutbecher Ensemble size: How suboptimal is less than infinity? 13 September

39 Conclusions How suboptimal is less than infinity? Three possible answers: A bit or maybe a lot, tell me the score and your ensemble size... Operational ensemble forecasts: 50 members are too few let s increase the ensemble size to... Research & Development: Small ensembles are highly efficient. Two to four members may be enough for standard evaluations (provided exchangeability in the ensemble generation and use of fair scores) M. Leutbecher Ensemble size: How suboptimal is less than infinity? 13 September

40 Discussion How can we increase ensemble size when we need to increase resolution too? Different users will have different needs, how to obtain a good compromise for all of them? How to increase ensemble size in a computationally efficient way for all forecast ranges from medium-range to extended-range? What is an adequate ensemble size for the reforecasts? Which other proper scores permit the construction of an associated fair score? M. Leutbecher Ensemble size: How suboptimal is less than infinity? 13 September

The Multiensemble Approach: The NAEFS Example

The Multiensemble Approach: The NAEFS Example MAY 2009 C A N D I L L E 1655 The Multiensemble Approach: The NAEFS Example GUILLEM CANDILLE Université de Québec à Montréal, Montréal, Québec, Canada (Manuscript received 13 June 2008, in final form 14

More information

New scoring methods for weather forecasts. Isabelle Sanchez, Fabien Stoop, Michaël Zamo, Francis Pouponneau, Nicole Girardot

New scoring methods for weather forecasts. Isabelle Sanchez, Fabien Stoop, Michaël Zamo, Francis Pouponneau, Nicole Girardot New scoring methods for weather forecasts Isabelle Sanchez, Fabien Stoop, Michaël Zamo, Francis Pouponneau, Nicole Girardot Using ECMWF Forecasts 6-9 june 2016 Outline 1 - Verification of sensible weather

More information

Information Content and Optimal Channel Selection for AIRS

Information Content and Optimal Channel Selection for AIRS Information Content and Optimal Channel Selection for AIRS Nadia Fourrié*, Jean-Noël Thépaut European Centre for Medium-range Weather Forecasts * NWP SAF visiting scientist program Presentation layout

More information

Luci Karbonini. Pmf St. SWAP May 27,2011

Luci Karbonini. Pmf St. SWAP May 27,2011 Seasonal dependence of low frequency variability in an idealised GCM Luci Karbonini Pmf St SWAP May 27,2011 Motivation Planetary-Scale Baroclinic Instability and MJO, Straus and Lindzen (observations)

More information

Ensemble based probabilistic forecasting of meteorology and air quality in Oslo, Norway

Ensemble based probabilistic forecasting of meteorology and air quality in Oslo, Norway Ensemble based probabilistic forecasting of meteorology and air quality in Oslo, Norway Sam Erik Walker, Bruce Rolstad Denby, Núria Castell NILU Norwegian Institute for Air Research 21 August 2014 World

More information

Simulation of Transient Pressure Behavior for a Well With a Finite- Conductivity Vertical Fracture

Simulation of Transient Pressure Behavior for a Well With a Finite- Conductivity Vertical Fracture Simulation of Transient Pressure Behavior for a Well With a Finite- Conductivity Vertical Fracture Paulo Dore Fernandes, Thiago Judson L. de Oliveira, Sylvio Dermeval de Souza, Raphael David and Rodrigo

More information

Graphical display of population data

Graphical display of population data Graphical display of population data E. Niclas Jonsson Division of Pharmacokinetics and Drug Therapy Department of Pharmaceutical Biosciences Uppsala University Nature of population data Hierarchical Variable

More information

Some Similarities between the Spread of Infectious Disease and Population Growth 1

Some Similarities between the Spread of Infectious Disease and Population Growth 1 Some Similarities between the Spread of Infectious Disease and Population Growth 1 The Spread of Infectious Disease 1. An infectious disease is any disease caused by germs such as viruses or bacteria.

More information

Expert judgements in risk and reliability analysis

Expert judgements in risk and reliability analysis Expert judgements in risk and reliability analysis Jan-Erik Holmberg M2-E2117 Risk analysis, April 17, 2016 Aalto university Contents Definitions Applications Bayesian approach Examples 2 Background Objective,

More information

Lecture 2: Foundations of Concept Learning

Lecture 2: Foundations of Concept Learning Lecture 2: Foundations of Concept Learning Cognitive Systems - Machine Learning Part I: Basic Approaches to Concept Learning Version Space, Candidate Elimination, Inductive Bias last change October 18,

More information

Sample size calculation a quick guide. Ronán Conroy

Sample size calculation a quick guide. Ronán Conroy Sample size calculation a quick guide Thursday 28 October 2004 Ronán Conroy rconroy@rcsi.ie How to use this guide This guide has sample size ready-reckoners for a number of common research designs. Each

More information

Tuning and Validating HYCOM s KPP Mixed-Layer. Alan J. Wallcraft and E. Joseph Metzger Naval Research Laboratory

Tuning and Validating HYCOM s KPP Mixed-Layer. Alan J. Wallcraft and E. Joseph Metzger Naval Research Laboratory Tuning and Validating HYCOM s KPP Mixed-Layer Alan J. Wallcraft and E. Joseph Metzger Naval Research Laboratory A. Birol Kara COAPS, Florida State University 2003 Layered Ocean Model Users Workshop February

More information

IEEE SIGNAL PROCESSING LETTERS, VOL. 13, NO. 3, MARCH A Self-Structured Adaptive Decision Feedback Equalizer

IEEE SIGNAL PROCESSING LETTERS, VOL. 13, NO. 3, MARCH A Self-Structured Adaptive Decision Feedback Equalizer SIGNAL PROCESSING LETTERS, VOL 13, NO 3, MARCH 2006 1 A Self-Structured Adaptive Decision Feedback Equalizer Yu Gong and Colin F N Cowan, Senior Member, Abstract In a decision feedback equalizer (DFE),

More information

A Decision-Theoretic Approach to Evaluating Posterior Probabilities of Mental Models

A Decision-Theoretic Approach to Evaluating Posterior Probabilities of Mental Models A Decision-Theoretic Approach to Evaluating Posterior Probabilities of Mental Models Jonathan Y. Ito and David V. Pynadath and Stacy C. Marsella Information Sciences Institute, University of Southern California

More information

Assimilating New Passive Microwave Data in the NOAA GDAS

Assimilating New Passive Microwave Data in the NOAA GDAS Assimilating New Passive Microwave Data in the NOAA GDAS Erin Jones 1, Kevin Garrett 1, Eric Maddy 1, Krishna Kumar 1, Sid Boukabara 2 1: RTi @ NOAA/NESDIS/STAR/JCSDA, 2: NOAA/NESDIS/STAR/JCSDA 1 Outline

More information

Nonlinear, Nongaussian Ensemble Data Assimilation with Rank Regression and a Rank Histogram Filter

Nonlinear, Nongaussian Ensemble Data Assimilation with Rank Regression and a Rank Histogram Filter Nonlinear, Nongaussian Ensemble Data Assimilation with Rank Regression and a Rank Histogram Filter Jeff Anderson, NCAR Data Assimilation Research Section pg 1 Schematic of a Sequential Ensemble Filter

More information

Introduction to Computational Neuroscience

Introduction to Computational Neuroscience Introduction to Computational Neuroscience Lecture 10: Brain-Computer Interfaces Ilya Kuzovkin So Far Stimulus So Far So Far Stimulus What are the neuroimaging techniques you know about? Stimulus So Far

More information

Various Approaches to Szroeter s Test for Regression Quantiles

Various Approaches to Szroeter s Test for Regression Quantiles The International Scientific Conference INPROFORUM 2017, November 9, 2017, České Budějovice, 361-365, ISBN 978-80-7394-667-8. Various Approaches to Szroeter s Test for Regression Quantiles Jan Kalina,

More information

Spatial and Temporal Data Fusion for Biosurveillance

Spatial and Temporal Data Fusion for Biosurveillance Spatial and Temporal Data Fusion for Biosurveillance Karen Cheng, David Crary Applied Research Associates, Inc. Jaideep Ray, Cosmin Safta, Mahmudul Hasan Sandia National Laboratories Contact: Ms. Karen

More information

Intelligent Systems. Discriminative Learning. Parts marked by * are optional. WS2013/2014 Carsten Rother, Dmitrij Schlesinger

Intelligent Systems. Discriminative Learning. Parts marked by * are optional. WS2013/2014 Carsten Rother, Dmitrij Schlesinger Intelligent Systems Discriminative Learning Parts marked by * are optional 30/12/2013 WS2013/2014 Carsten Rother, Dmitrij Schlesinger Discriminative models There exists a joint probability distribution

More information

A Race Model of Perceptual Forced Choice Reaction Time

A Race Model of Perceptual Forced Choice Reaction Time A Race Model of Perceptual Forced Choice Reaction Time David E. Huber (dhuber@psyc.umd.edu) Department of Psychology, 1147 Biology/Psychology Building College Park, MD 2742 USA Denis Cousineau (Denis.Cousineau@UMontreal.CA)

More information

Model calibration and Bayesian methods for probabilistic projections

Model calibration and Bayesian methods for probabilistic projections ETH Zurich Reto Knutti Model calibration and Bayesian methods for probabilistic projections Reto Knutti, IAC ETH Toy model Model: obs = linear trend + noise(variance, spectrum) 1) Short term predictability,

More information

SUPPLEMENTAL MATERIAL

SUPPLEMENTAL MATERIAL 1 SUPPLEMENTAL MATERIAL Response time and signal detection time distributions SM Fig. 1. Correct response time (thick solid green curve) and error response time densities (dashed red curve), averaged across

More information

List of Figures. List of Tables. Preface to the Second Edition. Preface to the First Edition

List of Figures. List of Tables. Preface to the Second Edition. Preface to the First Edition List of Figures List of Tables Preface to the Second Edition Preface to the First Edition xv xxv xxix xxxi 1 What Is R? 1 1.1 Introduction to R................................ 1 1.2 Downloading and Installing

More information

WDHS Curriculum Map Probability and Statistics. What is Statistics and how does it relate to you?

WDHS Curriculum Map Probability and Statistics. What is Statistics and how does it relate to you? WDHS Curriculum Map Probability and Statistics Time Interval/ Unit 1: Introduction to Statistics 1.1-1.3 2 weeks S-IC-1: Understand statistics as a process for making inferences about population parameters

More information

Mathematical Microbiologists: Why we have to return to our square roots to uncover uncertainty (of measurement) in Quantitative PCR (qpcr)

Mathematical Microbiologists: Why we have to return to our square roots to uncover uncertainty (of measurement) in Quantitative PCR (qpcr) Mathematical Microbiologists: Why we have to return to our square roots to uncover uncertainty (of measurement) in Quantitative PCR (qpcr) J. Ian Stuart George Zahariadis Marina Salvadori Objectives 1.

More information

Tuning and Validating HYCOM s KPP Mixed-Layer. Alan J. Wallcraft and E. Joseph Metzger Naval Research Laboratory

Tuning and Validating HYCOM s KPP Mixed-Layer. Alan J. Wallcraft and E. Joseph Metzger Naval Research Laboratory Tuning and Validating HYCOM s KPP Mixed-Layer Alan J. Wallcraft and E. Joseph Metzger Naval Research Laboratory A. Birol Kara COAPS, Florida State University 2003 Layered Ocean Model Users Workshop February

More information

A Spreadsheet for Deriving a Confidence Interval, Mechanistic Inference and Clinical Inference from a P Value

A Spreadsheet for Deriving a Confidence Interval, Mechanistic Inference and Clinical Inference from a P Value SPORTSCIENCE Perspectives / Research Resources A Spreadsheet for Deriving a Confidence Interval, Mechanistic Inference and Clinical Inference from a P Value Will G Hopkins sportsci.org Sportscience 11,

More information

Practical Bayesian Optimization of Machine Learning Algorithms. Jasper Snoek, Ryan Adams, Hugo LaRochelle NIPS 2012

Practical Bayesian Optimization of Machine Learning Algorithms. Jasper Snoek, Ryan Adams, Hugo LaRochelle NIPS 2012 Practical Bayesian Optimization of Machine Learning Algorithms Jasper Snoek, Ryan Adams, Hugo LaRochelle NIPS 2012 ... (Gaussian Processes) are inadequate for doing speech and vision. I still think they're

More information

IEEE Proof Web Version

IEEE Proof Web Version IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, VOL. 4, NO. 4, DECEMBER 2009 1 Securing Rating Aggregation Systems Using Statistical Detectors and Trust Yafei Yang, Member, IEEE, Yan (Lindsay)

More information

Bayesian Nonparametric Methods for Precision Medicine

Bayesian Nonparametric Methods for Precision Medicine Bayesian Nonparametric Methods for Precision Medicine Brian Reich, NC State Collaborators: Qian Guan (NCSU), Eric Laber (NCSU) and Dipankar Bandyopadhyay (VCU) University of Illinois at Urbana-Champaign

More information

Data Mining in Bioinformatics Day 4: Text Mining

Data Mining in Bioinformatics Day 4: Text Mining Data Mining in Bioinformatics Day 4: Text Mining Karsten Borgwardt February 25 to March 10 Bioinformatics Group MPIs Tübingen Karsten Borgwardt: Data Mining in Bioinformatics, Page 1 What is text mining?

More information

A Brief Introduction to Bayesian Statistics

A Brief Introduction to Bayesian Statistics A Brief Introduction to Statistics David Kaplan Department of Educational Psychology Methods for Social Policy Research and, Washington, DC 2017 1 / 37 The Reverend Thomas Bayes, 1701 1761 2 / 37 Pierre-Simon

More information

A Race Model of Perceptual Forced Choice Reaction Time

A Race Model of Perceptual Forced Choice Reaction Time A Race Model of Perceptual Forced Choice Reaction Time David E. Huber (dhuber@psych.colorado.edu) Department of Psychology, 1147 Biology/Psychology Building College Park, MD 2742 USA Denis Cousineau (Denis.Cousineau@UMontreal.CA)

More information

Neural circuits PSY 310 Greg Francis. Lecture 05. Rods and cones

Neural circuits PSY 310 Greg Francis. Lecture 05. Rods and cones Neural circuits PSY 310 Greg Francis Lecture 05 Why do you need bright light to read? Rods and cones Photoreceptors are not evenly distributed across the retina 1 Rods and cones Cones are most dense in

More information

Regression Discontinuity Analysis

Regression Discontinuity Analysis Regression Discontinuity Analysis A researcher wants to determine whether tutoring underachieving middle school students improves their math grades. Another wonders whether providing financial aid to low-income

More information

The Human Side of Science: I ll Take That Bet! Balancing Risk and Benefit. Uncertainty, Risk and Probability: Fundamental Definitions and Concepts

The Human Side of Science: I ll Take That Bet! Balancing Risk and Benefit. Uncertainty, Risk and Probability: Fundamental Definitions and Concepts The Human Side of Science: I ll Take That Bet! Balancing Risk and Benefit Uncertainty, Risk and Probability: Fundamental Definitions and Concepts What Is Uncertainty? A state of having limited knowledge

More information

The impact of the Wind guess, the Tracer size and the Temporal gap between images in the extraction of AMVs

The impact of the Wind guess, the Tracer size and the Temporal gap between images in the extraction of AMVs The impact of the Wind guess, the Tracer size and the Temporal gap between images in the extraction of AMVs 18th June 2014 Twelfth International Winds Workshop Copenhagen, Denmark Javier García-Pereda

More information

Analysis of acgh data: statistical models and computational challenges

Analysis of acgh data: statistical models and computational challenges : statistical models and computational challenges Ramón Díaz-Uriarte 2007-02-13 Díaz-Uriarte, R. acgh analysis: models and computation 2007-02-13 1 / 38 Outline 1 Introduction Alternative approaches What

More information

Learning to Identify Irrelevant State Variables

Learning to Identify Irrelevant State Variables Learning to Identify Irrelevant State Variables Nicholas K. Jong Department of Computer Sciences University of Texas at Austin Austin, Texas 78712 nkj@cs.utexas.edu Peter Stone Department of Computer Sciences

More information

Remarks on Bayesian Control Charts

Remarks on Bayesian Control Charts Remarks on Bayesian Control Charts Amir Ahmadi-Javid * and Mohsen Ebadi Department of Industrial Engineering, Amirkabir University of Technology, Tehran, Iran * Corresponding author; email address: ahmadi_javid@aut.ac.ir

More information

INVESTIGATION OF ROUNDOFF NOISE IN IIR DIGITAL FILTERS USING MATLAB

INVESTIGATION OF ROUNDOFF NOISE IN IIR DIGITAL FILTERS USING MATLAB Clemson University TigerPrints All Theses Theses 5-2009 INVESTIGATION OF ROUNDOFF NOISE IN IIR DIGITAL FILTERS USING MATLAB Sierra Williams Clemson University, sierraw@clemson.edu Follow this and additional

More information

Update of NOAA (NCEP, Climate Test Bed) Seasonal Forecast Activities

Update of NOAA (NCEP, Climate Test Bed) Seasonal Forecast Activities Update of NOAA (NCEP, Climate Test Bed) Seasonal Forecast Activities Stephen J. Lord Director NCEP Environmental Modeling Center NCEP: where America s climate, weather, and ocean services begin 1 Overview

More information

Review. Imagine the following table being obtained as a random. Decision Test Diseased Not Diseased Positive TP FP Negative FN TN

Review. Imagine the following table being obtained as a random. Decision Test Diseased Not Diseased Positive TP FP Negative FN TN Outline 1. Review sensitivity and specificity 2. Define an ROC curve 3. Define AUC 4. Non-parametric tests for whether or not the test is informative 5. Introduce the binormal ROC model 6. Discuss non-parametric

More information

Regression Equation. November 29, S10.3_3 Regression. Key Concept. Chapter 10 Correlation and Regression. Definitions

Regression Equation. November 29, S10.3_3 Regression. Key Concept. Chapter 10 Correlation and Regression. Definitions MAT 155 Statistical Analysis Dr. Claude Moore Cape Fear Community College Chapter 10 Correlation and Regression 10 1 Review and Preview 10 2 Correlation 10 3 Regression 10 4 Variation and Prediction Intervals

More information

Internal Model Calibration Using Overlapping Data

Internal Model Calibration Using Overlapping Data Internal Model Calibration Using Overlapping Data Ralph Frankland, James Sharpe, Gaurang Mehta and Rishi Bhatia members of the Extreme Events Working Party 23 November 2017 Agenda Problem Statement Overview

More information

Mostly Harmless Simulations? On the Internal Validity of Empirical Monte Carlo Studies

Mostly Harmless Simulations? On the Internal Validity of Empirical Monte Carlo Studies Mostly Harmless Simulations? On the Internal Validity of Empirical Monte Carlo Studies Arun Advani and Tymon Sªoczy«ski 13 November 2013 Background When interested in small-sample properties of estimators,

More information

Block-upscaling of transport in heterogeneous aquifers

Block-upscaling of transport in heterogeneous aquifers 158 Calibration and Reliability in Groundwater Modelling: From Uncertainty to Decision Making (Proceedings of ModelCARE 2005, The Hague, The Netherlands, June 2005). IAHS Publ. 304, 2006. Block-upscaling

More information

Bayesian approaches to handling missing data: Practical Exercises

Bayesian approaches to handling missing data: Practical Exercises Bayesian approaches to handling missing data: Practical Exercises 1 Practical A Thanks to James Carpenter and Jonathan Bartlett who developed the exercise on which this practical is based (funded by ESRC).

More information

Calculating volume scotomas for patients with central scotomas

Calculating volume scotomas for patients with central scotomas APPENDIX Calculating volume scotomas for patients with central scotomas This appendix provides formulas to derive the shape and extent of volume scotomas for some simple examples of bilateral central field

More information

Missy Wittenzellner Big Brother Big Sister Project

Missy Wittenzellner Big Brother Big Sister Project Missy Wittenzellner Big Brother Big Sister Project Evaluation of Normality: Before the analysis, we need to make sure that the data is normally distributed Based on the histogram, our match length data

More information

Search e Fall /18/15

Search e Fall /18/15 Sample Efficient Policy Click to edit Master title style Search Click to edit Emma Master Brunskill subtitle style 15-889e Fall 2015 11 Sample Efficient RL Objectives Probably Approximately Correct Minimizing

More information

Bayes Linear Statistics. Theory and Methods

Bayes Linear Statistics. Theory and Methods Bayes Linear Statistics Theory and Methods Michael Goldstein and David Wooff Durham University, UK BICENTENNI AL BICENTENNIAL Contents r Preface xvii 1 The Bayes linear approach 1 1.1 Combining beliefs

More information

PSYCH-GA.2211/NEURL-GA.2201 Fall 2016 Mathematical Tools for Cognitive and Neural Science. Homework 5

PSYCH-GA.2211/NEURL-GA.2201 Fall 2016 Mathematical Tools for Cognitive and Neural Science. Homework 5 PSYCH-GA.2211/NEURL-GA.2201 Fall 2016 Mathematical Tools for Cognitive and Neural Science Homework 5 Due: 21 Dec 2016 (late homeworks penalized 10% per day) See the course web site for submission details.

More information

Should a Normal Imputation Model Be Modified to Impute Skewed Variables?

Should a Normal Imputation Model Be Modified to Impute Skewed Variables? Sociological Methods and Research, 2013, 42(1), 105-138 Should a Normal Imputation Model Be Modified to Impute Skewed Variables? Paul T. von Hippel Abstract (169 words) Researchers often impute continuous

More information

Lecture 6B: more Chapter 5, Section 3 Relationships between Two Quantitative Variables; Regression

Lecture 6B: more Chapter 5, Section 3 Relationships between Two Quantitative Variables; Regression Lecture 6B: more Chapter 5, Section 3 Relationships between Two Quantitative Variables; Regression! Equation of Regression Line; Residuals! Effect of Explanatory/Response Roles! Unusual Observations! Sample

More information

HPV Testing What are EQAS and QC data telling us?

HPV Testing What are EQAS and QC data telling us? HPV Testing What are EQAS and QC data telling us? Vincini GA and Cabuang LM NRL, Melbourne, Australia Pathology Update 24 February 2019 HPV Screening 2017 Molecular testing for human papillomavirus (HPV)

More information

Using AUC and Accuracy in Evaluating Learning Algorithms

Using AUC and Accuracy in Evaluating Learning Algorithms 1 Using AUC and Accuracy in Evaluating Learning Algorithms Jin Huang Charles X. Ling Department of Computer Science The University of Western Ontario London, Ontario, Canada N6A 5B7 fjhuang, clingg@csd.uwo.ca

More information

Bayesian graphical models for combining multiple data sources, with applications in environmental epidemiology

Bayesian graphical models for combining multiple data sources, with applications in environmental epidemiology Bayesian graphical models for combining multiple data sources, with applications in environmental epidemiology Sylvia Richardson 1 sylvia.richardson@imperial.co.uk Joint work with: Alexina Mason 1, Lawrence

More information

Simulating the Tumor Growth with Cellular Automata Models

Simulating the Tumor Growth with Cellular Automata Models Simulating the Tumor Growth with Cellular Automata Models S. Zouhri Université Hassan II- Mohammédia, Faculté des Sciences Ben M'sik Département de Mathématiques, B.7955, Sidi Othmane, Casablanca, Maroc

More information

MS&E 226: Small Data

MS&E 226: Small Data MS&E 226: Small Data Lecture 10: Introduction to inference (v2) Ramesh Johari ramesh.johari@stanford.edu 1 / 17 What is inference? 2 / 17 Where did our data come from? Recall our sample is: Y, the vector

More information

Sampling Weights, Model Misspecification and Informative Sampling: A Simulation Study

Sampling Weights, Model Misspecification and Informative Sampling: A Simulation Study Sampling Weights, Model Misspecification and Informative Sampling: A Simulation Study Marianne (Marnie) Bertolet Department of Statistics Carnegie Mellon University Abstract Linear mixed-effects (LME)

More information

A Comparison of Robust and Nonparametric Estimators Under the Simple Linear Regression Model

A Comparison of Robust and Nonparametric Estimators Under the Simple Linear Regression Model Nevitt & Tam A Comparison of Robust and Nonparametric Estimators Under the Simple Linear Regression Model Jonathan Nevitt, University of Maryland, College Park Hak P. Tam, National Taiwan Normal University

More information

COPYRIGHTED MATERIAL. Auditing simple risk assessments

COPYRIGHTED MATERIAL. Auditing simple risk assessments A Pocket Guide to Risk Mathematics 11 Auditing simple risk assessments This chapter introduces the most basic ideas of probability and risk and shows how they can help us audit simple risk assessments.

More information

A validation study of after reconstitution stability of diabetes: level 1 and diabetes level 2 controls

A validation study of after reconstitution stability of diabetes: level 1 and diabetes level 2 controls A validation study of after reconstitution stability of diabetes: level 1 and diabetes level 2 controls Shyamali Pal R B Diagnostic Private Limited, Lake Town, Kolkata, India INFO ABSTRACT Corresponding

More information

Observational studies; descriptive statistics

Observational studies; descriptive statistics Observational studies; descriptive statistics Patrick Breheny August 30 Patrick Breheny University of Iowa Biostatistical Methods I (BIOS 5710) 1 / 38 Observational studies Association versus causation

More information

ISIR: Independent Sliced Inverse Regression

ISIR: Independent Sliced Inverse Regression ISIR: Independent Sliced Inverse Regression Kevin B. Li Beijing Jiaotong University Abstract In this paper we consider a semiparametric regression model involving a p-dimensional explanatory variable x

More information

Using the Patient Reported Outcome Measures Tool (PROMT)

Using the Patient Reported Outcome Measures Tool (PROMT) Using the Patient Reported Outcome Measures Tool (PROMT) Using the Patient Reported Outcome Measures Tool DH INFORMATION READER BOX Policy HR / Workforce Management Planning / Clinical Document Purpose

More information

Comparison of volume estimation methods for pancreatic islet cells

Comparison of volume estimation methods for pancreatic islet cells Comparison of volume estimation methods for pancreatic islet cells Jiří Dvořák a,b, Jan Švihlíkb,c, David Habart d, and Jan Kybic b a Department of Probability and Mathematical Statistics, Faculty of Mathematics

More information

Statistical Assessment of the Global Regulatory Role of Histone. Acetylation in Saccharomyces cerevisiae. (Support Information)

Statistical Assessment of the Global Regulatory Role of Histone. Acetylation in Saccharomyces cerevisiae. (Support Information) Statistical Assessment of the Global Regulatory Role of Histone Acetylation in Saccharomyces cerevisiae (Support Information) Authors: Guo-Cheng Yuan, Ping Ma, Wenxuan Zhong and Jun S. Liu Linear Relationship

More information

The Context of Detection and Attribution

The Context of Detection and Attribution 10.2.1 The Context of Detection and Attribution In IPCC Assessments, detection and attribution involve quantifying the evidence for a causal link between external drivers of climate change and observed

More information

Measurement and Descriptive Statistics. Katie Rommel-Esham Education 604

Measurement and Descriptive Statistics. Katie Rommel-Esham Education 604 Measurement and Descriptive Statistics Katie Rommel-Esham Education 604 Frequency Distributions Frequency table # grad courses taken f 3 or fewer 5 4-6 3 7-9 2 10 or more 4 Pictorial Representations Frequency

More information

The Good Judgment Project: A Large Scale Test of Different Methods of Combining Expert Predictions

The Good Judgment Project: A Large Scale Test of Different Methods of Combining Expert Predictions AAAI Technical Report FS-12-06 Machine Aggregation of Human Judgment The Good Judgment Project: A Large Scale Test of Different Methods of Combining Expert Predictions Lyle Ungar, Barb Mellors, Ville Satopää,

More information

Business Statistics Probability

Business Statistics Probability Business Statistics The following was provided by Dr. Suzanne Delaney, and is a comprehensive review of Business Statistics. The workshop instructor will provide relevant examples during the Skills Assessment

More information

Outlier Analysis. Lijun Zhang

Outlier Analysis. Lijun Zhang Outlier Analysis Lijun Zhang zlj@nju.edu.cn http://cs.nju.edu.cn/zlj Outline Introduction Extreme Value Analysis Probabilistic Models Clustering for Outlier Detection Distance-Based Outlier Detection Density-Based

More information

For general queries, contact

For general queries, contact Much of the work in Bayesian econometrics has focused on showing the value of Bayesian methods for parametric models (see, for example, Geweke (2005), Koop (2003), Li and Tobias (2011), and Rossi, Allenby,

More information

A Day in the Life of a Transactive Grid

A Day in the Life of a Transactive Grid A Day in the Life of a Transactive Grid GridWise Architecture Council Presented to: National Council of State Legislators Webinar March 5, 2015 Mark Knight and Tom Sloan GWAC Members Ron Melton GWAC Administrator

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION doi:1.138/nature1216 Supplementary Methods M.1 Definition and properties of dimensionality Consider an experiment with a discrete set of c different experimental conditions, corresponding to all distinct

More information

Statistics: Interpreting Data and Making Predictions. Interpreting Data 1/50

Statistics: Interpreting Data and Making Predictions. Interpreting Data 1/50 Statistics: Interpreting Data and Making Predictions Interpreting Data 1/50 Last Time Last time we discussed central tendency; that is, notions of the middle of data. More specifically we discussed the

More information

Frequency Tracking: LMS and RLS Applied to Speech Formant Estimation

Frequency Tracking: LMS and RLS Applied to Speech Formant Estimation Aldebaro Klautau - http://speech.ucsd.edu/aldebaro - 2/3/. Page. Frequency Tracking: LMS and RLS Applied to Speech Formant Estimation ) Introduction Several speech processing algorithms assume the signal

More information

Unit 1 Exploring and Understanding Data

Unit 1 Exploring and Understanding Data Unit 1 Exploring and Understanding Data Area Principle Bar Chart Boxplot Conditional Distribution Dotplot Empirical Rule Five Number Summary Frequency Distribution Frequency Polygon Histogram Interquartile

More information

Chapter 3: Describing Relationships

Chapter 3: Describing Relationships Chapter 3: Describing Relationships Objectives: Students will: Construct and interpret a scatterplot for a set of bivariate data. Compute and interpret the correlation, r, between two variables. Demonstrate

More information

White Paper Estimating Complex Phenotype Prevalence Using Predictive Models

White Paper Estimating Complex Phenotype Prevalence Using Predictive Models White Paper 23-12 Estimating Complex Phenotype Prevalence Using Predictive Models Authors: Nicholas A. Furlotte Aaron Kleinman Robin Smith David Hinds Created: September 25 th, 2015 September 25th, 2015

More information

Statistics and Probability

Statistics and Probability Statistics and a single count or measurement variable. S.ID.1: Represent data with plots on the real number line (dot plots, histograms, and box plots). S.ID.2: Use statistics appropriate to the shape

More information

Correcting for the Sampling Bias Problem in Spike Train Information Measures

Correcting for the Sampling Bias Problem in Spike Train Information Measures J Neurophysiol 98: 1064 1072, 2007. First published July 5, 2007; doi:10.1152/jn.00559.2007. Correcting for the Sampling Bias Problem in Spike Train Information Measures Stefano Panzeri, Riccardo Senatore,

More information

Popper If data follows a trend that is not linear, we cannot make a prediction about it.

Popper If data follows a trend that is not linear, we cannot make a prediction about it. - Popper 12 1. If data follows a trend that is not linear, we cannot make a prediction about it. 5.5 Non-Linear Methods Many times a scatter-plot reveals a curved pattern instead of a linear pattern. We

More information

Lean Body Mass and Muscle Mass What's the Difference?

Lean Body Mass and Muscle Mass What's the Difference? Lean Body Mass and Muscle Mass What's the Difference? PUBLISHED ON SEPTEMBER 24, 2015 BY RYAN WALTERS Consider the following three statements: I m not trying to get huge; I just want to put on five pounds

More information

STA Module 9 Confidence Intervals for One Population Mean

STA Module 9 Confidence Intervals for One Population Mean STA 2023 Module 9 Confidence Intervals for One Population Mean Learning Objectives Upon completing this module, you should be able to: 1. Obtain a point estimate for a population mean. 2. Find and interpret

More information

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo Please note the page numbers listed for the Lind book may vary by a page or two depending on which version of the textbook you have. Readings: Lind 1 11 (with emphasis on chapters 10, 11) Please note chapter

More information

Health impact assessment in the framework of global and regional AQ modelling

Health impact assessment in the framework of global and regional AQ modelling Health impact assessment in the framework of global and regional AQ modelling Rita Van Dingenen European Commission, Joint Research Centre Air pollution and health Ambient air pollution (individual) risk

More information

NEW METHODS FOR SENSITIVITY TESTS OF EXPLOSIVE DEVICES

NEW METHODS FOR SENSITIVITY TESTS OF EXPLOSIVE DEVICES NEW METHODS FOR SENSITIVITY TESTS OF EXPLOSIVE DEVICES Amit Teller 1, David M. Steinberg 2, Lina Teper 1, Rotem Rozenblum 2, Liran Mendel 2, and Mordechai Jaeger 2 1 RAFAEL, POB 2250, Haifa, 3102102, Israel

More information

Appendix III Individual-level analysis

Appendix III Individual-level analysis Appendix III Individual-level analysis Our user-friendly experimental interface makes it possible to present each subject with many choices in the course of a single experiment, yielding a rich individual-level

More information

Results Report. Welcome to Your ABT Report! Introduction to the ABT Report

Results Report. Welcome to Your ABT Report! Introduction to the ABT Report Results Report Athlete Name: SHEPPARD, JOSEPH Date of Blood Draw: Dec 08, 2017 Panel: ABT Gold Panel ABT Expert: Dr. Rock Welcome to Your ABT Report! Thank you for trusting AthleteBloodTest.com to be a

More information

Intelligent Edge Detector Based on Multiple Edge Maps. M. Qasim, W.L. Woon, Z. Aung. Technical Report DNA # May 2012

Intelligent Edge Detector Based on Multiple Edge Maps. M. Qasim, W.L. Woon, Z. Aung. Technical Report DNA # May 2012 Intelligent Edge Detector Based on Multiple Edge Maps M. Qasim, W.L. Woon, Z. Aung Technical Report DNA #2012-10 May 2012 Data & Network Analytics Research Group (DNA) Computing and Information Science

More information

Still important ideas

Still important ideas Readings: OpenStax - Chapters 1 11 + 13 & Appendix D & E (online) Plous - Chapters 2, 3, and 4 Chapter 2: Cognitive Dissonance, Chapter 3: Memory and Hindsight Bias, Chapter 4: Context Dependence Still

More information

Handling Partial Preferences in the Belief AHP Method: Application to Life Cycle Assessment

Handling Partial Preferences in the Belief AHP Method: Application to Life Cycle Assessment Handling Partial Preferences in the Belief AHP Method: Application to Life Cycle Assessment Amel Ennaceur 1, Zied Elouedi 1, and Eric Lefevre 2 1 University of Tunis, Institut Supérieur de Gestion de Tunis,

More information

Econometric Game 2012: infants birthweight?

Econometric Game 2012: infants birthweight? Econometric Game 2012: How does maternal smoking during pregnancy affect infants birthweight? Case A April 18, 2012 1 Introduction Low birthweight is associated with adverse health related and economic

More information