Financial Risk Forecasting Chapter 8 Backtesting and stresstesting

Similar documents
MLE #8. Econ 674. Purdue University. Justin L. Tobias (Purdue) MLE #8 1 / 20

- Decide on an estimator for the parameter. - Calculate distribution of estimator; usually involves unknown parameter

Bayesian Confidence Intervals for Means and Variances of Lognormal and Bivariate Lognormal Distributions

Chapter 19. Confidence Intervals for Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc.

Chapter 1 Review Questions

Previously, when making inferences about the population mean,, we were assuming the following simple conditions:

Chapter 19. Confidence Intervals for Proportions. Copyright 2010 Pearson Education, Inc.

The Economics of tobacco and other addictive goods Hurley, pp

THE BOND RATING EXPERIENCE OF SOVEREIGN ISSUERS : AN APPLICATION OF MARKOV CHAIN ANALYSIS

Pooling Subjective Confidence Intervals

EXERCISE: HOW TO DO POWER CALCULATIONS IN OPTIMAL DESIGN SOFTWARE

Sleeping Beauty is told the following:

The Loss of Heterozygosity (LOH) Algorithm in Genotyping Console 2.0

IMPACT OF THE PRIMARY MARKET ON ECONOMIC GROWTH, PRODUCTIVITY AND ENTREPRENEURSHIP: A CROSS COUNTRY ANALYSIS

References. Christos A. Ioannou 2/37

Objectives. Quantifying the quality of hypothesis tests. Type I and II errors. Power of a test. Cautions about significance tests

Russian Journal of Agricultural and Socio-Economic Sciences, 3(15)

Some Thoughts on the Principle of Revealed Preference 1

Placebo and Belief Effects: Optimal Design for Randomized Trials

Catherine A. Welch 1*, Séverine Sabia 1,2, Eric Brunner 1, Mika Kivimäki 1 and Martin J. Shipley 1

Evaluating Social Programs Course: Evaluation Glossary (Sources: 3ie and The World Bank)

Multiple Regression Analysis

Fundamental Clinical Trial Design

Remarks on Bayesian Control Charts

Estimating drug effects in the presence of placebo response: Causal inference using growth mixture modeling

Tilburg University. Comparing predictions and outcomes Das, J.W.M.; Dominitz, J.; van Soest, Arthur

Citation for published version (APA): Ebbes, P. (2004). Latent instrumental variables: a new approach to solve for endogeneity s.n.

BIOL 458 BIOMETRY Lab 7 Multi-Factor ANOVA

Sampling for Impact Evaluation. Maria Jones 24 June 2015 ieconnect Impact Evaluation Workshop Rio de Janeiro, Brazil June 22-25, 2015

Session 1: Dealing with Endogeneity

Lec 02: Estimation & Hypothesis Testing in Animal Ecology

Lessons in biostatistics

OCW Epidemiology and Biostatistics, 2010 David Tybor, MS, MPH and Kenneth Chui, PhD Tufts University School of Medicine October 27, 2010

Various Approaches to Szroeter s Test for Regression Quantiles

CHAPTER 3 METHOD AND PROCEDURE

A Brief Introduction to Bayesian Statistics

A re-randomisation design for clinical trials

Two-sample Categorical data: Measuring association

Homework Assignment Section 2

An Understanding of Role of Heuristic on Investment Decisions

OLS Regression with Clustered Data

Fertility and Development: Evidence from North Africa

Example 7.2. Autocorrelation. Pilar González and Susan Orbe. Dpt. Applied Economics III (Econometrics and Statistics)

Score Tests of Normality in Bivariate Probit Models

COMMITTEE FOR PROPRIETARY MEDICINAL PRODUCTS (CPMP) POINTS TO CONSIDER ON MISSING DATA

Doctors Fees in Ireland Following the Change in Reimbursement: Did They Jump?

REMARKS ON STATISTICAL INFERENCE FOR STATISTICAL DECISIONS

A Critique of Two Methods for Assessing the Nutrient Adequacy of Diets

Section 6: Analysing Relationships Between Variables

Macroeconometric Analysis. Chapter 1. Introduction

Bayesian and Frequentist Approaches

Session 3: Dealing with Reverse Causality

Econometric Analysis of RSG at the Midwest ISO Presentation to MISO RSG TF Carmel, IN July 8, 2009

Running head: NESTED FACTOR ANALYTIC MODEL COMPARISON 1. John M. Clark III. Pearson. Author Note

I. Introduction and Data Collection B. Sampling. 1. Bias. In this section Bias Random Sampling Sampling Error

Reading and maths skills at age 10 and earnings in later life: a brief analysis using the British Cohort Study

Binary Diagnostic Tests Two Independent Samples

6. Unusual and Influential Data

Modeling optimal cervical cancer prevention strategies in Nigeria

26:010:557 / 26:620:557 Social Science Research Methods

(Please check against delivery) EMBARGOED FOR 20 APRIL 2017 AT 0900 EST/1500 CET. Ladies and gentlemen,

PLS 506 Mark T. Imperial, Ph.D. Lecture Notes: Reliability & Validity

Cochrane Pregnancy and Childbirth Group Methodological Guidelines

Full title: A likelihood-based approach to early stopping in single arm phase II cancer clinical trials

RANDOMIZATION. Outline of Talk

DTI C TheUniversityof Georgia

PSYCH-GA.2211/NEURL-GA.2201 Fall 2016 Mathematical Tools for Cognitive and Neural Science. Homework 5

5 $3 billion per disease

A STUDY OF HIGHER MENTAL ABILITY IN SCIENCE AMONG JUNIOR COLLEGE STUDENTS OF JALNA DISTRICT

Power of a Clinical Study

On the diversity principle and local falsifiability

Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras

Enumerative and Analytic Studies. Description versus prediction

Mantel-Haenszel Procedures for Detecting Differential Item Functioning

Sample Size Considerations. Todd Alonzo, PhD

One-Way Independent ANOVA

Statistics Mathematics 243

INVESTIGATION SLEEPLESS DRIVERS

ESTUDIOS SOBRE LA ECONOMIA ESPAÑOLA

Randomization as a Tool for Development Economists. Esther Duflo Sendhil Mullainathan BREAD-BIRS Summer school

Communication Research Practice Questions

Game Theoretic Statistics

Identifying Peer Influence Effects in Observational Social Network Data: An Evaluation of Propensity Score Methods

Statistical Power Sampling Design and sample Size Determination

Assignment #6. Chapter 10: 14, 15 Chapter 11: 14, 18. Due tomorrow Nov. 6 th by 2pm in your TA s homework box

Measurement and meaningfulness in Decision Modeling

Statistical inference provides methods for drawing conclusions about a population from sample data.

How was your experience working in a group on the Literature Review?

RAG Rating Indicator Values

Sampling Weights, Model Misspecification and Informative Sampling: A Simulation Study

Getting started with Eviews 9 (Volume IV)

Stat Wk 9: Hypothesis Tests and Analysis

Estimating the number of components with defects post-release that showed no defects in testing

Effects of Overweight Samples and Rounding of Grade Percentages on Peanut Grades and Prices

STA Learning Objectives. What is Population Proportion? Module 7 Confidence Intervals for Proportions

Two Factor Analysis of Variance

Computerized Mastery Testing

STA Module 7 Confidence Intervals for Proportions

DRAFT (Final) Concept Paper On choosing appropriate estimands and defining sensitivity analyses in confirmatory clinical trials

Transcription:

Financial Risk Forecasting Chapter 8 Backtesting and stresstesting Jon Danielsson 2017 London School of Economics To accompany Financial Risk Forecasting www.financialriskforecasting.com Published by Wiley 2011 Version 3.1, November 2017 Financial Risk Forecasting 2011,2017 Jon Danielsson, page 1 of 84

Financial Risk Forecasting 2011,2017 Jon Danielsson, page 2 of 84

Introduction When making a risk forecast (or any type of forecast) It is important to validate the forecast Ex post ideally this is done after we make them using operational criteria Ex ante but often we have to do it before VaRs are only observed infrequently, a long period of time would be required Backtesting evaluates VaR forecasts by checking how a VaR forecast model performs over a period in the past in-sample Financial Risk Forecasting 2011,2017 Jon Danielsson, page 3 of 84

The focus of this chapter is on Backtesting Application of backtesting Significance of backtests Bernoulli coverage test Testing the independence of violations Joint test Loss-function-based backtests Expected shortfall backtesting Problems with backtesting Stress testing Financial Risk Forecasting 2011,2017 Jon Danielsson, page 4 of 84

Notation W T W E Testing window size Estimation window size η Indicates whether a violation occurs υ Count of violations Financial Risk Forecasting 2011,2017 Jon Danielsson, page 5 of 84

Backtesting Financial Risk Forecasting 2011,2017 Jon Danielsson, page 6 of 84

What is backtesting? Procedure to compare various risk models, ex-ante (that is in-sample) Take ex-ante VaR forecasts from a particular model and compare them with ex post realized return (i.e., historical observations) Whenever losses exceed VaR, a VaR violation is said to have occurred Can analyze violations in various ways Financial Risk Forecasting 2011,2017 Jon Danielsson, page 7 of 84

Forecasting VaR - Example Imagine you have 10 years of data, from 2007 to 2016 And using the first 2 years of that Are trying to forecast risk for 1 January 2009 You will be using the 500 days in 2008 and 2006 to make the forecast Financial Risk Forecasting 2011,2017 Jon Danielsson, page 8 of 84

The 500 trading days in 2007 and 2008 constitute the first estimation window W E is then moved up by one day to obtain the risk forecast for the second day of 2009, etc. Start End VaR forecast 1/1/2007 31/12/2008 VaR(1/1/2009) 2/1/2007 1/1/2009 VaR(2/1/2009)... 31/12/2014 30/12/2016 VaR(31/12/2016) Financial Risk Forecasting 2011,2017 Jon Danielsson, page 9 of 84

Usefulness of backtesting Identifying the weaknesses of risk forecasting methods Hence providing avenues for improvement Not very informative about the causes of weaknesses Models that perform poorly during backtesting should question 1. model assumptions 2. parameter estimates Backtesting can prevent underestimation and overestimation of risk Financial Risk Forecasting 2011,2017 Jon Danielsson, page 10 of 84

Estimation window (W E ): the number of observations used to forecast risk. If different procedures or assumptions are compared, the estimation window is set to whichever one needs the highest number of observations Testing window (W T ): the data sample over which risk is forecast (i.e., the days where we have made a VaR forecast) T = W E +W T Financial Risk Forecasting 2011,2017 Jon Danielsson, page 11 of 84

t = 1 Entire data sample t = T Financial Risk Forecasting 2011,2017 Jon Danielsson, page 12 of 84

t = 1 Entire data sample t = T t = 1 First estimation window t = W E VaR(W E +1) Financial Risk Forecasting 2011,2017 Jon Danielsson, page 13 of 84

t = 1 Entire data sample t = T t = 1 First estimation window t = W E VaR(W E +1) t = 2 Second estimation window t = W E +1 VaR(W E +2) Financial Risk Forecasting 2011,2017 Jon Danielsson, page 14 of 84

t = 1 Entire data sample t = T t = 1 First estimation window t = W E VaR(W E +1) t = 2 Second estimation window t = W E +1 VaR(W E +2) t = 3 Third estimation window t = W E +2 VaR(W E +3) Financial Risk Forecasting 2011,2017 Jon Danielsson, page 15 of 84

t = 1 Entire data sample t = T t = 1 First estimation window t = W E VaR(W E +1) t = 2 Second estimation window t = W E +1 VaR(W E +2) t = 3 Third estimation window t = W E +2 VaR(W E +3). Financial Risk Forecasting 2011,2017 Jon Danielsson, page 16 of 84

t = 1 Entire data sample t = T t = 1 First estimation window t = W E VaR(W E +1) t = 2 Second estimation window t = W E +1 VaR(W E +2) t = 3 Third estimation window t = W E +2 VaR(W E +3). t = T W E Last estimation window t = T 1 VaR(T) Financial Risk Forecasting 2011,2017 Jon Danielsson, page 17 of 84

VaR forecasts can be compared with the actual outcome: the daily 2007 to 2016 returns are already known Instead of referring to calendar dates (e.g., 1/1/2007), refer to days by indexing the returns, assuming 250 trading days per year: y 1 is the return on 1/1/2007 y 2,500 is the return on the last day, 31/12/2016 Financial Risk Forecasting 2011,2017 Jon Danielsson, page 18 of 84

The estimation window W E is set at 500 days, and the testing window W T is therefore 2,000 days t t +W E 1 VaR(t +W E ) 1 500 VaR(501) 2 501 VaR(502)... 1,999 2,499 VaR(2,500) Financial Risk Forecasting 2011,2017 Jon Danielsson, page 19 of 84

Violation ratios Financial Risk Forecasting 2011,2017 Jon Danielsson, page 20 of 84

VaR violation If a financial loss on a particular day exceeds the VaR forecast, then the VaR limit is said to have been violated VaR violation: an event such that: { 1, if y t VaR t η t = 0, if y t > VaR t. Financial Risk Forecasting 2011,2017 Jon Danielsson, page 21 of 84

Counting violations Count the violations and non-violations υ 1 υ 0 W T υ 1 = t=1 η t υ 0 =W T υ 1 Financial Risk Forecasting 2011,2017 Jon Danielsson, page 22 of 84

Violation ratios The main tools used in backtesting are violation ratios The observed number of VaR violations are compared with the expected Violation ratio: VR = Observed number of violations Expected number of violations = υ 1 p W T If the violation ratio is greater than one the VaR model underforecasts risk If smaller than one the model overforecasts risk Financial Risk Forecasting 2011,2017 Jon Danielsson, page 23 of 84

Estimation window length W E determined by the choice of VaR model and probability level Different methods have different data requirements EWMA c. 30 days HS at least 300 days for VaR 1% GARCH 500 or more days Financial Risk Forecasting 2011,2017 Jon Danielsson, page 24 of 84

Picking W E The estimation window should be sufficiently large to accommodate the most stringent data criteria So if comparing EWMA and HS, use at least 300 for both Financial Risk Forecasting 2011,2017 Jon Danielsson, page 25 of 84

Even within the same method, it may be helpful to compare different window lengths Maybe compare HS with 300, 500 and 1000 days Or GARCH with 500 and 5000 days Financial Risk Forecasting 2011,2017 Jon Danielsson, page 26 of 84

Testing window length VaR violations are infrequent events With a 1% VaR, a violation is expected once every 100 days, so that 2.5 violations are expected per year So the actual sample size of violations is quite small Causing difficulties for statistical inference At least 10 violations for reliable statistical analysis, or 4 years of data Preferably more Financial Risk Forecasting 2011,2017 Jon Danielsson, page 27 of 84

Violation ratios VR=1 is expected, but how can we ascertain whether any other value is statistically significant? A useful rule of thumb If VR [0.8,1.2] the model is good If VR [0.5,0.8] or VR [1.2,1.5] the model is acceptable If VR [0.3,0.5] or VR [1.5,2] the model is bad If VR<0.5 or VR>2 the model is useless Both bounds narrow with increasing testing window lengths As a first attempt Plot the actual returns and VaR together And then do a statistical test Financial Risk Forecasting 2011,2017 Jon Danielsson, page 28 of 84

Application of backtesting Financial Risk Forecasting 2011,2017 Jon Danielsson, page 29 of 84

Volatility and VaR: extreme example 100% 60% Returns 20% 20% 60% 100% 0 500 1000 1500 2000 day Financial Risk Forecasting 2011,2017 Jon Danielsson, page 30 of 84

Volatility and VaR: extreme example 100% 60% EWMA $130 $110 $90 Returns 20% 20% 60% $70 $50 $30 VaR 100% $10 0 500 1000 1500 2000 day Financial Risk Forecasting 2011,2017 Jon Danielsson, page 31 of 84

Volatility and VaR: extreme example 100% 60% EWMA MA 500 $130 $110 $90 Returns 20% 20% 60% $70 $50 $30 VaR 100% $10 0 500 1000 1500 2000 day Financial Risk Forecasting 2011,2017 Jon Danielsson, page 32 of 84

Volatility and VaR: extreme example 100% 60% EWMA MA 500 GARCH $130 $110 $90 Returns 20% 20% 60% $70 $50 $30 VaR 100% $10 0 500 1000 1500 2000 day Financial Risk Forecasting 2011,2017 Jon Danielsson, page 33 of 84

Volatility and VaR: extreme example 100% 60% EWMA MA 500 GARCH HS 300 $130 $110 $90 Returns 20% 20% 60% $70 $50 $30 VaR 100% $10 0 500 1000 1500 2000 day Financial Risk Forecasting 2011,2017 Jon Danielsson, page 34 of 84

Volatility and VaR: extreme example 100% 60% HS 300 HS 500 $130 $110 $90 Returns 20% 20% 60% $70 $50 $30 VaR 100% $10 0 500 1000 1500 2000 day Financial Risk Forecasting 2011,2017 Jon Danielsson, page 35 of 84

SP-500 10% 5% Returns 0% 5% 2000 2005 2010 2015 Financial Risk Forecasting 2011,2017 Jon Danielsson, page 36 of 84

SP-500 10% EWMA $120 $100 Returns 5% 0% $80 $60 VaR 5% $40 $20 2000 2005 2010 2015 Financial Risk Forecasting 2011,2017 Jon Danielsson, page 37 of 84

SP-500 10% EWMA MA 1000 $120 $100 Returns 5% 0% $80 $60 VaR 5% $40 $20 2000 2005 2010 2015 Financial Risk Forecasting 2011,2017 Jon Danielsson, page 38 of 84

SP-500 10% EWMA MA 1000 GARCH $140 $120 Returns 5% 0% $100 $80 $60 VaR 5% $40 $20 2000 2005 2010 2015 Financial Risk Forecasting 2011,2017 Jon Danielsson, page 39 of 84

SP-500 EWMA MA 1000 GARCH HS 1000 $140 10% $120 Returns 5% 0% $100 $80 $60 VaR 5% $40 $20 2000 2005 2010 2015 Financial Risk Forecasting 2011,2017 Jon Danielsson, page 40 of 84

SP-500 HS 1000 HS 300 10% $80 Returns 5% 0% $60 $40 VaR 5% $20 2000 2005 2010 2015 Financial Risk Forecasting 2011,2017 Jon Danielsson, page 41 of 84

Zoom into 2003-2006 2% Returns 0% 2% 2003 2004 2005 2006 2007 Financial Risk Forecasting 2011,2017 Jon Danielsson, page 42 of 84

Zoom into 2003-2006 EWMA $40 Returns 2% 0% 2% $35 $30 $25 $20 $15 VaR $10 2003 2004 2005 2006 2007 Financial Risk Forecasting 2011,2017 Jon Danielsson, page 43 of 84

Zoom into 2003-2006 EWMA MA 1000 $40 Returns 2% 0% 2% $35 $30 $25 $20 $15 VaR $10 2003 2004 2005 2006 2007 Financial Risk Forecasting 2011,2017 Jon Danielsson, page 44 of 84

Zoom into 2003-2006 EWMA MA 1000 GARCH 2% $40 Returns 0% $30 VaR 2% $20 $10 2003 2004 2005 2006 2007 Financial Risk Forecasting 2011,2017 Jon Danielsson, page 45 of 84

Zoom into 2003-2006 EWMA MA 1000 GARCH HS 1000 2% $40 Returns 0% $30 VaR 2% $20 $10 2003 2004 2005 2006 2007 Financial Risk Forecasting 2011,2017 Jon Danielsson, page 46 of 84

Zoom into 2003-2006 HS 1000 HS 300 $35 2% $30 Returns 0% $25 $20 VaR 2% $15 $10 2003 2004 2005 2006 2007 Financial Risk Forecasting 2011,2017 Jon Danielsson, page 47 of 84

Zoom into crisis 10% 5% Returns 0% 5% 2007 2008 2009 2010 Financial Risk Forecasting 2011,2017 Jon Danielsson, page 48 of 84

10% EWMA Zoom into crisis $120 $100 Returns 5% 0% $80 $60 VaR 5% $40 $20 2007 2008 2009 2010 Financial Risk Forecasting 2011,2017 Jon Danielsson, page 49 of 84

Zoom into crisis 10% EWMA MA 1000 $120 $100 Returns 5% 0% $80 $60 VaR 5% $40 $20 2007 2008 2009 2010 Financial Risk Forecasting 2011,2017 Jon Danielsson, page 50 of 84

Zoom into crisis 10% EWMA MA 1000 GARCH $140 $120 Returns 5% 0% $100 $80 $60 VaR 5% $40 $20 2007 2008 2009 2010 Financial Risk Forecasting 2011,2017 Jon Danielsson, page 51 of 84

Zoom into crisis EWMA MA 1000 GARCH HS 1000 $140 10% $120 Returns 5% 0% $100 $80 $60 VaR 5% $40 $20 2007 2008 2009 2010 Financial Risk Forecasting 2011,2017 Jon Danielsson, page 52 of 84

Zoom into crisis HS 1000 HS 300 10% $80 Returns 5% 0% $60 $40 VaR 5% $20 2007 2008 2009 2010 Financial Risk Forecasting 2011,2017 Jon Danielsson, page 53 of 84

Significance of Backtests Financial Risk Forecasting 2011,2017 Jon Danielsson, page 54 of 84

Testing violations We can test whether we get the expected number of violations and if there are patterns in the violations enumerate 1. The number of violations (tested by the unconditional coverage) 2. Clustering (tested by independence tests) Financial Risk Forecasting 2011,2017 Jon Danielsson, page 55 of 84

Distribution of violations We have a sequence of returns, VaR and violations η t η VaR y days 1 0 1 0 0 1 0 0 0 0 1.9 2.0 2.1 2.0 2.1 2.2 2.3 2.2 2.3 2.4-2.1 1.4-5.2 2.3 0.4-3.7 4.1 0.1 3.2-0.2 1 2 3 4 5 6 7 8 9 10 The {η t } T t=w E +1 is a sequence of 1 or 0 And hence follows the Bernoulli distribution Note that the sequence starts at W E +1 and ends at T and is hence W T long Financial Risk Forecasting 2011,2017 Jon Danielsson, page 56 of 84

Estimation The sample probability ˆp can be estimated by the average number of violations ˆp = υ 1 W t The Bernoulli density (on day t) is given by: (1 p) 1 ηt (p) ηt, η t = 0,1. Financial Risk Forecasting 2011,2017 Jon Danielsson, page 57 of 84

Bernoulli coverage test Financial Risk Forecasting 2011,2017 Jon Danielsson, page 58 of 84

Unconditional coverage Does the expected number of violations, as given by p match the observed number of violations For a VaR(1%) backtest, we would expect to observe a violation 1% of the time If, violations are observed more often the VaR model is underestimating risk And similarly if we observe too few violations Financial Risk Forecasting 2011,2017 Jon Danielsson, page 59 of 84

Bernoulli coverage test We can therefore test if the sequence {η t } T t=w E +1 expected number of 1 and 0 Use the Bernoulli coverage test The null hypothesis for VaR violations is: has the H 0 : η B(p), where B stands for the Bernoulli distribution Financial Risk Forecasting 2011,2017 Jon Danielsson, page 60 of 84

Recall from chapter 2 that the likelihood function is the product of the densities, and therefore The likelihood function is given by: L U (ˆp) = T t=w E +1 (1 ˆp) 1 ηt (ˆp) ηt = (1 ˆp) υ 0 (ˆp) υ 1 Denote this as the unrestricted likelihood function Because it uses estimated probability ˆp Under H 0, p = ˆp, so the restricted likelihood function is: L R (p) = T t=w E +1 = (1 p) υ 0 (p) υ 1 (1 p) 1 ηt (p) ηt Financial Risk Forecasting 2011,2017 Jon Danielsson, page 61 of 84

We can use a likelihood ratio (LR) test to see whether L R = L U or, equivalently, whether p = ˆp: LR = 2(logL U (p) logl R (ˆp)) = 2log (1 ˆp)υ 0 (ˆp) υ 1 (1 p) υ 0 (p) υ 1 asymptotic χ 2 (1) Choosing a 5% significance level for the test, the null hypothesis is rejected if LR > 3.84 Matlab chi2inv (1 0.05,1) 3. 8415 R qchisq (p=1 0.05,df=1) 3. 841459 Financial Risk Forecasting 2011,2017 Jon Danielsson, page 62 of 84

Bernoulli coverage test Matlab function res=bern test (p,v) a=pˆ(sum(v)) (1 p)ˆ( length (v) sum(v )); b=(sum(v)/ length (v))ˆ(sum(v)) (1 (sum(v)/ length (v)))ˆ( length (v) sum(v )); res= 2 log (a/b); end R bern test=function (p,v){ lv=length (v) sv=sum(v) al=log (p) sv+log(1 p) ( lv sv) bl=log (sv/lv ) sv +log(1 sv/lv ) (lv sv) return( 2 (al bl )) } Financial Risk Forecasting 2011,2017 Jon Danielsson, page 63 of 84

Independence property Financial Risk Forecasting 2011,2017 Jon Danielsson, page 64 of 84

Distribution of violations Suppose the violations cluster η VaR y days 0 1 1 1 0 0 0 0 0 0 2.0 1.9 2.1 2.2 2.1 2.0 2.3 2.2 2.3 2.4 1.4-2.1-5.2-3.7 0.4 2.3 4.1 0.1 3.2-0.2 1 2 3 4 5 6 7 8 9 10 Then we are violating the independence property Financial Risk Forecasting 2011,2017 Jon Danielsson, page 65 of 84

Independence test Do two violations follow each other? They should not because If they do, we can predict a violation today if there was one yesterday And a good VaR model would have increased the VaR forecast following a violation Financial Risk Forecasting 2011,2017 Jon Danielsson, page 66 of 84

The probabilities of two consecutive violations is p 11 The probability of a violation if there was no violation on the previous day p 01 and and p 10 p 00 More generally, the probability that: p i,j = Pr(η t = i η t 1 = j) where i and j are either 0 or 1 Financial Risk Forecasting 2011,2017 Jon Danielsson, page 67 of 84

The first-order transition probability matrix is defined as ( ) 1 p01 p Π 1 = 01 1 p 11 p 11 Show the likelihood of all 4 possible outcomes The restricted likelihood function comes from p L R (Π 1 ) = (1 p 01 ) υ 00 p υ 01 01 (1 p 11) υ 10 p υ 11 11 υ ij is the number of observations where j follows i Financial Risk Forecasting 2011,2017 Jon Danielsson, page 68 of 84

Under the null hypothesis of no clustering, the probability of a violation tomorrow does not depend on today seeing a violation Then, p 01 = p 11 = p and the estimated transition matrix is simply: ( ) 1 ˆp ˆp ˆΠ 0 = 1 ˆp ˆp where So ˆp = υ 01 +υ 11 υ 00 +υ 10 +υ 01 +υ 11 L U (ˆΠ 0 ) = (1 ˆp) υ 00+υ 10ˆp υ 01+υ 11 Financial Risk Forecasting 2011,2017 Jon Danielsson, page 69 of 84

Likelihood ratio test The likelihood ratio test: ( LR = 2 logl U (ˆΠ 0 ) logl R (ˆΠ 1 ) ) asymptotic χ 2 (1) Financial Risk Forecasting 2011,2017 Jon Danielsson, page 70 of 84

The main problem with tests of this sort is that they must specify the particular way in which independence is breached However, there are many possible ways in which the independence property is not fulfilled: Is the violation on day 1,3,5,5? Test can t detect Financial Risk Forecasting 2011,2017 Jon Danielsson, page 71 of 84

Testing the SP-500 Financial Risk Forecasting 2011,2017 Jon Danielsson, page 72 of 84

Testing S&P 500 1998 to 2009 Model Coverage test Independence test Test statistic p-value Test statistic p-value EWMA 18.1 0.00 0.00 0.96 MA 81.2 0.00 7.19 0.01 HS 24.9 0.00 4.11 0.04 GARCH 16.9 0.00 0.00 0.99 1998 to 2006 Model Coverage test Independence test Test statistic p-value Test statistic p-value EWMA 2.88 0.09 0.68 0.41 MA 6.15 0.01 2.62 0.11 HS 0.05 0.82 1.52 0.22 GARCH 1.17 0.28 0.99 0.32 Financial Risk Forecasting 2011,2017 Jon Danielsson, page 73 of 84

Joint test We can jointly test LR(joint) = LR(coverage)+LR(independence) χ 2 (2) The joint test has less power to reject a VaR model which only satisfies one of the two properties Financial Risk Forecasting 2011,2017 Jon Danielsson, page 74 of 84

Expected Shortfall Backtesting Financial Risk Forecasting 2011,2017 Jon Danielsson, page 75 of 84

It is harder to backtest expected shortfall ES than VaR because we are testing an expectation rather than a single quantile We know if VaR is violated, but cannot know that for ES There exists a simple methodology for backtesting ES that is analogous to the use of violation ratios for VaR For days when VaR is violated, normalized shortfall NS is calculated as: NS t = y t ES t where ES t is the observed ES on day t Financial Risk Forecasting 2011,2017 Jon Danielsson, page 76 of 84

From the definition of ES, the expected Y t given VaR is violated, is: E[Y t Y t < VaR t ] ES t = 1 Therefore, average NS, NS, should be one H 0 : NS = 1 Financial Risk Forecasting 2011,2017 Jon Danielsson, page 77 of 84

The reliability of any ES backtest procedure is much lower than that of VaR With ES, we are testing whether the mean of returns on days when VaR is violated is the same as average ES on these days. Much harder to create formal tests to ascertain whether normalized ES equals one or not than the coverage tests developed above for VaR violations Hence, backtesting ES requires many more observations than backtesting VaR In instances where ES is obtained directly from VaR, and gives the same signal as VaR (i.e., when VaR is subadditive), it is better to simply use VaR Financial Risk Forecasting 2011,2017 Jon Danielsson, page 78 of 84

Problems with Backtesting Financial Risk Forecasting 2011,2017 Jon Danielsson, page 79 of 84

Backtesting assumes that there have been no structural breaks in the data throughout the sample period: But financial markets are continually evolving, and new technologies, assets, markets and institutions affect the statistical properties of market prices Unlikely that the statistical properties of market data in the 1990s are the same as today, implying that a risk model that worked well then might not work well today Financial Risk Forecasting 2011,2017 Jon Danielsson, page 80 of 84

Data mining and intellectual integrity: Backtesting is only statistically valid if we have no ex ante knowledge of the data in the testing window If we iterate the process, continually refining the risk model with the same test data and thus learning about the events in the testing window, the model will be fitted to those particular outcomes, violating underlying statistical assumptions Financial Risk Forecasting 2011,2017 Jon Danielsson, page 81 of 84

Stresstesting Financial Risk Forecasting 2011,2017 Jon Danielsson, page 82 of 84

Stresstesting Create artificial market outcomes To see how risk management systems and risk models cope with the artificial event Assess the ability of a bank to survive a large shock The main aim is to come up with scenarios that are not well represented in recent historical data but are nonetheless possible and detrimental to portfolio performance Financial Risk Forecasting 2011,2017 Jon Danielsson, page 83 of 84

Examples of historical scenarios Scenario Period Stock market crash October 1987 ERM crisis September 1992 Bond market crash April 1994 Asian currency crisis Summer 1997 LTCM and Russia crisis August 1998 Global crisis 2007-2009 Eurozone crisis Since 2010 Brexit 2017 Two types: Shocks that have never occurred or are more likely to occur than historical data suggest Shocks that reflect permanent or temporary structural breakswhere historical relationships do not hold Financial Risk Forecasting 2011,2017 Jon Danielsson, page 84 of 84