Journal of Operational Risk
ISSN:
1744-6740 (print)
1755-2710 (online)
Editor-in-chief: Marcelo Cruz
Need to know
- We use RMT to fit high-dimensional t-copulas
- We introduce bias into the Kendall canonical maximum likelihood estimator of the degree of freedom of a t-copula
- We use simulation studies and an example of operational risk modeling to show the necessity and the benefit of using RMT to fit high-dimensional t-copulas in risk modeling
Abstract
ABSTRACT
In risk management, t-copulas are used to model dependencies beyond Gaussian copulas, as they take into account tail dependencies. A t-copula has two parameters: the correlation matrix and the degree of freedom; they are usually estimated by maximizing the likelihood function of the observations. In risk modeling, the dimension of the copula is often high, making the maximization intractable, as the number of pairwise correlations to estimate is too high. McNeil, Frey and Embrechts suggested a procedure that consists of using a correlation matrix estimated through Kendall's rank correlation matrix estimate, likely transformed to be positive definite, as an input in the likelihood function to deduce a value for the degree of freedom. We show the bias of this degree of freedom's estimator due to the noise in the correlation matrix estimate. We address this problem using a random-matrix-theory-based denoising technique to improve the correlation estimate. On simulation studies, we show how this improved procedure gives an estimator of the degree of freedom of t-copulas with no bias and a smaller variance. Finally, we fit a t-copula on real operational risk data in order to illustrate the necessity and the benefit of this procedure.
Introduction
Risk modelers often want to model dependencies in order to link several risks together by taking their dependence relationships into account. Copulas are today widely used to do this. Of these, the Gaussian copula is often preferred for its simplicity. Nonetheless, it does not take into account the so-called tail dependencies, that is, the dependence on extreme events. -copulas, on the other hand, have strictly positive tail dependencies, and thus can be considered as an interesting substitute. These require the fitting of two parameters: the correlation matrix and the degree of freedom.
McNeil et al (2005) suggested a procedure to estimate these two parameters, which provides better results than a simple maximization of the likelihood function when the dimension of the copula is high. It consists of estimating the correlation matrix separately (using the rank-based measure Kendall’s and the properties of elliptical copulas) and using this estimate as an input in the likelihood function to estimate the degree of freedom by maximizing it.
In this paper, we introduce a procedure derived from McNeil et al’s, which gives better empirical results even when the number of observations is not high in comparison with the dimension of the copula. We recall theoretical results about -copulas in Section 2. In Section 3, we introduce the three-stage procedure suggested by McNeil et al, and show the problems that may arise when using it and how to circumvent these using the random matrix theory (RMT)’s denoising technique. Section 4 applies the improved procedure to simulated data sets to observe how the bias of the degree of freedom’s estimator is reduced and how it is less variant. In Section 5, we give an example of fitting -copulas in operational risk modeling. Section 6 concludes.
2 Copulas
Elliptical copulas are simply the copulas of elliptically contoured distributions, eg, the Gaussian and -distributions. A key advantage of elliptical copulas is that they can model heterogeneous levels of correlation between the marginal distributions. These two copulas are widely used in finance or risk modeling.
In this section, we will introduce some basic concepts related to copulas and properties of -copulas.
2.1 Basic concepts related to copulas
2.1.1 The copula
A copula is a multivariate distribution function with margins that are uniformly distributed on . It was defined by Sklar (1959) as
where is the copula; , with for all , is a vector of random variables; and are realizations of .
2.1.2 Sklar’s theorem
Theorem 2.1.
Let be a joint distribution function with margins . Then, there exists a copula such that, for all ,
If the margins are continuous, then is unique. We define the density of a copula, , as
and we have
According to the above theorem, the risk modeling process with copulas consists of two steps. First, we must determine the marginal distribution of each single risk component . Second, we need to model the dependence structure between these risk components via the copula function, .
2.2 -copulas
A classic copula often used in risk management is the -copula. This is defined by
where is the distribution function of a standard univariate -distribution with degree of freedom , and denotes the joint distribution function of a multivariate Student distribution .
The density of a -copula can be written as
3 Estimation of parameters and of a -copula
Given a -copula , we have to estimate parameters for the correlation matrix and one parameter for the degree of freedom . When the data is low dimensional, we may use the maximum likelihood method to estimate these parameters. However, the optimization complexity with high-dimensional data is very difficult to handle, as it is hard to simultaneously optimize parameters. McNeil et al (2005) proposed a semi-parametric estimation method based on Kendall’s for high-dimensional -copulas. In order to simplify the optimization complexity, they first estimate the correlation matrix of the -copula via Kendall’s and then maximize the likelihood function, taking as an input to estimate the degree of freedom, .
3.1 Kendall’s canonical maximum likelihood estimator
Fantazzini (2010) talks about the “three-stage procedure”, referring to the semi-parametric estimation method proposed by McNeil et al (2005), as follows.
Let be a continuous random vector with margins .
Stage 1: transform the data set
In order to compute the likelihood function of a -copula, we transform the data set ( independent copies of ) into uniform variables
using the empirical distribution function , which is defined as
Stage 2: PP-Kendall correlation matrix estimator for
requires not only the specification of the estimate of the correlation matrix , but also the degree of freedom, . In this case, we cannot build an estimator with , as we do not know the value of . McNeil et al (2005) introduced a robust estimator of named the “pseudo-Pearson Kendall (PP-Kendall) correlation matrix estimator”, using the relationship between Kendall’s and Pearson’s in the case of elliptical copulas. When the variables with marginal distribution functions are connected by an elliptical copula with correlation matrix , Lindskog et al (2002) demonstrated the following relationship between the Kendall and Pearson coefficients:
As Kendall’s is a rank correlation coefficient, the PP-Kendall correlation matrix estimator is invariant to monotonic transformations:
Let denote the PP-Kendall correlation matrix estimator of .
Stage 3: Kendall canonical maximum likelihood estimator for
We look for the canonical maximum likelihood estimator of the degree of freedom by maximizing the loglikelihood function of the -copula density:
We will call this the “Kendall CML” method.
3.2 The correlation matrix estimate may not be positive semi-definite
The formulas in the second stage of Section 3.1 connect all the Kendall’s of the sample pairwise to their corresponding Pearson’s . Strictly speaking, the relation between Kendall’s and Pearson’s correlation found by Lindskog et al (2002) is valid for the bivariate case, but in practice we can build a correlation matrix from Kendall’s bivariate estimates when the dimension of the matrix is not too high. However, in the case of high-dimensional data, is a matrix close to , whose entries contain some small residual “error”; this becomes relevant when such small errors are accumulated, making the deduced correlation matrix unreliable.
In addition, there is no guarantee that this componentwise transformation of the empirical Kendall’s matrix is positive semi-definite. Rousseeuw and Molenberghs (1993) proposed a simple technique to adjust to obtain a positive definite matrix. They first compute the eigenvalues and the eigenvectors of the correlation matrix calculated from Kendall’s . Then, they replace the negative eigenvalues by arbitrarily chosen slightly positive ones () and keep the eigenvectors unchanged. They rebuild the matrix with the new eigenvalues and the old eigenvectors. As the diagonal elements of the computed matrix will not necessarily be equal to , Rousseeuw and Molenberghs consider the computed matrix as a covariance matrix and transform it into its corresponding correlation matrix. They do not propose a slightly positive value for to use in the procedure. McNeil et al (2005) chose to estimate the -copula parameters. Considering the matrix itself, the choice of will not greatly affect the results of the PP-Kendall correlation matrix estimator. But, the spectrum of its matrix inverse, which appears in the density formula of a -copula, is much more affected by the choice of (indeed, is the highest eigenvalue of the inverse), and this choice induces bias when estimating the degree of freedom using the Kendall CML method.
An algorithm proposed by Higham (2002) can adjust to obtain its “nearest” positive definite matrix. Here, the distance between matrixes is measured by the Frobenius norm. Higham (2002) uses the modified alternating projections method to compute the solution: a point in the intersection of two convex sets,
and | ||||
This technique finds a matrix that minimizes the Frobenius norm, but the smallest eigenvalue of this matrix may differ from the true value of the correlation matrix. Thus, this method will also bias the parameter’s estimation of -copulas.
We now present two simple examples to illustrate the existence of noise and the disadvantages of the Rousseeuw–Molenberghs and Higham methods.
No denoising | |
---|---|
Bias of | 1.10 |
Variance of | 0.006 |
Example 3.1 (Existence of noise).
We simulate 500 data sets of -copulas with a uniform correlation matrix:
and we set the dimension of the data set to , the sample size of the data set to , the degree of freedom of the -copula to and the coefficient of the uniform correlation matrix to . Then, we estimate the value of the degree of freedom of the -copula by using the CML estimator. In Figure 1, we provide the results of the estimation for 500 simulated data sets. In Table 1, we give the values of the empirical bias and empirical variance of the estimator .
According to these results, the Kendall CML estimator is clearly not an unbiased estimator of the degree of freedom for a high-dimensional -copula. In this configuration ( is quite large), the PP-Kendall correlation matrix estimator of is almost always positive definite, so the bias of the estimation is due to the “noise” in the data set, and not due to an arbitrary technique to make a nonpositive definite correlation matrix positive definite.
Example 3.2 (Bias of estimation).
We again simulate 500 data sets of -copulas with a uniform correlation matrix
and this time we choose , , and .
In this configuration ( is not very large), the PP-Kendall correlation matrix estimator is almost never positive definite. Given a data set, the method we use to find a positive definite correlation matrix will greatly impact the results of the estimation.
3.3 Denoising the correlation matrix to ensure a better estimate of high-dimensional -copulas
Because of the two sources of noise we exhibited in the previous section, we decide to denoise the PP-Kendall correlation matrix estimator.
3.3.1 Principle of the denoising technique
Rousseeuw and | ||||
Molenberghs | ||||
Higham | ||||
Bias of | 9.10 | 2.37 | 0.67 | 1.01 |
Variance of | 0.215 | 0.018 | 0.004 | 0.009 |
Laloux et al (2000) introduced the denoising technique to improve the estimation of the correlation matrix. This technique is based on results proved by the Marčenko–Pastur theorem in RMT, which provides a criterion to differentiate estimated eigenvalues of an empirical correlation matrix that carry a signal from others that might only be noise. In practice, it is necessary to compare the law of eigenvalues of an empirical matrix to the law of eigenvalues of a correlation matrix obtained from purely random uncorrelated variables in the same configuration (dimension and sample size).
3.3.2 Marčenko–Pastur theorem and the denoising technique
We state the Marčenko–Pastur law as follows.
Let denote an random matrix
whose entries are independent and identically distributed random variables with mean and variance , ie,
Let
and let be the eigenvalues of . We define the random spectral measure of by
Finally, when and the ratio , we obtain
where is a deterministic measure whose density is given by
with
Note that the support of Marčenko–Pastur law is bounded. The interval is called the “bulk”, and this gets narrower as increases. In the case when , the bulk is limited to and the Marčenko–Pastur law is a Dirac mass in .
The support of Marčenko–Pastur law, ie, the bulk, appears numerically to be the support of estimated eigenvalues, even in a non-asymptotic framework (Crenin et al 2015). From this, we see that eigenvalues that are out of the bulk can be supposed to carry a signal when others may not. According to Laloux et al (2000) and Crenin et al (2015), for finite sample sizes and finite dimensions there is a small probability of finding eigenvalues above and below when data sets are uncorrelated, and this probability naturally tends to as grows.
To clean an empirical correlation matrix, we would do the following:
- (1)
estimate the correlation matrix and compute the eigenvalues and eigenvectors;
- (2)
if necessary, apply Rousseeuw and Molenberghs’s technique with a slightly positive value to make it positive definite;
- (3)
keep the estimated eigenvalues that are higher than the upper bound of the bulk unchanged, and substitute the others by their mean to keep the trace unchanged;
- (4)
renormalize the matrix so that it is a correlation matrix (with on the diagonal).
3.3.3 The benefit of using the denoising technique to fit -copulas
There are two advantages when using this technique to clean the correlation matrix estimator.
- (1)
It provides a criterion to differentiate signal and noise in the empirical correlation matrix, which depends only on the sample size and the dimension.
- (2)
It is not greatly affected by the arbitrary choice of , as we replace all the eigenvalues lower than the upper bound of the bulk by their mean, which does not differ a lot for different choices of .
In addition, Crenin et al (2015) shows that this theory can still be applied when the dimension of the data set and the sample size are both not very large ( and ).
4 Simulation studies
In this section, we present some numerical examples to illustrate that the denoising technique can improve the estimate of the parameters of a -copula using the Kendall CML method. The denoising technique performs well in many cases.
Configuration 1: ,
In this configuration, the dimension of the data set, , and the sample size, , are both large, and their ratio, , is also large. As and are very large, there are rarely negative eigenvalues in the PP-Kendall correlation matrix estimator , and each element of is close to the theoretical one.
Configuration 2: ,
In this configuration, the dimension of the data set, , and the sample size, , are both large, but the ratio is not so large. As is small, there are almost always negative eigenvalues in the PP-Kendall correlation matrix estimator .
Configurations 1 and 2 are often faced in portfolio management, market risk and credit risk.
Configuration 3: ,
In this configuration, the dimension of the data set, , the sample size, , and the ratio are all not large.
We want to illustrate that the denoising technique can simultaneously clean the noise and the artefact due to the arbitrary choice of in Rousseeuw and Molenberghs’s technique.
Configuration 3 is met in operational risk or many cases of risk aggregation.
4.1 Simulation studies with correlation matrix depending on factors
Before simulating the data set whose dependence structure is modeled by a -copula, we should first build a theoretical correlation matrix. In the financial field, we often use common factors to explain market phenomena and the equilibrium of asset prices. We decided to build such a correlation matrix with the multifactor model.
Assume that random variables are modeled by common and idiosyncratic factors. Each random variable , , is thus modeled as
(4.1) |
where , , , are the factors assumed to be independent, with and . is the sensitivity of factor to factor , and is the idiosyncratic factor, such that
In this modeling, the theoretical correlation matrix depends on the factor sensitivities and is given by the following:
for all , .
4.1.1 Numerical results
No denoising | Denoising | |
---|---|---|
Bias of | 1.05 | 0.01 |
Variance of | 0.005 | 0.002 |
Here, we only give the results of the case with a one-factor correlation matrix. To do so, we assume that
The correlation between and is thus equal to
In order to have appropriate correlation coefficients, we simulate for all , and we compute with the above formulas. In this case, we will get a theoretical one-factor correlation matrix.
In each configuration, we simulate 500 data sets with the dimension of the data set equal to , the sample size equal to of a -copula with theoretical correlation matrix and a degree of freedom of . Then, we estimate the value of the degree of freedom of a -copula using the CML estimator. We will compare the results of the estimation without the denoising technique with those of the estimation with the denoising technique.
Configuration 1: ,
As we saw in Section 3.2, the CML estimator is a biased estimator of the degree of freedom of the -copula. In this configuration, there is no arbitrary choice of a slight positive value, and the bias is just the accumulated effect of noise in the correlation matrix estimator. From Figure 3, we can see that the CML estimator observed after denoising is a robust estimator of the degree of freedom of the -copula. Table 3 shows the empirical bias and the empirical variance of estimated on the 500 simulated data sets. The denoising technique based on RMT reduces not only the bias but also the variance’s estimator.
Configuration 2: ,
In this configuration, the impact of the method chosen to adjust the empirical correlation matrix is huge. Higham’s method cannot give an unbiased estimation of the degree of freedom of the -copula, and we cannot find any slightly positive value that should be used in the Rousseeuw–Molenberghs method, as this would change in another configuration. From Figure 4 and Table 4, we can see that the denoising technique performs very well in this configuration.
Rousseeuw and | |||||
Molenberghs | |||||
Higham | Denoising | ||||
Bias of | 9.48 | 2.52 | 0.59 | 0.92 | 0.06 |
Variance of | 0.242 | 0.019 | 0.004 | 0.008 | 0.008 |
Configuration 3: ,
In this extreme configuration, which often appears in operational risk modeling, both the dimension and the number of observations are relatively small. We do the same test and always notice the benefits from using RMT to fit a -copula. In this case, the classical method cannot give a robust estimation of the degree of freedom for a -copula. Meanwhile, after applying the denoising technique, the noise in the estimation is much reduced (see Figure 5 and Table 5).
No denoising | Denoising | |
---|---|---|
Bias of | 5.60 | 0.22 |
Variance of | 12.839 | 0.651 |
4.2 Simulation studies with correlation matrix having specified eigenvalues
No denoising | Denoising | |
---|---|---|
Bias of | 1.23 | 0.19 |
Variance of | 0.006 | 0.003 |
In this section, we want the theoretical correlation matrix to have specified eigenvalues. We use the method proposed by Marsaglia et al (1984) and Jones (2010) to generate a theoretical correlation matrix of dimension with the following configuration:
Unlike the correlation matrix generated by several factors in the previous section, the structure of the above correlation matrix is more sophisticated, and has more information lying in the bulk. We then simulate the data set and conduct the experiment introduced in Section 4.1.
4.2.1 Numerical results
Rousseeuw and | ||||
Molenberghs | ||||
Denoising | ||||
Bias of | 8.97 | 2.48 | 0.56 | 0.30 |
Variance of | 0.145 | 0.014 | 0.003 | 0.009 |
No denoising | Denoising | |
---|---|---|
Bias of | 5.46 | 0.29 |
Variance of | 14.765 | 0.690 |
- •
- •
- •
As in the previous section, Figures 6–8 show that the denoising technique also performs very well in the case of a correlation matrix with specified eigenvalues. In each configuration of risk modeling that we have tested, the denoising technique reduced the bias and provided a more robust estimator of the degree of freedom of -copulas. Further, from Figures 6 and 7, we see that, after applying the denoising technique on simulation studies, the CML estimator seems to be slightly biased, and thus demonstrates the limit of the suggested technique, which might ignore some small true data lying in the bulk.
5 Empirical results in operational risk modeling
5.1 The data set
The consultative paper from European Banking Authority (2014) provides guidance on dependence modeling in operational risk and suggests that banks use -copulas. Thus, the estimation of the degree of freedom of -copulas has become the central issue in operational risk modeling. Our study is based on losses occurring in numerous financial institutions in Western Europe between 2002 and 2013 that were collected by the Operational Riskdata eXchange Association (ORX), a leading banking consortium. Actually, the localization proved to be discriminant in terms of risk profiles (see Cope and Labbi 2008), and thus we decided to reduce the perimeter provided by ORX. For that reason, we propose modeling operational risk on a bank that would perform only retail activities, as in Crenin et al (2015). Risks are consequently modeled on the following five business lines for forty-eight quarters:
- (1)
retail banking;
- (2)
commercial banking;
- (3)
payment and settlement;
- (4)
agency services;
- (5)
retail brokerage.
The main idea here is to illustrate the application of RMT in operational risk when fitting -copulas. Consequently, the following results rely on thirty-five units of measure ( risk types as defined by Basel III’s advanced measurement approach model) and forty-eight quarters, leading to .
5.2 RMT application
5.2.1 Estimation of the degree of freedom of -copulas without applying the denoising technique
Rousseeuw and | ||||
Molenberghs | ||||
Higham | ||||
(no denoising) | 5.51 | 3.08 | 1.93 | 2.34 |
As explained in Crenin et al (2015) and in Section 3.1, we first transform our observations into pseudo-observations in order to estimate the Kendall correlations. These are then transformed into Pearson correlations using the formula that links the two parameters, leading to what are called PP-Kendall correlations. When necessary, the estimated correlation matrix is made positive definite using the method suggested by Rousseeuw and Molenberghs (1993), with an arbitrarily chosen , or using the method of Higham (2002). Eventually, we estimate the degree of freedom of the -copula by maximization of the likelihood function. Table 9 shows the impact of the choice of method on the estimation of the degree of freedom of the -copula. The results of estimation of are very sensitive to the method or the value of used to make the estimated correlation matrix positive definite.
5.2.2 Estimation of the degree of freedom of -copulas after applying the denoising technique
In this section, we will show the result of the estimation of the degree of freedom of a -copula after applying the denoising technique. As explained in Section 3.3, we apply the denoising technique to clean the empirical correlation matrix, and then estimate the degree of freedom of the -copulas by maximizing the likelihood function.
Rousseeuw and | ||||
Molenberghs | ||||
Higham | ||||
(denoising) | 9.93 | 9.93 | 9.93 | 9.97 |
Table 10 shows that the results of the estimation of after applying the denoising technique are stable with the method used to make the estimated correlation matrix positive definite. The difference between Tables 9 and 10 suggests that the denoising technique is very necessary in order to get a robust estimation of the degree of freedom of -copulas.
6 Conclusion
We introduced bias into the Kendall canonical maximum likelihood estimator of the degree of freedom of a -copula and demonstrated a technique that may clean the noise from the estimated correlation matrix to give a more robust estimator of the degree of freedom.
We also showed, using simulation studies and an example of operational risk modeling, the necessity and the benefit of using RMT to fit high-dimensional -copulas in risk modeling, even when the number of observations is small. More studies are still to be conducted in order to further reduce the slight remaining bias, by identifying all the true information contained in sophisticated correlation matrixes.
Declaration of interest
The views expressed in this paper are solely the responsibility of the authors and should not be interpreted as reflecting the official positions of Société Générale.
Acknowledgements
We are grateful to Vivien Brunel, who gave us the idea of applying RMT to operational risk. We are also grateful to the anonymous reviewer, who provided very helpful comments. The usual disclaimer nonetheless applies, and all errors remain ours.
References
Cope, E., and Labbi, A. (2008). Operational loss scaling by exposure indicators: evidence from the ORX database. Working Paper, ORX Analytical Working Group.
Crenin, F., Cressey, D., Lavaud, S., Xu, J., and Clauss, P. (2015). Random matrix theory applied to correlations in operational risk. The Journal of Operational Risk 10(4), 45–71 (http://doi.org/brw3).
European Banking Authority (2014). Draft regulatory technical standards on assessment methodologies for the advanced measurement approaches for operational risk under Article 312 of Regulation (EU) No 575/2013. Consultation Paper, EBA.
Fantazzini, D. (2010). Three-stage semi-parametric estimation of -copulas: asymptotics, finite-samples, properties and computational aspects. Computational Statistics and Data Analysis 54(11), 2562–2579 (http://doi.org/bbqbq5).
Higham, N. J. (2005). Computing the nearest correlation matrix: a problem from finance. IMA Journal of Numerical Analysis 22, 329–343 (http://doi.org/d8gj5f).
Jones, J. A. (2010). GenCorr: an R routine to generate correlation matrices from a user-defined eigenvalue structure. Applied Psychological Measurement 34, 68–69 (http://doi.org/bhjb5s).
Laloux, L., Cizeau, P., Potters, M., and Bouchaud, J.-P. (2000). Random matrix theory and financial correlations. International Journal of Theoretical and Applied Finance 3(3), 391–397 (http://doi.org/dd7skx).
Lindskog, F., McNeil, A. J., and Schmock, U. (2002). Kendall’s tau for elliptical distributions. In Credit Risk: Measurement, Evaluation and Management, pp. 149–156. Contributions to Economics. Physica, Heidelberg.
Marsaglia, G., and Olkin, I. (1984). Generating correlation matrices. SIAM Journal on Scientific and Statistical Computing 5, 470–475 (http://doi.org/bgv7hw).
McNeil, A. J., Frey, R., and Embrechts, P. (2005). Quantitative Risk Management: Concepts, Techniques and Tools. Princeton University Press.
Rousseeuw, P., and Molenberghs, G. (1993). Transformation of non positive semidefinite correlation matrices. Communications in Statistics: Theory and Methods 22(4), 965–984 (http://doi.org/c5z7kh).
Sklar, A. (1959). Fonctions de répartition à dimensions et leurs marges. Publications de l’Institut de Statistique de l’Université de Paris 8, 229–231.
Only users who have a paid subscription or are part of a corporate subscription are able to print or copy content.
To access these options, along with all other subscription benefits, please contact info@risk.net or view our subscription options here: http://subscriptions.risk.net/subscribe
You are currently unable to print this content. Please contact info@risk.net to find out more.
You are currently unable to copy this content. Please contact info@risk.net to find out more.
Copyright Infopro Digital Limited. All rights reserved.
As outlined in our terms and conditions, https://www.infopro-digital.com/terms-and-conditions/subscriptions/ (point 2.4), printing is limited to a single copy.
If you would like to purchase additional rights please email info@risk.net
Copyright Infopro Digital Limited. All rights reserved.
You may share this content using our article tools. As outlined in our terms and conditions, https://www.infopro-digital.com/terms-and-conditions/subscriptions/ (clause 2.4), an Authorised User may only make one copy of the materials for their own personal use. You must also comply with the restrictions in clause 2.5.
If you would like to purchase additional rights please email info@risk.net