Добро пожаловать в Scribd!
Академический Документы
Профессиональный Документы
Культура Документы
Хобби и ремесла Документы
Личностный рост Документы
1, that is, p>n/(n+1). Otherwise, for the case that 1 n 1 > rg. The probability b,(n,p) can be easily calculated by the recursive relation n-k p bear (n,p) = bx (np) eT T 7’ k=0,1,...,.n-1. In what follows, the cumulative probability and the survival probability of the binomial distribution B(n, p) are denoted, respectively, by k a Be(n,p) = S>6;(n,7), Bu(n,p) = > 4;(n,p); = 0,1,...,n. jms ahOTHER DISCRETE DISTRIBUTIONS 45 Since the binomial probability is symmetric in the sense that be(n,p) = br—n(n, 1 —p), we obtain the symmetry result Bi(n,p) = Br-e(nl—p), &=0,1,...,n. (3.4) In particular, because of this relationship, 64(n,1/2) is symmetric around k = n/2. The following approximation due to Peizer and Pratt (1968) is useful in practice: Bump) ~ Ov) =e [eae I where Kl — np + (1 2p)/6 [k’ — npl O44 2 Nog Bf oe 0. From Taylor’s expansion (1.17), that is, we obtain 37% P{X =n} = 1, hence (3.8) defines a probability distribution. The MGF of X ~ Poi(2) is given by m(t)=exp{A(e"—1)}, te R. (3.9) ‘The mean and variance of X ~ Poi(A) are obtained as E[X] = V[X] =. The proof is left to the reader. ‘The Poisson distribution is characterized as a limit of binomial distributions To see this, consider a binomial random variable X,, ~ B(n, \/n), where \ > 0 and n is sufficiently large. From (3.6), the MGF of X, is given by ma(t) = frve-v2]”, teR. Using (1.3), we then have lim, mn(t) = exp {A(e"- 1}, te R, which is the MGF (3.9) of a Poisson random variable X ~ Poi(A). It follows from Proposition 2.12 that X,, converges in law to the Poisson random variable X ~ Poi(d). Let X1 ~ Poi(A;) and X2 ~ Poi(Ag), and suppose that they are indepen- dent. Then, from (2.11) and the independence, we obtain P(X, = k}P{X2 = n— k} P(X + Xz =n} P(X, = |X, + Xp =n} = Since X; + Xz ~ Poi(A: + Az) from the result of Exercise 3.3, simple algebra shows that k me nce (= ) 5 ; Mt) aXe Hence, the random variable X; conditional on the event {X1+X2 =n} follows the binomial distribution B(n,1/(A1 + »2)) P(X, = bIXa + XpOTHER DISCRETE DISTRIBUTIONS 47 3.2.2 Geometric Distributions A discrete random variable X is said to follow a geometric distribution with parameter p (denoted by X ~ Geo(p) for short) if the set of realizations is {0,1,2,...} and the probability is given by P{X =n} =pll-p)", ‘The geometric distribution is interpreted as the number of failures until the first success, that is, the waiting time until the first success, in Bernoulli trials with success probability p. It is readily seen that =0,1,2,.... (3.10) i=p E[X] = oo vidd= = ‘The proof is left to the reader Geometric distributions satisfy the memoryless property. That is, let X ~ Geo(p). Then, P{X —n>m|X>n}=P{X>m}, mn 0,1,2,-.+ (3.11) ‘To see this, observe that the survival probability is given by P{X >m}=(1-p)™, m=0,1,2,... The left-hand side in (3.11) is equal to P{X >n+m} P(X Sn} ” hence the result. The memoryless property (3.11) states that, given X > n, the survival probability is independent of the past. This property is intuitively obvious because of the independence in the Bernoulli trials. That is, the event {X > n} in the Bernoulli trials is equivalent to saying that there is no success until the (n — 1)th trial. But, since each trial is performed independently, the first success after the nth trial should be independent of the past. It is noted that geometric distributions are the only discrete distributions that possess the memoryless property. P(X >n+ml|X >n} Example 3.2 In this example, we construct a default model of a corporate firm using Bernoulli trials. A general default model in the discrete-time setting will be discussed in Chapter 9. Let ¥, Y2,... be a sequence of IID Bernoulli random variables with parameter p. That is, PLY) =1}=1-P{¥%, =O} =p, n=1,2,.... Using these random variables, we define N = min{n: Yn =0}, and interpret NV as the default time epoch of the firm (see Figure 3.1). Note that N is an integer-valued random variable and satisfies the relationship {N>t}={Ya=l1forallnt}=p', t=1,2,..., hence (V ~1) follows a geometric distribution Geo(p). The default time epoch constructed in this way has the memoryless property, which is obviously coun- terintuitive in practice. However, by allowing the random variables Y, to be dependent in some ways, we can construct a more realistic default model. The details will be described in Chapter 9. 3.3 Normal and Log-Normal Distributions A central role in the finance literature is played by normal distributions. This section is devoted to describe a concise summary of normal distributions with emphasis on applications to finance. ‘A continuous random variable X is said to be normally distributed with mean p and variance a? if the density function of X is given by on {-S52"}, 2eR. (3.12) f( If this is the case, we denote X ~ N(,0?). In particular, N(0,1) is called the standard normal distribution, and its density function is given by 4 1 2, cer (3.13) Vin A significance of the normal density is the symmetry about the mean and its bell-like shape. The density functions of normal distributions with various variances are depicted in Figure 3.2. The distribution function of the standard normal distribution is given by aa)= fe PPat, «eR. (3.14) Note that the distribution function &(x) does not have any analytical expres- sion.NORMAL AND LOG-NORMAL DISTRIBUTIONS 49 Figure 3.2 The density functions of normal distribution (01 0, But, since X ~ N(, 07), the standardization (3.15) yields y= 0(PEE#), yo. o It follows that the density function is given by L _ (logy ~ 4)? } Fy) Yinoy ox { neg o7 alee) lick (3.18) ‘The proof is left to the reader. The nth moment of ¥ can be obtained from (3.17), since yr =e", n=1,2,. It follows that 2, EY” exp {nus 7S }. n=1,2,....OTHER CONTINUOUS DISTRIBUTIONS st It is noted that, although Y has the moments of any order, the MGF of a log-normal distribution does not exist. This is so, because for any h > 0 any? REY”) AY) = - 3 Ele 1-2/5 v ]-© ep (3.19) ‘The proof is left in Exercise 3.11. Next, suppose that X ~ N(j4,07), and let 6 # 0 be such that m(6) = E [e’*] = 1. It is easily seen that @ = —2u/c? in this particular case. Denote the density function of X by f(a), and define f'(x)=e* f(x), ce R. (3.20) ‘The MGF associated with f*(x) is obtained as m*(t) =/ ef f*(a)de -/ o(9* f(a)de = m(0 +t) for any t € R. Since ft(z) > 0 and m*(0) = m() = 1, the function f*(x) defines a density function. Also, when 0 = —2y/07, we have from (3.17) that m() =e", teR, which is the MGF of normal distribution N(—, 0). Since the transformation (3.20) is useful in the Black-Scholes formula (see, for example, Gerber and Shiu [1994] for details), we summarize the result as a theorem. Theorem 3.1 Suppose that X ~ N(—?/2, w), and denote its density func- tion by f(x). Then, the transformation f*(@) defines the density function of the normal distribution N (w?/2, ¥7). ef(a), teR, 3.4 Other Continuous Distributions This section introduces some useful, continuous distributions in financial en- gineering other than normal and log-normal distributions. See Exercises 3.13 3.15 for other important continuous distributions. 8.4.1 Chi-Square Distributions ‘A continuous random variable X is said to follow a chi-square distribution with n degrees of freedom (denoted by X ~ x2 for short) if its realization is in (0,00) and the density function is given by = B12, Pal) = seraRtm/ay len#/2, >,52 USEFUL DISTRIBUTIONS IN FINANCE, where P(«) denotes the gamma function defined by ut le“du, 2 > 0. (3.21) We note that T'(1) = 1, P(1/2) @, and P(a) = (a — L)P(a— 1). Hence, T(n) = (n — 1)! for positive integer n. The proof is left to the reader. The chi-square distribution x2 is interpreted as follows. Let X; ~ N(0,1), i=1,2,...,n, and suppose that they are independent. Then, it can be shown that XP4XP4--4+X2 ~ xh. (3.22) Hence, if X ~ x2, then E[X] =n by the linearity of expectation (see Propo- sition 2.3). Also, we have V[X] = 2n since, from Proposition 2.4, VIX] = V [9] +--+ V [XQ] =n (E [<9] —E? [X2]) and E[X4] = 3 from the result in Exercise 3.8. Note from the interpretation of (3.22) that if X ~ x2, ¥ ~ x2, and they are independent, then we have X+V~ xB am ‘The importance of chi-square distributions is apparent in statistical infer- ence. Let X,X2,...,X%n denote independent samples from a normal popula- tion NV(u,0?). The sample mean is defined as Xi+Xa+--+Xn X= , 7 while the sample variance is given by gee (% ~X)? + (Xo —- XP +--+ (Kn - KX)? n-1 ‘Then, it is well known that < o in ~ 1)S? X~N C9) , Gps Xho and they are independent. For independent random variables X; ~ N(u,0?), i= 1,2,...,n, define ‘The density function of X is given by fetes, , 662) =o PS FPN le), =P, 308) fea where ¥,(z) is the density function of x2. The distribution of X is called the non-central, chi-square distribution with n degrees of freedom and non- centrality 5. When 6 = 0, it coincides with the ordinary chi-square distribu- tion. The distribution is a Poisson mixture of chi-square distributions. The degree of freedom n. can be extended to a positive number.OTHER CONTINUOUS DISTRIBUTIONS, 53 3.4.2 Student t Distributions ‘A continuous random variable X is said to follow a ¢ distribution with n degrees of freedom (denoted by X ~ t,, for short) if its realization is in FR and the density function is given by _ U(r +1)/2) aay ern J) = ray (1+2) , 2eR ‘The density function is symmetric at x = 0. The ¢ distribution t, is interpreted as follows. Let X ~ N(0,1), Y ~ x3, and suppose that they are independent. Then, it can be shown that Z = x ~ ty. In general, the moment E(Z*] does not exist for k > n. Hence, V¥/n the moment generating function of ¢ distributions does not exist. However, it is well known that ¢ distributions with n > 30 are well approximated by the standard normal distribution. Also, ty is called a Cauchy distribution. That is, when X, ¥Y ~ N(0,1) and they are independent, both X/Y and Y/X follow the Cauchy distribution. The distribution function of the standard Cauchy distribution is given by 1 F(z) = + = arctan, ze R. Note that the mean does not exist for Cauchy distributions. Importance of ¢ distributions is apparent again in statistical inference. Let Xi,X2,...,Xn denote independent samples from a normal population N(u,0?). Then, we have where X is the sample mean and S? is the sample variance. It can be shown. that X and S? are mutually independent. Also, it is well known that ~ tnt Note that the statistic U does not depend on o”. 3.4.8 Raponential Distributions A continuous random variable X is said to follow an exponential distribution with parameter 4 > 0 (denoted by X ~ Bxp(A) for short) if its realization is on (0,00), and the density function is given by f(a) =de*, a 0. (3.24) As was used in Example 2.6, the survival probability is given by P{X>a}=e*, 220.54. USEFUL DISTRIBUTIONS IN FINANCE y=H() ° Figure 3.3 The construction of default time epoch N in continuous time. Note that E[X] = 1/A and V[X] = 1/? (see Exercise 3.13). Also, the MGF is given by a xe It is easily seen that the exponential distribution is a continuous limit of geo- metric distributions. Hence, exponential distributions share similar properties with geometric distributions. In particular, exponential distributions possess the memoryless property P{X>a+y|X>y}=P{X >a}, 2, y20, E [e*] t H(t)} Since H(z) is increasing in «, N is well defined (see Figure 3.3). As before, we interpret N as the default time epoch. Note that N’ is a positive random variable and satisfies the relation {N >t} S>H(t)}}, t20. It follows that P{N>tH}=P{S>H®}=e*, t20, hence NV is exponentially distributed with mean 1/. We shall return to a more realistic model using a general non-decreasing function H(x) (see Example 16.1 in Chapter 16). For IID random variables X; ~~ Exp(A), i= 1,2,..., define Sp = Xi t+Xot---+ Xe, K=1,2,...-OTHER CONTINUOUS DISTRIBUTIONS 85 ‘The density function of S}, is given by , 220, (3.25) f@=— which can be proved by the convolution formula. (see Exercise 3.9). This dis- tribution is called an Erlang distribution of order k with parameter \ > 0 (denoted by S_ ~ Ex(A) for short). The distribution function of Ej,(X) is given by 220. (3.26) ‘The Erlang distribution is related to a Poisson distribution through P{S, > A} = PAY k} is equivalent to the event {5x < t}, it follows from (3.26) that QU ae ae ‘That is, N(¢) follows a Poisson distribution with parameter At; see (3.8). The process {N(t), f > 0} counting the number of failures is called a Poisson process, which we shall discuss later (see Section 12.5 in Chapter 12) P{N(t) =k} = k=0,1,2,.... (3.28) 3.4.4. Uniform Distributions ‘The simplest continuous distributions are uniform distributions. A random variable X is said to follow a uniform distribution with interval (a,b) (denoted by X ~ U(a,b) for short) if the density function is given by f@)=ph, ace 3 En Onl On2 “* Onn * ‘Throughout this book, every vector is a column vector and the transpose T is used to denote a row vector.MULTIVARIATE NORMAL DISTRIBUTIONS, 87 ‘The symmetry of covariances implies that 91; = 0;:, so that the covariance matrix 5 is symmetric (in fact, it is positive semi-definite as proved below). The joint density function is given by 1 oop {Gow (m)"/2\/det(S) provided that © is positive definite, where x = (a1,22,-..,¢,)" and T de- notes the transpose. Note that the marginal distributions are normal and X;~ N(ui, 0x). If the covariance matrix is not positive definite, the random variables are linearly dependent and some special treatments are required. See, for example, Tong (1990) for details. FG = =H} ; (3.32) 2 Definition 3.1 A symmetric matrix A is said to be positive definite if, for any non-zero vector c, we have c’ Ac > 0. It is called positive semi-definite if, for any vector c, we have c’ Ac > 0. Recall that, even if X and Y are uncorrelated, they can be dependent on each other in general. However, in the case of normal distributions, uncorre- lated random variables are necessarily independent, which is one of the most striking properties of normal distributions. To prove this, suppose that the covariance matrix is diagonal, that is, o;; = 0 for i # j. Then, from (3.32), the joint density function becomes 2 { amy? } , meR, 2o% so that they are independent (see Definition 2.4). In particular, if X; ~ N(0,1), i = 1,2,...,7, and they are independent, then the joint density function is given by 2, a eR. (3.33) ryt f(x) Te iat V20 Such a random variable (X1,...,Xn) is said to follow the standard n-variate normal distribution. Example 3.4 (Gaussian Copula) Let X = (%1,.. random variable, and define USO MF (X), F=L2.00, X,) be an n-variate where &(z) is the standard normal cumulative distribution function (CDF) and F;(z) is the marginal CDF of X;. It is assumed for the sake of simplicity that F(2) is strictly increasing in z for all j. A Gaussian copula assumes that U = (U;,...,Un) follows an n-variate normal distribution with zero mean and correlation matrix ©), = (piy).t t Random variables modeled by the Gaussian copula are asymptotically independent for any correlation coefficient p, |p| < 1. See Section 2.8 in Chapter 2 for details.58 USEFUL DISTRIBUTIONS IN FINANCE, In this copula situation, the joint CDF F(#) for X is given by F(#) P(X, < a1, Xo S 82)--- Xn S On} P{U, < a1,U2 $ 2---1Un S On} where aj = ®7'{F)(0j)}. 7 = 1,2,...,2. Denote by dn), @ = (@1,--+52n)s the density function of the n-variate normal distribution with zero mean and correlation matrix Ep, (2) the density function of the univariate standard normal distribution, and f,(ss) the marginal density function of Xj. Then, the joint density function f(x) for X exists and is given by onl fe f(w) = since ®(aj) = Fj(2j)- For real numbers ¢1,¢2,-.-,n, consider the linear combination of normally distributed random variables Xi, X2,-- , Xp, that is, R= X, + e2X2t- + OnXn- (3.34) A significant property of normal distributions is that the linear combination Ris also normally distributed. Since a normal distribution is determined by the mean and the variance, it suffices to obtain the mean E(R] = +z Cob (3.35) a and the variance eae VIR) = > Yo cvouses- (3.36) Note that the variance V[R] must be non-negative, hence the covariance ma- trix 5 = (oi) is positive semi-definite by Definition 3.1. If the variance V[R] is positive for any non-zero, real numbers c;, then the covariance matrix 2 is positive definite. The positive definiteness implies that there are no redundant random variables. In other words, if the covariance matrix © is not positive definite, then there exists a linear combination such that R defined in (3.34) has zero variance, that is, R is a constant. 9.5.2 Value at Risk Consider a portfolio consisting of n securities, and denote the rate of return of security i by Xi. Let c; denote the weight of security i in the portfolio. Then, the rate of return of the portfolio is given by the linear combination (3.34), ‘To see this, let 2; be the current market value of security i, and let ¥; be its future value. The rate of return of security é is given by Xi = (Vi ~ ai)/a: Let Qo = Tit, €ts, where &; denotes the number of shares (not weight) of security , and let Q denote the random variable that represents the future portfolio value. Assuming that no rebalance will be made, the weight c: inMULTIVARIATE NORMAL DISTRIBUTIONS 59 Figure 3.4 The density function of Q- Qo and VaR. the portfolio is given by c: = &:xi/Qo. It follows that the rate of return, R= (Q—Qo)/Qo, of the portfolio is obtained as im H—) ax,, (3.37) as claimed. The mean E{R] and the variance V([R] are given by (3.35) and (3.36), respectively, no matter what the random variables X; are. Now, of interest is the potential for significant loss in the portfolio. Namely, given a probability level o, what would be the maximum loss in the portfolio? The maximum loss is called the value at risk (VaR) with confidence level 100a%, and VaR is a prominent tool for risk management. More specifically, VaR with confidence level 1000% is defined by zq > 0 that satisfies P{Q - Qo = -2a} = a and VaR’s popularity is based on aggregation of many components of market risk into the single number za. Figure 3.4 depicts the meaning of this equation. Since Q — Qo = QoR, we obtain P{QoR < ~2za} = 1 —@. (3.38) Note that the definition of VaR does not require normality. However, calcu- lation of VaR. becomes considerably simpler, if we assume that (X1,..., Xn) follows an n-variate normal distribution. Then, the rate of return R in (3.37) is normally distributed with mean (3.35) and variance (3.36). The value za satisfying (3.38) can be obtained using a table of the standard normal dis- tribution, In many standard textbooks of statistics, a table of the survival probability 12) = f 1L_ewvlay, 2>0, is given. Then, using the standardization (3.15) and symmetry of the density function (3.13) about 0, we can obtain the value zq with ease. Namely, letting60 USEFUL DISTRIBUTIONS IN FINANCE Table 3.1 Percentiles of the Standard Normal Distribution 100-a) 10 50 10 08 O1 Le 1.282 1.645 2.326 2.576 3.090 Ta = —Za/Qo, it follows from (3.38) that r-a=P{25# 7 where 4 = E[R] and o = \/V[R], hence ae (-==#) ‘Therefore, letting rq be such that L(q) = 1—«a, we obtain es Za = Qo(tar — H), as the VaR with confidence level 100a%.+ The value zq is the 100(1 — a)- percentile of the standard normal distribution (see Table 3.1). For example, if 4 = 0, then the 99% VaR is given by 2.326cQo. See Exercise 3.18 for an example of the VaR calculation. 3.6 Exercises Exercise 3.1 (Discrete Convolution) Let X and Y be non-negative, dis- crete random variables with probability distributions p¥ = P{X = i} and py = P{Y = i}, i =0,1,2,..., respectively, and suppose that they are inde- pendent. Prove that k PIX+Y =k} = Sop ph, k=01,2,... = Using this formula, prove that if X ~ B(n,p), Y ~ Be(p) and they are independent, then X + ¥Y ~ B(n +1,p). Exercise 3.2 Suppose that X ~ B(n,p), Y ~ B(m,p) and they are in- dependent. Using the MGF of binomial distributions, show that X + Y ~ B(n-+m,p). Interpret this result in terms of Bernoulli trials. Exercise 3.3 Suppose that X ~ Poi(X), ¥ ~ Poi() and they are indepen- dent. Using the MGF of Poisson distributions, show that X+Y ~ Poi(A-+11) Interpret this result in terms of Bernoulli trials. ¥ Since the risk horizon is very short, for example, 1 day or 1 week, in market risk manage- ‘ment, the mean rate of return pis often set to be zero. In this case, VaR. with confidence level 1000% is given by za0Qo.EXERCISES 61 Exercise 3.4 (Logarithmic Distribution) Suppose that a discrete ran- dom variable X has the probability distribution (l=p)" P{X =n} = » n=l... where 0 < p< 1 and a= —1/logp. Show that E [et] = leslt = ~ pet} t logp and E[X] = a(1 ~ p)/p. This distribution represents the situation that the failure probability decreases as the number of trials increases Bxercise 3.5 (Pascal Distribution) In Bernoulli trials with success prob- ability p, let T' represent the time epoch of the mth success. That is, denoting the time interval between the (i — 1)th success and the ith success by Tj, we have T = T, +T2+--++Tm. Prove that the probability function of T is given by P{T =k} = e-1Ce-mp™(1—p)™, k= m,m+1,. Exercise 3.6 (Negative Binomial Distribution) For the random vari- able T defined in Exercise 3.5, let X = T —m. Show that P{X =k} = mar-iCep™(1—p)*, k=0,1,.. Also, let p = 1 —A/n for some A > 0. Prove that X converges in law to a Poisson random variable as n + oo. Exercise 3.7 Suppose X ~ N(0,1). Then, using the MGF (3.17), show that ut+oX ~ N(u,0*). Exercise 3.8 Using the MGF (3.17), verify that E[Y] = y and V[Y] = 0? for Y ~ N(u,07). Also, show that E[X°] = 0 and E[X4] = 3 for X ~ N(0, 1). Exercise 3.9 (Convolution Formula) Let X and Y be continuous random variables with density functions fx(x) and fy(x), respectively, and suppose that they are independent. Prove that the density function of X +Y is given by feat) = [ txWivle-vey, weR Using this, prove that if X ~ N(ux,0%), ¥ ~ N(uy,o%) and they are independent, then X +Y¥ ~ N(ux + py, 7% + of). Also, prove (3.25) by an induction on k. Exercise 3.10 In contrast to Exercise 3.9, suppose that X ~ N(0,0?), but Y depends on the realization of X. Namely, suppose that Y ~ N(u(2), 02(2)) given X = 2, x € R, for some functions yu(e) and o?(z) > 0. Obtain the density function of Z =X + Y. Exercise 3.11 Equation (3.19) can be proved as follows. (1) Show that f"*" log ydy > logn! for any n = 1,2,...62 USEFUL DISTRIBUTIONS IN FINANCE (2) Let h > 0. For all sufficiently large n, show that nb nlogh+nptn?o? > f[ log y dy. 1 (3) Combining these inequalities, show that co. gnlog h4tnutn*o? eee =m. a nl Exercise 3.12 For a positive random variable X with mean ys and variance 0”, the quantity 7 = o/ is called the coefficient of variation of X. Show that n = 1 for any exponential distribution, while 7 < 1 for Exlang distributions given by (3.25). Also, prove that 1 > 1 for the distribution with the density function k F(a) = So pire", 2 30, 1 where pi, 1 > 0 and 7%, p; = 1. The mixture of exponential distributions is called a hyper-exponential distribution. Exercise 3-13 (Gamma Distribution) Suppose that a continuous random variable has the density function “te, > 0, iw=z where ['(e) is the gamma function (3.21). Prove that the MGF is given by m(t) = (au) t0. Exercise 3.15 (Extreme-Value Distribution) Consider a sequence of IID random variables X; ~ Ezxp(1/@), i =1,2,..., and define Yq = max{X1, X2,..., Xn} — Ologn. ‘The distribution function of Ya is denoted by Fy(z). Prove that 2/7" =|: Fala) = [1 er @tooer]” = [: -EXERCISES 63 so that lim Fy(2) = exp {- eve} This distribution is called a double exponential distribution and is known as one of the extreme-value distributions. Exercise 3.16 Let (X,Y) be any bivariate normal random variabie. For any function f(z) for which the following expectations exist, show that E[f(X)e-*] =E [e"Y] ELf(X - CX, YD], where C[X, Y] denotes the covariance between X and Y. Exercise 3.17 Let (X,Y) be any bivariate normal random variable. For any differentiable function g(x) for which the following expectations exist, show that Cl9(X), Y] = Ely’ (X)ICIX, Y)- When X = Y, in particular, obtain Stein’s lemma (Proposition 3.2). Exercise 3.18 Let X;, i = 1,2, denote the annual rate of return of security i, and suppose that (Xi, X2) follows the bivariate normal distribution with means i = 0.1, pa = 0.15, variances o1 = 0.12, a2 = 0.18, and correlation coefficient p = —0.4. Calculate the 95% VaR, for 5 days of the portfolio R 0.4X1 +0.6X2, where the current market value of the portfolio is $1 million.CHAPTER 4 Derivative Securities ‘The aim of this chapter is to explain typical (plain vanilla) derivative securities actually traded in the market. A derivative security (or simply a derivative) is a security whose value depends on the values of other more basic underlying variables. In recent years, derivative securities have become more important than ever in financial markets. Futures and options are actively traded in ex- change markets and swaps and many exotic options are traded outside of ex- changes, called the over-the-counter (OTC) markets, by financial institutions and their corporate clients. See, for example, Cox and Rubinstein (1985) and Hull (2000) for more detailed discussions of financial products. 4.1 The Money-Market Account Consider a bank deposit with initial principal 7 = 1. The amount of the deposit after ¢ periods is denoted by By. The interest paid for period ¢ is equal to Beyi — By. If the interest is proportional to the amount By, it is called the compounded interest. That is, the compounded interest is such that the amounts of deposit satisfy the relation Bui-Be=rB, t CP encn where the multiple r > 0 is called the interest rate. Since Bry: = (1 +r) Be, it follows that SORE RRE RR (4.1) The deposit B; is often called the money-market account. Suppose now that the annual interest rate is r and interest is paid n times each year. We divide 1 year into n equally spaced subperiods, so that the interest rate for each period is given by r/n. Following the same arguments as above, it is readily seen that the amount of deposit after m periods is given by (4.2) oo Bm=(1+=)", m=0,1,2, For example, when the interest is semiannual compounding, the amount of deposit after 2 years is given by By = (1+ 1/2) Suppose t = m/n for some integers m and n, and let B() denote the amount of deposit at time # for the compounding interest with annual interest rate r. From (4.2), we have rae (1+ 5) : 65 B(t)66 DERIVATIVE SECURITIES Now, what if we let n tend to infinity? That is, what if the interest is contin- uous compounding? This is easy because, from (1.3), we have B= [Jim (1+2)"]Jset, e209 (4.3) Here, we have employed Proposition 1.3, since the function f(x) = x* is continuous. Consider next the case that the interest rates vary in time. For simplicity, we first assume that the interest rates are a step function of time. That is, the interest rate at time t is given by r(t)=n iftist 0 is the time length covered by the LIBOR interest rates. For example, 5 = 0.25 means 3 months, 6 = 0.5 means 6 months, and so forth. From (4.5) with S(t) being replaced by u(t,t +6) and T by t+ 4, we identify the LIBOR rate L(t,t +4) to be the average interest rate covering the period [e+]. 4.3 Forward and Futures Contracts A forward contract is an agreement to buy or sell an asset at a certain future time, called the maturity, for a certain price, called the delivery price. The forward contract is usually traded on the OTC basis.FORWARD AND FUTURES CONTRACTS, nm ES x ° St) ° 5) FCO) EA (a) Party A (b) Party B Figure 4.2 The payoff functions of the forward contract, Suppose that two parties agree with a forward contract at time t, and one of the parties, say A, wants to buy the underlying asset at maturity T’ for a delivery price Fr(¢). The other party, say B, agrees to sell the asset at the same date for the same price. At the time that the contract is entered into, the delivery price is chosen so that the value of the forward contract to both sides is zero. Hence, the forward contract costs nothing to enter. However. party A must buy the asset for the delivery price F'p(¢) at date T. Since party ‘A can buy the asset at the market price S(T), its gain (or loss) of the contract, is given by X = S(T) - F(t). (4.16) Note that nobody knows the future price $(T). Hence, the payoff X at date T is a random variable. If S(T) = $ is realized at the future date T > t, the payoff is given by X(S) =S- Fr(t). The function X(S) with respect to $ is called the payoff function. The payoff function for party B is X(S) = Fr(t) — 8. Hence, the payoff functions of the forward contract are linear (see Figure 4.2). Example 4.3 Suppose that a hamburger chain is subject to the risk of price fluctuation of meat. If the cost of meat becomes high, the company ought to raise the hamburger price. Then, however, the company will lose approx- imately 30% of existing consumers. If, on the other hand, the cost of meat becomes low, the company can lower the hamburger price to obtain new con- sumers; but the increase is estimated to be only about 5%. Because the down- side risk is much more significant compared to the gain for this company, it decided to enter a forward contract to buy the meat at the current price 2. Now, if the future price of meat is less than «, then the company loses money, because the company could buy the meat at a lower cost if it did not enter the contract. On the other hand, even when the price of meat becomes extremely high, the company can purchase the meat at the fixed cost ©.ua DERIVATIVE SECURITIES The forward price for a contract is defined as the delivery price that makes the contract have zero value. Hence, the forward and delivery prices coincide at the time that the contract is entered into. As time passes, the forward price changes according to the price evolution of the underlying asset (and the interest rates too) while the delivery price remains the same during the life of the contract. This is so, because the forward price Fr(u) at time u>t depends on the price S(u), not on S(t), while the delivery price Fr(t) is determined at the start of the contract by S(t). As we shall see in Chapter 6, the time-t forward price of an asset with no dividends is given by sO Fr(t t0, fale = max(a,0}={ § 229 Similarly, the payoff function of a put option with maturity T and strike price K is given by (4.19) X = {K -S(T)}4 (4.20) ‘These payoff functions are depicted in Figure 4.3. Observe the non-linearity of the payoff functions. As we shall see later, the non-linearity makes the valuation of options considerably difficult. Example 4.4 Suppose that a pension fund is invested to an index of a finan- cial market. The return of the index is relatively high, but subject to the risk of price fluctuation. At the target date T, if the index is below a promised level K, the fund may run out of money to repay the pension. Because of this downside risk, the company running the pension fund decided to purchase put options written on the index with strike price K. If the level S(T) of the index at the target date T is less than K, then the company exercises the right to obtain the gain K — S(T), which compensates the shortfall, since S(T) + [K - S(T)] = K. If, on the other hand, S(T) is greater than K, the company abandons the right to do nothing; but in this case, the index may be sufficiently high for the purposes. The put option used to hedge downside risk in this way is called a protective put.1 DPRIVATIVE SECURITIES Note that the payoffs of options are always non-negative (see Figure 4.3) Hence, unlike forward and futures contracts, we cannot choose the strike price so that the value of an option is zero. This means that an investor must pay some amount of money to purchase an option. The cost is termed as the option premium. One of the goals in this book is to obtain the following Black-Scholes formula for a Buropean call option: o(S,t) = SB(d) — Ke T-O(d - oVT 8), (4.21) where log(S/K)+r(T—t) | covT=t a + SS,” ovT 2 S(t) = S is the current price of the underlying security, K is the strike price, T is the maturity of the call option, r is the spot rate, o is the volatility, and &(q) is the distribution function (3.14) of the standard normal distribution. ‘The volatility is defined as the instantaneous standard deviation of the rate of return of the underlying security. Note that the spot rate as well as the volatil- ity is constant in (4.21)-(4.22). The Black-Scholes formula will be discussed in detail later. See Exercise 4.7 for numerical calculations of the Black—Scholes formula. See also Exercises 4.8 and 4.9 for related problems. (4.22) 4.5 Interest-Rate Derivatives In this section, we introduce some commonly traded interest-rate derivatives in the market. For example, any type of interest-rate options is traded in the market. Let u(t,r) be the time-t price of a bond (not necessarily a discount bond) with maturity 7. Then, the payoff of a European call option at the maturity T is given by X={v(T,r)-K}4, tsT 0. For notational simplicity, we assume that the notional principal is unity and cash flows are exchanged at date T, = Tp +15, i =1,2,...,n. Let § be the swap rate. Then, in the fixed side, party A pays to party B the interest 5’ = 6S at dates T; and so the present value of the total payments is given by Varx = $'v(0,Ti) + $'0(0,T2) +--+ $'v(0,Tr) = 55 D> (0,7), (4.24) a where v(t, 7) denotes the time-t price of the default-free discount bond with maturity T. On the other hand, in the floating side, suppose that party B pays to party A the interest Li = 5L(T;,Ti41) at date Tj41, where L(t,t + 6) denotes the 5-LIBOR rate at time t. Since the future LIBOR rate L(T;,Ti+1) is not observed now (t = 0), it is a random variable. However, as we shall show later, the present value for the floating side is given by Ver = v(0, Zo) — v(0, Tn). (4.25) Since the swap rate S is determined so that the present values (4.24) and (4.25) in both sides are equal, it follows that (4.26) Observe that the swap rate S does not depend on the LIBOR rates. The swap rates for various maturities are quoted in the swap market. Let S(t) be the time-t swap rate of the plain vanilla. interest-rate swap. The swap rate observed in the market changes, as interest rates change. Hence, the swap rate can be an underlying asset for an option. Such an option is called a swaption, Suppose that a swaption has maturity r, and consider an interest-rate swap that starts at r with notional principal 1 and payments being exchanged at T, = 7 + #5, i = 1,2,...,n. Then, similar to (4.24), the value of the total payments for the fixed side at time 7 is given by Verx(r) = 58(r) Sone +16). i If an investor (for the fixed side) purchases a (European) swaption with strike rate K, then his/her payoff for the swaption at future time 7 is given by 5{S(7) — K}4. D> v(r,7 +48), (4.27) i=l76 DERIVATIVE SECURITIES Figure 4.4 Capping the LIBOR rate. since the value of the total payments for the fixed side is fixed to be Note that (4.27) is the payoff of a call option. Similarly, in the floating side, the payoff is 5{K — S(r)}4 Do v(r,7 + 18), which is the payoff of a put option. 4.5.2 Caps and Floors As in the case of swaps, let T; be the time epoch that the ith payment is made. We denote the LIBOR rate that covers the interval (Tj, Ti41] by Li L(Ti,Ti+1). An interest-rate derivative whose payoff is given by 6{li-K}4, &=Ti-T, (4.28) is called a caplet, where K is called the cap rate. A caplet is a call option written on the LIBOR rate. A cap is a sequence of caplets. If we possess a cap, then we can obtain {I,K}, from the cap. Hence, the effect is to “cap” the LIBOR rate at the cap rate K from above, since Li —{li—- K}4 SK. Figure 4.4 depicts a picture for capping the LIBOR rate by a cap. ‘A put option written on the LIBOR rate, that is, the payoff is given by 6{K —Li}e, 6 =Taa -Ty is called a floorlet, and a sequence of floorlets is a floor. If we possess a floor, then we can obtain {K — Li}+ from the floor, and the effect is to “floor” the LIBOR rate at K from below.EXERCISES 7 4.6 Exercises Exercise 4.1 Suppose that the current time is 0 and that the current forward rate of the default-free discount bonds is given by F(0,t) = 0.01 + 0.005 xt, #>0. Consider a coupon-bearing bond with 5 yen coupons and a face value of 100 yen. Coupons will be paid semiannually and the maturity of the bond is 4 years. Supposing that the bond is issued today and the interest is the contin- uous compounding, obtain the present value of the coupon-bearing bond. Exercise 4.2 In Example 4.1, suppose that the rate of return (yield) is caleu- lated in the continuous compounding. Prove that the yield is a unique solution of the equation q=Ce-¥/? + CeV + --- + Ce-P¥/? +. (1+ C)e~5¥ Explain how the yield y is obtained by the bisection method. Exercise 4.3 Prove the following identities, JE fltss)ds Sr f(t. s)ds T-t 7-T Exercise 4.4 In the discrete-time case, prove the following. @) Y@7)= @) fet.) = T-1 TIla+ Fes) act @ BO@=T]O+-) ey fon TT? Here, we use the convention []9_, a; = 1. Exercise 4.5 Suppose that the current time is 0 and that the current term structure of the default-free discount bonds is given by 0.5 1.0 2.0 5.0 10.0 (0,7) 0.9875 0.9720 0.9447 0.8975 0.7885 Obtain (a) the yields ¥ (0,2) and Y(0,5), (b) the forward yield f(0, 1,2), (c) the LIBOR rate L(0,0.5) as the semiannual interest rate, and (d) the forward price of »(0,10) maturing at 2 years later. Exercise 4.6 Draw graphs of the payoff functions of the following positions: (1) Sell a call option with strike price K. (2) Sell a put option with strike price K’, (3) Buy a call option with strike price Ki and, at the same time, buy a put option with strike price K2 and the same maturity (you may assume Ky < Ka, but try the other case too) Exercise 4.7 In the Black-Scholes formula (4.21)-(4.22), suppose that $ = K = 100, t = 0, and T = 0.4. Draw the graphs of the option premiums with respect to the volatility ¢, 0.01 < o < 0.6, for the cases that r = 0.05, 0.1, and 0.2.78 DERIVATIVE SECURITIES Exercise 4.8 Let ¢(z) denote the density function (3.13) of the standard normal distribution. Prove that $(@) = 7 “IY 6(d —o/TF—4), where dis given by (4.22). Using this, prove the following for the Black-Scholes formula (4.21): es = &(d)>0, So _— oe = are) + Kre"" @(d —ov/r) > 0, cK = —e"7®(d-oVF) <0, co = S¥rd(d)>0, and c = Kre"7&(d—ov7) > 0. Here, r = T—t and cz denotes the partial derivative of c with respect to x. Exercise 4.9 (Put-Call Parity) Consider call and put options with the same strike price K and the same maturity T’ written on the same underlying security S(t). Show that {S(T) - K}4 + K = {K - S(T)}4 + S(D)- Exercise 4.10 In the same setting as Exercise 4.5, obtain the swap rate with maturity T = 10. Assume that the swap starts now and the interests are exchanged semiannually. Interpolate the discount bond prices by a straight line if necessary. Exercise 4.11 (Implied Volatility) In the Black-Scholes model, the volatil- ity o is not actually observed in the market. Hence, given the other parame- ters, the implied volatility oa is calculated from Equation (4.21)~(4.22) and the market price car of the call option. Suppose ¢ = 0, r = 0.02, S = 100, K = 100, and T = 0.5. If the market price is cy = 2.1, calculate the implied volatility omCHAPTER 5 Change of Measures and the Pricing of Insurance Products ‘The change of measures provides an essential tool for the pricing of derivative securities. In this chapter, we describe the key idea of this extremely impor- tant tool for financial engineering for the case that the measure is changed by using a positive random variable. A direct application of this simple transfor- mation includes the pricing of insurance products. We explain how the Esscher transform is derived within this framework. The reader is referred to, for ex- ample, Borch (1990) and Gerber and Shiu (1994) for detailed discussions of insurance pricing. The change of measures based on a more general setting will be explained later. However, the idea is essentially the same even in the general setting. ‘Throughout this chapter, we fix the probability space (0, F,P) and denote the expectation operator by E? when the underlying probability measure is P. 5.1 Change of Measures Based on Positive Random Variables Let 7 be a positive, integrable" random variable defined on (9,F,P), and consider the set function Q: F + R by EF [ian] A)= Al AEF, 6.1) WA) = Be ea) where 14 denotes the indicator function, meaning that 14 = 1 if w € A and la = 0 otherwise. The set function Q defines a probability measure, since Q(M) = 1 and Q(A) > 0. The g-additivity (2.1) of Q follows from that of the indicator function. That is, for mutually exclusive events Aj, i= 1,2,..., we have lpg, ac= ola Note that the right-hand side of (5.1) corresponds to the original probability measure P, and the transformation defines a new probability measure Q. Such a transformation is called the change of measure. This is the basic idea of the change of measures. In the rest of this section, we study the transformation (5.1) from various perspectives. * A random variable X is said to be integrable if EP{|X|] < oo. 7980 CHANGE OF MEASURES AND THE PRICING OF INSURANCE PRODUCTS Example 5.1 (Esscher Transform) The positivity of 7 is always ensured if we set 7 = e+ for some random variable X for which the moment generating function (MGF) exists for the A. Then, the transformation (5.1) is rewritten 8s Pr eX EPtige*] (A) = eA, AEF. QA) = “Bex ‘The expectation of X under Q is given by EP[Xe*] eey" (5.2) E°[x] = which is called the Esscher transform in the actuarial literature and often usec for the pricing of insurance products. See Bihlmann et al. (1998) for details Since we assume that 7 > 0 is integrable, by defining 7/ > we pe EP[n] always have E®[7'] = 1. Hence, we can assume that E?[n] = 1 without loss o generality. The transformation (5.1) is then rewritten as Q(A) =E*[lan], AEF. (5.3; In the following, we describe the change of measure formula (5.3) in the integral form. Note first that Q(A) = E®[1,]. It follows that (5.3) can be written in integral form as [raw = fi nerrten, 267. (54 a i Since the formula (5.4) holds for any A € F, we have where X = Y a.s. (almost surely) if and only if P{w : X(w) = ¥(w)} = 1. The relationship for the random variables is stated as 7 = Sp, and the 17 is callec ae dc is also positive and integrable. Therefore, if P(A) > 0 then Q(A) > 0, and vio versa. In this case, the probability measures P and Q are said to be equivalent the Radon—Nikodym derivative. Since 7 is positive and integrable, 17 Theorem 5.1 (Radon-Nikodym) Suppose that two probability measure P and Q are equivalent. Then, there exists a positive, integrable random vari able n satisfying (5.4) uniquely. From Theorem 5.1, it is always possible to define transformation (5.4) fo any equivalent probability measures. That is, there always exists a Radon Nikodym derivative n > 0 for equivalent probability measures P and Q. Next, we describe the change of measure formula in terms of distributior functions (or density functions). To this end, we consider the expectatio: E{A(X)] for random variable X and real-valued function A(x) for which thCHANGE OF MEASURES BASED ON POSITIVE RANDOM VARIABLES 81 expectation exists. It follows from the definition of expectation and (5, 4) that E%A(X) fx a(eu) = [HX w)nleyPCaw) = ET nh(x)] Hence, we obtain E°(a(X)] = EP[nh(X)), (5.5) where 7 > 0 and EP{n] = 1. In fact, if we set h(X) = 14, we recover (5, 3) from (5.5). In order to state the expectations in (5.5) in terms of distribution functions, we denote the joint distribution function of (7, X) under P by F?(n,2), nS 0, 2 € R. This is so, because 7 and X may not be independent. The marginal distribution function of X under Q is denoted by F@(x). Then, transformation (5.5) is restated as 72 (de) = -” P finer (az) Lf nh(2)F¥ (dn, dz) Since function h(x) is arbitrary, we obtain FO (dz) [we anae, 2eR. (5.6) lb Hence, the change of measure formula (5.3) can be viewed as the transforma- tion of distribution functions of X from FP(x) = [5° F(dy,) under P to Fz) under Q defined in (5.6). If n and X are mutually independent, we have from (5.6) that Fax) = [nF *anFT(as), re R. Since [5° nF*(dn) = EF[n] = 1, it follows that F@(x) = FP(x). Hence, this case provides the trivial change of measures. Next, suppose that the Radon-Nikodym derivative 7 can be written as a function of X. That is, suppose that =X), EF[q]=1 for some function \(z) > 0. Then, from (5.5), we have E2(A(X)] = EPACX)A(X)] Suppose further that X admits a density function f? (zr) under P. Then, from (5.6), X also admits a density function f@(z) under Q given by $2) =Ae)fP(2), wER 6.7) Note that, since E*{\(X)] = 1, we have f° f@(x)dx = 1. ‘The following result is useful for the comparison between the expectations under P and Q in the transformation (5.7). See Kijima and Obnishi (1996) for details. Definition 5.1 (Likelihood Ratio Ordering) Suppose that two random variables X; admit density functions f;(x), i = 1,2. If the likelihood ratio82. CHANGE OF MEASURES AND THE PRICING OF INSURANCE PRODUCTS fila)/ fa(a) is non-decreasing in x, then X; (or fi(#)) is said to be greater than ‘Xz (or fo(x)) in the sense of likelihood ratio ordering, denoted by X1 21 X2 The proof of the next result is left in Exercise 5.4. Theorem 5.2 For two random variables X,, X2 such that X; > X2 and for non-decreasing function h(a), we have E{h(X1)] > Elh(Xa)] for which the expectations exist. When X(z) in (6.7) is non-decreasing (non-increasing, respectively) in x, the density function f(z) of X under Q is greater (smaller) than f?(z) under P in the sense of likelihood ratio ordering, and so we have E®[h(X)] > BP(h(X)] (E®(n(X)] < EP{A(X)]) for any non-decreasing function h(x). Example 5.2 (Esscher Transform, Continued) The Esscher Transform introduced in Example 5.1 can be viewed in terms of distribution functions as follows. Suppose that the payoff of an insurance product is given by h(X). The premium of the insurance is calculated from (5.2) as P fed = E%A(X)] = ae rey | enero Define . FA mie evFP(dy), ce R. It is easy to see that F(z) defines a distribution function on R, since FO (a is continuous and non-decreasing in x, F?(—co) = 0 and F®(oo) = 1. Also, EK(X)] = f a(x) aF (zx). Hence, F&(z) is a distribution function of X under Q. If X has a density function f*(<) under P, the density function under Q is given by 1 Az £P Pa) = BRO, eR If A > 0, then f@(x) >. fP(#) and E®{h(X)} = E®[h(X)] for any non- decreasing function A(z). 5.2 Black-Scholes Formula and Esscher Transform ‘As we shall see later, the Black-Scholes formula (4.21)-(4.22) can be obtained by change of measure formulas. In this section, assuming the interest rate r to be zero, we show that it is in fact a result of the Esscher transform. Suppose that the stock price at time t is modeled as S(t) = Se—27/)#+0x9), E> 0, (5.8) where jz and ¢ are positive constants and z(t) is a normally distributed random variable with mean 0 and variance ¢. The random variable z(t) with timeBLACK-SCHOLES FORMULA AND ESSCHER TRANSFORM 83. index ¢ is called a stochastic process. As we shall define later, the process z(t) is called a Brownian motion. Consider a financial product that promises to pay the amount C = h(S(T)) at some future time T. Note that S(7) is a random variable, because so is 2(T) ~ N(0,T). Hence, the random payment C is also a function of 2(T), say C = f(z(T)). According to the Esscher transform (5.2), the price of this product is given by EP[f(2(Z))e¥@] (C) Pty (5.9) for some 2. In this section, we assume that d=-08, = 4%. c ‘The meaning of the parameters will be explained later (see Example 14.4) Note that \ is negative in this setting. Now, since 2(T) ~ N(0,T), we have Ered] = e772 due to the MGF formula (3.17) of the normal distribution N(0,T). It follows from (5.9) that (0) =BP/(e(T))e*], X= 27) - 507. (6.10) Note that X is normally distributed with mean —6?7'/2 and variance 67. Hence, E?{e*] = 1 so that we can define a new probability measure Q ac- cording to Theorem 3.1. That is, the density function of X under Q is given by F%(z) =e7 fF), TER, which is the density function of (6/2, 6?T). Therefore, from (5.10), the price of the financial product is given by n(C) = E® [ (-F - 37) | , where X follows NV (07/2, 677) under Q. It follows that, using the standard normal random variable —Z, we obtain a(C) = EM f(VTZ ~ 67), (5.11) where Z ~ N(0,1) under Q Consider a call option whose payoff function is given by h(S) = {S— K}., so that, from (5.8), we have Selo? /2)T4on _ se) = {5 7 As in (5.10), the call option price in the Black-Scholes setting with r = 0 is obtained as x(C) =EP [* {sewretrareeacn = 5}, (6.12)84 CHANGE OF MEASURES AND THE PRICING OF INSURANCE PRODUCTS It follows from (5.11) that a(C) =E® [{sorstrverevee -K} | : (5.18) + since 06 = 1s by definition, where Z ~ N(0, 1) under Q. The calculation of the expectation in (5.13) leads to the Black-Scholes formula (see Exercise 5.5). Of course, we need more formal arguments for the rigorous derivation of the Black-Scholes formula; see Chapter 14. In the rest of this section, we calculate the expectation in (5.13) by applying the change of measure technique. Let A = {S(T) = K} be the event that the call option becomes in-the-money at the maturity. Recall that S(T) = Se" PTP+2VTZ, Z ~ N(O,1), under Q. Then, from (5.13), we obtain #(C) = E%(S(T)14] — E°[K 14) ‘The second expectation in this formula is just, E2K1,4] = KQ{S(T) > K} = K8(d-oVvT), (5.14) where ©(z) is the distribution function of the standard normal distribution and au WS) vr oVT 2 The proof is left in Exercise 5.6. Recall that we have assumed r = 0. Next, in order to calculate the first expectation in the above formula, we consider the following change of measure: ex (A) = Ee a GA) = Ean, = gaia where we set X = /TZ. It follows that E2(S(T)14] = SEL 49] = SE%{14] = SO{S(T) > K}. But, from Theorem 3.1, since X ~ N(0,T) under Q, we observe that X ~ N(oT, T) under Q. We then obtain Q{S(T) > K} = &(d). (5.15; Hence, we arrive at the Black-Scholes formula x(C) = S8(d) — KS(d-o VT), (5.16) where we set r = 0; cf. (4.21)-(4.22). The meaning of the measure @ anc the terms ®(d) and &(d — oVT) will be explained in Example 6.8. See alsc Examples 7.1 and 12.2 for related resultsPREMIUM PRINCIPLES FOR INSURANCE PRODUCTS 85. 5.3 Premium Principles for Insurance Products ‘This section overviews the pricing principle of insurance products in the ac- tuarial literature from the perspective of the change of measures. Throughout this section, the interest rate is kept to be zero for the sake of simplicity. ‘Let X denote the random variable that describes the claim of an insurance during the unit of time, and Jet c be the insurance premium. The simplest premium principle is the one that charges the mean of X as the insurance premium. However, in this case, since the insurance company has no profit in average, it will be bankrupt for sure. Hence, the company needs to add some risk margin to the mean EP(X]. See Example 12.4 for another concept of risk premium from the viewpoint of insurance companies. ‘The popular premium principles among others are the following. For some 5>0, 1. Variance Principle: ¢ = E?[X] + 6V?[X] 2. Standard Deviation Principle: ¢ = EP(X] + 6./V*[X] a? [eX] 3. Exponential Principle: ¢ = ok) sole: [xXe°%] 4, Bsscher Principle: = “pricsay 5. Wang Transform: c = E®[X]; F(x) = ®(-1(FP(2)) — 4) Here, VP denotes the variance operator under P and 6 is associated with risk premium. In the following, we explain the exponential principle, Esscher principle, and Wang transform. The others are straightforward. 5.3.1 Exponential Principle Suppose that the utility function of an insurance company is u(x). When the company sells an insurance at the premium c for claim X, the expected utility is given by EP [u(x + ¢ — X)], where « represents the initial wealth of the company. If the company sells nothing, its utility is just u(z). Hence, at least, the company wants to set the premium so that u(x) < EPfu(x + ¢—X)]. (6.17) Otherwise, the insurance company has no incentive to sell the product. Of course, in the highly competitive market, the company must determine the premium as cheap as possible. Suppose, in particular, that the company has the exponential utility func- tion u(x) = —e~®, 6 > 0. Then, from (5.17), we obtain ef < BPeX] = mx(8)86 CHANGE OF MEASURES AND THE PRICING OF INSURANCE PRODUCTS Hence, the premium 7(X) of the claim must be given by log mx (0) a When the claim follows a normal distribution (4,07), we have log mx (6) = 8 + 0767/2, m(X) = 1 0>0. (5.18) hence 6. m(X) = EP[X] + 5VFIX], 8 >0, which coincides with the variance principle. 5.3.2 Esscher Principle In the Esscher principle, the insurance fee of claim X is calculated by applying the Esscher transform (5.2), that is, BP[Xe°%] Brey for some @ > 0. Recall from (5.7) that the density function of X under Q is given by a(X) = (5.19) ex Qn) = Pe), A(t) = =a F(a) = A@)FP(@), A@) BPX] When @ > 0, since e% is increasing in x, Theorem 5.2 ensures that E®[X] > E?[X]. That is, when @ > 0,1 the insurance premium calculated by the Esscher principle (5.19) is higher than the mean value EP[X]. From the definition of covariance, the Esscher principle is written as CPX, 2%] Pex)’ When @ > 0, since we have C?[X,e"*] > 0, the risk margin « in the Esscher principle is given by a(X) = EPIX] + 6>0. CPLEX, 0°*] Pre, Note that, when the MGF mx (6) = E?[e®*] exists, we have my (6) = EP[xe°*}. >0. Hence, the Bsscher principle can be alternatively stated as m'(0) mx (6) ‘This means that the insurance premium in the Esscher principle is calculated m(X) = = (log mx (0))'. t In the transformation (5.9) to derive the Black-Scholes formula, we take 4 <0 and the call price is smaller than the mean of the call payoff. This is so, because the Black-Scholes formula evaluates the risk of asset (call option). On the other hand, the Esscher principle (5.19) evaluates the risk of debt (insurance),PREMIUM PRINCIPLES FOR INSURANCE PRODUCTS: 87 once the cumulant generating function logm (@) of the claim X is obtained. Also, when @ is close to zero, the exponential principle (5.18) is the same as the Esscher principle. In particular, when X_~ N(u,0?), that is, the claim follows a normal distribution, the Esscher principle becomes n(X) = EP[X] + 6V"[X], which agrees with the variance principle. This is a result from Theorem 3.1 that the Esscher transform changes normal distribution N(u,0*) to another normal distribution N(u + 907, 0). ‘The drawback of the Esscher principle is that, when the claim follows a log- normal distribution, the premium cannot be calculated, because the MGF of log-normal distributions does not exist. In actual markets, insurance products often follow log-normal distributions, hence this may be e serious drawback in practice. 5.3.3 Wang Transform Wang (2000) proposed a universal pricing method based on the following transformation: F%Xx) = 6[“1(FP(x)) 0], @>0, ER, (6.20) where denotes the standard normal cumulative distribution function (CDF for short), F(z) is the CDF for the risk Y and 2 is a positive constant. The transform is called the Wang transform and produces a risk-adjusted CDF Fz). The mean value evaluated under F@(c) will define a risk-adjusted “fair value” of risk Y at some future time, which will then be discounted to the current time using the risk-free interest rate. Note that, in contrast to the Esscher transform, the Wang transform (5.20) can be applied to the case that the CDF F®(z) is a log-normal distribution. ‘Also, the Wang transform not only possesses various desirable properties as ‘a pricing method, but also has a clear economic interpretation, which can be justified by a sound economic equilibrium argument for risk exchange. For example, the Wang transform (as well as the Esscher transform) is the only distortion function among the family of distortions that can recover the CAPM (capital asset pricing model) for underlying assets and the Black~ Scholes formula for option pricing. See Wang (2002) for details. From (5.20), when @ > 0, we have ®"1(F@(z)) < 71(F?(z)), so that F(x) < FP(c) for any x € R. It follows that, when 6 > 0, we have EX] > E?(X], that is, the insurance premium is greater than the mean of X. The proof is found in Exercise 5.4. By differentiating (5.20) with respect to 2, we obtain a) = rw, ¢= oF). (6.21) + ‘The transform (5.20) describes a distortion of CDF from FF to F®.