Math & ScienceAll new equations generated with the same equation number

Information and discussion about LaTeX's math and science related features (e.g. formulas, graphs).
bazman
Posts: 78
Joined: Mon Jan 26, 2009 3:24 am

All new equations generated with the same equation number

Post by bazman »

Hi there,

Please excuse the mamouth code sample. But the nature of the problem is such that I don't know how to reduce the size while still retaining the error.

Basically all my eqautions are supposed to be numbered but ever since squation (3.41) all the equations have been numbered (3.41)?

Is there a limit to the amount of numbered equations you can have? Surely not?

Code: Select all

\documentclass{article} % Your input file must contain these two lines
\usepackage{natbib}
\usepackage{amsmath}
\usepackage{amsfonts}
\usepackage{bm}
\usepackage{grffile}
\usepackage{graphicx}
\usepackage[%
  font=small,
  labelfont=bf,
  figurewithin=section,
  tablewithin=section,
  tableposition=top
]{caption}
\numberwithin{equation}{section}

\makeatletter
  \def\env@matrix{\hskip -\arraycolsep
  \let\@ifnextchar\new@ifnextchar
  \array{*\c@MaxMatrixCols l}}
\makeatother

\begin{document} % plus the \end{document} command at the end
\parskip = 1pc %change spacing between paragraphs
\parindent = 0pc %change paragraph indentation
\section{Introduction}

\section{LMM}

\subsection{Introduction} % This command makes a section title.
The Black model was already well established in the interest rate market. This allowed trader to price caps and swaptions individually i.e. in their own specific measure. However there was no framework to price caps swaptions or any other LIBOR product of a different maturity (and so measure) consistently. The seminal work of Heath, Jarrow and Morton (1992) whose great insight was that the no arbitrage of the state variables (e.g.) forward rates could be expressed as a function of the volatilities of and correlations between the state variables themselves. The HJM was originally cast is terms of instantenous forward rates, which don't actually trade in the market, also a point raised in the HJM paper was that in the continuous time limit for truely instantaneous and log-normal forward rates, their process explodes with positive probability. This lead early implementation of the approach that tried to steer clear of log normality: this was inconsistent with the already widely accepted Black approach and ultimately proved to be a dead end.

A new approach besed on HJM and first described in the papers by Brace et al(1996), Jamshidian(1997) and Musiela abd Rutkowski (1997) appeared in the mid 90's. 

\begin{itemize}
	\item Recast the yield curve in terms of market observable discrete sets of foreard rates
	\item The no arbitrage drifts for the forwards were translated from the continuous time HJM setting tothe new discrete setting.
	\item a numeraire had to be chosen (in early attempts this a discretely compounded money market account was invented, but forward and swap measures soon followed.
	\item the log normal distribution assumtption for forward rates was introduced
\end{itemize}
  
The LMM (like its HJM forebearer) is not a model per se  but rather a set of no arbitrage conditions among forward rates. There are various incarnations of the model. It can be formulated in terms of forward rates, swap rates, they can be normally, lognomally or otherwise distributed, the associated numeraire could be a zero coupon bond, a swap annuity or the money market account. These choice along with the specific forms of the instantaneous volatility and correaltion functions used fully specuify the model. The simultaneous specification of these time-dependent volatilities and correlations became \textit{the} problem in the specification of the LMM model. 
 

When using the forwards as the state variabels and log normality is assumes, caplet vols can be recovered exactly and in a very strightforward maner. Indeed the Lognormal forward market model prices caps using blacks formula so calibration is very strightforward. This is one of the reasons the model has become so popular it uses the market prices of underlying instruments as building blocks, hence the name the `market` model.
 
It should be noted that the Libor Forward Model and Libor Swap Model are theoretically inconsistent but this is a not a concern in practive as excellent approxiamtions exist see Brigo (2007) and Rebonato (2002).

As we will see the LMM lends itself well to monte carlo simulation and the pricing of path dependent options in multi-factor framework. However because the forward process implied by the LMM is in general non-Markovian it does not lend it self easily to recombining lattice type pricing techniques, consequently dealing with early exercise features can be problematic. However Carr and Yang 1997 provide an example of how to approximate a Bermuden price using a markov chain, as does Andersen (1999) (who approxiamtes the early exercise boundary as an function of intrinsic value and `sill alive' nested European swaptions. Further a general method for combining backward induction with Monte Carlo simulation has been proposed by Longstaff and Schwartz (2001). 


\subsection{Model Set-Up}

Following the presentation in Rebonato et al 2009 \nocite{*}.

A discrete set of default-free discount bonds are assumed to trade in the economy $P_t^i$. Spanning forward rates are denoted by.



\begin{equation}
f(t,T_i,T_{i+1}) = f_i^t \qquad i=1,2,...,N \\
\end{equation} 

The instantaneous volatilities of the forward reates are denoted:

\begin{equation}
\sigma(t,T_i)=\sigma_t^i \qquad i=1,2,...,N \\
\end{equation} 

The instantaneous volatility between forward rate i and j is denoted:

\begin{equation}
\rho(t,T_i,T_j)=\rho_{i,j}^t \qquad i,j=1,2,...,N \\
\end{equation} 

We choose as numeraire the discount bond $P_t^i=P(t,T_i)$

The forward rate can be expressed in terms of discount bonds as follows:

\begin{equation}
f_t^i= (\frac{P_t^i}{P_t^{i+1}}-1)\frac{1}{\tau_i}  \\
\end{equation} 

The description of the discrete yield curve is them completed by providing the value of the spot rate, i.e., the rate for lending/borrowing from spot time to $T_1$, given by:

\begin{equation}
r_0= (\frac{1}{P_0^{1}}-1)\frac{1}{\tau_1}  \\
\end{equation} 

Thus in the determinisitic-volatiltiy LMM the evolution of the forward rates is given by:

\begin{equation}
\frac{df_t^i}{f_t^i}= \mu^i(\{\textbf{f}_{t}\},\{ \bm{\sigma}_t\}, \bm{\rho},t)dt + \sigma^i(t,T_i)dz_t^i \\
\end{equation} 

with

\begin{equation}
\mathbb{E}[dz_t^idz_t^j] = \rhp(t,T_i,T_j)dt  \\
\end{equation} 


Where $\textbf{f}_{t}$ is a vector of spanning forward rates, $\bm{\sigma}_t$ is the vector of associated volatilities, and $\bm{\rho}$ the correlation matrix


The following result is taken from Brigo et al 2006, under the lognormal assumption, we obtain the dynamics of $f_k$ under the forward adjusted measure $Q^i$ in the three cases $i<k$, i=k and $i>k$. 

\begin{equation}
  df_k(t) =
  \begin{cases}
    \sigma_k(t)f_k(t) \displaystyle\sum_{j=i+1}^k \frac{\rho_{k,j}\tau_j \sigma_j(t)f_j(t)}
      {1+\tau_jf_j(t)}dt +\sigma_k(t)f_k(f)dZ_k(t) &\text{if }i<k, t \leq T_i.\\
    \sigma_k(t)f_k(f)dZ_k(t) &\text{if }i=k, t \leq T_i .\\
    -\sigma_k(t)f_k(t) \displaystyle\sum_{j=k+1}^i \frac{\rho_{k,j}\tau_j \sigma_j(t)f_j(t)}
      {1+\tau_jf_j(t)}dt +\sigma_k(t)f_k(f)dZ_k(t) &\text{if }i > k, t \leq T_i.
  \end{cases}
\end{equation}


where $Z=Z^i$ is brownian motion under $Q^i$. All of the equations above asmit a unique strong solution if the coefficients $\sigma(.)$ are bounded.


Going back to equaiton (2.6) it can be re-written as:

\begin{equation}
\frac{df_t^i}{f_t^i}= \mu^i(\{\textbf{f}_{t}\},\{ \bm{\sigma}_t\}, \bm{\rho},t)dt + \sum_{k=1}^{m} \sigma_{ik}dz_k \\
\end{equation} 

where we assume that we are dealing with m $(m \leq	 N)$ factors and that the Brownian motions are independent. The quantities $ \sigma_{ik} $ can be interpreted as the loading of the ith forard rate on to the kth factor. Hence:


\begin{equation}
\sigma_i(t)= \sqrt{\sum_{k=1}^m \sigma_{ik}^2(t)} \\
\end{equation} 

If the function has been chosen such that

\begin{equation}
\int_0^{T_i} \sigma_i(t)^2 dt = \hat{\sigma}_i^2 T_i \\
\end{equation} 

holds true, then the market caplets will be correctly priced. $ \hat{\sigma}$ represents the Black impled volatility.

If each loading $\sigma_{ik}$ is now multiplied and divided by the volatility $\sigma_i$ of the ithe forward rate:

\begin{equation}
\frac{df_t^i}{f_t^i}= \mu^i(\{\textbf{f}_{t}\},\{ \bm{\sigma}_t\}, \bm{\rho},t)dt + \sigma_{i} \sum_{k=1}^{m} \frac{\sigma_{ik}}{\sigma_i}dz_k \\
\end{equation} 

Using (2.10) this can be re-written as


\begin{equation}
\frac{df_t^i}{f_t^i}= \mu^i(\{\textbf{f}_{t}\},\{ \bm{\sigma}_t\}, \bm{\rho},t)dt + \sigma_{i} \sum_{k=1}^{m} \frac{\sigma_{ik}}{\sqrt{\sum_{k'=1}^{m}\sigma_{ik'}^2}}dz_k \\
\end{equation} 

Defining $b_{ik}$ as

\begin{equation}
b_{ik} \equiv \frac{\sigma_{ik}}{\sqrt{\sum_{k'=1}^m \sigma_{ik'}^2}} \\
\end{equation} 

thus (2.13) can be expressed as

\begin{equation}
\frac{df_t^i}{f_t^i}= \mu^i(\{\textbf{f}_{t}\},\{ \bm{\sigma}_t\}, \bm{\rho},t)dt + \sigma_{i} \sum_{k=1}^{m} b_{ik} dz_k \\
\end{equation} 

where $ \bm{b} $ the $[N*m]$ matrix of elements $b_{jk}$. It can readily be shown that the correlation matrix is 


\begin{equation}
\bm{b}\bm{b^T}= \bm{\rho} \\
\end{equation} 

Equation (2.15) allows the stochastic part to be decomposed into a pure volatility component which can be easily calibrated to caplet prices and a correlation component $\bm{b}$ which can for instance be fitted to historical correlation matrices.

\subsection{Volatility Function}

As stated earlier a given implentation of the LMM depends laregely on the calibration of the volatlity and correlation functions.

There are various choices but again following Rebonato et al 2009. I show an example of a volatility function for the determinsitc case. This is both for paedological reasons and because this function is used and expanded by Rebonato et al when the apply the SABR model. This model has been shown to be a good match empirically in Rebonato (2002) and (2004). 

\begin{equation}
\sigma_t^T=[a+b(\tau)]\exp^{[-c(\tau)]}+d \\
\end{equation} 

\begin{itemize}
	\item As shown empirically in Rebonato (2002) and (2004) the humped shape shown is prevalent in `normal' trading environments, while the monotonically decreasing shape is prevalent in `excited' trading environments after major market dislocations
	\item It is time homogeneous which as shown in  Rebonato (2002) and (2004) is a very desirable property matching empirical behaviour of the instantaneous volatility curve
	\item It is square integrable
	\item It is very intuitive to use. a+d is the value of the instantaneous volatility  of the expiry reaches zero; d is the instantaneous vol for long maturities and the position of the hump (assuming there is one) is given by $\hat{\tau}=\frac{1}{c}- \frac{a}{b}$
	\item When fitted to ATM swaptions or caplets natural fits are obtained. Although some fine tuning is often necessary see below.
\end{itemize}

In general an extra paremeter is required to attain a prefect fit to caption prices. This parameter is maturity specific and so not time homogeneous. However it is constrained during the optimisation process so that it is as close as possible to one. Rebonato himself confeses to this being a fudge factor but it ends up playing an inportant role in the SABR version of the model.

\begin{equation}
k_{Ti}^2=\frac{\hat{\sigma_{T_i}^2T_i}}{\int_0^{T_i}([a+b(\tau)]\exp^{-c(\tau)]}+d)^2d\tau} \\
\end{equation} 


\begin{figure}[h]
\centering
\includegraphics[scale=0.5, angle=-1]{volcurve.eps}
\caption{Possible shapes of the volatility function in equation (2.17)}\label{fig:2.1}
\end{figure}






\subsection{Correlation Function}

\subsubsection{Simple Exponential Function}

\begin{equation}
\rho(t,T_i,T_j) = \exp^{[-\beta|T_i-T_j|]} \qquad t \leqmin(T_j,T_j)  \\
\end{equation} 

\begin{itemize}
	\item Assuming $\beta$ is positive the further apart two forwart rates are the more decorrelatied they will be which matches empirical observation
	\item For any positive $\beta$ the corresponding matrix $\rho$ will always be real symmentric matrix with positive eigenvalues
	\item However it gives the same decorrealtion between forwards at 1y and 1y 1m as ones at 10y and 10y 1m. This does not match empirical observation. However for instumetns like swaptions where it is only the net level of correlation that matters not the shape of the correaltion matrix this makes very little difference in practice. However for products like CMS spread options this would be a very serious flaw.  
\end{itemize}

A further advantage is that this simple function has no dependence on  time variable in:

\begin{equation}
C(i,j,k) = \int_{T_k}^{T_{k+1}} \sigma_u^i \sigma_u^j \rho(u,T_i,T_j) du  \\
\end{equation} 


and therefore one can write
\begin{equation}
 C(i,j,k) = \rho_{i,j} \int_{T_k}^{T_{k+1}} \sigma_u^i \sigma_u^j du   \\
\end{equation} 

 
which simplifies the computational burden


\subsubsection{Doust Correlation Function}

Obviously we would like to find a way to introduce a dependence of the correlation factor $\beta$  on the epries of teh forward rates, $\beta=\beta(T_i,T_j)$ in such a way that the resulting correlation matric reamins positive definite. 

Assuming we takea 5 x 5 real symmetric matrix, where we retrict ourselves to only specifying (5-1) quantities, $a_1, a_2, a_3$ and $a_4$ keeping then between 1 and -1. Then Doust (2007) (as reported in Robonato et al 2009) showed that the resulting matrix is always positive defintie and so a possible correlation matrix.



\begin{equation}
\begin{bmatrix} 
1 & a_1 & a_1a_2 & a_1a_2a_3 & a_1a_2a_3a_4 \\
a_1 & 1 & a_2 & a_2a_3 & a_2a_3a_4  \\
a_1a_2 & a_2 & 1 & a_3 & a_3a_4 \\
a_1a_2a_3 & a_2a_3 & a_3 & 1 & a_4 \\
a_1a_2a_3a_4 & a_2a_3a_4 & a_3a_4 & a_4 & 1 \end{bmatrix} &&
\end{equation}


The result admits a cholesky decomposition

\begin{equation}
\begin{flalign}
\begin{bmatrix} 
1 & 0 & 0 & 0 & 0 \\
a_1 & \sqrt{1-a_1^2} & 0 & 0 & 0  \\
a_1a_2 & a_2 \sqrt{1-a_1^2} & \sqrt{1-a_2^2} & 0 & 0 \\
a_1a_2a_3 & a_2a_3\sqrt{1-a_1^2} & a_3\sqrt{1-a_2^2} & \sqrt{1-a_3^2} & 0 \\
a_1a_2a_3a_4 & a_2a_3a_4\sqrt{1-a_1^2} & a_3a_4\sqrt{1-a_2^2} & a_4\sqrt{1-a_3^2} & \sqrt{1-a_4^2} \end{bmatrix} &&
\end{flalign}
\end{equation} 

The condition that ensures a valid correlation matrix is that all the lements on the main diagonal must be real. This will always be the case so long as

\begin{equation}
 -1 \leq a_i \leq 1 \qquad \text{for any i}   \\
\end{equation} 

Therefore tp ensure that you are contructing a valid correlation matrix one proceeds af follows. All the diagonal elements are set to

\begin{equation}
 \rho_{ii} = 1   \\
\end{equation} 

The elements in the first row are then defines as:

\begin{equation}
 \rho_{i,i} = \prod_{k=1}^{j-1} a_k = \rhp_{j,1}, \qquad j =2,...,n   \\
\end{equation} 

Where assuming $i > j$ 

\begin{equation}
 \rho_{i,j} = \frac{\rho{1,i}}{\rho{j,1}} = \frac{\rho{1,i}}{\rho{1,j}}= \frac{\prod_{k=1}^{i-1}a_k}{\prod_{k=1}^{j-1}a_k}=\prod_{k=j}^{i-1}a_k   \\
\end{equation} 

The remaining elements may be found by symmetry \rho_{ij} =\rho_{ji}$ 

Rebonato et al 2009 suggest that the $a_i$ be chosen in the following maner.

\begin{equation}
 a_k = \exp^{[-\beta_k \Delta T]}  \\
\end{equation} 

where $\Delta T$ is the spacing between the forward rates. Therefore

\begin{equation}
 \rho_{i,j} = \prod_{k=j}^{i-1} a_k = \prod_{k=j}^{i-1} \exp^{[-\beta_k \Delta T]} = \exp^{[-sum_{k=j}^{i-1}\beta_k \Delta T]}   \\
\end{equation} 

When $\beta_k = \beta_0$ for all k's the expression degenerates to the simple exponential equation shown earlier

\begin{equation}
 \rho_{i,j}^0 = \exp^{[-(i-j) \beta_0 \Delta T]} = \exp^{[-\beta_0|T_i - t_j|]}   \\
\end{equation}

If we set $\beta_k = g_0+g1/k+g2/k^2 + ... $ for positive $g_i$ then the $\beta_k$ are both decreasing in k and always positive. Which is in line with the emirical finding outlines earlier.

The Doust correlation function can be simply augmented to allow the long term correlation to level off at some level rather that decreasign montonely to zero


\begin{equation}
 \rho_{ij}(t) = LongCorr _(1-LongCorr) \hat{\rho_{ij}}(t)  \\
\end{equation}

where $ \hat{\rho_{ij}}(t) $ is a valis correaltion matrix produced in this instance by the Doust method.

In the Doust model the decorrelation does not depend just depend on teh distance between two forwards also on their maturities. This means that correlation can no longer be pulled out of the covariance integral

 
\begin{equation}
C(i,j,k) = \int_{T_k}^{T_{k+1}} \sigma_u^i \sigma_u^j \rho(u,T_i,T_j) du \equiv I_k  \\
\end{equation} 

However because the Doust correlation function is time homogeneous, so thatthe decorrelation between contracts 4 and 5 at time $T_1$ will be equal to the decorrelation between contracts 3 and 4 at time $T_0$. Thus in the determinsitic volatility case is the tims steps of the monte carlo simulation are chosen to match the maturities of the underlying forwards then calander time will not enter the covariance integral.

Things are not quite so straightforward in the stochastic volatility case. However at worst one simply has to store a covariance matrix at each time step (expiry of each forward) and then use linear interploation between them. The covariance matrices themselves thanks to the time homogeneous property of the Doust correlation function.

\begin{figure}[h]
\centering
\includegraphics[scale=0.1, angle=-1]{correlation.eps}
\caption{Example of Doust correlation structure}\label{fig:2.2}
\end{figure}

\subsection{Intrinsic Incompleteness of LMM}

In what follows I give a couple of very strightforwad example from Rebonato (2004) that highlights some of the problems surrounding the calibration of the LMM model even in the deterministic volatility environment. 

\begin{figure}[h]
\centering
\includegraphics[scale=0.1]{swaption.eps}
\caption{The reset and expiry times of the two forwards and of the swap rate}\label{fig:2.3}
\end{figure}


If we consider two forwards $f_1(t,T_1,T_2), f_s(t,T_2,T_3), SR_{12} = SR_{12}(t,T_1,T_3)$. The swap rate is a linear combination of the two forwards

\begin{equation}
 SR_{12} = w_1f_1 +w_2f_2  \\
\end{equation}
 

If we assume that all volatilities and correlation to be piecewise constant over the two time steps. Further assume that the forwards f1 and f2 both trade in the market with implied Black volatilities of $\hat{\sigma_1}$ and $\hat{\sigma_2}$ of $20\%$. The swaption Black volatility is taken to be $\hat{\sigma_{SR}}$ is $18\%$. 

As shown in Rebonato and Jackel (2002) a reasonably accurate approximation for the volatility of the swaption is given by


\begin{equation}
\sigma_{SR}^2SR^2= w_1^2f_1^2 \sigma_{1,1}^2+w_2^2f_2^2 \sigma_{2,1}^2 + w_1w_2f_1f_2\sigma_{1,1}\sigma_{2,1} \rho  \\
\end{equation} 

Clearly the volatility of the first forward must simply equal its Black volatiltity of $20\%$. Therefore so long as the net volatility of the second forward over the time periods is still matched to $20\%$

Then the problem of matching the swaption price is findamentally over specified. We have twp desgrees of freedom and only one price to match. This is true even though there is some constraint on the value if the 2nd forward.

\begin{equation}
\hat{\sigma_{2}^2}(t_2-T_0)= \int_{T_0}^{T_2} \sigma_2^2 du = \sigma_{2,1}^2)(T_1-T_0) + \sigma_{2,2}^2)(T_2-T_1) = 20\%  \\
\end{equation}

In the extrame we could take $\rho = 1$ and reduce the volatility between $T_1$ and $T_2$, clearly we would have to raise teh volatility of the same forward between times$T_2$ and $T_3$ to keep equation 2.35 balanced. Or we could keep the volatilties flat at $20\%$ and reduce the correlation. Or indeed any of an infinite number of combinations varrying the correlation and the instantaneous volatilities. 

As you can see the caplet and swaption surfaces can be fitted in an infintiy of ways. This is a problem because each fit corresponds to a different future evelotion of instantaneous volatilities and correlations and so give rise to different exotics prices. Thus the caplet and swaption markets are instrinsically incomplete even in the determinsitic volatility universe.

This is even more of a problem in the smile case. The LMM does allow the trader to at least directly express their views about the future term structure of volatilities and other unknowns, in a much more straight forwardly then with tradtional spot rate term structure models. However 
since vega hedging is essential to exotics derivaties trading, future re hedging costs must be considered and this requires future modelling of implied volatilities. This is why a plausible evolution of the smile is essential. If a model implies an implausible shape for the future smile as for example local-volatilitiy models do, it also implies implausible future prices for caplets and swaptions and therefore implausible exotics prices.


\section{Smile Models used with LMM}

\subsection{Local-Volatility Models}

\subsubsection{Shifted-Lognormal Model}

Following Brigo (2006) assume that the forward $f_j$ evolves under its $T_j£ forward measure according to

\begin{align}
	\begin{split}
f_j(t) &=X_j(f) + \alpha    \\   
dX_j(t) &=\beta(t)X_j(t)dW_t  
	\end{split}
\end{align}

where $\alpha$ is a real constant and $\beta$ is a deterministic function of time, $W_t$ is a standard Brownian motion.

Therefore a strighforward application of Ito's lemma and somesimple algebra gives:

\begin{align}
df_j(t) = \beta(t)(f_j(t)-\alpha)dW_t 
\end{align}

thus for $t < T \leq T_{j-1}$, the forward  rate $f_j$ can be written as

\begin{equation}
df_j(t) = \alpha + (f_j(t) -\alpha)\exp^{-\frac{1}{2}\int_t^T\beta^2(u)du + \int_t^T\beta(u)dW_u} 
\end{equation}

The distribution of f_j(T), conditional on f_jt(t), $t < T \leq T_{j-1}$ is then a shifted lognormal distribution with density

\begin{equation}
p_{f_j(T)|f_j(t)}(x) = \frac{1}{(x-\alpha)U(t,T)\sqrt(2pi)} \exp^{-\frac{1}{2}(\frac{\ln(\frac{x-\alpha}{f_j(t)-\alpha})+\frac{1}{2}U^2(t,T)}{U(t,T})^2)
\end{equation}

for$ x > \alpha$, where

\begin{equation}
U(t,T) := \sqrt{\int_t^T \beta^2(u)du}
\end{equation}

so that for $ \alpha < K$ the caplet price \textbf{Cpl}$(t,T_{j-1},T_j,\tau_j,N,K)$ associated with equation (3.2) is given by

\begin{equation}
\textbf{Cpl}(t,T_{j-1},T_j,\tau_j,N,K)=\tau_jNP(t,T_j)\textbf{Black}(K-\alpha,f_j(t)-\alpha,U(t,T_{j-a}))
\end{equation}

Thus

\begin{equation}
(f_j(0)- \alpha)[2 \Phi(\frac{1}{2}U(0,T_{j-1}))-1] = f_j(0)[2 \Phi(\frac{1}{2}\sqrt{T_{j-1}}\hat{\sigma}(f_j(0), \alpha))-1]
\end{equation}

clearly increasing $\alpha$ ont eh left hand side needs a corresponding reduction in $\hat{\sigma}$ on the right hand side. Further differentiating a(3.6) with respect to $\alpha$ clearly produces a value which is always negative.

Therefore for $\alpha < 0$ we always havea decreasing curve. Also for $\alpha < 0$ the whole curve is moved downwards and vice versa.

\begin{figure}[h]
\centering
\includegraphics[scale=0.1]{ShiftedLognormal.eps}
\caption{Caplet Volatility Structure $\hat{\sigma}(K,\alpha)$ plotted against strike}\label{fig:3.1}
\end{figure}

\subsubsection{Constant Elasticity of Variance Model}

Following Andersen and Andreasen 

\begin{equation}
df_j(t)=\phi(f_j(t))\sigma_j(t)dZ_j^j(t)
\end{equation}

where $\phi$ is a general function. Andersen and Andreasen suggest

\begin{equation}
\phi(f_j(t))=[f_j(t)]^\gamma
\end{equation}

with $0<\gamma<1$. The limiting cases of $\gamma=0$ and $\gamma=1$ which are the fully normal and lognormal dynamics respectively. For values ibn between the dynmaics are a mixture of teh two processes.

Thus the model is

\begin{equation}
\phi(f_j(t))=\sigma_j(t)[f_j(t)]^\gamma dW_t \qquad f_j=0 \text{ absorbing boundary when } 0< \gamma<\frac{1}{2}
\end{equation}

Where W is a one-dimensional Brownian motion under teh $T_j$ forward measure

For $0<\gamma<\frac{1}{2}$  (3.10) does not have a unique solution unless we set $f_j=0$ as an absorbing boundary for the above SDE. Andersen and Andreasen also consider the case of $\gamma>1$ but note that this can leas to explosions when leaving the $T_j$ measure. 

Time dependence of $\sigma_j$ can be dealt with through a determinstic time change

\begin{equation}
\upsilon(\tau,T) = \int_{\tau}^T \sigma_j(s)^2ds
\end{equation} 

definig 

\begin{equation}
\tilde{W}(\upsilon(0,t)) := \int_0^t \sigma_j(s)dW(s)
\end{equation}

which gives us a Brownian motion $\tilde{W}$ with time parameter $\upsilon$. This is substituted in equation (3.10) 

\begin{equation}
df_j(\upsilon) = f_j(\upsilon)^{\gamma}\tilde{dW}(\upsilon) \qquad f_j=0 \text{ absorbing boundary when } 0< \gamma<\frac{1}{2}
\end{equation}

 
This process can be transformed into a Bessel  process via a change of variable. Some manipulations allow us to find teh transition density of f. Remembering the time change the continuous part of the density function of $f_j(T)$ conditional on $f_j(t), $t < T \leq T_{j-1}$ is then given by



\begin{align}
	\begin{split}
p_{f_j}(T)|f_j(t) &= 2(1-\gamma)k^{1/(2-2\gamma)}(uw^{1/(4-4\gamma)\exp^{-u-w}\textbf{I}_{1/(2-2\gamma)}(2\sqrt{uw}) \\
k &=\frac{1}{2} \upsilon(t,T)(1-\gamma)^2} \\
u &= k[f_j(t)]^{2(1-\gamma)} \\ 
w &=kx^{2(1-\gamma)} \\
	\end{split}
\end{align}


With $I_q$ denoting the modified Bessel function of the first kind of order q.
Denoting by $G(y,z) = \frac{\exp^{-z}z^{y-1}}{\Gamma(y)}$ the gamma density function and by $G(y,x)= \int_x^{+\infty} g(y,z)dz$ the complmentary gamma distribution, the probability that $f_j(T) =0$ conditional in $f_j(t)$ is $G(\frac{1}{2(1-\gamma)},u)$ 

From these results an explicit solution for caplet prices can be derived

\begin{equation}
\begin{split}
\textbf{Cpl}(t,T_{j-1},T_j,\tau_j,N,K) = \tau_jNP(t,T_j)[f_j(t).\sum_{n=0}^{+\infty}g(n+1,u)G(c_n,kK^{2(1-\gamma)}) \\ - K \sum_{n=0}^{+\infty}g(c_n,u)G(n+1,kK^{2(1-\gamma)})]
\end{split}
\end{equation}

where

\begin{equation}
c_n := n + 1 + \frac{1}{2(1- \gamma)}
\end{equation}


\begin{figure}[h]
\centering
\includegraphics[scale=1]{CEV.eps}
\caption{Caplet Volatility Structure $\hat{\sigma}(K,\alpha)$ plotted against strike}\label{fig:3.2}
\end{figure}


Andersen and Andreasen also propose an extension called the Limited CEV process that addresses te problem absorption at $F_j=0$. They propose a new process where


\begin{equation}
\phi(f) = f\text{ } min(\epsilon^{\gamma-1}.f^{\gamma-1})
\end{equation}

Basically the model collapses the process to a lognormal one for very low Levels of f. 



\subsubsection{Comment on CEV and Shifted Lognormal Models }

From visual inspection of Fig 3.1 and 3.2 it is quite clear that the smiles produces by the two models are qualitively similar. Indeed Rebonato (2002) shows that for a range of futures values there is a derect equivalence between the two models. He also shows how both these approaches can be applied within the LMM framework in a relatively straightforward maner.

The economic intuition behind these models is that in a log normal world and in a rising term structure environment a with rates at 1y equal to 3% and 2y equal to 6%. Then a shock in rates should produce a price move twice as large in the 6% rate. In a fully normal world the would both move by the same absolute amount. Empirically it has been obsrved that although the higher rate does tend to move more it does not move twice as much. As mentioned before the CEV model can be set to work as fully lognormal when $\gamma=1$ or normal when $\gamma=0$. Choosing a value in between allows the market behavious to be matched.

Simple skews like this were witnessed inteh market as far back as 1996. It was only later after the Russia crises that full smiles were winessed in interest rate markets.


\subsubsection{Brigo and Mercurio Mixture Type Models }


Brigo and Mercurio (2001, 2002) proposed a class of analytically tractable diffusions that are flexible enough to cover more general voltility surfaces. 

Let the dynamics of the forward rate $F_j$ under the forward measure $Q^j$ be denoted by

\begin{equation}
dF_j(t) = \sigma(t,F_j(t))F_j(t)dW_t
\end{equation}

Where $\sigma$ is the local volatility and must be chosen such that it is a well behave deterministic function.

It is chosen to so that it grants a unique strong solution to the SDE (3.18). In particular it is assumed that $\sigma$ satisfies for a suitable positive constant L, the linear growth condition

\begin{equation}
\sigma^2(t,y)Y^2 \leq L(1+y^2) \text{ uniformly in t}
\end{equation}

which is there to ensure a strong solution.

They then consider the N diffusion process with dynamics given by

\begin{equation}
dG_i(t) \upsilon(t,G_i(t))dW_t \text{   i=1,...,N,   }G_i(0)=F_j(0)
\end{equation}

Where similar to equation $\sigma$ in (3.18) the $\upsilon_i(t,y)$ are real functions each satisfying the same linear growth conditions.

For each t, we denote by $p_t^i(.)$ the density function of $G_i(t)$, i.e., $p_t^i(y) = d(Q^j{G_i(t) \leq y})/dy$. $p_0^i$ is the $\delta$-Dirac function centred on $G_i(0)

The problem is to derive the loical volatiltiy  $\sigma(t,S_t)$ such that the $Q^j$-density of $F_j(t)$ satisfies for each time t,

\begin{equation}
p_t(y) := \frac{d}{dy}Q^j{F_j(t) \leq y} = \sum_{i=1}^N \lambda_i \frac{d}{dy}Q^j{G_i(t) \leq y} = \sum{i=1}^N\lambda_ip_t^i(y)  \\
\end{equation}

whre teh $\lambda_i$'s are strictly positive constants such that $\sum_{i=1}^N \lambda_i =1$. Indeed $p_t(.)$ is a proper $Q^j$ density function since by definition

\begin{equation}
\int_0^{+\infty} yp_t(y)dy = \sum_{i=1}^N \lambda_i \int_0^{+\infty} yp_y^i(y)dy = \sum_{i=1}^N \lambda_iG_i(0) = F_j(0))  \\
\end{equation}

Brigo and Mercurio (2006) show that this is essentially the reverse of finding  the marginal density function of the solution of an SDE when teh coefficients are known. $\sigma(t,F_j(t)) can be found by solving the Fokker-Planck equation.

\begin{equation}
\frac{\partial}{\partial t}p_t(y) = \frac{1}{2}\frac{\partial^2}{\partial y^2}(\sigma^2(t,y)y^2p_t(y))
\end{equation}
 
as each density $p_t^i(y)$ satisfies the Fokker-Planck equation

\begin{equation}
\frac{\partial}{\partial t^i}p_t(y) = \frac{1}{2}\frac{\partial^2}{\partial y^2}(\upsilon^2(t,y)y^2p_t^i(y))
\end{equation}

Applying he definition (3.21) and the linearity of the derivative operator(3.23) can be written as

\begin{equation}
\sum_{i=1}^N\lambda_i\frac{\partial}{\partial t}p_t^i(y)= \sum_{i=1}^N\lambda_i[-\frac{\partial}{\partial y}(\muyp_t^i(y))]+\sum_{i=1}^N\lambda_i[\frac{1}{2}\frac{\partial^2}{\partial y^2}(\sigma^2(t,y)y^2p_t^i(y))}]
\end{equation}

which by subtituting from (3.24) becomes

\begin{equation}
\sum_{i=1}^N\lambda_i[\frac{1}{2}\frac{\partial^2}{\partial y^2}(\upsilon_i^2(t,y)p_t^i(y))}]= \sum_{i=1}^N\lambda_i[\frac{1}{2}\frac{\partial^2}{\partial y^2}(\sigma^2(t,y)y^2p_t^i(y))}]
\end{equation}

again using the linearity of the second order derivative operator

\begin{equation}
\frac{\partial^2}{\partial y^2}[\sum_{i=1}^N \lambda_i \upsilon_i^2(t,y)p_t^i(y))}]=\frac{\partial^2}{\partial y^2}[\sigma^2(t,y)y^2\sum_{i=1}^N \lambda_ip_t^i(y))}]
\end{equation}

The general solution of this second order differential equation is found as

\begin{equation}
\sigma^2(t,y)y^2\sum_{i=1}^N\lambda_ip_t^i(y) = \sum_{i=1}^N\lambda_i\upsilon_i^2(t,y)p_t^i(y)+A_tY_B_t
\end{equation}

Where A and B are suitable function of time. The regulatoy conditions as shown for £\sigma$ in (3.19) imply that the left hand side of the equation has zero  limit for $y \rightarrow \infty$. As a consequence, the RHS must have a zero limit as well. This is only true if $A_t=B_t=0 \text{ } \forall t$. Therefore the $\sigma$ consistent with the marginal density (3.21) and the regularity contraints for $(t,y)>(0,0)$

\begin{equation}
\sigma(t,y) =\sqrt{\frac{\sum_{i=1}^N\lambda_i\upsilon_i^2(t,y)p_t^i(y)}{\sum_{i=1}^N\lambda_iy^2p_t^i(y)}} \\
\end{equation}

setting

\begin{equation}
\Lambda_i(t,y) :=\frac{\lambda_i p_t^i(y)}{\sum_{i=1}^N\lambda_iy^2p_t^i(y)} \\
\end{equation}

for each i=1,...,N and $(t,y)>(0,0)$, we can write

\begin{equation}
\sigma(t,y) = \sum_{n=1}^N \Lambda_i(t,y) \frac{\upsilon_i^2(t,y) }{y^2} \\
\end{equation}

so that the square of the volatility $\sigma$ can be written as a stochastic convex combination of the squared volatilities of basic processes (3.20). For each (t,y), $\Lambda_i(t,y) \geq 0$ for each i and $\sum_{i=1}^N \Lambda_i(t,y) = 1$. Moreover, by (3.19) and setting $L := max_{i=1,...,N}L_i$ the consition (3.19) is fufilled since

\begin{equation}
\sigma^2(t,y)y^2 = \sum_{i=1}^N \Lambda_i(t,y)\upsilon_i^2(t,y) \leq \sum_{i=1}^N \Lambda_i(t,y)L_i(i+y^2) \leq L(1 +y^2) \\
\end{equation}

Forula (3.29) leads to the following SDE of the forward rate under the measure $Q^j$

\begin{equation}
dF_j(t) = \sqrt{\frac{\sum_{i=1}^N \lambda_i\upsilon_i^2(t,F_j(t))p_t^i(F_j(t))}{\sum_{i=1}^N \lambda_iF_j(t)^2p_t^i(F_j(t))}}F_j(t)dW_t \\
\end{equation}
 
This SDE simply defines some candidate dynamics that leads to the marginal density (3.21), If $\sigma$ is bounded, then the SDE is well defined, but the conditions imposed are by themselves not enough to guarantee this, so verification must be done on a case by case basis.

If we assume that the SDE has a strong solution the caplet prices associated with the base model (3.20) can be shown to be

\begin{center}
\begin{align}
\begin{split}
\textbf{Cpl}(0,T_{j-1},T_j,\tau_k,K) &= \tau_jP(0,T_j)E^j{[F_j(T)-K)]^+} \\
 &= \tau_jP(0,T_j)\sum_{i=1}^N \lambda_i \int_0^{+\infty} [y-K]^+ p_{T_j}^i(y)dy \\
 &= \sum_{i=1}^N \lambda_i \taxtbf{Cpl}^i(0,T_j-1,T_j,\tau_j,K) \\
\end{split}
\end{align}
\end{center}

In this set up $F_j$ can be viewed as a process at time t coincides withthe basic sensity $p_t^i$ with probability $\lambda_i$. As noted by Brigo and Mercurio (2006) when proposing altenative dymanics it can be difficult to come up with analytical formula for caplets. Here such a problem can be avaoided by design as analytically tractable densities have been used $p^i$. Moreover given that N is unbounded a virtually unlimited number of patamters can be introduced allowing for an arbitrarily good fit to the market. In practice one has to trade off such flexibility againt over parameterising the model.


\subsubsection{Lognormal Mixture Model}

This is an applciation of the general dynamics (3.33).

\begin{equation}
\upsilon_i(t,y) = \sigma_i(t)y
\end{equation}

where all the $\sigma_i$'s are determinsitc and continuous functions of time that are bounded above and below by positive constants.
  
\begin{align}
\begin{split}
p_t^i(y) &= \frac{1}{y V_i(t)\sqrt{2 \pi}} \exp^\left{-\frac{1}{2V_i^2(t)}[\ln{\frac{y}{F_j(0)}+ \frac{1}{2}V_i^2(t)]^2}\right} \\
V_i(t) &:= \sqrt{\int_0^t \sigma_i^2(u)du} \\
\end{split}
\end{align}


Then as shown by Brigo and Mercurio (2006)

\begin{equation}
\upsilon(t,y) &:= \sqrt{\frac{\sum_{i=1}^N\lambda_i \sigma_i^2(t) \frac{1}{V_i(t)} \exp^{\{-\frac{1}{2V_i^2(t)}[\ln{\frac{y}{F_j(0)}+\frac{1}{2}V_i^2(t)}]^2 \}}}{\sum_{i=1}^N\lambda_i \frac{1}{V_i(t)} \exp^{\{-\frac{1}{2V_i^2(t)}[\ln{\frac{y}{F_j(0)}+\frac{1}{2}V_i^2(t)}]^2 \}}}} \\
\end{equation}

for $(t,y) > (0,0)$ and $\upsilon(t,y) = \sigma_0$ for  for $(t,y) = (0,F_j(0))$, the SDE


\begin{equation}
dF_j(t)=\upsilon(t,F_j(t))F_j(t)dW_t \\
\end{equation}


The marginal density of $G_i(t)$, for each time t, is then lognormal and given by (3.21) of lognormals (3.36). This equation above is the analytical epxpresion for the diffusion coefficient in the SDE (3.18) such that the resulting equation has a unique string solution whose marginal density is given by (3.21)

The quare of the local volatility(t,y) can be viewed as a weighted average of teh badic volatilities $\sigma_1^2(t),..., \sigma_N^2(t)$, whre the weights are all functions of the lognormal marginal densities. That is for each i,....,N and $(t,y)>(0,0)$

\begin{align}
	\begin{split}
\upsilon^2(t,y) &= \sum_{i=1}^N \Lambda_i(t,y) \sigma_i^2(t)\\
\Lambda_i(t,y) &:= \frac{\lambda_ip_t^i(y)}{\sum_{i=1}^N\lambda_ip_y^i(y)} \\
	\end{split}
\end{align}

As a consequence for each $t>0$ and $y>0$ the function $\upsilon$ is bounded from above and below by stoctly positive constants.

At $t=0$ the pricing a caplet can be under the forward rate dynamics (3.38) can be shown to be

\begin{align}
	\begin{split}
P(0,T_j)E^j \{ [F_j(T_{j-1})-K]^+ \} &= P(0,T_j) \int_0^{+ \infty} (y-K)^+p_{T_{j-1}}(y)dy \\
 &= P(0,T_j) \sum_{i=1}^N \lambda_i(y-K)^+ p_{T_{j-1}}^i(y)dy \\
	\end{split}
\end{align}

so that the caplet price is given by

\begin{equation}
\textbf{Cpl}(0,T_[j-1},T_j,\tau_j,\tilde{N},K)=\tau_j\tilde{N}P(0,T_j)\sum_{i=1}^N\lambda_i \text{Bl}(K,F_j(0),V_i(T_j-1))
\end{equation


An exmaple of the smile these dynamics can produce is shown in fig (3.3). It can produce smiles with the minimum volatility being the ATM. It can also produce skew type shapes, but with zero slope at the ATM level.

\begin{figure}[h]
\centering
\includegraphics[scale=0.06]{LM.eps}
\caption{Caplet Volatility Structure $\hat{\sigma}(K,\alpha)$ plotted against strike}\label{fig:3.3}
\end{figure}

\subsubsection{LMM dynamics under Lognormal Mixture Model}

As shown in Brigo 2006 thw LMM dynamics for the Lognormal mixture model are

\begin{equation}
  dF_k(t) =
  \begin{cases}
    \upsilon_j(t,F_j(t))F_j(t) \displaystyle\sum_{k=i+1}^j \frac{\rho_{j,k}\tau_k \upsilon_k(t,F_k(t))F_k(t)}
      {1+\tau_kF_k(t)}dt +\upsilon_j(t,F_j(t))F_j(t)dW_j^i(t) &\text{if }i<j, t \leq T_i.\\
    \upsilon_j(t,F_j(t))F_j(t)dW_j^i(t) &\text{if }i=j, t \leq T_i .\\
    -\upsilon_j(t,F_j(t))F_j(t) \displaystyle\sum_{k=j+1}^i \frac{\rho_{j,k}\tau_k \upsilon_k(t,F_k(t))D_k(t)}
      {1+\tau_kF_k(t)}dt +\upsilon_j(t,F_j(t))F_j(t)dW_j^i(t) &\text{if }i > j, t \leq T_i.
  \end{cases}
\end{equation}

where $V_i,j(t) $ and $\upsilon(t,y)$ are as defined in euqations 3.36 and 3.37 respectively.

The same equations are valid for other model in this section you simply substitute the appropriate expression for $\upsilon$.

\subsubsection{Shifting the Lognormal Dynamics}

Brigo and Mercurio (2000b) proposed to entend the lognormal misture model by combining it with the displced diffusion technique. Where the forward process is given by

\begin{equation}
F_j(t)= \alpha + \overline{F}_j(t)
\end{equation}

where $\alpha$ is a real constant and $\overline{F}_j$ evolves according to the LM dynamics given in (3.37). Brigo and Mercurio state that this is the genertal affine transformation for which the forward-rate process is still a martingale under its canonical measure.

The analytical expression for the marginal density of the process is given by shifted mixture of lognormals

has a unique strong solution whose marginal density is given by the mixture  of lognormals


\begin{equation}
p_t(y)= \sum_{i=1}^N \lambda_i \frac{1}{(y-\alpha)V_i(t)\sqrt{2 \pi}}\exp^{\{-\frac{1}{2V_i^2(t)}[\ln{\frac{y-\alpha}{F_j(0)- \alpha}+\frac{1}{2}V_i^2(t)}]^2\}}
\end{equation}


with $y > \alpha$

From Ito's lemma we obtain the forward rate process evolves according to

\begin{equation}
dF_j(t)= \upsilon(t,F_j(t)- \alpha)(F_j(t)- \alpha)dW_t
\end{equation}

The model for the forward rate process preserves the analytical tractability of the original process $F_j$

\begin{equation}
	P(0,T_j)E^j\{[F_j(T_{j-1})-K]^+\} = P(0,T_j)E^j\{[\overline{F}(T_{j-1})-(K-\alpha)]^+\}
\end{equation}

so that for $ \alpha < K$  the caplet price is given by

\begin{equation}
	\textbf{Cpl}(t,T_{j-1},T_j,\tau_j,\overline{N},K)=\tau_j\overline{N}P(t,T_j)\sum_{i=1}^N \lambda_iBl(k-\alpha,F_j(0)-\alpha,V_i(T_{j-1}))
\end{equation}

Moreover Brigo and Mercurio show that the implied volatiltiy is given by

\begin{equation}
\hat{\sigma}(0)=\frac{2}{\sqrt{T_{j-1}}}\Phi^{-1}(\frac{F_j(0)-\alpha}{F_j(0)}\sum_{i=1}^^N \lambda_i \Phi(\frac{1}{2}V_i(T_{j-1})+\frac{\alpha}{2F_j(0)}
\end{equation}


For $\alpha=0$ $F_j$ and $\overline{F}_j$ are the same. Decreasing $\alpha$ the variance of each forward rate increases while meiantianin the same expectation. $\alpha$ affects the shape of the implied volatiltiy curve in two ways, firstit causes a parallel shift in the curve. Second it moves the position of the minimum volatiltiy. Specifically if $\alpha>0$ the minium is reached for strikes lower than the ATM.




\subsubsection{Lognormal-Mixture with Different Means}

Following Brigo, Mercurio and Sartolli(2003) here the $p_t^i$'s are still lognormal but their means are now assumed to be different. Specifically $G_i$ is assumed to evolve user $Q^j$ according to

\begin{equation}
dG_i(t)=\mu_i(t)G_i(t)dt+\sigma_i(t)G_i(t)dW_t \text{ i=1,...,N } G_i(0)=F_j(0)
\end{equation}

where aginan the $\sigma_i$'s satisfy the same conditionas as in the Lognormal mixture model and the $\mu_i"$'s are deterministic functions of time. The density of $G_i$ at time t is given by

\begin{align}
	\begin{split}
p_t^i(y) &= \frac{1}{yV_i(t)\sqrt{2\pi}}\exp^{\{-\frac{1}{2V_i^2(t)}[\ln{\frac{y}{F_j(0)}-M_i(t) + \frac{1}{2}V_i^2(t)}]^2\}} \\
M_i(t) &:= \int_0^t \mu_i(u) du \\
	\end{split}
\end{align}

Where $V_i$ is defined as before. The functions $\mu_i$ must be chosen such that

\begin{equation}
\sum_{i=1}^N \lambda_i \exp^{M_i(t)}=1 \text{  } \forall{t} > 0
\end{equation}

This is to ensure $p_t(y)=\sum_{i=1}^N \lambda_ip_t^i(y)$ must have a constant mean equal to $F_j(0)$ 


As before we look for a diffusion coefficient such that $\psi(.,.)$ such that the SDE

\begin{equation}
dF_j(t) = \psi(t,F_j(t))F_j(t)dW_t
\end{equation}
  
has a solution with marginal density $p_t(y) = \sum_{i=1}^N \lambda_ip_t^i(y)$. Applying again the Fokker-Planck equation to processes $F_j$ and $G_i$'s

\begin{align}
	\begin{split}
\psi(t,y)^2 &:= \upsilon(t,y)^2 + \frac{2 \sum_{i=1}^N \lambda_u \mu_i(t) \int_y^{+\infty} x p_t^i(x)dx}{y^2\sum_{i=1}^N\lambda_ip_t^i(y)} \\ 
&= \upsilon(t,y)^2 + \frac{2F_j(0)\sum_{i=1}^N \lambda_i \mu_i(t) \exp^{M_i(t)}\Phi(\frac{\ln{\frac{F_j(0)}{y}+M_i(t)+\frac{1}{2}V_i^2(t)}}{V_i(t)})}{y^2\sum_{i=1}^N\lambda_ip_t^i(y)} \\
	\end{split}
\end{align}

with $\upsilon$ defined as

\begin{equation}
\upsilon(t,y)^2 = \frac{\sum_[i=1}^N\lambda_i\sigma_i(t)^2p_t^i(y)}{\sum_{i=1}^M\lambda_ip_t^i(y)}
\end{equation}
 

Brigo,Mercurio and Sartorelli (2003) go on to show that the price of a caplet is given by

\begin{equation}
\textbf{Cpl}(0,T_{j-1},T_j,\tau_j,K) = \tau_j P(0,T_j) \sum_{i=1}^N \lambda_i \exp^{M_i(T_{j-1})} Bl(K \exp^{-M_i(T_{j-1}},F_j(0),V_i(T_{j-1}))
\end{equation}


The non zero drifts in the $G_i$-dynamics allows steeper and more skewed smiles to be produced than in the zero drift case, with minimums that can be shifted further from the ATM strike. 

 
\subsubsection{Hyperbolic-Sine Processes}

Follwoing again Brigo and Mercurio (2006) it is now assumed that the processes $G_i$ evolve under $Q^j$ acording to a hyperbolic-sine process

\begin{equation}
G_i(t) = \beta_i(t)sinh[\int_0^t \alpha_i(u)dW_u-L_i] \text{  i=1,...,N,  } G_i(0)=F_j(0) 
\end{equation}
 
where the $\alpha$'s are positive and determinisitc functions of time, L_i's are negative constants, and $\beta_i$'s are chosen so as to render the $G_i$'s martingales

\begin{equation}
\beta_i(t) = \frac{F_j(0)\exp^{-\frac{1}{2}A_i^2(t)}}{\sinh{(-\alpha_iLi)}} 
\end{equation}

where we set $A_i(t) := \sqrt{\int_0^t \alpha_i^2(u)du}

Each $G_i$ is defined as increasing function of a time changed Brownian motion

\begin{equation}
dG_i(t)=\alpha_i(t)\sqrt{\beta_i^2(t) +G_i^2(t)}dW_t \text{ i=1,...,N } 
\end{equation}

From visual inspection of the SDE's diffusion coefficient we immediately notice that it is roughly deterministic for small values of $G_i(t)$, whereas it is roughly proportional to $G_i(t)$ for large values og $G_i(t)$. Therefore, in the former case, teh dynamics are approximately of the gaussian type, whereas in the latter they are approximately lognormal.

The cumulative distribution function of the process $G_i$ at each time t is easily defined as follows

\begin{align}
 \begin{split}
Q^j \{G_i(t) \leq y \} &= Q^j \{ \int_0^t \alpha_i(u)dW_u \leq L_i + \sinh^{-1}(\frac{y}{\beta_i(t)})\} \\
&= \phi(\frac{L_i}{A_i(t)}+\frac{1}{A_i(t)} \sinh^{-1}(\frac{y}{\beta_i(t)})) 
 \end{split}
\end{align}

so that the time-t marginal density of $G_i$ is

\begin{equation}
p_t^i(y) = \frac{exp^{\{-\frac{1}{2A_i^2(t)}[L-i + \sinh^{-1}(\frac{y}{\beta_i(t)})]^2 \}}}{A_i(t)\sqrt{2\pi}\sqrt{\beta_i^2(t) + y^2}}
\end{equation}

Further they show that the associated caplet price is

\begin{equation}
\begin{split}
\textbf{Cpl}(0,T_{j-1},T_j,\tau_j,K)=\tau_j P(0,T-j)[\frac{F_j(0)}{2\sinh(-L_i)}(exp^{-L_i}   \Phi(\overline{y}_i(T_{j-1}) + \\ A_i(T_{j-1})-\exp^{L_i}\Phi(\overline(y)_i(T_{j-1}-A_i(T{j-1})))-K\phi(\overline(y)_i(T_{j-1}))]
\end{split}
\end{equation}
 
\subsubsection{Empirical Comparison of Models}
 
Brigo and Mercurio (2006) found that the Lognormal mixture model (LM) was usually good enough to capture smile shaped curves. Some example fits are shown in Figs 3.5 and 3.6 However in the presense of clear asymmetries, one has to allow for different means in the lognormal densities, particularly to produce a non zero slope at the ATM point. For skew shaped curves the Hyperbolic-Sine curve was found to perfom best. However even in this case steep skews for low strikes nad short maturities can still be difficult to accomodate. 
They also report that the LM model provides a much better fir than the CEV and shifted lognormal models.

Whilst a good fit to the current data what is perhaps evenmore important is future implied volatilities produced by the model. In practice all exotics traders have to vega hedge and so having a model that can predict realistic future evolutions of the implied volatlilty surface are crucial to reduce hedging errors. in this respect the CEV and shifted lognormal models are preferable as they allow the analytical expression of thr transition density of the forward rate process, while the lognormal mixture model does not. The next set of local volatility models try to retain the superior fit of the lognormal misture model while keeping them analytically tractable.

\begin{figure}[h]
\centering
\includegraphics[scale=1]{ImpVol1.eps}
\caption{Implied Volatility Curves for the 4 year maturity $\hat{\sigma}(K,\alpha)$ plotted against strike}\label{fig:3.4}
\end{figure}

\begin{figure}[h]
\centering
\includegraphics[scale=1]{ImpVol2.eps}
\caption{Implied Volatility Curves for the 4 year maturity $\hat{\sigma}(K,\alpha)$ plotted against strike}\label{fig:3.5}
\end{figure}


\subsubsection{Analytically Tractable Local Volatility Models}

Brigo and Mercurio (2003) introduce a new class odf models that seeks to retain a good fits to cap data whilt still giving the highest degree of analytical tractability. 

Assume that $F_j$ can be expressed by the following (time-dependent) transformationof the Brownian motion W

\begin{equation}
F_t(j) = h(t,W_t) \text{ for each } 0 \leq t \leq T_{j-1}
\end{equation}

where the h function satisfies

\begin{enumerate}
	\item h belongs to $C^{1,2}(D_j)$, with $D_j :=[0,T_{j-1}]*\mathbb{R}$
	\item $h(t,\omega)>0$ for each (t,\omega) $\in D_j$
	\item for each $t>0$ the function $h_t : \mathbb{R} \rightarrow \mathbb{R}^+. \oemga \mapsto h_t(w) := h(t,w)$ has zero limit at minus infinity, $lim_{w \rightarrow -\infty}ht(w)=0,$ and is stictly increasing i.e. $dh_t(\omega)/d\omega > 0$ , so that for each $t>0$ the function $h_t$ is invertible and the inverse function $h_t^{-1}$ is differentiable 
	\item $E^j\{h(T_{j-1},W_{T_{j-1}}\}$ exists is finite and $E^j\{h(T_{j-1},W_{T_{j-1}}\|F_t}=h(t,W_t)$ for each $0 \leq t \leq T_{j-1}$ so that $F_j$ is a martingale 
\end{enumerate}


A simple example of such a process is $h(t,\omega) =a \exp^{(-b^2\frac{t}{2}+b\omega)}$ with $a,b > 0$ which leads to a  geometric Brownian motion.

The follwoing section presesnts the general framework which preserves the analytical tractability of the Brownian motion, but includes models that can produce realistic implied volatility smiles.

The SDE followed by the forward rate $F_j$ can be shown via Ito's Lemma to be

\begin{align}
 \begin{split}
dF_j(t) &= [\frac{\partial h}{\partial t}(t,W_t) + \frac{1}{2}\frac{\partial^2h}{\partial \omega^2}(t,W_t)]dt+\frac{\partial h}{\patial \omega}(t,W_t)dW_t \\
			  &= \frac{\partial h}{\partial \omega}(t,h_t^{-1}(F_j(t)))dW_t \\
			  &= \sigma(t,F_j(t))F_j(t)dW_t
 \end{split}
\end{align}

where the definition of the local volatility term had been ustilised, and where the drift due to the last assumption on h and the Feynman-Kac theorem. The process $F_j$ id therefore a one-dimensional diffusion.

The marginal density can be shown to be

\begin{equation}
p_t(t) = \frac{d}{dx}Q^j \{F_j(t) \leq x \} = \frac{1}{\sqrt{2 \pi t}} \exp^{1\frac{1}{2t}[h_t^{-1}(x)]^2} \frac{d}{dx}h_t^{-1}(x)
\end{equation}

The forward rate $F_j(T)$ conditional in $F_j(t)$ can be written as


\begin{equation}
F_j(T) = h(T,h_t^{-1}(F_j(t)) + W_t-W_t)
\end{equation}

Tha transitional density denoted $p(t,y;T,x)$ i.e. the density of $F_j(T)$ conditional on $F_j(t)=y$ can be shown to be


\begin{align}
 \begin{split}
p(t,y;T,x)  &= \frac{d}{dx}Q^j \{F_j(t) \leq x |F_j9t)=y \}
						&= \frac{1}{\sqrt{2\pi(T-t)}} \exp^{-\frac{1}{2(T-t)[h_t^{-1}(x)-h_t^{-1}]^2\frac{d}{dx}h_t^{-1}(x)}
 \end{splt}
\end{align}
 

For each $f<T_j-1$, the function $h_t$ can be written in terms of $h_{T_{j-1}}^{-1}$ as


\begin{equation}
h_t(\omega) = \int_{-\infty}^{+\infty} \Phi(\frac{\omega-h_{T_{j-1}}^{-1}(z)}{\sqrt{T_{j-1}-t}})dz
\end{equation}

For each $t<T_{j-1}$ the derivative of function $h_t$ can be shown to be


\begin{equation}
\frac{d}{d \omega} h_t(\omega) = \int_0^{+\infty} \frac{\exp^{-\frac{1}{2(T{j-1}-t}[\omega-h_{T_{j-1}}(z)]^2}}{\sqrt{2 \pi (T_{j-1}-t}}dz
\end{equation}

Finally the caplet price in the framework can be shown to be


\begin{equation}

\textbf{Cpl}(t,T_{j-1},T_j,\tau_j,K) = 
\tau_j P(t,T_j) \int_{h_T_{j-1}(K)-h_t^{-1}(F_j(t))}^{+\infty}} \frac{h_t^{-1}(F_j(t))+\omega}{\sqrt{2 \pi(T_{j-1}-t)}} \exp^{-\frac{\omega^2}{2(T_{j-1}-t}}d \omega -K\tau_jP(t,T_j)\Phi(\frac{h_t^{-1}(F_j(t))-h_{t{j-1}}^{-1}(K)}{\sqrt{T_{j-1}-t}})

\end{equation}


where in general the final intergal must be evaluated numerically. The following sections show examples where this integral can be obtained explicitly.

\subsubsection{Mixture of GBM's}

A particular case of the dynamics in (3.41) described in Brigo and Mercurio (2003) consists of a linear combination of N driftless geometric Brownian motions that are prefectly instantaneously correlated

\begin{align}
 \begin{split}
	F_j(t) &= h(t,W_t) \text{  for each  } t \geq 0 \\
	h(t,\omega) &= h_t(\omega) = \sum_{i=1}^N \psi_i \exp^{- \frac{1}{2} \beta_i^2 t + \beta_i \omega}  
 \end{split}
\end{align}
 
where $F_j(0)$,$B_i$'s and $\psi$'s are postive constants

This function satisfies the four assumptions and the derivative of its inverse is $h_t^{-1}$ 

\begin{equation}
\frac{d}{dx}h_t^{-1}(x) = \frac{1}{\frac{d}{d \omega}h_t(h_t^{-1}(x))} = \frac{1}{ \sum_{i=1}^N \psi_i \beta_i \exp^{-\frac{1}{2}\beta_i^2t+\beta_ih_t^{-1}(x)}}
\end{equation}




**check out Pitarberg's critique of mixing models
Imply from the market what you can hedge and esimate econometrically what you can not robonato et al 2009.

COVER CAP AND SWAPTION PRICING GIVING EXAMPLES OF BOTH. EXAPLIN NOT UNIQUE EVEN IN DESCRETE VOL CASE. lINK TO MARKET PRICE OF RISK WICH CAN CHANGE AND CANNOT BE LOCKED IN.


kahneman and Tversky 2000 are utility models useless?
chap 23  real life dynamics rebonato 2004
add about tacolomy of LMM very brief history of term structure models? 
What is the smile see Rebonato 2002.
smile models Rebonato 2004.
LFM and SLM incompatible but are difference s material rebonato2002 no Brigo?
Brigo 204  nice example of why correaltion are material to swaption prices
How many factors for good correlation cover

\bibliographystyle{plainnat}
\bibliography{volsmile}
\end{document}

Recommended reading 2024:

LaTeXguide.org • LaTeX-Cookbook.net • TikZ.org

NEW: TikZ book now 40% off at Amazon.com for a short time.

kaiserkarl13
Posts: 707
Joined: Tue Mar 25, 2008 5:02 pm

All new equations generated with the same equation number

Post by kaiserkarl13 »

The code you posted contains numerous errors (missing dollar signs, misplaced parentheses and braces). In short, this is not a working example.

I tried processing your markup and just ignoring all the errors. The resulting PDF does indeed stop numbering the equations after 3.41, but that's the least of your problems. I'd guess that there are at least a dozen errors up to that point in the document (which is why LaTeX generated a messed up PDF). Garbage in, garbage out.

Try reprocessing your markup without ignoring the errors. Fix said errors, and see if you still have a problem.
User avatar
gmedina
Posts: 2313
Joined: Wed Jul 11, 2007 11:45 pm

Re: All new equations generated with the same equation number

Post by gmedina »

Hi,

your code contains lots of errors. Some of them:

1) In lines 99 and 290 you wrote \rhp instead of \rho
2) In line 219 you wrote \leqmin instead of \leq\min
3) in line 258 you have two ampersand characters that must be deleted.
4) In line 264 you are incorrectly nesting an flalign environment inside an equation environment.
5) In line 299 there's a $ missing.

And there, I lost my patience
1,1,2,3,5,8,13,21,34,55,89,144,233,...
bazman
Posts: 78
Joined: Mon Jan 26, 2009 3:24 am

All new equations generated with the same equation number

Post by bazman »

OK, I had hitherto blithely ignored the error messages so long as everything seemed to work. However I have now gone through and reduced the count by roughly half. Of the rest most seem to be producing multiple errors from the same piece of code.

I don't seem to be using align properly almost every time this produces errors but it prints out in pdf OK, and having looked at the documentation I can't see anything obviously wrong?

>In line 264 you are incorrectly nesting an flalign environment >inside an equation environment.

I have left this for now simply because with it the matrix looks the way I want it to. If there is better way to do it please let me know.

also the orininal problem of incorrect equation numbering remains.

Code: Select all

\documentclass{article} % Your input file must contain these two lines
\usepackage{natbib}
\usepackage{amsmath}
\usepackage{amsfonts}
\usepackage{bm}
\usepackage{grffile}
\usepackage{graphicx}
\usepackage[%
  font=small,
  labelfont=bf,
  figurewithin=section,
  tablewithin=section,
  tableposition=top
]{caption}
\numberwithin{equation}{section}

\makeatletter
  \def\env@matrix{\hskip -\arraycolsep
  \let\@ifnextchar\new@ifnextchar
  \array{*\c@MaxMatrixCols l}}
\makeatother

\begin{document} % plus the \end{document} command at the end
\parskip = 1pc %change spacing between paragraphs
\parindent = 0pc %change paragraph indentation
\section{Introduction}

\section{LMM}

\subsection{Introduction} % This command makes a section title.
The Black model was already well established in the interest rate market. This allowed trader to price caps and swaptions individually i.e. in their own specific measure. However there was no framework to price caps swaptions or any other LIBOR product of a different maturity (and so measure) consistently. The seminal work of Heath, Jarrow and Morton (1992) whose great insight was that the no arbitrage of the state variables (e.g.) forward rates could be expressed as a function of the volatilities of and correlations between the state variables themselves. The HJM was originally cast is terms of instantenous forward rates, which don't actually trade in the market, also a point raised in the HJM paper was that in the continuous time limit for truely instantaneous and log-normal forward rates, their process explodes with positive probability. This lead early implementation of the approach that tried to steer clear of log normality: this was inconsistent with the already widely accepted Black approach and ultimately proved to be a dead end.

A new approach besed on HJM and first described in the papers by Brace et al(1996), Jamshidian(1997) and Musiela abd Rutkowski (1997) appeared in the mid 90's. 

\begin{itemize}
	\item Recast the yield curve in terms of market observable discrete sets of foreard rates
	\item The no arbitrage drifts for the forwards were translated from the continuous time HJM setting tothe new discrete setting.
	\item a numeraire had to be chosen (in early attempts this a discretely compounded money market account was invented, but forward and swap measures soon followed.
	\item the log normal distribution assumtption for forward rates was introduced
\end{itemize}
  
The LMM (like its HJM forebearer) is not a model per se  but rather a set of no arbitrage conditions among forward rates. There are various incarnations of the model. It can be formulated in terms of forward rates, swap rates, they can be normally, lognomally or otherwise distributed, the associated numeraire could be a zero coupon bond, a swap annuity or the money market account. These choice along with the specific forms of the instantaneous volatility and correaltion functions used fully specuify the model. The simultaneous specification of these time-dependent volatilities and correlations became \textit{the} problem in the specification of the LMM model. 
 

When using the forwards as the state variabels and log normality is assumes, caplet vols can be recovered exactly and in a very strightforward maner. Indeed the Lognormal forward market model prices caps using blacks formula so calibration is very strightforward. This is one of the reasons the model has become so popular it uses the market prices of underlying instruments as building blocks, hence the name the `market` model.
 
It should be noted that the Libor Forward Model and Libor Swap Model are theoretically inconsistent but this is a not a concern in practive as excellent approxiamtions exist see Brigo (2007) and Rebonato (2002).

As we will see the LMM lends itself well to monte carlo simulation and the pricing of path dependent options in multi-factor framework. However because the forward process implied by the LMM is in general non-Markovian it does not lend it self easily to recombining lattice type pricing techniques, consequently dealing with early exercise features can be problematic. However Carr and Yang 1997 provide an example of how to approximate a Bermuden price using a markov chain, as does Andersen (1999) (who approxiamtes the early exercise boundary as an function of intrinsic value and `sill alive' nested European swaptions. Further a general method for combining backward induction with Monte Carlo simulation has been proposed by Longstaff and Schwartz (2001). 


\subsection{Model Set-Up}

Following the presentation in Rebonato et al 2009 \nocite{*}.

A discrete set of default-free discount bonds are assumed to trade in the economy $P_t^i$. Spanning forward rates are denoted by.



\begin{equation}
f(t,T_i,T_{i+1}) = f_i^t \qquad i=1,2,...,N \\
\end{equation} 

The instantaneous volatilities of the forward reates are denoted:

\begin{equation}
\sigma(t,T_i)=\sigma_t^i \qquad i=1,2,...,N \\
\end{equation} 

The instantaneous volatility between forward rate i and j is denoted:

\begin{equation}
\rho(t,T_i,T_j)=\rho_{i,j}^t \qquad i,j=1,2,...,N \\
\end{equation} 

We choose as numeraire the discount bond $P_t^i=P(t,T_i)$

The forward rate can be expressed in terms of discount bonds as follows:

\begin{equation}
f_t^i= (\frac{P_t^i}{P_t^{i+1}}-1)\frac{1}{\tau_i}  \\
\end{equation} 

The description of the discrete yield curve is them completed by providing the value of the spot rate, i.e., the rate for lending/borrowing from spot time to $T_1$, given by:

\begin{equation}
r_0= (\frac{1}{P_0^{1}}-1)\frac{1}{\tau_1}  \\
\end{equation} 

Thus in the determinisitic-volatiltiy LMM the evolution of the forward rates is given by:

\begin{equation}
\frac{df_t^i}{f_t^i}= \mu^i(\{\textbf{f}_{t}\},\{ \bm{\sigma}_t\}, \bm{\rho},t)dt + \sigma^i(t,T_i)dz_t^i \\
\end{equation} 

with

\begin{equation}
\mathbb{E}[dz_t^idz_t^j] = \rho(t,T_i,T_j)dt  \\
\end{equation} 


Where $\textbf{f}_{t}$ is a vector of spanning forward rates, $\bm{\sigma}_t$ is the vector of associated volatilities, and $\bm{\rho}$ the correlation matrix


The following result is taken from Brigo et al 2006, under the lognormal assumption, we obtain the dynamics of $f_k$ under the forward adjusted measure $Q^i$ in the three cases $i<k$, i=k and $i>k$. 

\begin{equation}
  df_k(t) =
  \begin{cases}
    \sigma_k(t)f_k(t) \displaystyle\sum_{j=i+1}^k \frac{\rho_{k,j}\tau_j \sigma_j(t)f_j(t)}
      {1+\tau_jf_j(t)}dt +\sigma_k(t)f_k(f)dZ_k(t) &\text{if }i<k, t \leq T_i.\\
    \sigma_k(t)f_k(f)dZ_k(t) &\text{if }i=k, t \leq T_i .\\
    -\sigma_k(t)f_k(t) \displaystyle\sum_{j=k+1}^i \frac{\rho_{k,j}\tau_j \sigma_j(t)f_j(t)}
      {1+\tau_jf_j(t)}dt +\sigma_k(t)f_k(f)dZ_k(t) &\text{if }i > k, t \leq T_i.
  \end{cases}
\end{equation}


where $Z=Z^i$ is brownian motion under $Q^i$. All of the equations above asmit a unique strong solution if the coefficients $\sigma(.)$ are bounded.


Going back to equaiton (2.6) it can be re-written as:

\begin{equation}
\frac{df_t^i}{f_t^i}= \mu^i(\{\textbf{f}_{t}\},\{ \bm{\sigma}_t\}, \bm{\rho},t)dt + \sum_{k=1}^{m} \sigma_{ik}dz_k \\
\end{equation} 

where we assume that we are dealing with m $(m \leq	 N)$ factors and that the Brownian motions are independent. The quantities $ \sigma_{ik} $ can be interpreted as the loading of the ith forard rate on to the kth factor. Hence:


\begin{equation}
\sigma_i(t)= \sqrt{\sum_{k=1}^m \sigma_{ik}^2(t)} \\
\end{equation} 

If the function has been chosen such that

\begin{equation}
\int_0^{T_i} \sigma_i(t)^2 dt = \hat{\sigma}_i^2 T_i \\
\end{equation} 

holds true, then the market caplets will be correctly priced. $ \hat{\sigma}$ represents the Black impled volatility.

If each loading $\sigma_{ik}$ is now multiplied and divided by the volatility $\sigma_i$ of the ithe forward rate:

\begin{equation}
\frac{df_t^i}{f_t^i}= \mu^i(\{\textbf{f}_{t}\},\{ \bm{\sigma}_t\}, \bm{\rho},t)dt + \sigma_{i} \sum_{k=1}^{m} \frac{\sigma_{ik}}{\sigma_i}dz_k \\
\end{equation} 

Using (2.10) this can be re-written as


\begin{equation}
\frac{df_t^i}{f_t^i}= \mu^i(\{\textbf{f}_{t}\},\{ \bm{\sigma}_t\}, \bm{\rho},t)dt + \sigma_{i} \sum_{k=1}^{m} \frac{\sigma_{ik}}{\sqrt{\sum_{k'=1}^{m}\sigma_{ik'}^2}}dz_k \\
\end{equation} 

Defining $b_{ik}$ as

\begin{equation}
b_{ik} \equiv \frac{\sigma_{ik}}{\sqrt{\sum_{k'=1}^m \sigma_{ik'}^2}} \\
\end{equation} 

thus (2.13) can be expressed as

\begin{equation}
\frac{df_t^i}{f_t^i}= \mu^i(\{\textbf{f}_{t}\},\{ \bm{\sigma}_t\}, \bm{\rho},t)dt + \sigma_{i} \sum_{k=1}^{m} b_{ik} dz_k \\
\end{equation} 

where $ \bm{b} $ the $[N*m]$ matrix of elements $b_{jk}$. It can readily be shown that the correlation matrix is 


\begin{equation}
\bm{b}\bm{b^T}= \bm{\rho} \\
\end{equation} 

Equation (2.15) allows the stochastic part to be decomposed into a pure volatility component which can be easily calibrated to caplet prices and a correlation component $\bm{b}$ which can for instance be fitted to historical correlation matrices.

\subsection{Volatility Function}

As stated earlier a given implentation of the LMM depends laregely on the calibration of the volatlity and correlation functions.

There are various choices but again following Rebonato et al 2009. I show an example of a volatility function for the determinsitc case. This is both for paedological reasons and because this function is used and expanded by Rebonato et al when the apply the SABR model. This model has been shown to be a good match empirically in Rebonato (2002) and (2004). 

\begin{equation}
\sigma_t^T=[a+b(\tau)]\exp^{[-c(\tau)]}+d \\
\end{equation} 

\begin{itemize}
	\item As shown empirically in Rebonato (2002) and (2004) the humped shape shown is prevalent in `normal' trading environments, while the monotonically decreasing shape is prevalent in `excited' trading environments after major market dislocations
	\item It is time homogeneous which as shown in  Rebonato (2002) and (2004) is a very desirable property matching empirical behaviour of the instantaneous volatility curve
	\item It is square integrable
	\item It is very intuitive to use. a+d is the value of the instantaneous volatility  of the expiry reaches zero; d is the instantaneous vol for long maturities and the position of the hump (assuming there is one) is given by $\hat{\tau}=\frac{1}{c}- \frac{a}{b}$
	\item When fitted to ATM swaptions or caplets natural fits are obtained. Although some fine tuning is often necessary see below.
\end{itemize}

In general an extra paremeter is required to attain a prefect fit to caption prices. This parameter is maturity specific and so not time homogeneous. However it is constrained during the optimisation process so that it is as close as possible to one. Rebonato himself confeses to this being a fudge factor but it ends up playing an inportant role in the SABR version of the model.

\begin{equation}
k_{Ti}^2=\frac{\hat{\sigma_{T_i}^2T_i}}{\int_0^{T_i}([a+b(\tau)]\exp^{-c(\tau)]}+d)^2d\tau} \\
\end{equation} 


\begin{figure}[h]
\centering
\includegraphics[scale=0.5, angle=-1]{volcurve.eps}
\caption{Possible shapes of the volatility function in equation (2.17)}\label{fig:2.1}
\end{figure}






\subsection{Correlation Function}

\subsubsection{Simple Exponential Function}

\begin{equation}
\rho(t,T_i,T_j) = \exp^{[-\beta|T_i-T_j|]} \qquad t \leq \min(T_j,T_j)  \\
\end{equation} 

\begin{itemize}
	\item Assuming $\beta$ is positive the further apart two forwart rates are the more decorrelatied they will be which matches empirical observation
	\item For any positive $\beta$ the corresponding matrix $\rho$ will always be real symmentric matrix with positive eigenvalues
	\item However it gives the same decorrealtion between forwards at 1y and 1y 1m as ones at 10y and 10y 1m. This does not match empirical observation. However for instumetns like swaptions where it is only the net level of correlation that matters not the shape of the correaltion matrix this makes very little difference in practice. However for products like CMS spread options this would be a very serious flaw.  
\end{itemize}

A further advantage is that this simple function has no dependence on  time variable in:

\begin{equation}
C(i,j,k) = \int_{T_k}^{T_{k+1}} \sigma_u^i \sigma_u^j \rho(u,T_i,T_j) du  \\
\end{equation} 


and therefore one can write
\begin{equation}
 C(i,j,k) = \rho_{i,j} \int_{T_k}^{T_{k+1}} \sigma_u^i \sigma_u^j du   \\
\end{equation} 

 
which simplifies the computational burden


\subsubsection{Doust Correlation Function}

Obviously we would like to find a way to introduce a dependence of the correlation factor $\beta$  on the epries of the forward rates, $\beta=\beta(T_i,T_j)$ in such a way that the resulting correlation matric reamins positive definite. 

Assuming we takea 5 x 5 real symmetric matrix, where we retrict ourselves to only specifying (5-1) quantities, $a_1, a_2, a_3$ and $a_4$ keeping then between 1 and -1. Then Doust (2007) (as reported in Robonato et al 2009) showed that the resulting matrix is always positive defintie and so a possible correlation matrix.



\begin{equation}
\begin{bmatrix} 
1 & a_1 & a_1a_2 & a_1a_2a_3 & a_1a_2a_3a_4 \\
a_1 & 1 & a_2 & a_2a_3 & a_2a_3a_4  \\
a_1a_2 & a_2 & 1 & a_3 & a_3a_4 \\
a_1a_2a_3 & a_2a_3 & a_3 & 1 & a_4 \\
a_1a_2a_3a_4 & a_2a_3a_4 & a_3a_4 & a_4 & 1 \end{bmatrix} 
\end{equation}


The result admits a cholesky decomposition

\begin{equation}
\begin{flalign}
\begin{bmatrix} 
1 & 0 & 0 & 0 & 0 \\
a_1 & \sqrt{1-a_1^2} & 0 & 0 & 0  \\
a_1a_2 & a_2 \sqrt{1-a_1^2} & \sqrt{1-a_2^2} & 0 & 0 \\
a_1a_2a_3 & a_2a_3\sqrt{1-a_1^2} & a_3\sqrt{1-a_2^2} & \sqrt{1-a_3^2} & 0 \\
a_1a_2a_3a_4 & a_2a_3a_4\sqrt{1-a_1^2} & a_3a_4\sqrt{1-a_2^2} & a_4\sqrt{1-a_3^2} & \sqrt{1-a_4^2} \end{bmatrix} &&
\end{flalign}
\end{equation} 

The condition that ensures a valid correlation matrix is that all the lements on the main diagonal must be real. This will always be the case so long as

\begin{equation}
 -1 \leq a_i \leq 1 \qquad \text{for any i}   \\
\end{equation} 

Therefore tp ensure that you are contructing a valid correlation matrix one proceeds af follows. All the diagonal elements are set to

\begin{equation}
 \rho_{ii} = 1   \\
\end{equation} 

The elements in the first row are then defines as:

\begin{equation}
 \rho_{i,i} = \prod_{k=1}^{j-1} a_k = \rho_{j,1}, \qquad j =2,...,n   \\
\end{equation} 

Where assuming $i > j$ 

\begin{equation}
 \rho_{i,j} = \frac{\rho{1,i}}{\rho{j,1}} = \frac{\rho{1,i}}{\rho{1,j}}= \frac{\prod_{k=1}^{i-1}a_k}{\prod_{k=1}^{j-1}a_k}=\prod_{k=j}^{i-1}a_k   \\
\end{equation} 

The remaining elements may be found by symmetry $\rho_{ij} =\rho_{ji}$ 

Rebonato et al 2009 suggest that the $a_i$ be chosen in the following maner.

\begin{equation}
 a_k = \exp^{[-\beta_k \Delta T]}  \\
\end{equation} 

where $\Delta T$ is the spacing between the forward rates. Therefore

\begin{equation}
 \rho_{i,j} = \prod_{k=j}^{i-1} a_k = \prod_{k=j}^{i-1} \exp^{[-\beta_k \Delta T]} = \exp^{[-sum_{k=j}^{i-1}\beta_k \Delta T]}   \\
\end{equation} 

When $\beta_k = \beta_0$ for all k's the expression degenerates to the simple exponential equation shown earlier

\begin{equation}
 \rho_{i,j}^0 = \exp^{[-(i-j) \beta_0 \Delta T]} = \exp^{[-\beta_0|T_i - t_j|]}   \\
\end{equation}

If we set $\beta_k = g_0+g1/k+g2/k^2 + ... $ for positive $g_i$ then the $\beta_k$ are both decreasing in k and always positive. Which is in line with the emirical finding outlines earlier.

The Doust correlation function can be simply augmented to allow the long term correlation to level off at some level rather that decreasign montonely to zero


\begin{equation}
 \rho_{ij}(t) = \text{LongCorr} + (1-\text{LongCorr}) \hat{\rho_{ij}}(t)  \\
\end{equation}

where $ \hat{\rho_{ij}}(t) $ is a valis correaltion matrix produced in this instance by the Doust method.

In the Doust model the decorrelation does not depend just depend on teh distance between two forwards also on their maturities. This means that correlation can no longer be pulled out of the covariance integral

 
\begin{equation}
C(i,j,k) = \int_{T_k}^{T_{k+1}} \sigma_u^i \sigma_u^j \rho(u,T_i,T_j) du \equiv I_k  \\
\end{equation} 

However because the Doust correlation function is time homogeneous, so thatthe decorrelation between contracts 4 and 5 at time $T_1$ will be equal to the decorrelation between contracts 3 and 4 at time $T_0$. Thus in the determinsitic volatility case is the tims steps of the monte carlo simulation are chosen to match the maturities of the underlying forwards then calander time will not enter the covariance integral.

Things are not quite so straightforward in the stochastic volatility case. However at worst one simply has to store a covariance matrix at each time step (expiry of each forward) and then use linear interploation between them. The covariance matrices themselves thanks to the time homogeneous property of the Doust correlation function.

\begin{figure}[h]
\centering
\includegraphics[scale=0.1, angle=-1]{correlation.eps}
\caption{Example of Doust correlation structure}\label{fig:2.2}
\end{figure}

\subsection{Intrinsic Incompleteness of LMM}

In what follows I give a couple of very strightforwad example from Rebonato (2004) that highlights some of the problems surrounding the calibration of the LMM model even in the deterministic volatility environment. 

\begin{figure}[h]
\centering
\includegraphics[scale=0.1]{swaption.eps}
\caption{The reset and expiry times of the two forwards and of the swap rate}\label{fig:2.3}
\end{figure}


If we consider two forwards $f_1(t,T_1,T_2), f_s(t,T_2,T_3), SR_{12} = SR_{12}(t,T_1,T_3)$. The swap rate is a linear combination of the two forwards

\begin{equation}
 SR_{12} = w_1f_1 +w_2f_2  \\
\end{equation}
 

If we assume that all volatilities and correlation to be piecewise constant over the two time steps. Further assume that the forwards f1 and f2 both trade in the market with implied Black volatilities of $\hat{\sigma_1}$ and $\hat{\sigma_2}$ of $20\%$. The swaption Black volatility is taken to be $\hat{\sigma_{SR}}$ is $18\%$. 

As shown in Rebonato and Jackel (2002) a reasonably accurate approximation for the volatility of the swaption is given by


\begin{equation}
\sigma_{SR}^2SR^2= w_1^2f_1^2 \sigma_{1,1}^2+w_2^2f_2^2 \sigma_{2,1}^2 + w_1w_2f_1f_2\sigma_{1,1}\sigma_{2,1} \rho  \\
\end{equation} 

Clearly the volatility of the first forward must simply equal its Black volatiltity of $20\%$. Therefore so long as the net volatility of the second forward over the time periods is still matched to $20\%$

Then the problem of matching the swaption price is findamentally over specified. We have twp desgrees of freedom and only one price to match. This is true even though there is some constraint on the value if the 2nd forward.

\begin{equation}
\hat{\sigma_{2}^2}(t_2-T_0)= \int_{T_0}^{T_2} \sigma_2^2 du = \sigma_{2,1}^2)(T_1-T_0) + \sigma_{2,2}^2)(T_2-T_1) = 20\%  \\
\end{equation}

In the extrame we could take $\rho = 1$ and reduce the volatility between $T_1$ and $T_2$, clearly we would have to raise teh volatility of the same forward between times$T_2$ and $T_3$ to keep equation 2.35 balanced. Or we could keep the volatilties flat at $20\%$ and reduce the correlation. Or indeed any of an infinite number of combinations varrying the correlation and the instantaneous volatilities. 

As you can see the caplet and swaption surfaces can be fitted in an infintiy of ways. This is a problem because each fit corresponds to a different future evelotion of instantaneous volatilities and correlations and so give rise to different exotics prices. Thus the caplet and swaption markets are instrinsically incomplete even in the determinsitic volatility universe.

This is even more of a problem in the smile case. The LMM does allow the trader to at least directly express their views about the future term structure of volatilities and other unknowns, in a much more straight forwardly then with tradtional spot rate term structure models. However 
since vega hedging is essential to exotics derivaties trading, future re hedging costs must be considered and this requires future modelling of implied volatilities. This is why a plausible evolution of the smile is essential. If a model implies an implausible shape for the future smile as for example local-volatilitiy models do, it also implies implausible future prices for caplets and swaptions and therefore implausible exotics prices.


\section{Smile Models used with LMM}

\subsection{Local-Volatility Models}

\subsubsection{Shifted-Lognormal Model}

Following Brigo (2006) assume that the forward $f_j$ evolves under its $T_j$ forward measure according to

\begin{align}
	\begin{split}
f_j(t) &=X_j(f) + \alpha    \\   
dX_j(t) &=\beta(t)X_j(t)dW_t  
	\end{split}
\end{align}

where $\alpha$ is a real constant and $\beta$ is a deterministic function of time, $W_t$ is a standard Brownian motion.

Therefore a strighforward application of Ito's lemma and somesimple algebra gives:

\begin{align}
df_j(t) = \beta(t)(f_j(t)-\alpha)dW_t 
\end{align}

thus for $t < T \leq T_{j-1}$, the forward  rate $f_j$ can be written as

\begin{equation}
df_j(t) = \alpha + (f_j(t) -\alpha)\exp^{-\frac{1}{2}\int_t^T\beta^2(u)du + \int_t^T\beta(u)dW_u} 
\end{equation}

The distribution of $f_j(T)$, conditional on $f_jt(t)$, $t < T \leq T_{j-1}$ is then a shifted lognormal distribution with density

\begin{equation}
p_{f_j(T)|f_j(t)}(x) = \frac{1}{(x-\alpha)U(t,T)\sqrt{2 \pi}} \exp^{-\frac{1}{2}(\frac{\ln(\frac{x-\alpha}{f_j(t)-\alpha})+\frac{1}{2}U^2(t,T)}{U(t,T})^2)}
\end{equation}

for$ x > \alpha$, where

\begin{equation}
U(t,T) := \sqrt{\int_t^T \beta^2(u)du}
\end{equation}

so that for $ \alpha < K$ the caplet price \textbf{Cpl}$(t,T_{j-1},T_j,\tau_j,N,K)$ associated with equation (3.2) is given by

\begin{equation}
\textbf{Cpl}(t,T_{j-1},T_j,\tau_j,N,K)=\tau_jNP(t,T_j)\textbf{Black}(K-\alpha,f_j(t)-\alpha,U(t,T_{j-a}))
\end{equation}

Thus

\begin{equation}
(f_j(0)- \alpha)[2 \Phi(\frac{1}{2}U(0,T_{j-1}))-1] = f_j(0)[2 \Phi(\frac{1}{2}\sqrt{T_{j-1}}\hat{\sigma}(f_j(0), \alpha))-1]
\end{equation}

clearly increasing $\alpha$ ont eh left hand side needs a corresponding reduction in $\hat{\sigma}$ on the right hand side. Further differentiating a(3.6) with respect to $\alpha$ clearly produces a value which is always negative.

Therefore for $\alpha < 0$ we always havea decreasing curve. Also for $\alpha < 0$ the whole curve is moved downwards and vice versa.

\begin{figure}[h]
\centering
\includegraphics[scale=0.1]{ShiftedLognormal.eps}
\caption{Caplet Volatility Structure $\hat{\sigma}(K,\alpha)$ plotted against strike}\label{fig:3.1}
\end{figure}

\subsubsection{Constant Elasticity of Variance Model}

Following Andersen and Andreasen 

\begin{equation}
df_j(t)=\phi(f_j(t))\sigma_j(t)dZ_j^j(t)
\end{equation}

where $\phi$ is a general function. Andersen and Andreasen suggest

\begin{equation}
\phi(f_j(t))=[f_j(t)]^\gamma
\end{equation}

with $0<\gamma<1$. The limiting cases of $\gamma=0$ and $\gamma=1$ which are the fully normal and lognormal dynamics respectively. For values ibn between the dynmaics are a mixture of teh two processes.

Thus the model is

\begin{equation}
\phi(f_j(t))=\sigma_j(t)[f_j(t)]^\gamma dW_t \qquad f_j=0 \text{ absorbing boundary when } 0< \gamma<\frac{1}{2}
\end{equation}

Where W is a one-dimensional Brownian motion under teh $T_j$ forward measure

For $0<\gamma<\frac{1}{2}$  (3.10) does not have a unique solution unless we set $f_j=0$ as an absorbing boundary for the above SDE. Andersen and Andreasen also consider the case of $\gamma>1$ but note that this can leas to explosions when leaving the $T_j$ measure. 

Time dependence of $\sigma_j$ can be dealt with through a determinstic time change

\begin{equation}
\upsilon(\tau,T) = \int_{\tau}^T \sigma_j(s)^2ds
\end{equation} 

definig 

\begin{equation}
\tilde{W}(\upsilon(0,t)) := \int_0^t \sigma_j(s)dW(s)
\end{equation}

which gives us a Brownian motion $\tilde{W}$ with time parameter $\upsilon$. This is substituted in equation (3.10) 

\begin{equation}
df_j(\upsilon) = f_j(\upsilon)^{\gamma}\tilde{dW}(\upsilon) \qquad f_j=0 \text{ absorbing boundary when } 0< \gamma<\frac{1}{2}
\end{equation}

 
This process can be transformed into a Bessel  process via a change of variable. Some manipulations allow us to find teh transition density of f. Remembering the time change the continuous part of the density function of $f_j(T)$ conditional on $f_j(t)$, $t < T \leq T_{j-1}$ is then given by



\begin{align}
	\begin{split}
p_{f_j(T)|f_j(t)}(x) &= 2(1- \gamma) k^{1/(2-2 \gamma)}(uw^{1- 4 \gamma})^{1/(4-4 \gamma) \exp^{-u-w} \textbf{I}_{1/(2 - 2 \gamma)}(2 \sqrt{uw})
\\ k &= \frac{1}{2 \upsilon(t,T)(1-\gamma)^2} 
\\ u &= k[f_j(t)]^{2(1-\gamma)}  
\\ w &=k x^{2(1-\gamma)} 
	\end{split}
\end{align}


With $I_q$ denoting the modified Bessel function of the first kind of order q.
Denoting by $G(y,z) = \frac{\exp^{-z}z^{y-1}}{\Gamma(y)}$ the gamma density function and by $G(y,x)= \int_x^{+\infty} g(y,z)dz$ the complmentary gamma distribution, the probability that $f_j(T) =0$ conditional on $f_j(t)$ is $G(\frac{1}{2(1-\gamma)},u)$ 

From these results an explicit solution for caplet prices can be derived

\begin{equation}
\begin{split}
\textbf{Cpl}(t,T_{j-1},T_j,\tau_j,N,K) = \tau_jNP(t,T_j)[f_j(t).\sum_{n=0}^{+\infty}g(n+1,u)G(c_n,kK^{2(1-\gamma)}) \\ - K \sum_{n=0}^{+\infty}g(c_n,u)G(n+1,kK^{2(1-\gamma)})]
\end{split}
\end{equation}

where

\begin{equation}
c_n := n + 1 + \frac{1}{2(1- \gamma)}
\end{equation}


\begin{figure}[h]
\centering
\includegraphics[scale=1]{CEV.eps}
\caption{Caplet Volatility Structure $\hat{\sigma}(K,\alpha)$ plotted against strike}\label{fig:3.2}
\end{figure}


Andersen and Andreasen also propose an extension called the Limited CEV process that addresses te problem absorption at $F_j=0$. They propose a new process where


\begin{equation}
\phi(f) = f\text{ } min(\epsilon^{\gamma-1}.f^{\gamma-1})
\end{equation}

Basically the model collapses the process to a lognormal one for very low Levels of f. 



\subsubsection{Comment on CEV and Shifted Lognormal Models }

From visual inspection of Fig 3.1 and 3.2 it is quite clear that the smiles produces by the two models are qualitively similar. Indeed Rebonato (2002) shows that for a range of futures values there is a derect equivalence between the two models. He also shows how both these approaches can be applied within the LMM framework in a relatively straightforward maner.

The economic intuition behind these models is that in a log normal world and in a rising term structure environment a with rates at 1y equal to $3\%$ and 2y equal to $6\%$. Then a shock in rates should produce a price move twice as large in the $6\%$ rate. In a fully normal world the would both move by the same absolute amount. Empirically it has been obsrved that although the higher rate does tend to move more it does not move twice as much. As mentioned before the CEV model can be set to work as fully lognormal when $\gamma=1$ or normal when $\gamma=0$. Choosing a value in between allows the market behaviours to be matched.

Simple skews like this were witnessed inteh market as far back as 1996. It was only later after the Russia crises that full smiles were winessed in interest rate markets.


\subsubsection{Brigo and Mercurio Mixture Type Models }


Brigo and Mercurio (2001, 2002) proposed a class of analytically tractable diffusions that are flexible enough to cover more general voltility surfaces. 

Let the dynamics of the forward rate $F_j$ under the forward measure $Q^j$ be denoted by

\begin{equation}
dF_j(t) = \sigma(t,F_j(t))F_j(t)dW_t
\end{equation}

Where $\sigma$ is the local volatility and must be chosen such that it is a well behave deterministic function.

It is chosen to so that it grants a unique strong solution to the SDE (3.18). In particular it is assumed that $\sigma$ satisfies for a suitable positive constant L, the linear growth condition

\begin{equation}
\sigma^2(t,y)Y^2 \leq L(1+y^2) \text{ uniformly in t}
\end{equation}

which is there to ensure a strong solution.

They then consider the N diffusion process with dynamics given by

\begin{equation}
dG_i(t) \upsilon(t,G_i(t))dW_t \text{   i=1,...,N,   }G_i(0)=F_j(0)
\end{equation}

Where similar to equation $\sigma$ in (3.18) the $\upsilon_i(t,y)$ are real functions each satisfying the same linear growth conditions.

For each t, we denote by $p_t^i(.)$ the density function of $G_i(t)$, i.e., $p_t^i(y) = d(Q^j{G_i(t) \leq y})/dy$. $p_0^i$ is the $\delta$-Dirac function centred on $G_i(0)$

The problem is to derive the loical volatiltiy  $\sigma(t,S_t)$ such that the $Q^j$-density of $F_j(t)$ satisfies for each time t,

\begin{equation}
p_t(y) := \frac{d}{dy}Q^j{F_j(t) \leq y} = \sum_{i=1}^N \lambda_i \frac{d}{dy}Q^j{G_i(t) \leq y} = \sum{i=1}^N\lambda_ip_t^i(y)  \\
\end{equation}

whre teh $\lambda_i$'s are strictly positive constants such that $\sum_{i=1}^N \lambda_i =1$. Indeed $p_t(.)$ is a proper $Q^j$ density function since by definition

\begin{equation}
\int_0^{+\infty} yp_t(y)dy = \sum_{i=1}^N \lambda_i \int_0^{+\infty} yp_y^i(y)dy = \sum_{i=1}^N \lambda_iG_i(0) = F_j(0))  \\
\end{equation}

Brigo and Mercurio (2006) show that this is essentially the reverse of finding  the marginal density function of the solution of an SDE when the coefficients are known. $\sigma(t,F_j(t))$ can be found by solving the Fokker-Planck equation.

\begin{equation}
\frac{\partial}{\partial t} p_t(y) = \frac{1}{2}\frac{\partial^2}{\partial y^2}(\sigma^2(t,y)y^2 p_t(y))
\end{equation}
 
as each density $p_t^i(y)$ satisfies the Fokker-Planck equation

\begin{equation}
\frac{\partial}{\partial t^i}p_t(y) = \frac{1}{2}\frac{\partial^2}{\partial y^2}(\upsilon^2(t,y)y^2p_t^i(y))
\end{equation}

Applying he definition (3.21) and the linearity of the derivative operator(3.23) can be written as

\begin{equation}
\sum_{i=1}^N \lambda_i \frac{\partial}{\partial t} p_t^i(y)= \sum_{i=1}^N \lambda_i[-\frac{\partial}{\partial y}(\mu y p_t^i(y))]+\sum_{i=1}^N\lambda_i[\frac{1}{2}\frac{\partial^2}{\partial y^2}(\sigma^2(t,y)y^2p_t^i(y))]
\end{equation}

which by substituting from (3.24) becomes

\begin{equation}
\sum_{i=1}^N \lambda_i [\frac{1}{2}\frac{\partial^2}{\partial y^2}(\upsilon_i^2(t,y)p_t^i(y))]= \sum_{i=1}^N \lambda_i [\frac{1}{2} \frac{\partial^2}{\partial y^2}(\sigma^2(t,y) y^2 p_t^i (y))]
\end{equation}

again using the linearity of the second order derivative operator

\begin{equation}
\frac{\partial^2}{\partial y^2}[\sum_{i=1}^N \lambda_i \upsilon_i^2(t,y) p_t^i (y)]= \frac{\partial^2}{\partial y^2}[\sigma^2(t,y) y^2 \sum_{i=1}^N \lambda_i p_t^i(y)]
\end{equation}

The general solution of this second order differential equation is found as

\begin{equation}
\sigma^2(t,y) y^2 \sum_{i=1}^N \lambda_i p_t^i (y) = \sum_{i=1}^N \lambda_i \upsilon_i^2 (t,y) p_t^i(y) +A_{t}y+ B_t
\end{equation}

Where A and B are suitable function of time. The regulatoy conditions as shown for $\sigma$ in (3.19) imply that the left hand side of the equation has zero  limit for $y \rightarrow \infty$. As a consequence, the RHS must have a zero limit as well. This is only true if $A_t=B_t=0 \text{ } \forall t$. Therefore the $\sigma$ consistent with the marginal density (3.21) and the regularity contraints for $(t,y)>(0,0)$

\begin{equation}
\sigma(t,y) =\sqrt{\frac{\sum_{i=1}^N\lambda_i\upsilon_i^2(t,y)p_t^i(y)}{\sum_{i=1}^N\lambda_iy^2p_t^i(y)}} \\
\end{equation}

setting

\begin{equation}
\Lambda_i(t,y) :=\frac{\lambda_i p_t^i(y)}{\sum_{i=1}^N\lambda_iy^2p_t^i(y)} \\
\end{equation}

for each i=1,...,N and $(t,y)>(0,0)$, we can write

\begin{equation}
\sigma(t,y) = \sum_{n=1}^N \Lambda_i(t,y) \frac{\upsilon_i^2(t,y) }{y^2} \\
\end{equation}

so that the square of the volatility $\sigma$ can be written as a stochastic convex combination of the squared volatilities of basic processes (3.20). For each (t,y), $\Lambda_i(t,y) \geq 0$ for each i and $\sum_{i=1}^N \Lambda_i(t,y) = 1$. Moreover, by (3.19) and setting $L := max_{i=1,...,N}L_i$ the consition (3.19) is fufilled since

\begin{equation}
\sigma^2(t,y)y^2 = \sum_{i=1}^N \Lambda_i(t,y) \upsilon_i^2(t,y) \leq \sum_{i=1}^N \Lambda_i(t,y) L_i(i+y^2) \leq L(1 +y^2) \\
\end{equation}

Formula (3.29) leads to the following SDE of the forward rate under the measure $Q^j$

\begin{equation}
dF_j(t) = \sqrt{\frac{\sum_{i=1}^N \lambda_i\upsilon_i^2(t,F_j(t))p_t^i(F_j(t))}{\sum_{i=1}^N \lambda_iF_j(t)^2p_t^i(F_j(t))}}F_j(t)dW_t \\
\end{equation}
 
This SDE simply defines some candidate dynamics that leads to the marginal density (3.21), If $\sigma$ is bounded, then the SDE is well defined, but the conditions imposed are by themselves not enough to guarantee this, so verification must be done on a case by case basis.

If we assume that the SDE has a strong solution the caplet prices associated with the base model (3.20) can be shown to be

\begin{center}
\begin{align}
\begin{split}
\textbf{Cpl}(0,T_{j-1},T_j,\tau_k,K) &= \tau_j P(0,T_j) E^j{[F_j(T)-K)]^+} \\
 &= \tau_j P(0,T_j) \sum_{i=1}^N \lambda_i \int_0^{+\infty} [y-K]^+ p_{T_j}^i(y)dy \\
 &= \sum_{i=1}^N \lambda_i \textbf{Cpl}^i(0,T_j-1,T_j,\tau_j,K) \\
\end{split}
\end{align}
\end{center}

In this set up $F_j$ can be viewed as a process at time t coincides withthe basic sensity $p_t^i$ with probability $\lambda_i$. As noted by Brigo and Mercurio (2006) when proposing altenative dymanics it can be difficult to come up with analytical formula for caplets. Here such a problem can be avaoided by design as analytically tractable densities have been used $p^i$. Moreover given that N is unbounded a virtually unlimited number of patamters can be introduced allowing for an arbitrarily good fit to the market. In practice one has to trade off such flexibility againt over parameterising the model.


\subsubsection{Lognormal Mixture Model}

This is an applciation of the general dynamics (3.33).

\begin{equation}
\upsilon_i(t,y) = \sigma_i(t)y
\end{equation}

where all the $\sigma_i$'s are determinsitc and continuous functions of time that are bounded above and below by positive constants.
  
\begin{align}
\begin{split}
p_t^i(y) &= \frac{1}{y V_i(t)\sqrt{2 \pi}} \exp^{ \left{-\frac{1}{2V_i^2(t)}[\ln{\frac{y}{F_j(0)}+ \frac{1}{2} V_i^2(t)]^2}\right}} \\
V_i(t) &:= \sqrt{\int_0^t \sigma_i^2(u)du} \\
\end{split}
\end{align}


Then as shown by Brigo and Mercurio (2006)

\begin{equation}
\upsilon(t,y) := \sqrt{\frac{\sum_{i=1}^N\lambda_i \sigma_i^2(t) \frac{1}{V_i(t)} \exp^{\{-\frac{1}{2V_i^2(t)}[\ln{\frac{y}{F_j(0)}+\frac{1}{2}V_i^2(t)}]^2 \}}}{\sum_{i=1}^N\lambda_i \frac{1}{V_i(t)} \exp^{\{-\frac{1}{2V_i^2(t)}[\ln{\frac{y}{F_j(0)}+\frac{1}{2}V_i^2(t)}]^2 \}}}} \\
\end{equation}

for $(t,y) > (0,0)$ and $\upsilon(t,y) = \sigma_0$ for  for $(t,y) = (0,F_j(0))$, the SDE


\begin{equation}
dF_j(t)=\upsilon(t,F_j(t))F_j(t)dW_t \\
\end{equation}


The marginal density of $G_i(t)$, for each time t, is then lognormal and given by (3.21) of lognormals (3.36). This equation above is the analytical epxpresion for the diffusion coefficient in the SDE (3.18) such that the resulting equation has a unique string solution whose marginal density is given by (3.21)

The quare of the local volatility(t,y) can be viewed as a weighted average of teh badic volatilities $\sigma_1^2(t),..., \sigma_N^2(t)$, whre the weights are all functions of the lognormal marginal densities. That is for each i,....,N and $(t,y)>(0,0)$

\begin{align}
	\begin{split}
\upsilon^2(t,y) &= \sum_{i=1}^N \Lambda_i(t,y) \sigma_i^2(t)\\
\Lambda_i(t,y) &:= \frac{\lambda_ip_t^i(y)}{\sum_{i=1}^N\lambda_ip_y^i(y)} \\
	\end{split}
\end{align}

As a consequence for each $t>0$ and $y>0$ the function $\upsilon$ is bounded from above and below by stoctly positive constants.

At $t=0$ the pricing a caplet can be under the forward rate dynamics (3.38) can be shown to be

\begin{align}
	\begin{split}
P(0,T_j)E^j \{ [F_j(T_{j-1})-K]^+ \} &= P(0,T_j) \int_0^{+ \infty} (y-K)^+p_{T_{j-1}}(y)dy \\
 &= P(0,T_j) \sum_{i=1}^N \lambda_i(y-K)^+ p_{T_{j-1}}^i(y)dy \\
	\end{split}
\end{align}

so that the caplet price is given by

\begin{equation}
\textbf{Cpl}(0,T_{j-1},T_j,\tau_j,\tilde{N},K)=\tau_j\tilde{N}P(0,T_j)\sum_{i=1}^N\lambda_i \text{Bl}(K,F_j(0),V_i(T_j-1))
\end{equation


An exmaple of the smile these dynamics can produce is shown in fig (3.3). It can produce smiles with the minimum volatility being the ATM. It can also produce skew type shapes, but with zero slope at the ATM level.

\begin{figure}[h]
\centering
\includegraphics[scale=0.06]{LM.eps}
\caption{Caplet Volatility Structure $\hat{\sigma}(K,\alpha)$ plotted against strike}\label{fig:3.3}
\end{figure}

\subsubsection{LMM dynamics under Lognormal Mixture Model}

As shown in Brigo 2006 thw LMM dynamics for the Lognormal mixture model are

\begin{equation}
  dF_k(t) =
  \begin{cases}
    \upsilon_j(t,F_j(t))F_j(t) \displaystyle\sum_{k=i+1}^j \frac{\rho_{j,k}\tau_k \upsilon_k(t,F_k(t))F_k(t)}
      {1+\tau_kF_k(t)}dt \\ +\upsilon_j(t,F_j(t))F_j(t)dW_j^i(t) &\text{if }i<j, t \leq T_i.\\
    \upsilon_j(t,F_j(t))F_j(t)dW_j^i(t) &\text{if }i=j, t \leq T_i .\\
    -\upsilon_j(t,F_j(t))F_j(t) \displaystyle\sum_{k=j+1}^i \frac{\rho_{j,k}\tau_k \upsilon_k(t,F_k(t))D_k(t)}
      {1+\tau_kF_k(t)}dt \\ +\upsilon_j(t,F_j(t))F_j(t)dW_j^i(t) &\text{if }i > j, t \leq T_i.
  \end{cases}
\end{equation}

where $V_i,j(t) $ and $\upsilon(t,y)$ are as defined in euqations 3.36 and 3.37 respectively.

The same equations are valid for other model in this section you simply substitute the appropriate expression for $\upsilon$.

\subsubsection{Shifting the Lognormal Dynamics}

Brigo and Mercurio (2000b) proposed to entend the lognormal misture model by combining it with the displced diffusion technique. Where the forward process is given by

\begin{equation}
F_j(t)= \alpha + \overline{F}_j(t) \\
\end{equation}

where $\alpha$ is a real constant and $\overline{F}_j$ evolves according to the LM dynamics given in (3.37). Brigo and Mercurio state that this is the genertal affine transformation for which the forward-rate process is still a martingale under its canonical measure.

The analytical expression for the marginal density of the process is given by shifted mixture of lognormals

has a unique strong solution whose marginal density is given by the mixture  of lognormals


\begin{equation}
p_t(y)= \sum_{i=1}^N \lambda_i \frac{1}{(y-\alpha)V_i(t)\sqrt{2 \pi}}\exp^{\{-\frac{1}{2V_i^2(t)}[\ln{\frac{y-\alpha}{F_j(0)- \alpha}+\frac{1}{2}V_i^2(t)}]^2\}}
\end{equation}


with $y > \alpha$

From Ito's lemma we obtain the forward rate process evolves according to

\begin{equation}
dF_j(t)= \upsilon(t,F_j(t)- \alpha)(F_j(t)- \alpha)dW_t
\end{equation}

The model for the forward rate process preserves the analytical tractability of the original process $F_j$

\begin{equation}
	P(0,T_j)E^j\{[F_j(T_{j-1})-K]^+\} = P(0,T_j)E^j\{[\overline{F}(T_{j-1})-(K-\alpha)]^+\}
\end{equation}

so that for $ \alpha < K$  the caplet price is given by

\begin{equation}
	\textbf{Cpl}(t,T_{j-1},T_j,\tau_j,\overline{N},K)=\tau_j\overline{N}P(t,T_j)\sum_{i=1}^N \lambda_iBl(k-\alpha,F_j(0)-\alpha,V_i(T_{j-1}))
\end{equation}

Moreover Brigo and Mercurio show that the implied volatiltiy is given by

\begin{equation}
\hat{\sigma}(0)=\frac{2}{\sqrt{T_{j-1}}}\Phi^{-1}(\frac{F_j(0)-\alpha}{F_j(0)}) \sum_{i=1}^N \lambda_i \Phi(\frac{1}{2}V_i(T_{j-1})+\frac{\alpha}{2F_j(0)}
\end{equation}


For $\alpha=0$ $F_j$ and $\overline{F}_j$ are the same. Decreasing $\alpha$ the variance of each forward rate increases while meiantianin the same expectation. $\alpha$ affects the shape of the implied volatiltiy curve in two ways, firstit causes a parallel shift in the curve. Second it moves the position of the minimum volatiltiy. Specifically if $\alpha>0$ the minium is reached for strikes lower than the ATM.




\subsubsection{Lognormal-Mixture with Different Means}

Following Brigo, Mercurio and Sartolli(2003) here the $p_t^i$'s are still lognormal but their means are now assumed to be different. Specifically $G_i$ is assumed to evolve user $Q^j$ according to

\begin{equation}
dG_i(t)=\mu_i(t)G_i(t)dt+\sigma_i(t)G_i(t)dW_t \text{ i=1,...,N } G_i(0)=F_j(0)
\end{equation}

where aginan the $\sigma_i$'s satisfy the same conditionas as in the Lognormal mixture model and the $\mu_i$'s are deterministic functions of time. The density of $G_i$ at time t is given by

\begin{align}
	\begin{split}
p_t^i(y) &= \frac{1}{yV_i(t)\sqrt{2\pi}}\exp^{\{-\frac{1}{2V_i^2(t)}[\ln{\frac{y}{F_j(0)}-M_i(t) + \frac{1}{2}V_i^2(t)}]^2\}} \\
M_i(t) &:= \int_0^t \mu_i(u) du \\
	\end{split}
\end{align}

Where $V_i$ is defined as before. The functions $\mu_i$ must be chosen such that

\begin{equation}
\sum_{i=1}^N \lambda_i \exp^{M_i(t)}=1 \text{  } \forall{t} > 0
\end{equation}

This is to ensure $p_t(y)=\sum_{i=1}^N \lambda_ip_t^i(y)$ must have a constant mean equal to $F_j(0)$ 


As before we look for a diffusion coefficient such that $\psi(.,.)$ such that the SDE

\begin{equation}
dF_j(t) = \psi(t,F_j(t))F_j(t)dW_t
\end{equation}
  
has a solution with marginal density $p_t(y) = \sum_{i=1}^N \lambda_ip_t^i(y)$. Applying again the Fokker-Planck equation to processes $F_j$ and $G_i$'s

\begin{align}
	\begin{split}
\psi(t,y)^2 &:= \upsilon(t,y)^2 + \frac{2 \sum_{i=1}^N \lambda_u \mu_i(t) \int_y^{+\infty} x p_t^i(x)dx}{y^2\sum_{i=1}^N\lambda_ip_t^i(y)} \\ 
&= \upsilon(t,y)^2 + \frac{2F_j(0)\sum_{i=1}^N \lambda_i \mu_i(t) \exp^{M_i(t)}\Phi(\frac{\ln{\frac{F_j(0)}{y}+M_i(t)+\frac{1}{2}V_i^2(t)}}{V_i(t)})}{y^2\sum_{i=1}^N\lambda_ip_t^i(y)} \\
	\end{split}
\end{align}

with $\upsilon$ defined as

\begin{equation}
\upsilon(t,y)^2 = \frac{\sum_{i=1}^N\lambda_i\sigma_i(t)^2p_t^i(y)}{\sum_{i=1}^M\lambda_ip_t^i(y)}
\end{equation}
 

Brigo,Mercurio and Sartorelli (2003) go on to show that the price of a caplet is given by

\begin{equation}
\textbf{Cpl}(0,T_{j-1},T_j,\tau_j,K) = \tau_j P(0,T_j) \sum_{i=1}^N \lambda_i \exp^{M_i(T_{j-1})} Bl(K \exp^{-M_i(T_{j-1}},F_j(0),V_i(T_{j-1}))
\end{equation}


The non zero drifts in the $G_i$-dynamics allows steeper and more skewed smiles to be produced than in the zero drift case, with minimums that can be shifted further from the ATM strike. 

 
\subsubsection{Hyperbolic-Sine Processes}

Follwoing again Brigo and Mercurio (2006) it is now assumed that the processes $G_i$ evolve under $Q^j$ acording to a hyperbolic-sine process

\begin{equation}
G_i(t) = \beta_i(t)sinh[\int_0^t \alpha_i(u)dW_u-L_i] \text{  i=1,...,N,  } G_i(0)=F_j(0) 
\end{equation}
 
where the $\alpha$'s are positive and determinisitc functions of time, $L_i$'s are negative constants, and $\beta_i$'s are chosen so as to render the $G_i$'s martingales

\begin{equation}
\beta_i(t) = \frac{F_j(0)\exp^{-\frac{1}{2}A_i^2(t)}}{\sinh{(-\alpha_iLi)}} 
\end{equation}

where we set $A_i(t) := \sqrt{\int_0^t \alpha_i^2(u)du}$

Each $G_i$ is defined as increasing function of a time changed Brownian motion

\begin{equation}
dG_i(t)=\alpha_i(t)\sqrt{\beta_i^2(t) +G_i^2(t)}dW_t \text{ i=1,...,N } 
\end{equation}

From visual inspection of the SDE's diffusion coefficient we immediately notice that it is roughly deterministic for small values of $G_i(t)$, whereas it is roughly proportional to $G_i(t)$ for large values og $G_i(t)$. Therefore, in the former case, the dynamics are approximately of the gaussian type, whereas in the latter they are approximately lognormal.

The cumulative distribution function of the process $G_i$ at each time t is easily defined as follows

\begin{align}
 \begin{split}
Q^j \{G_i(t) \leq y \} &= Q^j \{ \int_0^t \alpha_i(u)dW_u \leq L_i + \sinh^{-1}(\frac{y}{\beta_i(t)})\} \\
&= \phi(\frac{L_i}{A_i(t)}+\frac{1}{A_i(t)} \sinh^{-1}(\frac{y}{\beta_i(t)})) 
 \end{split}
\end{align}

so that the time-t marginal density of $G_i$ is

\begin{equation}
p_t^i(y) = \frac{exp^{\{-\frac{1}{2A_i^2(t)}[L-i + \sinh^{-1}(\frac{y}{\beta_i(t)})]^2 \}}}{A_i(t)\sqrt{2\pi}\sqrt{\beta_i^2(t) + y^2}}
\end{equation}

Further they show that the associated caplet price is

\begin{equation}
\begin{split}
\textbf{Cpl}(0,T_{j-1},T_j,\tau_j,K)=\tau_j P(0,T-j)[\frac{F_j(0)}{2\sinh(-L_i)}(exp^{-L_i}   \Phi(\overline{y}_i(T_{j-1}) + \\ A_i(T_{j-1})-\exp^{L_i}\Phi(\overline(y)_i(T_{j-1}-A_i(T{j-1})))-K\phi(\overline(y)_i(T_{j-1}))]
\end{split}
\end{equation}
 
\subsubsection{Empirical Comparison of Models}
 
Brigo and Mercurio (2006) found that the Lognormal mixture model (LM) was usually good enough to capture smile shaped curves. Some example fits are shown in Figs 3.5 and 3.6 However in the presense of clear asymmetries, one has to allow for different means in the lognormal densities, particularly to produce a non zero slope at the ATM point. For skew shaped curves the Hyperbolic-Sine curve was found to perfom best. However even in this case steep skews for low strikes nad short maturities can still be difficult to accomodate. 
They also report that the LM model provides a much better fir than the CEV and shifted lognormal models.

Whilst a good fit to the current data what is perhaps evenmore important is future implied volatilities produced by the model. In practice all exotics traders have to vega hedge and so having a model that can predict realistic future evolutions of the implied volatlilty surface are crucial to reduce hedging errors. in this respect the CEV and shifted lognormal models are preferable as they allow the analytical expression of thr transition density of the forward rate process, while the lognormal mixture model does not. The next set of local volatility models try to retain the superior fit of the lognormal misture model while keeping them analytically tractable.

\begin{figure}[h]
\centering
\includegraphics[scale=1]{ImpVol1.eps}
\caption{Implied Volatility Curves for the 4 year maturity $\hat{\sigma}(K,\alpha)$ plotted against strike}\label{fig:3.4}
\end{figure}

\begin{figure}[h]
\centering
\includegraphics[scale=1]{ImpVol2.eps}
\caption{Implied Volatility Curves for the 4 year maturity $\hat{\sigma}(K,\alpha)$ plotted against strike}\label{fig:3.5}
\end{figure}


\subsubsection{Analytically Tractable Local Volatility Models}

Brigo and Mercurio (2003) introduce a new class odf models that seeks to retain a good fits to cap data whilt still giving the highest degree of analytical tractability. 

Assume that $F_j$ can be expressed by the following (time-dependent) transformationof the Brownian motion W

\begin{equation}
F_t(j) = h(t,W_t) \text{ for each } 0 \leq t \leq T_{j-1}
\end{equation}

where the h function satisfies

\begin{enumerate}
	\item h belongs to $C^{1,2}(D_j)$, with $D_j :=[0,T_{j-1}]*\mathbb{R}$
	\item $h(t,\omega)>0$ for each $(t,\omega)$ $\in D_j$
	\item for each $t>0$ the function $h_t : \mathbb{R} \rightarrow \mathbb{R}^+. \omega \mapsto h_t(w) := h(t,w)$ has zero limit at minus infinity, $lim_{w \rightarrow -\infty}ht(w)=0,$ and is stictly increasing i.e. $dh_t(\omega)/d\omega > 0$ , so that for each $t>0$ the function $h_t$ is invertible and the inverse function $h_t^{-1}$ is differentiable 
	\item $E^j \{ h(T_{j-1},W_{T_{j-1}})\}$ exists is finite and $E^j \{h(T_{j-1},W_{T_{j-1}})|F_t \}=h(t,W_t)$ for each $0 \leq t \leq T_{j-1}$ so that $F_j$ is a martingale 
\end{enumerate}


A simple example of such a process is $h(t,\omega) =a \exp^{(-b^2\frac{t}{2}+b\omega)}$ with $a,b > 0$ which leads to a  geometric Brownian motion.

The follwoing section presesnts the general framework which preserves the analytical tractability of the Brownian motion, but includes models that can produce realistic implied volatility smiles.

The SDE followed by the forward rate $F_j$ can be shown via Ito's Lemma to be

\begin{align}
 \begin{split}
dF_j(t) &= [\frac{\partial h}{\partial t}(t,W_t) + \frac{1}{2}\frac{\partial^2h}{\partial \omega^2}(t,W_t)]dt+\frac{\partial h}{\partial \omega}(t,W_t)dW_t \\
			  &= \frac{\partial h}{\partial \omega}(t,h_t^{-1}(F_j(t)))dW_t \\
			  &= \sigma(t,F_j(t))F_j(t)dW_t
 \end{split}
\end{align}

where the definition of the local volatility term had been ustilised, and where the drift due to the last assumption on h and the Feynman-Kac theorem. The process $F_j$ id therefore a one-dimensional diffusion.

The marginal density can be shown to be

\begin{equation}
p_t(t) = \frac{d}{dx}Q^j \{F_j(t) \leq x \} = \frac{1}{\sqrt{2 \pi t}} \exp^{1\frac{1}{2t}[h_t^{-1}(x)]^2} \frac{d}{dx}h_t^{-1}(x)
\end{equation}

The forward rate $F_j(T)$ conditional in $F_j(t)$ can be written as


\begin{equation}
F_j(T) = h(T,h_t^{-1}(F_j(t)) + W_t-W_t)
\end{equation}

Tha transitional density denoted $p(t,y;T,x)$ i.e. the density of $F_j(T)$ conditional on $F_j(t)=y$ can be shown to be


\begin{align}
 \begin{split}
p(t,y;T,x)  &= \frac{d}{dx}Q^j \{F_j(t) \leq x |F_j9t)=y \}
						&= \frac{1}{\sqrt{2\pi(T-t)}} \exp^{-\frac{1}{2(T-t)[h_t^{-1}(x)-h_t^{-1}]^2\frac{d}{dx}h_t^{-1}(x)}
 \end{split}
\end{align}
 

For each $f<T_j-1$, the function $h_t$ can be written in terms of $h_{T_{j-1}}^{-1}$ as


\begin{equation}
h_t(\omega) = \int_{-\infty}^{+\infty} \Phi(\frac{\omega-h_{T_{j-1}}^{-1}(z)}{\sqrt{T_{j-1}-t}})dz
\end{equation}

For each $t<T_{j-1}$ the derivative of function $h_t$ can be shown to be


\begin{equation}
\frac{d}{d \omega} h_t(\omega) = \int_0^{+\infty} \frac{\exp^{-\frac{1}{2(T{j-1}-t}[\omega-h_{T_{j-1}}(z)]^2}}{\sqrt{2 \pi (T_{j-1}-t}}dz
\end{equation}

Finally the caplet price in the framework can be shown to be


\begin{equation}
\textbf{Cpl}(t,T_{j-1},T_j,\tau_j,K) = 
\tau_j P(t,T_j) \int_{h_{T_{j-1}}(K)-h_t^{-1}(F_j(t))}^{+\infty} \frac{h_t^{-1}(F_j(t))+\omega}{\sqrt{2 \pi(T_{j-1}-t)}} \exp^{-\frac{\omega^2}{2(T_{j-1}-t}}d \omega -K\tau_jP(t,T_j)\Phi(\frac{h_t^{-1}(F_j(t))-h_{t{j-1}}^{-1}(K)}{\sqrt{T_{j-1}-t}})
\end{equation}


where in general the final intergal must be evaluated numerically. The following sections show examples where this integral can be obtained explicitly.

\subsubsection{Mixture of GBM's}

A particular case of the dynamics in (3.41) described in Brigo and Mercurio (2003) consists of a linear combination of N driftless geometric Brownian motions that are prefectly instantaneously correlated

\begin{align}
 \begin{split}
	F_j(t) &= h(t,W_t) \text{  for each  } t \geq 0 \\
	h(t,\omega) &= h_t(\omega) = \sum_{i=1}^N \psi_i \exp^{- \frac{1}{2} \beta_i^2 t + \beta_i \omega}  
 \end{split}
\end{align}
 
where $F_j(0)$,$B_i$'s and $\psi$'s are postive constants

This function satisfies the four assumptions and the derivative of its inverse is $h_t^{-1}$ 

\begin{equation}
\frac{d}{dx}h_t^{-1}(x) = \frac{1}{\frac{d}{d \omega}h_t(h_t^{-1}(x))} = \frac{1}{ \sum_{i=1}^N \psi_i \beta_i \exp^{-\frac{1}{2}\beta_i^2t+\beta_ih_t^{-1}(x)}}
\end{equation}

The initial condition imposes that

\begin{equation}
\sum_{i=1}^N\psi_i = F_j(0)
\end{equation}

so that setting for each i, $\lambda_i := \psi_i/F_j(0)$, we can write the $F_j$ as a mixture of N (driftless Brownian motions starting at $F_j(0)$

\begin{align}
 \begin{split}
	F_j(t) &= \sum_{i=1}^N \lambda_iY_i(t) \\
	dY_i(t) &= Y_i(t) \beta_idW_t, \text{  } Y_i(0) = F_j(0)  \\
 \end{split}
\end{align}
 
Applying Ito's lemma

\begin{align}
 \begin{split}
	dF_j(t) &= \sum_{i=1}^N \lambda_iY_i(t) \beta_i dW_t \\
	&= \sum_{i=1}^N \psi_i \beta_i \exp^{\frac{1}{2}\beta_i^2t+\beta_i h_t^{-1}(F_j(t))} dW_t  \\
	&= \overline{\sigma}(t,F_j(t))F_j(t)dW_t
 \end{split}
\end{align}

which is obviously consistent with (x.xx) and where the local volatiltiy function $\overline{\sigma}$ is defined by the last equality. Hence

\begin{align}
 \begin{split}
dF_j(t) &= F_j(t) \sum_{i=1}^N \Lambda_i(t,F_j(t)) \beta_i dW_t   \\
\Lambda_i(t,z) &:= \frac{\psi_i \exp^{-\frac{1}{2}\beta_i^2t+\beta_ih_t^{-1}(z)}}{\sum_{k=1}^N\psi_k \exp^{-\frac{1}{2}\beta_k^2t+\beta_k h_t^{-1}(z)}}  	
 \end{split}
\end{align}

The local colatility $\overline{\sigma} $ can be thus be viewed as a stochastic weighted average of teh basic volatilities $\beta_i$'s and since the $\Lambda_i$'s are positive and sum to one.

Mercurio and Brido then go on to show that the caplet price is given by



\begin{equation}
\begin{split}
\begin{flushleft}
\textbf{Cpl}(0,T_{j-1},T_j,\tau_j,K) = \tau_j P(0,T_j) [\sum_{i=1}^N \psi_i \exp^{-\frac{1}{2} \beta_i^2t+\beta_ih_t^{-1}(F_j(t))} \Phi(\frac{\beta_i(T_{j-1}-t)-h_{T_{j-1}}^{-1}(K)+ h_t^{-1}(F_j(t))}{\sqrt{T_{j-1}-t}}) -K\Phi(-\frac{h_t^{-1}(F_j(t))-h_{T_{j-1}}^{-1}(K)}{\sqrt{T_{j-1}-t}})]
\end{flushleft}
\end{split}
\end{equation}

The implied volatility curves obtained from the caplet price equation above are increasing and concave in the strike. This makes them unsuitable for use in practice. The same thing happensin teh shifted log-normal model when the shift paramter $\alpha$ is positive . The reason is that the shifted lognormal is a particular case of the GBM mixture model (set N=2. $\beta_i=0$ and $\beta_2>0$).

\subsubsection{Extended Mixture of GBM's}

Brigo and Mercurio (2006) postulate that dealing with positive combinators $\psi$ is too restictive to adequately fit the implied volatility surface. So they allow for negative combinators without acompromising the analytical tractability of the model.


**check out Pitarberg's critique of mixing models
Imply from the market what you can hedge and esimate econometrically what you can not robonato et al 2009.

COVER CAP AND SWAPTION PRICING GIVING EXAMPLES OF BOTH. EXAPLIN NOT UNIQUE EVEN IN DESCRETE VOL CASE. lINK TO MARKET PRICE OF RISK WICH CAN CHANGE AND CANNOT BE LOCKED IN.


kahneman and Tversky 2000 are utility models useless?
chap 23  real life dynamics rebonato 2004
add about tacolomy of LMM very brief history of term structure models? 
What is the smile see Rebonato 2002.
smile models Rebonato 2004.
LFM and SLM incompatible but are difference s material rebonato2002 no Brigo?
Brigo 204  nice example of why correaltion are material to swaption prices
How many factors for good correlation cover

\bibliographystyle{plainnat}
\bibliography{volsmile}
\end{document}

User avatar
gmedina
Posts: 2313
Joined: Wed Jul 11, 2007 11:45 pm

Re: All new equations generated with the same equation number

Post by gmedina »

Hi,

your code still contains errors. Once you correct them, the numbering problem disappears. In some hours, I will try to give you some hints to get rid of all the errors that I found.
1,1,2,3,5,8,13,21,34,55,89,144,233,...
User avatar
localghost
Site Moderator
Posts: 9202
Joined: Fri Feb 02, 2007 12:06 pm

All new equations generated with the same equation number

Post by localghost »

bazman wrote:[...] Please excuse the mamouth code sample. But the nature of the problem is such that I don't know how to reduce the size while still retaining the error. [...]
You can avoid such a mass of errors by fixing the first one directly as it occurs. Moreover you better attach such long code as complete file by uploading it to the forum server.


Best regards
Thorsten
bazman
Posts: 78
Joined: Mon Jan 26, 2009 3:24 am

Re: All new equations generated with the same equation number

Post by bazman »

Hi there,

I have uploaded the transfer document as you suggested.

I is still showing about a total of 50 errors but I don't think there are more than approx 5 erros remaining. The same error seems to produce multiple error messages.

I so not know how to resolve the remaining errors any help you can give would be greatfully received.

Thanks
Attachments
Transfer Panel Document.tex
(56.53 KiB) Downloaded 234 times
User avatar
gmedina
Posts: 2313
Joined: Wed Jul 11, 2007 11:45 pm

Re: All new equations generated with the same equation number

Post by gmedina »

I regret that you couldn't wait until I pointed out the errors in your other post, as I said I would do. Now, I don't have time to look at yet another document.
1,1,2,3,5,8,13,21,34,55,89,144,233,...
User avatar
localghost
Site Moderator
Posts: 9202
Joined: Fri Feb 02, 2007 12:06 pm

All new equations generated with the same equation number

Post by localghost »

bazman wrote:[...] I so not know how to resolve the remaining errors any help you can give would be greatfully received. [...]
Take the code and shift the \end{document} line systematically from the beginning to the end for fixing the errors. The log file is very clear in pointing out where an error can be found and what it was caused by. Clean up the code and if the initial problem persists we will look further.
bazman
Posts: 78
Joined: Mon Jan 26, 2009 3:24 am

All new equations generated with the same equation number

Post by bazman »

Please accept my apologies the last thing I want to do is piss either of you off!!

I'm going through trying to sort the problems out

The follwoing equaiton doesn't generate any errors but it comes out right justified?

I tried using flushleft but that doesn't seem to work with align can you advise please.

Code: Select all

\documentclass[12pt]{article}
\usepackage{natbib}
\usepackage{amsmath}
\usepackage{amsfonts}
\usepackage{bm}
\usepackage{grffile}
\usepackage{graphicx}
\usepackage[%
  font=small,
  labelfont=bf,
  figurewithin=section,
  tablewithin=section,
  tableposition=top
]{caption}
\numberwithin{equation}{section}

\makeatletter
  \def\env@matrix{\hskip -\arraycolsep
  \let\@ifnextchar\new@ifnextchar
  \array{*\c@MaxMatrixCols l}}
\makeatother
\begin{document}


\begin{equation}
\begin{split}
\textbf{Cpl} (0,T_{j-1},T_j,\tau_j,K) =  \\ \tau_j P(0,T_j) [\sum_{i=1}^N \psi_i \exp^{-\frac{1}{2} \beta_i^2 t+ \beta_i h_t^{-1}(F_j(t))} \Phi( \frac{\beta_i(T_{j-1}-t)-h_{T_{j-1}}^{-1}(K)+ h_t^{-1}(F_j(t))}{\sqrt{T_{j-1}-t}}) \\ -K \Phi(\frac{h_t^{-1}(F_j(t))-h_{T_{j-1}}^{-1}(K)}{\sqrt{T_{j-1}-t}})]
\end{split}
\end{equation}

\end{document}
Post Reply