2. Literature Review
This paper explores the behaviour of the SA interest rates in terms of historical time series and a cross-section of yields across a maturity spectrum. Inspired by the seminal work of [
1], we proceed by implementing their model and its maximal counterpart. Among three options of models tested on US Treasury swaps, their
was found to perform better than other models, followed by their
. Initially, they consider a comprehensive framework for the specification, analysis, and classification of ATSMs. They provide a complete characterisation of admissible and identified ATSMs from which it is required that sufficient general conditions exist; see [
11], who describe the regular affine process. They also characterise the sufficient general conditions that must be met for a process to be affine; see [
7,
8] among others.
ATSMs are among popular models in the vast literature on interest rates term structure and bond pricing. Few examples are the early generation consisting of a single-factor Gaussian of [
12], and a square-root process by [
13], extended by [
14] into a multi-factor. The next generation are the correlated mixture affine models of [
1,
9], among others. The reason for their popularity is the ability to accommodate stochastic volatility, jumps and correlations among factors driving the asset returns; and lead to computationally tractable closed-form prices, and estimation through moment equations; see [
7]. Among research problems addressed using ATSMs is the description and treatment of the co-movement of short and long-term bond yields. An affine process
Y is defined as one in which a conditional mean
and variance
are affine functions of
Y. The process is further defined and characterised by [
11] as regular affine process, a class of time-homogeneous Markov processes. They consider a state space
, for integers
and
, from which the logarithm of a characteristic function of a transitional probability
of such process is affine with respect to the initial state
. [
7] conveniently formalises it in terms of their exponential-affine Fourier (for continuous-time) and Laplace (for discrete-time) transforms. The affine relationship is defined by coefficients which are solved by a family of ordinary differential equations (ODEs). These ODEs are the essence of tractability of regular affine processes. [
9] apply the ODEs as time-dependent drivers of the solution to a zero-coupon bond, provided the parameters are admissible. An inverted form of these zero-coupon bond gives rise to a yield as a state variable. They also exploit the idea of a yield-only analysis without including additional economic variables as latent factors.
[
15,
16] are among several authors who have approached the application of ATSMs in discrete-time although they are known to have less popularity compared to their continuous-time counterparts. Earlier models exhibited a tendency of perfectly correlated returns of bonds of all maturities, which is an unrealistic behaviour and unsuitable for hedging; see [
17]. Several authors extended these one-factor Markov representation of a short-rate by introducing a range of multi-factor models with the long-run mean
, and the stochastic volatility
of
that are affine functions of
for which [
1] explores several specifications. [
18] endorse a parsimonious representation of the yield curve matching the time series and cross-sectional variation of bond yields through three-factor models. They develop a simple estimation approach by exploiting the exponential-affine structure of these models; see also [
19] on the stochastic mean and stochastic volatility and three-factor model of the term structure of interest rates and its applications in derivatives pricing and risk management
A specification of an ATSM should be "admissible" and therefore lead to well-defined bond prices. The admissibility property is completely characterised by [
11] in the "canonical" state space
, with a non-negative diagonal matrix. However, this property has a problem of imposing parameter restrictions on the affine process to ensure that it is well defined. One typical scenario is the restriction of parameters to ensure that the conditional variance of a state variable remains non-negative. The requirements for admissibility become more complex as the number of state variables determining conditional variances increase; see [
7]. The admissibility condition ensures that the process does not exit the domain
. A family of
models with a domain
are a common admissible family of models; where
M factors evolve in a positive state space while
evolve in an unrestricted space; see [
20]. [
1] verifies this easily through admissible
factor ATSMs that are uniquely classified into
non-nested subfamilies.
Admissible models should also be canonical, meaning that they are economically identified, and maximally flexible; see [
7]. As a result, the
benchmark ATSM models should have a canonical representation and also satisfy the non-negative and non-explosive solution of ([
21]). Their drift should satisfy a Lipschitz condition, and the diffusion should satisfy the uniqueness condition of [
22]; see [
8]. These conditions have an effect of restricting the correlation structure of the affine diffusions. Exploiting the Gaussian and square-root form of diffusions, there still appear to be non-satisfaction of the regularity conditions of non-explosive growth and uniqueness, giving rise to need for a Feller condition;
1 see [
8]. A multi-dimensional extension of a Feller condition was implemented by [
9], which was found to handle the general correlated affine diffusions. The condition ensures that only positive factors enter the volatility
. This involves restrictions on the state variables that prevent the instantaneous conditional variances
from becoming negative. This condition is sufficient for the existence of a unique solution to the affine SDE according to [
11].
For each of the
subfamilies, there exists a maximal model that is econometrically plausible for all other models within this subfamily. They describe further the maximal models in relation to the
classification; and highlight an interaction within the family of ATSMs between the dependence of the conditional variance of each
on
and the admissible structure of the correlation matrix for
Y. A key advantage of maximal models is that of overcoming the overidentifying restrictions that are imposed on yield curve dynamics; see [
1]. The admissibility property is also confirmed by the no-arbitrage solution for a zero-coupon bond following [
9].
[
1] specification applied the continuous-time approach to the ATSMs which is popular to a majority of empirical literature. They explore the structural differences and relative goodness-of-fits of ATSMs. They refer to a trade-off between flexibility in modeling the conditional correlations and volatilities of the risk factors. They classify a family of
factor affine into
non-nested subfamilies of models. From their three-factor ATSMs, empirical analysis suggests that some subfamilies of ATSMs are better suited than others to explaining historical interest rate behaviour.
The focus of the research is to implement the specifications of [
1] to test the pricing of zero-coupon bonds and forecasting the yield curve dynamics when using the SA bond yield. It also attempts to extract the latent factors from the yield itself, without any consideration for other economic factors; see [
9]. ATSMs are proven to dominate both theoretical and empirical frameworks in term structure modelling; see [
8]. A link between the cross-sectional and time series properties is made consistent by the ATSMs. Evolution of unobserved factors from the risk-neutral dynamics of the yield are proved to have both the drift and the diffusion coefficients as affine functions of such factors by the ATSMs; see [
8]. Several methods of estimation are available and require mostly the knowledge of the joint conditional density of yields. In this study, we follow the estimation method of Fourier inversion for the characteristic function of a state variable, which is assumed to lead to a conditional density. This method leads to a closed-form solution where the maximum likelihood is an efficient estimator.