Power law in Sandwiched Volterra Volatility model

Di Nunno, Giulia; Yurchenko-Tytarenko, Anton

doi:10.15559/24-VMSTA246

Modern Stochastics: Theory and Applications

Power law in Sandwiched Volterra Volatility model

Volume 11, Issue 2 (2024), pp. 169–194

Giulia Di Nunno Anton Yurchenko-Tytarenko

https://doi.org/10.15559/24-VMSTA246

Pub. online: 23 January 2024 Type: Research Article

Open Access

Received
2 November 2023

Revised
11 January 2024

Accepted
12 January 2024

Published
23 January 2024

Abstract

The paper presents an analytical proof demonstrating that the Sandwiched Volterra Volatility (SVV) model is able to reproduce the power-law behavior of the at-the-money implied volatility skew, provided the correct choice of the Volterra kernel. To obtain this result, the second-order Malliavin differentiability of the volatility process is assessed and the conditions that lead to explosive behavior in the Malliavin derivative are investigated. As a supplementary result, a general Malliavin product rule is proved.

1 Introduction

One of the well-established benchmarks for evaluating option pricing models is comparing the model-generated Black–Scholes implied volatility surface $(\tau ,\kappa )\hspace{-0.1667em}\mapsto \hspace{-0.1667em}\widehat{\sigma }(\tau ,\kappa )$ with the empirically observed one $(\tau ,\kappa )\mapsto {\widehat{\sigma }_{\text{emp}}}(\tau ,\kappa )$. In this context, τ represents the time to maturity and $\kappa :=\log \frac{K}{{e^{r\tau }}{S_{0}}}$ is the log-moneyness with K denoting the strike, ${S_{0}}$ the current price of an underlying asset and r being the instantaneous interest rate. In particular, for any fixed τ, the values of ${\widehat{\sigma }_{\text{emp}}}(\tau ,\kappa )$ plotted against κ are known to produce convex “smiley” patterns with negative slopes at-the-money (i.e. when $\kappa \approx 0$). Furthermore, as reported in, e.g., [8, 16, 20] or [12, Subsection 2.2], the smile at-the-money becomes progressively steeper as $\tau \to 0$ with a rule-of-thumb behavior

(1)

\[ \bigg|\frac{{\widehat{\sigma }_{\text{emp}}}(\tau ,\kappa )-{\widehat{\sigma }_{\text{emp}}}(\tau ,{\kappa ^{\prime }})}{\kappa -{\kappa ^{\prime }}}\bigg|\propto {\tau ^{-\frac{1}{2}+H}},\hspace{1em}\kappa ,{\kappa ^{\prime }}\approx 0,\hspace{2.5pt}H\in \bigg(0,\frac{1}{2}\bigg).\]

The phenomenon (1) is known as the power law of the at-the-money implied volatility skew, and if one wants to replicate it, one may look for a model with

(2)

\[ {\bigg|\frac{\partial \widehat{\sigma }}{\partial \kappa }(\tau ,\kappa )\bigg|_{\kappa =0}}=O\big({\tau ^{-\frac{1}{2}+H}}\big),\hspace{1em}\tau \to 0.\]

However, it turns out that the property (2) is not easy to obtain: for example, as discussed in [1, Section 7.1] or [23, Remark 11.3.21], classical Brownian diffusion stochastic volatility models fail to produce implied volatilities with power law (2). In the literature, (2) is usually replicated by introducing a volatility process with a very low Hölder regularity within the rough volatility framework popularized by Gatheral, Jaisson and Rosenbaum in their landmark paper [20]. The efficiency of this approach can be explained as follows.

• On the one hand, a theoretical result of Fukasawa [17] suggests that the volatility process cannot be Hölder continuous of a high order in continuous nonarbitrage models exhibiting the property (2). In other words, the roughness of volatility is, in some sense, a necessary condition to reproduce (2) (at least in the fully continuous setting).
• On the other hand, as proved in the seminal 2007 paper [1] by Alòs, León and Vives, the short-term explosion (2) of the implied volatility skew can be deduced from the explosion of the Malliavin derivative of volatility. In particular, the latter characteristic is exhibited by fractional Brownian motion with $H\lt 1/2$, a common driver in the rough volatility literature.

However, despite the ability to reproduce the power law (2), rough volatility models are not perfect. In particular,

– in the specific context of a fractional Brownian motion, roughness contradicts the observations [6, 14, 15, 24, 29] of long memory on the market;
– in addition, volatility processes with long memory seem to be better in replicating the shape of implied volatility for longer maturities [7, 18, 19];
– furthermore, there is no guaranteed procedure of transition between physical and pricing measures: it is not always clear whether the volatility process $\sigma =\{\sigma (t),\hspace{2.5pt}t\in [0,T]\}$ hits zero and therefore the integral ${\textstyle\int _{0}^{t}}\frac{1}{{\sigma ^{2}}(s)}ds$ that is typically present in martingale densities (see, e.g., [5]) may be poorly defined;
– just like many classical Brownian stochastic volatility models (see, e.g., [2]), they may suffer from moment explosions in price, which results in complications with the pricing of some assets, quadratic hedging, and numerical methods.

For more details on rough volatility, we refer the reader to the recent review [12, Subsection 3.3.2] or the regularly updated literature list on the subject [28].

Recently, a series of papers [9–11] introduced the Sandwiched Volterra Volatility (SVV) model which accounts for all the problems mentioned above. More precisely, the volatility process $Y=\{Y(t),\hspace{2.5pt}t\in [0,T]\}$ is assumed to follow the stochastic differential equation

\[ Y(t)={y_{0}}+{\int _{0}^{t}}b\big(s,Y(s)\big)ds+Z(t)\]

driven by a general Hölder continuous Gaussian Volterra process

\[ Z(t)={\int _{0}^{t}}\mathcal{K}(t,s)dB(s).\]

The special part of the equation above is the drift b. It is assumed that there are two continuous functions $0\lt \varphi \lt \psi $ such that for some $\varepsilon \gt 0$

\[\begin{array}{r@{\hskip0pt}l@{\hskip0pt}r}\displaystyle b(t,y)& \displaystyle \ge \frac{C}{{(y-\varphi (t))^{\gamma }}},\hspace{2em}& \displaystyle y\in \big(\varphi (t),\varphi (t)+\varepsilon \big),\\ {} \displaystyle b(t,y)& \displaystyle \le -\frac{C}{{(\psi (t)-y)^{\gamma }}},\hspace{2em}& \displaystyle y\in \big(\psi (t)-\varepsilon ,\psi (t)\big).\end{array}\]

Such an explosive nature of the drift resembling the one in SDEs for Bessel processes (see, e.g., [27, Chapter XI]) or singular SDEs of [21] ensures that, with probability 1,

\[ 0\lt \varphi (t)\lt Y(t)\lt \psi (t),\]

which immediately solves the moment explosion problem (see, e.g., [9, Theorem 2.6]) and allows for a transparent transition between physical and pricing measures [9, Subsection 2.2]. In addition, the flexibility in the choice of the kernel $\mathcal{K}$ should allow to replicate both the long memory and the power law behavior (2).

The main goal of this paper is to give the theoretical justification to the latter claim: we prove that, with the correct choice of the Volterra kernel $\mathcal{K}$, the SVV model indeed reproduces (2). In order to do that, we employ the fundamental result [1, Theorem 6.3] by Alòs, León and Vives mentioned above and check that the Malliavin derivative $DY(t)$ indeed exhibits explosive behavior. The difficulty of this approach is as follows. While the first-order Malliavin differentiability of $Y(t)$ is established in [9, Section 3] with

\[ {D_{s}}Y(t)=\mathcal{K}(t,s)+{\int _{s}^{t}}\mathcal{K}(u,s){b^{\prime }_{y}}\big(u,Y(u)\big)\exp \Bigg\{{\int _{u}^{t}}{b^{\prime }_{y}}\big(v,Y(v)\big)dv\Bigg\}du,\]

[1, Theorem 6.3] actually demands the existence of the second-order Malliavin derivative. In principle, it is intuitively clear how this derivative should look like:

(3)

\[ \begin{aligned}{}{D_{r}}{D_{s}}Y(t)& ={D_{r}}{\int _{s}^{t}}\mathcal{K}(u,s){b^{\prime }_{y}}\big(u,Y(u)\big)\exp \Bigg\{{\int _{u}^{t}}{b^{\prime }_{y}}\big(v,Y(v)\big)dv\Bigg\}du\\ {} & ={\int _{s}^{t}}\mathcal{K}(u,s){D_{r}}\Bigg[{b^{\prime }_{y}}\big(u,Y(u)\big)\exp \Bigg\{{\int _{u}^{t}}{b^{\prime }_{y}}\big(v,Y(v)\big)dv\Bigg\}\Bigg]du\\ {} & ={\int _{s}^{t}}\mathcal{K}(u,s)\exp \Bigg\{{\int _{u}^{t}}{b^{\prime }_{y}}\big(v,Y(v)\big)dv\Bigg\}{D_{r}}\big[{b^{\prime }_{y}}\big(u,Y(u)\big)\big]du\\ {} & \hspace{1em}+{\int _{s}^{t}}\mathcal{K}(u,s){b^{\prime }_{y}}\big(u,Y(u)\big){D_{r}}\Bigg[\exp \Bigg\{{\int _{u}^{t}}{b^{\prime }_{y}}\big(v,Y(v)\big)dv\Bigg\}\Bigg]du\\ {} & ={\int _{s}^{t}}\mathcal{K}(u,s){b^{\prime\prime }_{yy}}\big(u,Y(u)\big)\exp \Bigg\{{\int _{u}^{t}}{b^{\prime }_{y}}\big(v,Y(v)\big)dv\Bigg\}{D_{r}}\big[Y(u)\big]du\\ {} & \hspace{1em}+{\int _{s}^{t}}\mathcal{K}(u,s){b^{\prime }_{y}}\big(u,Y(u)\big)\exp \Bigg\{{\int _{u}^{t}}{b^{\prime }_{y}}\big(v,Y(v)\big)dv\Bigg\}\times \\ {} & \hspace{2em}\hspace{2em}\times {\int _{u}^{t}}{b^{\prime\prime }_{yy}}\big(v,Y(v)\big){D_{r}}\big[Y(v)\big]dvdu.\end{aligned}\]

However, justifying the computations in (3) is far from straightforward. For example, the functions $y\mapsto {b^{\prime }_{y}}(t,y)$ and $y\mapsto {b^{\prime\prime }_{yy}}(t,y)$ demonstrate explosive behavior as $y\to \varphi (t)+$ and $y\to \psi (t)-$ for any $t\in [0,T]$. This makes it impossible to use the classical Malliavin chain rules such as [25, Proposition 1.2.3] requiring boundedness of the derivative or [25, Proposition 1.2.4] demanding the Lipschitz condition. In order to overcome this issue, we have to use some special properties of the volatility process established in [11] and tailor a version of the Malliavin chain rule specifically for our needs.

The paper is organized as follows. In Section 2, we provide some necessary details about the sandwiched volatility process Y. In Section 3, we prove second-order Malliavin differentiability of $Y(t)$. Finally, in Section 4, we use [1, Theorem 6.3] to determine conditions on the kernel under which the SVV model reproduces (2). In Appendix A, we gather some necessary facts from Malliavin calculus, list some of the notation and, in addition, we prove a general Malliavin product rule to fit our purposes and that we were not able to find in the literature.

2 Preliminaries on sandwiched processes

In this section, we gather all the necessary details about the main object of our study: the class of sandwiched processes driven by Hölder-continuous Gaussian Volterra noises.

Fix some $T\in (0,\infty )$ and consider a kernel $\mathcal{K}:{[0,T]^{2}}\to \mathbb{R}$ satisfying the following assumptions.

Assumption 1.

The kernel $\mathcal{K}$ is of Volterra type, i.e. $\mathcal{K}(t,s)=0$ whenever $t\le s$, and

(K1) $\mathcal{K}$ is square-integrable, i.e.
\[ {\int _{0}^{T}}{\int _{0}^{T}}{\mathcal{K}^{2}}(t,s)dsdt\lt \infty ,\]
(K2) there exists $H\in (0,1)$ such that for all $\lambda \in (0,H)$ and $0\le {t_{1}}\le {t_{2}}\le T$
\[ {\int _{0}^{T}}{\big(\mathcal{K}({t_{2}},s)-\mathcal{K}({t_{1}},s)\big)^{2}}ds\le {C_{\lambda }}|{t_{2}}-{t_{1}}{|^{2\lambda }},\]
where ${C_{\lambda }}\gt 0$ is some constant depending on λ.

Remark 1.

Note that items (K1) and (K2) of Assumption 1 jointly imply that

(4)

\[ \underset{t\in [0,T]}{\sup }{\int _{0}^{T}}{\mathcal{K}^{2}}(t,s)ds\lt \infty .\]

Let $B=\{B(t),\hspace{2.5pt}t\in [0,T]\}$ be a standard Brownian motion. Assumption 1 allows to define a Gaussian Volterra process

(5)

\[ Z(t):={\int _{0}^{t}}\mathcal{K}(t,s)dB(s),\hspace{1em}t\in [0,T],\]

and, moreover, Assumption 1(K2) together with [3, Theorem 1 and Corollary 4] implies that Z has a modification with Hölder continuous trajectories of any order $\lambda \in (0,H)$. In what follows, we always use this modification of Z: in other words, with probability 1, for any $\lambda \in (0,H)$ there exists a random variable $\Lambda =\Lambda (\lambda )\gt 0$ such that for all $0\le {t_{1}}\le {t_{2}}\le T$

(6)

\[ \big|Z({t_{2}})-Z({t_{1}})\big|\le \Lambda |{t_{2}}-{t_{1}}{|^{\lambda }}.\]

Furthermore, as stated in [3, Theorem 1], the random variable Λ from (6) can be chosen such that

(7)

\[ \mathbb{E}\big[{\Lambda ^{r}}\big]\lt \infty \hspace{1em}\text{for all}\hspace{2.5pt}r\in \mathbb{R}.\]

In what follows, we assume that (7) always holds.

Next, denote

(8)

\[ \begin{aligned}{}\mathcal{D}& :=\big\{(t,y)\in [0,T]\times \mathbb{R}\hspace{2.5pt}|\hspace{2.5pt}\varphi (t)\lt y\lt \psi (t)\big\},\\ {} \overline{\mathcal{D}}& :=\big\{(t,y)\in [0,T]\times \mathbb{R}\hspace{2.5pt}|\hspace{2.5pt}\varphi (t)\le y\le \psi (t)\big\}.\end{aligned}\]

Take $H\in (0,1)$ from Assumption 1(K2), consider two H-Hölder continuous functions φ, ψ: $[0,T]\to \mathbb{R}$ such that

\[ 0\lt \varphi (t)\lt \psi (t)\hspace{1em}\text{for all}\hspace{2.5pt}t\in [0,T],\]

and define a function b: $\mathcal{D}\to \mathbb{R}$ as

(9)

\[ b(t,y):=\frac{{\theta _{1}}(t)}{{(y-\varphi (t))^{{\gamma _{1}}}}}-\frac{{\theta _{2}}(t)}{{(\psi (t)-y)^{{\gamma _{2}}}}}+a(t,y),\]

where the coefficients in (9) satisfy the following assumption.

Assumption 2.

The constants ${\gamma _{1}}$, ${\gamma _{2}}\gt 0$ and functions ${\theta _{1}}$, ${\theta _{2}}$, a are such that

(B1) ${\gamma _{1}}\gt \frac{1}{H}-1$, ${\gamma _{2}}\gt \frac{1}{H}-1$ with $H\in (0,1)$ being from Assumption 1(K2);
(B2) the functions ${\theta _{1}}$, ${\theta _{2}}$: $[0,T]\to \mathbb{R}$ are strictly positive and continuous;
(B3) the function a: $[0,T]\times \mathbb{R}\to \mathbb{R}$ is locally Lipschitz in y uniformly in t, i.e. for any $N\gt 0$ there exists a constant ${C_{N}}\gt 0$ that does not depend on t such that
\[ \big|a(t,{y_{2}})-a(t,{y_{1}})\big|\le {C_{N}}|{y_{2}}-{y_{1}}|,\hspace{1em}t\in [0,T],\hspace{2.5pt}{y_{1}},{y_{2}}\in [-N,N];\]
(B4) a: $[0,T]\times \mathbb{R}\to \mathbb{R}$ is two times differentiable w.r.t. the spatial variable y with a, ${a^{\prime }_{y}}$, ${a^{\prime\prime }_{yy}}$ all being continuous on $[0,T]\times \mathbb{R}$.

Remark 2.

Note that ${b^{\prime }_{y}}$ is bounded from above on $\mathcal{D}$: indeed,

\[\begin{aligned}{}{b^{\prime }_{y}}(t,y)& =-\frac{{\gamma _{1}}{\theta _{1}}(t)}{{(y-\varphi (t))^{{\gamma _{1}}+1}}}-\frac{{\gamma _{2}}{\theta _{2}}(t)}{{(\psi (t)-y)^{{\gamma _{2}}+1}}}+{a^{\prime }_{y}}(t,y)\\ {} & \lt \underset{(t,y)\in \overline{\mathcal{D}}}{\max }{a^{\prime }_{y}}(t,y)\lt \infty .\end{aligned}\]

Finally, fix $\varphi (0)\lt {y_{0}}\lt \psi (0)$ and consider a stochastic differential equation of the form

(10)

\[ Y(t)={y_{0}}+{\int _{0}^{t}}b\big(s,Y(s)\big)ds+Z(t),\hspace{1em}t\in [0,T].\]

By [11, Theorem 4.1], under Assumptions 1 and 2, the SDE (10) has a unique strong solution $Y=\{Y(t),\hspace{2.5pt}t\in [0,T]\}$. Moreover, with probability 1,

(11)

\[ \varphi (t)\lt Y(t)\lt \psi (t)\hspace{1em}\text{for all}\hspace{2.5pt}t\in [0,T].\]

Remark 3.

Motivated by the property (11), we will call the solution Y of (10) a sandwiched process.

In what follows, we will need to analyze the behavior of the stochastic processes $|b(t,Y(t))|$, $|{b^{\prime }_{y}}(t,Y(t))|$ and $|{b^{\prime\prime }}(t,Y(t))|$, $t\in [0,T]$. In this regard, the property (11) alone is not sufficient: the process Y can, in principle, approach the bounds φ and ψ which results in an explosive growth of the processes mentioned above. Luckily, [11, Theorem 4.2] provides a refinement of (11) allowing for a more precise control of Y near φ and ψ. We give a slightly reformulated version of this result below.

Theorem 1.

Let Assumptions 1 and 2 hold and $\lambda \in (0,H)$, $\Lambda =\Lambda (\lambda )\gt 0$ be from (6). Then there exist deterministic constants ${C_{Y}}={C_{Y}}(\lambda )\gt 0$ and $\beta =\beta (\lambda )\gt 0$ such that

\[ \varphi (t)+\frac{{C_{Y}}}{{(1+\Lambda )^{\beta }}}\le Y(t)\le \psi (t)-\frac{{C_{Y}}}{{(1+\Lambda )^{\beta }}}\hspace{1em}\textit{for all}\hspace{2.5pt}t\in [0,T].\]

In particular, since Λ can be chosen to have moments of all orders, for all $r\ge 0$

\[ \mathbb{E}\bigg[\underset{t\in [0,T]}{\sup }\frac{1}{{(Y(t)-\varphi (t))^{r}}}\bigg]\lt \infty ,\hspace{1em}\mathbb{E}\bigg[\underset{t\in [0,T]}{\sup }\frac{1}{{(\psi (t)-Y(t))^{r}}}\bigg]\lt \infty .\]

We finalize this section by citing the first-order Malliavin differentiability result for the sandwiched process (10) proved in [9, Section 3].

Theorem 2.

Let Assumptions 1 and 2 hold and Y be the sandwiched process given by (10). Then, for any $t\in [0,T]$, $Y(t)\in {\mathbb{D}^{1,2}}$ and, with probability 1, for a.a. $s\in [0,T]$

(12)

\[ {D_{s}}Y(t)=\mathcal{K}(t,s)+{\int _{s}^{t}}\mathcal{K}(u,s){b^{\prime }_{y}}\big(u,Y(u)\big)\exp \Bigg\{{\int _{u}^{t}}{b^{\prime }_{y}}\big(v,Y(v)\big)dv\Bigg\}du.\]

Remark 4.

The result above actually holds for more general drifts than the one given in (9). The same is also, in principle, true for the results of the subsequent sections. Namely, it would be sufficient to assume that there exist deterministic constants $c\gt 0$, $r\gt 0$, $\gamma \gt \frac{1}{H}-1$ and $0\lt {y_{\ast }}\lt {\max _{t\in [0,T]}}|\psi (t)-\varphi (t)|$ such that

• b: $\mathcal{D}\to \mathbb{R}$ is continuous on $\mathcal{D}$ and has continuous partial derivatives ${b^{\prime }_{y}}$, ${b^{\prime\prime }_{yy}}$;
• for any $0\lt \varepsilon \lt \frac{1}{2}{\max _{t\in [0,T]}}|\psi (t)-\varphi (t)|$,
\[ \big|b(t,{y_{2}})-b(t,{y_{1}})\big|\le \frac{c}{{\varepsilon ^{r}}}|{y_{2}}-{y_{1}}|,\hspace{1em}t\hspace{-0.1667em}\in \hspace{-0.1667em}[0,T],\hspace{2.5pt}\varphi (t)+\varepsilon \le {y_{1}}\le {y_{2}}\le \psi (t)-\varepsilon ;\]
• b has an explosive growth to ∞ near φ and explosive decay to $-\infty $ near ψ of order $\gamma \gt \frac{1}{H}-1$, i.e.
\[\begin{array}{r@{\hskip0pt}l@{\hskip0pt}r}\displaystyle b(t,y)& \displaystyle \ge \frac{c}{{(y-\varphi (t))^{\gamma }}},\hspace{2em}& \displaystyle y\in \big(\varphi (t),\varphi (t)+{y_{\ast }}\big),\\ {} \displaystyle b(t,y)& \displaystyle \le -\frac{c}{{(\psi (t)-y)^{\gamma }}},\hspace{2em}& \displaystyle y\in \big(\psi (t)-{y_{\ast }},\psi (t)\big);\end{array}\]
• for all $(t,y)\in \mathcal{D}$, the partial derivatives ${b^{\prime }_{y}}$ and ${b^{\prime\prime }_{yy}}$ satisfy
\[ -C\bigg(1+\frac{c}{{(y-\varphi (t))^{r}}}+\frac{c}{{(\psi (t)-y)^{r}}}\bigg)\lt {b^{\prime }_{y}}(t,y)\lt C\]
and
\[ |{b^{\prime\prime }_{yy}}|\le C\bigg(1+\frac{c}{{(y-\varphi (t))^{r}}}+\frac{c}{{(\psi (t)-y)^{r}}}\bigg).\]

However, since (9) is the most natural choice satisfying these assumptions, we stick to this shape for notational convenience.

3 Second-order Malliavin differentiability

Let Assumptions 1 and 2 hold and $Y=\{Y(t),\hspace{2.5pt}t\in [0,T]\}$ be the sandwiched process defined by (10) with the drift (9).

Notation.

Here and in the sequel, C will denote any positive deterministic constant the exact value of which is not relevant. Note that C may change from line to line (or even within one line).

The main goal of this section is to establish the second-order Malliavin differentiability of the sandwiched process (10) and compute the corresponding derivative explicitly. As mentioned above, the main difficulty lies in controlling the behavior of $b(t,Y(t))$, ${b^{\prime }_{y}}(t,Y(t))$ and ${b^{\prime\prime }_{yy}}(t,Y(t))$ whenever $Y(t)$ approaces the bounds. Luckily, Theorem 1 gives all the necessary tools to do that as summarized in the following proposition.

Proposition 1.

There exists a random variable $\xi \gt 0$ such that

• for any $p\ge 1$, $\mathbb{E}[{\xi ^{p}}]\lt \infty $;
• for any $t\in [0,T]$,
\[ \big|b\big(t,Y(t)\big)\big|+\big|{b^{\prime }_{y}}\big(t,Y(t)\big)\big|+\big|{b^{\prime\prime }_{yy}}\big(t,Y(t)\big)\big|\lt \xi .\]

In particular, for any $p\ge 1$,

\[ \mathbb{E}\Big[\underset{t\in [0,T]}{\sup }\big({\big|b\big(t,Y(t)\big)\big|^{p}}+{\big|{b^{\prime }_{y}}\big(t,Y(t)\big)\big|^{p}}+{\big|{b^{\prime\prime }_{yy}}\big(t,Y(t)\big)\big|^{p}}\big)\Big]\lt \infty .\]

Proof.

Fix $\lambda \in (0,H)$ and take the corresponding $\Lambda \gt 0$ from (6) and ${C_{Y}},\beta \gt 0$ being from Theorem 1. Then

\[\begin{aligned}{}\big|b\big(t,Y(t)\big)\big|& =\frac{|{\theta _{1}}(t)|}{{(Y(t)-\varphi (t))^{{\gamma _{1}}}}}+\frac{|{\theta _{2}}(t)|}{{(\psi (t)-Y(t))^{{\gamma _{2}}}}}+\big|a\big(t,Y(t)\big)\big|\\ {} & \le \frac{{\sup _{t\in [0,T]}}|{\theta _{1}}(t)|{(1+\Lambda )^{\beta {\gamma _{1}}}}}{{C_{Y}^{{\gamma _{1}}}}}\\ {} & \hspace{1em}+\frac{{\sup _{t\in [0,T]}}|{\theta _{2}}(t)|{(1+\Lambda )^{\beta {\gamma _{2}}}}}{{C_{Y}^{{\gamma _{2}}}}}\\ {} & \hspace{1em}+\underset{(t,y)\in \mathcal{D}}{\sup }\big|a(t,y)\big|\\ {} & :={\xi _{0}},\\ {} \big|{b^{\prime }_{y}}\big(t,Y(t)\big)\big|& =\frac{{\gamma _{1}}|{\theta _{1}}(t)|}{{(Y(t)-\varphi (t))^{{\gamma _{1}}+1}}}+\frac{{\gamma _{2}}|{\theta _{2}}(t)|}{{(\psi (t)-Y(t))^{{\gamma _{2}}+1}}}+\big|{a^{\prime }_{y}}\big(t,Y(t)\big)\big|\\ {} & \le \frac{{\gamma _{1}}{\sup _{t\in [0,T]}}|{\theta _{1}}(t)|{(1+\Lambda )^{\beta ({\gamma _{1}}+1)}}}{{C_{Y}^{{\gamma _{1}}+1}}}\\ {} & \hspace{1em}+\frac{{\gamma _{2}}{\sup _{t\in [0,T]}}|{\theta _{2}}(t)|{(1+\Lambda )^{\beta ({\gamma _{2}}+1)}}}{{C_{Y}^{{\gamma _{2}}+1}}}\\ {} & \hspace{1em}+\underset{(t,y)\in \mathcal{D}}{\sup }\big|{a^{\prime }_{y}}(t,y)\big|\\ {} & :={\xi _{1}},\\ {} \big|{b^{\prime\prime }_{yy}}\big(t,Y(t)\big)\big|& =\frac{{\gamma _{1}}({\gamma _{1}}+1)|{\theta _{1}}(t)|}{{(Y(t)-\varphi (t))^{{\gamma _{1}}+2}}}+\frac{{\gamma _{2}}({\gamma _{2}}+1)|{\theta _{2}}(t)|}{{(\psi (t)-Y(t))^{{\gamma _{2}}+2}}}+\big|{a^{\prime\prime }_{yy}}\big(t,Y(t)\big)\big|\\ {} & \le \frac{{\gamma _{1}}({\gamma _{1}}+1){\sup _{t\in [0,T]}}|{\theta _{1}}(t)|{(1+\Lambda )^{\beta ({\gamma _{1}}+2)}}}{{C_{Y}^{{\gamma _{1}}+2}}}\\ {} & \hspace{1em}+\frac{{\gamma _{2}}({\gamma _{2}}+1){\sup _{t\in [0,T]}}|{\theta _{2}}(t)|{(1+\Lambda )^{\beta ({\gamma _{2}}+2)}}}{{C_{Y}^{{\gamma _{2}}+2}}}\\ {} & \hspace{1em}+\underset{(t,y)\in \mathcal{D}}{\sup }\big|{a^{\prime\prime }_{yy}}(t,y)\big|\\ {} & :={\xi _{2}}.\end{aligned}\]

Note that ${\xi _{0}}$, ${\xi _{1}}$ and ${\xi _{2}}$ have moments of all orders by the properties of Λ, see (7), and hence, putting

\[ \xi :={\xi _{0}}+{\xi _{1}}+{\xi _{2}},\]

we obtain the required result. □

As noted in Theorem 2, $Y(t)\in {\mathbb{D}^{1,2}}$ for each $t\ge 0$. In fact, Proposition 1 together with the shape (12) of the derivative allows to establish a more general result.

Proposition 2.

For any $t\in [0,T]$ and $p\gt 1$, $Y(t)\in {\mathbb{D}^{1,p}}$.

Proof.

Note that, by (11), $\mathbb{E}[|Y(t){|^{p}}]\lt \infty $ for any $p\gt 1$, so, by Lemma 1 from the Appendix, it is sufficient to prove that

\[ \mathbb{E}\Bigg[{\Bigg({\int _{0}^{T}}{\big({D_{s}}Y(t)\big)^{2}}ds\Bigg)^{\frac{p}{2}}}\Bigg]\lt \infty \]

for any $p\gt 1$. Note that, by Remark 2,

\[ \exp \Bigg\{{\int _{s}^{t}}{b^{\prime }_{y}}\big(v,Y(v)\big)dv\Bigg\}\lt \exp \{cT\},\]

where

\[ c:=\underset{(t,y)\in \overline{\mathcal{D}}}{\max }{a^{\prime }_{y}}(t,y),\]

and, by Proposition 1, there exists a random variable ξ having all moments such that

\[ \underset{s\in [0,T]}{\sup }\big|{b^{\prime }_{y}}\big(s,Y(s)\big)\big|\le \xi .\]

Hence

(13)

\[ \begin{aligned}{}\big|{D_{s}}& Y(t)\big|\\ {} & \le \big|\mathcal{K}(t,s)\big|+{\int _{s}^{t}}\big|\mathcal{K}(u,s)\big|\big|{b^{\prime }_{y}}\big(u,Y(u)\big)\big|\exp \Bigg\{{\int _{u}^{t}}{b^{\prime }_{y}}\big(v,Y(v)\big)dv\Bigg\}du\\ {} & \le \big|\mathcal{K}(t,s)\big|+\xi \exp \{cT\}{\int _{s}^{t}}\big|\mathcal{K}(u,s)\big|du.\end{aligned}\]

By Assumption 1 and Remark 1,

\[ {\Bigg({\int _{0}^{T}}{\mathcal{K}^{2}}(t,s)ds\Bigg)^{\frac{p}{2}}}\lt \infty ,\]

therefore

(14)

\[ \begin{aligned}{}\mathbb{E}& \Bigg[{\Bigg({\int _{0}^{T}}{\big({D_{s}}Y(t)\big)^{2}}ds\Bigg)^{\frac{p}{2}}}\Bigg]\\ {} & \le C{\Bigg({\int _{0}^{T}}{\mathcal{K}^{2}}(t,s)ds\Bigg)^{\frac{p}{2}}}\\ {} & \hspace{1em}+C\mathbb{E}\Bigg[{\Bigg({\int _{0}^{T}}{\int _{0}^{t}}{\mathcal{K}^{2}}(u,s){\big({b^{\prime }_{y}}\big(u,Y(u)\big)\big)^{2}}\exp \Bigg\{2{\int _{u}^{t}}{b^{\prime }_{y}}\big(v,Y(v)\big)dv\Bigg\}duds\Bigg)^{\frac{p}{2}}}\Bigg]\\ {} & \le C{\Bigg({\int _{0}^{T}}{\mathcal{K}^{2}}(t,s)ds\Bigg)^{\frac{p}{2}}}+C\mathbb{E}\big[{\xi ^{p}}\big]\exp \{pcT\}{\Bigg({\int _{0}^{T}}{\int _{0}^{t}}{\mathcal{K}^{2}}(u,s)duds\Bigg)^{\frac{p}{2}}}\\ {} & \lt \infty ,\end{aligned}\]

which ends the proof. □

Our next goal is to establish the Malliavin chain rule for the random variables ${b^{\prime }_{y}}(t,Y(t))$ and $\exp \{{\textstyle\int _{u}^{t}}{b^{\prime }_{y}}(v,Y(v))dv\}$.

Proposition 3.

For any $0\le u\le t\le T$ and $p\gt 1$,

1) ${b^{\prime }_{y}}(t,Y(t))\in {\mathbb{D}^{1,p}}$ with

(15)
\[ {D_{s}}\big[{b^{\prime }_{y}}\big(t,Y(t)\big)\big]={b^{\prime\prime }_{yy}}\big(t,Y(t)\big){D_{s}}Y(t),\]
2) $\exp \{{\textstyle\int _{u}^{t}}{b^{\prime }_{y}}(v,Y(v))dv\}\in {\mathbb{D}^{1,p}}$ with

(16)
\[ \begin{aligned}{}{D_{s}}& \Bigg[\exp \Bigg\{{\int _{u}^{t}}{b^{\prime }_{y}}\big(v,Y(v)\big)dv\Bigg\}\Bigg]\\ {} & =\exp \Bigg\{{\int _{u}^{t}}{b^{\prime }_{y}}\big(v,Y(v)\big)dv\Bigg\}{\int _{u}^{t}}{b^{\prime\prime }_{yy}}\big(v,Y(v)\big){D_{s}}Y(v)dv.\end{aligned}\]

Proof.

1) We shall start from proving that ${b^{\prime }_{y}}(t,Y(t))\in {\mathbb{D}^{1,p}}$. Note that ${b^{\prime }_{y}}$ is not a bounded function itself and it does not have bounded derivatives – hence the classical chain rule from [25, Section 1.2] cannot be applied here in a straightforward manner. In order to overcome this issue, we will use the approach in the spirit of [26, Lemma A.1] or [9, Proposition 3.4]. For the reader’s convenience, we divide the proof into steps.

Step 0. First of all, observe that ${b^{\prime }}(t,Y(t))\in {L^{2}}(\Omega )$ as a direct consequence of Proposition 1. Also, for any $p\gt 1$,

\[ \mathbb{E}\Bigg[{\Bigg({\int _{0}^{T}}{\big({b^{\prime\prime }_{yy}}\big(t,Y(t)\big){D_{s}}Y(t)\big)^{2}}ds\Bigg)^{\frac{p}{2}}}\Bigg]\lt \infty .\]

Indeed, again by Proposition 1 together with the proof of Proposition 2, we have

\[\begin{aligned}{}\mathbb{E}& \Bigg[{\Bigg({\int _{0}^{T}}{\big({b^{\prime\prime }_{yy}}\big(t,Y(t)\big){D_{s}}Y(t)\big)^{2}}ds\Bigg)^{\frac{p}{2}}}\Bigg]\\ {} & \le \mathbb{E}\Bigg[{\xi ^{p}}{\Bigg({\int _{0}^{T}}{\big({D_{s}}Y(t)\big)^{2}}ds\Bigg)^{\frac{p}{2}}}\Bigg]\\ {} & \le {\big(\mathbb{E}\big[{\xi ^{2p}}\big]\big)^{\frac{1}{2}}}{\Bigg(\mathbb{E}\Bigg[{\Bigg({\int _{0}^{T}}{\big({D_{s}}Y(t)\big)^{2}}ds\Bigg)^{p}}\Bigg]\Bigg)^{\frac{1}{2}}}\\ {} & \lt \infty .\end{aligned}\]

Therefore, by Lemma 1, it is sufficient to prove that ${b^{\prime }_{y}}(t,Y(t))\in {\mathbb{D}^{1,2}}$ with (15) being the corresponding Malliavin derivative.

Step 1. Let $\phi \in {C^{1}}(\mathbb{R})$ be a compactly supported function such that $\phi (x)=x$ whenever $|x|\le 1$ and $|\phi (x)|\le |x|$ for all $|x|\gt 1$. Fix $t\in [0,T]$ and, for $m\ge 1$, put

\[ {f_{m}}(y):=m\phi \bigg(\frac{{b^{\prime }_{y}}(t,y)}{m}\bigg).\]

Observe that

\[ {f^{\prime }_{m}}(y)={b^{\prime\prime }_{yy}}(t,y){\phi ^{\prime }}\bigg(\frac{{b^{\prime }_{y}}(t,y)}{m}\bigg)\]

is bounded. Indeed, let $0\lt {\varepsilon _{m}}\lt \psi (t)-\varphi (t)$ be such that

\[ -\frac{{\gamma _{1}}{\theta _{1}}(t)}{{\varepsilon _{m}^{{\gamma _{1}}+1}}}+\underset{\varphi (t)\le x\le \psi (t)}{\max }{a^{\prime }_{y}}(t,x)\lt m\inf \operatorname{supp}\phi \]

and

\[ -\frac{{\gamma _{2}}{\theta _{2}}(t)}{{\varepsilon _{m}^{{\gamma _{2}}+1}}}+\underset{\varphi (t)\le x\le \psi (t)}{\max }{a^{\prime }_{y}}(t,x)\lt m\inf \operatorname{supp}\phi .\]

Then,

• if $y\in (\varphi (t),\varphi (t)+{\varepsilon _{m}})$, then
\[\begin{aligned}{}{b^{\prime }_{y}}(t,y)& =-\frac{{\gamma _{1}}{\theta _{1}}(t)}{{(y-\varphi (t))^{{\gamma _{1}}+1}}}-\frac{{\gamma _{2}}{\theta _{2}}(t)}{{(\psi (t)-y)^{{\gamma _{2}}+1}}}+{a^{\prime }_{y}}(t,y)\\ {} & \le -\frac{{\gamma _{1}}{\theta _{1}}(t)}{{\varepsilon _{m}^{{\gamma _{1}}+1}}}+\underset{\varphi (t)\le x\le \psi (t)}{\max }{a^{\prime }_{y}}(t,x)\\ {} & \lt m\inf \operatorname{supp}\phi ,\end{aligned}\]
so $\frac{{b^{\prime }_{y}}(t,y)}{m}\notin \operatorname{supp}\phi $, ${f_{m}}(y)=0$ and ${f^{\prime }_{m}}(y)=0$;
• if $y\in (\psi (t)-{\varepsilon _{m}},\psi (t))$, then, similarly,
\[\begin{aligned}{}{b^{\prime }_{y}}(t,y)& =-\frac{{\gamma _{1}}{\theta _{1}}(t)}{{(y-\varphi (t))^{{\gamma _{1}}+1}}}-\frac{{\gamma _{2}}{\theta _{2}}(t)}{{(\psi (t)-y)^{{\gamma _{2}}+1}}}+{a^{\prime }_{y}}(t,y)\\ {} & \le -\frac{{\gamma _{2}}{\theta _{2}}(t)}{{\varepsilon _{m}^{{\gamma _{2}}+1}}}+\underset{\varphi (t)\le x\le \psi (t)}{\max }{a^{\prime }_{y}}(t,x)\\ {} & \lt m\inf \operatorname{supp}\phi ,\end{aligned}\]
so $\frac{{b^{\prime }_{y}}(t,y)}{m}\notin \operatorname{supp}\phi $, ${f_{m}}(y)=0$ and ${f^{\prime }_{m}}(y)=0$;
• on the compact set $[\varphi (t)+{\varepsilon _{m}},\psi (t)-{\varepsilon _{m}}]$, both ${f_{m}}$ and its derivative ${f^{\prime }_{m}}$ are continuous and hence bounded.

Therefore, the function ${f_{m}}$ satisfies the conditions of the classical Malliavin chain rule [25, Proposition 1.2.3], so ${f_{m}}(Y(t))\in {\mathbb{D}^{1,2}}$ and, with probability 1 for a.a. $s\in [0,T]$,

\[ {D_{s}}{f_{m}}\big(Y(t)\big)={b^{\prime\prime }_{yy}}\big(t,Y(t)\big){\phi ^{\prime }}\bigg(\frac{{b^{\prime }_{y}}(t,Y(t))}{m}\bigg){D_{s}}Y(t).\]

Now it remains to prove that

\[ {f_{m}}\big(Y(t)\big)\to {b^{\prime }}\big(t,Y(t)\big)\]

in ${L^{2}}(\Omega )$ and

\[ D{f_{m}}\big(Y(t)\big)\to {b^{\prime\prime }_{yy}}\big(t,Y(t)\big)DY(t)\]

in ${L^{2}}(\Omega \times [0,T])$ as $m\to \infty $; then the result will follow immediately from the closedness of the Malliavin derivative operator D.

Step 2: ${f_{m}}(Y(t))\to {b^{\prime }}(t,Y(t))$ in ${L^{2}}(\Omega )$ as $m\to \infty $. By the definitions of ${f_{m}}$ and ϕ, ${f_{m}}(Y(t))\to {b^{\prime }}(t,Y(t))$ a.s. as $m\to \infty $. Moreover, with probability 1, $|{f_{m}}(Y(t))|\le |{b^{\prime }_{y}}(t,Y(t))|\in {L^{2}}(\Omega )$ and hence the required convergence follows from the dominated convergence theorem.

Step 3: $D{f_{m}}(Y(t))\to {b^{\prime\prime }_{yy}}(t,Y(t))DY(t)$ in ${L^{2}}(\Omega \times [0,T])$ as $m\to \infty $. By the definitions of ${f_{m}}$ and ϕ, with probability 1,

\[\begin{aligned}{}& {\bigg({b^{\prime\prime }_{yy}}\big(t,Y(t)\big){\phi ^{\prime }}\bigg(\frac{{b^{\prime }_{y}}(t,Y(t))}{m}\bigg)\bigg)^{2}}{\int _{0}^{T}}{\big({D_{s}}Y(t)\big)^{2}}ds\\ {} & \hspace{1em}\to {\big({b^{\prime\prime }_{yy}}\big(t,Y(t)\big)\big)^{2}}{\int _{0}^{T}}{\big({D_{s}}Y(t)\big)^{2}}ds\end{aligned}\]

as $m\to \infty $. Moreover, since ϕ has compact support, ${\max _{y\in \mathbb{R}}}{({\phi ^{\prime }}(y))^{2}}\lt \infty $, so we can write

\[\begin{aligned}{}{\int _{0}^{T}}{\big({D_{s}}{f_{m}}\big(Y(t)\big)\big)^{2}}ds& ={\bigg({b^{\prime\prime }_{yy}}\big(t,Y(t)\big){\phi ^{\prime }}\bigg(\frac{{b^{\prime }_{y}}(t,Y(t))}{m}\bigg)\bigg)^{2}}{\int _{0}^{T}}{\big({D_{s}}Y(t)\big)^{2}}ds\\ {} & \le \underset{y\in \mathbb{R}}{\max }{\big({\phi ^{\prime }}(y)\big)^{2}}{\big({b^{\prime\prime }_{yy}}\big(t,Y(t)\big)\big)^{2}}{\int _{0}^{T}}{\big({D_{s}}Y(t)\big)^{2}}ds\in {L^{2}}(\Omega ).\end{aligned}\]

Therefore, by the dominated convergence theorem,

\[\begin{aligned}{}\mathbb{E}& \Bigg[{\int _{0}^{T}}{\big({D_{s}}{f_{m}}\big(Y(t)\big)-{b^{\prime\prime }_{yy}}\big(t,Y(t)\big){D_{s}}Y(t)\big)^{2}}ds\Bigg]\to 0,\hspace{1em}m\to \infty ,\end{aligned}\]

which proves the first claim of the Proposition.

2) Let us proceed with the second claim and verify that

\[ \exp \Bigg\{{\int _{u}^{t}}{b^{\prime }_{y}}\big(v,Y(v)\big)dv\Bigg\}\in {\mathbb{D}^{1,p}}\]

with (16) being the corresponding Malliavin derivative. Note that, since ${b^{\prime }_{y}}$ is bounded from above, $\exp \{{\textstyle\int _{u}^{t}}{b^{\prime }_{y}}(v,Y(v))dv\}$ is also bounded from above and hence is an element of ${L^{p}}(\Omega )$ for any $p\gt 1$. Moreover, by Proposition 1, boundedness of $\exp \{{\textstyle\int _{u}^{t}}{b^{\prime }_{y}}(v,Y(v))dv\}$ and (13), we can write

\[\begin{aligned}{}\mathbb{E}& \Bigg[{\Bigg({\int _{0}^{T}}{\Bigg(\exp \Bigg\{{\int _{u}^{t}}{b^{\prime }_{y}}\big(v,Y(v)\big)dv\Bigg\}{\int _{u}^{t}}{b^{\prime\prime }_{yy}}\big(v,Y(v)\big){D_{s}}Y(v)dv\Bigg)^{2}}ds\Bigg)^{\frac{p}{2}}}\Bigg]\\ {} & \le C\mathbb{E}\Bigg[{\xi ^{p}}{\Bigg({\int _{0}^{T}}{\int _{u}^{t}}{\big({D_{s}}Y(v)\big)^{2}}dvds\Bigg)^{\frac{p}{2}}}\Bigg]\\ {} & \le C\mathbb{E}\Bigg[{\xi ^{p}}{\Bigg({\int _{0}^{T}}{\int _{u}^{t}}{\mathcal{K}^{2}}(v,s)dvds\Bigg)^{\frac{p}{2}}}\Bigg]\\ {} & \hspace{1em}+C\exp \{pcT\}\mathbb{E}\big[{\xi ^{2p}}\big]{\Bigg({\int _{0}^{T}}{\int _{u}^{t}}{\int _{s}^{v}}{\mathcal{K}^{2}}(u,s)dudvds\Bigg)^{\frac{p}{2}}}\\ {} & \lt \infty ,\end{aligned}\]

and hence it is sufficient to prove that $\exp \{{\textstyle\int _{u}^{t}}{b^{\prime }_{y}}(v,Y(v))dv\}\in {\mathbb{D}^{1,2}}$.

Since the Malliavin derivative operator D is closed and the expression ${\textstyle\int _{u}^{t}}{b^{\prime\prime }_{yy}}(v,Y(v)){D_{s}}Y(v)dv$ is well-defined by Proposition 1, Step 1 of the current proof and Hille’s theorem [22, Theorem 1.2.4] guarantee that

\[ {\int _{u}^{t}}{b^{\prime }_{y}}\big(v,Y(v)\big)Y(v)dv\in {\mathbb{D}^{1,2}}\]

and

\[ {D_{s}}{\int _{u}^{t}}{b^{\prime }_{y}}\big(v,Y(v)\big)Y(v)dv={\int _{u}^{t}}{b^{\prime\prime }_{yy}}\big(v,Y(v)\big){D_{s}}Y(v)dv.\]

Finally, the function $x\mapsto {e^{x}}$ satisfies the conditions of the chain rule from [9, Proposition 3.4] and hence $\exp \{{\textstyle\int _{u}^{t}}{b^{\prime }_{y}}(v,Y(v))dv\}\in {\mathbb{D}^{1,2}}$ and (16) holds. □

Proposition 3 and Lemma 2 together allow us to deduce the following corollary.

Corollary 1.

For any $0\le s\lt t\le T$ and $p\gt 1$,

\[ {b^{\prime }_{y}}\big(s,Y(s)\big)\exp \Bigg\{{\int _{s}^{t}}{b^{\prime }_{y}}\big(v,Y(v)\big)dv\Bigg\}\in {\mathbb{D}^{1,p}}\]

and

(17)

\[ \begin{aligned}{}{D_{u}}& \Bigg[{b^{\prime }_{y}}\big(s,Y(s)\big)\exp \Bigg\{{\int _{s}^{t}}{b^{\prime }_{y}}\big(v,Y(v)\big)dv\Bigg\}\Bigg]\\ {} & ={b^{\prime\prime }_{yy}}\big(s,Y(s)\big)\exp \Bigg\{{\int _{s}^{t}}{b^{\prime }_{y}}\big(v,Y(v)\big)dv\Bigg\}{D_{u}}Y(s)\\ {} & \hspace{1em}+{b^{\prime }_{y}}\big(s,Y(s)\big)\exp \Bigg\{{\int _{s}^{t}}{b^{\prime }_{y}}\big(v,Y(v)\big)dv\Bigg\}{\int _{s}^{t}}{b^{\prime\prime }_{yy}}\big(v,Y(v)\big){D_{u}}Y(v)dv.\end{aligned}\]

Proof.

For fixed $0\le s\lt t\le T$, denote

\[ {X_{1}}:={b^{\prime }_{y}}\big(s,Y(s)\big),\hspace{1em}{X_{2}}:=\exp \Bigg\{{\int _{s}^{t}}{b^{\prime }_{y}}\big(v,Y(v)\big)dv\Bigg\}.\]

By Proposition 3 and Lemma 2 from the Appendix, it is sufficient to check that for all $p\ge 2$

(i) the product ${X_{1}}{X_{2}}\in {L^{p}}(\Omega )$,
(ii) $\mathbb{E}[{({\textstyle\int _{0}^{T}}{({X_{2}}{D_{u}}{X_{1}})^{2}})^{\frac{p}{2}}}]\lt \infty $ and
(iii) $\mathbb{E}[{({\textstyle\int _{0}^{T}}{({X_{1}}{D_{u}}{X_{2}})^{2}})^{\frac{p}{2}}}]\lt \infty $.

All conditions (i)–(iii) can be checked in a straightforward manner using Proposition 1 and the arguments similar to the proof of Proposition 2. □

We are now ready to formulate the main result of this section.

Theorem 3.

For any $t\in [0,T]$ and $p\ge 2$,

1) $Y(t)\in {\mathbb{D}^{2,p}}$,
2) with probability 1 and for a.a. $r,s\in [0,T]$,

(18)
\[ \begin{aligned}{}{D_{r}}{D_{s}}Y(t)& ={\int _{s}^{t}}\mathcal{K}(u,s){F_{1}}(t,u)\Bigg({\int _{u}^{t}}{b^{\prime\prime }_{yy}}\big(v,Y(v)\big){D_{r}}Y(v)dv\Bigg)du\\ {} & \hspace{2em}+{\int _{s}^{t}}\mathcal{K}(u,s){F_{2}}(t,u){D_{r}}Y(u)du,\end{aligned}\]
where
\[\begin{aligned}{}{F_{1}}(t,u)& :={b^{\prime }_{y}}\big(u,Y(u)\big)\exp \Bigg\{{\int _{u}^{t}}{b^{\prime }_{y}}\big(v,Y(v)\big)dv\Bigg\},\\ {} {F_{2}}(t,u)& :={b^{\prime\prime }_{yy}}\big(u,Y(u)\big)\exp \Bigg\{{\int _{u}^{t}}{b^{\prime }_{y}}\big(v,Y(v)\big)dv\Bigg\}.\end{aligned}\]

Proof.

Our goal is to prove that $Y(t)\in {\mathbb{D}^{2,p}}$ and

\[\begin{aligned}{}{D_{r}}{D_{s}}Y(t)& ={\int _{s}^{t}}\mathcal{K}(u,s){D_{r}}\Bigg[{b^{\prime }_{y}}\big(u,Y(u)\big)\exp \Bigg\{{\int _{u}^{t}}{b^{\prime }_{y}}\big(v,Y(v)\big)dv\Bigg\}\Bigg]du\\ {} & ={\int _{s}^{t}}\mathcal{K}(u,s){D_{r}}\big[{F_{1}}(t,u)\big]du,\end{aligned}\]

since, in such case, (18) follows immediately from Corollary 1. Recall that

\[ {D_{s}}Y(t)=\mathcal{K}(t,s)+{\int _{s}^{t}}\mathcal{K}(u,s){F_{1}}(t,u)du.\]

Clearly, for any $0\le r,s\lt t\le T$,

\[ {D_{r}}\mathcal{K}(t,s)=0,\]

so, by closedness of D and Hille’s theorem [22, Theorem 1.2.4], it is enough to show that

(i) for a.a. $0\le s\le u\lt t\le T$, $\mathcal{K}(u,s){F_{1}}(t,u)\in {\mathbb{D}^{1,p}}$ and
(ii) for a.a. $0\le s\lt t\le T$,
\[\begin{aligned}{}{\int _{0}^{T}}& {\Bigg(\mathbb{E}\Bigg[{\Bigg({\int _{0}^{T}}{\big({D_{r}}\big[\mathcal{K}(u,s){F_{1}}(t,u)\big]\big)^{2}}dr\Bigg)^{\frac{p}{2}}}\Bigg]\Bigg)^{\frac{1}{p}}}du\\ {} & ={\int _{0}^{T}}\mathcal{K}(u,s){\Bigg(\mathbb{E}\Bigg[{\Bigg({\int _{0}^{T}}{\big({D_{r}}\big[{F_{1}}(t,u)\big]\big)^{2}}dr\Bigg)^{\frac{p}{2}}}\Bigg]\Bigg)^{\frac{1}{p}}}du\\ {} & \lt \infty .\end{aligned}\]

Item (i) above follows immediately from Corollary 1. As for item (ii), observe that, by Proposition 1, (13) as well as the boundedness of $\exp \{{\textstyle\int _{u}^{t}}{b^{\prime }_{y}}(v,Y(v))dv\}$, we have

\[\begin{aligned}{}{\big({D_{r}}\big[{F_{1}}(t,u)\big]\big)^{2}}& \le C\Bigg({\big({b^{\prime\prime }_{yy}}\big(u,Y(u)\big)\big)^{2}}\exp \Bigg\{2{\int _{u}^{t}}{b^{\prime }_{y}}\big(v,Y(v)\big)dv\Bigg\}{\big({D_{r}}Y(u)\big)^{2}}\\ {} & \hspace{1em}+{\big({b^{\prime }_{y}}\big(u,Y(u)\big)\big)^{2}}\exp \Bigg\{2{\int _{u}^{t}}{b^{\prime }_{y}}\big(v,Y(v)\big)dv\Bigg\}\times \\ {} & \hspace{2em}\hspace{2em}\times {\int _{u}^{t}}{\big({b^{\prime\prime }_{yy}}\big(v,Y(v)\big){D_{r}}Y(v)\big)^{2}}dv\Bigg)\\ {} & \le C\Bigg({\xi ^{2}}{\big({D_{r}}Y(u)\big)^{2}}+{\xi ^{4}}{\int _{u}^{t}}{\big({D_{r}}Y(v)\big)^{2}}dv\Bigg)\\ {} & \le C{\xi ^{2}}\Bigg({\mathcal{K}^{2}}(u,r)+{\int _{r}^{u}}{\mathcal{K}^{2}}(z,r)dz\Bigg)\\ {} & \hspace{1em}+C{\xi ^{4}}\Bigg({\int _{u}^{t}}{\mathcal{K}^{2}}(v,r)dv+{\int _{u}^{t}}{\int _{r}^{v}}{\mathcal{K}^{2}}(z,r)dzdv\Bigg).\end{aligned}\]

Hence, for any $p\ge 2$, Remark 1 implies

\[\begin{aligned}{}{\int _{0}^{T}}{\big({D_{r}}\big[{F_{1}}(t,u)\big]\big)^{2}}dr& \le C{\xi ^{2}}\Bigg({\int _{0}^{T}}{\mathcal{K}^{2}}(u,r)dr+{\int _{0}^{T}}{\int _{r}^{u}}{\mathcal{K}^{2}}(z,r)dzdr\Bigg)\\ {} & \hspace{1em}+C{\xi ^{4}}{\int _{0}^{T}}{\int _{u}^{t}}{\mathcal{K}^{2}}(v,r)dvdr\\ {} & \hspace{1em}+C{\xi ^{4}}{\int _{0}^{T}}{\int _{u}^{t}}{\int _{r}^{v}}{\mathcal{K}^{2}}(z,r)dzdvdr\\ {} & \le C\big({\xi ^{2}}+{\xi ^{4}}\big),\end{aligned}\]

so, since ξ has moments of all orders, (ii) holds, which finalizes the proof. □

Finally, denote ${\mathbb{L}^{2,p}}:={L^{p}}([0,T];{\mathbb{D}^{2,p}})$. We complete the section with the following result.

Corollary 2.

For any $p\ge 2$, $Y\in {\mathbb{L}^{2,p}}$.

Proof.

By the definition of the $\| \cdot {\| _{2,p}}$-norm in (32) from Appendix A, it is sufficient to check that

(19)

\[ {\int _{0}^{T}}\mathbb{E}\big[|Y(t){|^{p}}\big]\lt \infty ,\]

(20)

\[ {\int _{0}^{T}}\mathbb{E}\Bigg[{\Bigg({\int _{0}^{T}}{\big({D_{s}}Y(t)\big)^{2}}ds\Bigg)^{\frac{p}{2}}}\Bigg]dt\lt \infty \]

and

(21)

\[ {\int _{0}^{T}}\mathbb{E}\Bigg[{\Bigg({\int _{0}^{T}}{\int _{0}^{T}}{\big({D_{r}}{D_{s}}Y(t)\big)^{2}}dsdr\Bigg)^{\frac{p}{2}}}\Bigg]dt\lt \infty .\]

By (11), (19) holds automatically. Next, (20) can be easily deduced from (14). Finally, using Proposition (1) and the boundedness of $\exp \{{\textstyle\int _{u}^{t}}{b^{\prime }_{y}}(v,Y(v))dv\}$, it is easy to prove a bound similar to (14) for

\[ \mathbb{E}\Bigg[{\Bigg({\int _{0}^{T}}{\int _{0}^{T}}{\big({D_{r}}{D_{s}}Y(t)\big)^{2}}dsdr\Bigg)^{\frac{p}{2}}}\Bigg],\]

which implies (21). By this, the proof is complete. □

4 Power law in SVV model

Having the second-order Malliavin differentiability in place, we now possess all the necessary tools to analyze the behavior of implied volatility skew of a model with the sandwiched process (10) as stochastic volatility. Namely, we consider a (risk-free) market model with the price process $S=\{S(t),\hspace{2.5pt}t\in [0,T]\}$ of the form

(22)

\[ \begin{aligned}{}S(t)& ={e^{X(t)}},\\ {} X(t)& ={x_{0}}+rt-\frac{1}{2}{\int _{0}^{t}}{Y^{2}}(s)ds+{\int _{0}^{t}}Y(s)\big(\rho d{B_{1}}(s)+\sqrt{1-{\rho ^{2}}}d{B_{2}}(s)\big),\\ {} Y(t)& ={y_{0}}+{\int _{0}^{t}}b\big(s,Y(s)\big)ds+{\int _{0}^{t}}\mathcal{K}(t,s)d{B_{1}}(s),\end{aligned}\]

where ${B_{1}},{B_{2}}$ are two independent Brownian motions, $X=\{X(t),t\in [0,T]\}$ denotes the (risk-free) log-price of an asset starting from some level ${x_{0}}\in \mathbb{R}$, r is a constant instantaneous interest rate, and $\rho \in (-1,1)$ is a correlation coefficient that accounts for the leverage effect. As previously, the drift b and the Volterra kernel $\mathcal{K}$ satisfy Assumptions 1 and 2.

Remark 5.

The model (22) was initially introduced in [9] and, given the nature of the volatility process, is called the Sandwiched Volterra Volatility (SVV) model.

The goal of this section is to establish conditions under which (22) reproduces the power law (2) of the short-term at-the-money implied volatility. Namely, we have the following result.

Theorem 4.

Let Assumptions 1 and 2 hold with $H\in (\frac{1}{6},\frac{1}{2})$. Assume that the Volterra kernel $\mathcal{K}$ is such that, for any $0\le s\lt t\le T$,

\[ \big|\mathcal{K}(t,s)\big|\le C|t-s{|^{-\frac{1}{2}+H}}\]

for some constant $C\gt 0$, and

(23)

\[ \frac{1}{{\tau ^{\frac{3}{2}+H}}}{\int _{0}^{\tau }}{\int _{s}^{\tau }}\mathcal{K}(t,s)dtds\to {K_{Y}},\hspace{1em}\tau \to 0+,\]

for some finite constant ${K_{Y}}$. Then, with probability 1, the SVV implied volatility $\widehat{\sigma }$ exhibits the property

\[ \underset{\tau \to 0}{\lim }{\tau ^{\frac{1}{2}-H}}\frac{\partial \widehat{\sigma }}{\partial \kappa }(\tau ,\kappa ){\bigg|_{\kappa =0}}=\frac{\rho }{{y_{0}}}{K_{Y}}.\]

In particular, if $\rho {K_{Y}}\ne 0$, the SVV model (22) reproduces the power law (2) of the at-the-money implied volatility skew.

Remark 6.

The behavior of empirically observed implied volatilities (see, e.g., [12]) shows that realistic market models should produce $\widehat{\sigma }$ with

(24)

\[ \frac{\partial \widehat{\sigma }}{\partial \kappa }(\tau ,\kappa ){\bigg|_{\kappa =0}}\lt 0.\]

In the SVV setting (22), Theorem 4 guarantees that (24) holds for all small enough τ provided that $\rho {K_{Y}}\lt 0$.

Remark 7.

The condition $H\gt \frac{1}{6}$ in Theorem 4 is consistent with the recent empirical estimate $H\approx 0.19$ for the SPX implied volatility obtained in [8].

To prove Theorem 4, we will apply the fundamental result [1, Theorem 6.3] which connects the shape of the skew with the Malliavin derivative of the volatility.

Remark 8.

In the recent literature (see, e.g., [4, 8, 12, 20]), it is typical to characterize the implied volatility skew in terms of $\frac{\partial \widehat{\sigma }}{\partial \kappa }$ with $\kappa =\log \frac{K}{{e^{r\tau +{x_{0}}}}}$ being the log-moneyness. In [1], a slightly different parametrization ${\widehat{\sigma }_{\text{log-price}}}(\tau ,{x_{0}})$ is considered with

\[ {\widehat{\sigma }_{\text{log-price}}}(\tau ,x)=\widehat{\sigma }\bigg(\tau ,\log \frac{K}{{e^{r\tau }}}-x\bigg).\]

With this parametrization,

\[ \frac{\partial {\widehat{\sigma }_{\text{log-price}}}(\tau ,x)}{\partial x}=-\frac{\partial \widehat{\sigma }(\tau ,\log \frac{K}{{e^{r\tau }}}-x)}{\partial \kappa }\]

and the power law (2) is equivalent to

\[ {\bigg|\frac{\partial {\widehat{\sigma }_{\text{log-price}}}}{\partial x}(\tau ,x)\bigg|_{x=\log \frac{K}{{e^{r\tau }}}}}=O\big({\tau ^{-\frac{1}{2}+H}}\big),\hspace{1em}\tau \to 0.\]

With Remark 8 in mind, let us provide a slightly adjusted version of [1, Theorem 6.3].

Theorem 5.

Consider a risk-free log-price

(25)

\[ X(t)={x_{0}}+rt-\frac{1}{2}{\int _{0}^{t}}{\sigma ^{2}}(s)ds+{\int _{0}^{t}}\sigma (s)\big(\rho d{B_{1}}(s)+\sqrt{1-{\rho ^{2}}}d{B_{2}}(s)\big),\]

where ${B_{1}}$, ${B_{2}}$ are two independent Brownian motions, ${x_{0}}\in \mathbb{R}$ is a deterministic initial value, r is an instantaneous interest rate, $\rho \in (-1,1)$ is a correlation coefficient and $\sigma =\{\sigma (t),\hspace{2.5pt}t\in [0,T]\}$ is a square-integrable stochastic process with right-continuous trajectories adapted to the filtration $\mathcal{F}=\{{\mathcal{F}_{t}},\hspace{2.5pt}t\in [0,T]\}$ generated by ${B_{1}}$.

Assume that

(H1) $\sigma \in {\mathbb{L}^{2,4}}$ with respect to ${B_{1}}$;
(H2) there exists a constant ${\varphi _{\ast }}\gt 0$ such that, with probability 1, $\sigma (t)\gt {\varphi _{\ast }}$ for all $t\in [0,T]$;
(H3) there exists a constant $H\in (0,\frac{1}{2})$ such that, with probability 1, for any $0\lt s\lt t\lt T$,

(26)
\[\begin{aligned}{}\mathbb{E}\big[{\big({D_{s}}\sigma (t)\big)^{2}}\big]& \le \frac{C}{{(t-s)^{1-2H}}},\end{aligned}\]

(27)
\[\begin{aligned}{}\mathbb{E}\big[{\big({D_{r}}{D_{s}}\sigma (t)\big)^{2}}\big]& \le C{\bigg(\frac{t-r}{t-s}\bigg)^{1-2H}},\end{aligned}\]

where $C\gt 0$ is some constant;
(H4) σ has a.s. right-continuous trajectories;
(H5) ${\sup _{r,s,t\in [0,\tau ]}}\mathbb{E}[{(\sigma (s)\sigma (t)-{\sigma ^{2}}(r))^{2}}]\to 0$ when $\tau \to 0+$.

Finally, assume that there exists a constant ${K_{\sigma }}\gt 0$ such that, with probability 1,

(28)

\[ \frac{1}{{\tau ^{\frac{3}{2}+H}}}{\int _{0}^{\tau }}{\int _{s}^{\tau }}\mathbb{E}\big[{D_{s}}\sigma (t)\big]dtds-{K_{\sigma }}\to 0,\hspace{1em}\tau \to 0+.\]

Then, with probability 1,

\[ \underset{\tau \to 0}{\lim }{\tau ^{\frac{1}{2}-H}}\frac{\partial {\widehat{\sigma }_{\textit{log-price}}}}{\partial x}(\tau ,x){\bigg|_{x=\log \frac{K}{{e^{r\tau }}}}}=-\frac{\rho }{\sigma (0)}{K_{\sigma }}.\]

Remark 9.

The original formulation of [1, Theorem 6.3] is slightly more general than Theorem 5 above in the sense that

1) in [1, Theorem 6.3], the log-price X is allowed to have jumps;
2) the result in [1] is formulated for the future implied volatility surfaces ${\widehat{\sigma }_{\text{log-price}}}({t_{0}},\tau ,X({t_{0}}))$, ${t_{0}}\ge 0$.

Since we are interested in the continuous model (22), we removed the jump component in (25) and, for the simplicity of notation, we put ${t_{0}}=0$.

Observe that the SVV model (22) automatically satisfies a number of assumptions of Theorem 5:

• assumption (H2) with ${\varphi ^{\ast }}:={\min _{t\in [0,T]}}\varphi (t)\gt 0$;
• assumption (H4) since Y is continuous a.s.;
• assumption (H1) by the results of Section 3 above.

Therefore, it remains to check (H3), (H5), and (28). Naturally, given the shape of the Malliavin derivative (12), both (H3) and (28) require additional assumptions on the kernel, so let us start with (H5).

Proposition 4.

Let Assumptions 1 and 2 hold. Then with probability 1,

\[ \underset{r,s,t\in [0,\tau ]}{\sup }\mathbb{E}\big[{\big(Y(s)Y(t)-{Y^{2}}(r)\big)^{2}}\big]\to 0,\hspace{1em}\tau \to 0.\]

Proof.

By [10, Lemma 3.6], there exists a positive random variable $\Upsilon ={\Upsilon _{T}}$ such that, for all ${t_{1}},{t_{2}}\in [0,T]$,

\[ \big|Y({t_{1}})-Y({t_{2}})\big|\le \Upsilon |{t_{1}}-{t_{2}}{|^{\lambda }}\]

and, for any $r\gt 0$,

\[ \mathbb{E}\big[{\Upsilon ^{r}}\big]\lt \infty .\]

Therefore, given that ${\max _{t\in [0,T]}}Y(t)\lt {\max _{t\in [0,T]}}\psi (t)$ by (11),

\[\begin{aligned}{}\mathbb{E}& \big[{\big(Y(s)Y(t)-{Y^{2}}(r)\big)^{2}}\big]\\ {} & =\mathbb{E}\big[{\big(Y(s)\big(Y(t)-Y(r)\big)+Y(r)\big(Y(s)-Y(r)\big)\big)^{2}}\big]\\ {} & \le 2\mathbb{E}\big[({Y^{2}}(s){\big(Y(t)-Y(r)\big)^{2}}\big]+2\mathbb{E}\big[{Y^{2}}(r){\big(Y(s)-Y(r)\big)^{2}}\big]\\ {} & \le 2|t-r{|^{2\lambda }}\underset{s\in [0,T]}{\max }{\psi ^{2}}(s)\mathbb{E}\big[{\Upsilon ^{2}}\big]+2|s-r{|^{2\lambda }}\underset{s\in [0,T]}{\max }{\psi ^{2}}(s)\mathbb{E}\big[{\Upsilon ^{2}}\big]\end{aligned}\]

and hence, with probability 1,

\[\begin{aligned}{}\underset{r,s,t\in [0,\tau ]}{\sup }\mathbb{E}\big[{\big(Y(s)Y(t)-{Y^{2}}(r)\big)^{2}}\big]& \le 4{\tau ^{2\lambda }}\underset{s\in [0,T]}{\max }{\psi ^{2}}(s)\mathbb{E}\big[{\Upsilon ^{2}}\big]\to 0\end{aligned}\]

as $\tau \to 0+$. □

Our next step is to handle (28).

Proposition 5.

Let Assumptions 1 and 2 hold and the Volterra kernel $\mathcal{K}$ satisfy (23) for some finite constant ${K_{Y}}$. Then, with probability 1,

\[ \frac{1}{{\tau ^{\frac{3}{2}+H}}}{\int _{0}^{\tau }}{\int _{s}^{\tau }}\mathbb{E}\big[{D_{s}}Y(t)\big]dtds-{K_{Y}}\to 0,\hspace{1em}\tau \to 0+.\]

Proof.

Recall that

\[ {F_{1}}(t,u):={b^{\prime }_{y}}\big(u,Y(u)\big)\exp \Bigg\{{\int _{u}^{t}}{b^{\prime }_{y}}\big(v,Y(v)\big)dv\Bigg\}\]

and that, by Proposition 1,

(29)

\[ \big|{F_{1}}(t,u)\big|\le {e^{cT}}\xi ,\]

where $c:={\max _{(t,y)\in \overline{\mathcal{D}}}}{a^{\prime }_{y}}(t,y)$. Then we can write

\[\begin{aligned}{}\frac{1}{{\tau ^{\frac{3}{2}+H}}}& {\int _{0}^{\tau }}{\int _{s}^{\tau }}\mathbb{E}\big[{D_{s}}Y(t)\big]dtds\\ {} & =\frac{1}{{\tau ^{\frac{3}{2}+H}}}{\int _{0}^{\tau }}{\int _{s}^{\tau }}\mathcal{K}(t,s)dtds\\ {} & \hspace{1em}+\frac{1}{{\tau ^{\frac{3}{2}+H}}}{\int _{0}^{\tau }}{\int _{s}^{\tau }}{\int _{s}^{t}}\mathcal{K}(u,s)\mathbb{E}\big[{F_{1}}(t,u)\big]dudtds\\ {} & =\frac{1}{{\tau ^{\frac{3}{2}+H}}}{\int _{0}^{\tau }}{\int _{s}^{\tau }}\mathcal{K}(t,s)dtds\\ {} & \hspace{1em}+\frac{1}{{\tau ^{\frac{3}{2}+H}}}{\int _{0}^{\tau }}{\int _{s}^{\tau }}\mathcal{K}(u,s)\Bigg({\int _{u}^{\tau }}\mathbb{E}\big[{F_{1}}(t,u)\big]dt\Bigg)duds.\end{aligned}\]

The term $\frac{1}{{\tau ^{\frac{3}{2}+H}}}{\textstyle\int _{0}^{\tau }}{\textstyle\int _{s}^{\tau }}\mathcal{K}(t,s)dtds$ converges to ${K_{Y}}$ by (23). As for the second term, note that, with probability 1, for any $u\in [0,\tau ]$,

\[\begin{aligned}{}{\int _{u}^{\tau }}\big|\mathbb{E}\big[{F_{1}}(t,u)\big]\big|dt& \le C\mathbb{E}[\xi ]\tau \end{aligned}\]

and hence, given (23), with probability 1,

\[ \frac{1}{{\tau ^{\frac{3}{2}+H}}}{\int _{0}^{\tau }}{\int _{s}^{\tau }}\mathcal{K}(u,s)\Bigg({\int _{u}^{\tau }}\mathbb{E}\big[{F_{1}}(t,u)\big]dt\Bigg)duds\to 0,\hspace{1em}\tau \to 0+,\]

which ends the proof. □

Finally, let us deal with (H3).

Proposition 6.

Let Assumptions 1 and 2 hold with $H\in (\frac{1}{6},\frac{1}{2})$ and the Volterra kernel $\mathcal{K}$ be such that for any $0\le s\lt t\le T$

(30)

\[ \big|\mathcal{K}(t,s)\big|\le C|t-s{|^{-\frac{1}{2}+H}}\]

for some constant $C\gt 0$. Then the hypothesis (H3) from Theorem 5 holds for the sandwiched volatility process $\sigma =Y$.

Proof.

Fix $0\lt r,s\lt t$. Then, taking into account (29), with probability 1,

(31)

\[ \begin{aligned}{}\big|{D_{s}}Y(t)\big|& \le \big|\mathcal{K}(t,s)\big|+{\int _{s}^{t}}\big|\mathcal{K}(u,s)\big|\big|{F_{1}}(t,u)\big|du\\ {} & \le C\Bigg(|t-s{|^{-\frac{1}{2}+H}}+\xi {\int _{s}^{t}}|u-s{|^{-\frac{1}{2}+H}}du\Bigg)\\ {} & \le C(1+T\xi )|t-s{|^{-\frac{1}{2}+H}}\\ {} & =:\zeta |t-s{|^{-\frac{1}{2}+H}},\end{aligned}\]

which immediately implies (26). Next, by Proposition 1,

\[ \big|{b^{\prime\prime }_{yy}}\big(v,Y(v)\big)\big|\le \xi \]

for any $v\in [0,T]$ and, for any $0\le u\le t\le T$,

\[\begin{aligned}{}\big|{F_{2}}(t,u)\big|& =\Bigg|{b^{\prime\prime }_{yy}}\big(u,Y(u)\big)\exp \Bigg\{{\int _{u}^{t}}{b^{\prime }_{y}}\big(v,Y(v)\big)dv\Bigg\}\Bigg|\\ {} & \le {e^{cT}}\xi \end{aligned}\]

with $c:={\max _{(t,y)\in \overline{\mathcal{D}}}}{a^{\prime }_{y}}(t,y)$, so we can write

\[\begin{aligned}{}& \big|{D_{r}}{D_{s}}Y(t)\big|\\ {} & \hspace{1em}\le {\int _{s}^{t}}\big|\mathcal{K}(u,s)\big|\big|{F_{1}}(t,u)\big|\Bigg({\int _{u}^{t}}\big|{b^{\prime\prime }_{yy}}\big(v,Y(v)\big)\big|\big|{D_{r}}Y(v)\big|dv\Bigg)du\\ {} & \hspace{2em}+{\int _{s}^{t}}\big|\mathcal{K}(u,s)\big|\big|{F_{2}}(t,u)\big|\big|{D_{r}}Y(u)\big|du\\ {} & \hspace{1em}\le C\Bigg({\xi ^{2}}{\int _{s}^{t}}\big|\mathcal{K}(u,s)\big|\Bigg({\int _{u}^{t}}\big|{D_{r}}Y(v)\big|dv\Bigg)du+\xi {\int _{s}^{t}}\big|\mathcal{K}(u,s)\big|\big|{D_{r}}Y(u)\big|du\Bigg)\\ {} & \hspace{1em}=C\Bigg({\xi ^{2}}{\int _{s}^{t}}\big|\mathcal{K}(u,s)\big|\Bigg({\int _{u\vee r}^{t}}\big|{D_{r}}Y(v)\big|dv\Bigg)du+\xi {\int _{r\vee s}^{t}}\big|\mathcal{K}(u,s)\big|\big|{D_{r}}Y(u)\big|du\Bigg).\end{aligned}\]

Taking into account (30) and (31),

\[\begin{aligned}{}\big|{D_{r}}{D_{s}}Y(t)\big|& \le C\Bigg({\xi ^{2}}\zeta {\int _{s}^{t}}|u-s{|^{-\frac{1}{2}+H}}\Bigg({\int _{u\vee r}^{t}}|v-r{|^{-\frac{1}{2}+H}}dv\Bigg)du\\ {} & \hspace{2em}+\xi \zeta {\int _{r\vee s}^{t}}|u-s{|^{-\frac{1}{2}+H}}|u-r{|^{-\frac{1}{2}+H}}du\Bigg)\\ {} & \le C\Bigg({\xi ^{2}}\zeta {\int _{s}^{t}}|u-s{|^{-\frac{1}{2}+H}}|t-r{|^{\frac{1}{2}+H}}du\\ {} & \hspace{2em}+\xi \zeta {\int _{r\vee s}^{t}}|u-s{|^{-\frac{1}{2}+H}}|u-r{|^{-\frac{1}{2}+H}}du\Bigg).\end{aligned}\]

Note that

\[\begin{aligned}{}{\int _{s}^{t}}|u-s{|^{-\frac{1}{2}+H}}|t-r{|^{\frac{1}{2}+H}}du& \le C|t-r{|^{\frac{1}{2}+H}}|t-s{|^{\frac{1}{2}+H}}\\ {} & \le C{\bigg(\frac{t-r}{t-s}\bigg)^{\frac{1}{2}-H}}.\end{aligned}\]

As for the integral ${\textstyle\int _{r\vee s}^{t}}|u-s{|^{-\frac{1}{2}+H}}|u-r{|^{-\frac{1}{2}+H}}du$, we have two cases:

• if $0\lt r\le s\lt t$, we can write
\[\begin{aligned}{}{\int _{s}^{t}}|u-s{|^{-\frac{1}{2}+H}}|u-r{|^{-\frac{1}{2}+H}}du& \le {\int _{s}^{t}}|u-s{|^{-1+2H}}du\\ {} & \le C{(t-s)^{2H}}\le C{\bigg(\frac{t-r}{t-s}\bigg)^{\frac{1}{2}-H}};\end{aligned}\]
• similarly, if $0\lt s\lt r\lt t$ and given that $H\gt \frac{1}{6}$, we have
\[\begin{aligned}{}{\int _{r}^{t}}|u-s{|^{-\frac{1}{2}+H}}|u-r{|^{-\frac{1}{2}+H}}du& \le {\int _{r}^{t}}|u-r{|^{-1+2H}}du\\ {} & \le C{(t-r)^{2H}}\le C{\bigg(\frac{t-r}{t-s}\bigg)^{\frac{1}{2}-H}}.\end{aligned}\]

In any case,

\[\begin{aligned}{}\big|{D_{r}}{D_{s}}Y(t)\big|& \le C\xi \zeta (\xi +1){\bigg(\frac{t-r}{t-s}\bigg)^{\frac{1}{2}-H}},\end{aligned}\]

where ξ and ζ are random variables having all moments, and hence (27) holds. □

Having in mind all of the results above, we are ready to prove the main result of this section, namely Theorem 4.

Proof of Theorem 4.

The results above show that the SVV model satisfies conditions (H1)–(H5) of Theorem 5. Therefore, taking into account the reparametrization described in Remark 8, Theorem 4 follows immediately from Theorem 5. □

Example 1.

Let $\frac{1}{6}\lt {H_{0}}\lt {H_{1}}\lt \cdots \lt {H_{n}}\lt 1$ be such that ${H_{0}}\lt \frac{1}{2}$ and ${\alpha _{k}}\gt 0$, $k=0,\dots ,n$. Then the kernel

\[ \mathcal{K}(t,s)=\Bigg({\sum \limits_{k=0}^{n}}{\alpha _{k}}{(t-s)^{{H_{k}}-\frac{1}{2}}}\Bigg){1_{s\lt t}}\]

satisfies the assumptions of Theorem 4, so the corresponding SVV model generates power law (2) with $H={H_{0}}$ provided that $\rho \lt 0$ in (22).

A Selected results from the Malliavin calculus

A.1 The Malliavin derivative and the space ${\mathbb{D}^{k,p}}$

Hereafter, we summarize the essentials of the Malliavin derivative with respect to the classical Brownian motion. For more details, we refer the reader to the classical books [25] or [13].

Denote by ${C_{p}^{(\infty )}}({\mathbb{R}^{n}})$ the space of all infinitely differentiable functions with the derivatives of at most polynomial growth. Let $B=\{B(t),\hspace{2.5pt}t\in [0,T]\}$ be a standard Brownian motion. For any $h\in {L^{2}}([0,T])$, denote

\[ B(h):={\int _{0}^{T}}h(t)dB(t).\]

Definition 1.

The random variables X of the form

\[ X=f\big(B({h_{1}}),\dots ,B({h_{n}})\big),\]

where $n\ge 1$, $f\in {C_{p}^{(\infty )}}({\mathbb{R}^{n}})$ and ${h_{1}},\dots ,{h_{n}}\in {L^{2}}([0,T])$ are called smooth. The set of all smooth random variables is denoted by $\mathcal{S}$.

Definition 2.

Let $X\in \mathcal{S}$. The Malliavin derivative of X (with respect to B) is the ${L^{2}}([0,T])$-valued random variable of the form

\[ DX:={\sum \limits_{k=1}^{n}}\frac{\partial f}{\partial {x_{k}}}\big(B({h_{1}}),\dots ,B({h_{n}})\big){h_{k}}.\]

By [25, Proposition 1.2.1], the operator D is closable from ${L^{p}}(\Omega )$ to ${L^{p}}(\Omega \times [0,T])$ for any $p\ge 1$, and we use the same notation D for the closure. The domain of this closure D in ${L^{p}}(\Omega )$, i.e. the closure of the class $\mathcal{S}$ with respect to the norm

\[ \| X{\| _{1,p}}:={\Bigg(\mathbb{E}\big[|X{|^{p}}\big]+\mathbb{E}\Bigg[{\Bigg({\int _{0}^{T}}{({D_{s}}X)^{2}}ds\Bigg)^{\frac{p}{2}}}\Bigg]\Bigg)^{\frac{1}{p}}},\]

is traditionally denoted by ${\mathbb{D}^{1,p}}$. This definition can be iterated as described in [25, p. 27] to introduce the iterated derivative ${D^{k}}X$ as a random variable with values in ${({L^{2}}([0,T]))^{\otimes k}}\sim {L^{2}}({[0,T]^{k}})$. One can also define ${\mathbb{D}^{k,p}}$ as the completion of $\mathcal{S}$ with respect to the seminorm

(32)

\[ \| X{\| _{k,p}}:={\Bigg(\mathbb{E}\big[|X{|^{p}}\big]+{\sum \limits_{j=1}^{k}}\mathbb{E}\big[\| {D^{j}}X{\| _{{L^{2}}({[0,T]^{k}})}^{p}}\big]\Bigg)^{\frac{1}{p}}}.\]

Throughout the paper, we often use the following lemma which is essentially a simplified version of [25, Proposition 1.5.5].

Lemma 1.

Let $p\gt 1$ and $X\in {\mathbb{D}^{1,2}}$ be such that

\[ \mathbb{E}\big[|X{|^{p}}\big]\lt \infty \]

and

\[ \mathbb{E}\Bigg[{\Bigg({\int _{0}^{T}}{({D_{s}}X)^{2}}ds\Bigg)^{\frac{p}{2}}}\Bigg]\lt \infty .\]

Then $X\in {\mathbb{D}^{1,p}}$.

A.2 Generalized Malliavin product rule

Finally, let us prove a generalized version of the product rule from [25, Exercise 1.2.12] or [13, Theorem 3.4].

Lemma 2.

Let ${X_{1}}$, ${X_{2}}\in {\mathbb{D}^{1,2}}$ be such that

(i) ${X_{1}}{X_{2}}\in {L^{2}}(\Omega )$;
(ii) ${X_{2}}D{X_{1}}$, ${X_{1}}D{X_{2}}\in {L^{2}}(\Omega \times [0,T])$.

Then ${X_{1}}{X_{2}}\in {\mathbb{D}^{1,2}}$ and

\[ D[{X_{1}}{X_{2}}]={X_{2}}D{X_{1}}+{X_{1}}D{X_{2}}.\]

If, in addition,

\[ \mathbb{E}\big[|{X_{1}}{X_{2}}{|^{p}}\big]\lt \infty ,\hspace{1em}\mathbb{E}\Bigg[{\Bigg({\int _{0}^{T}}{({X_{2}}{D_{u}}{X_{1}}+{X_{1}}{D_{u}}{X_{2}})^{2}}du\Bigg)^{\frac{p}{2}}}\Bigg]\lt \infty \]

for some $p\ge 2$, then ${X_{1}}{X_{2}}\in {\mathbb{D}^{1,p}}$.

Proof.

Let $\phi \in {C^{\infty }}(\mathbb{R})$ be a compactly supported function such that $\phi (x)=x$ whenever $|x|\le 1$ and $|\phi (x)|\le |x|$ for all $|x|\gt 1$. For $m\ge 1$, put

\[ {f_{m}}({x_{1}},{x_{2}}):={m^{2}}\phi \bigg(\frac{{x_{1}}}{m}\bigg)\phi \bigg(\frac{{x_{2}}}{m}\bigg)\]

and observe that both partial derivatives

\[ \frac{\partial {f_{m}}}{\partial {x_{1}}}({x_{1}},{x_{2}})=m{\phi ^{\prime }}\bigg(\frac{{x_{1}}}{m}\bigg)\phi \bigg(\frac{{x_{2}}}{m}\bigg),\hspace{1em}\frac{\partial {f_{m}}}{\partial {x_{2}}}({x_{1}},{x_{2}})=m\phi \bigg(\frac{{x_{1}}}{m}\bigg){\phi ^{\prime }}\bigg(\frac{{x_{2}}}{m}\bigg)\]

are bounded. Therefore, by the classical chain rule [25, Proposition 1.2.3],

\[ D{f_{m}}({X_{1}},{X_{2}})=m\bigg({\phi ^{\prime }}\bigg(\frac{{X_{1}}}{m}\bigg)\phi \bigg(\frac{{X_{2}}}{m}\bigg)D{X_{1}}+\phi \bigg(\frac{{X_{1}}}{m}\bigg){\phi ^{\prime }}\bigg(\frac{{X_{2}}}{m}\bigg)D{X_{2}}\bigg).\]

Now it is sufficient to prove that

(33)

\[ {f_{m}}({X_{1}},{X_{2}})\to {X_{1}}{X_{2}}\]

in ${L^{2}}(\Omega )$ and

(34)

\[ m\bigg({\phi ^{\prime }}\bigg(\frac{{X_{1}}}{m}\bigg)\phi \bigg(\frac{{X_{2}}}{m}\bigg)D{X_{1}}+\phi \bigg(\frac{{X_{1}}}{m}\bigg){\phi ^{\prime }}\bigg(\frac{{X_{2}}}{m}\bigg)D{X_{2}}\bigg)\to {X_{2}}D{X_{1}}+{X_{1}}D{X_{2}}\]

in ${L^{2}}(\Omega \times [0,T])$ as $m\to \infty $.

Observe that $|{f_{m}}({X_{1}},{X_{2}})|\to {X_{1}}{X_{2}}$ a.s. as $m\to \infty $ and

\[ \big|{f_{m}}({X_{1}},{X_{2}})\big|\le {X_{1}}{X_{2}}\in {L^{2}}(\Omega ),\]

so (33) holds by the dominated convergence theorem. Next, since ${\phi ^{\prime }}$ is bounded, we have that, with probability 1,

\[\begin{aligned}{}m& \bigg|{\phi ^{\prime }}\bigg(\frac{{X_{1}}}{m}\bigg)\phi \bigg(\frac{{X_{2}}}{m}\bigg)D{X_{1}}+\phi \bigg(\frac{{X_{1}}}{m}\bigg){\phi ^{\prime }}\bigg(\frac{{X_{2}}}{m}\bigg)D{X_{2}}\bigg|\\ {} & \le \underset{x\in \mathbb{R}}{\max }\big|{\phi ^{\prime }}(x)\big|\big(|{X_{2}}D{X_{1}}|+|{X_{1}}D{X_{2}}|\big)\in {L^{2}}\big(\Omega \times [0,T]\big).\end{aligned}\]

Therefore, since $m{\phi ^{\prime }}(\frac{{X_{1}}}{m})\phi (\frac{{X_{2}}}{m})\hspace{-0.1667em}\to \hspace{-0.1667em}{X_{2}}$ a.s. and $m\phi (\frac{{X_{1}}}{m}){\phi ^{\prime }}(\frac{{X_{2}}}{m})\hspace{-0.1667em}\to \hspace{-0.1667em}{X_{1}}$ a.s. as $m\to \infty $, (34) holds by the dominated convergence, which ends the proof of the first claim.

The second claim immediately follows from Lemma 1. □

References

[1]

Alòs, E., León, J.A., Vives, J.: On the short-time behavior of the implied volatility for jump-diffusion models with stochastic volatility. Finance Stoch. 11(4), 571–589 (2007). MR2335834. https://doi.org/10.1007/s00780-007-0049-1

[2]

Andersen, L.B.G., Piterbarg, V.V.: Moment explosions in stochastic volatility models. Finance Stoch. 11(1), 29–50 (2006). MR2284011. https://doi.org/10.1007/s00780-006-0011-7

[3]

Azmoodeh, E., Sottinen, T., Viitasaari, L., Yazigi, A.: Necessary and sufficient conditions for Hölder continuity of Gaussian processes. Stat. Probab. Lett. 94, 230–235 (2014). https://doi.org/10.1016/j.spl.2014.07.030. MR3257384

[4]

Bayer, C., Friz, P., Gatheral, J.: Pricing under rough volatility. Quant. Finance 16(6), 887–904 (2016). MR3494612. https://doi.org/10.1080/14697688.2015.1099717

[5]

Biagini, F., Guasoni, P., Pratelli, M.: Mean-variance hedging for stochastic volatility models. Math. Finance 10(2), 109–123 (2000). MR1802593. https://doi.org/10.1111/1467-9965.00084

[6]

Bollerslev, T., Mikkelsen, H.O.: Modeling and pricing long memory in stock market volatility. J. Econom. 73(1), 151–184 (1996). https://doi.org/10.1016/0304-4076(95)01736-4

[7]

Comte, F., Renault, E.: Long memory in continuous-time stochastic volatility models. Math. Finance 8(4), 291–323 (1998). MR1645101. https://doi.org/10.1111/1467-9965.00057

[8]

Delemotte, J., De Marco, S., Segonne, F.: Yet another analysis of the SP500 at-the-money skew: Crossover of different power-law behaviours. SSRN Electron. J. (2023). https://doi.org/10.2139/ssrn.4428407

[9]

Di Nunno, G., Mishura, Y., Yurchenko-Tytarenko, A.: Option pricing in Volterra sandwiched volatility model. arXiv:2209.10688 (2022). https://doi.org/10.48550/ARXIV.2209.10688

[10]

Di Nunno, G., Mishura, Y., Yurchenko-Tytarenko, A.: Drift-implicit Euler scheme for sandwiched processes driven by Hölder noises. Numer. Algorithms 93(2), 459–491 (2023). MR4586188. https://doi.org/10.1007/s11075-022-01424-6

[11]

Di Nunno, G., Mishura, Y., Yurchenko-Tytarenko, A.: Sandwiched SDEs with unbounded drift driven by Hölder noises. Adv. Appl. Probab. 55(3), 927–964 (2023). MR4624032. https://doi.org/10.1017/apr.2022.56

[12]

Di Nunno, G., Kubilius, K., Mishura, Y., Yurchenko-Tytarenko, A.: From constant to rough: A survey of continuous volatility modeling. Mathematics 11(19), 4201 (2023). https://doi.org/10.3390/math11194201

[13]

Di Nunno, G., Øksendal, B., Proske, F.: Malliavin Calculus for Lévy Processes with Applications to Finance, 1st edn. Springer, Berlin, Heidelberg (2009). MR2460554

[14]

Ding, Z., Granger, C.W.J.: Modeling volatility persistence of speculative returns: A new approach. J. Econom. 73(1), 185–215 (1996). MR1410004. https://doi.org/10.1016/0304-4076(95)01737-2

[15]

Ding, Z., Granger, C.W.J., Engle, R.F.: A long memory property of stock market returns and a new model. J. Empir. Finance 1(1), 83–106 (1993). https://doi.org/10.1016/0927-5398(93)90006-d

[16]

Fouque, J.-P., Papanicolaou, G., Sircar, R., Sølna, K.: Maturity cycles in implied volatility. Finance Stoch. 8(4) (2004). MR2212113. https://doi.org/10.1007/s00780-004-0126-7

[17]

Fukasawa, M.: Volatility has to be rough. Quant. Finance 21(1), 1–8 (2021). MR4188876. https://doi.org/10.1080/14697688.2020.1825781

[18]

Funahashi, H., Kijima, M.: Does the Hurst index matter for option prices under fractional volatility? Ann. Finance 13(1), 55–74 (2017). MR3623799. https://doi.org/10.1007/s10436-016-0289-1

[19]

Funahashi, H., Kijima, M.: A solution to the time-scale fractional puzzle in the implied volatility. Fractal Fract. 1(1), 14 (2017). https://doi.org/10.3390/fractalfract1010014

[20]

Gatheral, J., Jaisson, T., Rosenbaum, M.: Volatility is rough. Quant. Finance 18(6), 933–949 (2018). MR3805308. https://doi.org/10.1080/14697688.2017.1393551

[21]

Hu, Y., Nualart, D., Song, X.: A singular stochastic differential equation driven by fractional Brownian motion. Stat. Probab. Lett. 78(14), 2075–2085 (2008). MR2458016. https://doi.org/10.1016/j.spl.2008.01.080

[22]

Hytönen, T., van Neerven, J., Veraar, M., Weis, L.: Analysis in Banach Spaces. Volume I: Martingales and Littlewood-Paley theory, 1st edn. Springer, Cham, Switzerland (2016). MR3617205

[23]

Lee, R.W.: Implied volatility: Statics, dynamics, and probabilistic interpretation. In: Recent Advances in Applied Probability, pp. 241–268. Kluwer Academic Publishers, Boston (2006). MR2102956. https://doi.org/10.1007/0-387-23394-6_11

[24]

Lobato, I.N., Velasco, C.: Long memory in stock-market trading volume. J. Bus. Econ. Stat. 18(4), 410–427 (2000). MR1802045. https://doi.org/10.2307/1392223

[25]

Nualart, D.: The Malliavin Calculus and Related Topics. Springer, Berlin/Heidelberg (2006). MR2200233

[26]

Ocone, D.L., Karatzas, I.: A generalized Clark representation formula, with application to optimal portfolios. Stoch. Stoch. Rep. 34(3–4), 187–220 (1991). MR1124835. https://doi.org/10.1080/17442509108833682

[27]

Revuz, D., Yor, M.: Continuous Martingales and Brownian Motion. Springer, Berlin, Heidelberg (1999). MR1725357. https://doi.org/10.1007/978-3-662-06400-9

[28]

Rough Volatility Literature. https://sites.google.com/site/roughvol/home/rough-volatility-literature?authuser=0. Accessed: 2023-10-30.

[29]

Willinger, W., Taqqu, M.S., Teverovsky, V.: Stock market prices and long-range dependence. Finance Stoch. 3(1), 1–13 (1999). https://doi.org/10.1007/s007800050049

Reading mode

Table of contents

1 Introduction
2 Preliminaries on sandwiched processes
3 Second-order Malliavin differentiability
4 Power law in SVV model
A Selected results from the Malliavin calculus
References

Open access article under the CC BY license.

Keywords

SVV model stochastic volatility sandwiched process Gaussian Volterra noise Malliavin calculus

MSC2010

91G30 60H10 60H35 60G22

Funding

The present research is carried out within the frame and support of the ToppForsk project nr. 274410 of the Research Council of Norway with the title STORM: Stochastics for Time-Space Risk Models.

Metrics

since March 2018

648

Article info
views

222

Full article
views

277

PDF
downloads

100

XML
downloads

RSS

Theorems
5

Theorem 1.

Theorem 2.

Theorem 3.

Theorem 4.

Theorem 5.

Authors

Abstract

1 Introduction

(1)

(2)

(3)

2 Preliminaries on sandwiched processes

Assumption 1.

Remark 1.

(4)

(5)

(6)

(7)

(8)

(9)

Assumption 2.

Remark 2.

(10)

(11)

Remark 3.

Theorem 1.

Theorem 2.

(12)

Remark 4.

3 Second-order Malliavin differentiability

Notation.

Proposition 1.

Proof.

Proposition 2.

Proof.

(13)

(14)

Proposition 3.

(15)

(16)

Proof.

Corollary 1.

(17)

Proof.

Theorem 3.

(18)

Proof.

Corollary 2.

Proof.

(19)

(20)

(21)

4 Power law in SVV model

(22)

Remark 5.

Theorem 4.

(23)

Remark 6.

(24)

Remark 7.

Remark 8.

Theorem 5.

(25)

(26)

(27)

(28)

Remark 9.

Proposition 4.

Proof.

Proposition 5.

Proof.

(29)

Proposition 6.

(30)

Proof.

(31)

Proof of Theorem 4.

Example 1.

A Selected results from the Malliavin calculus

A.1 The Malliavin derivative and the space ${\mathbb{D}^{k,p}}$

Definition 1.

Definition 2.

(32)

Lemma 1.

A.2 Generalized Malliavin product rule