On shortfall risk minimization for game options

Dolinsky, Yan

doi:10.15559/20-VMSTA164

Abstract

In this paper we study the existence of an optimal hedging strategy for the shortfall risk measure in the game options setup. We consider the continuous time Black–Scholes (BS) model. Our first result says that in the case where the game contingent claim (GCC) can be exercised only on a finite set of times, there exists an optimal strategy. Our second and main result is an example which demonstrates that for the case where the GCC can be stopped on the whole time interval, optimal portfolio strategies need not always exist.

1 Introduction

A game contingent claim (GCC) or game option, which was introduced in [8], is defined as a contract between the seller and the buyer of the option such that both have the right to exercise it at any time up to a maturity date (horizon) T. If the buyer exercises the contract at time t then he receives the payment ${Y_{t}}$, but if the seller exercises (cancels) the contract before the buyer then the latter receives ${X_{t}}$. The difference ${\Delta _{t}}={X_{t}}-{Y_{t}}$ is the penalty which the seller pays to the buyer for the contract cancellation. In short, if the seller will exercise at a stopping time $\sigma \le T$ and the buyer at a stopping time $\tau \le T$ then the former pays to the latter the amount $H(\sigma ,\tau )$ where

(1)

\[ H(\sigma ,\tau ):={X_{\sigma }}{\mathbb{I}_{\sigma <\tau }}+{Y_{\tau }}{\mathbb{I}_{\tau \le \sigma }}\]

and we set ${\mathbb{I}_{Q}}=1$ if an event Q occurs and ${\mathbb{I}_{Q}}=0$ if not.

A hedge (for the seller) against a GCC is defined here as a pair $(\pi ,\sigma )$ which consists of a self–financing strategy π and a stopping time σ which is the cancellation time for the seller. A hedge is called perfect if no matter what exercise time the buyer chooses, the seller can cover his liability to the buyer (with probability 1). The option price is defined as the minimal initial capital which is required for a perfect hedge. Recall (see [8]) that pricing a GCC in a complete market leads to the value of a zero sum optimal stopping (Dynkin’s) game under the unique martingale measure. For additional information about pricing of game options see, for instance [5–7, 10–12].

In real market conditions an investor (seller) may not be willing for various reasons to tie in a hedging portfolio the full initial capital required for a perfect hedge. In this case the seller is ready to accept a risk that his portfolio value at an exercise time may be less than his obligation to pay and he will need additional funds to fulfill the contract.

We consider the shortfall risk measure which is given by (see [2])

\[ R(\pi ,\sigma ):=\underset{\tau }{\sup }{\mathbb{E}_{\mathbb{P}}}\left[{\left(H(\sigma ,\tau )-{V_{\sigma \wedge \tau }^{\pi }}\right)^{+}}\right]\]

where ${\{{V_{t}^{\pi }}\}_{t=0}^{T}}$ is the wealth process of the portfolio strategy π and ${\mathbb{E}_{\mathbb{P}}}$ denotes the expectation with respect to the market measure. The supremum is taken over all exercise times of the buyer and corresponds to the case where the investor has no information on the buyer exercise strategy. The only assumption is that the buyer exercise strategy is a stopping time with respect to a given filtration.

A natural question to ask, is whether for a given initial capital there exists a hedging strategy which minimizes the shortfall risk (an optimal hedge). For American options the existence of an optimal hedging strategy is proved by applying the Komlós lemma and relies heavily on the fact that the shortfall risk measure is a convex functional of the wealth process (see [14, 16]). For the game options setup, the shortfall risk measure, as a functional of the wealth process is given by

(2)

\[ R(\pi ):=\underset{\sigma }{\inf }\underset{\tau }{\sup }{\mathbb{E}_{\mathbb{P}}}\left[{\left(H(\sigma ,\tau )-{V_{\sigma \wedge \tau }^{\pi }}\right)^{+}}\right].\]

This functional is not necessarily convex (because of the inf) and so the Komlós lemma can not be applied here.

In this paper we treat the simplest complete, continuous time model, namely the Black–Scholes (BS) model. Our first result (Theorem 1) which is proved in the next section says that for the case where the option can be exercised only on a finite set of times, there exists an optimal hedging strategy. The proof is based on the dynamical programming approach and the randomization technique developed in [17, 18]. Up to date there are several existence results for risk minimization in the game options setup (see [2, 3] and Section 5.2 in [9]). The above papers treat essentially discrete time trading and due to admissability conditions the trading strategies are compact. In the current setup trading is done continuously, and so it requires a new method of proof.

In Section 3 we provide the second result of the paper (Theorem 2). This is an example which demonstrates that for the case where the GCC can be stopped on the whole time interval, optimal portfolio strategies need not always exist. We combine the machinery developed in [13] with additional ideas which allow us to treat the shortfall risk measure for game options. Formally, we show that the inf in (2) which ruins the convexity leads to non existence of optimal hedging strategies.

2 Existence result

Consider a complete probability space $(\Omega ,\mathcal{F},\mathbb{P})$ together with a standard one–dimensional Brownian motion ${\{{W_{t}}\}_{t=0}^{\infty }}$, and the filtration ${\mathcal{F}_{t}}=\sigma \{{W_{s}}|s\le t\}$ completed by the null sets. We consider a simple BS financial market with time horizon $T<\infty $, which consists of a riskless savings account bearing zero interest (for simplicity) and of a risky asset S, whose value at time t is given by

\[ {S_{t}}={S_{0}}\exp \left(\kappa {W_{t}}+(\vartheta -{\kappa ^{2}}/2)t\right),\hspace{2.5pt}\hspace{2.5pt}t\in [0,T]\]

where ${S_{0}},\kappa >0$ and $\vartheta \in \mathbb{R}$ are constants.

Define the exponential martingale

(3)

\[ {Z_{t}}:=\exp \left(-\frac{\vartheta }{\kappa }{W_{t}}-\frac{{\vartheta ^{2}}}{2{\kappa ^{2}}}t\right),\hspace{2.5pt}\hspace{2.5pt}t\in [0,T].\]

From the Girsanov theorem it follows that the probability measure $\mathbb{Q}$ which is given by

(4)

\[ \frac{d\mathbb{Q}}{d\mathbb{P}}|{\mathcal{F}_{t}}:={Z_{t}},\hspace{2.5pt}\hspace{2.5pt}t\in [0,T]\]

is the unique martingale measure for the risky asset S.

Next, let $\mathbb{T}:=\{0={T_{0}}<{T_{1}}<\cdots <{T_{n}}=T\}$ be a finite set of a deterministic times. Consider a game option that can be exercised on the set $\mathbb{T}$. Denote by ${\mathcal{T}_{\mathbb{T}}}$ the set of all stopping times with values in $\mathbb{T}$. For any $k=0,1,\dots ,n$ the payoffs at time ${T_{k}}$ are path–independent and given by ${Y_{{T_{k}}}}={f_{k}}({S_{{T_{k}}}})$ and ${X_{{T_{k}}}}={g_{k}}({S_{{T_{k}}}})$ where ${f_{k}},{g_{k}}:(0,\infty )\to \mathbb{R}$ are measurable functions and $0\le {f_{k}}\le {g_{k}}$. The payoff function H is given by (1). We will assume the following integrability condition

(5)

\[ {\mathbb{E}_{\mathbb{P}}}[{X_{{T_{k}}}}]<\infty ,\hspace{2.5pt}\hspace{2.5pt}k=0,1,\dots ,n.\]

A portfolio strategy with an initial capital $x\ge 0$ is a pair $\pi =(x,\gamma )$ such that $\gamma ={\{{\gamma _{t}}\}_{t=0}^{T}}$ is a predictable S–integrable process and the corresponding wealth process

\[ {V_{t}^{\pi }}:=x+{\int _{0}^{t}}{\gamma _{u}}d{S_{u}},\hspace{2.5pt}\hspace{2.5pt}t\in [0,T]\]

satisfies the admissibility condition ${V_{t}^{\pi }}\ge 0$ a.s. for all t.

Let us recall some elementary properties that will be used in the sequel (for details see Chapters IV-V in [19]). The continuity of S implies that the wealth process ${\{{V_{t}^{\pi }}\}_{t=0}^{T}}$ is continuous as well. Moreover, since ${\{{S_{t}}\}_{t=0}^{T}}$ is a $\mathbb{Q}$–martingale then the wealth process ${\{{V_{t}^{\pi }}\}_{t=0}^{T}}$ is a $\mathbb{Q}$–local martingale, and so from the admissibility condition we get that ${\{{V_{t}^{\pi }}\}_{t=0}^{T}}$ is a $\mathbb{Q}$–super martingale. On the other hand, due to the martingale representation theorem, for any nonnegative $\mathbb{Q}$–martingale ${\{{M_{t}}\}_{t=0}^{T}}$ there exists a portfolio strategy π such that ${V_{t}^{\pi }}={M_{t}}$ for all t a.s.

For any $x\ge 0$ denote by $\mathcal{A}(x)$ the set of all portfolio strategies with an initial capital x. A hedging strategy with an initial capital x is a pair $(\pi ,\sigma )\in \mathcal{A}(x)\times {\mathcal{T}_{\mathbb{T}}}$.

The shortfall risk measure is given by

\[\begin{array}{r@{\hskip10.0pt}c}& \displaystyle {R_{\mathbb{T}}}(\pi ,\sigma ):={\sup _{\tau \in {\mathcal{T}_{\mathbb{T}}}}}{\mathbb{E}_{\mathbb{P}}}\left[{\left(H(\sigma ,\tau )-{V_{\sigma \wedge \tau }^{\pi }}\right)^{+}}\right],\hspace{2.5pt}\hspace{2.5pt}(\pi ,\sigma )\in \mathcal{A}(x)\times {\mathcal{T}_{\mathbb{T}}},\\ {} & \displaystyle {R_{\mathbb{T}}}(x):={\inf _{(\pi ,\sigma )\in \mathcal{A}(x)\times {\mathcal{T}_{\mathbb{T}}}}}{R_{\mathbb{T}}}(\pi ,\sigma ).\end{array}\]

Now, we are ready to formulate our first result.

Theorem 1.

For any $x\ge 0$ there exists a hedging strategy $(\hat{\pi },\hat{\sigma })\in \mathcal{A}(x)\times {\mathcal{T}_{\mathbb{T}}}$ such that

\[ {R_{\mathbb{T}}}(\hat{\pi },\hat{\sigma })={R_{\mathbb{T}}}(x).\]

Remark 1.

We emphasis that in contrast to previous work on game options (see [2, 3] and Section 5.2 in [9]) the trading in our setup is done continuously. Namely, the investor trades the risky asset continuously, but the GCC can be exercised only on a finite set of deterministic times. This can be viewed as a game version of the Bermudan options.

2.1 Proof of Theorem 1

We start with some preparations. Let $U:[0,\infty )\times (0,\infty )\to \mathbb{R}$ be a measurable function such that for any $y>0$, $U(\cdot ,y)$ is a bounded, nondecreasing and continuous function. Let ${U_{c}}:[0,\infty )\times (0,\infty )\to \mathbb{R}$ be the concave envelop of U with respect to the first variable. Namely, for any $y>0$ the function ${U_{c}}(\cdot ,y)$ is the minimal concave function which satisfies ${U_{c}}(\cdot ,y)\ge U(\cdot ,y)$. Clearly, ${U_{c}}$ is continuous in the first variable. Thus, for any $y>0$ the set $\{x:\hspace{2.5pt}U(x,y)<{U_{c}}(x,y)\}$ is open and so can be written as a countable union of disjoint intervals

(6)

\[ \{x:\hspace{2.5pt}U(x,y)<{U_{c}}(x,y)\}=\bigcup \limits_{n\in \mathbb{N}}({a_{n}}(y),{b_{n}}(y)).\]

From Lemma 2.8 in [17] it follows that ${U_{c}}(\cdot ,y)$ is affine on each of the intervals $({a_{n}}(y),{b_{n}}(y))$. Since U, ${U_{c}}$ are continuous in the first variable, then the functions ${a_{n}},{b_{n}}:(0,\infty )\to {\mathbb{R}_{+}}$, $n\in \mathbb{N}$ are determined by the countable collection of functions $U(q,\cdot ),{U_{c}}(q,\cdot ):(0,\infty )\to \mathbb{R}$ for nonnegative rational q, and so they are measurable.

For any $0\le {t_{1}}<{t_{2}}\le T$ and a ${\mathcal{F}_{{t_{1}}}}$ measurable random variable ${\Theta _{1}}\ge 0$ denote by ${\mathcal{H}_{{t_{1}},{t_{2}}}}({\Theta _{1}})$ the set of all random variables ${\Theta _{2}}\ge 0$ which are ${\mathcal{F}_{{t_{2}}}}$ measurable and satisfy ${\mathbb{E}_{\mathbb{Q}}}({\Theta _{2}}|{\mathcal{F}_{{t_{1}}}})\le {\Theta _{1}}$.

The following auxiliary result is an extension of Theorem 5.1 in [17].

Lemma 1.

Let $0\le {t_{1}}<{t_{2}}\le T$ and let ${\Theta _{1}}\ge 0$ be a ${\mathcal{F}_{{t_{1}}}}$ measurable random variable. For a function U as above, assume that there exists a function $G:\mathbb{R}\to \mathbb{R}$ such that $|U(x,y)|\le G(y)$ for all x, y and ${\mathbb{E}_{\mathbb{P}}}[G({S_{{t_{2}}}})]<\infty $. Then there exists a random variable $\Theta \in {\mathcal{H}_{{t_{1}},{t_{2}}}}({\Theta _{1}})$ such that

\[\begin{array}{r@{\hskip10.0pt}c}& \displaystyle {\mathbb{E}_{\mathbb{P}}}\left[U(\Theta ,{S_{{t_{2}}}})|{\mathcal{F}_{{t_{1}}}}\right]\\ {} & \displaystyle =ess{\sup _{{\Theta _{2}}\in {\mathcal{H}_{{t_{1}},{t_{2}}}}({\Theta _{1}})}}{\mathbb{E}_{\mathbb{P}}}\left[U({\Theta _{2}},{S_{{t_{2}}}})|{\mathcal{F}_{{t_{1}}}}\right]\\ {} & \displaystyle =ess{\sup _{{\Theta _{2}}\in {\mathcal{H}_{{t_{1}},{t_{2}}}}({\Theta _{1}})}}{\mathbb{E}_{\mathbb{P}}}\left[{U_{c}}({\Theta _{2}},{S_{{t_{2}}}})|{\mathcal{F}_{{t_{1}}}}\right].\end{array}\]

Proof.

Since ${U_{c}}\ge U$, it is sufficient to show that there exists $\Theta \in {\mathcal{H}_{{t_{1}},{t_{2}}}}({\Theta _{1}})$ such that

Choose a sequence ${\Theta ^{(n)}}\in {\mathcal{H}_{{t_{1}},{t_{2}}}}({\Theta _{1}})$, $n\in \mathbb{N}$ such that

(7)

\[ \underset{n\to \infty }{\lim }{\mathbb{E}_{\mathbb{P}}}\left[{U_{c}}({\Theta ^{(n)}},{S_{{t_{2}}}})|{\mathcal{F}_{{t_{1}}}}\right]=ess\underset{{\Theta _{2}}\in {\mathcal{H}_{{t_{1}},{t_{2}}}}({\Theta _{1}})}{\sup }{\mathbb{E}_{\mathbb{P}}}\left[{U_{c}}({\Theta _{2}},{S_{{t_{2}}}})|{\mathcal{F}_{{t_{1}}}}\right].\]

From Lemma A1.1 in [4] we obtain a sequence ${\Lambda ^{(m)}}\in conv({\Theta ^{(m)}},{\Theta ^{(m+1)}},\dots )$, $m\in \mathbb{N}$ converging $\mathbb{P}$ a.s. to a random variable Λ. The Fatou lemma implies that $\Lambda \in {\mathcal{H}_{{t_{1}},{t_{2}}}}({\Theta _{1}})$.

By applying the dominated convergence theorem, the inequality $|{U_{c}}(\cdot ,{S_{{t_{2}}}})|\le G({S_{{t_{2}}}})$ and the fact that ${U_{c}}$ is concave and continuous in the first variable we obtain

\[\begin{array}{r@{\hskip10.0pt}c}& \displaystyle {\mathbb{E}_{\mathbb{P}}}\left[{U_{c}}(\Lambda ,{S_{{t_{2}}}})|{\mathcal{F}_{{t_{1}}}}\right]\\ {} & \displaystyle ={\lim \nolimits_{n\to \infty }}{\mathbb{E}_{\mathbb{P}}}\left[{U_{c}}({\Lambda ^{(n)}},{S_{{t_{2}}}})|{\mathcal{F}_{{t_{1}}}}\right]\\ {} & \displaystyle \ge {\lim \nolimits_{n\to \infty }}{\mathbb{E}_{\mathbb{P}}}\left[{U_{c}}({\Lambda ^{(n)}},{S_{{t_{2}}}})|{\mathcal{F}_{{t_{1}}}}\right].\end{array}\]

This together with (7) gives

(8)

\[ {\mathbb{E}_{\mathbb{P}}}\left[{U_{c}}(\Lambda ,{S_{{t_{2}}}})|{\mathcal{F}_{{t_{1}}}}\right]=ess\underset{{\Theta _{2}}\in {\mathcal{H}_{{t_{1}},{t_{2}}}}({\Theta _{1}})}{\sup }{\mathbb{E}_{\mathbb{P}}}\left[{U_{c}}({\Theta _{2}},{S_{{t_{2}}}})|{\mathcal{F}_{{t_{1}}}}\right].\]

Next, introduce the normal random variable

\[ \Gamma :=({W_{\frac{{t_{1}}+{t_{2}}}{2}}}-{W_{{t_{1}}}})-\frac{1}{2}({W_{{t_{2}}}}-{W_{{t_{1}}}}).\]

Observe that ${\mathbb{E}_{\mathbb{P}}}[\Gamma {W_{{t_{2}}}}]=0$ and so we conclude that Γ is independent of the σ–algebra generated by ${W_{t}},t\in [0,{t_{1}}]\cup \{{t_{2}}\}$. From Theorem 1 in [20] it follows that there exists a measurable function $\Phi :C[0,{t_{1}}]\times {\mathbb{R}^{2}}\to \mathbb{R}$ such that we have the following equality of the joint laws

\[ \left(\left({W_{[0,{t_{1}}]}},{W_{{t_{2}}}},\Lambda \right);\mathbb{P}\right)=\left(\left({W_{[0,{t_{1}}]}},{W_{{t_{2}}}},\Phi \left({W_{[0,{t_{1}}]}},{W_{{t_{2}}}},\Gamma \right)\right);\mathbb{P}\right).\]

In particular $\Phi \left({W_{[0,{t_{1}}]}},{W_{{t_{2}}}},\Gamma \right)\in {\mathcal{H}_{{t_{1}},{t_{2}}}}({\Theta _{1}})$ and

\[ {\mathbb{E}_{\mathbb{P}}}\left[{U_{c}}\left(\Lambda ,{S_{{t_{2}}}}\right)|{\mathcal{F}_{{t_{1}}}}\right]={\mathbb{E}_{\mathbb{P}}}\left[{U_{c}}\left(\Phi \left({W_{[0,{t_{1}}]}},{W_{{t_{2}}}},\Gamma \right),{S_{{t_{2}}}}\right)|{\mathcal{F}_{{t_{1}}}}\right].\]

Thus, without loss of generality we assume that $\Lambda =\Phi \left({W_{[0,{t_{1}}]}},{W_{{t_{2}}}},\Gamma \right)$.

We arrive to the final step of the proof. Introduce the normal random variable

\[ \hat{\Gamma }:=({W_{\frac{{t_{1}}+2{t_{2}}}{3}}}-{W_{{t_{1}}}})-\frac{2}{3}({W_{{t_{2}}}}-{W_{{t_{1}}}})-\frac{2}{3}\Gamma .\]

Observe that ${\mathbb{E}_{\mathbb{P}}}[\hat{\Gamma }{W_{{t_{2}}}}]={\mathbb{E}_{\mathbb{P}}}[\hat{\Gamma }\Gamma ]=0$. Thus, $\hat{\Gamma }$ is independent of the σ–algebra generated by ${W_{t}},t\in [0,{t_{1}}]\cup \{{t_{2}}\}$ and Γ. Let ${F^{-1}}$ be the inverse function of the cumulative distribution function $F(\cdot ):=\mathbb{P}(\hat{\Gamma }\le \cdot )$. Recall (6) and define the random variable

\[\begin{array}{r@{\hskip10.0pt}c}& \displaystyle \Theta :=\Lambda {\mathbb{I}_{\Lambda \notin {\textstyle\bigcup _{n\in \mathbb{N}}}({a_{n}}({S_{{t_{2}}}}),{b_{n}}({S_{{t_{2}}}}))}}\\ {} & \displaystyle +{\textstyle\sum _{n\in \mathbb{N}}}{b_{n}}({S_{{t_{2}}}}){\mathbb{I}_{\Lambda \in ({a_{n}}({S_{{t_{2}}}}),{b_{n}}({S_{{t_{2}}}}))}}{\mathbb{I}_{\hat{\Gamma }<{F^{-1}}\left(\frac{\Lambda -{a_{n}}({S_{{t_{2}}}})}{{b_{n}}({S_{{t_{2}}}})-{a_{n}}({S_{{t_{2}}}})}\right)}}\\ {} & \displaystyle +{\textstyle\sum _{n\in \mathbb{N}}}{a_{n}}({S_{{t_{2}}}}){\mathbb{I}_{\Lambda \in ({a_{n}}({S_{{t_{2}}}}),{b_{n}}({S_{{t_{2}}}}))}}{\mathbb{I}_{\hat{\Gamma }>{F^{-1}}\left(\frac{\Lambda -{a_{n}}({S_{{t_{2}}}})}{{b_{n}}({S_{{t_{2}}}})-{a_{n}}({S_{{t_{2}}}})}\right)}}.\end{array}\]

Let $\mathcal{G}$ be the σ–algebra generated by ${W_{t}},t\in [0,{t_{1}}]\cup \{{t_{2}}\}$ and Γ. From the Bayes theorem, the tower property for conditional expectation and (4) we get

\[\begin{array}{r@{\hskip10.0pt}c}& \displaystyle {\mathbb{E}_{\mathbb{Q}}}\left(\Theta |{\mathcal{F}_{{t_{1}}}}\right)={\mathbb{E}_{\mathbb{P}}}\left(\frac{\Theta {Z_{{t_{2}}}}}{{Z_{{t_{1}}}}}|{\mathcal{F}_{{t_{1}}}}\right)\\ {} & \displaystyle ={\mathbb{E}_{\mathbb{P}}}\left({\mathbb{E}_{\mathbb{P}}}\left(\frac{\Theta {Z_{{t_{2}}}}}{{Z_{{t_{1}}}}}|\mathcal{G}\right)|{\mathcal{F}_{{t_{1}}}}\right)={\mathbb{E}_{\mathbb{P}}}\left(\frac{\Lambda {Z_{{t_{2}}}}}{{Z_{{t_{1}}}}}|{\mathcal{F}_{{t_{1}}}}\right)={\mathbb{E}_{\mathbb{Q}}}\left(\Lambda |{\mathcal{F}_{{t_{1}}}}\right).\end{array}\]

Thus $\Theta \in {\mathcal{H}_{{t_{1}},{t_{2}}}}({\Theta _{1}})$. Finally, let us notice that $U(\Theta ,{S_{{t_{2}}}})={U_{c}}(\Theta ,{S_{{t_{2}}}})$, and so, from the tower property of conditional expectation and the fact that ${U_{c}}(\cdot ,y)$ is affine on each of the intervals $({a_{n}}(y),{b_{n}}(y))$ we obtain

\[\begin{array}{r@{\hskip10.0pt}c}& \displaystyle {\mathbb{E}_{\mathbb{P}}}\left[U(\Theta ,{S_{{t_{2}}}})|{\mathcal{F}_{{t_{1}}}}\right]={\mathbb{E}_{\mathbb{P}}}\left[{\mathbb{E}_{\mathbb{P}}}\left(U(\Theta ,{S_{{t_{2}}}})|\mathcal{G}\right)|{\mathcal{F}_{{t_{1}}}}\right]\\ {} & \displaystyle ={\mathbb{E}_{\mathbb{P}}}\left[{\mathbb{E}_{\mathbb{P}}}\left({U_{c}}(\Theta ,{S_{{t_{2}}}})|\mathcal{G}\right)|{\mathcal{F}_{{t_{1}}}}\right]={\mathbb{E}_{\mathbb{P}}}\left[{U_{c}}(\Lambda ,{S_{{t_{2}}}})|{\mathcal{F}_{{t_{1}}}}\right].\end{array}\]

This together with (8) completes the proof. □

We arrive at the following Corollary.

Corollary 1.

Let $B:[0,\infty )\times (0,\infty )\to \mathbb{R}$ be a measurable function such that for any $y>0$, $B(\cdot ,y)$ is a bounded, nonincreasing and continuous function. Let ${B^{c}}:[0,\infty )\times (0,\infty )\to \mathbb{R}$ be the convex envelop of B with respect to the first variable.

(i). Let $0\le {t_{1}}<{t_{2}}\le T$ and let ${\Theta _{1}}\ge 0$ be a ${\mathcal{F}_{{t_{1}}}}$ measurable random variable. Assume that there exists a function $G:\mathbb{R}\to \mathbb{R}$ such that $|B(x,y)|\le G(y)$ for all x, y and ${\mathbb{E}_{\mathbb{P}}}[G({S_{{t_{2}}}})]<\infty $. Then there exists a random variable $\Theta \in {\mathcal{H}_{{t_{1}},{t_{2}}}}({\Theta _{1}})$ such that

\[\begin{array}{r@{\hskip10.0pt}c}& \displaystyle {\mathbb{E}_{\mathbb{P}}}\left[B(\Theta ,{S_{{t_{2}}}})|{\mathcal{F}_{{t_{1}}}}\right]\\ {} & \displaystyle =ess{\inf _{{\Theta _{2}}\in {\mathcal{H}_{{t_{1}},{t_{2}}}}({\Theta _{1}})}}{\mathbb{E}_{\mathbb{P}}}\left[B({\Theta _{2}},{S_{{t_{2}}}})|{\mathcal{F}_{{t_{1}}}}\right]\\ {} & \displaystyle =ess{\inf _{{\Theta _{2}}\in {\mathcal{H}_{{t_{1}},{t_{2}}}}({\Theta _{1}})}}{\mathbb{E}_{\mathbb{P}}}\left[{B^{c}}({\Theta _{2}},{S_{{t_{2}}}})|{\mathcal{F}_{{t_{1}}}}\right].\end{array}\]

(ii). Let ${t_{1}}=0$. The function $b:[0,\infty )\to \mathbb{R}$ which is defined by

\[ b(x):=\underset{\Theta \in {\mathcal{H}_{0,{t_{2}}}}(x)}{\inf }{\mathbb{E}_{\mathbb{P}}}\left[B(\Theta ,{S_{{t_{2}}}})\right]=\underset{\Theta \in {\mathcal{H}_{0,{t_{2}}}}(x)}{\inf }{\mathbb{E}_{\mathbb{P}}}\left[{B^{c}}(\Theta ,{S_{{t_{2}}}})\right],\hspace{2.5pt}\hspace{2.5pt}x\ge 0\]

is convex and continuous.

Proof.

(i). The result follows immediately by applying Lemma 1 for $U:=-B$.

(ii). The convexity of b follows from the convexity of ${B^{c}}$ in the first variable and the fact that for any ${x_{1}},{x_{2}}\ge 0$ and $\lambda \in (0,1)$,

\[ \lambda \mathcal{A}({x_{1}})+(1-\lambda )\mathcal{A}({x_{2}})\subset \mathcal{A}(\lambda {x_{1}}+(1-\lambda ){x_{2}}).\]

In particular b is continuous in $(0,\infty )$. It remans to prove continuity at $x=0$. Since B is nonincreasing in the first variable, then b is nonincreasing as well. Thus, it is sufficient to show that $b(0)\le {\lim \nolimits_{n\to \infty }}b(1/n)$. To that end, choose ${\Theta ^{(n)}}\in {\mathcal{H}_{0,{t_{2}}}}(1/n)$, $n\in \mathbb{N}$ such that

\[ \underset{n\to \infty }{\lim }{\mathbb{E}_{\mathbb{P}}}\left[{B^{c}}({\Theta ^{(n)}},{S_{{t_{2}}}})\right]=\underset{n\to \infty }{\lim }b(1/n).\]

From Lemma A1.1 in [4] we obtain a sequence ${\Lambda ^{(m)}}\in conv({\Theta ^{(m)}},{\Theta ^{(m+1)}},\dots )$, $m\in \mathbb{N}$ converging $\mathbb{P}$ a.s. to a random variable Λ. The Fatou lemma implies that $\Lambda =0$, and so by applying the dominated convergence theorem together with convexity and continuity of ${B^{c}}$ in the first variable we get,

\[ b(0)=\underset{n\to \infty }{\lim }{\mathbb{E}_{\mathbb{P}}}\left[{B^{c}}({\Lambda ^{(n)}},{S_{{t_{2}}}})\right]\le \underset{n\to \infty }{\lim }{\mathbb{E}_{\mathbb{P}}}\left[{B^{c}}({\Theta ^{(n)}},{S_{{t_{2}}}})\right]=\underset{n\to \infty }{\lim }b(1/n)\]

and continuity follows. □

Now we are ready to prove Theorem 1.

Proof.

Let $x\ge 0$. For any $\pi \in \mathcal{A}(x)$ we define ${R_{\mathbb{T}}}(\pi )$ as in (2) where the infimum and the supremum are taken over the set ${\mathcal{T}_{\mathbb{T}}}$.

Moreover, define the random variables ${\Psi _{k}^{\pi }}$, $k=0,1,\dots ,n$ by

\[ {\Psi _{n}^{\pi }}:={\left({Y_{T}}-{V_{T}^{\pi }}\right)^{+}}\]

and for $k=0,1,\dots ,n-1$ by the recursive relations

(9)

\[ {\Psi _{k}^{\pi }}:=\min \left({\left({X_{{T_{k}}}}-{V_{{T_{k}}}^{\pi }}\right)^{+}},\max \left({\left({Y_{{T_{k}}}}-{V_{{T_{k}}}^{\pi }}\right)^{+}},{\mathbb{E}_{\mathbb{P}}}({\Psi _{k+1}^{\pi }}|{\mathcal{F}_{{T_{k}}}})\right)\right).\]

In view of (5) the random variables ${\Psi _{k}^{\pi }}$, $k=0,1,\dots ,n$ are well defined. From the standard theory of zero–sum Dynkin games (see [15]) it follows that

\[ {\Psi _{0}^{\pi }}={R_{\mathbb{T}}}(\pi ).\]

Moreover, for the stopping time

\[ \sigma :=T\wedge \min \left\{t\in \mathbb{T}:\hspace{2.5pt}{\Psi _{t}^{\pi }}={\left({X_{t}}-{V_{t}^{\pi }}\right)^{+}}\right\}\]

we have ${R_{\mathbb{T}}}(\pi )={R_{\mathbb{T}}}(\pi ,\sigma )$.

Thus, in order to conclude the proof we need to show that there exists $\hat{\pi }\in \mathcal{A}(x)$ such that

(10)

\[ {\Psi _{0}^{\hat{\pi }}}=\underset{\pi \in \mathcal{A}(x)}{\inf }{\Psi _{0}^{\pi }}.\]

We apply dynamical programming. Introduce the functions ${B_{k}}:[0,\infty )\times (0,\infty )\to \mathbb{R}$, $k=0,1,\dots ,n$ by

\[ {B_{n}}(z,y):={({f_{n}}(y)-z)^{+}},\]

and for $k=0,1,\dots ,n-1$ by the recursive relations

\[\begin{array}{r@{\hskip10.0pt}c}& \displaystyle {B_{k}}(z,y)=\min \left({\left({g_{k}}(y)-z\right)^{+}},\max \left({\left({f_{k}}(y)-z\right)^{+}},\right.\right.\\ {} & \displaystyle \left.\left.{\inf _{{\Theta _{k+1}}\in {\mathcal{H}_{0,{T_{k+1}}-{T_{k}}}}(z)}}{\mathbb{E}_{\mathbb{P}}}\left[{B_{k+1}}({\Theta _{k+1}},y{S_{{T_{k+1}}-{T_{k}}}})\right]\right)\right).\end{array}\]

Let us argue by backward induction that for any k, ${B_{k}}(z,y)$ is measurable, and for any y the function ${B_{k}}(\cdot ,y)$ is continuous and nonincreasing. For $k=n$ this is clear. Assume that the statement holds for $k+1$, let us prove it for k. From Corollary 1(ii) it follows that for any y the function ${B_{k}}(\cdot ,y)$ is continuous and nonincreasing. For any $z>0$ the measurability of the function ${B_{k}}(z,\cdot )$ follows from the fact that the set ${\mathcal{H}_{0,{T_{k+1}}-{T_{k}}}}(z)$ is separable (with respect to convergence in probability). Since ${B_{k}}$ is continuous in the first variable we conclude joint measurability and complete the argument.

Next, from Corollary 1(i) it follows that we can construct a sequence of random variables ${D_{0}}$, ${D_{1}}$, ..., ${D_{n}}$ such that ${D_{0}}=x$ and for any $k=1,\dots ,n$ ${D_{k}}\in {\mathcal{H}_{{T_{k-1}},{T_{k}}}}({D_{k-1}})$ satisfies

(11)

\[ {\mathbb{E}_{\mathbb{P}}}\left[{B_{k}}({D_{k}},{S_{{T_{k}}}})|{\mathcal{F}_{{T_{k-1}}}}\right]=ess\underset{{\Theta _{k}}\in {\mathcal{H}_{{T_{k-1}},{T_{k}}}}({D_{k-1}})}{\inf }\hspace{2.5pt}{\mathbb{E}_{\mathbb{P}}}\left[{B_{k}}({\Theta _{k}},{S_{{T_{k}}}})|{\mathcal{F}_{{T_{k-1}}}}\right].\]

Since ${B_{k}}$, $k=0,1,\dots ,n$ are nonincreasing in the first variable then without loss of generality we assume that ${\mathbb{E}_{\mathbb{Q}}}[{D_{k}}|{\mathcal{F}_{{T_{k-1}}}}]={D_{k-1}}$ for all k.

Finally, the completeness of the BS model implies that there exists $\hat{\pi }\in \mathcal{A}(x)$ such that ${V_{{T_{k}}}^{\hat{\pi }}}={D_{k}}$ for all $k=0,1,\dots ,n$. Observe that $\frac{{S_{{T_{k}}}}}{{S_{{T_{k-1}}}}}$ is independent of ${\mathcal{F}_{{T_{k-1}}}}$ and has the same distribution as ${S_{{T_{k}}-{T_{k-1}}}}$. Thus, from (9) and (11) we obtain (by backward induction)

(12)

\[ {B_{k}}({V_{{T_{k}}}^{\hat{\pi }}},{S_{{T_{k}}}})={\Psi _{k}^{\hat{\pi }}}\hspace{2.5pt}\hspace{2.5pt}\text{a.s}.\hspace{2.5pt}\hspace{2.5pt}\forall k=0,1,\dots ,n.\]

On the other hand, for an arbitrary $\pi \in \mathcal{A}(x)$ we have ${V_{{T_{k}}}^{\pi }}\in {\mathcal{H}_{{T_{k-1}},{T_{k}}}}({V_{{T_{k-1}}}^{\pi }})$, $k=1,\dots ,n$. Hence, similar arguments as before (12) yield

(13)

\[ {B_{k}}({V_{{T_{k}}}^{\pi }},{S_{{T_{k}}}})\le {\Psi _{k}^{\pi }}\hspace{2.5pt}\hspace{2.5pt}\text{a.s}.\hspace{2.5pt}\hspace{2.5pt}\forall k=0,1,\dots ,n.\]

By combining (12)–(13) for $k=0$ gives that for any $\pi \in \mathcal{A}(x)$

\[ {\Psi _{0}^{\hat{\pi }}}={B_{0}}(x,{S_{0}})\le {\Psi _{0}^{\pi }}\]

and (10) follows. □

Remark 2.

We observe that the proof of Theorem 1 and Lemma 1 can be adjusted to the case where the volatility and the drift are deterministic functions of time. However, in order to make the presentation more friendly we assume constant parameters.

3 Example where no optimal strategy exists

In this section we consider a game option which can be exercised at any time in the interval $[0,1]$. The payoffs are given by

\[\begin{array}{r@{\hskip10.0pt}c}& \displaystyle {X_{t}}=(1+\sin (\pi t))\max ({Z_{t}},1/2),\hspace{2.5pt}\hspace{2.5pt}t\in [0,1]\\ {} & \displaystyle {Y_{1}}={X_{1}},\\ {} & \displaystyle {Y_{t}}=0,\hspace{2.5pt}\hspace{2.5pt}\text{for}\hspace{2.5pt}\hspace{2.5pt}t<1\end{array}\]

where ${Z_{t}}$ was defined in (3). Notice that ${\mathbb{E}_{\mathbb{P}}}[{\sup _{0\le t\le 1}}{X_{t}}]<\infty $.

Denote by $\mathcal{T}$ the set of all stopping times with values in the interval $[0,1]$. Obviously, the equalities ${Y_{[0,1)}}\equiv 0$ and ${Y_{1}}={X_{1}}$ imply that (the buyer of the game option will not stop before $t=1$) the shortfall risk measure is given by

\[ R(\pi ,\sigma )={\mathbb{E}_{\mathbb{P}}}\left[{\left({X_{\sigma }}-{V_{\sigma }^{\pi }}\right)^{+}}\right].\]

As in (2), for a portfolio strategy π we have

\[ R(\pi ):=\underset{\sigma \in \mathcal{T}}{\inf }R(\pi ,\sigma )=\underset{\sigma \in \mathcal{T}}{\inf }{\mathbb{E}_{\mathbb{P}}}\left[{\left({X_{\sigma }}-{V_{\sigma }^{\pi }}\right)^{+}}\right].\]

Similarly to Section 2, for an initial capital x we define

\[ R(x):=\underset{(\pi ,\sigma )\in \mathcal{A}(x)\times \mathcal{T}}{\inf }R(\pi ,\sigma )=\underset{(\pi ,\sigma )\in \mathcal{A}(x)\times \mathcal{T}}{\inf }{\mathbb{E}_{\mathbb{P}}}\left[{\left({X_{\sigma }}-{V_{\sigma }^{\pi }}\right)^{+}}\right].\]

For any π the process ${\{{({X_{t}}-{V_{t}^{\pi }})^{+}}\}_{t=0}^{1}}$ is continuous, and so from the general theory of optimal stopping (see Section 6 in [9]) it follows that there exists $\sigma =\sigma (\pi )$ such that $R(\pi ,\sigma )=R(\pi )$. Namely, the existence of an optimal hedging strategy is equivalent to the existence of an optimal portfolio strategy. We say that $\pi \in \mathcal{A}(x)$ is an optimal portfolio strategy if $R(\pi )=R(x)$.

We arrive at the main result.

Theorem 2.

Assume that the drift term $\vartheta \ne 0$ and let

(14)

\[ \nu :=\frac{1}{2}{\mathbb{E}_{\mathbb{P}}}\left[{Z_{1}}{\mathbb{I}_{{Z_{1}}<1/2}}\right],\]

observe that $\vartheta \ne 0$ implies that $\nu >0$. Then for any initial capital $x\in (0,\nu )$ there is no optimal strategy.

Remark 3.

If $\vartheta =0$ then $\mathbb{P}=\mathbb{Q}$. In this specific case (see Theorem 7.1 in [2]) there exists an optimal hedging strategy.

Remark 4.

Let us notice that for a given stopping time $\sigma \in \mathcal{T}$ the functional

\[ \pi \to R(\pi ,\sigma ):=\underset{\tau \in \mathcal{T}}{\sup }{\mathbb{E}_{\mathbb{P}}}\left[{\left(H(\sigma ,\tau )-{V_{\sigma \wedge \tau }^{\pi }}\right)^{+}}\right]\]

is convex. Thus, by following the same arguments as in [14] (which are based on the Komlós lemma) one can prove that (for a given σ) the infimum in the expression ${\inf _{\pi \in \mathcal{A}(x)}}R(\pi ,\sigma )$ is attained. Hence, Theorem 2 implies that for any $x\in (0,\nu )$ we have the following

\[ \underset{\pi \in \mathcal{A}(x)}{\inf }R(\pi ,\sigma )=\underset{\pi \in \mathcal{A}(x)}{\min }R(\pi ,\sigma )>R(x)\hspace{2.5pt}\hspace{2.5pt}\forall \sigma \in {\mathcal{T}_{T}}.\]

Before we prove Theorem 2 we will need some auxiliary results. We start with the following lemma.

Lemma 2.

The function $R:[0,\infty )\to [0,\infty )$ is convex and continuous. Namely, the shortfall risk measure R is convex and continuous as a function of the initial capital.

Proof.

The proof will be done by approximating $R(\cdot )$. For any $n\in \mathbb{N}$ let ${\mathcal{T}_{n}}$ be the set of all stopping times with values in the set $\{1/n,2/n,\dots ,1\}$ (0 is not included). Set,

\[\begin{array}{r@{\hskip10.0pt}c}& \displaystyle {R_{n}}(\pi ):={\inf _{\sigma \in {\mathcal{T}_{n}}}}R(\pi ,\sigma )\\ {} & \displaystyle {R_{n}}(x):={\inf _{\sigma \in {\mathcal{T}_{n}}}}{\inf _{\pi \in \mathcal{A}(x)}}R(\pi ,\sigma ).\end{array}\]

We argue that ${R_{n}}$ converge uniformly to R. First, we have the obvious observation ${R_{n}}(\cdot )\ge R(\cdot )$. Next, let $x\ge 0$ and $(\pi ,\sigma )\in \mathcal{A}(x)\times \mathcal{T}$. Define ${\sigma _{n}}\in {\mathcal{T}_{n}}$ by

\[ {\sigma _{n}}:=\frac{1}{n}\min \{k\in \mathbb{N}:\hspace{2.5pt}k/n\ge \sigma \}.\]

Clearly, ${\sigma _{n}}\ge \sigma $. Thus, there exists a portfolio ${\pi _{n}}\in \mathcal{A}(x)$ such that ${V_{{\sigma _{n}}}^{{\pi _{n}}}}={V_{\sigma }^{\pi }}$. From the inequality ${\sigma _{n}}-\sigma \le 1/n$ we obtain

\[ R({\pi _{n}},{\sigma _{n}})-R(\pi ,\sigma )\le {\mathbb{E}_{\mathbb{P}}}\left[|{X_{\sigma }}-{X_{{\sigma _{n}}}}|\right]\le {\mathbb{E}_{\mathbb{P}}}\left[\underset{|t-s|\le 1/n}{\sup }|{X_{t}}-{X_{s}}|\right].\]

Since $(\pi ,\sigma )\in \mathcal{A}(x)\times \mathcal{T}$ was arbitrary we conclude that

\[ 0\le {R_{n}}(x)-R(x)\le {\mathbb{E}_{\mathbb{P}}}\left[\underset{|t-s|\le 1/n}{\sup }|{X_{t}}-{X_{s}}|\right].\]

From the dominated convergence theorem

\[ \underset{n\to \infty }{\lim }{\mathbb{E}_{\mathbb{P}}}\left[\underset{|t-s|\le 1/n}{\sup }|{X_{t}}-{X_{s}}|\right]=0\]

and uniform convergence follows.

It remains to argue that for any n the function ${R_{n}}:[0,\infty )\to [0,\infty )$ is convex and continuous. Fix $n\in \mathbb{N}$. For any $k=1,\dots ,n$ let ${g_{k}}:(0,\infty )\to (0,\infty )$ be such that ${X_{k/n}}={g_{k}}({S_{k/n}})$. Introduce the functions ${\hat{B}_{k}}:[0,\infty )\times (0,\infty )\to \mathbb{R}$, $k=0,1,\dots ,n$ by

\[ {\hat{B}_{n}}(z,y):={({g_{n}}(y)-z)^{+}},\]

for $k=1,\dots ,n-1$ by the recursive relations

\[ {\hat{B}_{k}}(z,y)=\min \left({\left({g_{k}}(y)-z\right)^{+}},\underset{{\Theta _{k+1}}\in {\mathcal{H}_{0,1/n}}(z)}{\inf }{\mathbb{E}_{\mathbb{P}}}\left[{\hat{B}_{k+1}}({\Theta _{k+1}},y{S_{1/n}})\right]\right),\]

and for $k=0$

(15)

\[ {\hat{B}_{0}}(z,y)=\underset{{\Theta _{1}}\in {\mathcal{H}_{0,1/n}}(z)}{\inf }{\mathbb{E}_{\mathbb{P}}}\left[{\hat{B}_{1}}({\Theta _{1}},y{S_{1/n}})\right].\]

Observe that ${R_{n}}(\cdot )$ is “almost” as ${R_{\mathbb{T}}}(\cdot )$ defined in Section 2 for the set $\mathbb{T}:=\{0,1/n,2/n,\dots ,1\}$, the only difference is that for ${R_{n}}(x)$ stopping at zero is not allowed. This is why in (15) we do not take minimum with ${({g_{0}}(y)-z)^{+}}$. Using similar arguments as in the proof of Theorem 1 we obtain that ${R_{n}}(x)={\hat{B}_{0}}(x,{S_{0}})$. Finally, from Corollary 1(ii) we get that for any y, ${\hat{B}_{0}}(\cdot ,y)$ is convex and continuous. This completes the proof. □

Next, we observe that for any stopping time $\sigma \in \mathcal{T}$ and $\lambda >0$

(16)

\[ \underset{\Upsilon \ge 0}{\inf }\left[{({X_{\sigma }}-\Upsilon )^{+}}+\lambda {Z_{\sigma }}\Upsilon \right]={X_{\sigma }}\min (1,\lambda {Z_{\sigma }}).\]

This brings us to introducing the function

(17)

\[ F(\lambda )=\underset{\sigma \in \mathcal{T}}{\inf }{\mathbb{E}_{\mathbb{P}}}\left[{X_{\sigma }}\min (1,\lambda {Z_{\sigma }})\right],\hspace{2.5pt}\hspace{2.5pt}\lambda >0.\]

Obviously $F:(0,\infty )\to [0,\infty )$ is concave and nondecreasing. Inspired by Corollary 8.3 in [13] we prove the following.

Lemma 3.

(i). For any $x\ge 0$ and $\lambda >0$,

\[ R(x)\ge F(\lambda )-\lambda x.\]

(ii). Let $\lambda >0$ be such that F is differentiable at λ. Then for $x={F^{\prime }}(\lambda )$ we have the equality

\[ R(x)=F(\lambda )-\lambda x.\]

Proof.

(i). Let $x\ge 0$ and $\lambda >0$. Choose arbitrary $(\pi ,\sigma )\in \mathcal{A}(x)\in \mathcal{T}$. Then, from the super–martingale property of an admissible portfolio we have

(18)

\[ x={V_{0}^{\pi }}\ge {\mathbb{E}_{\mathbb{Q}}}[{V_{\sigma }^{\pi }}]={\mathbb{E}_{\mathbb{P}}}[{Z_{\sigma }}{V_{\sigma }^{\pi }}].\]

This together with (16) gives

\[ R(\pi ,\sigma )+\lambda x\ge {\mathbb{E}_{\mathbb{P}}}\left[{({X_{\sigma }}-{V_{\sigma }^{\pi }})^{+}}+\lambda {Z_{\sigma }}{V_{\sigma }^{\pi }}\right]\ge F(\lambda ).\]

Since $(\pi ,\sigma )\in \mathcal{A}(x)\in \mathcal{T}$ was arbitrary we complete the proof.

(ii). In view of (i), it is sufficient to show that $R(x)\le F(\lambda )-\lambda x$. Let ${\sigma _{\lambda }}\in \mathcal{T}$ be an optimal stopping time in (17), i.e.

(19)

\[ F(\lambda )={\mathbb{E}_{\mathbb{P}}}\left[{X_{{\sigma _{\lambda }}}}\min (1,\lambda {Z_{{\sigma _{\lambda }}}})\right].\]

Such stopping time exists because the process ${\{{X_{t}}\min (1,\lambda {Z_{t}})\}_{t=0}^{1}}$ is continuous. Set ${\Upsilon _{\lambda }}={X_{{\sigma _{\lambda }}}}{\mathbb{I}_{{Z_{{\sigma _{\lambda }}}}<1/\lambda }}$. From (16) it follows that for any $\tilde{\lambda }>0$

\[ F(\tilde{\lambda })\le {\mathbb{E}_{\mathbb{P}}}\left[{({X_{{\sigma _{\lambda }}}}-{\Upsilon _{\lambda }})^{+}}+\tilde{\lambda }{Z_{{\sigma _{\lambda }}}}{\Upsilon _{\lambda }}\right].\]

On the other hand from (19)

\[ F(\lambda )={\mathbb{E}_{\mathbb{P}}}\left[{({X_{{\sigma _{\lambda }}}}-{\Upsilon _{\lambda }})^{+}}+\lambda {Z_{{\sigma _{\lambda }}}}{\Upsilon _{\lambda }}\right].\]

Thus,

\[\begin{array}{r@{\hskip10.0pt}c}& \displaystyle \frac{F(\tilde{\lambda })-F(\lambda )}{\tilde{\lambda }-\lambda }\le {\mathbb{E}_{\mathbb{P}}}\left[{Z_{{\sigma _{\lambda }}}}{\Upsilon _{\lambda }}\right],\hspace{2.5pt}\hspace{2.5pt}\text{for}\hspace{2.5pt}\hspace{2.5pt}\tilde{\lambda }>\lambda \\ {} & \displaystyle \text{and}\hspace{2.5pt}\hspace{2.5pt}\frac{F(\tilde{\lambda })-F(\lambda )}{\tilde{\lambda }-\lambda }\ge {\mathbb{E}_{\mathbb{P}}}\left[{Z_{{\sigma _{\lambda }}}}{\Upsilon _{\lambda }}\right]\hspace{2.5pt}\hspace{2.5pt}\text{for}\hspace{2.5pt}\hspace{2.5pt}\tilde{\lambda }<\lambda .\end{array}\]

From the fact that ${F^{\prime }}(\lambda )=x$ we conclude that

\[ x={\mathbb{E}_{\mathbb{P}}}\left[{Z_{{\sigma _{\lambda }}}}{\Upsilon _{\lambda }}\right]={\mathbb{E}_{\mathbb{Q}}}\left[{\Upsilon _{\lambda }}\right].\]

The completeness of the BS model implies that there exists $\pi \in \mathcal{A}(x)$ such that ${V_{{\sigma _{\lambda }}}^{\pi }}={\Upsilon _{\lambda }}$. From (19) we get

\[\begin{array}{r@{\hskip10.0pt}c}& \displaystyle R(x)+\lambda x\le R(\pi ,{\sigma _{\lambda }})+\lambda x={\mathbb{E}_{\mathbb{P}}}\left[{({X_{{\sigma _{\lambda }}}}-{\Upsilon _{\lambda }})^{+}}+\lambda {Z_{{\sigma _{\lambda }}}}{\Upsilon _{\lambda }}\right]\\ {} & \displaystyle ={\mathbb{E}_{\mathbb{P}}}\left[{X_{{\sigma _{\lambda }}}}\min (1,\lambda {Z_{{\sigma _{\lambda }}}})\right]=F(\lambda )\end{array}\]

as required. □

While Lemmas 2–3 are quite general, the following lemma uses the explicit structure of the payoff process ${\{{X_{t}}\}_{t=0}^{1}}$.

Lemma 4.

(i). For any $\lambda \ge 2$, $F(\lambda )=1$.

(ii). The derivative of F from the left (exists because F is concave) satisfies ${F^{\prime }_{-}}(2)\ge \nu $ where ν is given by (14).

Proof.

(i). Let $\lambda \ge 2$. Obviously, ${\mathbb{E}_{\mathbb{P}}}[{Z_{\sigma }}]=1$ for all $\sigma \in \mathcal{T}$. Hence, from the simple formula $\max (z,1/2)\min (1,2z)\equiv z$ we obtain

\[ F(\lambda )\ge F(2)=\underset{\sigma \in \mathcal{T}}{\inf }{\mathbb{E}_{\mathbb{P}}}\left[{Z_{\sigma }}(1+\sin (\pi \sigma ))\right]\ge 1.\]

On the other hand, taking $\sigma \equiv 0$ in (17), we get $F(\lambda )\le 1$ and so $F\equiv 1$ on the interval $[2,\infty )$.

(ii). Choose $\lambda <2$. Clearly, (we take $\sigma \equiv 1$ in (17))

\[\begin{array}{r@{\hskip10.0pt}c}& \displaystyle F(\lambda )\le {\mathbb{E}_{\mathbb{P}}}\left[\max ({Z_{1}},1/2)\min (1,\lambda {Z_{1}})\right]\\ {} & \displaystyle \le {\mathbb{E}_{\mathbb{P}}}\left[{Z_{1}}{\mathbb{I}_{{Z_{1}}>1/2}}+\frac{\lambda }{2}{Z_{1}}{\mathbb{I}_{{Z_{1}}<1/2}}\right]=1-\frac{2-\lambda }{2}{\mathbb{E}_{\mathbb{P}}}\left[{Z_{1}}{\mathbb{I}_{{Z_{1}}<1/2}}\right].\end{array}\]

This together with the equality $F(2)=1$ gives ${F^{\prime }_{-}}(2)\ge \nu $. □

Now, we have all the ingredients for the proof of Theorem 2.

Proof.

From Lemma 3(i) and Lemma 4(i) it follows that for any x

\[ R(x)\ge F(2)-2x=1-2x.\]

Let us prove that

(20)

\[ R(x)=1-2x,\hspace{2.5pt}\hspace{2.5pt}\forall x\le {F^{\prime }_{-}}(2).\]

Since R is convex (Lemma 2) then it is sufficient to show that $R(0)\le 1$ and $R({F^{\prime }_{-}}(2))\le 1-2{F^{\prime }_{-}}(2)$.

The first inequality is trivial, $R(0)\le {X_{0}}=1$. Let us show the second inequality. The concavity of F implies that there exists a sequence ${\lambda _{n}}\uparrow 2$ such that for any n the derivative ${F^{\prime }}({\lambda _{n}})$ exists. Hence, from the continuity of R (Lemma 2), the concavity of F and Lemma 3(ii) we obtain

\[ R({F^{\prime }_{-}}(2))=\underset{n\to \infty }{\lim }R({F^{\prime }}({\lambda _{n}}))=\underset{n\to \infty }{\lim }[F({\lambda _{n}})-{\lambda _{n}}{F^{\prime }}({\lambda _{n}})]=1-2{F^{\prime }_{-}}(2)\]

and (20) follows.

Next, let $x\in (0,\nu )$. Assume by contradiction that there exists a hedging strategy $(\pi ,\sigma )\in \mathcal{A}(x)\times \mathcal{T}$ such that $R(\pi ,\sigma )=R(x)$. From Lemma 4(ii) and (20) we obtain

(21)

\[ R(\pi ,\sigma )=1-2x.\]

Observe that if σ takes on values (with positive probability) in the interval $(0,1)$ then

\[ {\mathbb{E}_{\mathbb{P}}}\left[{X_{\sigma }}\min (1,2{Z_{\sigma }})\right]={\mathbb{E}_{\mathbb{P}}}\left[{Z_{\sigma }}(1+\sin (\pi \sigma ))\right]>{\mathbb{E}_{\mathbb{P}}}[{Z_{\sigma }}]=1.\]

Thus, from (16) and (18)

\[ R(\pi ,\sigma )+2x\ge {\mathbb{E}_{\mathbb{P}}}\left[{({X_{\sigma }}-{V_{\sigma }^{\pi }})^{+}}+2{Z_{\sigma }}{V_{\sigma }^{\pi }}\right]>1\]

which is a contradiction to (21). On the other hand if $\sigma \equiv 0$ then

\[ R(\pi ,\sigma )={X_{0}}-x=1-x,\]

also a contradiction to (21).

We conclude that the only remaining possibility is $\sigma \equiv 1$. Let us show that there is a contradiction in this case as well. Introduce the event

\[ A:=\{\max ({Z_{1}},{V_{1}^{\pi }})<1/2\}.\]

Observe that on the event A we have

(22)

\[ {({X_{1}}-{V_{1}^{\pi }})^{+}}+2{Z_{1}}{V_{1}^{\pi }}=(1/2-{Z_{1}})(1-2{V_{1}^{\pi }})+{Z_{1}}>{Z_{1}}={X_{1}}\min (1,2{Z_{1}}).\]

From (18) and the fact that $x<\nu $ it follows that

\[ {\mathbb{E}_{\mathbb{P}}}[{Z_{1}}{V_{1}^{\pi }}]<\nu =\frac{1}{2}{\mathbb{E}_{\mathbb{P}}}\left[{Z_{1}}{\mathbb{I}_{{Z_{1}}<1/2}}\right].\]

This together with the inequality ${V_{1}^{\pi }}\ge 0$ gives $\mathbb{P}(A)>0$. Thus, by combining (16), (18) and (22) we obtain

\[\begin{array}{r@{\hskip10.0pt}c}& \displaystyle R(\pi ,\sigma )+2x={\mathbb{E}_{\mathbb{P}}}\left[{({X_{1}}-{V_{1}^{\pi }})^{+}}+2{Z_{1}}{V_{1}^{\pi }}\right]\\ {} & \displaystyle >{\mathbb{E}_{\mathbb{P}}}\left[{X_{1}}\min (1,2{Z_{1}})\right]={\mathbb{E}_{\mathbb{P}}}[{Z_{1}}]=1\end{array}\]

which is a contradiction to (21). □

We end this section with the following two remarks.

Remark 5.

The message of Theorem 2 is that the inf in (2) which ruins the convexity of the shortfall risk functional $R(\pi )$ can lead to non existence of an optimal strategy. Observe that in the above constructed example, the payoff process X is continuous and the payoff process Y has a positive jump in the maturity date.

One can ask, what if we require that both of the payoff processes X and Y will be continuous, is there a counter example in this case as well?

The answer is yes. Let us apply Theorem 2 in order to construct a counter example with continuous payoffs.

Consider a simple BS financial market with time horizon $T=2$ which consists of a riskless savings account bearing zero interest and of a risky asset S, whose value at time t is given by

\[\begin{array}{r@{\hskip10.0pt}c}& \displaystyle {S_{t}}={S_{0}}\exp \left(\kappa {W_{t}}+(\vartheta -{\kappa ^{2}}/2)t\right),\hspace{2.5pt}\hspace{2.5pt}t\in [0,1]\\ {} & \displaystyle {S_{t}}={S_{1}}\exp \left(\kappa ({W_{t}}-{W_{1}})-{\kappa ^{2}}(t-1)/2\right),\hspace{2.5pt}\hspace{2.5pt}t\in (1,2]\end{array}\]

where, as before, ${S_{0}},\kappa >0$ and $\vartheta \ne 0$ are constants. Namely, this is a BS model which has a drift jump in $t=1$. Obviously this market is complete and the unique martingale measure is given by $\frac{d\mathbb{Q}}{d\mathbb{P}}|{\mathcal{F}_{t}}:={Z_{t\wedge 1}}$ where ${Z_{t}}$ is given by (3). Consider a game option with the continuous payoffs

\[\begin{array}{r@{\hskip10.0pt}c}& \displaystyle {\hat{X}_{t}}=(1+\sin (\pi t))\max ({Z_{t}},1/2),\hspace{2.5pt}\hspace{2.5pt}t\in [0,1]\\ {} & \displaystyle {\hat{X}_{t}}={\hat{X}_{1}},\hspace{2.5pt}\hspace{2.5pt}t\in (1,2],\\ {} & \displaystyle {\hat{Y}_{t}}=0,\hspace{2.5pt}\hspace{2.5pt}t\in [0,1]\\ {} & \displaystyle {\hat{Y}_{t}}=(t-1){\hat{X}_{1}},\hspace{2.5pt}\hspace{2.5pt}t\in (1,2].\end{array}\]

Denote by $\hat{R}$ the corresponding shortfall risk. We argue that for an initial capital $0<x<\nu :=\frac{1}{2}{\mathbb{E}_{\mathbb{P}}}\left[{Z_{1}}{\mathbb{I}_{{Z_{1}}<1/2}}\right]$ there is no optimal hedging strategy.

Indeed, let π be an admissible portfolio strategy and σ be a stopping time with values in the interval $[0,2]$. From the super–martingale property of the portfolio value and the fact that Z is a constant random variable after $t=1$ we obtain

\[ {V_{\sigma \wedge 1}^{\pi }}\ge {\mathbb{E}_{\mathbb{P}}}[{V_{\sigma }^{\pi }}|{\mathcal{F}_{1}}].\]

This together with the Jensen inequality and the fact that $\hat{X}$ is a constant random variable after $t=1$ gives

(23)

\[ {\mathbb{E}_{\mathbb{P}}}\left[{({\hat{X}_{\sigma \wedge 1}}-{V_{\sigma \wedge 1}^{\pi }})^{+}}\right]\le {\mathbb{E}_{\mathbb{P}}}\left[{\mathbb{E}_{\mathbb{P}}}\left[{({\hat{X}_{\sigma }}-{V_{\sigma }^{\pi }})^{+}}|{\mathcal{F}_{1}}\right]\right]={\mathbb{E}_{\mathbb{P}}}\left[{({\hat{X}_{\sigma }}-{V_{\sigma }^{\pi }})^{+}}\right].\]

From (23), and the relations ${\hat{Y}_{[0,1]}}\equiv 0$, ${\hat{Y}_{2}}={\hat{X}_{2}}$ we obtain

\[ \hat{R}(\pi ,\sigma \wedge 1)={\mathbb{E}_{\mathbb{P}}}\left[{({\hat{X}_{\sigma \wedge 1}}-{V_{\sigma \wedge 1}^{\pi }})^{+}}\right]\le {\mathbb{E}_{\mathbb{P}}}\left[{({\hat{X}_{\sigma }}-{V_{\sigma }^{\pi }})^{+}}\right]\le \hat{R}(\pi ,\sigma ).\]

Namely, we can restrict the investor to stopping times in the interval $[0,1]$, but this is exactly the setup that was studied in Theorem 2. From Theorem 2 we conclude that there is no optimal hedging strategy for $x\in (0,\nu )$.

Remark 6.

From Theorem 1 it follows that for any n there exists a hedging strategy $({\pi _{n}},{\sigma _{n}})\in \mathcal{A}(x)\times {\mathcal{T}_{n}}$ such that

\[ {R_{n}}(x)={\mathbb{E}_{\mathbb{P}}}\left[{\left({X_{{\sigma _{n}}}}-{V_{{\sigma _{n}}}^{{\pi _{n}}}}\right)^{+}}\right]\]

where ${R_{n}}$ was defined in the beginning of the proof of Lemma 2.

Theorem 2 implies that we should not expect that these optimal hedging strategies $({\pi _{n}},{\sigma _{n}})\in \mathcal{A}(x)\times {\mathcal{T}_{n}}$, $n\in \mathbb{N}$ will converge in a strong sense when n goes to infinity. Indeed, if these hedging strategies would converge in a strong sense, then we can argue that the limit is an optimal hedging strategy for the continuous time problem. This is a contradiction to Theorem 2 (at least for $x\in (0,\nu )$).

By applying the weak convergence theory we can show that $({\pi _{n}},{\sigma _{n}})\in \mathcal{A}(x)\times {\mathcal{T}_{n}}$, $n\in \mathbb{N}$ has a cluster point (with respect to convergence in law). However, in view of Theorem 2 we conclude that the representation of the cluster point will require an enlargement of the probability space, i.e. an additional randomization.

An interesting question which is left for the future, is whether by allowing the investor to randomize from the start (in the spirit of [1]) will provide an existence of an optimal hedging strategy.

Authors

Abstract

1 Introduction

(1)

(2)

2 Existence result

(3)

(4)

(5)

Theorem 1.

Remark 1.

2.1 Proof of Theorem 1

(6)

Lemma 1.

Proof.

(7)

(8)

Corollary 1.

Proof.

Proof.

(9)

(10)

(11)

(12)

(13)

Remark 2.

3 Example where no optimal strategy exists

Theorem 2.

(14)

Remark 3.

Remark 4.

Lemma 2.

Proof.

(15)

(16)

(17)

Lemma 3.

Proof.

(18)

(19)

Lemma 4.

Proof.

Proof.

(20)

(21)

(22)

Remark 5.

(23)

Remark 6.

Acknowledgments

References

Export citation

Copy and paste formatted citation

Download citation in file