Exponential utility maximization in small/large financial markets

Rásonyi, Miklós; Sayit, Hasanjan

doi:10.15559/24-VMSTA270

Abstract

Obtaining a utility-maximizing optimal portfolio in a closed form is a challenging issue when the return vector follows a more general distribution than the normal one. In this paper, for markets based on finitely many assets, a closed-form expression is given for optimal portfolios that maximize an exponential utility function when the return vector follows normal mean-variance mixture models. Especially, the used approach expresses the closed-form solution in terms of the Laplace transformation of the mixing distribution of the normal mean-variance mixture model and no distributional assumptions on the mixing distribution are made.

Also considered are large financial markets based on normal mean-variance mixture models, and it is shown that the optimal exponential utilities in small markets converge to the optimal exponential utility in the large financial market. This shows, in particular, that to reach the best utility level investors need to diversify their investments to include infinitely many assets into their portfolio, and with portfolios based on only finitely many assets they will never be able to reach the optimum level of utility.

1 Introduction

We consider a frictionless financial market with $d+1$ assets. We assume the first asset is a risk-free asset with risk-free interest rate ${r_{f}}$ and the remaining d assets are risky assets with returns modeled by a d-dimensional random vector X. In this note, we assume that X follows a normal mean-variance mixture (NMVM) distribution,

(1)

\[ X\stackrel{d}{=}\mu +\gamma Z+\sqrt{Z}AN,\]

where $\mu \in {\mathbb{R}^{d}}$ is location parameter, $\gamma \in {\mathbb{R}^{d}}$ controls the skewness, $Z\sim G$ is a nonnegative random variable with distribution function G, $A\in {\mathbb{R}^{d\times d}}$ is a symmetric and positive definite $d\times d$ matrix of real numbers, $N\sim N(0,I)$ is a d-dimensional Gaussian random vector with identity covariance matrix I in ${\mathbb{R}^{d}}\times {\mathbb{R}^{d}}$, and N is independent of the mixing distribution Z.

In this paper we use the following notations. For any vectors $x={({x_{1}},{x_{2}},\dots ,{x_{d}})^{T}}$ and $y={({y_{1}},{y_{2}},\dots ,{y_{d}})^{T}}$ in ${\mathbb{R}^{d}}$, where the superscript T stands for the transpose of a vector, $\lt x,y\gt ={x^{T}}y={\textstyle\sum _{i=1}^{d}}{x_{i}}{y_{i}}$ denotes the scalar product of the vectors x and y, and $|x|=\sqrt{{\textstyle\sum _{i=1}^{d}}{x_{i}^{2}}}$ denotes the Euclidean norm of the vector x. We sometimes use the short-hand notation $X\sim N(\mu +\gamma z,z\Sigma )\circ G$ for (1), where $\Sigma ={A^{T}}A$. $\mathbb{R}$ denotes the set of real numbers and ${\mathbb{R}_{+}}=[0,+\infty )$ denotes the set of nonnegative real numbers. Following the notations of [13], $\mathcal{J}$ denotes the family of infinitely divisible random variables on ${\mathbb{R}_{+}}$, $\mathcal{S}$ denotes the set of self-decomposable random variables on ${\mathbb{R}_{+}}$, and $\mathcal{G}$ denotes the class of generalized gamma convolutions (GGCs) on ${\mathbb{R}_{+}}$ that will be introduced later. The Laplace transformation of any distribution G is denoted by ${\mathcal{L}_{G}}(s)=\textstyle\int {e^{-sy}}G(dy)$. A gamma random variable with density function $f(x)=\frac{1}{\Gamma (\alpha ){\beta ^{\alpha }}}{x^{\alpha -1}}{e^{-x/\beta }}$ is denoted by $G=G(\alpha ,\beta )$.

A prominent example of the NMVM models is generalized hyperbolic (GH) distributions, where the mixing distribution Z follows a generalized inverse Gaussian (GIG) distribution denoted as $\mathit{GIG}(\lambda ,a,b)$. The probability density function of a GIG distribution, denoted by ${f_{\mathit{GIG}}}(\lambda ,a,b)$, takes the form

(2)

\[ {f_{\mathit{GIG}}}(x;\lambda ,a,b)={\bigg(\frac{b}{a}\bigg)^{\lambda }}\frac{1}{{K_{\lambda }}(ab)}{x^{\lambda -1}}{e^{-\frac{1}{2}({a^{2}}{x^{-1}}+{b^{2}}x)}}{1_{(0,+\infty )}}(x),\]

where ${K_{\lambda }}(x)$ denotes the modified Bessel function of third kind with index λ and the allowed parameter ranges for λ, a, b in (2) are (i) $a\ge 0$, $b\gt 0$ if $\lambda \gt 0$, (ii) $a\gt 0$, $b\ge 0$ if $\lambda \lt 0$, (iii) $a\gt 0$, $b\gt 0$ if $\lambda =0$. Here the case $a=0$ in (i) or the case $b=0$ in (ii) above need to be understood in limiting cases of (2) and in these special cases we have

(3)

\[ \begin{aligned}{}{f_{\mathit{GIG}}}(x;\lambda ,0,b)& ={\bigg(\frac{{b^{2}}}{2}\bigg)^{\lambda }}\frac{{x^{\lambda -1}}}{\Gamma (\lambda )}{e^{-\frac{{b^{2}}}{2}x}}{1_{(0,+\infty )}}(x),\hspace{1em}\lambda \gt 0,\\ {} {f_{\mathit{GIG}}}(x;\lambda ,a,0)& ={\bigg(\frac{2}{{a^{2}}}\bigg)^{\lambda }}\frac{{x^{\lambda -1}}}{\Gamma (-\lambda )}{e^{-\frac{{a^{2}}}{2x}}}{1_{(0,+\infty )}}(x),\hspace{1em}\lambda \lt 0,\end{aligned}\]

where $\Gamma (x)$ denotes the Gamma function. Here ${f_{\mathit{GIG}}}(x;\lambda ,0,b)$ is the density function of a Gamma distribution $G(\lambda ,\frac{2}{{b^{2}}})$ and ${f_{\mathit{GIG}}}(x;\lambda ,a,0)$ is the density function of an inverse Gamma distribution $\mathit{iG}(\lambda ,\frac{{a^{2}}}{2})$.

The GH distribution in dimension d is denoted by ${\mathit{GH}_{d}}(\lambda ,\alpha ,\beta ,\delta ,\mu ,\Sigma )$ and it satisfies ${\mathit{GH}_{d}}(\lambda ,\alpha ,\beta ,\delta ,\mu ,\Sigma )\sim N(\mu +z\Sigma \beta ,z\Sigma )\circ \mathit{GIG}(\lambda ,\delta ,\sqrt{{\alpha ^{2}}-{\beta ^{T}}\Sigma \beta })$. The parameter ranges of this distribution are $\lambda \in \mathbb{R}$, $\alpha ,\delta \in {\mathbb{R}_{+}}$, $\beta ,\mu \in {\mathbb{R}^{d}}$ and (i′) $\delta \ge 0$, $0\le \sqrt{{\beta ^{T}}\Sigma \beta }\lt \alpha $ if $\lambda \gt 0$, (ii′) $\delta \gt 0$, $0\le \sqrt{{\beta ^{T}}\Sigma \beta }\lt \alpha $ if $\lambda =0$, (iii′) $\delta \gt 0$, $0\le \sqrt{{\beta ^{T}}\Sigma \beta }\le \alpha $ if $\lambda \lt 0$. The class of GH distributions includes two popular models in finance: if $\lambda =-\frac{1}{2}$ we have a normal inverse Gaussian distribution which is denoted by ${\mathit{NIG}_{d}}(\alpha ,\beta ,\delta ,\mu ,\Sigma )$, and when $\lambda =\frac{1+d}{2}$ we have the class of hyperbolic distributions denoted by ${\mathit{HYP}_{d}}(\alpha ,\beta ,\delta ,\mu ,\Sigma )$. As in the case of the GIG distributions, the case $\delta =0$ in (i′) above and the case $\sqrt{{\beta ^{T}}\Sigma \beta }=\alpha $ or $\alpha =0$ in (iii′) above need to be understood as limiting cases of the GH distributions. If $\lambda \gt 0$, $\delta \to 0$ in case (i′) above then

(4)

\[\begin{aligned}{}{\mathit{GH}_{d}}(\lambda ,\alpha ,\beta ,\delta ,\mu ,\Sigma )& \stackrel{w}{\to }{N_{d}}(\mu +z\Sigma \beta ,z\Sigma )\circ G\bigg(\lambda ,\frac{{\alpha ^{2}}-{\beta ^{T}}\Sigma \beta }{2}\bigg)\\ {} & =:V{G_{d}}(\lambda ,\alpha ,\beta ,\mu ,\Sigma ),\end{aligned}\]

where $\stackrel{w}{=}$ denotes weak convergence of distributions and ${\mathit{VG}_{d}}$ represents the class of variance gamma distributions. If $\lambda \lt 0$ and $\alpha \to 0$ as well as $\beta \to 0$ in case (iii′) above we have the shifted t distributions with degrees of freedom $-2\lambda $

(5)

\[ {\mathit{GH}_{d}}(\lambda ,\alpha ,\beta ,\delta ,\mu ,\Sigma )\stackrel{w}{\to }N(\mu ,z\Sigma )\circ \mathit{iG}\bigg(\lambda ,\frac{{\delta ^{2}}}{2}\bigg)=:{t_{d}}(\lambda ,\delta ,\mu ,\Sigma ).\]

If $\alpha \to \infty $, $\delta \to \infty $ and $\frac{\delta }{\alpha }\to {\sigma ^{2}}\lt \infty $, we have the following relation that shows that the normal random vectors are limiting cases of the GH distributions,

(6)

\[ {\mathit{GH}_{d}}(\lambda ,\alpha ,\beta ,\delta ,\mu ,\Sigma )\stackrel{w}{\to }N(\mu +z\Sigma \beta ,z\Sigma )\circ {\epsilon _{{\sigma ^{2}}}}=:N\big(\mu +{\sigma ^{2}}\Sigma \beta ,{\sigma ^{2}}\Sigma \big),\]

where ${\epsilon _{{\sigma ^{2}}}}$ is the Dirac function that equals to 1 when $z={\sigma ^{2}}$ and equals to zero otherwise, see Chapter 2 of [10] for the details. All of normal inverse Gaussian, hyperbolic, variance gamma, and Student t distributions are very popular models in finance, see [12], [1], [3], [8], [11], [21], [20], [14], [22] for this.

The class of GIG distributions belongs to the class of GGCs. A positive random variable Z is a GGC, without translation term, if there exists a positive Radon measure ν on ${\mathbb{R}_{+}}$ such that

(7)

\[ {\mathcal{L}_{Z}}(s)=E{e^{-sZ}}={e^{-{\textstyle\textstyle\int _{0}^{\infty }}\ln (1+\frac{s}{z})\nu (dz)}},\]

with

(8)

\[ {\int _{0}^{1}}|lnx|\nu (dx)\lt \infty ,\hspace{2em}{\int _{1}^{\infty }}\frac{1}{x}\nu (dx)\lt \infty .\]

The measure ν is called Thorin’s measure associated with Z. For the definition of the GGCs, see the survey paper [13]. In Proposition 1.1 of [13], it was shown that any GGC random variable can be written as the Wiener-Gamma integral

(9)

\[ Z={\int _{0}^{\infty }}h(s)d{\gamma _{s}},\]

where $h(s):{\mathbb{R}_{+}}\to {\mathbb{R}_{+}}$ is a deterministic function with ${\textstyle\int _{0}^{\infty }}\mathrm{ln}(1+h(s))ds\lt \infty $ and $\{{\gamma _{s}}\}$ is a standard Gamma process with the Lévy measure ${e^{-x}}\frac{dx}{x},\hspace{2.5pt}x\gt 0$.

Proposition 1.23 of [10] shows that the class of GIG random variables belongs to the class GGC. It provides the description of the corresponding Thorin’s measures (in terms of the functions ${U_{\mathit{GIG}}}$ in the proposition) for all the cases of parameters of GIG. The class of GGC distributions is rich as stated in the introduction of [13] and we have the relation $\mathcal{G}\subset \mathcal{S}\subset \mathcal{J}$. In our model (1) the mixing distribution Z can be any distribution in $\mathcal{J}$. In fact, Z can be any nonnegative random variable.

Given an initial endowment ${W_{0}}\gt 0$, the investor must determine the portfolio weights x on the d risky assets to maximize the expected utility of the next period wealth. The wealth that corresponds to the portfolio weight x on the risky assets is given by

(10)

\[\begin{aligned}{}W(x)=& {W_{0}}\big[1+\big(1-{x^{T}}1\big){r_{f}}+{x^{T}}X\big]\\ {} =& {W_{0}}(1+{r_{f}})+{W_{0}}\big[{x^{T}}(X-\mathbf{1}{r_{f}})\big]\end{aligned}\]

and the investor’s problem is

(11)

\[ \underset{x\in D}{\max }\hspace{0.2778em}EU\big(W(x)\big),\]

for some domain D of the portfolio set D. Note here that x represents the portfolio weights on the risky assets and $1-{x^{T}}\mathbf{1}$ is the proportion of the initial wealth invested on the risk-free asset. The portfolio weights x on risky assets are allowed to be any vector in D.

The main goal of this paper is to discuss the solution to the problem (11) for an exponential utility function U when the returns of the risky assets have an NMVM distribution as in (1). This type of utility maximization problems in one period models were studied in many papers in the past, see [17], [18], [15], [29], [2]. Especially, the recent paper [3] made an interesting observation that, with generalized hyperbolic models and with exponential utility, the optimal portfolios of the corresponding expected utility maximization problems can be written as a sum of two portfolios that are determined by the location and skewness parameters of the model (1) separately. The present paper extends their result to a more general class of NMVM models as a compliment.

The paper is organized as follows. In Section 2 below we present a closed-form solution for an optimal portfolio when the utility function U is exponential. In Section 3 we show that the optimal expected utilities in small financial markets converge to an overall best-expected utility in a large financial market. In Section 4 we present examples as applications of our results.

2 Closed-form solution for optimal portfolios under an exponential utility

In this section, we study the solution to the problem (11) when the utility function of the investor is exponential,

(12)

\[ U(W)=-{e^{-aW}},\hspace{1em}a\gt 0,\]

and when the investment opportunity set consists of the above-stated $d+1$ assets. Below we obtain an expression that relates $EU(W)$ to the Laplace transformation of the mixing distribution Z as in (14) below. First, observe that we have

(13)

\[ W(x)\stackrel{d}{=}{W_{0}}(1+{r_{f}})+{W_{0}}\big[{x^{T}}(\mu -\mathbf{1}{r_{f}})+{x^{T}}\gamma Z+\sqrt{{x^{T}}\Sigma x}\sqrt{Z}N(0,1)\big].\]

Lemma 2.1.

For any portfolio $x\in {\mathbb{R}^{d}}$ such that $EU(W(x))$ is finite, we have

(14)

\[ EU\big(W(x)\big)=-{e^{-a{W_{0}}(1+{r_{f}})}}{e^{-a{W_{0}}{x^{T}}(\mu -\mathbf{1}{r_{f}})}}{\mathcal{L}_{Z}}\bigg(a{W_{0}}{x^{T}}\gamma -\frac{{a^{2}}{W_{0}^{2}}}{2}{x^{T}}\Sigma x\bigg),\]

where ${\mathcal{L}_{Z}}(s)=E{e^{-sZ}}$ is the Laplace transformation of Z.

Proof.

From (13), we have

\[\begin{aligned}{}EU\big(W(x)\big)=& -E{e^{-a{W_{0}}(1+{r_{f}})-a{W_{0}}[{x^{T}}(\mu -\mathbf{1}{r_{f}})+{x^{T}}\gamma Z+\sqrt{{x^{T}}\Sigma x}\sqrt{Z}N(0,1)]}}\\ {} =& -{e^{-a{W_{0}}(1+{r_{f}})}}{e^{-a{W_{0}}{x^{T}}(\mu -\mathbf{1}{r_{f}})}}\\ {} & \times {\int _{0}^{+\infty }}E{e^{-a{W_{0}}{x^{T}}\gamma z-a{W_{0}}\sqrt{{x^{T}}\Sigma x}\sqrt{z}N(0,1)}}{f_{Z}}(z)dz\\ {} =& -{e^{-a{W_{0}}(1+{r_{f}})}}{e^{-a{W_{0}}{x^{T}}(\mu -\mathbf{1}{r_{f}})}}\\ {} & \times {\int _{0}^{+\infty }}{e^{-a{W_{0}}{x^{T}}\gamma z}}E{e^{-a{W_{0}}\sqrt{{x^{T}}\Sigma x}\sqrt{z}N(0,1)}}{f_{Z}}(z)dz\\ {} =& -{e^{-a{W_{0}}(1+{r_{f}})}}{e^{-a{W_{0}}{x^{T}}(\mu -\mathbf{1}{r_{f}})}}{\int _{0}^{+\infty }}{e^{-a{W_{0}}{x^{T}}\gamma z}}{e^{\frac{{a^{2}}{W_{0}^{2}}}{2}{x^{T}}\Sigma xz}}{f_{Z}}(z)dz\\ {} =& -{e^{-a{W_{0}}(1+{r_{f}})}}{e^{-a{W_{0}}{x^{T}}(\mu -\mathbf{1}{r_{f}})}}{\int _{0}^{+\infty }}{e^{-(a{W_{0}}{x^{T}}\gamma -\frac{{a^{2}}{W_{0}^{2}}}{2}{x^{T}}\Sigma x)z}}{f_{Z}}(z)dz\\ {} =& -{e^{-a{W_{0}}(1+{r_{f}})}}{e^{-a{W_{0}}{x^{T}}(\mu -\mathbf{1}{r_{f}})}}{\mathcal{L}_{Z}}\bigg(a{W_{0}}{x^{T}}\gamma -\frac{{a^{2}}{W_{0}^{2}}}{2}{x^{T}}\Sigma x\bigg).\end{aligned}\]

□

Remark 2.2.

If $\mu -\mathbf{1}{r_{f}}=0$ in our model (1), from (14) we have

\[ EU\big(W(x)\big)=-{e^{-a{W_{0}}(1+{r_{f}})}}{\mathcal{L}_{Z}}\bigg(a{W_{0}}{x^{T}}\gamma -\frac{{a^{2}}{W_{0}^{2}}}{2}{x^{T}}\Sigma x\bigg).\]

Since ${\mathcal{L}_{Z}}(s)$ is a strictly decreasing function, the expected utility maximization problem becomes the maximization problem of the quadratic function $a{W_{0}}{x^{T}}\gamma -\frac{{a^{2}}{W_{0}^{2}}}{2}{x^{T}}\Sigma x$ in this case. Especially, if the risk-free interest rate ${r_{f}}$ is zero and our model (1) is such that the location parameter μ is zero, then the utility optimizing portfolio can be found by optimizing a quadratic function. Therefore for the rest of the paper, we assume that our model (1) is such that $\mu -\mathbf{1}{r_{f}}\ne 0$. Also we assume that $Z\ne 0$ with positive probability.

Remark 2.3.

By using the relation (11) and by checking the first order condition for optimality, it is easy to see that the optimal portfolio ${x^{\mathrm{\star }}}$ satisfies the relation

(15)

\[ {x^{\mathrm{\star }}}=\frac{1}{a{W_{0}}}\bigg[{\Sigma ^{-1}}\gamma -\frac{{\mathcal{L}_{Z}}(g({x^{\mathrm{\star }}}))}{{\mathcal{L}^{\prime }_{Z}}(g({x^{\mathrm{\star }}}))}{\Sigma ^{-1}}(\mu -\mathbf{1}{r_{f}})\bigg],\]

where $g(x)$ is given in the expression (16) below. There are several questions that one needs to address when applying the direct approach (15) in obtaining the optimal portfolio ${x^{\mathrm{\star }}}$: (i) if the function $x\to EU(W(x))$ is continuously differentiable; (ii) if the optimal portfolio is the interior point of the corresponding domain; (iii) if the equation (15) has a unique solution. After these questions are addressed the next challenge becomes how to compute ${x^{\mathrm{\star }}}$ numerically. This problem is not trivial if the dimension d is a large number, i.e. $x\in {\mathbb{R}^{d}}$ for large d. To overcome these problems, in this paper we take different approach and obtain ${x^{\mathrm{\star }}}$ in near closed form: to calculate ${x^{\mathrm{\star }}}$ we only need to find the minimizing point of a convex function on the real line.

Lemma 2.1 expresses the expected utility in terms of the linear function ${x^{T}}(\mu -\mathbf{1}{r_{f}})$ and the quadratic function $a{W_{0}}{x^{T}}\gamma -\frac{{a^{2}}{W_{0}^{2}}}{2}{x^{T}}\Sigma x$ of the portfolio $x\in {\mathbb{R}^{n}}$. For convenience, we introduce the notations

(16)

\[\begin{aligned}{}g(x)=:& a{W_{0}}{x^{T}}\gamma -\frac{{a^{2}}{W_{0}^{2}}}{2}{x^{T}}\Sigma x,\\ {} G(x)=:& {e^{-a{W_{0}}{x^{T}}(\mu -\mathbf{1}{r_{f}})}}{\mathcal{L}_{Z}}\bigg(a{W_{0}}{x^{T}}\gamma -\frac{{a^{2}}{W_{0}^{2}}}{2}{x^{T}}\Sigma x\bigg),\\ {} =& {e^{-a{W_{0}}{x^{T}}(\mu -\mathbf{1}{r_{f}})}}{\mathcal{L}_{Z}}\big(g(x)\big).\end{aligned}\]

Then the relation (14) becomes

(17)

\[ EU(W)=-{e^{a{W_{0}}(1+{r_{f}})}}G(x)=-{e^{a{W_{0}}(1+{r_{f}})}}{e^{-a{W_{0}}{x^{T}}(\mu -\mathbf{1}{r_{f}})}}{\mathcal{L}_{Z}}\big(g(x)\big).\]

Therefore we have the obvious relation

(18)

\[ \arg \underset{x\in D}{\max }EU(W)=\arg \underset{x\in D}{\min }G(x)\]

for any domain $D\in {\mathbb{R}^{d}}$ of the portfolio set. Note here that the equality in (18) means the equality of two sets if there is more than one optimizing point.

Our goal in this section is to give a closed-form solution to the problem (11) for some domains of the portfolio set. Before we start our analysis, we first present the following example.

Example 2.4.

Consider the model (1) with $\gamma =0$ and with the mixing distribution $Z\sim {e^{N(0,1)}}$. Then for any $x\ne 0$ we have

\[ EU\big(W(x)\big)=-\infty .\]

To see this, assume that there is $x\ne 0$ such that $EU(W(x))$ is finite. Then by Lemma 2.1 we have

\[ EU\big(W(x)\big)=-{e^{-a{W_{0}}(1+{r_{f}})}}{e^{-a{W_{0}}{x^{T}}(\mu -\mathbf{1}{r_{f}})}}{\mathcal{L}_{Z}}\bigg(-\frac{{a^{2}}{W_{0}^{2}}}{2}{x^{T}}\Sigma x\bigg).\]

For any $x\ne 0$ we have ${x^{T}}\Sigma x\gt 0$ as Σ is positive definite by the assumption of the model (1). Now it is well known that when $Z\sim {e^{N(0,1)}}$ we have ${\mathcal{L}_{Z}}(s)=+\infty $ whenever $s\lt 0$. Therefore ${\mathcal{L}_{Z}}(-\frac{{a^{2}}{W_{0}^{2}}}{2}{x^{T}}\Sigma x)=+\infty $ whenever $x\ne 0$ and this contradicts the finiteness assumption of $EU(W(x))$ made above. Thus we have $EU(W(x))=-\infty $ whenever $x\ne 0$. Therefore the problem (11) does not have a solution when the domain D does not include the zero vector. But if $0\in D$, then $x=0$ is the optimal portfolio and ${\max _{x\in D}}\hspace{0.2778em}EU(W(x))=-{e^{-a{W_{0}}(1+{r_{f}})}}$. This case corresponds to investing all the initial wealth ${W_{0}}$ on the risk-free asset as an optimal portfolio. We remark here that since $\gamma =0$ by Jensen’s inequality we have

\[ EU\big(W(x)\big)\le U\big(EW(x)\big)=U\big({W_{0}}(1+{r_{f}})+{W_{0}}{x^{T}}(\mu -\mathbf{1}{r_{f}})\big).\]

From this relation it is difficult to see that 0 is the expected utility optimizing portfolio when $Z\sim {e^{N(0,1)}}$. But with the assistance of Lemma 2.1 it becomes trivial to determine that 0 is the optimal portfolio as discussed earlier.

Example 2.4 shows that when the model (1) satisfies the conditions in the example and when $0\in D$, the zero portfolio $x=0$ is an optimal portfolio as when $x\ne 0$ one has $EU(W(x))=-\infty $ always. It is obvious that, in this case, the function $x\to EU(W(x))$ is not differentiable at $x=0$. Therefore we call $x=0$ an irregular solution to the optimization problem (18). Before we give the formal definition of irregularity, we first introduce the following definition.

Definition 2.5.

For any mixing distribution Z, if ${\mathcal{L}_{Z}}(s)\lt \infty $ for all $s\in \mathbb{R}$, we set $\hat{s}=-\infty $ and if ${\mathcal{L}_{Z}}(s)\lt \infty $ for some $s\in \mathbb{R}$ and ${\mathcal{L}_{Z}}(s)=+\infty $ for some $s\in \mathbb{R}$, we let $\hat{s}$ be the real number such that

(19)

\[ {\mathcal{L}_{Z}}(s)=E{e^{-sZ}}\lt \infty ,\hspace{2.5pt}\forall s\gt \hat{s}\hspace{1em}\text{and}\hspace{1em}{\mathcal{L}_{Z}}(s)=E{e^{-sZ}}=+\infty ,\hspace{2.5pt}\forall s\lt \hat{s}.\]

We call $\hat{s}$ the critical value (CV) of Z under the Laplace transformation. We use the acronym CV-L from now on, where L means that CV is in the context of the Laplace transformation. One can also define this CV in the context of moment-generating functions and in this case an acronym CV-M can be used. Observe that since Z is a nonnegative random variable we always have $\hat{s}\le 0$.

Remark 2.6.

In Definition 2.5, the value of ${\mathcal{L}_{Z}}(s)$ at $s=\hat{s}$ is not specified. Both the cases ${\mathcal{L}_{Z}}(\hat{s})\lt \infty $ and ${\mathcal{L}_{Z}}(\hat{s})=+\infty $ are possible. For example, if $Z\sim {e^{N(0,1)}}$, then $\hat{s}=0$ and clearly ${\mathcal{L}_{Z}}(0)=1\lt \infty $. If $Z\sim {x^{\alpha -1}}{e^{-x/\beta }}/[\Gamma (\alpha ){\beta ^{\alpha }}]$ is a Gamma distribution, then ${\mathcal{L}_{Z}}(s)=1/[{(1+\beta s)^{\alpha }}]$. In this case $\hat{s}=-1/\beta $ and we have ${\mathcal{L}_{Z}}(\hat{s})=+\infty $.

Below we define some domains for the portfolio set.

(20)

\[ \begin{aligned}{}{S_{a}}=:& \bigg\{x\in {\mathbb{R}^{d}}:a{W_{0}}{x^{T}}\gamma -\frac{{a^{2}}{W_{0}^{2}}}{2}{x^{T}}\Sigma x\gt \hat{s}\bigg\},\\ {} \partial {S_{a}}=:& \bigg\{x\in {\mathbb{R}^{d}}:a{W_{0}}{x^{T}}\gamma -\frac{{a^{2}}{W_{0}^{2}}}{2}{x^{T}}\Sigma x=\hat{s}\bigg\},\\ {} {\bar{S}_{a}}=:& {S_{a}}\cup \partial {S_{a}}.\end{aligned}\]

Remark 2.7.

Our main objective in this section is to find a closed-form solution for the optimal portfolio for the problem

(21)

\[ \underset{x\in {\mathbb{R}^{d}}}{\max }\hspace{0.2778em}EU\big(W(x)\big).\]

The following relations are easy to see:

(22)

\[ \underset{x\in {\mathbb{R}^{d}}}{\max }\hspace{0.2778em}EU\big(W(x)\big)=\underset{x\in {S_{a}}}{\max }EU\big(W(x)\big),\]

if ${\mathcal{L}_{Z}}(\hat{s})=+\infty $, and

(23)

\[ \underset{x\in {\mathbb{R}^{d}}}{\max }\hspace{0.2778em}EU\big(W(x)\big)=\underset{x\in {\bar{S}_{a}}}{\max }EU\big(W(x)\big),\]

if ${\mathcal{L}_{Z}}(\hat{s})\lt +\infty $. Observe here that if $\hat{s}\lt 0$, then ${S_{a}}$ is a nonempty set as the zero vector $x=0$ is in it. If $\hat{s}=0$, then the set ${\bar{S}_{a}}$ is nonempty as $x=0$ is in it.

In this section we attempt to give closed-form solutions to the problems (22) and (23) above. Our approach for this is based on the following idea: we fix the term ${x^{T}}(\mu -\mathbf{1}{r_{f}})$ at some constant level c and optimize the quadratic term $a{W_{0}}{x^{T}}\gamma -\frac{{a^{2}}{W_{0}^{2}}}{2}{x^{T}}\Sigma x$ in (14). More specifically, we solve the optimization problem

(24)

\[ \begin{aligned}{}\underset{x}{\max }\hspace{0.2778em}& a{W_{0}}{x^{T}}\gamma -\frac{{a^{2}}{W_{0}^{2}}}{2}{x^{T}}\Sigma x,\\ {} \mathrm{s}.\mathrm{t}.\hspace{2.5pt}& {x^{T}}(\mu -{r_{f}}\mathbf{1})=c\end{aligned}\]

first, and plug in the solution, which we denote by ${x_{c}}$, into the expression (14) so that the utility maximization problem becomes an optimization problem of a function of one variable c.

Lemma 2.8.

Consider the optimization problem (21). Let $\bar{x}\in {\mathbb{R}^{d}}$ be a solution to this problem. Then $\bar{x}$ solves (24) for some c.

Proof.

Define $\bar{c}=:{\bar{x}^{T}}(\mu -\mathbf{1}{r_{f}})$. Let $\tilde{x}$ be the solution to the problem (24) with c replaced by $\bar{c}$ (here the solution is unique as Σ is positive definite by assumption). By the optimality of $\tilde{x}$, we have $g(\bar{x})\le g(\tilde{x})$. Since ${\mathcal{L}_{Z}}(s)$ is a decreasing function, we have ${\mathcal{L}_{Z}}(g(\tilde{x}))\le {\mathcal{L}_{Z}}(g(\bar{x}))$. Since $\bar{c}={\bar{x}^{T}}(\mu -\mathbf{l}{r_{f}})={\tilde{x}^{T}}(\mu -\mathbf{l}{r_{f}})$, we have $G(\tilde{x})\le G(\bar{x})$. This shows that $EU(W(\tilde{x}))\ge EU(W(\bar{x}))$. But $\bar{x}$ is optimal for (11) with $D={\mathbb{R}^{d}}$. Therefore we should have $EU(W(\tilde{x}))=EU(W(\bar{x}))$. This implies $G(\tilde{x})=G(\bar{x})$ and this in turn implies $g(\bar{x})=g(\tilde{x})$ again due to $\bar{c}={\bar{x}^{T}}(\mu -\mathbf{l}{r_{f}})={\tilde{x}^{T}}(\mu -\mathbf{l}{r_{f}})$. The uniqueness of the optimization point for (24) then implies $\bar{x}=\tilde{x}$. □

Remark 2.9.

Lemma 2.8 gives a characterization of the optimal portfolios for the problem (11). But it doesn’t tell us if the optimal portfolio for the problem (2.8) is unique. It shows only that any optimal portfolio for the problem (11) solves a quadratic optimization problem (24) for some appropriate c. Now consider the case of Example 2.4. In the setting of this example, consider the utility maximization problem (11). Since $0\in {\mathbb{R}^{d}}$, as explained in Example 2.4, the vector $\hat{x}=0$ is the solution to the optimization problem (11). Now let ${x^{\mathrm{\star }}}$ be the optimal solution to the problem (24) with $c=0$ (which means ${({x^{\mathrm{\star }}})^{T}}(\mu -{r_{f}}\mathbf{1}))=0$). Then we should have $g({x^{\mathrm{\star }}})\ge g(\hat{x})$. But if $g({x^{\mathrm{\star }}})\gt g(\hat{x})$, then $\hat{x}=0$ cannot be an optimal solution to (11). Therefore we should have $g({x^{\mathrm{\star }}})=g(\hat{x})$. The uniqueness of the optimal solution to (24) with $c=0$ then implies ${x^{\mathrm{\star }}}=\hat{x}=0$.

Definition 2.10.

Consider the optimization problem (11) for some given model (1) and for some domain $D\subset {\mathbb{R}^{d}}$. Let $\hat{s}$ denote the CV-L of the mixing distribution Z. Let ${x^{\mathrm{\star }}}\in D$ be a solution to (11). We say that ${x^{\mathrm{\star }}}$ is irregular if $g({x^{\mathrm{\star }}})=\hat{s}$. If $g({x^{\mathrm{\star }}})\gt \hat{s}$, we call the solution ${x^{\mathrm{\star }}}$ regular.

Remark 2.11.

Clearly, the definition of irregular and regular solutions depends on the CV-L number $\hat{s}$ of the mixing distribution Z in (1). If ${\mathcal{L}_{Z}}(\hat{s})=+\infty $, then the solution to (11) cannot be irregular. Therefore, the irregularity can happen only when ${\mathcal{L}_{Z}}(\hat{s})\lt +\infty $. Observe that the solution $x=0$ in Example 2.4 is an irregular solution.

Remark 2.12.

Consider the optimization problem (11). From Lemma 2.8, any optimal portfolio ${x^{\mathrm{\star }}}$ is a solution to the quadratic optimization problem (24) with ${x^{T}}(\mu -{r_{f}}\mathbf{1})={c^{\mathrm{\star }}}$ for some fixed ${c^{\mathrm{\star }}}$. If ${x^{\mathrm{\star }}}$ is irregular, then $g({x^{\mathrm{\star }}})=\hat{s}$. The optimality and uniqueness (on the hyperplane ${x^{T}}(\mu -{r_{f}}\mathbf{1})={c^{\mathrm{\star }}}$) of ${x^{\mathrm{\star }}}$ implies that we have $g(x)\lt g({x^{\mathrm{\star }}})=\hat{s}$ for all $x\ne {x^{\mathrm{\star }}}$ on the hyperplane ${x^{T}}(\mu -{r_{f}}\mathbf{1})={c^{\mathrm{\star }}}$. Therefore we have $EU(W(x))=-\infty $ for all $x\ne {x^{\mathrm{\star }}}$ on the hyperplane ${x^{T}}(\mu -{r_{f}}\mathbf{1})={c^{\mathrm{\star }}}$. From this we conclude that if the optimal portfolio for the problem (24) is irregular, then any small neighborhood of this portfolio contains some portfolios with infinite expected utility. In comparison, if the optimal portfolio is regular, then it has a small ball around it with finite expected value for each portfolio in this small ball.

As it was shown in Lemma 2.8, the solutions to the utility maximization problem (11) can be obtained by solving the quadratic optimization problem (24). For a given optimization problem (11), if we know the corresponding c in (24) such that the solution to (24) is the solution to (11), then we just need to solve the optimization problem (24) to obtain the optimal portfolio. But figuring out such an c is not a trivial issue. We first prove following lemma.

Lemma 2.13.

For any real number c, when ${x^{T}}(\mu -\mathbf{1}{r_{f}})=c$, the maximizing point ${x_{c}}$ of $g(x)$ is given by

(25)

\[ {x_{c}}=\frac{1}{a{W_{0}}}\big[{\Sigma ^{-1}}\gamma -{q_{c}}{\Sigma ^{-1}}(\mu -\mathbf{1}{r_{f}})\big],\]

and we have

(26)

\[ g({x_{c}})=\frac{1}{2}{\gamma ^{T}}{\Sigma ^{-1}}\gamma -\frac{{q_{c}^{2}}}{2}{(\mu -\mathbf{1}{r_{f}})^{T}}{\Sigma ^{-1}}(\mu -\mathbf{1}{r_{f}}),\]

where

(27)

\[ {q_{c}}=\frac{{\gamma ^{T}}{\Sigma ^{-1}}(\mu -\mathbf{1}{r_{f}})-a{W_{0}}c}{{(\mu -\mathbf{1}{r_{f}})^{T}}{\Sigma ^{-1}}(\mu -\mathbf{1}{r_{f}})}.\]

Proof.

We form the Lagrangian $L=g(x)+\lambda (c-{x^{T}}(\mu -\mathbf{1}{r_{f}}))$ with the Lagrangian parameter λ. Denoting the maximizing point by ${x_{c}}$, the first order condition gives

(28)

\[ {x_{c}}=\frac{1}{a{W_{0}}}{\Sigma ^{-1}}\gamma -\frac{\lambda }{{a^{2}}{W_{0}^{2}}}{\Sigma ^{-1}}(\mu -\mathbf{1}{r_{f}}).\]

We plug ${x_{c}}$ into ${x_{c}^{T}}(\mu -\mathbf{1}{r_{f}})=c$ and obtain

(29)

\[ c=\frac{1}{a{W_{0}}}{\gamma ^{T}}{\Sigma ^{-1}}(\mu -\mathbf{1}{r_{f}})-\frac{\lambda }{{a^{2}}{W_{0}^{2}}}{(\mu -\mathbf{1}{r_{f}})^{T}}{\Sigma ^{-1}}(\mu -\mathbf{1}{r_{f}}).\]

From this we find λ as

(30)

\[ \lambda =\frac{a{W_{0}}{\gamma ^{T}}{\Sigma ^{-1}}(\mu -\mathbf{1}{r_{f}})-c{a^{2}}{W_{0}^{2}}}{{(\mu -\mathbf{1}{r_{f}})^{T}}{\Sigma ^{-1}}(\mu -\mathbf{1}{r_{f}})}.\]

Then we plug λ into the expression (28) of ${x_{c}}$ above and obtain (25). To obtain (26), we plug ${x_{c}}$ into $g(x)$ in (16). After doing some algebra, we obtain

(31)

\[ g({x_{c}})=\frac{1}{2}{\gamma ^{T}}{\Sigma ^{-1}}\gamma -\frac{1}{2}{q_{c}^{2}}{(\mu -\mathbf{1}{r_{f}})^{T}}{\Sigma ^{-1}}(\mu -\mathbf{1}{r_{f}}),\]

with ${q_{c}}$ given as in (27). This completes the proof. □

For the rest of the paper, as in [3], for convenience, we use the notations

(32)

\[ \mathcal{A}={\gamma ^{T}}{\Sigma ^{-1}}\gamma ,\hspace{1em}\mathcal{C}={(\mu -\mathbf{1}{r_{f}})^{T}}{\Sigma ^{-1}}(\mu -\mathbf{1}{r_{f}}),\hspace{1em}\mathcal{B}={\gamma ^{T}}{\Sigma ^{-1}}(\mu -\mathbf{1}{r_{f}}).\]

We first observe that $\mathcal{C}\gt 0$ due to the assumption in Remark 2.2 and the assumption on positive definiteness of Σ. With these notations we have

(33)

\[ g({x_{c}})=\frac{\mathcal{A}}{2}-\frac{{q_{c}^{2}}}{2}\mathcal{C},\hspace{2em}{q_{c}}=\frac{\mathcal{B}}{\mathcal{C}}-\frac{a{W_{0}}}{\mathcal{C}}c.\]

From the relation (33), we express c as a function of ${q_{c}}$ as

(34)

\[ c=\frac{1}{a{W_{0}}}[\mathcal{B}-\mathcal{C}{q_{c}}].\]

We define the function

(35)

\[ Q(\theta )={e^{\mathcal{C}\theta }}{\mathcal{L}_{Z}}\bigg[\frac{1}{2}\mathcal{A}-\frac{{\theta ^{2}}}{2}\mathcal{C}\bigg],\]

and we define $\hat{\theta }=:\sqrt{\frac{\mathcal{A}-2\hat{s}}{\mathcal{C}}}$, where $\hat{s}$ is the IN of Z. If $\hat{s}=-\infty $, the $\hat{\theta }$ is understood to be equal to $+\infty $. Note here that $\hat{s}\le 0$ as Z is a nonnegative random variable. Therefore $\hat{\theta }$ is well defined. If ${\mathcal{L}_{Z}}(\hat{s})\lt +\infty $, $Q(\theta )$ is finite iff $\frac{1}{2}\mathcal{A}-\frac{{\theta ^{2}}}{2}\mathcal{C}\ge \hat{s}$ and this translates into: $Q(\theta )$ is finite iff $\theta \in [-\hat{\theta },\hat{\theta }]$. If ${\mathcal{L}_{Z}}(\hat{s})=+\infty $, $Q(\theta )$ is finite iff $\frac{1}{2}\mathcal{A}-\frac{{\theta ^{2}}}{2}\mathcal{C}\gt \hat{s}$ and this translates into: $Q(\theta )$ is finite iff $\theta \in (-\hat{\theta },\hat{\theta })$.

Next we prove the following lemma that relates Q to G.

Lemma 2.14.

Let ${x_{c}}$ be the solution to the problem (24) for a given c. Assume ${x_{c}}\in {S_{a}}$ if ${\mathcal{L}_{Z}}(\hat{s})=+\infty $ and ${x_{c}}\in {\bar{S}_{a}}$ if ${\mathcal{L}_{Z}}(\hat{s})\lt +\infty $. Then, for any x with ${x^{T}}(\mu -\mathbf{1}{r_{f}})=c$, we have

(36)

\[ {e^{-\mathcal{B}}}Q({q_{c}})\le G(x),\]

where ${q_{c}}$ is given by (27) and $\mathcal{B}$ is given by (32). We also have ${e^{-\mathcal{B}}}Q({q_{c}})=G({x_{c}})$.

Proof.

Note that $G(x)={e^{-a{W_{0}}{x^{T}}(\mu -\mathbf{1}{r_{f}})}}{\mathcal{L}_{Z}}(g(x))$. The conditions stated on ${x_{c}}$ in the lemma ensure that $G({x_{c}})={e^{-a{W_{0}}c}}{\mathcal{L}_{Z}}(g({x_{c}}))$ is finite. Since $g(x)\le g({x_{c}})$ for any x with ${x^{T}}(\mu -\mathbf{1}{r_{f}})=c$ by the definition of ${x_{c}}$ (the optimizing point) and also since ${\mathcal{L}_{Z}}(s)$ is a decreasing function of s, we have

(37)

\[ G({x_{c}})\le G(x)\]

for any x with ${x^{T}}(\mu -\mathbf{1}{r_{f}})=c$. We plug c in (34) into the expression of $G({x_{c}})$ and obtain

(38)

\[ G({x_{c}})={e^{-\mathcal{B}}}{e^{\mathcal{C}{q_{c}}}}{\mathcal{L}_{Z}}\bigg[\frac{1}{2}\mathcal{A}-\frac{{q_{c}^{2}}}{2}\mathcal{C}\bigg]={e^{-\mathcal{B}}}Q({q_{c}}).\]

□

Remark 2.15.

Lemma 2.14 shows that the function $G(x)$ achieves its unique (as the solution to (24) is unique in a hyperplane) minimum value on the hyperplane ${x^{T}}(\mu -{r_{f}}\mathbf{1})=c$ at ${x_{c}}$ and its minimum value is given by ${e^{-\mathcal{B}}}Q({q_{c}})$ with ${q_{c}}$ in (33). For any ${\theta _{0}}\in [-\hat{\theta },\hat{\theta }]$, we can let ${c_{0}}$ be such that ${q_{{c_{0}}}}={\theta _{0}}$. Let ${x_{0}}$ be the optimal solution to (24) with c replaced by ${c_{0}}$. From Lemma 2.13, we have $g({x_{0}})=\frac{1}{2}\mathcal{A}-\frac{{q_{{c_{0}}}^{2}}}{2}\mathcal{C}$. If $|{q_{{c_{0}}}}|=\hat{\theta }$, then $g({x_{0}})=\hat{s}$. If $|{q_{{c_{0}}}}|\lt \hat{\theta }$, then $g({x_{0}})\gt \hat{s}$.

Theorem 2.16.

Consider the optimization problem (21). A portfolio ${x^{\mathrm{\star }}}$ is a solution to (21) if and only if

(39)

\[ {x^{\mathrm{\star }}}=\frac{1}{a{W_{0}}}\big[{\Sigma ^{-1}}\gamma -{q_{\mathit{min}}}{\Sigma ^{-1}}(\mu -\mathbf{1}{r_{f}})\big]\]

for some

(40)

\[ {q_{\mathit{min}}}\in \arg \underset{\theta \in \Theta }{\min }Q(\theta ),\]

where $\Theta =[-\hat{\theta },\hat{\theta }]$ if $\hat{\theta }=\sqrt{\frac{\mathcal{A}-2\hat{s}}{\mathcal{C}}}\lt \infty $ and $\Theta =(-\infty ,+\infty )$ if $\hat{\theta }=+\infty $. Here $\hat{s}$ is the CV-L of the mixing distribution Z.

Proof.

First we show that if $\hat{x}$ is a solution to (21), then $\hat{x}$ is given by (39). By Lemma 2.8, $\hat{x}$ is a solution to the optimization problem (24) with some $c=\hat{c}$. By Lemma 2.13, $\hat{x}$ takes the form

\[ \hat{x}=\frac{1}{a{W_{0}}}\big[{\Sigma ^{-1}}\gamma -\hat{q}{\Sigma ^{-1}}(\mu -\mathbf{1}{r_{f}})\big],\]

with $\hat{q}=\mathcal{B}/\mathcal{C}-(a{W_{0}}/\mathcal{C})\hat{c}$. Again by Lemma 2.13 we have (see (33))

\[ g(\hat{x})=\frac{\mathcal{A}}{2}-\frac{{(\hat{q})^{2}}}{2}\mathcal{C}.\]

Since $\hat{x}$ is a solution to (21) we have $G(\hat{x})\lt \infty $ and this implies $g(\hat{x})\ge \hat{s}$ if $\hat{s}$ is finite and $g(\hat{x})\gt \hat{s}$ if $\hat{s}=-\infty $ (note that $g(\hat{x})=-\infty $ implies $G(\hat{x})=+\infty $ due to the assumption $Z\ne 0$ in Remark 2.2 and $G(\hat{x})={e^{-a{W_{0}}{\hat{x}^{T}}(\mu -{r_{f}}\mathbf{1})}}{\mathcal{L}_{Z}}(g(\hat{x}))$). The expression of $g(\hat{x})$ above then implies $\hat{q}\in \Theta $ (note here that for the case $\hat{\theta }=+\infty $, we can’t have ${\hat{q}^{2}}=+\infty $ as $g(\hat{x})$ is finite as explained above).

Now we need to show $\hat{q}\in \arg {\min _{\theta \in \Theta }}Q(\theta )$. From Lemma 2.14, we have $G(\hat{x})={e^{-\mathcal{B}}}Q(\hat{q})$. Take any ${\theta _{0}}\in \Theta $ (including the case $\Theta =(-\infty ,+\infty )$). Let ${c_{0}}$ be such that ${\theta _{0}}={q_{{c_{0}}}}$ (see Remark 2.15). Let ${x_{0}}$ be the solution to (24) with c replaced by ${c_{0}}$. By Lemma 2.13 we have $g({x_{0}})=\frac{\mathcal{A}}{2}-\frac{{({q_{{c_{0}}}})^{2}}}{2}\mathcal{C}$. Since ${\theta _{0}}={q_{{c_{0}}}}\in \Theta $, we have $g({x_{0}})\ge \hat{s}$ if $\hat{s}$ is finite and $g({x_{0}})\gt \hat{s}$ if $\hat{s}=-\infty $. Therefore, either ${x_{0}}\in {S_{a}}$ or ${x_{0}}\in {\bar{S}_{a}}$. Then by Lemma 2.14 we have $G({x_{0}})={e^{-\mathcal{B}}}Q({q_{{c_{0}}}})$. Since $\hat{x}$ is the optimal portfolio, it is the minimizing point for the function $G(x)$ (see (18) for this). Therefore we have $G(\hat{x})\le G({x_{0}})$. This implies $Q(\hat{q})\le Q({q_{{c_{0}}}})=Q({\theta _{0}})$. Since ${\theta _{0}}$ is arbitrary, we conclude that $\hat{q}\in \arg {\min _{\theta \in \Theta }}Q(\theta )$.

Next we show that any portfolio of the form (39) is an optimal portfolio for (21). Fix an arbitrary ${q_{m}}\in \arg {\min _{\theta \in \Theta }}Q(\theta )$. Then ${q_{m}}\in [-\hat{\theta },\hat{\theta }]$ if $\hat{\theta }$ is finite and ${q_{m}}\in (-\infty ,+\infty )$ if $\hat{\theta }=+\infty $. Let ${c_{m}}$ be such that ${q_{m}}={q_{{c_{m}}}}$ and let ${x_{m}}$ be the solution to (24) with c replaced by ${c_{m}}$. By Lemma 2.13, we have

\[ {x_{m}}=\frac{1}{a{W_{0}}}\big[{\Sigma ^{-1}}\gamma -{q_{m}}{\Sigma ^{-1}}(\mu -\mathbf{1}{r_{f}})\big],\]

and $g({x_{m}})=\frac{\mathcal{A}}{2}-\frac{{q_{m}^{2}}}{2}\mathcal{C}$. The condition on ${q_{m}}$ above implies $g({x_{m}})\ge \hat{s}$ if $\hat{s}$ is finite and $g({x_{m}})\gt -\infty $ if $\hat{s}=-\infty $. Therefore, either ${x_{m}}\in {S_{a}}$ or ${x_{m}}\in {\bar{S}_{a}}$. By Lemma 2.14 we have $G({x_{m}})={e^{-\mathcal{B}}}Q({q_{m}})$ which is a finite number. To show ${x_{m}}$ is an optimal portfolio we need to show $G({x_{m}})\le G(x)$ for any x that $G(x)$ is finite (note that either $G(x)=+\infty $ or it is finite). Fix an arbitrary $\bar{x}$ with $G(\bar{x})\lt +\infty $. Let $\bar{c}={\bar{x}^{T}}(\mu -{r_{f}}\mathbf{1})$. Let ${x_{\bar{c}}}$ be the solution to (24) with c replaced by $\bar{c}$. Since $G(x)\lt \infty $, we either have $x\in {\bar{S}_{a}}$ or $x\in {S_{a}}$. This means that ${x_{\bar{c}}}\in {\bar{S}_{a}}$. By Lemma 2.13 we have $g({x_{\bar{c}}})=\frac{\mathcal{A}}{2}-\frac{{q_{\bar{c}}^{2}}}{2}\mathcal{C}$, where ${q_{\bar{c}}}$ is given by (33) with c replaced by $\bar{c}$. Therefore, we have ${q_{\bar{c}}}\in [-\hat{\theta },\hat{\theta }]$ if $\hat{\theta }$ is finite and ${q_{\bar{c}}}\in (-\infty ,+\infty )$ if $\hat{\theta }=+\infty $. By the definition of ${q_{m}}$, we have $Q({q_{m}})\le Q({q_{\bar{c}}})$. Therefore, we have $G({x_{m}})={e^{-\mathcal{B}}}Q({q_{m}})\le {e^{-\mathcal{B}}}Q({q_{\bar{c}}})=G(\bar{x})$. □

Proposition 2.17.

Consider the optimization problem (21). If ${x^{\mathrm{\star }}}$ is a regular solution to (21) then

(41)

\[ {x^{\mathrm{\star }}}=\frac{1}{a{W_{0}}}\big[{\Sigma ^{-1}}\gamma -{q_{\mathit{min}}}{\Sigma ^{-1}}(\mu -\mathbf{1}{r_{f}})\big],\]

for some

(42)

\[ {q_{min}}\in \arg \underset{\theta \in (-\hat{\theta },\hat{\theta })}{\min }Q(\theta ),\]

where $\hat{\theta }=:\sqrt{\frac{\mathcal{A}-2\hat{s}}{\mathcal{C}}}$ and $\hat{s}$ is the CV-L of the mixing distribution Z.

Proof.

Let $\hat{x}$ be a regular solution. By Lemma 2.8, $\hat{x}$ is a solution to the optimization problem (24) with some $c=\hat{c}$. By Lemma 2.13, $\hat{x}$ takes the form

\[ \hat{x}=\frac{1}{a{W_{0}}}\big[{\Sigma ^{-1}}\gamma -\hat{q}{\Sigma ^{-1}}(\mu -\mathbf{1}{r_{f}})\big]\]

with $\hat{q}=\mathcal{B}/\mathcal{C}-(a{W_{0}}/\mathcal{C})\hat{c}$. Again by Lemma 2.13 we have (see (33))

\[ g(\hat{x})=\frac{\mathcal{A}}{2}-\frac{{(\hat{q})^{2}}}{2}\mathcal{C}.\]

Since $\hat{x}$ is regular, we have $g(\hat{x})\gt \hat{s}$. From this we conclude $\hat{q}\in (-\hat{\theta },\hat{\theta })$. From Lemma 2.14, we have $G(\hat{x})={e^{-\mathcal{B}}}Q(\hat{q})$. Note that $\hat{q}={q_{\hat{c}}}$. Now we show that $\hat{q}\in \arg {\min _{\theta \in (-\hat{\theta },\hat{\theta })}}Q(\theta )$. Take any ${\theta _{0}}\in (-\hat{\theta },\hat{\theta })$. Let ${c_{0}}$ be such that ${\theta _{0}}={q_{{c_{0}}}}$ (see Remark 2.15). Let ${x_{0}}$ be the solution to (24) with c replaced by ${c_{0}}$. By Lemma 2.13 we have $g({x_{0}})=\frac{\mathcal{A}}{2}-\frac{{({q_{{c_{0}}}})^{2}}}{2}\mathcal{C}$. Since ${\theta _{0}}={q_{{c_{0}}}}\in (-\hat{\theta },\hat{\theta })$, we have $g({x_{0}})\gt \hat{s}$. Therefore ${x_{0}}\in {S_{a}}$. Then by Lemma 2.14 we have $G({x_{0}})={e^{-\mathcal{B}}}Q({q_{{c_{0}}}})$. Since $\hat{x}$ is the optimal portfolio, it is the minimizing point for the function $G(x)$ (see (18) for this). Therefore, we have $G(\hat{x})\le G({x_{0}})$. This implies $Q(\hat{q})\le Q({q_{{c_{0}}}})=Q({\theta _{0}})$. Since ${\theta _{0}}$ is arbitrary, we conclude that $\hat{q}\in \arg {\min _{\theta \in (-\hat{\theta },\hat{\theta })}}Q(\theta )$. □

Remark 2.18.

Let us look at the case of Example 2.4. From the analysis in this example the optimal solution to the problem (21) is ${x^{\mathrm{\star }}}=0$ and it is unique. Here we would like to check that this optimal portfolio ${x^{\mathrm{\star }}}=0$ can also be derived from (39). To see this, note that in this example $\gamma =0$. Therefore we have $Q(\theta )={e^{\mathcal{C}\theta }}{\mathcal{L}_{Z}}(-\frac{{\theta ^{2}}}{2}\mathcal{C})$ and ${q_{c}}=-\frac{a{W_{0}}}{\mathcal{C}}c$. Observe that $0\in \{{x^{T}}(\mu -\mathbf{1}{r_{f}}):x\in {\mathbb{R}^{n}}\}$. Also for any $\theta \ne 0$ we have $Q(\theta )=+\infty $ as the CV-L of $Z\sim {e^{N(0,1)}}$ is $\hat{s}=0$. Therefore $\arg {\min _{\theta \in \Theta }}Q(\theta )$ has only one element ${q_{\mathit{min}}}=0$. Then (39) gives ${\bar{x}^{\mathrm{\star }}}=0$ as the only optimal solution. Observe that in fact in this example we have $\mathcal{A}=0$ and therefore $\hat{\theta }=0$. Thus ${\bar{q}_{\mathit{min}}}=\arg {\min _{\theta \in \{0\}}}Q(\theta )=0$.

Remark 2.19.

We remark here that our closed-form formula (39) expresses the optimal portfolio in terms of the critical value (see Definition 2.10) of the mixing distribution Z and its Laplace transformation which is hidden in the function $Q(\theta )$. This has some advantage in determining the optimal portfolio for some cases of models (1), see our Corollary 4.5 below for this.

3 Large financial markets

In the previous section we gave a closed-form solution for the optimal portfolio for an exponential utility maximizer in a market that contains one risk-free asset and finitely many risky assets with return vector that follow (1). Our Theorem 2.16 gives the complete characterization of the optimal portfolio in such small markets.

The next natural question to ask is what happens if the consumer with an exponential utility wants to increase her expected utility as much as possible by adding as many as necessary assets into her portfolio. We can best investigate this possibility by working in mathematical models with countably infinitely many assets.

In this section we consider a sequence of economies with increasing number of assets. In the nth economy, there are n risky assets and one riskless asset. The return vector of the risky assets in the nth economy satisfies (1). A consumer with an exponential utility maximizes her expected utility based on the $n+1$ assets in each nth economy. Our main concern in this section is to investigate if the optimal expected utility of the consumer converges to a limit as $n\to \infty $, and we would like to identify this limit as the optimizer in the market with infinitely many assets.

Such “stability” of optimal investment problems was proved in [7] for a wide range of models. The methods of [7], however, cannot deal with exponential utilities. So we need to apply somewhat different, new arguments.

Our main result in this section shows that the consumer can achieve the maximum possible (in a market where she can trade on countably infinitely many risky assets) expected utility by following the sequence of optimal trading strategies in each nth economy, which are shown to converge to a limit (see our Lemma 3.6 below). We call this limit portfolio the “overall best optimal portfolio” in this paper.

An economy that allows to trade on countably infinitely many risky assets is called a large financial market in the literature. They serve well to describe, e.g., bonds of various maturities. The first model of this type, the “Arbitrage Pricing Model” (APM), goes back to [26]. We consider a slight extension of that model in the present section. As the main result of this section, we will show that the exponential utiliy maximization problem in a large financial market can be approximated by similar problems for finitely many assets (and the latter can be solved by the results of the previous sections).

Before we state and prove our main result of this section, we first specify the structure of our nth economy for all n. The return on the bank account is ${R_{0}}:={r_{f}}$ where ${r_{f}}\ge 0$ is the risk-free interest rate. For simplicity we assume ${r_{f}}=0$ henceforth. For $i=1$, ${R_{1}}:={\gamma _{1}}Z+{\mu _{1}}+{\bar{\beta }_{1}}\sqrt{Z}{\varepsilon _{1}}$ is the return on the “market portfolio”, which may be thought of as an investment into an index. For $i\ge 2$, let the return on risky asset i be given by

(43)

\[ {R_{i}}={\gamma _{i}}Z+{\mu _{i}}+{\beta _{i}}\sqrt{Z}{\varepsilon _{1}}+{\bar{\beta }_{i}}\sqrt{Z}{\varepsilon _{i}}.\]

Here ${({\varepsilon _{i}})_{i\ge 1}}$ are assumed to be independent standard Gaussian variables, Z is a positive random variable, independent of ${\varepsilon _{i}}$, ${\beta _{i}}$, $i\ge 2$, ${\bar{\beta }_{i}}\ne 0,{\gamma _{i}},{\mu _{i}}$, $i\ge 1$ are constants. The classical APM corresponds to $Z\equiv 1$. We refer to [26] for further discussions on that model.

We consider investment strategies in finite market segments. A strategy investing in the first n assets is a sequence of numbers ${\phi _{0}},{\phi _{1}},\dots ,{\phi _{n}}$. For simplicity, we assume 0 to be initial capital and also that every asset has price 1 at time 0. Self-financing imposes ${\textstyle\sum _{i=0}^{n}}{\phi _{i}}=0$, so a strategy is, in fact, described by ${\phi _{1}},\dots ,{\phi _{n}}$ which can be arbitrary real numbers. The return on the portfolio ϕ is thus

\[ V(\phi )={\sum \limits_{i=1}^{n}}{\phi _{i}}{R_{i}},\]

noting also that ${R_{0}}=0$ is assumed.

For a utility maximization problem to be well-posed, one should assume a certain arbitrage-free property for the market. Notice that a probability ${Q_{n}}\sim P$ is a martingale measure for the first n assets (that is, ${E_{{Q_{n}}}}[{R_{i}}]=0$ for all $1\le i\le n$) provided that

(44)

\[ {E_{{Q_{n}}}}[{\varepsilon _{1}}|Z=z]={b_{1}}(z):=-\frac{{\gamma _{1}}\sqrt{z}}{{\bar{\beta }_{1}}}-\frac{{\mu _{1}}}{\sqrt{z}{\bar{\beta }_{1}}},\hspace{1em}z\in (0,\infty ),\]

and, for each $i\ge 2$,

(45)

\[ {E_{{Q_{n}}}}[{\varepsilon _{i}}|Z=z]={b_{i}}(z):=-\frac{{\gamma _{i}}\sqrt{z}}{{\bar{\beta }_{i}}}-\frac{{\mu _{i}}}{\sqrt{z}{\bar{\beta }_{i}}}-\frac{{\beta _{i}}{b_{1}}(z)\sqrt{z}}{{\bar{\beta }_{i}}},\hspace{1em}z\in (0,\infty ).\]

Now notice that, in fact, the set of such $V(\phi )$ coincides with the set of

\[ V(h):={\sum \limits_{i=1}^{n}}{h_{i}}\sqrt{Z}\big({\varepsilon _{i}}-{b_{i}}(Z)\big)\]

where ${h_{1}},\dots ,{h_{n}}$ are arbitrary real numbers. We denote by ${H_{n}}$ the set of all n-tuples $({h_{1}},\dots ,{h_{n}})$. It is more convenient to use this “h-parametrization” in the sequel.

Assumption 3.1.

There are finite real numbers $0\lt c\lt C$, such that $c\le Z\le C$.

Let us define ${d_{i}}:={\sup _{z\in [c,C]}}|{b_{i}}(z)|$, $i\ge 1$. The next assumption is similar in spirit to the no-arbitrage condition derived in [26], see also [25].

Assumption 3.2.

We stipulate ${\textstyle\sum _{i=1}^{\infty }}{d_{i}^{2}}\lt \infty $.

Fact. If X is standard normal then $E[{e^{-\theta X-{\theta ^{2}}/2}}]=1$ and $E[X{e^{-\theta X-{\theta ^{2}}/2}}]=\theta $, for all $\theta \in \mathbb{R}$. Notice also that, for all $p\ge 1$,

(46)

\[ E\big[{e^{-p\theta X-p{\theta ^{2}}/2}}\big]={e^{({p^{2}}-p){\theta ^{2}}/2}}.\]

Let us now define

\[ {f_{n}}(z):=\exp \Bigg(-{\sum \limits_{i=1}^{n}}\big[{b_{i}}(z){\varepsilon _{i}}+{b_{i}}{(z)^{2}}\big]\Bigg).\]

Clearly, $E[{f_{n}}(z)]=1$ and $E[{f_{n}}(z){\varepsilon _{i}}]={b_{i}}(z)$ for $i=1,\dots ,n$. Then ${Q_{n}}$ defined by $d{Q_{n}}/dP:={f_{n}}(Z)$ will be a martingale measure for the first n assets. Indeed,

\[ E\big[{f_{n}}(Z)\big]={\int _{[c,C]}}E\big[{f_{n}}(z)\big]\hspace{0.1667em}\mathrm{Law}(Z)(dz)=1\]

and

\[ E\big[{f_{n}}(Z){\varepsilon _{i}}|Z=z\big]=E\big[\big({\varepsilon _{i}}-{b_{i}}(z)\big){e^{-{b_{i}}(z){\varepsilon _{i}}-{b_{i}}{(z)^{2}}/2}}\big]=0,\hspace{1em}1\le i\le n.\]

It follows from (46) and from Assumption 3.2 that ${\sup _{n}}E[{(d{Q_{n}}/dP)^{2}}]\lt \infty $ hence $dQ/dP:={\lim \nolimits_{n\to \infty }}d{Q_{n}}/dP$ exists almost surely and in ${L^{2}}$, and this is a martingale measure for all the assets, that is, ${E_{Q}}[{R_{i}}]=0$ for all $i\ge 1$. Note also that $E[{(dQ/dP)^{2}}]\lt \infty $.

Using the previous sections, we may find ${h_{n}^{\ast }}\in {H_{n}}$ such that

\[ {U_{n}}:=E\big[{e^{-V({h_{n}^{\ast }})}}\big]=\underset{h\in {H_{n}}}{\min }E\big[{e^{-V(h)}}\big].\]

If we wish to find (asymptotically) optimal strategies for this large financial market, then we also need to verify that ${U_{n}}\to U:={\inf _{h\in {\cup _{n\ge 1}}{H_{n}}}}E[{e^{-V(h)}}]$ as $n\to \infty $.

Let us introduce

\[ {\ell _{2}}:=\Bigg\{{({h_{i}})_{i\ge 1}},\hspace{0.1667em}{h_{i}}\in \mathbb{R},\hspace{0.1667em}i\ge 1,\hspace{0.1667em}{\sum \limits_{i=1}^{\infty }}{h_{i}^{2}}\lt \infty \Bigg\}\]

which is a Hilbert space with the norm $||h|{|_{{\ell _{2}}}}:=\sqrt{{\textstyle\sum _{i=1}^{\infty }}{h_{i}^{2}}}$. We may and will identify each $({h_{1}},\dots ,{h_{n}})\in {H_{n}}$ with $({h_{1}},{h_{2}},\dots )\in {\ell _{2}}$ for all $n\ge 1$. Also define $d:=({d_{1}},{d_{2}},\dots )\in {\ell _{2}}$.

Theorem 3.3.

Under Assumptions 3.1 and 3.2, one has ${U_{n}}\to U$, $n\to \infty $.

Proof.

It follows from Lemma 3.6 below that there is ${\bar{h}^{\ast }}\in {\ell _{2}}$ such that $U=E[{e^{-V({\bar{h}^{\ast }})}}]$. Define now ${\tilde{h}_{n}}:=({\bar{h}_{1}^{\ast }},\dots ,{\bar{h}_{n}^{\ast }})\in {H_{n}}$. It is clear that ${U_{n}}\ge U$ and $E[{e^{-V({\tilde{h}_{n}})}}]\ge {U_{n}}$ for all $n\ge 1$. Hence it remains to establish $E[{e^{-V({\tilde{h}_{n}})}}]\to U$.

Noting that $V({\tilde{h}_{n}})\to V({h^{\ast }})$ almost surely, it suffices to show that ${\sup _{n\in \mathbb{N}}}E[{e^{-2V({\tilde{h}_{n}})}}]\lt \infty $. This follows from

\[ E\big[{e^{-2V({\tilde{h}_{n}})}}\big]\le {e^{2\sqrt{C}||{\tilde{h}_{n}}|{|_{2}}||d|{|_{2}}}}E\big[{e^{2\sqrt{C}||{\tilde{h}_{n}}|{|_{2}}|N|}}\big]\le {e^{2\sqrt{C}||{h^{\ast }}|{|_{2}}||d|{|_{2}}}}E\big[{e^{2\sqrt{C}||{h^{\ast }}|{|_{2}}|N|}}\big],\]

where N is a standard normal random variable. □

Remark 3.4.

The main message of Theorem 3.3 is that the sequence of optimal expected utilities in the small markets defined above is a convergent sequence, the limit being a finite number. This means that after the consumer increases the number of assets in her/his portfolio to a certain level, a further increase of the number of assets will not bring significant increments of the expected utility. It is not trivial to have some estimations on the number of assets needed for the optimal expected utility to be sufficiently close to the overall best utility level. It would be interesting to see how fast this sequence converges to the overall best utility level U. We leave this for further discussions.

Lemma 3.5.

There exists $\alpha \gt 0$ such that, for all $h\in {\ell _{2}}$ with $\| h{\| _{{\ell _{2}}}}=1$, $P(V(h)\le -\alpha )\ge \alpha $ holds.

Proof.

We follow closely the proof of Proposition 3.2 in [7], see also [6]. We argue by contradiction. Assume that for all $n\ge 1$, there is ${g_{n}}=({g_{n}}(1),{g_{n}}(2),\dots )\in {\cup _{n\ge 1}}{H_{n}}$ with $\| {g_{n}}{\| _{{\ell _{2}}}}=1$ and $P(V({g_{n}})\le -1/n)\le 1/n$.

Clearly, $V{({g_{n}})^{-}}\to 0$ in probability as $n\to \infty $. We claim that ${E_{Q}}[V{({g_{n}})^{-}}]\to 0$. By the Cauchy–Schwarz inequality

\[ {E_{Q}}\big[V{({g_{n}})^{-}}\big]\le \| dQ/dP{\| _{{L^{2}}(P)}}{\big(E\big[{\big(V{({g_{n}})^{-}}\big)^{2}}\big]\big)^{1/2}}.\]

However,

(47)

\[ V{({g_{n}})^{-}}\le |V({g_{n}})|\le \sqrt{C}[|N|+||d|{|_{2}}]\]

for some standard normal N. This implies $E[{(V{({g_{n}})^{-}})^{2}}]$, $n\to \infty $, and hence our claim.

Since ${E_{Q}}[V({g_{n}})]=0$ by the martingale measure property of Q, we also get that ${E_{Q}}[V{({g_{n}})^{+}}]\to 0$. It follows that ${E_{Q}}[|V({g_{n}})|]\to 0$, hence $V({g_{n}})$ goes to zero Q-a.s. (along a subsequence) and, as Q is equivalent to P, P-a.s. Using that $|V({g_{n}}){|^{2}}$, $n\in \mathbb{N}$, is uniformly P-integrable by (47), we get $E[V{({g_{n}})^{2}}]\to 0$. An auxiliary calculation gives

\[ E\big[V{({g_{n}})^{2}}\big]=\| {g_{n}}{\| _{{\ell _{2}}}^{2}}E[Z]+{\sum \limits_{i=1}^{\infty }}{g_{n}^{2}}(i)E\big[{b_{i}^{2}}(Z)Z\big]\ge E[Z]\gt 0,\]

a contradiction proving our lemma. □

Lemma 3.6.

There is ${h^{\ast }}\in {\ell _{2}}$ such that $U=E[{e^{-V({h^{\ast }})}}]$.

Proof.

There are ${h_{n}}\in {\cup _{j\in \mathbb{N}}}{H_{j}}$, $n\in \mathbb{N}$, such that $E[{e^{-V({h_{n}})}}]\to U$. If we had ${\sup _{n}}||{h_{n}}|{|_{{\ell _{2}}}}=\infty $, then (taking a subsequence still denoted by n), $||{h_{n}}|{|_{{\ell _{2}}}}\to \infty $, $n\to \infty $. By Lemma 3.5,

\[ P\big(V({h_{n}})\le -\alpha ||{h_{n}}|{|_{{\ell _{2}}}}\big)\ge \alpha \]

and this implies $E[{e^{-V({h_{n}})}}]\to \infty $, which contradicts $E[{e^{-V({h_{n}})}}]\to U\le E[{e^{0}}]=1$.

Then necessarily ${\sup _{n}}||{h_{n}}|{|_{{\ell _{2}}}}\lt \infty $ and the Banach–Saks theorem implies that convex combinations ${\bar{h}_{n}}$ of ${h_{n}}$ converge to some ${h^{\ast }}\in {\ell _{2}}$ (in the norm of ${\ell _{2}}$). By Fatou’s lemma,

\[ E\big[{e^{-V({h^{\ast }})}}\big]\le \underset{n\to \infty }{\liminf }E\big[{e^{-V({\bar{h}_{n}})}}\big]\le \underset{n\to \infty }{\liminf }E\big[{e^{-V({h_{n}})}}\big]=U,\]

using also convexity of the exponential function. This proves the statement. □

4 Applications and examples

Our Theorem 2.16 gives a closed-form expression for the optimal portfolios for the problem (21) by using the function $Q(\theta )$ defined in (35). In this section, we first study some properties of this function. Then we present some examples.

Let ${\mathcal{M}_{Z}}(s)=E{e^{sZ}}$ and ${\mathcal{K}_{Z}}(s)=\ln {\mathcal{M}_{Z}}(s)$ denote the moment generating function (MGF) and the cumulant generating function (CGF) of the mixing distribution Z, respectively. We have the obvious relation

\[ Q(\theta )={e^{\mathcal{C}\theta }}{\mathcal{M}_{Z}}\bigg(\frac{\mathcal{C}}{2}{\theta ^{2}}-\frac{\mathcal{A}}{2}\bigg),\hspace{2em}\ln Q(\theta )=\mathcal{C}\theta +{\mathcal{K}_{Z}}\bigg(\frac{\mathcal{C}}{2}{\theta ^{2}}-\frac{\mathcal{A}}{2}\bigg).\]

Therefore the minimizing points of $Q(\theta )$ in (40) can also be found by using the MGF or KGF of Z. In the following lemma we state some properties of the function $Q(\theta )$.

Lemma 4.1.

Consider the model (1) with a nontrivial mixing distribution Z. Let $\hat{s}$ denote the CV-L of Z and $\hat{\theta }$ be defined as in Section 2. Let the function $Q(\theta )$ be defined by (35). Assume our model (1) is such that either $\mathcal{A}\ne 0$ or $\hat{s}\ne 0$ which ensures $\hat{\theta }=\sqrt{(\mathcal{A}-2\hat{s})/\mathcal{C}}\ne 0$ and hence $(-\hat{\theta },\hat{\theta })$ is a nonempty open interval. Then we have the following.

a) The function $Q(\theta )$ is infinitely differentiable on $(-\hat{\theta },\hat{\theta })$. If $\hat{s}$ is finite and ${\mathcal{L}_{Z}}(\hat{s})=+\infty $ or if $\hat{s}=-\infty $, we have

(48)
\[ \underset{\theta \to {\hat{\theta }^{-}}}{\lim }Q(\theta )=+\infty ,\hspace{2em}\underset{\theta \to -{\hat{\theta }^{+}}}{\lim }Q(\theta )=+\infty .\]
When $\hat{s}$ is finite and ${\mathcal{L}_{Z}}(\hat{s})\lt \infty $ we have $Q(\hat{\theta })\lt \infty $ and $Q(-\hat{\theta })\lt \infty $. When $\hat{s}$ is finite and $\theta \notin [-\hat{\theta },\hat{\theta }]$ we have $Q(\theta )=+\infty $.
b) The function $Q(\theta )$ is strictly increasing on $[0,\hat{\theta }]$ when $\hat{s}$ is finite. It is strictly increasing on $[0,+\infty )$ when $\hat{s}=-\infty $. We have ${Q^{\prime }}(0)\ne 0$ which implies ${q_{\mathit{min}}}$ in (39) cannot be zero under the stated conditions.
c) The function $Q(\theta )$ is strictly convex on the open interval $(-\hat{\theta },\hat{\theta })$ when $\hat{s}$ is finite and $\mathcal{L}(\hat{s})=+\infty $ or when $\hat{s}=-\infty $. $Q(\theta )$ is strictly convex on $[-\hat{\theta },\hat{\theta }]$ when $\hat{s}$ is finite and $\mathcal{L}(\hat{s})\lt \infty $.

Proof.

a) It is sufficient to prove that the function $\theta \to {\mathcal{L}_{Z}}(\frac{\mathcal{A}}{2}-\frac{\mathcal{C}}{2}{\theta ^{2}})$ is infinitely differentiable when $\theta \in (-\hat{\theta },\hat{\theta })$. This function is a composition of two functions $s\to {\mathcal{L}_{Z}}(s)$ and $\theta \to \frac{\mathcal{A}}{2}-\frac{\mathcal{C}}{2}{\theta ^{2}}$. So it is sufficient to prove the infinite differentiability of $s\to {\mathcal{L}_{Z}}(s)$ in the corresponding domain. If ${\mathcal{L}_{Z}}(s)$ is k-times differentiable then we will have ${\mathcal{L}_{Z}^{(k)}}(s)={(-s)^{k}}E[{Z^{k}}{e^{-sZ}}]$. To justify the change of the order of derivative with expectation for this we need to show $E[{Z^{k}}{e^{-sZ}}]\lt \infty $. Let us look at the case $\hat{s}\ne 0$ first. In this case we have $E{e^{sZ}}\lt \infty $ in $(-\infty ,|\hat{s}|)$. Thus all the moments of Z are finite. This implies $E[{Z^{k}}{e^{-sZ}}]\lt \infty $ for any positive integer k and all $s\in (\hat{s},+\infty )$. If $\theta \in (-\hat{\theta },\hat{\theta })$, then $\frac{\mathcal{A}}{2}-\frac{\mathcal{C}}{2}{\theta ^{2}}\in (\hat{s},\frac{\mathcal{A}}{2})$. Therefore, when $\hat{s}\ne 0$, the infinite differentiability of $Q(\theta )$ follows. Now let us look at the case $\hat{s}=0$. In this case $\hat{\theta }=\sqrt{\frac{\mathcal{A}}{\mathcal{C}}}$ and for any $\theta \in (-\hat{\theta },\hat{\theta })$ we have $\frac{\mathcal{A}}{2}-\frac{\mathcal{C}}{2}{\theta ^{2}}\in (0,\frac{\mathcal{A}}{2})$. Therefore, it is sufficient to prove infinite differentiability of ${\mathcal{L}_{Z}}(s)$ on $(0,\frac{\mathcal{A}}{2})$. Fix an arbitrary positive integer k. When $s\in (0,\frac{\mathcal{A}}{2})$ we have ${Z^{k}}/{e^{sZ}}=({Z^{k}}/{e^{sZ}}){1_{\{Z\le M\}}}+({Z^{k}}/{e^{sZ}}){1_{\{Z\gt M\}}}$ for any positive number M. For sufficiently large $M={M_{0}}$, we have $({Z^{k}}/{e^{sZ}}){1_{\{Z\gt {M_{0}}\}}}\le 1$ and ${Z^{k}}/{e^{sZ}}=({Z^{k}}/{e^{sZ}}){1_{\{Z\le {M_{0}}\}}}$ is a bounded random variable. Thus $E({Z^{k}}{e^{-sZ}})\lt \infty $ for any positive integer k when $s\in (0,\frac{\mathcal{A}}{2})$. This shows that $\theta \to {\mathcal{L}_{Z}}(\frac{\mathcal{A}}{2}-\frac{\mathcal{C}}{2}{\theta ^{2}})$ is infinitely differentiable when $\hat{s}=0$ also.

When $\hat{s}$ is finite and when $\theta \to \hat{\theta }$ from the left-hand side or when $\theta \to -\hat{\theta }$ from the right-hand side, the function $\frac{\mathcal{A}}{2}-\frac{\mathcal{C}}{2}{\theta ^{2}}$ decreasingly converges to $\hat{s}$ (in some neighborhood of $\hat{s}$). Then the monotone convergence theorem gives the claim (48). Now assume $\hat{s}=-\infty $ which happens when the mixing distribution Z is a bounded nontrivial random variable. The result ${\lim \nolimits_{\theta \to +\infty }}Q(\theta )=+\infty $ is clear as both ${e^{\mathcal{C}\theta }}$ and ${\mathcal{L}_{Z}}(\frac{\mathcal{A}}{2}-\frac{{\theta ^{2}}}{2}\mathcal{C})$ go to $+\infty $. The limit ${\lim \nolimits_{\theta \to -\infty }}Q(\theta )=+\infty $ is less clear as ${e^{\mathcal{C}\theta }}\to 0$ and ${\mathcal{L}_{Z}}(\frac{\mathcal{A}}{2}-\frac{{\theta ^{2}}}{2}\mathcal{C})\to +\infty $ in this case. But since $Z\ne 0$ with positive probability, we have a positive number $\delta \gt 0$ with $P(Z\ge \delta )\gt 0$. We have

(49)

\[ Q(\theta )=E{e^{[\frac{\mathcal{C}}{2}{\theta ^{2}}-\frac{\mathcal{A}}{2}]Z+\mathcal{C}\theta }}\ge {e^{[\frac{\mathcal{C}}{2}{\theta ^{2}}-\frac{\mathcal{A}}{2}]\delta +\mathcal{C}\theta }}P(Z\ge \delta )\]

for all θ with $\frac{\mathcal{C}}{2}{\theta ^{2}}-\frac{\mathcal{A}}{2}\gt 0$. Then, since the right-hand side of (49) goes to $+\infty $ when $\theta \to -\infty $, the claim follows. The remaining property of Q in part a) above is obvious by the definition of $\hat{\theta }$.

b) For any $\theta \in (-\hat{\theta },\hat{\theta })$ we have

(50)

\[ {Q^{\prime }}(\theta )=\mathcal{C}{e^{\mathcal{C}\theta }}{\mathcal{L}_{Z}}\bigg[\frac{\mathcal{A}}{2}-\frac{{\theta ^{2}}}{2}\mathcal{C}\bigg]-\theta \mathcal{C}{e^{\mathcal{C}\theta }}{\mathcal{L}^{\prime }_{Z}}\bigg[\frac{\mathcal{A}}{2}-\frac{{\theta ^{2}}}{2}\mathcal{C}\bigg].\]

Observe that $0\in (-\hat{\theta },\hat{\theta })$ always (in both cases $\hat{s}\ne 0$ and $\hat{s}=0$). Therefore, ${Q^{\prime }}(0)$ always exists and from (50) we see that ${Q^{\prime }}(0)\ne 0$. Now since ${\mathcal{L}_{Z}}(s)$ is a strictly decreasing function, we have ${\mathcal{L}^{\prime }_{Z}}(s)\lt 0$. Therefore, ${Q^{\prime }}(\theta )$ is finite and ${Q^{\prime }}(\theta )\gt 0$ when $\theta \in (0,\hat{\theta })$. At $\theta =0$, we have $Q(0)=\mathcal{C}{\mathcal{L}_{Z}}(\mathcal{A}/2)$ and clearly we have $Q(0)\lt Q(\theta )$ for all $\theta \in (0,\hat{\theta })$. At $\theta =\hat{\theta }$, we have $Q(\theta )={\mathcal{L}_{Z}}(\hat{s})$ which is either $+\infty $ or finite. When it is finite we have $Q(\theta )\lt Q(\hat{\theta })$ for all $\theta \in [0,\hat{\theta })$ also.

c) Define ${f_{z}}(\theta )=:{e^{\frac{\mathcal{C}}{2}z{\theta ^{2}}+\mathcal{C}\theta -\frac{\mathcal{A}}{2}z}}$ for any real number $z\ge 0$ and for all $\theta \in \mathbb{R}$. We have ${f^{\prime }_{z}}(\theta )=(\mathcal{C}z\theta +\mathcal{C}){e^{\frac{\mathcal{C}}{2}z{\theta ^{2}}+\mathcal{C}\theta -\frac{\mathcal{A}}{2}z}}$ and ${f^{\prime\prime }_{z}}(\theta )=\mathcal{C}z{e^{\frac{\mathcal{C}}{2}z{\theta ^{2}}+\mathcal{C}\theta -\frac{\mathcal{A}}{2}z}}+{(\mathcal{C}z\theta +\mathcal{C})^{2}}{e^{\frac{\mathcal{C}}{2}z{\theta ^{2}}+\mathcal{C}\theta -\frac{\mathcal{A}}{2}z}}\gt 0$ for any $z\ge 0$. Therefore, ${f_{z}}(\theta )$ is a strictly convex function for any fixed $z\ge 0$. Therefore, we have

\[ {f_{z}}\big(\lambda {\theta _{1}}+(1-\lambda ){\theta _{2}}\big)\lt \lambda {f_{z}}({\theta _{1}})+(1-\lambda ){f_{z}}({\theta _{2}})\]

for any $\lambda \in [0,1]$ and for all ${\theta _{1}},{\theta _{2}}\in \mathbb{R}$ for each fixed $z\ge 0$. This strict inequality also holds when $z=Z$. Also, observe that when $\hat{s}$ is finite and ${\mathcal{L}_{Z}}(\hat{s})=+\infty $ or when $\hat{s}=-\infty $, for ${\theta _{1}},{\theta _{2}}\in (-\hat{\theta },\hat{\theta })$ we have $E{f_{Z}}({\theta _{1}})\lt \infty $ and $E{f_{Z}}({\theta _{2}})\lt \infty $. When $\hat{s}$ is finite and ${\mathcal{L}_{Z}}(\hat{s})\lt \infty $, for all ${\theta _{1}},{\theta _{2}}\in [-\hat{\theta },\hat{\theta }]$ we have $E{f_{Z}}({\theta _{1}})\lt \infty $ and $E{f_{Z}}({\theta _{2}})\lt \infty $. We take expectation to the above inequality when $z=Z$ and obtain $Q(\lambda {\theta _{1}}+(1-\lambda ){\theta _{2}})\lt {\lambda _{1}}Q({\theta _{1}})+(1-\lambda )Q({\theta _{2}})$. This shows the strict convexity of $Q(\theta )$ stated in the lemma. □

Remark 4.2.

The main message of Lemma 4.1 is that the optimal solution to the problem (21) is always unique. Now assume ${\mathcal{L}_{Z}}(\hat{s})\lt \infty $. In this case, if the optimal portfolio ${x^{\mathrm{\star }}}$ for the problem (21) is irregular then ${q_{\mathit{min}}}$ in (39) satisfy ${q_{\mathit{min}}}=-\hat{\theta }$. This means that $-\hat{\theta }$ is the minimizing point of $Q(\theta )$ in $[-\hat{\theta },\hat{\theta }]$. As $Q(\theta )$ is a strictly convex function on $[-\hat{\theta },\hat{\theta }]$ as shown in Lemma 4.1, we conclude that $Q(\theta )$ is a strictly increasing, strictly convex function on $[-\hat{\theta },\hat{\theta }]$. In comparison, when the solution to (21) is regular, then the corresponding $Q(\theta )$ is strictly convex but not strictly increasing on $[-\hat{\theta },\hat{\theta }]$.

Example 4.3.

Assume the mixing distribution Z in our model (1) takes finitely many values ${\{{z_{i}}\}_{1\le i\le m}}$ with corresponding probabilities ${({p_{i}})_{1\le i\le m}}$. Then X in (1) is a mixture of normal random vectors

(51)

\[ X\sim {\sum \limits_{i=1}^{m}}{p_{i}}{N_{d}}(\mu +\gamma {z_{i}},{z_{i}}\Sigma ).\]

In this case, the function $Q(\theta )$ takes the form

(52)

\[ Q(\theta )={\sum \limits_{i=1}^{m}}{p_{i}}{e^{(\frac{{\theta ^{2}}}{2}C-\frac{1}{2}A){z_{i}}+\theta C}}.\]

From part c) of the above Lemma 4.1 we know that the function $Q(\theta )$ is strictly convex on $(-\infty ,+\infty )$. Thus the solution to the optimization problem (21) is unique and it is given by (41) with ${q_{\mathit{min}}}=\arg {\min _{\theta \in (-\infty ,0)}}Q(\theta )$. Now, assume $Z=1$ with probability one instead. Then ${\mathcal{L}_{Z}}(s)={e^{-s}}$ and in this case it is easy to see that

\[ Q(\theta )={e^{\frac{C}{2}({\theta ^{2}}+2\theta )-\frac{A}{2}}}.\]

The minimizing point of this function is $\theta =-1$ and so ${q_{\mathit{min}}}=-1$. Then, from (39), the optimal portfolio is given by

\[ {x^{\mathrm{\star }}}=\frac{1}{a{W_{0}}}{\Sigma ^{-1}}(\gamma +\mu -\mathbf{1}{r_{f}}).\]

Note here that since we assumed $Z=1$, X in (1) is a Gaussian random vector and therefore one can obtain the above optimal portfolio by direct calculation as our utility function is exponential. However, our above approach seems more convenient.

In the next example, we look at the case of GH models.

Example 4.4.

Let us look at the case of the model (1) when the mixing distribution Z is given by GIG models. First assume $Z\sim \mathit{iG}(\lambda ,\frac{{a^{2}}}{2})$, the inverse Gaussian distribution. In this case, we have $\lambda \lt 0$ by the definition of inverse Gaussian random variable. From Proposition 9 of [10] we have ${\mathcal{L}_{Z}}(s)={(\frac{2}{a\sqrt{2s}})^{\lambda }}\frac{2{K_{\lambda }}(a\sqrt{2s})}{\Gamma (-\lambda )}$ and therefore $Q(\theta )={e^{\mathcal{C}\theta }}{(\frac{2}{a\sqrt{\mathcal{A}-\mathcal{C}{\theta ^{2}}}})^{\lambda }}\frac{2{K_{\lambda }}(a\sqrt{\mathcal{A}-\mathcal{C}{\theta ^{2}}})}{\Gamma (-\lambda )}$. In this case, the CV-L is $\hat{s}=0$ and $\hat{\theta }=\sqrt{\mathcal{A}/\mathcal{C}}$. If $\gamma =0$, as discussed in Example 2.4, the optimal solution to (21) is ${x^{\mathrm{\star }}}=0$. In this case, the solution ${x^{\mathrm{\star }}}=0$ is irregular. Note that in this case $\mathcal{A}=0$ and therefore $\hat{\theta }=0$. If $\gamma \ne 0$, then $\hat{\theta }\gt 0$ and in this case ${q_{\mathit{min}}}$ in (39) is given by ${q_{\mathit{min}}}=\arg {\min _{\theta \in [-\sqrt{\mathcal{A}/\mathcal{C}},0)}}Q(\theta )$ (due to Lemma 4.1). Note that either by using the fact $\hat{s}=0$ or by using the property (A. 8) in [10] directly, one can easily check that ${(\frac{2}{a\sqrt{\mathcal{A}-\mathcal{C}{\theta ^{2}}}})^{\lambda }}\frac{2{K_{\lambda }}(a\sqrt{\mathcal{A}-\mathcal{C}{\theta ^{2}}})}{\Gamma (-\lambda )}\to 1$ when ${\theta ^{2}}\to \mathcal{A}/\mathcal{C}$. Therefore $Q(-\sqrt{\frac{\mathcal{A}}{\mathcal{C}}})={e^{-\sqrt{\mathcal{A}\mathcal{C}}}}$. In this case, it is not clear if ${q_{\mathit{min}}}=-\sqrt{\frac{\mathcal{A}}{\mathcal{C}}}$ (the solution ${x^{\mathrm{\star }}}$ is irregular) or ${q_{\mathit{min}}}\in (-\sqrt{\frac{\mathcal{A}}{\mathcal{C}}},0)$ (the solution ${x^{\mathrm{\star }}}$ is regular).

Now let us look at the case $Z\sim \mathit{GIG}(\lambda ,a,b)$ when $a\gt 0$, $b\gt 0$. Again from Proposition 9 of [10] we have ${\mathcal{L}_{Z}}(s)={(\frac{b}{\sqrt{{b^{2}}+2s}})^{\lambda }}\frac{{K_{\lambda }}(a\sqrt{{b^{2}}+2s})}{{K_{\lambda }}(ab)}$ and $Q(\theta )={e^{\mathcal{C}\theta }}{(\frac{b}{\sqrt{{b^{2}}+\mathcal{A}-\mathcal{C}{\theta ^{2}}}})^{\lambda }}\frac{{K_{\lambda }}(a\sqrt{{b^{2}}+\mathcal{A}-\mathcal{C}{\theta ^{2}}})}{{K_{\lambda }}(ab)}$. In this case $\hat{s}=-{b^{2}}/2$ and $\hat{\theta }=\sqrt{\frac{\mathcal{A}+{b^{2}}}{\mathcal{C}}}$. One can easily check that ${\mathcal{L}_{Z}}(\hat{s})=+\infty $ in this case. Therefore the unique optimal solution to (21) is given by (41) and it is regular.

Corollary 4.5.

Consider the model (1) with $\gamma =0$. In this case the distribution of X is Elliptical distribution. Assume the CV-L of the mixing distribution Z is $\hat{s}=0$. Then the corresponding optimization problem (21) has a unique solution ${x^{\mathrm{\star }}}=0$. The CV-L of Z is $\hat{s}=0$ if $E{Z^{n}}=+\infty $ for some positive integer n.

Proof.

Observe that in this case $\mathcal{A}=0$ and therefore $\hat{\theta }=0$. Then $[-\hat{\theta },\hat{\theta }]=\{0\}$. Therefore ${q_{\mathit{min}}}$ in (39) is ${q_{\mathit{min}}}=0$. As $\gamma =0$ also by assumption, we have ${x^{\mathrm{\star }}}=0$ by (39). It is clear that this solution is unique. If $\hat{s}\ne 0$, then the Laplace transformation of Z is finite in $(-\infty ,|\hat{s}|)$ and this would imply that all the moments of Z are finite. Therefore infiniteness of one of the moments of Z implies $\hat{s}=0$. □

Example 4.6 (Stable distributions).

Let us look at the case of α-stable distributions. Here we look at the 1-parametrization of the stable distributions (see Definition 1.5 of [24]). For other parameterizations, see [24]. A distribution W follows α-stable distribution with parameters $\alpha \in (0,2]$, $\beta \in [-1,1]$, $\sigma \gt 0$, $u\in \mathbb{R}$, and we write $W\sim S(\alpha ,\beta ,\sigma ,u)$ if its characteristic function is given by

(53)

\[ \phi (t)=E{e^{itW}}=\left\{\begin{array}{l@{\hskip10.0pt}l}{e^{-{\sigma ^{\alpha }}|t{|^{\alpha }}[1-i\beta \mathrm{sign}(t)\mathrm{tan}(\frac{\pi \alpha }{2})]+itu}},& \alpha \ne 1,\\ {} {e^{-\sigma |t|[1+i\beta \frac{2}{\pi }\mathrm{sign}(t)\ln |t|]+itu}},& \alpha =1.\end{array}\right.\]

When $\alpha =2$, a stable distribution is a normal distribution. When $\alpha \in (0,2)$, $E{W^{2}}=+\infty $ for all $\beta \in [-1,1]$, $\sigma \gt 0$, $u\in \mathbb{R}$. Therefore, for the mixing distributions $Z=|W|$, $\alpha \in (0,2)$, $\beta \in [-1,1]$, $\sigma \gt 0$, $u\in \mathbb{R}$, the corresponding CV-L is $\hat{s}=0$. Thus when $\gamma =0$ and when $Z=|W|$, $\alpha \in (0,2)$, $\beta \in [-1,1]$, $\sigma \gt 0$, $u\in \mathbb{R}$, in the model (1), the optimization problem (21) has a unique solution ${x^{\mathrm{\star }}}=0$. This means that when the mixing distribution Z in (1) is equal to the absolute value of a stable distribution with $\alpha \in (0,2)$ and when $\gamma =0$, then the optimal portfolio for an exponential utility maximizer is to invest all her/his wealth into the risk-free asset.

Remark 4.7.

Stable distributions are infinitely divisible. The characteristic functions (53) of the stable laws can be obtained directly from their Lévy–Khintchine representations. The generelized central limit theorem states that stable laws are the only nontrivial limits of normalized sums of independent identically distributed random variables. As such they were proposed to model many empirical (heavy tails, skewness, etc.) financial phenomena in the past. The heavy-tailedness of them is related with the CV-L of them being $\hat{s}=0$. Example 4.6 shows that time-changed Brownian motion models with stable subordinators (the ones with Elliptical marginal distributions) always give the trivial portfolio, investing everything on the risk-free asset, as the optimal portfolio for an exponential utility maximizer.

As pointed out in Remark 4.2, our Lemma 4.1 shows that the solution to the problem (21) is unique. Part b) of this lemma shows that $\theta =0$ is not the minimizing point of the function $Q(\theta )$ under the condition that $\mathcal{A}\ne 0$ or $\hat{s}\ne 0$. For this unique minimizing point $\theta \ne 0$ of $Q(\theta )$ the first order condition (50) can equivalently be written as

(54)

\[ \frac{{\mathcal{L}^{\prime }_{Z}}(\frac{\mathcal{A}}{2}-\frac{\mathcal{C}}{2}{\theta ^{2}})}{{\mathcal{L}_{Z}}(\frac{\mathcal{A}}{2}-\frac{\mathcal{C}}{2}{\theta ^{2}})}=\frac{1}{\theta }.\]

A change of variable $\eta =\mathcal{A}/2-(\mathcal{C}/2){\theta ^{2}}$, which gives $\theta =-\sqrt{(\mathcal{A}-2\beta )/\mathcal{C}}$ due to $\theta \lt 0$ by Lemma 4.1, then gives

(55)

\[ \frac{{\mathcal{L}^{\prime }_{Z}}(\beta )}{{\mathcal{L}_{Z}}(\beta )}=-\sqrt{\mathcal{C}/(\mathcal{A}-2\beta )},\hspace{1em}\hat{s}\lt \beta \lt \mathcal{A}/2.\]

From this we can conclude that if ${x^{\mathrm{\star }}}$ is a regular solution to (21), then ${\beta _{\mathit{min}}}=:\mathcal{A}/2-(\mathcal{C}/2){q_{\mathit{min}}^{2}}$ with ${q_{\mathit{min}}}$ in (41) satisfies the relation (55). This observation is useful if it can be confirmed that the solution to the equation (55) is unique. Then this unique solution equals to ${\beta _{\mathit{min}}}$. Consider, for example, the case $Z=1$ in the model (1). As discussed in Example 4.3, in this case we have ${\mathcal{L}_{Z}}(s)={e^{-s}}$. Then ${\mathcal{L}^{\prime }_{Z}}(\beta )/{\mathcal{L}_{Z}}(\beta )=-1$ and it is clear that the equation $1=\sqrt{\mathcal{C}/(\mathcal{A}-2\beta )}$ has a unique solution $\beta =\mathcal{A}/2-\mathcal{C}/2$. This implies ${q_{\mathit{min}}^{2}}=1$ which then shows that ${q_{\mathit{min}}}=-1$ is the minimizing point of $Q(\theta )$.

A positive random variable Z is a GGC with a generating pair $(\tau ,\nu )$ if

(56)

\[ {\mathcal{L}_{Z}}(s)=E{e^{-sZ}}={e^{-\tau -{\textstyle\textstyle\int _{0}^{\infty }}\ln (1+\frac{s}{z})\nu (dz)}}.\]

If Z is a GGC with a generating pair $(\tau ,\nu )$, then $\frac{{\mathcal{L}^{\prime }_{Z}}(\beta )}{{\mathcal{L}_{Z}}(\beta )}=-\tau -{\textstyle\int _{0}^{+\infty }}\frac{1}{t-\beta }\nu (dt)$. So if the solution to (21) is regular, then ${\beta _{\mathit{min}}}$ defined above satisfies the equation

\[ -\tau -{\int _{|\hat{s}|}^{+\infty }}\frac{1}{t-\beta }\nu (dt)=-\sqrt{\mathcal{C}/(\mathcal{A}-2\beta )},\]

where $\hat{s}$ is the CV-L of the GGC random variable Z.

Now consider the case of positive α-stable random variables $Z=S(\alpha ,1,\sigma ,u)$, $0\lt \alpha \lt 1$, $u\gt 0$. Here we took $\beta =1$ (see Lemma 1.1 of [24]). After normalization these mixing distributions have the Laplace transformation ${\mathcal{L}_{Z}}(s)={e^{-{s^{\alpha }}}}$ (see Proposition 1 of [4] and also see [28]). Thus we have ${\mathcal{L}^{\prime }_{Z}}(s)/{\mathcal{L}_{Z}}(s)=-{s^{\alpha }}\ln s$. Assume the problem (21) has a regular solution (a necessary condition for this is $\gamma \ne 0$, see Corollary 4.5). Let ${\beta _{\mathit{min}}}=\mathcal{A}/2-(\mathcal{C}/2){q_{\mathit{min}}^{2}}$ with ${q_{\mathit{min}}}$ in (41). Then $0\lt {\beta _{\mathit{min}}}\lt \mathcal{A}/2$ and due to (55) it satisfies the equation

\[ {\beta ^{\alpha }}\ln \beta =\sqrt{\mathcal{C}/(\mathcal{A}-2\beta )}.\]

We square both sides of this equation and obtain

\[ \mathcal{A}{\beta ^{2\alpha }}{(\ln \beta )^{2}}-2{\beta ^{2\alpha +1}}{(\ln \beta )^{2}}=\mathcal{C}.\]

As discussed earlier, if this equation has a unique solution β then it is ${\beta _{\mathit{min}}}$.

Remark 4.8.

We should mention here that the formula (39) for the optimal portfolio for the problem (21) is related to the Laplace transformation of the mixing distribution Z in the model (1) only. Namely, we don’t need to know the probability density function of Z to find the optimal portfolio for the optimization problem (21). The relation (55) gives a convenient approach to locating the unique optimal portfolio as discussed earlier.

Next, we discuss the applications of our results in continuous time financial modeling. First, we recall Lemma 2.6 of [10] here. According to this lemma, for each model $F={N_{d}}(\mu +\gamma z,z\Sigma )\circ G$ in (1) there is a corresponding Lévy process

(57)

\[ {Y_{t}}=\mu t+\gamma {\tau _{t}}+{\bar{B}_{{\tau _{t}}}},\]

with $\mathit{Law}({Y_{1}})=F$ and $\mathit{Law}({\tau _{1}})=G$ as long as $G\in \mathcal{J}$ (note that if $G\in \mathcal{J}$ then $X\in \mathcal{J}$ also from Lemma 2.5 of [10]). In the model (57), ${({\bar{B}_{t}})_{t\ge 0}}={(A{B_{t}})_{t\ge 0}}$ where ${B_{t}}$ is an n-dimensional standard Brownian motion independent from ${({\tau _{t}})_{t\ge 0}}$ and ${({\tau _{t}})_{t\ge 0}}$ is a subordinator (a nonnegative Lévy process with increasing sample paths). We denote the Lévy measure of this subordinator by ρ and its Laplace transformation by

(58)

\[ {\mathcal{L}_{{\tau _{t}}}}(s)={e^{-t\Psi (s)}},\]

where $\Psi (s)=bs+{\textstyle\int _{0}^{\infty }}(1-{e^{-sy}})\rho (dy)$ with a constant $b\ge 0$. As stated in Proposition 2.3 of [16], the function $\Psi (s)$ is continuous, nondecreasing, nonnegative, and convex. At each time point $t\gt 0$ we have

(59)

\[ {Y_{t}}\stackrel{d}{=}\mu t+\gamma {\tau _{t}}+\sqrt{{\tau _{t}}}A{N_{d}}.\]

Now consider a market with n risky assets with the price process ${S_{t}}\in {\mathbb{R}^{d}}$ and one risk-free asset with price process ${B_{t}}={e^{t{r_{f}}}}$. Assume the log-return process ${Y_{t}}=({Y_{t}^{(1)}},{Y_{t}^{(2)}},\dots ,{Y_{t}^{(d)}})$, where ${Y_{t}^{(i)}}=\ln ({S_{t}^{(i)}}/{S_{0}^{(i)}})$, has the dynamics as in (57). The log-return in the risk-free asset is $\ln ({B_{t}}/{B_{0}})={r_{f}}t$. An exponential utility maximizer wants to determine the optimal portfolio at each time point t based on the log-return vector of risky assets $R\in {\mathbb{R}^{d}}$ with components ${R^{(i)}}=\ln ({S_{t+\triangle }^{(i)}}/{S_{t}^{(i)}})$ and the log-return of the risk-free asset ${R^{(0)}}=\ln ({B_{t+\triangle }}/{B_{t}})=\triangle {r_{f}}$ in the time horizon $[t,t+\triangle ]$. Assume the time increment is $\triangle =1$. Then we have

(60)

\[ R\stackrel{d}{=}\mu +\gamma {\tau _{1}}+\sqrt{{\tau _{1}}}A{N_{d}},\]

and from our Theorem 2.16 the exponential utility maximizer’s optimal portfolio at time t is

(61)

\[ {x_{t}^{\mathrm{\star }}}=\frac{1}{a{W_{0}^{(t)}}}\big[{\Sigma ^{-1}}\gamma -{q_{\mathit{min}}^{(t)}}{\Sigma ^{-1}}(\mu -\mathbf{1}{r_{f}})\big],\]

where ${W_{0}^{(t)}}$ is its (initial) wealth that it invests in the $n+1$ assets for the period $[t,t+\triangle ]$ and ${q_{\mathit{min}}^{(t)}}$ in (61) is given by ${q_{\mathit{min}}^{(t)}}=\arg {\min _{\theta \in \Theta }}Q(\theta )$ in the corresponding domain θ. Here

(62)

\[ Q(\theta )={e^{C\theta -\Psi (\frac{1}{2}A-\frac{{\theta ^{2}}}{2}C)}},\]

due to (58).

Example 4.9 (Variance-gamma model).

Consider the financial market that was discussed in the paper [21]. The stock price is given by $S(t)=S(0){e^{mt+X(t;\hspace{0.2778em}{\sigma _{S}},\hspace{0.2778em}{\nu _{S}},\hspace{0.2778em}{\theta _{S}})+{\omega _{S}}t}}$ in their equation (21), where m is the mean-rate of return on the stock under the statistical probability measure, ${\omega _{S}}=\frac{1}{{\nu _{S}}}\ln (1-{\theta _{S}}{\nu _{S}}-{\sigma _{S}^{2}}{\nu _{S}}/2)$, and $X(t;{\sigma _{S}},{\nu _{S}},{\theta _{S}})=b(\gamma (t;1,{\nu _{S}});{\theta _{S}},{\sigma _{S}})$ with $b(t;\theta ,\sigma )=\theta t+\sigma W(t)$ being a Brownian motion with drift θ and volatility σ. Here the gamma process $\gamma (t;\mu ,\nu )$ has mean rate μ and variance rate ν (note here that $\gamma (t;\mu ,\nu )\sim G({\mu ^{2}}/\nu ,\nu /\mu )$ with our notation for gamma random variables in this paper). The increment ${g_{0}}=:\gamma (t+1;1,{\nu _{S}})-\gamma (t;1,{\nu _{S}})\stackrel{d}{=}\gamma (1;1,{\nu _{S}})$ of this process has the Laplace transformation

(63)

\[ {\mathcal{L}_{{g_{0}}}}(s)={\bigg(\frac{1}{1+s{\nu _{S}}}\bigg)^{\frac{1}{{\nu _{S}}}}},\]

which can be seen also from the characteristic function expression in (3) of [21] for gamma processes. The risk-free asset in this financial market is given by ${B_{t}}={B_{0}}{e^{t{r_{f}}}}$. The log-returns of these two assets in the time horizon $[t,t+1]$ are given by

\[\begin{aligned}{}R=:& \ln \big(S(t+1)/S(t)\big)\stackrel{d}{=}m+{\omega _{S}}+{\theta _{S}}\gamma (1;1,\nu )+{\sigma _{S}}\sqrt{\gamma (1;1,{\nu _{S}})}N(0,1),\\ {} {R^{0}}=:& \ln ({B_{t+1}}/{B_{t}})={r_{f}}.\end{aligned}\]

An exponential utility maximizer with the utility function $u(x)=-{e^{-ax}}$, $a\gt 0$, and wealth ${W_{0}^{(t)}}$ at time t wants to decide on the optimal proportion ${x^{\mathrm{\star }}}$ on the risky asset of his wealth for the period $[t,t+1]$. His acceptable set for ${x^{\mathrm{\star }}}$ is given by

(64)

\[ {S_{a}}=\bigg\{x\in \mathbb{R}:a{W_{0}^{(t)}}{\theta _{S}}x-\frac{{a^{2}}{({W_{0}^{(t)}})^{2}}}{2}{\sigma _{S}^{2}}{x^{2}}\gt -\frac{1}{{\nu _{S}}}\bigg\},\]

as $\hat{s}=-\frac{1}{{\nu _{S}}}$ in this case. The corresponding expressions for $\mathcal{A}$, $\mathcal{B}$, $\mathcal{C}$ in (32) are given by

\[ \mathcal{A}={\bigg(\frac{{\theta _{S}}}{{\sigma _{S}}}\bigg)^{2}},\hspace{2em}\mathcal{C}={\bigg(\frac{m+{\omega _{S}}-{r_{f}}}{{\sigma _{S}}}\bigg)^{2}},\mathcal{B}=\frac{{\theta _{S}}(m+{\omega _{S}}-{r_{f}})}{{\sigma _{S}^{2}}}.\]

Since the mixing distribution is of a gamma random variable, the solution to the corresponding problem (21) is regular. Our Theorem 2.16 shows that the optimal portfolio is given by

(65)

\[ {x^{\mathrm{\star }}}=\frac{1}{a{W_{0}}}\bigg[\frac{1}{{\sigma _{S}^{2}}}{\theta _{S}}-{q_{\mathit{min}}}\frac{1}{{\sigma _{S}^{2}}}(m+{\omega _{S}}-{r_{f}})\bigg].\]

where ${q_{\mathit{min}}}=\arg {\min _{\theta \in (-\hat{\theta },\hat{\theta })}}Q(\theta )$ with $Q(\theta )$ given by (35). Here $\hat{\theta }=\sqrt{\frac{\mathcal{A}+2/{\nu _{S}}}{\mathcal{C}}}$. Next, we calculate ${q_{\mathit{min}}}$ explicitly. We have $Q(\theta )={e^{\mathcal{C}\theta }}{\mathcal{L}_{{g_{0}}}}(\mathcal{A}/2-(\mathcal{C}/2){\theta ^{2}})$ and from this we get $\ln Q(\theta )=C\theta -\frac{1}{{v_{S}}}\ln (1+\frac{A}{2}{v_{S}}-\frac{C}{2}{v_{S}}{\theta ^{2}})$. The first order condition for the minimizing point of $\ln Q(\theta )$ gives ${(\theta +\frac{1}{\mathcal{C}{\nu _{S}}})^{2}}=\frac{1+\mathcal{C}{\nu _{S}}(2+\mathcal{A}{\nu _{S}})}{{\mathcal{C}^{2}}{\nu _{S}^{2}}}$. This gives two solutions $\theta =-\frac{1}{\mathcal{C}{\nu _{S}}}\pm \frac{1}{\mathcal{C}{\nu _{S}}}\sqrt{1+\mathcal{C}{\nu _{S}}(2+\mathcal{A}{\nu _{S}})}$. But since θ needs to be negative due to Lemma 4.1, we take ${q_{\mathit{min}}}=\theta =-\frac{1}{\mathcal{C}{\nu _{S}}}-\frac{1}{\mathcal{C}{\nu _{S}}}\sqrt{1+\mathcal{C}{\nu _{S}}(2+\mathcal{A}{\nu _{S}})}$. We then plug this into (39) and obtain

(66)

\[ {x^{\mathrm{\star }}}=\frac{1}{a{W_{0}^{(t)}}{\sigma _{S}^{2}}}\bigg[{\theta _{S}}+\frac{m+{\omega _{S}}-{r_{f}}}{\mathcal{C}{\nu _{S}}}+\frac{m+{\omega _{S}}-{r_{f}}}{\mathcal{C}{\nu _{S}}}\sqrt{1+\mathcal{C}{\nu _{S}}(2+\mathcal{A}{\nu _{S}})}\bigg].\]

Therefore in this case we have a closed-form expression for the optimal portfolio. We should mention that one can use similar calculations to obtain a closed-form expression for optimal portfolio in a market where risky assets are modeled by multidimensional variance gamma (MVG) model, see [20] for the details of MVG models.

Remark 4.10.

Price processes with log-returns of the type (57) have been quite popular in financial literature in the past. Such models include inverse Gaussian Lévy processes, hyperbolic Lévy motions, variance gamma models, and CGYM models, and all of these models were shown to fit empirical data quite well, see [5, 9, 27, 8, 19] and the references therein for this. In fact, every semimartingale can be written as a time-change of Brownian motion, see [23] for this. This means that all the Lévy processes are time-changed Brownian motions. In all these cases, if the time-changing subordinator is independent of the Brownian motion, then our Theorem 2.16 is applicable in principle. However, it is not easy to find the time-change used for general semimartingales. Recently, the paper [19] obtained the time-change used for the CGMY model and Meixner processes. Our results in this paper can be applied to such processes to determine optimal portfolios for an exponential utility maximizer in a market where single or multiple risky asset dynamics follow such models.

5 Conclusion

The main result of this paper is Theorem 2.16 where we show that the problem of locating the optimal portfolio for (11) when the utility function is exponential boils down to finding the minimum point of a real-valued function on the real line, improving Theorem 1 of [3] for the case of GH models and in the meantime extending it from the class of GH models to the general class of NMVM models. Our Theorem 3.3 shows that an optimal exponential utility in small markets converge to the overall best exponential utility in the large financial market. While optimal portfolio problems under expected utility criteria for exponential utility functions have been discussed extensively in the past financial literature, an explicit solution of the optimal portfolio as in Theorem 2.16 above seems to be new. This is partly due to the condition we impose on the return vector X of being an NMVM model. However, despite this restrictive condition on X, asset price dynamics with NMVM distributions in their log-returns often show up in financial literature like exponential variance gamma and exponential generalized hyperbolic Lévy motions.

Authors

Abstract

1 Introduction

(1)

(2)

(3)

(4)

(5)

(6)

(7)

(8)

(9)

(10)

(11)

2 Closed-form solution for optimal portfolios under an exponential utility

(12)

(13)

Lemma 2.1.

(14)

Proof.

Remark 2.2.

Remark 2.3.

(15)

(16)

(17)

(18)

Example 2.4.

Definition 2.5.

(19)

Remark 2.6.

(20)

Remark 2.7.

(21)

(22)

(23)

(24)

Lemma 2.8.

Proof.

Remark 2.9.

Definition 2.10.

Remark 2.11.

Remark 2.12.

Lemma 2.13.

(25)

(26)

(27)

Proof.

(28)

(29)

(30)

(31)

(32)

(33)

(34)

(35)

Lemma 2.14.

(36)

Proof.

(37)

(38)

Remark 2.15.

Theorem 2.16.

(39)

(40)

Proof.

Proposition 2.17.

(41)

(42)

Proof.

Remark 2.18.

Remark 2.19.

3 Large financial markets

(43)

(44)

(45)

Assumption 3.1.

Assumption 3.2.

(46)

Theorem 3.3.

Proof.