On closeness of two discrete weighted sums

Čekanavičius, Vydas; Vellaisamy, Palaniappan

doi:10.15559/18-VMSTA103

Abstract

The effect that weighted summands have on each other in approximations of $S={w_{1}}{S_{1}}+{w_{2}}{S_{2}}+\cdots +{w_{N}}{S_{N}}$ is investigated. Here, ${S_{i}}$’s are sums of integer-valued random variables, and ${w_{i}}$ denote weights, $i=1,\dots ,N$. Two cases are considered: the general case of independent random variables when their closeness is ensured by the matching of factorial moments and the case when the ${S_{i}}$ has the Markov Binomial distribution. The Kolmogorov metric is used to estimate the accuracy of approximation.

1 Introduction

Let us consider a typical cluster sampling design: the entire population consists of different clusters, and the probability for each cluster to be selected into a sample is known. The sum of sample elements is then equal to $S={w_{1}}{S_{1}}+{w_{2}}{S_{2}}+\cdots +{w_{N}}{S_{N}}$. Here, ${S_{i}}$ is the sum of independent identically distributed (iid) random variables (rvs) from the i-th cluster. A similar situation arises in actuarial mathematics when the sum S models the discounted amount of the total net loss of a company, see, for example, [24]. Note that then ${S_{i}}$ may be the sum of dependent rvs. Of course, in actuarial models, ${w_{i}}$ are also typically random, which makes our research just a first step in this direction. In many papers, the limiting behavior of weighted sums is investigated with the emphasis on weights or tails of distributions, see, for example, [6, 16–18, 23, 25–30], and references therein. We, however, concentrate on the impact of $S-{w_{i}}{S_{i}}$ on ${w_{i}}{S_{i}}$. Our research is motivated by the following simple example. Let us assume that ${S_{i}}$ is in some sense close to ${Z_{i}}$, $i=1,2$. Then a natural approximation to ${w_{1}}{S_{1}}+{w_{2}}{S_{2}}$ is ${w_{1}}{Z_{1}}+{w_{2}}{Z_{2}}$. Suppose that we want to estimate the closeness of both sums in some metric $d(\cdot ,\cdot )$. The standard approach which works for the majority of metrics then gives

(1)

\[ d({w_{1}}{S_{1}}+{w_{2}}{S_{2}},{w_{1}}{Z_{1}}+{w_{2}}{Z_{2}})\leqslant d({w_{1}}{S_{1}},{w_{1}}{Z_{1}})+d({w_{2}}{S_{2}},{w_{2}}{Z_{2}}).\]

The triangle inequality (1) is not always useful. For example, let ${S_{1}}$ and ${Z_{1}}$ have the same Poisson distribution with parameter n and let ${S_{2}}$ and ${Z_{2}}$ be Bernoulli variables with probabilities 1/3 and 1/4, respectively. Then (1) ensures the trivial order of approximation $O(1)$ only. Meanwhile, both S and Z can be treated as small (albeit different) perturbations to the same Poisson variable and, therefore, one can expect closeness of their distributions at least for large n. The ‘smoothing’ effect that other sums have on the approximation of ${w_{i}}{S_{i}}$ is already observed in [7] (see also references therein). For some general results involving the concentration functions, see, for example, [10, 20].

To make our goals more explicit, we need additional notation. Let $\mathbb{Z}$ denote the set of all integers. Let $\mathcal{F}$ (resp. ${\mathcal{F}_{Z}}$, resp. $\mathcal{M}$) denote the set of probability distributions (resp. distributions concentrated on integers, resp. finite signed measures) on $\mathbb{R}$. Let ${I_{a}}$ denote the distribution concentrated at real a and set $I={I_{0}}$ . Henceforth, the products and powers of measures are understood in the convolution sense. Further, for a measure M, we set ${M}^{0}=I$ and $\exp \{M\}={\sum _{k=0}^{\infty }}{M}^{k}/k!$. We denote by $\widehat{M}(t)$ the Fourier–Stieltjes transform of M. The real part of $\widehat{M}(t)$ is denoted by $Re\widehat{M}(t)$. Observe also that $\widehat{\exp \{M(t)\}}=\exp \{\widehat{M}(t)\}$. We also use $\mathcal{L}(\xi )$ to denote the distribution of ξ.

The Kolmogorov (uniform) norm $|M{|_{K}}$ and the total variation norm $\| M\| $ of M are defined by

\[ |M{|_{K}}=\underset{x\in \mathbb{R}}{\sup }\big|M\big((-\infty ,x]\big)\big|,\hspace{2em}\| M\| ={M}^{+}\{\mathbb{R}\}+{M}^{-}\{\mathbb{R}\},\]

respectively. Here $M={M}^{+}-{M}^{-}$ is the Jordan–Hahn decomposition of M. Also, for any two measures M and V, $|M{|_{K}}\leqslant \| M\| $, $|MV{|_{K}}\leqslant \| M\| \cdot |V{|_{K}}$, $|\widehat{M}(t)|\leqslant \| M\| $, $\| \exp \{M\}\| \leqslant \exp \{\| M\| \}$. If $F\in \mathcal{F}$, then $|F{|_{K}}=\| F\| =\| \exp \{F-I\}\| =1$. Observe also that, if M is concentrated on integers, then

\[ M={\sum \limits_{k=-\infty }^{\infty }}M\{k\}\hspace{0.1667em}{I_{k}},\hspace{2em}\widehat{M}(t)={\sum \limits_{k=-\infty }^{\infty }}{\mathrm{e}}^{\mathrm{i}tk}M\{k\},\hspace{2em}\| M\| ={\sum \limits_{k=-\infty }^{\infty }}\big|M\{k\}\big|.\]

For $F\in \mathcal{F}$, $h\geqslant 0$, Lévy’s concentration function is defined by

\[ Q(F,h)=\underset{x}{\sup }F\big\{[x,x+h]\big\}.\]

All absolute positive constants are denoted by the same symbol C. Sometimes to avoid possible ambiguities, the constants C are supplied with indices. Also, the constants depending on parameter N are denoted by $C(N)$. We also assume usual conventions ${\sum _{j=a}^{b}}=0$ and ${\prod _{j=a}^{b}}=1$, if $b<a$. The notation Θ is used for any signed measure satisfying $\| \varTheta \| \leqslant 1$. The notation θ is used for any real or complex number satisfying $|\theta |\leqslant 1$.

2 Sums of independent rvs

The results of this section are partially inspired by a comprehensive analytic research of probability generating functions in [12] and the papers on mod-Poisson convergence, see [2, 13, 14], and references therein. Assumptions in the above-mentioned papers are made about the behavior of characteristic or probability generating functions. The inversion inequalities are then used to translate their differences to the differences of distributions. In principle, mod-Poisson convergence means that if an initial rv is a perturbation of some Poisson rv, then their distributions must be close. Formally, it is required for $\exp \{-{\tilde{\lambda }_{n}}({\mathrm{e}}^{\mathrm{i}t}-1)\}{f_{n}}(t)$ to have a limit for some sequence of Poisson parameters ${\tilde{\lambda }_{n}}$, as $n\to \infty $. Here, ${f_{n}}(t)$ is a characteristic function of an investigated rv. Division by a certain Poisson characteristic function is one of the crucial steps in the proof of Theorem 2.1 below, which makes it applicable to rvs satisfying the mod-Poisson convergence definition, provided they can be expressed as sums of independent rvs. Though we use factorial moments, similar to Section 7.1 in [2], our work is much more closer in spirit to [21], where general lemmas about the closeness of lattice measures are proved.

In this section, we consider a general case of independent non-identically distributed rvs, forming a triangular array (a scheme of series). Let ${S_{i}}={X_{i1}}+{X_{i2}}+\cdots +{X_{i{n_{i}}}}$, ${Z_{i}}={Z_{i1}}+{Z_{i2}}+\cdots +{Z_{i{n_{i}}}}$, $i=1,2,\dots ,N$. We assume that all the ${X_{ij}}$, ${Z_{ij}}$ are mutually independent and integer-valued. Observe that, in general, $S={\sum _{i=1}^{N}}{w_{i}}{S_{i}}$ and $Z={\sum _{i=1}^{N}}{w_{i}}{Z_{i}}$ are not integer-valued and, therefore, the standard methods of estimation of lattice rvs do not apply. Note also that, since any infinitely divisible distribution can be expressed as a sum of rvs, Poisson, compound Poisson and negative binomial rvs can be used as ${Z_{i}}$.

The distribution of ${X_{ij}}$ (resp. ${Z_{ij}}$) is denoted by ${F_{ij}}$ (resp. ${G_{ij}}$). The closeness of characteristic functions will be determined by the closeness of corresponding factorial moments. Though it is proposed in [2] to use standard factorial moments even for rvs taking negative values, we think that right-hand side and left-hand side factorial moments, already used in [21], are more natural characteristics. Let, for $k=1,2,\dots \hspace{0.1667em}$, and any $F\in {\mathcal{F}_{Z}}$,

\[\begin{aligned}{\nu _{k}^{+}}({F_{ij}})& ={\sum \limits_{m=k}^{\infty }}m(m-1)\cdots (m-k+1){F_{ij}}\{m\},\\{} {\nu _{k}^{-}}({F_{ij}})& ={\sum \limits_{m=k}^{\infty }}m(m-1)\cdots (m-k+1){F_{ij}}\{-m\}.\end{aligned}\]

For the estimation of the remainder terms we also need the following notation: ${\beta _{k}^{\pm }}({F_{ij}},{G_{ij}})={\nu _{k}^{\pm }}({F_{ij}})+{\nu _{k}^{\pm }}({G_{ij}})$, ${\sigma _{ij}^{2}}=\max (\mathrm{Var}({X_{ij}}),\mathrm{Var}({Z_{ij}}))$, and

\[\begin{aligned}{u_{ij}}& =\min \bigg\{1-\frac{1}{2}\big\| {F_{ij}}({I_{1}}-I)\big\| ;1-\frac{1}{2}\big\| {G_{ij}}({I_{1}}-I)\big\| \bigg\}\\{} & =\min \Bigg\{{\sum \limits_{k=-\infty }^{\infty }}\min \big({F_{ij}}\{k\},{F_{ij}}\{k-1\}\big);{\sum \limits_{k=-\infty }^{\infty }}\min \big({G_{ij}}\{k\},{G_{ij}}\{k-1\}\big)\Bigg\}.\end{aligned}\]

For the last equality, see (1.9) and (5.15) in [5]. Next we formulate our assumptions. For some fixed integer $s\geqslant 1$, $i=1,\dots ,N,\hspace{2.5pt}j=1,\dots ,{n_{i}}$,

(2)

\[\begin{aligned}{u_{ij}}& >0,\hspace{2em}{\sum \limits_{j=1}^{{n_{i}}}}{u_{ij}}\geqslant 1,\hspace{2em}{n_{i}}\geqslant 1,\hspace{2em}{w_{i}}>0,\end{aligned}\]

(3)

\[\begin{aligned}{\nu _{k}^{+}}({F_{ij}})& ={\nu _{k}^{+}}({G_{ij}}),\hspace{2em}{\nu _{k}^{-}}({F_{ij}})={\nu _{k}^{-}}({G_{ij}}),\hspace{1em}k=1,2,\dots ,s\end{aligned}\]

(4)

\[\begin{aligned}{\beta _{s+1}^{+}}({F_{ij}},{G_{ij}})& +{\beta _{s+1}^{-}}({F_{ij}},{G_{ij}})<\infty .\end{aligned}\]

Now we are in position to formulate the main result of this section.

Theorem 2.1.

Let assumptions (2)–(4) hold. Then

(5)

\[\begin{aligned}\hspace{-0.1667em}\hspace{-0.1667em}\hspace{-0.1667em}{\big|\mathcal{L}(S)-\mathcal{L}(Z)\big|_{K}}& \leqslant C(N,s)\frac{\,{\max _{j}}{w_{j}}}{\,{\min _{j}}{w_{j}}}{\Bigg({\sum \limits_{i=1}^{N}}{\sum \limits_{l=1}^{{n_{i}}}}{u_{il}}\Bigg)}^{-1/2}{\prod \limits_{l=1}^{N}}\Bigg(1+{\sum \limits_{k=1}^{{n_{l}}}}{\sigma _{lk}^{2}}/{\sum \limits_{k=1}^{{n_{l}}}}{u_{lk}}\Bigg)\\{} & \hspace{1em}\times {\sum \limits_{i=1}^{N}}{\sum \limits_{j=1}^{{n_{i}}}}\big[{\beta _{s+1}^{+}}({F_{ij}},{G_{ij}})+{\beta _{s+1}^{-}}({F_{ij}},{G_{ij}})\big]{\Bigg({\sum \limits_{k=1}^{{n_{i}}}}{u_{ik}}\Bigg)}^{-s/2}.\end{aligned}\]

If, in addition, s is even and ${\beta _{s+2}^{+}}({F_{ij}},{G_{ij}})+{\beta _{s+2}^{-}}({F_{ij}},{G_{ij}})<\infty $, then

(6)

\[\begin{aligned}{\big|\mathcal{L}(S)-\mathcal{L}(Z)\big|_{K}}& \leqslant C(N,s)\frac{\,{\max _{j}}{w_{j}}}{\,{\min _{j}}{w_{j}}}{\Bigg({\sum \limits_{i=1}^{N}}{\sum \limits_{l=1}^{{n_{i}}}}{u_{il}}\Bigg)}^{-1/2}{\prod \limits_{l=1}^{N}}\Bigg(1+{\sum \limits_{k=1}^{{n_{l}}}}{\sigma _{lk}^{2}}/{\sum \limits_{k=1}^{{n_{l}}}}{u_{lk}}\Bigg)\\{} & \hspace{1em}\times {\sum \limits_{i=1}^{N}}{\sum \limits_{j=1}^{{n_{i}}}}{\Bigg({\sum \limits_{k=1}^{{n_{i}}}}{u_{ik}}\Bigg)}^{-s/2}\Bigg(\big|{\beta _{s+1}^{+}}({F_{ij}},{G_{ij}})-{\beta _{s+1}^{-}}({F_{ij}},{G_{ij}})\big|\\{} & \hspace{1em}+\big[{\beta _{s+2}^{+}}({F_{ij}},{G_{ij}})+{\beta _{s+2}^{-}}({F_{ij}},{G_{ij}})\\{} & \hspace{1em}+{\beta _{s+1}^{-}}({F_{ij}},{G_{ij}})\big]{\Bigg({\sum \limits_{k=1}^{{n_{i}}}}{u_{ik}}\Bigg)}^{-1/2}\Bigg).\end{aligned}\]

The factor ${({\sum _{i=1}^{n}}{\sum _{j=1}^{{n_{i}}}}{u_{ij}})}^{-1/2}$ estimates the impact of S on approximation of ${w_{i}}{S_{i}}$. The estimate (6) takes care of a possible symmetry of distributions.

If, in each sum ${S_{i}}$ and ${Z_{i}}$, all the rvs are identically distributed, then we can get rid of the factor containing variances. We say that condition (ID) is satisfied if, for each $i=1,2,\dots ,N$, all rvs ${X_{ij}}$ and ${Z_{ij}}$ ($j=1,\dots ,{n_{i}}$) are iid with distributions ${F_{i}}$ and ${G_{i}}$, respectively. Observe, that if condition (ID) is satisfied, then the characteristic functions of S and Z are respectively equal to

\[ {\prod \limits_{i=1}^{N}}{\widehat{F}_{i}^{{n_{i}}}}({w_{i}}t),\hspace{2em}{\prod \limits_{i=1}^{N}}{\widehat{G}_{i}^{{n_{i}}}}({w_{i}}t).\]

We also use notation ${u_{i}}$ instead of ${u_{ij}}$, since now ${u_{i1}}={u_{i2}}=\cdots ={u_{i{n_{i}}}}$.

Theorem 2.2.

Let the assumptions (2)–(4) and the condition (ID) hold. Then

(7)

\[\begin{aligned}{\big|\mathcal{L}(S)-\mathcal{L}(Z)\big|_{K}}& \leqslant C(N,s)\frac{\,{\max _{j}}{w_{j}}}{\,{\min _{j}}{w_{j}}}{\Bigg({\sum \limits_{i=1}^{N}}{n_{i}}{u_{i}}\Bigg)}^{-1/2}\\{} & \hspace{1em}\times {\sum \limits_{i=1}^{N}}\frac{{\beta _{s+1}^{+}}({F_{i}},{G_{i}})+{\beta _{s+1}^{-}}({F_{i}},{G_{i}})}{{n_{i}^{s/2-1}}{u_{i}^{s/2}}}.\end{aligned}\]

How does Theorem 2.1 compare to the known results? In [4], compound Poisson-type approximations to non-negative iid rvs in each sum were considered under the additional Franken-type condition:

(8)

\[ {\nu _{1}^{+}}({F_{j}})-{\big({\nu _{1}^{+}}({F_{j}})\big)}^{2}-{\nu _{2}^{+}}({F_{j}})>0,\]

see [8]. Similar assumptions were used in [7, 21]. Observe that Franken’s condition requires almost all probabilistic mass to be concentrated at 0 and 1. Indeed, then ${\nu _{1}^{+}}({F_{j}})<1$ and ${F_{j}}\{1\}\geqslant {\sum _{k=3}^{\infty }}k(k-2){F_{j}}\{k\}$. Meanwhile, Theorems 2.1 and 2.2 hold under much milder assumptions and, as demonstrated in the example below, can be useful even if (8) is not satisfied. Therefore, even for the case of one sum when $N=1$, our results are new.

Example. Let $N=2$, ${w_{1}}=1$, ${w_{2}}=\sqrt{2}$, and ${F_{j}}$ and ${G_{j}}$ be defined by ${F_{j}}\{0\}=0.375$, ${F_{j}}\{1\}=0.5$, ${F_{j}}\{4\}=0.125$, ${G_{j}}\{0\}=0.45$, ${G_{j}}\{1\}=0.25$, ${G_{j}}\{2\}=0.25$, ${G_{j}}\{5\}=0.05$, $(j=1,2)$. We assume that ${n_{2}}=n$ and ${n_{1}}=⌈\sqrt{n}\hspace{0.1667em}⌉$ is the smallest integer greater or equal to $\sqrt{n}$. Then ${\nu _{k}^{+}}({F_{j}})={\nu _{k}^{+}}({G_{j}})$, $k=1,2,3$, ${\beta _{4}^{+}}({F_{j}},{G_{j}})=9$, ${u_{j}}=3/8$, $(j=1,2)$. Therefore, by Theorem 2.2

\[ {\big|\mathcal{L}(S)-\mathcal{L}(Z)\big|_{K}}\leqslant \frac{C}{\sqrt{{n_{1}}+{n_{2}}}}\bigg(\frac{1}{{n_{1}}}+\frac{1}{{n_{2}}}\bigg)=O\big({n}^{-1}\big).\]

In this case, Franken’s condition (8) is not satisfied, since ${\nu _{1}^{+}}({F_{j}})-{\nu _{2}^{+}}({F_{j}})-{({\nu _{1}^{+}}({F_{j}}))}^{2}<0$.

Next we apply Theorem 2.2 to the negative binomial distribution. For real $r>0$ and $0<\tilde{p}<1$, let $\xi \sim \mathrm{NB}(r,\tilde{p})$ denote the distribution with

\[ \mathrm{P}(\xi =k)=\left(\genfrac{}{}{0pt}{}{r+k-1}{k}\right){\tilde{p}}^{r}{\tilde{q}}^{k},\hspace{1em}k=0,1,\dots .\]

Here $\tilde{q}=1-\tilde{p}$. Note that r is not necessarily an integer.

Let ${X_{1j}}$ be concentrated on non-negative integers (${\nu _{k}^{-}}({F_{j}})=0$). We approximate ${S_{i}}$ by ${Z_{i}}\sim \mathrm{NB}({r_{i}},{p_{i}})$ with

\[ {r_{i}}=\frac{{(\mathrm{E}{S_{i}})}^{2}}{\mathrm{Var}{S_{i}}-\mathrm{E}{S_{i}}},\hspace{2em}{\tilde{p}_{i}}=\frac{\mathrm{E}{S_{i}}}{\mathrm{Var}{S_{i}}},\]

so that $\mathrm{E}{S_{i}}={r_{i}}{\tilde{q}_{i}}/{\tilde{p}_{i}}$ and $\mathrm{Var}{S_{i}}={r_{i}}{\tilde{q}_{i}}/{\tilde{p}_{i}^{2}}$. Observe that

(9)

\[ {\widehat{G}_{j}}(t)={\bigg(\frac{{\tilde{p}_{j}}}{1-{\tilde{q}_{j}}{\mathrm{e}}^{\mathrm{i}t}}\bigg)}^{{r_{j}}/{n_{j}}}.\]

Corollary 2.1.

Let assumptions of Theorem 2.2 hold with ${X_{1j}}$ concentrated on non-negative integers and let $\mathrm{E}{X_{1j}^{3}}<\infty $, $(j=1,\dots ,N)$. Let ${G_{j}}$ be defined by (9). Then

(10)

\[\begin{aligned}{\big|\mathcal{L}(S)-\mathcal{L}(Z)\big|_{K}}& \leqslant C\frac{\,{\max _{j}}{w_{j}}}{\,{\min _{j}}{w_{j}}}{\Bigg({\sum \limits_{i=1}^{N}}{n_{i}}{\tilde{u}_{i}}\Bigg)}^{-1/2}\\{} & \hspace{1em}\times {\sum \limits_{k=1}^{N}}\bigg[{\nu _{3}^{+}}({F_{k}})+{\nu _{1}^{+}}({F_{k}}){\nu _{2}^{+}}({F_{k}})+{\big({\nu _{1}^{+}}({F_{k}})\big)}^{3}\\{} & \hspace{1em}+\frac{{({\nu _{2}^{+}}({F_{k}})-{({\nu _{1}^{+}}({F_{k}}))}^{2})}^{2}}{{\nu _{1}^{+}}({F_{k}})}\bigg]{\tilde{u}_{k}^{-1}}.\end{aligned}\]

Here

\[ {\tilde{u}_{k}}=1-\frac{1}{2}\max \bigg(\big\| ({I_{1}}-I){F_{k}}\big\| ,{\bigg({r_{k}}\ln \frac{1}{{\tilde{p}_{k}}}\bigg)}^{-1/2}\bigg).\]

Remark 2.1.

(i) Note that

\[ {r_{k}}\ln \frac{1}{{\tilde{p}_{k}}}=\frac{{({\nu _{1}^{+}}({F_{k}}))}^{2}}{{\nu _{2}^{+}}({F_{k}})-{({\nu _{1}^{+}}({F_{k}}))}^{2}}\ln \frac{{\nu _{2}^{+}}({F_{k}})-{({\nu _{1}^{+}}({F_{k}}))}^{2}+{\nu _{1}^{+}}({F_{k}})}{{\nu _{1}^{+}}({F_{k}})}.\]

(ii) Let ${\nu _{k}^{+}}({F_{j}})\asymp C,{w_{j}}\asymp C$. Then the accuracy of approximation in (10) is of the order $O({({n_{1}}+\cdots +{n_{N}})}^{-1/2})$.

3 Sums of Markov Binomial rvs

We already mentioned that it is not always natural to assume independence of rvs. In this section, we still assume that $S={w_{1}}{S_{1}}+{w_{2}}{S_{2}}+\cdots +{w_{N}}{S_{N}}$ with mutually independent ${S_{i}}$. On the other hand, we assume that each ${S_{i}}$ has a Markov Binomial (MB) distribution, that is, ${S_{i}}$ is a sum of Markov dependent Bernoulli variables. Such a sum S has a slightly more realistic interpretation in actuarial mathematics. Assume, for example, that we have N insurance policy holders, i-th of whom can get ill during an insurance period and be paid a claim ${w_{i}}$. The health of the policy holder depends on the state of her/his health in the previous period. Therefore, we have a natural two state (healthy, ill) Markov chain. Then ${S_{i}}$ is an aggregate claim for ith insurance policy holder after ${n_{i}}$ periods, meanwhile S is an aggregate claim of all holders. Limit behavior of the MB distribution is a popular topic among mathematicians, discussed in numerous papers, see, for example, [3, 9, 11], and references therein.

Let $0,{\xi _{i1}},\dots ,{\xi _{i{n_{i}}}},\dots \hspace{0.1667em}$ , ($i=1,2,\dots ,N$) be a Markov chain with the transition probabilities

\[\begin{aligned}& \mathrm{P}({\xi _{ik}}=1\hspace{0.1667em}|\hspace{0.1667em}{\xi _{i,k-1}}=1)={p_{i}},\hspace{2em}\mathrm{P}({\xi _{ik}}=0\hspace{0.1667em}|\hspace{0.1667em}{\xi _{i,k-1}}=1)={q_{i}},\\{} & \mathrm{P}({\xi _{i,k}}=1\hspace{0.1667em}|\hspace{0.1667em}{\xi _{i,k-1}}=0)={\overline{q}_{i}},\hspace{2em}\mathrm{P}({\xi _{ik}}=0\hspace{0.1667em}|\hspace{0.1667em}{\xi _{i,k-1}}=0)={\overline{p}_{i}},\\{} & {p_{i}}+{q_{i}}={\overline{q}_{i}}+{\overline{p}_{i}}=1,\hspace{1em}\hspace{2em}{p_{i}},{\overline{q}_{i}}\in (0,1),\hspace{1em}k\in \mathbb{N}.\end{aligned}\]

The distribution of ${S_{i}}={\xi _{i1}}+\cdots +{\xi _{i{n_{i}}}}$ $({n_{i}}\in \mathbb{N})$ is called the Markov binomial distribution with parameters ${p_{i}},{q_{i}},{\overline{p}_{i}},{\overline{q}_{i}},{n_{i}}$. The definition of a MB rv slightly differs from paper to paper. We use the one from [3]. Note that the Markov chain, considered above, is not necessarily stationary. Furthermore, the distribution of ${w_{i}}{S_{i}}$ is denoted by ${H_{in}}=\mathcal{L}({w_{i}}{S_{i}})$. For approximation of ${H_{in}}$ we use the signed compound Poisson (CP) measure with matching mean and variance. Such signed CP approximations usually outperform both the normal and CP approximations, see, for example, [1, 3, 20]. Let

\[ {\gamma _{i}}=\frac{{q_{i}}{\overline{q}_{i}}}{{q_{i}}+{\overline{q}_{i}}},\hspace{2em}{\widehat{Y}_{i}}(t)=\frac{{q_{i}}{\mathrm{e}}^{\mathrm{i}{w_{i}}t}}{1-{p_{i}}{\mathrm{e}}^{\mathrm{i}{w_{i}}t}}-1.\]

Observe that ${\widehat{Y}_{i}}(t)+1$ is the characteristic function of the geometric distribution. Let ${Y_{i}}$ be a measure corresponding to ${\widehat{Y}_{i}}(t)$. For approximation of ${H_{in}}$ we use the signed CP measure ${D_{in}}$

(11)

\[\begin{aligned}{D_{in}}& =\exp \bigg\{\bigg(\frac{{\gamma _{i}}({\overline{q}_{i}}-{p_{i}})}{{q_{i}}+{\overline{q}_{i}}}+{n_{i}}{\gamma _{i}}\bigg){Y_{i}}\\{} & \hspace{1em}-{n_{i}}\bigg(\frac{{q_{i}}{\overline{q}_{i}^{2}}}{{({q_{i}}+{\overline{q}_{i}})}^{2}}\bigg({p_{i}}+\frac{{q_{i}}}{{q_{i}}+{\overline{q}_{i}}}\bigg)+\frac{{\gamma _{i}^{2}}}{2}\bigg){Y_{i}^{2}}\bigg\}.\end{aligned}\]

The CP limit occurs when $n{\overline{q}_{i}}\to \tilde{\lambda }$, see, for example, [3]. Therefore, we assume ${\overline{q}_{i}}$ to be small, though not necessarily vanishing. Let, for some fixed integer ${k_{0}}\geqslant 2$,

(12)

\[ {\overline{q}_{i}}\hspace{0.1667em}\geqslant \hspace{0.1667em}\frac{1}{{n}^{{k_{0}}}},\hspace{2em}0\hspace{0.1667em}<\hspace{0.1667em}{p_{i}}\hspace{0.1667em}\leqslant \hspace{0.1667em}\frac{1}{2},\hspace{2em}{\overline{q}_{i}}\hspace{0.1667em}\leqslant \hspace{0.1667em}\frac{1}{30},\hspace{2em}{w_{i}}\hspace{0.1667em}>\hspace{0.1667em}0,\hspace{2em}{n_{i}}\hspace{0.1667em}\geqslant \hspace{0.1667em}1,\hspace{1em}i\hspace{0.1667em}=\hspace{0.1667em}1,\dots ,N.\]

In principle, the first assumption in (12) can be dropped, but then exponentially vanishing remainder terms appear in all results, making them very complicated.

Theorem 3.1.

Let ${H_{in}}=\mathcal{L}({w_{i}}{S_{i}})$ and let ${D_{in}}$ be defined by (11), $i=1,\dots ,N$. Let the conditions stated in (12) be satisfied. Then

(13)

\[ {\Bigg|{\prod \limits_{i=1}^{N}}{H_{in}}-{\prod \limits_{i=1}^{N}}{D_{in}}\Bigg|}_{K}\leqslant C(N,{k_{0}})\frac{\max {w_{i}}}{\min {w_{i}}}\cdot \frac{{\textstyle\sum _{i=1}^{N}}{\overline{q}_{i}}({p_{i}}+{\overline{q}_{i}})}{\sqrt{{\textstyle\sum _{k=1}^{N}}\max ({n_{k}}{\overline{q}_{k}},1)}}.\]

Remark 3.1.

Let all ${\overline{q}_{i}}\geqslant C$, $i=1,\dots ,N$. Then, obviously, the right-hand side of (13) is majorized by

\[ C(N,{k_{0}})\frac{\max {w_{i}}}{\min {w_{i}}}\cdot \frac{1}{\sqrt{\max {n_{k}}}}.\]

Therefore, even in this case, the result is comparable with the Berry–Esseen theorem.

4 Auxiliary results

Lemma 4.1.

Let $h>0$, $W\in \mathcal{M}$, $W\{\mathbb{R}\}=0$, $U\in \mathcal{F}$ and $|\widehat{U}(t)|\leqslant C\widehat{V}(t)$, for $|t|\leqslant 1/h$ and some symmetric distribution V having non-negative characteristic function. Then

\[\begin{aligned}|WU{|_{K}}& \leqslant C{\int _{|t|\leqslant 1/h}}\bigg|\frac{\widehat{W}(t)\widehat{U}(t)}{t}\bigg|\hspace{0.1667em}\mathrm{d}t+C\| W\| Q(U,h)\\{} & \leqslant C\bigg(\underset{|t|\leqslant 1/h}{\sup }\frac{|\widehat{W}(t)|}{|t|}\cdot \frac{1}{h}+\| W\| \bigg)Q(V,h).\end{aligned}\]

Lemma 4.1 is a version of Le Cam’s smoothing inequality, see Lemma 9.3 in [5] and Lemma 3 on p. 402 in [15].

Lemma 4.2.

Let $F\in \mathcal{F}$, $h>0$ and $a>0$. Then

(14)

\[\begin{aligned}Q(F,h)& \leqslant {\bigg(\frac{96}{95}\bigg)}^{2}h{\int _{|t|\leqslant 1/h}}\big|\widehat{F}(t)\big|\hspace{0.1667em}\mathrm{d}t,\end{aligned}\]

(15)

\[\begin{aligned}Q(F,h)& \leqslant \bigg(1+\bigg(\frac{h}{a}\bigg)\bigg)Q(F,a),\end{aligned}\]

(16)

\[\begin{aligned}Q\big(\exp \big\{a(F-I)\big\},h\big)& \leqslant \frac{C}{\sqrt{aF\{|x|>h\}}}.\end{aligned}\]

If, in addition, $\widehat{F}(t)\geqslant 0$, then

(17)

\[ h{\int _{|t|\leqslant 1/h}}\big|\widehat{F}(t)\big|\hspace{0.1667em}\mathrm{d}t\leqslant CQ(F,h).\]

Lemma 4.2 contains well-known properties of Levy’s concentration function, see, for example, Chapter 1 in [19] or Section 1.5 in [5].

Expansion in left-hand and right-hand factorial moments for Fourier–Stieltjes transforms is given in [21]. Here we need its analogue for distributions.

Lemma 4.3.

Let $F\in {\mathcal{F}_{Z}}$ and, for some $s\geqslant 1$, ${\nu _{s+1}^{+}}(F)+{\nu _{s+1}^{-}}(F)<\infty $. Then

(18)

\[\begin{aligned}F& =I+{\sum \limits_{m=1}^{s}}\frac{{\nu _{m}^{+}}(F)}{m!}{({I_{1}}-I)}^{m}+{\sum \limits_{m=1}^{s}}\frac{{\nu _{m}^{-}}(F)}{m!}{({I_{-1}}-I)}^{m}\\{} & \hspace{1em}+\frac{{\nu _{s+1}^{+}}(F)+{\nu _{s+1}^{-}}(F)}{(s+1)!}{({I_{1}}-I)}^{s+1}\varTheta .\end{aligned}\]

Proof.

For measures, concentrated on non-negative integers, (18) is given in [5], Lemma 2.1. Observe that distribution F can be expressed as a mixture $F={p}^{+}{F}^{+}+{p}^{-}{F}^{-}$ of distributions ${F}^{+}$, ${F}^{-}$ concentrated on non-negative and negative integers, respectively. Then Lemma 2.1 from [5] can be applied in turn to ${F}^{+}$ and to ${F}^{-}$ (with ${I_{-1}}$). The remainder terms can be combined, since $({I_{-1}}-I)={I_{-1}}(I-{I_{1}})=({I_{1}}-I)\varTheta $. □

Lemma 4.4.

Let $F,G\in {\mathcal{F}_{Z}}$ and, for some $s\geqslant 1$, ${\nu _{j}^{+}}(F)={\nu _{j}^{+}}(G)$, ${\nu _{j}^{-}}(F)={\nu _{j}^{-}}(G)$, $(j=1,2,\dots ,s)$. If ${\beta _{s+1}^{+}}(F,G)+{\beta _{s+1}}(F,G)<\infty $, then

\[ F-G=\frac{{\beta _{s+1}^{+}}(F,G)+{\beta _{s+1}^{-}}(F,G)}{(s+1)!}{({I_{1}}-I)}^{s+1}\varTheta .\]

If, in addition, ${\beta _{s+2}^{+}}(F,G)+{\beta _{s+2}}(F,G)<\infty $ and s is even, then

\[\begin{aligned}F-G& =\frac{{\beta _{s+1}^{+}}(F,G)-{\beta _{s+1}^{-}}(F,G)}{(s+1)!}{({I_{1}}-I)}^{s+1}\\{} & \hspace{1em}+\big[{\beta _{s+2}^{+}}(F,G)+{\beta _{s+2}^{-}}(F,G)+{\beta _{s+1}^{-}}(F,G)\big]{({I_{1}}-I)}^{s+2}\varTheta C(s).\end{aligned}\]

Proof.

Observe that

\[\begin{aligned}{({I_{1}}-I)}^{s+1}+{({I_{-1}}-I)}^{s+1}& ={({I_{1}}-I)}^{s+1}-{({I_{-1}})}^{s+1}{({I_{1}}-I)}^{s+1}\\{} & ={({I_{1}}-I)}^{s+1}{I_{-1}}({I_{1}}-I){\sum \limits_{j=1}^{s+1}}{({I_{-1}})}^{s+1-j}\\{} & ={({I_{1}}-I)}^{s+2}\varTheta (s+1).\end{aligned}\]

The lemma now follows from (18). □

Lemma 4.5.

Let $F\in {\mathcal{F}_{Z}}$ with mean $\mu (F)$ and variance ${\sigma }^{2}(F)$, both finite. Then, for all $|t|\leqslant \pi $,

(19)

\[\begin{aligned}\big|\widehat{F}(t)\big|& \leqslant 1-\frac{(1-\| ({I_{1}}-I)F\| /2){t}^{2}}{4\pi }\\{} & \leqslant \exp \bigg\{-\frac{(1-\| ({I_{1}}-I)F\| /2)}{\pi }{\sin }^{2}\frac{t}{2}\bigg\},\end{aligned}\]

(20)

\[\begin{aligned}\big|{\big(\widehat{F}(t){\mathrm{e}}^{-\mathrm{i}t\mu (F)}\big)}^{\prime }\big|& \leqslant {\pi }^{2}{\sigma }^{2}(F)\big|\sin (t/2)\big|.\end{aligned}\]

The first estimate in (19) is given in [2] p. 884, the second estimate in (19) is trivial. For the proof of (20), see p. 81 in [5].

Lemma 4.6.

Let $M\in \mathcal{M}$ be concentrated on $\mathbb{Z}$, $\,{\sum _{k\in \mathbb{Z}}}\,|k||M\{k\}|<\infty $. Then, for any $a\in \mathbb{R}$, $b>0$ the following inequality holds

\[ \| M\| \leqslant {(1+b\pi )}^{1/2}{\Bigg(\frac{1}{2\pi }{\int _{-\pi }^{\pi }}\bigg({\big|\widehat{M}(t)\big|}^{2}+\frac{1}{{b}^{2}}{\big|{\big({\mathrm{e}}^{-\mathrm{i}ta}\widehat{M}(t)\big)}^{\prime }\big|}^{2}\bigg)\hspace{0.1667em}\mathrm{d}t\Bigg)}^{1/2}.\]

Lemma 4.6 is a well-known inversion inequality for lattice distributions. Its proof can be found, for example, in [5], Lemma 5.1.

Lemma 4.7.

Let ${H_{in}}=\mathcal{L}({w_{i}}{S_{i}})$ and let ${D_{in}}$ be defined by (11), $i=1,\dots ,N$. Let conditions (12) hold. Then, for $i=1,2,\dots ,N$,

\[\begin{aligned}{H_{in}}-{D_{in}}& ={\overline{q}_{i}}({p_{i}}+{\overline{q}_{i}}){Y_{i}}\exp \{{n_{i}}{\gamma _{i}}{Y_{i}}/60\}\varTheta C+({p_{i}}+{\overline{q}_{i}})({I_{{w_{i}}}}-I)\varTheta C{\mathrm{e}}^{-{C_{i}}{n_{i}}},\\{} {H_{in}}& =\exp \{{n_{i}}{\gamma _{i}}{Y_{i}}/30\}\varTheta C+({p_{i}}+{\overline{q}_{i}})({I_{{w_{i}}}}-I)\varTheta C{\mathrm{e}}^{-{C_{i}}{n_{i}}},\\{} {D_{in}}& =\exp \{{n_{i}}{\gamma _{i}}{Y_{i}}/30\}\varTheta C,\hspace{2em}{\mathrm{e}}^{-{C_{i}}{n_{i}}}\leqslant \frac{C({k_{0}}){\overline{q}_{i}}}{\sqrt{\max ({n_{i}}{\overline{q}_{i}},1)}},\\{} \big|{\widehat{Y}_{i}}(t)\big|& \leqslant 4\big|\sin (t{w_{i}}/2)\big|,\hspace{2em}Re{\widehat{Y}_{i}}(t)\geqslant -\frac{4}{3}{\sin }^{2}(t{w_{i}}/2),\hspace{2em}\frac{{\overline{q}_{i}}}{2}\leqslant {\gamma _{i}}\leqslant {\overline{q}_{i}}.\end{aligned}\]

Proof.

The statements follow from Lemma 5.4, Lemma 5.1 and the relations given on pp. 1131–1132 in [3]. The estimate for ${\mathrm{e}}^{-{C_{i}}{n_{i}}}$ follows from the first assumption in (12) and the following simple estimate

\[\begin{aligned}{\mathrm{e}}^{-{C_{i}}{n_{i}}}& \leqslant {\mathrm{e}}^{-{C_{i}}{n_{i}}/2}{\mathrm{e}}^{-{C_{i}}{n_{i}}{\overline{q}_{i}}/2}\leqslant \frac{C({k_{0}})}{{n_{i}^{{k_{0}}}}}\frac{2}{1+{C_{i}}{n_{i}}{\overline{q}_{1}}}\\{} & \leqslant \frac{C({k_{0}}){\overline{q}_{i}}}{\min (1,{C_{i}})(1+{n_{i}}{\overline{q}_{i}})}\leqslant \frac{C({k_{0}}){\overline{q}_{i}}}{\min (1,{C_{i}})\max ({n_{i}}{\overline{q}_{i}},1)}.\end{aligned}\]

□

5 Proofs for sums of independent rvs

Proof of Theorem 2.1.

Let ${F_{ij,w}}$ (resp. ${G_{ij,w}}$) denote the distribution of ${w_{i}}{X_{ij}}$ (resp. ${w_{i}}{Z_{ij}}$). Note that ${\widehat{F}_{ij,w}}(t)={\widehat{F}_{ij}}({w_{i}}t)$. By the triangle inequality

\[\begin{aligned}{\big|\mathcal{L}(S)-\mathcal{L}(Z)\big|_{K}}& ={\Bigg|{\prod \limits_{i=1}^{N}}\mathcal{L}({w_{i}}{S_{i}})-{\prod \limits_{i=1}^{N}}\mathcal{L}({w_{i}}{Z_{i}})\Bigg|}_{K}\\{} & \leqslant {\sum \limits_{i=1}^{N}}{\Bigg|\big(\mathcal{L}({w_{i}}{S_{i}})-\mathcal{L}({w_{i}}{Z_{i}})\big){\prod \limits_{l=1}^{i-1}}\mathcal{L}({w_{l}}{S_{l}}){\prod \limits_{l=i+1}^{N}}\mathcal{L}({w_{l}}{Z_{l}})\Bigg|}_{K}.\end{aligned}\]

Similarly,

\[\begin{aligned}\mathcal{L}({w_{i}}{S_{i}})-\mathcal{L}({w_{i}}{Z_{i}})& ={\prod \limits_{j=1}^{{n_{i}}}}{F_{ij,w}}-{\prod \limits_{j=1}^{{n_{i}}}}{G_{ij,w}}\\{} & ={\sum \limits_{j=1}^{{n_{i}}}}({F_{ij,w}}-{G_{ij,w}}){\prod \limits_{k=1}^{j-1}}{F_{ik,w}}{\prod \limits_{k=j+1}^{{n_{i}}}}{G_{ik,w}}.\end{aligned}\]

For the sake of brevity, let

\[\begin{aligned}{E_{ij}}& :={\prod \limits_{k=1}^{j-1}}{F_{ik,w}}{\prod \limits_{k=j+1}^{{n_{i}}}}{G_{ik,w}},\\{} {T_{i}}& :={\prod \limits_{l=1}^{i-1}}\mathcal{L}({w_{l}}{S_{l}}){\prod \limits_{l=i+1}^{N}}\mathcal{L}({w_{l}}{Z_{l}})={\prod \limits_{l=1}^{i-1}}{\prod \limits_{m=1}^{{n_{l}}}}{F_{lm,w}}{\prod \limits_{l=i+1}^{N}}{\prod \limits_{m=1}^{{n_{l}}}}{G_{lm,w}}.\end{aligned}\]

Then, combining both equations given above with Lemma 4.4 , we get

(21)

\[\begin{aligned}{\big|\mathcal{L}(S)-\mathcal{L}(Z)\big|_{K}}& \leqslant C(s){\sum \limits_{i=1}^{N}}{\sum \limits_{j=1}^{{n_{i}}}}\big[{\beta _{s+1}^{+}}({F_{ij}},{G_{ij}})\\{} & \hspace{1em}+{\beta _{s+1}^{-}}({F_{ij}},{G_{ij}})\big]{\big|{({I_{{w_{i}}}}-I)}^{s+1}{E_{ij}}{T_{i}}\big|}_{K}.\end{aligned}\]

Let $|t|\leqslant \pi /\,{\max _{i}}{w_{i}}$. Then it follows from (19) that

(22)

\[ \big|{\widehat{E}_{ij}}(t){\widehat{T}_{i}}(t)\big|\leqslant {\mathrm{e}}^{{u_{ij}}{\sin }^{2}(t{w_{i}}/2)/\pi }\exp \Bigg\{-\frac{1}{\pi }{\sum \limits_{l=1}^{N}}{\sum \limits_{m=1}^{{n_{l}}}}{u_{lm}}{\sin }^{2}\frac{t{w_{l}}}{2}\Bigg\}.\]

Observe that ${\mathrm{e}}^{{u_{ij}}{\sin }^{2}(t{w_{i}}/2)/\pi }\leqslant {\mathrm{e}}^{1/\pi }=C$. Next, let

(23)

\[ L:=\frac{1}{8\pi }{\sum \limits_{l=1}^{N}}{\sum \limits_{m=1}^{{n_{l}}}}{u_{lm}}\big[({I_{{w_{l}}}}-I)+({I_{-{w_{l}}}}-I)\big].\]

It is not difficult to check, that $\exp \{L\}$ is a CP distribution with non-negative characteristic function. Also, by the definition of exponential measure, $\exp \{-L\}$, which can be called the inverse to $\exp \{L\}$, is a signed measure with finite variation. We have

(24)

\[ |{({I_{{w_{i}}}}-I)}^{s+1}{E_{ij}}{T_{i}}{|_{K}}=|{({I_{{w_{i}}}}-I)}^{s+1}{E_{ij}}{T_{i}}\exp \{-L\}\exp \{L\}{|_{K}}.\]

Next step is similar to the definition of mod-Poisson convergence. We apply Lemma 4.1 with $h=\max {w_{i}}/\pi $ and ${U_{1}}=\exp \{L\}$ and ${W_{1}}={({I_{{w_{i}}}}-I)}^{s+1}{E_{ij}}{T_{i}}\exp \{-L\}$. By Lemma 4.2,

(25)

\[\begin{aligned}Q\big(\exp \{L\},h\big)& \leqslant C\frac{\max {w_{i}}}{\min {w_{i}}}\cdot Q\big(\exp \{L\},\min {w_{i}}/2\big)\\{} & \leqslant C\frac{\max {w_{i}}}{\min {w_{i}}}{\Bigg({\sum \limits_{l=1}^{N}}{\sum \limits_{m=1}^{{n_{l}}}}{u_{lm}}\Bigg)}^{-1/2}.\end{aligned}\]

From (22) and (23), it follows that

(26)

\[\begin{aligned}\bigg|\frac{{\widehat{W}_{1}}(t)}{t}\bigg|\cdot \frac{1}{h}& \leqslant C(s)\frac{|\sin (t{w_{i}}/2){|}^{s+1}}{h|t|}\exp \Bigg\{-\frac{1}{2\pi }{\sum \limits_{l=1}^{N}}{\sum \limits_{m=1}^{{n_{l}}}}{u_{lm}}{\sin }^{2}\frac{t{w_{l}}}{2}\Bigg\}\\{} & \leqslant C(s)\frac{{w_{i}}}{h}{\big|\sin (t{w_{i}}/2)\big|}^{s}\exp \Bigg\{-\frac{1}{2\pi }{\sum \limits_{m=1}^{{n_{i}}}}{u_{im}}{\sin }^{2}(t{w_{i}}/2)\Bigg\}\\{} & \leqslant C(s){\Bigg({\sum \limits_{m=1}^{{n_{i}}}}{u_{im}}\Bigg)}^{-s/2}.\end{aligned}\]

It remains to estimate $\| {W_{1}}\| $. Let

\[\begin{aligned}{\varPhi _{lm,w}}& :={F_{lm,w}}\exp \bigg\{\frac{1}{8\pi }{u_{lm}}\big[({I_{{w_{l}}}}-I)+({I_{-{w_{l}}}}-I)\big]\bigg\},\\{} {\varPsi _{lm,w}}& :={G_{lm,w}}\exp \bigg\{\frac{1}{8\pi }{u_{lm}}\big[({I_{{w_{l}}}}-I)+({I_{-{w_{l}}}}-I)\big]\bigg\}\end{aligned}\]

Then by the properties of the total variation norm,

(27)

\[\begin{aligned}\| {W_{1}}\| & \leqslant \bigg\| \exp \bigg\{\frac{1}{8}{u_{ij}}\big[({I_{{w_{i}}}}-I)+({I_{-{w_{i}}}}-I)\big]\bigg\}\bigg\| \\{} & \hspace{1em}\times \Bigg\| {({I_{{w_{i}}}}-I)}^{s+1}{\prod \limits_{k=1}^{j-1}}{\varPhi _{ik,w}}{\prod \limits_{k=j+1}^{{n_{i}}}}{\varPsi _{ik,w}}\Bigg\| \\{} & \hspace{1em}\times {\prod \limits_{l=1}^{i-1}}\Bigg\| {\prod \limits_{m=1}^{{n_{l}}}}{\varPhi _{lm,w}}\Bigg\| {\prod \limits_{l=i+1}^{N}}\Bigg\| {\prod \limits_{m=1}^{{n_{l}}}}{\varPsi _{lm,w}}\Bigg\| .\end{aligned}\]

The first norm in (27) is bounded by $\exp \{\frac{1}{8}{u_{ij}}[\| {I_{{w_{i}}}}-I\| +\| {I_{-{w_{i}}}}-I\| ]\}\leqslant \exp \{1/2\}$. The total variation norm is invariant with respect to scale. Therefore, without loss of generality, we can switch to ${w_{l}}=1$. In this case, we use the notations ${\varPhi _{ik}},{\varPsi _{ik}}$. Then, again employing the inverse CP measures, we get

\[\begin{aligned}& \Bigg\| {({I_{{w_{i}}}}-I)}^{s+1}{\prod \limits_{k=1}^{j-1}}{\varPhi _{ik,w}}{\prod \limits_{k=j+1}^{{n_{i}}}}{\varPsi _{ik,w}}\Bigg\| \\{} & \hspace{1em}=\Bigg\| {({I_{1}}-I)}^{s+1}{\prod \limits_{k=1}^{j-1}}{\varPhi _{ik}}{\prod \limits_{k=j+1}^{{n_{i}}}}{\varPsi _{ik}}\Bigg\| \\{} & \hspace{1em}=\Bigg\| {({I_{1}}-I)}^{s+1}{\prod \limits_{k=1}^{j-1}}{\varPhi _{ik}}{\prod \limits_{k=j+1}^{{n_{i}}}}{\varPsi _{ik}}\exp \big\{{u_{ij}}({I_{1}}-I)\big\}\exp \big\{{u_{ij}}(I-{I_{1}})\big\}\Bigg\| \\{} & \hspace{1em}\leqslant {\mathrm{e}}^{2}\Bigg\| {({I_{1}}-I)}^{s+1}\exp \big\{{u_{ij}}({I_{1}}-I)\big\}{\prod \limits_{k=1}^{j-1}}{\varPhi _{ik}}{\prod \limits_{k=j+1}^{{n_{i}}}}{\varPsi _{ik}}\Bigg\| .\end{aligned}\]

We apply Lemma 4.6 with $a={u_{ij}}+{\sum _{k\ne i}^{{n_{i}}}}{\mu _{ik}}$, $b=1$, where ${\mu _{ik}}={\nu _{1}^{+}}({F_{ik}})+{\nu _{1}^{-}}({F_{ik}})$ is the mean of ${F_{ik}}$ and, due to assumption (3), of ${G_{ik}}$. Let

\[ \widehat{\Delta }(t):={\big({\mathrm{e}}^{\mathrm{i}t}-1\big)}^{s+1}\exp \big\{{u_{ij}}\big({\mathrm{e}}^{\mathrm{i}t}-1-it\big)\big\}{\prod \limits_{k=1}^{j-1}}{\widehat{\varPhi }_{ik}}(t){\mathrm{e}}^{-\mathrm{i}t{\mu _{ik}}}{\prod \limits_{k=j+1}^{{n_{i}}}}{\widehat{\varPsi }_{ik}}{\mathrm{e}}^{-\mathrm{i}t{\mu _{ik}}}.\]

It follows from (19) that

\[\begin{aligned}\big|\Delta (t)\big|& \leqslant C(s){\big|\sin (t/2)\big|}^{s+1}\exp \Bigg\{-\frac{1}{2\pi }{\sum \limits_{m=1}^{{n_{i}}}}{u_{im}}{\sin }^{2}(t/2)\Bigg\}\\{} & \leqslant C(s){\Bigg({\sum \limits_{m=1}^{{n_{i}}}}{u_{im}}\Bigg)}^{-s/2}.\end{aligned}\]

For the estimation of $|{\Delta ^{\prime }}(t)|$, observe that by (19) and (20)

\[\begin{aligned}\big|{\big({\widehat{\varPhi }_{ik}}(t){\mathrm{e}}^{-\mathrm{i}t{\mu _{ik}}}\big)}^{\prime }\big|& \leqslant \bigg|{\widehat{F}_{ik}}(t){\mathrm{e}}^{-\mathrm{i}t{\mu _{ik}}}\frac{{u_{ik}}}{\pi }\sin (t/2){\mathrm{e}}^{({u_{ik}}/2\pi ){\sin }^{2}(t/2)}\bigg|\\{} & \hspace{1em}+\big|{\big({\widehat{F}_{ik}}(t){\mathrm{e}}^{-\mathrm{i}t{\mu _{ik}}}\big)}^{\prime }{\mathrm{e}}^{({u_{ik}}/2\pi ){\sin }^{2}(t/2)}\big|\\{} & \leqslant C(s)\big({u_{ik}}+{\sigma _{ik}^{2}}\big)\big|\sin (t/2)\big|\\{} & \leqslant C(s)\big({u_{ik}}+{\sigma _{ik}^{2}}\big)\big|\sin (t/2)\big|\exp \bigg\{-\frac{{u_{ik}}}{\pi }{\sin }^{2}(t/2)\bigg\}{\mathrm{e}}^{1/\pi }.\end{aligned}\]

The same bound holds for $|{({\widehat{\varPsi }_{ik}}(t)\exp \{-\mathrm{i}t{\mu _{ik}}\})}^{\prime }|$. The direct calculation shows that

\[ \big|{\big({\big({\mathrm{e}}^{\mathrm{i}t}-1\big)}^{s+1}\exp \big\{{u_{ij}}\big({\mathrm{e}}^{\mathrm{i}t}-1-\mathrm{i}t\big)\big\}\big)}^{\prime }\big|\leqslant C(s){\big|\sin (t/2)\big|}^{s}\exp \bigg\{-\frac{1}{\pi }{u_{ij}}{\sin }^{2}(t/2)\bigg\}.\]

Taking into account of the previous two estimates, it is not difficult to prove that

\[\begin{aligned}\big|{\Delta ^{\prime }}(t)\big|& \leqslant C(s){\big|\sin (t/2)\big|}^{s}\exp \Bigg\{-\frac{1}{\pi }{\sum \limits_{k=1}^{{n_{i}}}}{u_{ik}}{\sin }^{2}(t/2)\Bigg\}\\{} & \hspace{1em}\times \Bigg(1+{\sin }^{2}(t/2){\sum \limits_{k=1,k\ne j}^{{n_{i}}}}\big({u_{ik}}+{\sigma _{ik}^{2}}\big)\Bigg)\\{} & \leqslant C(s){\Bigg({\sum \limits_{k=1}^{{n_{i}}}}{u_{ik}}\Bigg)}^{-s/2}\Bigg(1+{\sum \limits_{k=1}^{{n_{i}}}}{\sigma _{ik}^{2}}/{\sum \limits_{k=1}^{{n_{i}}}}{u_{ik}}\Bigg).\end{aligned}\]

From Lemma 4.6, it follows that

(28)

\[ \Bigg\| {({I_{{w_{i}}}}-I)}^{s+1}{\prod \limits_{k=1}^{j-1}}{\varPhi _{ik,w}}{\prod \limits_{k=j+1}^{{n_{i}}}}{\varPsi _{ik,w}}\Bigg\| \leqslant C(s){\Bigg({\sum \limits_{k=1}^{{n_{i}}}}{u_{ik}}\Bigg)}^{-s/2}\Bigg(1+{\sum \limits_{k=1}^{{n_{i}}}}{\sigma _{ik}^{2}}/{\sum \limits_{k=1}^{{n_{i}}}}{u_{ik}}\Bigg).\]

The remaining two norms in (27) can be estimated similarly:

(29)

\[ \Bigg\| {\prod \limits_{m=1}^{{n_{l}}}}{\varPhi _{lm,w}}\Bigg\| ,\Bigg\| {\prod \limits_{m=1}^{{n_{l}}}}{\varPsi _{lm,w}}\Bigg\| \leqslant C\Bigg(1+{\sum \limits_{m=1}^{{n_{l}}}}{\sigma _{lm}^{2}}/{\sum \limits_{m=1}^{{n_{l}}}}{u_{lm}}\Bigg).\]

Substituting (28), (29) into (27), we obtain

(30)

\[ \| {W_{1}}\| \leqslant C(N,s){\Bigg({\sum \limits_{m=1}^{{n_{i}}}}{u_{im}}\Bigg)}^{-s/2}{\prod \limits_{l=1}^{N}}\Bigg(1+{\sum \limits_{k=1}^{{n_{l}}}}{\sigma _{lk}^{2}}/{\sum \limits_{k=1}^{{n_{l}}}}{u_{lk}}\Bigg).\]

Combining (30) with (25), (26) and (24), we get

\[\begin{aligned}{\big|{({I_{{w_{i}}}}-I)}^{s+1}{E_{ij}}{T_{i}}\big|}_{K}& \leqslant C(N,s)\frac{\,{\max _{j}}{w_{j}}}{\,{\min _{j}}{w_{j}}}{\Bigg({\sum \limits_{i=1}^{N}}{\sum \limits_{k=1}^{{n_{i}}}}{u_{ik}}\Bigg)}^{-1/2}\\{} & \hspace{1em}\times {\Bigg({\sum \limits_{m=1}^{{n_{i}}}}{u_{im}}\Bigg)}^{-s/2}{\prod \limits_{l=1}^{N}}\Bigg(1+{\sum \limits_{k=1}^{{n_{l}}}}{\sigma _{lk}^{2}}/{\sum \limits_{k=1}^{{n_{l}}}}{u_{lk}}\Bigg).\end{aligned}\]

Substituting the last estimate into (21) we complete the proof of (5). The proof of (6) is very similar and, therefore, omitted. □

Proof of Theorem 2.2.

We outline only the differences from the proof of Theorem 2.1. No use of convolution with the inverse Poisson measure is required, since we have powers of ${F_{i}^{{n_{i}}}}$, which can be used for Levy’s concentration function. Let $\lfloor a\rfloor $ denote an integer part of a and let $a(k):=\lfloor (k-1)/2\rfloor $, $b(k):=\lfloor ({n_{i}}-k)/2\rfloor $. Then, as in the proof of Theorem 2.1, we obtain

\[\begin{aligned}{\big|\mathcal{L}(S)-\mathcal{L}(Z)\big|_{K}}& \leqslant C(s){\sum \limits_{i=1}^{N}}{\sum \limits_{k=1}^{{n_{i}}}}\big({\beta _{s+1}^{+}}({F_{i}},{G_{i}})+{\beta _{s+1}^{-}}({F_{i}},{G_{i}})\big)\\{} & \hspace{1em}\times {\Bigg|{({I_{{w_{i}}}}-I)}^{s+1}{F_{iw}^{a(k)}}{G_{iw}^{b(k)}}{F_{iw}^{a(k)}}{G_{iw}^{b(k)}}{\prod \limits_{j=1}^{i-1}}{F_{jw}^{{n_{j}}}}{\prod \limits_{j=i+1}^{N}}{G_{jw}^{{n_{j}}}}\Bigg|_{K}}.\end{aligned}\]

Here ${F_{iw}}$ and ${G_{iw}}$ denote the distributions of ${w_{i}}{X_{ij}}$ and ${w_{i}}{Z_{ij}}$, respectively. We can apply Lemma 4.1 to the Kolmogorov norm given above, taking $W={({I_{{w_{i}}}}-I)}^{s+1}{F_{iw}^{a(k)}}{G_{iw}^{b(k)}}$. The remaining distribution is used in Levy’s concentration function. The Fourier–Stieltjes transform $\widehat{W}(t)/t$ is estimated exactly as in the proof of Theorem 2.1. The total variation of any distribution is equal to 1, therefore $\| W\| \leqslant \| {I_{{w_{i}}}}-I\| \leqslant 2$ and we can avoid application of Lemma 4.6. □

Proof of Corollary 2.1.

As proved in [1], p. 144,

\[ \frac{1}{2}\big\| {G_{k}}({I_{1}}-I)\big\| \leqslant {\bigg(\frac{{p_{k}}{\nu _{1}^{+}}({F_{k}})}{{q_{k}}}\ln \frac{1}{{p_{k}}}\bigg)}^{-1/2}.\]

Observe that ${\nu _{1}^{+}}({F_{j}})={\nu _{1}^{+}}({G_{j}})$ and ${\nu _{2}^{+}}({F_{j}})={\nu _{2}^{+}}({G_{j}})$. It remains to find ${\nu _{3}^{+}}({G_{j}})$ and apply Theorem 2.2. □

6 Proof of Theorem 3.1

The proof is similar to the one given in [22]. Let ${A_{i}}=\exp \{{n_{i}}{\gamma _{i}}{Y_{i}}/30\}$. From Lemma 4.7, it follows that

\[ {H_{in}}={A_{i}}{\varTheta _{i}}C+{\mathrm{e}}^{-{C_{i}}{n_{i}}}{\varTheta _{i}}C,\hspace{2em}{D_{in}}={A_{i}}{\varTheta _{i}}C,\hspace{1em}i=1,2,\dots ,N.\]

Here we have added index to ${\varTheta _{i}}$ emphasizing that they might be different for different i. As usual, we assume that the convolution ${\prod _{k=N+1}^{N}}={\prod _{k=1}^{0}}=I$. Let also denote by ${\sum _{i}^{\ast }}$ summation over all indices $\{{j_{1}},{j_{2}},\dots ,{j_{i-1}}\in \{0,1\}\}$. Taking into account Lemma 4.7 and the properties of the Kolmogorov and total variation norms given in the Introduction, we get

(31)

\[\begin{aligned}& {\Bigg|{\prod \limits_{i=1}^{N}}{H_{in}}-{\prod \limits_{i=1}^{N}}{D_{in}}\Bigg|}_{K}\\{} & \hspace{1em}\leqslant {\sum \limits_{i=1}^{N}}{\Bigg|({H_{in}}-{D_{in}}){\prod \limits_{k=1}^{i-1}}{H_{kn}}{\prod \limits_{k=i+1}^{N}}{D_{kn}}\Bigg|}_{K}\\{} & \hspace{1em}\leqslant {\sum \limits_{i=1}^{N}}\Bigg|({H_{in}}-{D_{in}}){\sum \nolimits_{i}^{\ast }}{\prod \limits_{k=1}^{i-1}}{A_{k}^{{j_{k}}}}{\varTheta _{k}}C\\{} & \hspace{2em}\times {\prod \limits_{k=i+1}^{N}}{A_{k}}{\varTheta _{k}}C{\prod \limits_{k=1}^{i-1}}{\mathrm{e}}^{-(1-{j_{k}}){n_{k}}{C_{k}}}{\varTheta _{k}}C\Bigg|{_{K}}\\{} & \hspace{1em}\leqslant C(N){\sum \limits_{i=1}^{N}}{\overline{q}_{i}}({p_{i}}+{\overline{q}_{i}}){\sum \nolimits_{i}^{\ast }}\,{\Bigg|{Y_{i}}\exp \{{n_{i}}{\gamma _{i}}{Y_{i}}/60\}{\prod \limits_{k=1}^{i-1}}{A_{k}^{{j_{k}}}}{\prod \limits_{k=i+1}^{N}}{A_{k}}\Bigg|}_{K}\\{} & \hspace{2em}\times {\prod \limits_{k=1}^{i-1}}{\mathrm{e}}^{-(1-{j_{k}}){n_{k}}{C_{k}}}+C{\sum \limits_{i=1}^{N}}({p_{i}}+{\overline{q}_{i}}){\mathrm{e}}^{-{C_{i}}{n_{i}}}\\{} & \hspace{2em}\times {\sum \nolimits_{i}^{\ast }}{\Bigg|({I_{{w_{i}}}}-I){\prod \limits_{k=1}^{i-1}}{A_{k}^{{j_{k}}}}{\prod \limits_{k=i+1}^{N}}{A_{k}}\Bigg|}_{K}{\prod \limits_{k=1}^{i-1}}{\mathrm{e}}^{-(1-{j_{k}}){n_{k}}{C_{k}}}.\end{aligned}\]

Both summands on the right-hand side of (31) are estimated similarly. Observe that

\[\begin{aligned}& \Bigg|{Y_{i}}\exp \{{n_{i}}{\gamma _{i}}{Y_{i}}/60\}{{\prod \limits_{k=1}^{i-1}}{A_{k}^{{j_{k}}}}{\prod \limits_{k=i+1}^{N}}{A_{k}}}_{K}\Bigg|\\{} & \hspace{1em}=\,{\Bigg|{Y_{i}}\exp \Bigg\{\frac{{n_{i}}{\gamma _{i}}{Y_{i}}}{60}+\frac{1}{30}{\sum \limits_{k=1}^{i-1}}{j_{k}}{n_{k}}{\gamma _{k}}{Y_{k}}+\frac{1}{30}{\sum \limits_{k=i+1}^{N}}{n_{k}}{\gamma _{k}}{Y_{k}}\Bigg\}\Bigg|}_{K}.\end{aligned}\]

Next we apply Lemma 4.1 with $W={Y_{i}}$ and $h=\max {w_{i}}/\pi $ and V with

\[\begin{aligned}\widehat{V}(t)& =\exp \Bigg\{-\frac{1}{90}\Bigg[{\sum \limits_{k=1}^{i-1}}{j_{k}}\max ({n_{k}}{\overline{q}_{k}},1){\sin }^{2}(t{w_{k}}/2)\\{} & \hspace{1em}+{\sum \limits_{k=i}^{N}}\max ({n_{k}}{\overline{q}_{k}},1){\sin }^{2}(t{w_{k}}/2)\Bigg]\Bigg\}.\end{aligned}\]

By Lemma 4.7

\[ \frac{|{\widehat{Y}_{i}}(t)|}{t}\frac{1}{h}+\| {Y_{i}}\| \leqslant C.\]

Observe that

\[\begin{aligned}& \Bigg|\exp \Bigg\{\frac{{n_{i}}{\gamma _{i}}}{60}{\widehat{Y}_{i}}(t)+\frac{1}{30}{\sum \limits_{k=1}^{i-1}}{j_{k}}{n_{k}}{\gamma _{k}}{\widehat{Y}_{k}}(t)+\frac{1}{30}{\sum \limits_{k=i+1}^{N}}{\gamma _{k}}{\widehat{Y}_{k}}(t)\Bigg\}\Bigg|\\{} & \hspace{1em}\leqslant \exp \Bigg\{-\frac{{n_{i}}{\gamma _{i}}{\sin }^{2}(t{w_{i}}/2)}{45}-\frac{2}{45}{\sum \limits_{k=1}^{i-1}}{j_{k}}{n_{k}}{\gamma _{k}}{\sin }^{2}(t{w_{k}}/2)\\{} & \hspace{2em}-\frac{2}{45}{\sum \limits_{k=i+1}^{N}}{n_{k}}{\gamma _{k}}{\sin }^{2}(t{w_{k}}/2)\Bigg\}\\{} & \hspace{1em}\leqslant \exp \Bigg\{-\frac{1}{90}\Bigg[{\sum \limits_{k=1}^{i-1}}{j_{k}}{n_{k}}{\overline{q}_{k}}{\sin }^{2}(t{w_{k}}/2)+{\sum \limits_{k=i}^{N}}{n_{k}}{\overline{q}_{k}}{\sin }^{2}(t{w_{k}}/2)\Bigg]\Bigg\}\\{} & \hspace{1em}\leqslant {\mathrm{e}}^{N/90}\exp \Bigg\{-\frac{1}{90}\Bigg[{\sum \limits_{k=1}^{i-1}}{j_{k}}({n_{k}}{\overline{q}_{k}}+1){\sin }^{2}(t{w_{k}}/2)\\{} & \hspace{2em}+{\sum \limits_{k=i}^{N}}({n_{k}}{\overline{q}_{k}}+1){\sin }^{2}(t{w_{k}}/2)\Bigg]\Bigg\}\\{} & \hspace{1em}\leqslant {\mathrm{e}}^{N/90}\exp \Bigg\{-\frac{1}{90}\Bigg[{\sum \limits_{k=1}^{i-1}}{j_{k}}\max ({n_{k}}{\overline{q}_{k}},1){\sin }^{2}(t{w_{k}}/2)\\{} & \hspace{2em}+{\sum \limits_{k=i}^{N}}\max ({n_{k}}{\overline{q}_{k}},1){\sin }^{2}(t{w_{k}}/2)\Bigg]\Bigg\}\\{} & \hspace{1em}={\mathrm{e}}^{N/90}\widehat{V}(t).\end{aligned}\]

Therefore, using Lemma 4.2, we prove

(32)

\[\begin{aligned}& \,{\Bigg|{Y_{i}}\exp \{{n_{i}}{\gamma _{i}}{Y_{i}}/60\}{\prod \limits_{k=1}^{i-1}}{A_{k}^{{j_{k}}}}{\prod \limits_{k=i+1}^{N}}{A_{k}}\Bigg|}_{K}\\{} & \hspace{1em}\leqslant C(N)Q\Big(V,\underset{i}{\max }{w_{i}}/h\Big)\\{} & \hspace{1em}\leqslant C(N)\bigg(\frac{\max {w_{i}}}{\min {w_{i}}}\bigg)Q(V,\min {w_{i}}/2)\\{} & \hspace{1em}\leqslant C(N)\bigg(\frac{\max {w_{i}}}{\min {w_{i}}}\bigg){\Bigg({\sum \limits_{k=1}^{i-1}}{j_{k}}\max ({n_{k}}{\overline{q}_{k}},1)+{\sum \limits_{k=i+1}^{N}}\max ({n_{k}}{\overline{q}_{k}},1)\Bigg)}^{-1/2}.\end{aligned}\]

Next observe that by Lemma 4.7,

\[\begin{aligned}\Bigg|{\prod \limits_{k=1}^{i-1}}{\mathrm{e}}^{-(1-{j_{k}}){n_{k}}{C_{k}}}\Bigg|& =C\exp \Bigg\{-{\sum \limits_{k=1}^{i-1}}(1-{j_{k}}){C_{k}}{n_{k}}\Bigg\}\\{} & \leqslant \frac{C({k_{0}},N)}{\max (1,\sqrt{{\textstyle\sum _{k=1}^{i-1}}(1-{j_{k}})\max ({n_{k}}{\overline{q}_{k}},1)})}.\end{aligned}\]

The last estimate, (32) and the trivial inequality $1/(ab)<2/(a+b)$, valid for any $a,b\geqslant 1$, allow us to obtain

\[\begin{aligned}& {\sum \limits_{i=1}^{N}}{\overline{q}_{i}}({p_{i}}+{\overline{q}_{i}}){\sum \nolimits_{i}^{\ast }}\,{\Bigg|{Y_{i}}\exp \{{n_{i}}{\gamma _{i}}{Y_{i}}/60\}{\prod \limits_{k=1}^{i-1}}{A_{k}^{{j_{k}}}}{\prod \limits_{k=i+1}^{N}}{A_{k}}\Bigg|}_{K}{\prod \limits_{k=1}^{i-1}}{\mathrm{e}}^{-(1-{j_{k}}){n_{k}}{C_{k}}}\\{} & \hspace{1em}\leqslant C({k_{0}},N)\frac{\max {w_{j}}}{\min {w_{j}}}\cdot \frac{{\textstyle\sum _{i=1}^{N}}{\overline{q}_{i}}({p_{i}}+{\overline{q}_{i}})}{\sqrt{{\textstyle\sum _{k=1}^{N}}\max ({n_{k}}{\overline{q}_{k}},1)}}.\end{aligned}\]

The estimation of the second sum in (31) is almost identical and, therefore, omitted. □

Authors

Abstract

1 Introduction

(1)

2 Sums of independent rvs

(2)

(3)

(4)

Theorem 2.1.

(5)

(6)

Theorem 2.2.

(7)

(8)

(9)

Corollary 2.1.

(10)

Remark 2.1.

3 Sums of Markov Binomial rvs

(11)

(12)

Theorem 3.1.

(13)

Remark 3.1.

4 Auxiliary results

Lemma 4.1.

Lemma 4.2.

(14)

(15)

(16)

(17)

Lemma 4.3.

(18)

Proof.

Lemma 4.4.

Proof.

Lemma 4.5.

(19)

(20)

Lemma 4.6.

Lemma 4.7.

Proof.

5 Proofs for sums of independent rvs

Proof of Theorem 2.1.

(21)

(22)

(23)

(24)

(25)

(26)

(27)

(28)

(29)

(30)

Proof of Theorem 2.2.

Proof of Corollary 2.1.

6 Proof of Theorem 3.1

(31)

(32)

Acknowledgement

References

Export citation

Copy and paste formatted citation

Download citation in file