Asymptotic genealogies for a class of generalized Wright–Fisher models

Huillet, Thierry; Möhle, Martin

doi:10.15559/21-VMSTA196

Modern Stochastics: Theory and Applications

Asymptotic genealogies for a class of generalized Wright–Fisher models

Volume 9, Issue 1 (2022), pp. 17–43

Thierry Huillet Martin Möhle

https://doi.org/10.15559/21-VMSTA196

Pub. online: 15 December 2021 Type: Research Article

Open Access

Received
8 March 2021

Revised
30 September 2021

Accepted
27 November 2021

Published
15 December 2021

Abstract

A class of Cannings models is studied, with population size N having a mixed multinomial offspring distribution with random success probabilities ${W_{1}},\dots ,{W_{N}}$ induced by independent and identically distributed positive random variables ${X_{1}},{X_{2}},\dots $ via ${W_{i}}:={X_{i}}/{S_{N}}$, $i\in \{1,\dots ,N\}$, where ${S_{N}}:={X_{1}}+\cdots +{X_{N}}$. The ancestral lineages are hence based on a sampling with replacement strategy from a random partition of the unit interval into N subintervals of lengths ${W_{1}},\dots ,{W_{N}}$. Convergence results for the genealogy of these Cannings models are provided under assumptions that the tail distribution of ${X_{1}}$ is regularly varying. In the limit several coalescent processes with multiple and simultaneous multiple collisions occur. The results extend those obtained by Huillet [J. Math. Biol. 68 (2014), 727–761] for the case when ${X_{1}}$ is Pareto distributed and complement those obtained by Schweinsberg [Stoch. Process. Appl. 106 (2003), 107–139] for models where sampling is performed without replacement from a supercritical branching process.

1 Introduction

Let ${X_{1}},{X_{2}},\dots $ be independent copies of a random variable X taking values in $(0,\infty )$. For $N\in \mathbb{N}:=\{1,2,\dots \}$ define ${S_{N}}:={X_{1}}+\cdots +{X_{N}}$ and ${W_{i}}:={X_{i}}/{S_{N}}$, $i\in \{1,\dots ,N\}$. The weights ${W_{1}},\dots ,{W_{N}}$ are exchangeable random variables with ${W_{1}}+\cdots +{W_{N}}=1$. In particular, $\mathbb{E}({W_{i}})=1/N$, $i\in \{1,\dots ,N\}$. Consider the Cannings model [6, 7] with population size N and nonoverlapping generations such that, conditional on ${W_{1}},\dots ,{W_{N}}$, the offspring sizes ${\nu _{1}},\dots ,{\nu _{N}}$ have a multinomial distribution with parameters N and ${W_{1}},\dots ,{W_{N}}$. Thus, the offspring distribution is

(1)

\[ \mathbb{P}({\nu _{1}}={i_{1}},\dots ,{\nu _{N}}={i_{N}})\hspace{2.5pt}=\hspace{2.5pt}\frac{N!}{{i_{1}}!\cdots {i_{N}}!}\mathbb{E}({W_{1}^{{i_{1}}}}\cdots {W_{N}^{{i_{N}}}}),\]

${i_{1}},\dots ,{i_{N}}\in {\mathbb{N}_{0}}:=\{0,1,2,\dots \}$ with ${i_{1}}+\cdots +{i_{N}}=N$. For degenerate X, i.e. $\mathbb{P}(X=c)=1$ for some real constant $c>0$, this model reduces to the classical Wright–Fisher model with deterministic weights ${W_{i}}=1/N$, $i\in \{1,\dots ,N\}$. It is straightforward to check that the offspring sizes have joint descending factorial moments

(2)

\[ \mathbb{E}({({\nu _{1}})_{{k_{1}}}}\cdots {({\nu _{N}})_{{k_{N}}}})={(N)_{{k_{1}}+\cdots +{k_{N}}}}\mathbb{E}({W_{1}^{{k_{1}}}}\cdots {W_{N}^{{k_{N}}}}),\hspace{1em}{k_{1}},\dots ,{k_{N}}\in {\mathbb{N}_{0}},\]

where ${(x)_{0}}:=1$ and ${(x)_{k}}:=x(x-1)\cdots (x-k+1)$ for $x\in \mathbb{R}$ and $k\in \mathbb{N}$. In [15] this model is studied for the case when X is Pareto distributed. If X is gamma distributed with density $x\mapsto {x^{r-1}}{e^{-x}}/\Gamma (r)$, $x>0$, for some $r>0$, then $({W_{1}},\dots ,{W_{N}})$ is symmetric Dirichlet distributed with parameter r, leading to the Cannings model with the offspring distribution

\[ \mathbb{P}({\nu _{1}}={i_{1}},\dots ,{\nu _{N}}={i_{N}})\hspace{2.5pt}=\hspace{2.5pt}\frac{N!}{{i_{1}}!\cdots {i_{N}}!}\frac{{[r]_{{i_{1}}}}\cdots {[r]_{{i_{N}}}}}{{[rN]_{N}}},\]

${i_{1}},\dots ,{i_{N}}\in {\mathbb{N}_{0}}$ with ${i_{1}}+\cdots +{i_{N}}=N$, where ${[x]_{0}}:=1$ and ${[x]_{i}}:=x(x+1)\cdots (x+i-1)$ for $x\in \mathbb{R}$ and $i\in \mathbb{N}$. This Dirichlet multinomial model has been studied extensively in the literature (see, for example, Griffiths and Spanò [13]). In a series of papers [16, 17, 19] a subclass of Cannings models, called conditional branching process models in the spirit of Karlin and McGregor [23, 24], has been investigated, whose offspring distributions are (by definition) obtained by assuming that $\mathbb{P}({X_{1}}+\cdots +{X_{N}}=N)>0$ and conditioning on the event that ${X_{1}}+\cdots +{X_{N}}=N$. This construction based on conditioning is rather different from the construction based on sampling from a random partition of the unit interval we are dealing with in this article. Note however that several concrete examples (such as the classical Wright–Fisher model and the above mentioned Dirichlet multinomial model) can be constructed in both ways, either by sampling or by conditioning. For example, the Dirichlet multinomial model is obtained by taking N independent and identically distributed negative binomial random variables ${X_{1}},\dots ,{X_{N}}$ with parameter $r>0$ and $p\in (0,1)$, so with distribution $\mathbb{P}({X_{1}}=k)=\left(\genfrac{}{}{0.0pt}{}{r+k-1}{k}\right){p^{r}}{(1-p)^{k}}$, $k\in {\mathbb{N}_{0}}$, and conditioning on the event that ${X_{1}}+\cdots +{X_{N}}=N$.

The closely related model studied by Schweinsberg [37] differs from ours, since sampling is performed without replacement from a discrete super-critical Galton–Watson branching process, as explained in [37, Section 1.3]. In that model, X is integer valued and satisfies $\mathbb{E}(X)>1$. In our model, X does not need to be integer valued and its mean is allowed to be less than 1. Moreover, the sampling in our multinomial model is with replacement, whereas in Schweinsberg’s model it is without replacement.

The same multinomial scheme with an additional dormancy mechanism is considered in the recent work by Cordero et al. [8]. A class of Dirichlet models in the domain of attraction of the Kingman coalescent is also studied in two recent works by Boenkost et al. [4, 5] with an emphasis on Haldane’s formula [14]. We refer the reader to Athreya [1] for some more information on Haldane’s formula.

Fix $n\in \{1,\dots ,N\}$ and sample n individuals from the current generation. For $r\in {\mathbb{N}_{0}}$ define a random partition ${\Pi _{r}^{(N,n)}}$ of $\{1,\dots ,n\}$ such that $i,j\in \{1,\dots ,n\}$ belong to the same block of ${\Pi _{r}^{(N,n)}}$ if and only if the individual i and j share a common parent r generations backward in time. The process ${\Pi ^{(N,n)}}:={({\Pi _{r}^{(N,n)}})_{r\in {\mathbb{N}_{0}}}}$, called the discrete-time n-coalescent, takes values in the space ${\mathcal{P}_{n}}$ of partitions of $\{1,\dots ,n\}$. As in [15] we are interested in the limiting behavior of the discrete-time n-coalescent as the total population size N tends to infinity. It is easily seen (and well known) that the discrete-time n-coalescent is a time-homogeneous Markovian process. The transition probabilities ${p_{\pi {\pi ^{\prime }}}}:=\mathbb{P}({\Pi _{r+1}^{(N,n)}}={\pi ^{\prime }}\hspace{0.1667em}|\hspace{0.1667em}{\Pi _{r}^{(N,n)}}=\pi )$ are given by

(3)

\[ {p_{\pi {\pi ^{\prime }}}}\hspace{2.5pt}=\hspace{2.5pt}{(N)_{j}}\mathbb{E}({W_{1}^{{k_{1}}}}\cdots {W_{j}^{{k_{j}}}})\hspace{2.5pt}=:\hspace{2.5pt}{\Phi _{j}^{(N)}}({k_{1}},\dots ,{k_{j}}),\hspace{2em}\pi ,{\pi ^{\prime }}\in {\mathcal{P}_{n}},\]

if each block of ${\pi ^{\prime }}$ is a union of some blocks of π, where $j:=|{\pi ^{\prime }}|$ denotes the number of blocks of ${\pi ^{\prime }}$ and ${k_{1}},\dots ,{k_{j}}$ are the group sizes of merging blocks of π. Note that ${\Phi _{j}^{(N)}}({k_{1}},\dots ,{k_{j}})$ is defined for all $N,j,{k_{1}},\dots ,{k_{j}}\in \mathbb{N}$. Since the random variables ${W_{1}},\dots ,{W_{N}}$ are exchangeable and satisfy ${W_{1}}+\cdots +{W_{N}}=1$, it follows for all $N,j,{k_{1}},\dots ,{k_{j}}\in \mathbb{N}$ with $j\le N$ that

\[\begin{array}{r@{\hskip10.0pt}c@{\hskip10.0pt}l}& & \displaystyle (N-j)\mathbb{E}({W_{1}^{{k_{1}}}}\cdots {W_{j}^{{k_{j}}}}{W_{j+1}})\hspace{2.5pt}=\hspace{2.5pt}\mathbb{E}\big({W_{1}^{{k_{1}}}}\cdots {W_{j}^{{k_{j}}}}({W_{j+1}}+\cdots +{W_{N}})\big)\\ {} & & \displaystyle \hspace{2em}=\mathbb{E}\big({W_{1}^{{k_{1}}}}\cdots {W_{j}^{{k_{j}}}}(1-({W_{1}}+\cdots +{W_{j}}))\big)\\ {} & & \displaystyle \hspace{2em}=\mathbb{E}({W_{1}^{{k_{1}}}}\cdots {W_{j}^{{k_{j}}}})-{\sum \limits_{i=1}^{j}}\mathbb{E}({W_{1}^{{k_{1}}}}\cdots {W_{i-1}^{{k_{i-1}}}}{W_{i}^{{k_{i}}+1}}{W_{i+1}^{{k_{i+1}}}}\cdots {W_{j}^{{k_{j}}}}).\end{array}\]

Multiplication by ${(N)_{j}}$ ($=0$ for $j>N$) shows that the consistency relation

(4)

\[\begin{aligned}{}& {\Phi _{j}^{(N)}}({k_{1}},\dots ,{k_{j}})\\ {} & \hspace{2em}={\Phi _{j+1}^{(N)}}({k_{1}},\dots ,{k_{j}},1)+{\sum \limits_{i=1}^{j}}{\Phi _{j}^{(N)}}({k_{1}},\dots ,{k_{i-1}},{k_{i}}+1,{k_{i+1}},\dots ,{k_{j}})\end{aligned}\]

holds for all $N,j,{k_{1}},\dots ,{k_{j}}\in \mathbb{N}$. Moreover, for all $j,l\in \mathbb{N}$ with $j\ge l$ and all ${k_{1}},\dots ,{k_{j}},{m_{1}},\dots ,{m_{l}}\in \mathbb{N}$ with ${k_{1}}\ge {m_{1}},\dots ,{k_{l}}\ge {m_{l}}$, the monotonicity relation

(5)

\[ {\Phi _{j}^{(N)}}({k_{1}},\dots ,{k_{j}})\hspace{2.5pt}\le \hspace{2.5pt}{\Phi _{l}^{(N)}}({m_{1}},\dots ,{m_{l}})\]

holds. Note that (5) follows from (4) by induction on the difference $d:=j-l\in {\mathbb{N}_{0}}$. We refer the reader to [30, Definition 2.2] and the remark thereafter for similar statements and proofs for the full class of Cannings models. Choosing $j=1$ and ${k_{1}}=2$ in (3) shows that two individuals share a common ancestor one generation backward in time with probability ${c_{N}}:={\Phi _{1}^{(N)}}(2)=N\mathbb{E}({W_{1}^{2}})$, the so-called coalescence probability. We also introduce the effective population size ${N_{e}}:=1/{c_{N}}$. Note that ${c_{N}}=N\mathbb{E}({W_{1}^{2}})\ge N{(\mathbb{E}({W_{1}}))^{2}}=1/N$ or, equivalently, ${N_{e}}\le N$. All Cannings models having an effective population size strictly larger than N (such as the Moran model having effective population size ${N_{e}}=N(N-1)/2>N$ for $N\ge 4$ and most of the extended Moran models studied by Eldon and Wakeley [11] and Huillet and Möhle [18]) therefore do not belong to the class of models we are dealing with in this article.

General results for Cannings models concerning the convergence of their genealogical tree to an exchangeable coalescent process as the total population size tends to infinity are provided in [32]. For information on the theory of exchangeable coalescent processes we refer the reader to Pitman [33], Sagitov [34] and Schweinsberg [35, 36]. Coalescents with multiple collisions (Λ-coalescents) are Markovian stochastic processes taking values in the set of partitions of $\mathbb{N}$. They are characterized by a finite measure Λ on the unit interval. Important examples are Dirac-coalescents, where $\Lambda ={\delta _{a}}$ is the Dirac measure at a given point $a\in [0,1]$, including the prominent Kingman coalescent (Kingman [26, 25, 27]), where $\Lambda ={\delta _{0}}$ is the Dirac measure at 0, and the star-shaped coalescent, where $\Lambda ={\delta _{1}}$. Other important examples are beta coalescents, where $\Lambda =\beta (a,b)$ is the beta distribution with parameters $a,b>0$, including the Bolthausen–Sznitman coalescent, where Λ is the uniform distribution on the unit interval ($a=b=1$).

The full class of exchangeable coalescent processes (Ξ-coalescents) allowing for simultaneous multiple collisions of ancestral lineages is characterized by a finite measure Ξ on the infinite simplex $\Delta :=\{x=({x_{1}},{x_{2}},\dots ):{x_{1}}\ge {x_{2}}\ge \cdots \ge 0,{\textstyle\sum _{i=1}^{\infty }}{x_{i}}\le 1\}$. An example is the two-parameter Poisson–Dirichlet coalescent with parameters $\alpha >0$ and $\theta >-\alpha $, where the characterizing measure $\nu (\mathrm{d}x):=\Xi (\mathrm{d}x)/{\textstyle\sum _{i=1}^{\infty }}{x_{i}^{2}}$ on Δ is (by definition) the Poisson–Dirichlet distribution $\nu =\mathrm{PD}(\alpha ,\theta )$ with parameters $\alpha >0$ and $\theta >-\alpha $. For more information on the Poisson–Dirichlet coalescent we refer the reader to Section 6 of [31]. In most studies, continuous-time coalescent processes ${({\Pi _{t}})_{t\in T}}$ with index set $T=[0,\infty )$ are considered. Note however that all Ξ-coalescents can as well be introduced with discrete time $T={\mathbb{N}_{0}}$. In this case one speaks about a discrete-time Ξ-coalescent ${({\Pi _{r}})_{r\in {\mathbb{N}_{0}}}}$. The following terminology is taken from [16, Definition 2.1].

Definition 1.

(i) A Cannings model is said to be in the domain of attraction of a continuous-time coalescent $\Pi ={({\Pi _{t}})_{t\ge 0}}$ if for each sample size $n\in \mathbb{N}$ the time-scaled ancestral process ${({\Pi _{\lfloor t/{c_{N}}\rfloor }^{(N,n)}})_{t\ge 0}}$ converges in ${D_{{\mathcal{P}_{n}}}}([0,\infty ))$ to ${\Pi ^{(n)}}$ as $N\to \infty $, where ${\Pi ^{(n)}}={({\Pi _{t}^{(n)}})_{t\ge 0}}$ denotes the restriction of Π to a sample of size n.

(ii) Analogously, a Cannings model is said to be in the domain of attraction of a discrete-time coalescent $\Pi ={({\Pi _{r}})_{r\in {\mathbb{N}_{0}}}}$ if for each sample size $n\in \mathbb{N}$ the ancestral process ${({\Pi _{r}^{(N,n)}})_{r\in {\mathbb{N}_{0}}}}$ converges in ${D_{{\mathcal{P}_{n}}}}({\mathbb{N}_{0}})$ to ${\Pi ^{(n)}}$ as $N\to \infty $, where ${\Pi ^{(n)}}={({\Pi _{r}^{(n)}})_{r\in {\mathbb{N}_{0}}}}$ denotes the restriction of Π to a sample of size n.

Conditions on the tails of the distribution of X are provided which ensure that the population model with the offspring distribution (1) is in the domain of attraction of some exchangeable coalescent process. The tail condition is of the standard form $\mathbb{P}(X>x)\sim {x^{-\alpha }}\ell (x)$ as $x\to \infty $, where $\alpha \ge 0$ and ℓ is a function slowly varying at ∞. The results are collected in Theorem 1 in Section 2. It turns out that the three parameter values $\alpha \in \{0,1,2\}$ are boundary cases. Consequently, six different regimes ($\alpha >2$, $\alpha =2$, $\alpha \in (1,2)$, $\alpha =1$, $\alpha \in (0,1)$ and $\alpha =0$) are considered leading to different limiting behaviors of the ancestral process. Theorem 1 also provides the asymptotics of the coalescence probability ${c_{N}}$ as $N\to \infty $ for all six cases. In Section 3 some illustrating examples are provided including the case studied in [15] when X is Pareto distributed. The proofs are provided in the main Section 4. They are based on general convergence-to-the-coalescent theorems for Cannings models provided in [32] and combine (Abelian and Tauberian) arguments from the theory of regularly varying functions in the spirit of Karamata [20–22] with techniques used by Huillet [15] for the Pareto case and by Schweinsberg [37] for the related model where the sampling is performed without replacement.

2 Results

For most of the results it is assumed that there exist a constant $\alpha \ge 0$ and a function $\ell :(0,\infty )\to (0,\infty )$ slowly varying at ∞ such that

(6)

\[ \mathbb{P}(X>x)\hspace{2.5pt}\sim \hspace{2.5pt}{x^{-\alpha }}\ell (x),\hspace{2em}x\to \infty .\]

Our main result (Theorem 1) clarifies the limiting behavior of the ancestral structure of the Cannings model with the offspring distribution (1) as the total population size N tends to infinity under the assumption (6). It turns out that the parameter values $\alpha \in \{0,1,2\}$ are boundary cases. It is hence natural to distinguish six regimes corresponding to the parameter ranges $\alpha >2$, $\alpha =2$, $\alpha \in (1,2)$, $\alpha =1$, $\alpha \in (0,1)$ and $\alpha =0$. In order to state the result it is convenient to introduce the function ${\ell ^{\ast }}:(1,\infty )\to (0,\infty )$ via

(7)

\[ {\ell ^{\ast }}(x)\hspace{2.5pt}:=\hspace{2.5pt}{\int _{1}^{x}}\frac{\ell (t)}{t}\hspace{0.1667em}\mathrm{d}t.\]

Note that ${\ell ^{\ast }}$ is nondecreasing, slowly varying at ∞ and satisfies $\ell (x)/{\ell ^{\ast }}(x)\to 0$ as $x\to \infty $, see, for example, Bingham and Doney [3, p. 717 and 718] or Eq. (1.5.8) on p. 26 of Bingham, Goldie and Teugels [2] and the remarks thereafter. More precisely, for every $\lambda >0$, as $x\to \infty $,

\[ \frac{{\ell ^{\ast }}(\lambda x)-{\ell ^{\ast }}(x)}{\ell (x)}\hspace{2.5pt}=\hspace{2.5pt}\frac{1}{\ell (x)}{\int _{x}^{\lambda x}}\frac{\ell (t)}{t}\hspace{0.1667em}\mathrm{d}t\hspace{2.5pt}=\hspace{2.5pt}{\int _{1}^{\lambda }}\frac{\ell (xu)}{\ell (x)}\frac{1}{u}\hspace{0.1667em}\mathrm{d}u\hspace{2.5pt}\to \hspace{2.5pt}{\int _{1}^{\lambda }}\frac{1}{u}\hspace{0.1667em}\mathrm{d}u\hspace{2.5pt}=\hspace{2.5pt}\log \lambda ,\]

where the convergence holds by the uniform convergence theorem for slowly varying functions. Thus, ${\ell ^{\ast }}$ is a de Haan function (with ℓ-index 1) and hence slowly varying. For general information on de Haan theory we refer the reader to Chapter 3 of [2].

The main (and only) result of this article is the following.

Theorem 1.

For the Cannings model with the offspring distribution (1) the following assertions hold.

(i) If $\mathbb{E}({X^{2}})<\infty $ (in particular if (6) holds with $\alpha >2$) then the model is in the domain of attraction of the continuous-time Kingman coalescent and the coalescence probability ${c_{N}}$ satisfies ${c_{N}}\sim \rho /({\mu ^{2}}N)$ as $N\to \infty $, where $\mu :=\mathbb{E}(X)$ and $\rho :=\mathbb{E}({X^{2}})$.
(ii) If (6) holds with $\alpha =2$ then the model is in the domain of attraction of the continuous-time Kingman coalescent and the coalescence probability ${c_{N}}$ satisfies ${c_{N}}\sim 2{\ell ^{\ast }}(N)/({\mu ^{2}}N)$ as $N\to \infty $, where $\mu :=\mathbb{E}(X)$ and ${\ell ^{\ast }}$ is defined via (7).
(iii) If (6) holds with $\alpha \in (1,2)$ then the model is in the domain of attraction of the continuous-time Λ-coalescent with $\Lambda :=\beta (2-\alpha ,\alpha )$ being the beta distribution with parameters $2-\alpha $ and α. Moreover, the coalescence probability ${c_{N}}$ satisfies ${c_{N}}\sim \alpha \mathrm{B}(2-\alpha ,\alpha ){\mu ^{-\alpha }}\ell (N)/{N^{\alpha -1}}=\Gamma (2-\alpha )\Gamma (\alpha +1){\mu ^{-\alpha }}\ell (N)/{N^{\alpha -1}}$ as $N\to \infty $, where $\mu :=\mathbb{E}(X)$.
(iv) If (6) holds with $\alpha =1$, then the model is in the domain of attraction of the continuous-time Bolthausen–Sznitman coalescent. If ${({a_{N}})_{N\in \mathbb{N}}}$ is a sequence of positive real numbers satisfying ${\ell ^{\ast }}({a_{N}})\sim {a_{N}}/N$ as $N\to \infty $, where ${\ell ^{\ast }}$ is defined via (7), then the coalescence probability ${c_{N}}$ satisfies ${c_{N}}\sim \ell ({a_{N}})/{\ell ^{\ast }}({a_{N}})\sim N\ell ({a_{N}})/{a_{N}}$ as $N\to \infty $.
(v) If (6) holds with $\alpha \in (0,1)$, then the model is in the domain of attraction of the discrete-time Ξ-coalescent, where the characterizing measure $\nu (\mathrm{d}x):=\Xi (\mathrm{d}x)/{\textstyle\sum _{i=1}^{\infty }}{x_{i}^{2}}$ is the Poisson–Dirichlet distribution $\nu =\mathrm{PD}(\alpha ,0)$ with parameters α and $\theta :=0$. The coalescence probability satisfies ${c_{N}}\to 1-\alpha $ as $N\to \infty $.
(vi) If (6) holds with $\alpha =0$, then the model is in the domain of attraction of the discrete-time star-shaped coalescent and the coalescence probability satisfies ${c_{N}}\to 1$ as $N\to \infty $.

In particular, for the first four cases (i)–(iv), ${c_{N}}\to 0$ as $N\to \infty $.

The six cases of Theorem 1 are summarized in Table 1. In the table, $\mu :=\mathbb{E}(X)$, $\rho :=\mathbb{E}({X^{2}})$, ${\ell ^{\ast }}(x):={\textstyle\int _{1}^{x}}\ell (t)/t\hspace{0.1667em}\mathrm{d}t$, $x>1$, and ${({a_{N}})_{N\in \mathbb{N}}}$ is a sequence such that ${\ell ^{\ast }}({a_{N}})\sim {a_{N}}/N$ as $N\to \infty $.

Remark 1.

If $\ell (x)\equiv C$ for some constant $C>0$, then ${\ell ^{\ast }}(x)=C{\textstyle\int _{1}^{x}}{t^{-1}}\hspace{0.1667em}\mathrm{d}t=C\log x$ as $x\to \infty $. Assume now in addition that $\alpha =1$. In this case, in part (iv) of Theorem 1 one can choose ${a_{1}}:=1$ and ${a_{N}}:=CN\log N$, $N\in \mathbb{N}\setminus \{1\}$. The coalescence probability thus satisfies ${c_{N}}\sim CN/{a_{N}}\sim 1/\log N$, in agreement with Proposition 6 of Huillet [15] for the Pareto example $\mathbb{P}(X>x)=1/x$, $x>1$. The same asymptotics for the coalescence probability holds for the related model considered by Schweinsberg (see [37, Lemma 16]) and, for example, when X is discrete taking the value $k\in \mathbb{N}$ with probability $\mathbb{P}(X=k)=1/(k(k+1))$.

Remark 2.

One may doubt that Theorem 1 is valid when X takes values close to 0 with high probability such that $\mathbb{E}(1/{S_{N}})=\infty $ for all $N\in \mathbb{N}$. Typical examples of this form arise when the Laplace transform ψ of X satisfies $\psi (u)\sim L(u)$ as $u\to \infty $ for some function L slowly varying at ∞, or, equivalently (see Feller [12], p. 445, Theorem 2 and p. 446, Theorem 3), if $\mathbb{P}(X\le x)\sim L(1/x)$ as $x\to 0$. A concrete example is $P(X\le x)=1/(1-\log x)$, $0<x\le 1$. In this case, $L(x)=1/\log x$, $x>0$, and, hence, $\mathbb{E}(1/{S_{N}})={\textstyle\int _{0}^{\infty }}{(\psi (u))^{N}}\hspace{0.1667em}\mathrm{d}u=\infty $ for all $N\in \mathbb{N}$. By Theorem 1 this model is in the domain of attraction of the Kingman coalescent, since $\mathbb{E}({X^{2}})<\infty $.

The finiteness or infiniteness of $\mathbb{E}(1/{S_{N}})$ turns out to be irrelevant for the statements in Theorem 1, since the convergence results of Theorem 1 solely depend on the limiting behavior of the joint moments of the weights ${W_{1}},\dots ,{W_{j}}$ as $N\to \infty $. For example (see Lemma 3), the asymptotics of $\mathbb{E}({W_{1}^{p}})$, $p>0$, as $N\to \infty $ is determined by the values $\psi (u)$ of the Laplace transform ψ for values of u close to 0. For any fixed $\delta >0$ the values $u>\delta $ do not play any role.

Conjectures and open problems.

Table 1.

Asymptotics of the ancestry of mixed multinomial Cannings models of the form (1) under the tail condition $\mathbb{P}(X>x)\sim {x^{-\alpha }}\ell (x)$ as $x\to \infty $

Condition	Limiting coalescent	Coalescence probability
$\mathbb{E}({X^{2}})<\infty $	Kingman	$\sim \displaystyle\frac{\rho }{{\mu ^{2}}N}$
$\alpha =2$	Kingman	$\sim \displaystyle\frac{2{\ell ^{\ast }}(N)}{{\mu ^{2}}N}$
$1<\alpha <2$	$\beta (2-\alpha ,\alpha )$	$\sim \displaystyle\frac{\Gamma (2-\alpha )\Gamma (\alpha +1)\ell (N)}{{\mu ^{\alpha }}{N^{\alpha -1}}}$
$\alpha =1$	Bolthausen–Sznitman	$\sim \displaystyle\frac{\ell ({a_{N}})}{{\ell ^{\ast }}({a_{N}})}\sim \displaystyle\frac{N\ell ({a_{N}})}{{a_{N}}}$
$\alpha \in (0,1)$	discrete time $\mathrm{PD}(\alpha ,0$)	$\sim 1-\alpha $
$\alpha =0$	discrete time star-shaped	$\sim 1$

Theorem 1 should also hold for Schweinsberg’s model [37], since sampling without replacement (instead of sampling with replacement) should neither influence the asymptotics of the coalescence probability nor the limiting processes arising in Theorem 1. Note that in [37] the subclass of models without replacement is studied where the function ℓ in (6) is constant. We leave the analysis of Schweinsberg’s model under the more general assumption (6) for the interested reader.

In contrast, conditional branching process models [16, 17, 19] seem to be harder to analyse and behave quite differently in general. Even for the subclass of so-called compound Poisson models, only partial results are available. Theorems 2.2 and 2.3 of [19] clarify that many unbiased compound Poisson models are in the domain of attraction of the Kingman coalescent, and [19, Theorem 2.5] (subcritical case) demonstrates that the limiting behavior of compound Poisson models can differ substantially from all scenarios arising in Theorem 1. To the best of the authors knowledge, the limiting behavior of the ancestral structure of unbiased conditional branching process models as $N\to \infty $ under assumptions of the form (6) has not been fully addressed in the literature. We leave this analysis for future research.

3 Examples

Example 1 (Pareto distribution).

Let X be Pareto distributed with parameter $\alpha >0$ having tail probabilities $\mathbb{P}(X>x)={x^{-\alpha }}$, $x>1$. Clearly, (6) holds with $\ell \equiv 1$, so Theorem 1 is applicable. Note that $\mathbb{E}({X^{p}})<\infty $ if and only if $p<\alpha $ and in this case $\mathbb{E}({X^{p}})=\alpha {\textstyle\int _{1}^{\infty }}{x^{p-\alpha -1}}\hspace{0.1667em}\mathrm{d}x=\alpha /(\alpha -p)$. In particular $\mu :=\mathbb{E}(X)=\alpha /(\alpha -1)<\infty $ for $\alpha >1$ and $\rho :=\mathbb{E}({X^{2}})=\alpha /(\alpha -2)<\infty $ for $\alpha >2$. By Theorem 1, for $\alpha \ge 2$ the model is in the domain of attraction of the Kingman coalescent, for $\alpha \in [1,2)$ in the domain of attraction of the $\beta (2-\alpha ,\alpha )$-coalescent, and for $\alpha \in (0,1)$ in the domain of attraction of the discrete-time Poisson–Dirichlet coalescent with parameter α.

Note that ${\ell ^{\ast }}(x)={\textstyle\int _{1}^{x}}1/t\hspace{0.1667em}\mathrm{d}t=\log x$, $x>1$. In part (iv) of Theorem 1, we can therefore choose ${a_{N}}:=N\log N$ and obtain ${c_{N}}\sim \ell ({a_{N}})/{\ell ^{\ast }}({a_{N}})=1/{\ell ^{\ast }}({a_{N}})\sim 1/\log N$ as $N\to \infty $. Thus, by Theorem 1, the coalescence probability ${c_{N}}$ satisfies

\[ {c_{N}}\hspace{2.5pt}\sim \hspace{2.5pt}\left\{\begin{array}{c@{\hskip10.0pt}l}\frac{\rho }{{\mu ^{2}}N}\hspace{2.5pt}=\hspace{2.5pt}\frac{{(\alpha -1)^{2}}}{\alpha (\alpha -2)N}& \text{if}\hspace{2.5pt}\alpha >2\text{,}\\ {} \frac{2{\ell ^{\ast }}(N)}{{\mu ^{2}}N}\hspace{2.5pt}=\hspace{2.5pt}\frac{\log N}{2N}& \text{if}\hspace{2.5pt}\alpha =2\text{,}\\ {} \frac{\Gamma (2-\alpha )\Gamma (\alpha +1)}{{\mu ^{\alpha }}{N^{\alpha -1}}}& \text{if}\hspace{2.5pt}\alpha \in (1,2)\text{,}\\ {} \frac{1}{\log N}& \text{if}\hspace{2.5pt}\alpha =1\text{,}\\ {} 1-\alpha & \text{if}\hspace{2.5pt}\alpha \in (0,1)\text{.}\end{array}\right.\]

For $\alpha >2$ these results coincide with Proposition 7 of [15] with $\beta =0$, for $\alpha =2$ with Proposition 9 of [15], for $\alpha \in (1,2)$ with Lemma 4 and Proposition 5 of [15] with $\beta =0$, for $\alpha =1$ with Proposition 6 of [15] with $\beta =0$, and for $\alpha \in (0,1)$ with Theorem 3 of [15] with $\beta =0$.

The Pareto example is easily generalized in various ways by replacing $\ell \equiv 1$ by some other slowly varying function. For example, choosing for ℓ (a power of) the logarithm leads to the following example.

Example 2.

Fix $\alpha \ge 0$ and assume that X has tail behavior $\mathbb{P}(X>x)\sim {x^{-\alpha }}\ell (x)$ as $x\to \infty $ with $\ell (x):=c{(\log x)^{\beta -1}}$, $x>1$, for some constants $c>0$ and $\beta >0$. This example includes the Pareto model ($c=\beta =1$). Clearly, (6) holds, since ℓ slowly varies at ∞. By Theorem 1, for $\alpha \ge 2$ the model is in the domain of attraction of the Kingman coalescent, for $\alpha \in [1,2)$ in the domain of attraction of the $\beta (2-\alpha ,\alpha )$-coalescent, for $\alpha \in (0,1)$ in the domain of attraction of the discrete-time Poisson–Dirichlet coalescent with parameter α, and for $\alpha =0$ in the domain of attraction of the discrete-time star-shaped coalescent. Note that

\[ {\ell ^{\ast }}(x)\hspace{2.5pt}=\hspace{2.5pt}{\int _{1}^{x}}\frac{\ell (t)}{t}\hspace{0.1667em}\mathrm{d}t\hspace{2.5pt}=\hspace{2.5pt}c{\int _{1}^{x}}\frac{{(\log t)^{\beta -1}}}{t}\hspace{0.1667em}\mathrm{d}t\hspace{2.5pt}=\hspace{2.5pt}\frac{c}{\beta }{(\log x)^{\beta }},\hspace{2em}x\to \infty .\]

The asymptotics of the coalescence probability ${c_{N}}$ as $N\to \infty $ can hence be obtained from the formulas provided in Theorem 1. In particular, for $\alpha >1$ the asymptotics of ${c_{N}}$ depends on the concrete value of $\mu :=\mathbb{E}(X)$. For $\alpha =1$ the asymptotics of ${c_{N}}$ is obtained as follows. The sequence ${({a_{N}})_{N\in \mathbb{N}}}$, defined via ${a_{1}}:=1$ and ${a_{N}}:=(c/\beta )N{(\log N)^{\beta }}$ for $N\in \mathbb{N}\setminus \{1\}$, satisfies ${\ell ^{\ast }}({a_{N}})\sim (c/\beta ){(\log {a_{N}})^{\beta }}\sim (c/\beta ){(\log N)^{\beta }}={a_{N}}/N$ as $N\to \infty $. By Theorem 1 (iv), the coalescence probability ${c_{N}}$ satisfies ${c_{N}}\sim \ell ({a_{N}})/{\ell ^{\ast }}({a_{N}})\sim \beta /\log N$ as $N\to \infty $.

For illustration three examples with discrete X are provided.

Example 3 (Yule–Simon distribution).

Let X be Yule–Simon distributed [28, 38] with parameter $\alpha >0$ having distribution $\mathbb{P}(X=k)=\alpha \mathrm{B}(\alpha +1,k)=\alpha \Gamma (\alpha +1)\Gamma (k)/\Gamma (\alpha +1+k)$, $k\in \mathbb{N}$, where $\mathrm{B}(.,.)$ and $\Gamma (.)$ denote the beta and the gamma function respectively. It is easily checked that $\mathbb{P}(X>k)=\Gamma (\alpha +1)\Gamma (k+1)/\Gamma (k+\alpha +1)$, $k\in {\mathbb{N}_{0}}$. In particular, $\mathbb{P}(X>x)\sim \Gamma (\alpha +1){x^{-\alpha }}$ as $x\to \infty $. Thus, (6) holds with $\ell \equiv \Gamma (\alpha +1)$. Note that $\mathbb{E}({(X)_{k}})<\infty $ if and only if $k<\alpha $ and in this case $\mathbb{E}({(X)_{k}})=\alpha k!\mathrm{B}(\alpha -k,k)$. In particular, $\mu =\mathbb{E}(X)=\alpha /(\alpha -1)$ for $\alpha >1$ and $\mathbb{E}({(X)_{2}})=2\alpha /((\alpha -1)(\alpha -2))$ for $\alpha >2$, which yields $\rho =\mathbb{E}({X^{2}})={\alpha ^{2}}/((\alpha -1)(\alpha -2))$ for $\alpha >2$. By Theorem 1, for $\alpha \ge 2$ the model is in the domain of attraction of the Kingman coalescent, for $\alpha \in [1,2)$ in the domain of attraction of the $\beta (2-\alpha ,\alpha )$-coalescent, and for $\alpha \in (0,1)$ in the domain of attraction of the discrete-time Poisson–Dirichlet coalescent with parameter α. Note that ${\ell ^{\ast }}(x)=\Gamma (\alpha +1){\textstyle\int _{1}^{x}}1/t\hspace{0.1667em}\mathrm{d}t=\Gamma (\alpha +1)\log x$, $x>1$. In part (iv) of Theorem 1 we can thus choose ${a_{N}}:=\Gamma (\alpha +1)N\log N$ and obtain ${c_{N}}\sim \ell ({a_{N}})/{\ell ^{\ast }}({a_{N}})=1/\log {a_{N}}\sim 1/\log N$ as $N\to \infty $. Thus, by Theorem 1, the coalescence probability ${c_{N}}$ satisfies

\[ {c_{N}}\hspace{2.5pt}\sim \hspace{2.5pt}\left\{\begin{array}{c@{\hskip10.0pt}l}\frac{\rho }{{\mu ^{2}}N}\hspace{2.5pt}=\hspace{2.5pt}\frac{\alpha -1}{(\alpha -2)N}& \text{if}\hspace{2.5pt}\alpha >2\text{,}\\ {} \frac{2{\ell ^{\ast }}(N)}{{\mu ^{2}}N}\hspace{2.5pt}=\hspace{2.5pt}\frac{\log N}{N}& \text{if}\hspace{2.5pt}\alpha =2\text{,}\\ {} \frac{\Gamma (2-\alpha ){(\Gamma (\alpha +1))^{2}}}{{\mu ^{\alpha }}{N^{\alpha -1}}}& \text{if}\hspace{2.5pt}\alpha \in (1,2)\text{,}\\ {} \frac{1}{\log N}& \text{if}\hspace{2.5pt}\alpha =1\text{,}\\ {} 1-\alpha & \text{if}\hspace{2.5pt}\alpha \in (0,1)\text{.}\end{array}\right.\]

The Yule–Simon model is a discrete analog of the Pareto model discussed in Example 1. We refer the reader to Kozubowski and Podgórski [28] for some further information on Sibuya and Yule–Simon distributions.

Example 4 (Sibuya distribution).

Let X be Sibuya distributed with parameter $\alpha \in (0,1)$ having probability generating function $f(s)=1-{(1-s)^{\alpha }}$, $s\in [0,1]$. Note that $f(s)={\textstyle\sum _{k=1}^{\infty }}{(-1)^{k-1}}\left(\genfrac{}{}{0.0pt}{}{\alpha }{k}\right){s^{k}}$, so X takes the value $k\in \mathbb{N}$ with probability $\mathbb{P}(X=k)={(-1)^{k-1}}\left(\genfrac{}{}{0.0pt}{}{\alpha }{k}\right)=\alpha \Gamma (k-\alpha )/(\Gamma (1-\alpha )k!)$. The Laplace transform ψ of X satisfies $1-\psi (u)=1-f({e^{-u}})={(1-{e^{-u}})^{\alpha }}\sim {u^{\alpha }}$ as $u\to 0$, i.e. relation (2.1) of Bingham and Doney [3] holds with $n=0$, $\beta =\alpha \in (0,1)$ and $L\equiv 1$. By Theorem A of [3] this relation is equivalent (see Eq. (2.3b) of [3]) to $\mathbb{P}(X>x)\sim {(\Gamma (1-\alpha ))^{-1}}{x^{-\alpha }}$ as $x\to \infty $, which shows that (6) holds with $\ell \equiv 1/\Gamma (1-\alpha )$. Part (v) of Theorem 1 ensures that the model is in the domain of attraction of the Poisson–Dirichlet coalescent with parameter α and the coalescence probability ${c_{N}}$ satisfies ${c_{N}}\to 1-\alpha $ as $N\to \infty $. The same results are valid when X is α-stable, $\alpha \in (0,1)$, with Laplace transform $\psi (u):={e^{-{u^{\alpha }}}}$, $u\ge 0$, since in this case the same asymptotics $1-\psi (u)\sim {u^{\alpha }}$ as $u\to 0$ holds. In this sense the Sibuya example is a discrete version of the α-stable case with $\alpha \in (0,1)$.

Example 5.

Let $\alpha \in (1,2)$ and $b\in (0,1/(\alpha -1)]$. Assume that X has probability generating function $f(s)=(b+1)s+b({(1-s)^{\alpha }}-1)$, $s\in [0,1]$. Note that X is discrete taking values in $\mathbb{N}$ with probabilities ${p_{k}}:=\mathbb{P}(X=k)$, $k\in \mathbb{N}$, given by ${p_{1}}=b+1-b\alpha $ and ${p_{k}}=b{(-1)^{k}}\left(\genfrac{}{}{0.0pt}{}{\alpha }{k}\right)=b\Gamma (k-\alpha )/(\Gamma (-\alpha )k!)$ for $k\in \{2,3,\dots \}$. From ${f^{\prime }}(s)=b+1-b\alpha {(1-s)^{\alpha -1}}$ it follows that $\mu :=\mathbb{E}(X)={f^{\prime }}(1)=b+1$. The Laplace transform ψ of X satisfies $\psi (u)-1+(b+1)u\sim b{u^{\alpha }}$ as $u\to 0$, i.e. relation (2.1) of Bingham and Doney [3] holds with $n=1$, $\beta =\alpha -1\in (0,1)$, and $L\equiv b$. By Theorem A of [3] this relation is equivalent (see Eq. (2.3b) of [3]) to $\mathbb{P}(X>x)\sim b{(-\Gamma (1-\alpha ))^{-1}}{x^{-\alpha }}$ as $x\to \infty $, which shows that (6) holds with $\ell (x)\equiv b/(-\Gamma (1-\alpha ))$. By Theorem 1 (iii) the model is in the domain of attraction of the $\beta (2-\alpha ,\alpha )$-coalescent and ${c_{N}}\sim (\alpha -1)\Gamma (\alpha +1)b/({\mu ^{\alpha }}{N^{\alpha -1}})$ as $N\to \infty $.

We close this section with a concrete example belonging to the boundary case (vi) ($\alpha =0$).

Example 6.

Let $\beta >0$. If $\mathbb{P}(X>x)=1/{(1+\log x)^{\beta }}$, $x\ge 1$, then $\mathbb{P}(X>x)\sim \ell (x)$ as $x\to \infty $ with $\ell (x):=1/{(\log x)^{\beta }}$. By Theorem 1 (vi), the model is in the domain of attraction of the discrete-time star-shaped coalescent and ${c_{N}}\to 1$ as $N\to \infty $.

4 Proofs

The following auxiliary result (Lemma 1) is a modified version of Lemma 5 of Schweinsberg [37], adapted to our model. The result may be also viewed as a weak version of Cramér’s large deviation theorem (see, for example, [10, Theorem 2.2.3]). Recall that $\mu :=\mathbb{E}(X)\in (0,\infty ]$.

Lemma 1.

For every $a\in (0,\mu )$ there exists $q\in (0,1)$ such that $\mathbb{P}({S_{N}}\le aN)\le {q^{N}}$ for all $N\in \mathbb{N}$.

Proof.

Let f denote the moment generating function of $Y:=X/a$, i.e. $f(x):=\mathbb{E}({x^{Y}})$, $x\in [0,1]$. From $\mathbb{E}({x^{{S_{N}}/a}})\ge {\textstyle\int _{\{{S_{N}}\le aN\}}}{x^{{S_{N}}/a}}\hspace{0.1667em}\mathrm{d}\mathbb{P}\ge {x^{N}}\mathbb{P}({S_{N}}\le aN)$ it follows that $\mathbb{P}({S_{N}}\le aN)\le {x^{-N}}\mathbb{E}({x^{{S_{N}}/a}})={({x^{-1}}f(x))^{N}}$ for all $x\in (0,1]$. Since $f(1)=1$ and ${f^{\prime }}(1)=\mathbb{E}(Y)=\mu /a>1$, there exists ${x_{0}}\in (0,1)$ such that $f({x_{0}})<{x_{0}}$. The result follows with $q:={x_{0}^{-1}}f({x_{0}})$. □

We now prove part (i) of Theorem 1.

Poof of Theorem 1 (i).

We first verify that $N{c_{N}}\to \rho /{\mu ^{2}}$ as $N\to \infty $. We have

\[\begin{array}{r@{\hskip10.0pt}c@{\hskip10.0pt}l}\displaystyle N{c_{N}}\hspace{2.5pt}=\hspace{2.5pt}{N^{2}}\mathbb{E}({W_{1}^{2}})& \displaystyle =& \displaystyle {N^{2}}{\int _{(0,\infty )}}\mathbb{E}\bigg({\bigg(\frac{x}{x+{S_{N-1}}}\bigg)^{2}}\bigg)\hspace{0.1667em}{\mathbb{P}_{X}}(\mathrm{d}x)\\ {} & \displaystyle =& \displaystyle {\int _{(0,\infty )}}{f_{N}}(x)\hspace{0.1667em}{\mathbb{P}_{X}}(\mathrm{d}x),\end{array}\]

where ${f_{N}}(x):=\mathbb{E}({(x/(x/N+{S_{N-1}}/N))^{2}})$. By the law of large numbers, ${(x/(x/N+{S_{N-1}}/N))^{2}}\to {(x/\mu )^{2}}$ almost surely and, hence, also in distribution as $N\to \infty $. For any $r>0$ the map $x\mapsto x\wedge r$ is bounded and continuous on $[0,\infty )$. Thus,

\[ \underset{N\to \infty }{\liminf }{f_{N}}(x)\hspace{2.5pt}\ge \hspace{2.5pt}\underset{N\to \infty }{\liminf }\mathbb{E}\bigg({\bigg(\frac{x}{\frac{x}{N}+\frac{{S_{N-1}}}{N}}\bigg)^{2}}\wedge r\bigg)\hspace{2.5pt}=\hspace{2.5pt}{(x/\mu )^{2}}\wedge r.\]

Letting $r\to \infty $ yields ${\liminf _{N\to \infty }}{f_{N}}(x)\ge {(x/\mu )^{2}}$. Therefore, by Fatou’s lemma,

\[ \underset{N\to \infty }{\liminf }N{c_{N}}\hspace{2.5pt}=\hspace{2.5pt}\underset{N\to \infty }{\liminf }{\int _{(0,\infty )}}{f_{N}}(x)\hspace{0.1667em}{\mathbb{P}_{X}}(\mathrm{d}x)\hspace{2.5pt}\ge \hspace{2.5pt}{\int _{(0,\infty )}}{(x/\mu )^{2}}\hspace{0.1667em}{\mathbb{P}_{X}}(\mathrm{d}x)\hspace{2.5pt}=\hspace{2.5pt}\frac{\rho }{{\mu ^{2}}}.\]

In order to see that ${\limsup _{N\to \infty }}N{c_{N}}\le \rho /{\mu ^{2}}$ fix $a\in (0,\mu )$. By Lemma 1 there exists $q\in (0,1)$ such that $\mathbb{P}({S_{N}}\le aN)\le {q^{N}}$ for all $N\in \mathbb{N}$. Therefore,

\[\begin{array}{r@{\hskip10.0pt}c@{\hskip10.0pt}l}\displaystyle N{c_{N}}& \displaystyle =& \displaystyle {N^{2}}\mathbb{E}({W_{1}^{2}})\hspace{2.5pt}=\hspace{2.5pt}{N^{2}}\mathbb{E}({W_{1}^{2}}{1_{\{{S_{N}}\le aN\}}})+{N^{2}}\mathbb{E}({({X_{1}}/{S_{N}})^{2}}{1_{\{{S_{N}}>aN\}}})\\ {} & \displaystyle \le & \displaystyle {N^{2}}\mathbb{P}({S_{N}}\le aN)+{N^{2}}\mathbb{E}(({({X_{1}}/(aN))^{2}})\hspace{2.5pt}\le \hspace{2.5pt}{N^{2}}{q^{N}}+\frac{\rho }{{a^{2}}}\hspace{2.5pt}\to \hspace{2.5pt}\frac{\rho }{{a^{2}}}\end{array}\]

as $N\to \infty $. Thus, ${\limsup _{N\to \infty }}N{c_{N}}\le \rho /{a^{2}}$. Letting $a\uparrow \mu $ shows that ${\limsup _{N\to \infty }}N{c_{N}}\le \rho /{\mu ^{2}}$ and $N{c_{N}}\to \rho /{\mu ^{2}}$ is established.

It is well known (see [29, Section 4]) that any sequence of Cannings models with population sizes N is in the domain of attraction of the Kingman coalescent if and only if ${\Phi _{1}^{(N)}}(3)/{c_{N}}\to 0$ as $N\to \infty $. Thus, we have to verify that $\mathbb{E}({W_{1}^{3}})/\mathbb{E}({W_{1}^{2}})\to 0$ as $N\to \infty $. Since $\mathbb{E}({W_{1}^{2}})\ge {(\mathbb{E}({W_{1}}))^{2}}=1/{N^{2}}$ it suffices to verify that ${N^{2}}\mathbb{E}({W_{1}^{3}})\to 0$ as $N\to \infty $. Fix again $a\in (0,\mu )$ and choose $q\in (0,1)$ as above. We have

\[ {N^{2}}\mathbb{E}({W_{1}^{3}})\hspace{2.5pt}=\hspace{2.5pt}{N^{2}}\mathbb{E}({W_{1}^{3}}{1_{\{{S_{N}}\le aN\}}})+{N^{2}}\mathbb{E}({W_{1}^{3}}{1_{\{{S_{N}}>aN\}}}).\]

Since ${N^{2}}\mathbb{E}({W_{1}^{3}}{1_{\{{S_{N}}\le aN\}}})\le {N^{2}}\mathbb{P}({S_{N}}\le aN)\le {N^{2}}{q^{N}}\to 0$ as $N\to \infty $ it remains to verify that ${N^{2}}\mathbb{E}({W_{1}^{3}}{1_{\{{S_{N}}>aN\}}})\to 0$ as $N\to \infty $. We have

\[\begin{array}{r@{\hskip10.0pt}c@{\hskip10.0pt}l}\displaystyle {N^{2}}\mathbb{E}({W_{1}^{3}}{1_{\{{S_{N}}>aN\}}})& \displaystyle =& \displaystyle {N^{2}}\mathbb{E}({X_{1}^{3}}{S_{N}^{-3}}{1_{\{{S_{N}}>aN,{X_{1}}\le aN\}}})\\ {} & & \displaystyle \hspace{28.45274pt}+{N^{2}}\mathbb{E}({W_{1}^{3}}{1_{\{{S_{n}}>aN,{X_{1}}>aN\}}})\\ {} & \displaystyle \le & \displaystyle \frac{1}{{a^{3}}N}\mathbb{E}({X^{3}}{1_{\{X\le aN\}}})+{N^{2}}\mathbb{P}(X>aN).\end{array}\]

Clearly, ${N^{2}}\mathbb{P}(X>aN)\le {a^{-2}}\mathbb{E}({X^{2}}{1_{\{X>aN\}}})\to 0$ as $N\to \infty $, since $\rho :=\mathbb{E}({X^{2}})<\infty $. It hence remains to verify that ${N^{-1}}\mathbb{E}({X^{3}}{1_{\{X\le aN\}}})\to 0$ as $N\to \infty $. Let $\varepsilon >0$. Choose L sufficiently large such that $\mathbb{E}({X^{2}}{1_{\{X>L\}}})\le \varepsilon /(2a)$. Then, for all $N\in \mathbb{N}$ with $N\ge 2\rho L/\varepsilon $,

\[\begin{array}{r@{\hskip10.0pt}c@{\hskip10.0pt}l}\displaystyle {N^{-1}}\mathbb{E}({X^{3}}{1_{\{X\le aN\}}})& \displaystyle =& \displaystyle {N^{-1}}\mathbb{E}({X^{3}}{1_{\{X\le aN,X\le L\}}})+{N^{-1}}\mathbb{E}({X^{3}}{1_{\{L<X\le aN\}}})\\ {} & \displaystyle \le & \displaystyle {N^{-1}}L\rho +a\mathbb{E}({X^{2}}{1_{\{X>L\}}})\hspace{2.5pt}\le \hspace{2.5pt}\frac{\varepsilon }{2}+\frac{\varepsilon }{2}\hspace{2.5pt}=\hspace{2.5pt}\varepsilon ,\end{array}\]

which shows that ${N^{-1}}\mathbb{E}({X^{3}}{1_{\{X\le aN\}}})\to 0$ as $N\to \infty $. □

We now prepare the proofs of the parts (ii) and (iii) of Theorem 1. We need the following two auxiliary results.

Lemma 2.

If (6) holds for some $\alpha \ge 0$ then for all $p>\alpha $,

\[ \mathbb{E}\bigg({\bigg(\frac{X}{X+x}\bigg)^{p}}\bigg)\hspace{2.5pt}\sim \hspace{2.5pt}\frac{\Gamma (\alpha +1)\Gamma (p-\alpha )}{\Gamma (p)}{x^{-\alpha }}\ell (x),\hspace{2em}x\to \infty ,\]

and

\[ \mathbb{E}\bigg({\bigg(\frac{X}{X\vee x}\bigg)^{p}}\bigg)\hspace{2.5pt}\sim \hspace{2.5pt}\frac{p}{p-\alpha }{x^{-\alpha }}\ell (x),\hspace{2em}x\to \infty .\]

Proof.

Let T be a nonnegative random variable and $f:[0,\infty )\to \mathbb{R}$ be a continuous and piecewise continuously differentiable function such that $f(T)$ is integrable. Then,

(8)

\[\begin{array}{r@{\hskip10.0pt}c@{\hskip10.0pt}l}& & \displaystyle \mathbb{E}(f(T))-f(0)\\ {} & & \displaystyle \hspace{2em}={\int _{[0,\infty )}}\big(f(x)-f(0)\big)\hspace{0.1667em}{\mathbb{P}_{T}}(\mathrm{d}x)\hspace{2.5pt}=\hspace{2.5pt}{\int _{[0,\infty )}}{\int _{[0,x)}}{f^{\prime }}(t)\hspace{0.1667em}\lambda (\mathrm{d}t)\hspace{0.1667em}{\mathbb{P}_{T}}(\mathrm{d}x)\\ {} & & \displaystyle \hspace{2em}={\int _{[0,\infty )}}{f^{\prime }}(t){\int _{(t,\infty )}}{\mathbb{P}_{T}}(\mathrm{d}x)\hspace{0.1667em}\lambda (\mathrm{d}t)\hspace{2.5pt}=\hspace{2.5pt}{\int _{0}^{\infty }}{f^{\prime }}(t)\hspace{0.1667em}\mathbb{P}(T>t)\hspace{0.1667em}\mathrm{d}t.\end{array}\]

Let $x>0$. Applying (8) to $T:=X/x$ and $f(t):={(t/(t+1))^{p}}$ shows that

\[ \mathbb{E}\bigg({\bigg(\frac{X}{X+x}\bigg)^{p}}\bigg)\hspace{2.5pt}=\hspace{2.5pt}{\int _{0}^{\infty }}\frac{p{t^{p-1}}}{{(t+1)^{p+1}}}\mathbb{P}(X>xt)\hspace{0.1667em}\mathrm{d}t.\]

By Theorem 3 of Karamata [22], applied to the function $\varphi (x):=\mathbb{P}(X>x)$, which is regularly varying at ∞ with index $\gamma :=-\alpha $, it follows that, as $x\to \infty $,

\[\begin{array}{r@{\hskip10.0pt}c@{\hskip10.0pt}l}\displaystyle \mathbb{E}\bigg({\bigg(\frac{X}{X+x}\bigg)^{p}}\bigg)& \displaystyle \sim & \displaystyle \mathbb{P}(X>x){\int _{0}^{\infty }}\frac{p{t^{p-1}}}{{(t+1)^{p+1}}}{t^{-\alpha }}\hspace{0.1667em}\mathrm{d}t\\ {} & \displaystyle =& \displaystyle \mathbb{P}(X>x)\frac{\Gamma (\alpha +1)\Gamma (p-\alpha )}{\Gamma (p)}.\end{array}\]

The same steps, but applied to $f(t):={(t/(t\vee 1))^{p}}$, show that

\[\begin{array}{r@{\hskip10.0pt}c@{\hskip10.0pt}l}\displaystyle \mathbb{E}\bigg({\bigg(\frac{X}{X\vee x}\bigg)^{p}}\bigg)& \displaystyle =& \displaystyle {\int _{0}^{\infty }}{f^{\prime }}(t)\mathbb{P}(X>xt)\hspace{0.1667em}\mathrm{d}t\hspace{2.5pt}\sim \hspace{2.5pt}\mathbb{P}(X>x){\int _{0}^{\infty }}{f^{\prime }}(t){t^{-\alpha }}\hspace{0.1667em}\mathrm{d}t\\ {} & \displaystyle =& \displaystyle \mathbb{P}(X>x){\int _{0}^{1}}p{t^{p-\alpha -1}}\hspace{0.1667em}\mathrm{d}t\hspace{2.5pt}=\hspace{2.5pt}\mathbb{P}(X>x)\frac{p}{p-\alpha }.\end{array}\]

□

Lemma 3.

For all $j\in \{1,\dots ,N\}$ and ${p_{1}},\dots ,{p_{j}}>0$,

(9)

\[ \mathbb{E}({W_{1}^{{p_{1}}}}\cdots {W_{j}^{{p_{j}}}})\hspace{2.5pt}=\hspace{2.5pt}\frac{1}{\Gamma (p)}{\int _{0}^{\infty }}{u^{p-1}}\mathbb{E}({e^{-u{S_{N-j}}}}){\prod \limits_{i=1}^{j}}\mathbb{E}({X^{{p_{i}}}}{e^{-uX}})\hspace{0.1667em}\mathrm{d}u,\]

where $p:={p_{1}}+\cdots +{p_{j}}$ and ${S_{0}}:=0$. Moreover, for any fixed $j\in \mathbb{N}$ the asymptotics of the latter integral as $N\to \infty $ is determined by the values of u close to 0, i.e. for any fixed $j\in \mathbb{N}$ and $\delta >0$, as $N\to \infty $,

(10)

\[ \mathbb{E}({W_{1}^{{p_{1}}}}\cdots {W_{j}^{{p_{j}}}})\hspace{2.5pt}\sim \hspace{2.5pt}\frac{1}{\Gamma (p)}{\int _{0}^{\delta }}{u^{p-1}}\mathbb{E}({e^{-u{S_{N-j}}}}){\prod \limits_{i=1}^{j}}\mathbb{E}({X^{{p_{i}}}}{e^{-uX}})\hspace{0.1667em}\mathrm{d}u.\]

In particular, for any fixed $\delta >0$,

(11)

\[ \frac{1}{N}\hspace{2.5pt}=\hspace{2.5pt}\mathbb{E}({W_{1}})\hspace{2.5pt}\sim \hspace{2.5pt}{\int _{0}^{\delta }}\mathbb{E}(X{e^{-uX}})\mathbb{E}({e^{-u{S_{N-1}}}})\hspace{0.1667em}\mathrm{d}u,\hspace{2em}N\to \infty .\]

Remark 3.

The fundamental relation (9) is well known from several references (see, for example, Cortines [9, Proposition 4.4] or Huillet [15]).

Proof.

Let $j\in \{1,\dots ,N\}$ and ${p_{1}},\dots ,{p_{j}}>0$. From the representation ${S_{N}^{-p}}={(\Gamma (p))^{-1}}{\textstyle\int _{0}^{\infty }}{u^{p-1}}{e^{-u{S_{N}}}}\hspace{0.1667em}\mathrm{d}u$ it follows that

\[\begin{array}{r@{\hskip10.0pt}c@{\hskip10.0pt}l}\displaystyle \mathbb{E}({W_{1}^{{p_{1}}}}\cdots {W_{j}^{{p_{j}}}})& \displaystyle =& \displaystyle \mathbb{E}({X_{1}^{{p_{1}}}}\cdots {X_{j}^{{p_{j}}}}{S_{N}^{-p}})\\ {} & \displaystyle =& \displaystyle \frac{1}{\Gamma (p)}{\int _{0}^{\infty }}{u^{p-1}}\mathbb{E}({X_{1}^{{p_{1}}}}\cdots {X_{j}^{{p_{j}}}}{e^{-u{S_{N}}}})\hspace{0.1667em}\mathrm{d}u\\ {} & \displaystyle =& \displaystyle \frac{1}{\Gamma (p)}{\int _{0}^{\infty }}{u^{p-1}}\mathbb{E}({e^{-u{S_{N-j}}}}){\prod \limits_{i=1}^{j}}\mathbb{E}({X^{{p_{i}}}}{e^{-uX}})\hspace{0.1667em}\mathrm{d}u,\end{array}\]

which is (9). To check (10) fix $j\in \mathbb{N}$ and $\delta >0$ and let ψ denote the Laplace transform of X. Decompose $\mathbb{E}({W_{1}^{{p_{1}}}}\cdots {W_{j}^{{p_{j}}}})={A_{N}}+{B_{N}}$ with

\[ {A_{N}}\hspace{2.5pt}:=\hspace{2.5pt}\frac{1}{\Gamma (p)}{\int _{0}^{\delta }}{u^{p-1}}\mathbb{E}({e^{-u{S_{N-j}}}}){\prod \limits_{i=1}^{j}}\mathbb{E}({X^{{p_{i}}}}{e^{-uX}})\hspace{0.1667em}\mathrm{d}u\]

and

\[ {B_{N}}\hspace{2.5pt}:=\hspace{2.5pt}\frac{1}{\Gamma (p)}{\int _{\delta }^{\infty }}{u^{p-1}}\mathbb{E}({e^{-u{S_{N-j}}}}){\prod \limits_{i=1}^{j}}\mathbb{E}({X^{{p_{i}}}}{e^{-uX}})\hspace{0.1667em}\mathrm{d}u.\]

The map $u\mapsto \mathbb{E}({e^{-u{S_{N-1}}}})$ is nonincreasing on $[0,\infty )$. Thus,

\[ {B_{N}}\hspace{2.5pt}\le \hspace{2.5pt}\mathbb{E}({e^{-\delta {S_{N-j}}}})\frac{1}{\Gamma (p)}{\int _{\delta }^{\infty }}{u^{p-1}}{\prod \limits_{i=1}^{j}}\mathbb{E}({X^{{p_{i}}}}{e^{-uX}})\hspace{0.1667em}\mathrm{d}u\hspace{2.5pt}=\hspace{2.5pt}{c_{1}}{(\psi (\delta ))^{N-j}}\]

and

\[ {A_{N}}\hspace{2.5pt}\ge \hspace{2.5pt}\frac{1}{\Gamma (p)}{\int _{0}^{\delta /2}}{u^{p-1}}\mathbb{E}({e^{-u{S_{N-j}}}}){\prod \limits_{i=1}^{j}}\mathbb{E}({X^{{p_{i}}}}{e^{-uX}})\hspace{0.1667em}\mathrm{d}u\hspace{2.5pt}\ge \hspace{2.5pt}{c_{2}}{(\psi (\delta /2))^{N-j}}\]

with constants

\[ {c_{1}}\hspace{2.5pt}:=\hspace{2.5pt}{c_{1}}({p_{1}},\dots ,{p_{j}},\delta )\hspace{2.5pt}:=\hspace{2.5pt}\frac{1}{\Gamma (p)}{\int _{\delta }^{\infty }}{u^{p-1}}{\prod \limits_{i=1}^{j}}\mathbb{E}({X^{{p_{i}}}}{e^{-uX}})\hspace{0.1667em}\mathrm{d}u\]

and

\[ {c_{2}}\hspace{2.5pt}:=\hspace{2.5pt}{c_{2}}({p_{1}},\dots ,{p_{j}},\delta )\hspace{2.5pt}:=\hspace{2.5pt}\frac{1}{\Gamma (p)}{\int _{0}^{\delta /2}}{u^{p-1}}{\prod \limits_{i=1}^{j}}\mathbb{E}({X^{{p_{i}}}}{e^{-uX}})\hspace{0.1667em}\mathrm{d}u.\]

Note that $0<{c_{1}},{c_{2}}<\infty $ and that ${c_{1}}$ and ${c_{2}}$ do not depend on N. Thus,

\[\begin{array}{r@{\hskip10.0pt}c@{\hskip10.0pt}l}\displaystyle 1\hspace{2.5pt}\le \hspace{2.5pt}\frac{\mathbb{E}({W_{1}^{{p_{1}}}}\cdots {W_{j}^{{p_{j}}}})}{{A_{N}}}& \displaystyle =& \displaystyle 1+\frac{{B_{N}}}{{A_{N}}}\hspace{2.5pt}\le \hspace{2.5pt}1+\frac{{c_{1}}{(\psi (\delta ))^{N-j}}}{{c_{2}}{(\psi (\delta /2))^{N-j}}}\\ {} & \displaystyle =& \displaystyle 1+\frac{{c_{1}}}{{c_{2}}}{\bigg(\frac{\psi (\delta )}{\psi (\delta /2)}\bigg)^{N-j}}\hspace{2.5pt}\to \hspace{2.5pt}1\end{array}\]

as $N\to \infty $, since $\psi (\delta /2)>\psi (\delta )$. Eq. (11) follows by choosing $j:=1$ and ${p_{1}}:=1$ in (10). □

We now turn to the proofs of the parts (ii) and (iii) of Theorem 1. We first consider part (iii) ($1<\alpha <2$). The boundary case $\alpha =2$ (part (ii) of Theorem 1) will be studied afterwards.

Proof of Theorem 1 (iii).

The idea of the proof is to apply the general convergence result [32, Theorem 2.1]. Having (3) in mind the main task is to derive the asymptotics of the moments of ${W_{1}}$ or, more generally, the asymptotics of the joint moments of the random variables ${W_{1}},\dots ,{W_{j}}$ as $N\to \infty $. The following proof is based on Schweinsberg’s [37] method. We first verify that

(12)

\[ \underset{N\to \infty }{\lim }\frac{{(\mu N)^{\alpha }}}{\ell (N)}\mathbb{E}({W_{1}^{k}})\hspace{2.5pt}=\hspace{2.5pt}\alpha \mathrm{B}(k-\alpha ,\alpha ),\hspace{2em}k\in \mathbb{N}\setminus \{1\}.\]

For all $\lambda >\mu :=\mathbb{E}(X)$, by the law of large numbers, $\mathbb{P}({S_{N-1}}\le \lambda N)\to 1$ as $N\to \infty $. Thus,

\[\begin{array}{r@{\hskip10.0pt}c@{\hskip10.0pt}l}\displaystyle \mathbb{E}({W_{1}^{k}})& \displaystyle \ge & \displaystyle \mathbb{E}({W_{1}^{k}}{1_{\{{X_{2}}+\cdots +{X_{N}}\le \lambda N\}}})\\ {} & \displaystyle \ge & \displaystyle \mathbb{E}\bigg({\bigg(\frac{{X_{1}}}{{X_{1}}+\lambda N}\bigg)^{k}}\bigg)\mathbb{P}({X_{2}}+\cdots +{X_{N}}\le \lambda N)\\ {} & \displaystyle \sim & \displaystyle \mathbb{E}\bigg({\bigg(\frac{X}{X+\lambda N}\bigg)^{k}}\bigg)\hspace{2.5pt}\sim \hspace{2.5pt}\alpha \mathrm{B}(k-\alpha ,\alpha )\frac{\ell (N)}{{(\lambda N)^{\alpha }}},\hspace{2em}N\to \infty ,\end{array}\]

where the last asymptotics holds by Lemma 2, since ℓ is slowly varying at ∞. Multiplication with ${N^{\alpha }}/\ell (N)$ and taking lim inf shows that

\[ \underset{N\to \infty }{\liminf }\frac{{N^{\alpha }}}{\ell (N)}\mathbb{E}({W_{1}^{k}})\hspace{2.5pt}\ge \hspace{2.5pt}\alpha \mathrm{B}(k-\alpha ,\alpha )/{\lambda ^{\alpha }}.\]

Letting $\lambda \downarrow \mu $ it follows that ${\liminf _{N\to \infty }}{N^{\alpha }}/\ell (N)\mathbb{E}({W_{1}^{k}})\ge \alpha \mathrm{B}(k-\alpha ,\alpha )/{\mu ^{\alpha }}$.

To handle the lim sup, fix $a\in (0,\mu )$ and decompose

\[ \mathbb{E}({W_{1}^{k}})\hspace{2.5pt}=\hspace{2.5pt}\mathbb{E}({W_{1}^{k}}{1_{\{{X_{2}}+\cdots +{X_{N}}\le aN\}}})+\mathbb{E}({W_{1}^{k}}{1_{\{{X_{2}}+\cdots +{X_{N}}>aN\}}}).\]

From Lemma 1 it follows that there exists ${N_{0}}\in \mathbb{N}$ and $q\in (0,1)$ such that $\mathbb{P}({S_{N-1}}\le aN)\le {q^{N}}$ for all $N>{N_{0}}$. Thus, $\mathbb{E}({W_{1}^{k}}{1_{\{{X_{2}}+\cdots +{X_{N}}\le aN\}}})\le \mathbb{P}({X_{2}}+\cdots +{X_{N}}\le aN)=\mathbb{P}({S_{N-1}}\le aN)\le {q^{N}}$ for all $N\in \mathbb{N}$ with $N>{N_{0}}$. It hence suffices to verify that

(13)

\[ \underset{N\to \infty }{\limsup }\frac{{(\mu N)^{\alpha }}}{\ell (N)}\mathbb{E}({W_{1}^{k}}{1_{\{{X_{2}}+\cdots +{X_{N}}>aN\}}})\hspace{2.5pt}=\hspace{2.5pt}\alpha \mathrm{B}(k-\alpha ,\alpha ).\]

In order to see this, let $\lambda \in (a,\mu )$ and decompose

\[\begin{array}{r@{\hskip10.0pt}c@{\hskip10.0pt}l}& & \displaystyle \mathbb{E}({W_{1}^{k}}{1_{\{{X_{2}}+\cdots +{X_{N}}>aN\}}})\\ {} & & \displaystyle \hspace{2em}=\mathbb{E}({W_{1}^{k}}{1_{\{aN<{X_{2}}+\cdots +{X_{N}}\le \lambda N\}}})+\mathbb{E}({W_{1}^{k}}{1_{\{{X_{2}}+\cdots +{X_{N}}>\lambda N\}}})\\ {} & & \displaystyle \hspace{2em}\le \mathbb{E}\bigg({\bigg(\frac{{X_{1}}}{{X_{1}}+aN}\bigg)^{k}}\bigg)\mathbb{P}({S_{N-1}}\le \lambda N)\\ {} & & \displaystyle \hspace{2em}\hspace{2em}+\mathbb{E}\bigg({\bigg(\frac{{X_{1}}}{{X_{1}}+\lambda N}\bigg)^{k}}\bigg)\mathbb{P}({S_{N-1}}>\lambda N).\end{array}\]

The two expectations on the right hand side are both $O(\ell (N)/{N^{\alpha }})$ by Lemma 2. Moreover, $\mathbb{P}({S_{N-1}}\le \lambda N)\to 0$ and $\mathbb{P}({S_{N-1}}>\lambda N)\to 1$ as $N\to \infty $. Therefore, only the last term contributes to the lim sup, and we obtain

\[\begin{array}{r@{\hskip10.0pt}c@{\hskip10.0pt}l}& & \displaystyle \underset{N\to \infty }{\limsup }\frac{{N^{\alpha }}}{\ell (N)}\mathbb{E}({W_{1}^{k}}{1_{\{{X_{2}}+\cdots +{X_{N}}>aN\}}})\\ {} & & \displaystyle \hspace{2em}\le \underset{N\to \infty }{\limsup }\frac{{N^{\alpha }}}{\ell (N)}\mathbb{E}\bigg({\bigg(\frac{{X_{1}}}{{X_{1}}+\lambda N}\bigg)^{k}}\bigg)\mathbb{P}({S_{N-1}}>\lambda N)\\ {} & & \displaystyle \hspace{2em}\sim \frac{{N^{\alpha }}}{\ell (N)}\alpha \mathrm{B}(k-\alpha ,\alpha )\frac{\ell (\lambda N)}{{(\lambda N)^{\alpha }}}\hspace{2.5pt}=\hspace{2.5pt}\alpha \mathrm{B}(k-\alpha ,\alpha )/{\lambda ^{\alpha }}.\end{array}\]

Letting $\lambda \uparrow \mu $ shows that (13) holds. Thus, (12) is established.

Choosing $k=2$ in (12) yields the asymptotic formula for the coalescence probability ${c_{N}}=N\mathbb{E}({W_{1}^{2}})$ stated in Theorem 1 (iii). In particular, ${c_{N}}=O(\ell (N)/{N^{\alpha -1}})$. In summary we conclude that

\[ \frac{{\Phi _{1}^{(N)}}(k)}{{c_{N}}}\hspace{2.5pt}=\hspace{2.5pt}\frac{\mathbb{E}({W_{1}^{k}})}{\mathbb{E}({W_{1}^{2}})}\hspace{2.5pt}\to \hspace{2.5pt}\frac{\Gamma (k-\alpha )}{\Gamma (k)\Gamma (2-\alpha )}\hspace{2.5pt}=\hspace{2.5pt}{\int _{(0,1)}}{x^{k-2}}\Lambda (\mathrm{d}x),\hspace{2em}N\to \infty ,\]

where $\Lambda :=\beta (2-\alpha ,\alpha )$ denotes the beta distribution with parameters $2-\alpha $ and α. Moreover,

\[\begin{array}{r@{\hskip10.0pt}c@{\hskip10.0pt}l}\displaystyle \mathbb{E}({W_{1}^{2}}{W_{2}^{2}}{1_{\{{S_{N}}>aN\}}})& \displaystyle \le & \displaystyle \mathbb{E}\bigg(\frac{{X_{1}^{2}}{X_{2}^{2}}}{{({X_{1}}\vee aN)^{2}}{({X_{2}}\vee aN)^{2}}}\bigg)\\ {} & \displaystyle =& \displaystyle \bigg(\mathbb{E}{\bigg(\frac{{X^{2}}}{{(X\vee aN)^{2}}}\bigg)\bigg)^{2}}\hspace{2.5pt}\sim \hspace{2.5pt}{\bigg(\frac{2}{2-\alpha }\frac{\ell (aN)}{{(aN)^{\alpha }}}\bigg)^{2}}\\ {} & \displaystyle =& \displaystyle O\bigg(\frac{{(\ell (N))^{2}}}{{N^{2\alpha }}}\bigg).\end{array}\]

Since ${c_{N}}\ge K\ell (N)/{N^{\alpha -1}}$ for some $K>0$ it follows that ${\Phi _{2}^{(N)}}(2,2)/{c_{N}}=O(\ell (N)/{N^{\alpha -1}})=O({c_{N}})\to 0$ as $N\to \infty $. Thus, for all $j,{k_{1}},\dots ,{k_{j}}\in \mathbb{N}\setminus \{1\}$, ${\Phi _{j}^{(N)}}({k_{1}},\dots ,{k_{j}})/{c_{N}}\le {\Phi _{2}^{(N)}}(2,2)/{c_{N}}\to 0$ as $N\to \infty $. By [32, Theorem 2.1], the model is in the domain of attraction of the $\beta (2-\alpha ,\alpha )$-coalescent. □

We now turn to the boundary case $\alpha =2$, so we prove part (ii) of Theorem 1.

Proof of Theorem 1 (ii).

For all $x>0$,

\[\begin{array}{r@{\hskip10.0pt}c@{\hskip10.0pt}l}& & \displaystyle \mathbb{E}({X^{2}}{1_{\{X\le x\}}})\hspace{2.5pt}=\hspace{2.5pt}{\int _{0}^{\infty }}\mathbb{P}({X^{2}}{1_{\{X\le x\}}}>y)\hspace{0.1667em}\mathrm{d}y\\ {} & & \displaystyle \hspace{1em}={\int _{0}^{\infty }}2t\mathbb{P}({X^{2}}{1_{\{X\le x\}}}>{t^{2}})\hspace{0.1667em}\mathrm{d}t\hspace{2.5pt}=\hspace{2.5pt}{\int _{0}^{x}}2t\mathbb{P}(t<X\le x)\hspace{0.1667em}\mathrm{d}t\\ {} & & \displaystyle \hspace{1em}={\int _{0}^{x}}2t\big(\mathbb{P}(X>t)-\mathbb{P}(X>x)\big)\hspace{0.1667em}\mathrm{d}t\hspace{2.5pt}=\hspace{2.5pt}{\int _{0}^{x}}2t\mathbb{P}(X>t)\hspace{0.1667em}\mathrm{d}t-{x^{2}}\mathbb{P}(X>x).\end{array}\]

Since ${\textstyle\int _{0}^{x}}t\mathbb{P}(X>t)\hspace{0.1667em}\mathrm{d}t\sim {\textstyle\int _{1}^{x}}\ell (t)/t\hspace{0.1667em}\mathrm{d}t={\ell ^{\ast }}(x)$, ${x^{2}}\mathbb{P}(X>x)\sim \ell (x)$, and $\ell (x)/{\ell ^{\ast }}(x)\to 0$ as $x\to \infty $, it follows that $\mathbb{E}({X^{2}}{1_{\{X\le x\}}})\sim 2{\ell ^{\ast }}(x)$ as $x\to \infty $. Thus, relation (2.3c) of Bingham and Doney [3] holds with $n=1$ and $L:={\ell ^{\ast }}$. This relation is equivalent (see (2.4) in Theorem A of [3]) to ${\psi ^{\prime\prime }}(u)\sim 2{\ell ^{\ast }}(1/u)$ as $u\to 0$.

Recall that ${c_{N}}=N\mathbb{E}({W_{1}^{2}})$. We now verify the asymptotic relation ${c_{N}}\sim 2{\mu ^{-2}}{\ell ^{\ast }}(N)/N$ as $N\to \infty $ or, equivalently, that

(14)

\[ \underset{N\to \infty }{\lim }\frac{{N^{2}}}{{\ell ^{\ast }}(N)}\mathbb{E}({W_{1}^{2}})\hspace{2.5pt}=\hspace{2.5pt}\frac{2}{{\mu ^{2}}}.\]

We have

\[ \mathbb{E}({W_{1}^{2}})\hspace{2.5pt}=\hspace{2.5pt}{\int _{0}^{\infty }}u{\psi ^{\prime\prime }}(u)\mathbb{E}({e^{-u{S_{N-1}}}})\hspace{0.1667em}\mathrm{d}u\hspace{2.5pt}=\hspace{2.5pt}\frac{1}{{N^{2}}}{\int _{0}^{\infty }}t{\psi ^{\prime\prime }}(t/N)\mathbb{E}({e^{-t{S_{N-1}}/N}})\hspace{0.1667em}\mathrm{d}t.\]

Multiplication by ${N^{2}}/{\ell ^{\ast }}(N)$ and Fatou’s lemma yield

\[\begin{array}{r@{\hskip10.0pt}c@{\hskip10.0pt}l}\displaystyle \underset{N\to \infty }{\liminf }\frac{{N^{2}}}{{\ell ^{\ast }}(N)}\mathbb{E}({W_{1}^{2}})& \displaystyle \ge & \displaystyle {\int _{0}^{\infty }}t\underset{N\to \infty }{\liminf }\frac{{\psi ^{\prime\prime }}(t/N)}{{\ell ^{\ast }}(N)}\mathbb{E}({e^{-t{S_{N-1}}/N}})\hspace{0.1667em}\mathrm{d}t\\ {} & \displaystyle =& \displaystyle {\int _{0}^{\infty }}2t{e^{-\mu t}}\hspace{0.1667em}\mathrm{d}t\hspace{2.5pt}=\hspace{2.5pt}\frac{2}{{\mu ^{2}}},\end{array}\]

since ${\psi ^{\prime\prime }}(t/N)\sim 2{\ell ^{\ast }}(N/t)\sim 2{\ell ^{\ast }}(N)$ and $\mathbb{E}({e^{-t{S_{N-1}}/N}})\to {e^{-\mu t}}$ as $N\to \infty $. To see that ${\limsup _{N\to \infty }}\frac{{N^{2}}}{{\ell ^{\ast }}(N)}\mathbb{E}({W_{1}^{2}})\le 2/{\mu ^{2}}$, fix $a\in (0,\mu )$. By Lemma 1 there exists ${N_{0}}\in \mathbb{N}$ and $q\in (0,1)$ such that $\mathbb{P}({S_{N-1}}\le aN)\le {q^{N}}$ for all $N\in \mathbb{N}$ with $N>{N_{0}}$. Noting that $\mathbb{E}({W_{1}^{2}}{1_{\{{X_{2}}+\cdots +{X_{N}}\le aN\}}})\le \mathbb{P}({S_{N-1}}\le aN)\le {q^{N}}$, it suffices to verify that

(15)

\[ \underset{N\to \infty }{\limsup }\frac{{N^{2}}}{{\ell ^{\ast }}(N)}\mathbb{E}({W_{1}^{2}}{1_{\{{X_{2}}+\cdots +{X_{N}}>aN\}}})\hspace{2.5pt}\le \hspace{2.5pt}\frac{2}{{\mu ^{2}}}.\]

In order to see this, let $\lambda \in (a,\mu )$ and decompose $\mathbb{E}({W_{1}^{2}}{1_{\{{X_{2}}+\cdots +{X_{N}}>aN\}}})=\mathbb{E}({W_{1}^{2}}{1_{{A_{N}}}})+\mathbb{E}({W_{1}^{2}}{1_{{B_{N}}}})$, where ${A_{N}}:=\{aN<{X_{2}}+\cdots +{X_{N}}\le \lambda N\}$ and ${B_{N}}:=\{{X_{2}}+\cdots +{X_{N}}>\lambda N\}$. We have

\[\begin{array}{r@{\hskip10.0pt}c@{\hskip10.0pt}l}\displaystyle \frac{{N^{2}}}{{\ell ^{\ast }}(N)}\mathbb{E}({W_{1}^{2}}{1_{{A_{N}}}})& \displaystyle =& \displaystyle \frac{{N^{2}}}{{\ell ^{\ast }}(N)}{\int _{0}^{\infty }}u{\psi ^{\prime\prime }}(u)\mathbb{E}({e^{-u{S_{N-1}}}}{1_{\{aN<{S_{N-1}}\le \lambda N\}}})\hspace{0.1667em}\mathrm{d}u\\ {} & \displaystyle \le & \displaystyle \mathbb{P}({S_{N-1}}\le \lambda N)\frac{{N^{2}}}{{\ell ^{\ast }}(N)}{\int _{0}^{\infty }}u{\psi ^{\prime\prime }}(u){e^{-uaN}}\hspace{0.1667em}\mathrm{d}u\\ {} & \displaystyle =& \displaystyle \mathbb{P}({S_{N-1}}\le \lambda N)\frac{1}{{\ell ^{\ast }}(N)}{\int _{0}^{\infty }}t{\psi ^{\prime\prime }}(t/N){e^{-at}}\hspace{0.1667em}\mathrm{d}t\\ {} & \displaystyle \sim & \displaystyle \mathbb{P}({S_{N-1}}\le \lambda N)\frac{1}{{\ell ^{\ast }}(N)}{\psi ^{\prime\prime }}(1/N){\int _{0}^{\infty }}t{e^{-at}}\hspace{0.1667em}\mathrm{d}t\\ {} & \displaystyle \sim & \displaystyle \mathbb{P}({S_{N-1}}\le \lambda N)\frac{2}{{a^{2}}}\hspace{2.5pt}\to \hspace{2.5pt}0,\hspace{2em}N\to \infty ,\end{array}\]

where the second last asymptotics holds by Theorem 3 of Karamata [22], applied with $f(t):=t{e^{-at}}$ and $\varphi :={\psi ^{\prime\prime }}$, which is slowly varying at 0. For the second part we obtain

\[\begin{array}{r@{\hskip10.0pt}c@{\hskip10.0pt}l}\displaystyle \frac{{N^{2}}}{{\ell ^{\ast }}(N)}\mathbb{E}({W_{1}^{2}}{1_{{B_{N}}}})& \displaystyle =& \displaystyle \frac{{N^{2}}}{{\ell ^{\ast }}(N)}{\int _{0}^{\infty }}u{\psi ^{\prime\prime }}(u)\mathbb{E}({e^{-u{S_{N-1}}}}{1_{\{{S_{N-1}}>\lambda N\}}})\hspace{0.1667em}\mathrm{d}u\\ {} & \displaystyle \le & \displaystyle \frac{{N^{2}}}{{\ell ^{\ast }}(N)}{\int _{0}^{\infty }}u{\psi ^{\prime\prime }}(u){e^{-u\lambda N}}\hspace{0.1667em}\mathrm{d}u\\ {} & \displaystyle =& \displaystyle \frac{1}{{\ell ^{\ast }}(N)}{\int _{0}^{\infty }}t{\psi ^{\prime\prime }}(t/N){e^{-\lambda t}}\hspace{0.1667em}\mathrm{d}t\\ {} & \displaystyle \sim & \displaystyle \frac{1}{{\ell ^{\ast }}(N)}{\psi ^{\prime\prime }}(1/N){\int _{0}^{\infty }}t{e^{-\lambda t}}\hspace{0.1667em}\mathrm{d}t\hspace{2.5pt}\sim \hspace{2.5pt}\frac{2}{{\lambda ^{2}}},\end{array}\]

where the second last asymptotics holds again by Theorem 3 of Karamata [22], now applied with $f(t):=t{e^{-\lambda t}}$ and $\varphi :={\psi ^{\prime\prime }}$. Therefore,

\[\begin{array}{r@{\hskip10.0pt}c@{\hskip10.0pt}l}& & \displaystyle \underset{N\to \infty }{\limsup }\frac{{N^{2}}}{{\ell ^{\ast }}(N)}\mathbb{E}({W_{1}^{2}}{1_{\{{X_{2}}+\cdots +{X_{N}}>aN\}}})\\ {} & & \displaystyle \hspace{2em}\le \underset{N\to \infty }{\limsup }\frac{{N^{2}}}{{\ell ^{\ast }}(N)}\mathbb{E}({W_{1}^{2}}{1_{{A_{N}}}})+\underset{N\to \infty }{\limsup }\frac{{N^{2}}}{{\ell ^{\ast }}(N)}\mathbb{E}({W_{1}^{2}}{1_{{B_{N}}}})\\ {} & & \displaystyle \hspace{2em}\le 0+\frac{2}{{\lambda ^{2}}}\hspace{2.5pt}=\hspace{2.5pt}\frac{2}{{\lambda ^{2}}}.\end{array}\]

Letting $\lambda \uparrow \mu $ shows that (15) holds. Thus, (14) is established. The rest of the proof now works as follows. By the monotone density theorem (Lemma 4), applied with $\rho =0$,

\[ \frac{-u{\psi ^{\prime\prime\prime }}(u)}{{\psi ^{\prime\prime }}(u)}\hspace{2.5pt}\sim \hspace{2.5pt}\frac{-u{\psi ^{\prime\prime\prime }}(u)}{2{\ell ^{\ast }}(1/u)}\hspace{2.5pt}\to \hspace{2.5pt}0,\hspace{2em}u\to 0.\]

Thus, for every $\varepsilon >0$ there exists $\delta =\delta (\varepsilon )>0$ such that $-u{\psi ^{\prime\prime\prime }}(u)\le \varepsilon {\psi ^{\prime\prime }}(u)$ for all $u\in (0,\delta )$. Therefore, together with Lemma 3, as $N\to \infty $,

\[\begin{array}{r@{\hskip10.0pt}c@{\hskip10.0pt}l}\displaystyle \mathbb{E}({W_{1}^{3}})& \displaystyle \sim & \displaystyle \frac{1}{2}{\int _{0}^{\delta }}{u^{2}}(-{\psi ^{\prime\prime\prime }}(u)){(\psi (u))^{N-1}}\hspace{0.1667em}\mathrm{d}u\\ {} & \displaystyle \le & \displaystyle \frac{\varepsilon }{2}{\int _{0}^{\delta }}u{\psi ^{\prime\prime }}(u){(\psi (u))^{N-1}}\hspace{0.1667em}\mathrm{d}u\hspace{2.5pt}\sim \hspace{2.5pt}\frac{\varepsilon }{2}\mathbb{E}({W_{1}^{2}}).\end{array}\]

Thus, ${\limsup _{N\to \infty }}\mathbb{E}({W_{1}^{3}})/\mathbb{E}({W_{1}^{2}})\le \varepsilon /2$. Since ε can be chosen arbitrarily small, it follows that ${\lim \nolimits_{N\to \infty }}{\Phi _{1}^{(N)}}(3)/{c_{N}}={\lim \nolimits_{N\to \infty }}\mathbb{E}({W_{1}^{3}})/\mathbb{E}({W_{1}^{2}})=0$, which is equivalent (see, for example, [29, Section 4]) to the property that the model is in the domain of attraction of the Kingman coalescent. □

We now turn to the proofs of the three remaining parts (iv)–(vi) of Theorem 1. We first consider the case $0<\alpha <1$ corresponding to part (v) of Theorem 1. The boundary cases (iv) ($\alpha =1$) and (vi) ($\alpha =0$) will be considered afterwards. Assume that $0<\alpha <1$. Then (6) is exactly Eq. (2.3b) of Bingham and Doney [3] with $n=0$, $\beta =\alpha \in (0,1)$ and $L(x):=\Gamma (1-\alpha )\ell (x)$. By [3, Theorem A], (6) is hence equivalent (see [3, Eq. (2.1)]) to $1-\psi (u)\sim {u^{\alpha }}L(1/u)=\Gamma (1-\alpha ){u^{\alpha }}\ell (1/u)$ as $u\to 0$.

Proof of Theorem 1 (v).

For $k\in {\mathbb{N}_{0}}$ and $x>0$ define ${h_{k}}(x):={x^{k}}\mathbb{P}(X>x)$. By (6), ${h_{k}}(x)\sim {x^{k-\alpha }}\ell (x)$ as $x\to \infty $. Karamata’s Tauberian theorem [2, Theorem 1.7.6], applied with $U:={h_{k}}$, $\rho :=k-\alpha $ and $c:=\Gamma (\rho +1)$, yields for all $k\in {\mathbb{N}_{0}}$ that ${\widehat{h}_{k}}(u):=u{\textstyle\int _{0}^{\infty }}{e^{-ux}}{x^{k}}\mathbb{P}(X>x)\hspace{0.1667em}\mathrm{d}x\sim \Gamma (k-\alpha +1){u^{\alpha -k}}\ell (1/u)$ as $u\to 0$. Thus, by (8), for all $k\in \mathbb{N}$,

(16)

\[\begin{array}{r@{\hskip10.0pt}c@{\hskip10.0pt}l}\displaystyle {\varphi _{k}}(u)& \displaystyle :=& \displaystyle \mathbb{E}({X^{k}}{e^{-uX}})\hspace{2.5pt}=\hspace{2.5pt}{\int _{0}^{\infty }}\frac{\mathrm{d}}{\mathrm{d}x}({x^{k}}{e^{-ux}})\mathbb{P}(X>x)\hspace{0.1667em}\mathrm{d}x\\ {} & \displaystyle =& \displaystyle {\int _{0}^{\infty }}(k{x^{k-1}}{e^{-ux}}-u{x^{k}}{e^{-ux}})\mathbb{P}(X>x)\hspace{0.1667em}\mathrm{d}x\hspace{2.5pt}=\hspace{2.5pt}\frac{k}{u}{\widehat{h}_{k-1}}(u)-{\widehat{h}_{k}}(u)\\ {} & \displaystyle \sim & \displaystyle \frac{k}{u}\Gamma (k-\alpha ){u^{\alpha -(k-1)}}\ell (1/u)-\Gamma (k-\alpha +1){u^{\alpha -k}}\ell (1/u)\\ {} & \displaystyle =& \displaystyle \alpha \Gamma (k-\alpha ){u^{\alpha -k}}\ell (1/u),\hspace{2em}u\to 0.\end{array}\]

We now turn to the joint moments of ${W_{1}},\dots ,{W_{j}}$. Let ${a_{1}},{a_{2}},\dots $ be positive real numbers satisfying $L({a_{N}})\sim {a_{N}^{\alpha }}/N$ as $N\to \infty $. Moreover, fix some $\delta \in (0,\infty )$. The exact value of δ is irrelevant but it is important that δ is finite. Let $j,{k_{1}},\dots ,{k_{j}}\in \mathbb{N}$. Define $k:={k_{1}}+\cdots +{k_{j}}$. By Lemma 3, as $N\to \infty $,

\[\begin{array}{r@{\hskip10.0pt}c@{\hskip10.0pt}l}\displaystyle {\Phi _{j}^{(N)}}({k_{1}},\dots ,{k_{j}})& \displaystyle =& \displaystyle {(N)_{j}}\mathbb{E}({W_{1}^{{k_{1}}}}\cdots {W_{j}^{{k_{j}}}})\\ {} & \displaystyle \sim & \displaystyle \frac{{N^{j}}}{\Gamma (k)}{\int _{0}^{\delta }}{u^{k-1}}\mathbb{E}({e^{-u{S_{N-j}}}}){\prod \limits_{i=1}^{j}}{\varphi _{{k_{i}}}}(u)\hspace{0.1667em}\mathrm{d}u\\ {} & \displaystyle =& \displaystyle \frac{{N^{j}}}{\Gamma (k){a_{N}^{k}}}{\int _{0}^{\delta {a_{N}}}}{t^{k-1}}\mathbb{E}({e^{-t{S_{N-j}}/{a_{N}}}}){\prod \limits_{i=1}^{j}}{\varphi _{{k_{i}}}}(t/{a_{N}})\hspace{0.1667em}\mathrm{d}t.\end{array}\]

Corollary 1, an Abelian result á la Karamata provided in the appendix for convenience, applied to ${x_{N}}:=1/{a_{N}}$, ${f_{N}}(t):={t^{k-1}}\mathbb{E}({e^{-t{S_{N-j}}/{a_{N}}}}){1_{(0,\delta {a_{N}})}}(t)$ and $\varphi :={\textstyle\prod _{i=1}^{j}}{\varphi _{{k_{i}}}}$, which is regularly varying at 0 with index ${\textstyle\sum _{i=1}^{j}}(\alpha -{k_{i}})=j\alpha -k$, yields, as $N\to \infty $,

(17)

\[ {\Phi _{j}^{(N)}}({k_{1}}\dots ,{k_{j}})\hspace{2.5pt}\sim \hspace{2.5pt}\frac{{N^{j}}{\textstyle\textstyle\prod _{i=1}^{j}}{\varphi _{{k_{i}}}}(1/{a_{N}})}{\Gamma (k){a_{N}^{k}}}{\int _{0}^{\delta {a_{N}}}}{t^{j\alpha -1}}\mathbb{E}({e^{-t{S_{N-j}}/{a_{N}}}})\hspace{0.1667em}\mathrm{d}t.\]

In the following the asymptotic relation (17) is used to verify by induction on $j\in \mathbb{N}$ that, for all ${k_{1}},\dots ,{k_{j}}\in \mathbb{N}$,

(18)

\[ \underset{N\to \infty }{\lim }{\Phi _{j}^{(N)}}({k_{1}}\dots ,{k_{j}})\hspace{2.5pt}=\hspace{2.5pt}{\alpha ^{j-1}}\frac{\Gamma (j)}{\Gamma (k)}{\prod \limits_{i=1}^{j}}\frac{\Gamma ({k_{i}}-\alpha )}{\Gamma (1-\alpha )}.\]

Since ${\Phi _{1}^{(N)}}(1)=N\mathbb{E}({W_{1}})=1$, the choice $j={k_{1}}=1$ in (17) yields

(19)

\[ {\int _{0}^{\delta {a_{N}}}}{t^{\alpha -1}}\mathbb{E}({e^{-t{S_{N-1}}/{a_{N}}}})\hspace{0.1667em}\mathrm{d}t\hspace{2.5pt}\sim \hspace{2.5pt}\frac{{a_{N}}}{N{\varphi _{1}}(1/{a_{N}})}\hspace{2.5pt}\sim \hspace{2.5pt}\frac{1}{\alpha },\hspace{2em}N\to \infty ,\]

where the last asymptotics holds, since ${\varphi _{1}}(1/{a_{N}})\sim \alpha \Gamma (1-\alpha ){a_{N}^{1-\alpha }}\ell ({a_{N}})$ and ${a_{N}^{\alpha }}/N\sim L({a_{N}})=\Gamma (1-\alpha )\ell (N)$. Note that in (19) it is important that $\delta <\infty $ because otherwise the integral on the left hand side of (19) could take the value ∞. For $j=1$ and ${k_{1}}=k\in \mathbb{N}$, (17) thus reduces to

\[ {\Phi _{1}^{(N)}}(k)\hspace{2.5pt}\sim \hspace{2.5pt}\frac{N{\varphi _{k}}(1/{a_{N}})}{\Gamma (k){a_{N}^{k}}}\frac{1}{\alpha }\hspace{2.5pt}\sim \hspace{2.5pt}\frac{\Gamma (k-\alpha )}{\Gamma (k)\Gamma (1-\alpha )},\hspace{2em}N\to \infty ,\]

which shows that (18) holds for $j=1$. In particular, ${c_{N}}={\Phi _{1}^{(N)}}(2)\to 1-\alpha >0$ as $N\to \infty $. The induction step from $j-1$ to j ($\ge 2$) works as follows. By the consistency relation (4) and the induction hypothesis,

\[\begin{array}{r@{\hskip10.0pt}c@{\hskip10.0pt}l}\displaystyle {\Phi _{j}^{(N)}}(1,\dots ,1)& \displaystyle =& \displaystyle {\Phi _{j-1}^{(N)}}(1,\dots ,1)-(j-1){\Phi _{j-1}^{(N)}}(2,1,\dots ,1)\\ {} & \displaystyle \to & \displaystyle {\alpha ^{j-2}}-{\alpha ^{j-2}}(1-\alpha )\hspace{2.5pt}=\hspace{2.5pt}{\alpha ^{j-1}}.\end{array}\]

Thus, (18) holds for ${k_{1}}=\cdots ={k_{j}}=1$ and the choice ${k_{1}}=\cdots ={k_{j}}=1$ in (17) yields

\[ {\int _{0}^{\delta {a_{N}}}}{t^{j\alpha -1}}\mathbb{E}({e^{-t{S_{N-j}}/{a_{N}}}})\hspace{0.1667em}\mathrm{d}t\hspace{2.5pt}\sim \hspace{2.5pt}\hspace{2.5pt}\frac{\Gamma (j){a_{N}^{j}}}{{N^{j}}{({\varphi _{1}}(1/{a_{N}}))^{j}}}{\alpha ^{j-1}}\hspace{2.5pt}\sim \hspace{2.5pt}\frac{\Gamma (j)}{\alpha },\hspace{2em}N\to \infty .\]

Therefore, (17) reduces to

\[\begin{array}{r@{\hskip10.0pt}c@{\hskip10.0pt}l}\displaystyle {\Phi _{j}^{(N)}}({k_{1}},\dots ,{k_{j}})& \displaystyle \sim & \displaystyle \frac{{N^{j}}{\textstyle\textstyle\prod _{i=1}^{j}}{\varphi _{{k_{i}}}}(1/{a_{N}})}{\Gamma (k){a_{N}^{k}}}\frac{\Gamma (j)}{\alpha }\\ {} & \displaystyle \sim & \displaystyle \frac{{N^{j}}\Gamma (j){\textstyle\textstyle\prod _{i=1}^{j}}(\alpha \Gamma ({k_{i}}-\alpha ){a_{N}^{{k_{i}}-\alpha }}\ell ({a_{N}}))}{\alpha \Gamma (k){a_{N}^{k}}}\\ {} & \displaystyle =& \displaystyle {\alpha ^{j-1}}\frac{\Gamma (j)}{\Gamma (k)}{\Big(\frac{N\Gamma (1-\alpha )\ell ({a_{N}})}{{a_{N}^{\alpha }}}\Big)^{j}}{\prod \limits_{i=1}^{j}}\frac{\Gamma ({k_{i}}-\alpha )}{\Gamma (1-\alpha )}\\ {} & \displaystyle \to & \displaystyle {\alpha ^{j-1}}\frac{\Gamma (j)}{\Gamma (k)}{\prod \limits_{i=1}^{j}}\frac{\Gamma ({k_{i}}-\alpha )}{\Gamma (1-\alpha )}\hspace{2.5pt}=:\hspace{2.5pt}{\phi _{j}}({k_{1}},\dots ,{k_{j}}),\end{array}\]

since $N\Gamma (1-\alpha )\ell ({a_{N}})=NL({a_{N}})\sim {a_{N}^{\alpha }}$ as $N\to \infty $. The induction is complete.

In summary, ${\Phi _{j}^{(N)}}({k_{1}},\dots ,{k_{j}})\to {\phi _{j}}({k_{1}},\dots ,{k_{j}})$ as $N\to \infty $ for all $j,{k_{1}},\dots ,{k_{j}}\in \mathbb{N}$. The quantities ${\phi _{j}}({k_{1}},\dots ,{k_{j}})$ are (see [31, Eq. (16)] for the analogous formula for the rates of the continuous-time Poisson–Dirichlet coalescent) the transition probabilities of the discrete-time two-parameter Poisson–Dirichlet coalescent with parameters α and 0. The convergence result (v) of Theorem 1 therefore follows from [32, Theorem 2.1]. □

Let us now turn to the (boundary) case $\alpha =1$, so we now assume that $\mathbb{P}(X>x)\sim {x^{-1}}\ell (x)$ as $x\to \infty $ for some function ℓ slowly varying at ∞.

Proof of Theorem 1 (iv).

The proof has much in common with that of part (v). The details are however slightly different. For all $x>0$,

\[\begin{array}{r@{\hskip10.0pt}c@{\hskip10.0pt}l}& & \displaystyle \mathbb{E}(X{1_{\{X\le x\}}})\hspace{2.5pt}=\hspace{2.5pt}{\int _{0}^{\infty }}\mathbb{P}(X{1_{\{X\le x\}}}>t)\hspace{0.1667em}\mathrm{d}t\hspace{2.5pt}=\hspace{2.5pt}{\int _{0}^{x}}\mathbb{P}(t<X\le x)\hspace{0.1667em}\mathrm{d}t\\ {} & & \displaystyle \hspace{1em}={\int _{0}^{x}}\big(\mathbb{P}(X>t)-\mathbb{P}(X>x)\big)\hspace{0.1667em}\mathrm{d}t\hspace{2.5pt}=\hspace{2.5pt}{\int _{0}^{x}}\mathbb{P}(X>t)\hspace{0.1667em}\mathrm{d}t-x\mathbb{P}(X>x).\end{array}\]

Using that ${\textstyle\int _{0}^{x}}\mathbb{P}(X>t)\hspace{0.1667em}\mathrm{d}t\sim {\textstyle\int _{1}^{x}}\ell (t)/t\hspace{0.1667em}\mathrm{d}t={\ell ^{\ast }}(x)$, $x\mathbb{P}(X>x)\sim \ell (x)$ and $\ell (x)/{\ell ^{\ast }}(x)\to 0$ as $x\to \infty $, it follows that $\mathbb{E}(X{1_{\{X\le x\}}})\sim {\ell ^{\ast }}(x)$ as $x\to \infty $. Recall that ${\ell ^{\ast }}$ is slowly varying at ∞. Thus, Eq. (2.3c) of Bingham and Doney [3] holds with $n=0$ and $\alpha =\beta =1$ and $L:={\ell ^{\ast }}$, which is equivalent (see [3, Theorem A, Eq. (2.1)]) to

\[ 1-\psi (u)\hspace{2.5pt}\sim \hspace{2.5pt}u{\ell ^{\ast }}(1/u),\hspace{2em}u\to 0\]

and as well (see [3, Theorem A, Eq. (2.4)]) equivalent to

\[ {\varphi _{1}}(u)\hspace{2.5pt}:=\hspace{2.5pt}\mathbb{E}(X{e^{-uX}})\hspace{2.5pt}=\hspace{2.5pt}-{\psi ^{\prime }}(u)\hspace{2.5pt}\sim \hspace{2.5pt}{\ell ^{\ast }}(1/u),\hspace{2em}u\to 0.\]

For $k\in \mathbb{N}\setminus \{1\}$, the asymptotic relation

(20)

\[ {\varphi _{k}}(u)\hspace{2.5pt}:=\hspace{2.5pt}\mathbb{E}({X^{k}}{e^{-uX}})\hspace{2.5pt}\sim \hspace{2.5pt}\Gamma (k-1){u^{1-k}}\ell (1/u),\hspace{2em}u\to 0,\]

is verified exactly as in the proof of part (v) of Theorem 1. In particular, ${\varphi _{k}}$ is regularly varying at 0 with index $1-k$, $k\in {\mathbb{N}_{0}}$.

We now turn to the joint moments of ${W_{1}},\dots ,{W_{j}}$. Let ${a_{1}},{a_{2}},\dots $ be positive real numbers satisfying ${\ell ^{\ast }}({a_{N}})\sim {a_{N}}/N$ as $N\to \infty $. As in the proof of part (v) of Theorem 1, fix some $\delta \in (0,\infty )$. Again, the exact value of δ is irrelevant but it is important that δ is finite. Let $j,{k_{1}},\dots ,{k_{j}}\in \mathbb{N}$. Define $k:={k_{1}}+\cdots +{k_{j}}$. By Lemma 3, as $N\to \infty $,

\[\begin{array}{r@{\hskip10.0pt}c@{\hskip10.0pt}l}\displaystyle \mathbb{E}({W_{1}^{{k_{1}}}}\cdots {W_{j}^{{k_{j}}}})& \displaystyle \sim & \displaystyle \frac{1}{\Gamma (k)}{\int _{0}^{\delta }}{u^{k-1}}\mathbb{E}({e^{-u{S_{N-j}}}}){\prod \limits_{i=1}^{j}}{\varphi _{{k_{i}}}}(u)\hspace{0.1667em}\mathrm{d}u\\ {} & \displaystyle =& \displaystyle \frac{1}{\Gamma (k){a_{N}^{k}}}{\int _{0}^{\delta {a_{N}}}}{t^{k-1}}\mathbb{E}({e^{-t{S_{N-j}}/{a_{N}}}}){\prod \limits_{i=1}^{j}}{\varphi _{{k_{i}}}}(t/{a_{N}})\hspace{0.1667em}\mathrm{d}t.\end{array}\]

Corollary 1, applied to ${x_{N}}:=1/{a_{N}}$, ${f_{N}}(t):={t^{k-1}}\mathbb{E}({e^{-t{S_{N-j}}/{a_{N}}}}){1_{(0,\delta {a_{N}})}}(t)$ and $\varphi :={\textstyle\prod _{i=1}^{j}}{\varphi _{{k_{i}}}}$, which is regularly varying at 0 with index ${\textstyle\sum _{i=1}^{j}}(1-{k_{i}})=j-k$, shows that, as $N\to \infty $,

(21)

\[ \mathbb{E}({W_{1}^{{k_{1}}}}\cdots {W_{j}^{{k_{j}}}})\hspace{2.5pt}\sim \hspace{2.5pt}\frac{{\textstyle\textstyle\prod _{i=1}^{j}}{\varphi _{{k_{i}}}}(1/{a_{N}})}{\Gamma (k){a_{N}^{k}}}{\int _{0}^{\delta {a_{N}}}}{t^{j-1}}\mathbb{E}({e^{-t{S_{N-j}}/{a_{N}}}})\hspace{0.1667em}\mathrm{d}t.\]

Since $\mathbb{E}({W_{1}})=1/N$, the asymptotic relation (21) turns for $j={k_{1}}=1$ into

\[\begin{array}{r@{\hskip10.0pt}c@{\hskip10.0pt}l}\displaystyle \frac{1}{N}& \displaystyle \sim & \displaystyle {a_{N}^{-1}}{\varphi _{1}}(1/{a_{N}}){\int _{0}^{\delta {a_{N}}}}\mathbb{E}({e^{-t{S_{N-1}}/{a_{N}}}})\hspace{0.1667em}\mathrm{d}t\\ {} & \displaystyle \sim & \displaystyle {a_{N}^{-1}}{\ell ^{\ast }}({a_{N}}){\int _{0}^{\delta {a_{N}}}}\mathbb{E}({e^{-t{S_{N-1}}/{a_{N}}}})\hspace{0.1667em}\mathrm{d}t,\end{array}\]

or, equivalently,

\[ {\int _{0}^{\delta {a_{N}}}}\mathbb{E}({e^{-t{S_{N-1}}/{a_{N}}}})\hspace{0.1667em}\mathrm{d}t\hspace{2.5pt}\sim \hspace{2.5pt}\frac{{a_{N}}}{N{\ell ^{\ast }}({a_{N}})}\hspace{2.5pt}\sim \hspace{2.5pt}1,\hspace{2em}N\to \infty .\]

Therefore, for $j=1$ and $k={k_{1}}\in \mathbb{N}\setminus \{1\}$, (21) reduces to

\[ \mathbb{E}({W_{1}^{k}})\hspace{2.5pt}\sim \hspace{2.5pt}\frac{{\varphi _{k}}(1/{a_{N}})}{\Gamma (k){a_{N}^{k}}}\hspace{2.5pt}\hspace{2.5pt}\sim \hspace{2.5pt}\frac{\ell ({a_{N}})}{(k-1){a_{N}}},\hspace{2em}N\to \infty ,\]

since ${\varphi _{k}}(1/{a_{N}})\sim \Gamma (k-1){a_{N}^{k-1}}\ell ({a_{N}})$ by (20). Thus, the coalescence probability ${c_{N}}$ satisfies

\[ {c_{N}}\hspace{2.5pt}=\hspace{2.5pt}N\mathbb{E}({W_{1}^{2}})\hspace{2.5pt}\sim \hspace{2.5pt}\frac{N\ell ({a_{N}})}{{a_{N}}}\hspace{2.5pt}\sim \hspace{2.5pt}\frac{\ell ({a_{N}})}{{\ell ^{\ast }}({a_{N}})}\hspace{2.5pt}\to \hspace{2.5pt}0,\hspace{2em}N\to \infty ,\]

and

\[ \frac{{\Phi _{1}^{(N)}}(k)}{{c_{N}}}\hspace{2.5pt}=\hspace{2.5pt}\frac{\mathbb{E}({W_{1}^{k}})}{\mathbb{E}({W_{1}^{2}})}\hspace{2.5pt}\to \hspace{2.5pt}\frac{1}{k-1}\hspace{2.5pt}=\hspace{2.5pt}{\int _{[0,1]}}{x^{k-2}}\hspace{0.1667em}\Lambda (\mathrm{d}x),\hspace{2em}k\in \mathbb{N}\setminus \{1\},\]

where Λ denotes the uniform distribution on $[0,1]$. To see that simultaneous multiple collisions cannot occur in the limit, note that ${(N)_{2}}\mathbb{E}({W_{1}}{W_{2}})=1-{c_{N}}\sim 1$ as $N\to \infty $, or, equivalently, $\mathbb{E}({W_{1}}{W_{2}})\sim 1/{N^{2}}$ as $N\to \infty $. Thus, (21) reduces for $j=2$ and ${k_{1}}={k_{2}}=1$ to

\[\begin{array}{r@{\hskip10.0pt}c@{\hskip10.0pt}l}\displaystyle \frac{1}{{N^{2}}}& \displaystyle \sim & \displaystyle \frac{{\varphi _{1}^{2}}(1/{a_{N}})}{{a_{N}^{2}}}{\int _{0}^{\delta {a_{N}}}}t\mathbb{E}({e^{-t{S_{N-2}}/{a_{N}}}})\hspace{0.1667em}\mathrm{d}t\\ {} & \displaystyle \sim & \displaystyle {\Big(\frac{{\ell ^{\ast }}({a_{N}})}{{a_{N}}}\Big)^{2}}{\int _{0}^{\delta {a_{N}}}}t\mathbb{E}({e^{-t{S_{N-2}}/{a_{N}}}})\hspace{0.1667em}\mathrm{d}t,\end{array}\]

or, equivalently,

(22)

\[ {\int _{0}^{\delta {a_{N}}}}t\mathbb{E}({e^{-t{S_{N-2}}/{a_{N}}}})\hspace{0.1667em}\mathrm{d}t\hspace{2.5pt}\sim \hspace{2.5pt}{\Big(\frac{{a_{N}}}{N{\ell ^{\ast }}({a_{N}})}\Big)^{2}}\hspace{2.5pt}\sim \hspace{2.5pt}1,\hspace{2em}N\to \infty .\]

Therefore, for $j=2$ and ${k_{1}},{k_{2}}\in \mathbb{N}\setminus \{1\}$, (21) reduces to

\[ \mathbb{E}({W_{1}^{{k_{1}}}}{W_{2}^{{k_{2}}}})\hspace{2.5pt}\sim \hspace{2.5pt}\frac{{\varphi _{{k_{1}}}}(1/{a_{N}}){\varphi _{{k_{2}}}}(1/{a_{N}})}{\Gamma (k){a_{N}^{k}}}\hspace{2.5pt}\sim \hspace{2.5pt}\frac{\Gamma ({k_{1}}-1)\Gamma ({k_{2}}-1)}{\Gamma (k)}{\Big(\frac{\ell ({a_{N}})}{{a_{N}}}\Big)^{2}},\]

where the last asymptotics holds since ${\varphi _{{k_{i}}}}(1/{a_{N}})\sim \Gamma ({k_{i}}-1){a_{N}^{{k_{i}}-1}}\ell ({a_{N}})$ by (20). In particular, ${\Phi _{2}^{(N)}}(2,2)={(N)_{2}}\mathbb{E}({W_{1}^{2}}{W_{2}^{2}})\sim {(N\ell ({a_{N}})/{a_{N}})^{2}}/6\sim {c_{N}^{2}}/6$. For $j,{k_{1}},\dots ,{k_{j}}\in \mathbb{N}\setminus \{1\}$ it follows from the monotonicity property (5) that ${\Phi _{j}^{(N)}}({k_{1}},\dots ,{k_{j}})\le {\Phi _{2}^{(N)}}(2,2)=O({c_{N}^{2}})$, and, therefore, ${\Phi _{j}^{(N)}}({k_{1}},\dots ,{k_{j}})/{c_{N}}\to 0$ as $N\to \infty $, which shows that simultaneous multiple collisions cannot occur in the limit.

To summarize, by [32, Theorem 2.1], the model is in the domain of attraction of the Λ-coalescent with Λ the uniform distribution on $[0,1]$, which is the Bolthausen–Sznitman coalescent. □

Remark 4.

Suppose that (6) holds with $\alpha \in (0,1)$. Using the same techniques as in the previous proof, it follows for all $j\in \mathbb{N}$ and ${k_{1}},\dots ,{k_{j}}\ge 2$ that

where ${c_{N}}\sim N\ell ({a_{N}})/{a_{N}}\sim \ell ({a_{N}})/{\ell ^{\ast }}({a_{N}})\to 0$ as $N\to \infty $. Thanks to the monotonicity property (5) this formula is only needed for $j\in \{1,2\}$ in the previous proof.

We finally turn to the case $\alpha =0$ corresponding to the last part (vi) of Theorem 1.

Proof of Theorem 1 (vi).

Let ${Q_{N}}$ denote the distribution of ${X_{2}}+\cdots +{X_{N}}\stackrel{d}{=}{S_{N-1}}$. For all $p>0$,

(23)

\[ \mathbb{E}({W_{1}^{p}})\hspace{2.5pt}=\hspace{2.5pt}\mathbb{E}({W_{1}^{p}}{1_{\{{X_{2}}+\cdots +{X_{N}}\le N\}}})+{\int _{(N,\infty )}}\mathbb{E}\bigg({\bigg(\frac{X}{X+x}\bigg)^{p}}\bigg)\hspace{0.1667em}{Q_{N}}(\mathrm{d}x).\]

From Lemma 1 it follows that there exists $q\in (0,1)$ such that

\[ \mathbb{E}({W_{1}^{p}}{1_{\{{X_{2}}+\cdots +{X_{N}}\le N\}}})\hspace{2.5pt}\le \hspace{2.5pt}\mathbb{P}({S_{N-1}}\le N)\hspace{2.5pt}\le \hspace{2.5pt}{q^{N}}\]

for all sufficiently large N. By Lemma 2, $\mathbb{E}({(X/(X+x))^{p}})\sim \ell (x)$ as $x\to \infty $, which implies that

(24)

\[ {\int _{(N,\infty )}}\mathbb{E}\bigg({\bigg(\frac{X}{X+x}\bigg)^{p}}\bigg)\hspace{0.1667em}{Q_{N}}(\mathrm{d}x)\hspace{2.5pt}\sim \hspace{2.5pt}{\int _{(N,\infty )}}\ell (x)\hspace{0.1667em}{Q_{N}}(\mathrm{d}x),\hspace{2em}N\to \infty .\]

Note that the integral on the right hand side of (24) does not depend on the parameter p. For $p=1$, taking $\mathbb{E}({W_{1}})=1/N$ into account, Eq. (23), multiplied by N, turns into

\[ 1\hspace{2.5pt}=\hspace{2.5pt}N\mathbb{E}({W_{1}}{1_{\{{X_{2}}+\cdots +{X_{N}}\le N\}}})+N{\int _{(N,\infty )}}\mathbb{E}\bigg(\frac{X}{X+x}\bigg)\hspace{0.1667em}{Q_{N}}(\mathrm{d}x).\]

Noting that, for all sufficiently large N,

\[ N\mathbb{E}({W_{1}}{1_{\{{X_{2}}+\cdots +{X_{N}}\le N\}}})\hspace{2.5pt}\le \hspace{2.5pt}N\mathbb{P}({S_{N-1}}\le N)\hspace{2.5pt}\le \hspace{2.5pt}N{q^{N}}\hspace{2.5pt}\to \hspace{2.5pt}0,\hspace{2em}N\to \infty ,\]

it follows that ${\lim \nolimits_{N\to \infty }}N{\textstyle\int _{(N,\infty )}}\mathbb{E}(X/(X+x))\hspace{0.1667em}{Q_{N}}(\mathrm{d}x)=1$, or, equivalently,

\[ \frac{1}{N}\hspace{2.5pt}\sim \hspace{2.5pt}{\int _{(N,\infty )}}\mathbb{E}\bigg(\frac{X}{X+x}\bigg)\hspace{0.1667em}{Q_{N}}(\mathrm{d}x)\hspace{2.5pt}\sim \hspace{2.5pt}{\int _{(N,\infty )}}\ell (x)\hspace{0.1667em}{Q_{N}}(\mathrm{d}x),\hspace{2em}N\to \infty ,\]

where the last asymptotics holds by (24) for $p=1$. Therefore, for every $p>0$ the integral in (24) is asymptotically equal to $1/N$ and it follows from (23) that $N\mathbb{E}({W_{1}^{p}})\to 1$ as $N\to \infty $ for all $p>0$. In particular, ${c_{N}}=N\mathbb{E}({W_{1}^{2}})\to 1$ as $N\to \infty $. Moreover, ${\Phi _{2}^{(N)}}(2,2)={(N)_{2}}\mathbb{E}({W_{1}^{2}}{W_{2}^{2}})\le {(N)_{2}}\mathbb{E}({W_{1}}{W_{2}})=1-{c_{N}}\to 0$ as $N\to \infty $. Thus, for all $j,{k_{1}},\dots ,{k_{j}}\in \mathbb{N}\setminus \{1\}$, ${\Phi _{j}^{(N)}}({k_{1}},\dots ,{k_{j}})\le {\Phi _{2}^{(N)}}(2,2)\to 0$ as $N\to \infty $, which shows that simultaneous multiple collisions cannot occur in the limit. By [32, Theorem 2.1], the model is in the domain of attraction of the discrete-time star-shaped coalescent. □

A Appendix

For convenience we present the following version of the monotone density theorem.

Lemma 4.

Let ${x_{0}}\in (0,\infty ]$ and assume that $G:(0,{x_{0}})\to \mathbb{R}$ has the form $G(x)={\textstyle\int _{(x,{x_{0}})}}g(y)\hspace{0.1667em}\lambda (\mathrm{d}y)$ for some measurable function $g:(0,{x_{0}})\to \mathbb{R}$. If $G(x)\sim {x^{-\rho }}\ell (x)$ as $x\to 0$ for some constant $\rho \in [0,\infty )$ and some function ℓ slowly varying at 0 and if g is monotone in some right neighborhood of 0, then ${\lim \nolimits_{x\to 0}}{x^{\rho +1}}g(x)/\ell (x)=\rho $.

Remark 5.

Note that ${G^{\prime }}(x)=-g(x)$. The statement of the lemma is hence equivalent to ${\lim \nolimits_{x\to 0}}x{G^{\prime }}(x)/G(x)=-\rho $.

The following proof of Lemma 4 almost exactly coincides with the proofs known for standard versions of the monotone density theorem (see, for example, Bingham, Goldie and Teugels [2, Theorem 1.7.2] or Feller [12, p. 446]. The proof is provided, since the monotone density theorem in the form of Lemma 4 is heavily used throughout the proofs in Section 4.

Proof of Lemma 4.

Suppose first that g is nonincreasing in some right neighborhood of 0. If $0<a<b<\infty $, then, for all $x\in (0,{x_{0}}/b)$, $G(ax)-G(bx)={\textstyle\int _{(ax,bx]}}g(y)\hspace{0.1667em}\lambda (\mathrm{d}y)$ so, for x small enough,

\[ \frac{(b-a)xg(bx)}{{x^{-\rho }}\ell (x)}\hspace{2.5pt}\le \hspace{2.5pt}\frac{G(ax)-G(bx)}{{x^{-\rho }}\ell (x)}\hspace{2.5pt}\le \hspace{2.5pt}\frac{(b-a)xg(ax)}{{x^{-\rho }}\ell (x)}.\]

The middle fraction is

\[ \frac{G(ax)}{{(ax)^{-\rho }}\ell (ax)}{a^{-\rho }}\frac{\ell (ax)}{\ell (x)}-\frac{G(bx)}{{(bx)^{-\rho }}\ell (bx)}{b^{-\rho }}\frac{\ell (bx)}{\ell (x)}\hspace{2.5pt}\to \hspace{2.5pt}{a^{-\rho }}-{b^{-\rho }},\hspace{2em}x\to 0,\]

so the first inequality above yields

\[ \underset{x\to 0}{\limsup }\frac{g(bx)}{{x^{-\rho -1}}\ell (x)}\hspace{2.5pt}\le \hspace{2.5pt}\frac{{a^{-\rho }}-{b^{-\rho }}}{b-a}.\]

Taking $b:=1$ and letting $a\uparrow 1$ gives

\[ \underset{x\to 0}{\limsup }\frac{g(x)}{{x^{-\rho -1}}\ell (x)}\hspace{2.5pt}\le \hspace{2.5pt}\underset{a\to 1}{\lim }\frac{{a^{-\rho }}-1}{1-a}\hspace{2.5pt}=\hspace{2.5pt}\rho .\]

By a similar treatment of the right inequality with $a:=1$ and $b\downarrow 1$ we find that the lim inf is at least ρ, and the conclusion follows. The argument when g is nondecreasing in some right neighborhood of 0 is similar. □

The following two results are extended versions of Theorem 2 and Theorem 3 of Karamata [22] adapted to our purposes. Lemma 5 provides conditions under which a slowly varying part inside an integral can be moved in front of the integral without changing the asymptotics of the integral. Corollary 1 is a similar result for the regularly varying case. The results are slightly more general than those provided in [22], since the functions ${g_{N}}$ and ${f_{N}}$ arising in the statements are allowed to depend on N, which is not the case in the formulation of [22].

Lemma 5.

Let $L:(0,\infty )\to (0,\infty )$ be slowly varying at 0 (or ∞), let ${({x_{N}})_{N\in \mathbb{N}}}$ be a sequence of positive real numbers satisfying ${x_{N}}\to 0$ (or ${x_{N}}\to \infty $) as $N\to \infty $. Furthermore, let ${g_{N}}:(0,\infty )\to [0,\infty )$ be nonnegative, integrable functions with $0<{\textstyle\int _{0}^{\infty }}{g_{N}}(t)\hspace{0.1667em}\mathrm{d}t<\infty $ for all $N\in \mathbb{N}$ and such that, for some $a>0$ and some $\eta >0$,

\[ {\int _{0}^{a}}{t^{-\eta }}{g_{N}}(t)\hspace{0.1667em}\mathrm{d}t\hspace{2.5pt}<\hspace{2.5pt}\infty \hspace{1em}\textit{and}\hspace{1em}{\int _{a}^{\infty }}{t^{\eta }}{g_{N}}(t)\hspace{0.1667em}\mathrm{d}t\hspace{2.5pt}<\hspace{2.5pt}\infty \]

for all $N\in \mathbb{N}$. Then, as $N\to \infty $,

\[ {\int _{0}^{\infty }}L({x_{N}}t){g_{N}}(t)\hspace{0.1667em}\mathrm{d}t\hspace{2.5pt}\sim \hspace{2.5pt}L({x_{N}}){\int _{0}^{\infty }}{g_{N}}(t)\hspace{0.1667em}\mathrm{d}t.\]

Proof.

Define $P(x):={x^{\eta }}L(x)$ and $Q(x):={x^{-\eta }}L(x)$, $x>0$. Note that P is regularly varying with index η and Q is regularly varying with index $-\eta $. By [2, Theorem 1.5.2], $P({x_{N}}t)/P({x_{N}})\to {t^{\eta }}$ as $N\to \infty $ uniformly in $t\in (0,a]$ and $Q({x_{N}}t)/Q({x_{N}})\to {t^{-\eta }}$ as $N\to \infty $ uniformly in $t\in [a,\infty )$. Thus, for every $\varepsilon >0$ there exists ${N_{0}}={N_{0}}(\varepsilon )\in \mathbb{N}$ such that, for all $N\in \mathbb{N}$ with $N>{N_{0}}$,

\[ P({x_{N}})(1-\varepsilon )\hspace{2.5pt}\le \hspace{2.5pt}{t^{-\eta }}P({x_{N}}t)\hspace{2.5pt}\le \hspace{2.5pt}P({x_{N}})(1+\varepsilon )\hspace{1em}\text{for all}\hspace{2.5pt}t\in (0,a]\]

and

\[ Q({x_{N}})(1-\varepsilon )\hspace{2.5pt}\le \hspace{2.5pt}{t^{\eta }}Q({x_{N}}t)\hspace{2.5pt}\le \hspace{2.5pt}Q({x_{N}})(1+\varepsilon )\hspace{1em}\text{for all}\hspace{2.5pt}t\in [a,\infty ).\]

For all $N\in \mathbb{N}$ with $N>{N_{0}}$ it follows that

\[\begin{array}{r@{\hskip10.0pt}c@{\hskip10.0pt}l}& & \displaystyle {\int _{0}^{\infty }}L({x_{N}}t){g_{N}}(t)\hspace{0.1667em}\mathrm{d}t\\ {} & & \displaystyle \hspace{2em}={x_{N}^{-\eta }}{\int _{0}^{a}}{t^{-\eta }}P({x_{N}}t){g_{N}}(t)\hspace{0.1667em}\mathrm{d}t+{x_{N}^{\eta }}{\int _{a}^{\infty }}{t^{\eta }}Q({x_{N}}t){g_{N}}(t)\hspace{0.1667em}\mathrm{d}t\\ {} & & \displaystyle \hspace{2em}\le {x_{N}^{-\eta }}P({x_{N}})(1+\varepsilon ){\int _{0}^{a}}{g_{N}}(t)\hspace{0.1667em}\mathrm{d}t+{x_{N}^{\eta }}Q({x_{N}})(1+\varepsilon ){\int _{a}^{\infty }}{g_{N}}(t)\hspace{0.1667em}\mathrm{d}t\\ {} & & \displaystyle \hspace{2em}=(1+\varepsilon )L({x_{N}}){\int _{0}^{\infty }}{g_{N}}(t)\hspace{0.1667em}\mathrm{d}t\end{array}\]

and, analogously, ${\textstyle\int _{0}^{\infty }}L({x_{N}}t){g_{N}}(t)\hspace{0.1667em}\mathrm{d}t\ge (1-\varepsilon )L({x_{N}}){\textstyle\int _{0}^{\infty }}{g_{N}}(t)\hspace{0.1667em}\mathrm{d}t$. □

Corollary 1.

Let $\varphi :(0,\infty )\to (0,\infty )$ be regularly varying at 0 (or ∞) with index $\gamma \in \mathbb{R}$ and let ${({x_{N}})_{N\in \mathbb{N}}}$ be a sequence of positive real numbers satisfying ${x_{N}}\to 0$ (or ${x_{N}}\to \infty $) as $N\to \infty $. Furthermore, let ${f_{N}}:(0,\infty )\to [0,\infty )$, $N\in \mathbb{N}$, be functions such that $0<{\textstyle\int _{0}^{\infty }}{t^{\eta }}{f_{N}}(t)\hspace{0.1667em}\mathrm{d}t<\infty $ for all $N\in \mathbb{N}$ and all η in some neighborhood of γ, i.e. for all $\eta \in (\gamma -\varepsilon ,\gamma +\varepsilon )$ for some $\varepsilon >0$. Then, as $N\to \infty $,

\[ {\int _{0}^{\infty }}\varphi ({x_{N}}t){f_{N}}(t)\hspace{0.1667em}\mathrm{d}t\hspace{2.5pt}\sim \hspace{2.5pt}\varphi ({x_{N}}){\int _{0}^{\infty }}{t^{\gamma }}{f_{N}}(t)\hspace{0.1667em}\mathrm{d}t.\]

Proof.

Define $L(x):={x^{-\gamma }}\varphi (x)$ for $x>0$, and ${g_{N}}(t):={t^{\gamma }}{f_{N}}(t)$ for $t>0$. Choose $\eta :=\varepsilon /2>0$. Then, for any $a>0$,

\[ {\int _{0}^{a}}{t^{-\eta }}{g_{N}}(t)\hspace{0.1667em}\mathrm{d}t\hspace{2.5pt}=\hspace{2.5pt}{\int _{0}^{a}}{t^{\gamma -\eta }}{f_{N}}(t)\hspace{0.1667em}\mathrm{d}t\hspace{2.5pt}\le \hspace{2.5pt}{\int _{0}^{\infty }}{t^{\gamma -\eta }}{f_{N}}(t)\hspace{0.1667em}\mathrm{d}t\hspace{2.5pt}<\hspace{2.5pt}\infty \]

by assumption and as well

\[ {\int _{a}^{\infty }}{t^{\eta }}{g_{N}}(t)\hspace{0.1667em}\mathrm{d}t\hspace{2.5pt}=\hspace{2.5pt}{\int _{a}^{\infty }}{t^{\gamma +\eta }}{f_{N}}(t)\hspace{0.1667em}\mathrm{d}t\hspace{2.5pt}\le \hspace{2.5pt}{\int _{0}^{\infty }}{t^{\gamma +\eta }}{f_{N}}(t)\hspace{0.1667em}\mathrm{d}t\hspace{2.5pt}<\hspace{2.5pt}\infty \]

by assumption. Thus, Lemma 5 is applicable, which yields the result. □

Acknowledgments

The authors thank two anonymous referees for their useful reports leading to a significant improvement of the proofs of parts (iv) and (v) of Theorem 1.

References

[1]

Athreya, K.B.: Rates of decay for the survival probability of a mutant gene. J. Math. Biol. 30(6), 577–581 (1992). MR1173109

[2]

Bingham, N.H., Goldie, C.M., Teugels, J.L.: Regular Variation. Cambridge University Press, Cambridge (1987). MR0898871

[3]

Bingham, N.H., Doney, R.A.: Asymptotic properties of supercritical branching processes. I. The Galton–Watson process. Adv. Appl. Probab. 6(4), 711–731 (1974). MR0362525

[4]

Boenkost, F., González Casanova, A., Pokalyuk, C., Wakolbinger, A.: Haldane’s formula in Cannings models: the case of moderately weak selection. Electron. J. Probab. 26 (Paper no. 4), 1–36 (2021). MR4216517

[5]

Boenkost, F., González Casanova, A., Pokalyuk, C., Wakolbinger, A.: Haldane’s formula in Cannings models: the case of moderately strong selection. J. Math. Biol. 83(6–7), Article number 70 (2021). 31 pages. MR4348421

[6]

Cannings, C.: The latent roots of certain Markov chains arising in genetics: a new approach. I. Haploid models. Adv. Appl. Probab. 6(2), 260–290 (1974). MR0343949

[7]

Cannings, C.: The latent roots of certain Markov chains arising in genetics: a new approach. II. Further haploid models. Adv. Appl. Probab. 7(2), 264–282 (1975). MR0371430

[8]

Cordero, F., González Casanova, A., Schweinsberg, J., Wilke-Berenguer, M.: Λ-coalescents arising in populations with dormancy (2020). arXiv preprint 2009.09418

[9]

Cortines, A.: The genealogy of a solvable population model under selection with dynamics related to directed polymers. Bernoulli 22(4), 2209–2236 (2016). MR3498028

[10]

Dembo, A., Zeitouni, O.: Large Deviations Techniques and Applications. Springer (2010). MR2571413

[11]

Eldon, B., Wakeley, J.: Coalescent processes when the distribution of offspring number among individuals is highly skewed. Genetics 172(4), 2621–2633 (2006)

[12]

Feller, W.: An Introduction to Probability Theory and Its Applications. Vol. II, 2nd edn. Wiley, New York (1971). MR0270403

[13]

Griffiths, R.C., Spanò, D.: Orthogonal polynomial kernels and canonical correlations for Dirichlet measures. Bernoulli 19(2), 548–598 (2013). MR3037164

[14]

Haldane, J.B.S.: A mathematical theory of neutral and artificial selection, Part V. Selection and mutation. Proc. Camb. Philol. Soc. 23(7), 838–844 (1927)

[15]

Huillet, T.: Pareto genealogies arising from a Poisson branching evolution model with selection. J. Math. Biol. 68(3), 727–761 (2014). MR3152761

[16]

Huillet, T., Möhle, M.: Population genetics models with skewed fertilities: a forward and backward analysis. Stoch. Models 27(3), 521–554 (2011). MR2827443

[17]

Huillet, T., Möhle, M.: Correction on ‘Population genetics models with skewed fertilities: a forward and backward analysis’. Stoch. Models 28(3), 527–532 (2012). MR2959453

[18]

Huillet, T., Möhle, M.: On the extended Moran model and its relation to coalescents with multiple collisions. Theor. Popul. Biol. 87, 5–14 (2013)

[19]

Huillet, T., Möhle, M.: Asymptotics of symmetric compound Poisson population models. Comb. Probab. Comput. 24(1), 216–253 (2015). MR3318045

[20]

Karamata, J.: Neuer Beweis und Verallgemeinerung der Tauberschen Sätze, welche die Laplacesche und Stieltjessche Transformation betreffen. J. Reine Angew. Math. 164, 27–39 (1931). MR1581248

[21]

Karamata, J.: Neuer Beweis und Verallgemeinerung einiger Tauberian-Sätze. Math. Z. 33(1), 294–299 (1931). MR1545213

[22]

Karamata, J.: Some theorems concerning slowly varying functions (1962). Technical Report #369, The Univ. of Wisconsin

[23]

Karlin, S., McGregor, J.: Direct product branching processes and related Markov chains. Proc. Natl. Acad. Sci. USA 51, 598–602 (1964). MR0163362

[24]

Karlin, S., McGregor, J.: Direct product branching processes and related induced Markov chains. I. Calculations of rates of approach to homozygosity. In: Proc. Internat. Res. Sem., Statist. Lab., Univ. California, Berkeley, Calif., 1963, pp. 111–145. Springer (1965). MR0217892

[25]

Kingman, J.F.C.: The coalescent. Stoch. Process. Appl. 13(3), 235–248 (1982). MR0671034

[26]

Kingman, J.F.C.: Exchangeability and the evolution of large populations. In: Exchangeability in Probability and Statistics (Rome, 1981), pp. 97–112. North-Holland, Amsterdam–New York (1982). MR675968

[27]

Kingman, J.F.C.: On the genealogy of large populations. J. Appl. Probab. Special Vol. 19A, 27–43 (1982). Essays in statistical science. MR633178

[28]

Kozubowski, T.J., Podgórski, K.: A generalized Sibuya distribution. Ann. Inst. Stat. Math. 70(4), 855–887 (2018). MR3830290

[29]

Möhle, M.: Total variation distances and rates of convergence for ancestral coalescent processes in exchangeable population models. Adv. Appl. Probab. 32(4), 983–993 (2000). MR1808909

[30]

Möhle, M.: On sampling distributions for coalescent processes with simultaneous multiple collisions. Bernoulli 12(1), 35–53 (2006). MR2202319

[31]

Möhle, M.: Asymptotic results for coalescent processes without proper frequencies and applications to the two-parameter Poisson–Dirichlet coalescent. Stoch. Process. Appl. 120(11), 2159–2173 (2010). MR2684740

[32]

Möhle, M., Sagitov, S.: A classification of coalescent processes for haploid exchangeable population models. Ann. Probab. 29(4), 1547–1562 (2001). MR1880231

[33]

Pitman, J.: Coalescents with multiple collisions. Ann. Probab. 27(4), 1870–1902 (1999). MR1742892

[34]

Sagitov, S.: The general coalescent with asynchronous mergers of ancestral lines. J. Appl. Probab. 36(4), 1116–1125 (1999). MR1742154

[35]

Schweinsberg, J.: Coalescents with simultaneous multiple collisions. Electron. J. Probab. 5 (Paper no. 12), 1–50 (2000). MR1781024

[36]

Schweinsberg, J.: A necessary and sufficient condition for the Λ-coalescent to come down from infinity. Electron. Commun. Probab. 5, 1–11 (2000). MR1736720

[37]

Schweinsberg, J.: Coalescent processes obtained from supercritical Galton–Watson processes. Stoch. Process. Appl. 106(1), 107–139 (2003). MR1983046

[38]

Yule, G.U.: A mathematical theory of evolution based on the conclusions of Dr. J.C. Willis, F.R.S. Philos. Trans. R. Soc. Lond. Ser. B 213, 21–87 (1925)

Reading mode

Table of contents

1 Introduction
2 Results
3 Examples
4 Proofs
A Appendix
Acknowledgments
References

Open access article under the CC BY license.

Keywords

Cannings model exchangeable coalescent regularly varying function simultaneous multiple collisions weak convergence 60J90 92D15

Metrics

since March 2018

647

Article info
views

527

Full article
views

475

PDF
downloads

179

XML
downloads

RSS

Tables
1
Theorems
1

Table 1.

Asymptotics of the ancestry of mixed multinomial Cannings models of the form (1) under the tail condition $\mathbb{P}(X>x)\sim {x^{-\alpha }}\ell (x)$ as $x\to \infty $

Theorem 1.

Table 1.

Asymptotics of the ancestry of mixed multinomial Cannings models of the form (1) under the tail condition $\mathbb{P}(X>x)\sim {x^{-\alpha }}\ell (x)$ as $x\to \infty $

Condition	Limiting coalescent	Coalescence probability
$\mathbb{E}({X^{2}})<\infty $	Kingman	$\sim \displaystyle\frac{\rho }{{\mu ^{2}}N}$
$\alpha =2$	Kingman	$\sim \displaystyle\frac{2{\ell ^{\ast }}(N)}{{\mu ^{2}}N}$
$1<\alpha <2$	$\beta (2-\alpha ,\alpha )$	$\sim \displaystyle\frac{\Gamma (2-\alpha )\Gamma (\alpha +1)\ell (N)}{{\mu ^{\alpha }}{N^{\alpha -1}}}$
$\alpha =1$	Bolthausen–Sznitman	$\sim \displaystyle\frac{\ell ({a_{N}})}{{\ell ^{\ast }}({a_{N}})}\sim \displaystyle\frac{N\ell ({a_{N}})}{{a_{N}}}$
$\alpha \in (0,1)$	discrete time $\mathrm{PD}(\alpha ,0$)	$\sim 1-\alpha $
$\alpha =0$	discrete time star-shaped	$\sim 1$

Theorem 1.

For the Cannings model with the offspring distribution (1) the following assertions hold.

(i) If $\mathbb{E}({X^{2}})<\infty $ (in particular if (6) holds with $\alpha >2$) then the model is in the domain of attraction of the continuous-time Kingman coalescent and the coalescence probability ${c_{N}}$ satisfies ${c_{N}}\sim \rho /({\mu ^{2}}N)$ as $N\to \infty $, where $\mu :=\mathbb{E}(X)$ and $\rho :=\mathbb{E}({X^{2}})$.
(ii) If (6) holds with $\alpha =2$ then the model is in the domain of attraction of the continuous-time Kingman coalescent and the coalescence probability ${c_{N}}$ satisfies ${c_{N}}\sim 2{\ell ^{\ast }}(N)/({\mu ^{2}}N)$ as $N\to \infty $, where $\mu :=\mathbb{E}(X)$ and ${\ell ^{\ast }}$ is defined via (7).
(iii) If (6) holds with $\alpha \in (1,2)$ then the model is in the domain of attraction of the continuous-time Λ-coalescent with $\Lambda :=\beta (2-\alpha ,\alpha )$ being the beta distribution with parameters $2-\alpha $ and α. Moreover, the coalescence probability ${c_{N}}$ satisfies ${c_{N}}\sim \alpha \mathrm{B}(2-\alpha ,\alpha ){\mu ^{-\alpha }}\ell (N)/{N^{\alpha -1}}=\Gamma (2-\alpha )\Gamma (\alpha +1){\mu ^{-\alpha }}\ell (N)/{N^{\alpha -1}}$ as $N\to \infty $, where $\mu :=\mathbb{E}(X)$.
(iv) If (6) holds with $\alpha =1$, then the model is in the domain of attraction of the continuous-time Bolthausen–Sznitman coalescent. If ${({a_{N}})_{N\in \mathbb{N}}}$ is a sequence of positive real numbers satisfying ${\ell ^{\ast }}({a_{N}})\sim {a_{N}}/N$ as $N\to \infty $, where ${\ell ^{\ast }}$ is defined via (7), then the coalescence probability ${c_{N}}$ satisfies ${c_{N}}\sim \ell ({a_{N}})/{\ell ^{\ast }}({a_{N}})\sim N\ell ({a_{N}})/{a_{N}}$ as $N\to \infty $.
(v) If (6) holds with $\alpha \in (0,1)$, then the model is in the domain of attraction of the discrete-time Ξ-coalescent, where the characterizing measure $\nu (\mathrm{d}x):=\Xi (\mathrm{d}x)/{\textstyle\sum _{i=1}^{\infty }}{x_{i}^{2}}$ is the Poisson–Dirichlet distribution $\nu =\mathrm{PD}(\alpha ,0)$ with parameters α and $\theta :=0$. The coalescence probability satisfies ${c_{N}}\to 1-\alpha $ as $N\to \infty $.
(vi) If (6) holds with $\alpha =0$, then the model is in the domain of attraction of the discrete-time star-shaped coalescent and the coalescence probability satisfies ${c_{N}}\to 1$ as $N\to \infty $.

In particular, for the first four cases (i)–(iv), ${c_{N}}\to 0$ as $N\to \infty $.

Authors

Abstract

1 Introduction

(1)

(2)

(3)

(4)

(5)

Definition 1.

2 Results

(6)

(7)

Theorem 1.

Remark 1.

Remark 2.

Table 1.

3 Examples

Example 1 (Pareto distribution).

Example 2.

Example 3 (Yule–Simon distribution).

Example 4 (Sibuya distribution).

Example 5.

Example 6.

4 Proofs

Lemma 1.

Proof.

Poof of Theorem 1 (i).

Lemma 2.

Proof.

(8)

Lemma 3.

(9)

(10)

(11)

Remark 3.

Proof.

Proof of Theorem 1 (iii).

(12)

(13)

Proof of Theorem 1 (ii).

(14)

(15)

Proof of Theorem 1 (v).

(16)

(17)

(18)

(19)

Proof of Theorem 1 (iv).

(20)

(21)

(22)

Remark 4.

Proof of Theorem 1 (vi).

(23)

(24)

A Appendix

Lemma 4.

Remark 5.

Proof of Lemma 4.

Lemma 5.

Proof.

Corollary 1.

Proof.

Acknowledgments

References

Export citation

Copy and paste formatted citation

Download citation in file

Table 1.

Theorem 1.