Gambler's ruin in Astro and the accuracy of Gaussian approximation [3]

In the previous posts, we calculated the distribution of the so-called “flight time” τα, interpreted as the (random) number of Astro card a player can scratch before loosing more than their initial investment α (typically α is €2), both in closed-form under a continuous time Gaussian setting and empirically using random simulations of the exact Astro loss random walk. These two approaches showed that αμ, where μ is the average loss of random walk step, was a good estimation of E[τα]; in the case of Astro, this estimate is E[τα]3.17. We now show that this estimation is actually correct. We will actually prove a more general result, namely a closed-form expression of the Laplace transform of τα, based on the method of martingales already used in the Gaussian approximation.

We recall that (ξt)tN denotes a i.i.d sequence of random Astro losses and let Xt=s=1tξs be the cumulative loss after tN steps, and that the flight time is defined by the equation τα=inf{tN,Xtα}.

What goes wrong in the method of martingales for discrete random walks?

In the continuous time Gaussian setting, we replaced τα and (Xt)tN with τ¯α={tR+,Ztα} and (Zt)tR+ satisfying the stochastic differential equation dZt=μdt+σdWt, driven by a Brownian motion W. Then for any λR, Mtλ=exp(λσZt(λ22+λμσ)t) defines a martingale and a standard application of Doob’s optional stopping theorem showed that E[Mτ¯αλ]=1. Now in general, this gives information on two random variables simultaneously: τ¯α and Zτ¯α. The whole trick is to disentangle the contribution of each of these variables and extract the Laplace transform of τ¯α, which is E[eβτ¯α] for β>0. In the continuous time setting, the almost sure continuity of tZt and the definition of τ¯α shows that Zτ¯α=α (almost surely), which is not random at all, thus revealing information directly on τ¯α.

By contrast, the discrete time counterpart Xτα does not satisfy a similar equality in general, but only the inequality Xταα. Worse, the distribution of Xτα can be quite difficult to characterise as there are potentially many different sequences of steps ξ1,,ξτα that result in crossing the barrier α. In a special case of step distribution however, Xτα takes a simple form, namely if the process X can only increase by a fixed step size, or decrease by multiples of the same step size. More precisely, let Δ>0, (pk)kN such that pk[0,1] for all kN and kNpk<1, and define the i.i.d. step sequence (ξ)tN by ξt={kΔwith probability pk,Δwith probability 1kNpk. Indeed, this forces the process X to move over the grid ΔZ, and since α>0 and X can only increase by Δ, this implies that Xτα can only take a single value corresponding to Xτα1<α and ξτα=Δ, i.e. Xτα=αΔΔ=:α+, where x denotes the ceil operator (smallest integer larger than x).

As it turns out, this rather specific property is satisfied by the Astro step distribution: ξt is equal to €2 minus the gain of the t-th ticket, and since all gains are multiple of €2, with a minimum of 0€, the above property holds with Δ equal to €2 and (pk)kN the corresponding gain probabilities.

Exponential supermartingale and expected flight time

Using the property that Xτα, the martingale method developed in the first part for the Gaussian approximation can be applied to the process X essentially unchanged. Let ξ denote a generic random variable following the Astro step distribution and define ψ:λR+logE[eλξ]. Then, for any λ>0, the process defined for tN by Mtλ=eXttψ(λ) is a martingale. Using the expression of the step size sequence, we have ψ(λ)=λΔ+log(1kNpk(1eλ(k+1)Δ)). Moreover, eψ(λ)=E[eλξ]eλμ by convexity and Jensen’s inequality, where μ=E[ξ]=Δ(1kN(1+k)pk). and hence ψ(λ)λμ>0.

The same arguments as for the Gaussian approximation (dominated convergence, Doob’s optional stopping theorem) apply and show that E[eψ(λ)τα]=eλα+. Noting that ψ is invertible and using the change of variable β=ψ(λ), we deduce that E[eβτα]=eψ1(β)α+, which is the expression of the Laplace transform of τα. To obtain the expected flight time E[τα], we differentiate with respect to β, which gives βE[eβτα]=E[τα]=(ψ1)(0)α+eψ1(0)α+=(ψ1)(0)α+, since ψ(0)=0.

The term (ψ1)(0) can be calculated using the inverse function rule, which yields (ψ1)(0)=1ψ(0). A direct calculation shows that ψ(λ)=Δ+ΔkN(k+1)pkeλ(k+1)Δ1kNpk(1eλ(k+1)Δ), and thus ψ(0)=Δ(1kN(k+1)pk)=μ. Going back to the expected flight time, we have E[τα]=α+μ. In particular for the Astro distribution, α=Δ=α+; in other words, E[τα]=E[τ¯α], i.e. the Gaussian approximation was actually correct in expectation!

Going further: higher moments

Thanks to the exact calculation of the Laplace transform of τα, it is possible to derive higher moments by differentiating E[eβτα] multiple times. A key observation is that the function ψ fully describes the cumulants (κn)n1 of the distribution of ξ, which are an alternative to moments (intuitively, the n-th cumulant is the component of the n-th moment that is “independent” of the previous moments; for instance, κ2 is the variance of ξ rather than E[ξ2]). More formally, the cumulants are defined by the power series expansion ψ(λ)=n=1+κnλnn!, in particular ψ(n)(0)=κn. Therefore, by successive differentiations of βeψ1(β)α+ or βψ1(β)α+, we can obtain the successive moments or cumulants of τα.

For instance, applying twice the inverse function rule yields (ψ1)(β)=ψ(λ)ψ(λ)3, and thus the variance of τα is V[τα]=α+σ2μ358.46, where σ20.67 is the standard deviation of ξ. As it turns out, this is also exactly the formula provided by the inverse Gaussian distribution IG(αμ,α2σ2), i.e. V[τα]=V[τ¯α]!

A natural question is whether higher moments/cumulants are also well approximated by the inverse Gaussian distribution. Intuitively, this should not be the case since the Gaussian distribution used in this approximation only matches the first two moments of ξ, which is also suggested by the empirical analysis performed in the second post. We confirm this by an explicit calculation of the third cumulant of τα.

Indeed, on the one hand, the third cumulant of the inverse Gaussian distribution is given by κ¯3=E[(τ¯ααμ)3]=3σ4αμ5. On the other hand, yet another application of the inverse function rule shows that (ψ1)(3)(β)=3ψ(λ)2ψ(λ)2ψ(λ)3ψ(3)(λ)ψ(λ)7, and thus κ3=3σ4α+μ5ψ(3)(0)α+μ4=κ¯3ψ(3)(0)α+μ4, where ψ(3)(0)=E[(ξμ)3] is the third cumulant of ξ. In other words, negative skewness in the true Astro distribution translates into higher skewness in the flight time distribution than the inverse Gaussian approximation suggests.

Conclusion

We can summarize the Gaussian approximation using the diagram below. Moving from the true Astro step variable ξ to its continuous time Gaussian approximation dZ preserves both mean and variance (as these are the two degrees of freedom of a Gaussian distribution), and this moment matching property is transferred at the level of flight times. However, this commutative diagram does not hold for higher order moments.

License

Copyright 2022-present Patrick Saux.

Related