The two analytic leaves of the pointwise Shannon–McMillan–Breiman theorem #
This file discharges the two remaining analytic leaves of the pointwise SMB theorem set up in
Oseledets.Krieger.SMBPointwise, making the headline a.e. convergence
(1/n)·iₙ(x) → h(P,T) unconditional (given only the measure-algebra Breiman telescoping R2,
which is not an analytic leaf).
Leaf 1 (Chung stopping-time tail).
chungTail: for each cellPᵢandλ > 0,μ {x ∈ Pᵢ | λ < g* x} ≤ ofReal e^{−λ}, whereg* = ⨆ₖ gₖis the Chung maximal information function andgₖ = condInfoFun 𝒞ₖ Palong the increasing past filtration𝒞ₖ = condLevelSigma. The argument is a Doob/Markov bound on the conditional-probability martingalepₖ = μ⟦Pᵢ | 𝒞ₖ⟧along the first-passage timeτ = inf{k : pₖ < e^{−λ}}(MeasureTheory.setIntegral_condExpon each{τ = k} ∈ 𝒞ₖ). This delivers, via the layer-cake formula, the hypothesishlayeroflintegral_condInfoMaxFun_le_of_layer, henceg* ∈ L¹(lintegral_condInfoMaxFun_lt_top).Leaf 2 (Maker/Breiman dominated-Cesàro).
makerTail: fromg* ∈ L¹and the a.e. Lévy limitgₖ → g∞, the Cesàro tail(1/n)∑_{j<n}(g_{n−j} − g∞)(Tʲx) → 0a.e. This is Maker's ergodic lemma, proved here by the standard ε/truncation split: asup-tailGₙ = ⨆_{k≥n}|F_k| ↓ 0dominated by2g* ∈ L¹, a Birkhoff average for the bounded part, and the orbital decaym⁻¹·h(T^[m]x) → 0(Oseledets.ae_tendsto_orbit_div_atTop_zero) for the small-lag part.
Assembling Leaf 2 into ae_tendsto_div_infoFun_of_tail yields the unconditional pointwise SMB
ae_tendsto_div_infoFun (parameterized by the abstract information function together with its
Breiman telescoping), and its in-measure / equipartition corollaries.
References #
- L. Breiman, The individual ergodic theorem of information theory, Ann. Math. Statist. 28 (1957), 809–811; correction 31 (1960), 809–810.
- K. L. Chung, A note on the ergodic theorem of information theory, Ann. Math. Statist. 32 (1961), 612–614.
- P. C. Shields, The Ergodic Theory of Discrete Sample Paths, GSM 13, AMS (1996), §3.1 (Maker).
- M. Einsiedler, E. Lindenstrauss, T. Ward, Entropy in Ergodic Theory and Topological Dynamics, Ch. 2 (SMB).
Leaf 1: the Chung stopping-time tail estimate #
For a fixed cell A = P.cells i₀ and λ > 0, the maximal information function g* exceeds λ
only on a set of measure ≤ e^{−λ} inside A. The proof tracks the conditional-probability
martingale pₖ(x) = (condExpKernel μ 𝒞ₖ x A).toReal (=ᵐ μ⟦A | 𝒞ₖ⟧), and the first-passage time
τ(x) = inf{k : pₖ(x) < e^{−λ}}. Each first-passage stratum {τ = k} is 𝒞ₖ-measurable, so
μ(A ∩ {τ = k}) = ∫_{τ=k} 𝟙_A = ∫_{τ=k} pₖ ≤ e^{−λ}·μ{τ = k} by setIntegral_condExp; summing the
disjoint strata gives μ(A ∩ {g* > λ}) ≤ e^{−λ}·∑ₖ μ{τ=k} ≤ e^{−λ}.
On the "good" part of a cell A = P.cells i₀ — points lying in no other cell — the
conditional information function is just -log of the conditional probability of A:
condInfoFun 𝒜 P x = -log (condExpKernel μ 𝒜 x A).toReal. Only the i₀-indicator survives.
The a.e. set on which each point lies in exactly one cell of P: it is in its own cell and
no other. Off a null set (the pairwise a.e.-overlaps), every point lies in a unique cell.
Leaf 1 (Chung's stopping-time tail). For each cell Pᵢ₀ and λ > 0, the Chung maximal
information function g* = condInfoMaxFun exceeds λ only on a subset of Pᵢ₀ of measure
≤ e^{−λ}:
μ {x ∈ Pᵢ₀ | ofReal λ < g* x} ≤ ofReal e^{−λ}.
The point x ∈ Pᵢ₀ (in no other cell, a.e.) with g* x > λ has some level gₖ(x) > λ, i.e.
-log pₖ(x) > λ, i.e. pₖ(x) < e^{−λ} (and pₖ(x) > 0, since -log 0 = 0 ≯ λ). So the level set
sits inside Pᵢ₀ ∩ ⋃ₖ {τ = k}, whose measure is ≤ e^{−λ} by the first-passage union bound.
The layer-cake bridge hlayer. Summing the per-cell Chung tail through the layer-cake
formula bounds the maximal information L¹-norm by the per-cell tail integral:
∫⁻ g* ≤ ∫⁻ t in Ioi 0, ∑ᵢ min(μ Pᵢ, e^{−t}).
The supremum g* = ⨆ₙ ofReal Gₙ of the real partial maxima Gₙ is reached by monotone convergence;
each ∫⁻ ofReal Gₙ equals ∫⁻ t in Ioi 0, μ{t < Gₙ} (real layer cake), and the integrand is
bounded by μ{ofReal t < g*} ≤ ∑ᵢ min(μ Pᵢ, e^{−t}) (measure_superlevel_le) uniformly in n.
g* ∈ L¹. Combining the layer-cake bridge with the proved tail-integral bound
lintegral_tail_sum_le (≤ H(P) + 1), the Chung maximal information function is integrable.
Leaf 2: the Maker/Breiman dominated-Cesàro vanishing #
From g* ∈ L¹ (Leaf 1) and the a.e. Lévy limit gₖ → g∞, the Cesàro tail
(1/n)∑_{j<n}(g_{n−j} − g∞)(Tʲx) → 0 a.e. This is Maker's ergodic lemma.
The argument: with Fk k = gk k − g∞ (so Fk k → 0 a.e., dominated by D = g*ℝ + g∞ ∈ L¹), set
the antitone sup-tail Ψ_N = ⨆_{k≥N}|Fk k| (Ψ_N ↓ 0 a.e., Ψ_N ≤ D, so ∫ Ψ_N → 0 by
dominated convergence). Splitting ∑_{j<n}|Fk(n−j)(Tʲx)| at lag n−j ≷ N:
the large-lag part is ≤ birkhoffAverage Ψ_N n x → ∫ Ψ_N (Birkhoff); the small-lag part is the last
N terms |Fk(m)(T^{n−m}x)|/n with m ≤ N, each → 0 by the orbital decay
ae_tendsto_orbit_div_atTop_zero. Hence limsup ≤ ∫ Ψ_N → 0.
The Maker dominated-Cesàro core #
Leaf 2 (Maker/Breiman dominated-Cesàro). For ergodic T, the Cesàro tail of the Breiman
telescoping vanishes a.e.:
(1/n)·∑_{j<n}(g_{n−j} − g∞)(Tʲx) → 0.
Assembling the unconditional pointwise SMB #
With both analytic leaves discharged — Leaf 1 gives g* ∈ L¹ (lintegral_condInfoMaxFun_lt_top),
Leaf 2 gives the dominated-Cesàro vanishing (makerTail) — the Cesàro tail hypothesis of
ae_tendsto_div_infoFun_of_tail is now proved. The Breiman tail decomposition
(1/n)·iₙ(x) − A_n(g∞)(x) = (1/n)∑_{j<n}(g_{n−j} − g∞)(Tʲx) (which holds a.e. via the
measure-algebra telescoping infoWeight_succ_eq, recorded here as the hypothesis hbreiman) then
yields the unconditional pointwise SMB (1/n)·iₙ(x) → h(P,T).
Leaf 2 in the form consumed by ae_tendsto_div_infoFun_of_tail. The Cesàro tail of the
Breiman split — the difference between the information average (1/n)∑_{j<n} g_{n−j}(Tʲx) and the
limit Birkhoff average A_n(g∞) — vanishes a.e.
Pointwise Shannon–McMillan–Breiman (unconditional, given the Breiman telescoping R2).
For ergodic T and any sequence iₙ satisfying the a.e. Breiman telescoping
iₙ(x) = ∑_{j<n} g_{n−j}(Tʲx) (hbreiman), the information averages (1/n)·iₙ(x) converge
μ-a.e. to the Kolmogorov–Sinai entropy h(P,T) = ksEntropyPartition hT P.
Both analytic leaves are now proved: Leaf 1 (lintegral_condInfoMaxFun_lt_top, g* ∈ L¹) and
Leaf 2 (ae_tendsto_breiman_tail, the Maker dominated-Cesàro tail). The only remaining input is
the measure-algebra telescoping hbreiman (Breiman's R2, not an analytic leaf).
In-measure / upper-equipartition corollary. For ergodic T and the Breiman telescoping,
for every δ > 0 the measure of {x : (1/n)·iₙ(x) > h(P,T) + δ} tends to 0: a.e. convergence
(ae_tendsto_div_infoFun) implies convergence in measure of the deviation set.