The two analytic leaves of the pointwise Shannon–McMillan–Breiman theorem #

This file discharges the two remaining analytic leaves of the pointwise SMB theorem set up in Oseledets.Krieger.SMBPointwise, making the headline a.e. convergence (1/n)·iₙ(x) → h(P,T) unconditional (given only the measure-algebra Breiman telescoping R2, which is not an analytic leaf).

Leaf 1 (Chung stopping-time tail). chungTail: for each cell Pᵢ and λ > 0, μ {x ∈ Pᵢ | λ < g* x} ≤ ofReal e^{−λ}, where g* = ⨆ₖ gₖ is the Chung maximal information function and gₖ = condInfoFun 𝒞ₖ P along the increasing past filtration 𝒞ₖ = condLevelSigma. The argument is a Doob/Markov bound on the conditional-probability martingale pₖ = μ⟦Pᵢ | 𝒞ₖ⟧ along the first-passage time τ = inf{k : pₖ < e^{−λ}} (MeasureTheory.setIntegral_condExp on each {τ = k} ∈ 𝒞ₖ). This delivers, via the layer-cake formula, the hypothesis hlayer of lintegral_condInfoMaxFun_le_of_layer, hence g* ∈ L¹ (lintegral_condInfoMaxFun_lt_top).
Leaf 2 (Maker/Breiman dominated-Cesàro). makerTail: from g* ∈ L¹ and the a.e. Lévy limit gₖ → g∞, the Cesàro tail (1/n)∑_{j<n}(g_{n−j} − g∞)(Tʲx) → 0 a.e. This is Maker's ergodic lemma, proved here by the standard ε/truncation split: a sup-tail Gₙ = ⨆_{k≥n}|F_k| ↓ 0 dominated by 2g* ∈ L¹, a Birkhoff average for the bounded part, and the orbital decay m⁻¹·h(T^[m]x) → 0 (Oseledets.ae_tendsto_orbit_div_atTop_zero) for the small-lag part.

Assembling Leaf 2 into ae_tendsto_div_infoFun_of_tail yields the unconditional pointwise SMB ae_tendsto_div_infoFun (parameterized by the abstract information function together with its Breiman telescoping), and its in-measure / equipartition corollaries.

References #

L. Breiman, The individual ergodic theorem of information theory, Ann. Math. Statist. 28 (1957), 809–811; correction 31 (1960), 809–810.
K. L. Chung, A note on the ergodic theorem of information theory, Ann. Math. Statist. 32 (1961), 612–614.
P. C. Shields, The Ergodic Theory of Discrete Sample Paths, GSM 13, AMS (1996), §3.1 (Maker).
M. Einsiedler, E. Lindenstrauss, T. Ward, Entropy in Ergodic Theory and Topological Dynamics, Ch. 2 (SMB).

Leaf 1: the Chung stopping-time tail estimate #

For a fixed cell A = P.cells i₀ and λ > 0, the maximal information function g* exceeds λ only on a set of measure ≤ e^{−λ} inside A. The proof tracks the conditional-probability martingale pₖ(x) = (condExpKernel μ 𝒞ₖ x A).toReal (=ᵐ μ⟦A | 𝒞ₖ⟧), and the first-passage time τ(x) = inf{k : pₖ(x) < e^{−λ}}. Each first-passage stratum {τ = k} is 𝒞ₖ-measurable, so μ(A ∩ {τ = k}) = ∫_{τ=k} 𝟙_A = ∫_{τ=k} pₖ ≤ e^{−λ}·μ{τ = k} by setIntegral_condExp; summing the disjoint strata gives μ(A ∩ {g* > λ}) ≤ e^{−λ}·∑ₖ μ{τ=k} ≤ e^{−λ}.

source

theorem Oseledets.Krieger.condInfoFun_eq_neg_log_of_mem_unique {α : Type u_1} {ι : Type u_2} {𝒜 : MeasurableSpace α} [mα : MeasurableSpace α] [StandardBorelSpace α] [Fintype ι] {μ : MeasureTheory.Measure α} [MeasureTheory.IsProbabilityMeasure μ] (P : Entropy.MeasurePartition μ ι) {i₀ : ι} {x : α} (hx : x ∈ P.cells i₀) (hxu : ∀ (j : ι), j ≠ i₀ → x ∉ P.cells j) :

condInfoFun P x = -Real.log (((ProbabilityTheory.condExpKernel μ 𝒜) x) (P.cells i₀)).toReal

On the "good" part of a cell A = P.cells i₀ — points lying in no other cell — the conditional information function is just -log of the conditional probability of A: condInfoFun 𝒜 P x = -log (condExpKernel μ 𝒜 x A).toReal. Only the i₀-indicator survives.

source

theorem Oseledets.Krieger.ae_mem_unique_cell {α : Type u_1} {ι : Type u_2} [mα : MeasurableSpace α] [Fintype ι] {μ : MeasureTheory.Measure α} (P : Entropy.MeasurePartition μ ι) :

∀ᵐ (x : α) ∂μ, ∀ (i : ι), x ∈ P.cells i → ∀ (j : ι), j ≠ i → x ∉ P.cells j

The a.e. set on which each point lies in exactly one cell of P: it is in its own cell and no other. Off a null set (the pairwise a.e.-overlaps), every point lies in a unique cell.

source

theorem Oseledets.Krieger.chungTail {α : Type u_1} {ι : Type u_2} [mα : MeasurableSpace α] [StandardBorelSpace α] [Fintype ι] {μ : MeasureTheory.Measure α} [MeasureTheory.IsProbabilityMeasure μ] {T : α → α} (hT : MeasureTheory.MeasurePreserving T μ μ) (P : Entropy.MeasurePartition μ ι) (i₀ : ι) {lam : ℝ} (hlam : 0 < lam) :

μ {x : α | x ∈ P.cells i₀ ∧ ENNReal.ofReal lam < condInfoMaxFun hT P x} ≤ ENNReal.ofReal (Real.exp (-lam))

Leaf 1 (Chung's stopping-time tail). For each cell Pᵢ₀ and λ > 0, the Chung maximal information function g* = condInfoMaxFun exceeds λ only on a subset of Pᵢ₀ of measure ≤ e^{−λ}: μ {x ∈ Pᵢ₀ | ofReal λ < g* x} ≤ ofReal e^{−λ}.

The point x ∈ Pᵢ₀ (in no other cell, a.e.) with g* x > λ has some level gₖ(x) > λ, i.e. -log pₖ(x) > λ, i.e. pₖ(x) < e^{−λ} (and pₖ(x) > 0, since -log 0 = 0 ≯ λ). So the level set sits inside Pᵢ₀ ∩ ⋃ₖ {τ = k}, whose measure is ≤ e^{−λ} by the first-passage union bound.

source

theorem Oseledets.Krieger.condInfoMaxFun_layer_le {α : Type u_1} {ι : Type u_2} [mα : MeasurableSpace α] [StandardBorelSpace α] [Fintype ι] {μ : MeasureTheory.Measure α} [MeasureTheory.IsProbabilityMeasure μ] {T : α → α} (hT : MeasureTheory.MeasurePreserving T μ μ) (P : Entropy.MeasurePartition μ ι) :

∫⁻ (x : α), condInfoMaxFun hT P x ∂μ ≤ ∫⁻ (t : ℝ) in Set.Ioi 0, ∑ i : ι, min (μ (P.cells i)) (ENNReal.ofReal (Real.exp (-t)))

The layer-cake bridge hlayer. Summing the per-cell Chung tail through the layer-cake formula bounds the maximal information L¹-norm by the per-cell tail integral: ∫⁻ g* ≤ ∫⁻ t in Ioi 0, ∑ᵢ min(μ Pᵢ, e^{−t}).

The supremum g* = ⨆ₙ ofReal Gₙ of the real partial maxima Gₙ is reached by monotone convergence; each ∫⁻ ofReal Gₙ equals ∫⁻ t in Ioi 0, μ{t < Gₙ} (real layer cake), and the integrand is bounded by μ{ofReal t < g*} ≤ ∑ᵢ min(μ Pᵢ, e^{−t}) (measure_superlevel_le) uniformly in n.

source

theorem Oseledets.Krieger.lintegral_condInfoMaxFun_lt_top {α : Type u_1} {ι : Type u_2} [mα : MeasurableSpace α] [StandardBorelSpace α] [Fintype ι] {μ : MeasureTheory.Measure α} [MeasureTheory.IsProbabilityMeasure μ] {T : α → α} (hT : MeasureTheory.MeasurePreserving T μ μ) (P : Entropy.MeasurePartition μ ι) :

∫⁻ (x : α), condInfoMaxFun hT P x ∂μ < ⊤

g* ∈ L¹. Combining the layer-cake bridge with the proved tail-integral bound lintegral_tail_sum_le (≤ H(P) + 1), the Chung maximal information function is integrable.

Leaf 2: the Maker/Breiman dominated-Cesàro vanishing #

From g* ∈ L¹ (Leaf 1) and the a.e. Lévy limit gₖ → g∞, the Cesàro tail (1/n)∑_{j<n}(g_{n−j} − g∞)(Tʲx) → 0 a.e. This is Maker's ergodic lemma.

The argument: with Fk k = gk k − g∞ (so Fk k → 0 a.e., dominated by D = g*ℝ + g∞ ∈ L¹), set the antitone sup-tail Ψ_N = ⨆_{k≥N}|Fk k| (Ψ_N ↓ 0 a.e., Ψ_N ≤ D, so ∫ Ψ_N → 0 by dominated convergence). Splitting ∑_{j<n}|Fk(n−j)(Tʲx)| at lag n−j ≷ N: the large-lag part is ≤ birkhoffAverage Ψ_N n x → ∫ Ψ_N (Birkhoff); the small-lag part is the last N terms |Fk(m)(T^{n−m}x)|/n with m ≤ N, each → 0 by the orbital decay ae_tendsto_orbit_div_atTop_zero. Hence limsup ≤ ∫ Ψ_N → 0.

The Maker dominated-Cesàro core #

source

theorem Oseledets.Krieger.makerTail {α : Type u_1} {ι : Type u_2} [mα : MeasurableSpace α] [StandardBorelSpace α] [Fintype ι] {μ : MeasureTheory.Measure α} [MeasureTheory.IsProbabilityMeasure μ] {T : α → α} (hT : MeasureTheory.MeasurePreserving T μ μ) (P : Entropy.MeasurePartition μ ι) (herg : Ergodic T μ) :

∀ᵐ (x : α) ∂μ, Filter.Tendsto (fun (n : ℕ) => (↑n)⁻¹ * Oseledets.Krieger.makerSum✝ hT P n x) Filter.atTop (nhds 0)

Leaf 2 (Maker/Breiman dominated-Cesàro). For ergodic T, the Cesàro tail of the Breiman telescoping vanishes a.e.: (1/n)·∑_{j<n}(g_{n−j} − g∞)(Tʲx) → 0.

Assembling the unconditional pointwise SMB #

With both analytic leaves discharged — Leaf 1 gives g* ∈ L¹ (lintegral_condInfoMaxFun_lt_top), Leaf 2 gives the dominated-Cesàro vanishing (makerTail) — the Cesàro tail hypothesis of ae_tendsto_div_infoFun_of_tail is now proved. The Breiman tail decomposition (1/n)·iₙ(x) − A_n(g∞)(x) = (1/n)∑_{j<n}(g_{n−j} − g∞)(Tʲx) (which holds a.e. via the measure-algebra telescoping infoWeight_succ_eq, recorded here as the hypothesis hbreiman) then yields the unconditional pointwise SMB (1/n)·iₙ(x) → h(P,T).

source

theorem Oseledets.Krieger.ae_tendsto_breiman_tail {α : Type u_1} {ι : Type u_2} [mα : MeasurableSpace α] [StandardBorelSpace α] [Fintype ι] {μ : MeasureTheory.Measure α} [MeasureTheory.IsProbabilityMeasure μ] {T : α → α} (herg : Ergodic T μ) (P : Entropy.MeasurePartition μ ι) :

∀ᵐ (x : α) ∂μ, Filter.Tendsto (fun (n : ℕ) => (↑n)⁻¹ * ∑ j ∈ Finset.range n, condInfoFun P (T^[j] x) - birkhoffAverage ℝ T (condInfoFun P) n x) Filter.atTop (nhds 0)

Leaf 2 in the form consumed by ae_tendsto_div_infoFun_of_tail. The Cesàro tail of the Breiman split — the difference between the information average (1/n)∑_{j<n} g_{n−j}(Tʲx) and the limit Birkhoff average A_n(g∞) — vanishes a.e.

source

theorem Oseledets.Krieger.ae_tendsto_div_infoFun {α : Type u_1} {ι : Type u_2} [mα : MeasurableSpace α] [StandardBorelSpace α] [Fintype ι] {μ : MeasureTheory.Measure α} [MeasureTheory.IsProbabilityMeasure μ] {T : α → α} [Nonempty ι] (herg : Ergodic T μ) (P : Entropy.MeasurePartition μ ι) (infoFun_n : ℕ → α → ℝ) (hbreiman : ∀ᵐ (x : α) ∂μ, ∀ (n : ℕ), infoFun_n n x = ∑ j ∈ Finset.range n, condInfoFun P (T^[j] x)) :

∀ᵐ (x : α) ∂μ, Filter.Tendsto (fun (n : ℕ) => infoFun_n n x / ↑n) Filter.atTop (nhds (Entropy.ksEntropyPartition ⋯ P))

Pointwise Shannon–McMillan–Breiman (unconditional, given the Breiman telescoping R2). For ergodic T and any sequence iₙ satisfying the a.e. Breiman telescoping iₙ(x) = ∑_{j<n} g_{n−j}(Tʲx) (hbreiman), the information averages (1/n)·iₙ(x) converge μ-a.e. to the Kolmogorov–Sinai entropy h(P,T) = ksEntropyPartition hT P.

Both analytic leaves are now proved: Leaf 1 (lintegral_condInfoMaxFun_lt_top, g* ∈ L¹) and Leaf 2 (ae_tendsto_breiman_tail, the Maker dominated-Cesàro tail). The only remaining input is the measure-algebra telescoping hbreiman (Breiman's R2, not an analytic leaf).

source

theorem Oseledets.Krieger.tendsto_measure_div_infoFun_gt {α : Type u_1} {ι : Type u_2} [mα : MeasurableSpace α] [StandardBorelSpace α] [Fintype ι] {μ : MeasureTheory.Measure α} [MeasureTheory.IsProbabilityMeasure μ] {T : α → α} [Nonempty ι] (herg : Ergodic T μ) (P : Entropy.MeasurePartition μ ι) (infoFun_n : ℕ → α → ℝ) (hbreiman : ∀ᵐ (x : α) ∂μ, ∀ (n : ℕ), infoFun_n n x = ∑ j ∈ Finset.range n, condInfoFun P (T^[j] x)) (hmeas : ∀ (n : ℕ), Measurable (infoFun_n n)) {δ : ℝ} (hδ : 0 < δ) :

Filter.Tendsto (fun (n : ℕ) => μ {x : α | Entropy.ksEntropyPartition ⋯ P + δ < infoFun_n n x / ↑n}) Filter.atTop (nhds 0)

In-measure / upper-equipartition corollary. For ergodic T and the Breiman telescoping, for every δ > 0 the measure of {x : (1/n)·iₙ(x) > h(P,T) + δ} tends to 0: a.e. convergence (ae_tendsto_div_infoFun) implies convergence in measure of the deviation set.

Documentation

Oseledets.Krieger.SMBLeaves

The two analytic leaves of the pointwise Shannon–McMillan–Breiman theorem #

References #

Leaf 1: the Chung stopping-time tail estimate #

Leaf 2: the Maker/Breiman dominated-Cesàro vanishing #

The Maker dominated-Cesàro core #

Assembling the unconditional pointwise SMB #