Monotonicity of conditional entropy under refinement of the σ-algebra #
This file adds the monotonicity of conditional entropy to the conditional-entropy milestone
(GitHub issue #13), continuing Oseledets.Entropy.CondPartition (which defines condEntropy μ 𝒜 s
as the μ-average of the pointwise entropy against the regular conditional probability
condExpKernel μ 𝒜 ω).
The single result is the conditional generalization of condEntropy_le ("conditioning does not
increase entropy"): conditioning on a finer sub-σ-algebra reduces conditional entropy.
condEntropy_mono_of_le: if𝒜 ≤ ℬ ≤ mαthenH(P | ℬ) ≤ H(P | 𝒜).
The proof runs the same Jensen argument as condEntropy_le, but with the conditional Jensen
inequality (ConcaveOn.condExp_map_le) against the coarser σ-algebra 𝒜 in place of the
unconditional one, combined with the tower property of conditional expectation
(condExp_condExp_of_le). Term by term over the cells Pᵢ, writing
f_i ω = (condExpKernel μ ℬ ω Pᵢ).toReal and g_i ω = (condExpKernel μ 𝒜 ω Pᵢ).toReal:
- the kernel-to-condExp links (
condExpKernel_ae_eq_condExp) givef_i =ᵐ μ⟦Pᵢ | ℬ⟧andg_i =ᵐ μ⟦Pᵢ | 𝒜⟧, and the tower property givesμ⟦f_i | 𝒜⟧ =ᵐ g_i; - conditional Jensen for the concave
negMulLoggivesμ⟦negMulLog ∘ f_i | 𝒜⟧ ≤ᵐ negMulLog ∘ μ⟦f_i | 𝒜⟧ =ᵐ negMulLog ∘ g_i; - integrating and using
integral_condExp(theμ-average of a conditional expectation is theμ-average of the function) turns the left side back into∫ negMulLog ∘ f_i ∂μ, giving∫ negMulLog ∘ f_i ∂μ ≤ ∫ negMulLog ∘ g_i ∂μ.
Summing over the finite partition yields the claim.
Main results #
Oseledets.Entropy.condEntropy_mono_of_le: conditioning on a finer σ-algebra reduces conditional entropy.
References #
- François Le Maître, Notes on the Kolmogorov–Sinai theorem (2017), §1.
- Peter Walters, An Introduction to Ergodic Theory, Springer GTM 79, Chapter 4.
Monotonicity of conditional entropy under refinement. Conditioning on the finer
sub-σ-algebra ℬ ⊇ 𝒜 produces the smaller conditional entropy: for any finite measurable partition
P of the probability space and any sub-σ-algebras 𝒜 ≤ ℬ ≤ mα,
H(P | ℬ) ≤ H(P | 𝒜).
This is the conditional generalization of condEntropy_le (which is the case 𝒜 = ⊥). The proof
applies the conditional Jensen inequality (ConcaveOn.condExp_map_le) for the concave negMulLog
against 𝒜, term by term over the cells Pᵢ. Writing f_i ω = (condExpKernel μ ℬ ω Pᵢ).toReal,
the tower property condExp_condExp_of_le identifies the inner 𝒜-conditional expectation
μ⟦f_i | 𝒜⟧ with the coarser kernel mass (condExpKernel μ 𝒜 ω Pᵢ).toReal; integrating the
resulting pointwise Jensen bound and using integral_condExp gives the termwise inequality, which
sums to the claim.