Integration

Measurable transformations

Let `(Omega, F, P)` be a probability space. Then `f: Omega -> RR` is called a random variable if `X^(-1)(-oo, a] = {omega: X(omega} <= a} in F`. This is equivalent to the stronger condition that `X^(-1)(A) in F` for all `A in B(RR)`.

A mapping `T: Omega_1 -> Omega_2` between two probability spaces is measurable with respect to the `sigma`-algebras if `T^(-1)(A) in F_1 AA A in F_2`. (and will be measurable for any `barF_1 sup F_1` and `barF_2 sub F_2`).

In general, this is a difficult property to prove directly, but:

if `F_2 = sigma(:A:)` and `T^(-1)(A) in F_1` then T is `(: F_1, F_2 :)` measurable
and if `T_1: Omega_1 -> Omega_2` is `(: F_1, F_2 :)`-measurable, and `T:_2 Omega_2 -> Omega_3` is `(: F_2, F_3 :)`-measurable, then `T = T_1(T_2)` is `(:F_1, F_3:)`-measurable.
any continuous function is measurable (converse not true, but can find an approximation that is exact except over a set of small measure)
the identity function is measurable

And if `AA n in N`, `f_n : Omega -> barRR` is `(:F, B(barRR) :)` measurable, then:

`spr_n f_n`, `inf_n f_n`, `barlim f_n`, `ul lim f_n` are all measurable
`A = {omega: lim_(n->oo) f_n(omega) < oo}` lies in `F` and `h = [lim_(n->oo) f_n] * I_A` is measurable.

Let `{ f_lambda: lambda in Lambda }` be a family of mappings from `Omega_1 -> Omega_2`, and `F_2` a `sigma`-algebra on `Omega_2`, then `sigma(: {f_lambda^{-1} (A) : A in F_2, lambda in Lambda} :)` is called the `sigma`-algebra generated by `{f_lambda: lambda in Lambda}`. This is the smallest `sigma`-algebra such that all `f_lambda` are measurable.

Let `{f_lambda: lambda in Lambda}` be an uncountable collection of maps `Omega_1 -> Omega_2` then `AA B in sigma(: {f_lambda: lambda in Lambda} :)` then exists a countable set `Lambda_B in Lambda` st. `B in sigma(: {f_lambda: lambda in Lambda_B} :)` (ie. a countable cover)

Induced measures, distribution functions

Suppose X is an rv defined on `(Omega, F, P)`, then `P` governs probabilities assigned to events like `X^(-1)[a,b]`. Since `X` takes values in the real line, it should be possible to express such probabilities as function of `[a, b]`, eg. `P_X(A) = P(X^(-1)(A))`. Is this a probability measure?

Let `(Omega_i, F_i)`, `i=1,2` be measurable spaces, and let `T: Omega_1 -> Omega_2` be a `(: F_1, F_2 :)` measurable function. Then for any measure `mu` on `(Omega_1, Omega_2)`, `mu_T = mu T^(-1) (A) = mu(T^(-1) (A))` is a measure on `F_2`.

For a rv `X` defined on `(Omega, F, P)` the probability distribution of X is the induced measure of `X` under `P` on `RR`.

The cumulative distribution function (CDF) = `F_X(x) = P_X(-oo, x] = P{omega: X(omega) <= x}`, and has the following properties: (any function with these properties is called a cdf)

`x_1 < x_2 => F(x_1) <= F(x_2)`
right continuous
`F(oo) = 1`, `F(-oo) = 0`

Given any cdf it is possible to construct a probability space and a random variable st. it is the cdf of X. (eg. `(RR, B(RR), mu_F)`, `f(x) = x`).

A random variable is discrete if there exists a countable set `A sub RR` st `P(X in A) = 1`, or continuous if `P(X=x)=0 AA x in RR`. The decomposition theorem states that any cdf can be written as the weighted sum of a continuous and a discrete cdf.

Generalisation to higher dimensions is straightforward, eg. for `k=2`

`AA x = (x_1, x_2), y = (y_1, y_2) => F(y_1, y_2) - F(x_1, y_2) - F(y_1, x_2) + F(x_1, x_2) >= 0` (monotone)
right continuous
`F(oo, oo) = 1`, `F(-oo, a) = F(a, -oo) = 0`
`F(a, oo) = P(X_1 <= a)`

Integration

Let `(Omega, F, mu)` be a measure space, and `f: Omega -> R` be a measurable function.

A function `f: Omega -> barRR` is called simple if there exists a finite set `{c_1, c_2, ..., c_k} in barRR` and sets `A_1, A_2, ..., A_k in F, k in NN` st. `f = sum c_i I_(A_i)`. The intergal of a simple function `int_Omega f dmu = sum c_i mu(A_i)`. Note: `0 <= int f dmu <= oo`

If `f` and `g` are two simple non-negative functions then:

linearity: for `alpha,beta >= 0` `int (alpha*f + beta*g) dmu = alpha int f dmu + beta int g dmu`
monotonicity: if `f >= g "a.e." (mu)` then `int f dmu >= int g dmu`
if `f = g "a.e." (mu)` then `int f dmu = int g dmu`

Can extend this definition to any non-negative function by discretising and taking limits. (Need to confirm that all admissable sequences take the same form) ie. `lim_(n->oo) int f_n dmu = int lim_(n->oo) f_n dmu = int f mu`

The monotone convergence theorem proves that `int f dmu = lim_(n->oo) int f_n dmu`, for any increasing sequence of `f_n`. Corollaries:

`int sum h_n dmu = sum int h_n dmu`
`nu(A) = int f I_A dmu` is a measure.

Can extend to any function by defining `f = f^+ - f^-` (where `f^+ = max(0, f)`, `f^- = max(0, -f)`), or more generally `f = f_1 + f_2` for some `f_1, f_2 >= 0`. Said to be integrable if `int |f| < oo`.

Theory	Hypotheses	Results
MCT	`f_n >= 0` `f_n uarr f "ae" (mu)`	`int f_n uArr int f`
Fatou	`f_n >= 0`	`ullim int f_n >= int ullim f_n`
LDCT	`\|f_n\| <=g` `g in L^1(mu)` `f_n -> f`	`f in L^1(mu)` `int f_n -> int f` `int\|f_n - f\| -> 0`
BCT	`mu(Omega) < oo` `\|f_n\| <= K` `f_n -> f`	`f in L^1(mu)` `int \|f_n - f\| -> 0`

`L^p` spaces

`L^p(Omega, F, P) = {f: int |f|^p dmu < oo}`
`L^oo(Omega, F, P) = {f: mu( {|f| >} ) = 0 "for some" K in R^+)}`

If `f` and `g` are integrable (ie. `in L^1(Omega, F, P)`) then:

linearity: for `alpha,beta in RR` `int (alpha*f + beta*g) dmu = alpha int f dmu + beta int g dmu`
monotonicity: if `f >= g "a.e." (mu)` then `int f dmu >= int g dmu`
if `f = g "a.e." (mu)` then `int f dmu = int g dmu`

Riemann and Lebesgue integrals

Let `f` be a real value function bounded on a bounded interval. Let `P = {x_0, x_1, ..., x_n}` be a finite partition of `[a,b]`, and `Delta = Delta(P) = max{(x_1 - x_0), ...}` be the diameter of `P`. Let `M_i = spr{f(x): x_i <= x <= x_(i+1)}` and `m_i = inf{f(x): x_i <= x <= x_(i+1)}`.

The upper and lower Riemann sums of `f` wrt `P` are defined as:

`U(f, P) = sum M_i * (x_(x+1) - x_i)`
`L(f, P) = sum m_i * (x_(x+1) - x_i)`

The upper and lower Rieman integrals are defined as:

`bar int f = inf_(P in sfP) U(f, P)`
`ul int f = spr_(P in sfP) U(f, P)`

Where `sfP` is the set of all partitions.

`f` is said to be Riemann-integrable if `bar int f = ul int f`. `f` is a bounded function on a bounded interval. `f` is Riemann integrable on `[a,b]` if `f` is continuous on `[a,b]` and `f` is Lebesgue integrable and the two integrals are equal.

More on convergence

Name	Symbol	Definition
Pointwise	`f_n -> f`	`lim_(n->oo) f_n(omega) = f(omega) AA omega in Omega`
Almost everywhere	`f_n -> f "ae" mu`	`lim_(n->oo) f_n(omega) = f(omega) AA omega in B^c, mu(B) = 0`
In measure	`f_n stackrel m -> f`	`lim_(n->oo) mu{\|f_n - f\| > epsi} = 0`
In `L^p`	`f_n stackrel (L^p) -> f`	`lim_(n->oo) int \|f_n - f\|^p = 0` or `{:\|\| f_n - f\|\|:}_p -> 0` where `{:\|\| g \|\|:}_p = (int \|g\|^p dmu)^(1/p)`
Uniformly		`lim_(n->oo) spr_{omega in Omega} {\|f_n(omega) - f(omega)\|} = 0`
Nearly (almost) uniformly		`AA epsilon > 0 EE A in F "st" mu(A) lt epsilon` and on `A^c` f converges uniformly

If `mu(Omega) < oo` then convergence almost everywhere implies convergence in measure
If `f_n` converges in measure, then there is some subsequence that converges almost everywhere
If `f_n` converges in `L^p` `f_n` converges in measure
If `f_n >= 0, f_n in L^1, int f_n -> int f` then `f_n -> f "ae"` or `f_n stackrel m -> f` implies convergence in `L^p`

Uniform integrability

Let `a_f(t) = int_{|f| > t} |f|`. If `f in L^1` then `a_lambda(t) -> 0`, that is `AA epsilon > 0 EE t_epsilon "st" t >= t_epsilon => a_f(t)`. A collection of functions `{f_lambda : lambda in Lambda}` is uniformly integrable if `AA epsilon > 0 EE t_epsi "st" t > t_epsi => spr_(lambda in Lambda) a_(f_lambda)(t) <= epsilon`.

If `|Lambda| < oo` then `{f_lambda: lambda in Lambda}` is UI
If `K = spr{ int |f_lambda|^(1+epsilon) dmu: lambda in Lambda} < oo` for some `epsilon > 0` then `{f_lambda: lambda in Lambda}` is UI
If `|f_lambda| <= g "ae" (mu)` and `g in L^1(mu)` then `{f_lambda: lambda in Lambda}` is UI
If `{f_lambda: lambda in Lambda}` and `{g_gamma: gamma in Gamma}` are UI, then so it `{f_lambda + g_gamma: lambda in Lambda, gamma in Gamma}` is UI
If `{f_lambda: lambda in Lambda}` is UI and `mu(Omega) < oo` then `spr_{lambda in Lambda} int |f_lambda| dmu < oo`

If `mu(Omega) < oo` and let `{f_n : n >= 1} sub L^1` st `f_n -> f "ae" (mu)`, f measurable. If `{f_n}_(n>=1)` is UI then `lim_(n->oo) int |f_n - f| dmu = 0`

Ergorov’s theorem: Let `f_n -> f "ae" (mu)`, `mu(Omega) < oo`, then `f_n -> f` almost uniformly.