Stratified sampling

Can improve on SRS given more information.

Basic idea is to split population into strata:

Advantages:

General theory

Population mean = weighted average of stratum means.

\[ \bar{y} = \sum_l W_l \bar{y}_l \]

Population SE(standard error) = weighted average of statrum SE’s

\[ Var[\bar{y}_st] = \sum_l W_l^2 Var[\bar{y}_l] \]

Stratified sampling does well when $\sigma^2_l$ are small (values in each stratum are similar).

Allocation

\[n_l \propto \frac{W_n \sigma_n}{\sqrt{cost_n}} \]

Post-stratification

How many strata?

Goal of stratification is to reduce within stratum variation, so we want more strata, but this comes with increased complexity and cost. Additionally var $\propto \sigma^2_l / n_l$ so decreasing strata sizes will increase variance.

If number of strata, $L$ , chosen, then choose stratum boundaries to minimise $\sum W_l \sigma_l$ (need preliminary smaple of distribution of $y$ ). Approximate rule is to choose $y_h$ so that $W_h(y_h - y_{h-1})$ is constant. With optimal choice of boundaries var $\propto 1/L^2$ . Increasing L too far – within stratum variance dominates.

Conclusion: $L$ often small, < 6.