Stratified sampling

Can improve on SRS given more information.

Basic idea is to split population into strata:

divide population into non-overlapping groups (strata)
SRS each strata
produce estimate for each strata
overall estimates weighted average of strata estimates

Advantages:

protects against getting a really bad sample
improves precision of estimates
can use difference sampling methods in each stratum
separate estimates in each stratum
some protection against non-response

General theory

Population mean = weighted average of stratum means.

\[ \bar{y} = \sum_l W_l \bar{y}_l \]

Population SE(standard error) = weighted average of statrum SE’s

\[ Var[\bar{y}_st] = \sum_l W_l^2 Var[\bar{y}_l] \]

Stratified sampling does well when $\sigma^2_l$ are small (values in each stratum are similar).

Allocation

\[n_l \propto \frac{W_n \sigma_n}{\sqrt{cost_n}} \]

Post-stratification

eg. can’t identify stratum unit is from before sampling
cannot benefit from sample design
can use correct stratum weight $W_l$
if sample size large, post-stratification will be close to proportional allocation
biggest benefit is overcoming some effects of non-response

How many strata?

Goal of stratification is to reduce within stratum variation, so we want more strata, but this comes with increased complexity and cost. Additionally var $\propto \sigma^2_l / n_l$ so decreasing strata sizes will increase variance.

If number of strata, $L$ , chosen, then choose stratum boundaries to minimise $\sum W_l \sigma_l$ (need preliminary smaple of distribution of $y$ ). Approximate rule is to choose $y_h$ so that $W_h(y_h - y_{h-1})$ is constant. With optimal choice of boundaries var $\propto 1/L^2$ . Increasing L too far – within stratum variance dominates.

Conclusion: $L$ often small, < 6.