Direct |
Horvitz-Thompson estimator |
|
Only based on real sample units. |
Easy to calculate and unbiased for large samples. |
It is unreliable and can not use auxiliary data. |
Only if sample size is large enough. |
Generalised regression or GREG estimator |
|
Based on real data and weighted least square (WLS) estimate of regression coefficient. |
Can use auxiliary data at small area level, and approximately design and model unbiased. |
It could be negative in some cases and not a consistent estimator due to high residuals. |
When sample size is large and reliable auxiliary data are available at small area level. |
Modified direct estimator |
|
Based on real sample, auxiliary data and WLS estimate of regression. |
Design unbiased and uses overall aggregated data for coefficient estimation. |
Borrows strength from the overall data but can not increase effective sample. |
When the overall sample size is large and reliable. |
Indirect modelling |
Implicit models |
Synthetic estimator |
|
Requires actual sample and auxiliary data for a large scale domain. |
Straightforward formula and very easy and inexpensive to calculate. |
All small areas are similar to large area assumption is not tenable & estimate is biased. |
Used in various areas in government and social statistics. |
Composite estimator |
|
Based on direct and synthetic estimators. |
Have choices of balancing weight at small areas. |
Biased estimator; depends on the chosen weight. |
If direct and synthetic estimates are possible. |
Demographic estimator |
|
Rooted in data from census, and with time dependent variable. |
Easy to estimate, and the underlying theory is simple and straightforward. |
Only used for population estimates and affected by miscounts in census data. |
Used to find birth and death rates and various population estimates. |
Explicit models2 |
Area level |
|
Based on a two stages model and known as the Fay-Herriot model. |
Can use area specific auxiliary data and direct estimator. |
Assumptions of normality with known variance may untenable at small sample. |
Various areas in statistics fitting with assumptions of the model. |
Unit level |
|
Based on unit level auxiliary data and a nested error model. |
Useful for continuous value variables, two stage and multivariate data. |
Validating is quite complex and unreliable. |
Used successfully in many areas of agricultural statistics. |
General linear mixed model |
|
A general model, which encompasses all other small area models. |
Can allow correlation between small areas, AR(1) and time series data. |
Calculation and structure of matrix for area random effects are very complex. |
In all areas of statistics where data are useful for the general model. |