1. Methodology
  2. Spatial microsimulation
Download icon

Comparing two methods of reweighting a survey file to small area data

  1. Robert Tanton  Is a corresponding author
  2. Paul Williamson  Is a corresponding author
  3. Ann Harding  Is a corresponding author
  1. Institute for Governance and Policy Analysis, University of Canberra, Australia
  2. University of Liverpool, United Kingdom
Research article
Cite this article as: R. Tanton, P. Williamson, A. Harding; 2014; Comparing two methods of reweighting a survey file to small area data; International Journal of Microsimulation; 7(1); 76-99. doi: 10.34196/ijm.00094
5 figures and 8 tables

Figures

SLA level housing stress estimates (%).

Note: GREGWT (GREGWT convergent SLAs).

SLA level housing stress estimates (%).

Note: CO-Min Proportion (GREGWT convergent SLAs).

SLA level housing stress estimates (%): CO-Min Absolute.

Note: (GREGWT convergent SLAs).

Distribution of GREGWT weights for NSW.
Distribution of GREGWT weights for the Australian Capital Territory.

Tables

Table 1
Benchmarks used for creating small-area weights.
Census table Type Dimensiona Fully specifiedb Benchmarks (no.)
Age by sex by labour force status Person Multi Yes 32
Residents in different types of non-private dwelling Person Single Yes 8
Household Type Household Single No 1
Household size – number usual residents Household Single Yes 7
Dwelling tenure by weekly household rent Household Multi No 7
Dwelling tenure by household type Household Multi Yes 15
Dwelling tenure by weekly household income Household Multi No 16
Monthly household mortgage by weekly household income Household Multi Yes 12
Weekly household rental by weekly household income Household Multi Yes 20
Dwelling structure by household family composition Household Multi No 12
Total number of benchmark tabulations 10
Total number of benchmarks 130
  1. a

    Multi-dimensional means cross-tabulations of variables.

  2. b

    Not fully specified means that one or more of the cells in a benchmark tabulation were not used for weight production. For example, for the benchmark table of ‘Household Type’, the count of ‘Private households’ was extracted for use as a benchmark, whilst the count of ‘Non-private dwellings’ was excluded from the reweighting process.

Table 2
Comparison of methods in summary.
GREGWT CO
Approach National household weights from a national survey dataset are reweighted to household weights for SLAs by constraining to small-area census counts Selection of a combination of households from a national survey microdata set that best fit small-area census counts
Weights In fractional numbers In integer numbers
Preparation of census data Needs to address re-allocation of ‘not-stated’ and ‘not applicable’ counts Needs to address re-allocation of ‘not-stated’ and ‘not applicable’ counts
Conflicting benchmark counts due to statistical disclosure measures Causes non-convergence because no set of weights can be found that simultaneously satisfies all benchmarks Seeks to minimise the difference between the final weights and the target benchmarks which typically results in weights that matchthe average of any discrepant benchmarks
Optimisation strategy Algorithm reaches an optimised solution when residual (i.e. difference between an synthetic estimate and the benchmark count) approaches zero Minimise absolute or proportional error
‘Convergent’ & ‘non-convergent’ SLAs In some cases no convergent solution may be found; Average Household Absolute Sum of Residuals is >1 provides a proxy indicator for this non-convergence. No convergence issues, although final ‘optimal’ estimates may still fail to fit all user-supplied benchmarks
  1. Source: NATSEM (GREGWT) and Williamson (CO).

Table 3
Summary measures of goodness of fit.
Measure Description
Overall Total Absolute Error (OTAE) Absolute Sum of Residuals summed across all benchmark counts
Overall Total Absolute Error per household (OTAE/HH) Absolute sum of residuals per household across all benchmark counts
Overall Total Absolute Proportional Error (OTAPE) Absolute difference between benchmark counts when expressed as fraction of the table total
Overall relative sum of Z-square scores (ORSumZ2) For each benchmark table, the Z-score of each benchmark count squared, and summed for the table; then divide by chi-square critical value for table (--> RSumZ2), then sum across all tables (--> ORSumZ2). For a given table, RSumZ2 > 1 shows it is not fitting.
Table 4
Results for constrained variables, Australian Capital Territory.

(GREGWT ‘convergent’ SLAs only)

Measure GREGWT CO (Min Proportion) CO (Min Absolute)
OTAE 139.6 133.4 92.2
OTAE/HH 0.1 0.1 0.1
OTAPE 0.4 0.2 0.2
ORSumZ2 48.4 0.5 27.8
  1. Note: Lower numbers signify greater accuracy.

Table 5
Results for constrained variables, New South Wales.

(GREGWT ‘convergent’ SLAs only)

Measure GREGWT CO (Min Proportion) CO (Min Absolute)
OTAE 602.9 483.1 979.3
OTAE/HH 0.1 0.1 0.1
OTAPE 0.2 0.1 0.2
ORSumZ2 60.5 1.9 29.2
  1. Note: Lower numbers signify greater accuracy.

Table 6
Results for housing stress, Australian Capital Territory and New South Wales.

(GREGWT ‘convergent’ SLAs only)

Number Unaffordable
State ABS GREGWT CO (Min Proportion) CO (Min Absolute)
Australian Capital Territory 5,526 6,147 5,924 5,821
New South Wales 169,823 194,394 191,720 189,269
Total 175,349 200,541 197,644 195,090
% Unaffordable
Australian Capital Territory 5.9 5.9 5.7 5.6
New South Wales 9.1 9.2 9.1 8.9
Combined 9.0 9.0 8.9 8.8
Table 7
Size of weights from CO and GREGWT.

(Table legend)

Method Australian Capital Territory New South Wales
0 > 0 – 1a > 1 0 > 0 – 1a > 1
% % % % % %
CO 95 3 1 79 8 13
GREGWT 53 47 1 36 48 16
  1. a

    For > 0 – 1, the CO value is 1.

Table 8
Weights for GREGWT and CO.

Note: GREGWT convergent SLAs only.

Maximum Average non-zero value
Method New South Wales Australian Capital Territory New South Wales Average Australian Capital Territory Average
CO (Min Proportion) 443 24 3.49 1.45
GREGWT 647 18 1.11 0.15

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Download citations (links to download the citations from this article in formats compatible with various reference manager tools)

Open citations (links to open the citations from this article in various online reference manager services)