What is sampling?
Sampling is used to make inferences about a population from a relatively small number of observations, that are assumed to be representative of the population.
More detailed guidance is contained in the Background
Document (pdf - 192kb)
Why use sampling methods?
In a social researcher's ideal world all data would be collected
by census. However, in practice this approach is not always practical
because of time and cost constraints. As an alternative
sampling methods are used to provide estimates
based on data from a very small percentage of the population.
How does sampling work?
Sampling works because:
- It is not necessary to collect data from all people
in order to generate statistics about that population.
- After a certain sample size, there is no need to collect more
data. The extra data does not improve the accuracy of the estimate
to any great extent.
Sample bias
Population estimates based on survey data will be inaccurate if
the sample is biased. There are a number of reasons why a sample
may be biased:
- Sample bias occurs when the selected sample is systematically
different to the population. The sample must be a fair
representation of the population we are interested in.
- The sample size may be too small to produce
a reliable estimate.
- There may be variability in the population.
If you want your sample to give an estimate that is close to
the population value, you need to take into account how much
variability there is in the variable you are trying to measure.
All else being equal, the greater the variability the larger
the sample size needed.
Sampling frames
The ideal sampling frame is a straightforward list of the elements
you are trying to sample. So, for a population sample, a comprehensive
list of members of the population would be ideal. And for a household
survey a comprehensive list of households would be the ideal.
In practice the ideal sampling frame hardly ever exists. In the
UK, for instance, there is no population register that can be used
for sampling purposes. The closest is the electoral register, but
because inclusion on the electoral register is voluntary large sections
of the population are excluded. This means that the electoral register
tends to give biased samples.
Popular sampling frames
The Postcode Address File (PAF)
The PAF is the Post Office's list of all addresses in the
UK, which receive less than 25 items of post per day. The
list is primarily residential addresses (about 94%), and
the list covers approximately 99% of all residential addresses.
The PAF closely approximates to a sampling frame of households,
which makes it ideal for surveys of households (once non-residential
and unoccupied addresses have been screened out). |
Random Digit Dialling (RDD)
A sample frame can be generated for a telephone survey
using Random Digit Dialling (RDD). RDD involves:
- Selecting 11 digit numbers at random, the first 7 digits
being randomly selected from the published list of prefix
numbers (such as 020 8693 XXXX) and the last four digits
being generated entirely at random.
- The selected numbers are then dialled and numbers not
in use, and business numbers, are screened out to leave
just residential numbers.
|
Sample size
The main decision needed in deciding on a sample
design is sample size. To decide
on this a number of questions need to be decided on:
- What are the key estimates for the study?
- How precise do those estimates need to be?
(i.e. what size of standard error or confidence interval can
be tolerated?)
- Are there key sub-groups for which separate
estimates will be needed?
- Does the survey need to be large enough to detect change
over time between surveys, or differences between
key sub-groups?
Clustering
A 'clustered' sample is defined as a sample that is selected in
two or more hierarchical stages, different 'units' being selected
at each stage, and with multiple sub-units being selected within
higher order units.
A few examples will help to clarify this:
- A sample of children is selected by (a) sampling schools
and then (b) selecting children within schools. This
is a two-stage clustered sample, the clustering being
of children within schools.
- On a general population survey a PAF sample is used
to generate a sample of households. Within each household
up to two adults are selected at random. This is a two-stage
clustered sample, the clustering being of adults within
households.
Note that, had the instruction been to select just one
adult per household, this would not be described as
a clustered sample, because there would no clustering
of the adult sample within a smaller number of households.
|
The most common design for PAF samples is:
- Select a random sample of postcode sectors
- Households are selected
- Households are selected within these postcode sectors
- Individuals might be selected within households
Under this design adults are clustered within households (assuming
more than one adult is selected per household) and households are
clustered within postcode sectors.
Clustering, or multi-stage sampling, is adopted on surveys for
a number of reasons. The two main reasons are:
- Because the sampling frame units cover two or more survey units
and so clustering is the only practical way of selecting a sample
of the units required
- To divide the sample into manageable workloads for interviewers.
The impact of clustering on standard errors
The main objection to clustered samples is that
they tend to give estimates with larger standard errors
than unclustered samples.
The reasoning here is that the more the sample is clustered the
greater the chance of drawing a sample that is extreme. For instance,
imagine a scenario where you are selecting a sample of 1000 people.
Then if you choose to select the 1000 people by selecting just 10
postcode sectors and 100 people per sector, then if you are unlucky
enough to select one or two sectors that are outliers in some sense
then you will get a sample mean that is quite different to the population
mean.
From this example it is hopefully clear that the more the total
sample is spread across clusters the lower the chance of taking
an extreme sample and the lower the standard error. This translates
into: for a fixed sample size, the smaller the sample size per cluster
the smaller the standard error.
Stratification
Stratification essentially means dividing the sampling
frame into groups (strata) before sampling. Stratification
reduces the risk of drawing an extreme sample, unrepresentative
of the population.
A simple example would be to take a sampling frame of, say, business
establishments and then to sort them into size strata before sampling.
The sample would then be described as a sample stratified by size.
|
There are two methods of stratified sampling: proportionate
and disproportionate.
- In a proportionate stratified sample
the sampling frame is divided into strata but the same
sampling fraction is applied per stratum. This means
that each stratum is sampled from in its correct proportion.
- In a disproportionate stratified sample
the sampling fraction differs between strata. This means
that individuals from the strata with the highest sampling
fractions will be over-represented in the sample. Disproportionate
sampling is generally used when there is a need to boost
the sample size within a particular stratum or strata
(e.g. for boosting the number of young people or minority
ethnic groups).
|
Quota sampling
Quota sampling allows substitution of non-respondents with
others willing to respond, but it makes use of stratification
to ensure that the final sample reflects the population on at least
some key variables.
For instance, the quota sample method uses population estimates
of the numbers within each combination of age group, sex, and social
classes to determine the number needed within each of these combinations
for a survey. Interviewers are then given quotas based on these
numbers that they are asked to achieve.
Quota sampling assumes that:
- Those who take part in the survey have the same characteristics,
attitudes, behaviours etc. as those who do not take part.
- It is possible to find variables for the quota that control
for most of the survey variability between individuals.
Quota samples can be biased and for this reason
the UK government has tended to use random sampling even though
quota-sampling methods are cheaper and faster.
Sampling special populations using screening
Some surveys have a particular focus on sub-groups of the general
population. Some examples include:
- The 2003 Health Survey for England which had a focus
on child health and included a boost sample of children
- The 1996 British Crime Survey which included a boost
sample of people from minority ethnic groups
- The Low Income Diet and Nutrition Survey which specifically
includes only low-income households.
The usual approach to selecting the sample in these instances is:
- Select a large PAF based sample
- Carry out a short screening survey at all households (which
might be done on the doorstep if it is very short),
and then,
- Carry out a full interview only in households with the relevant
people.
The success of screening largely depends on our
ability to ask screening questions quickly, and
at the start of interviews.
Survey Weighting
In most surveys it will be the case that some groups are
over-represented in the raw data and others under-represented.
These mis-representations are usually dealt with by weighting the
data.
The idea behind weighting is that:
- Members of sub-groups that are thought to be over- or under-represented
in the survey data are each given a weight
- Over-represented groups are given a weight of less than one
- Under-represented groups are given a weight of greater than
one
- The weight being calculated in such a way that the weighted
frequency of groups matches the population
- All survey estimates are calculated using these weights, so
that averages become weighted averages, and percentages become
weighted percentages, and so on.
Weights for disproportionate sampling are relatively non-controversial,
but weights to adjust for non-response biases are largely dependent
upon judgement.
The Government Statistical Service has established some principles
for weighting survey data. These are:
- Non-equal probabilities of selection (including disproportionate
stratification) should be dealt with by applying weights proportional
to the inverse of the probability of selection
- Non-response should be dealt with by weighting survey data
to published distributions by age, sex and region.
Further reading
- Barnett, V. (2002). Sample Survey Principles & Methods,
Arnold (3rd Edition).
- Barton, J. (1996). Selecting Stratifiers for the Family
Expenditure Survey (FES), in Survey Methodology Bulletin 39,
21-26.
- Cochran, W. G. (1977). Sampling Techniques, Wiley (3rd
edition).
- Elliot, D. (1991). Weighting for Non-Response: A survey
researcher's handbook, OPCS.
- Greenfield, T. (1996). Research Methods: Guidance for Postgraduates,
ed. Arnold.
- Kalton, G. (1983). Introduction to Survey Sampling,
Sage.
- Kalton, G. (1983). Compensating for Missing Survey Data,
Institute for Social Research, University of Michigan.
- Kish, L. (1992). Weighting for Unequal Pi, Journal of Official
Statistics, 8, 183-200.
- Kish, L. (1965). Survey Sampling, Wiley.
- Lynn, P. and Lievesley, D. (1991). Drawing General Population
Samples in Great Britain, SCPR.
- Lynn, P. and Taylor, B. (1995). On the bias and variance
of samples of individuals: a comparison of the electoral registers
and postcode address file as sampling frames, The Statistician,
44,173-194.
- Moser, C. A. and Kalton, G. (1971). Survey Methods in Social
Investigation, Gower (2nd edition 1971)
- Stuart, A. (1984). The Ideas of Sampling, Griffin.
> Magenta Book contents