UNSD Document

Statistical units

A first demand in this respect is that the statistical unit is defined in such a way that the respondent can recognise himself as a real transactor in the economy rather than an artificial construction. This can be realised by stressing the requirements of 'autonomy' and 'data availability' in operational definitions of statistical units, while accepting a certain degree of 'heterogeneity'. There will, however, always remain situations where the respondent is not able or willing to report data on the level of the envisaged statistical unit. Then the 'reporting unit' will deviate from the statistical unit. In case of a 1:n relationship the statistician will have to allocate the data reported, while in the inverted case consolidation is necessary.

Variables: concepts and definitions

Whenever statistical concepts deviate from accounting concepts, this should be considered to be a problem for the statistician, not for the respondent. This means that questionnaires should be designed in such a way that they can be completed directly from bookkeeping records, and that it is, again, up to the statistician to bridge the gap between questionnaire concepts and statistical output concepts.

Variables: number and detail

As with sample sizes, it may be justified to alternate the contents of questionnaires. Once the 'maximum' questionnaire has been designed, one should seriously consider whether it is necessary to apply it 'full size', for each respondent and each reporting period. For smaller businesses, asking less detail should be considered. Whether it is wise to vary number and detail of variables over time, e.g. by asking detail only at intervals of three months, depends on the situation. In certain situations, in particular when respondents have taken special measures to generate the requested data automatically from their computer systems, it is better to maintain a constant rhythm.

Couleur locale

When a survey covers distinct SIC-areas, accounting practices and vocabulary may differ among industry branches. This may ask for different questionnaires for different groups of respondents.

Introduction and presentation

Response burden is not only determined by the time it takes to answer questions, but also by the time and effort needed to read and understand questions, introductory letters and explanatory notes. These should not only be brief and clear but, above all, applicable. Not or hardly applicable instructions and extensive, irrelevant explanations are very irritating. This is another justification for tailoring questionnaires to homogeneous groups of respondents. SIC-code is an important parameter for identifying such groups, as well as data reported in related or previous surveys. Pre-printing of product specifications is an example of the use of the latter.

Two stage sampling

An effective way to avoid overkill is two stage sampling. This requires the conduct of two consecutive surveys. First a few simple questions are asked to a large number of enterprises, e.g. whether or not a business carries out Research and development activities, and if so, whether the expenditure exceeds a certain threshold. The findings of the first stage are then used to narrow down the target population for the second stage. Besides, it is possible that the first stage generates data which can be used as estimators in the second stage, enabling to reduce the sample size.

Feedback of results

Providing respondents with some statistical results of the survey is a measure which, of course, only affects perception of burden. The information to be given should be relevant and readable and must therefore be carefully edited. It makes sense to test whether the statistics are of real interest to the businessman. If not, the effect on his perception may well be negative, instead of positive.