Page tree
Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

Version 1 Next »


  1. Chapter 18 describes the concept and structure of metadata (or data that define and describe other data and processes) See International Organization for Standardization (ISO) and International Electrotechnical Commission (IEC), ISO/IEC FDIS 11179-1 "Information technology - Metadata registries - Part1: Framework", March 2004. Available from https://www.iso.org/obp/ui/#iso:std:iso-iec:11179:-1:ed-2:v1:en. for use within the statistical framework for describing the international supply of services, as well as good metadata compilation practices. It underscores that metadata are relevant for the correct understanding of the content, coverage and limitations of the data, and should guide users on their correct interpretation. The chapter contains the following sections: a summary of good practices (section A); an overview of the basic concepts, definitions and role of the Statistical Data and Metadata Exchange (SDMX) (section B); an indicative list of metadata items (section C); metadata standards of international and regional organizations (section D); and country practices (section E).


A.Summary of good practices

  1. The present Guide recognizes that metadata (a) play a crucial role in the statistical production process, as they enable and facilitate sharing, querying, understanding and using data over the different stages of their collection, compilation and dissemination, and at their various levels of aggregation and (b) are indispensable for assessment of the quality of data, as their availability and wide dissemination constitute a basis for the correct interpretation of publicly available statistics and their effective use.


  1. It is advised that compilers take into account that statistical metadata cover the following items: Guidelines on Integrated Economic Statistics, para. 5.91. statistical description, unit of reference, reference period, institutional mandate, confidentiality, release policy, frequency of dissemination, dissemination format, accessibility of documentation, quality management, relevance, data accuracy and reliability, timeliness, comparability, coherence, cost and response burden, data revision and statistical processing.


  1. The use of standard terminology for metadata across the various statistical domains and use of the SDMX information model is also advised, as they will facilitate further integration of statistics, the standardized sharing of data and the international comparability of data.


  1. It is further advised that compilers design their metadata systems in the most efficient way so that metadata items can be conveniently retrieved from the relevant databases, be used in the generation of the intermediate and final data sets or in the production of other metadata, and be updated and synchronized. In that context, compilers are encouraged to design and use a warehousing system for data and metadata to integrate the dissemination of data and metadata with the collection and processing components of the statistical production process.


  1. With regard to compiling metadata, compilers are encouraged to follow standardized metadata concepts, make use of the metadata developed in related statistical domains and already being applied in their national statistical system, define layers of metadata, establish metadata registries, incorporate structural metadata items into the data processing as early as possible, establish clear links between data and metadata and compile reference metadata.


  1. In the case of countries with less developed statistical systems, it is good practice to begin by setting up an exhaustive, consistent and detailed repository (possibly in the form of a metadata registry) with both structural and reference metadata, adopting, as much as possible, metadata concepts that are standardized across all statistical domains, both nationally and internationally. The present Guide strongly advises that the next immediate priority be to grant equal, easy, extensive and timely access to metadata to all user groups, including the general public.


  1. The structural metadata items promoted by the present Guide are those defined within the framework of the Balance of Payments Data Structure Definition (BOP-DSD) defined by the Manual on Statistics of International Trade in Services (MSITS 2010) and the present Guide for FATS and for additional indicators on the international supply of services. Metadata items should cover both monetary and non-monetary (quantitative) data items whose compilation is encouraged by the present Guide.


  1. The metadata standards of international organizations should be carefully considered by compilers, both to improve their metadata collection and compilation and ensure better compliance with their international and regional data and metadata reporting obligations.


  1. It is good practice for any deviations from international standards, as well as the use of estimations and modelling to compile certain data series, to be clearly documented in metadata.


B. Metadata: basic concepts and definitions and the role of the Statistical Data and Metadata Exchange

  1. Definition of metadata Metadata are data that define and describe other data and processes. See International Organization for Standardization (ISO) and International Electrotechnical Commission (IEC), ISO/IEC FDIS 11179-1 "Information technology - Metadata registries - Part1: Framework", March 2004. Available from https://www.iso.org/obp/ui/#iso:std:iso-iec:11179:-1:ed-2:v1:en. Data become metadata when they are used to describe other data. See SDMX Content-Oriented Guidelines (2009) for details.
  2. Scope of statistical metadata Statistical metadata, according to the Statistical Commission of the United Nations, describe various elements of the statistical processes, including collection, processing and production of statistical data, and indicate the data sources and tools that are instrumental in statistical production, such as statistical standards and classifications, business registers and frames, statistical methods, procedures and software. Section C provides an indicative list of structural and reference metadata items relevant in the context of statistics on the international supply of services.
  3. Institutional arrangements for metadata compilation To reduce the burden associated with projects on metadata for statistics compiled within the framework for describing the international supply of services, it is good practice for compilers to cooperate closely with the specific units within the national statistical system responsible for ensuring that metadata is produced, and for the metadata to adhere to a standard format and be properly maintained and updated.


  1. The Statistical Commission recommends the use of standard terminology for metadata across the various statistical domains to facilitate the international comparison of data. Guidelines on Integrated Economic Statistics, p. 40. The Commission is also increasingly encouraging countries to treat metadata compilation and dissemination as integral parts of the statistical process in any statistical domain, and promotes the standardization of the compilation and dissemination of metadata. The important role in this respect was played by the ECE publication Terminology on Statistical Metadata, Conference of European Statisticians Statistical Standards and Studies, No. 53, (United Nations publication, Sales No. E.00.II.E.21).
  2. The way forward: metadata warehousing Statistical agencies have traditionally developed separate databases for each statistical output. While that practice may simplify development processes, it can hinder the successful integration of statistics, especially if there is no effort to standardize variable definitions, labels and formats. Use of a centralized data warehousing system for data and metadata can make creating, maintaining and accessing metadata more efficient and can contribute to the integration of economic statistics. The process is being facilitated as better information and communications technology tools become available. See chapter 21 for more information on the use of information and communications technology (ICT) in the statistical process.
  3. With well-designed data warehouses, the dissemination of data and metadata becomes integrated with the collection and processing components of the statistical production process. A data warehouse should establish a simple and efficient process for accessing data to provide the following:

(a) Comprehensive metadata to facilitate understanding and analysis;
(b) Consistent and coherent long-term time series;
(c) Reliable information about the availability of data;
(d) Information about the availability of updated versions of published series;
(e) Contact details for the people who can provide more information about a statistical output.

  1. The implementation of a more comprehensive metadata system is an important prerequisite for developing an integrated questionnaire in the statistical system. The metadata will eventually provide the necessary coherence among the various estimates and data collection tools involved in the production of statistical information. For sophisticated users, metadata are not only relevant for concepts related to units, variables and classifications, they are also relevant for the quality of data.

B.1 Role of the Statistical Data and Metadata Exchange

  1. The SDMX project was developed by an international consortium Bank for International Settlements, Monetary and Economic Development,Guidelines for Reporting the BIS International Banking Statistics (2013), para. 5.123. for use in data and metadata management. The SDMX information model is applicable for much of the information stored and processed within statistical organizations and its use by such organizations is promoted by the Guidelines on Integrated Economic Statistics of the United Nations. Ibid.
  2. The use of the standardized information management model is very important for compilers of statistics on the international supply of services, as various agencies participate in data collection and compilation at different stages of the statistical production process, and the establishment of a standardized data sharing among them results in additional efficiency.
  3. The development of global DSD, which define the structure for the exchange of data (see section C), by the SDMX consortium and international organizations, enables the broader adoption of the SDMX standard for data collection, exchange and dissemination.

Box 18.1 Statistical Data and Metadata Exchange
The Statistical Data and Metadata Exchange (SDMX) is an international cooperative initiative aimed at developing standards and employing more efficient processes for the exchange and sharing of statistical data and metadata among international organizations and their member countries.

  1. The rationale of SDMX is the standardization of statistical data and metadata access and exchange. With the ever increasing ease of use of the Internet, the electronic exchange and sharing of data are becoming easier, more frequent and important. This heightens the need for the development of a set of common standards for the exchange and sharing of statistical data and metadata, and for making processes more efficient. As statistical data exchange takes place continuously, the gains to be realized from adopting common standards are considerable, for both data providers and users.


  1. The objective is to establish a set of commonly recognized standards, adhered to by all players, making it possible not only to have easy access to statistical data, wherever those data may be, but also to metadata that make the statistics more meaningful and usable. The standards are envisaged to help national organizations fulfil their responsibilities towards users and partners, including international organizations, more efficiently. Among other things, they are seen as facilitating the use of Internet-accessible databases that enable the retrieval of data as soon as they are released. Several quality dimensions can also be improved through the use of SDMX standards, such as timeliness, accessibility, interpretability, coherence and cost effectiveness.
    C.Indicative lists of metadata items
  2. The present section contains the indicative lists of structural and reference metadata items that countries are encouraged to compile as part of their statistics production within the framework for describing the international supply of services. (The issues specifically related to the dissemination of metadata are discussed in chapter 20.) The list of structural metadata provided below stems from the BOP-DSD. The reference metadata are based on the recommendations of MSITS 2010.
  3. In the context of SDMX, the term structural metadata refers to identifiers and descriptors of data, such as concepts and attributes of variables, which are essential for discovering, organizing, retrieving and processing statistical data sets. They can be thought of as the "labels" that need to be associated with each data item in order for it to have any meaning. Reference metadata are of a more general nature and may refer to specific statistical data, to entire data collections or even to the institution that provides the data.


C.1.Structural metadata: Balance of Payments-Data Structure Definition

  1. Structural metadata contain the list of concepts and attributes of variables necessary to codify the reporting requirements of four international agencies, the European Central Bank (ECB), Eurostat, the International Monetary Fund (IMF) and the Organization for Economic Cooperation and Development (OECD), The four agencies are the European Central Bank, Eurostat, IMF and OECD. for data collection exercises of external sector statistics, including international trade in services. At the time of writing, there is not yet a data structure definition (DSD) available neither for FATS nor for non-monetary indicators of modes of supply. A FATS DSD would include specific dimensions such as FATS characteristics, e.g., number of enterprises, number of persons employed, etc. and resident economic activity vs. non-resident economic activity. Items specific to non-monetary indicators on modes of services supply would include direction of trips: inbound, outbound; and country of origin or destination. The Balance of Payments-Data Structure Definition (BOP-DSD) structure is based on the methodology defined in the IMF Balance of Payments and International Investment Position Manual, 6th edition (BPM6), MSITS 2010, and the OECD Benchmark Definition of Foreign Direct Investment, 4th edition (BD4). In order to code trade in services data, the Extended Balance of Payments Services Classification (EBOPS), including the complementary groupings, is included in the "international account item" dimension of the DSD.
  2. The BOP-DSD, presented on the SDMX website, See http://sdmx.org/?page_id=1747. includes 16 concepts and 13 attributes (see box 18.2). Concepts are used to uniquely identify a time series and, when joined together, provide the series code or "time series keys," which are the unique identifiers for a time series. When defining a time series key using SDMX, a valid code must be assigned to each concept of the DSD. Attributes are used to further describe the data.
  3. When coding detailed trade in services by partner country statistics, a number of BOP-DSD concepts are fixed, e.g., the reference and counterpart sectors are defined as the total economy (S1) when the data refer to total trade between related and unrelated parties. In order to eliminate the possibility of having multiple ways of coding the EBOPS 2010 complementary grouping "total services transactions between related entities", that item is not coded in the international accounts dimension of the DSD, but is coded in the annual international trade in services dataflow as follows (example on the credit side): A.N.#.%.S1.S1A.T.C.S._Z._Z._Z.$._T._X.N. However, trade in services between related parties can also be coded by using code S1A "affiliates", in the counterpart sector dimension, whereas S1B should be used for unaffiliated parties.


  1. Other concepts of the DSD are not fixed, such as the "counterpart area", which is used to identify the territory of the non-resident entity of individual time series. The country code list in the counterpart area follows the International Organization for Standardization (ISO) classification and is a "cross-domain" code list. The codes used for various regional groupings were harmonized across international agencies that use the BOP-DSD, wherever possible.



Box 18.2 List of concepts and attributes in the Balance of Payments-Data Structure Definitions
The second column provides information on the coding that should be used in the context of an annual trade in service data submission.

 

Frequency

The code for the annual periodicity is "A".

Adjustment indicator

The code for no adjustment is "N".

Reference country or area

For a reference country, an ISO 3166 code should be used.

Counterpart area

For the partner country, an ISO 3166 code should be used.

Reference sector

The code for total economy, which is used for ITS, in all cases is "S1".

Counterpart sector

Code "S1" for total economy, "S1A" for related enterprises, "S1B" for unrelated enterprises (see reference para. 18.38).

Flows and stocks indicators

The code for transactions, is used in all cases for TIS, is "T".

Accounting entries

Credit "C", debit "D" or net "N".

International accounts item

"S" for total services, "SC" for transportation, etc.

Functional category

Identifies functional categories applicable to financial accounts. It is not applicable for trade in services. Code: "Z".

Instrument and assets classification

Identifies the type of financial instrument reported in the external sector time series. It is not applicable for trade in services. Code: "_Z".

Maturity

Identifies the types of maturity of the financial instrument of the external sector statistics time series. It is not applicable for trade in services. Code: "_Z"

Unit of measure

Refers to a currency unit.

Currency of denomination

Identifies the currency of denomination of the financial instrument. For ITS, a constant "_T" is applied.

Valuation

Identifies the method of valuation for selected transactions and positions data. For ITS, this is coded "unspecified _X".

Compilation methodology

Distinguishes between time series compiled according to the methodology applied to national statistics in opposition to similar time series that follow the specific methodology applied for economic or currency union statistics. For ITS, it is coded as national "N".

The BOP-DSD also uses the following 13 attributes:
1. Time format
2. Observation status
3. Confidentiality status
4. Pre-break value
5. Comments to the observation value
6. Detailed description (title complement)
7. Short title
8. Unit multiplier
9. Decimals
10. Time period collection
11. Reference period detail
12. Compiling organization
13. Underlying compilation

 


C.2Example of a trade in services time series key, coded according to the Balance of Payments-Data Structure Definition

  1. The times series key (A.N.US.FR.S1.S1.T.C.SC._Z._Z._Z.USD._T._X.N) provides an example of codification for a trade in services series. That times series key stands for a series in annual frequency, "A", with no adjustment indicator, "N". The reference country is the United States, "US", the counterpart area is France, "FR", the reference sector is the total economy, "S1" and the counterpart sector is the total economy, "S1". The flows and stocks indicator dimension indicates a transaction, "T", the accounting entry dimension indicates that this is a credit entry, "C", the international accounts item specifies that the series relate to transport services, "SC", the functional category is not applicable, "Z", the instrument and assets classification is not applicable, "Z", and the maturity is not applicable, "Z". The unit of measure is the United States dollar, "USD", and the currency of denomination is set to "all currencies of denomination" ,_T. The valuation is unspecified, _X, and the compilation methodology is National, "N".


  1. As noted above, attributes are used to qualify observations further, that is, they provide information on the "confidentiality" status or the "compiling organization". Attributes do not contribute to the identification of a time series, as that is already done by using the dimensions.

C.3Reference metadata

  1. The following items are typically part of the reference metadata associated with statistics on the international supply of services:

(a) Legal framework and institutional arrangements: references to relevant laws and regulations, the role of all institutions involved in compilation and the description of the coordination of the dissemination of statistics and data-sharing agreements among those institutions, either distinctly or as part of broader statistics (e.g., BOP and other external sectors statistics);
(b) Underlying concepts and definitions: definition of residence, non-residence, residence of units, as applicable, definition of statistical value, scope of statistics on the international supply of services and their relationship to national accounts and international merchandise trade statistics, distinction from other international transactions and classification under relevant services item according to BPM6 and EBOPS 2010 and any deviations from international standards, if any, the ultimate controlling institutional unit (UCI) concept, the definition of a foreign affiliate, the definition of direct or indirect control and definitions of statistical and reporting units, etc.;
(c) Description of core data sources: ITRS, enterprise/establishment surveys, surveys of households or persons, administrative records, statistical models, partner country data or a combination of sources, including specific notes on services categories or activities for which particular data collection arrangements or a combination of sources are employed and comments on limitations of source data in terms of coverage, frequency, level of detail, reliability and availability, etc.;
(d) Description of data collection, data compilation methods and data-processing procedures, including the frequency of data collection, the description of specific procedures used for data collection, validation, editing and aggregation, etc., adjustments made to source data, such as imputations, misclassification, adjustments for non-response or under-coverage, adjustments to standard data processing procedures, such as coding, tabulation errors, etc. and indications of departures from international standards, if any;
(e) Estimation methods, such as descriptions of methods for estimating non-reported transactions or transactions falling below customs and/or ITRS thresholds (e.g., cost, insurance and freight (CIF)-free on board (FOB) adjustments for the transportation item);
(f) Dissemination policy, including release and revision schedules, an indication of the presentation format of data, the level of disaggregation and eventual commentaries accompanying the data, etc.;
(g) Additional explanations and footnotes concerning the data as required: explanatory notes on revisions, breaks in series, definitions of confidentiality flags, etc.;
(h) Quality reporting, including the publication of regular quality reports that use the quality dimensions in the Template for a Generic National Quality Assurance Framework (NQAF) See https://unstats.un.org/unsd/dnss/QualityNQAF/nqaf.aspx; and see chapter 19 for more information on the national quality assurance framework (NQAF) and quality reporting and management in general. and include definitions of such quality dimensions as timeliness, accessibility and comparability;
(info) Confidentiality, including descriptions of confidentiality rules and indications of how much data is affected by such rules. See chapter 20 for more information on statistical confidentiality.

  1. Compilation of metadata Metadata are compiled at all stages of the statistics production process. The present Guide encourages countries to use the following good practices, as applicable, in metadata compilation:

(a) Use standardized metadata concepts. In the same way as any data item, metadata items must also be clearly defined. Even though each statistical domain, including statistics of international trade in services, has its specific metadata items, it is good practice to use applicable standardized concepts that are relevant across statistical domains (e.g., by adopting cross-domain concepts from the SDMX framework or the OECD Glossary of Statistical Terms). The aim should be to promote the harmonization of statistical information and their related high-level metadata across various institutions and statistical domains, even if some specific metadata concepts are not applicable or are organized differently in different domains or institutions;
(b) When developing metadata for statistics compiled within the framework for describing the international supply of services, use the metadata developed in the related statistical domains and used in your country. Statistics of international trade in services is a relatively new statistical domain in many countries. However, it is very likely that the metadata policy is already in place in related statistical domains. Compilers are advised to carefully review and use such metadata;
(c) Define layers of metadata. It is good practice to compile metadata in layers of incremental detail and provide clear links between high-level and specific metadata concepts. Such a layered structure of metadata will allow data users and analysts to access necessary metadata items and to minimize the risk of misinterpretation of data content when, for example, compiling data from various data sources. It will also ensure the clear presentation of metadata to diverse groups of users;
(d) Establish metadata registries. A metadata registry is a central repository (usually a database itself) with information that allows for the linking of the detailed definitions (semantics) with the codes (representations) of the metadata items used to describe a particular statistical data set. It is good practice for compilers to put special emphasis on the development, maintenance and dissemination of metadata registries to improve the harmonization, standardization, use, reuse and exchange of their metadata; The Euro-SDMX registry includes harmonized structural metadata, the DSDs designed for the statistical domains and metadata structure definitions, e.g., Euro-SDMX Metadata Structure (ESMS) and other related information.
(e) Confidentiality and access to metadata during the compilation process. As metadata for statistics on the international supply of services might be compiled by various units of the same agency or by units located in different organizations, there might be cases in which metadata describes data that is subject to confidentiality rules. It is good practice, in that context, for confidentiality rules to be set up in a such way that they will allow compilers to obtain non-confidential data aggregates with the same metadata content;
(f) Incorporate structural metadata items into the data processing as early as possible (e.g., as parts of the records structure). That step will facilitate data processing, including the identification of viable options for data aggregation and subsequent presentation. It is advisable for structural metadata to be made an integral part of the database containing statistics compiled within the framework for describing the international supply of services in such a way that it can be extracted together with any data item and used in data processing to obtain meaningful combined data sets;
(g) Establish clear links between data and metadata. As metadata are generated and processed during every step of the data compilation process, there is a strong requirement to ensure that the appropriate metadata retain their links with data. In that connection, it is good practice to implement metadata-driven management in the various stages of the statistical production process; There are several information model specifications that can contribute to achieving that goal (most notably SDMX and the Data Documentation Initiative (DDI), which are designed to perform different functions, but can be used together in the same system, or complement each other in the compilation and exchange of data and metadata.
(h) Compilation of reference metadata. Reference metadata can be presented as a detailed explanatory note describing the scope, coverage, and quality of a data set and can be made available electronically alongside the database or in special publications.

  1. Priorities in metadata management Although, ideally, the management of metadata would take into account all the recommendations highlighted thus far, countries with less developed systems for statistics on the international supply of services should begin by setting up an exhaustive, consistent and detailed repository (possibly in the form of a metadata registry) with both structural and reference metadata, adopting as much as possible standardized cross-domain metadata concepts, both nationally and internationally. The next immediate priority should be granting to the general public easy, extensive and timely access to metadata. In subsequent phases, the system could be improved by the gradual incorporation of more advanced features, such as a layered presentation of metadata and active links between data and metadata.

D.Metadata standards of international and regional organizations

  1. The international availability of appropriate metadata is of great interest to all organizations with global or regional responsibilities. Those organizations have made efforts to standardize their requirements for the scope and structure of the data and metadata that they would like to obtain from countries. Those requirements should be carefully studied by countries, both to improve their metadata collection and compilation, as well as to ensure better compliance with their international and regional data and metadata reporting obligations.


D.1 Special Data Dissemination Standard and General Data Dissemination System metadata of the International Monetary Fund

  1. The Special Data Dissemination Standard (SDDS) and the General Data Dissemination System (GDDS) are part of the IMF data standards initiative aimed at enhancing member countries' data transparency and promoting the development of sound statistical systems. A dedicated electronic bulletin board on the IMF website http://dsbb.imf.org. posts information that SDDS countries provide to IMF on their dissemination practices, and offers direct links to the economic and financial data that countries disseminate under the SDDS, as well as information that GDDS countries make available to IMF on their statistical practices
  2. For both standards, metadata are organized by country and topic. The SDDS metadata are available in two presentations, the current SDDS format and the Data Quality Assessment Framework (DQAF) format, while the GDDS uses the DQAF format alone. Revisions to metadata made regularly and are available on the IMF website. Metadata aspects related to statistics compiled within the framework for describing the international supply of services are embedded in the various quality dimensions of the BOP framework.

D.2The Statistical Data and Metadata Exchange content-oriented guidelines on metadata of Eurostat

  1. On the basis of SDMX information, model DSDs can be created for data on the international supply of services (including FATS). Available from http://ec.europa.eu/eurostat/data/metadata/metadata-structure. The SDMX content-oriented guidelines have been used to define reference metadata for the European Statistical System (ESS). See also the Commission of the European Communities recommendation 2009/498/EC of 23 June 2009 on reference metadata for the European Statistical System (Official Journal of the European Union, L 168,30.6.2009). Table 18.1 lists the main components of the ESS reference metadata. The present Guide advises compilers of statistics on the international supply of services of other regions to take the European Union recommendations into account, as applicable, when setting up the conceptual structure of their own reference metadata for such statistics.

D.3Ensuring the consistency of metadata and data reported to international organizations

  1. Reporting of trade in services data and metadata to all international organizations should be consistent. This can be achieved by ensuring that the total services and main items are transmitted with identical values to all international organizations, even if a "not allocated" category is used in a compiling country. As mentioned in chapter 17 (section E), compilers should make every effort to allocate all services transactions to the relevant individual EBOPS services categories and to appropriate trading partners, and should avoid, as much as possible, the use of catch-all services categories (e.g., "other business services", "other business services n.i.e.", "services not allocated" or "undefined") or catch-all partner categories. To conduct such allocations, compilers could use information from partner economies. If that information is not available or reliable, then it is advised to use the most appropriate modelling and statistical techniques, on the basis of the information available to compilers. (See chapter 17 for more information on the estimation and modelling of missing data.) Such practices should be clearly documented in the metadata.

Table 18.1 Main components of the European Statistical System reference metadata

 

Concept Name

Descriptions

1.

Contact

Individual or organizational contact points for the data or metadata, including information on how to reach the contact points

2.

Metadata update

The date on which the metadata element was inserted or modified in the database

3.

Statistical presentation

 Data description, classifications used, concepts and definitions etc.

4.

Unit of measure

The unit in which the data values are measured

5.

Reference period

The period of time or point in time to which the measured observation is intended to refer

6.

Institutional mandate

Set of rules or other formal set of instructions assigning responsibility, as well as the authority to an organization for the collection, processing and dissemination of statistics

7.

Confidentiality

A property of data indicating the extent to which their unauthorized disclosure could be prejudicial or harmful to the interest of the source or other relevant parties

8.

Release policy

Rules for disseminating statistical data to interested parties

9.

Frequency of dissemination

The time interval at which the statistics are disseminated over a given time period

10.

Dissemination format

Media by which statistical data and metadata are disseminated

11.

Accessibility of documentation

 

12.

Quality management

Systems and frameworks in place within an organization to manage the quality of statistical products and processes, as well as quality assurance and quality assessment

13.

Relevance

The degree to which statistical information meet current and potential needs of the users

14.

Accuracy and reliability

Accuracy: closeness of computations or estimates to the exact or true values that the statistics were intended to measureReliability: closeness of the initial estimated value to the subsequent estimated value

15.

Timeliness and punctuality

 

16.

Comparability

Measurement of the impact of differences in applied statistical concepts, measurement tools and procedures where statistics are compared between geographical areas or over time

17.

Coherence

Adequacy of statistics to be reliably combined in different ways and for various uses.

18.

Cost and burden

Cost associated with the collection and production of a statistical product and burden on respondents

 

 

 

19.

Data revision

Any change in the value of a statistic released to the public

20.

Statistical processing

Source data, frequency of data collection, data validation, data compilation and adjustments 

21.

Comment

Supplementary descriptive text that can be attached to data or metadata


  1. If there is no acceptable way to allocate all services, compilers should use a category to be labelled "not allocated", which will be shown at the same level as the main services items (i.e., not included within any of the services items) or main partners. The corresponding values should be included at the total services level only if such a category corresponds to the classification shown, or at the total world level if it relates to the partner breakdown shown.


  1. A "services not allocated" item is included in the BOP-DSD because it is part of the reporting requirements of ECB and Eurostat for the quarterly Balance of Payments (QBOP) and of the requirements by Eurostat, OECD and the Statistical Division for annual international trade in services data. By detailed partner countries for the former and by detailed EBOPS 2010 for the latter. However, such a category is not included in the QBOP data reporting requirement of IMF. Consequently, for countries that use the category "services not allocated", the individual services items will not add up to the total services in their report to IMF. In that case, the compiler should indicate in the metadata provided to the agencies that that total services do not correspond to the sum of the main services items owing to the presence of some transactions that are impossible to allocate across services. Table 18.2 presents an example of data reporting in such a case.



Table 18.2 Example of reporting trade in services data to international organizations for cases in which the "services not allocated" category is (or is not) part of the data collection

Credit side

Submission of data to international organizations following EBOPS 2010 ("services not allocated" is part of data collection)

Submission of data to international organizations following BPM6 ("services not allocated" is not part of data collection)

Total services

130

130

Manufacturing services on physical inputs owned by others

10

10

Maintenance and repair services n.i.e.

10

10

Transport

10

10

Travel

10

10

Construction

10

10

Insurance and pension services

10

10

Financial services

10

10

Charges for the use of intellectual property n.i.e

10

10

Telecommunications, computer and information services

10

10

Other business services

10

10

Personal, cultural and recreational services

10

10

Government goods and services n.i.e.

10

10

Services not allocated

10

(Metadata need to explain the difference between total services and sum of subcomponents)


E.Country experience
Italy

  1. Bank of Italy adopts an integrated approach to the broad range of its statistical programmes. The approach of the Bank to the management of metadata is based on two pillars: (a) an information model, capable of fully describing the data, the processing steps and the elaboration algorithms and (b) a software platform, which supports the entire statistical production chain.


  1. Bank of Italy has designed a proprietary model called "Matrix," Vincenzo Del Vecchio, Fabio Di Giovanni and Stefano Pambianco, "The 'Matrix' Model:unified model for statistical data representation and processing" (Banca d'Italia, 2007). Available from https://www.bancaditalia.it/statistiche/raccolta-dati/sistema-informativo-statistico/modellazione/matrixmod.pdf. based on mathematical and statistical theory, to support all phases of the statistics production process (data definition, collection, compilation and dissemination) and all the data of interest (micro/aggregated, registers, questionnaires, etc.). A fundamental infrastructural component of the Bank's system, representing a core part of the actual implementation of the Matrix model, is the central statistical dictionary, a repository describing the entire content of the statistical data warehouse, in terms of structural metadata (e.g., concepts, classifications, data structures and processing rules) and reference metadata (e.g., methodological notes).
  2. The Matrix model was also designed to take into account major international standards, so that, for example, the Matrix data and metadata can be easily transformed into SDMX and other metadata formats. Another essential feature of the Matrix model is that it enables a metadata-driven system by employing a recently introduced software platform for statistical processing called Infostat. Fabio Di Giovanni, Daniele Piazza, "Processing and managing statistical data: a National Central Bank experience", paper presented at the "International Statistical Conference", Prague, 14-15 September 2009. Available from http://www.academia.edu/6455962/Processing_and_managing_statistical_data_a_National_Central_Bank_experience. Consistent with the underlying holistic approach, Infostat supports the statistical production chain end-to-end, by providing the following services:


(a) Identity and access management (e.g., user registration, authentication, user profiling);
(b) Metadata definition;
(c) Online data entry and data upload;
(d) Support for secure data transmission, storing and versioning;
(e) Validation and handling of reporters' feedback;
(f) Calculations;
(g) Data and metadata import, export and exchange;
(h) Event subscription and notification;
(info) User environment for metadata prototyping and data production;
(j) Reporting and publishing;
(k) Information search;
(l) Inquiry and download of data and metadata;
(m) End-to-end monitoring of business processes.

  1. An important advantage of the approach detailed above is that most of the changes in the statistical processes (e.g., the establishment of a new survey, the production of new sets of statistics, the release of a new publication, etc.) can be implemented in a timely manner by metadata administrators, avoiding the need for complex software maintenance. In fact, due to an advanced user interface, metadata administration is rather user-friendly, allowing users who are not information technology specialists to accomplish it directly, without the intervention of technical staff.
  2. The system can handle both qualitative and quantitative indicators, micro- and macrodata, questionnaires, registers and unstructured data (documents), thus allowing broad integration. Infostat also adopts the Matrix model language, called EXL (expression language), to define expressions used in data validation and in data processing phases for calculations. EXL expressions are intuitive, as they are quite similar to spreadsheet formulas, and the language is conceived to be extensible, in order to support the great variety and variability of statistical requirements.


  • No labels