Cover paper oranje

Papers

Wetenschappelijke publicaties, proefschriften en verslagen van kwaliteitsonderzoeken op het gebied van statistiek.

Papers in deze reeks gaan over methoden, kwaliteit, processen en informatietechnologische en conceptuele onderwerpen en die relevant zijn voor het werkterrein van CBS. Ze beschrijven het resultaat van (toegepast) wetenschappelijk onderzoek dat onderzoekers van CBS (soms samen met anderen) hebben uitgevoerd. De reeks is niet bedoeld om CBS-cijfers te publiceren. De cijfers in de papers zijn dus niet als officiële CBS-uitkomsten aan te merken.

tijdreeks papers

Vrijwilligerswerk en de Sustainable Development Goals

Om de inzet van vrijwilligerswerk voor de Sustainable Development Goals (SDG’s) in kaart te brengen wordt een methode onderzocht om door kernwoordendetectie vacatures in te delen bij de SDG’s.

Transitieplan herontwerp Enquête Beroepsbevolking

In dit rapport wordt een methodiek voorgesteld om breuken ten gevolge van de overgang naar een nieuw design voor de Enquête Beroepsbevolking te schatten via een parallel run in combinatie met een tijdreeks model.

Basisverlegging inputprijsindexcijfers Grond-, weg- en waterbouw van 2010=100 naar 2015=100

Nederlands: Met ingang van het eerste kwartaal van 2020 worden de prijsindices GWW gepubliceerd met een nieuw referentiejaar, te weten het jaar 2015. In deze publicatie wordt beschreven wat er is veranderd ten opzichte van de gww-reeksen op basis van 2010=100.

Tourism statistics and the use of social media data

Deze notitie beschrijft fundamenteel onderzoek naar verschillende mogelijkheden om publiek beschikbare sociale media te gebruiken voor statistieken in het domein toerisme.

Cybersecuritybedrijven in Nederland

Een discussie paper over het onderzoek naar het waarnemen en beschrijven van kenmerken van cybersecuritybedrijven in Nederland in 2019 met behulp van webscraping en textmining.

Het meten van de interneteconomie met big data

Een discussie paper over het onderzoek naar het meten van de interneteconomie in Nederland voor de periode 2016-2018 met behulp van big data. Dit onderzoek is een herhaling van een vergelijkbaar onderzoek uit 2016 over het jaar 2015.

Resultaten Gepromoveerdenonderzoek 2019

In 2019 heeft het CBS voor de derde keer onderzoek gedaan onder gepromoveerden. Doel van dit onderzoek is het vergroten van het inzicht in de loopbanen en de mate van internationale mobiliteit van gepromoveerden na het behalen van hun doctoraat.

Integratie verdragen en Internationale Bedrijven

Onderzoek naar de samenhang tussen handels- en investeringsverdragen en de internationalisering van bedrijven.

Methodenaudits bij nationale statistische instellingen

Methodenaudits bij nationale statistische instellingen – een internationale inventarisatie

Pensioenaansprakenstatistiek, 2017

Verantwoording en resultaten van de pensioenaansprakenstatistiek 2017.

Een dynamisch factor model voor werkloosheidsstatistieken

Discussiepaper over een methode om een groot aantal hulpreeksen afkomstig uit big data bronnen te gebruiken bij het schatten van de maandelijkse werkloze beroepsbevolking.

Nauwkeurigheid van schatters bij binaire classificatie

Publicaties in de officiële statistiek worden vaak uitgesplitst naar domeinen. Het is dan belangrijk om het effect van misclassificaties tussen domeinen te bepalen op de nauwkeurigheid van deze statistieken.

Big data in de officiële statistiek

Voor het gebruik van big data in de officiële statistiek zijn methodologische vernieuwingen noodzakelijk.

Vinden van websites van bedrijven

Het koppelen van domeinnamen van websites aan juridische bedrijfseenheden

Stopdetectie in mobiliteitsonderzoek

In app-ondersteund verplaatsingsonderzoek worden sensoren ingezet om locatie te meten en stops en verplaatsingen te voorspellen.

Is ongewenst antwoordgedrag consistent?

Dit paper onderzoekt of antwoordgedrag dat een verhoogd risico geeft op meetfouten over meerdere onderzoeken heen terugkomt, ofwel een eigenschap is van sommige respondenten.

Free services, experimentele studie

In dit rapport wordt een aspect van de digitale economie besproken, namelijk de verspreiding van nieuwe en gratis goederen en diensten.

Financiële kengetallen zorginstellingen 2018

Beschrijving van financiële kengetallen van zorginstellingen over het jaar 2018, waarmee het mogelijk is individuele zorginstellingen te benchmarken.

Machine learning-technieken voor de CPI

Een vergelijkend onderzoek naar het gebruik van machine learning-technieken voor het classificeren van producten voor de CPI.

Onderzoek houdbaarheid woonlandfactorberekening

Is de berekeningswijze van de bijdrage voor gezondheidszorg door Nederlanders wonend in het buitenland houdbaar?

Tijdreeks cao-lonen 2010=100

In deze nota wordt beschreven hoe de lange tijdreeks cao-lonen 2010=100 tot stand is gekomen

Tijdreeksmodellering van mobiliteitstrends

Discussiepaper over een methode om gedetailleerde trendschattingen over mobiliteit te maken gecorrigeerd voor methodebreuken.

Stikstofverlies uit opgeslagen mest

Stikstofverlies berekend uit het verschil in verhouding tussen stikstof en fosfaat bij excretie en bij mestafvoer.

Methoden voor trendanalyse Evaluatie Meststoffenwet

Twee methodologische rapporten over trendanalyses binnen de Evaluatie Meststoffenwet.

Componenten van werkloosheid

Componenten (oorzaken) van werkloosheid statistisch bepaald.

Conjunctuurgecorrigeerde werkloosheid

De conjunctuurgecorrigeerde werkloosheid is berekend en zegt iets over de lange termijn ontwikkeling.

Kinderen van gescheiden ouders

Kinderen van gescheiden ouders: geregistreerd versus gemeld verblijfadres

Analyse seizoensverschillen in opgegeven omzetwaarden

Analyse van seizoensverschillen in opgegeven omzetwaarden bij steekproef- versus BTW data

Herziening van de Inkomensstatistiek 2011

Beschreven worden de aspecten van de herziening van de Inkomensstatistiek en het effect hiervan op de uitkomsten.

Corrigeren voor koppelfouten binnen de multipele vangst

Corrigeren voor koppelfouten binnen de multipele vangst – hervangst methode bij het schatten van de populatieomvang.

Importgedrag en exportprestaties van bedrijven

Onderzoek naar de samenhang tussen exportprestaties van Nederlandse bedrijven en hun importgedrag.

Gebruik van sociale media, 2017

Verkennend onderzoek naar de mogelijkheden van het gebruik van sociale media voor statistiek voor bedrijven.

Alternatieven voor Excel in de Iv3-keten

Rapport in samenwerking met Open State Foundation waarin alternatieven voor aanlevering in Excel worden onderzocht voor het rapporteren van Iv3-informatie.

Kwaliteitstoets Mezuro

In dit rapport is beschreven hoe het CBS de kwaliteitsanalyse van Mezuro-data op verzoek van Mezuro heeft uitgevoerd.

E-commerce Onderzoek Caribisch Nederland 2018-2019

In dit rapport worden de resultaten van het e-commerce onderzoek Caribisch Nederland 2018-2019 belicht.

Importconcurrentie en bedrijfsinnovatie, 2000-2010

De invloed van importconcurrentie op innovatie door bedrijven in de Nederlandse industrie in de periode 2000-2010.

Beveiligen van tabellen met behulp van celonderdrukking

Statistiekbureaus beveiligen tabeldata tegen onthulling van gevoelige informatie voordat ze worden gepubliceerd. Een bekende methode is celonderdrukking. In dit paper stellen we een nieuwe aanpak voor celonderdrukking voor.

Pensioenaansprakenstatistiek, 2016

Verantwoording en resultaten van de pensioenaansprakenstatistiek 2016.

Het meten van online platformen

Het meten van online platformen: definities, actoren en statistische uitdagingen

Big data bronnen voor officiële statistieken

Discussion Paper about now casting monthly unemployment figures using auxiliary series derived from big data sources.

Carry-along export: wat wordt er gebundeld?

Een analyse van carry-along export (CAT) door Nederlandse goederenexporteurs.

De Conjunctuurenquête Nederland regionaal

De relatie tussen sentimentsindicatoren en realisaties per COROP-gebied: statistisch-economische en methodische achtergronden.

Risico op vervoersarmoede

Welke factoren spelen een rol bij vervoersarmoede en in hoeverre verschilt het risico op vervoersarmoede tussen een stad met bevolkingsgroei (Utrecht) en een stad met bevolkingskrimp (Heerlen)?

Financiële kengetallen zorginstellingen 2017

Financiële kengetallen van zorginstellingen, waarmee het mogelijk is individuele zorginstellingen te benchmarken.

Meten van eenzaamheid

Dit artikel gaat over de methode die het CBS gebruikt om eenzaamheid te meten. Er worden twee verschillende meetinstrumenten met elkaar vergeleken.

Pensioenaansprakenstatistiek, 2015

Verantwoording en eerste resultaten van de per 2015 vernieuwde pensioenaansprakenstatistiek.

De rol van het MKB in de Nederlandse economie, 2012

De rol van verschillende typen bedrijven (MKB, grootbedrijf) in de Nederlandse economie, 2012, methode en resultaten.

Kwaliteitsmanagement voor methodologie

Methodological departments of national statistical institutes should adopt formal quality certification for ISO 9001.

Een MIP-aanpak voor gegeneraliseerde foutlocalisatie

We presenteren en testen een aanpak voor automatische foutlocalisatie gebaseerd op algemene aanpassingsacties.

Statistische kwaliteit: certificering, regels, cultuur

National statistical institutes should adopt formal quality certification, e.g. ISO or EFQM.

Gasdruk en gradiënten in gasdruk in Groningen

Een rapport met een statistische analyse van de relatie tussen gasdruk en gradienten in gasdruk binnen het Groningse reservoir voorafgaand aan aardbevingen, uitgevoerd in opdracht van het Staatstoezicht op de Mijnen.

Aanbod- en gebruiktabellen ecosysteemdiensten Nederland

Methodologische beschrijving en toepassing van dertien ecosysteemdienstmodellen in fysieke aanbod- en gebruiktabellen.

Weibull verdelingen aardbevingen Groningen

Een rapport met een analyse van de tijdsintervallen tussen aardbevingen in Groningen, gerelateerd aan de gaswinning.

Transitiviteit van prijsindices

Transitivity of price indices is advocated and various ways to construct these are shown.

Toepassingsmogelijkheden van Complexiteitstheorie bij het CBS

Dit position paper geeft aan hoe het CBS de inzichten van de Complexiteitstheorie kan toepassen in officiële statistieken.

Toerisme in Caribisch Nederland 2017

Deze publicatie bevat indicatoren over inkomend toerisme op de eilanden Bonaire, Saba en Sint Eustatius in de periode 2015 - 2017.

Co-integratie in structurele tijdreeks modellen

Investigating co-integraton between inflation indicators within the price dashboard using structural time series models.

Samenhang tussen inflatie indicatoren

Investigating correlation and temporal relationships between the inflation indicators within the price dashboard.

Verkenning alternatieven CUMI-regeling

Verkenning van alternatieven voor de CUMI-regeling in het speciaal onderwijs

Werk en vestigingen in Nederland

Het onderzoek gaat in op definitieverschillen en verschillen in dataverzameling.

Bemonsteren van netwerken met beperkte toegang

Dit paper bespreekt enkele onderwerpen rond het bemonsteren van netwerken met beperkte toegang.

Reservoirdruk en aantallen aardbevingen in Groningen

Her-analyse frequentie van aardbevingen in Groningen i.v.m. verbeterde dagelijkse tijdsresolutie v/d reservoir gasdruk

impact of linkage errors and erroneous captures

Vangst-hervangst methoden en schending van de aannames van perfecte koppeling en geen foutieve vangsten

Publieke exportkredietverzekering in Nederland

becijfering van de bijdrage van de publieke exportkredietverzekering aan het bbp en de werkgelegenheid

Schatten van methodebreuken in survey onderzoek

methoden om de effecten van veranderingen in een survey onderzoek te kwantificeren.

Basissituaties en methoden

Multi-source Statistics: basic situations of combining sources and estimation methods

Van GEKS naar de cycle methode

The GEKS method and several generalizations are discussed, including the cycle method

Transitief maken van elementaire prijsindices

Transitivizing elementary price indices for internet data using the cycle method

Filtering in the Fourier domain

Filtering in the Fourier domain: a new set of filters for seasonal adjustment of time series and its evaluation

Consistente multivariate seizoencorrectie

Constistente multivariate seizoencorrectie, gebaseerd op een combinatie van een multivariaat structureel tijdreksmodel.

Reservoirdruk daling en aantallen aardbevingen

Een rapport van voortgezet onderzoek naar de frequentie van aardbevingen in Groningen, gerelateerd aan de gaswinning.

Responsgedrag bij de Veiligheidsmonitor

Onderzoek naar responsgedrag en selectiviteit bij inzet van internet en papier bij de Veiligheidsmonitor

Fraude met online handel

Fraude met online handel, Antwoorden uit de Veiligheidsmonitor vergeleken met het politieregister

Massa-imputatie voor de virtuele volkstelling

Schatten van hoogst behaalde opleiding voor de Nederlandse virtuele volkstelling.

Imputation of Numerical Data under Edit Restrictions

In dit paper stellen we een nieuwe imputatiemethode voor die rekening met controleregels houdt.

Uitbreiding Materiaalmonitor met voorraden

Haalbaarheidsstudie naar fysieke voorraden (urban mine) in onze economie

Big Data Masterclass and DataCamp 2015

This discussion paper describes a method based on latent class modeling

Removing the Gap between Annual and Sub-Annual Statistics

Statistical methods for reconciling data that are published at different frequencies (e.g. monthly and quarterly).

Estimation of level and change for unemployment

Discussion Paper about model-based methods for the estimation of monthly unemployment by province

Scalar measures of uncertainty

Measuring the uncertainty in accounting equations (estimated figures connected by known constraints) by scalar measures.

Estimating unemployment levels using a mix of three interview modes

This article contains an appraisal of two recently proposed methods of inference for sequential mixed-mode surveys.

Profiling of twitter users a big data selectivity study

Discussion paper on the potential and implications of profiling big data sources for official statistics.

Uitbreiding Materiaalmonitor met Water

This is a methodological report on the extension of the Materials Monitor.

Design and analysis of experiments of embedded in complex sample surveys

Randomized experiments embedded in probability samples typically find their applications in survey methodology to test the effect of alternative survey implementations on the outcomes of a sample survey. The value of empirical research into survey methods is strengthened as conclusions can be generalized to populations larger than the sample that is included in the experiment. This can be achieved by selecting experimental units randomly from a larger target population and naturally leads to randomized experiments embedded in probability samples. This thesis develops statistical methods for design and analysis of such large scale field experiments.

Musea en internetdata, 2016

Cijfers over musea, zoals bezoekersaantallen, toegangsprijzen en het gebruik van sociale media, worden nu verkregen via jaarlijkse enquêtes onder musea (vanaf onderzoeksjaar 2015 digitaal).

Multilevel Hierarchical Bayesian vs. State Space Approach in Time Series Small Area Estimation: the Dutch Travel Survey

This study compares two different techniques in a time series small area application: state space models estimated with the Kalman filter with a frequentist approach to hyperparameter estimation, and multilevel time series models estimated within the hierarchical Bayesian framework. The application chosen is the Dutch Travel Survey featuring level breaks caused by the survey redesigns, as well as small sample sizes for the main publication domains. Both models require variances of the design-based domain estimates as prior information. In practice, however, only unstable estimates of the design-based variances are available. In this paper, excessive volatility and a possible bias in design-based variance estimates are removed with the help of a state space model. The multilevel and state space modelling approaches deliver similar results. Slight differences in model-based variance estimates appear mostly in small-scale domains and are due to neglecting uncertainty around the hyperparameter estimates in state space models, and to a lesser extent due to skewness in the posterior distributions of the parameters of interest. The results suggest that the reduction in design-based standard errors with the hierarchical Bayesian approach is over 50% at the provincial level, and over 30% at the national level, averaged over the domains and time.

Under coverage of the population register in the Netherlands 2010

In this manuscript we consider an important topic in official statistics, namely estimating the number of usual residents. For the Netherlands we investigate the under coverage of the Population Register. First, the Population Register is linked to an Employment Register and a Crime Suspects Register. Then, we use three list capture-recapture methodology to estimate the number of usual residents. Two problems arise: 1) all three registers have no variable on usual residence; 2) capture-recapture methodology relies on a couple of assumptions, for which it cannot be verified whether they are met. The paper shows how the problems are solved.

Social media as a data source for official statistics; the Dutch Consumer Confidence Index

In this paper the question is addressed how alternative data sources, such as administrative and social media data, can be used in the production of official statistics. Since most surveys at national statistical institutes are conducted repeatedly over time, a multivariate structural time series modelling approach is proposed to model the series observed by a repeated surveys with related series obtained from such alternative data sources. Generally, this improves the precision of the direct survey estimates by using sample information observed in preceding periods and information from related auxiliary series. The concept of cointegration is applied to address the question to which extent the alternative series represent the same phenomena as the series observed with the repeated survey. The methodology is applied to the Dutch Consumer Confidence Survey and a sentiment index derived from social media.

Monitor Duurzame agro-grondstoffen 2015

De Monitor Duurzame Agro-grondstoffen 2015 laat cijfers en trends zien op het terrein van verduurzaming van allerlei agro-grondstoffen, zoals palmolie, soja en koffie.

Adjusting measurement bias in sequential mixed-mode surveys using re-interview data

In mixed-mode surveys, mode-differences in measurement bias, also called measurement effects or mode effects, continue to pose a problem to survey practitioners. In this paper, we discuss statistical adjustment of measurement bias to the level of a measurement benchmark mode during inference from mixed-mode data. In doing so, statistical methodology requires auxiliary information which we suggest to collect in a re-interview administered to a sub-set of respondents to the first stage of a sequential mixed-mode survey. In the re-interview, relevant questions from the main survey are repeated. After introducing the design and presenting relevant statistical theory, this paper evaluates the performance of a set of six candidate estimators that exploit re-interview information in a Monto Carlo simulation. In the simulation, a large number of parameters is systematically varied, which define the size and type of measurement and selection effects between modes in the mixed-mode design. Our results indicate that the performance of the estimators strongly depends on the true measurement error model. However, one estimator, called the inverse regression estimator, performs particularly well under all considered scenarios. Our results suggest that the re-interview method is a useful approach to adjust measurement effects in the presence of non-ignorable selectivity between modes in mixed-mode data.

Woningmarktontwikkelingen Groningenveld t/m 2015 q2

Rapport, methodologische toelichting en tabellen over woningmarktontwikkelingen rondom het Groningenveld.

Can survey item characteristics relevant to mode-specific measurement error be coded reliably?

In multi-mode questionnaire design, usually some consideration is given to mode-specific measurement error. Despite this consideration, however, these measurement effects can be unexpectedly large. For this reason, there is a strong incentive to better predict measurement effects. This may be done by constructing profiles of a questionnaire, in which relevant item characteristics are summarized. For all items of a survey, these item characteristics need to be coded and combined. In this paper, we evaluated a list of item characteristics that literature has reported as relevant to mode-specific measurement error. Most importantly, we evaluated the reliability of the coding of such characteristics. Our results showed that intercoder reliability can be low for the most relevant characteristics. This may be explained by the difficulty of defining the item characteristics and the inherent subjectivity with which these item characteristics are coded. Finally, some suggestions are made for coping with low intercoder reliability.

Selecting adaptive survey design strata with partial R-indicators

Recent survey literature shows an increasing interest in survey designs that adapt data collection to characteristics of the survey target population. Given a specified quality objective function, the designs attempt to find an optimal balance between quality and costs. Finding the optimal balance may not be straightforward as corresponding optimization problems are often highly non-linear and non-convex. In this paper, we discuss how to choose strata in such designs and how to allocate these strata in a sequential design with two phases. We use partial R-indicators to build profiles of the data units where more or less attention is required in the data collection. In allocating cases, we look at two extremes: surveys that are run only once, or infrequent, and surveys that are run continuously. We demonstrate the impact of the sample size in a simulation study and provide an application to a real survey, the Dutch Crime Victimisation Survey.

Calibrated Hot Deck Imputation for Numerical Data under Edit Restrictions

A common problem faced by statistical institutes is that some data of otherwise responding units may be missing. This is referred to as item non-response. Item-nonresponse is usually treated by imputing the missing data. The problem of imputing missing data is complicated by the fact that statistical data often have to satisfy so-called edit rules, which for numerical data usually take the form of linear restrictions. A further complication is that numerical data sometimes have to sum up to known totals. Standard imputation methods for numerical data as described in the literature generally do not take such linear edit restrictions on the data or known totals into account. In this paper we develop simple imputation methods that satisfy edits and preserve known totals. These methods are based on well-known hot deck approaches. Extension of our methods to other types of imputation, such as regression imputation or predictive mean matching, is straightforward.

Statistical matching: Experimental results and future research questions

National statistical institutes try to construct data sets that are rich in information content by combining available data as much as possible. Unfortunately, units, e.g. persons or enterprises, in different data sources cannot always be matched directly with each other, for example because different data sources often contain different units. In such a case one can sometimes resort to statistical matching rather than exact matching. Statistical matching can be used when different data sources contain (different) units with a set of common (background) variables. These common variables may then be used to match similar units in the data sources to each other. From March 2015 till the end of June 2015 two master students, Sofie Linskens and Janneke van Roij, at Tilburg University evaluated some methods for statistical matching, namely random hot deck, distance hot deck and statistical matching using a parametric model, on categorical data from the Dutch Population Census 2001. In this paper we describe the methods that they examined and the results they obtained.

Koopkracht en inkomen werkend Nederland

Onderzoekstoelichting over de koopkracht en persoonlijk inkomen van werkend Nederland en verschillen die er zijn bij uitsplitsen naar vervoermiddelgebruik voor werkgerelateerde verplaatsingen. Opdrachtgever: Interbest B.V.

Simplifying constraints in data editing

Data editing is the process of checking and correcting data. In practise, these processes are often automated. A large number of constraints needs to be handled in many applications. This paper shows that data editing can benefit from constraint simplification techniques that are often used in Operations Research and Artificial Intelligence. Performance can be improved and a better quality of automatically corrected data can be obtained. First, a new procedure for constraint simplification will be proposed that is especially developed for data editing; a procedure that combines several known algorithms from Operations Research and Artificial Intelligence. Thereafter, it will be demonstrated that real-life edit sets can actually be simplified.

Modelling Measurement Error to Estimate Bias in Administrative and Survey Variables

In this paper, we describe a method for assessing the measurement validity as well as the bias of administrative and survey variables by means of a structural equation model. The method is applied to value-added tax Turnover for the Dutch short-term statistics.

Uitbreiding van de Monitor materiaalstromen (alleen Engelstalig)

Haalbaarheidsstudie naar de concepten en data ten behoeve van een geïntegreerd systeem om de ‘circulaire economie’, 'bio-based economy', eco-belasting en natuurlijke hulpbronnen te meten.

Expanding the Material Flow Monitor

A feasibility study on the concepts and data needed to create an integrated measurement system for the circular economy, bio-based economy, eco-taxation and other resource issues.

Statistical inference based on randomly generated auxiliary variables

In most real-life studies, auxiliary variables are available and are employed to explain and understand missing data patterns and to evaluate and control causal relations with variables of interest. Usually their availability is assumed to be a fact, even if the variables are measured without the objectives of the study in mind. As a result, inference with missing data and causal inference require a number of assumptions that cannot easily be validated or checked. In this paper, a framework is constructed in which auxiliary variables are treated as a selection, possibly random, from the universe of variables on a population. This framework provides conditions to make statistical inference beyond the traces of bias or effects found by the auxiliary variables themselves. The utility of the framework is demonstrated for the analysis and reduction of nonresponse in surveys. However, the framework may be more generally used to understand the strength of associations between variables. Important roles are played by the diversity and diffusion of the population of interest, features that are defined in the paper and the estimation of which is discussed.

Evaluating bias of sequential mixed-mode designs against benchmark surveys

This study evaluated three types of bias – total, measurement, and selection bias – in three sequential mixed-mode designs of the Dutch Crime Victimization Survey: telephone, mail, and web, where nonrespondents were followed up face-to-face. In the absence of true scores, all biases were estimated as mode effects against two different types of benchmarks. In the single-mode benchmark (SMB), effects were evaluated against a face-to-face reference survey. In an alternative analysis, a ‘hybrid-mode benchmark’ (HMB) was used, where effects were evaluated against a mix of the measurements of a web survey and the selection bias of a face-to-face survey. A special re-interview design made available additional auxiliary data exploited in estimation for a range of survey variables. Depending on the SMB and HMB perspectives, a telephone, mail, or web design with a face-to-face follow-up (SMB) or a design involving only mail and/or web but not a face-to-face follow-up (HMB) is recommended based on the empirical findings.

Expanding the Material Flow Monitor

A feasibility study on the concepts and data needed to create an integrated measurement system for the circular economy, bio-based economy, eco-taxation and other resource issues.

Predictive inference for non-probability samples

Non-probability samples provide a challenging source of information for official statistics, because the data generating mechanism is unknown. Making inference from such samples therefore requires a novel approach compared with the classic approach of survey sampling. We propose a framework based on predictive inference and discuss three classes of methods. We conduct a simulation study with a real-world data set.

The Dutch sport economy, 2006-2012 (Dutch only)

Report and tables on the sport satellite account. Commissioned by: Ministry of Health, Welfare and Sport.

Koopkracht werkend Nederland

Onderzoekstoelichting over de koopkracht van werkend Nederland en verschillen die er zijn bij uitsplitsen naar vervoermiddelgebruik voor werkgerelateerde verplaatsingen. Opdrachtgever: Interbest B.V.

Purchasing power of the working population in the Netherlands (Dutch only)

Research description of the purchasing power of the working population in the Netherlands and differences after the breakdown by transport mode used for work-related trips. Commissioned by: Interbest B.V.

Monitor topsectors 2015 (Dutch only)

Report and dashboard containing the results, description of methods and (custom-made) data tables regarding the nine top business sectors within the Netherlands (the so-called ‘topsectors’), along with their regional distribution, as obtained from the Monitor topsectors 2015. Commissioned by the Ministry of Economic Affairs (EZ).

Monitor topsectoren 2015

Dashboard met uitkomsten, methodebeschrijving en tabellenset en een maatwerktabellenset over topsectoren naar regio van de Monitor topsectoren 2015. Opdrachtgever: Ministerie van Economische Zaken (EZ).

Record linkage for health studies: three demonstration projects

Het koppelen van gegevens wordt steeds gebruikelijker in statistisch en academisch onderzoek. Door informatie uit verschillende bronnen te combineren kunnen vragen beantwoord worden die met de gegevens uit één bron niet of zeer lastig te beantwoorden zijn. Het is belangrijk te begrijpen wat de technische, methodologische en juridische beperkingen van gegevenskoppeling zijn. Dit rapport beschrijft drie voorbeelden waarin medische en algemene gegevensbronnen gekoppeld zijn. Het is tot stand gekomen in het kader van het Biolink NL-project, één van de zogenaamde regenboogprojecten gefinancierd door de Biobanking and Biomolecular Research Infrastructure (BBMRI-NL). Het is een samenwerking tussen een aantal academische onderzoeksinstituten en het CBS. Dit is het tweede en laatste rapport dat de methodologische aspecten van het koppelen van bestanden onderzoekt binnen het Biolink NL-project.

General approaches for consistent estimation based on administrative data and surveys

National statistical institutes (NSIs) fulfill an important role as providers of objective and undisputed statistical information on many different aspects of society. To this end NSIs try to construct data sets that are rich in information content and that can be used to estimate a large variety of population figures. At the same time NSIs aim to construct these rich data sets as efficiently and cost effectively as possible. This can be achieved by utilizing already available administrative data as much as possible, and supplementing these administrative data with survey data collected by the NSI. In this paper we will focus on obtaining consistent population estimates from such a combination of administrative data sets and surveys. We will sketch general approaches based on weighting, imputation and macro-integration, and discuss their advantages and drawbacks.

Quantifying the effect of classification errors on the accuracy of mixed-source statistics

Estimating the effect of non-sampling errors on the accuracy of mixed-source statistics is not straightforward. Here we simulate the bias and variance of the turnover estimates in car trade due to classification errors, using a bootstrap approach. In addition, we study the extent to which manual selective editing at micro level can improve the accuracy. We discuss how to develop a practical method that can be implemented in production to estimate the accuracy of register-based estimates.

Data-related-issues-with-compsite-indicators-availability-frequency-and-adjustment-techniques
Constructing-the-supplementary-pension-table

Previous international national accounting standards provided a somewhat scattered picture of pension arrangements. This made international comparability of statistics difficult, because of the diversity in pension schemes among countries. The new European System of Accounts (ESA 2010) allows for a better analysis and international comparability of the pension systems within and between countries, by introducing a supplementary table on pension schemes. This paper focuses on the construction of the supplementary pension table, including estimates for the size of the unfunded pension schemes. It describes the methodology used, and the choices made by Statistics Netherlands in completing this table for 2012.

Complementing official health statistics with internet search indices

This study investigates if and how internet search indices can contribute to a quantitative picture of health (related) topics in the Netherlands.

Wie wordt bereikt via het Europees Sociaal Fonds? Kenmerken van deelnemers in Nederland, 2015

Rapport en tabellen over deelnemerskenmerken aan projecten opgezet vanuit het Europees Sociaal Fonds (ESF) en over de duurzaamheid van de arbeidsrelatie na deelname aan ESF-projecten in de periode 2010-2014. Opdrachtgever: het Agentschap van het ministerie van Sociale Zaken en Werkgelegenheid (SZW).

Regional differences in long-term care use by elderly and chronically ill people in 2012  (Dutch only)

Tables and a research description on the regional differences in long-term care use by elderly and chronically ill people in 2012. The tables show the regional use of long-term care, an analysis of the possible determinants of long-term care use and the remaining regional differences after correction for these determinants. The research description explains why and how the analysis was conducted. It also contains a reading guide for the tables. Commissioned by: Netherlands Court of Audit (NCA)

Unoccupied dwellings 2013: are they available for the housing market?

Report and tables on the availability of unoccupied dwellings in the Netherlands on 1 July 2013. The tables contain information on energy consumption, possible secondary functions and approved demolition permits of unoccupied dwellings per region. The report describes the main results. Commissioned by the Ministry of the Interior and Kingdom Relations.

Operationalisatie sterfte na ontslag in de HSMR

Dit onderzoek is uitgevoerd in opdracht van Dutch Hospital Data (DHD). Doel van het onderzoek is de eventuele doorontwikkeling van de Hospital Standardised Mortality Ratio (HSMR) door het meenemen van de sterfte kort na ontslag. Het voordeel van sterfte na ontslag meenemen in het gestandaardiseerde sterftecijfer is dat het cijfer minder afhankelijk wordt van het ontslagbeleid van de ziekenhuizen. In 2013 is een onderzoek uitgevoerd door Pouw et al. (2013) waarin gekeken is naar de mogelijkheid en de effecten van een dergelijke indicator in Nederland. De conclusie van dat onderzoek was dat het wenselijk zou zijn om inderdaad de sterfte kort na ontslag mee te nemen in het sterftecijfer. In dat onderzoek is er niet onderzocht wat de optimale periode is gedurende welke de sterfgevallen meegenomen zouden moeten worden, of deze periode verschilt per diagnosegroep en of deze periode gerekend moet worden vanaf de opname of het ontslag. In opdracht van DHD heeft het CBS dit onderzocht. De resultaten hiervan staan in dit rapport.

Internationalisation and R&D of Dutch small and medium-sized enterprises (Dutch only)

The research report and the accompanying custom-made tables show to what extent Dutch small and medium-sized enterprises invest in R&D, but are not active on foreign markets in terms of imports, exports or outward investments. Commissioned by: Ministry of Foreign Affairs, Ministry of Economic Affairs and Netherlands Enterprise Agency (RVO)

Internationalisering en R&D in het Nederlands midden- en kleinbedrijf

Het onderzoeksrapport en de bijbehorende tabellenset brengen in beeld welk deel van het Nederlands midden- en kleinbedrijf wel investeert in R&D, maar niet actief is op buitenlandse markten

Bronnenonderzoek Digitale Atlas Natuurlijk Kapitaal

Rapport dat de mogelijkheden beschrijft om bronnen van CBS te gebruiken voor de digitale atlas natuurlijk kapitaal

Estimating detailed frequency tables from registers and sample surveys

Repeated weighting is a method for the consistent estimation of multiple frequency tables from registers and sample surveys. Statistics Netherlands uses repeated weighting for the compilation of the Dutch census. The application of the method is not without its problems. Estimation problems may occur, especially for detailed tables, of which some of the cells are covered by only few (or even none) observations. This paper reviews existing solutions for methodological problems and proposes new solutions when necessary. The problems and solutions are illustrated with experiences of the Dutch 2011 Census compilation. The general message of this paper is that repeated weighting can be applied to very complex estimation problems, although it still has its limitations. (Jacco Daalmans)

Small area estimation with zero-inflated data - a simulation study

Many target variables in official statistics follow a semicontinuous distribution with a mixture of zeros and continuously distributed positive values, also called zero-inflated variables. When reliable estimates for subpopulations with small samples are required, a model-based small area estimation method can be used, which improves the accuracy of the estimates by borrowing information from other domains. In this paper, two small area estimators are compared in a simulation study with zero-inflated target variables. The first estimator, the EBLUP, can be considered as the standard small area estimator, and is based on a linear model that assumes normal distributions. Therefore it is model-misspecified in our situation.The second estimator is based on a model that takes the zero-inflation into account and is therefore less misspecified. Both estimators are found to improve the accuracy compared to a design-based approach. The gain in accuracy is generally larger for the model that takes the zero-inflation into account. The amount of improvement depends on properties of the population. Furthermore, there are large differences in improvement between the domains.

Expat, wanneer ben je het?

De term expats is algemeen bekend, maar een algemeen gangbare definitie van deze groep buitenlandse werknemers ontbreekt

Haalbaarheidsstudie prijswaarneming GVO's

Rapport dat de mogelijkheden beschrijft om prijzen waar te nemen van Garanties van Oorsprong (GVO’s) voor Nederlandse windenergie. GVO’s zijn certificaten die zijn verbonden aan de productie van hernieuwbare energie. Opdrachtgever: Ministerie van Economische Zaken.

Monitor artists and arts graduates (Dutch Only)

Paper and tables on various aspects of the labour market position of artists, people with creative jobs, and intermediate and higher level arts graduates. Commissioned by: Ministry of Education, Culture and Science (OCW).

Monitor kunstenaars en afgestudeerden aan creatieve opleidingen

Rapportage en tabellenset over de arbeidsmarktpositie van kunstenaars, personen werkzaam in overige creatieve beroepen en afgestudeerden aan creatieve hbo- en mbo opleidingen. Opdrachtgever: ministerie van Onderwijs, Cultuur en Wetenschap (OCW).

Monitor kwaliteit Stelsel van Basisregistraties. Nulmeting van de kwaliteit van basisregistraties in samenhang, 2014

Rapportage en tabellenset over de kwaliteit van Basisregistraties aan de hand van 19 indicatoren. De rapportage beschrijft de belangrijkste resultaten per indicator. De tabellenset bevat gedetailleerde informatie over de 19 indicatoren, uitgesplitst naar een aantal duidingsvariabelen. Opdrachtgever: STelsel Oplossingen en UitvoeringsTraject (STOUT) van de stichting ICTU (ICT uitvoeringsorganisatie).

Gelijk loon voor gelijk werk? Banen en lonen bij de overheid en bedrijfsleven, 2012

Dit rapport schetst een beeld van ongecorrigeerde en gecorrigeerde beloningsverschillen tussen mannen en vrouwen bij de overheid en het bedrijfsleven in 2008, 2010 en 2012. De publicatie gaat vergezeld van een methodologische toelichting en een aanvullende tabel over loonverschillen inclusief en exclusief bijzondere beloning.Opdrachtgever: Ministerie van Sociale Zaken en Werkgelegenheid.

Maatwerk voor brancheverenigingen

Rapport over kenmerken van werknemers die werkzaam zijn in de branche VGB (Vereniging van Groothandelaren in Bloemkwekerijproducten). Opdrachtgever: VGB.

Model selection for small area estimation in repeated surveys

Many surveys are repeated at regular intervals to monitor temporal change in quantities of social or economic importance. When small area estimation methods are applied in such settings, the question arises of which models to use. Model selection techniques are used to identify an optimal set of covariates from a larger set of candidates.In the context of repeated surveys the question arises how to select a model for small area predictions that are comparable over time. To this end, this paper presents and compares four approaches to model selection for repeated surveys. Consecutive editions of the Dutch crime victimization survey are used as a case study.

Monitor topsectoren 2014

Wat is het (economisch) belang van de topsectoren voor Nederland? Wat is hun innovatiekracht? Hoe ontwikkelen topsectoren zich en wijkt dit af van andere economische sectoren?

Multivariate State-Space Approach to Variance Reduction in Series with Level and Variance Breaks

This study illustrates how time-series model-based techniques can improve the production of official statistical figures for repeatedly conducted surveys. Apart from reducing the variance of design-based estimates, time series models provide estimates on the magnitude of discontinuities caused by survey redesigns. In surveys with a subdivision into several domains, such models can be effectively applied in the context of small area estimation problems.

Vooronderzoek satellietrekening cultuur en creatieve industrie

Rapport over de mogelijkheid tot het opstellen van een satellietrekening cultuur en creatieve industrie. Opdrachtgever: Ministerie van Onderwijs, Cultuur en Wetenschap.

Inflation dynamics in the Netherlands; a linear and non-linear analysis and the influence of economic conditions

Insight into the nature of inflation dynamics is crucial for prediction and for understanding inflationary pressure. This study applies a whole range of linear and non-linear time series models to the rate of inflation in the Netherlands between 1970-2009 to test which one best describes inflation dynamics.

Traffic intensity as indicator of regional economic activity

In this study it is shown that it is possible to construct indicators of regional economic activity based on local traffic intensity data. Sensor readings on traffic flow from the NDW database were used to construct a monthly indicator of traffic intensity in the Eindhoven region.

Analysing short term developments in key economic indicators: predicting the sign of period on period changes

This study applies a range of predictive techniques to forecasting the sign of the next month’s change in five key economic indicators: consumer confidence, household consumption, producer confidence, manufacturing production and exports, all for the Netherlands. Techniques tested range from standard regression models to advanced machine learning techniques.

Time-series small area estimation for unemployment based on a rotating panel survey

A time-series multi-level model is used to estimate municipal unemployment based on the Dutch Labour Force Survey (LFS) at a quarterly frequency. The model includes random municipality effects as well as random municipality-by-quarter effects. The latter are modelled as independent effects or random walks over time, or as a sum of both. The model is fit using quarterly LFS data from 2003 to 2008.

The impact of survey item characteristics on mode-specific measurement bias in the crime victimisation

In this discussion paper, the interaction between survey item characteristics and survey mode is investigated in terms of measurement bias. For this purpose a typology of survey items was constructed and is presented in the paper. Mode-specific measurement bias was estimated using a large-scale mixed-mode experiment linked to the Dutch Crime Victimisation Survey of 2011. This experiment consisted of a randomized allocation of sample persons to the four survey modes web, paper, telephone and face-to-face. A multi-level model is used to explain mode-specific measurement bias for a set of approximately 125 items from the Crime Victimisation Survey.

Theoretical and empirical support for adjustment of nonresponse by design

Recently, various indicators have been proposed as indirect measures of nonresponse error in surveys. They employ auxiliary variables to detect nonrepresentative or unbalanced response. A class of survey designs known as adaptive survey designs maximizes these indicators by applying different treatments to different subgroups. The natural question is whether the decrease in nonresponse bias caused by adaptive survey designs could also be achieved by nonresponse adjustment methods. We discuss this question and provide theoretical and empirical considerations, supported by a range of household and business surveys. We find evidence that balancing response reduces bias more than adjustment does.

Error localisation using general edit operations

In this paper, a generalisation of the Fellegi-Holt paradigm is proposed that can incorporate various complex edit operations in a natural way. In addition, an algorithm is outlined that may be used to solve the resulting generalised error localisation problem, at least in theory. It is hoped that this generalisation may be used to increase the suitability of automatic editing in practice, and hence to improve the efficiency of data editing processes.

Leegstaande woningen

Rapportage en tabellenset over leegstaande woningen in Nederland. De tabellenset bevat informatie over het aantal leegstaande woningen naar regio, kenmerken van de leegstaande woningen en de duur van de leegstand. De rapportage beschrijft de belangrijkste resultaten. Opdrachtgever: Ministerie van Binnenlandse Zaken en Koninkrijksrelaties.

Small area estimation with state-space common factor models for rotating panels

Macro-economic indicators about the labour force, published by national statistical institutes, are predominantly based on rotating panels. Sample sizes of most Labour Force Surveys in combination with the design-based or model-assisted mode of inference, usually applied in survey sampling, obstructs the publication of such indicators on a monthly frequency. It is shown how multivariate structural time series models can be used to obtain more precise model-based estimates.

Measuring Inequalities in the Dutch Household Sector

In recent years, the importance of the households sector in measuring economic welfare has increasingly been recognised, and the development of additional indicators to measure inequalities is suggested. This article reports the work done by Statistics Netherlands concerning the development of such indicators. National accounts data has been combined with distributional information to divide income, consumption and wealth over household groups. This paper presents the preliminary results for the standard of living.

Kwaliteitsrichtlijnen

De Kwaliteitsrichtlijnen beschrijft de normen waaraan statistische processen van het CBS moeten voldoen. De richtlijnen vormen de basis voor audits en self-assessments van statistische processen. Dit rapport kan ook dient als input voor (her)ontpwerp van statistische processen. De Kwaliteitsrichtlijnen integreert bestaande internationale en nationale kaders, zowel als richtlijnen en directiebesluiten van het CBS.

Arbeidsproductiviteit in de elektrotechniek en de machinebouw

Dit rapport bevat een beschrijving van de sectoren elektrotechniek en de machinebouw, de arbeidsproductiviteit in deze sectoren en een aantal determinanten van arbeidsproductiviteit. Opdrachtgever: Ministerie van Economische Zaken. Een korte technische bijlage bij het rapport is op aanvraag beschikbaar.

Record Linkage in Health Data: a simulation study

Record linkage is becoming more and more common in statistical and academic research. Linking records makes it possible to combine data from different sources to answer research questions that are very difficult to answer using data from just one source. In the present paper, we demonstrate the influence that the choice of variables and linkage algorithms has on linkage results, but also the importance of the properties of the data sources.

Error localization as a mixed integer problem with the editrules package

Error localization is the problem of finding out which fields in raw data records contain erroneous values. The editrules extension package for the R environment forstatistical computing was recently extended with a module that allows for error localizationbased on a mixed integer programming formulation (MIP). In this paper we describe the MIP formulation of the error localization problem for the case of numerical, categorical, or mixed numerical and categorical datasets.

Selectivity of Big data

Assessment of selectivity of Big data sets is generally not straightforward, if at all possible. Some approaches are proposed in this paper. It is argued that the degree to which selectivity – or its assessment – is an issue, depends on the way the data are used for production of statistics. The role Big data can play in that process ranges from minor over supplementary to vital. Methods for inference that are in part or wholly based on Big data need to be developed, with particular attention to their capabilities of dealing with or correcting for selectivity of Big data. This paper elaborates on the current view on these matters at Statistics Netherlands, and concludes with some discussion points for further consideration or research.

Inkomsten uit werk en toch een langdurig laag inkomen, kenmerken en verklaringen

Dit rapport bevat resultaten over de samenstelling van de groep werkenden met een langdurig laag inkomen en de factoren die een rol spelen bij het hebben van een langdurig laag inkomen. Opdrachtgever: Ministerie van Sociale Zaken en Werkgelegenheid.

How unusual weather influences GDP

We carried out an analysis at a sectoral level, and tested a score of different, potentially relevant, weather effects. The influence on GDP was then computed by aggregation. We found that, on a quarterly basis, several industries exhibit a significant weather effect. As a result of sectoral effects having opposite signs, the net effect of unusual weather on GDP as a whole is rather modest for some periods, while for other periods, the effect is more substantial.

Verbeteringen en reeksbreuken in economische statistieken

Om de statistieken bij de tijd te houden worden regelmatig verbeteringen doorgevoerd. De eventuele reeksbreuken die daar de keerzijde van kunnen zijn worden zo goed mogelijk opgevangen. Gebruikers ondervinden dan zoveel mogelijk wèl de voordelen van de verbeteringen en zo min mogelijk de nadelen van een breuk. Dit rapport gaat over de verbeteringen en reeksbreuken in economische statistieken.

The use of within-subject experiments for estimating measurement effects in mixed-mode surveys

The estimation of measurement effects (MEs) of survey modes in the presence of selection bias poses a great problem to methodologists. We present a new method to estimate MEs by means of “within-subject designs”, in which the same sample is approached by two different modes at two subsequent points in time. The decomposition of mode effects into MEs and selection biases is illustrated for key statistics from the Dutch Crime Victimization Survey using data from a large-scale within-subject experiment conducted within Statistics Netherlands’ project Mode Effects in Social Surveys (abbreviated to MEPS in Dutch).

Explicit and Implicit Calibration of Covariance and Mean Structures

Linear structural equation models (SEMs) are widely used to assess the validity and reliability of survey variables. When population means or totals are of interest, it is also important to assess whether the observed variables contain an intercept bias. Unfortunately, standard identification procedures for SEMs define an arbitrary metric for the latent variables, which prevents the estimation of valid latent means and intercepts in a single population. In this paper, it is shown how an audit sample may be used to estimate a non-arbitrary set of identification restrictions.

Kwaliteitsmaten voor het datacorrectieproces

Datacorrectie is gebaat bij indicatoren die op eenvoudige en duidelijke wijze de invloed vanhet correctieproces op de data aangeven. In dit rapport worden enkele indicatoren beschrevendie op grafische wijze aspecten van verandering in data ten gevolge van een correctieprocesuitlichten. De indicatoren vallen uiteen in twee soorten: indicatoren die betrekking hebben op waardenen indicatoren die betrekking hebben op regelschendingen.

Solving the nonresponse problem with sample matching?

This paper explores the use of matched samples as an alternative for estimators based on surveys suffering from a substantial amount of nonresponse.

Information management as tool for standardization in statistics

We describe the role of information management in redesign programs: why it is needed, how it is at present being developed through international cooperation in the Generic Statistical Information Model (GSIM) project, and we give some examples how information management has been used in practice at Statistics Netherlands. We conclude that GSIM is necessary for more extensive usage of big data sources, for interoperability of statistical tools and for sharing tools between NSIs.

Estimation of household income based on asset ownership

Within the collaboration between Statistics Netherlands and the National Statistics Office of Georgia (GeoStat) a pilot study was carried out. The study aims the improvement of quality of the Integrated Household Survey (IHS) of GeoStat. One of the issues is that the income of the households obtained from the IHS is not reliable and not representative for the population. GeoStat does not have any administrative or other source with information on household income. In order to obtain information related to household income we developed an asset ownership questionnaire. The questionnaire was applied to a small group of households. Using a linear regression model based on this questionnaire we estimated the household income for the IHS data frame in the Tbilisi (Georgia) area. In this paper we recommend GeoStat to use these estimates for stratification of the IHS according to household income. The stratification can also be used for improving the weighting of the IHS and make the income distribution of households more representative. The pilot study was applied only on households in the Tbilisi area.

The impact of contact effort and interviewer performance on mode-specific measurement

In 2011, a large-scale mixed-mode experiment was linked to the Crime Victimisation Survey (CVS). This experiment consisted of a randomized allocation of sample persons to the four survey modes Web, mail, telephone and face-to-face, and a follow-up using only interviewer modes face-to-face and telephone. The aim of the experiment was to disentangle mode-specific selection- and measurement effects. The analyses show that contact effort has little impact on the size of measurement bias and a modest impact on the size of selection bias. Also, interviewer performance plays just a small role in the size of both biases. From these results, we conclude that contact effort and interviewer performance do not have a simultaneous impact on nonresponse and measurement error.

Does balancing of survey response reduce nonresponse bias?

Recently, representativeness indicators, or R-indicators, have been proposed as indirect measures of nonresponse error in surveys. The indicators employ available auxiliary variables in order to detect nonrepresentative response. They may be used as quality objective functions in the design of survey data collection. Such designs are called adaptive survey designs as different subgroups receive different treatments. The obvious question is whether the decrease in nonresponse bias caused by adaptive survey designs could also be achieved by nonresponse adjustment methods that employ the same auxiliary variables.In this paper, we discuss this important question. We provide theoretical and empirical considerations on the role of both the survey design and nonresponse adjustment methods to make response representative. The empirical considerations are supported by a wide range of household and business surveys from Statistics Netherlands.

Sampling and estimation techniques for household panels

A problem with using households as sampling units in the sample design of panels is the instability of these sampling units over time. Changes in the household composition affect the inclusion probabilities required for design-based and model-assisted inference procedures. The required information to derive correct inclusion probabilities is often not available. This problem can be circumvented by sampling persons which are followed over time. At each period the household members of these sampled persons are included in the sample. This comes down to sampling with probabilities proportional to household size where households can be selected more than once but with a maximum equal to the number of household members. In this paper properties of this sample design are described and applied to the Dutch Regional Income Survey.

Stromen in de sociale zekerheid

Maatwerktabellen over stromen in de sociale zekerheid van personen met een wajong- of bijstandsuitkering of een WSW-indicatie. Het gebruik van uitkeringen in het kader van WW, ZW of WIA/WAO na werkhervatting is voor deze doelgroep in kaart gebracht. Opdrachtgever: Ministerie van Sociale Zaken en Werkgelegenheid.

An Introduction to data cleaning with R

Data cleaning, or data preparation is an essential part of statistical analysis. In fact, in practice it is often more time-consuming than the statistical analysis itself. These lecture notes describe a range of techniques, implemented in the R statistical environment, that allow the reader to build data cleaning scripts for data suffering from a wide range of errors and inconsistencies, in textual format. These notes cover technical as well as subject-matter related aspects of data cleaning . Technical aspects include data reading, type conversion and string matching and manipulation. Subject-matter related aspects include topics like data checking, error localization and an introduction to imputation methods in R. References to relevant literature and R packages are provided throughout.

Human capital in the Netherlands

The report presents experimental estimations of the level and growth of human capital in the period 1999-2009, and discusses the plausibility of these estimates based on the lifetime income approach. The figures presented in this report are not official statistics of Statistics Netherlands, and not published as such. The estimates are still under investigation and have a highly explorative character, and should therefore be treated with caution when drawing conclusions based on the current research results. The results just give an indication of the sources of growth of human capital, investment in human capital, and how the estimate of human capital in the Netherlands compares to other types of capital and GDP, and to other countries.

Reconciling theory and practice in environmental accounting

In dit proefschrift wordt onderzocht hoe theorie en praktijk op het terrein van milieurekeningen beter met elkaar verbonden kunnen worden. Hiertoe is statistisch onderzoek gedaan op een aantal terreinen waaronder het waarderen van de uitputting van grondstoffen, de bepaling van het nationaal vermogen, en het nieuwe terrein ecosysteem rekeningen.

Adaptive survey designs to minimize survey mode effects. A case study on the Dutch Labour Force Survey

Assessing the impact of mode effects on survey estimates has become a crucial question due to the increasing appeal of mixed-mode designs. Despite the advantages of a mixed-mode design such as lower costs and increased coverage, there is sufficient evidence that mode affects may sometimes be large relative to the precision. They may lead to incomparable statistics in time or over population subgroups and they may increase bias. Adaptive survey designs offer a flexible mathematical framework to obtain the optimal balance between survey quality and costs. In this paper we employ adaptive designs in order to minimize mode effects. We illustrate our optimization model by means of a case study on the Dutch Labour Force Survey.

Model-based estimation of discontinuities for small domains in the Dutch Crime Victimization Survey

During redesigns of repeated surveys, the old and the new approach are often conducted in parallel to quantify discontinuities which are initiated by the modifications in the survey process. Due to budget limitations, the sample size allocated to the alternative approach is often considerably smaller, compared to the regular survey that is also used for official publication purposes. In this paper, small area estimation techniques are considered to improve the accuracy of domain estimates obtained under the alternative approach. Besides auxiliary information available from registrations, direct domain estimates available from the regular survey are useful auxiliary variables to construct model-based small area estimators. These methods are applied to a redesign of the Dutch Crime Victimization Survey.

Pilot maatwerk voor brancheverenigingen

Rapport over kenmerken van werknemers die werkzaam zijn in de branche FEDA (federatie Aandrijven en Automatiseren). Opdrachtgever: FME-CWM.

Improving survey fieldwork with paradata

This PhD thesis shows how paradata (process data) and other auxiliary variables can he used to improve survey fieldwork. Examples are: response, and how to accomplish a more respresentative response by means of adaptive survey design; advance letters, and the different impact they can have on sample persons; call scheduling, the study of timing and spacing of visits. The thesis pays ample attention to the role of interviewers, understanding their behaviour, and the impact of interviewer behaviour on the quality of survey results.

Deugdelijke peilingen

Dit rapport beschrijft in eenvoudige termen hoe een deugdelijke peiling (survey) moet worden opgezet en uitgevoerd.

Wie wordt bereikt via het Europees Sociaal Fonds? Kenmerken van deelnemers in Nederland 2009-2012

Rapport en tabellen over deelnemerskenmerken (in 2009 en in 2012) aan projecten opgezet vanuit het Europees Sociaal Fonds (ESF) en over de duurzaamheid van de arbeidsrelatie na deelname aan ESF-projecten in de periode 2008-2012. Opdrachtgever: het Agentschap van het ministerie van Sociale Zaken en Werkgelegenheid (SZW).

Web panels for official statistics

New developments in computer technology, but also new challenges in society like increasing nonresponse rates, decreasing budgets, or demands for reducing the response burden, may lead to changes in survey methodology for official statistics. In this paper, the use of a Web panel for high quality statistics about the general population is explored. Such a panel needs to be probability based. A Web panel can be used for either longitudinal or cross-sectional studies. Depending on the type of panel, a number of choices for the implementation need to be made. These involve decisions on survey topics and questionnaires, the recruitment strategy, maintenance of the panel and how to deal with the various types of nonresponse. In this paper, these methodological issues are discussed in more detail.

Automated and manual data editing: a view on process design and methodology

Data editing is arguably one of the most resource-intensive processes at NSI’s. Forced by ever increasing budget pressure, NSIs keep searching for more efficient forms of data editing. Efficiency gains can be obtained by selective editing, that is limiting the manual editing to influential errors, and by automating the editing process as much as possible. In this paper we present a decomposition of the overall editing process in a number of different tasks given an up-to-date overview of all the possibilities of automatic editing in terms of these tasks.

Quality reporting and sufficient quality

Modernization of official statistics leads not only to other production processes and other products but also to another way of quality approach.

Hidden workers and the hidden worker potential in the Netherlands

From 2006 to 2010 Statistics Netherlands conducted surveys on the size and structure of the hidden labour market. Five to ten percent of the respondents admitted that they did not report all of their income to the tax or social security authorities.

Measurement error calibration in mixed-mode sample surveys

Mixed-mode surveys are known to be susceptible to mode-dependent selection and measurement effects, collectively referred to as mode effects. The use of different data collection modes within the same survey may reduce selectivity of the overall response but is characterized by measurement errors differing across modes. Inference in sample surveys generally proceeds by correcting for selectivity – for example by applying calibration estimators – and ignoring measurement error. When a survey is conducted repeatedly, such inferences are valid only if the measurement error remains constant between surveys. In sequential mixed-mode surveys it is likely that the mode composition of the overall response differs between subsequent editions of the survey, leading to variations in the total measurement error and invalidating classical inferences. An approach to inference in these circumstances, which is based on calibrating the mode composition of the respondents towards fixed levels, is proposed. Assumptions and risks are discussed and explored in a simulation and applied to the Dutch crime victimization survey.

Mobility and knowledge sharing

The paper describes how Statistics Netherlands has developed a Knowledge and Innovation programme. The Knowledge part of the Programme has three goals: (i) to preserve knowledge with regard to the expected retirement wave (ii) to develop and share knowledge in order to be prepared for the future, and (iii) to provide adequate tooling for knowledge sharing. Mobility of employees is an important vehicle for knowledge sharing. Besides job rotation there are other instruments that can be used to create more flexibility in the organisation and stimulate knowledge sharing. The paper presents the first experiences of Statistics Netherlands with instruments like working in flexible, multidisciplinary teams and internal network communities in order to stimulate knowledge sharing and developing and to make use of best practices.

Innovation at Statistics Netherlands

It is the challenge of statistical offices to use their knowledge and innovation power to the optimal extent in order to remain able to respond pro-actively and in a creative way to these developments. This paper describes how Statistics Netherlands has developed an Innovation programme. A three-stage funnel approach plays a key role in this programme. This approach gives maximum room for bottom-up development of ideas, while its focuses at the same time on maximum contribution to the goals of the organization. The Innovation Lab is an important instrument for the Innovation programme. It offers a suitable environment to support the generation of ideas and test their feasibility.

Estimating the validity of administrative and survey variables through structural equation modeling: a simulation study on robustness

Over the past years, the use of administrative data in both official statistics and academic research has grown. This development has made the problem of assessing the quality of administrative sources for statistical use increasingly important. Two of the main aspects of this are validity and reliability of measurement. Although this problem is often mentioned in qualitative terms, so far, not much research has been done on methods that assess the validity or reliability of administrative variables in a quantitative way. The objective of this paper is to describe a quantitative method for estimating validity and to present results obtained with this method in a simulation study.

Assessing the mode dependency of sample selectivity across the survey response process

Survey researchers currently act on three beliefs when combining survey modes in mixed-mode designs. First, modes elicit distinct patterns of survey response behaviour and nonresponse bias. Second, these selection differences are caused by differences in the response process of differential coverage, contact and cooperation. Third, mode-dependent response patterns might be exploitable by sequential mixed-mode designs ideally yielding samples less biased by selection and hence more ‘representative’. These assumptions are assessed using a factorial design, in which the Dutch Crime Victimisation Survey was administered either in CAPI, CATI, mail, or web.

Linking ICT related Innovation Adoption and Productivity; results from micro-aggregated versus firm-level data
Revisiting the Porter Hypothesis: an emperical analysis of green innovation for the Netherlands

Using a comprehensive unbalanced panel of firm-level data constructed from four surveys we test within a structural model what is known in the literature as the “weak” and the “strong” version of the Porter hypothesis.Our “Green Innovation” model includes three types of eco investments and non-eco R&D to explain differences in the incidence of innovation. We aim to estimate the relative importance of energy price incentives as a market based type of ER and the direct effect of environmental regulation on eco investment and firms’ decisions regarding the introduction of several types of innovations. We explicitly model the potential synergies of introducing the three types of innovations simultaneously and their synergy in affecting total factor productivity (TFP) performance. The results of our analysis show a strong corroboration of the weak version of the Porter hypothesis but not of the strong version of the PH, in this case on TFP performance.

Automatic editing with hard and soft edits

Soft edit rules, i.e. constraints which identify (combinations of)values that are suspicious but not necessarily incorrect, are an importantelement of many traditional, manual editing processes. It is desirable to usethe information contained in these edit rules also in automatic editing.However, current algorithms for automatic editing are not well suited to usesoft edits because they treat all edit rules as hard constraints: each editfailure is attributed to an error. Recently at Statistics Netherlands, a newautomatic editing method has been developed that can distinguish betweenhard and soft edits. A prototype implementation of the new algorithm has been written in the R programming language. This paper reports some resultsof an application of this prototype to data from the Dutch Structural BusinessStatistics. The paper also introduces and tests several size measures of softedit failures that can be used with the new automatic editing method.

Improvement of waterflows in the National Water Balance

Improvement of waterflows in the National Water Balance;Water Stocks; feasibility of Water Balances per River Basin.Final report on Grant Agreement No. 50303.2010.001-2010.564

Hot deck imputation of numerical data under edit restrictions

A common problem faced by statistical institutes is that data may be missing from collected data sets. The typical way to overcome this problem is to impute the missing data. The problem of imputing missing data is complicated by the fact that statistical data collected by statistical institutes often have to satisfy certain edit rules, which for numerical data usually take the form of linear restrictions. Standard imputation methods for numerical data as described in the literature generally do not take such linear edit restrictions on the data into account. Hot-deck imputation techniques form a well-known class and relatively simple to apply class of imputation methods. In this paper we extend this class of imputation methods so that linear edit restrictions are satisfied.

Early school-leaving in the Netherlands. A multidisciplinary study of risk and protective factors explaining early school-leaving

Signalen voor een verhoogde kans op voortijdig schoolverlaten zijn al zichtbaar in de brugklas. Dat is één van de conclusies van het proefschrift waarmee Mw. Drs T.Traag, onderzoeker bij het Centraal Bureau voor de Statistiek, promoveerde aan de Universiteit Maastricht.

Aan het werk met re-integratieondersteuning, viermeting

Rapport en tabellen over hoe vaak personen die re-integratieondersteuning ontvangen, een baan vinden.Opdrachtgever: Ministerie van Sociale Zaken en Werkgelegenheid (SZW)

Rapport monitoring kinderalimentatie

Rapport over de inning van kinderalimentatie. Gebaseerd op gegevens van het Landelijk Bureau Inning Onderhoudsbijdragen (LBIO).Opdrachtgever: Ministerie van Sociale Zaken en Werkgelegenheid (SZW)

Rapport inning partneralimentatie

Rapport over de inning van partneralimentatie. Gebaseerd op gegevens van het Landelijk Bureau Inning Onderhoudsbijdragen (LBIO).Opdrachtgever: Ministerie van Sociale Zaken en Werkgelegenheid (SZW)

Adaptive survey designs that minimize nonresponse and measurement risk

Recently, survey literature has put forward responsive and adaptive survey designs as means to make efficient tradeoffs between survey quality and survey costs. The designs, however, restrict quality-cost assessments to nonresponse error, while in mixed-mode surveys the measurement or response error plays a dominant role. Furthermore, there is both theoretical and empirical evidence that the two types of error are correlated.In this paper, we investigate adaptive survey designs that minimize both errors simultaneously in the Labour Force Survey. The design features that are selected are self-reporting versus proxy reporting, and the number of contact attempts.

49 Factors that Influence the Quality of Secondary Data Sources

Data quality is important for statistical institutes, not only in relation to the data they produce, but also because they use more and more secondary data sources to produce statistics. Examples of secondary data sources, data produced by others, are administrative data, transactional data and data from the Internet.

Gelijk loon voor gelijk werk? Banen en lonen bij de overheid en het bedrijfsleven, 2010

Dit rapport schetst een beeld van de banen, lonen en beloningsverschillen bij de overheid en het bedrijfsleven in 2010. Opdrachtgever: Ministerie van Sociale Zaken en Werkgelegenheid.

Miljonairs in cijfers

Dit rapport beschrijft de kenmerken van miljonairs in Nederland in de periode 2006-2011. Opdrachtgever: Van Lanschot Bankiers.

Outcomemonitor Wijkenaanpak: De Tweede Voortgangsrapportage

Deze publicatie beschrijft de ontwikkelingen in de 40 aandachtswijken ten opzichte van de 18 gemeenten waarvan zij deel uitmaken. Opdrachtgever: Ministerie van BZK

Twitter as a potential data source for statistics

An increasing number of people is active on various social media platforms. Here, people voluntarily share information, discuss topics of interest, and contact family and friends. Because the social media platform Twitter is used by a large number of people in the Netherlands and the pubic messages can be relatively easily collected, we investigated the content and usability of Twitter-messages for statistics. This revealed that a considerable amount of the messages collected, around 50%, could potentially be used to provide information on work, politics, spare time activities and events.

Trust and Dissemination in Official Statistics

We discuss the relation between trust and statistical dissemination, drawing on examples from the Netherlands and comparing with other European countries. Dutch citizens have a fair amount of confidence in official statistics, even in the recent period of political and economic upheaval. The most important reason for this seems to be the political culture in the Netherlands, which puts a strong emphasis on rational policy making based on evaluations from scientific councils, committees and official research bureaus. We discuss how this came to be and how this influences the trust in the national statistical institute. and the consequences this has for the dissemination of statistical data.

Monitor topsectoren (eerste meting)

Rapport met uitkomsten, methodebeschrijving en tabellenset van de eerste meting van de Monitor topsectoren. Opdrachtgever: Ministerie van Economische Zaken, Landbouw en Innovatie (EL&I)

Shifting paradigms in official statistics: from design-based to model-based to algorithmic inference

Inference in official statistics is traditionally motivated from a design-based perspective, with the model-based approach being gradually adopted in specific circumstances. We take this shifting paradigm one step further, from model-based to algorithmic inference methods. Surveying a sample of the population of interest – typically enterprises or households – is fundamental to the design-based approach, where the design is the basis for inference. Model-based estimation methods may provide a viable alternative in situations where design information is not available. Estimation of the model parameters is pivotal, although in official statistics it is only an intermediate goal, as the model is ultimately used for prediction. Therefore, adopting a data-centred, algorithmic view rather than a model-centred view is possible. The algorithmic view encompasses methods generally attributed to the fields of data mining, machine learning, or statistical learning. Algorithmic methods may be useful in situations where data are not obtained through a sample survey, and where the typical models used in model-based estimation are not tenable.

Optimal adjustments for inconsistency in imputed data

Conflicting information may arise in statistical micro data due to partial imputation, where one part of the imputed record consists of the observed values of the original record and the other of the imputed values. Edit rules that involve variables from both parts of the record will often be violated. One strategy to remedy this problem is to make adjustments to the imputations such that all constraints are simultaneously satisfied and the adjustments are, in some sense, as small as possible. The minimal adjustments are obtained by minimizing a chosen distance metric subject to the constraints and we show how different choices of the distance metric result in different adjustments to the imputed data. As an extension we also consider an approach that does not aim to minimize the adjustments but to make the adjustments as uniform as possible between variables. Under this approach, even the values that are not explicitly involved in any constraints can be adjusted. The properties and interpretations of the proposed methods are illustrated using empirical business-economic data.

De bijdrage van sport aan de Nederlandse economie

Rapport en tabellen van de satellietrekening sport. Opdrachtgever: Ministerie van Volksgezondheid, Welzijn en Sport

Methodologisch handboek van de satellietrekening sport (Engelstalig)

Methodologisch handboek van de satellietrekening sport. Opdrachtgever: Ministerie van Volksgezondheid, Welzijn en Sport

Samenloop in voorzieningen en inkomen 2009 (vervolgstudie)

Vervolgonderzoek naar mensen met meerdere voorzieningen tegelijkertijd op de domeinen zorg en inkomen in 2009. In dit onderzoek zijn cijfers over de WMO (Wet Maatschappelijke Ondersteuning) toegevoegd. Opdrachtgever: ministerie van SZW.

Peilingen beoordelen

Steeds vaker wordt in peilingen de mening van ‘de Nederlander’ gevraagd over allerlei onderwerpen. De vraag is echter of al die peilingen wel een goed beeld geven van de werkelijkheid. Er zitten goede en slechte onderzoeken bij en het is niet eenvoudig om het kaf van het koren te scheiden. Door de vragen in deze checklist één voor één af te lopen, kunt u een eerste indruk krijgen van de kwaliteit van een onderzoek. Waarom het antwoord op elke vraag belangrijk is, wordt aan het eind van de checklist nader toegelicht.

Aan het werk met re-integratieondersteuning, viermeting (definitieve cijfers)

Tabellen over het starten van een baan vanuit een uitkering of periode als niet-uitkeringsgerechtigde, met en zonder re-integratieondersteuning. Opdrachtgever: Ministerie van Sociale Zaken en Werkgelegenheid (SZW)

On aligned composite estimates from overlapping samples for growth rates and totals

When monthly business surveys are not completely overlapping, there are two different estimators for the monthly growth rate of the turnover: (i) one that is based on the monthly estimated population totals and (ii) one that is purely based on enterprises observed on both occasions in the overlap of the corresponding surveys. The resulting estimates and variances might be quite different. This paper proposes an optimal composite estimator for the growth rate as well as the population totals.

Manipulation of conditional restrictions and error localization with the editrules package

The quality of statistical statements strongly depends on the quality of the underlying data. Since raw data is often inconsistent or incomplete, data editing may consume a substantial amount of the resources available for statistical analyses. Although R has many features for analyzing data, the func-tionality for data checking and error localization based on logical restrictions (edit rules, or edits) is currently limited. The editrules package is designed to offer a user-friendly toolbox for edit definition, edit manipulation, data checking, and error localization.

Time patterns, geospatial clustering and mobility statistics based on mobile phone network data

In our research we aim to gain insight in the geospatial activity of mobile phone users. Points op interest are the correlation between calling- and economic activity, population density based on the number of active mobile phones in an area and movement statistics. A derived research question is deducing a method to obtain a tessellation of cell serving areas from a cell plan and combining different tessellations.For our research we obtained a dataset from a telecommunication company containing records of all call-events on their network in the Netherlands for a time period of two weeks. Each record contains information about the time and serving antenna and an identification key of the phone. The dataset is large (containing over 600 million records) and the cell plan has over 20.000 geo-locations of antennas. We devised a method to transform this cell plan with use of the Voronoi algorithm into an appropriate tessellation needed for geo-spatial analysis.Results of our research are a geo-spatial animation from which it is clearly visible that high call intensity coincides with high population density. Also with use of k-means clustering we found useful patterns in the time series of the call activity providing insight into economic activity over time and space. Using the unique phone identification we obtained information of the movement of Dutch inhabitants.

Disentangling mode-specific selection and measurement bias in social surveys

In 2011, Statistics Netherlands conducted a large-scale mixed-mode experiment linked to the Crime Victimization Survey. The experiment consisted of two waves; one wave with random assignment to one of the modes web, paper, telephone and face-to-face, and one follow-up wave to the full sample with interviewer modes only. The objective of the experiment is to estimate total mode effects and the corresponding mode effect components arising from undercoverage, nonresponse and measurement. The estimated mode effects are used to improve methodology for mixed-mode surveys. In this paper, we define mode-specific selection and measurement bias, and we introduce and discuss estimators for these bias terms based on the experimental design. Furthermore, we investigate whether mode effect estimators based on the first wave only, reproduce the estimates from the full experimental design. The proposed estimators are applied to a number of key survey variables from the Labour Force Survey and the Crime Victimization Survey.

Standard for statistical processes 2012

The objective of this document is to describe a Standard for objects that are relevant for statistical processes and products. It is a Standard that can be applied to individual statistical processes.

Using response probabilities for assessing representativity

Nonresponse in surveys may effect representativity, and therefore lead to biased estimates. A first step in exploring a possible lack of representativity is to estimate response probabilities. This paper proposes using the coefficient of variation of the response probabilities as an indicator for the lack of representativity. The usual approach for estimating response probabilities is by fitting a logit model. A drawback of this model is that it requires the values of the explanatory variables of the model to be known for all nonrespondents. This paper shows this condition can be relaxed by computing response probabilities from weights that have been obtained from some weighting adjustment technique.

Semantic networks for automatic coding

In this paper a general methodology for automatic coding is suggested, which is supposed to be (fairly) general. Characteristic for the approach described in this paper is that each code (say for a business activity) is characterized by one or more combination of code words, or Cwords. These combinations of C-words can be seen as definitions of the various codes used. It is assumed that the order of the C-words is irrelevant to describe a code. Because it is unlikely that people will use exactly those Cwords when describing a business activity, synonyms, hyponyms and hyperonyms for the C-words are needed as well. They form the bridge between the definition of the codes in the classification used and the descriptions provided by respondent, and are called D-words. A semantic network is used to provide a bridge between the descriptions and the codes.Some inherent difficulties with automatic coding are presented, as well aspossible solutions to overcome them, either by solving or by sidesteppingthem.

Wie wordt bereikt via het Europees Sociaal Fonds? Kenmerken van deelnemers 2007-2011 nader bekeken

Rapport en tabellen over kenmerken van deelnemers (2008 en 2011) aan projecten opgezet vanuit het Europees Sociaal Fonds (ESF) en over de duurzaamheid van de arbeidsrelatie na deelname aan ESF-projecten in de periode 2008-2011. Opdrachtgever: het Agentschap van het ministerie van Sociale Zaken en Werkgelegenheid (SZW)

The satisfaction with life scale examining construct validity

This report describes the validity testing of a multi-item scale of global life satisfaction, namely the Satisfaction With Life Scale (SWLS). This scale has been proposed as an alternative to single-item life satisfaction measures.

Handling incompleteness after linkage to a population frame: incoherence in unit types, variables and periods

This paper concentrates on methods for handling incompleteness caused by differences in units, variables and periods of the observed data set compared to the target one. Especially in economic statistics different unit types are used in different data sets.

Inkomen en vermogen op de oude dag - voormalige werknemers en zelfstandigen vergeleken

Zelfstandigen krijgen na hun 65e gemiddeld minder pensioen dan werknemers, maar hebben meer vermogen. Wat betekent dit voor hun financiële situatie? Opdrachtgever: Ministerie van Sociale Zaken en Werkgelegenheid (SZW)

Bewegingen op de Nederlandse arbeidsmarkt, 1999-2008

Dit rapport beschrijft de mobilteit op de arbeidsmarkt in de periode 1999-2008, waarbij onderscheid wordt gemaakt naar 28 sectoren. Opdrachtgever: Raad voor Werk en Inkomen

Editing of errors associated with linking economic data sets to a population frame

In this paper we concentrate on methodological developments to improve the accuracy of a data set after linking economic survey and register data to a population frame. Because in economic data different unit types are used, errors may occur in relations between data unit types and statistical unit types. A population frame contains all units and their relations for a specific period. There may also be errors in the linkage of data sets to the population frame. When variables are added to a statistical unit by linking it to a data source, the effect of an incorrect linkage or relation is that the additional variables are combined with the wrong statistical unit. In the present paper we formulate a strategy for detecting and correcting errors in the linkage and relations between units of integrated data. For a Dutch case study the detection and correction of potential errors is illustrated

Income and capital of former entrepeneurs, revised version (Dutch only)

Report and tables on income and < capital of former self-employed persons, with a few small corrections with regard to December 2011. Commissioned by: Ministry of Social Affairs and Employment (SZW)

The Export Scan

This paper by Daan Zult and Floris van Ruth describes a new type of analytical tool, developed for detecting early signals of changes in the development of exports. It is novel in several ways; it focuses on exports using a demand-pull approach, is based on a structural analysis of the demand for Dutch exports, and takes the form a disaggregated visual tool. 16 selected foreign demand sectors are monitored using leading signaling indicators. The aggregate is leading to the growth rate of Dutch exports. The disaggregated set up results in increased early warning capabilities, as changes in development in individual industries are immediately visible.

Analysing the labour market; a tension gauge

In this paper by Floris van Ruth, a graphical tool is presented for analysing the labour market. The state of the labour market, defined as tightness, is characterised via the interaction of supply of and demand for labour, enabling a more comprehensive analysis. Supply is defined as the proportion of the labour force holding a job, whilst demand is represented by the average of several indicators of labour demand. This approach results in a new and more general characterization of the state of the labour market, presented in an easy to interpret visual manner.

The contribution of illegal activities to national income in the Netherlands

Recently Statistics Netherlands has started research into the share of illegal activities in the national income. The total contribution of illegal activities to the national income of the Netherlands increased from 1800 million euro in 1995 to almost 3500 million euro in 2008, equalling 0.6 percent of gross national income. The main illegal sector is drugs, which accounted for over 50 percent of the total income from illegal activities in 2001. In 2008 that share was down to less than 40 percent, whereas finding illegal employment rose from about 10 percent in 1995 to 33 percent in 2008.

Jongeren 2010

Deze publicatie geeft een beeld van de jongeren (15 tot 27 jaar) in Nederland in 2010. Onder andere komen de thema's onderwijs, werk en uitkering aan bod. Opdrachtgever: ministerie van Sociale Zaken en Werkgelegenheid.

Aan het werk met re-integratieondersteuning, viermeting (voorlopige cijfers)

Tabellen over het starten van een baan vanuit een uitkering of periode als niet-uitkeringsgerechtigde, met en zonder re-integratieondersteuning. Opdrachtgever: Ministerie van Sociale Zaken en Werkgelegenheid (SZW)

Choosing a language and tool for the Business and Information Architecture of Statistics Netherlands

This is a report for choosing a language and tool for the business and information architecture (BI-architecture) of Statistics Netherlands. This is a first step to improve the current way of BI-architecture development and maintenance. In the first phase of the project the business architects indicated the need to change the current language and tool for others. In the second phase of the project the business architects together with the IT Enterprise architects advised that Archimate and BizzDesign Architect are the best language and tool combination to support the new way of working.

Small Area Estimation Of Turnover Of The Structural Business Survey

Since design-based methods like the generalized regression estimator have large design variances in the case of small sample sizes, model-based techniques can be considered as an alternative. In this paper a simulation study is carried out where small area estimation based on a linear mixed model is applied to the variable turnover of the Structural Business Survey of Statistics Netherlands. By applying the EBLUP, the accuracy of the estimates can be substantially improved compared to the generalized regression estimator. The EBLUP estimates, however, are biased, which is partly caused by the skewed distribution of the variable turnover. It is found that by transforming the target variable both skewness and bias can be substantially reduced, whereas the variance increases. As a result, the accuracy is slightly improved compared to the EBLUP.

Macro-integration techniques with applications to census tables and labour market statistics

Macro-integration is widely used for the reconciliation of macro figures, usually in the form of large multi-dimensional tabulations, obtained from different sources. Traditionally these techniques have been extensively applied in the area of macro-economics, especially in the compilation of the National Accounts. Methods for macro-integration have developed over the years and have become very versatile techniques for solving integration of data from different sources at a macro level. In this paper we propose applications of macro-integration techniques in other domains than the traditional macro-economic applications. In particular, we present two possible applications for macro-integration methods: reconciliation of tables of a virtual census and combining estimates of labour market variables.

Witte vlekken op pensioengebied 2010

Cijfers over de witte vlek op pensioengebied in 2010. Opdrachtgever: ministerie van SZW

Werkhervattingskansen na instroom in de WW

Een beschrijving van de factoren die van invloed zijn op de kans op werkhervatting voor personen die zijn ingestroomd in de WW. Opdrachtgever: Ministerie van Sociale Zaken en Werkgelegenheid (SZW)

The editing of statistical data: methods and techniques for the efficient detection and correction of errors and missing values

In order to produce official statistics of sufficient quality, statistical institutes carry out an extensive process of checking and correcting the data that they collect. This process is called statistical data editing. In this article, we give a brief overview of current data editing methodology. In particular, we discuss the application of selective and automatic editing procedures to improve the efficiency and timeliness of the data editing process.

Monitor diversiteit in het jeugdbeleid, vervolgmeting

Tabellen die een beeld geven van de leefsituatie en het voorzieningengebruik van allochtone en autochtone jongeren op terreinen als gezin, school, vrije tijd en arbeidsmarkt en de diversiteit van het personeel in de jeugdsector. Opdrachtgever: ZonMw.

Automatic editing with soft edits

The object of this paper is to present a new formulation of the error localisation problem which can distinguish between hard and soft edits

Optimal scheduling of contact attempts in mixed-mode surveys

Survey nonresponse occurs when members of a sample cannot or will not participate in the survey. It remains a problem despite the development of statistical methods that aim to reduce nonresponse. In this paper, we address the problem of resource allocation in survey designs in which the focus is on the quality of the survey results given that there will be nonresponse. We propose a novel method in which the optimal allocationof survey resources can be determined. We demonstrate the effectiveness of our method by extensive numerical experiments.

Stijgende werkloosheid, stagnerende participatie

Deze publicatie geeft een beeld van de arbeidsparticipatie op de arbeidsmarkt van 20 tot en met 64-jarigen in Nederland in 2010. Opdrachtgever: ministerie van Sociale Zaken en Werkgelegenheid.

On second-order inclusion probabilities and variances among large randomized PPS samples

This paper presents and discusses some new results on the second-order inclusion probabilities of a systematic probability proportional to size sample drawn from a randomly ordered list, also called randomized PPS sampling. It is shown that some standard approximations of these second-order inclusion probabilities meant for relatively small sample sizes, need not be valid when the sample size n is of the same order as the population size N. In addition, it is shown that under a number of assumptions the variance formulas for rejective Poisson sampling can be applied to randomized PPS sampling designs when both n and N-n are large.

On course, but not there yet: Enterprise architecture conformance and benefits in systems development

Various claims have been made regarding the benefits that EnterpriseArchitecture (EA) delivers for both individual systems development projects and the organization as a whole. This paper presents the statistical findings of a survey study(n=293) carried out to empirically test these claims. First, we investigated which techniques are used in practice to stimulate conformance to EA. Secondly, we studied which benefits are actually gained. Thirdly, we verified whether EA creators (e.g. enterprise architects) and EA users (e.g. project members) differ in their perceptions regarding EA. Finally, we investigated which of the applied techniques most effectively increase project conformance to and effectiveness of EA. A multivariate regression analysis demonstrates that three techniques have a major impact on conformance: carrying out compliance assessments, management propagation of EA and providing assistance to projects. Although project conformance plays a central role in reaping various benefits at both the organizational and the project level, it is shown that a number of important benefits have not yet been fully achieved.

Simple derivations of variance formulas in two-stage simple random sampling

This paper gives alternative derivations for the standard variance formulas in two-stage sampling. The derivations are based on a direct use of the statistical properties of the sampling errors in the second stage. For the ease of exposition we examine the specific case that simple random sampling is used in both stages. These derivations might be useful for readers looking for more elementary approaches to two-stage sampling.

Deductive imputation with the deducorrect package

Numerical and categorical data used for statistical analyses is often plagued with missing values and inconsistencies. In many cases, a number of missing values may be derived, based on the consistency rules imposed on the data and the observed values in a record. The methods used for such derivations are called deductive imputation. In this paper, we describe the newly developed deductive imputation functionality of R package deducorrect.

Manipulation of categorical data edits and error localization with the editrules package

Analyses of categorical data are often hindered by the occurrence of inconsistent or incomplete raw data. Although R has many features for analyzing categorical data, the functionality for error localization and error correction are currently limited. The editrules package is designed to o er a user-friendly toolbox for edit de nition, manipulation, and error localization based on the generalized paradigm of Fellegi and Holt.

Jongeren met een tijdelijk contract in 2009 en 2010

In dit rapport worden jongeren met een tijdelijk contract gevolgd om te zien of ze aan het werk blijven of hun baan verliezen. Opdrachtgever: ministerie van Sociale Zaken en Werkgelegenheid (SZW).

Kunstenaars in breder perspectief. Kunstenaars, kunstopleiding en arbeidsmarkt

Dit onderzoek beschrijft diverse aspecten van de arbeidsmarktpositie van kunstenaars, personen werkzaam in overige creatieve beroepen en enkele cohorten van afgestudeerden aan creatieve hbo- en mbo-opleidingen. Opdrachtgever: ministerie van Onderwijs, Cultuur en Wetenschappen (OCW).

Aan het werk met re-integratieondersteuning, driemeting (definitieve cijfers)

Rapport en tabellen over het starten van een baan vanuit een uitkering of periode als niet-uitkeringsgerechtigde, met en zonder re-integratieondersteuning. Opdrachtgever: Ministerie van Sociale Zaken en Werkgelegenheid (SZW).

Transseksuelen in Nederland

Dit rapport beschrijft de omvang van de groep en de kenmerken van transseksuelen in Nederland. Opdrachtgever: ministerie van OCW.

Vertrokken onbekend waarheen

Dit rapport beschrijft demografische en sociaal-economische kenmerken van personen die in de Gemeentelijke Basisadministratie persoonsgegevens (GBA) staan opgenomen onder de noemer 'vertrokken onbekend waarheen'. Opdrachtgever: ministerie van Binnenlandse Zaken en Koninkrijksrelaties.

Uitbreiding van de Europese Unie 2004 en 2007 - Cijfers over Nederland

Rapport en tabellen over kenmerken van de potentiële beroepsbevolking van personen geboren in één van de EU-uitbreidingslanden en een aantal referentiegroepen. Opdrachtgever: ministerie van Buitenlandse Zaken.

Estimation of monthly unemployment figures in a rotating panel; on the use of auxiliary series in structural times series models

The Dutch Labor Force Survey (LFS) is based on a rotating panel design. Recently an estimation procedure that is based on a multivariate structural time series model has been adopted to produce monthly official statistics about the labor force. This approach handles problems with rotation group bias and small sample sizes in an effective way and enables Statistics Netherlands to produce timely and accurate estimates about the labor market. In this paper the time series model is extended by incorporating an auxiliary series about people registered as unemployed in the register of the Office for Employment and Income. The information of the auxiliary series is used to improve the precision of the monthly unemployment figures by modelling the correlation between the trends of the LFS series and the auxiliary series of the registered unemployed labor force. It appears that the trend of the series of the registered unemployed labor force is cointegrated or almost cointegrated with the trend of the estimated unemployed labor force of the LFS for several domains. This results in a considerable decrease of the standard errors for the monthly unemployed labor force.

Vluchtelingen in Nederland 2010 (publicatie 3)

Samenvattend rapport over diverse demografische, regionale en sociaaleconomische kenmerken van de vluchtelingen in Nederland in 2010. Opdrachtgever: VluchtelingenWerk Nederland.

Bootstrapping Combined Estimator based on Register and Sample Survey Data

This paper describes how the bootstrap resampling method may be used to assess the accuracy of estimates based on a combination of data from registers and sample surveys. We consider three different estimators that may be applied in this context. The validity of the proposed bootstrap method is tested in a simulation study with realistic data fromthe Dutch Educational Attainment File.

Het belang van de cumelasector voor het platteland

Tabellen en kaarten over werknemers in de cumelasector (agrarisch loonwerk, meststoffendistributie, cultuurtechniek en grondwerk voor de infrastructuur) die wonen op het platteland. Opdrachtgever: CUMELA Nederland.

Manipulation of linear edits and error localization with the editrules package

This paper is the first of two papers describing the editrules package. The current paper is concerned with the treatment of numerical data under linear constraints, while the accompanying paper (Van der Loo and De Jonge, 2011) is concerned with constrained categorical and mixed data. The editrules package is designed to offer user-friendly interface for edit de_nition, manipulation and checking. The package offers functionality for error localization based on the paradigm of Fellegi and Holt and a flexible interface to binary programming based on the choice point paradigm. Lower-level functions include echelon transformation of linear systems, variable substitution and a fast Fourier-Motzkin elimination routine. We describe theory, implementation and give examples of package usage.

Inference in surveys with sequential mixed-mode data collection

Mixed-mode surveys are susceptible to mode-dependent selection effects and measurement errors, collectively known as mode effects. In sequential mixed-mode surveys, where non-respondents in one mode are re-approached using a different mode, it is likely that the mode composition of the response differs between subpopulations or between subsequent editions of the survey. Such variations in the mode composition lead to variations in the measurement errors, invalidating classical inference. An approach to inference in these circumstances is proposed, by calibrating the mode composition of the response to fixed levels. Assumptions and risks associated with such a procedure are discussed. The case of the Dutch Crime Survey is discussed as an example.

Diversiteit in Cijfers 2009

Voor de diversiteitsindex zijn gegevens berekend over de 10 procent banen met hoogste brutojaarloon en over de in- en uitstroom van deze banen en het totaal aantal banen. De gegevens worden uitgesplitst naar geslacht, leeftijd, herkomstgroepering, opleiding, sector, woonprovincie en rpa-gebied. De gegevens zijn eerder al berekend over 2008 en 2005.Opdrachtgever: Ministerie van Binnenlandse Zaken en Koninkrijkrelaties.

Jongeren 2009

Deze publicatie geeft een beeld van de jongeren (15-26 jaar) in Nederland in 2009. Onder andere komen de thema's onderwijs, werk en uitkering aan bod. Opdrachtgever: ministerie van Sociale Zaken en Werkgelegenheid.

Samenloop van zorg en inkomen 2009

Cijfers over het aantal mensen dat meerdere voorzieningen tegelijkertijd heeft op de domeinen zorg en inkomen in 2009.Opdrachtgever: ministerie van SZW.

Inkomens van afgestudeerden

Gegevens over de inkomens van afgestudeerden in het mbo-bol, hbo en wo in de periode 2007-2009. Opdrachtgever: Ministerie van OCW.

Wie wordt bereikt via het Europees Sociaal Fonds? Kenmerken van deelnemers 2007-2010 nader bekeken.

Tabellen over kenmerken van deelnemers (2007 en 2010) aan projecten opgezet vanuit het Europees Sociaal Fonds (ESF) en tabellen over de duurzaamheid van de arbeidsrelatie na deelname aan ESF-projecten in de periode 2007-2010.Opdrachtgever: het Agentschap van het ministerie van Sociale Zaken en Werkgelegenheid.

Correction of rounding, typing, and sign errors with the deducorrect package

Since raw (survey) data usually has to be edited before statistical analysis can take place, the availability of data cleaning algorithms is important to many statisticians. In this paper the implementation of three data correction methods in R are described. The methods of this package can be used to correct numerical data under linear restrictions for typing errors, rounding errors, sign errors and value interchanges. The algorithms, based on earlier work of Scholtus, are described and implementation details with coded examples aregiven. Although the algorithms have originally been developed with financial balance accounts in mind the algorithms are formulated generically and can be applied in a wider range of applications.

Omvang en dynamiek van werknemers in sectoren in Nederland, 1999-2008

Deze publicatie beschrijft hoe de dynamiek is geweest in de sectoren van de arbeidsmarkt in de periode van 1999 tot en met 2008. Opdrachtgever: Raad voor Werk en Inkomen

Interactie arbeidsmarkt en sociale zekerheid, 2006 en 2008

Tabellen met de arbeidsmarktpositie van de potentiële beroepsbevolking, begin 2006 en eind 2008. Opdrachtgever: ministerie van Sociale Zaken en Werkgelegenheid.

Attributes of quality reports

This paper shows that 19 relevant attributes of quality reports can be distinguished. These attributes are useful if we want to systematically manage the quality of quality reports and were established through analysis of documents about quality reporting and the minutes of the SQ-ESAC workshop about quality reporting. Each attribute is defined, but according to the Object-oriented Quality Management model more steps can be taken. Requirements can be formulated for each attribute and causes and effects of problems can be analyzed. Based on these requirements and risk analysis, measures can be taken to assure the quality of quality reports.

Optimizing quality of response through adaptive survey designs

In most surveys all sample units receive the same treatment and the same design features apply to all selected people and households. In this paper, it is explained how survey designs may be tailored to optimize response rates and to reduce nonresponse selectivity. Such designs are called adaptive survey designs. The basic ingredients of such designs are introduced and discussed and illustrated with a number of examples including a pilot study.

Standard for statistical processes 2011

The objective of this document is to describe a Standard for objects that are relevant for statistical processes and products. It is a Standard that can be applied to individual statistical processes.

Inventories and the business cycle

Inventories play a crucial role in explaining business cycle turning points. Inventories contributed 0.7 percentage point to a 4 percent contraction of economic activity in 2009. In light of this, demand for inventory data has been growing since the financial crisis hit in late 2008. This paper analyses Dutch wholesale and manufacturing inventories and relates them to the business cycle.

The inventory to sales ratio in manufacturing: a real leading business cycle indicator

Inventories are a useful statistic for tracking and analysing short-term economic developments. This paper by Floris van Ruth and Marcel van Velzen describes how the index of inventories of finished goods in the manufacturing industry can be used in business cycle analysis. Inventories themselves lag business cycle developments, and are therefore of limited use. Using the turnover index of the manufacturing industry to compute a ratio of inventory to sales (ISR) produces a new and leading business cycle indicator. The ISR is shown to consistently lead Dutch business cycle developments by one to two quarters. It is therefore one of the few real, i.e. non-financial and non-sentiment, leading indicators. Inventories are shown to exhibit clear co-movement with sales and the business cycle. The countercyclical development of the ISR is therefore explained by the fact that turnover reacts more strongly to business cycle developments than inventories.

The case of the missing tax data

In this paper, we describe the controversy that arose between Statistics Netherlands and the Ministry of Economic Affairs in 2009 after the Dutch government announced a tax relief measure for businesses, which deteriorated the quality of tax data used by Statistics Netherlands for producing short-term statistics.

Aan het werk met re-integratieondersteuning (driemeting)

Tabellen over het starten van een baan vanuit een uitkering of periode als niet-uitkeringsgerechtigde, met en zonder re-integratieondersteuning. Opdrachtgever: Ministerie van Sociale Zaken en Werkgelegenheid (SZW)

EU Water Statistics Grant: Water abstraction and use at River basin level

Final report on Grant agreement no. 50303 2008 003 2008 352 In this discussion paper methods are presented to compile water abstraction and water use data at the level of River Basins in the Netherlands, for the years 2004-2008. In general, the methods build upon existing national data on water abstraction and drinking water use.

Information technology, organization change and firm productivity: A panel study of complementarity effects and clustering patterns

Organizational change (OC) is an important complementarity factor in the process of creating business value from information technology (IT) investments.. This paper investigates complementarities between IT capital and OC initiatives of the firm. It analyzes the productivity impact of different clusters of IT and OC in the manufacturing and services sectors of the economy. Three dimensions of OC are studied: process, structure, and boundary changes. Two distinct econometric approaches are applied to a unique and detailed sample of 32,619 firm-level observations in the Netherlands for the period 1994-2006. The results reveal that the productivity effect of IT significantly increases when technology investments are accompanied by relevant organizational changes. The observed complementarity effects between IT and OC are stronger for services than for manufacturing firms. The effects become stronger if different types of change are combined with each other and form clusters.

Ouderen zonder baan, één jaar later - Arbeidsmarktkansen van werkloze ouderen en inkomstenbronnen van alle ouderen, 45 tot 65 jaar

Gegevens over de Arbeidsmarktkansen van ouderen zonder baan (45 tot 65 jaar) in 2008 en één jaar later, en over de inkomstenbronnen van alle ouderen in 2006-2008. Opdrachtgever: Raad voor Werk en Inkomen

How to judge the reliability of provisional National Accounts

It is not only important to produce good quality statistics, but also that the users of these statistics believe that they are of good quality. Therefore, it is necessary that the subsequent provisional estimates of the national accounts show a similar picture of economic performance. I.e. the subsequent estimates have to be sufficiently reliable.In this paper is analysed to what extent this requirement is met, taking into account that this requirement seriously conflicts with timeliness. A computer program was developed to enable a quick reliability-check to a large number of economic variables, derived from the national accounts.

New data sources for statistics: experiences at Statistics Netherlands

Apart from the traditional sources used by National Statistical Institutes, like sample surveys and administrative sources, nowadays more and more electronic sources of information are available that potentially can be used for the production of statistics. In the paper four sources are studied: i) Product prices on the internet, ii) Mobile phone location data, iii) Twitter text messages, and iv) Global Positioning System (GPS) data and traffic loop information. For each data source an overview is given of the usability of the collected information, as well as the practical and methodological challenges that lay ahead.

Imputation of rounded data

In surveys persons have a tendency to round their answers. For example, in the Labour Force Survey people are asked about the period they have been unemployed. There is clearly a tendency to give answers that are rounded to years of half years. Because of this rounding statistics based on this data tend to be biased. In this paper we introduce a method with which the rounding mechanism is modelled together with the ‘true’ underlying distribution. These are then used to select samples which are likely to be rounded an impute new values for these. This method is applied to the Labour Force Survey data. An investigation of robustness shows that the method is robust against misspecification of the model of the underlying distribution and to misspecification of the rounding mechanism.

Review of statistical priorities at Statistics Netherlands

Statistics Netherlands has started a process to review the statistical priorities. The demands of society change, but budget restrictions and the desire to reduce administrative burden do not allow increase of staff or surveys. Therefore, negative priorities are needed. A working group, chaired by the chief statistical officer and with members of the statistics divisions, was asked to assess proposals that were put forward by the statistics divisions. In order to try and objectify the comparison of the proposals, an assessment model was developed in cooperation with external consultancy. This paper describes the approach of the process.

Representativity of VAT and survey data for short term business statistics

Monthly Short Term Business statistics at Statistics Netherlands can be based on survey data, VAT records or a combination of these two data sources. Both sources are incomplete when statistics need to be produced. The survey response rate increases gradually in time and is still far from 100% after a month of data collection. The VAT register also fills gradually in time because i) quite some enterprises report on a quarterly or annual basis, and iii) those that report on a monthly basis report unevenly spread over time.In this paper we investigate and compare the representativity of survey and VAT response as a function of time. The objective is to determine whether VAT is as representative as survey data and can be used to produce accurate statistics. For this purpose we use so-called Representativity (R)-indicators and partial R-indicators. The results can be used in designing data collection for monthly statistics and in assessing the timing of processing survey and register data.

How to improve the quality of regional accounts estimates

In the European Union tens of billions of euros are spent on regional policy every year. A major part of this amount is allocated on the basis of regional gross domestic product per capita. In this paper by Henk Nijmeijer an inventory is drawn up of recent work on quality of regional accounts estimates. Special attention is paid to the instrument of process tables.The regional accounts should be compiled in close cooperation with the national accounts. The quality of the national accounts estimates could be improved by the findings of the regional accounts compiling process.

Witte vlekken op pensioengebied 2008 en 2009

Cijfers over de witte vlek op pensioengebied in 2008 en 2009. Opdrachtgever: ministerie van SZW.

Chain indices and path independence

Methods are considered to calculate a set of consistent price index numbers from an inconsistent set of chained index numbers. The inconsistencies are due to the existence of cycles in the price index graph cycles. The initial index numbers are calculated using an index formula, to the user’s choice. It is only required to satisfy a few simple consistency conditions. This does not include transitivity. One method (due to Hill) uses spanning trees to solve (or rather: sidestep) the inconsistency problem. The second method seeks to adjust the initial values in such a way that the new index numbers satisfy a transitivity criterion, and are close to the original index numbers. The approach in the present paper is inspired by levelling in land surveying.

Response representativeness and measurement profile risks in the LFS

We investigate the relationship between nonresponse error and measurement error as a function of a number of survey design features. Both types of survey error are quantified using indicators. Nonresponse error is analysed in terms of maximal nonresponse bias. Measurement error is decomposed into measurement profile risk and response bias. A measurement profile is a certain response style or behaviour.

Verhuizingen en inkomens gemeente Rotterdam

Tabellen over verhuizingen in de deelgemeenten van Rotterdam in 2001-2007 naar inkomen. Opdrachtgever: Centrum voor Onderzoek en Statistiek Rotterdam.

Small area estimates of labour status in Dutch municipalities

Within the ongoing redesign program of social surveys at Statistics Netherlands a small area estimation method for labour status has been developed. The model used is the basic unit-level model, which is a linear mixed model with random area effects, where the areas are municipalities. We discuss several issues concerning model choice, including the use of linear (mixed) models for binary variables, the use of posterior means instead of maximum likelihood estimates to prevent zero or too small estimates of between area variance and the use of covariates at both the unit and area level. Several model selection measures and graphical diagnostics have been applied to arrive at a set of covariates used in the model. We focus on the estimation of municipal unemployment fractions, but also discuss estimation of fractions employed and not belonging to the labour force. The municipal estimates are benchmarked such that they are consistent with regularly produced provincial estimates. The small area estimates thus obtained have smaller estimated mean squared errors than the current estimates based on the generalized regression estimator, and display a much more plausible development over time.

Participatiepotentieel 2009

In december 2010 aangekondigde publicatie met gegevens over de participatie op de arbeidsmarkt van 20 tot en met 64 jarigen. Opdrachtgever: Ministerie van SZW

The monthly index of inventories of finished goods for the Dutch manufacturing industry

This paper by Marcel van Velzen and Leendert Hoven describes the method used in the compilation of the monthly volume index of inventories of finished goods for the Dutch manufacturing industry. The index was introduced at the end of 2009. In the paper, the plausibility of the outcomes is assessed. The paper also addresses the potential use of the index in the compilation of production indices.

Participatiepotentieel 2009

In opdracht van het Ministerie van SZW zijn gegevens berekend over de participatie op de arbeidsmarkt van 20 tot en met 64 jarigen.

Vijftigplussers aan het werk (Update)

Evaluatie IOW: Arbeidsparticipatie en vangnetvoorzieningen oudere werklozen

Aan het werk met of zonder re-integratieondersteuning (definitieve cijfers 2008 en 2009)

Dit rapport beschrijft de banen die in 2008 of 2009 zijn gestart door personen vanuit een uitkering of als niet-uitkeringsgerechtigde. Verder komen beëindigde loonkostensubsidies aan bod. Opdrachtgever: Ministerie van SZW.

Met re-integratieondersteuning op weg naar werk (2007)

In opdracht van het ministerie van Sociale Zaken en Werkgelegenheid (SZW) heeft het Centrum voor Beleidsstatistiek van het CBS (CBS-CvB) een rapportage gemaakt over de uitstroom naar regulier werk van personen binnen 24 maanden na start van de re-integratieondersteuning in 2007.

Scale Adjusted Volume Measurement of Educational Services

This exploratory paper presents a new method to measure the output of government-provided secondary education in the Netherlands. Transferred knowledge and skills, and not pupil-hours, are used as the principal output measure of education. The approach developed involves a transformation of traditional unit cost weights.

Gelijk loon voor gelijk werk? Banen en lonen bij overheid en bedrijfsleven, 2008

Gelijk loon voor gelijk werk? Dit rapport schetst een beeld van de banen, lonen en beloningsverschillen bij overheid en bedrijfsleven in 2008. De aanvullende tabellenset is op aanvraag te verkrijgen. Opdrachtgever: Ministerie van SZW

Kritieke materialen in de Nederlandse economie. Voorlopige resultaten (Engelstalig)

Deze exploratieve studie onderzoekt de impact van kritieke materialen op de Nederlandse economie. De studie gebruikt de 41 kritieke materialen zoals bepaald door de EU-werkgroep kritieke grondstoffen. Deze lijst is door het Ministerie van Economische Zaken aangevuld met 3 kritieke materialen. De studie biedt een eerste inzicht in de bedrijfstakken en productgroepen waarin kritieke materialen worden gebruikt.

Geen kans of geen keuze? Typologie van het arbeidspotentieel zonder inkomsten

Het Centrum voor Beleidsstatistiek heeft in opdracht van de Raad voor Werk en Inkomen en een aantal gemeenten onderzoek uitgevoerd naar het arbeidspotentieel zonder inkomsten in 2007.

Outcomemonitor Wijkenaanpak, Eerste Voortgangsrapportage

Deze rapportage geeft voor de 40 aandachtswijken een beschrijving van de meest recente gegevens en de ontwikkeling sinds de Nulmeting. Opdrachtgever: Ministerie van Volkshuisvesting, Ruimtelijke Ordening en Milieubeheer.

A monthly indicator of private fixed capital formation; a model based approach

This paper by Floris van Ruth describes two methods for computing a monthly statistic based on a lower frequency, quarterly reference statistic and related monthly indicators. Both methods are based on a state space formulation and the Kalman filter. The first method is an interpolation methods, which is used to produce the monthly indicator of fixed capital formation development. The alternative method is a cumulative approach. Both methods produce credible and reliable monthly statistics. Crucial is the availability of relevant monthly indicators, in this case indicators of production and imports of capital goods.

Innovation and competition in the Netherlands: testing the inverted U for industries and firms

Competition can be good or bad for innovation. In this paper a model is tested in which an increase of competition stimulates innovation when competition is low, but where innovation is discouraged when the level of competition goes beyond a certain threshold. We use industry- as well as firm-level data, and find evidence for such a threshold using&nbsp;two different competition measures.

Monitoring conditions for consumption, exports and fixed capital formation, the radar concept

This paper by Floris van Ruth describes the concept, working and outcomes of the Statistics Netherlands export, consumption and fixed capital formation radars. These are tools for monitoring and showing how conditions develop for the growth of the target key macro-economic indicators. By showing in one diagram indicators representing the driving factors for the relevant macro-economic quantity, a general picture is given of how conditions for this central indicator are developing. The graphic and dynamic character of the radar-concept allows for easy and quick analysis. This study presents the indicators selected for the export, consumption and fixed capital formation radars and their properties. From these radars a conditions indicator can be derived. The significance of the conditions for explaining the development of the central indicators is tested, as well as their predictive powers.

Arbeidsmarktpostie van ouderen (45 tot 65 jaar) in 2006 - 2009

Het Centrum voor Beleidsstatistiek (CBS-CvB) heeft in opdracht van de Raad voor Werk en Inkomen in september 2010 gegevens geleverd over de arbeidsmarktpositie van ouderen (45 tot 65 jaar) over de jaren 2006 tot en met 2009. Voor meer informatie kunt u contact opnemen met het Centrum voor Beleidsstatistiek.

Estimation of the monthly unemployment rate for six domains through Structural Time Series Modelling with Cointegrated Trends
A new model for quality management

This paper explains the OQM model developed by SN, and describes nine applications of the model. The applications vary from large-scale (TQM and process assurance) to small-scale. They demonstrate that the concept of quality areas is both powerful and flexible, and can be used in any domain.

ESF-monitor deelnemerskenmerken 2009 - Kenmerken van deelnemers aan ESF-projecten (voorlopige uitkomsten)

De Europese Commissie (EC) stelt via het Europees Sociaal Fonds (ESF) gelden beschikbaar om personen te helpen hun vaardigheden, en daarmee hun vooruitzichten op werk, te verbeteren. In Nederland is het Agentschap Sociale Zaken en Werkgelegenheid (Agentschap SZW) verantwoordelijk voor de uitvoering van de subsidieregeling. In dat kader heeft het Agentschap SZW verzocht om gegevens samen te stellen over kenmerken van deelnemers aan projecten in 2009. Het gaat om kenmerken als leeftijd, geslacht, arbeidsmarktpositie, opleidingsniveau, herkomstgroepering, herkomstgeneratie en het hebben van een arbeidsongeschiktheidsuitkering. In 2009 hebben ruim 221 duizend personen deelgenomen aan circa 780 projecten die ESF-subsidie hebben aangevraagd.

Calibrated hot-deck donor imputation subject to edit restrictions

A major problem that has to be faced by basically all institutes that collect statistical data on persons or enterprises is that data may be missing in the observed data sets. The most common solution to handle missing data is imputation. At national statistical institutes and other statistical institutes, the imputation problem is further complicated owing to the existence of constraints in the form of edit restrictions that have to be satisfied by the data. Examples of such edit restrictions are that someone who is less than 16 years old cannot be married in the Netherlands, and that someone whose marital status is unmarried cannot be the spouse of the head of household. Records that do not satisfy these edits are inconsistent, and are hence considered incorrect. Another additional problem for categorical data is that the frequencies of certain categories are sometimes known from other sources or have already been estimated. In this paper we develop imputation methods for categorical data that take these edits and known frequencies into account while imputing a record.

Competitive, but too small

In this paper we show that scale effects, market structure, and regulation determine the poor productivity performance of the European business services industry. We apply parametric and nonparametric methods to estimate the productivity frontier and subsequently explain the distance of firms to the productivity frontier by market characteristics, entry- and exit dynamics and national regulation. The frontier is assessed using detailed industry data panel for 13 EU countries. Our estimates suggest that most scale advantages are exhausted after reaching a size of 20 employees. This scale inefficiency is persistent over time and points to weak competitive selection. Market and regulation characteristics explain the persistence of X-inefficiency (sub-optimal productivity relative to the industry frontier). More entry and exit are favourable for productivity performance, while higher market concentration works out negatively. Regulatory differences also appear to explain part of the business services' productivity performance. In particular regulation-caused exit and labour reallocation costs have significant and large negative impacts on the process of competitive selection and hence on productivity performance.

Diversiteit in cijfers 2008

Voor het Ministerie van Binnenlandse Zaken en Koninkrijksrelaties zijn eind 2008 gegevens geleverd voor de diversiteitsindex over het jaar 2005. Deze gegevens gingen over banen van werknemers in Nederland, over de 10 procent banen met hoogste brutojaarloon en over de in- en uitstroom van banen in het peiljaar. In dit vervolgonderzoek worden de gegevens geactualiseerd voor het jaar 2008. Deze gegevens worden uitgesplitst naar geslacht, leeftijd, herkomstgroepering, opleiding, sector, woonprovincie en rpa-gebied.

Vijftigplussers aan het werk. Evaluatie IOW: Arbeidsparticipatie en vangnetvoorzieningen oudere werklozen

In 2006 is de Wet Inkomensvoorziening Ouderen Werklozen (IOW) ingevoerd als vangnet voor werklozen die 60 jaar of ouder zijn als ze WW aanvragen en meer dan drie maanden recht hebben op WW. In verband met de evaluatie van de IOW heeft het Centrum voor Beleidsstatistiek van het Centraal Bureau voor de Statistiek in opdracht van het ministerie van Sociale Zaken en Werkgelegenheid een onderzoek uitgevoerd naar de werkhervattingsmogelijkheden van oudere werklozen en de achtergrondkenmerken van de potentiële IOW populatie en personen die een uitkering in het kader van Wet Inkomensvoorziening Oudere en gedeeltelijk Arbeidsongeschikte werkloze Werknemers (IOAW) ontvangen.

Design-based analysis of factorial designs embedded in probability samples

At national statistical institutes experiments embedded in ongoing sample surveys are frequently conducted, for example to test the effect of modifications in the survey process on the main parameter estimates of the survey, to quantify the effect of alternative survey implementations on these estimates, or to obtain insight in the various sources of non-sampling errors. A design-based analysis procedure for factorial completely randomized designs and factorial randomized block designs embedded in probability samples is proposed in this paper. Design-based Wald statistics are developed to test whether estimated population parameters, like means, totals and ratios of two population totals, that are observed under the different treatment combinations of the experiment are significantly different. The methods are illustrated with a real life application of an experiment embedded in the Dutch Labor Force Survey.

Hoofdlijnen van de Business Architectuur CBS

Inleidende tekst: in dit cursusdocument worden de hoofdlijnen van de business architectuur van het CBS uiteengezet. Het niveau van deze inleidende cursus is ‘intermediate’. De nadruk ligt niet zozeer op allerlei theoretische begrippen uit de business architectuur, maar op de toepassing ervan op het CBS. Het doel van de cursus is dat de cursist een idee krijgt welke aspecten van belang zijn bij het ‘maken van een statistiek’, en hoe deze aspecten in samenhang tot elkaar staan. En dat is ook precies wat architectuur beoogt: het geven van een samenvattend beeld van één of meer artefacten in een omgeving, vooral betreffend coherente keuzes op het gebied van functie, structuur en stijl. De twee belangrijkste onderwerpen die in de cursus aan de orde komen zijn 1) het statistische product, i.e. wat is een statistiek en 2) het statistische proces, i.e. hoe maken we een statistiek.

Nonresponsanalyse van het AVO 2007

In deze nota wordt een analyse beschreven van de (non)respons bij het Aanvullend Voorzieningen Onderzoek van 2007, waarvan het veldwerk is uitgevoerd door het CBS. Er wordt een overzicht gegeven van de respons van verschillene groepen in de samenleving, uitgesplitst naar de mate waarin met deze groepen contact wordt gemaakt en de mate waarin mensen in deze groepen bereid zijn mee te doen naar het onderzoek.

Standard Processes

Statistical processes can be very complex, and it is not uncommon that they are designed and implemented as one big tangle of statistical activities. This paper is an initiative to structure and standardise the processing of statistical data. Concepts like ‘standard process step’ and ‘standard process’ are introduced and explained by means of both a non-statistical example (fixing a flat bike tyre) and a statistical example (matching two data files).

Application of the Object Oriented Quality Management Model to Secondary Data Sources

The paper describes the application of the Object Oriented Quality Management model to the object secondary data sources. The results obtained are compared to those of the, independently developed, Quality framework for Administrative Data Sources. An administrative data source is an example of a secondary data source. This exercise was performed to enable the evaluation of the strengths and weaknesses of the quality management model and the completeness of the quality framework.

The definition of a job and the flow approach to the labour market; a sensitivity analysis for the Netherlands

Measurement of labour market flows depends on three major aspects of the job definition, namely (i) the size of the job; (ii) the length of the job, and (iii) whether in accordance to national accounting rules, jobs are identified with labour contracts, or whether in accordance to labour demand theory, unfilled vacancies are also counted as jobs. This paper looks at the sensitivity of measuring job and worker flows with respect to these alternatives for the job definition. Measurement of labour market dynamics appears especially sensitive for the dynamic dimension of the job definition, namely that of (minimum) job length.

Sampling and estimation techniques for the implementation of new classification systems; the change-over from NACE Rev. 1.1 to NACE Rev. 2 in business surveys

This paper describes some of the methodological problems encountered with the change-over from the NACE Rev. 1.1 to the NACE Rev. 2 in business statistics. Different sampling and estimation strategies are proposed to produce reliable figures for the domains under both classifications simultaneously. Furthermore several methods are described that can be used to reconstruct time series for the domains under the NACE Rev. 2.

Is er nog plaats voor een nieuw kwaliteitsmodel?

In dit rapport wordt antwoord gegeven op de vraag wat de kwaliteit van het OQM model is. Het OQM model is ontwikkeld door het CBS en ook geïmplementeerd binnen het CBS. Het is een model voor het managen van kwaliteit. De kwaliteit van het model was tot op heden nog niet getoetst.

Distribution based outlier detection in univariate data

Two univariate outlier detection methods are introduced. In both methods, the distribution of the bulk of observed data is approximated by regression of the observed values on their estimated QQ plot positions using a model cummulative distribution function.

Uit de WW naar een vaste baan? Uitstromers uit de WW in 2006 naar dynamiek op de arbeidsmarkt in 2006-2008

Paper en erratum. &nbsp;De uitkomsten van dit rapport maken onderdeel uit van een studie naar personen die in en uit de Werkloosheidswet (WW) stromen.&nbsp;Dit rapport bevat gegevens over de uitstromers uit de WW in 2006.&nbsp;De gegevens zijn gemaakt door het Centrum voor Beleidsstatistiek van het Centraal Bureau voor de Statistiek in opdracht van het ministerie van SZW. In aanvulling op de publicatie zijn een aantal extra uitsplitsingen geleverd als maatwerktabellen. Deze zijn op aanvraag beschikbaar.

Consumer durables in the Dutch national accounts

This paper presents the methods used for compiling balance sheets for consumer durables in the Netherlands. The Perpetual Inventory Method is used to convert time series of consumption of consumer durables to wealth stocks. Consumer durables are a memorandum item in the non-financial balance sheets.

A general formulation of the secondary cell suppression problem

Statistical agencies have to ensure that respondents’ private information cannot be revealed from the tables they release. A well-known protection method is cell suppression, where values that provide too much information are left out from the table to be published. In a first step, sensitive cell values are suppressed. This is called primary suppression. In a second step, other values are suppressed as well to exclude that primarily suppressed values can be re-calculated from the values published in the table. This second step is called secondary cell suppression

New developments in survey data collection methodology for official statistics

This paper describes some new developments in survey methodology that may help to solve problems of survey taking in official statistics. The R-indicator is described as an additional indicator for survey quality. Web surveys are considered as a cheaper means of data collection, either as a single-mode survey or as one of the modes in a mixed-mode survey. Also attention is paid to more flexible ways of conducting the fieldwork of a survey. The R-indicator could play a role in this.

Regionale innovatie in Nederland; Community Innovation Survey 2004 en 2006

In opdracht van het ministerie van Economische Zaken heeft het Centrum voor Beleidsstatistiek van het Centraal Bureau voor de Statistiek (CBS-CvB) het innovatieve gedrag van bedrijven op regionaal niveau berekend. De resultaten hebben betrekking op twee verslagperioden, namelijk 2004 en 2006. De verrijkte onderzoeksbestanden die de basis vormen voor het samenstellen van de tabellen, zijn gedocumenteerd en onder de naam ‘CIS-regio’ opgenomen in de catalogus van microdatabestanden bij het CBS-CvB.

Macro-integration with inequality constraints - an application to the integration of transport and trade statistics

Macro-integration is the process of combining data from several sources at an aggregate level. We review a Bayesian approach to macro-integration with special emphasis on the inclusion of inequality constraints. In particular, an approximate method of dealing with inequality constraints within the linear macro-integration framework is proposed. This method is based on a normal approximation to the truncated multivariate normal distribution. The framework is then applied to the integration of international trade statistics and transport statistics. By combining these data sources, transit flows can be derived as differences between specific transport and trade flows. Two methods of imposing the inequality restrictions that transit flows must be non-negative are compared. Moreover, the figures are improved by imposing the equality constraints that aggregates of incoming and outgoing transit flows must be equal.

Monitor Diversiteit in het Jeugdbeleid, nulmeting

De Monitor Diversiteit in het Jeugdbeleid is een onderdeel van het ZonMw-programma Diversiteit in het Jeugdbeleid. Het Centrum voor Beleidsstatistiek van het Centraal Bureau voor de Statistiek (CBS-CvB) en het Sociaal Cultureel Planbureau (SCP) zijn door ZonMw gevraagd om de nulmeting van de Monitor Diversiteit in het Jeugdbeleid uit te voeren. De monitor beoogt verdere kennisontwikkeling en –verspreiding over het voorzieningengebruik in de jeugdsector door etnische groeperingen, en over de diversiteit van het personeel in de jeugdsector. Ook biedt de monitor informatie over de leefsituatie van jongeren, over aspecten als gezin, school, vrije tijd en arbeidmarkt. Dit rapport bevat de tabellenset en technische toelichting van de nulmeting van de monitor.

Opgroeien in diversiteit, beschrijving van de nulmeting Monitor Diversiteit in het Jeugdbeleid

De Monitor Diversiteit in het Jeugdbeleid is een onderdeel van het ZonMw-programma Diversiteit in het Jeugdbeleid. Het Centrum voor Beleidsstatistiek van het Centraal Bureau voor de Statistiek (CBS-CvB) en het Sociaal Cultureel Planbureau (SCP) zijn door ZonMw gevraagd om de nulmeting van de Monitor Diversiteit in het Jeugdbeleid uit te voeren. De monitor beoogt verdere kennisontwikkeling en –verspreiding over het voorzieningengebruik in de jeugdsector door etnische groeperingen, en over de diversiteit van het personeel in de jeugdsector. Ook biedt de monitor informatie over de leefsituatie van jongeren, over aspecten als gezin, school, vrije tijd en arbeidmarkt. Het rapport ‘Opgroeien in diversiteit’ beschrijft de resultaten van de nulmeting van de monitor.

Estimating accuracy for statistics based on register and survey data

In this paper, we consider the situation where data are collected from both registers and sample surveys. We show how the bootstrap resampling method can be applied in this situation, in order to obtain insight in the accuracy of statistics based on combined data. The method is applied to the Dutch Educational Attainment File.

Social desirability bias, response order effect and selection effects in the new Dutch Safety Monitor

In 2008, there was a transition in the survey that measures perceived and actual safety in the Netherlands. In this paper by Kraan, Van den Brakel, Buelens, and Huys, it is attempted to explain some significant discontinuities caused by this transition. The observed discontinuities can be explained with the introduction of new data collection modes, Web survey and Self Completion Paper Questionnaire.

Instromers in de WW in 2007 en hun arbeidsmarktverleden

Het Centrum voor Beleidsstatistiek van het Centraal Bureau voor de Statistiek heeft in opdracht van het ministerie van SZW gegevens berekend over het arbeidsmarktverleden van instromers in de WW in 2007. Van instromers wordt op het moment van instroom de arbeidsmarktpositie bepaald. Daarnaast wordt teruggekeken tot 2001 om het arbeidsmarktverleden van de instromers te bepalen. De uitkomsten maken onderdeel uit van een studie naar dynamiek in de WW waarin wordt gekeken naar stromen in en uit de WW naar achtergrondkenmerken van personen met een WW-uitkering en naar het type banen dat voorafgaat en volgt op een WW-uitkering. Aanvullend op deze publicatie zijn een aantal extra uitsplitsingen geleverd als maatwerktabellen. Deze zijn op aanvraag beschikbaar.

Jongeren 2008

In opdracht van het ministerie van Jeugd en Gezin heeft het Centrum voor Beleidsstatistiek van het Centraal Bureau voor de Statistiek cijfers samengesteld over jongeren (15-26 jaar) in Nederland in 2008. Het rapport gaat onder andere in op de volgende vragen: wie volgt onderwijs en wie heeft er een baan of een uitkering?

A cross-sectional approach to business cycle analysis

This paper by Floris van Ruth describes how the current stance of the business cycle can be derived from a mixed set containing a limited number of selected indicators. Using the Netherlands and the United States as examples, it is shown that it is possible to extract the business cycle from a mix of leading, coincident and lagging indicators, as long as the set is not skewed towards one particular type of indicator. Different methods, both direct and indirect, of deriving the common business cycle are used to test the robustness of the results. The similarity between the common cycles resulting from the different methods is seen as a confirmation of the validity of the common cycle interpretation of the business cycle.

Verkenning niet-westerse derde generatie

De niet-westerse derde generatie heeft ouders die in Nederland zijn geboren, maar minimaal één grootouder die is geboren in een niet-westers land. Het ministerie van Volkshuisvesting, Ruimtelijke Ordening en Milieu (VROM) heeft het Centrum voor Beleidsstatistiek van het Centraal Bureau voor de Statistiek (CBS-CvB) gevraagd om een eerste verkennende studie te doen naar de omvang en maatschappelijke positie van de niet-westerse derde generatie op 1 januari 2008.

A multivariate Denton method for benchmarking large data sets

Benchmarking is the process to achieve mathematical consistency between low-frequency (e.g. annual) and high-frequency (e.g. quarterly) data. Statistics Netherlands is going to apply a new benchmarking method on Dutch national accounts. The new benchmarking method is based on a multivariate Denton method, presented in Bikker and Buijtenhek (2006). In order to incorporate all economic relations into this model, we extended it with new methodological features, such as ratio constraints, soft constraints and inequality constraints. In this paper the new extended multivariate Denton method is presented.

Mobiliteit van universitair personeel 2003-2006

Over de mobiliteit van universitair personeel was tot nu toe nog weinig bekend. Zo ontbrak informatie over mobiliteit tussen universiteiten en was niet bekend waar medewerkers gingen werken na het verlaten van de universiteit. De Vereniging van Universiteiten (VSNU) heeft het Centrum voor Beleidsstatistiek (CBS-CvB) gevraagd tabellen te genereren over de mobiliteit van universitair personeel. Het onderzoek is mogelijk gemaakt door een subsidie, verstrekt door stichting SoFoKleS, het Sociaal Fonds voor de Kennissector.

Arbeidsparticipatie van 20-64, participatiepotentieel in 2008

Het ministerie van Sociale Zaken en Werkgelegenheid (SZW) streeft naar een bruto arbeidsparticipatie van 80 procent in 2016. Dit onderzoek geeft inzicht in participatie op en afstand tot de arbeidsmarkt in 2008. Er worden verschillende uitkeringsgroepen onderscheiden evenals niet-werkend werkzoekenden die bij het UWV Werkbedrijf staan ingeschreven. De focus ligt op de 20 tot en met 64 jarigen en de gegevens zijn uitgesplitst naar achtergrondkenmerken als geslacht, herkomstgroepering en opleidingsniveau.

Met re-integratieondersteuning op weg naar werk

Het ministerie van Sociale Zaken en Werkgelegenheid (SZW) monitort de resultaten van re-integratieondersteuning. Sinds 2008 wordt hiervoor de indicator ‘uitstroom naar regulier werk binnen 24 maanden na start van de van personen die re-integratieondersteuning ontvangen’ gebruikt. De doelstelling is dat de uitstroom naar werk 60 procent moet zijn in 2011 (60%-doelstelling). Vorig jaar heeft de nulmeting over 2005 plaatsgevonden en in deze eenmeting zijn de resultaten van re-integratieondersteuning in 2006 in kaart gebracht.

Aan het werk met of zonder re-integratieondersteuning, definitieve uitkomsten 2008.

Dit rapport beschrijft de definitieve uitkomsten over het aantal banen dat in 2008 werd gestart door personen met een uitkering of een inschrijving bij het CWI (nu UWV WERKbedrijf) als niet-werkende werkzoekende. Hierbij wordt gekeken hoe vaak er gelijktijdig of voorafgaand aan de start van de baan sprake was van re-integratieondersteuning, of de uitstroom naar werk volledig was, en hoe duurzaam personen aan het werk waren. Eerder dit jaar heeft het Centrum voor Beleidsstatistiek (CBS-CvB) in opdracht van het ministerie van Sociale Zaken en Werkgelegenheid (SZW) al voorlopige uitkomsten over dit onderwerp gepubliceerd.

Personen naar wel of geen pensioenopbouw

In opdracht van het Ministerie voor Sociale Zaken en Werkgelegenheid (SZW) heeft het Centrum voor Beleidsstatistiek van het Centraal Bureau voor de Statistiek (CBS-CvB) de witte vlek op pensioengebied in 2007 in kaart gebracht. De tabellenset bevat gegevens over witte werknemers (werknemers zonder pensioenregeling in de tweede pijler) en witte werkgevers (werkgevers die geen pensioenregeling aanbieden).

Automatic correction of simple typing errors in numerical data with balance edits

Data collected for the production of structural business statistics consist of a large number of numerical variables, with many mathematical relations between them. These relations are specified in the form of consistency checks, called edit rules or edits. Edits are used to detect errors that occur in the data set.

Offshoring and productivity (growth) in Dutch manufacturing and service industries: An empirical asessment

In this paper the authors analyze the effect of offshoring on productivity (growth) while explicitly accounting for the presence of imperfect competition in the output market. The analysis suggests that offshoring has a positive impact on industrial TFP growth while it compresses competition. This latter result can be explained by the fact that, given output prices, lower production costs lead to higher price cost margins.

Modernising household surveys in the Netherlands: Design, Efficiency Gains and Perspectives

The present paper outlines the current modernisation programme of household surveys carried out by Statistics Netherlands. It focuses on its objectives in terms of data collection and standardisation of processes, efficiency targets and relevance and quality of outputs. The short and medium term goals and the European context are discussed and the experiences gathered so far are presented. This redesign project of household surveys is part of a large-scale modernisation programme of statistics production at Statistics Netherlands.

Chain management in statistics: Best practices

Statistics like the national accounts are compiled using various data sources. Most statistical institutes compile the underlying data using so called stovepipes. A stovepipe means that a statistic is produced as a “stand alone process”. Stovepipes, however, are inappropriate for producing integrated and coherent information and cause administrative burden. A solution is to integrate all stages of the production process as part of a chain of statistical products, using chain management. So far, chain management is hardly applied in statistical institutes. The present chapter describes the main components of chain management and how it is organized. It turns out that chain management is not easy to implement; it must be introduced step by step, and it is not something that once “organised” will continue to work ever after.

Checklist for the Quality evaluation of Administrative Data Sources

Statistics Netherlands is increasingly making use of administrative and other secondary data sources for the production of statistics. This approach makes Statistics Netherlands highly dependent on the quality of those sources. It is therefore of vital importance that a procedure is available to determine the quality of such data sources in a systematic, objective, and standardized way. For this purpose a quality framework and a checklist have been developed.

Labour Market Policy database; Definitieve cijfers 2007 en voorlopige cijfers 2008

Lidstaten van de Europese Unie (EU) en de Organisatie voor Economische Samenwerking en Ontwikkeling (OESO) publiceren jaarlijks informatie over arbeidsmarktmaatregelen. Zij doen dit onder andere door middel van de ‘Labour Market Policy (LMP) database’. Het Centrum voor Beleidsstatistiek van het Centraal Bureau voor de Statistiek (CBS-CvB) heeft in opdracht van het ministerie van SZW cijfers berekend over arbeidsmarktmaatregelen in 2007 en 2008 waarmee de LMP-database geactualiseerd kan worden.

Nonresponse in Sample Surveys, Methods for Analysis and Adjustment

The response to a survey will never be 100%, nor will it be completely non-selective. This causes a bias in statistics based on survey research. In this PhD-thesis written by Fannie Cobben various methods for the treatment of nonresponse in sample surveys are described.

Representativity of the LISS panel

The analysis in this paper shows that certain groups are underrepresented in the LISS panel. Especially single person households, households with a high average age and households with first generation immigrants. We also show that offering a free computer and internet access has a positive effect on the representativity.

Intervention analysis with state-space models to estimate discontinuities due to a survey redesign

An important quality aspect of official statistics produced by national statistical institutes is comparability over time. To maintain uninterrupted time series, surveys conducted by national statistical institutes are often kept unchanged as long as possible. To improve the quality or efficiency of a survey process, however, it remains inevitable to adjust methods or redesign this process from time to time. Adjustments in the survey process generally affect survey characteristics such as response bias and therefore have a systematic effect on the parameter estimates of a sample survey. Therefore it is important that the effects of a survey redesign on the estimated series are explained and quantified. In this paper a structural time series model is applied to estimate discontinuities in series of the Dutch survey on social participation and environmental consciousness due to a redesign of the underlying survey process.

The creation and application of a new quality management model

This paper gives introduction to the OQM model, discusses its characteristics and advantages, and describes the applications of the model up to now.

Measuring Business Cycles with Rapid Estimates

Statistics Netherlands developed a factsheet containing several indicators measuring conjunctural change. This factsheet can be divided into three parts. The first part is based on 15 macro-economic indicators. The second part consists of the export condition monitor (ECM). The third part consists of 10 rapid indicators. Until now this research was mainly focussed on collecting existing statistical data and putting it in a general context. When collecting data some methodological issues raised. These issues are discussed in the last part of this paper and are challenges for further research.

Aan het werk met of zonder re-integratieondersteuning, voorlopige cijfers 2008

In opdracht van het Ministerie van Sociale Zaken en Werkgelegenheid (SZW) heeft het Centrum voor Beleidsstatistiek (CBS-CvB) voorlopige cijfers samengesteld over personen die in 2008 vanuit een uitkerings- of NUG-situatie een baan startten. Daarbij is gekeken of zij op dat moment volledig uit de uitkering raakten, en of ze in de 12 maanden daarvoor re-integratieondersteuning kregen. Daarnaast besteedt de publicatie aandacht aan loonkostensubsidies die in 2008 zijn beëindigd. Afgelopen voorjaar zijn voorlopige uitkomsten gepubliceerd over de eerste helft van 2008. Later dit jaar zal CBS-CvB een onderzoeksrapport publiceren met daarin definitieve uitkomsten, uitgesplitst naar onder andere leeftijd, geslacht en herkomst.

Eenmeting uitstroomdoelstelling re-integratieondersteuning, voorlopige cijfers 2005 en 2006

In opdracht van het Ministerie van Sociale Zaken en Werkgelegenheid heeft het Centrum voor Beleidsstatistiek van het Centraal Bureau voor de statistiek (CBS-CvB) berekend hoeveel personen die re-integratieondersteuning starten, binnen 24 maanden aan het werk gaan. In de nu gepubliceerde tabellenset staan voorlopige uitkomsten over heel 2005 en heel 2006, uitgesplitst naar uitkeringspositie. In het voorjaar leverde CBS-CvB voorlopige uitkomsten over het eerste half jaar 2005 en eerste half jaar 2006. Later dit jaar zal CBS-CvB een onderzoeksrapport publiceren met daarin definitieve uitkomsten, uitgesplitst naar onder andere leeftijd, geslacht en herkomst.

Measuring capital in the Netherlands

At Statistics Netherlands the development of productivity statistics is addressed as a key field of interest. The recent national accounts revision at Statistics Netherlands was taken as an opportunity to improve capital stock and depreciation statistics. The Perpetual Inventory Method, as now applied at Statistics Netherlands, provides in a consistent way statistics on depreciation, the net capital stock, the productive capital stock and capital services. Further, much attention has been given to estimating average service lives and discard patterns of different asset types based on direct capital stock observations in the manufacturing industry.

Algebraic algorithms for stochastic imputation of item nonresponse with edit restrictions

Raw data records often contain missing or inconsistent items. Estimatingthe value of those items is usually complicated by restrictions imposed on possible value combinations. In this paper a method is described with which missing values can be estimated for categorical data, while restrictions on value combinations are obeyed. It is also possible to draw random values. The method is based on performing random walks on the set of all possible datafiles of records which obey the restrictions.

A mixed-mode follow-up of panel refusers in the Dutch LFS

The Dutch Labour Force Survey (LFS) is a rotating panel survey, of wich the first wave is CAPI and all the other waves are CATI. LFS statistics based on the first CAPI wave, especially employment status, show small but systematic differences with statistics based on the CATI waves.

Standardisation of Design and Production of Statistics

This paper describes the ‘ideal’ to-be situation of the statistical production process of Statistics Netherlands from a business perspective. Notions as ‘steady state’, ‘unit base’ and ‘knowledge rule’ are introduced and explained. This ‘ideal’ to-be situation should serve as an architectural framework for future redesigns. The paper also touches upon the latest experiences of SN to achieve this ‘ideal’ situation.

Valuation of oil and gas reserves in the Netherlands 1990-2005

This paper presents the methods used for compiling physical and monetary balance sheets of oil and gas reserves in the Netherlands. Results and sensitivity analyses are shown for the period between 1990 and 2005. The results are used for measuring multi-factor productivity and are published in the Dutch environmental accounts. Furthermore, balance sheets of oil and gas reserves are part of complete balance sheets for non-financial assets that are currently being developed.

ESF monitor deelnemerskenmerken 2008; kenmerken van deelnemers aan ESF-projecten (voorlopige uitkomsten).

Via het Europees Centraal Fonds (ESF) is geld beschikbaar om mensen te helpen hun vaardigheden, en daarmee hun vooruitzichten op werk, te verbeteren. In Nederland is het Agentschap van het ministerie van Sociale Zaken en Werkgelegenheid (SZW) hiermee belast. Het Centrum voor Beleidsstatistiek heeft in opdracht van het Agentschap SZW de kenmerken van de ESF-deelnemers in 2008 in beeld gebracht. Het gaat om voorlopige uitkomsten.

Productivity effects of innovation modes

This paper is a collaboration of Michael Polder and George van Leeuwen (CBS) with Pierre Mohnen and Wladimir Raymond (UNU-MERIT and University Maastricht). It investigates the effect of ICT and R&D on innovation, and the effect of innovation on firms' productivity, distinguishing various innovation types (product, process and organisational).

Jongeren 2007

Dit rapport geeft een beeld van de jongeren in Nederland van 15-26 jaar en hun participatie in de maatschappij in 2007. Hierbij wordt gekeken naar kenmerken als het volgen van onderwijs en het hebben van werk of uitkering. Omdat het overheidsbeleid veel aandacht heeft voor jongeren die niet naar school gaan en geen werk hebben, gaat een deel van het rapport in op deze specifieke groep (18-26 jarigen).

Counting on Statistics: Statistics Netherlands'Modernization Program

Counting on Statistics, the modernization program of Statistics Netherlands has as aims improvement of quality, more use of administrative data, higher efficiency, and reduction of the number of ICT applications. It consists of 5 subprograms: business architecture, methodological framework, generic processes, modernization of the main economic statistics, and modernization of other statistics

Processes, Methods and Tools

This document surveys the desired set of tools based on 1) acknowledged business functions in the production of a statistic, 2) the current set of tools and 3) general criteria that are considered relevant for the selection of tools.

A Comparison of Different Estimation Methods with an Application in the Dutch National Elections of 2003 and 2006

The estimation of changes in voting behaviour can be pursued in two ways, by modelling aggregated election results, and by means of recall data recorded in survey questionnaires. In an application of these methods for the Dutch national elections of 2003 and 2006, we show that the estimated voting transition estimated by survey techniques and model based techniques complement each other, improve the validity of the results, and provide a basis for new research.

Structural time series modelling of the monthly unemployment rate in a rotating panel

In this paper a multivariate structural time series model is described that accounts for the panel design of the Dutch Labour Force Survey and is applied to estimate monthly unemployment rates. Compared to the generalized regression estimator, this approach results in a substantial increase of the accuracy due to a reduction of the standard error and the explicit modelling of the bias between the subsequent waves.

The euro-dollar exchange rate and Dutch imports and exports

This paper quantifies the effects of a changing euro-dollar exchange rate on Dutch imports and exports of goods during the period 1978-2007. We find that these effects did not change significantly over time. The paper also considers the effects of a changing exchange rate on re-exports and exports of Dutch products separately.

Waarnemingsinnovatie: nieuwe bronnen en mogelijkheden

Dit rapport geeft een overzicht van nieuwe secundaire databronnen die voor het CBS mogelijk interessante gegevens bevatten. Voor een aantal van die bronnen is een statistische toepassing denkbaar. Ook wordt een aantal nieuwe toepassingen besproken. Het belangrijkste doel van dit rapport is een inspirerende en verfrissende kijk te geven op het gebruik van secundaire bronnen door het CBS.

Measuring and Relating Aggregate and Subaggregate Productivity Change without Neoclassical Assumptions

This paper by Bert M. Balk considers the relation between (total factor) productivity measures for individual production units and those for aggregates. It avoids the making of all kinds of (neoclassical) structural and behavioural assumptions. In addition, dynamic ensembles of production units are treated, characterized by entry and exit."

Careers of Doctorate Holders

Deze tabellenset bevat een aantal demografische kenmerken van gepromoveerden in Nederland in 2007. Het onderzoek is onderdeel van een internationale studie naar loopbanen van gepromoveerden en is uitgevoerd in opdracht van het ministerie van Onderwijs, Cultuur en Wetenschap.

Innovation and performance, a collection of microdata studies

There are still many puzzles to be solved concerning the relation between innovation and firm performance, in particular concerning the distinct roles of information and communication technology (ICT) and Research and Development (R&D) in creating new or improved products or production processes. This thesis provides evidence that both instances of innovation are important drivers of productivity and, thus, economic growth. This book consolidates the results of empirical work covering about 10 years of research aimed at understanding the importance for firm performance of innovation in a broad sense.

Checklist voor Kwaliteit van Statistische Output

Dit rapport beschrijft negentien eigenschappen van statistische output. Elke eigenschap – ook wel aspect genoemd – is uitgewerkt volgens een bepaalde structuur te beginnen met de definitie van de eigenschap. Voor elke eigenschap zijn er mogelijke indicatoren en maatregelen benoemd en samengevat in een checklist in een bijlage. Dit rapport heeft verscheidene doelen. Zeven doelen zijn geïdentificeerd zoals het bieden van ondersteuning bij het maken van afspraken met afnemers over de kwaliteit van de statistische output. Het rapport bevat geen richtlijnen voor het CBS. Het heeft geen verplichtend karakter. Het rapport kan echter wel kan dienen als basis voor het ontwikkeling van dergelijke richtlijnen.

Final Report Project KIS

This report describes the results of the project Quality of Statistics Relevant to Statistics Netherlands’Corporate Image (KIS) as reported to Eurostat. In the project KIS three tools are developed that can be used to manage the quality of statistical output: a checlist, a questionnaire quick scan and a questionnaire deep scan.

Uitbreiding Outcomemonitor Wijkenaanpak:  Beschrijving van aanvullende indicatoren

Om de voortgang van de Wijkenaanpak te volgen, heeft het Centrum voor Beleidsstatistiek van het Centraal Bureau voor de Statistiek (CBS-CvB) in 2008 een outcomemonitor voor de 40 aandachtswijken ontwikkeld en een nulmeting uitgevoerd. Het CBS-CvB heeft onderzocht of de Outcomemonitor wijkenaanpak kan worden uitgebreid met aanvullende indicatoren. In dit rapport worden de aanvullende indicatoren beschreven die in de outcomemonitor zijn opgenomen.

Micro and macro indicators of competition: comparison and relation with productivity change

This paper investigates competition and its relation to productivity growth. The results indicate that firms need time to adjust to changes in competition, raising productivity afterwards. Results are consistent over micro and macro, various indicators and industries.

Potential of XBRL Dimensions for International Trade Statistics

Standardization of financial reporting is desirable to reduce the administrative burden on companies. To achieve this, the eXtensible Business Reporting Language (XBRL) has been developed. Almost 60% of the administrative burden on companies by Statistics Netherlands is caused by surveys about product trade between EU countries. Trade data cannot be filed in XBRL yet, because of the complexity of the Combined Nomenclature, the EU classification of traded products. Here, we describe a prototype of a dimensional XBRL taxonomy that enables reporting of trade concepts specified by product, using the recent XBRL Dimensions 1.0 specification.

Statistics Netherlands Architecture; Business and Information model

This paper describes the business and information architecture of Statistics Netherlands as a process chain of activities. These activities are divided into four business areas, namely policy, design, management, and implementation. Each business area is discussed at length.

Statistics Netherlands Architecture; Context of Change

This paper describes the context architecture of Statistics Netherland (SN). Among others, the paper discusses the clients, consumers, in- and outputs, financers, and several reasons for change. Special attention is given to the Electronic Government.

Calibrated impution of numerical data under linear edit restrictions

A common problem faced by statistical offices is that data may be missing from collected data sets. The typical way to overcome this problem is to impute the missing data. The problem of imputing missing data is complicated by the fact that statistical data often have to satisfy certain edit rules and that values of variables sometimes have to sum up to known totals. Standard imputation methods for numerical data as described in the literature generally do not take such edit rules and totals into account. In the present paper we describe algorithms for imputation of missing numerical data that do take edit restrictions into account and that ensure that sums are calibrated to known totals.

A conditions monitor for fixed capital formation

Economic indicators such as fixed capital formation do not develop in isolation. The basic idea underlying the conditions monitors is that developments in macro-economic indicators are driven by a relatively low number of fundamental factors. By finding indicators representing developments in these underlying factors, the observed values of the target economic variable can be placed in context and analysed more thoroughly. The underlying factors identified for the private fixed capital formation were availability and cost of capital, capacity utilization and business conditions. The usefulness of the indicator set is greatly enhanced by presenting it in an appropriate graphic visualisation.

On the asymptotic error of a bivariate normal approximation with an application to simple random sampling

This paper gives a formula for the limiting error of the central limit theorem for the bivariate case. Insight this type of error simplifes the proofs of central limit theorems in probability sampling from finite populations.

Monthly employers' sentiment indicators; Doing more with business survey data

Timely information on the direction of labour market developments is very important. A central indicator are vacancies, which are most sensitive to business cycle developments and lead the other labour market indicators. Using indicators from the various business surveys, monthly composite sentiment indicators, termed the employers’ sentiment, are constructed. These give a fast and reliable indication of the direction of vacancy development. This is due to the fact that vacancy development is intimately connected to developments in business conditions, as reflected in business survey indicators.

Research on edit and imputation methodology; the troughput programme

This research programme focuses on methodology for the statistical process between input (data gathering and storage) and output (estimation, analysis and publication). This “troughput process”consist of procedures for the detection of errors, correction of errors and imputation (filling in estimates) of missing values. These throughput procedures are also referred to as editing and imputation (E&I) procedures. The goal of our throughput research programme is to improve the E&I procedures both in cost-effectiveness and in quality.

Measuring inventories in the Dutch national accounts

This paper presents the methods used for compiling complete balance sheets for inventories. Furthermore, the underlying assumptions are explained and results of sensitivity analyses are shown. The balance sheets of inventories are part of complete balance sheets for non-financial assets that are currently being developed.

A Monthly Workers labourmarket sentimentindicator; Doing more with The Consumer Survey

More frequent and timely information on the development of the employed and unemployed labour force can be useful. This can be provided by a composite sentiment indicator, the workers’ sentiment, constructed from outcomes of the Consumer Survey. The high correlation found between consumer survey indicators and labour market developments can be explained by the great importance of job prospects for the average consumer. The constructed composite sentiment indicator is very successful in tracking developments in the direction employed and unemployed labour force

A conditions monitor for exports

Economic indicators such as exports do not develop in isolation. The basic idea underlying the conditions monitors is that developments in macro-economic indicators are driven by a relatively low number of fundamental factors. By finding indicators representing developments in these underlying factors, the observed values of the target economic variable can be placed in context and analysed more thoroughly. The underlying factors identified for Dutch exports were mainly the developments in the major export markets, Germany and the Eurozone and changes in competitiveness. The usefulness of the indicator set is greatly enhanced by presenting it in an appropriate graphic visualisation.

Arbeidsparticipatie van 20-64 jarigen - Participatiepotentieel  in 2006 en 2007

Het ministerie van Sociale Zaken en Werkgelegenheid (SZW) streeft naar een bruto arbeidsparticipatie van 80 procent in 2016. Via dit onderzoek wil SZW inzicht krijgen in de afstand van personen tot de arbeidsmarkt. De focus ligt daarbij op de doelgroep van de participatiedoelstelling, de 20 tot en met 64 jarigen. De gegevens worden ook voor een aantal specifieke leeftijdsgroepen en subgroepen bepaald, bijvoorbeeld vrouwen en niet-westerse allochtonen.

Measuring Productivity Change without Neoclassical Assumptions: A Conceptual Analysis

This survey discusses the basics of productivity measurement and shows that one can dispense with most if not all of the usual, neoclassical assumptions, on which models for productivity measurement are often based. The measurement model is applicable to individual establishments as well as aggregates such as industries or economies.

A conditions monitor for household consumption

Economic indicators such as consumption do not develop in isolation. The basic idea underlying the conditions monitors is that developments in macro-economic indicators are driven by a relatively low number of fundamental factors. By finding indicators representing developments in these underlying factors, the observed values of the target economic variable can be placed in context and analysed more thoroughly. The underlying factors identified for household consumption were labour market developments, (expected) income development and developments in asset values. The usefulness of the indicator set is greatly enhanced by presenting it in an appropriate graphic visualisation.

On the efficiency of randomized PPS sampling with an application to the Producer Price Index

This paper examines the efficiency of the Horvitz-Thompson estimator from a systematic probability proportional to size sample drawn from a randomly ordered list. Moreover, the efficiency is compared with that of an ordinary ratio estimator. The results are demonstrated by means of a simulation study with Dutch data on the Producer Price Index. The discussion on the efficiency includes a comparison with rejected Poisson sampling.

Indicatoren van economische dynamiek 2001-2005

In opdracht van de Nederlandse Mededingingsauthoriteit (NMa) heeft het Centrum voor Beleidsstatistiek een reeks indicatoren over economische dynamiek berekend over de periode 2001-2005. De indicatoren geven voor een groot aantal bedrijfstakken een beeld van de grootte en samenstelling, overlevingskansen en economische ontwikkeling. Verder geven de indicatoren aan welk gedeelte van de omzet in een bedrijfstak wordt gegenereerd door de grootste vier bedrijven en is er aandacht voor het innovatiegedrag in de bedrijfstakken.

The rise of survey sampling

This paper is about the history of survey sampling. It describes how sampling became an accepted scientific method. From the first ideas in 1895 it took some 50 years before the principles of probability sampling were widely accepted. This papers has a focus on developments in official statistics in The Netherlands, but it also pays attention use of sampling techniques in market research.

Sampling and estimation techniques for the implementation of the NACE Rev.2 in Business Surveys

In all European Union countries the classification of economic activities that is used in the Business Surveys is currently rebased from NACE Rev 1.1 to NACE Rev. 2. This paper describes some of the methodological problems encountered with the changeover to the NACE Rev. 2. Different sampling and estimation strategies are proposed to produce reliable figures for the domains under both classifications simultaneously. Furthermore several methods are described that can be used to reconstruct time series for the domains under the NACE Rev. 2.

On asymptotic distributions in random sampling from finite populations

Existing proofs of central limit theorems in random sampling from finite populations are quite lengthy. The present paper shows how the proof for this kind of theorem in random sampling can be simplifed by using the cntral limit theorem for independent, random vectors.

Best practices for business and systems analysis in projects conforming to enterprise architecture

This paper by Ralph Foorthuis and Sjaak Brinkkemper aims to identify best practices for carrying out business and systems analysis in projects that are required to comply with an Enterprise Architecture. The research methods used are Canonical Action Research and Focus Group interviews

An artifact model for projects conforming to enterprise architecture

This article by Ralph Foorthuis, Sjaak Brinkkemper and Rik Bos presents a model for projects that have to adhere to an Enterprise Architecture (EA) in order for their results to be aligned with the broader organization. The model features project artifacts (i.e. deliverables such as Software Architecture Documents), their mutual relationships, their relationship with EA, and the processes in which they are created and tested on conformance.

Difficult Groups in Survey Research and the Development of Tailor-made Approach Strategies

The subject of this thesis is difficult groups in survey research. Moeilijk waarneembare groepen in enquêteonderzoek is het onderwerp van deze dissertatie

A time series approach to estimate discontinuities due to a survey redesign

To maintain uninterrupted time series, surveys conducted by national statistical institutes are often kept unchanged as long as possible. To improve the quality or efficiency of a survey process, however, it remains inevitable to adjust methods or redesign this process from time to time. Such adjustments generally have a systematic effect on the parameter estimates of a sample survey. Therefore it is important that the effects of a survey redesign on the estimated series are explained and quantified. In this paper a structural time series model is applied to estimate discontinuities in series of the Dutch survey on social participation and environmental consciousness due to a redesign of the underlying survey process.

Producing historical time series for STS-statistics in NACE Rev.2

This contribution discusses possible scenarios and methodologies for the national statistical agencies for backcasting the new classification scheme (NACE Rev. 2.0) in existing time series of business statistics. We provide a discussion of the basic principles of reconstructing time series in general, after which the application of these methods in the area of short term business statistics is handled and illustrated with an example. We conclude that it is possible to obtain reasonable approximations of historic time series using rather simple methodology, but the quality of the backcasted time series is hampered by heterogeneity of classes.

An overview of statistical data editing

This paper gives an overview of statistical data editing. The paper first describes the traditional interactive approach to data editing. It then focuses on modern editing techniques, such as selective editing, automatic editing, and macro-editing. The paper aims to provide an introduction to these topics, and gives many references to the literature.

Objectgeoriënteerde kwaliteitszorg

Dit rapport beschrijft hoe kwaliteit in een organisatie kan worden gemanaged. Volgens het model wordt eerst een keuze gemaakt uit de relevantie aandachtsgebieden. Binnen elk aandachtsgebied dienen vervolgens de juiste beheersingsmaatregelen te worden getroffen. Het model kan worden toegepast binnen elk organisatie en op elk gewenst gebied.

Cross your border and look around

This papers analyses which determinants are relevant for catching up to the global and national technological frontier. We focus on innovation, human capital, technology transfers and competition as sources of productivity growth. Our approach integrates the literature on the two faces of R&amp;D, convergence and firm-level heterogeneity in productivity.

Intangible capital in the Netherlands: Measurement and contribution to economic growth

This paper explores the broader range of intangible investment in the Netherlands. Both conceptual and measurement issues are discussed. Furthermore, intangibles are capitalized and their contribution to economic growth by industry is examined.

How accurate are self-selection web surveys?

Due to methodological problems, the quality of the outcomes of web surveys may be seriously affected. This paper addresses one of these problems: self-selection of respondents. Self-selection leads to a lack of representativity and thus to biased estimates. It is shown that the bias of estimators in self-selection surveys can be much larger than in surveys based on traditional probability samples. It is explored whether some correction techniques (adjustment weighting and use of reference surveys) can improve the quality of the outcomes. It turns out that there is no guarantee for success.

Algorithms for correcting some obvious inconsistencies and rounding errors in business survey data

This paper describes methods for automatically detecting and correcting two systematic errors that are found in data collected for structural business statistics. It also discusses a simple heuristic method for resolving rounding errors when the data have to satisfy many balance edit rules.

Representativity of the Time Use Survey

In 2006 the selection of respondents for the Time Use Survey (‘Tijdbestedingsonderzoek’ or ‘TBO’) consisted of three stages. In this paper we employ R-indicators to investigate the representativity of the TBO response. R-indicators are proposed by Schouten and Cobben (2007) as measures to evaluate the similarity between the response and population of interest. This research was sponsored by the Netherlands Institute for Social Research (SCP). The 2005 and 2006 datasets were provided by DANS.

Indicators in the SEEA: identifying in the main aggregates in the SEEA Part I

The system of environmental and economic accounting (SEEA) provides a range if important accounting aggregates which can logically be defined within the SEEA’s accounting identities. One key recommendation made in this paper is that these main aggregates should be explicitly pointed out as potential indicators in Part I of the revised SEEA. In this paper an overview is given of key aggregates from the physical flow accounts of which it is recommended that they should be explicitly exposed in the SEEA text of Part I.

Processing of Erroneous and Unsafe Data

The Ph.D. thesis by Ton de Waal examines two different areas: detection of erroneous data and protection of sensitive data.By assuming that as few errors as possible have been made while answering and processing the questionnaires, the detection of erroneous data can be formulated as a mathematical optimisation problem. In the thesis a number of methods are developed to solve this optimisation problem efficiently. Sensitive data of individual respondents or small groups of respondents have to be protected, for instance by not publishing these data. In the thesis several mathematical problems with respect to protecting sensitive data are explored. For a number of problems, such as calculating the information loss due to protecting sensitive data and minimising the information that is not published, solutions are presented.

Service lives and discard patterns of capital goods in the manufacturing industry, based on direct capital stock observations, the Netherlands

This paper presents methods and results with regard to the estimation of service lives and discard patterns, based on directly observed capital stock data and on discard surveys in the manufacturing industry. The presented results are input for the calculation of consumption of fixed capital and net capital stock for the national accounts.

Consistency and beyond - improving quality of national accounts estimates in practice

The system of national accounts shows a consistent quantitative overview of the economic process of a country. Consistency however is not a gurantee for good quality. In this paper an overview is given of possible adjustments and extensions of the system. The paper refers to the national accounts practice in the Netherlands.

Constructing a monthly indicator of fixed capital formation via high frequency interpolation. The state space approach.

In this study, a different temporal disaggregation approach is tested for producing a monthly series from a quarterly statistic when monthly observations are lacking. Monthly realisations are obtained by interpolating the quarterly series using related monthly indicators and an autoregressive component. For optimal results, the whole is cast in the state space form. Good monthly indicators can thus be easily obtained, and real time analyses shows that the methods work in practice as well.

A latent variable approach to constructing monthly indicators; fixed capital formation

Econometric techniques can be a powerful aid for the production of statistics. A prime example is the use of so-called benchmarking or interpolation techniques for producing more detailed or frequent statistics when complete data are lacking. Here, a version is tested which aims to produce a monthly indicator of fixed capital formation from the quarterly national accounts series. In this version, three latent monthly indicators are defined for each quarter, and derived from available related monthly indicators. A credible monthly series can be constructed, but unfortunately real time performance is shown to be unsatisfactory.

An empirical validation of R-indicators

In a previous paper (discussion paper 07002, Schouten and Cobben) we introduced so-called R-indicators. These indicators attempt to measure the representativeness of the response to a survey. In the present paper we apply these indicators to several examples at CBS. These examples include surveys with different interview modes, data collection strategies and amounts of pre-paid incentives. We compare the values of the R-indicators to the outcomes of thorough analyses performed on these data sets.

Nonresponse in the recruitment of an internet panel based on a probability sample

In this paper we study the selectivity in the recruitment of respondents for one of CentERdata’s Internet panels (the CentERpanel). This recruitment is based on a probability sample. It involves three stages: participation to a first telephone interview, willing to be re-contacted and final agreement to participate in the Internet panel. We distinguish selectivity with regard to age and income in all stages and with pc-ownership in the latter two stages only.

Model-Based Estimation for Official Statistics

Design-based and model-assisted estimation procedures are widely applied by most of the European national statistical institutes. There are, however, situations were model-based approaches can have additional value in the production of official statistics, e.g. to deal with small sample sizes, measurement errors and discontinuities due to survey redesigns. In this paper several cases are identified where design-based estimators do not result in sufficiently reliable estimates and model-based procedures are more appropriate for producing official releases.

A framework for project architecture in the context of enterprise architecture

Dit paper van Ralph Foorthuis en Sjaak Brinkkemper beschrijft verscheidene architecturen op project niveau, indien geconformeerd wordt aan Enterprise Architecture. Onder de beschreven architecturen bevinden zich Project Architecture, Project Start Architecture en Software Architecture. Zij worden geplaatst in de context van Enterprise Architecture en Domain Architecture.

Estimation of the monthly unemployment rate trough structural time series modelling in a rotating panel design

In this paper a multivariate structural time series model is described that accounts for the panel design of the Dutch Labour Force Survey and is applied to estimate monthly unemployment rates.

An analysis of editing strategies for mixed-mode establishment surveys

For several of Statistics Netherlands' establishment surveys, respondents havethe choice to respond on paper or electronically. This paper investigates theconsequences for editing processes when two different data streams have to behandled. Special attention is payed to the question of how to quantitativelycompare the quality of the data streams.

A proposed model for microintegration of economic en social data

Globalization affects all aspects of economic and social life. In order to study the effects of an open economy on employment and welfare, combined microdata from business surveys, social surveys and administrative registers are required to make causal inferences.

Intangible capital in the Netherlands: A benchmark

This paper comprises a first attempt to provide a comprehensive measure on spending on intangible capital in the Netherlands. We replicate the approach pioneered by Corrado, Hulten and Sichel (2004, 2005 and 2006) for the U.S

Market structure, productivity and scale in European business services

Labour productivity in business-services industry tends to lag behind the rest of the economy. This paper investigates whether labour productivity in European business services is affected by unexploited economies of scale. Moreover, it analyses whether the incidence of scale sub-optimality is related to characteristics of the market or to national regulation characteristics.

Imputation of numerical data under linear edit restrictions

A common problem faced by statistical offices is that data may be missing from collected data sets. The typical way to overcome this problem is to impute the missing data. In this paper we describe two algorithms for imputation of missing numerical data that take into account edit restrictions .

Interactive coding in a field: a test

This paper was presented at the QUEST (Questionnaire Evaluation Standards) Workshop, 21-23 October, ZUMA Mannheim, Germany. A Questionnaire lab test will be discussed concerning new questions on respondents’ education, occupation and company as well as automatic coding of the answers.