Skip to main content

Thank you for visiting You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

Multiple imputation as one tool to provide longitudinal databases for modelling human height and weight development




Besides large efforts regarding field work, provision of valid databases requires statistical and informational infrastructure to enable long-term access to longitudinal data sets on height, weight and related issues.


To foster use of longitudinal data sets within the scientific community, provision of valid databases has to address data-protection regulations. It is, therefore, of major importance to hinder identifiability of individuals from publicly available databases. To reach this goal, one possible strategy is to provide a synthetic database to the public allowing for pretesting strategies for data analysis. The synthetic databases can be established using multiple imputation tools. Given the approval of the strategy, verification is based on the original data.


Multiple imputation by chained equations is illustrated to facilitate provision of synthetic databases as it allows for capturing a wide range of statistical interdependencies. Also missing values, typically occurring within longitudinal databases for reasons of item non-response, can be addressed via multiple imputation when providing databases.


The provision of synthetic databases using multiple imputation techniques is one possible strategy to ensure data protection, increase visibility of longitudinal databases and enhance the analytical potential.

This is a preview of subscription content, access via your institution

Access options

Rent or buy this article

Get just this article for as long as you need it


Prices may be subject to local taxes which are calculated during checkout


  1. Campbell P . Data's shameful neglect. Nature 2009; 461: 145.

    Google Scholar 

  2. Cole TJ, Green PJ . Smoothing reference centile curves: the LMS method and penalized likelihood. Stat Med 1992; 11: 1305–1319.

    Article  CAS  Google Scholar 

  3. Rubin DB . Multiple Imputation for Nonresponse in Surveys. Wiley-IEEE: Hoboken, NJ, USA, 1987.

    Book  Google Scholar 

  4. van Buuren S, Groothuis-Oudshoorn K . MICE: Multivariate Imputation by Chained Equations. J Stat Softw 2011; 45: 1–67.

    Google Scholar 

  5. Preece MA, Baines MJ . A new family of mathematical models describing the human growth curve. Ann Hum Biol 1978; 5: 1–24.

    Article  CAS  Google Scholar 

  6. van Buuren S . Fully conditional specification. In: Molenberghs G, Fitzmaurice GM, Kenward MG, Tsiatis AA, Verbeke G (eds), Handbook of Missing Data Methodology. Chapman & Hall/CRC Press: Boca Raton, FL, USA, 2015.

    Google Scholar 

  7. Barkow I, Leopold T, Raab M, Schiller D, Wenzig K, Blossfeld HP et al. RemoteNEPS: data dissemination in a collaborative workspace. Z Erziehungswiss 2011; 14: 315–325.

    Article  Google Scholar 

  8. Drechsler J . Synthetic Datasets for Statistical Disclosure Control: Theory and Implementation. Springer, Lecture notes in statistics: Berlin, Germany, 2011.

    Book  Google Scholar 

Download references

Author information

Authors and Affiliations


Corresponding author

Correspondence to C Aßmann.

Ethics declarations

Competing interests

The author declares no conflict of interest.

Rights and permissions

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Aßmann, C. Multiple imputation as one tool to provide longitudinal databases for modelling human height and weight development. Eur J Clin Nutr 70, 653–655 (2016).

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI:


Quick links