Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • Article
  • Published:

Characterizing populations of individuals using pooled samples

Abstract

Biomonitoring involves the assessment of human or animal populations by measuring organic or biological compounds or their metabolites in the body fluids or tissues of individuals in those populations. Pooling samples before making analytical measurements can reduce the costs of biomonitoring by reducing the number of analyses. By proper choice of pooled-sample design, population means can be estimated without measuring individual samples. I present a statistical method for characterizing an entire population distribution of such compounds by exploiting the theoretic relationship between interindividual-sample variance and the variation between pooled samples. I use simulation experiments to determine an optimum pooled-sample design as a function of the number of subpopulations and the number of available samples. Using pooled samples to characterize populations is not only more cost-efficient, but also in some cases it can lead to more precise and less biased parameter estimation than that occurs with individual samples.

This is a preview of subscription content, access via your institution

Access options

Buy this article

Prices may be subject to local taxes which are calculated during checkout

Figure 1
Figure 2
Figure 3
Figure 4

Similar content being viewed by others

References

  • Affymetrix. Sample Pooling for Microarray Analysis, Technical note, Affymetrix, San Diego, 2004.

  • Aitchison J., and Brown J.A.C. The Lognormal Distribution. Cambridge University Press, London, 1963 pp. 7–8.

    Google Scholar 

  • Baccarelli A., Pfeiffer R., Consonni D., Pesatori A.C., Bonzini M., Patterson Jr. D.G., et al. Handling of dioxin measurement data in the presence of non-detectable values: overview of available methods and their application in the Seveso chloracne study. Chemosphere 2005: 60: 898–906.

    Article  CAS  Google Scholar 

  • Bates M.N., Buckland S.J., Garrett N., Caudill S.P., and Ellis H. Methodological aspects of a national population-based study of persistent organochlorine compounds in serum. Chemosphere 2005: 58: 943–951.

    Article  CAS  Google Scholar 

  • Bates M.N., Buckland S.J., Garrett N., Ellis H., Needham L.L., Turner D.G., et al. Persistent organochlorines in the serum of the non-occupationally exposed New Zealand population. Chemosphere 2004: 54: 1431–1443.

    Article  CAS  Google Scholar 

  • Bignert A., Gothberg A., Jensen S., Litzen K., Odsjo T., Olsson M., et al. The need for adequate biological sampling in ecotoxicological investigations: a retrospective study of 20 years pollution monitoring. Sci Total Environ 1993: 128: 121–139.

    Article  CAS  Google Scholar 

  • Caudill S.P., Wong L.Y., Turner W.E., Lee R., Henderson A., and Patterson Jr. D.G. Chemosphere 2007a: 68: 169–180.

    Article  CAS  Google Scholar 

  • Caudill S.P., Turner W.E., and Patterson Jr. D.G. Geometric mean estimation from pooled samples. Chemosphere 2007b: 69: 371–380.

    Article  CAS  Google Scholar 

  • Gleit A. Estimation of small normal data sets with detection limits. Environ Sci Technol 1985: 19: 1201–12061.

    Article  CAS  Google Scholar 

  • Haas C.N., and Scheff P.A. Estimation of averages in truncated samples. Environ Sci Technol 1990: 24: 912–919.

    Article  CAS  Google Scholar 

  • Hald A. A History of Mathematical Statistics from 1750 to 1930. Wiley Inc., New York, 1998 pp. 303–350.

    Google Scholar 

  • Helsel D.R. Less than obvious – statistical treatment of data below the detection limit. Environ Sci Technol 1990: 24: 1766–1774.

    Article  CAS  Google Scholar 

  • Helsel D.R. Nondetects and Data Analysis. Wiley Inc., New Jersey, 2005 p. 58.

    Google Scholar 

  • Hornung R.W., and Reed L.D. Estimation of average concentration in the presence of nondetectable values. Appl Occup Environ Hyg! 1990: 5: 46–51.

    Article  CAS  Google Scholar 

  • Huybrechts T., Thas O., Dewulf J., and Van Langenhove H. How to estimate moments and quantiles of environmental data sets with non-detected observations? A case study on volatile organic compounds in marine water samples. J Chromatogr A 2002: 975: 123–133.

    Article  CAS  Google Scholar 

  • Johnson N.L., and Kotz S. Distributions in Statistics: Continuous Distributions — 1. Houghton Mifflin, New York, 1970 p. 63.

    Google Scholar 

  • Mary-Huard T., Daudin J., Baccini M., Biggeri A., and Bar-Hen A. Biases induced by pooling samples in microarray experiments. Bioinformatics 2007: 23: i313–i318.

    Article  CAS  Google Scholar 

  • NCEH First National Report on Human Exposure to Environmental Chemicals (NCEH Publication 01-0164, 2001).

  • NCEH Second National Report on Human Exposure to Environmental Chemicals (NCEH Publication 02-0716, 2003).

  • NCEH Third National Report on Human Exposure to Environmental Chemicals (NCEH Publication 05-0570, 2005).

  • Odeh R.E., and Owen D.B. Tables for Normal Tolerance Limits, Sampling Plans, and Screening. Marcel Dekker, New York, 1980 pp. 17–69.

    Google Scholar 

  • Persson T., and Rootzen H. Simple and highly efficient estimators for a type I censored normal sample. Biometrika 1977: 64: 123–128.

    Article  Google Scholar 

  • Travis C.C., and Land M.L. Estimating the mean of data sets with nondetectable values. Environ Sci Technol 1990: 24: 961–962.

    Article  CAS  Google Scholar 

  • Weinberg C.R., and Umbach D.M. Using pooled exposure assessment to improve efficiency in case-control studies. Biometrics 1999: 55: 718–726.

    Article  CAS  Google Scholar 

  • Zhang S.D., and Gant T.W. Effect of pooling samples on the efficiency of comparative studies using microarrays. Bioinformatics 2005: 21: 4378–4383.

    Article  CAS  Google Scholar 

Download references

Acknowledgements

I thank W. E. Turner, D. G. Patterson Jr., J. L. Pirkle, and E. J. Sampson for access to individual measurements of National Health and Nutrition Examination Survey samples and for making possible the pooling of samples.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Samuel P Caudill.

Additional information

Supplementary Information accompanies the paper on the Journal of Exposure Science and Environmental Epidemiology website (http://www.nature.com/jes)

Supplementary information

Rights and permissions

Reprints and permissions

About this article

Cite this article

Caudill, S. Characterizing populations of individuals using pooled samples. J Expo Sci Environ Epidemiol 20, 29–37 (2010). https://doi.org/10.1038/jes.2008.72

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1038/jes.2008.72

Keywords

This article is cited by

Search

Quick links