This is a preview of subscription content
Subscribe to Nature+
Get immediate online access to the entire Nature family of 50+ journals
Subscribe to Journal
Get full journal access for 1 year
only $9.92 per issue
All prices are NET prices.
VAT will be added later in the checkout.
Tax calculation will be finalised during checkout.
Get time limited or full article access on ReadCube.
All prices are NET prices.
The datasets analyzed during the current study are available in the Qiita repository with the specific study accessions in Supplementary Data 1, and were extracted with Qiita’s redbiom interface.
Lozupone, C. & Knight, R. Appl. Environ. Microbiol. 71, 8228–8235 (2005).
Thompson, L. R. et al. Nature 551, 457–463 (2017).
McDonald, D. et al. mSystems 3, e00031-18 (2018).
Gonzalez, A. et al. Nat. Methods 15, 796–798 (2018).
Caporaso, J. G. et al. Nat. Methods 7, 335–336 (2010).
Chang, Q., Luan, Y. & Sun, F. BMC Bioinformatics 12, 118 (2011).
Chen, J. et al. Bioinformatics 28, 2106–2113 (2012).
McMurdie, P. J. & Holmes, S. PLoS One 8, e61217 (2013).
Amir, A. et al. mSystems 2, e00191-16 (2017).
This work was supported by the NSF (grant DBI-1565100 to D.M., Y.V.-B., Z.X., A.G., and R.K.; award 1664803 to D.K and J.M.), the Alfred P. Sloan Foundation (G-2017-9838 to D.M., Y.V.-B., A.G., and R.K.; G-2015-13933 to A.G. and R.K.), ONR (grant N00014-15-1-2809 to D.M., A.G., and R.K.), and NIH–NIDDK (grant P01DK078669 to A.G. and R.K.). This work was partially supported by XSEDE resource grant BIO150043. Additional support was provided by CRISP, one of six centers in JUMP, a Semiconductor Research Corporation (SRC) program sponsored by DARPA.
R.K. is a founder and CSO of Biota Technology Inc. D.M. is a consultant with Biota Technology Inc.
Integrated supplementary information
(A-B) Walltime and memory distributions of independent processes operating on the full Earth Microbiome Project dataset (n = 26,181) executing on shared compute nodes. An individual partition represents a single independent process, and each process was run with two threads; 32 partitions indicates 32 processes using two threads each. A higher partition count means each individual process is doing less work. Box plots show the median, whiskers are 1.5 times the proportion of the interquartile range past the 25th and 75th percentiles; the number of data points in each box plot is the number of partitions in the processing run. (C) An empirical assessment of the number of proportion vectors required to be retained in memory over increasing tree sizes. This assessment was performed by randomly sampling tips from the Greengenes 99% OTU tree, and counting the maximum number of nodes required to hold proportion vectors resident in memory. Box plots show the median, whiskers are 1.5 times the proportion of the interquartile range past the 25th and 75th percentiles; each box plot represents 10 independent experiments. (D) Empirical assessment of the runtime of Striped UniFrac for 1,024 samples over increasing numbers of tips in a phylogeny. (E) Mantel tests (Pearson) between Striped UniFrac in exact mode, which produces identical results to UniFrac, versus fast mode, in which the UniFrac distances are not computed at the tips of the tree during traversal. Each data point represents n = 10 random subsets (independent experiments) of the Earth Microbiome Project Deblur 90-nt dataset, with the mean R2 value depicted. Error bars are 95% CI around the mean. The figure data can be found in Supplementary Data 3.
Supplementary Figure 1 and Supplementary Note 1
table_s1.xlsx, the Qiita study accessions used.
figure1-data.xlsx, the data necessary to re-create panels c and d in Fig. 1.
figureS1-data.xlsx, the data necessary to re-create Supplementary Fig. 1.
Supplementary SoftwareUnifrac.tar.gz, the version of UniFrac used in the study.
About this article
Cite this article
McDonald, D., Vázquez-Baeza, Y., Koslicki, D. et al. Striped UniFrac: enabling microbiome analysis at unprecedented scale. Nat Methods 15, 847–848 (2018). https://doi.org/10.1038/s41592-018-0187-8
Fecal microbiome of horses transitioning between warm-season and cool-season grass pasture within integrated rotational grazing systems
Animal Microbiome (2022)
Distribution characteristics of ammonia-oxidizing microorganisms and their responses to external nitrogen and carbon in sediments of a freshwater reservoir, China
Aquatic Ecology (2022)
BMC Research Notes (2021)
Nature Biotechnology (2021)
Longitudinal patterns in the skin microbiome of wild, individually marked frogs from the Sierra Nevada, California
ISME Communications (2021)