Creating large genome/phenome collections can require consortium-scale resources. DNA.Land is a digital biobank that collects genetic data from individuals tested by consumer genomic companies using a fraction of the resources of traditional studies.
Subscribe to Journal
Get full journal access for 1 year
only $17.42 per issue
All prices are NET prices.
VAT will be added later in the checkout.
Rent or Buy article
Get time limited or full article access on ReadCube.
All prices are NET prices.
Ashley, E. A. Nat. Rev. Genet. 17, 507–522 (2016).
Sudlow, C. et al. PLoS Med. 12, e1001779 (2015).
Downey, P. & Peakman, T. C. Int. J. Epidemiol. 37(Suppl. 1), i46–i50 (2008).
Khan, R. & Mittelman, D. Genome Biol. 14, 139 (2013).
Greshake, B., Bayer, P. E., Rausch, H. & Reda, J. PLoS ONE 9, e89204 (2014).
Erlich, Y. et al. PLoS Biol. 12, e1001983 (2014).
Delaney, S. K. et al. Expert. Rev. Mol. Diagn. 16, 521–532 (2016).
Wilbanks, J. & Friend, S. H. Nat. Biotechnol. 34, 377–379 (2016).
Bakos, Y., Marotta-Wurgler, F. & Trossen, D. R. J. Legal. Stud. 43, 1–35 (2014).
Albala, I., Doyle, M. & Appelbaum, P. S. IRB Ethics Hum. Res. 32, 3 (2010). Available at http://www.thehastingscenter.org/irb_article/the-evolution-of-consent-forms-for-research-a-quarter-century-of-changes/ (accessed 17 September 2017).
Klitzman, R. L. J. Empir. Res. Hum. Res. Ethics 8, 8–19 (2013).
Lunshof, J. E., Chadwick, R., Vorhaus, D. B. & Church, G. M. Nat. Rev. Genet. 9, 406–411 (2008).
Ball, M. P. et al. Proc. Natl. Acad. Sci. USA 109, 11920–11927 (2012).
Curnin, C., Gordon, A. & Erlich, Y. Bioinformatics 33, 2191–2193 (2017).
Kaplanis, J. et al. Preprint at https://www.biorxiv.org/content/early/2017/02/07/106427.1 (2017).
Bryc, K., Durand, E. Y., Macpherson, J. M., Reich, D. & Mountain, J. L. Am. J. Hum. Genet. 96, 37–53 (2015).
Jain, S. H., Powers, B. W., Hawkins, J. B. & Brownstein, J. S. Nat. Biotechnol. 33, 462–463 (2015).
Kosinski, M., Stillwell, D. & Graepel, T. Proc. Natl. Acad. Sci. USA 110, 5802–5805 (2013).
Wu, H.-Y. et al. Eulerian video magnification for revealing subtle changes in the world. Preprint at https://dspace.mit.edu/openaccess-disseminate/1721.1/86955 (2012).
Paparrizos, J., White, R. W. & Horvitz, E. J. Oncol. Pract. 12, 737–744 (2016).
Y.E. holds a Career Award at the Scientific Interface from the Burroughs Wellcome Fund. This study was supported by a generous gift from Andria and Paul Heafy to the Erlich Laboratory, funding from the National Breast Cancer Coalition, and support from Amazon Web Services’ Education Grants. J.Y. is supported by the Columbia University Integrative Graduate Education and Research Traineeship (IGERT), funded by NSF research grant number 1144854. We thank the tens of thousands of DNA.Land participants—especially our early adopters, whose feedback was integral in our efforts to improve the site—and genetic genealogist C. Moore for her valuable advice. We welcome inquiries by researchers who are interested in collecting genotype and phenotype information with our resource.
Y.E. is the Chief Science Officer of https://www.MyHeritage.com. J.P. is the CEO and co-founder of Gencove.
Integrated Supplementary Information
a The number of page visits per week to each type of report on DNA.Land: Ancestry (Red), Relative Matching (Light Blue), Relatives of Relatives (Green), and Trait Report pages (Dark Blue) b The distribution of new user registrations to DNA.Land by day of the week c The percentage of page visits by DNA.Land users by day of the week.
a Per-week and cumulative numbers of total surveys completed by users b Per-week and cumulative numbers of total questions answered by users c The distribution of time required by users to complete each type of survey. The surveyed traits are as follows: Chronotype (Orange), Coffee Consumption (Blue), Myopia (Red), Eye Color (Green), Neuroticism (Pink), Educational Attainment (Purple), and Height (Yellow).
a The distribution of the number of inferred relatives among DNA.Land users based on matching IBD segments. Only 10.5% of DNA.Land users have no detected relatives b The distribution of degrees of relatedness among matching pairs of DNA.Land users, as calculated by the ERSA algorithm. A degree of 0 indicates either an identical twin or duplicate genotype file.
a Self-reported age distribution in DNA.Land b Ancestry composition of DNA.Land users with aggregated ancestry categories: Northern European (Red), Northeast European (Orange), Other European (Light Orange), Ashkenazi (Yellow), African (Yellow-Green), South Asian (Light Green), East Asian (Turquoise), Native American (Blue). Each column represents a single user, and stacked bars on each column indicate the distribution of ancestry groups for a given user. Users are sorted by decreasing percentage of their largest ancestry group c Geographic location of DNA.Land users, as determined by IP address.
About this article
Cite this article
Yuan, J., Gordon, A., Speyer, D. et al. DNA.Land is a framework to collect genomes and phenomes in the era of abundant genetic information. Nat Genet 50, 160–165 (2018). https://doi.org/10.1038/s41588-017-0021-8
Nature Biotechnology (2019)
Trends in Genetics (2019)
The American Journal of Human Genetics (2019)
MySeq: privacy-protecting browser-based personal Genome analysis for genomics education and exploration
BMC Medical Genomics (2019)