Large open-access data sets offer unprecedented opportunities for scientific discovery — the current global collapse of bee and frog populations are classic examples. However, we must resist the temptation to do science backwards by posing questions after, rather than before, data analysis.
A scant understanding of the context in which data sets were collected can lead to poorly framed questions and results, and to conclusions that are plain wrong. Scientists intending to make use of large composite data sets need to work closely with those responsible for gathering the data. Standard scientific principles and practice then demand that they first frame the important questions, then design and execute the data analyses needed to answer them.
CONTRIBUTIONS Correspondence may be sent to firstname.lastname@example.org after consulting the author guidelines at go.nature.com/cmchno. Alternatively, readers may comment online: www.nature.com/nature.
Rights and permissions
About this article
Cite this article
Lindenmayer, D., Likens, G. Don't do big-data science backwards. Nature 499, 284 (2013). https://doi.org/10.1038/499284d