Abstract
The question of whether it is possible to automate the scientific process is of both great theoretical interest1,2 and increasing practical importance because, in many scientific areas, data are being generated much faster than they can be effectively analysed. We describe a physically implemented robotic system that applies techniques from artificial intelligence3,4,5,6,7,8 to carry out cycles of scientific experimentation. The system automatically originates hypotheses to explain observations, devises experiments to test these hypotheses, physically runs the experiments using a laboratory robot, interprets the results to falsify hypotheses inconsistent with the data, and then repeats the cycle. Here we apply the system to the determination of gene function using deletion mutants of yeast (Saccharomyces cerevisiae) and auxotrophic growth experiments9. We built and tested a detailed logical model (involving genes, proteins and metabolites) of the aromatic amino acid synthesis pathway. In biological experiments that automatically reconstruct parts of this model, we show that an intelligent experiment selection strategy is competitive with human performance and significantly outperforms, with a cost decrease of 3-fold and 100-fold (respectively), both cheapest and random-experiment selection.
This is a preview of subscription content, access via your institution
Access options
Subscribe to this journal
Receive 51 print issues and online access
$199.00 per year
only $3.90 per issue
Rent or buy this article
Prices vary by article type
from$1.95
to$39.95
Prices may be subject to local taxes which are calculated during checkout



Similar content being viewed by others
References
Popper, K. The Logic of Scientific Discovery (Hutchinson, London, 1972)
Sloman, A. The Computer Revolution in Philosophy (Harvester, Hassocks, Sussex, 1978); available online from 〈http://www.cs.bham.ac.uk/research/cogaff/crp/〉
Buchanan, B. G., Sutherland, G. L. & Feigenbaum, E. A. in Machine Intelligence Vol. 4 (eds Meltzer, B. & Michie, D.) 209–254 (Edinburgh Univ. Press, 1969)
Langley, P., Simon, H. A., Bradshaw, G. L. & Zytkow, J. M. Scientific Discovery: Computational Explorations of the Creative Process (MIT Press, Cambridge, Massachusetts, 1987)
Z̈ytkow, J. M., Zhu, J. & Hussam, A. Automated discovery in a chemistry laboratory. in Proceedings of the 8th National Conference on Artificial Intelligence (AAAI-1990) (eds Dietterich, T. & Swartout, W.) 889–894 (MIT, Cambridge, Massachusetts, 1990)
King, R. D., Muggleton, S. H., Srinivasan, A. & Sternberg, M. J. E. Structure–activity relationships derived by machine learning: The use of atoms and their bond connectivities to predict mutagenicity by inductive logic programming. Proc. Natl Acad. Sci. USA 93, 438–442 (1996)
Valdes-Perez, R. E. Discovery tools for science applications. Commun. ACM 42, 37–41 (1999)
Langley, P. The computational support of scientific discovery. Int. J. Hum.–Comput. Stud. 53, 393–410 (2000)
Beadle, G. W. & Tatum, E. I. Genetic control of biochemical reactions in Neurospora. Proc. Natl Acad. Sci. USA 27, 499–506 (1941)
Pierce, C. S. Collected Papers of Charles Sanders Pierce (Harvard Univ. Press, 1958)
Mewes, H. W. et al. MIPS: a database for genomes and protein sequences. Nucleic Acids Res. 30, 31–34 (2002); 〈http://mips.gsf.de/proj/yeast/CYGD/db/index.html〉
Reiser, P. G. K. et al. Developing a logical model of yeast metabolism. Electron. Trans. Artif. Intell. 5, 223–244 (2001)
Kanehisa, M. & Goto, S. KEGG: Kyoto Encyclopedia of Genes and Genomes. Nucleic Acids Res. 28, 27–30 (2000); 〈http://www.genome.ad.jp/kegg/〉
Mitchell, T. M. Machine Learning (McGraw-Hill, New York, 1997)
Langley, P. Elements of Machine Learning (Morgan Kaufmann, San Mateo, California, 1996)
Cohen, D. A., Ghabhramani, Z. & Jordan, M. I. Active learning with statistical models. J. Artif. Intell. Res. 4, 129–145 (1996)
Lin, F.-R. & Shaw, M. J. Active training of backpropagation neural networks using the learning by experimentation methodology. Ann. Oper. Res. 75, 129–145 (1997)
Fedorov, V. V. Theory of Optimal Experiments (Academic, London, 1972)
Muggleton, S. & Page, D. in Machine Intelligence Vol. 15 (eds Furukawa, K., Michie, D. & Muggleton, S.) 248–267 (Oxford Univ. Press, 1999)
Bryant, C. H., Muggleton, S. H., Oliver, S. G., Kell, D. B., Reiser, P. G. K. & King, R. D. Combining inductive logic programming, active learning, and robotics to discover the function of genes. Electron. Trans. Artif. Intell. 5, 1–36 (2001)
Flach, P. & Kaka, A. Abduction and Induction (Kluwer, London, 2000)
Zupan, B. et al. in Proceedings of the Eighth European Conference on Artificial Intelligence in Medicine (eds Qualini, S., Barahona, P. & Andreassen, S.) 304–313 (Springer, Berlin, 2001)
Muggleton, S. Inverse entailment and Progol. New Generation Comput. J. 13, 245–286 (1995)
Rabitz, H., de Vivie-Riedle, R., Motzkus, M. & Kompa, K. Whither the future of controlling quantum phenomena? Science 288, 824–828 (2000)
Cochran, W. G. & Cox, G. M. Experimental Designs (Wiley, New York, 1992)
Brachmann, C. B. et al. Designer deletion strains derived from Saccharomyces cerevisiae S288C: a useful set of strains and plasmids for PCR-mediated gene disruption and other applications. Yeast 14, 115–132 (1998)
Giaever, G. et al. Functional profiling of the Saccharomyces cerevisiae genome. Nature 418, 387–391 (2002)
Acknowledgements
We thank D. Page, U. Sarkans, A. Tamaddoni, M. Sternberg, A. Sloman and D. Michie for their help and advice, and D. Struttman for technical assistance. The work was funded by the BBSRC, the EPSRC, the Wellcome Trust and PharmDM.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Competing interests
R.D.K. is a co-founder and member of the scientific board of PharmaDM, Kapeldreef 60, B-3001 Heverlee, Belgium.
Supplementary information
Rights and permissions
About this article
Cite this article
King, R., Whelan, K., Jones, F. et al. Functional genomic hypothesis generation and experimentation by a robot scientist. Nature 427, 247–252 (2004). https://doi.org/10.1038/nature02236
Received:
Accepted:
Issue Date:
DOI: https://doi.org/10.1038/nature02236
This article is cited by
-
A robotic platform for the synthesis of colloidal nanocrystals
Nature Synthesis (2023)
-
Knowledge-integrated machine learning for materials: lessons from gameplaying and robotics
Nature Reviews Materials (2023)
-
BacterAI maps microbial metabolism without prior knowledge
Nature Microbiology (2023)
-
Robotic automation and unsupervised cluster assisted modeling for solving the forward and reverse design problem of paper airplanes
Scientific Reports (2023)
-
Explanatory machine learning for sequential human teaching
Machine Learning (2023)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.