Access

Review

Nature Reviews Genetics 3, 601–610 (1 August 2002) | doi:10.1038/nrg861

Genomics and natural language processing

Mark D. Yandell & William H. Majoros

The Human Genome and MEDLINE are both the foci of intense data-mining efforts worldwide. The biomedical literature has much to say about sequence, but it also seems that sequence can tell us much about the biomedical literature. Biological natural language processing is an emerging field of research that seeks to explore systematically the relationships between genes, sequences and the biomedical literature as a basis for a new generation of data-mining tools.