Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • Research Briefing
  • Published:

Protein language model enables fast and sensitive remote homolog detection

To handle increasingly large protein databases, a new ultrafast, highly sensitive method — Dense Homolog Retriever (DHR) — detects remote homologs using dense retrieval and protein language models. Its alignment-free nature makes it much faster than traditional approaches, and the newly found remote homologs benefit our understanding of protein evolution, structure and function.

This is a preview of subscription content, access via your institution

Access options

Buy this article

Prices may be subject to local taxes which are calculated during checkout

Fig. 1: DHR for fast detection of remote homologs and an improved understanding of proteins.

References

  1. Jumper, J. et al. Highly accurate protein structure prediction with AlphaFold. Nature 596, 583–589 (2021). This paper proposed AlphaFold2, a deep learning model to predict protein structure accurately.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  2. Buchfink, B. et al. Fast and sensitive protein alignment using DIAMOND. Nat. Methods 12, 59–60 (2015). This paper introduced an alignment algorithm that can achieve high sensitivity while being much faster than the gold standards.

    Article  CAS  PubMed  Google Scholar 

  3. Lin, Z. et al. Evolutionary-scale prediction of atomic-level protein structure with a language model. Science 379, 1123–1130 (2023). This paper proposed a protein language model trained on a large scale and a structure prediction model that uses only a single sequence.

    Article  CAS  PubMed  Google Scholar 

  4. Alexander, L. et al. Protein target highlights in CASP15: analysis of models by structure providers. Proteins 91, 1571–1599 (2023). This paper presented CASP, a challenge aiming at establishing the current state of the art in protein structure prediction.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  5. Mirdita, M. et al. ColabFold: making protein folding accessible to all. Nat. Methods 19, 679–682 (2022). This paper proposed an accelerated method and an accessible platform for protein structure prediction.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

Download references

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This is a summary of: Hong, L. et al. Fast, sensitive detection of protein homologs using deep dense retrieval. Nat. Biotechnol. https://doi.org/10.1038/s41587-024-02353-6 (2024).

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Protein language model enables fast and sensitive remote homolog detection. Nat Biotechnol (2024). https://doi.org/10.1038/s41587-024-02359-0

Download citation

  • Published:

  • DOI: https://doi.org/10.1038/s41587-024-02359-0

Search

Quick links

Nature Briefing: Translational Research

Sign up for the Nature Briefing: Translational Research newsletter — top stories in biotechnology, drug discovery and pharma.

Get what matters in translational research, free to your inbox weekly. Sign up for Nature Briefing: Translational Research