Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • Research Briefing
  • Published:

Using machine learning to predict the structure of proteins that bind to DNA and RNA

We developed a machine learning model, RoseTTAFoldNA, that can predict the structures of protein–DNA and protein–RNA complexes. Our model is capable of predicting accurate structures of protein families for which structural information is unknown.

This is a preview of subscription content, access via your institution

Access options

Buy this article

Prices may be subject to local taxes which are calculated during checkout

Fig. 1: Example predictions of a protein–RNA and protein–DNA complex.

References

  1. Lambert, S. et al. The human transcription factors. Cell 172, 650–665 (2018). This paper gives an overview of human transcription factors, including some understudied families.

    Article  CAS  PubMed  Google Scholar 

  2. Joyce, A. et al. Structure-based modeling of protein: DNA specificity. Brief. Funct. Genomics 14, 39–49 (2021). This paper reviews structure-based predictions of DNA binding specificity.

    Article  Google Scholar 

  3. Jumper, J. et al. Highly accurate protein structure prediction with AlphaFold. Nature 596, 583–589 (2021). This paper reports AlphaFold2, a neural network trained to predict protein structure with a high degree of accuracy.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  4. Baek, M. et al. Accurate prediction of protein structures and interactions using a three-track neural network. Science 373, 871–876 (2021). This paper presents RoseTTAFold, an alternative neural network architecture to AlphaFold that also predicts protein structure to a high degree of accuracy.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  5. Castro-Mondragon, J. A. et al. JASPAR 2022: the 9th release of the open-access database of transcription factor binding profiles. Nucleic Acids Res. 50, D165–D173 (2022). This paper describes the curation of JASPER, a database of known protein–DNA interactions.

    Article  CAS  PubMed  Google Scholar 

Download references

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This is a summary of: Baek, M. et al. Accurate prediction of protein–nucleic acid complexes using RoseTTAFoldNA. Nat. Methods https://doi.org/10.1038/s41592-023-02086-5 (2023).

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Using machine learning to predict the structure of proteins that bind to DNA and RNA. Nat Methods 21, 22–23 (2024). https://doi.org/10.1038/s41592-023-02088-3

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1038/s41592-023-02088-3

Search

Quick links

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing