Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

Volume 6 Issue 2, February 2024

Dynamic biomolecular complex prediction with generative AI

Predicting the structure of 3D biological binding complexes is a major challenge in structural biology. Qiao et al. report a diffusion model-based generative AI approach known as NeuralPLexer that enables the prediction of protein–ligand structures, including large-scale conformational changes of such structures after ligand binding, based on protein sequences and ligand molecular graphs. The methodology could advance the mechanistic understanding of biological pathways and aid the discovery of new therapeutic agents.

See Qiao et al.

Image: Frederick R. Manby and Matthew Welborn. Cover design: Amie Fernandez

Editorial

  • One of the most successful areas for deep learning in scientific discovery has been protein predictions and engineering. We take a closer look at four studies in this issue that advance protein science with innovative deep learning approaches.

    Editorial

    Advertisement

Top of page ⤴

News & Views

  • Recent work has demonstrated important parallels between human visual representations and those found in deep neural networks. A new study comparing functional MRI data to deep neural network models highlights factors that may determine this similarity.

    • Katja Seeliger
    • Martin N. Hebart
    News & Views
Top of page ⤴

Reviews

  • Training a machine learning model with multiple tasks can create more-useful representations and achieve better performance than training models for each task separately. In this Perspective, Allenspach et al. summarize and compare multi-task learning methods for computer-aided drug design.

    • Stephan Allenspach
    • Jan A. Hiss
    • Gisbert Schneider
    Perspective
  • Machine learning algorithms play important roles in medical imaging analysis but can be affected by biases in training data. Jones and colleagues discuss how causal reasoning can be used to better understand and tackle algorithmic bias in medical imaging analysis.

    • Charles Jones
    • Daniel C. Castro
    • Ben Glocker
    Perspective
Top of page ⤴

Research

  • Recent years have seen many advances in deep learning models for protein design, usually involving a large amount of training data. Focusing on potential clinical impact, Garton et al. develop a variational autoencoder approach trained on sparse data of natural sequences of adenoviruses to generate large proteins that can be used as viral vectors in gene therapy.

    • Suyue Lyu
    • Shahin Sowlati-Hashjin
    • Michael Garton
    Article
  • Machine learning techniques are widely employed in chemical science, but are application specific and their development requires dedicated expertise. Jablonka and colleagues fine-tune the GPT-3 model and show that it can provide surprisingly accurate answers to a wide range of chemical questions.

    • Kevin Maik Jablonka
    • Philippe Schwaller
    • Berend Smit
    Article Open Access
  • Machine learning methods have made great advances in modelling protein sequences for a variety of downstream tasks. The representation used as input for these models has been primarily the sequence of amino acids. Outeiral and Deane show that using codon sequences instead can improve protein representations and lead to model performance.

    • Carlos Outeiral
    • Charlotte M. Deane
    Article Open Access
  • Denoising low-counting statistics data in the presence of multiple, unknown noise profiles is a challenging task in scientific applications where high accuracy is required. Oppliger and colleagues train a deep convolutional neural network on pairs of experimental low- and high-noise X-ray diffraction data and demonstrate better performance on experimental noise filtering compared with the case of training on artificial data pairs.

    • Jens Oppliger
    • M. Michael Denner
    • Johan Chang
    Article Open Access
  • Algorithmic decisions have a history of harming already marginalized populations. In an effort to combat these discriminative patterns, data-driven methods are used to comprehend these patterns, and recently also to identify disadvantaged communities to allocate resources. Huynh et al. analyse one of these tools and show a concerning sensitivity to input parameters that can lead to unintentional biases with substantial financial consequences.

    • Benjamin Q. Huynh
    • Elizabeth T. Chin
    • David H. Rehkopf
    Article Open Access
  • Great advances in protein structure prediction have been made with recent deep learning-based methods, but proteins interact with their environment and can change shape drastically when binding to ligand molecules. To predict the 3D structure of these combined protein–ligand complexes, Qiao et al. developed a generative diffusion model with biophysical constraints and geometric deep learning.

    • Zhuoran Qiao
    • Weili Nie
    • Animashree Anandkumar
    Article
  • Deep learning language models have proved useful for both natural language and protein modelling. Similar to semantics in natural language, protein functions are complex and depend on the context of their environment, rather than on the similarity of sequences. Kulmanov and colleagues present an approach to frame function prediction as semantic entailment using a neuro-symbolic model to augment a large protein language model.

    • Maxat Kulmanov
    • Francisco J. Guzmán-Vega
    • Robert Hoehndorf
    Article Open Access
  • AI-enabled diagnostic applications in healthcare can be powerful, but study design is very important to avoid subtle issues of bias in the dataset and evaluation. Coppock et al. demonstrate how an AI-based classifier for diagnosing SARS-Cov-2 infection from audio recordings can seem to make predictions with high accuracy but shows much lower performance after taking into account confounders, providing insights in study design and replicability in AI-based audio analysis.

    • Harry Coppock
    • George Nicholson
    • Chris Holmes
    Article Open Access
Top of page ⤴

Challenge Accepted

Top of page ⤴

Search

Quick links