Research articles

A computational framework for neural network-based variational Monte Carlo with Forward Laplacian

Realistic quantum mechanical simulations are computationally costly to perform but can be approximated using neural network models. Li and colleagues propose a forward propagation method in lieu of traditional backpropagation to speed up these neural network-based approaches.

Ruichen Li
Haotian Ye
Liwei Wang
Article13 Feb 2024
State-specific protein–ligand complex structure prediction with a multiscale deep generative model

Great advances in protein structure prediction have been made with recent deep learning-based methods, but proteins interact with their environment and can change shape drastically when binding to ligand molecules. To predict the 3D structure of these combined protein–ligand complexes, Qiao et al. developed a generative diffusion model with biophysical constraints and geometric deep learning.

Zhuoran Qiao
Weili Nie
Animashree Anandkumar
Article12 Feb 2024
Audio-based AI classifiers show no evidence of improved COVID-19 screening over simple symptoms checkers

AI-enabled diagnostic applications in healthcare can be powerful, but study design is very important to avoid subtle issues of bias in the dataset and evaluation. Coppock et al. demonstrate how an AI-based classifier for diagnosing SARS-Cov-2 infection from audio recordings can seem to make predictions with high accuracy but shows much lower performance after taking into account confounders, providing insights in study design and replicability in AI-based audio analysis.

Harry Coppock
George Nicholson
Chris Holmes
ArticleOpen Access07 Feb 2024
Leveraging large language models for predictive chemistry

Machine learning techniques are widely employed in chemical science, but are application specific and their development requires dedicated expertise. Jablonka and colleagues fine-tune the GPT-3 model and show that it can provide surprisingly accurate answers to a wide range of chemical questions.

Kevin Maik Jablonka
Philippe Schwaller
Berend Smit
ArticleOpen Access06 Feb 2024
Variational autoencoder for design of synthetic viral vector serotypes

Recent years have seen many advances in deep learning models for protein design, usually involving a large amount of training data. Focusing on potential clinical impact, Garton et al. develop a variational autoencoder approach trained on sparse data of natural sequences of adenoviruses to generate large proteins that can be used as viral vectors in gene therapy.

Suyue Lyu
Shahin Sowlati-Hashjin
Michael Garton
Article23 Jan 2024
Assessing antibody and nanobody nativeness for hit selection and humanization with AbNatiV

Designing antibodies and assessing their biophysical properties for potential therapeutic development is challenging with current computational methods. Ramon et al. have developed a deep learning approach called AbNatiV, based on a vector-quantized variational encoder that accurately assesses the nativeness of antibodies and nanobodies, which are small single-domain antibodies that have recently attracted considerable interest.

Aubin Ramon
Montader Ali
Pietro Sormanni
ArticleOpen Access15 Jan 2024
Generation of 3D molecules in pockets via a language model

Drug design has recently seen immense improvements in computational methods, but models can still struggle generalizing across binding pockets. Feng and colleagues combine a language model with geometric deep learning to provide efficient generation of potential new drugs.

Wei Feng
Lvwei Wang
Wenbiao Zhou
ArticleOpen Access15 Jan 2024
Capturing complex hand movements and object interactions using machine learning-powered stretchable smart textile gloves

Accurate real-time tracking of dexterous hand movements and interactions has applications in human–computer interaction, the metaverse, robotics and tele-health. Capturing realistic hand movements is challenging due to the large number of articulations and degrees of freedom. Tashakori and colleagues report accurate and dynamic tracking of articulated hand and finger movements using machine-learning powered stretchable, washable smart gloves.

Arvin Tashakori
Zenan Jiang
Peyman Servati
Article12 Jan 2024
Autonomous 3D positional control of a magnetic microrobot using reinforcement learning

Magnetic microrobots are of considerable interest for non-invasive biomedical applications but it is challenging to develop a general strategy for controlling microrobot positions, for varying configurations and environments. Choi et al. develop a reinforcement learning control method, training the model in a simulation environment for initial exploration after which the learning process is transferred to a physical electromagnetic actuation system.

Sarmad Ahmad Abbasi
Awais Ahmed
Hongsoo Choi
Article10 Jan 2024
Multi-animal 3D social pose estimation, identification and behaviour embedding with a few-shot learning framework

Multi-animal behaviour quantification is pivotal for deciphering animal social behaviours and has broad applications in neuroscience and ecology. Han and colleagues develop a few-shot learning framework for multi-animal 3D pose estimation, identity recognition and social behaviour classification.

Yaning Han
Ke Chen
Pengfei Wei
ArticleOpen Access08 Jan 2024
Inversion dynamics of class manifolds in deep learning reveals tradeoffs underlying generalization

Feed-forward neural networks have become powerful tools in machine learning, but their behaviour during optimization is still not well understood. Ciceri and colleagues find that during optimization, class representations first separate and then rejoin, prompted by specific elements of the training set.

Simone Ciceri
Lorenzo Cassani
Marco Gherardi
Article08 Jan 2024
Multi-modal molecule structure–text model for text-based retrieval and editing

Machine learning methods in cheminformatics have made great progress in using chemical structures of molecules, but a large portion of textual information remains scarcely explored. Liu and colleagues trained MoleculeSTM, a foundation model that aligns the structure and text modalities through contrastive learning, and show its utility on the downstream tasks of structure–text retrieval, text-guided editing and molecular property prediction.

Shengchao Liu
Weili Nie
Animashree Anandkumar
Article18 Dec 2023
A statistical mechanics framework for Bayesian deep neural networks beyond the infinite-width limit

Theoretical frameworks aiming to understand deep learning rely on a so-called infinite-width limit, in which the ratio between the width of hidden layers and the training set size goes to zero. Pacelli and colleagues go beyond this restrictive framework by computing the partition function and generalization properties of fully connected, nonlinear neural networks, both with one and with multiple hidden layers, for the practically more relevant scenario in which the above ratio is finite and arbitrary.

R. Pacelli
S. Ariosto
P. Rotondo
Article18 Dec 2023
Defending ChatGPT against jailbreak attack via self-reminders

Interest in using large language models such as ChatGPT has grown rapidly, but concerns about safe and responsible use have emerged, in part because adversarial prompts can bypass existing safeguards with so-called jailbreak attacks. Wu et al. build a dataset of various types of jailbreak attack prompt and demonstrate a simple but effective technique to counter these attacks by encapsulating users’ prompts in another standard prompt that reminds ChatGPT to respond responsibly.

Yueqi Xie
Jingwei Yi
Fangzhao Wu
Article12 Dec 2023
Inverse design of nonlinear mechanical metamaterials via video denoising diffusion models

Machine learning models have been widely used in the inverse design of new materials, but typically only linear properties could be targeted. Bastek and Kochmann show that video diffusion generative models can produce the nonlinear deformation and stress response of cellular materials under large-scale compression.

Jan-Hendrik Bastek
Dennis M. Kochmann
ArticleOpen Access11 Dec 2023
Bridging the gap between chemical reaction pretraining and conditional molecule generation with a unified model

Virtual drug design has seen recent progress in methods that can generate new molecules with specific properties. Separately, methods have also improved in the task of computationally predicting the outcome of chemical reactions. Qiang and colleagues use the close relation of the two problems to train a model that aims at solving both tasks.

Bo Qiang
Yiran Zhou
Zhenming Liu
Article05 Dec 2023
Physics-enhanced deep surrogates for partial differential equations

Data-driven surrogate models are used in computational physics and engineering to greatly speed up evaluations of the properties of partial differential equations, but they come with a heavy computational cost associated with training. Pestourie et al. combine a low-fidelity physics model with a generative deep neural network and demonstrate improved accuracy–cost trade-offs compared with standard deep neural networks and high-fidelity numerical solvers.

Raphaël Pestourie
Youssef Mroueh
Steven G. Johnson
Article04 Dec 2023
Reconstructing growth and dynamic trajectories from single-cell transcriptomics data

Single-cell transcriptomics has provided a powerful approach to investigate cellular properties at unprecedented resolution. Sha et al. have developed an optimal transport-based algorithm called TIGON that can connect transcriptomic snapshots from different time points to obtain collective dynamical information, including cell population growth and the underlying gene regulatory network.

Yutong Sha
Yuchi Qiu
Qing Nie
ArticleOpen Access30 Nov 2023
Spatially embedded recurrent neural networks reveal widespread links between structural and functional neuroscience findings

A fundamental question in neuroscience is what are the constraints that shape the structural and functional organization of the brain. By bringing biological cost constraints into the optimization process of artificial neural networks, Achterberg, Akarca and colleagues uncover the joint principle underlying a large set of neuroscientific findings.

Jascha Achterberg
Danyal Akarca
Duncan E. Astle
ArticleOpen Access20 Nov 2023
Topological structure of complex predictions

Deep learning is a powerful method to process large datasets, and shown to be useful in many scientific fields, but models are highly parameterized and there are often challenges in interpretation and generalization. David Gleich and colleagues develop a method rooted in computational topology, starting with a graph-based topological representation of the data, to help assess and diagnose predictions from deep learning and other complex prediction methods.

Meng Liu
Tamal K. Dey
David F. Gleich
ArticleOpen Access17 Nov 2023

Research articles

A computational framework for neural network-based variational Monte Carlo with Forward Laplacian

State-specific protein–ligand complex structure prediction with a multiscale deep generative model

Audio-based AI classifiers show no evidence of improved COVID-19 screening over simple symptoms checkers

Leveraging large language models for predictive chemistry

Variational autoencoder for design of synthetic viral vector serotypes

Assessing antibody and nanobody nativeness for hit selection and humanization with AbNatiV

Generation of 3D molecules in pockets via a language model

Capturing complex hand movements and object interactions using machine learning-powered stretchable smart textile gloves

Autonomous 3D positional control of a magnetic microrobot using reinforcement learning

Multi-animal 3D social pose estimation, identification and behaviour embedding with a few-shot learning framework

Inversion dynamics of class manifolds in deep learning reveals tradeoffs underlying generalization

Multi-modal molecule structure–text model for text-based retrieval and editing

A statistical mechanics framework for Bayesian deep neural networks beyond the infinite-width limit

Defending ChatGPT against jailbreak attack via self-reminders

Inverse design of nonlinear mechanical metamaterials via video denoising diffusion models

Bridging the gap between chemical reaction pretraining and conditional molecule generation with a unified model

Physics-enhanced deep surrogates for partial differential equations

Reconstructing growth and dynamic trajectories from single-cell transcriptomics data

Spatially embedded recurrent neural networks reveal widespread links between structural and functional neuroscience findings

Topological structure of complex predictions

Search

Quick links

Research articles

Filter By:

Search

Quick links