Research articles

Mitigating allocative tradeoffs and harms in an environmental justice data tool

Algorithmic decisions have a history of harming already marginalized populations. In an effort to combat these discriminative patterns, data-driven methods are used to comprehend these patterns, and recently also to identify disadvantaged communities to allocate resources. Huynh et al. analyse one of these tools and show a concerning sensitivity to input parameters that can lead to unintentional biases with substantial financial consequences.

Benjamin Q. Huynh
Elizabeth T. Chin
David H. Rehkopf
ArticleOpen Access16 Feb 2024
Reusability report: Unpaired deep-learning approaches for holographic image reconstruction

A parameterized physical model that uses unpaired datasets for adaptive holographic imaging was published in Nature Machine Intelligence in 2023. Zhang and colleagues evaluate its performance and extend it to non-perfect optical systems by integrating specific optical response functions.

Yuhe Zhang
Tobias Ritschel
Pablo Villanueva-Perez
ArticleOpen Access15 Feb 2024
Protein function prediction as approximate semantic entailment

Deep learning language models have proved useful for both natural language and protein modelling. Similar to semantics in natural language, protein functions are complex and depend on the context of their environment, rather than on the similarity of sequences. Kulmanov and colleagues present an approach to frame function prediction as semantic entailment using a neuro-symbolic model to augment a large protein language model.

Maxat Kulmanov
Francisco J. Guzmán-Vega
Robert Hoehndorf
ArticleOpen Access14 Feb 2024
Weak signal extraction enabled by deep neural network denoising of diffraction data

Denoising low-counting statistics data in the presence of multiple, unknown noise profiles is a challenging task in scientific applications where high accuracy is required. Oppliger and colleagues train a deep convolutional neural network on pairs of experimental low- and high-noise X-ray diffraction data and demonstrate better performance on experimental noise filtering compared with the case of training on artificial data pairs.

Jens Oppliger
M. Michael Denner
Johan Chang
ArticleOpen Access13 Feb 2024
A computational framework for neural network-based variational Monte Carlo with Forward Laplacian

Realistic quantum mechanical simulations are computationally costly to perform but can be approximated using neural network models. Li and colleagues propose a forward propagation method in lieu of traditional backpropagation to speed up these neural network-based approaches.

Ruichen Li
Haotian Ye
Liwei Wang
Article13 Feb 2024
State-specific protein–ligand complex structure prediction with a multiscale deep generative model

Great advances in protein structure prediction have been made with recent deep learning-based methods, but proteins interact with their environment and can change shape drastically when binding to ligand molecules. To predict the 3D structure of these combined protein–ligand complexes, Qiao et al. developed a generative diffusion model with biophysical constraints and geometric deep learning.

Zhuoran Qiao
Weili Nie
Animashree Anandkumar
Article12 Feb 2024
Audio-based AI classifiers show no evidence of improved COVID-19 screening over simple symptoms checkers

AI-enabled diagnostic applications in healthcare can be powerful, but study design is very important to avoid subtle issues of bias in the dataset and evaluation. Coppock et al. demonstrate how an AI-based classifier for diagnosing SARS-Cov-2 infection from audio recordings can seem to make predictions with high accuracy but shows much lower performance after taking into account confounders, providing insights in study design and replicability in AI-based audio analysis.

Harry Coppock
George Nicholson
Chris Holmes
ArticleOpen Access07 Feb 2024
Leveraging large language models for predictive chemistry

Machine learning techniques are widely employed in chemical science, but are application specific and their development requires dedicated expertise. Jablonka and colleagues fine-tune the GPT-3 model and show that it can provide surprisingly accurate answers to a wide range of chemical questions.

Kevin Maik Jablonka
Philippe Schwaller
Berend Smit
ArticleOpen Access06 Feb 2024
Variational autoencoder for design of synthetic viral vector serotypes

Recent years have seen many advances in deep learning models for protein design, usually involving a large amount of training data. Focusing on potential clinical impact, Garton et al. develop a variational autoencoder approach trained on sparse data of natural sequences of adenoviruses to generate large proteins that can be used as viral vectors in gene therapy.

Suyue Lyu
Shahin Sowlati-Hashjin
Michael Garton
Article23 Jan 2024
Assessing antibody and nanobody nativeness for hit selection and humanization with AbNatiV

Designing antibodies and assessing their biophysical properties for potential therapeutic development is challenging with current computational methods. Ramon et al. have developed a deep learning approach called AbNatiV, based on a vector-quantized variational encoder that accurately assesses the nativeness of antibodies and nanobodies, which are small single-domain antibodies that have recently attracted considerable interest.

Aubin Ramon
Montader Ali
Pietro Sormanni
ArticleOpen Access15 Jan 2024
Generation of 3D molecules in pockets via a language model

Drug design has recently seen immense improvements in computational methods, but models can still struggle generalizing across binding pockets. Feng and colleagues combine a language model with geometric deep learning to provide efficient generation of potential new drugs.

Wei Feng
Lvwei Wang
Wenbiao Zhou
ArticleOpen Access15 Jan 2024
Capturing complex hand movements and object interactions using machine learning-powered stretchable smart textile gloves

Accurate real-time tracking of dexterous hand movements and interactions has applications in human–computer interaction, the metaverse, robotics and tele-health. Capturing realistic hand movements is challenging due to the large number of articulations and degrees of freedom. Tashakori and colleagues report accurate and dynamic tracking of articulated hand and finger movements using machine-learning powered stretchable, washable smart gloves.

Arvin Tashakori
Zenan Jiang
Peyman Servati
Article12 Jan 2024
Autonomous 3D positional control of a magnetic microrobot using reinforcement learning

Magnetic microrobots are of considerable interest for non-invasive biomedical applications but it is challenging to develop a general strategy for controlling microrobot positions, for varying configurations and environments. Choi et al. develop a reinforcement learning control method, training the model in a simulation environment for initial exploration after which the learning process is transferred to a physical electromagnetic actuation system.

Sarmad Ahmad Abbasi
Awais Ahmed
Hongsoo Choi
Article10 Jan 2024
Multi-animal 3D social pose estimation, identification and behaviour embedding with a few-shot learning framework

Multi-animal behaviour quantification is pivotal for deciphering animal social behaviours and has broad applications in neuroscience and ecology. Han and colleagues develop a few-shot learning framework for multi-animal 3D pose estimation, identity recognition and social behaviour classification.

Yaning Han
Ke Chen
Pengfei Wei
ArticleOpen Access08 Jan 2024
Inversion dynamics of class manifolds in deep learning reveals tradeoffs underlying generalization

Feed-forward neural networks have become powerful tools in machine learning, but their behaviour during optimization is still not well understood. Ciceri and colleagues find that during optimization, class representations first separate and then rejoin, prompted by specific elements of the training set.

Simone Ciceri
Lorenzo Cassani
Marco Gherardi
Article08 Jan 2024
Multi-modal molecule structure–text model for text-based retrieval and editing

Machine learning methods in cheminformatics have made great progress in using chemical structures of molecules, but a large portion of textual information remains scarcely explored. Liu and colleagues trained MoleculeSTM, a foundation model that aligns the structure and text modalities through contrastive learning, and show its utility on the downstream tasks of structure–text retrieval, text-guided editing and molecular property prediction.

Shengchao Liu
Weili Nie
Animashree Anandkumar
Article18 Dec 2023
A statistical mechanics framework for Bayesian deep neural networks beyond the infinite-width limit

Theoretical frameworks aiming to understand deep learning rely on a so-called infinite-width limit, in which the ratio between the width of hidden layers and the training set size goes to zero. Pacelli and colleagues go beyond this restrictive framework by computing the partition function and generalization properties of fully connected, nonlinear neural networks, both with one and with multiple hidden layers, for the practically more relevant scenario in which the above ratio is finite and arbitrary.

R. Pacelli
S. Ariosto
P. Rotondo
Article18 Dec 2023
Defending ChatGPT against jailbreak attack via self-reminders

Interest in using large language models such as ChatGPT has grown rapidly, but concerns about safe and responsible use have emerged, in part because adversarial prompts can bypass existing safeguards with so-called jailbreak attacks. Wu et al. build a dataset of various types of jailbreak attack prompt and demonstrate a simple but effective technique to counter these attacks by encapsulating users’ prompts in another standard prompt that reminds ChatGPT to respond responsibly.

Yueqi Xie
Jingwei Yi
Fangzhao Wu
Article12 Dec 2023
Inverse design of nonlinear mechanical metamaterials via video denoising diffusion models

Machine learning models have been widely used in the inverse design of new materials, but typically only linear properties could be targeted. Bastek and Kochmann show that video diffusion generative models can produce the nonlinear deformation and stress response of cellular materials under large-scale compression.

Jan-Hendrik Bastek
Dennis M. Kochmann
ArticleOpen Access11 Dec 2023
Bridging the gap between chemical reaction pretraining and conditional molecule generation with a unified model

Virtual drug design has seen recent progress in methods that can generate new molecules with specific properties. Separately, methods have also improved in the task of computationally predicting the outcome of chemical reactions. Qiang and colleagues use the close relation of the two problems to train a model that aims at solving both tasks.

Bo Qiang
Yiran Zhou
Zhenming Liu
Article05 Dec 2023

Research articles

Mitigating allocative tradeoffs and harms in an environmental justice data tool

Reusability report: Unpaired deep-learning approaches for holographic image reconstruction

Protein function prediction as approximate semantic entailment

Weak signal extraction enabled by deep neural network denoising of diffraction data

A computational framework for neural network-based variational Monte Carlo with Forward Laplacian

State-specific protein–ligand complex structure prediction with a multiscale deep generative model

Audio-based AI classifiers show no evidence of improved COVID-19 screening over simple symptoms checkers

Leveraging large language models for predictive chemistry

Variational autoencoder for design of synthetic viral vector serotypes

Assessing antibody and nanobody nativeness for hit selection and humanization with AbNatiV

Generation of 3D molecules in pockets via a language model

Capturing complex hand movements and object interactions using machine learning-powered stretchable smart textile gloves

Autonomous 3D positional control of a magnetic microrobot using reinforcement learning

Multi-animal 3D social pose estimation, identification and behaviour embedding with a few-shot learning framework

Inversion dynamics of class manifolds in deep learning reveals tradeoffs underlying generalization

Multi-modal molecule structure–text model for text-based retrieval and editing

A statistical mechanics framework for Bayesian deep neural networks beyond the infinite-width limit

Defending ChatGPT against jailbreak attack via self-reminders

Inverse design of nonlinear mechanical metamaterials via video denoising diffusion models

Bridging the gap between chemical reaction pretraining and conditional molecule generation with a unified model

Search

Quick links

Research articles

Filter By:

Search

Quick links