Research articles

Topological structure of complex predictions

Deep learning is a powerful method to process large datasets, and shown to be useful in many scientific fields, but models are highly parameterized and there are often challenges in interpretation and generalization. David Gleich and colleagues develop a method rooted in computational topology, starting with a graph-based topological representation of the data, to help assess and diagnose predictions from deep learning and other complex prediction methods.

Meng Liu
Tamal K. Dey
David F. Gleich
ArticleOpen Access17 Nov 2023
Incorporating neuro-inspired adaptability for continual learning in artificial intelligence

Continual learning is an innate ability in biological intelligence to accommodate real-world changes, but it remains challenging for artificial intelligence. Wang, Zhang and colleagues model key mechanisms of a biological learning system, in particular active forgetting and parallel modularity, to incorporate neuro-inspired adaptability to improve continual learning in artificial intelligence systems.

Liyuan Wang
Xingxing Zhang
Yi Zhong
Article16 Nov 2023
Reusability report: Learning the transcriptional grammar in single-cell RNA-sequencing data using transformers

scBERT, a pretrained neural network for single-cell sequencing tasks, was published last year in Nature Machine Intelligence. To test the reusability of the method, Khan et al. use the code to assess the generalizablility of transformer architectures on single-cell genomics tasks.

Sumeer Ahmad Khan
Alberto Maillo
Jesper Tegner
ArticleOpen Access16 Nov 2023
Better models of human high-level visual cortex emerge from natural language supervision with a large and diverse dataset

Prediction of high-level visual representations in the human brain may benefit from multimodal sources in network training and the incorporation of complex datasets. Wang and colleagues show that language pretraining and a large, diverse dataset together build better models of higher-level visual cortex compared to earlier models.

Aria Y. Wang
Kendrick Kay
Leila Wehbe
Article13 Nov 2023
Learning characteristics of graph neural networks predicting protein–ligand affinities

Graph neural networks have proved useful in modelling proteins and their ligand interactions, but it is not clear whether the patterns they identify have biological relevance or whether interactions are merely memorized. Mastropietro et al. use a Shapley value-based method to identify important edges in protein interaction graphs, enabling explanatory analysis of the model mechanisms.

Andrea Mastropietro
Giuseppe Pasculli
Jürgen Bajorath
Article13 Nov 2023
Self-supervised deep learning for tracking degradation of perovskite light-emitting diodes with multispectral imaging

Halide perovskites are promising materials for light-emitting devices, given their narrowband emission and solution processability. However, detailed information on device degradation during operation is required to improve their stability, and this is challenging to obtain. Ji et al. propose a self-supervised deep learning method to capture multi-dimensional images of such devices in their operating regime faster than allowed by conventional imaging techniques.

Kangyu Ji
Weizhe Lin
Samuel D. Stranks
ArticleOpen Access09 Nov 2023
Development of the Senseiver for efficient field reconstruction from sparse observations

The reconstruction of dynamic, spatial fields from sparse sensor data is an important challenge in various fields of science and technology. Santos et al. introduce the Senseiver, a deep learning framework that reconstructs spatial fields from few observations using attention layers to encode and decode sparse data, enabling efficient inference.

Javier E. Santos
Zachary R. Fox
Nicholas Lubbers
ArticleOpen Access06 Nov 2023
Calibrated geometric deep learning improves kinase–drug binding predictions

Geometric deep learning has become a powerful tool in virtual drug design, but it is not always obvious when a model makes incorrect predictions. Luo and colleagues improve the accuracy of their deep learning model using uncertainty calibration and Bayesian optimization in an active learning cycle.

Yunan Luo
Yang Liu
Jian Peng
Article06 Nov 2023
Hierarchical generative modelling for autonomous robots

Human and animal motion planning works at various timescales to allow the completion of complex tasks. Inspired by this natural strategy, Yuan and colleagues present a hierarchical motion planning approach for robotics, using deep reinforcement learning and predictive proprioception.

Kai Yuan
Noor Sajid
Zhibin Li
ArticleOpen Access02 Nov 2023
Mode switching in organisms for solving explore-versus-exploit problems

Organisms show complex behaviour resulting from a trade-off between obtaining information (explore) and using current information (exploit). Biswas et al. observe a mode-switching strategy modulated by sensory salience in a diverse range of organisms, including electric fish and humans, and argue that the observed heuristic could inform the design of active-sensing behaviours in robotics.

Debojyoti Biswas
Andrew Lamperski
Noah J. Cowan
ArticleOpen Access26 Oct 2023
Design of prime-editing guide RNAs with deep transfer learning

Prime editors are innovative genome-editing tools, but selecting guide RNAs with high efficiency remains challenging and requires costly experimental efforts. Liu and colleagues develop a method to design prime-editing guide RNAs based on transfer learning for in silico prediction of editing efficacy.

Feng Liu
Shuhong Huang
Wenjie Shu
ArticleOpen Access26 Oct 2023
Deep learning of causal structures in high dimensions under data limitations

Learning causal relationships between variables in large datasets is an outstanding challenge in various scientific applications. Lagemann et al. introduce a deep neural network approach combining convolutional and graph models intended for causal learning in high-dimensional biomedical problems.

Kai Lagemann
Christian Lagemann
Sach Mukherjee
ArticleOpen Access26 Oct 2023
Neural scaling of deep chemical models

Deep learning methods in natural language processing generally become more effective with larger datasets and bigger networks. But it is not evident whether the same is true for more specialized domains such as cheminformatics. Frey and colleagues provide empirical explorations of chemistry models and find that neural-scaling laws hold true even for the largest tested models and datasets.

Nathan C. Frey
Ryan Soklaski
Vijay Gadepally
ArticleOpen Access23 Oct 2023
Improving Wikipedia verifiability with AI

The immense amount of Wikipedia articles makes it challenging for volunteers to ensure that cited sources support the claim they are attached to. Petroni et al. use an information-retrieval model to assist Wikipedia users in improving verifiability.

Fabio Petroni
Samuel Broscheit
Sebastian Riedel
ArticleOpen Access19 Oct 2023
Mitigating the missing-fragmentation problem in de novo peptide sequencing with a two-stage graph-based deep learning model

Identifying unknown peptides in tandem mass spectrometry is challenging as fragmentation of precursor peptides can be incomplete. Mao and colleagues present a method based on graph neural networks and a path-searching model to create more stable sequence predictions.

Zeping Mao
Ruixue Zhang
Ming Li
Article19 Oct 2023
A taxonomy and review of generalization research in NLP

With the rapid development of natural language processing (NLP) models in the last decade came the realization that high performance levels on test sets do not imply that a model robustly generalizes to a wide range of scenarios. Hupkes et al. review generalization approaches in the NLP literature and propose a taxonomy based on five axes to analyse such studies: motivation, type of generalization, type of data shift, the source of this data shift, and the locus of the shift within the modelling pipeline.

Dieuwke Hupkes
Mario Giulianelli
Zhijing Jin
AnalysisOpen Access19 Oct 2023
Construction of a 3D whole organism spatial atlas by joint modelling of multiple slices with deep neural networks

Computational methods for analysing single 2D tissue slices from spatial transcriptomics studies are well established, but their extension to the 3D domain is challenging. Wang et al. develop a deep learning framework that can perform 3D reconstruction of cellular structures in tissues as well as whole organisms.

Gefei Wang
Jia Zhao
Can Yang
Article19 Oct 2023
Protein–protein contact prediction by geometric triangle-aware protein language models

Contact prediction between two proteins is still computationally challenging, but is vital for understanding multi-protein complexes. Lin et al. use a geometric deep learning approach to provide accurate predictions of inter-protein residue–residue contacts.

Peicong Lin
Huanyu Tao
Sheng-You Huang
Article19 Oct 2023
Deep domain adversarial neural network for the deconvolution of cell type mixtures in tissue proteome profiling

Deconvolution of cell types in tissue proteomic data is a challenging computational task for the bioinformatics community. A deep-learning method termed scpDeconv is introduced that makes efficient use of single-cell proteomics data to deconvolve cell types and states from bulk proteomics measurements.

Fang Wang
Fan Yang
Jianhua Yao
Article19 Oct 2023
Forecasting the future of artificial intelligence with machine learning-based link prediction in an exponentially growing knowledge network

The number of publications in artificial intelligence (AI) has been increasing exponentially and staying on top of progress in the field is a challenging task. Krenn and colleagues model the evolution of the growing AI literature as a semantic network and use it to benchmark several machine learning methods that can predict promising research directions in AI.

Mario Krenn
Lorenzo Buffoni
Michael Kopp
AnalysisOpen Access16 Oct 2023