Research articles

Neural scaling of deep chemical models

Deep learning methods in natural language processing generally become more effective with larger datasets and bigger networks. But it is not evident whether the same is true for more specialized domains such as cheminformatics. Frey and colleagues provide empirical explorations of chemistry models and find that neural-scaling laws hold true even for the largest tested models and datasets.

Nathan C. Frey
Ryan Soklaski
Vijay Gadepally
ArticleOpen Access23 Oct 2023
Improving Wikipedia verifiability with AI

The immense amount of Wikipedia articles makes it challenging for volunteers to ensure that cited sources support the claim they are attached to. Petroni et al. use an information-retrieval model to assist Wikipedia users in improving verifiability.

Fabio Petroni
Samuel Broscheit
Sebastian Riedel
ArticleOpen Access19 Oct 2023
Mitigating the missing-fragmentation problem in de novo peptide sequencing with a two-stage graph-based deep learning model

Identifying unknown peptides in tandem mass spectrometry is challenging as fragmentation of precursor peptides can be incomplete. Mao and colleagues present a method based on graph neural networks and a path-searching model to create more stable sequence predictions.

Zeping Mao
Ruixue Zhang
Ming Li
Article19 Oct 2023
A taxonomy and review of generalization research in NLP

With the rapid development of natural language processing (NLP) models in the last decade came the realization that high performance levels on test sets do not imply that a model robustly generalizes to a wide range of scenarios. Hupkes et al. review generalization approaches in the NLP literature and propose a taxonomy based on five axes to analyse such studies: motivation, type of generalization, type of data shift, the source of this data shift, and the locus of the shift within the modelling pipeline.

Dieuwke Hupkes
Mario Giulianelli
Zhijing Jin
AnalysisOpen Access19 Oct 2023
Construction of a 3D whole organism spatial atlas by joint modelling of multiple slices with deep neural networks

Computational methods for analysing single 2D tissue slices from spatial transcriptomics studies are well established, but their extension to the 3D domain is challenging. Wang et al. develop a deep learning framework that can perform 3D reconstruction of cellular structures in tissues as well as whole organisms.

Gefei Wang
Jia Zhao
Can Yang
Article19 Oct 2023
Protein–protein contact prediction by geometric triangle-aware protein language models

Contact prediction between two proteins is still computationally challenging, but is vital for understanding multi-protein complexes. Lin et al. use a geometric deep learning approach to provide accurate predictions of inter-protein residue–residue contacts.

Peicong Lin
Huanyu Tao
Sheng-You Huang
Article19 Oct 2023
Deep domain adversarial neural network for the deconvolution of cell type mixtures in tissue proteome profiling

Deconvolution of cell types in tissue proteomic data is a challenging computational task for the bioinformatics community. A deep-learning method termed scpDeconv is introduced that makes efficient use of single-cell proteomics data to deconvolve cell types and states from bulk proteomics measurements.

Fang Wang
Fan Yang
Jianhua Yao
Article19 Oct 2023
Forecasting the future of artificial intelligence with machine learning-based link prediction in an exponentially growing knowledge network

The number of publications in artificial intelligence (AI) has been increasing exponentially and staying on top of progress in the field is a challenging task. Krenn and colleagues model the evolution of the growing AI literature as a semantic network and use it to benchmark several machine learning methods that can predict promising research directions in AI.

Mario Krenn
Lorenzo Buffoni
Michael Kopp
AnalysisOpen Access16 Oct 2023
A method for multiple-sequence-alignment-free protein structure prediction using a protein language model

AlphaFold2 has revolutionized bioinformatics, but its ability to predict protein structures with high accuracy comes at the price of a costly database search for multiple sequence alignments. Fang and colleagues pre-train a large-scale protein language model and use it in conjunction with AlphaFold2 as a fully trainable and efficient model for structure prediction.

Xiaomin Fang
Fan Wang
Le Song
ArticleOpen Access09 Oct 2023
Human–AI adaptive dynamics drives the emergence of information cocoons

It is widely known that AI-based recommendation systems on social media and news websites can isolate humans from diverse information, eventually trapping them in so-called information cocoons, where they are exposed to a narrow range of viewpoints. Li et al. introduce an adaptive information dynamics model to uncover the origin of information cocoons in complex human–AI interaction systems, and test their findings on two large real-world datasets.

Jinghua Piao
Jiazhen Liu
Yong Li
Article09 Oct 2023
The pitfalls of negative data bias for the T-cell epitope specificity challenge

Ceder Dens
Kris Laukens
Pieter Meysman
Matters Arising05 Oct 2023
Reply to: The pitfalls of negative data bias for the T-cell epitope specificity challenge

Yicheng Gao
Yuli Gao
Qi Liu
Matters Arising05 Oct 2023
Decoding speech perception from non-invasive brain recordings

Deep learning can help develop non-invasive technology for decoding speech from brain activity, which could improve the lives of patients with brain injuries. Défossez et al. report a contrastive-learning approach to decode speech listening from human participants, using public databases of recordings based on non-invasive magnetic and electrical measurements.

Alexandre Défossez
Charlotte Caucheteux
Jean-Rémi King
ArticleOpen Access05 Oct 2023
Matching algorithms for blood donation

Online matching platforms are increasingly used for applications with positive social impact such as matching blood donors with recipients, where matching algorithms need to balance fairness with an efficiency objective. The authors demonstrate, both in computational simulations and using real data from the Facebook Blood Donations tool, that introducing a simple online matching policy can substantially increase the likelihood of donor action.

Duncan C. McElfresh
Christian Kroer
John P. Dickerson
Article05 Oct 2023
A soft-packaged and portable rehabilitation glove capable of closed-loop fine motor skills

Fine motor skill recovery in hand rehabilitation is a challenge due to limited finger movement sensing and closed-loop control algorithms in existing rehabilitation gloves. Sui et al. develop a soft-packaged rehabilitation glove, integrating sensing, actuation, a human–machine interface, power, electronics and a closed-loop algorithm. The glove aids patients after a stroke to recover fine motor skills of the fingers in a portable manner.

Mengli Sui
Yiming Ouyang
Shiwu Zhang
Article05 Oct 2023
Active learning for optimal intervention design in causal models

Identifying interventions that can induce a desired effect is challenging owing to the combinatorial number of possible choices in design space. Zhang and colleagues propose an active learning approach with theoretical guarantees to discover optimal interventions in causal models, and demonstrate the framework in the context of genetic perturbation design using single-cell transcriptomic data.

Jiaqi Zhang
Louis Cammarata
Caroline Uhler
Article02 Oct 2023
A deep neural network for real-time optoacoustic image reconstruction with adjustable speed of sound

State-of-the-art image reconstruction for multispectral optoacoustic tomography is currently too slow for clinical applications. Dehner, Zahnd et al. propose a deep learning framework to reconstruct optoacoustic images in real-time while maintaining similar quality.

Christoph Dehner
Guillaume Zahnd
Dominik Jüstel
Article02 Oct 2023
Influencing human–AI interaction by priming beliefs about AI can increase perceived trustworthiness, empathy and effectiveness

The recent accessibility of large language models brought them into contact with a large number of users and, due to the social nature of language, it is hard to avoid prescribing human characteristics such as intentions to a chatbot. Pataranutaporn and colleagues investigated how framing a bot as helpful or manipulative can influence this perception and the behaviour of the humans that interact with it.

Pat Pataranutaporn
Ruby Liu
Pattie Maes
Article02 Oct 2023
Dual adaptive training of photonic neural networks

Despite their efficiency advantages, the performance of photonic neural networks is hampered by the accumulation of inherent systematic errors. Zheng et al. propose a dual backpropagation training approach, which allows the network to adapt to systematic errors, thus outperforming state-of-the-art in situ training approaches.

Ziyang Zheng
Zhengyang Duan
Xing Lin
Article28 Sept 2023
From attribution maps to human-understandable explanations through Concept Relevance Propagation

Local methods of explainable artificial intelligence identify where important features or inputs occur, while global methods try to understand what features or concepts have been learned by a model. The authors propose a concept-level explanation method that bridges the local and global perspectives, enabling more comprehensive and human-understandable explanations.

Reduan Achtibat
Maximilian Dreyer
Sebastian Lapuschkin
ArticleOpen Access20 Sept 2023