Research articles

Deep learning supported discovery of biomarkers for clinical prognosis of liver cancer

The potential of deep learning in pathological prognosis has been hampered by limited interpretability in clinical applications. Liang and colleagues present a human-centric deep learning framework that supports the discovery of prognostic biomarkers in an interpretable way.

Junhao Liang
Weisheng Zhang
Lingjie Kong
Article03 Apr 2023
Testing the limits of SMILES-based de novo molecular generation with curriculum and deep reinforcement learning

Generative models in cheminformatics depend on molecules being representable as structured data, such as the simplified molecular-input line-entry system (SMILES). Mokaya and colleagues investigated how the choice of representation influences the quality of generated compounds, and found that string-based representations can hinder performance in a curriculum learning setting.

Maranga Mokaya
Fergus Imrie
Charlotte M. Deane
Article03 Apr 2023
Characterizing the interaction conformation between T-cell receptors and epitopes with deep learning

Computational modelling of the interactions between T-cell receptors (TCRs) and epitopes is a crucial yet challenging scientific problem. Peng and colleagues develop a deep learning model to capture TCR–epitope binding patterns, providing useful insights for understanding TCR recognition.

Xingang Peng
Yipin Lei
Jianyang Zeng
Article27 Mar 2023
Synthetic data accelerates the development of generalizable learning-based algorithms for X-ray image analysis

Simulated data is an alternative to real data for medical applications where interventional data are needed to train AI-based systems. Gao and colleagues develop a model transfer paradigm to train deep networks on synthetic X-ray data and corresponding labels generated using simulation techniques from CT scans. The approach establishes synthetic data as a viable resource for developing machine learning models that apply to real clinical data.

Cong Gao
Benjamin D. Killeen
Mathias Unberath
Article20 Mar 2023
Neural-network solutions to stochastic reaction networks

Stochastic reaction networks involve solving a system of ordinary differential equations, which becomes challenging as the number of reactive species grows, but a new approach based on evolving a variational autoregressive neural network provides an efficient way to track time evolution of the joint probability distribution for general reaction networks.

Ying Tang
Jiayu Weng
Pan Zhang
Article16 Mar 2023
Predicting metabolomic profiles from microbial composition through neural ordinary differential equations

Computational models can help predict metabolic profiles of microbial communities such as human gut microbiomes or environmental microbiomes, but they lack generalizability and interpretability. To address this challenge, Wang et al. report a deep learning approach for metabolic profile prediction called mNODE that incorporates a neural network module with hidden layers described by ordinary differential equations.

Tong Wang
Xu-Wen Wang
Yang-Yu Liu
Article13 Mar 2023
Evaluation of post-hoc interpretability methods in time-series classification

Various post-hoc interpretability methods exist to evaluate the results of machine learning classification and prediction tasks. To better understand the performance and reliability of such methods, which is particularly necessary in high-risk applications, Turbe et al. have developed a framework for quantitative comparison of post-hoc interpretability approaches in time-series classification.

Hugues Turbé
Mina Bjelogrlic
Gianmarco Mengaldo
ArticleOpen Access13 Mar 2023
A multi-modal pre-training transformer for universal transfer learning in metal–organic frameworks

Metal–organic frameworks are of high interest for a range of energy and environmental applications due to their stable gas storage properties. A new machine learning approach based on a pre-trained multi-modal transformer can be fine-tuned with small datasets to predict structure-property relationships and design new metal-organic frameworks for a range of specific tasks.

Yeonghun Kang
Hyunsoo Park
Jihan Kim
Article13 Mar 2023
A neuro-vector-symbolic architecture for solving Raven’s progressive matrices

Neuro-symbolic artificial intelligence approaches display both perception and reasoning capabilities, but inherit the limitations of their individual deep learning and symbolic artificial intelligence components. By combining neural networks and vector-symbolic architectures, Hersche and colleagues propose a neuro-vector-symbolic framework that can solve Raven’s progressive matrices tests faster and more accurately than other state-of-the-art methods.

Michael Hersche
Mustafa Zeqiri
Abbas Rahimi
Article09 Mar 2023
A typology for exploring the mitigation of shortcut behaviour

Explanatory interactive machine learning methods have been developed to facilitate the learning process between the machine and the user. Friedrich et al. provide a unification of various explanatory interactive machine learning methods into a single typology, and present benchmarks for evaluating such methods.

Felix Friedrich
Wolfgang Stammer
Kristian Kersting
Article09 Mar 2023
Pan-Peptide Meta Learning for T-cell receptor–antigen binding recognition

Machine learning methods can predict and recognize binding patterns between T-cell receptors and human antigens, but they struggle with antigens for which no or little data exist regarding interactions with the immune system. A new method called PanPep based on meta-learning can learn quickly on new binding prediction tasks and accurately predicts pairing between T-cell receptors and new antigens.

Yicheng Gao
Yuli Gao
Qi Liu
Article06 Mar 2023
Labelling instructions matter in biomedical image analysis

High-quality annotation of datasets is critical for machine-learning-based biomedical image analysis. However, a detailed examination of recent image competitions reveals a gap between annotators’ needs and quality of labelling instructions. It is also found that annotator performance can be substantially improved by providing exemplary images.

Tim Rädsch
Annika Reinke
Lena Maier-Hein
ArticleOpen Access02 Mar 2023
Parameter-efficient fine-tuning of large-scale pre-trained language models

Training a deep neural network can be costly but training time is reduced when a pre-trained network can be adapted to different use cases. Ideally, only a small number of parameters needs to be changed in this process of fine-tuning, which can then be more easily distributed. In this Analysis, different methods of fine-tuning with only a small number of parameters are compared on a large set of natural language processing tasks.

Ning Ding
Yujia Qin
Maosong Sun
AnalysisOpen Access02 Mar 2023
Continuous improvement of self-driving cars using dynamic confidence-aware reinforcement learning

Reinforcement learning is a powerful technique to learn complex behaviours, but in the context of self-driving vehicles it might result in unsafe behaviour in previously unseen situations. Cao et al. create a confidence-aware method that improves through reinforcement learning but reverts to safe behaviour when a situation is new.

Zhong Cao
Kun Jiang
Diange Yang
Article23 Feb 2023
Stretchable e-skin and transformer enable high-resolution morphological reconstruction for soft robots

Developing proprioception systems for flexible structures such as soft robots is a challenge. Hu et al. report a stretchable e-skin for soft robot proprioception. Combined with deep learning, the e-skin enables high-resolution 3D geometry reconstruction of the soft robot and can be applied in many scenarios, such as human–robot interaction.

Delin Hu
Francesco Giorgio-Serchi
Yunjie Yang
Article23 Feb 2023
Mixed-modality speech recognition and interaction using a wearable artificial throat

The mechanical signals of the laryngeal vocal organ have not been well utilized by human speech processing technology. The authors develop a prototype of a wearable artificial throat that can sense speech- and vocalization-related actions. The results suggest a new technological pathway for speech recognition and interaction systems.

Qisheng Yang
Weiqiu Jin
Tian-Ling Ren
Article23 Feb 2023
Estimating categorical counterfactuals via deep twin networks

When learning a causal model from data, deriving counterfactual examples from the model can help to evaluate how plausible the mechanisms are and create hypotheses that can be tested with new data. Vlontzos and colleagues develop a deep learning-based method for answering counterfactual queries that can deal with categorical variables, rather than only binary ones, using the notion of ‘counterfactual ordering’.

Athanasios Vlontzos
Bernhard Kainz
Ciarán M. Gilligan-Lee
Article20 Feb 2023
Echo state graph neural networks with analogue random resistive memory arrays

Co-designing hardware platforms and neural network software can help improve the computational efficiency and training affordability of deep learning implementations. A new approach designed for graph learning with echo state neural networks makes use of in-memory computing with resistive memory and shows up to a 35 times improvement in the energy efficiency and 99% reduction in training cost for graph classification on large datasets.

Shaocong Wang
Yi Li
Ming Liu
ArticleOpen Access13 Feb 2023
Predicting the prevalence of complex genetic diseases from individual genotype profiles using capsule networks

Disease phenotypes can be predicted from genetic profiles, but diseases with complex, non-additive interactions between genes are hard to disentangle. An approach called DiseaseCapsule makes use of capsule networks to identify the hierarchical structure in genomic data and can predict complex diseases such as amyotrophic lateral sclerosis with high accuracy.

Xiao Luo
Xiongbin Kang
Alexander Schönhuth
ArticleOpen Access13 Feb 2023
Interpretable bilinear attention network with domain adaptation improves drug–target prediction

Predicting drug–target interaction with computational models has attracted a lot of attention, but it is a difficult problem to generalize across domains to out-of-distribution data. Bai et al. present here a method that aims to model local interactions of proteins and drug molecules while being interpretable and provide cross-domain generalization.

Peizhen Bai
Filip Miljković
Haiping Lu
Article02 Feb 2023