Research articles

Filter By:

Year
  • Deep learning methods in natural language processing generally become more effective with larger datasets and bigger networks. But it is not evident whether the same is true for more specialized domains such as cheminformatics. Frey and colleagues provide empirical explorations of chemistry models and find that neural-scaling laws hold true even for the largest tested models and datasets.

    • Nathan C. Frey
    • Ryan Soklaski
    • Vijay Gadepally
    ArticleOpen Access
  • The immense amount of Wikipedia articles makes it challenging for volunteers to ensure that cited sources support the claim they are attached to. Petroni et al. use an information-retrieval model to assist Wikipedia users in improving verifiability.

    • Fabio Petroni
    • Samuel Broscheit
    • Sebastian Riedel
    ArticleOpen Access
  • With the rapid development of natural language processing (NLP) models in the last decade came the realization that high performance levels on test sets do not imply that a model robustly generalizes to a wide range of scenarios. Hupkes et al. review generalization approaches in the NLP literature and propose a taxonomy based on five axes to analyse such studies: motivation, type of generalization, type of data shift, the source of this data shift, and the locus of the shift within the modelling pipeline.

    • Dieuwke Hupkes
    • Mario Giulianelli
    • Zhijing Jin
    AnalysisOpen Access
  • The number of publications in artificial intelligence (AI) has been increasing exponentially and staying on top of progress in the field is a challenging task. Krenn and colleagues model the evolution of the growing AI literature as a semantic network and use it to benchmark several machine learning methods that can predict promising research directions in AI.

    • Mario Krenn
    • Lorenzo Buffoni
    • Michael Kopp
    AnalysisOpen Access
  • AlphaFold2 has revolutionized bioinformatics, but its ability to predict protein structures with high accuracy comes at the price of a costly database search for multiple sequence alignments. Fang and colleagues pre-train a large-scale protein language model and use it in conjunction with AlphaFold2 as a fully trainable and efficient model for structure prediction.

    • Xiaomin Fang
    • Fan Wang
    • Le Song
    ArticleOpen Access
  • It is widely known that AI-based recommendation systems on social media and news websites can isolate humans from diverse information, eventually trapping them in so-called information cocoons, where they are exposed to a narrow range of viewpoints. Li et al. introduce an adaptive information dynamics model to uncover the origin of information cocoons in complex human–AI interaction systems, and test their findings on two large real-world datasets.

    • Jinghua Piao
    • Jiazhen Liu
    • Yong Li
    Article
  • Deep learning can help develop non-invasive technology for decoding speech from brain activity, which could improve the lives of patients with brain injuries. Défossez et al. report a contrastive-learning approach to decode speech listening from human participants, using public databases of recordings based on non-invasive magnetic and electrical measurements.

    • Alexandre Défossez
    • Charlotte Caucheteux
    • Jean-Rémi King
    ArticleOpen Access
  • Online matching platforms are increasingly used for applications with positive social impact such as matching blood donors with recipients, where matching algorithms need to balance fairness with an efficiency objective. The authors demonstrate, both in computational simulations and using real data from the Facebook Blood Donations tool, that introducing a simple online matching policy can substantially increase the likelihood of donor action.

    • Duncan C. McElfresh
    • Christian Kroer
    • John P. Dickerson
    Article
  • Fine motor skill recovery in hand rehabilitation is a challenge due to limited finger movement sensing and closed-loop control algorithms in existing rehabilitation gloves. Sui et al. develop a soft-packaged rehabilitation glove, integrating sensing, actuation, a human–machine interface, power, electronics and a closed-loop algorithm. The glove aids patients after a stroke to recover fine motor skills of the fingers in a portable manner.

    • Mengli Sui
    • Yiming Ouyang
    • Shiwu Zhang
    Article
  • Identifying interventions that can induce a desired effect is challenging owing to the combinatorial number of possible choices in design space. Zhang and colleagues propose an active learning approach with theoretical guarantees to discover optimal interventions in causal models, and demonstrate the framework in the context of genetic perturbation design using single-cell transcriptomic data.

    • Jiaqi Zhang
    • Louis Cammarata
    • Caroline Uhler
    Article
  • The recent accessibility of large language models brought them into contact with a large number of users and, due to the social nature of language, it is hard to avoid prescribing human characteristics such as intentions to a chatbot. Pataranutaporn and colleagues investigated how framing a bot as helpful or manipulative can influence this perception and the behaviour of the humans that interact with it.

    • Pat Pataranutaporn
    • Ruby Liu
    • Pattie Maes
    Article
  • Despite their efficiency advantages, the performance of photonic neural networks is hampered by the accumulation of inherent systematic errors. Zheng et al. propose a dual backpropagation training approach, which allows the network to adapt to systematic errors, thus outperforming state-of-the-art in situ training approaches.

    • Ziyang Zheng
    • Zhengyang Duan
    • Xing Lin
    Article
  • Local methods of explainable artificial intelligence identify where important features or inputs occur, while global methods try to understand what features or concepts have been learned by a model. The authors propose a concept-level explanation method that bridges the local and global perspectives, enabling more comprehensive and human-understandable explanations.

    • Reduan Achtibat
    • Maximilian Dreyer
    • Sebastian Lapuschkin
    ArticleOpen Access