August 24 Issue

August issue

Mittal, S., Thakral, K., Singh, R. et al. On responsible machine learning datasets emphasizing fairness, privacy and regulatory norms with examples in biometrics and healthcare.

  • Surbhi Mittal
  • Kartik Thakral
  • Tal Hassner
Article

Nature Machine Intelligence is a Transformative Journal; authors can publish using the traditional publishing route OR via immediate gold Open Access.

Our Open Access option complies with funder and institutional requirements.

Advertisement

  • With increasing reliance on public data sources, researchers are concerned whether low-quality or even adversarial data could have detrimental effects on medical models. Yang et al. developed Scorpius, a malicious text generator, to investigate whether large language models can mislead medical knowledge graphs. They show that a single generated paper abstract can mislead a medical reasoning system that has read millions of papers.

    • Junwei Yang
    • Hanwen Xu
    • Sheng Wang
    Article
  • Stabilization of proteins is a key task in protein engineering; however, current methods to predict mutant stability face a number of limitations. Reeves and Kalyaanamoorthy study the performance of self-supervised protein sequence likelihood models for stability prediction and find that combining them with task-specific supervised models can lead to appreciable practical gains.

    • Shawn Reeves
    • Subha Kalyaanamoorthy
    Article
  • Extracting time traces and spatial footprints of single neurons from population calcium imaging data presents challenges. Zhang et al. introduce a deep learning method that efficiently segments neuronal footprints and extracts activity traces from these data. The method surpasses existing approaches in both quality and speed, providing a robust tool for large-scale neuronal circuit analysis.

    • Kangning Zhang
    • Sean Tang
    • Weijian Yang
    Article
  • Denoising methods introduce useful priors in pre-training methods for molecular property prediction, but chemically unaware noise can lead to inaccurate predictions in downstream tasks. A molecular pre-training framework that uses fractional denoising to improve molecular distribution modelling is proposed, resulting in better predictions in various property prediction tasks.

    • Yuyan Ni
    • Shikun Feng
    • Yanyan Lan
    Article