Computational biology and bioinformatics articles within Nature Communications

Featured

  • Comment
    | Open Access

    More and more clinical studies include potentially sensitive human proteomics or metabolomics datasets, but bioinformatics resources for managing the access to these data are not yet available. This commentary discusses current best practices and future perspectives for the responsible handling of clinical proteomics and metabolomics data.

    • Thomas M. Keane
    • , Claire O’Donovan
    •  & Juan Antonio Vizcaíno
  • Article
    | Open Access

    The increasing scale and scope of biomedical data is generating tremendous opportunities for improving health outcomes, but also raises new challenges ranging from data acquisition and storage to data analysis and utilization. To meet these challenges, the authors develop the Personal Health Dashboard, which provides an end-to-end solution for deep biomedical data analytics.

    • Amir Bahmani
    • , Arash Alavi
    •  & Michael P. Snyder
  • Article
    | Open Access

    Lassa Fever is a rodent-borne viral haemorrhagic fever that is a public health problem in West Africa. Here, the authors develop a spatiotemporal model of the socioecological drivers of disease using surveillance data from Nigeria, and find evidence of climate sensitivity.

    • David W. Redding
    • , Rory Gibb
    •  & Chikwe Ihekweazu
  • Article
    | Open Access

    “Protein relocalisation plays a major role in the innate immune response but remains incompletely characterised. Here, the authors combine temporal proteomics with LOPIT, a spatial proteomic workflow, in a fully Bayesian framework to elucidate spatiotemporal proteomic changes during the LPS-induced immune response in THP-1 cells.

    • Claire M. Mulvey
    • , Lisa M. Breckels
    •  & Kathryn S. Lilley
  • Article
    | Open Access

    Small-molecule responsive protein switches are crucial components to control synthetic cellular activities. Here, we present a computational protein design strategy to repurpose drug-inhibited protein-protein interactions into OFF- and ON-switches active in cells.

    • Sailan Shui
    • , Pablo Gainza
    •  & Bruno E. Correia
  • Article
    | Open Access

    Understanding the potential impacts of new variants of SARS-CoV-2 is important for pandemic planning. Here, the authors develop a model incorporating hypothetical new variants with varying transmissibility and immune evasion properties, and use it to project possible future epidemic waves in the UK.

    • Louise Dyson
    • , Edward M. Hill
    •  & Matt J. Keeling
  • Article
    | Open Access

    Studies show the cancer transcriptome correlates poorly with the cancer proteome, questioning the role of chromatin regulation. Here the authors demonstrate proximal-gene-body chromatin elements and transcription predict abundances of differentially expressed proteins in thyroid and breast cancers.

    • Akshay Sanghi
    • , Joshua J. Gruber
    •  & Michael P. Snyder
  • Article
    | Open Access

    A Bayesian framework based on partially observable Markov decision processes (POMDPs) not only predicts subjects’ confidence in a perceptual decision making task but also explains well-known discrepancies between confidence and choice accuracy as arising from incomplete knowledge of the environment.

    • Koosha Khalvati
    • , Roozbeh Kiani
    •  & Rajesh P. N. Rao
  • Article
    | Open Access

    In clinical practice, the continuous progress of image acquisition technology or diagnostic procedures and evolving imaging protocols hamper the utility of machine learning, as prediction accuracy on new data deteriorates. Here, the authors propose a continual learning approach to deal with such domain shifts occurring at unknown time points.

    • Matthias Perkonigg
    • , Johannes Hofmanninger
    •  & Georg Langs
  • Article
    | Open Access

    Developing interpretable models is a major challenge in single cell deep learning. Here we show that the VEGA variational autoencoder model, whose decoder wiring mirrors gene modules, can provide direct interpretability to the latent space further enabling the inference of biological module activity.

    • Lucas Seninge
    • , Ioannis Anastopoulos
    •  & Joshua Stuart
  • Article
    | Open Access

    Differential expression analysis of single-cell transcriptomics allows scientists to dissect cell-type-specific responses to biological perturbations. Here, the authors show that many commonly used methods are biased and can produce false discoveries.

    • Jordan W. Squair
    • , Matthieu Gautier
    •  & Grégoire Courtine
  • Article
    | Open Access

    The vertebrate limb bud is a paradigm to uncover the fundamental mechanisms that govern embryogenesis and evolutionary diversification. Here the authors compare mouse and chicken limb bud development to study the impact of genome evolution on conserved and divergent gene regulatory interactions.

    • Shalu Jhanwar
    • , Jonas Malkmus
    •  & Rolf Zeller
  • Article
    | Open Access

    Glycosyltransferases (GT) are proteins that display extensive sequence and functional variation on a subset of 3D folds. Here, the authors use interpretable deep learning to predict 3D folds from sequence without the need for sequence alignment, which also enables the prediction of GTs with new folds.

    • Rahil Taujale
    • , Zhongliang Zhou
    •  & Natarajan Kannan
  • Article
    | Open Access

    Specific transcriptional changes in microglia associated with Alzheimer’s disease have been reported. Here, the authors show that transcriptional analysis of human hydrocephalus biopsies identifies changes in immune response genes associated with early AD pathology, including cognitive decline.

    • Wenrui Huang
    • , Anne Marie Bartosch
    •  & Andrew F. Teich
  • Article
    | Open Access

    Computer-assisted diagnosis is key for scaling up cervical cancer screening, but current algorithms perform poorly on whole slide image analysis and generalization. Here, the authors present a WSI classification and top lesion cell recommendation system using deep learning, and achieve comparable results with cytologists.

    • Shenghua Cheng
    • , Sibo Liu
    •  & Xiuli Liu
  • Article
    | Open Access

    How to infer transient cells and cell-fate transitions from snap-shot single cell transcriptome dataset remains a major challenge. Here the authors present a multiscale approach to construct single-cell dynamical manifold, quantify cell stability, and compute transition trajectory and probability between cell states.

    • Peijie Zhou
    • , Shuxiong Wang
    •  & Qing Nie
  • Article
    | Open Access

    Determining the quality of localisation microscopy images is currently challenging. Here the authors report use of the Haar wavelet kernel analysis (HAWK) Method for the Assessment of Nanoscopy, termed HAWKMAN, to assess the reliability of localisation information.

    • Richard J. Marsh
    • , Ishan Costello
    •  & Susan Cox
  • Article
    | Open Access

    Rift Valley fever is a zoonotic haemorrhagic fever with complex transmission dynamics influenced by environmental variables and animal movements. Here, the authors develop a metapopulation model incorporating these factors and use it to identify the main drivers of transmission in the Comoros archipelago.

    • Warren S. D. Tennant
    • , Eric Cardinale
    •  & Raphaëlle Métras
  • Article
    | Open Access

    Retinoblastoma is the most frequent intraocular paediatric malignancy whose molecular basis remains poorly understood. Here, the authors perform multi-omic analysis and identify two subtypes; one in a cone differentiated state and one more aggressive showing cone dedifferentiation and expressing neuronal markers.

    • Jing Liu
    • , Daniela Ottaviani
    •  & François Radvanyi
  • Article
    | Open Access

    Classifying cells into unseen cell types remains challenging in scRNA-seq analysis. Here we show that Cell Ontology enables an accurate classification of unseen cell types through considering the cell type relationships in the Cell Ontology graph.

    • Sheng Wang
    • , Angela Oliveira Pisco
    •  & Russ B. Altman
  • Article
    | Open Access

    The high dimensional and complex nature of mass spectrometry imaging (MSI) data poses challenges to downstream analyses. Here the authors show an application of artificial intelligence in mining MSI data revealing biologically relevant metabolomic and proteomic information from data acquired on different mass spectrometry platforms.

    • Walid M. Abdelmoula
    • , Begona Gimenez-Cassina Lopez
    •  & Nathalie Y. R. Agar
  • Article
    | Open Access

    Scalable trajectory inference for multi-omic single cell datasets is challenging in terms of capturing non-tree complex topologies. Here the authors present a method, VIA, that scales to millions of cells across multiple omic modalities using lazy-teleporting random walks.

    • Shobana V. Stassen
    • , Gwinky G. K. Yip
    •  & Kevin K. Tsia
  • Article
    | Open Access

    Comparing changes in behaviour across various species is not always trivial, especial across significantly divergent species. Here, the authors develop a deep learning framework that allows them to map changes in locomotion demonstrated on dopamine-deficient humans, mice and worms.

    • Takuya Maekawa
    • , Daiki Higashide
    •  & Susumu Takahashi
  • Article
    | Open Access

    Drug and target discovery for advanced liver disease are hampered by a lack of suitable models for clinical translation. Here the authors present a human liver cell-based system modeling a clinical prognostic signature allowing to propose nizatidine for treatment of advanced liver fibrosis and hepatocellular carcinoma prevention.

    • Emilie Crouchet
    • , Simonetta Bandiera
    •  & Thomas F. Baumert
  • Article
    | Open Access

    Dual-energy X-ray absorptiometry and the Fracture Risk Assessment Tool are recommended tools for osteoporotic fracture risk evaluation, but are underutilized. Here, the authors present an opportunistic tool to identify fractures, predict bone mineral density and evaluate fracture risk using plain pelvis and lumbar spine radiographs.

    • Chen-I Hsieh
    • , Kang Zheng
    •  & Chang-Fu Kuo
  • Article
    | Open Access

    Peptide-protein interactions play fundamental roles in cellular processes and are crucial for designing peptide therapeutics. Here, the authors present a deep learning framework for simultaneously predicting peptide-protein interactions and identifying peptide binding residues involved in the interactions.

    • Yipin Lei
    • , Shuya Li
    •  & Jianyang Zeng
  • Article
    | Open Access

    Image-based simulation for obtaining physical quantities is limited by the uncertainty in the underlying image segmentation. Here, the authors introduce a workflow for efficiently quantifying segmentation uncertainty and creating uncertainty distributions of the resulting physics quantities.

    • Michael C. Krygier
    • , Tyler LaBonte
    •  & Scott A. Roberts
  • Article
    | Open Access

    Little is known about how human parental relatedness varied across ancient populations. Runs of homozygosity (ROH) in the offspring’s genome can give clues. Here, the authors present a method to identify ROH in ancient genomes and infer low rates of close kin unions across most ancient populations.

    • Harald Ringbauer
    • , John Novembre
    •  & Matthias Steinrücken
  • Article
    | Open Access

    Despite extensive genetic heterogeneity, nearly half of all multiple myeloma (MM) cases are driven by cyclin D2 (CCND2) over-expression. Here the authors dissect the chromatin landscape of MM to provide insights into the transcriptional regulatory landscape driving MM and divergent transcriptomes corresponding to different MM genetic subtypes.

    • Jaime Alvarez-Benayas
    • , Nikolaos Trasanidis
    •  & Anastasios Karadimitris
  • Article
    | Open Access

    The molecular basis of Alzheimer’s Disease has been obscured by heterogeneity and scarcity of brain gene expression data, which limit effectiveness in complex models. Here, the authors introduce a multi-task deep learning framework to learn generalizable and nuanced relationships between gene expression and neuropathology.

    • Nicasia Beebe-Wang
    • , Safiye Celik
    •  & Su-In Lee
  • Article
    | Open Access

    Newly emerged pathogens are inherently difficult to forecast, due to many unknowns about their biology early in an epidemic. Here, the authors assess forecasts of a suite of models during the Zika epidemic in Colombia, finding that the models that performed best changed over the course of the epidemic.

    • Rachel J. Oidtman
    • , Elisa Omodei
    •  & T. Alex Perkins
  • Article
    | Open Access

    Complex biomolecular networks are fundamental to the functioning of living systems, both at the cellular level and beyond. In this paper, the authors develop a systems framework to elucidate the interplay of networks and the spatial localisation of network components.

    • Govind Menon
    •  & J. Krishnan
  • Article
    | Open Access

    Intratumour heterogeneity (ITH) and mutational signatures are typically analysed separately, even though they are not necessarily independent. Here, the authors present CloneSig, a tool for the joint estimation of ITH and mutational signatures, with which they analyse the TCGA and PCAWG datasets.

    • Judith Abécassis
    • , Fabien Reyal
    •  & Jean-Philippe Vert
  • Article
    | Open Access

    Disambiguating abbreviations is important for automated clinical note processing; however, deploying machine learning for this task is restricted by lack of good training data. Here, the authors show novel data augmentation methods that use biomedical ontologies to improve abbreviation disambiguation in many datasets.

    • Marta Skreta
    • , Aryan Arbabi
    •  & Michael Brudno
  • Article
    | Open Access

    Historical interbreeding between Neanderthals and humans should leave signatures of historical demographics in modern human genomes. Analysing the size distribution of Neanderthal fragments in non-African genomes suggests consistent differences in the generation interval across Eurasia, and that this could explain mutational spectrum variation.

    • Moisès Coll Macià
    • , Laurits Skov
    •  & Mikkel Heide Schierup
  • Comment
    | Open Access

    Spatially resolved transcriptomic data demand new computational analysis methods to derive biological insights. Here, we comment on these associated computational challenges as well as highlight the opportunities for standardized benchmarking metrics and data-sharing infrastructure in spurring innovation moving forward.

    • Lyla Atta
    •  & Jean Fan
  • Article
    | Open Access

    Genetic plasticity drives phenotypic differences. Here, the authors develop a framework to quantify the individual and combinatorial contributions of SNPs on a phenotype of interest and use it to identify SNP-SNP interactions associated with variations in bacteria’s response to external changes.

    • Dengcheng Yang
    • , Yi Jin
    •  & Rongling Wu
  • Article
    | Open Access

    Trypanosoma brucei undergoes developmental steps during host infection. Here, using oligopeptide-induced differentiation in vitro, authors model replicative ‘slender’ to transmissible ‘stumpy’ bloodstream forms and identify developmental and cell cycle regulators by single cell transcriptomics.

    • Emma M. Briggs
    • , Federico Rojas
    •  & Thomas D. Otto
  • Article
    | Open Access

    Identifying the molecular mechanisms of response to systemic therapy in prostate cancer remains crucial. Here, the authors apply single cell-ATAC and RNAseq to models of early treatment response and resistance to enzalutamide and identify chromatin and gene expression patterns that can predict treatment response.

    • S. Taavitsainen
    • , N. Engedal
    •  & A. Urbanucci
  • Article
    | Open Access

    The intrinsic disorder of histone tails poses challenges in their characterization. Here the authors apply extensive molecular dynamics simulations of the full nucleosome to show reversible binding to DNA with specific binding modes of different types of histone tails, where charge-altering modifications suppress tail-DNA interactions and may boost interactions between nucleosomes and nucleosome-binding proteins.

    • Yunhui Peng
    • , Shuxiang Li
    •  & Anna R. Panchenko
  • Article
    | Open Access

    The application of polygenic risk scores to individual-level disease susceptibility is challenging, as risk is evaluated at a group-level. Here, the authors describe a machine learning method, Mondrian Cross-Conformal Prediction, that reports disease status conditional probability value at the individual level.

    • Jiangming Sun
    • , Yunpeng Wang
    •  & Kasper Lage
  • Article
    | Open Access

    The global pattern of the mammalian methylome is formed by changes in methylation and demethylation. Here the authors describe a metric methylation concurrence that measures the ratio of unmethylated CpGs inside the partially methylated reads and show that methylation concurrence is associated with epigenetically regulated tumour suppressor genes.

    • Jiejun Shi
    • , Jianfeng Xu
    •  & Wei Li
  • Article
    | Open Access

    Finding a biologically-relevant inductive bias for training DNNs on large fitness landscapes is challenging. Here, the authors propose a method called Epistatic Net that improves DNN prediction accuracy and interpretation speed by integrating the knowledge that higher-order epistatic interactions are usually sparse.

    • Amirali Aghazadeh
    • , Hunter Nisonoff
    •  & Kannan Ramchandran