Featured
-
-
Article
| Open AccessHigh resolution spatial profiling of kidney injury and repair using RNA hybridization-based in situ sequencing
Advancements in spatial transcriptomics technologies have enabled the analysis of gene expression at cellular resolution in situ. The authors applied direct RNA hybridization-based in situ sequencing (dRNA HybISS) and developed a computational tool, CellScopes, to study gene expression in mouse kidneys, identifying cellular changes and interactions during injury and repair.
- Haojia Wu
- , Eryn E. Dixon
- & Benjamin D. Humphreys
-
Article
| Open AccessA deep-learning-based framework for identifying and localizing multiple abnormalities and assessing cardiomegaly in chest X-ray
Accurate localization of abnormalities is crucial in the interpretation of chest X-rays. Here the authors present a deep learning framework for simultaneous localization of 14 thoracic abnormalities and calculation of cardiothoracic ratio, based on large X-ray dataset with bounding boxes created via a human-in-the-loop approach.
- Weijie Fan
- , Yi Yang
- & Dong Zhang
-
Article
| Open AccessGenomic epidemiology reveals geographical clustering of multidrug-resistant Escherichia coli ST131 associated with bacteraemia in Wales
Escherichia coli ST131 is a globally dominant multidrug resistant clone associated with high rates of recurring urinary tract infections. In this genomic epidemiology study, the authors describe the evolution, population structure, and antimicrobial resistance in 142 E. coli ST131 samples from Wales, UK.
- Rhys T. White
- , Matthew J. Bull
- & Scott A. Beatson
-
Article
| Open AccessA general computational design strategy for stabilizing viral class I fusion proteins
The authors present a generalisable computational approach to stabilize class I fusion proteins in the prefusion state. The method was used to stabilize the fusion proteins of RSV, hMPV, and SARS-CoV-2 viruses, with the designs structurally validated and RSV F protein assessed in a neutralization assay.
- Karen J. Gonzalez
- , Jiachen Huang
- & Eva-Maria Strauch
-
Article
| Open AccessPredicting proximal tubule failed repair drivers through regularized regression analysis of single cell multiomic sequencing
A profibrotic, proinflammatory kidney cell population has been identified as a driver of chronic kidney disease. Here, authors generate a human kidney single cell multiomic dataset and apply a regularised regression approach to identify transcription factors underpinning this cell population.
- Nicolas Ledru
- , Parker C. Wilson
- & Benjamin D. Humphreys
-
Article
| Open AccessRegression-based Deep-Learning predicts molecular biomarkers from pathology slides
Cancer biomarkers are often continuous measurements, which poses challenges for their prediction using classification-based deep learning. Here, the authors develop a regression-based deep learning method to predict continuous biomarkers - such as the homologous repair deficiency score - from cancer histopathology images.
- Omar S. M. El Nahhas
- , Chiara M. L. Loeffler
- & Jakob Nikolas Kather
-
Article
| Open AccessMatrin3 mediates differentiation through stabilizing chromatin loop-domain interactions and YY1 mediated enhancer-promoter interactions
Alterations in proteins within nuclear compartments often lead to changes in chromosomal architecture. Here, using acute targeted protein degradation, the authors reveal that the nuclear complex protein Matrin3 directly mediates differentiation through stabilizing chromatin loop domain interactions.
- Tianxin Liu
- , Qian Zhu
- & Stuart H. Orkin
-
Article
| Open AccessA method to estimate the contribution of rare coding variants to complex trait heritability
The contribution of rare variants to complex traits has not been well studied. Here, the authors present RARity, a method to assess rare variant heritability without assuming a particular genetic architecture and enabling both gene-level and exome-wide heritability estimation of continuous traits.
- Nazia Pathan
- , Wei Q. Deng
- & Guillaume Paré
-
Article
| Open AccessPredicting DNA structure using a deep learning method
In this work, the authors report a deep learning method, Deep DNAshape, to predict the influence of flanking regions on three-dimensional DNA structure and in structural readout mechanisms of protein-DNA binding.
- Jinsen Li
- , Tsu-Pei Chiu
- & Remo Rohs
-
Review Article
| Open AccessThe genetic basis of autoimmunity seen through the lens of T cell functional traits
Genetic risk variants for autoimmune diseases are largely enriched in T cell-specific regulatory regions. In this review, Raychaudhuri and colleagues summarise the findings of recent studies evaluating the genetic regulation of T cell molecular and functional traits in these diseases.
- Kaitlyn A. Lagattuta
- , Hannah L. Park
- & Soumya Raychaudhuri
-
Article
| Open AccessAntiviral fibrils of self-assembled peptides with tunable compositions
In this work, the authors report the use of a computationally and rationally designed self-assembling peptide that has robust antiviral capability with demonstrated specificity in binding to SARS-CoV-2 and inhibition of viral entry into human cells.
- Joseph Dodd-o
- , Abhishek Roy
- & Vivek Kumar
-
Article
| Open AccessA multicenter clinical AI system study for detection and diagnosis of focal liver lesions
Early detection and accurate diagnosis of focal liver lesions are crucial for effective treatment and prognosis. Here, the authors present a fully automated diagnostic system that leverages multi-phase CT scans and clinical features, for diagnosing liver lesions.
- Hanning Ying
- , Xiaoqing Liu
- & Xiujun Cai
-
Article
| Open AccessOverlay databank unlocks data-driven analyses of biomolecules for all
In this work, the authors report NMR lipids Databank to promote decentralised sharing of biomolecular molecular dynamics (MD) simulation data with an overlay design. Programmatic access enables analyses of rare phenomena and advances the training of machine learning models.
- Anne M. Kiirikki
- , Hanne S. Antila
- & O. H. Samuli Ollila
-
Article
| Open AccessMAIVeSS: streamlined selection of antigenically matched, high-yield viruses for seasonal influenza vaccine production
Vaccines combat global influenza threats, relying on timely selection of optimal seed viruses. Here, authors introduce MAIVeSS, a machine learning assisted framework to streamline vaccine seed virus selection using genomic sequence, expediting seasonal flu vaccine production and supply.
- Cheng Gao
- , Feng Wen
- & Xiu-Feng Wan
-
Article
| Open AccessDynamicBind: predicting ligand-specific protein-ligand complex structure with a deep equivariant generative model
Proteins often function by changing conformations upon ligand binding. Efficient structural modelling of these interactions, crucial for drug discovery, is limited: here the authors address this with DynamicBind, a diffusion-based deep generative model.
- Wei Lu
- , Jixian Zhang
- & Shuangjia Zheng
-
Article
| Open AccessProtein structure generation via folding diffusion
The ability to engineer novel protein structures has tremendous scientific and therapeutic impact. Here, authors develop a generative model acting upon an angular representation of protein structures to create high quality protein backbones.
- Kevin E. Wu
- , Kevin K. Yang
- & Ava P. Amini
-
Article
| Open AccessUnzipped genome assemblies of polyploid root-knot nematodes reveal unusual and clade-specific telomeric repeats
Telomeres protect the extremities of linear chromosomes and are involved in ageing, senescence and genome stability. Here, the authors have identified peculiar and specific telomeric DNA repeats in the genomes of devastating plant-parasitic nematodes, opening new perspectives for their control.
- Ana Paula Zotta Mota
- , Georgios D. Koutsovoulos
- & Etienne G. J. Danchin
-
Article
| Open AccessOrchestrating chromosome conformation capture analysis with Bioconductor
The Bioconductor project aims to develop R packages for analysis of genomic datasets. Here the authors show the HiCExperiment package suite and its companion online book (https://bioconductor.org/books/OHCA/) which present data structures, computational methods and visualization tools available in Bioconductor to investigate chromatin conformation capture (3C) data in R.
- Jacques Serizay
- , Cyril Matthey-Doret
- & Romain Koszul
-
Article
| Open AccessLogical design of synthetic cis-regulatory DNA for genetic tracing of cell identities and state changes
Descriptive data in biomedical research are expanding rapidly, but functional validation methods lag behind. Here, authors present Logical Synthetic cis-regulatory DNA, a framework to design reporters that mark cellular states and pathways, showcasing its applicability to complex phenotypic states.
- Carlos Company
- , Matthias Jürgen Schmitt
- & Gaetano Gargiulo
-
Article
| Open AccessThe impacts of active and self-supervised learning on efficient annotation of single-cell expression data
Cell type annotation for single-cell data is challenging. Here, authors explore active and self-supervised learning and introduce adaptive reweighting as a tailored heuristic, demonstrating competitive performance and showing that incorporating prior knowledge enhances cell type annotation accuracy.
- Michael J. Geuenich
- , Dae-won Gong
- & Kieran R. Campbell
-
Article
| Open AccessOrientation-invariant autoencoders learn robust representations for shape profiling of cells and organelles
In image analysis, the shape properties of cells/organelles should be unaffected by image orientation. Conventional autoencoder (AE) methods can be sensitive to orientation. Here, the authors develop an unsupervised AE method that learns robust, orientation-invariant representations.
- James Burgess
- , Jeffrey J. Nirschl
- & Serena Yeung-Levy
-
Article
| Open AccessImproving polygenic risk prediction in admixed populations by explicitly modeling ancestral-differential effects via GAUDI
Most polygenic risk score (PRS) methods focus only on individuals with distinct primary continental ancestry, without accommodating recently-admixed individuals. Here, the authors develop a novel penalized regression-based PRS method specifically designed for admixed individuals.
- Quan Sun
- , Bryce T. Rowland
- & Yun Li
-
Article
| Open AccessTigerfish designs oligonucleotide-based in situ hybridization probes targeting intervals of highly repetitive DNA at the scale of genomes
Repetitive DNA intervals play important roles in the nucleus but are difficult to study due to their reiterated nature. Tigerfish introduces a novel computational platform for the design of interval-specific in situ hybridization probes.
- Robin Aguilar
- , Conor K. Camplisson
- & Brian J. Beliveau
-
Article
| Open AccessDetection of senescence using machine learning algorithms based on nuclear features
Identifying senescence is complicated by a lack of universal markers. Here, Duran et al. use nuclear morphology features to devise machine-learning classifiers that detect senescence in cell lines and liver sections of patients and mouse models of aging and disease.
- Imanol Duran
- , Joaquim Pombo
- & Jesús Gil
-
Article
| Open AccessLocal structural preferences in shaping tau amyloid polymorphism
In this work, using a combination of Cryo-EM, in-cell experiments and biophysical analysis, the authors decoded the aggregation propensity of tau, revealing 5 central hot spots in its primary sequence and identify PAM4 as short segment that determines both the structure, as well as the cellular propagation of tau aggregates extracted from Alzheimer’s disease, corticobasal degeneration, and progressive supranuclear palsy patients.
- Nikolaos Louros
- , Martin Wilkinson
- & Joost Schymkowitz
-
Article
| Open AccessA sequence-aware merger of genomic structural variations at population scale
Existing tools for structural variations (SVs) calling and merging often lead to fragmented SVs and the potential of introducing unnecessary errors. Here, the authors report the PanPop pipeline to address these issues by implementing sequence-aware SV merging algorithm to efficiently merge SVs of various types.
- Zeyu Zheng
- , Mingjia Zhu
- & Yongzhi Yang
-
Matters Arising
| Open AccessReply to: Assessing the precision of morphogen gradients in neural tube development
- Roman Vetter
- & Dagmar Iber
-
Article
| Open AccessCongenital heart disease detection by pediatric electrocardiogram based deep learning integrated with human concepts
Congenital heart disease is life threatening, and its screening is complex and costly. Here, authors use AI to detect the disease based on pediatric electrocardiogram, suggesting superior performance over cardiologists.
- Jintai Chen
- , Shuai Huang
- & Huiying Liang
-
Article
| Open AccessDeepFocus: fast focus and astigmatism correction for electron microscopy
High-throughput electron microscopy demands minimal human intervention and high image quality. Here, authors introduce DeepFocus, a data-driven method for aberration correction in electron microscopy, robust for low SNR images, fast and easily adaptable to microscopes and samples. Peer Review Information: Nature Communications thanks Yang Zhang and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.
- P. J. Schubert
- , R. Saxena
- & J. Kornfeld
-
Article
| Open AccessContScout: sensitive detection and removal of contamination from annotated genomes
It is unclear whether naturally evolved de novo proteins have stable, folded structures. Here, systematic identification and structural modeling of de novo genes, this study reveals that a small subset of these proteins may have well-folded structures, and were likely born with these structures.
- Balázs Bálint
- , Zsolt Merényi
- & László G. Nagy
-
Article
| Open AccessEffect of aging on the human myometrium at single-cell resolution
Age-associated myometrial dysfunction can cause complications during pregnancy and labor. Here, the authors report that aging myometrium is characterized by diminished contractile capillary cells, altered gene expression, and disrupted cellular communication leading to impaired angiogenesis, increased fibrosis and inflammation.
- Paula Punzon-Jimenez
- , Alba Machado-Lopez
- & Aymara Mas
-
Article
| Open AccessscDisInFact: disentangled learning for integration and prediction of multi-batch multi-condition single-cell RNA-sequencing data
Here the authors propose a deep learning model that integrates multi-condition, multi-batch single-cell RNA-sequencing datasets. The model disentangles biological variation (condition effect) from technical confounders (batch effect) and overcomes some limitations of existing approaches.
- Ziqi Zhang
- , Xinye Zhao
- & Xiuwei Zhang
-
Article
| Open AccessLongitudinal quantification of Bifidobacterium longum subsp. infantis reveals late colonization in the infant gut independent of maternal milk HMO composition
Here, the authors develop a high-throughput method to quantify Bifidobacterium longum subsp. infantis (BL. infantis), a proficient HMO-utilizer, from metagenomic sequencing, and applied it to a longitudinal cohort consisting of 21 mother-infant dyads, suggesting BL. infantis colonization to start late in the breast-feeding period.
- Dena Ennis
- , Shimrit Shmorak
- & Moran Yassour
-
Article
| Open AccessSingle-cell analysis of psoriasis resolution demonstrates an inflammatory fibroblast state targeted by IL-23 blockade
Single cell profiling of tissue from patients undergoing therapy has the potential to identify drug-induced immune changes. Here the authors show a skin scRNA-seq study of psoriasis patients treated with an IL-23 inhibitor and characterize changes in cell states during early treatment.
- Luc Francis
- , Daniel McCluskey
- & Satveer K. Mahil
-
Article
| Open AccessSemi-supervised integration of single-cell transcriptomics data
Batch effects hinder multi-sample single-cell data analyses. Here, authors present STACAS, a scalable single-cell RNA-seq data integration tool that uses prior cell type knowledge to preserve biological variability, demonstrating robustness to noisy input cell type labels.
- Massimo Andreatta
- , Léonard Hérault
- & Santiago J. Carmona
-
Article
| Open AccessUtility of long-read sequencing for All of Us
Using All of Us pilot data, the authors compared short- and long-read performance across medically relevant genes and showcased the utility of long reads to improve variant detection and phasing in easy and hard to resolve medically relevant genes.
- M. Mahmoud
- , Y. Huang
- & F. J. Sedlazeck
-
Article
| Open AccessSingle-cell multiomics decodes regulatory programs for mouse secondary palate development
Development of the secondary palate is a complex process. Here, the authors profile mouse palatogenesis through single-cell multiome sequencing, revealing dynamic gene regulation across embryonic days (E) 12.5, E13.5, E14.0, and E14.5.
- Fangfang Yan
- , Akiko Suzuki
- & Zhongming Zhao
-
Article
| Open AccessTrajectory inference across multiple conditions with condiments
scRNA-Seq has enabled the study of dynamic systems such as response to a drug at the individual cell and gene levels. Here the authors introduce a framework to interpret differences at the trajectory, cell populations, and individual gene levels.
- Hector Roux de Bézieux
- , Koen Van den Berge
- & Sandrine Dudoit
-
Article
| Open AccessSiFT: uncovering hidden biological processes by probabilistic filtering of single-cell data
Cells simultaneously encode multiple signals, some harder to recover. Here, authors introduce SiFT (Signal FilTering), a kernel-based projection method, revealing underlying biological processes in single-cell data.
- Zoe Piran
- & Mor Nitzan
-
Article
| Open AccessCompositional and temporal division of labor modulates mixed sugar fermentation by an engineered yeast consortium
Synthetic microbial communities are suitable for mixed substrates fermentation and long metabolic pathway engineering. Here, the authors combine fermentation experiments with mathematical modeling to reveal the effect of compositional and temporal changes on division of labor in cellulosic ethanol production using two yeast strains.
- Jonghyeok Shin
- , Siqi Liao
- & Yong-Su Jin
-
Article
| Open AccessAnti-correlated feature selection prevents false discovery of subpopulations in scRNAseq
Typical single-cell RNAseq pipelines will subcluster homogeneous cells. Here, authors present a computational algorithm for accurately identifying cell-type marker genes in single-cell data analysis with a low false discovery rate.
- Scott R. Tyler
- , Daniel Lozano-Ojalvo
- & Eric E. Schadt
-
Article
| Open AccessEmergence of periodic circumferential actin cables from the anisotropic fusion of actin nanoclusters during tubulogenesis
Periodic circumferential cytoskeletons support biological tube formation. Here, the authors show that self-assembled actin nanoclusters undergo biased fusion and develop into periodic cables in response to the membrane anisotropy of the expanding Drosophila tracheal tube.
- Sayaka Sekine
- , Mitsusuke Tarama
- & Shigeo Hayashi
-
Article
| Open AccessRational strain design with minimal phenotype perturbation
No consensus exists on the computationally tractable use of dynamic models for strain design. To tackle this, the authors report a framework, nonlinear-dynamic-model-assisted rational metabolic engineering design, for efficiently designing robust, artificially engineered cellular organisms.
- Bharath Narayanan
- , Daniel Weilandt
- & Vassily Hatzimanikatis
-
Article
| Open AccessHuman whole-exome genotype data for Alzheimer’s disease
The heterogeneity of whole-exome sequencing (WES) data generation methods presents a challenge to joint analysis. Here, the authors present a bioinformatics strategy to generate high-quality data from processing diversely generated WES samples, as applied in the Alzheimer’s Disease Sequencing Project.
- Yuk Yee Leung
- , Adam C. Naj
- & Li-San Wang
-
Article
| Open AccessUltraconserved bacteriophage genome sequence identified in 1300-year-old human palaeofaeces
Bacterial viruses (phages) are generally recognised as rapidly evolving biological entities. Here, Rozwalak et al. analyse DNA sequence datasets generated from ancient palaeofaeces and identify 298 phage genomes from the last 5300 years, including a 1300-year-old phage genome nearly identical to a present-day virus that infects human gut bacteria.
- Piotr Rozwalak
- , Jakub Barylski
- & Andrzej Zielezinski
-
Article
| Open AccessMARS an improved de novo peptide candidate selection method for non-canonical antigen target discovery in cancer
Detection of neoepitopes from tumours is time consuming and requires the integration of genomic and/or RNA sequencing expression data. Here, the authors propose a machine learning method to enable direct identification of additional, tumour-specific sequences using mass spectrometry through integration of de novo peptide sequencing scores, MHC class I binding prediction, and peptide retention time prediction.
- Hanqing Liao
- , Carolina Barra
- & Nicola Ternette
-
Article
| Open AccessSegment anything in medical images
Segmentation is an important fundamental task in medical image analysis. Here the authors show a deep learning model for efficient and accurate segmentation across a wide range of medical image modalities and anatomies.
- Jun Ma
- , Yuting He
- & Bo Wang
-
Article
| Open AccessMolecular quantitative trait loci in reproductive tissues impact male fertility in cattle
Investigating the genetics of male fertility requires comprehensive genotype and phenotype data. Here, the authors characterize the transcriptional complexity of bovine male reproductive tissues to identify loci associated with male fertility.
- Xena Marie Mapel
- , Naveen Kumar Kadri
- & Hubert Pausch
-
Article
| Open AccessUsing big sequencing data to identify chronic SARS-Coronavirus-2 infections
Chronic SARS-CoV-2 infections have been hypothesised to be sources of new variants. Here, the authors use large-scale genome sequencing data to identify mutations predictive of chronic infections, which may therefore be relevant in future variants.
- Sheri Harari
- , Danielle Miller
- & Adi Stern
Browse broader subjects
Browse narrower subjects
- Biochemical reaction networks
- Cellular signalling networks
- Classification and taxonomy
- Communication and replication
- Computational models
- Computational neuroscience
- Computational platforms and environments
- Data acquisition
- Data integration
- Data mining
- Data processing
- Data publication and archiving
- Databases
- Functional clustering
- Gene ontology
- Gene regulatory networks
- Genome informatics
- Hardware and infrastructure
- High-throughput screening
- Image processing
- Literature mining
- Machine learning
- Microarrays
- Network topology
- Phylogeny
- Power law
- Predictive medicine
- Probabilistic data networks
- Programming language
- Protein analysis
- Protein design
- Protein folding
- Protein function predictions
- Protein structure predictions
- Proteome informatics
- Quality control
- Scale invariance
- Sequence annotation
- Software
- Standards
- Statistical methods
- Virtual drug screening