-
-
Article
| Open AccessAn engineered variant of MECR reductase reveals indispensability of long-chain acyl-ACPs for mitochondrial respiration
Mitochondrial fatty acid synthesis (mtFAS) generates the precursor for lipoic acid synthesis, but the role of longer fatty acid products has remained unclear. Here, the authors generated an engineered variant of human 2E-enoyl-ACP reductase (MECR) of mtFAS to study the role of long chain fatty acids.
- M. Tanvir Rahman
- , M. Kristian Koski
- & Kaija J. Autio
-
Article
| Open AccessAlternative polyadenylation transcriptome-wide association study identifies APA-linked susceptibility genes in brain disorders
Alternative polyadenylation (APA) contributes to the post-transcriptional regulation of most human genes, yet the effects of APA are largely overlooked by conventional transcriptome-wide association studies (TWAS). Here, the authors conduct an APA-TWAS for 11 brain disorders, identifying hundreds of APA-linked disease susceptibility genes.
- Ya Cui
- , Frederick J. Arnold
- & Wei Li
-
Article
| Open AccessEstimation of cell lineages in tumors from spatial transcriptomics data
Cell type deconvolution in tumor spatial transcriptomics (ST) data remains challenging. Here, the authors develop Spatial Cellular Estimator for Tumors (SpaCET) to infer cell types and intercellular interactions from ST data in cancer across different platforms, with improved performance over similar methods.
- Beibei Ru
- , Jinlin Huang
- & Peng Jiang
-
Article
| Open AccessReanalysis of ribosome profiling datasets reveals a function of rocaglamide A in perturbing the dynamics of translation elongation via eIF4A
The compound Rocaglamide A (RocA) is known for repressing translation initiation. Here the authors identify a dual mode of action for RocA in blocking translation initiation and elongation via eIF4A using previous datasets and new analyses.
- Fajin Li
- , Jianhuo Fang
- & Xuerui Yang
-
Article
| Open AccessSample multiplexing-based targeted pathway proteomics with real-time analytics reveals the impact of genetic variation on protein expression
Targeted proteomics enables robust hypothesis-driven research. Here, Yu et al. present a multiplexed approach for targeted pathway proteomics and apply it to quantify protein families across 480 fully genotyped Diversity Outbred mice, revealing impacts of genetic variation on protein expression and lipid metabolism.
- Qing Yu
- , Xinyue Liu
- & Steven P. Gygi
-
Article
| Open AccessGenome-wide host-pathogen analyses reveal genetic interaction points in tuberculosis disease
Few genetic loci have been associated with tuberculosis infection, possibly because of the influence of genetic variation in the pathogen. Here, the authors integrate human and Mycobacterium tuberculosis genetics to find genome-genome interactions associated with infection.
- Jody Phelan
- , Paula Josefina Gomez-Gonzalez
- & Taane G. Clark
-
Article
| Open AccessViralCC retrieves complete viral genomes and virus-host pairs from metagenomic Hi-C data
Metagenomic Hi-C enables genome retrieval in microbial samples. Here, the authors develop an integrative method to recover complete viral genomes and detect virus-host pairs using metagenomic Hi-C data.
- Yuxuan Du
- , Jed A. Fuhrman
- & Fengzhu Sun
-
Article
| Open AccessThe RESP AI model accelerates the identification of tight-binding antibodies
High-affinity antibodies are often identified through directed evolution but deep leaning methods hold great promise. Here the authors report RESP, a pipeline for efficient identification of high affinity antibodies, and apply this to the PD-L1 antibody Atezolizumab.
- Jonathan Parkinson
- , Ryan Hard
- & Wei Wang
-
Article
| Open AccessMapping lesion-specific response and progression dynamics and inter-organ variability in metastatic colorectal cancer
Understanding the heterogeneity of growth, response to therapy and progression dynamics in metastatic colorectal cancer (mCRC) remains critical. Here, the authors analyse lesion-specific response heterogeneity in 4,308 mCRC patients and find that organ-level progression sequence is associated with long-term survival.
- Jiawei Zhou
- , Amber Cipriani
- & Yanguang Cao
-
Article
| Open AccessDecision level integration of unimodal and multimodal single cell data with scTriangulate
Single-cell genomics has expanded to measure diverse molecular modalities within the same cell. Here the authors provide a computational framework called scTriangulate to integrate cluster annotations from diverse independent sources, algorithms, and modalities to define statistically stable populations.
- Guangyuan Li
- , Baobao Song
- & Nathan Salomonis
-
Article
| Open AccessTopological identification and interpretation for single-cell gene regulation elucidation across multiple platforms using scMGCA
A major challenge in analyzing scRNA-seq data arises from challenges related to dimensionality and the prevalence of dropout events. Here the authors develop a deep graph learning method called scMGCA based on a graph-embedding autoencoder that simultaneously learns cell-cell topology representation and cluster assignments, outperforming other state-of-the-art models across multiple platforms.
- Zhuohan Yu
- , Yanchi Su
- & Xiangtao Li
-
Article
| Open AccessEpigenetic and transcriptional regulations prime cell fate before division during human pluripotent stem cell differentiation
Many stem cells exhibit cell division coupled to differentiation, though the changes occurring between consecutive cell divisions have been difficult to study. Here they use synchronized hPSC culture to show that production of transcription factors and epigenetic changes are linked with cell division timing.
- Pedro Madrigal
- , Siwei Deng
- & Siim Pauklin
-
Article
| Open AccessProjected health impact of post-discharge malaria chemoprevention among children with severe malarial anaemia in Africa
Trial data have shown that post-discharge malaria chemoprevention (PDMC) reduces the risk of readmission and death in children previously hospitalised with severe malarial anaemia. Here, the authors use mathematical modelling to estimate the potential epidemiological impacts of PDMC in malaria-endemic countries in Africa.
- Lucy C. Okell
- , Titus K. Kwambai
- & Amani Thomas Mori
-
Article
| Open AccessscMoMaT jointly performs single cell mosaic integration and multi-modal bio-marker detection
Many methods for single cell data integration have been developed, though mosaic integration remains challenging. Here the authors present scMoMaT, a mosaic integration method for single cell multi-modality data from multiple batches, that jointly learns cell representations and marker features across modalities for different cell clusters, to interpret the cell clusters from different modalities.
- Ziqi Zhang
- , Haoran Sun
- & Xiuwei Zhang
-
Article
| Open AccessMiXcan: a framework for cell-type-aware transcriptome-wide association studies with an application to breast cancer
Conventional transcriptome-wide association study (TWAS) approaches predict genetically regulated gene expression at the tissue level. Here, the authors develop a framework for cell-type-aware TWAS that predicts cell-type level expression from genotype data and identifies disease-associated genes with cell-type-specific effects.
- Xiaoyu Song
- , Jiayi Ji
- & Weiva Sieh
-
Article
| Open AccessMolecular characterization of Richter syndrome identifies de novo diffuse large B-cell lymphomas with poor prognosis
Richter syndrome (RS) is the transformation of chronic lymphocytic leukaemia (CLL) into aggressive lymphoma, in most cases diffuse large B-cell lymphoma (DLBCL). Here, the authors characterize the DNA methylation and transcriptomic profiles of RS samples, find a clonally-related CLL epigenetic imprint, and develop classifiers for “RS-type” de novo DLBCLs.
- Julien Broséus
- , Sébastien Hergalant
- & Stephan Stilgenbauer
-
Article
| Open AccessAnnotation of natural product compound families using molecular networking topology and structural similarity fingerprinting
Comparing experimental mass spectra to reference spectra can enable natural product identification, but these spectral libraries are often incomplete and not universally applicable. Here, the authors present SNAP-MS, a tool that allows assigning compound families without experimental or calculated reference spectra.
- Nicholas J. Morehouse
- , Trevor N. Clark
- & Roger G. Linington
-
Article
| Open AccessProbabilistic embedding, clustering, and alignment for integrating spatial transcriptomics data with PRECAST
Methods that perform data integration are needed to analyse spatial transcriptomics data from multiple tissue slides. Here, the authors present PRECAST, an efficient data integration method for multiple spatial transcriptomics datasets with complex batch or biological effects between slides.
- Wei Liu
- , Xu Liao
- & Jin Liu
-
Article
| Open AccessDeciphering the exact breakpoints of structural variations using long sequencing reads with DeBreak
Long-read sequencing is promising for the detection of structural variants (SVs), which requires algorithms with high sensitivity and precision. Here, the authors develop DeBreak, an algorithm for comprehensive and accurate SV detection in long-read sequencing data across different platforms, which outperforms other SV callers.
- Yu Chen
- , Amy Y. Wang
- & Zechen Chong
-
Article
| Open AccessDynamics of CLIMP-63 S-acylation control ER morphology
A key player in the formation of endoplasmic reticulum sheets is CLIMP-63, but mechanistic details remained elusive. Here authors combined cellular experiments and mathematical modelling to show that S-acylation of CLIMP-63 regulates its function by mediating its oligomerisation, turnover, and localisation.
- Patrick A. Sandoz
- , Robin A. Denhardt-Eriksson
- & F. Gisou van der Goot
-
Article
| Open AccessBMI-adjusted adipose tissue volumes exhibit depot-specific and divergent associations with cardiometabolic diseases
Different location of adipose tissue may have different consequences to cardiometabolic risk. Here the authors report that deep learning enabled accurate prediction of specific adipose tissue volumes, and that after adjustment for BMI, visceral adiposity was associated with increased risk of cardiometabolic disease, while gluteofemoral adiposity was associated with reduced risk.
- Saaket Agrawal
- , Marcus D. R. Klarqvist
- & Amit V. Khera
-
Article
| Open AccessComparative analysis of genome-scale, base-resolution DNA methylation profiles across 580 animal species
DNA methylation is involved in regulatory processes throughout the animal kingdom. Here, the authors map DNA methylation in 535 vertebrates and 45 invertebrates, establishing a reference dataset for cross-species analysis and exploring epigenetic variation across vertebrate evolution.
- Johanna Klughammer
- , Daria Romanovskaia
- & Christoph Bock
-
Article
| Open AccessTransformer for one stop interpretable cell type annotation
Developing computational tools for interpretable cell type annotation in scRNA-seq data remains challenging. Here the authors propose a Transformer-based model for interpretable annotation transfer using biologically understandable entities, and demonstrate its performance on large or atlas datasets.
- Jiawei Chen
- , Hao Xu
- & Jing-Dong J. Han
-
Article
| Open AccessGALA: a computational framework for de novo chromosome-by-chromosome assembly with long reads
Genomes usually contain multiple chromosomes. The paper reports on GALA, a computational framework for chromosome-based sequencing data separation and gap-free de novo assembly. It allows integration of different sources of data.
- Mohamed Awad
- & Xiangchao Gan
-
Article
| Open AccessEstimating conformational landscapes from Cryo-EM particles by 3D Zernike polynomials
Conformational heterogeneity is key to understand how macromolecular function and structure converge. Here, the authors propose an algorithm designed to estimate structural landscapes directly from Cryo-EM particle images.
- D. Herreros
- , R. R. Lederman
- & J. M. Carazo
-
Article
| Open AccessThermodynamic architecture and conformational plasticity of GPCRs
GPCRs are integral membrane proteins that serve as attractive drug targets. Here, authors delineate the conformational landscapes of 45 GPCRs using a statistical model, highlighting their malleable native ensembles and providing functional insights.
- Sathvik Anantakrishnan
- & Athi N. Naganathan
-
Article
| Open AccessDirect and indirect effects of the COVID-19 pandemic on mortality in Switzerland
COVID-19-releated public health measures may have indirectly impacted mortality rates by causing or averting deaths. Here, the authors use data from Switzerland until April 2022 and estimate that, after accounting for deaths directly related to COVID-19, mortality was lower than expected, indicating some evidence of an overall positive impact of control measures.
- Julien Riou
- , Anthony Hauser
- & Garyfallos Konstantinoudis
-
Article
| Open AccessBenchmarking commonly used software suites and analysis workflows for DIA proteomics and phosphoproteomics
Many software suites and spectral libraries have been developed for DIA proteomics data analysis. Here, the authors create benchmark data sets to evaluate four commonly used software tools combined with seven spectral libraries in both global proteomics and phosphoproteomics analysis.
- Ronghui Lou
- , Ye Cao
- & Wenqing Shui
-
Article
| Open AccessA Bayesian model for unsupervised detection of RNA splicing based subtypes in cancers
RNA splicing variations could help identify cancer subtypes, but this task is computationally challenging. Here, the authors develop CHESSBOARD, a Bayesian tile finding algorithm for splicing data which identifies patterns in the form of tiles and can discover leukemia subgroups associated with therapeutic response.
- David Wang
- , Mathieu Quesnel-Vallieres
- & Yoseph Barash
-
Article
| Open AccessPrediction of designer-recombinases for DNA editing with generative deep learning
Design of recombinases with new target sites is usually achieved through cycles of directed molecular evolution. Here the authors report Recombinase Generator, RecGen, an algorithm for generation of designer-recombinases; they perform experimental validation to show that this can predict recombinase sequences.
- Lukas Theo Schmitt
- , Maciej Paszkowski-Rogacz
- & Frank Buchholz
-
Article
| Open AccessThousands of human non-AUG extended proteoforms lack evidence of evolutionary selection among mammals
Analysis of a large number of Ribo-seq datasets and genomic alignments led to detection of novel non-AUG proteoforms. Unexpectedly the number of non-AUG proteoforms identified with Ribo-seq greatly exceeds those with strong phylogenetic support.
- Alla D. Fedorova
- , Stephen J. Kiniry
- & Pavel V. Baranov
-
Matters Arising
| Open AccessReply to: A balanced measure shows superior performance of pseudobulk methods in single-cell RNA-sequencing analysis
- Kip D. Zimmerman
- , Ciaran Evans
- & Carl D. Langefeld
-
Matters Arising
| Open AccessA balanced measure shows superior performance of pseudobulk methods in single-cell RNA-sequencing analysis
- Alan E. Murphy
- & Nathan G. Skene
-
Article
| Open AccessBenchmarking tools for detecting longitudinal differential expression in proteomics data allows establishing a robust reproducibility optimization regression approach
Longitudinal proteomics holds great promise for biomarker discovery, but the data interpretation has remained a challenge. Here, the authors evaluate several tools to detect longitudinal differential expression in proteomics data and introduce RolDE, a robust reproducibility optimization approach.
- Tommi Välikangas
- , Tomi Suomi
- & Laura L. Elo
-
Article
| Open AccessGraph-based pangenomics maximizes genotyping density and reveals structural impacts on fungal resistance in melon
The power of pangenomic graphs to improve genetic mapping is still unclear. Here, the authors demonstrate its value in identification of genetic variants associated with disease resistance traits in melon using PanPipes, a pangenome construction and low-coverage genotype-by-sequencing pipeline.
- Justin N. Vaughn
- , Sandra E. Branham
- & William P. Wechter
-
Article
| Open AccessA method to build extended sequence context models of point mutations and indels
The mutation rate at any specific position in the human genome depends on sequence context. Here, the authors develop a method for predicting mutation rates of point mutations and indels based on sequence context; the results can be used to find genes where de novo mutations cause disease and genes under strong selective constraint.
- Jörn Bethune
- , April Kleppe
- & Søren Besenbacher
-
Article
| Open AccessProtein complex prediction using Rosetta, AlphaFold, and mass spectrometry covalent labeling
Covalent labeling (CL) from mass spectrometry experiments provides structural information of higher-order protein structure. Here, the authors develop an algorithm which integrates experimental CL data to predict protein complexes in the Rosetta molecular modeling suite using AlphaFold models.
- Zachary C. Drake
- , Justin T. Seffernick
- & Steffen Lindert
-
Article
| Open AccessProteome-wide 3D structure prediction provides insights into the ancestral metabolism of ancient archaea and bacteria
Previous studies have reconstructed ancestral metabolism using sequence-based approaches. This study uses a high-throughput version of AlphaFold2 to compare proteome-wide 3D structure predictions of two representative strains of ancient archaea and bacteria.
- Weishu Zhao
- , Bozitao Zhong
- & Xiang Xiao
-
Article
| Open AccessCombining genome-wide association studies highlight novel loci involved in human facial variation
Combining multiple related traits can increase power in genetic association studies. Here, the authors develop a method to integrate GWAS statistics for multiple traits and apply it to find genetic loci affecting human facial variation.
- Ziyi Xiong
- , Xingjian Gao
- & Fan Liu
-
Article
| Open AccessEstimating diagnostic uncertainty in artificial intelligence assisted pathology using conformal prediction
Artificial intelligence prediction accuracy can be reduced with new data. Here, the authors utilise conformal prediction to reduce incorrect predictions in histopathological analysis of prostate cancer biopsies.
- Henrik Olsson
- , Kimmo Kartasalo
- & Martin Eklund
-
Article
| Open AccessAccuracy and data efficiency in deep learning models of protein expression
Synthetic biology often involves engineering microbial strains to express high-value proteins. Here the authors build deep learning predictors of protein expression from sequence that deliver accurate models with fewer data than previously assumed, helping to lower costs of model-driven strain design.
- Evangelos-Marios Nikolados
- , Arin Wongprommoon
- & Diego A. Oyarzún
-
Article
| Open AccessA unifying Bayesian framework for merging X-ray diffraction data
Observation of the chemical and conformational dynamics of biomolecules by diffraction methods is impeded by several physical artifacts. The authors present an extensible framework for accurate correction of such data that can keep pace with rapid developments in diffraction methods.
- Kevin M. Dalton
- , Jack B. Greisman
- & Doeke R. Hekstra
-
Article
| Open AccessDiscovery of synthetic lethal interactions from large-scale pan-cancer perturbation screens
Synthetic lethality can be used to identify potential drug targets in cancer based on simultaneous inactivation of two genes through genetic aberrations and gene silencing. Here, the authors develop a statistical framework to identify synthetic lethal pairs from large scale perturbation screens across multiple cancer types.
- Sumana Srivatsa
- , Hesam Montazeri
- & Niko Beerenwinkel
-
Article
| Open AccessClustering of single-cell multi-omics data with a multimodal deep learning method
Single-cell multimodal sequencing technologies are developed to simultaneously profile different modalities of data in the same cell. Here the authors develops a multimodal deep clustering method for the analysis of single-cell multi-omics data that supports clustering different types of multi-omics data and multi-batch data, as well as downstream differential expression analysis.
- Xiang Lin
- , Tian Tian
- & Hakon Hakonarson
-
Article
| Open AccessSpatial transcriptomics landscape of lesions from non-communicable inflammatory skin diseases
Inflammatory skin diseases involve various different immune cells in a localised area. Here the authors use spatial transcriptomics to show that disease relevant cytokine transcripts are sparsely expressed in lesional skin, yet are associated with local amplification cascades that promote skin inflammation.
- A. Schäbitz
- , C. Hillig
- & S. Eyerich
-
Article
| Open AccessInferring time-varying generation time, serial interval, and incubation period distributions for COVID-19
The generation time (interval between successive infections in a transmission chain) is an important parameter for epidemiological modeling. Here, the authors develop a framework for estimating this parameter and how it changes over time and apply it to data from China in the first months of the pandemic.
- Dongxuan Chen
- , Yiu-Chung Lau
- & Sheikh Taslim Ali
-
Article
| Open AccessTumor fractions deciphered from circulating cell-free DNA methylation for cancer early diagnosis
‘Circulating cell-free DNA can be used to predict cancer, but it is more challenging to assess in early stage cancer. Here, the authors created a diagnostic model using tumor fractions deciphered from circulating cfDNA methylation signatures, which exhibited an 86% sensitivity in detecting early-stage cancer.
- Xiao Zhou
- , Zhen Cheng
- & Weibin Cheng
-
Article
| Open AccessDeep transfer learning enables lesion tracing of circulating tumor cells
Liquid biopsy offers great promise for noninvasive cancer diagnostics, while the lack of adequate target characterization and analysis hinders its wide application. Here, the authors design a transfer learning-based algorithm to transfer lesion labels from the primary cancer cell atlas to circulating tumor cells.
- Xiaoxu Guo
- , Fanghe Lin
- & Jia Song
-
Article
| Open AccessBroad misappropriation of developmental splicing profile by cancer in multiple organs
The molecular mechanisms underlying the overlap between oncogenic and embryonic development remain to be explored. Here, the authors use temporal transcriptomic data during development in multiple human organs and suggest the involvement of alternative splicing events, splicing factors, and transcription factors.
- Arashdeep Singh
- , Arati Rajeevan
- & Sridhar Hannenhalli
Browse broader subjects
Browse narrower subjects
- Biochemical reaction networks
- Cellular signalling networks
- Classification and taxonomy
- Communication and replication
- Computational models
- Computational neuroscience
- Computational platforms and environments
- Data acquisition
- Data integration
- Data mining
- Data processing
- Data publication and archiving
- Databases
- Functional clustering
- Gene ontology
- Gene regulatory networks
- Genome informatics
- Hardware and infrastructure
- High-throughput screening
- Image processing
- Literature mining
- Machine learning
- Microarrays
- Network topology
- Phylogeny
- Power law
- Predictive medicine
- Probabilistic data networks
- Programming language
- Protein analysis
- Protein design
- Protein folding
- Protein function predictions
- Protein structure predictions
- Proteome informatics
- Quality control
- Scale invariance
- Sequence annotation
- Software
- Standards
- Statistical methods
- Virtual drug screening