Computational biology and bioinformatics articles within Nature Communications

Featured

  • Article
    | Open Access

    Selective sweeps are events in which beneficial mutations spread rapidly through a population. Here, Sugden et al. develop SWIF(r), a probabilistic classification framework for detecting and localizing selective sweeps, and apply it to genomic data from the ‡Khomani San.

    • Lauren Alpert Sugden
    • , Elizabeth G. Atkinson
    •  & Sohini Ramachandran
  • Article
    | Open Access

    Current methods for prioritization of non-coding genetic risk variants are based on sequence and chromatin features. Here, Gao et al. develop ARVIN, which predicts causal regulatory variants using disease-relevant gene-regulatory networks, and validate this approach in reporter gene assays.

    • Long Gao
    • , Yasin Uzun
    •  & Kai Tan
  • Article
    | Open Access

    Elucidating molecular organisation requires precise localisation and analysis. Here the authors develop SODA software for automatic and quantitative mapping of statistically coupled molecules, and use it to unravel spatial organisation of thousands of synaptic proteins in SIM and 3DSTORM microscopy.

    • Thibault Lagache
    • , Alexandre Grassart
    •  & Jean-Christophe Olivo-Marin
  • Article
    | Open Access

    Sepsis is characterized by deregulated host response to infection. Efficient therapies are still needed but a limitation for sepsis treatment is the heterogeneity in patients. Here Sweeney et al. generate prognostic models based on gene expression to improve risk stratification classification and prediction for 30-day mortality of patients.

    • Timothy E. Sweeney
    • , Thanneer M. Perumal
    •  & Raymond J. Langley
  • Article
    | Open Access

    RNA levels in post-mortem tissue can differ greatly from those before death. Studying the effect of post-mortem interval on the transcriptome in 36 human tissues, Ferreira et al. find that the response to death is largely tissue-specific and develop a model to predict time since death based on RNA data.

    • Pedro G. Ferreira
    • , Manuel Muñoz-Aguirre
    •  & Roderic Guigó
  • Article
    | Open Access

    Mass cytometry is a powerful method of single cell analysis, but potential confounding effects of cell cycle and cell volume are not taken into account. Here the authors present a combined experimental and computational method to correct for these effects and reveal features of TNFα stimulation that are otherwise masked.

    • Maria Anna Rapsomaniki
    • , Xiao-Kang Lun
    •  & María Rodríguez Martínez
  • Article
    | Open Access

    Different experimental and computational approaches can be used to study RNA structures. Here, the authors present a computational method for data-directed reconstruction of complex RNA structure landscapes, which predicts a parsimonious set of co-existing structures and estimates their abundances from structure profiling data.

    • Hua Li
    •  & Sharon Aviran
  • Article
    | Open Access

    B and T cell receptor diversity can be studied by high-throughput immune receptor sequencing. Here, the authors develop a software tool, IGoR, that calculates the likelihoods of potential V(D)J recombination and somatic hypermutation scenarios from raw immune sequence reads.

    • Quentin Marcou
    • , Thierry Mora
    •  & Aleksandra M. Walczak
  • Article
    | Open Access

    Increase in high throughput sequencing (HTS) data warrants compression methods to facilitate better storage and communication. Here, Ginart et al. introduce Assembltrie, a reference-free compression tool which is guaranteed to achieve optimality for error-free reads.

    • Antonio A. Ginart
    • , Joseph Hui
    •  & David N. Tse
  • Article
    | Open Access

    Long-read sequencing technologies facilitate efficient and high quality genome assembly. Here Michael et al. achieve a fast reference assembly for Arabidopsis thaliana KBS-Mac-74 accession using the handheld Oxford Nanopore MinION sequencer and consumer computing hardware, and demonstrate its usefulness in resolving complex structural variation.

    • Todd P. Michael
    • , Florian Jupe
    •  & Joseph R. Ecker
  • Article
    | Open Access

    Topologically associating domains (TADs) detected by Hi-C technologies are megabase-scale areas of highly interacting chromatin. Here Gong, Lazaris et al. develop a computational approach to improve the reproducibility of Hi-C contact matrices and stratify TAD boundaries based on their insulating strength.

    • Yixiao Gong
    • , Charalampos Lazaris
    •  & Aristotelis Tsirigos
  • Article
    | Open Access

    Innate immunity combines intra- and intercellular signalling to develop responses that limit pathogen spread. Here the authors analyse feedback and feedforward loops connecting IRF3, NF-κB and STAT pathways, and suggest they allow coordinating cell fate decisions in cellular populations in response to the virus-mimicking agent poly(I:C).

    • Maciej Czerkies
    • , Zbigniew Korwek
    •  & Tomasz Lipniacki
  • Article
    | Open Access

    Ageing is associated with a pronounced shift in mortality from cancer to degenerative diseases. Here, the authors show that in concordance with this shift, conserved transcriptional alterations during ageing across four vertebrates align with degenerative diseases but are opposite to those in cancer.

    • Peer Aramillo Irizar
    • , Sascha Schäuble
    •  & Christoph Kaleta
  • Article
    | Open Access

    Resting cortical activity fluctuates, but it is unclear what underlies these variations in activity. Here, the authors show that large-scale fluctuations in fMRI cortical activity are associated with momentary decreases in cortical arousal and opposite activity changes in the basal forebrain and thalamus.

    • Xiao Liu
    • , Jacco A. de Zwart
    •  & Jeff H. Duyn
  • Article
    | Open Access

    Further automation of NMR structure determination is needed to increase the throughput and accessibility of this method. Here the authors present 4D-CHAINS/autoNOE-Rosetta, a complete pipeline that allows rapid and fully automated structure determination from two highly complementary NMR datasets.

    • Thomas Evangelidis
    • , Santrupti Nerli
    •  & Konstantinos Tripsianes
  • Article
    | Open Access

    While gene expression noise in single-celled organisms is well understood, it is less so in the context of tissues. Here the authors show that coupling between cells in tissues can increase or decrease cell-to-cell variability depending on the level of noise intrinsic to the regulatory networks.

    • Stephen Smith
    •  & Ramon Grima
  • Article
    | Open Access

    Single-cell RNA sequencing (scRNA-seq) data provides information on transcriptomic heterogeneity within cell populations. Here, Risso et al develop ZINB-WaVE for low-dimensional representations of scRNA-seq data that account for zero inflation, over-dispersion, and the count nature of the data.

    • Davide Risso
    • , Fanny Perraudeau
    •  & Jean-Philippe Vert
  • Article
    | Open Access

    The isolation of single cells while retaining context is important for quantifying cellular heterogeneity but technically challenging. Here, the authors develop a high-throughput, scalable workflow for microscopy-based single cell isolation using machine-learning, high-throughput microscopy and laser capture microdissection.

    • Csilla Brasko
    • , Kevin Smith
    •  & Peter Horvath
  • Article
    | Open Access

    Although topologically associating domains (TADs) have been extensively investigated, it is not clear to what extent DNA sequence contributes to their formation. Here the authors develop software to identify high-resolution TAD boundaries and reveal their relationship to underlying DNA motifs.

    • Fidel Ramírez
    • , Vivek Bhardwaj
    •  & Thomas Manke
  • Article
    | Open Access

    How gene regulatory pathways control cell fate decisions in single cells is not fully understood. Here the authors present an integrated dual-input microfluidic chip and a linked analysis software, enabling tracking of gene regulatory responses of single bacterial cells to changing conditions.

    • Matthias Kaiser
    • , Florian Jug
    •  & Erik van Nimwegen
  • Article
    | Open Access

    Genetic methods are useful to test whether risk factors are causal for or consequence of disease. Here, Zhu et al. develop a generalized summary-based Mendelian Randomization (GSMR) method which uses summary-level data from GWAS to test for causal associations of health risk factors with common diseases.

    • Zhihong Zhu
    • , Zhili Zheng
    •  & Jian Yang
  • Article
    | Open Access

    Existing pathway design tools make use of existing reactions from databases or successively apply retrosynthetic rules. novoStoic provides an integrated optimization-based framework combining known reactions with novel steps in pathway design allowing for constraints on thermodynamic feasibility, product yield, pathway length and number of novel steps.

    • Akhil Kumar
    • , Lin Wang
    •  & Costas D. Maranas
  • Article
    | Open Access

    The cancer immunoediting hypothesis assumes the immune system sculpts the cancer genome. Here the authors show, in a mouse model, that neutral evolution outweighs the effects of immunoselection and that immune checkpoint blockade potentiates the immunoediting, switching the system to non-neutral evolution.

    • Mirjana Efremova
    • , Dietmar Rieder
    •  & Zlatko Trajanoski
  • Article
    | Open Access

    Deregulation of signalling is responsible for many cancer phenotypes. Leveraging available perturbation data, here the authors assess large-scale pathway activity patterns based on consensus downstream readout genes, enabling accurate prediction of the effects of mutations and small molecules.

    • Michael Schubert
    • , Bertram Klinger
    •  & Julio Saez-Rodriguez
  • Article
    | Open Access

    Extracting kinetic models from high-throughput molecular dynamics (MD) simulations is laborious and prone to human error. Here the authors introduce a deep learning framework that automates construction of Markov state models from MD simulation data.

    • Andreas Mardt
    • , Luca Pasquali
    •  & Frank Noé
  • Article
    | Open Access

    Microbiota is often a complex mixture of multiple coexisting species and strains with high level of phenotypic and genomic variability. Here, Albanese and Donati develop StrainEst for estimating the number and identity of coexisting strains and their relative abundances in mixed metagenomic samples.

    • Davide Albanese
    •  & Claudio Donati
  • Article
    | Open Access

    Network dynamical systems can represent the interactions involved in the collective dynamics of gene regulatory networks or metabolic circuits. Here Casadiego et al. present a method for inferring these types of interactions directly from observed time series without relying on their model.

    • Jose Casadiego
    • , Mor Nitzan
    •  & Marc Timme
  • Article
    | Open Access

    Facioscapulohumeral muscular dystrophy is a myopathy linked to ectopic expression of the DUX4 transcription factor. The authors show that the suppression of targets genes of the myogenesis regulator PAX7 is a signature of FSHD, and might explain oxidative stress sensitivity and epigenetic changes.

    • Christopher R. S. Banerji
    • , Maryna Panamarova
    •  & Peter S. Zammit
  • Article
    | Open Access

    Ligand-induced biased signaling is thought to result in part from ligand-specific receptor conformations that cause the engagement of distinct effectors. Here the authors trace and evaluate the impact of mutations of the β2–adrenergic receptor on multiple signaling outputs to provide structural-level insight into the determinants of GPCR functional selectivity.

    • Anne-Marie Schönegge
    • , Jonathan Gallion
    •  & Michel Bouvier
  • Article
    | Open Access

    It is not known whether the shape of a cell can regulate cellular phenotype independently. Here, the authors show that culturing kidney podocytes or smooth muscle cells on 3-D biomimetic surfaces results in phenotypic changes and that cell shape is sensed by integrin β3 in a tension-independent manner.

    • Amit Ron
    • , Evren U. Azeloglu
    •  & Ravi Iyengar
  • Article
    | Open Access

    Mathematical approaches can be used to assess immune cell composition from the tumour's bulk expression data. Here the authors optimise the CYBERSORT-based deconvolution algorithm by including cell type-specific reference gene expression profiles generated from tumour-derived single-cell RNA sequencing data.

    • Max Schelker
    • , Sonia Feau
    •  & Andreas Raue
  • Article
    | Open Access

    There is a need to understand how mammary epithelial cells respond to changes at various developmental stages. Here, the authors use single-cell RNA sequencing of mammary epithelial cells at different adult developmental stages, identifying different cell types and charting their developmental trajectory.

    • Karsten Bach
    • , Sara Pensa
    •  & Walid T. Khaled
  • Article
    | Open Access

    Rising rates of peanut allergy pose a public health problem. Here, the authors profile blood transcriptomes during double-blind, placebo-controlled oral challenge in peanut-allergic children to identify gene and cell composition changes, and construct causal networks to detect key allergic reaction drivers.

    • C. T. Watson
    • , A. T. Cohain
    •  & S. Bunyavanich
  • Article
    | Open Access

    Reported associations between the human microbiome and disease are often inconsistent. Here, Duvallet et al. perform a meta-analysis of 28 gut microbiome studies spanning ten diseases, and find associations that are likely not disease-specific but potentially part of a shared response to disease.

    • Claire Duvallet
    • , Sean M. Gibbons
    •  & Eric J. Alm
  • Article
    | Open Access

    Centromeres and large-scale structural variants evolve and contribute to genome diversity during vertebrate speciation. Here Ichikawa et al perform de novo long-read genome assembly of three inbred medaka strains, and report long-range structure of centromeres and their methylation as well as correlation of structural variants with differential gene expression.

    • Kazuki Ichikawa
    • , Shingo Tomioka
    •  & Shinich Morishita
  • Article
    | Open Access

    Prioritizing genetic variants is a major challenge in genome-wide association studies. Here, the authors develop FUMA, a web-based bioinformatics tool that uses a combination of positional, eQTL and chromatin interaction mapping to prioritize likely causal variants and genes.

    • Kyoko Watanabe
    • , Erdogan Taskesen
    •  & Danielle Posthuma
  • Article
    | Open Access

    The authors use an integrative clustering approach to identify two laryngeal cancer clusters with distinct prognosis and show that mutations damaging the NSD1 and NSD2 methyltransferases segregate to the cluster with favorable prognosis, and independently predict longer survival in patients with laryngeal, but not other head and neck cancers.

    • Suraj Peri
    • , Evgeny Izumchenko
    •  & Erica A. Golemis
  • Article
    | Open Access

    Eukaryotic genomes are partitioned into self-interacting modules or topologically associated domains (TADs) that exist at the kilo-megabase scale. Here Cattoni et al. combine super-resolution microscopy with DNA-labeling methods to quantify absolute frequencies of interactions within TADs.

    • Diego I. Cattoni
    • , Andrés M. Cardozo Gizzi
    •  & Marcelo Nollmann
  • Article
    | Open Access

    Single cell profiling yields high dimensional data of very large numbers of cells, posing challenges of visualization and analysis. Here the authors introduce a method for analysis of mass cytometry data that can handle very large datasets and allows their intuitive and hierarchical exploration.

    • Vincent van Unen
    • , Thomas Höllt
    •  & Boudewijn P. F. Lelieveldt