Computational biology and bioinformatics articles within Nature Communications

Featured

  • Article
    | Open Access

    Coupling of growth and product synthesis is an important principle in metabolic engineering, but its range of applicability is unclear. Here, the authors use a dedicated computational framework to study the feasibility of coupling the production of metabolites to growth in the genome-scale metabolic models of five production organisms, and show that coupling can be achieved for most metabolites.

    • Axel von Kamp
    •  & Steffen Klamt
  • Article
    | Open Access

    In single-cell RNA sequencing data of heterogeneous cell populations, cell cycle stage of individual cells would often be informative. Here, the authors introduce a computational model to reconstruct a pseudo-time series from single cell transcriptome data, identify the cell cycle stages, identify candidate cell cycle-regulated genes and recover the methylome changes during the cell cycle.

    • Zehua Liu
    • , Huazhe Lou
    •  & Ting Chen
  • Article
    | Open Access

    Accurate quantification of bioimaging data is often confounded by uneven illumination (shading) in space and background variation in time. Here the authors present BaSiC, a Fiji plugin solving both issues. It requires fewer input images and is more robust to artefacts than existing shading correction tools.

    • Tingying Peng
    • , Kurt Thorn
    •  & Nassir Navab
  • Article
    | Open Access

    V2 neurons exhibit complex and diverse selectivity for visual features. Here the authors use a statistical analytical framework to model V2 responses to natural stimuli and find three organizing principles, chief among them is the cross-orientation suppression that increases response selectivity.

    • Ryan J. Rowekamp
    •  & Tatyana O. Sharpee
  • Article
    | Open Access

    Some cancers with DNA mismatch repair deficiency display microsatellite instability. Here the authors analyse twenty three cancer types at the exome and whole-genome level, and identify loci with recurrent microsatellite instability that could be used to identify patients who would benefit from immunotherapy.

    • Isidro Cortes-Ciriano
    • , Sejoon Lee
    •  & Peter J. Park
  • Article
    | Open Access

    Maximum Intensity Projection is a common tool to represent 3D biological imaging data in a 2D space, but it creates artefacts. Here the authors develop Smooth Manifold Extraction, an ImageJ/Fiji plugin, to preserve local spatial relationships when extracting the content of a 3D volume to a 2D space.

    • Asm Shihavuddin
    • , Sreetama Basu
    •  & Auguste Genovesio
  • Article
    | Open Access

    There are no robust methods for systematically identifying mutation-specific synthetic lethal (SL) partners in cancer. Here, the authors develop a computational algorithm that uses pan-cancer data to detect mutation-andcancer-specific SL partners and they validate a novel SL interaction between mutant IDH and loss of ACACA in leukaemia.

    • Subarna Sinha
    • , Daniel Thomas
    •  & David L. Dill
  • Article
    | Open Access

    Tumour expression profiling is currently used for prognostic and predictive purposes without taking into account the intra patient heterogeneity. Here the authors show that cancer cell specific signatures overcome the tumour heterogeneity effect and result in better classification of colorectal cancer patients.

    • Philip D. Dunne
    • , Matthew Alderdice
    •  & Mark Lawler
  • Article
    | Open Access

    Growth-coupled designs for chemical production are limited by native metabolic networks’ optimality for growth. Here, the authors introduce pathway orthogonality as a measure of the independence of biomass and chemical production pathways, identify metabolic valves that allow substrate utilization to be switched between the two, and demonstrate advantages of orthogonal designs.

    • Aditya Vikram Pandit
    • , Shyam Srinivasan
    •  & Radhakrishnan Mahadevan
  • Article
    | Open Access

    There are few methods available that can quantify relationships between cell types in tissue images. Here the authors present a quantitative method to evaluate cellular organization, validated in the mouse thymus and spinal cord, called Multitaper Circularly Averaged Spectral Analysis (MiCASA).

    • Andrew Sornborger
    • , Jie Li
    •  & Nancy R. Manley
  • Article
    | Open Access

    Analysis of molecular quantitative trait loci (molQTL) can help interpret genome-wide association studies and requires efficient approaches to correct for multiple testing. This study describes a bioinformatics toolkit called QTLtool that can handle large data sets and quickly perform multiple types of molQTL analyses.

    • Olivier Delaneau
    • , Halit Ongen
    •  & Emmanouil T. Dermitzakis
  • Article
    | Open Access

    Outsourcing computation for genomic data processing offers the ability to allocate massive computing power and storage on demand. Here, Popic and Batzoglou develop a hybrid cloud aligner for sequence read mapping that preserves privacy with competitive accuracy and speed.

    • Victoria Popic
    •  & Serafim Batzoglou
  • Article
    | Open Access

    Biomphalaria glabrata is a fresh water snail that acts as a host for trematode Schistosoma mansoni that causes intestinal infection in human. This work describes the genome and transcriptome analyses from 12 different tissues of B glabrata, and identify genes for snail behavior and evolution.

    • Coen M. Adema
    • , LaDeana W. Hillier
    •  & Richard K. Wilson
  • Article
    | Open Access

    Plastoquinone (PLQ) shuttles electrons between photosystem II (PSII) and cytochrome b6f. Here the authors perform molecular dynamics simulations and propose that PLQ enters the exchange cavity of PSII by a promiscuous diffusion mechanism whereby three different channels each act as entry and exit points.

    • Floris J. Van Eerden
    • , Manuel N. Melo
    •  & Siewert J. Marrink
  • Article
    | Open Access

    We can often observe only a small fraction of a system, which leads to biases in the inference of its global properties. Here, the authors develop a framework that enables overcoming subsampling effects, apply it to recordings from developing neural networks, and find that neural networks become critical as they mature.

    • A. Levina
    •  & V. Priesemann
  • Article
    | Open Access

    The coevolution of viruses and host cells can be mapped with interactomics. Here the authors identify coupling of human and viral promoters, and show that HIV-reactivation from dormancy is coincident with migration of HIV-infected cells owing to coupling of human CXCR4 and HIV LTR promoters.

    • Kathrin Bohn-Wippert
    • , Erin N. Tevonian
    •  & Roy D. Dar
  • Article
    | Open Access

    Quantifying deformation patterns of curved epithelial sheets is challenging owing to imaging difficulties. Here the authors develop a method to obtain a quantitative description of 3D tissue deformation dynamics from a small set of cell positional data and applied it to chick forebrain morphogenesis.

    • Yoshihiro Morishita
    • , Ken-ichi Hironaka
    •  & Daisuke Ohtsuka
  • Article
    | Open Access

    Large dsDNA viruses either replicate in or disrupt the nucleus to gain access to host RNA polymerases, or they rely on virus-encoded, packaged RNA polymerases. Here, the authors show that Noumeavirus replicates in the cytoplasm and relies on a transient recruitment of nuclear proteins to initiate replication.

    • Elisabeth Fabre
    • , Sandra Jeudy
    •  & Chantal Abergel
  • Article
    | Open Access

    The identification of master regulator genes that may be manipulatedin vitro to regulate reprogramming has been difficult. Here, the authors use a computational systems approach to identify three genes (FoxA1, Nkx3.1and the androgen receptor) that can reprogramme fibroblasts to prostate tissue.

    • Flaminia Talos
    • , Antonina Mitrofanova
    •  & Michael M. Shen
  • Article
    | Open Access

    Some argue that health-related behaviours, such as obesity, are contagious, but empirical evidence of health contagion remains inconclusive. Here, using a large scale quasi-experiment in a global network of runners, Aral and Nicolaides show that this type of contagion exists in fitness behaviours.

    • Sinan Aral
    •  & Christos Nicolaides
  • Article
    | Open Access

    Beta-arrestins play central roles in the mechanisms regulating GPCR signalling and trafficking. Here the authors identify a selective inhibitor of the interaction between β-arrestin and the β2-adaptin subunit of the clathrin adaptor protein AP-2, which they use to dissect the role of the β-arrestin/β2-adaptin interaction in GPCR signalling.

    • Alexandre Beautrait
    • , Justine S. Paradis
    •  & Michel Bouvier
  • Article
    | Open Access

    Genome assembly for many plant species can be challenging due to large size and high repeat content. Here, the authors usein vitroproximity ligation to assemble the genome of lettuce, revealing a family-specific triplication event and providing a comprehensive reference genome for a member of the Compositae.

    • Sebastian Reyes-Chin-Wo
    • , Zhiwen Wang
    •  & Richard W. Michelmore
  • Article
    | Open Access

    Dysregulation of microRNAs is thought to be important for metastasis in castration-resistant prostate cancer (CRPC), but its drivers are unknown. Here, the authors use computational analysis to identify HOXC6 and NKX2-2 in addition to AR as alternative drivers of dysregulated miRNA expression in CRPC.

    • Mengzhu Xue
    • , Haiyue Liu
    •  & Peng Wang
  • Article
    | Open Access

    The impact of chromatin structure on gene expression makes it integral to our understanding of developmental and disease processes. Here, the authors introduce a hierarchical hidden Markov model to systematically annotate chromatin states at multiple length scales, and demonstrate its utility for the elucidation of the role of chromatin structure in gene expression.

    • Eugenio Marco
    • , Wouter Meuleman
    •  & Guo-Cheng Yuan
  • Article
    | Open Access

    Large-scale metabolic models of organisms from microbes to mammals can provide great insight into cellular function, but their analysis remains challenging. Here, the authors provide an approximate analytic method to estimate the feasible solution space for the flux vectors of metabolic networks, enabling more accurate analysis under a wide range of conditions of interest.

    • Alfredo Braunstein
    • , Anna Paola Muntoni
    •  & Andrea Pagnani
  • Article
    | Open Access

    While rare cell subpopulations frequently make the difference between health and disease, their detection remains a challenge. Here, the authors devise CellCnn, a representation learning approach to detecting such rare cell populations from high-dimensional single cell data, and, among other examples, demonstrate its capacity for detecting rare leukaemic blasts in minimal residual disease.

    • Eirini Arvaniti
    •  & Manfred Claassen
  • Article
    | Open Access

    Patient-derived xenografts recapitulate major genomic signatures and transcriptome profiles of their original tumours. Here, the authors, performing proteomic and phosphoproteomic analyses of 24 breast cancer PDX models, demonstrate that druggable candidates can be identified based on a comprehensive proteogenomic profiling.

    • Kuan-lin Huang
    • , Shunqiang Li
    •  & Li Ding
  • Article
    | Open Access

    The gene-battery model posits transposable elements (TEs) may becis-regulatory elements to control gene expression. Here, mouse-specific TEs are shown as binding sites for multiple collaborating transcription factors in embryonic stem cells, and act as cis-regulatory modules in synergistic fashion.

    • Vasavi Sundaram
    • , Mayank N. K. Choudhary
    •  & Ting Wang
  • Article
    | Open Access

    The three-dimensional architecture of genome-reduced bacteria is poorly understood. Here the authors combine Hi-C with super-resolution microscopy inMycoplasma pneumoniaeand provide evidence of how supercoiling and local organization influences gene regulation.

    • Marie Trussart
    • , Eva Yus
    •  & Luís Serrano
  • Article
    | Open Access

    The TIM barrel fold is an evolutionarily conserved motif found in proteins with a variety of enzymatic functions. Here the authors explore the fitness landscape of the TIM barrel protein IGPS and uncover evolutionary constraints on both sequence and structure, accompanied by long range allosteric interactions.

    • Yvonne H. Chan
    • , Sergey V. Venev
    •  & C. Robert Matthews
  • Article
    | Open Access

    Assembling genomes using currently available computational methods can be time consuming. Here, Coin and colleagues describe a bioinformatics tool named npScarf that can scaffold and complete an existing short read assembly in real-time using nanopore sequencing.

    • Minh Duc Cao
    • , Son Hoang Nguyen
    •  & Lachlan J. M. Coin
  • Article
    | Open Access

    Long noncoding-RNAs have been linked to hepatocellular carcinoma (HCC) and some can be used as prognostic markers. Here the authors, by analysing RNA-seq in 60 clinical samples from 20 patients, provide a resource of functional lncRNAs and biomarkers associated with HCC tumorigenesis and metastasis.

    • Yang Yang
    • , Lei Chen
    •  & Zhi John Lu
  • Article
    | Open Access

    Sequencing initiatives have detected multiple types of mutations in cancer. Here the authors, analysing enhancer-targeting sequence data, show that small insertions in transcriptional enhancers are frequently found near oncogenes, and demonstrate how one mutation deregulates expression of LMO2 in leukemia cells.

    • Brian J. Abraham
    • , Denes Hnisz
    •  & Richard A. Young
  • Article
    | Open Access

    Single-cell RNA sequencing has enabled great advances in understanding developmental biology but reconstructing cellular lineages from this data remains challenging. Here the authors develop an algorithm,dpath, which models the lineage relationships of underlying single cells based on single cell RNA seq data and apply it to study lineage progression of Etv2 expressing progenitors.

    • Wuming Gong
    • , Tara L. Rasmussen
    •  & Daniel J. Garry
  • Article
    | Open Access

    The rat is a widely-used model for human biology, but we must be aware of metabolic differences. Here, the authors reconstruct the genome-scale metabolic network of the rat, and after reconciling it with an improved human metabolic model, demonstrate the power of the models to integrate toxicogenomics data, providing species-specific biomarker predictions in response to a panel of drugs.

    • Edik M. Blais
    • , Kristopher D. Rawls
    •  & Jason A. Papin
  • Article
    | Open Access

    Translocation of the tRNA on the ribosome is associated with large-scale molecular movements of the ribosomal L1 stalk. Here the authors identify the key determinants that allow these dramatic movements, and suggest they represent general strategies used to enable large-scale motions in functional RNAs.

    • Srividya Mohan
    •  & Harry F Noller
  • Article
    | Open Access

    Genetic data has led to great advances in our understanding of human evolution and dispersal, but information on more recent events is limited. Here, the authors analyse genotypes from 770,000 US individuals to map the fine-scale population structure of North America after European settlement.

    • Eunjung Han
    • , Peter Carbonetto
    •  & Catherine A. Ball
  • Article
    | Open Access

    Tumours frequently metastasize to multiple anatomical sites and understanding how these different metastases evolve may be important for therapy. Here, the authors develop a method—Treeomics—that can construct phylogenies from multiple metastases from next-generation sequencing data.

    • Johannes G. Reiter
    • , Alvin P. Makohon-Moore
    •  & Martin A. Nowak