Computational biology and bioinformatics articles within Nature Communications

Featured

  • Article
    | Open Access

    TALE proteins are popular tools for genome engineering because they can recognize specific DNA sequences, however off-target effects are a routine problem. Here Rogers and Barrera et al. comprehensively map TALE–DNA interactions to develop a computational model to predict binding specificity.

    • Julia M. Rogers
    • , Luis A. Barrera
    •  & Martha L. Bulyk
  • Article
    | Open Access

    Adverse drug reactions are an important clinical problem. Here the authors combine information about drug-induced gene expression changes and genetic variability of patients with a genome-scale metabolic model to identify drug-induced changes in cellular metabolism that may be linked to drug side effects.

    • Daniel C. Zielinski
    • , Fabian V. Filipp
    •  & Bernhard O. Palsson
  • Article
    | Open Access

    Proteins are sometimes implicated in separate and seemingly unrelated processes, so called moonlighting functions. Here the authors use bioinformatics tools to identify extreme multifunctional proteins and define a signature of extreme multifunctionality.

    • Charles E. Chapple
    • , Benoit Robisson
    •  & Christine Brun
  • Article
    | Open Access

    Planctomycetes appear to differ from all other bacteria in their cellular organization and their apparent lack of a peptidoglycan (PG) cell wall. Here Jeske et al. show that Planctomycetes do possess a typical PG cell wall and that their cellular architecture resembles that of Gram-negative bacteria.

    • Olga Jeske
    • , Margarete Schüler
    •  & Christian Jogler
  • Article |

    There is currently no consensus on how best to identify and delimit biogeographical regions. Here the authors develop a network-based approach incorporating complex presence–absence patterns that can successfully identify commonly recognized biogeographical regions, and apply it to two large-scale data sets of plants and amphibians.

    • Daril A. Vilhena
    •  & Alexandre Antonelli
  • Article
    | Open Access

    Cells constantly integrate information from multiple stimuli. By considering every possible means by which two stimuli can interact, Cappuccio et al. define 10 interaction modes and demonstrate their preferential use by dendritic cells responding to different combinations of microbial and host inflammatory cues.

    • Antonio Cappuccio
    • , Raphaël Zollinger
    •  & Vassili Soumelis
  • Article |

    Artifacts caused by whole-genome amplification bias are a recurrent challenge in single-cell sequencing analysis. Here, the authors develop statistical models and demonstrate an efficient strategy for controlling amplification errors by a joint analysis of single cell genomes.

    • Cheng-Zhong Zhang
    • , Viktor A. Adalsteinsson
    •  & J. Christopher Love
  • Article |

    Sequential segmentation in development is best described in vertebrates, where it relies on cell proliferation and shows regular periodicity. Here, the authors show that in the flour beetle segments are added with irregular rate and their elongation during periods of fast growth relies mostly on cell movements.

    • A. Nakamoto
    • , S. D. Hester
    •  & T. A. Williams
  • Article
    | Open Access

    The key regulators that allow transition from proliferative to invasive phenotype in melanoma cells have not been identified yet. The authors perform chromatin and transcriptome profiling followed by comprehensive bioinformatics analysis identifying new candidate regulators for two distinct cell states of melanoma.

    • Annelien Verfaillie
    • , Hana Imrichova
    •  & Stein Aerts
  • Article
    | Open Access

    Gradients of the secreted morphogen Sonic Hedgehog (Shh) pattern the neural tube in vertebrates. Cohen et al.quantify Shh signalling in developing mice, and by constructing a computational model of the process, identify mechanisms by which the dynamics of Shh signalling are regulated.

    • Michael Cohen
    • , Anna Kicheva
    •  & James Briscoe
  • Article |

    The evolutionary origin of Hippopotamidae, the family of hippos, is poorly understood. Here, the authors describe a new fossil from Kenya that unambiguously roots Hippopotamidae into the group that includes the first large terrestrial mammals to invade Africa, more than 30 million years ago.

    • Fabrice Lihoreau
    • , Jean-Renaud Boisserie
    •  & Stéphane Ducrocq
  • Article
    | Open Access

    The activity of sensory neurons can be correlated with perceptual decisions and this effect may provide insights into how sensory information is processed during perceptual tasks. Here the authors develop a network model of sensory and decision-making areas and propose that the dynamics across the network hierarchy explains the choice probabilities.

    • Klaus Wimmer
    • , Albert Compte
    •  & Jaime de la Rocha
  • Article
    | Open Access

    Motor learning is characterized by diverse cognitive processes, which lack a unified theoretical framework. Here, Takiyama et al.present a model demonstrating that motor learning is determined by prospective errors, which they test in a specially designed visuomotor adaptation task.

    • Ken Takiyama
    • , Masaya Hirashima
    •  & Daichi Nozaki
  • Article
    | Open Access

    Understanding the dynamics of enzyme-substrate complexation provides an insight into potential drugs, but intermediate states are difficult to observe experimentally. Here, the authors use simulations and machine learning to analyse the binding of transition state inhibitors to purine nucleoside phosphorylase.

    • Sergio Decherchi
    • , Anna Berteotti
    •  & Andrea Cavalli
  • Article
    | Open Access

    Understanding the epidemiology of malaria transmission between humans and mosquitoes is crucial for successful disease control. Analysing data from an 18-year malaria control programme, Churcher et al. show that decreased parasite prevalence in humans can be found concurrently with an increase in transmission efficiency.

    • Thomas S. Churcher
    • , Jean-François Trape
    •  & Anna Cohuet
  • Article |

    Body plan complexity is associated with the number of different cell types, yet the processes that create this diversity are unclear. Here the authors use transcriptomics to test the hypothesis that unlike cancer cells, novel normal cell types arise through sub-specialization of an ancestral cell type.

    • Cong Liang
    • , Alistair R.R. Forrest
    •  & Günter P. Wagner
  • Article |

    The structure of insect odorant receptors (ORs) has remained elusive due to their lack of homology to other proteins and the inability to obtain OR crystals. Here, the authors use amino acid evolutionary covariation patterns to fold these proteins de novoand generate the first three-dimensional models of insect ORs.

    • Thomas A. Hopf
    • , Satoshi Morinaga
    •  & Richard Benton
  • Article
    | Open Access

    Cellular imaging studies can generate large volumes of complex phenotypic data; however, presenting this information in a form that quickly conveys trends in the data set remains a challenge. Sailem et al.present a tool which translates such data into easily interpretable cell-like glyphs.

    • Heba Z. Sailem
    • , Julia E. Sero
    •  & Chris Bakal
  • Article
    | Open Access

    The correct assembly of genomes from sequencing data remains a challenge due to difficulties in correctly assigning the location of repeated DNA elements. Here the authors describe GRAAL, an algorithm that utilizes genome-wide chromosome contact data within a probabilistic framework to produce accurate genome assemblies.

    • Hervé Marie-Nelly
    • , Martial Marbouty
    •  & Romain Koszul
  • Article |

    microRNAs are short non-coding RNAs that post-transcriptionally regulate gene expression for which the identification of promoter and primary transcripts (pri-miRNAs) has been difficult. Here the authors describe microTSS, an algorithm that supports the precise identification of intergenic pri-miRNA transcription start sites.

    • Georgios Georgakilas
    • , Ioannis S. Vlachos
    •  & Artemis G. Hatzigeorgiou
  • Article |

    Metabolites produced by the gut microbiota can potentially affect our physiology. Here, the authors present a metabolomics strategy that models microbiota metabolism as a reaction network and uses pathway analysis to facilitate identification and characterization of microbial metabolites.

    • Gautham V. Sridharan
    • , Kyungoh Choi
    •  & Arul Jayaraman
  • Article |

    A large portion of the transcribed genome—such as introns and noncoding RNAs—is believed to not be translated into protein products. Here, the authors provide evidence for the existence of regulated peptide products that are translated from transcribed sequences generally characterized as noncoding.

    • Sudhakaran Prabakaran
    • , Martin Hemberg
    •  & Judith A. Steen
  • Article
    | Open Access

    CpG islands are high GC content DNA elements that surround the majority of transcriptional start sites in eukaryotes. Here, the authors analyse over 200 genomic data sets to provide new insight into global CpG islands-dependent regulatory mechanisms in differentiated and pluripotent stem cells.

    • Samuel Beck
    • , Bum-Kyu Lee
    •  & Jonghwan Kim
  • Article |

    Despite our growing understanding of their complexity, different types of RNA are still classified using technical rather than functional criteria. Andersson et al.show that categorization of RNAs based on stability and direction of transcription is an effective means of functional classification.

    • Robin Andersson
    • , Peter Refsing Andersen
    •  & Albin Sandelin
  • Article |

    The development of software tools to analyse large mass spectrometry data sets lags behind the increase in diversity of the data. Here the authors develop MS-GF+, a database search tool that outperforms other popular tools in identifying peptides from a variety of data sets.

    • Sangtae Kim
    •  & Pavel A. Pevzner
  • Article
    | Open Access

    No experimental evidence exists for intra-helical motion of DNA at the μs timescale, which has been attributed to technical difficulties in observing motion in this time range. Here, the authors demonstrate, using extensive molecular dynamics simulations and experimental analysis, that such motion is effectively absent from a B-DNA duplex.

    • Rodrigo Galindo-Murillo
    • , Daniel R. Roe
    •  & Thomas E. Cheatham III
  • Article
    | Open Access

    Metastasizing tumour cells undergo epithelial-to-mesenchymal transition. Using both bioinformatic and in vivo approaches, Chanrion et al.identify combined Notch activation and p53 inactivation as a potent inducer of this transition, and apply this to create a highly metastatic tumour model in mice.

    • Maia Chanrion
    • , Inna Kuperstein
    •  & Sylvie Robine
  • Article
    | Open Access

    Linear mixed models (LMMs) provide a powerful method for studying genotype–phenotype associations. Here the authors present a LMM application that estimates an optimal transformation from observed data and increases the accuracy of heritability estimation and phenotype prediction.

    • Nicolo Fusi
    • , Christoph Lippert
    •  & Oliver Stegle
  • Article |

    The functional consequences of naturally occurring variation in ribosomal DNA (rDNA) copy number are poorly understood. Here the authors estimate rDNA copy number and mitochondrial DNA abundance in humans using whole-genome short-read DNA sequencing and characterize global regulatory mechanisms for cellular homeostasis and adaptation.

    • John G. Gibbons
    • , Alan T. Branco
    •  & Bernardo Lemos
  • Article
    | Open Access

    Common methods to detect adenosine-to-inosine RNA editing sites rely on mapping short RNA reads to the genome while allowing only a limited number of mismatches. Here, Porath et al. present a novel RNA-seq based approach to identify hyper-edited reads that significantly expands the RNA editome.

    • Hagit T. Porath
    • , Shai Carmi
    •  & Erez Y. Levanon
  • Article
    | Open Access

    Metagenomic studies of microbial communities often report DNA sequences from unidentified viruses. Here, Dutilh et al. analyse metagenomic data to reveal the complete genome of an abundant, ubiquitous virus from human faeces, and predict that the virus infects bacteria of the Bacteroides group.

    • Bas E. Dutilh
    • , Noriko Cassman
    •  & Robert A. Edwards
  • Article
    | Open Access

    Intestinal microbes can have important effects on our health. Here, the authors analyse the gut microbiota composition in 1,000 western adults and find that certain bacteria are either abundant or nearly absent, and that these alternative states are associated with ageing and overweight.

    • Leo Lahti
    • , Jarkko Salojärvi
    •  & Willem M. de Vos
  • Article
    | Open Access

    Some viruses are spherical particles in which protein components are organized with well-defined icosahedral and local symmetries. Here, Gipson et al. describe a unique arrangement of proteins, breaking all expected local symmetries, in particles of a marine bacterial virus.

    • Preeti Gipson
    • , Matthew L. Baker
    •  & Wah Chiu
  • Article
    | Open Access

    Dyslipidemia and obesity have a high prevalence in populations with Amerindian backgrounds, such as Mexican–Americans. Here, the authors design an approach to identify Amerindian risk genes in Mexicans and identify five genomic loci, which include RORA and SIK3that may contribute to the risk of dyslipidemia and obesity in Amerindian populations.

    • Arthur Ko
    • , Rita M. Cantor
    •  & Päivi Pajukanta
  • Article |

    Analyses of genome and transcriptome data are unable to accurately predict protein levels and function in tumour samples. Here, the authors carry out a comprehensive protein analysis in 3,467 samples from the cancer genome atlas, providing a resource to study the prognostic and therapeutic potential of tumour proteins.

    • Rehan Akbani
    • , Patrick Kwok Shing Ng
    •  & Gordon B. Mills
  • Article |

    Model-based part design is a key step in synthetic biology. Here, the authors report a method for tuning nucleosome architecture in order to strengthen native promoters and facilitate synthetic promoter design in yeast.

    • Kathleen A. Curran
    • , Nathan C. Crook
    •  & Hal S. Alper
  • Article
    | Open Access

    Misfolded protein accumulation is a hallmark of many neurodegenerative diseases. Here Budrikis et al. model protein aggregation in the endoplasmic reticulum and show that it is the result of a non-equilibrium phase transition caused by tipping the balance from the rates of protein production to degradation.

    • Zoe Budrikis
    • , Giulio Costantini
    •  & Stefano Zapperi