Protein analysis articles within Nature Communications

Featured

  • Article
    | Open Access

    Reynolds and colleagues examine a biochemically-mediated epistatic interaction between metabolic enzymes involved in folate metabolism and show that biochemical coupling shapes the range of enzyme activities sufficient to rescue cell growth.

    • Thuy N. Nguyen
    • , Christine Ingle
    •  & Kimberly A. Reynolds
  • Article
    | Open Access

    Mass spectrometry-based proteomics faces the challenge of processing vast data amounts. Here, the authors introduce AlphaPept, an open-source, Python-based framework that offers high speed analysis and easy integration for large-scale proteome analysis.

    • Maximilian T. Strauss
    • , Isabell Bludau
    •  & Matthias Mann
  • Article
    | Open Access

    Gene variants can affect folding and stability of the encoded protein. Here, the authors apply deep mutational scanning to provide genotype-phenotype information for 99% of the possible PRKN variants and reveal mechanistic details on how some variants cause loss-of-function and Parkinsons disease.

    • Lene Clausen
    • , Vasileios Voutsinos
    •  & Rasmus Hartmann-Petersen
  • Article
    | Open Access

    Energetic local frustration in proteins may have been positively selected by evolution when related to function such as ligand binding, allostery and other. Here the authors present a methodology to analyze local frustration patterns within protein families and superfamilies.

    • Maria I. Freiberger
    • , Victoria Ruiz-Serra
    •  & Alfonso Valencia
  • Article
    | Open Access

    Many fusion oncoproteins (FOs) form condensates, some form in the nucleus and regulate gene expression while others form in the cytoplasm and promote cell signaling. In this work, the authors report the analysis of physicochemical features to enable prediction of FO condensation behavior.

    • Swarnendu Tripathi
    • , Hazheen K. Shirnekhi
    •  & Richard W. Kriwacki
  • Article
    | Open Access

    Understanding protein dynamics is a complex scientific challenge. Here, authors construct coarse-grained molecular potentials using artificial neural networks, significantly accelerating protein dynamics simulations while preserving their thermodynamics.

    • Maciej Majewski
    • , Adrià Pérez
    •  & Gianni De Fabritiis
  • Article
    | Open Access

    The relevance of non-coding somatic mutations in cancer remains elusive. Here, the combination of mass spectrometry-based proteomics and whole genome sequencing data across multiple cancer types helps to assess the effects of somatic structural variant breakpoint patterns on protein expression of nearby genes.

    • Fengju Chen
    • , Yiqun Zhang
    •  & Chad J. Creighton
  • Article
    | Open Access

    Metabolic labeling is often used to measure protein turnover. Here the authors show that for interconvertible protein species like phosphoforms metabolic labeling does not provide information on turnover differences, but that the relative order of modification can determine the observed dynamics.

    • Henrik M. Hammarén
    • , Eva-Maria Geissen
    •  & Mikhail M. Savitski
  • Article
    | Open Access

    Protein language models taking multiple sequence alignments as inputs capture protein structure and mutational effects. Here, the authors show that these models also encode phylogenetic relationships, and can disentangle correlations due to structural constraints from those due to phylogeny.

    • Umberto Lupo
    • , Damiano Sgarbossa
    •  & Anne-Florence Bitbol
  • Article
    | Open Access

    Most known pathogenic mutations occur in protein-coding regions of DNA and change the way proteins are made. Here the authors analyse the locations of thousands of human disease mutations and their predicted effects on protein structure and show that,while loss-of-function mutations tend to be highly disruptive, non-loss-of-function mutations are in general much milder at a protein structural level.

    • Lukas Gerasimavicius
    • , Benjamin J. Livesey
    •  & Joseph A. Marsh
  • Article
    | Open Access

    Pan-cancer proteomics analysis enables the analysis of protein expression across multiple cancer types. Here, the authors compare proteomics from 14 cancer types and show 11 distinct subtypes across multiple cancer types. Proteome data could link higher pathway activity levels with somatic alteration of specific genes in the pathway.

    • Yiqun Zhang
    • , Fengju Chen
    •  & Chad J. Creighton
  • Article
    | Open Access

    Glycosyltransferases (GT) are proteins that display extensive sequence and functional variation on a subset of 3D folds. Here, the authors use interpretable deep learning to predict 3D folds from sequence without the need for sequence alignment, which also enables the prediction of GTs with new folds.

    • Rahil Taujale
    • , Zhongliang Zhou
    •  & Natarajan Kannan
  • Article
    | Open Access

    The authors present flDPnn, a computational tool for disorder and disorder function predictions from protein sequences. flDPnn was assessed with the data from the “Critical Assessment of Protein Intrinsic Disorder Prediction” experiment and on an independent and low-similarity test dataset, which show that flDPnn offers accurate predictions of disorder, fully disordered proteins and four common disorder functions.

    • Gang Hu
    • , Akila Katuwawala
    •  & Lukasz Kurgan
  • Article
    | Open Access

    Our understanding of the residue-level details of protein interactions remains incomplete. Here, the authors show sequence coevolution can be used to infer interacting proteins with residue-level details, including predicting 467 interactions de novo in the Escherichia coli cell envelope proteome.

    • Anna G. Green
    • , Hadeer Elhabashy
    •  & Debora S. Marks
  • Article
    | Open Access

    Identifying variants capable of causing genetic disease is challenging. The authors use semisupervised learning to predict pathogenic missense variants and their impacts on protein structure and function, enabling a molecular mechanism-driven approach to studying different types of human disease.

    • Vikas Pejaver
    • , Jorge Urresti
    •  & Predrag Radivojac
  • Article
    | Open Access

    Dpr (Defective proboscis extension response) and DIP (Dpr Interacting Proteins) are immunoglobulin-like cell-cell adhesion proteins that form highly specific pairwise interactions, which control synaptic connectivity during Drosophila development. Here, the authors combine a computational approach with binding affinity measurements and find that DIP/Dpr binding specificity is controlled by negative constraints that interfere with non-cognate binding.

    • Alina P. Sergeeva
    • , Phinikoula S. Katsamba
    •  & Barry Honig
  • Article
    | Open Access

    Dimensionality reduction is often used to visualize expression profiling data in order to find relationships among cells. Here, the authors use Uniform Manifold Approximation and Projection (UMAP) on published expression data of gene deletions of S. cerevisiae to find novel protein interactions.

    • Michael W. Dorrity
    • , Lauren M. Saunders
    •  & Cole Trapnell
  • Article
    | Open Access

    Membrane proteins have been implicated in cancers, but studying the downstream effects of their perturbation remains challenging. Here, the authors map the membrane protein-regulated network of 15 cancers, a resource for prognostic biomarker development and druggable target identification.

    • Chun-Yu Lin
    • , Chia-Hwa Lee
    •  & Jinn-Moon Yang
  • Article
    | Open Access

    The directions of most human protein-protein interactions (PPIs) remain unknown. Here, the authors use cancer genomic and drug response data to infer the direction of signal flow in the human PPI network and show that the directed network improves drug target and cancer driver gene prioritization.

    • Dana Silverbush
    •  & Roded Sharan
  • Article
    | Open Access

    Proteoforms arise as protein isoforms or as protein haplotypes, which are the result of genetic variation. Here, the authors develop Haplosaurus, a database that computes protein haplotypes genome-wide from existing genotype data and analyse protein haplotype variability in the 1000 Genomes dataset.

    • William Spooner
    • , William McLaren
    •  & Catherine Chaillan Huntington
  • Article
    | Open Access

    Billions of metagenomic and genomic sequences fill up public datasets, which makes similarity clustering an important and time-critical analysis step. Here, the authors develop Linclust, an algorithm with linear time complexity that can cluster over a billion sequences within hours on a single server.

    • Martin Steinegger
    •  & Johannes Söding
  • Article
    | Open Access

    Mass cytometry is a powerful method of single cell analysis, but potential confounding effects of cell cycle and cell volume are not taken into account. Here the authors present a combined experimental and computational method to correct for these effects and reveal features of TNFα stimulation that are otherwise masked.

    • Maria Anna Rapsomaniki
    • , Xiao-Kang Lun
    •  & María Rodríguez Martínez
  • Article
    | Open Access

    Identification of the function of proteins is difficult when there are no structurally or biochemically characterized homologs. Here, the authors present an approach that allows the prediction of nucleic-acid binding proteins based on sequence alone, and they are able to experimentally validate their method.

    • Sapir Peled
    • , Olga Leiderman
    •  & Yanay Ofran
  • Article
    | Open Access

    The comprehensive bioanalysis of proteins usually requires multi-step surface and mobile phase measurements. Here, the authors use chips functionalized with dynamically actuated nanolevers—DNA strands that can be switched in an electric field—to obtain motional dynamic measurements of proteins on a chip.

    • Andreas Langer
    • , Paul A. Hampel
    •  & Ulrich Rant
  • Article |

    Protein microarrays are useful both in basic research and also in disease monitoring and diagnosis, but their dynamic range is limited. By using plasmonic gold substrates with near-infrared fluorescent enhancement, Tabakman et al. demonstrate a multiplexed protein array with improved detection limits and dynamic range.

    • Scott M. Tabakman
    • , Lana Lau
    •  & Hongjie Dai