Computational biology and bioinformatics

Article
10 June 2021 | Open Access

Machine-learning predicts genomic determinants of meiosis-driven structural variation in a eukaryotic pathogen

Structural variation in genomes of the same species is frequent but what drives the rearrangements remains unclear. Machine-learning of rearrangement patterns among telomere-to-telomere assemblies can accurately identify regions of intrinsic DNA instability in a eukaryotic pathogen.

Thomas Badet
, Simone Fouché
& Daniel Croll

Article
10 June 2021 | Open Access

Rapid detection of identity-by-descent tracts for mega-scale datasets

Traditional methods to identify genomic regions identical-by-descent (IBD) do not scale well to biobank-level datasets. Here, the authors describe a new IBD algorithm, iLASH, which uses LocAlity-Sensitive Hashing to provide rapid IBD estimation when applied to the PAGE and UK Biobank datasets.

Ruhollah Shemirani
, Gillian M. Belbin
& José Luis Ambite

Article
10 June 2021 | Open Access

Cell segmentation-free inference of cell types from in situ transcriptomics data

Inaccurate cell segmentation has been the major problem for cell-type identification and tissue characterization of the in situ spatially resolved transcriptomics data. Here we show a robust cell segmentation-free computational framework (SSAM), for identifying cell types and tissue domains in 2D and 3D.

Jeongbin Park
, Wonyl Choi
& Naveed Ishaque

Article
10 June 2021 | Open Access

Hybrid AI-assistive diagnostic model permits rapid TBS classification of cervical liquid-based thin-layer cell smears

Technical advancements have significantly improved early diagnosis of cervical cancer, but accurate diagnosis is still difficult due to various practical factors. Here, the authors develop an artificial intelligence assistive diagnostic solution to improve cervical liquid-based thin-layer cell smear diagnosis according to clinical TBS criteria in a large multicenter study.

Xiaohui Zhu
, Xiaoming Li
& Yanqing Ding

Article
09 June 2021 | Open Access

R2DT is a framework for predicting and visualising RNA secondary structure using templates

Non-coding RNA function is poorly understood, partly due to the challenge of determining RNA secondary (2D) structure. Here, the authors present a framework for the reproducible prediction and visualization of the 2D structure of a wide array of RNAs, which enables linking RNA sequence to function.

Blake A. Sweeney
, David Hoksza
& Anton I. Petrov

Article
09 June 2021 | Open Access

Time trajectories in the transcriptomic response to exercise - a meta-analysis

Regular exercise promotes overall health and prevents non-communicable diseases, but the adaptation mechanisms are unclear. Here, the authors perform a meta-analysis to reveal time-specific patterns of the acute and long-term exercise response in human skeletal muscle, and identify sex- and age-specific changes.

David Amar
, Malene E. Lindholm
& Euan A. Ashley

Article
09 June 2021 | Open Access

Variant-specific inflation factors for assessing population stratification at the phenotypic variance level

Pooling participant-level genetic data into a single analysis can result in variance stratification, reducing statistical performance. Here, the authors develop variant-specific inflation factors to assess variance stratification and apply this to pooled individual-level data from whole genome sequencing.

Tamar Sofer
, Xiuwen Zheng
& Kenneth M. Rice

Article
08 June 2021 | Open Access

Deep learning connects DNA traces to transcription to reveal predictive features beyond enhancer–promoter contact

Recent advances in super-resolution microscopy have made it possible to measure chromatin 3D structure and transcription in thousands of single cells. Here, authors present a deep learning-based approach to characterise how chromatin structure relates to transcriptional state of individual cells and determine which structural features of chromatin regulation are important for gene expression state.

Aparna R. Rajpurkar
, Leslie J. Mateo
& Alistair N. Boettiger

Article
08 June 2021 | Open Access

Systematic benchmarking of tools for CpG methylation detection from nanopore sequencing

Several existing algorithms predict the methylation of DNA using Nanopore sequencing signals, but it is unclear how they compare in performance. Here, the authors benchmark the performance of several such tools, and propose METEORE, a consensus tool that improves prediction accuracy.

Zaka Wing-Sze Yuen
, Akanksha Srivastava
& Eduardo Eyras

Article
08 June 2021 | Open Access

MOGONET integrates multi-omics data using graph convolutional networks allowing patient classification and biomarker identification

Our understanding of human disease can be improved by integrating the abundance of high throughput biomedical data. Here, the authors use deep learning methods successfully used on images to integrate various types of omics data to improve patient classification and identify disease biomarkers.

Tongxin Wang
, Wei Shao
& Kun Huang

Article
08 June 2021 | Open Access

Large variation in anti-SARS-CoV-2 antibody prevalence among essential workers in Geneva, Switzerland

Many job sectors classified as ‘essential’ have continued operating with limited restrictions during the COVID-19 pandemic, potentially placing workers at higher risk of infection. Here, the authors show that seropositivity rates in workers vary widely across and between job sectors in Geneva, Switzerland.

Silvia Stringhini
, María-Eugenia Zaballa
& Idris Guessous

Article
08 June 2021 | Open Access

Optimizing vaccine allocation for COVID-19 vaccines shows the potential role of single-dose vaccination

Most COVID-19 vaccines require two doses but a single dose provides partial protection, so it is unclear how best to prioritize vaccine distribution in the context of limited supply. Here, the authors show that campaigns in which some age groups receive one dose while others receive both doses may be optimal.

Laura Matrajt
, Julia Eaton
& Holly Janes