Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain
the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in
Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles
and JavaScript.
A two-stage learning algorithm is proposed to directly uncover the symbolic representation of rules for skill acquisition from large-scale training log data.
CASTLE, a deep learning approach, extracts interpretable discrete representations from single-cell chromatin accessibility data, enabling accurate cell type identification, effective data integration, and quantitative insights into gene regulatory mechanisms.
This paper introduces an algorithm to uncover laws of skill acquisition from naturally occurring data. By combining deep learning and symbolic regression, it accurately identifies cognitive states and extracts algebraic equations.
Discovering improved semiconductor materials is essential for optimal device fabrication. In this Perspective, data-driven computational frameworks for semiconductor discovery and device development are discussed, including the challenges and opportunities moving forward.
We present a method to alleviate re-identification risks behind sharing haplotype reference panels for imputation. In an anonymized reference panel, one might try to infer the genomes’ phenotypes to re-identify their owner. Our method protects against such attack by shuffling the reference panels genomes while maintaining imputation accuracy.
MISATO, a dataset for structure-based drug discovery combines quantum mechanics property data and molecular dynamics simulations on ~20,000 protein–ligand structures, substantially extends the amount of data available to the community and holds potential for advancing work in drug discovery.
The authors develop the tool RESHAPE to share reference panels in a safer way. The genome–phenome links in reference panels can generate re-identification threats and RESHAPE breaks these links by shuffling haplotypes while preserving imputation accuracy.
MISATO is a database for structure-based drug discovery that combines quantum mechanics data with molecular dynamics simulations on ~20,000 protein–ligand structures. The artificial intelligence models included provide an easy entry point for the machine learning and drug discovery communities.
A method based on a vector-quantized variational autoencoder, called CASTLE, can interpretably extract discrete latent embeddings and quantitatively generate the cell-type-specific feature spectrum for single-cell chromatin accessibility sequencing data.
Multicellular modeling is increasingly being used to understand biological systems. SimuCell3D is a tool that allows mechanically realistic simulations, using the deformable cell model, to be developed and run.
We highlight the vibrant discussions on quantum computing and quantum algorithms that took place at the 2024 American Physical Society March Meeting and invite submissions that notably drive the field of quantum information science forward.
Cooperation is crucial for human prosperity, and population structure fosters it through pairwise interactions and coordinated behavior in larger groups. A recent study explores the evolution of behavioral strategies in higher-order population structures, including pairwise and multi-way interactions to reveal that higher-order interactions promote cooperation across networks, especially when they are formed by conjoined communities.
SANGO efficiently removed batch effects between the query and reference single-cell ATAC signals through the underlying genome sequences, to enable cell type assignment according to the reference data. The method achieved superior performance on diverse datasets and could detect unknown tumor cells, providing valuable functional biological signals.
Approaches are needed to accelerate the discovery of transition metal complexes (TMCs), which is challenging owing to their vast chemical space. A large dataset of diverse ligands is now introduced and leveraged in a multiobjective genetic algorithm that enables the efficient optimization of TMCs in chemical spaces containing billions of them.
Cooperation is not merely a dyadic phenomenon, it also includes multi-way social interactions. A mathematical framework is developed to study how the structure of higher-order interactions influences cooperative behavior.
Wildfires have increased in frequency and intensity due to climate change and have had severe impacts on the built environment worldwide. Moving forward, models should take inspiration from epidemic network modeling to predict damage to individual buildings and understand the impact of different mitigations on the community vulnerability in a network setting.
This study introduces SANGO, a method for accurate single-cell annotation leveraging genomic sequences around accessibility peaks within single-cell ATAC sequencing data. SANGO consistently outperforms existing methods across diverse datasets for identification of cell type and detection of unknown tumor cells. SANGO enables the discovery of cell-type-specific functional insights through expression enrichment, cis-regulatory chromatin interactions and motif enrichment analyses.
A fast and versatile three-dimensional cell-based model, called SimuCell3D, is developed for high-resolution simulations of large and complex biological tissues. SimuCell3D natively integrates intra- and extracellular entities, including extracellular matrix, nuclei and polarized cell surfaces.
A method is developed for the directional optimization of multiple properties without prior knowledge on their nature. Using a large ligand dataset, diverse metal complexes are found along the Pareto front of vast chemical spaces.