Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain
the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in
Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles
and JavaScript.
Genetic variation from many thousands of individuals is needed to confidently detect regions depleted of variation. This cover image draws an analogy between genetic variation and rain falling on a sidewalk. We think of rainfall as being random. But what if it isn’t? We only begin to see non-random patterns with many raindrops.
This perspective presents a primer on deep learning applications for the genomics field. It includes a general guide for how to use deep learning and describes the current tools and resources that are available to the community.
The long noncoding RNA SCHLAP1 has been reported to act by depleting the SWI/SNF complex from genomic sites, but new data show that SWI/SNF remains localized to chromatin in the presence of SCHLAP1, suggesting that SCHLAP1 may act independently of SWI/SNF.
Assembly of a pan-genome from 910 humans of African descent identifies 296.5 Mb of novel DNA mapping to 125,715 distinct contigs. This African pan-genome contains ~10% more DNA than the current human reference genome.
Analysis of mutational asymmetry with respect to the direction of replication and transcription suggests that error-prone damage bypass on the lagging strand has a major role in human germline and cancer mutations.
Individuals with biscuspid aortic valve and ascending aortic aneurysm show enrichment of rare variants in ROBO4. Functional studies suggest that ROBO4 mutations disrupt endothelial cell performance and contribute to pathological remodeling of aortic tissues.
Analysis of blood pressure data from the Million Veteran Program trans-ethnic cohort identifies common and rare variants, and genetically predicted gene expression across multiple tissues associated with systolic, diastolic and pulse pressure in over 775,000 individuals.
A genome-wide association study for attention deficit/hyperactivity disorder (ADHD) identifies 12 loci and implicates neurodevelopmental pathways and conserved regions of the genome as being involved in underlying ADHD biology.
Genome-wide association analyses based on whole-genome sequencing and imputation identify 40 new risk variants for colorectal cancer, including a strongly protective low-frequency variant at CHD1 and loci implicating signaling and immune function in disease etiology.
This study leverages coding variation observed among 123,136 individuals to create a detailed map of constrained coding regions in the human genome. This map may help identify critical regions within genes that, when mutated, cause embryonic lethality or severe developmental phenotypes.
Gain-of-function mutations altering the PWWP domain of DNMT3A are identified as a new cause of microcephalic dwarfism. These mutations abrogate DNMT3A binding to H3K36me2 and H3K36me3 and lead to aberrant DNA methylation of Polycomb-marked regions.
Analysis of ~10,000 cases of developmental delay and autism identifies 253 candidate neurodevelopmental disease genes. Network analysis highlights cell-specific enrichments of disease-related genes in the D1+ and D2+ spiny neurons of the striatum.
Genome-wide analysis of copy number variants in 2,824 cases across the phenotypic spectrum of CAKUT sheds light on the genomic architecture of disease and identifies TBX6 as a driver for CAKUT subphenotypes in the 16p11.2 microdeletion syndrome.
A Bayesian hierarchical approach identifies over 15,000 causal regulatory interactions in the human genome using ATAC-seq data from 100 individuals. The majority of detected interactions were over distances of <20 kb, a range where 3C methods perform poorly.
This study shows that immune-related genes are primed for transcription by proximal lncRNAs. One such lncRNA, UMLILO, directs the WDR5–MLL1 complex to CXCL chemokine promoters, facilitating H3K4me3 deposition.
Integrated analysis of transcriptome, open chromatin region and chromatin conformation capture data from subjects with acute myeloid leukemia (AML) harboring defined transcription factor and signaling molecule alterations provide insights into the subtype-specific regulatory network in AML.
Comparative study of 81 genomes of parasitic and non-parasitic worms identifies gene family births and expanded gene families at key nodes in the phylogeny that are relevant to parasitism and proteins historically targeted for drug development.
Including patient-specific information about nearby somatic and germline alterations improves the accuracy of neoantigen prediction, potentially impacting cancer vaccine design.
StructLMM is a new method to identify genotype–environment interactions (G×E) that involve multiple exposures or environments. When applied to UK Biobank and eQTL data, StructLMM discovers new G×E signals.
Multivariate adaptive shrinkage (mash) is a method for estimating and testing multiple effects in multiple conditions. When applied to GTEx data, mash can be used to analyze sharing of eQTL effects by examining variation in effect sizes.