Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain
the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in
Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles
and JavaScript.
Natsuhiko Kumasaka, Andrew Knights and Daniel Gaffney develop a new statistical approach for association mapping that models genetic effects and accounts for biases in sequencing data in a single probabilistic framework. They apply this method to generate a map of chromatin accessibility QTLs and show how it can be used to fine-map regulatory variants and link distal regulatory elements with genes.
Matthew Stephens and colleagues present a method for visualizing geographic patterns in genetic population structure. They apply this method to data from elephant, human and Arabidopsis thaliana populations and illustrate its potential to highlight barriers and corridors to gene flow.
Kornelia Polyak, Franziska Michor and colleagues report a novel method, STAR-FISH, for combined in situ single-cell analysis of point mutations and copy number alterations in archived tissue samples. They apply STAR-FISH to clinically relevant PIK3CA mutations and HER2 amplifications and observe associations between intratumoral diversity and clinical outcome.
Hae Kyung Im and colleagues report a method for predicting gene expression perturbations from genotype data after training on reference transcriptome data sets. Association of predicted gene expression with disease traits identifies known and new candidate disease genes.
Michael Beer and colleagues report a metric based on a regulatory region annotation method, gkm-SVM, and use this to predict the effects of regulatory variants from sequencing and DNase I–hypersensitive site data. They apply their method to autoimmune disease GWAS data and report several new predictions for causal SNPs.
Mary Fortune, Chris Wallace and colleagues report a new method that allows statistical colocalization of genetic risk variants for related autoimmune diseases in the context of common controls. They apply their method to type 1 diabetes, rheumatoid arthritis, celiac disease and multiple sclerosis and highlight the complexity in genetic variation underlying these distinct autoimmune diseases.
Gil McVean, Alexander Dilthey and colleagues present a graphical model-based method for accurate genomic assembly that uses the diversity present in multiple reference sequences, as represented by a population reference graph. The method is applied to simulated and empirical data from the human MHC region to demonstrate the improved accuracy of genomic inference.
Xiaoming Liu and Yun-Xin Fu present a model-flexible method for inferring changes in population size over time on the basis of the composite likelihood of SNP frequencies. They apply the method to 1000 Genomes Project data to infer changes in human population size on the timescale of 10,000 to 200,000 years ago.
John Storey and colleagues report a statistical test for genetic association for use with data from structured populations. They demonstrate the use of this test on both simulated data and empirical data from the Northern Finland Birth Cohort, from which they identify significant loci not detected by other methods.
Benjamin Neale and colleagues report the LD Score regression method, used to distinguish the relative contributions of confounding bias and polygenicity to inflated test statistics in GWAS. They apply their method to summary statistics from GWAS for over 30 phenotypes, confirm that polygenicity accounts for the majority of inflation in test statistics and demonstrate use of this method as a correction factor.
Alkes Price, Po-Ru Loh and colleagues report the BOLT-LMM method for mixed-model association. They apply their method to 9 quantitative traits in 23,294 samples and demonstrate that it provides improvements in computational efficiency as well as gains in power that increase with the size of the cohort, making it useful for the analysis of large cohorts.
Steven McCarroll and colleagues report an analysis of multiallelic copy number variants (mCNVs). They characterize mCNVs in 849 whole-genome sequences from the 1000 Genomes Project and find that mCNVs give rise to most gene dosage variation in humans.
Adam Siepel and colleagues develop a statistical method, fitCons, which combines comparative and functional genomic data to estimate the probability that a point mutation will influence fitness. They generate fitCons scores for three human cell types from ENCODE data sets and demonstrate improved prediction power for cis regulatory elements in comparison to conventional conservation-based scores.
Noah Zaitlen, Alkes Price and colleagues report a new approach to estimate the narrow-sense heritability of complex traits from unrelated individuals in a recently admixed population. They apply this approach to estimate the heritability for 13 quantitative or case-control phenotypes in 21,497 African-American individuals and suggest the inflation of family-based h2 estimates.
Peter Sorger, Mohammed AlQuraishi and colleagues present a statistical framework for integrating biophysical and genomic data to predict the consequences of cancer-related mutations on protein-protein interactions. They apply their framework to the SH2 phosphoprotein network using publicly available data from The Cancer Genome Atlas.
David Jaffe and colleagues report a new algorithm, DISCOVAR, for variant calling and de novo genome assembly. They test the algorithm on a new reference variant call set and demonstrate improved variant calling, particularly in challenging regions of the genome.
Frank Steemers and colleagues report a new method for genome-wide haplotyping based on contiguity-preserving transposition and combinatorial indexing. They apply this method to assemble over 95% of the heterozygous variants in a human genome into long, accurate haplotype blocks.
Gerton Lunter and colleagues report Platypus software, which combines a haplotype-based multi-sample variant caller with local sequence assembly in a Bayesian statistical framework. They demonstrate applications to exome and whole-genome data sets, to the identification de novo mutations in parent-offspring trios and to the genotyping of HLA loci.
Stephan Schiffels and Richard Durbin report the multiple sequentially Markovian coalescent (MSMC) method for inferring human population size and separation history from multiple genome sequences. Their application to the whole-genome sequences of 34 individuals from 9 populations allows inferences about events in human population history as recent as 2,000 years ago.
Gonçalo Abecasis, Chaolong Wang and colleagues report a new statistical method, implemented in a publicly available software program LASER, to estimate an individual's genetic ancestry directly from off-target sequence reads from targeted sequencing experiments, making use of a reference panel. Their simulations and testing on real data sets show accurate inference of worldwide continental ancestry with whole-genome shotgun coverage as low as 0.001× and of fine-scale ancestry within Europe with coverage as low as 0.1×.