Abstract
Identifying functionally relevant variants against the background of ubiquitous genetic variation is a major challenge in human genetics. For variants in protein-coding regions, our understanding of the genetic code and splicing allows us to identify likely candidates, but interpreting variants outside genic regions is more difficult. Here we present genome-wide annotation of variants (GWAVA), a tool that supports prioritization of noncoding variants by integrating various genomic and epigenomic annotations.
This is a preview of subscription content, access via your institution
Relevant articles
Open Access articles citing this article.
-
Reduced Vrk2 expression is associated with higher risk of depression in humans and mediates depressive-like behaviors in mice
BMC Medicine Open Access 14 July 2023
-
Parallel functional annotation of cancer-associated missense mutations in histone methyltransferases
Scientific Reports Open Access 02 November 2022
-
Interpretation of the role of germline and somatic non-coding mutations in cancer: expression and chromatin conformation informed analysis
Clinical Epigenetics Open Access 28 September 2022
Access options
Subscribe to this journal
Receive 12 print issues and online access
$259.00 per year
only $21.58 per issue
Rent or buy this article
Prices vary by article type
from$1.95
to$39.95
Prices may be subject to local taxes which are calculated during checkout


References
Hindorff, L.A. et al. Proc. Natl. Acad. Sci. USA 106, 9362–9367 (2009).
Cooper, G.M. & Shendure, J. Nat. Rev. Genet. 12, 628–640 (2011).
The ENCODE Project Consortium. Nature 489, 57–74 (2012).
Bernstein, B.E. et al. Nat. Biotechnol. 28, 1045–1048 (2010).
Kumar, P., Henikoff, S. & Ng, P.C. Nat. Protoc. 4, 1073–1081 (2009).
Adzhubei, I.A. et al. Nat. Methods 7, 248–249 (2010).
Schmidt, D. et al. Science 328, 1036–1040 (2010).
Stenson, P.D. et al. Genome Med. 1, 13 (2009).
The 1000 Genomes Project Consortium. Nature 491, 56–65 (2012).
Breiman, L. Mach. Learn. 45, 5–32 (2001).
Maurano, M.T. et al. Science 337, 1190–1195 (2012).
Forbes, S.A. et al. Nucleic Acids Res. 39, D945–D950 (2011).
Schwarz, J.M., Rödelsperger, C., Schuelke, M. & Seelow, D. Nat. Methods 7, 575–576 (2010).
Flicek, P. et al. Nucleic Acids Res. 41, D48–D55 (2013).
Mathelier, A. et al. Nucleic Acids Res. 42, D142–D147 (2014).
Hoffman, M.M. et al. Nucleic Acids Res. 41, 827–841 (2013).
Hoffman, M.M. et al. Nat. Methods 9, 473–476 (2013).
Ernst, J. & Kellis, M. Nat. Methods 9, 215–216 (2012).
Davydov, E.V. et al. PLoS Comput. Biol. 6, e1001025 (2010).
Harrow, J. et al. Genome Res. 22, 1760–1774 (2012).
Church, D. et al. PLoS Biol. 9, e1001091 (2011).
Pedregosa, F. et al. J. Mach. Learn. Res. 12, 2825–2830 (2011).
McLaren, W.M. et al. Bioinformatics 26, 2069–2070 (2010).
Acknowledgements
G.R.S.R. is supported by European Molecular Biology Laboratory and the Sanger Institute via an EBI-Sanger Postdoctoral Fellowship. This work was funded by the Wellcome Trust (098051 and 095908) and by the European Molecular Biology Laboratory. The research leading to these results has received funding from the EU Seventh Framework Programme (FP7/2007-2013) under grant agreement (282510–BLUEPRINT).
Author information
Authors and Affiliations
Contributions
G.R.S.R. implemented the method, performed all analyses and drafted the manuscript. I.D. assisted with access to ENCODE data and suggested how to construct the control sets. E.Z. and P.F. contributed to the development of the method, manuscript writing and jointly directed the work.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing financial interests.
Supplementary information
Supplementary Text and Figures
Supplementary Results and Supplementary Figures 1–8 (PDF 1163 kb)
Supplementary Table 1
Summary annotation results for the SORT1 locus. (XLS 23 kb)
Supplementary Table 2
Summary annotation results for the TNFAIP3 locus. (XLS 25 kb)
Supplementary Table 3
Summary annotation results for the TCF7L2 locus. (XLS 18 kb)
Supplementary Table 4
Results from the gene-by-gene analysis of variants from a single individual showing the rank of the spike-in variant for each gene, and the number of background variants, using the classifier trained on variants matched by distance to the nearest TSS. (XLS 18 kb)
Supplementary Table 5
Statistics for enrichment/depletion for all annotations analyzed; methods are described in Online Methods. (XLS 47 kb)
Supplementary Software
Python software implementing the annotation pipeline and classifier, and the variant training sets used to train the classifier (ZIP 2119 kb)
Rights and permissions
About this article
Cite this article
Ritchie, G., Dunham, I., Zeggini, E. et al. Functional annotation of noncoding sequence variants. Nat Methods 11, 294–296 (2014). https://doi.org/10.1038/nmeth.2832
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1038/nmeth.2832
This article is cited by
-
Reduced Vrk2 expression is associated with higher risk of depression in humans and mediates depressive-like behaviors in mice
BMC Medicine (2023)
-
Unraveling the Drivers of Tumorigenesis in the Context of Evolution: Theoretical Models and Bioinformatics Tools
Journal of Molecular Evolution (2023)
-
A Combination of BRAF and EZH1/SPOP/ZNF148 Three-Gene Mutational Classifier Improves Benign Call Rate in Indeterminate Thyroid Nodules
Endocrine Pathology (2023)
-
Interpretation of the role of germline and somatic non-coding mutations in cancer: expression and chromatin conformation informed analysis
Clinical Epigenetics (2022)
-
TSABL: Trait Specific Annotation Based Locus predictor
BMC Genomics (2022)