Abstract
Visualization is an integral aspect of genomics data analysis. Algorithmic-statistical analysis and interactive visualization are most effective when used iteratively. Epiviz (http://epiviz.cbcb.umd.edu/), a web-based genome browser, and the Epivizr Bioconductor package allow interactive, extensible and reproducible visualization within a state-of-the-art data-analysis platform.
This is a preview of subscription content, access via your institution
Access options
Subscribe to this journal
Receive 12 print issues and online access
$259.00 per year
only $21.58 per issue
Buy this article
- Purchase on Springer Link
- Instant access to full article PDF
Prices may be subject to local taxes which are calculated during checkout
Similar content being viewed by others
References
Bostock, M., Ogievetsky, V. & Heer, J. IEEE Trans. Vis. Comput. Graph. 17, 2301–2309 (2011).
Stolte, C., Tang, D. & Hanrahan, P. Commun. ACM 51, 75–84 (2008).
Lister, R. et al. Cell 133, 523–536 (2008).
Zhou, X. et al. Nat. Methods 8, 989–990 (2011).
Gentleman, R.C. et al. Genome Biol. 5, R80 (2004).
Yi, J.S., Kang, Y.A., Stasko, J. & Jacko, J. IEEE Trans. Vis. Comput. Graph. 13, 1224–1231 (2007).
McCall, M.N., Uppal, K., Jaffee, H.A., Zilliox, M.J. & Irizarry, R.A. Nucleic Acids Res. 39, D1011–D1015 (2011).
Karolchik, D. et al. Nucleic Acids Res. 36, D773–D779 (2008).
Hubbard, T.J.P. et al. Nucleic Acids Res. 37, D690–D697 (2009).
Durinck, S. et al. Bioinformatics 21, 3439–3440 (2005).
Anders, S. & Huber, W. Genome Biol. 11, R106 (2010).
Lawrence, M. et al. PLoS Comput. Biol. 9, e1003118 (2013).
Hansen, K.D. et al. Nat. Genet. 43, 768–775 (2011).
Paulson, J.N., Stine, O.C., Bravo, H.C. & Pop, M. Nat. Methods 10, 1200–1202 (2013).
Lister, R. et al. Nature 462, 315–322 (2009).
Aryee, M.J. et al. Bioinformatics 30, 1363–1369 (2014).
Cancer Genome Atlas Network. Nature 487, 330–337 (2012).
Goecks, J. et al. BMC Genomics 14, 397 (2013).
Miller, C.A., Anthony, J., Meyer, M.M. & Marth, G. Bioinformatics 29, 381–383 (2013).
Acknowledgements
We thank the Bioconductor core team and members of the Bioinformatics and Computational Biology Department of Genentech Research and Early Development for helpful suggestions and comments. This work was partially supported by US National Institutes of Health grants R01 HG006102 to H.C.B. and F.C., R01 HG005220 to H.C.B., an undergraduate internship sponsorship from the Illumina Corporation to L.S., and support from Genentech.
Author information
Authors and Affiliations
Contributions
H.C.B. conceived the project. F.C. and H.C.B. designed the project. F.C., L.S., N.G. and H.C.B. wrote the Epiviz and Epivizr software. F.C., L.S. and H.C.B. analyzed data. H.C.B. and F.C. wrote the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing financial interests.
Integrated supplementary information
Supplementary Figure 1 The Epiviz architecture.
Presentation, visualizations and data representations are distinct. This allows Epiviz to reuse visualizations regardless of data source (Epiviz sever, or WebSocket connection through Epivizr). Data providers and visualizations can be plugged in on the fly using Epiviz’ plugin API.
Supplementary Figure 2 Chart load times with and without cache.
Average comparison of time taken by ‘add chart’ and ‘navigate’ operations per 1,000 data objects with and without using the predictive cache in the Epiviz data management tier.
Supplementary Figure 3 Chart draw times for different parameter values.
A comparison of draw times when varying specific chart parameters for Scatter Plot and Blocks Track. The parameter for scatter plot is “circle ratio” which splits the chart in a grid of squares of width equal to this parameter, and draws at most one circle in each cell of the grid. All data objects that overlap this point are mapped to the single circle displayed. The parameter for block tracks is the minimum distance in screen pixels between two blocks before they are merged into one display object. Again, all data objects merged are mapped to the single display object. The data to visual object mapping is used for brushing, tooltips and other interactivity actions.
Supplementary Figure 4 A comparison of draw times when varying specific chart parameters for Heatmap Plot and Lines Track.
The parameter for heatmap is the maximum number of columns to be drawn by the heat map before multiple columns are averaged into one. All data objects that are merged are mapped to the single column displayed. The data to visual object mapping is used for brushing, tooltips and other interactivity actions. The parameter for line tracks is the maximum number of points drawn. If the number of data points is greater than this parameter, the required number of points are sampled uniformly.
Supplementary Figure 5 Gene expression analysis of colon cancer methylation loss regions with Epiviz.
A) We used the Epiviz computed columns feature to define an MA plot of colon cancer expression in the MMP gene family region (Figure 1). B) Gene expression barcode data for the same region shows similar expression patterns across multiple cancer types. Both of these plots were saved as pdfs directly from Epiviz.
Supplementary Figure 6 Comparison of hypomethylation block finding methods.
We compare hypomethylation blocks inferred using BSmooth on whole-genome bisuflite sequencing with blocks inferred with minfi on Illumina HumanMethylation450k beadarray data. In this plot we show the regions found along with smoothed bp-level mean methylation (for BSmooth) and probe-level mean methylation (aggregated over CpG clusters for minfi) data. The block-finding method used in minfi ignores methylation measurements in CpG islands by design, so that long blocks of methylation change would span across CpG islands. BSmooth does not use this design so blocks are frequently punctuated by CpG islands. We see this effect in this specific integrative visualization using Epivizr, where the only difference hypomethylation blocks is the punctuation at the CpG island for the BSmooth block.
Supplementary Figure 7 The spatial distribution of genes in correlation with hypomethylated blocks.
Visualizing genes and corresponding exons side by side with methylation levels in normal and cancer tissues using Epiviz confirms that hypo-methylated blocks are gene-poor.
Supplementary Figure 8 Exon-level expression in differentially methylated regions.
The track-based visualization of exon-level expression data, side by side with a view of DNA methylation and one of differentially methylated blocks reveals that at low resolution, exons tend to be silenced within blocks, and highly expressed outside.
Supplementary information
Supplementary Text and Figures
Supplementary Figures 1–8 and Supplementary Note (PDF 2136 kb)
Rights and permissions
About this article
Cite this article
Chelaru, F., Smith, L., Goldstein, N. et al. Epiviz: interactive visual analytics for functional genomics data. Nat Methods 11, 938–940 (2014). https://doi.org/10.1038/nmeth.3038
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1038/nmeth.3038
This article is cited by
-
gEAR: Gene Expression Analysis Resource portal for community-driven, multi-omic data exploration
Nature Methods (2021)
-
Enhanced JBrowse plugins for epigenomics data visualization
BMC Bioinformatics (2018)
-
C-State: an interactive web app for simultaneous multi-gene visualization and comparative epigenetic pattern search
BMC Bioinformatics (2017)
-
VisRseq: R-based visual framework for analysis of sequencing data
BMC Bioinformatics (2015)
-
Epiviz: a view inside the design of an integrated visual analysis software for genomics
BMC Bioinformatics (2015)