Epiviz: interactive visual analytics for functional genomics data

Chelaru, Florin; Smith, Llewellyn; Goldstein, Naomi; Bravo, Héctor Corrada

doi:10.1038/nmeth.3038

Brief Communication
Published: 03 August 2014

Epiviz: interactive visual analytics for functional genomics data

Florin Chelaru^1,2,
Llewellyn Smith^1,3,4,
Naomi Goldstein^1,5 &
…
Héctor Corrada Bravo^1,2

Nature Methods volume 11, pages 938–940 (2014)Cite this article

7271 Accesses
40 Citations
138 Altmetric
Metrics details

Subjects

Abstract

Visualization is an integral aspect of genomics data analysis. Algorithmic-statistical analysis and interactive visualization are most effective when used iteratively. Epiviz (http://epiviz.cbcb.umd.edu/), a web-based genome browser, and the Epivizr Bioconductor package allow interactive, extensible and reproducible visualization within a state-of-the-art data-analysis platform.

Access through your institution

Buy or subscribe

This is a preview of subscription content, access via your institution

Access options

Access through your institution

Buy this article

Purchase on Springer Link
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

**Figure 1: Screenshot of visualization of chromosome 11 region of colon cancer methylome using Epiviz.**

**Figure 2: Screenshots for integrative analysis of Illumina HumanMethylation450 BeadChip data and exon-level RNA-seq data using Epivizr.**

Exploratory Gene Ontology Analysis with Interactive Visualization

Article Open access 24 May 2019

Junjie Zhu, Qian Zhao, … Chiara Sabatti

The VRNetzer platform enables interactive network analysis in Virtual Reality

Article Open access 23 April 2021

Sebastian Pirch, Felix Müller, … Jörg Menche

multiSLIDE is a web server for exploring connected elements of biological pathways in multi-omics data

Article Open access 16 April 2021

Soumita Ghosh, Abhik Datta & Hyungwon Choi

References

Bostock, M., Ogievetsky, V. & Heer, J. IEEE Trans. Vis. Comput. Graph. 17, 2301–2309 (2011).
Article Google Scholar
Stolte, C., Tang, D. & Hanrahan, P. Commun. ACM 51, 75–84 (2008).
Article Google Scholar
Lister, R. et al. Cell 133, 523–536 (2008).
Article CAS Google Scholar
Zhou, X. et al. Nat. Methods 8, 989–990 (2011).
Article CAS Google Scholar
Gentleman, R.C. et al. Genome Biol. 5, R80 (2004).
Article Google Scholar
Yi, J.S., Kang, Y.A., Stasko, J. & Jacko, J. IEEE Trans. Vis. Comput. Graph. 13, 1224–1231 (2007).
Article Google Scholar
McCall, M.N., Uppal, K., Jaffee, H.A., Zilliox, M.J. & Irizarry, R.A. Nucleic Acids Res. 39, D1011–D1015 (2011).
Article CAS Google Scholar
Karolchik, D. et al. Nucleic Acids Res. 36, D773–D779 (2008).
Article CAS Google Scholar
Hubbard, T.J.P. et al. Nucleic Acids Res. 37, D690–D697 (2009).
Article CAS Google Scholar
Durinck, S. et al. Bioinformatics 21, 3439–3440 (2005).
Article CAS Google Scholar
Anders, S. & Huber, W. Genome Biol. 11, R106 (2010).
Article CAS Google Scholar
Lawrence, M. et al. PLoS Comput. Biol. 9, e1003118 (2013).
Article CAS Google Scholar
Hansen, K.D. et al. Nat. Genet. 43, 768–775 (2011).
Article CAS Google Scholar
Paulson, J.N., Stine, O.C., Bravo, H.C. & Pop, M. Nat. Methods 10, 1200–1202 (2013).
Article CAS Google Scholar
Lister, R. et al. Nature 462, 315–322 (2009).
Article CAS Google Scholar
Aryee, M.J. et al. Bioinformatics 30, 1363–1369 (2014).
Article CAS Google Scholar
Cancer Genome Atlas Network. Nature 487, 330–337 (2012).
Goecks, J. et al. BMC Genomics 14, 397 (2013).
Article Google Scholar
Miller, C.A., Anthony, J., Meyer, M.M. & Marth, G. Bioinformatics 29, 381–383 (2013).
Article CAS Google Scholar

Download references

Acknowledgements

We thank the Bioconductor core team and members of the Bioinformatics and Computational Biology Department of Genentech Research and Early Development for helpful suggestions and comments. This work was partially supported by US National Institutes of Health grants R01 HG006102 to H.C.B. and F.C., R01 HG005220 to H.C.B., an undergraduate internship sponsorship from the Illumina Corporation to L.S., and support from Genentech.

Author information

Authors and Affiliations

Center for Bioinformatics and Computational Biology, University of Maryland, College Park, Maryland, USA
Florin Chelaru, Llewellyn Smith, Naomi Goldstein & Héctor Corrada Bravo
Department of Computer Science, University of Maryland, College Park, Maryland, USA
Florin Chelaru & Héctor Corrada Bravo
Department of Mathematics, Williams College, Williamstown, Massachusetts, USA
Llewellyn Smith
Department of Computer Science, Williams College, Williamstown, Massachusetts, USA
Llewellyn Smith
Department of Mechanical Engineering and Materials Science, Washington University in St. Louis, St. Louis, Missouri, USA
Naomi Goldstein

Authors

Florin Chelaru
View author publications
You can also search for this author in PubMed Google Scholar
Llewellyn Smith
View author publications
You can also search for this author in PubMed Google Scholar
Naomi Goldstein
View author publications
You can also search for this author in PubMed Google Scholar
Héctor Corrada Bravo
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

H.C.B. conceived the project. F.C. and H.C.B. designed the project. F.C., L.S., N.G. and H.C.B. wrote the Epiviz and Epivizr software. F.C., L.S. and H.C.B. analyzed data. H.C.B. and F.C. wrote the manuscript.

Corresponding author

Correspondence to Héctor Corrada Bravo.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Integrated supplementary information

Supplementary Figure 1 The Epiviz architecture.

Presentation, visualizations and data representations are distinct. This allows Epiviz to reuse visualizations regardless of data source (Epiviz sever, or WebSocket connection through Epivizr). Data providers and visualizations can be plugged in on the fly using Epiviz’ plugin API.

Supplementary Figure 2 Chart load times with and without cache.

Average comparison of time taken by ‘add chart’ and ‘navigate’ operations per 1,000 data objects with and without using the predictive cache in the Epiviz data management tier.

Supplementary Figure 3 Chart draw times for different parameter values.

A comparison of draw times when varying specific chart parameters for Scatter Plot and Blocks Track. The parameter for scatter plot is “circle ratio” which splits the chart in a grid of squares of width equal to this parameter, and draws at most one circle in each cell of the grid. All data objects that overlap this point are mapped to the single circle displayed. The parameter for block tracks is the minimum distance in screen pixels between two blocks before they are merged into one display object. Again, all data objects merged are mapped to the single display object. The data to visual object mapping is used for brushing, tooltips and other interactivity actions.

Supplementary Figure 4 A comparison of draw times when varying specific chart parameters for Heatmap Plot and Lines Track.

The parameter for heatmap is the maximum number of columns to be drawn by the heat map before multiple columns are averaged into one. All data objects that are merged are mapped to the single column displayed. The data to visual object mapping is used for brushing, tooltips and other interactivity actions. The parameter for line tracks is the maximum number of points drawn. If the number of data points is greater than this parameter, the required number of points are sampled uniformly.

Supplementary Figure 5 Gene expression analysis of colon cancer methylation loss regions with Epiviz.

A) We used the Epiviz computed columns feature to define an MA plot of colon cancer expression in the MMP gene family region (Figure 1). B) Gene expression barcode data for the same region shows similar expression patterns across multiple cancer types. Both of these plots were saved as pdfs directly from Epiviz.

Supplementary Figure 6 Comparison of hypomethylation block finding methods.

We compare hypomethylation blocks inferred using BSmooth on whole-genome bisuflite sequencing with blocks inferred with minfi on Illumina HumanMethylation450k beadarray data. In this plot we show the regions found along with smoothed bp-level mean methylation (for BSmooth) and probe-level mean methylation (aggregated over CpG clusters for minfi) data. The block-finding method used in minfi ignores methylation measurements in CpG islands by design, so that long blocks of methylation change would span across CpG islands. BSmooth does not use this design so blocks are frequently punctuated by CpG islands. We see this effect in this specific integrative visualization using Epivizr, where the only difference hypomethylation blocks is the punctuation at the CpG island for the BSmooth block.

Supplementary Figure 7 The spatial distribution of genes in correlation with hypomethylated blocks.

Visualizing genes and corresponding exons side by side with methylation levels in normal and cancer tissues using Epiviz confirms that hypo-methylated blocks are gene-poor.

Supplementary Figure 8 Exon-level expression in differentially methylated regions.

The track-based visualization of exon-level expression data, side by side with a view of DNA methylation and one of differentially methylated blocks reveals that at low resolution, exons tend to be silenced within blocks, and highly expressed outside.

Supplementary information

Supplementary Text and Figures

Supplementary Figures 1–8 and Supplementary Note (PDF 2136 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Chelaru, F., Smith, L., Goldstein, N. et al. Epiviz: interactive visual analytics for functional genomics data. Nat Methods 11, 938–940 (2014). https://doi.org/10.1038/nmeth.3038

Download citation

Received: 08 January 2014
Accepted: 12 June 2014
Published: 03 August 2014
Issue Date: September 2014
DOI: https://doi.org/10.1038/nmeth.3038

This article is cited by

gEAR: Gene Expression Analysis Resource portal for community-driven, multi-omic data exploration
- Joshua Orvis
- Brian Gottfried
- Ronna Hertzano
Nature Methods (2021)
Enhanced JBrowse plugins for epigenomics data visualization
- Brigitte T. Hofmeister
- Robert J. Schmitz
BMC Bioinformatics (2018)
C-State: an interactive web app for simultaneous multi-gene visualization and comparative epigenetic pattern search
- Divya Tej Sowpati
- Surabhi Srivastava
- Rakesh K Mishra
BMC Bioinformatics (2017)
VisRseq: R-based visual framework for analysis of sequencing data
- Hamid Younesy
- Torsten Möller
- Steven JM Jones
BMC Bioinformatics (2015)
Epiviz: a view inside the design of an integrated visual analysis software for genomics
- Florin Chelaru
- Héctor Corrada Bravo
BMC Bioinformatics (2015)