Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • Comment
  • Published:

The dynseq browser track shows context-specific features at nucleotide resolution

Subjects

High-throughput experimental platforms have revolutionized the ability to profile biochemical and functional properties of biological sequences such as DNA, RNA and proteins. By collating several data modalities with customizable tracks rendered using intuitive visualizations, genome browsers enable an interactive and interpretable exploration of diverse types of genome profiling experiments and derived annotations. However, existing genome browser tracks are not well suited for intuitive visualization of high-resolution DNA sequence features such as transcription factor motifs. Typically, motif instances in regulatory DNA sequences are visualized as BED-based annotation tracks, which highlight the genomic coordinates of the motif instances but do not expose their specific sequences. Instead, a genome sequence track needs to be cross-referenced with the BED track to identify sequences of motif hits. Even so, quantitative information about the motif instances such as affinity or conservation as well as differences in base resolution from the consensus motif are not immediately apparent. This makes interpretation slow and challenging. This problem is compounded when analyzing several cellular states and/or molecular readouts (such as ATAC-seq and ChIP–seq) simultaneously, as coordinates of enriched regions (peaks) and the set of active transcription factor motifs vary across cell states.

This is a preview of subscription content, access via your institution

Access options

Buy this article

Prices may be subject to local taxes which are calculated during checkout

Fig. 1: WashU Epigenome Browser session for deciphering sequence architecture of a cis-regulatory element.

Data availability

Data and models used to create vignettes are available at: https://doi.org/10.5281/zenodo.6582100. See Supplementary Table 1 for browser-specific functionalities

Code availability

Code to reproduce vignettes is available at: https://doi.org/10.5281/zenodo.7019993. The dynseq tracks use the BigWig file format. A tutorial is available at: https://kundajelab.github.io/dynseq-pages/. The dynseq track is supported by:

• UCSC Genome Browser (https://genome.ucsc.edu). Documentation is available at: https://genome.ucsc.edu/goldenpath/help/bigWig.html#dynseq. Source code is available at: https://github.com/ucscGenomeBrowser/kent.

• HiGlass/Resgen (https://higlass.io; https://resgen.io). Dynseq is implemented as a plugin. Source code is available at: https://github.com/kundajelab/higlass-dynseq/.

• WashU Epigenome Browser (https://epigenomegateway.wustl.edu). Source code is available at: https://github.com/lidaof/eg-react. Documentation is available at: https://eg.readthedocs.io/en/latest/tracks.html#dynseq.

References

  1. Eraslan, G., Avsec, Ž., Gagneur, J. & Theis, F. J. Nat. Rev. Genet. 20, 389–403 (2019).

    Article  CAS  PubMed  Google Scholar 

  2. de Almeida, B. P., Reiter, F., Pagani, M. & Stark, A. Nat. Genet. 54, 613–624 (2022).

    Article  PubMed  Google Scholar 

  3. Avsec, Ž. et al. Nat. Methods 18, 1196–1203 (2021).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  4. Jaganathan, K. et al. Cell 176, 535–548.e24 (2019).

    Article  CAS  PubMed  Google Scholar 

  5. Bogard, N., Linder, J., Rosenberg, A. B. & Seelig, G. Cell 178, 91–106.e23 (2019).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  6. Schneider, T. D. Nucleic Acids Res. 25, 4408–4415 (1997).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  7. Li, D., Hsu, S., Purushotham, D., Sears, R. L. & Wang, T. Nucleic Acids Res. 47, W158–W165 (2019).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  8. Li, D. et al. Nucleic Acids Res. 50, W774–W781 (2022).

    Article  PubMed Central  Google Scholar 

  9. Kent, W. J., Zweig, A. S., Barber, G., Hinrichs, A. S. & Karolchik, D. Bioinformatics 26, 2204–2207 (2010).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  10. Kent, W. J. Genome Res. 12, 996–1006 (2002).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  11. Kerpedjiev, P. et al. Genome Biol. 19, 125 (2018).

    Article  PubMed  PubMed Central  Google Scholar 

  12. Avsec, Ž. et al. Nat. Genet. 53, 354–366 (2021).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  13. ENCODE Project Consortium. Nature 489, 57–74 (2012).

    Article  Google Scholar 

  14. Davis, C. A. et al. Nucleic Acids Res. 46, D794–D801 (2018).

    Article  CAS  PubMed  Google Scholar 

  15. Shrikumar, A., Greenside, P. & Kundaje, A. In Proc. 34th International Conference on Machine Learning 70, 3145–3153 (2017).

  16. Lundberg, S. M. & Lee, S.-I. A. In Advances in Neural Information Processing Systems (eds. Guyon, I. et al.) 30, 4765–4774 (Curran Associates, 2017).

  17. Kellis, M. et al. Proc. Natl. Acad. Sci. USA 111, 6131–6138 (2014).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  18. Vierstra, J. et al. Nature 583, 729–736 (2020).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  19. Pollard, K. S., Hubisz, M. J., Rosenbloom, K. R. & Siepel, A. Genome Res. 20, 110–121 (2010).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  20. Tehranchi, A. K. et al. Cell 165, 730–741 (2016).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

Download references

Acknowledgements

This work was supported by National Institutes of Health (NIH) grant numbers U01HG009431 and U01HG012069 to A.K.; R01HG007175, U01CA200060, U24ES026699, U01HG009391, UM1HG011585, U41HG010972 and U24HG012070 to T.W.; 5U41HG002371 to B.J.R., B.T.L. and M.H.

Author information

Authors and Affiliations

Authors

Contributions

S.N. and A.K. conceived the project. S.N., A.B., D.L., B.J.R., B.T.L. and P.K. implemented the software. V.R. and A.P. trained machine learning models. S.N., P.K., F.L., T.W., M.H. and A.K. supervised the software development and/or analyses. S.N. drafted the initial manuscript and revised it with feedback from A.K. All authors approved the final manuscript.

Corresponding author

Correspondence to Anshul Kundaje.

Ethics declarations

Competing interests

A.K. is scientific co-founder of Ravel Biotechnology, is on the scientific advisory board of PatchBio, SerImmune, AINovo, TensorBio and OpenTargets, is a consultant with Illumina and owns shares in DeepGenomics, Immuni and Freenome. All other authors have no competing interests to declare.

Peer review

Peer review information

Nature Genetics thanks Bernardo de Almeida and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.

Supplementary information

Supplementary Information

Supplementary Note, Supplementary Figures 1-2, Supplementary Table 1

Reporting Summary

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Nair, S., Barrett, A., Li, D. et al. The dynseq browser track shows context-specific features at nucleotide resolution. Nat Genet 54, 1581–1583 (2022). https://doi.org/10.1038/s41588-022-01194-w

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1038/s41588-022-01194-w

Search

Quick links

Nature Briefing: Translational Research

Sign up for the Nature Briefing: Translational Research newsletter — top stories in biotechnology, drug discovery and pharma.

Get what matters in translational research, free to your inbox weekly. Sign up for Nature Briefing: Translational Research