Global voxel transformer networks for augmented microscopy

Wang, Zhengyang; Xie, Yaochen; Ji, Shuiwang

doi:10.1038/s42256-020-00283-x

Article
Published: 25 January 2021

Global voxel transformer networks for augmented microscopy

Nature Machine Intelligence volume 3, pages 161–171 (2021)Cite this article

2133 Accesses
23 Citations
60 Altmetric
Metrics details

Subjects

A preprint version of the article is available at arXiv.

Abstract

Advances in deep learning have led to remarkable success in augmented microscopy, enabling us to obtain high-quality microscope images without using expensive microscopy hardware and sample preparation techniques. Current deep learning models for augmented microscopy are mostly U-Net-based neural networks, thus sharing certain drawbacks that limit the performance. In particular, U-Nets are composed of local operators only and lack dynamic non-local information aggregation. In this work, we introduce global voxel transformer networks (GVTNets), a deep learning tool for augmented microscopy that overcomes intrinsic limitations of the current U-Net-based models and achieves improved performance. GVTNets are built on global voxel transformer operators, which are able to aggregate global information, as opposed to local operators like convolutions. We apply the proposed methods on existing datasets for three different augmented microscopy tasks under various settings.

Access through your institution

Buy or subscribe

This is a preview of subscription content, access via your institution

Access options

Access through your institution

Buy this article

Purchase on Springer Link
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

**Fig. 1: GVTNets architecture, training and inference.**

**Fig. 2: GVTNets on label-free prediction of 3D fluorescence images from transmitted-light microscopy.**

**Fig. 3: GVTNets on content-aware 3D image denoising.**

**Fig. 4: GVTNets on content-aware 3D-to-2D image projection.**

**Fig. 5: Generalization ability of GVTNets.**

Unsupervised content-preserving transformation for optical microscopy

Article Open access 01 March 2021

Pretraining a foundation model for generalizable fluorescence microscopy-based image restoration

Article 12 April 2024

Deep learning enables reference-free isotropic super-resolution for volumetric fluorescence microscopy

Article Open access 08 June 2022

Data availability

Datasets for label-free prediction of 3D fluorescence images from transmitted-light microscopy²⁵ can be downloaded from https://downloads.allencell.org/publication-data/label-free-prediction/index.html. Datasets for context-aware 3D image denoising and 3D-to-2D image projection²⁷ can be downloaded from https://publications.mpi-cbg.de/publications-sites/7207.

Code availability

The code for GVTNets training, prediction and evaluation (in Python/TensorFlow) is publicly available at https://github.com/divelab/GVTNets and ref. ⁶⁰.

References

Gustafsson, M. G. Surpassing the lateral resolution limit by a factor of two using structured illumination microscopy. J. Microsc. 198, 82–87 (2000).
Article Google Scholar
Huisken, J., Swoger, J., Del Bene, F., Wittbrodt, J. & Stelzer, E. H. Optical sectioning deep inside live embryos by selective plane illumination microscopy. Science 305, 1007–1009 (2004).
Article Google Scholar
Betzig, E. et al. Imaging intracellular fluorescent proteins at nanometer resolution. Science 313, 1642–1645 (2006).
Article Google Scholar
Rust, M. J., Bates, M. & Zhuang, X. Sub-diffraction-limit imaging by stochastic optical reconstruction microscopy (storm). Nat. Meth. 3, 793–796 (2006).
Article Google Scholar
Heintzmann, R. & Gustafsson, M. G. Subdiffraction resolution in continuous samples. Nat. Photon. 3, 362–364 (2009).
Article Google Scholar
Tomer, R., Khairy, K., Amat, F. & Keller, P. J. Quantitative high-speed imaging of entire developing embryos with simultaneous multiview light-sheet microscopy. Nat. Meth. 9, 755–763 (2012).
Article Google Scholar
Chen, B.-C. et al. Lattice light-sheet microscopy: imaging molecules to embryos at high spatiotemporal resolution. Science 346, 1257998 (2014).
Article Google Scholar
Belthangady, C. & Royer, L. A. Applications, promises, and pitfalls of deep learning for fluorescence image reconstruction. Nat. Meth. 16, 1215–1225 (2019).
Laissue, P. P., Alghamdi, R. A., Tomancak, P., Reynaud, E. G. & Shroff, H. Assessing phototoxicity in live fluorescence imaging. Nat. Meth. 14, 657–661 (2017).
Article Google Scholar
Icha, J., Weber, M., Waters, J. C. & Norden, C. Phototoxicity in live fluorescence microscopy, and how to avoid it. Bioessays 39, 1700003 (2017).
Article Google Scholar
Selinummi, J. et al. Bright field microscopy as an alternative to whole cell fluorescence in automated analysis of macrophage images. PLoS ONE 4, e7497 (2009).
Article Google Scholar
Pawley, J. B. in Handbook of Biological Confocal Microscopy (ed. Pawley, J. B.) 20-42 (Springer, 2006).
Scherf, N. & Huisken, J. The smart and gentle microscope. Nat. Biotechnol. 33, 815–818 (2015).
Article Google Scholar
Skylaki, S., Hilsenbeck, O. & Schroeder, T. Challenges in long-term imaging and quantification of single-cell dynamics. Nat. Biotechnol. 34, 1137–1144 (2016).
Article Google Scholar
LeCun, Y. et al. Gradient-based learning applied to document recognition. Proc. IEEE 86, 2278–2324 (1998).
Article Google Scholar
Sullivan, D. P. & Lundberg, E. Seeing more: a future of augmented microscopy. Cell 173, 546–548 (2018).
Article Google Scholar
Chen, P. et al. An augmented reality microscope with real-time artificial intelligence integration for cancer diagnosis. Nat. Med. 25, 1453–1457 (2019).
Article Google Scholar
Moen, E. et al. Deep learning for cellular image analysis. Nat. Meth. 16, 1233–1246 (2019).
Johnson, G. R., Donovan-Maiye, R. M. & Maleckar, M. M. Building a 3D integrated cell. Preprint at https://doi.org/10.1101/238378 (2017).
Ounkomol, C. et al. Three dimensional cross-modal image inference: label-free methods for subcellular structure prediction. Preprint at https://doi.org/10.1101/216606 (2017).
Osokin, A., Chessel, A., Carazo Salas, R. E. & Vaggi, F. GANs for biological image synthesis. In Proc. IEEE International Conference on Computer Vision 2233-2242 (2017).
Yuan, H. et al. Computational modeling of cellular structures using conditional deep generative networks. Bioinformatics 35, 2141–2149 (2019).
Article Google Scholar
Johnson, G., Donovan-Maiye, R., Ounkomol, C. & Maleckar, M. M. Studying stem cell organization using ‘label-free’ methods and a novel generative adversarial model. Biophys. J. 114, 43A (2018).
Article Google Scholar
Christiansen, E. M. et al. In silico labeling: predicting fluorescent labels in unlabeled images. Cell 173, 792–803 (2018).
Article Google Scholar
Ounkomol, C., Seshamani, S., Maleckar, M. M., Collman, F. & Johnson, G. R. Label-free prediction of three-dimensional fluorescence images from transmitted-light microscopy. Nat. Meth. 15, 917–920 (2018).
Article Google Scholar
Wu, Y. et al. Three-dimensional virtual refocusing of fluorescence microscopy images using deep learning. Nat. Meth. 16, 1323–1331 (2019).
Weigert, M. et al. Content-aware image restoration: pushing the limits of fluorescence microscopy. Nat. Meth. 15, 1090–1097 (2018).
Article Google Scholar
Wang, H. et al. Deep learning enables cross-modality super-resolution in fluorescence microscopy. Nat. Meth. 16, 103–110 (2019).
Article Google Scholar
Rivenson, Y. et al. Deep learning microscopy. Optica 4, 1437–1443 (2017).
Article Google Scholar
Ronneberger, O., Fischer, P. & Brox, T. U-Net: convolutional networks for biomedical image segmentation. In International Conference on Medical Image Computing and Computer-Assisted Intervention 234–241 (Springer, 2015).
Falk, T. et al. U-Net: deep learning for cell counting, detection, and morphometry. Nat. Meth. 16, 67–70 (2019).
Article Google Scholar
He, K., Zhang, X., Ren, S. & Sun, J. Deep residual learning for image recognition. In Proc. IEEE Conference on Computer Vision and Pattern Recognition 770–778 (2016).
He, K., Zhang, X., Ren, S. & Sun, J. Identity mappings in deep residual networks. In European Conference on Computer Vision 630–645 (Springer, 2016).
Fakhry, A., Zeng, T. & Ji, S. Residual deconvolutional networks for brain electron microscopy image segmentation. IEEE Trans. Med. Imaging 36, 447–456 (2017).
Article Google Scholar
Lee, K., Zung, J., Li, P., Jain, V. & Seung, H. S. Superhuman accuracy on the SNEMI3D connectomics challenge. Preprint at https://arxiv.org/abs/1706.00120 (2017).
Çiçek, Ö., Abdulkadir, A., Lienkamp, S. S., Brox, T. & Ronneberger, O. 3D U-Net: learning dense volumetric segmentation from sparse annotation. In International Conference on Medical Image Computing and Computer-Assisted Intervention 424–432 (Springer, 2016).
Simonyan, K. & Zisserman, A. Very deep convolutional networks for large-scale image recognition. Preprint at https://arxiv.org/abs/1409.1556 (2014).
Vaswani, A. et al. Attention is all you need. In Advances in Neural Information Processing Systems 5998–6008 (2017).
Wang, X., Girshick, R., Gupta, A. & He, K. Non-local neural networks. In Proc. IEEE Conference on Computer Vision and Pattern Recognition 7794–7803 (2018).
Wilson, D. R. & Martinez, T. R. The general inefficiency of batch training for gradient descent learning. Neural Networks 16, 1429–1451 (2003).
Article Google Scholar
Wang, Z. et al. Image quality assessment: from error visibility to structural similarity. IEEE Trans. Image Process. 13, 600–612 (2004).
Google Scholar
Aigouy, B. et al. Cell flow reorients the axis of planar polarity in the wing epithelium of drosophila. Cell 142, 773–786 (2010).
Article Google Scholar
Etournay, R. et al. Interplay of cell dynamics and epithelial tension during morphogenesis of the Drosophila pupal wing. eLife 4, e07090 (2015).
Article Google Scholar
Pan, S. J. & Yang, Q. A survey on transfer learning. IEEE Transactions Knowl. Data Eng. 22, 1345–1359 (2009).
Article Google Scholar
Blasse, C. et al. PreMosa: extracting 2D surfaces from 3D microscopy mosaics. Bioinformatics 33, 2563–2569 (2017).
Article Google Scholar
Cai, L., Wang, Z., Gao, H., Shen, D. & Ji, S. Deep adversarial learning for multi-modality missing data completion. In Proc. 24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining 1158–1166 (Association for Computing Machinery, 2018).
Zhang, Q., Cui, Z., Niu, X., Geng, S. & Qiao, Y. Image segmentation with pyramid dilated convolution based on ResNet and U-Net. In International Conference on Neural Information Processing 364–372 (Springer, 2017).
Huang, J. et al. Range scaling global U-Net for perceptual image enhancement on mobile devices. In Proc. European Conference on Computer Vision (ECCV) (Springer, 2018).
Oktay, O. et al. Attention U-Net: learning where to look for the pancreas. Preprint at https://arxiv.org/abs/1804.03999 (2018).
Zhou, Z., Siddiquee, M. M. R., Tajbakhsh, N. & Liang, J. UNet++: a nested U-Net architecture for medical image segmentation. In Deep Learning in Medical Image Analysis and Multimodal Learning for Clinical Decision Support 3–11 (Springer, 2018).
Gu, Z. et al. CE-Net: context encoder network for 2D medical image segmentation. IEEE Trans. Med. Imaging 38, 2281–2292 (2019).
Article Google Scholar
Goodfellow, I. et al. Generative adversarial nets. In Advances in Neural Information Processing Systems 2672–2680 (MIT Press, 2014).
Rivenson, Y. et al. Virtual histological staining of unlabelled tissue-autofluorescence images via deep learning. Nat. Biomed. Eng. 3, 466–477 (2019).
Article Google Scholar
Finn, C., Abbeel, P. & Levine, S. Model-agnostic meta-learning for fast adaptation of deep networks. In Proc. 34th International Conference on Machine Learning 70, 1126–1135 (JMLR, 2017).
Krizhevsky, A., Sutskever, I. & Hinton, G. E. ImageNet classification with deep convolutional neural networks. In Advances in Neural Information Processing Systems 1097–1105 (2012).
Kolda, T. G. & Bader, B. W. Tensor decompositions and applications. SIAM Rev. 51, 455–500 (2009).
Article MathSciNet Google Scholar
Kingma, D. P. & Ba, J. Adam: a method for stochastic optimization. In Proc. 3rd International Conference on Learning Representations (2015).
Ioffe, S. & Szegedy, C. Batch normalization: accelerating deep network training by reducing internal covariate shift. In International Conference on Machine Learning 448–456 (2015).
Kendall, A. & Gal, Y. What uncertainties do we need in bayesian deep learning for computer vision? In Advances in Neural Information Processing Systems 5574–5584 (2017).
Wang, Z., Xie, Y. & Ji, S. zhengyang-wang/GVTNets: Code for “Global voxel transformer networks for augmented microscopy” (version v1.0.0). Zenodo https://doi.org/10.5281/zenodo.4285769 (2020).

Download references

Acknowledgements

We thank the teams at CARE and the Allen Institute for Cell Science for making their data and tools publicly available. This work was supported in part by National Science Foundation grants DBI-1922969, IIS-1908166 and IIS-1908220, National Institutes of Health grant 1R21NS102828 and Defense Advanced Research Projects Agency grant N66001-17-2-4031.

Author information

These authors contributed equally: Zhengyang Wang, Yaochen Xie.

Authors and Affiliations

Texas A&M University, Department of Computer Science and Engineering, College Station, TX, USA
Zhengyang Wang, Yaochen Xie & Shuiwang Ji

Authors

Zhengyang Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yaochen Xie
View author publications
You can also search for this author in PubMed Google Scholar
Shuiwang Ji
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.J. conceived and initiated the research. Z.W. and S.J. designed the methods. Z.W. and Y.X. implemented the training and validation methods. Z.W. and Y.X. designed and developed the software package. S.J. supervised the project. Z.W., Y.X. and S.J. wrote the manuscript.

Corresponding author

Correspondence to Shuiwang Ji.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature Machine Intelligence thanks Ruogu Fang and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Supplementary Figs. 1–13, Tables 1–6 and Notes 1,2.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wang, Z., Xie, Y. & Ji, S. Global voxel transformer networks for augmented microscopy. Nat Mach Intell 3, 161–171 (2021). https://doi.org/10.1038/s42256-020-00283-x

Download citation

Received: 20 February 2020
Accepted: 14 December 2020
Published: 25 January 2021
Issue Date: February 2021
DOI: https://doi.org/10.1038/s42256-020-00283-x

This article is cited by

Pretraining a foundation model for generalizable fluorescence microscopy-based image restoration
- Chenxi Ma
- Weimin Tan
- Bo Yan
Nature Methods (2024)
Challenges and opportunities in bioimage analysis
- Xinyang Li
- Yuanlong Zhang
- Qionghai Dai
Nature Methods (2023)
IMC-Denoise: a content aware denoising pipeline to enhance Imaging Mass Cytometry
- Peng Lu
- Karolyn A. Oetjen
- Daniel L. J. Thorek
Nature Communications (2023)
Spatial redundancy transformer for self-supervised fluorescence image denoising
- Xinyang Li
- Xiaowan Hu
- Qionghai Dai
Nature Computational Science (2023)