Here, we review single-cell sequencing techniques for individual and multiomics profiling in single cells. We mainly describe single-cell genomic, epigenomic, and transcriptomic methods, and examples of their applications. For the integration of multilayered data sets, such as the transcriptome data derived from single-cell RNA sequencing and chromatin accessibility data derived from single-cell ATAC-seq, there are several computational integration methods. We also describe single-cell experimental methods for the simultaneous measurement of two or more omics layers. We can achieve a detailed understanding of the basic molecular profiles and those associated with disease in each cell by utilizing a large number of single-cell sequencing techniques and the accumulated data sets.
Introduction: single-cell sequencing analysis
Recently, single-cell sequencing technologies have been rapidly developed for observing the multilayered status of single cells. Single-cell sequencing has the power to elucidate genomic, epigenomic, and transcriptomic heterogeneity in cellular populations, and the changes at these levels. A large number of reports on this topic have been published worldwide from various regions. Under an international approach, the Human Cell Atlas (HCA; https://www.humancellatlas.org/) has been generating comprehensive molecular maps of all human cells1. The HCA platform utilizes single-cell sequencing techniques to obtain single-cell genomic information from healthy and diseased cells. To study all types of cells and omics layers, we should consider single-cell sequencing methods from both laboratory and clinical perspectives. In this review, we introduce basic information and describe several applications of single-cell sequencing techniques.
Single-cell transcriptome sequencing
Single-cell RNA sequencing (scRNA-seq)2 has been widely utilized worldwide. RNA-seq analysis conventionally measures transcripts in a mixture of cells (called a “bulk”). Bulk RNA-seq analysis allows the measurement of only the average transcript expression in a cell population. For example, in the RNA-seq of cancer tissue, transcripts from various types of cells, including tumor cells, immune cells, fibroblasts, and endothelial cells, are analyzed. To precisely understand the transcriptomic status of such heterogeneous cell populations, we can use scRNA-seq techniques (Table 1). For the analysis of tissues, cell dissociation is the most important step, as the conditions of this step directly affect the molecular profiles of cells, and the impacts of stress and damage depend on the cell type. For the measurement of transcripts in individual cells, reverse transcription (RT) and cDNA amplification must be performed from very small amounts of RNA. Smart-seq3 is a whole-transcriptome amplification (WTA) method that has been developed for full-length cDNA amplification with oligo-dT priming and template switching. Smart-seq24, Quartz-Seq5, and CEL-seq6 have also been developed to stably measure mRNAs from a single cell. RamDa-seq7 detects non-poly(A) transcripts, including long noncoding RNAs and enhancer RNAs, in a single cell. Although diverse WTA methods exist, it is still difficult to perform scRNA-seq because the processing of hundreds to thousands of single cells and small amounts of liquid are conditions inherent to WTA methods. A number of methods for the simple procedure of scRNA-seq library construction have been reported. Several protocols based on microdroplet technology have been reported, such as Drop-Seq8 and DroNc-seq9. In these methods, a cell/nucleus, reaction liquid, and a barcoded bead are included in an oil droplet, and RT is conducted with molecular/cell barcoding within each oil droplet. On the other hand, in the microwell-seq10 approach, a cell and barcoded bead are isolated in a well. Nx1-seq11 and Seq-Well12 are reported to be portable, low-cost microwell-based platforms. These microdroplet- and microwell-based protocols enable the easy handling of thousands of single cells. For higher-throughput and lower-cost scRNA-seq analysis, sci-RNA-seq13 is a combinatorial indexing method (the recent version is sci-RNA-seq314) that has been developed. Vendors have also developed automatic scRNA-seq platforms that can automatically conduct cell isolation, cell lysis, RT, and PCR amplification for each individual cell. The C1 Single-Cell Auto Prep system (Fluidigm) was launched in 2013. This platform enables the automatic isolation of 96 cells, cDNA synthesis, and amplification based on Smart-seq through microfluidics. C1-CAGE can also be conducted using the C1 system, which enables the profiling of the 5′ end of transcripts with strand information in a single cell15. Microdroplet-based systems such as Chromium (10× Genomics), ddSEQ (Bio-Rad/Illumina), Nadia (Dolomite), and inDrop (1CellBio) and microwell-based systems such as Rhapsody (BD) and ICELL8 (Takara) also exist. Researchers can select various methods and platforms for scRNA-seq (Table 1). However, problems such as limited cell capture, low RT efficiency, amplification bias and the requirement for a large number of sequencing reads remain, depending on the platform. Users should select appropriate methods of scRNA-seq for their sample type and research purpose (Fig. 1). For example, although only 96 cells can be analyzed per run with a C1 chip, as determined by its size, the C1/Smart-seq platform can be used to obtain full-length cDNA libraries for each cell separately and can perform additional sequencing of libraries in user-selected wells; therefore, we can obtain in-depth transcriptome information for each cell. Chromium enables the analysis of thousands of cells with simple protocols, while the libraries of selected cells cannot be reanalyzed because libraries are mixed after barcoding. In several studies, multiple methods have been applied to the same samples to complement the weak points of each method. We also conducted two types of scRNA-seq assays, bead-seq16 (involving a small number of cells and providing a large amount of information for each cell) and Chromium (involving many cells and providing less information per cell), to monitor transcriptome changes under anticancer drug treatment, and we extracted cells that exhibited an atypical expression pattern from bead-seq data and validated the cells in a larger population using Chromium data17. Furthermore, as scRNA-seq platforms and their consumables are frequently updated, we should be aware of their different versions. In particular, 10× Genomics updated the reagent kit from v2 to v3 for the Chromium system, and the detection sensitivity was greatly improved. They also recently modified the chip architecture of Next GEM technology. It is important to recognize the differences in the detection limits and dynamic ranges of gene expression levels when we compare or merge data across different versions of machines or kits.
Recent scRNA-seq studies have been conducted in various research fields, such as immunology, developmental biology and oncology. In the field of cancer genomics, researchers have conducted the scRNA-seq of cancer cells and their surrounding stromal cells in the tumor microenvironment. Several groups have reported the scRNA-seq of brain tumors and revealed intratumor transcriptional heterogeneity and diverse evolutionary paths18,19,20. Tirosh et al.21 performed the first large-scale scRNA-seq study of the tumor ‘ecosystem’ and performed the scRNA-seq of CD45+ and CD45− cells in 19 melanoma patients. They specifically elucidated different types of T cell exhaustion programs in each patient, which might be relevant for immunotherapy strategies. Chung et al.22 also focused on tumor and immune cells, including T cells, B cells and macrophages, in 11 breast cancer samples. Tumor-infiltrating lymphocytes in various types of cancers, such as hepatocellular carcinoma23, non-small-cell lung cancer24, and colon cancer, have also been targeted for scRNA-seq. In our group, to elucidate tumor evolution and the mechanism of acquired resistance to anticancer drugs, we conducted the scRNA-seq of lung cancer cell lines stimulated by receptor tyrosine kinase inhibitors. We observed different transcriptional responses to the drug among sensitive and insensitive cells25, and identified distinct transcriptional modules that might be associated with early resistance responses, such as dormancy17. The number of studies utilizing scRNA-seq is continuing to increase rapidly.
For the computational analysis of scRNA-seq data, we cannot simply use the bioinformatics approaches employed for bulk RNA-seq analysis because single-cell sequencing generates sparse multidimensional data. Seurat26 is an R package for scRNA-seq analysis that includes data filtering, normalization, scaling, dimensional reduction, clustering and visualization. To further analyze single-cell transcriptome data, various types of algorithms and tools have been developed, such as MAGIC27, SAVER28, and scImpute29 for imputation, Seurat CCA30 and ZINB-WaVE31 for batch effect removal, Monocle332 and cellTree33 for pseudotime trajectory, and velocyte34 for RNA velocity and NicheNet35 for ligand–receptor interaction determination. There are an increasing number of tools for scRNA-seq analysis; as a result, we should select suitable tools for our own research purposes and data sets.
Single-cell genome sequencing for understanding genetic heterogeneity
Single-cell genome sequencing enables the elucidation of genetic heterogeneity; thus, it can be used for the analysis of de novo germline mutations and somatic mutations in normal and cancer cells (Table 2). To uniformly amplify genomic DNA in individual cells, whole-genome amplification (WGA) methods have been developed2, such as multiple displacement amplification (MDA)36, multiple annealing and looping-based amplification cycles (MALBAC)37 and degenerate oligonucleotide-primed PCR (DOP-PCR)38. WGA is challenging for reasons such as the presence of only two copies of genomic DNA in human cells. This strategy occasionally misses an allele within a large genomic region (allelic dropout) and fails to achieve a uniform sequencing depth because of amplification bias. Care must be taken in the analysis of such genome sequencing data, especially in the detection of point mutations. Bioinformatics methods such as SCcaller39, Monovar40, LiRA41, and Conbase42 have been developed to detect single-nucleotide variants (SNVs) considering allelic dropout and amplification artifacts. For automatic library construction, the C1 system supports single-cell whole-genome and whole-exome sequencing. Furthermore, 10× Genomics recently released a copy number variant (CNV) solution for the Chromium system to profile copy numbers in single cells. The procedure for library construction is simplified, but a large number of sequencing reads are required, and the sequencing cost is very high.
Single-cell genome sequencing reveals genetic heterogeneity. Mutations independently accumulate in cells and cause aging and diseases such as developmental diseases and cancers. Zhang et al.43 reported a single-cell whole-genome sequencing study of somatic mutations in B lymphocytes and observed the accumulation of somatic mutations with age and mutational signatures associated with the carcinogenesis of B cell cancers. They used the MDA method for WGA and obtained whole-genome sequencing data that covered approximately half of the genome regions at 20× and achieved greater sequencing depths. Neurogenerative diseases have also been analyzed through single-cell genome sequencing because most neurons exhibit longevity and cannot be renewed; thus, mutations tend to accumulate44. In a previous report45, a total of 159 single neurons from healthy and diseased individuals were sequenced to evaluate the accumulation of somatic mutations caused by aging or defects in DNA damage repair. Bae et al.46 also conducted the genome sequencing of single neurons from the prenatal brain and detected 200–400 SNVs per cell. In cancers, researchers have attempted to identify intratumor genetic heterogeneity generated during cancer evolution. Dr. Navin’s group reported a series of single-cell genome analyses of cancer cells, focusing on breast cancer cells in particular. They elucidated tumor progression through analyses of punctuated copy number evolution and the gradual evolution of point mutations by conducting single-cell genome sequencing and profiling mutations and CNVs in each individual cancer cell47,48,49. They also reported multiclonal invasion, which is a model of cancer evolution from ductal carcinoma in situ (DCIS), as an early stage in the progression of breast cancer to invasive ductal carcinoma (IDC)50. In another report, the adaptive selection of pre-existing clones was used as a model of chemoresistance to neoadjuvant therapy51. Furthermore, to understand the clonal evolution that leads to the acquisition of resistance to FLT3 inhibitors in acute myeloid leukemia (AML), McMahon et al.52 performed single-cell targeted DNA sequencing using the Tapestri platform (Mission Bio). They found that clones harboring RAS/MAPK mutations were selected after treatment with FLT3 inhibitors.
Single-cell epigenome sequencing for detecting footprints of differentiation of individual cells
Single-cell sequencing technologies for studying epigenomics also exist (Table 3). By elucidating the epigenomic status of cells, such as DNA methylation and chromatin states, we can observe the cell lineage and differentiation state of individual cells. Single-cell DNA methylation profiling can be analyzed by single-cell bisulfite sequencing (scBS-seq)53 and single-cell reduced representation bisulfite sequencing (scRRBS)54.
For the investigation of chromatin status, several methods can be used to measure the patterns of histone modifications in individual cells. Single-cell ChIP-seq can be conducted via a droplet microfluidics-based procedure known as Drop-ChIP55. This study reported the H3K4me2 and H3K4me3 patterns of mouse ES cells, embryonic fibroblasts and hematopoietic progenitors. Grosselin et al.56 recently conducted single-cell chromatin immunoprecipitation followed by sequencing (scChIP-seq) to analyze the H3K27me3 landscapes of patient-derived xenografts (PDXs) of breast cancers. They revealed differences between cells that were sensitive and resistant to chemotherapies and found that a fraction of sensitive tumors already harbored the distinct H3K27me3 patterns observed in resistant cells. Cleavage under targets and tagmentation (CUT&Tag)57 is another method used to profile chromatin components. First, an antibody identifies a target chromatin protein, such as a histone modification. Then, protein A and Tn5 transposase fusion proteins bind to the antibody and are tagged to the genomic regions where the target protein is bound.
Assay for transposase-accessible chromatin using sequencing (ATAC-seq) elucidates open chromatin patterns using a small number of cells. Open chromatin regions are tagged with sequencing adaptors by Tn5 transposase, amplified by PCR and sequenced. Several single-cell platforms, including the C1 and Chromium systems, enable single-cell ATAC-seq (scATAC-seq). In the C1 system, all steps of library preparation, from cell lysis to PCR amplification, are automatically conducted with microfluidics58. For the Chromium Single-Cell ATAC Solution approach, researchers must prepare isolated nuclei and conduct Tn5 tagmentation before separation in droplets. scATAC-seq is useful for analyzing transcriptional regulatory programs in mixed cell populations including various lineages and developmental stages, such as blood cells. Corces et al.59 reported the application of “enhancer cytometry” for the identification of cell types in a mixed population of blood cells using ATAC-seq data, which included the in silico deconvolution of cell types based on enhancer patterns. They constructed a regulatory map of hematopoiesis and elucidated the AML cell population with the projection of scATAC-seq data for validation.
Proteomics analysis at the single-cell level
To comprehensively measure the expression patterns of each protein, researchers generally use mass spectrometry or flow cytometry rather than sequencing. Technical challenges related to factors such as the required sample amounts and detection coverage are encountered in the application of mass spectrometry to single-cell proteomics, such that various study groups are now making an effort to develop methods for measurement of more protein molecules using a lower sample input. In recent single-cell studies, CyToF, which is a method based on mass cytometry, has been used to analyze tens of surface and intracellular proteins by using antibodies tagged with metal labels. For immune cells, in particular, the profiling of cell surface proteins is useful for the classification of cell types. There have been many studies using CyToF, including general and cancer immunology studies, often in combination with scRNA-seq analysis.
Integration of different layers of single-cell data sets
Single-cell sequencing enables the elucidation of the omics features of each layer of genomic, epigenomic and transcriptomic data. Many studies have attempted to integrate single-cell data sets that are independently obtained from multiple layers.
To integrate different layers of single-cell omics data, several computational methods have been developed, such as Seurat Label Transfer60 and LIGER61. To provide an overview multiomics single-cell analysis, we describe a representative case for analysis involving the mouse lung. As shown in Fig. 2, we conducted scRNA-seq and scATAC-seq of mouse lung cells using the Chromium system and tried to integrate the results using Seurat Label Transfer. We generated scRNA-seq data sets using Chromium after the dissociation of mouse lung tissue according to the manufacturer’s protocol. We also extracted nuclei from the mouse lung tissue for scATAC-seq. We used Cell Ranger and Cell Ranger ATAC, which are analytical pipelines provided by 10× Genomics, to extract matrices of RNA expression and open chromatin patterns from each data set for individual cells. For scRNA-seq, we used Seurat v3 and annotated cell subpopulations (clusters) according to known cell type markers, such as Epcam and Cdh1 for epithelial cells and Cd19 for B cells, following the filtering of low-quality data, dimensional reduction and clustering (Fig. 2b). To integrate the scATAC-seq data (Fig. 2c) with scRNA-seq clusters annotated by cell-type markers, we conducted Seurat Label Transfer (Fig. 2d). Briefly, scATAC-seq reads in promoters and gene bodies were counted to represent the open chromatin status of each gene as gene activities. From the gene expression level (scRNA-seq) and gene activity (scATAC-seq) data, the shared characters of the two data sets were extracted as anchors. Using these anchors, scRNA-seq clusters were transferred as a reference into scATAC-seq patterns. This integration ignored several regulatory factors, such as transcription factor binding and enhancers. We suggest that each layer of single-cell data sets is carefully analyzed in detail before different multiple layers are integrated. We may be successful in roughly integrating scRNA-seq and scATAC-seq at the cell type level (i.e., epithelial cells, immune cells) using Seurat and LIGER. However, integration focusing on detailed cellular states including unknown cell subpopulations and transition events that are only determined by the epigenome would be more difficult because these tools use scATAC-seq data as RNA-seq data, ignoring binary patterns of open chromatin data and complicated transcriptional regulation. Methods such as scAI62 have indicated the weakness of gene activity-based integration, and different approaches have been reported to overcome these weaknesses. The mouse lung data sets are provided in our DBKERO database (https://kero.hgc.jp/).
A different group developed single-nucleus droplet-based sequencing (snDrop-seq) for gene expression profiling and single-cell transposome hypersensitive site sequencing (scTHS-seq) for the analysis of chromatin accessibility in more than 60,000 human brain cells and integrated the two data sets63. They used a gradient boosting model (GBM) to associate differential accessibility with differential gene expression and to understand cell-type-specific transcriptional regulation in the human brain. This integration strategy is helpful for annotating cell types from both chromatin accessibility data and gene expression data and for understanding the association between transcriptional regulation and gene expression for each of the cell types. To identify the causes of mixed-phenotype acute leukemia, Granja et al.64 conducted CITE-seq (see below), scATAC-seq and scRNA-seq analysis. They integrated chromatin accessibility and gene expression data by using Seurat CCA and identified responsible transcription factors in leukemia.
Multilayered sequencing from the same cells
Once an individual cell is used for the sequencing analysis of a single omics layer, we cannot profile different layers of omics information from the same cell. Methods that analyze two or more omics layers from a single cell have been reported65 (Fig. 3 and Table 4). G&T-seq66 and DR-seq67 were developed for simultaneously analyzing genomic DNA sequences and mRNA profiles. The copy number profile and expression profile accuracy of these methods is similar to that achieved via conventional WGA and WTA methods, respectively. scDam&T-seq68 measures both protein–DNA interactions and transcriptome profiles in the same cell and can thus couple transcriptional regulation analysis and gene expression analysis in individual cells by focusing on chromatin-associated proteins such as the lamina and Polycomb complex. As described above, the rapid development of scRNA-seq platforms has enabled us to easily obtain single-cell transcriptome profiles. However, it is still difficult to obtain single-cell genome sequences for joint analysis with transcriptome data from the same cell because no automatic platforms have been developed for the simultaneous measurement of the only two copies of genomic DNA and the 0.1–1 million mRNA molecules per cell, let alone for addressing the difficulty of avoiding dropout and detection bias. There are still only a small number of reports of the use of these methods.
Transcript-indexed ATAC-seq (T-ATAC-seq)69 combines open chromatin profiling with the analysis of T cell receptor genes and thus analyzes epigenomic profiles in T cell clones. SNARE-seq70, Paired-seq71, and scCAT-seq72 enable the measurement of chromatin accessibility and whole-transcriptome profiles. In addition, 10× Genomics is scheduled to release the Single-Cell ATAC + Gene Expression platform for the simultaneous profiling of the epigenome and transcriptome by combining scRNA-seq with ATAC-seq (https://www.10xgenomics.com/product-updates/). We can expect that researchers will be able to access a simple protocol and platform for this purpose. Epigenomic landscapes determine the basic characteristics of cells, such as the cell lineage and differentiation state, while the transcriptome status represents the consequences of the cell conditions in a given state. Methods that can measure both transcriptome and open chromatin status in a single cell enable the elucidation of the direct link between transcriptome networks and their regulation, including the epigenome landscape and responsible transcription factors in each cell, resulting in an increasing number of reports and data sets arising from the simultaneous measurement of gene expression and ATAC-seq profiles.
For the simultaneous expression profiling of transcripts and cell surface proteins, CITE-seq73 and REAP-seq74 were developed, which are used mainly in immune cell analysis. Antibodies conjugated to barcode sequences are used to capture target cell surface proteins, and mRNAs and the barcode sequences of antibodies are analyzed for each cell. Feature Barcoding (10× Genomics) enables the combined profiling of targeted cell surface proteins with scRNA-seq via the Chromium system. The protocol is very simple and easily conducted: antibodies conjugated with each Feature Barcode oligo used to mark cell surface protein expression are mixed, single-cell separation, and amplification are conducted via the Chromium platform, and libraries of both cDNA and antibody-derived tags are constructed. Several cell types, especially immune cells, have historically been classified according to patterns of cell surface proteins. For example, naive, memory, and effector T cells are distinguished using CD45 isoform patterns (CD45RA/CD45RO antigens); however, these isoforms are not measured via general 3′ scRNA-seq, which indicates that information on the expression of cell surface markers may support the classification and interpretation of cell subsets. 10× Genomics also announced that they will release a method for the detection of intracellular proteins combined with gene expression profiling in a cell. The application of multilayered single-cell sequencing has expanded to include its combination with proteomics analysis.
In this review, we summarized single-cell sequencing methods applied at the genome, epigenome and transcriptome levels and their combinations, even including proteome-level analysis. An increasing number of experimental and computational methods are being rapidly developed for single-cell analysis, and we need to understand the advantages and disadvantages of each of these methods. We can obtain various omics profiles from each individual cell and should utilize the obtained information to understand the heterogeneity of molecular profiles, their changes in a given population, and the interaction among cells, although the obtained data sets include high-dimensional and mostly sparse data and, thus, are not easy to handle. Multiomics data analysis from the same single cell is more reliable than the integration of single omics layers because less sampling bias and fewer batch effects are involved, as shown by CITE-seq, for example. However, it is still easier to obtain single-layered data from single cells, and their integration may allow more cost-effective and less time-consuming analysis to be achieved by utilizing publicly available data. The data coverage (sequencing depths and the number of detected genes/regions) may be better for single omics data because more sequencing reads are required to cover two or more layers in multiomics sequencing. We can utilize a combination of single and multilayered sequencing depending on the omics layers involved.
Furthermore, the results obtained with single-cell sequencing technologies lack spatial information because a tissue is dissociated into single cells before sequencing analysis. Recently, spatial transcriptome techniques in which gene expression analysis is conducted in tissue sections have been reported, where spatial information is retained via molecular barcoding; these include methods such as the Slide-seq75 and Visium (10× Genomics/Spatial Transcriptomics) approaches76. Using Visium, gene expression profiles from one to tens of cells can be measured in up to 5000 spots (55 μm diameter per spot) on a slide for each tissue section. A frozen tissue section with a 10–20 μm thickness is prepared on the slide with oligos containing spatial barcodes and UMIs. By sequencing the synthesized cDNA libraries, we obtained RNA-seq data for each local spot with spatial information. By comparing these data with an H&E-stained image, we can compare gene expression patterns with histopathological information. Although existing spatial transcriptome techniques are still not available at a single-cell resolution, they enable us to identify differential expression patterns depending on the condition of each local microenvironment within tissues. We need to not only deal with single-cell multiomics information but also integrate temporal and spatial information to understand the diverse omics features of each individual cell.
Regev, A. et al. The human cell atlas. Elife 6, e27041 (2017).
Kashima, Y., Suzuki, A. & Suzuki, Y. An informative approach to single-cell sequencing analysis. Adv. Exp. Med. Biol. 1129, 81–96 (2019).
Ramsköld, D. et al. Full-length mRNA-Seq from single-cell levels of RNA and individual circulating tumor cells. Nat. Biotechnol. 30, 777–782 (2012).
Picelli, S. et al. Smart-seq2 for sensitive full-length transcriptome profiling in single cells. Nat. Methods 10, 1096–1100 (2013).
Sasagawa, Y. et al. Quartz-Seq: a highly reproducible and sensitive single-cell RNA-Seq reveals non-genetic gene expression heterogeneity. Genome Biol. 14, R31 (2013).
Hashimshony, T., Wagner, F., Sher, N. & Yanai, I. CEL-Seq: single-cell RNA-seq by multiplexed linear amplification. Cell Rep. 2, 666–673 (2012).
Hayashi, T. et al. Single-cell full-length total RNA sequencing uncovers dynamics of recursive splicing and enhancer RNAs. Nat. Commun. 9, 619 (2018).
Macosko, E. Z. et al. Highly parallel genome-wide expression profiling of individual cells using nanoliter droplets. Cell 161, 1202–1214 (2015).
Habib, N. et al. Massively parallel single-nucleus RNA-seq with DroNc-seq. Nat. Methods 14, 955–958 (2017).
Han, X. et al. Mapping the mouse cell atlas by microwell-seq. Cell 172, 1091–1107.e17 (2018).
Hashimoto, S. Nx1-Seq Well based single-cell analysis system. Adv. Exp. Med. Biol. 1129, 51–61 (2019).
Gierahn, T. M. et al. Seq-Well: portable, low-cost RNA sequencing of single cells at high throughput. Nat. Methods 14, 395–398 (2017).
Cao, J. et al. Comprehensive single-cell transcriptional profiling of a multicellular organism. Science 357, 661–667 (2017).
Cao, J. et al. The single-cell transcriptional landscape of mammalian organogenesis. Nature 566, 496–502 (2019).
Kouno, T. et al. C1 CAGE detects transcription start sites and enhancer activity at single-cell resolution. Nat. Commun. 10, 360 (2019).
Matsunaga, H. et al. A highly sensitive and accurate gene expression analysis by sequencing (‘bead-seq’) for a single cell. Anal. Biochem. 471, 9–16 (2015).
Kashima, Y. et al. Combinatory use of distinct single-cell RNA-seq analytical platforms reveals the heterogeneous transcriptome response. Sci. Rep. 8, 3482 (2018).
Patel, A. P. et al. Single-cell RNA-seq highlights intratumoral heterogeneity in primary glioblastoma. Science 344, 1396–1401 (2014).
Venteicher, A. S. et al. Decoupling genetics, lineages, and microenvironment in IDH-mutant gliomas by single-cell RNA-seq. Science 355, eaai8478 (2017).
Tirosh, I. et al. Single-cell RNA-seq supports a developmental hierarchy in human oligodendroglioma. Nature 539, 309–313 (2016).
Tirosh, I. et al. Dissecting the multicellular ecosystem of metastatic melanoma by single-cell RNA-seq. Science 352, 189–196 (2016).
Chung, W. et al. Single-cell RNA-seq enables comprehensive tumour and immune cell profiling in primary breast cancer. Nat. Commun. 8, 15081 (2017).
Zheng, C. et al. Landscape of infiltrating T cells in liver cancer revealed by single-cell sequencing. Cell 169, 1342–1356.e16 (2017).
Guo, X. et al. Global characterization of T cells in non-small-cell lung cancer by single-cell sequencing. Nat. Med. 24, 978–985 (2018).
Suzuki, A. et al. Single-cell analysis of lung adenocarcinoma cell lines reveals diverse expression patterns of individual cells invoked by a molecular target drug treatment. Genome Biol. 16, 66 (2015).
Satija, R., Farrell, J. A., Gennert, D., Schier, A. F. & Regev, A. Spatial reconstruction of single-cell gene expression data. Nat. Biotechnol. 33, 495–502 (2015).
Bierie, B. et al. Recovering gene interactions from single-cell data using data diffusion. Cell 174, 716–729.e27 (2018).
Huang, M. et al. SAVER: gene expression recovery for single-cell RNA sequencing. Nat. Methods 15, 539–542 (2018).
Li, W. V. & Li, J. J. An accurate and robust imputation method scImpute for single-cell RNA-seq data. Nat. Commun. 9, 997 (2018).
Butler, A., Hoffman, P., Smibert, P., Papalexi, E. & Satija, R. Integrating single-cell transcriptomic data across different conditions, technologies, and species. Nat. Biotechnol. 36, 411–420 (2018).
Risso, D., Perraudeau, F., Gribkova, S., Dudoit, S. & Vert, J. P. A general and flexible method for signal extraction from single-cell RNA-seq data. Nat. Commun. 9, 284 (2018).
Trapnell, C. et al. The dynamics and regulators of cell fate decisions are revealed by pseudotemporal ordering of single cells. Nat. Biotechnol. 32, 381–386 (2014).
duVerle, D. A., Yotsukura, S., Nomura, S., Aburatani, H. & Tsuda, K. CellTree: an R/bioconductor package to infer the hierarchical structure of cell populations from single-cell RNA-seq data. BMC Bioinformatics 17, 363 (2016).
La Manno, G. et al. RNA velocity of single cells. Nature 560, 494–498 (2018).
Browaeys, R., Saelens, W. & Saeys, Y. NicheNet: modeling intercellular communication by linking ligands to target genes. Nat. Methods 17, 159–162 (2019).
Spits, C. et al. Whole-genome multiple displacement amplification from single cells. Nat. Protoc. 1, 1965–1970 (2006).
Zong, C., Lu, S., Chapman, A. R. & Xie, X. S. Genome-wide detection of single-nucleotide and copy-number variations of a single human cell. Science 338, 1622–1626 (2012).
Telenius, H. et al. Degenerate oligonucleotide-primed PCR: general amplification of target DNA by a single degenerate primer. Genomics 13, 718–725 (1992).
Dong, X. et al. Accurate identification of single-nucleotide variants in whole-genome-amplified single cells. Nat. Methods 14, 491–493 (2017).
Zafar, H., Wang, Y., Nakhleh, L., Navin, N. & Chen, K. Monovar: single-nucleotide variant detection in single cells. Nat. Methods 13, 505–507 (2016).
Bohrson, C. L. et al. Linked-read analysis identifies mutations in single-cell DNA-sequencing data. Nat. Genet. 51, 749–754 (2019).
Hård, J. et al. Conbase: a software for unsupervised discovery of clonal somatic mutations in single cells through read phasing. Genome Biol. 20, 68 (2019).
Zhang, L. et al. Single-cell whole-genome sequencing reveals the functional landscape of somatic mutations in B lymphocytes across the human lifespan. Proc. Natl Acad. Sci. USA 116, 9014–9019 (2019).
Lodato, M. A. & Walsh, C. A. Genome aging: somatic mutation in the brain links age-related decline with disease and nominates pathogenic mechanisms. Hum. Mol. Genet. 28, R197–R206 (2019).
Lodato, M. A. et al. Aging and neurodegeneration are associated with increased mutations in single human neurons. Science 359, 555–559 (2018).
Bae, T. et al. Different mutational rates and mechanisms in human cells at pregastrulation and neurogenesis. Science 359, 550–555 (2018).
Navin, N. et al. Tumour evolution inferred by single-cell sequencing. Nature 472, 90–95 (2011).
Wang, Y. et al. Clonal evolution in breast cancer revealed by single nucleus genome sequencing. Nature 512, 155–160 (2014).
Gao, R. et al. Punctuated copy number evolution and clonal stasis in triple-negative breast cancer. Nat. Genet. 48, 1119–1130 (2016).
Casasent, A. K. et al. Multiclonal invasion in breast tumors identified by topographic single cell sequencing. Cell 172, 205–217.e12 (2018).
Kim, C. et al. Chemoresistance evolution in triple-negative breast cancer delineated by single-cell sequencing. Cell 173, 879–893.e13 (2018).
McMahon, C. M. et al. Clonal selection with RAS pathway activation mediates secondary clinical resistance to selective FLT3 inhibition in acute myeloid leukemia. Cancer Discov. 9, 1050–1063 (2019).
Smallwood, S. A. et al. Single-cell genome-wide bisulfite sequencing for assessing epigenetic heterogeneity. Nat. Methods 11, 817–820 (2014).
Guo, H. et al. Single-cell methylome landscapes of mouse embryonic stem cells and early embryos analyzed using reduced representation bisulfite sequencing. Genome Res. 23, 2126–2135 (2013).
Rotem, A. et al. Single-cell ChIP-seq reveals cell subpopulations defined by chromatin state. Nat. Biotechnol. 33, 1165–1172 (2015).
Grosselin, K. et al. High-throughput single-cell ChIP-seq identifies heterogeneity of chromatin states in breast cancer. Nat. Genet. 51, 1060–1066 (2019).
Kaya-Okur, H. S. et al. CUT&Tag for efficient epigenomic profiling of small samples and single cells. Nat. Commun. 10, 1930 (2019).
Buenrostro, J. D. et al. Single-cell chromatin accessibility reveals principles of regulatory variation. Nature 523, 486–490 (2015).
Corces, M. R. et al. Lineage-specific and single-cell chromatin accessibility charts human hematopoiesis and leukemia evolution. Nat. Genet. 48, 1193–1203 (2016).
Stuart, T. et al. Comprehensive integration of single-cell data. Cell 177, 1888–1902.e21 (2019).
Welch, J. D. et al. Single-cell multi-omic integration compares and contrasts features of brain cell identity. Cell 177, 1873–1887.e17 (2019).
Jin, S., Zhang, L. & Nie, Q. scAI: an unsupervised approach for the integrative analysis of parallel single-cell transcriptomic and epigenomic profiles. Genome Biol. 21, 25 (2020).
Lake, B. B. et al. Integrative single-cell analysis of transcriptional and epigenetic states in the human adult brain. Nat. Biotechnol. 36, 70–80 (2018).
Granja, J. M. et al. Single-cell multiomic analysis identifies regulatory programs in mixed-phenotype acute leukemia. Nat. Biotechnol. 37, 1458–1465 (2019).
Zhu, C., Preissl, S. & Ren, B. Single-cell multimodal omics: the power of many. Nat. Methods 17, 11–14 (2020).
Macaulay, I. C. et al. G&T-seq: parallel sequencing of single-cell genomes and transcriptomes. Nat. Methods 12, 519–522 (2015).
Dey, S. S., Kester, L., Spanjaard, B., Bienko, M. & Van Oudenaarden, A. Integrated genome and transcriptome sequencing of the same cell. Nat. Biotechnol. 33, 285–289 (2015).
Rooijers, K. et al. Simultaneous quantification of protein-DNA contacts and transcriptomes in single cells. Nat. Biotechnol. 37, 766–772 (2019).
Satpathy, A. T. et al. Transcript-indexed ATAC-seq for precision immune profiling. Nat. Med. 24, 580–590 (2018).
Chen, S., Lake, B. B. & Zhang, K. High-throughput sequencing of the transcriptome and chromatin accessibility in the same cell. Nat. Biotechnol. 37, 1452–1457 (2019).
Zhu, C. et al. An ultra high-throughput method for single-cell joint analysis of open chromatin and transcriptome. Nat. Struct. Mol. Biol. 26, 1063–1070 (2019).
Liu, L. et al. Deconvolution of single-cell multi-omics layers reveals regulatory heterogeneity. Nat. Commun. 10, 470 (2019).
Stoeckius, M. et al. Simultaneous epitope and transcriptome measurement in single cells. Nat. Methods 14, 865–868 (2017).
Peterson, V. M. et al. Multiplexed quantification of proteins and transcripts in single cells. Nat. Biotechnol. 35, 936–939 (2017).
Rodriques, S. G. et al. Slide-seq: a scalable technology for measuring genome-wide expression at high spatial resolution. Science 363, 1463–1467 (2019).
Ståhl, P. L. et al. Visualization and analysis of gene expression in tissue sections by spatial transcriptomics. Science 353, 78–82 (2016).
Mooijman, D., Dey, S. S., Boisset, J.-C., Crosetto, N. & van Oudenaarden, A. Single-cell 5hmC sequencing reveals chromosome-wide cell-to-cell variability and enables lineage reconstruction. Nat. Biotechnol. 34, 852–856 (2016).
Ku, W. L. et al. Single-cell chromatin immunocleavage sequencing (scChIC-seq) to profile histone modification. Nat. Methods 16, 323–325 (2019).
Nagano, T. et al. Single-cell Hi-C reveals cell-to-cell variability in chromosome structure. Nature 502, 59–64 (2013).
Angermueller, C. et al. Parallel single-cell sequencing links transcriptional and epigenetic heterogeneity. Nat. Methods 13, 229–232 (2016).
The publication of this review was supported by JSPS KAKENHI 16H06279 and 17H06306.
Conflict of interest
The authors declare that they have no conflict of interest.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Kashima, Y., Sakamoto, Y., Kaneko, K. et al. Single-cell sequencing techniques from individual to multiomics analyses. Exp Mol Med 52, 1419–1427 (2020). https://doi.org/10.1038/s12276-020-00499-2
Genome Biology (2022)
Co-varying neighborhood analysis identifies cell populations associated with phenotypes of interest from single-cell transcriptomics
Nature Biotechnology (2022)
Nature Reviews Neuroscience (2022)
Clinical & Experimental Metastasis (2022)
Clinical Epigenetics (2021)