Single-cell triple omics sequencing reveals genetic, epigenetic, and transcriptomic heterogeneity in hepatocellular carcinomas

Hou, Yu; Guo, Huahu; Cao, Chen; Li, Xianlong; Hu, Boqiang; Zhu, Ping; Wu, Xinglong; Wen, Lu; Tang, Fuchou; Huang, Yanyi; Peng, Jirun

doi:10.1038/cr.2016.23

Download PDF

Original Article
Open access
Published: 23 February 2016

Single-cell triple omics sequencing reveals genetic, epigenetic, and transcriptomic heterogeneity in hepatocellular carcinomas

Yu Hou¹^na1,
Huahu Guo^2,3,4^na1,
Chen Cao¹,
Xianlong Li¹,
Boqiang Hu¹,
Ping Zhu^1,6,
Xinglong Wu^1,6,
Lu Wen¹,
Fuchou Tang^1,5,6,7,
Yanyi Huang^1,6,8 &
…
Jirun Peng^2,3,4

Cell Research volume 26, pages 304–319 (2016)Cite this article

38k Accesses
382 Citations
70 Altmetric
Metrics details

Subjects

Abstract

Single-cell genome, DNA methylome, and transcriptome sequencing methods have been separately developed. However, to accurately analyze the mechanism by which transcriptome, genome and DNA methylome regulate each other, these omic methods need to be performed in the same single cell. Here we demonstrate a single-cell triple omics sequencing technique, scTrio-seq, that can be used to simultaneously analyze the genomic copy-number variations (CNVs), DNA methylome, and transcriptome of an individual mammalian cell. We show that large-scale CNVs cause proportional changes in RNA expression of genes within the gained or lost genomic regions, whereas these CNVs generally do not affect DNA methylation in these regions. Furthermore, we applied scTrio-seq to 25 single cancer cells derived from a human hepatocellular carcinoma tissue sample. We identified two subpopulations within these cells based on CNVs, DNA methylome, or transcriptome of individual cells. Our work offers a new avenue of dissecting the complex contribution of genomic and epigenomic heterogeneities to the transcriptomic heterogeneity within a population of cells.

Three million images and morphological profiles of cells treated with matched chemical and genetic perturbations

Article Open access 09 April 2024

Srinivas Niranj Chandrasekaran, Beth A. Cimini, … Anne E. Carpenter

Assessing GPT-4 for cell type annotation in single-cell RNA-seq analysis

Article Open access 25 March 2024

Wenpin Hou & Zhicheng Ji

Inferring gene regulatory networks from single-cell multiome data using atlas-scale external data

Article Open access 12 April 2024

Qiuyue Yuan & Zhana Duren

Introduction

The development of single-cell genome, DNA methylome, and transcriptome sequencing technologies in recent years has greatly aided dissection of the heterogeneity within a population of cells^1,2. We and others have developed single-cell RNA-seq methods, such as scRNA-seq, Smart-seq/Smart-seq2, CEL-seq, MARS-seq, STRT-seq, and Quartz-seq^3,4,5,6,7,8, and applied these techniques to analyze gene expression dynamics during mammalian embryonic development or tumor heterogeneity^{9,10,11,12,13}. Single-cell genome-sequencing technologies have been used to reveal recombination patterns and aneuploidies in single human germ cells^14,15, and genomic heterogeneities in tumors and circulating tumor cells^16,17,18. Recently, we and others have developed single-cell DNA methylome sequencing techniques, such as single-cell reduced representation bisulfite sequencing (scRRBS) and single-cell bisulfite sequencing (scBS)^19,20. We have applied scRRBS in analyzing DNA methylome dynamics during mammalian early embryonic development²¹. Combined genome and transcriptome analyses of a single cell based on either microarray or next-generation sequencing have also been successfully used to analyze tumor heterogeneity^22,23,24,25. However, to directly analyze the mechanisms by which genetic and epigenetic factors regulate gene expression in an individual cell, the genome, epigenome, and transcriptome need to be simultaneously analyzed in a single cell. This approach is especially desirable for cancer, which displays strong heterogeneity in all of these three omics^26,27,28.

Here we report the development of a single-cell triple omics sequencing technique, single-cell triple omics sequencing (scTrio-seq), and the application of this technique for analyzing the relationship between the genome (copy-number variations, CNVs), DNA methylome, and transcriptome of a single mammalian cell. We demonstrate that CNVs can be reliably identified using single-cell RRBS data produced from the scTrio-seq assay. We observed a negative correlation between promoter methylation and RNA expression, and a positive correlation between gene body methylation and RNA expression, in a single cell. Furthermore, a strong positive correlation between the DNA copy number and gene expression within the affected genomic region was found. In contrast, the DNA copy number does not affect DNA methylation level of the region. Finally, we used scTrio-seq to analyze 25 single cells derived from a human hepatocellular carcinoma (HCC) tissue sample and found two subpopulations distinct in DNA copy numbers, DNA methylation, or RNA expression levels. By comparing the multi-omic differences between two HCC subpopulations, we found that the subpopulation I, accounting for a minor part in tumor tissues, harbored more copy-gain CNVs, expressed more invasive cell markers, and were more likely to evade immune surveillance.

Results

Development of the scTrio-seq method

First, we developed a mild lysis protocol with which we only lysed the cytoplasm of an individual cell to release the mRNAs into the solution while keeping the nucleus intact. We next centrifuged the lysis product to separate the mRNA-containing supernatant from the nucleus-containing precipitate, and each was transferred to a different tube. The supernatant was subjected to the scRNA-seq method we previously developed²⁹, whereas the precipitate was subjected to DNA methylome sequencing using the scRRBS method we recently developed¹⁹. This approach simultaneously yielded genomic (in term of CNVs), DNA methylomic, and transcriptomic information from the same cell. We have named this new method the scTrio-seq technique (Figure 1A).

To test the method, we sequenced six single HepG2 cells (a human hepatoblastoma-derived cell line) and six mouse embryonic stem cells (mESCs) using scTrio-seq; we also subjected HepG2 cells and mESCs to scRNA-seq and scRRBS as technique controls. The DNA methylome data obtained from scTrio-seq yielded an average of 1.5 million CpG sites from a single HepG2 cell and 0.8 million CpG sites from a single mESC cell, which is comparable to the detection efficiency of the standard scRRBS method (Table 1 and Supplementary information, Table S1). Compared with scRRBS data, the scTrio-seq technique did not result in a significant loss of DNA segments, even at a resolution of 1 kb (Figure 1B). This result demonstrates that the physical separation of the nucleus from the supernatant can retain all the chromosomes of an individual cell. Moreover, the DNA methylation levels of individual HepG2 cells analyzed by scTrio-seq and those analyzed using standard scRRBS technique were also comparable (Figure 1C and Supplementary information, Figure S1A). We were able to detect the hypomethylation valleys around transcription start sites (TSSs) as well as hypermethylation patterns of the gene bodies at the single cell level (Figure 1D and 1E, Supplementary information, Figure S1B). These results indicate that the measurement of the DNA methylome by scTrio-seq is as accurate as that obtained by the standard scRRBS approach.

Table 1 Number of the detected CpG sites, genes, and MspI-digested fragments in single HepG2 cells

Full size table

We next tested the ability of scTrio-seq to accurately measure the gene expression pattern. We found that scTrio-seq detected an average of 6 179 genes in a single HepG2 cell. When we merged the transcriptome sequencing data from only six individual cells analyzed by scTrio-seq, a total of 10 390 genes were detected; this detection efficiency is comparable to that of a standard RNA-seq for bulk cells (Supplementary information, Figure S1C and S1D). The correlation between the scTrio-seq data and the standard scRNA-seq data was quite high (Pearson correlation coefficient = 0.96; Supplementary information, Figure S1E). To confirm the accuracy of quantitative gene detection using scTrio-seq, we quantified the relative gene expression using real-time quantitative PCR (qPCR) as previously described³⁰. The results showed that scTrio-seq quantified the expression of genes highly accurately when only half of the single-cell lysis product was used for cDNA amplification (Supplementary information, Figure S1F).

CNV deduction using scTrio-seq

Next, we attempted to identify global CNV patterns using the standard RRBS data of bulk cells, which could cover 0.57 million unique DNA fragments after MspI digestion at CCGG......CCGG. This corresponds to ∼1 900 unique MspI-digested DNA fragments per 10-Mb bin that can support the copy-number measurement. We observed strong correlations between the sequencing depth and GC content, especially the number of MspI-digested DNA fragments in each 10-Mb bin (Supplementary information, Figure S2A). We also performed both RRBS and whole-genome sequencing of the normal human liver tissues as a normal control. The same CNV patterns were observed in both the HepG2 whole-genome sequencing data and in the bulk RRBS data at a resolution of 10 Mb, consistent with that of the published SNP array data (Cancer Cell Line Encyclopedia)³¹ and whole-genome bisulfite sequencing data³² (accession number SRX332734; Figure 1F). We also calculated the sensitivity and specificity at different resolutions using the whole-genome sequencing data as a standard reference (Supplementary information, Figure S2B and S2C).

As a single mammalian cell has only two copies of genomic DNA, it is not feasible to simultaneously perform single-cell whole-genome DNA amplification and bisulfite conversion of the genome. Because more than one hundred thousand MspI-digested DNA fragments were recovered in the DNA methylome data of scTrio-seq (Table 1), we tested the ability to measure CNVs using the RRBS data of scTrio-seq in an individual cell. We also observed strong correlations between the sequencing depth and the number of MspI-digested DNA fragments for each 10-Mb bin in the scTrio-seq and scRRBS data (Supplementary information, Figure S3A and S3B). After normalization by the normal liver tissue, scTrio-seq can also be used to accurately deduce almost all the CNVs in a single HepG2 cell at a 10-Mb resolution (Figure 1F).

We verified the diploid nature of our mESC cell line by normalizing it with the published normal diploid mESC data (accession number SRX673789, Figure 1F and Supplementary information, Figure S3C). Using the bulk mESC cell data as a control, we observed a 95.3% specificity and 98% sensitivity of CNV deductions at a 10-Mb resolution using scTrio-seq data (Supplementary information, Figure S3D). Furthermore, we also fitted the normalized copy numbers to integer values for scTrio-seq data and scRRBS data using a hidden Markov model (HMM)^16,33. As the RRBS reads are not uniformly distributed in the genome, we tried to raise the resolution of CNV deduction of scTrio-seq by focusing on the highly covered genomic regions, and found that shorter CNV segments and more accurate breakpoints of CNVs can be identified (Supplementary information, Figure S3E). Together, these results demonstrate that the scTrio-seq technique can simultaneously and accurately analyze the genome (CNVs), DNA methylome, and transcriptome in a single cell.

Application of scTrio-seq to explore the relationship between the genome (CNVs), DNA methylome, and transcriptome in a single cell

Epigenetic modifications are important for regulating chromatin status and gene expression. They are potentially heterogeneous within a cell population, especially in cancer³⁴. Many studies have indicated that DNA methylation in the promoter regions often negatively correlates with gene expression, whereas DNA methylation in the gene body positively correlates with gene expressions³⁵. We also observed same correlations in bulk cell data (Supplementary information, Figure S4A). We next used scTrio-seq data to explore the relationship between methylation and gene expression in individual cells, and found a similar negative correlation between promoter DNA methylation and the expression level of the corresponding gene in each HepG2 cell (Figure 2 and Supplementary information, Figure S4B). This correlation indicates that high DNA methylation in promoters may repress the expression of corresponding genes within a HepG2 cell. Moreover, DNA methylation in the gene body (excluding promoter regions) showed a positive correlation with gene expression (Figure 2). Furthermore, this positive correlation increased when moving to the 3′-end of gene body, indicating that gene body DNA methylation may promote the transcription of these genes. To our knowledge, this is the first global demonstration of relationship between DNA methylation and RNA expression in single cells.

The CNV patterns in the scTrio-seq data from six HepG2 cells showed that these six cells shared most of their CNVs. However, some CNVs were unique to only one of the samples (Figure 3A). For example, four copies of chromosome 2 were identified in sample scTrio-HepG2-#6, but only three copies of this region were identified in each of the other five single cells. Furthermore, we calculated the relative expression levels within each 10-Mb window of scTrio-seq data by normalizing the data with the RNA-seq data from normal liver tissues (Figure 3B). We compared the CNV patterns with the RNA expression patterns and found that expression of genes within the genomic regions with extra copies also increased proportionally. Similarly, the expression of genes within genomic regions with lost copies proportionally decreased (Figure 3A-3C). We observed a Pearson correlation coefficient of 0.68 ± 0.07 (mean ± SD) between the digital DNA copy number values and the gene expression levels in the same cell at a 10-Mb resolution, which is consistent with the bulk cell data (Figure 3D and Supplementary information, Figures S4C and S5A). These results indicate that CNVs contribute to the changes of gene expression by changing the copy numbers and dosages of genes within these genomic regions.

In contrast, the DNA methylation level within the genomic region that gained or lost copies showed no alteration (Figure 3C). The correlation coefficient between the digital DNA copy number values and DNA methylation levels was 0.05 ± 0.02 (mean ± SD) for single cells (Figure 3D and Supplementary information, Figure S5B). For the bulk HepG2 data, there was no correlations between DNA copy numbers and DNA methylation levels even at a 0.5-Mb resolution (Supplementary information, Figure S4C), indicating that large-scale CNV patterns generally did not directly affect the DNA methylation levels within the corresponding genomic regions. Therefore, at the single-cell resolution, CNVs affect gene expression mainly by dosage effect but do not markedly change the DNA methylation patterns of the corresponding genes.

Application of scTrio-seq to explore the genome (CNVs), DNA methylome, and transcriptome relationships in HCC

Single-cell analyses have provided new insights into the evolution, therapeutic responses, and drug resistance of cancer^16,36. Single-cell genome and transcriptome sequencing analyses have accelerated studies of tumor heterogeneity³⁷. Changes in DNA methylation also have a critical role in tumorigenesis^38,39. Global DNA hypermethylation or hypomethylation has been observed in many types of cancers, and drugs that regulate DNA methylation, such as 5-azacitidine and decitabine, have been used in cancer therapy^40,41. However, the heterogeneity of DNA methylation in tumors in vivo has not been well characterized over the entire genome at single-cell resolution, and the relationships between the genome, epigenome, and transcriptome in single cancer cells have not been directly elucidated.

The RRBS data obtained from bulk HCC cells indicated global hypomethylation compared with the adjacent normal liver cells (Supplementary information, Figure S6), which is consistent with results from previous studies^42,43. We next analyzed 26 single cells isolated from a HCC sample from one patient using the scTrio-seq technique. As expected, these HCC cells showed global hypomethylation patterns (Figure 4A and 4B), except for one cell (HCC-sc#26; Supplementary information, Figure S7). Unlike the other 25 cells, this cell lacked significant aneuploidies (Supplementary information, Figure S8), indicating that it was likely to be noncancerous cell. After excluding this cell, we focused on the remaining 25 cancer cells in further analyses.

As observed in HepG2 cells, the DNA copy number and expression profile also showed strong correlations in HCC cells, with a Pearson correlation coefficient of 0.73 ± 0.04 (mean ± SD) between the digital copy-number values and the gene expression levels in individual HCC cells. However, the DNA copy number did not significantly affect the DNA methylation at the 10-Mb scale (Pearson correlations, 0.025 ± 0.035; Supplementary information, Figure S7C).

Differences in triple-omics between two subpopulations of HCC cells

We then performed an unsupervised hierarchical clustering analysis of these 25 hepatocellular carcinoma cells based on their CNV patterns, and this separated these cells into two subpopulations. All the 25 HCC cells harbored extra copies of Chr. 7 and the q arm of Chr. 1; these extra copies were also detected in several previously analyzed HCC samples⁴⁴. Furthermore, subpopulation I harbored several unique CNVs including gained copies of Chr. 8, Chr. 11 and Chr. 20. Conversely, subpopulation II lost copies of Chr. 4 and Chr.16 (Figure 4C and Supplementary information, Figures S8 and S9A). We also identified similar patterns and obtained similar clustering results using RNA expression values of the genes within each 10-Mb window (Figure 4D). At a 10-Mb resolution, a comparison of the CNV patterns between subpopulation I and subpopulation II defined 164 10-Mb bins with CNV differences between the two subpopulations as “differential CNV regions” and 158 bins without CNV differences as “shared CNV regions”.

Next, we examined the heterogeneity in global DNA methylation level among these 25 cancer cells. We found that cells of the same subpopulation (I or II) had higher correlations compared with the normal liver cells, while there was noticeable heterogeneity among two subpopulations (Figure 4E). We then performed unsupervised clustering analysis for these cells based on methylation level of all detected CpG sites. Notably, this analysis also separated these cells into two subpopulations exactly identical to those identified by the CNV patterns (Figure 5A and Supplementary information, Figure S9B). Of these two subpopulations, subpopulation I displayed slightly higher level of global DNA methylation than subpopulation II, but the differentially methylated regions (DMRs) were not associated with the regions of different CNV (Supplementary information, Figure S9C and S9D).

To analyze the DMRs between and within two HCC subpopulations, we calculated the methylation level and variance with a 3-kb sliding window across all the 25 HCC cells²⁰. After ranking the windows with their DNA methylation variances among the 25 HCC cells, we found the top variable windows were significantly enriched in the CpG island (CGI) region (Fisher's exact test, FDR = 2 × 10⁻¹⁵; Supplementary information, Figure S10A). Moreover, the CGI region also had higher DNA methylation variances within each subpopulation. A total of 140 and 200 out of the 300 most variable windows were located in CGI regions in subpopulation I and subpopulation II, respectively (Supplementary information, Figure S10B and S10C). We next compared the DNA methylation level of each CGI and identified the CGIs with significant methylation level difference (difference > 0.3; Fisher's exact test P value < 0.05) as differentially methylated CGIs (dmCGIs) between the two HCC subpopulations. We found 69 CGIs were hypermethylated in subpopulation I, and 33 were hypermethylated in subpopulation II (Figure 5B).

To analyze gene expression differences between the two subpopulations in HCC, we performed a principal component analysis using the gene expression data from 25 HCC cells and this again notably distinguished two cell subpopulations consistent with the results from the analyses on CNV patterns and DNA methylation data (Figure 5C). Subpopulation I expressed significantly higher levels of 245 genes and significantly lower levels of 350 genes than subpopulation II (FDR < 0.05; Figure 5D). The 245 genes with higher expression levels in subpopulation I were not significantly enriched in Gene Ontology (GO) terms. Interestingly, the 350 genes expressed significantly lower in subpopulation I were clearly enriched in several GO terms, such as acute inflammatory response, innate immune response, and complement activation, as well as complement and coagulation cascades in the KEGG pathway analysis (Figure 5E and Supplementary information, Figure S11). Complement activation has been considered a biomarker of many tumors⁴⁵, and the protein AIM has been identified as the complement activator that initiates HCC necrotic death^46,47. Thus, the data suggest that the cells in subpopulation I are less responsive to the immune recognition than those in subpopulation II.

Although DNA copy-number difference between two HCC subpopulations may contribute to differential RNA expression on a large scale of genomic region, the expression of individual genes is still regulated by DNA methylation in DMRs in a context-dependent manner. For example, both ANO1 and S100A11 have been reported to have important roles in tumorigenesis and cancer metastasis^{48,49,50,51,52}. We found that for HCC cells in subpopulation I, both ANO1 and S100A11 had lower DNA methylation levels. However, the hypomethylation occurred in the gene body of ANO1, whereas in S100A11 it was the promoter that was hypomethylated, and the expression level of ANO1 in these HCC cells is suppressed, whereas the expression of S100A11 is elevated in subpopulation I (Figure 6A and 6B).

Taken together, these results indicate that the DNA copy number, DNA methylome, and transcriptome significantly differ between subpopulations I and II. The differential CNV regions, DMRs, and differentially expressed genes regulated each other. HCC cells in subpopulation I, which harbor more copy-gain CNVs, are likely to escape the immune recognition and are more invasive compared with the cells in subpopulation II. It also should be noted that these single cells account for a minor part in the tumor tissue, and thus their unique genomic, epigenomic, and transcriptomic characters will be concealed in bulk analysis.

Discussion

Cancer development and metastasis involve various aspects of genomic alternations, including but not limited to changes of genomic DNA, epigenetic modifications, gene expression, and complex interplays between them. The intrinsically strong intratumoral heterogeneity makes it difficult to define an accurate regulatory relationship among genome, epigenome, and transcriptome using bulk cells. Well-established single-cell methods were typically optimized to examine only one aspect of regulatory hierarchy, hence losing the possibility to probe the inter-omics regulations at the single-cell level. Recently reported single-cell dual-omics sequencing methods (e.g., DR-seq and G&T-seq)^24,25 can depict regulatory relationships between genome and transcriptome, but are incapable to provide epigenetic information (methylome especially), which is critical to RNA expression regulation.

In this study, we have developed a novel method called scTrio-seq that can, for the first time, simultaneously acquire genome (CNVs), DNA methylome, and transcriptome information of the single cells. We have shown that this method can accurately deduce CNV patterns at a 10-Mb resolution, obtain the methylation patterns of 1.5 million CpG sites, and detect the expression levels of 6 179 genes on average in a single mammalian cell. Correlations between genomic (CNVs), methylomic, and transcriptomic data have also been analyzed in the same individual cells for the first time. Changes in gene dosages due to CNVs proportionally affect the RNA expression levels of the corresponding regions, whereas they do not significantly affect the DNA methylation levels in these regions.

In scTrio-seq, we physically separate the DNA and RNA molecules before amplifications and primer binding, eliminating the possible genomic DNA cross-contamination in scRNA-seq. Moreover, our mild lysis condition and separation procedure are compatible with conventional single-cell methods. For example, the DNA in the lysate can be used for single-cell whole-genome amplification, scRRBS or scBS analysis, while the RNA can be processed using Smart-seq or CEL-seq pipelines in parallel. Admittedly, to avoid disturbing the nuclear DNA precipitate, some RNA-containing supernatant is left in the tube after separation, leading to a slight loss of RNA transcripts. However, we found that using half of the lysate did not compromise the whole-transcriptome analysis. Further improvement may be achieved by increasing compactness of DNA pellet to optimize the separation of DNA and RNA.

Using scTrio-seq, we can detect subpopulations of cancer cells according to the genome (CNVs) information, and infer malignancy and metastasis potentials of the subpopulations based on triple-omic information. Moreover, we are also able to explore the relationships between differential CNV regions, differentially expressed genes, and DMRs. After filtering out the differences between subpopulations, we can unveil the heterogeneity existing within each subpopulation. Our work paves the way for deciphering the heterogeneity and complexity of cell populations in development and cancer by simultaneously interrogating the genome, methylome, and transcriptome of their constituents at the single-cell level.

Materials and Methods

Cancer sample collection and single-cell isolation

This study was approved by the Ethics Committee of Beijing Shijitan Hospital, Capital Medical University. Surgically removed HCC specimens were collected from a 51-year-old male patient who had provided written informed consent. All the clinicopathologic results of specimen are accordant with hepatocellular carcinoma. The pathological report shows that the tumor has extensive degeneration and necrosis, the surrounding tissue of the tumor has nodular cirrhosis, and the pathological features are accordant with hepatitis B-associated cirrhosis. The IHC result of the specimen is AFP (±), GPC3 (−), ki-67 (−), and CD34 (+). The tissues were mechanically dissociated into small pieces on ice and then digested into an HCC cell suspension using a Tumor Dissociation Kit (Miltenyi Biotec 130-095-929); a part of the cancer samples was retained for bulk genome, transcriptome, and methylome sequencing. Three normal liver cells were obtained from the adjacent normal tissue of another HCC patient. The cell viability of digested HCC cells were tested with Propidium iodide (PI) staining. Among the digested HCC cells, 76.5% were PI negative (live cells) and CD45 negative (non-leukocyte cells) analyzed by FACS. The HepG2 cells were cultured in RPMI 1640 medium (Corning, 10-040-CVR) containing 10% fetal bovine serum under 5% CO₂ at 37 °C. Before the single-cell study, HepG2 cells were digested with 0.5% trypsin into a single-cell suspension and picked with a mouth pipette.

Purification of HCC cells by MACS

The digested HCC cell suspension was passed through a 70-μm strainer (BD Biosciences) and then passed through a 40-μm strainer (BD Biosciences) to obtain a single-cell suspension. The HCC cells were purified by magnetic-activated cell sorting (MACS) using MS columns in MACS buffer (2 mM EDTA, 0.5% BSA in PBS). Red blood cells and inflammatory infiltrate cells were depleted using CD45 and CD71 MACS beads (Miltenyi Biotec) and MS columns (Mitenyi Biotec).

Single-cell segregation for DNA methylome and transcriptome sequencing

Single cells were individually transferred into 200-μl tubes using a mouth pipette. The single cells were lysed in 7 μl of soft buffer (500 mM KCl, 100 mM Tris-HCl (pH = 8.3), 1.35 mM MgCl₂, 4.5 mM DTT, 0.45% Nonidet P-40 (Roche, 11332473001), 0.18 U SUPERase-In (Applied Biosystems, AM2694), and 0.36 U RNase-inhibitor (Applied Biosystems, AM2682) for 30 min at 4 °C, and then the lysate was vortexed for 1 min at room temperature. All RNAs were released, whereas the nucleus remained intact. The lysed single cell was then centrifuged at 1 000× g for 5 min to leave the nucleus at the bottom of the tube. The 4 μl of lysis product supernatant was carefully removed and added to another 200-μl tube containing spike-in RNA (ERCC, Ambion) and reverse transcriptase. This fraction was used for transcriptome analyses, whereas the remaining 3 μl of lysis solution (containing the nucleus) was used for genome (CNVs) and methylome analyses. The upper 4 μl of lysis solution was reverse-transcribed with poly T primers, and the cDNA was amplified as previously described²⁹. Protease was added to the bottom 3 μl lysis solution, and the DNA was added with 60 fg of unmethylated lambda DNA (Fermentas). The released naked DNA was then digested and bisulfite-converted using the scRRBS method¹⁹.

Sample quality control and library construction for sequencing

The cDNA amplicons of each single cell were quantified with qPCR of two housekeeping genes (GAPDH and ACTB). Amplified single-cell cDNA was purified with the DNA Clean & Concentrator 5 Kit (VisTech, HLLCTech, DC2005). The amplification primers were removed by selecting 500-3 000 bp cDNA products on a 2% agarose gel; the product was recovered from the gel using the VisClean Gel DNA Recovery kit (VisTech, HLLCTech, PC0313). The purified cDNAs were then sonicated with a Covaris S2 system to generate 150-250 bp fragments. The cDNA libraries were barcoded and amplified using NEBNext Ultra DNA Prep Kit for Illumina (New England Biolabs, E7370). Single-cell RRBS libraries were constructed according to previously published protocols¹⁹, and two genomic loci were checked in RRBS libraries with qPCR before the high-throughput sequencing to ensure that DNA was present. Only the libraries in which the two genomic loci were detected were sequenced. Bulk cDNA libraries and RRBS libraries were constructed according to previously published protocols⁵³. All constructed libraries were used for 100-bp pair-end high-throughput sequencing on an Illumina HiSeq2000 or HiSeq 2500 Sequencer. The qPCR primers for checking the bisulfite-converted DNA were

Chr3_Forward: GTTAGGGAAGAGTTGGTTAGAG

Chr3_Reverse: TCTAAAACCAAATCTAAATCCTAAA

Chr17_Forward: GGTTTTTGGTGAGTTTTTTTT

Chr17_Reverse: AACCTACACAAACCCAAAAT

For the HepG2 and mESC cells, we picked 10 cells from each cell line. All 20 cells showed high RNA quality and DNA quality in qPCR quality control experiments. Then 6 out of the 10 cells of each cell line were sequenced using scTrio-seq technique, 2 cells were sequenced using scRNA-seq and 2 cells were sequenced using scRRBS. For the digested HCC cells, we picked 37 single cells, among which 9 cells showed low quality of cDNA and 2 cells showed low DNA quality in qPCR quality control experiments. We thus sequenced and analyzed the remaining 26 (70.3%) HCC single cells.

Sequencing quality control and data processing

Single-cell RNA seq data The raw sequencing reads were trimmed to remove low-quality read ends, library construction adapters, and amplification primers. The clean reads were aligned to the human genome (hg19) or the mouse genome (mm9) with Tophat and the gene expression levels were calculated with Cufflinks⁵⁴. Mapped reads, mapped ratio, and detected gene numbers are shown in the Supplementary information, Table S2. The number of detected RefSeq gene and NONCODE^55,56 gene were calculated separately for each single-cell RNA-seq data, bulk cell data, as well as the published RNA-seq data of HepG2 cell line (ENCODE, ENCLB257SKY)⁵⁷.

Single-cell RRBS data The raw sequencing reads were trimmed to remove low-quality read ends and library construction adapters. We then aligned the trimmed reads to human or mouse C-T (G-A) genomes with the Bismark software⁵⁸. The bisulfite conversion rate for each sample, which is shown in the Supplementary information, Table S1, was calculated using a spike-in of unmethylated lambda DNA. The methylation level of each CpG site was then calculated by counting the methylated reads and unmethylated reads. Only the CpG sites with a depth of = 3 and a methylation level of ≥ 0.9 or ≤ 0.1 were used for further analyses of single-cell RRBS sequencing data.

Calculation of the correlation between gene expression and DNA methylation at the single-cell level

The gene body was defined as the region from TSS to TES of each gene. Considering that the promoter regions and CGI regions in gene bod may influence the DNA methylation calculation of gene body, we excluded the promoter regions (from TSS to 2 000-bp downstream of TSS) and the CGI regions from each gene body region. The promoter region of each gene was defined as the region from 1 000-bp upstream to 500-bp downstream of the TSS. Only the genes with more than five CpG sites detected in the gene body region or gene promoter region were used to analyze the relationship with gene expression, and each CpG site used for analysis was required to be sequenced at depth of = 3. The DNA methylation level in the gene body or promoter region was calculated based on the mean methylation level of detected CpGs of each region. The gene expression level was the FPKM value calculated with Cufflinks program. The genes were then arranged according to their expression levels. The Pearson correlation coefficients (r) between gene body methylation or promoter methylation and the corresponding gene expression level (log₂ (FPKM + 1)) were calculated as previously described²¹.

Unique mappable MspI-digested fragments of RRBS data

We searched the reference genome (hg19 or mm9) for all possible MspI-digested fragments (CCGG......CCGG) except for the ones from random chromosomes. We then generated a simulated paired-end RRBS data using the sequences from two ends of these MspI-digested fragments. The simulated RRBS reads were mapped to the reference genome in the same manner as the experimental data were. We discarded the alignments that yielded multiple hits or that could have been mismatched by reads from elsewhere. After filtering, we defined 627 448 unique mapped fragments from 727 620 candidates in the human hg19 reference genome, and 339 101 unique fragments from 427 854 candidates in the mouse mm9 reference genome. Unique fragment counts in each genomic bin were calculated using BEDTools⁵⁹.

Correlations between DNA methylation and RNA expression

The gene body region (from TSS to TES) of each gene was divided into 20 equal fractions and the 15-kb upstream (or downstream) flanking regions were divided into five fractions. The mean DNA methylation level of CpG sites in each fraction of each gene was computed by Pearson correlation analysis with the corresponding genes. The genes with FPKM < 0.001 were reset to 0.001, and then the relative gene expressions (log₂ (FPKM + 1)) were used for correlation analysis with DNA methylation levels of different genomic regions. The correlation of adult liver cells in Supplementary information, Figure S6C was calculated using the published whole-genome bisulfite sequencing data of adult liver cells (GSM916049) and the published RNA-seq data of human liver (accession number: ERX011229).

CNV deduction with whole-genome sequencing data or RRBS data

Samtools depth was used to count the depth of each position across the genome. The total sequence depth of each window was counted, and then normalized using the total depth of each sample. The windows with low mappability such as centromere and telomere were not included in our analyzed windows. Because the systematic coverage bias in the RRBS data is too much to allow the deduction of copy numbers, we then normalized the sequence depth of each window by dividing it by the normalization factor. The normalization factor of each window was calculated by averaging the depth value of normal liver bulk RRBS data. The normalized copy number from each window was then used to cluster the human HCC cells with average-linkage hierarchical clustering.

To ensure that the mouse mESCs we used were normal diploid cells, we deduced the CNVs of bulk whole-genome sequencing data by normalizing it with the published normal diploid mESC data (accession number: SRX673789). For the single-cell RRBS data, the normalized copy numbers were further fitted to integer copy-number values using the hidden Markov model (HMM) as described for CNV deductions of circulating tumor cells¹⁶. The integer copy-number values were then used to calculate the Pearson correlation with gene expression and DNA methylation.

CNV deduction with RNA-seq data

Approximately 6 000 genes, whose average relative expression level (calculated as log₂ (FPKM + 1)) exceeded 1.5 across all single-cell samples, were used to measure CNVs according to a previous published method¹¹. The CNV value for each gene was defined as the mean expression level (FPKM) of the 100 genes around the gene (50 upstream genes and 50 downstream genes). The CNV values were then centered to zero by subtracting the average CNV value for each cell¹¹. Furthermore, the relative CNV value of a given 10-Mb window was calculated by averaging the values of all the genes within the window. We performed a bulk RNA-seq analysis of liver tissues near the HCC tissue and also obtained normal liver bulk RNA-seq data from NCBI as the normal reference (accession number: ERX011229). Single-cell data sets were then normalized to the normal reference, and hierarchical clustering was performed to discriminate between samples based on the severity of copy-number abnormality.

Sensitivity and specificity of CNV deductions

We used the whole-genome sequencing data of bulk HepG2 cells as a standard reference for the bulk HepG2 RRBS data. We then calculated the sensitivity and specificity of CNV deductions at different resolution levels (from 0.1 to 10 Mb) for bulk RRBS data. We assessed the specificity and sensitivity of CNV deduction in scTrio-seq data using a normal diploid cell line (mouse mESCs). The normalized copy-number value of each window was expected to be within the range of (1.5-2.5) for autosomes in specificity calculation, and (0.5-1.5) for X chromosome in sensitivity calculation.

DNA methylation variance among single cells

We estimated the cell-to-cell variance with a 3 000-bp window as described in a previously published study²⁰. First, the mean methylation rate of each window in each cell was calculated, and the reciprocal of the SEM for each sample was set as the weight value for calculating the variance among different cells. The lower bound of the chi-squared confidence interval of the variance estimator with a confidence level of 0.95 was then used to calculate the variances in each genome element. The variable windows were then ranked with their variable values. Distribution enrichment of each genomic element in the top 300 variable windows were calculated and the significance was checked using Fisher's exact test.

Identification of dmCGIs between two HCC subpopulations

For the following analysis, we selected definitively methylated CpG (mCG) sites or unmethylated (umCG) sites that were covered at least three times in a sample. We counted the mCG and umCG in each CGI and determined its methylation level by calculating the ratio between mCG and total CpGs. Only the CGIs that had at least five CpGs detected in a single-cell sample were considered as qualified. We then added the number of mCG and umCG sites in a CGI across samples in the same subpopulation if a CGI is qualified in 3 out of 7 cells in subpopulation I and in 5 out of 18 cells in subpopulation II. Sites that were differentially methylated at a significance level of 0.05 as determined by the Fisher's exact test and had a minimum methylation difference of 0.3 between two subpopulations were considered dmCGIs.

Data access

All sequencing data have been submitted to the NCBI Gene Expression Omnibus (http://www.ncbi.nlm.nih.gov/geo/) under accession number GSE65364.

Author Contributions

YuH, FT, YanH and JP designed the study, interpreted data and wrote the manuscript; YuH, HG, XL, XW and LW performed experiments and analyzed data; the bioinformatics analysis was performed by YuH, XL, CC, BH and PZ.

Competing Financial Interests

The authors declare no conflict of interest.

Accession codes

Accessions

GenBank/EMBL/DDBJ

Gene Expression Omnibus

Swiss-Prot

ERX011229

References

Wen L, Tang F . Reconstructing complex tissues from single-cell analyses. Cell 2014; 157:771–773.
Article CAS Google Scholar
Shapiro E, Biezuner T, Linnarsson S . Single-cell sequencing-based technologies will revolutionize whole-organism science. Nat Rev Genet 2013; 14:618–630.
Article CAS Google Scholar
Ramskold D, Luo S, Wang YC, et al. Full-length mRNA-Seq from single-cell levels of RNA and individual circulating tumor cells. Nat Biotechnol 2012; 30:777–782.
Article Google Scholar
Hashimshony T, Wagner F, Sher N, Yanai I . CEL-Seq: single-cell RNA-Seq by multiplexed linear amplification. Cell Rep 2012; 2:666–673.
Article CAS Google Scholar
Jaitin DA, Kenigsberg E, Keren-Shaul H, et al. Massively parallel single-cell RNA-seq for marker-free decomposition of tissues into cell types. Science 2014; 343:776–779.
Article CAS Google Scholar
Tang F, Barbacioru C, Wang Y, et al. mRNA-Seq whole-transcriptome analysis of a single cell. Nat Methods 2009; 6:377–382.
Article CAS Google Scholar
Sasagawa Y, Nikaido I, Hayashi T, et al. Quartz-Seq: a highly reproducible and sensitive single-cell RNA sequencing method, reveals non-genetic gene-expression heterogeneity. Genome Biol 2013; 14:R31.
Article Google Scholar
Islam S, Kjallquist U, Moliner A, et al. Characterization of the single-cell transcriptional landscape by highly multiplex RNA-seq. Genome Res 2011; 21:1160–1167.
Article CAS Google Scholar
Yan L, Yang M, Guo H, et al. Single-cell RNA-Seq profiling of human preimplantation embryos and embryonic stem cells. Nat Struct Mol Biol 2013; 20:1131–1139.
Article CAS Google Scholar
Dalerba P, Kalisky T, Sahoo D, et al. Single-cell dissection of transcriptional heterogeneity in human colon tumors. Nat Biotechnol 2011; 29:1120–1127.
Article CAS Google Scholar
Patel AP, Tirosh I, Trombetta JJ, et al. Single-cell RNA-seq highlights intratumoral heterogeneity in primary glioblastoma. Science 2014; 344:1396–1401.
Article CAS Google Scholar
Treutlein B, Brownfield DG, Wu AR, et al. Reconstructing lineage hierarchies of the distal lung epithelium using single-cell RNA-seq. Nature 2014; 509:371–375.
Article CAS Google Scholar
Durruthy-Durruthy R, Gottlieb A, Hartman BH, et al. Reconstruction of the mouse otocyst and early neuroblast lineage at single-cell resolution. Cell 2014; 157:964–978.
Article CAS Google Scholar
Hou Y, Fan W, Yan L, et al. Genome analyses of single human oocytes. Cell 2013; 155:1492–1506.
Article CAS Google Scholar
Lu S, Zong C, Fan W, et al. Probing meiotic recombination and aneuploidy of single sperm cells by whole-genome sequencing. Science 2012; 338:1627–1630.
Article CAS Google Scholar
Ni X, Zhuo M, Su Z, et al. Reproducible copy number variation patterns among single circulating tumor cells of lung cancer patients. Proc Natl Acad Sci USA 2013; 110:21083–21088.
Article CAS Google Scholar
Beroukhim R, Getz G, Nghiemphu L, et al. Assessing the significance of chromosomal aberrations in cancer: methodology and application to glioma. Proc Natl Acad Sci USA 2007; 104:20007–20012.
Article CAS Google Scholar
Beroukhim R, Mermel CH, Porter D, et al. The landscape of somatic copy-number alteration across human cancers. Nature 2010; 463:899–905.
Article CAS Google Scholar
Guo H, Zhu P, Wu X, et al. Single-cell methylome landscapes of mouse embryonic stem cells and early embryos analyzed using reduced representation bisulfite sequencing. Genome Res 2013; 23:2126–2135.
Article CAS Google Scholar
Smallwood SA, Lee HJ, Angermueller C, et al. Single-cell genome-wide bisulfite sequencing for assessing epigenetic heterogeneity. Nat Methods 2014; 11:817–820.
Article CAS Google Scholar
Guo H, Zhu P, Yan L, et al. The DNA methylation landscape of human early embryos. Nature 2014; 511:606–610.
Article CAS Google Scholar
Klein CA, Seidl S, Petat-Dutter K, et al. Combined transcriptome and genome analysis of single micrometastatic cells. Nat Biotechnol 2002; 20:387–392.
Article CAS Google Scholar
Guzvic M, Braun B, Ganzer R, et al. Combined genome and transcriptome analysis of single disseminated cancer cells from bone marrow of prostate cancer patients reveals unexpected transcriptomes. Cancer Res 2014; 74:7383–7394.
Article CAS Google Scholar
Dey SS, Kester L, Spanjaard B, Bienko M, van Oudenaarden A . Integrated genome and transcriptome sequencing of the same cell. Nat Biotechnol 2015; 33:285–289.
Article CAS Google Scholar
Macaulay IC, Haerty W, Kumar P, Li YI . G&T-seq: parallel sequencing of single-cell genomes and transcriptomes. Nat Methods 2015; 12:519–522.
Article CAS Google Scholar
Garraway LA, Lander ES . Lessons from the cancer genome. Cell 2013; 153:17–37.
Article CAS Google Scholar
Swanton C . Intratumor heterogeneity: evolution through space and time. Cancer Res 2012; 72:4875–4882.
Article CAS Google Scholar
Marusyk A, Almendro V, Polyak K . Intra-tumour heterogeneity: a looking glass for cancer? Nat Rev Cancer 2012; 12:323–334.
Article CAS Google Scholar
Tang F, Barbacioru C, Nordman E, et al. RNA-Seq analysis to capture the transcriptome landscape of a single cell. Nat Protoc 2010; 5:516–535.
Article CAS Google Scholar
Janes KA, Wang CC, Holmberg KJ, Cabral K, Brugge JS . Identifying single-cell molecular programs by stochastic profiling. Nat methods 2010; 7:311–317.
Article CAS Google Scholar
Barretina J, Caponigro G, Stransky N, et al. The cancer cell line encyclopedia enables predictive modelling of anticancer drug sensitivity. Nature 2012; 483:603–607.
Article CAS Google Scholar
Ziller MJ, Gu H, Muller F, et al. Charting a dynamic DNA methylation landscape of the human genome. Nature 2013; 500:477–481.
Article CAS Google Scholar
Zong C, Lu S, Chapman AR, Xie XS . Genome-wide detection of single-nucleotide and copy-number variations of a single human cell. Science 2012; 338:1622–1626.
Article CAS Google Scholar
Jaenisch R, Bird A . Epigenetic regulation of gene expression: how the genome integrates intrinsic and environmental signals. Nat Genet 2003; 33 Suppl:245–254.
Article Google Scholar
Smith ZD, Meissner A . DNA methylation: roles in mammalian development. Nat Rev Genet 2013; 14:204–220.
Article CAS Google Scholar
Navin N, Kendall J, Troge J, et al. Tumour evolution inferred by single-cell sequencing. Nature 2011; 472:90–94.
Article CAS Google Scholar
Kreso A, O'Brien CA, van Galen P, et al. Variable clonal repopulation dynamics influence chemotherapy response in colorectal cancer. Science 2013; 339:543–548.
Article CAS Google Scholar
Rodriguez-Paredes M, Esteller M . Cancer epigenetics reaches mainstream oncology. Nat Med 2011; 17:330–339.
Article CAS Google Scholar
Gal-Yam EN, Saito Y, Egger G, Jones PA . Cancer epigenetics: modifications, screening, and therapy. Ann Rev Med 2008; 59:267–280.
Article CAS Google Scholar
Dawson MA, Kouzarides T . Cancer epigenetics: from mechanism to therapy. Cell 2012; 150:12–27.
Article CAS Google Scholar
Suva ML, Riggi N, Bernstein BE . Epigenetic reprogramming in cancer. Science 2013; 339:1567–1570.
Article CAS Google Scholar
Torano EG, Petrus S, Fernandez AF, Fraga MF . Global DNA hypomethylation in cancer: review of validated methods and clinical significance. Clin Chem Lab Med 2012; 50:1733–1742.
Article CAS Google Scholar
Hernandez-Vargas H, Lambert MP, Le Calvez-Kelm F, et al. Hepatocellular carcinoma displays distinct DNA methylation signatures with potential as clinical predictors. PloS One 2010; 5:e9749.
Article Google Scholar
Xu H, Zhu X, Xu Z, et al. Non-invasive analysis of genomic copy number variation in patients with hepatocellular carcinoma by next generation DNA sequencing. J Cancer 2015; 6:247–253.
Article CAS Google Scholar
Ajona D, Pajares MJ, Corrales L, et al. Investigation of complement activation product c4d as a diagnostic and prognostic biomarker for lung cancer. J Natl Cancer Inst 2013; 105:1385–1393.
Article CAS Google Scholar
Maurer AJ, Bonney PA, Toho LC, et al. Tumor necrosis-initiated complement activation stimulates proliferation of medulloblastoma cells. Inflamm Res 2015; 4:185–192.
Article Google Scholar
Maehara N, Arai S, Mori M, et al. Circulating AIM prevents hepatocellular carcinoma through complement activation. Cell Rep 2014; 9:61–74.
Article CAS Google Scholar
Qu Z, Yao W, Yao R, et al. The Ca(2+) -activated Cl(-) channel, ANO1 (TMEM16A), is a double-edged sword in cell proliferation and tumorigenesis. Cancer Med 2014; 3:453–461.
Article CAS Google Scholar
Jia L, Liu W, Guan L, Lu M, Wang K . Inhibition of calcium-activated chloride channel ANO1/TMEM16A suppresses tumor growth and invasion in human lung cancer. PloS One 2015; 10:e0136584.
Article Google Scholar
Sui Y, Sun M, Wu F, et al. Inhibition of TMEM16A expression suppresses growth and invasion in human colorectal cancer cells. PloS Oon 2014; 9:e115443.
Article Google Scholar
Shiwarski DJ, Shao C, Bill A, et al. To “grow” or “go”: TMEM16A expression as a switch between tumor growth and metastasis in SCCHN. Clin Cancer Res 2014; 20:4673–4688.
Article CAS Google Scholar
Jaiswal JK, Lauritzen SP, Scheffer L, et al. S100A11 is required for efficient plasma membrane repair and survival of invasive cancer cells. Nat Commun 2014; 5:3795.
Article CAS Google Scholar
Gu H, Smith ZD, Bock C, et al. Preparation of reduced representation bisulfite sequencing libraries for genome-scale DNA methylation profiling. Nat Protoc 2011; 6:468–481.
Article CAS Google Scholar
Trapnell C, Roberts A, Goff L, et al. Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks. Nat Protoc 2012; 7:562–578.
Article CAS Google Scholar
Xie C, Yuan J, Li H, et al. NONCODEv4: exploring the world of long non-coding RNA genes. Nucleic Acids Res 2014; 42:D98–D103.
Article CAS Google Scholar
Zhao Y, Li H, Fang S, et al. NONCODE 2016: an informative and valuable data source of long non-coding RNAs. Nucleic Acids Res 2015.
Gerstein MB, Rozowsky J, Yan KK, et al. Comparative analysis of the transcriptome across distant species. Nature 2014; 512:445–448.
Article CAS Google Scholar
Krueger F, Andrews SR . Bismark: a flexible aligner and methylation caller for Bisulfite-Seq applications. Bioinformatics 2011; 27:1571–1572.
Article CAS Google Scholar
Quinlan AR, Hall IM . BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 2010; 26:841–842.
Article CAS Google Scholar

Download references

Acknowledgements

We acknowledge the staff of the BIOPIC sequencing facility at Peking University for their assistance. This project was supported by grants from the National Natural Science Foundation of China (31322037, 81521002, 21327808, 81561138005, 81372604, and 31271543) and grants from the Ministry of Sciences and Technology of China (2012CB966704).

Author information

Yu Hou and Huahu Guo: These two authors contributed equally to this work.

Authors and Affiliations

Biodynamic Optical Imaging Center, College of Life Sciences, Peking University, 100871, Beijing, China
Yu Hou, Chen Cao, Xianlong Li, Boqiang Hu, Ping Zhu, Xinglong Wu, Lu Wen, Fuchou Tang & Yanyi Huang
Department of Surgery, Beijing Shijitan Hospital, Capital Medical University, 100038, Beijing, China
Huahu Guo & Jirun Peng
Ninth School of Clinical Medicine, Peking University, 100038, Beijing, China
Huahu Guo & Jirun Peng
School of Oncology, Capital Medical University, 100038, Beijing, China
Huahu Guo & Jirun Peng
Ministry of Education Key Laboratory of Cell Proliferation and Differentiation, Peking University, 100871, Beijing, China
Fuchou Tang
Peking-Tsinghua Center for Life Science, 100084, Beijing, China
Ping Zhu, Xinglong Wu, Fuchou Tang & Yanyi Huang
Center for Molecular and Translational Medicine (CMTM), 100101, Beijing, China
Fuchou Tang
College of Engineering, Peking University, 100871, Beijing, China
Yanyi Huang

Authors

Yu Hou
View author publications
You can also search for this author in PubMed Google Scholar
Huahu Guo
View author publications
You can also search for this author in PubMed Google Scholar
Chen Cao
View author publications
You can also search for this author in PubMed Google Scholar
Xianlong Li
View author publications
You can also search for this author in PubMed Google Scholar
Boqiang Hu
View author publications
You can also search for this author in PubMed Google Scholar
Ping Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Xinglong Wu
View author publications
You can also search for this author in PubMed Google Scholar
Lu Wen
View author publications
You can also search for this author in PubMed Google Scholar
Fuchou Tang
View author publications
You can also search for this author in PubMed Google Scholar
Yanyi Huang
View author publications
You can also search for this author in PubMed Google Scholar
Jirun Peng
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Fuchou Tang, Yanyi Huang or Jirun Peng.

Additional information

( Supplementary information is linked to the online version of the paper on the Cell Research website.)

Supplementary information

Supplementary information, Table S1

Sequencing information of DNA methylome data. (PDF 43 kb)

Supplementary information, Table S2

Sequencing information of transcriptome data. (PDF 33 kb)

Supplementary information, Figure S1

Sensitivity and reliability of DNA methylome and transcriptome analysis in scTrio-seq data. (PDF 576 kb)

Supplementary information, Figure S2

CNV deduction using bulk RRBS data of HepG2 cells (PDF 3639 kb)

Supplementary information, Figure S3

CNV deduction using scRRBS data and scTrio-seq data (PDF 5170 kb)

Supplementary information, Figure S4

Correlations between DNA methylation, gene expression and DNA copy number in HepG2 cells. (PDF 892 kb)

Supplementary information, Figure S5

Correlations between DNA copy number and gene expression (or DNA methylation) in scTrio-seq data. (PDF 646 kb)

Supplementary information, Figure S6

DNA methylome differences between HCC bulk cells and liver bulk cells. (PDF 523 kb)

Supplementary information, Figure S7

DNA methylome of single HCC cells. (PDF 540 kb)

Supplementary information, Figure S8

Copy number variations of HCC cells. (PDF 4174 kb)

Supplementary information, Figure S9

Differences between subpopulation I and subpopulation II HCC cells. (PDF 564 kb)

Supplementary information, Figure S10

DNA methylome heterogeneity among 25 HCC cells. (PDF 751 kb)

Supplementary information, Figure S11

Subpopulation I HCC cells lack complement and coagulation cascades pathway. (PDF 1018 kb)

Rights and permissions

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivs 4.0 Unported License. The images or other third party material in this article are included in the article's Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Hou, Y., Guo, H., Cao, C. et al. Single-cell triple omics sequencing reveals genetic, epigenetic, and transcriptomic heterogeneity in hepatocellular carcinomas. Cell Res 26, 304–319 (2016). https://doi.org/10.1038/cr.2016.23

Download citation

Received: 24 September 2015
Revised: 05 January 2016
Accepted: 28 January 2016
Published: 23 February 2016
Issue Date: March 2016
DOI: https://doi.org/10.1038/cr.2016.23

Keywords

This article is cited by

GBAP1 functions as a tumor promotor in hepatocellular carcinoma via the PI3K/AKT pathway
- Rong Chen
- Meng Zhao
- Qiusha Tang
BMC Cancer (2023)
Methylomics and cancer: the current state of methylation profiling and marker development for clinical care
- Chengyin Liu
- Han Tang
- Tianbao Li
Cancer Cell International (2023)
Downregulation of KEAP1 in melanoma promotes resistance to immune checkpoint blockade
- Douglas B. Fox
- Richard Y. Ebright
- Daniel A. Haber
npj Precision Oncology (2023)
HBV genome-enriched single cell sequencing revealed heterogeneity in HBV-driven hepatocellular carcinoma (HCC)
- Wenhui Wang
- Yan Chen
- Jun Zhu
BMC Medical Genomics (2022)
Systematic evaluation of colorectal cancer organoid system by single-cell RNA-Seq analysis
- Rui Wang
- Yunuo Mao
- Fuchou Tang
Genome Biology (2022)