Single-cell RNA sequencing of human kidney

Abstract

A comprehensive cellular anatomy of normal human kidney is crucial to address the cellular origins of renal disease and renal cancer. Some kidney diseases may be cell type-specific, especially renal tubular cells. To investigate the classification and transcriptomic information of the human kidney, we rapidly obtained a single-cell suspension of the kidney and conducted single-cell RNA sequencing (scRNA-seq). Here, we present the scRNA-seq data of 23,366 high-quality cells from the kidneys of three human donors. In this dataset, we show 10 clusters of normal human renal cells. Due to the high quality of single-cell transcriptomic information, proximal tubule (PT) cells were classified into three subtypes and collecting ducts cells into two subtypes. Collectively, our data provide a reliable reference for studies on renal cell biology and kidney disease.

Measurement(s) transcriptome • RNA
Technology Type(s) RNA sequencing
Factor Type(s) type of kidney cell
Sample Characteristic - Organism Homo sapiens

Machine-accessible metadata file describing the reported data: https://doi.org/10.6084/m9.figshare.11365796

Background & Summary

The kidney is a highly complex organ with many different functions, and consists of several functionally and anatomically discrete segments1. The glomerulus and renal tubules are important components of the nephron. The functional complexity of these structures appears to be associated with different cell types. Along with the glomerular endothelial cells, podocytes synthesize the glomerular basement membrane, which is the final filtration barrier, representing an important seal that prevents the loss of proteins into the urine2. Parietal epithelial cells (PECs) are another common glomerular cell type that might contribute to glomerulosclerosis, crescent and pseudocrescent formation3. The proximal tubule (PT) plays an important role in regulating systemic acid-base balance by controlling Na+-H+ and HCO3 transport, while the distal convoluted tubule is more involved in electrolyte transport4,5,6. In previous studies, researchers have performed bulk RNA sequencing of different components of the kidney, providing a reference for understanding the transcriptome of different segments7,8,9. However, bulk RNA sequencing cannot reflect the transcriptome at the single-cell level, but only the overall average RNA expression.

With the development of next-generation sequencing, high-throughput single-cell analysis10 and the Human Cell Atlas11, scRNA-seq of the kidney became feasible. Park, J et al.1 characterised 57,979 cells from healthy mouse kidneys using unbiased single-cell RNA sequencing and revealed potential cellular targets of kidney disease. Later, Young, M. D et al.12 studied 72,501 single-cell transcriptomes of human renal carcinomas and normal tissue from foetal, paediatric and adult kidneys. These studies provided transcriptome maps of mouse and human kidneys. At the same time, it was reported that some renal diseases may be cell type-specific. For example, chronic kidney disease (CKD) is associated with PT cells13. Thus, PT cells have attracted extensive attention. However, the number of human PT cells obtained in the abovementioned studies was relatively small, i.e. approximately 1,000 high-quality PT cells which highly expressed marker genes. It is difficult to classify a subpopulation of PT cells. Moreover, in previous studies1,13 that considered kidney disease associated with specific cell types, results were obtained based on mouse kidney transcriptome data.

To address this problem, we set out to obtain a single-cell suspension of the human kidney and performed scRNA-seq with a high throughput droplet-mediated scRNA-seq platform (10x Genomics Chromium14, Fig. 1a). We obtained a single-cell transcriptome dataset of 23,366 high-quality human kidney cells from three donors (kidneys 1, 2 and 3), including 20,308 PT cells. Considering the important role of PT cells in renal disease, this considerable individual cell transcriptome information may validate previously reported susceptibility genes for kidney disease. In addition, monogenic disease genes and complex trait genes identified by genome-wide association study (GWAS) may be associated with precise cell types. With the unbiased classification of cells, we can discover new genes with specific expression in some cell types. Taken together, the generated data provide more abundant transcriptomic information on renal tubular cells, representing an important reference for the accurate classification of renal tubular cells and the study of the relationship between renal tubular cells and diseases.

Fig. 1
figure1

scRNA-seq reveals the cell populations of the human kidney. (a) Overview of the scRNA-seq process using human kidney tissue samples. (b) Uniform manifold approximation and projection (UMAP) plot showing the unbiased classification of renal cells. (c) Pie chart showed the proportion of each kidney cell type. (d) Heat map showing the marker genes of each cluster, highlighting the selected marker genes for each cluster.

Methods

We present an overview of the kidney scRNA-seq method. The whole process included the acquisition of human kidney tissue, the preparation of a single-cell suspension and 10x Genomics sample processing (Fig. 1a).

Ethical approval

We received approval from the Institution Review Board (IRB) from the First Affiliated Hospital of Guangxi Medical University, and signed informed consent was obtained from all patients.

Human kidney tissue procurement and isolation

Fresh human kidney samples (Supplementary Table S1) were collected at the First Affiliated Hospital of Guangxi Medical University and Affiliated Tumour Hospital of Guangxi Medical University. Two out of the three samples were obtained from patients undergoing radical nephrectomy, and the remaining sample was from a patient undergoing radical nephroureterectomy. Normal kidney tissues were obtained at least 2 cm away from tumour tissue.

First, fresh samples were taken from the operating room and placed in a solution containing Hank’s balanced salt solution (HBSS; WISENT, 311-512-CL) and 1% Antibiotic-Antimycotic (Gibco, 15240062) on ice, which was transported to the laboratory within 20 minutes. Then, 0.5–1 g full-thickness sections of kidney tissue were cut lengthwise using surgical scissors. Subsequently, the tissue was washed twice with cold Dulbecco’s phosphate-buffered saline (DPBS; WISENT, 311-425-CL). We placed the tissue on a stainless steel cell filter, crushed the tissue with the plunger of a syringe and washed it with DPBS. We added flushing fluid to a centrifuge tube, and collected the kidney fragments into the centrifuge tube, which was then spun at 350 g for 5 min at 4 °C; we then repeated this step. After discarding the supernatant, we used TrypLE™ Express Enzyme (1X, Gibco, 12605010) to further digest the sticky clumps of cells for 5–10 min at 37 °C, then terminated digestion using Dulbecco’s modified eagle medium (DMEM; WISENT, 319-006-CL) containing 10% fatal bovine serun (FBS; Gibco, 10099141). The digested cells were centrifuged at 350 g for 5 min at 4 °C. After discarding the supernatant, the cells were resuspended in 5 ml of DPBS and filtered through a 100 μm cell strainer. Next, we removed red blood cells using 1X RBC lysis buffer (10X diluted to 1X,BioLegend, B250015) for 5 min and centrifuged the cells at 300 g for 5 min at 4 °C. After discarding the supernatant, the cells were suspended in DPBS and centrifuged again. After discarding the supernatant, the cells were resuspended in cold DPBS and passed through a 40 μm cell strainer. Live cells were counted using trypan blue (0.4%, Gibco, 420301) staining. If the cell viability was above 80%, we perform 10x Genomics sample processing.

10x Genomics sample processing and cDNA library preparation

The 10x Genomics Chromium Single Cell 3′ Reagents Kit v2 user guide (https://support.10xgenomics.com/single-cell-gene-expression/index/doc/user-guide-chromium-single-cell-3-reagent-kits-user-guide-v2-chemistry) was used to prepare the single cell suspension. The single cell samples were passed through a 40 μm cell strainer and counted using a haemocytometer with trypan blue. Then, the appropriate volume of each sample was diluted to recover 10,000 kidney cells. Subsequently, the single cell suspension, Gel Beads and oils were added to the 10x Genomics single-cell A chip. We checked that there were no errors before running the assay. After droplet generation, samples were transferred into PCR tubes and we performed reverse transcription using a T100 Thermal Cycler (Bio-Rad). After reverse transcription, cDNA was recovered using a recovery agent, provided by 10x Genomics, followed by silane DynaBead clean-up as outlined in the user guide. Before clean-up using SPRIselect beads, we amplified the cDNA for 10 cycles. The cDNA concentration was detected by a Qubit2.0 fluorometer (Invitrogen). The kidney cDNA libraries were prepared referring to the Chromium Single Cell 3′ Reagent Kit v2 user guide.

Single-cell RNA-seq details and preliminary results

Samples were sequenced by Hiseq Xten (Illumina, San Diego, CA, USA) with the following run parameters: read 1 for 150 cycles, read 2 for 150 cycles, index for 14 cycles. Preliminary sequencing results (bcl files) were converted to FASTQ files with CellRanger (version 3.0, https://support.10xgenomics.com/single-cell-gene-expression/software/pipelines/latest/what-is-cell-ranger). We followed the 10x Genomics standard seq protocol by trimming the barcode and unique molecular identifier (UMI) end to 26 bp, and the mRNA end to 98 bp. Then, the FASTQ files were aligned to the human genome reference sequence GRCh38. Subsequently, we applied CellRanger for preliminary data analysis and generated a file that contained a barcode table, a gene table and a gene expression matrix. We carried out preliminary quality control (QC) on the FASTQ files to ensure high quality scRNA-seq data. We also made a comparison between three different methods (Cell Ranger V2.1 or 2.2 with 150 bp × 2, Cell Ranger V3.0 with 150 bp × 2, Cell Ranger V3.0 with trimming the FASTQ data to 26 bp × 98 bp). We found that more single cells were actually identified using Cellranger V3.0 compared with Cellranger V2.0 or 2.1 (Tables 1 and 2). At the same time, we obtained some basic information about sequencing by a website, such as the number of cells, the median number of detected genes, sequencing saturation and sequencing depth (Table 2). The strategy of using CellRanger V3.0 and trimming the FASTQ data to 26 bp × 98 bp was used to pre-process the scRNA-seq data and perform downstream analysis.

Table 1 Detailed QC of FASTQ files.
Table 2 Information and sequencing statistics of three samples and comparison between different sequencing strategies. NA, after pre-processing the data, some of them didn’t use for downstream analysis.

Using Seurat for quality control (QC) and data second analysis after mitigating the batch effect

We used the R (version 3.5.2, https://www.r-project.org/) and Seurat15,16 R package (version 3.1, https://satijalab.org/seurat/). We used the MergeSeurat function to merged the three kidney data sets. The filter criteria of cells were determined after reference to previous studies1,12. According to the median number of genes and the percentage of mitochondrial genes in the kidney samples (Fig. 2a), cells with <200 and >2,500 genes (potential cell duplets) and a mitochondrial gene percentage of >30% were filtered. After QC, 23,366 high quality kidney cells were obtained. The relationship between the percentage of mitochondrial genes and the mRNA reads were detected and visualised, together with the relationship between the number of mRNAs and the reads of mRNA (Fig. 2b).

Fig. 2
figure2

Quality control (QC) of human kidney single cell data. (a) Scatterplot illustrating the number of genes, unique molecular identifiers (UMIs) and the percentage of mitochondrial genes in each cell of three kidney samples. (b) The relationship between the percentage of mitochondrial genes and the mRNA reads, together with the relationship between the amount of mRNA and the reads of mRNA. (c) We detected the batch effect between three different kidney samples. (d) UMAP plot showing the cell cycle status of each cell. (e) Violin plot illustrating the number of genes, unique molecular identifiers (UMIs) and the percentage of mitochondrial genes in previous kidney single-cell data from GSE1075851.

After data normalisation, all highly variable genes in single cells were identified after controlling for the relationship between average expression and dispersion. All variable genes (n = 16471) were used in the downstream analysis, i.e. principal-component analysis (PCA).

Since this data came from three different samples, in order to avoid the batch effect affecting downstream analysis, we adopted a strategy of mitigating the batch effect, called ‘Harmony’. The R package Harmony focuses on scalable integration of scRNA-seq data for batch correction and meta analysis17 (version 0.99.9, https://github.com/immunogenomics/Harmony). To ensure the reliability of Harmony, we also applied another approach based on mutual nearest neighbours (MNN)18, which was implemented in the R package scran19 (the “fastMNN” function). We found that these two methods of eliminating batch effects produced similar results (Fig. 2c, Supplementary Fig. S1a). However, Harmony may be slightly better than fastMNN for our data. Harmony identified collecting duct intercalated cells with the same reduced resolution, but fastMNN could not (Fig. 1b, Supplementary Fig. S1b,c). Therefore, we used Harmony to eliminate batch effects and continued downstream analysis.

Subsequently, we used PCA with variable genes as the input and identified significant principal components (PCs) based on the jackStraw function. Twenty PCs were selected as the input for uniform manifold approximation and projection (UMAP) and t-distributed stochastic neighbour embedding (tSNE) when statistically significant. We detected the batch effect between three different kidney samples (Fig. 2c and Supplementary Fig. S2a). With a resolution of 0.25, cells were clustered by the FindClusters function and classified into 10 different cell types. Next, we used the FindAllMarkers function to find differentially expressed genes between each type of cell (Supplementary Table S2).

Cell cycle analysis

Cell cycle analysis was performed by using the Seurat program. We used a previously defined core set of 43 G1/S and 54 G2/M cell cycle genes20. Cells were classified by the maximal average expression (‘cycle score’) in these two gene sets. When the cycle scores of G1/S and G2/M were both less than 2, we considered these cells to be non-cycling. Otherwise, we considered cells to be proliferative. After cell cycle analysis, no bias induced by cell cycle genes was observed (Fig. 2d and Supplementary Fig. S2b).

Cell type markers

Cell type assignment was performed based on the marker genes reported in previous studies1,3,7,8,9,12,21,22,23 (Online-only Table 1).

Reconstructing PT cells differentiation trajectories by Monocle2

PT cells fate decisions and pseudotime trajectories were reconstructed by the Monocle224 R package (version 2.10.1, http://cole-trapnell-lab.github.io/monocle-release/). First of all, the three types of PT cells were selected by Seurat. The PT cell data, which included 20,308 PT cells, were imported into Monocle2. We used genes that were expressed in at least 10 cells and in greater than 5% of cells. Subsequently, we used thresholds on the cell local density (rho) and nearest distance (delta) to determine the number of clusters. Then, we performed differential gene expression analysis as before, but across all cell clusters. We used the top 1,000 most significantly differentially expressed genes as the set of ordering genes and performed dimension reduction and trajectory analysis. Once we established a trajectory, we used the differential GeneTest function to find genes that had an expression pattern that varied according to pseudotime.

Data Records

All kidney sequencing data have been uploaded to the NCBI GEO database. It is possible to access these data through the project accession number GSE13168525. These data include barcodes.tsv, features.tsv and gene expression matrix (*.mtx) files. The raw data in the bam files have been deposited in the NCBI Sequence Read Archive (SRA) and are accessible through the project accession number SRP19929426.

Technical Validation

Kidney specimens were collected fresh, dissected and digested into single cells from organ donors (two males and one female) aged 57–65 years (Supplementary Table S1, Methods). We used Seurat for QC, in which the number of genes, the number of UMI and the percentage of mitochondrial genes in each cell were calculated (Fig. 2a). We made a comparison with the previous kidney single-cell data from GSE1075851 on the number of genes detected per cell (Fig. 2e). We found that the median genes per cell were 941. This result was close to our scRNA-seq results. In general, the proportion of mitochondrial genes in the kidney cells was greater than in other organs, such as the liver, prostate, testis and peripheral blood mononuclear cell (PBMC)1,12,23,27,28,29. Since the proportion of mitochondrial genes reflects the state of cells, the exclusion criteria are controversial. Some researchers have suggested that kidney cells should discarded if their mitochondrial gene percentage is over 50%1, while other researchers remove any cells that have greater than 20% expression originating from mitochondrial genes12. In this study, we were conservative, in that cells with a mitochondrial gene percentage of >30% were filtered.

After QC, 23,366 high quality kidney cells were further analysed. We could identify 10 cell clusters that consisted of cells in the range of 79–11,539 cells per cluster (Fig. 1c). We visualised cell clustering using two different approaches (UMAP and tSNE), and the results were the same (Fig. 1b and Supplementary Fig. S2c). According to the marker genes (Online-only Table 1), we classified cells into clusters 1–10, corresponding to proximal convoluted tubule cells, proximal tubule cells, proximal straight tubule cells, NK-T cells, monocytes, glomerular parietal epithelial cells, distal tubule cells, collecting duct principal cells, B cells and collecting duct intercalated cells (Fig. 1d).

Our results show that PT cells were very abundant, with 20,308 PT cells. PT cells can be classified into three different clusters according to their markers, including the proximal convoluted tubule, proximal straight tubule and PT cells of no accurate classification (Figs. 1b and 3a, Online-only Table 1). Furthermore, we applied Monocle2 to perform pseudotime trajectories of all PT cells and showed the fate decisions between them (Fig. 3b–e). At the same time, we discovered the top six genes that influenced fate decisions (Fig. 3f). To show more detailed gene information, we presented the top 50 genes that affect fate decisions (Fig. 3g).

Fig. 3
figure3

Subpopulations of PT cells and reconstructing the developmental trajectory of PT cells. (a) Violin plots representing the expression of marker genes in PT cells. Clusters 1, 2 and 3 refer to the proximal convoluted tubule, proximal tubule and proximal straight tubule, respectively. (b) Monocle2-generated pseudotemporal trajectory of three PT cell types (n = 20,308); imported Seurat data are coloured according to the cell name designation. (c) Pseudotime was coloured in a gradient from dark to light blue, and the start of pseudotime is dark. (d) The pseudotime trajectory was divided into three different states by Monocle2. (e) The trajectory showing the distribution of cells from three samples. (f) The top six genes influencing fate decisions are shown as line plots displayed as the expression level over pseudotime by Monocle2. (g) Heat map for clustering the top 50 genes that affected cell fate decisions. These 50 genes were divided into three clusters (cluster 1, cluster 2 and cluster 3), showing genes at the beginning stage, the transitory stage and the end stage of the developmental trajectory, respectively.

Initially the collecting duct was described as having a role only in water reabsorption, while in recent years the understanding of the function of the collecting duct has become greatly enhanced and has led to a new model for how the distal segments of the kidney tubule integrate salt and water reabsorption, potassium homeostasis and acid-base status30. Interestingly, our data also provide transcriptomic information for collecting duct cells (Fig. 1b, cluster 8, 10). We classified collecting duct cells into principal cells (cluster 8) and intercalated cells (cluster 10), according to marker expression (Fig. 4a, Online-only Table 1). Given that the three ‘healthy’ kidney samples were collected from renal cancer patients, we had to confirm their universal representativeness. Previous studies on human kidney scRNA-seq12 have provided us with many marker genes for proximal, distal and collecting tubule cells. We found that almost all the genes for PT cells were highly expressed in our PT cells (Supplementary Fig. S3a). Most of these genes for distal and collecting tubules cells were expressed in our data (Supplementary Fig. S3b). Thus, we consider these results to be reliable.

Fig. 4
figure4

Detailed classification of collecting duct cells and NK-T cells by scRNA-seq. (a) Violin plot showing the expression of the collecting duct principal cell marker AQP2 and the collecting duct intercalated cell markers ATP6V1B1, ATP6V0D2 and ATP6V1G3. Clusters 8 and 10 are collecting duct principal cells and collecting duct intercalated cells, respectively. (b) UMAP plot showing the spatial location of NKT cells and T cells after dimensionality reduction. The red dots represent NKT cells and the green dots represent T cells. (cg) Violin plots showing the expression of the NK cells markers GNLY and NKG7. Violin plots showing the expression of the T cell markers CD3D, CD3E and IL7R.

Finally, we present a method for the detailed classification of cell subsets. Initially, the parameters of 20 PCs and 0.25 resolution were selected to identify 10 cell types (Fig. 1b). We found that cluster 4 highly expressed marker genes of both NK cells and T cells, designated as NK-T cells (Fig. 1d, Supplementary Table S2). Interestingly, cluster 4 can be further classified into two subtypes (Fig. 4b). By modifying the parameters to 20 PCs and 0.8 resolution, we could accurately distinguish NKT cells (CD3D+CD3E+GNLY+NKG7+) and T cells (CD3D+CD3E+IL7R+) (Fig. 4c–g), which can be used for downstream analysis.

Taken together, we provide a transcriptomic map of human kidney cells that will help us to study renal cell biology and the relationship between cell types and diseases.

Code availability

The R code used in the analysis of the scRNA-seq data is available on GitHub (https://github.com/lessonskit/Single-cell-RNA-sequencing-of-human-kidney). This R code is also available at figshare31.

References

  1. 1.

    Park, J. et al. Single-cell transcriptomics of the mouse kidney reveals potential cellular targets of kidney disease. Science (New York, N.Y.) 360, 758–763, https://doi.org/10.1126/science.aar2131 (2018).

    CAS  Article  Google Scholar 

  2. 2.

    Brunskill, E. W., Georgas, K., Rumballe, B., Little, M. H. & Potter, S. S. Defining the molecular character of the developing and adult kidney podocyte. PloS one 6, e24640, https://doi.org/10.1371/journal.pone.0024640 (2011).

    ADS  CAS  Article  PubMed  PubMed Central  Google Scholar 

  3. 3.

    Shankland, S. J., Smeets, B., Pippin, J. W. & Moeller, M. J. The emergence of the glomerular parietal epithelial cell. Nat. Rev. Nephrol. 10, 158–173, https://doi.org/10.1038/nrneph.2014.1 (2014).

    CAS  Article  PubMed  Google Scholar 

  4. 4.

    Aronson, P. S. Mechanisms of active H+ secretion in the proximal tubule. Am. J. Physiol. 245, F647–659, https://doi.org/10.1152/ajprenal.1983.245.6.F647 (1983).

    CAS  Article  PubMed  Google Scholar 

  5. 5.

    Guo, Y. M. et al. Na(+)/HCO3(−) Cotransporter NBCn2 Mediates HCO3(−) Reclamation in the Apical Membrane of Renal Proximal Tubules. J. Am. Soc. Nephrol. 28, 2409–2419, https://doi.org/10.1681/asn.2016080930 (2017).

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  6. 6.

    McCormick, J. A. & Ellison, D. H. Distal convoluted tubule. Compr. Physiol. 5, 45–98, https://doi.org/10.1002/cphy.c140002 (2015).

    Article  PubMed  PubMed Central  Google Scholar 

  7. 7.

    Chabardes-Garonne, D. et al. A panoramic view of gene expression in the human kidney. Proc. Natl. Acad. Sci. USA 100, 13710–13715, https://doi.org/10.1073/pnas.2234604100 (2003).

    ADS  CAS  Article  PubMed  Google Scholar 

  8. 8.

    Lee, J. W., Chou, C. L. & Knepper, M. A. Deep Sequencing in Microdissected Renal Tubules Identifies Nephron Segment-Specific Transcriptomes. J. Am. Soc. Nephrol. 26, 2669–2677, https://doi.org/10.1681/asn.2014111067 (2015).

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  9. 9.

    Habuka, M. et al. The kidney transcriptome and proteome defined by transcriptomics and antibody-based profiling. PloS one 9, e116125, https://doi.org/10.1371/journal.pone.0116125 (2014).

    ADS  CAS  Article  PubMed  PubMed Central  Google Scholar 

  10. 10.

    Macosko, E. Z. et al. Highly Parallel Genome-wide Expression Profiling of Individual Cells Using Nanoliter Droplets. Cell 161, 1202–1214, https://doi.org/10.1016/j.cell.2015.05.002 (2015).

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  11. 11.

    Regev, A. et al. The Human Cell Atlas. eLife 6, https://doi.org/10.7554/eLife.27041 (2017).

  12. 12.

    Young, M. D. et al. Single-cell transcriptomes from human kidneys reveal the cellular identity of renal tumors. Science (New York, N.Y.) 361, 594–599, https://doi.org/10.1126/science.aat1699 (2018).

    ADS  CAS  Article  Google Scholar 

  13. 13.

    Qiu, C. et al. Renal compartment-specific genetic variation analyses identify new pathways in chronic kidney disease. Nat. Med. 24, 1721–1731, https://doi.org/10.1038/s41591-018-0194-4 (2018).

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  14. 14.

    Zheng, G. X. et al. Massively parallel digital transcriptional profiling of single cells. Nat. Commun. 8, 14049, https://doi.org/10.1038/ncomms14049 (2017).

    ADS  CAS  Article  PubMed  PubMed Central  Google Scholar 

  15. 15.

    Satija, R., Farrell, J. A., Gennert, D., Schier, A. F. & Regev, A. Spatial reconstruction of single-cell gene expression data. Nat. Biotechnol. 33, 495–502, https://doi.org/10.1038/nbt.3192 (2015).

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  16. 16.

    Stuart, T. et al. Comprehensive Integration of Single-Cell Data. Cell 177, 1888–1902.e1821, https://doi.org/10.1016/j.cell.2019.05.031 (2019).

    CAS  Article  Google Scholar 

  17. 17.

    Korsunsky, I. et al. Fast, sensitive, and flexible integration of single cell data with Harmony. BioRxiv, https://doi.org/10.1101/461954 (2018).

  18. 18.

    Haghverdi, L., Lun, A. T. L., Morgan, M. D. & Marioni, J. C. Batch effects in single-cell RNA-sequencing data are corrected by matching mutual nearest neighbors. Nat. biotechnol. 36, 421–427, https://doi.org/10.1038/nbt.4091 (2018).

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  19. 19.

    Lun, A. T. L., McCarthy, D. J. & Jc, M. A step-by-step workflow for low-level analysis of single-cell RNA-seq data with Bioconductor. F1000Research 5, 2122, https://doi.org/10.12688/f1000research.9501.2 (2016).

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  20. 20.

    Tirosh, I. et al. Dissecting the multicellular ecosystem of metastatic melanoma by single-cell RNA-seq. Science (New York, N.Y.) 352, 189–196, https://doi.org/10.1126/science.aad0501 (2016).

    ADS  CAS  Article  Google Scholar 

  21. 21.

    Nakagawa, J. et al. Molecular characterization of mammalian dicarbonyl/L-xylulose reductase and its localization in kidney. J. Biol. Chem. 277, 17883–17891, https://doi.org/10.1074/jbc.M110703200 (2002).

    CAS  Article  PubMed  Google Scholar 

  22. 22.

    Schnapp, D., Reid, C. J. & Harris, A. Localization of expression of human beta defensin-1 in the pancreas and kidney. J. Pathol. 186, 99–103, doi: 10.1002/(sici)1096-9896(199809)186:1<99::aid-path133>3.0.co;2-# (1998).

  23. 23.

    Chen, J., Cheung, F., Shi, R., Zhou, H. & Lu, W. PBMC fixation and processing for Chromium single-cell RNA sequencing. J. Transl. Med. 16, 198, https://doi.org/10.1186/s12967-018-1578-4 (2018).

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  24. 24.

    Trapnell, C. et al. The dynamics and regulators of cell fate decisions are revealed by pseudotemporal ordering of single cells. Nat. biotechnol. 32, 381–386, https://doi.org/10.1038/nbt.2859 (2014).

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  25. 25.

    Gene Expression Omnibus, https://identifiers.org/geo:GSE131685 (2019).

  26. 26.

    NCBI Sequence Read Archive, http://identifiers.org/ncbi/insdc.sra:SRP199294 (2019).

  27. 27.

    MacParland, S. A. et al. Single cell RNA sequencing of human liver reveals distinct intrahepatic macrophage populations. Nat. Commun. 9, 4383, https://doi.org/10.1038/s41467-018-06318-7 (2018).

    ADS  CAS  Article  PubMed  PubMed Central  Google Scholar 

  28. 28.

    Henry, G. H. et al. A Cellular Anatomy of the Normal Adult Human Prostate and Prostatic Urethra. Cell. Rep. 25, 3530–3542.e3535, https://doi.org/10.1016/j.celrep.2018.11.086 (2018).

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  29. 29.

    Lukassen, S., Bosch, E., Ekici, A. B. & Winterpacht, A. Single-cell RNA sequencing of adult mouse testes. Sci. Data. 5, 180192, https://doi.org/10.1038/sdata.2018.192 (2018).

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  30. 30.

    Roy, A., Al-bataineh, M. M. & Pastor-Soler, N. M. Collecting duct intercalated cell function and regulation. Clin. J. Am. Soc. Nephrol. 10, 305–324, https://doi.org/10.2215/cjn.08880914 (2015).

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  31. 31.

    Liao, J. et al. Single-cell RNA sequencing of human kidney. figshare. https://doi.org/10.6084/m9.figshare.8131328.v2 (2019).

Download references

Acknowledgements

The authors thank the lab members for their helpful advices and technical assistance. This work was supported by grants from the National Natural Science Foundation of China (81770759), the National Natural Science Foundation of China (81370857), National Key R&D Program of China (2017YFC0908000), Guangxi Natural Science Fund for Innovation Research Team (2013GXNSFFA019002).

Author information

Affiliations

Authors

Contributions

J. Liao. performed RNA-seq experiments, made cDNA library and wrote the paper; Z.Y. performed single-cell RNA-seq analyses, made figures, and wrote the paper; Y.C. wrote the paper; M.B. and C.Z. dissected human kidney tissues, performed RNA-seq experiments; J.C., D.L., T.L. and Q.Z. provided and dissected human kidney tissues; H.Z discussed the draft paper, and critically reviewed the manuscript; J. Li. offered bioinformatics help. J.C. and Z.M. conceived of and supervised the project, analyzed data, made figures, and wrote the paper; and all authors read and commented on the manuscript.

Corresponding authors

Correspondence to Jiwen Cheng or Zengnan Mo.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Online-only Table

Online-only Table 1 Cell type assignment based on the marker genes reported in previous studies.

Supplementary information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

The Creative Commons Public Domain Dedication waiver http://creativecommons.org/publicdomain/zero/1.0/ applies to the metadata files associated with this article.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Liao, J., Yu, Z., Chen, Y. et al. Single-cell RNA sequencing of human kidney. Sci Data 7, 4 (2020). https://doi.org/10.1038/s41597-019-0351-8

Download citation

Further reading