Chromatin accessibility identifies diversity in mesenchymal stem cells from different tissue origins

Ho, Yen-Ting; Shimbo, Takashi; Wijaya, Edward; Ouchi, Yuya; Takaki, Eiichi; Yamamoto, Ryoma; Kikuchi, Yasushi; Kaneda, Yasufumi; Tamai, Katsuto

doi:10.1038/s41598-018-36057-0

Download PDF

Article
Open access
Published: 10 December 2018

Chromatin accessibility identifies diversity in mesenchymal stem cells from different tissue origins

Yen-Ting Ho¹^na1,
Takashi Shimbo¹^na1,
Edward Wijaya ORCID: orcid.org/0000-0002-1234-0761^1,2,
Yuya Ouchi^1,2,
Eiichi Takaki^1,2,
Ryoma Yamamoto^1,2,
Yasushi Kikuchi¹,
Yasufumi Kaneda³ &
…
Katsuto Tamai¹

Scientific Reports volume 8, Article number: 17765 (2018) Cite this article

5664 Accesses
25 Citations
4 Altmetric
Metrics details

Subjects

Abstract

Mesenchymal stem cells (MSCs), which can differentiate into tri-lineage (osteoblast, adipocyte, and chondrocyte) and suppress inflammation, are promising tools for regenerative medicine. MSCs are phenotypically diverse based on their tissue origins. However, the mechanisms underlying cell-type-specific gene expression patterns are not fully understood due to the lack of suitable strategy to identify the diversity. In this study, we investigated gene expression programs and chromatin accessibilities of MSCs by whole-transcriptome RNA-seq analysis and an assay for transposase-accessible chromatin using sequencing (ATAC-seq). We isolated MSCs from four tissues (femoral and vertebral bone marrow, adipose tissue, and lung) and analysed their molecular signatures. RNA-seq identified the expression of MSC markers and both RNA-seq and ATAC-seq successfully clustered the MSCs based on their tissue origins. Interestingly, clustering based on tissue origin was more accurate with chromatin accessibility signatures than with transcriptome profiles. Furthermore, we identified transcription factors potentially involved in establishing cell-type specific chromatin structures. Thus, epigenome analysis is useful to analyse MSC identity and can be utilized to characterize these cells for clinical use.

Single-cell RNA sequencing analysis of human bone-marrow-derived mesenchymal stem cells and functional subpopulation identification

Article Open access 01 April 2022

Mesenchymal stromal cells in the bone marrow niche consist of multi-populations with distinct transcriptional and epigenetic properties

Article Open access 04 August 2021

Characterization of mesenchymal stem cells in human fetal bone marrow by single-cell transcriptomic and functional analysis

Article Open access 31 March 2023

Introduction

Mesenchymal stem cells (MSCs) have unique differentiation potential toward three mesenchymal lineages including osteoblast, adipocyte, and chondrocyte^1,2. It was also shown that the differentiation potential of MSCs is not limited to the mesenchymal lineage; specifically, MSCs can also differentiate into ectodermal and endodermal lineages³. In addition to their marked differentiation potential, MSCs exert immunosuppressive activity by secreting cytokines such as TSG-6⁴. These unique features make MSCs a promising tool for regenerative medicine for intractable diseases. We previously showed that MSCs have critical roles in tissue regeneration in the damaged skin and can be utilized for the treatment of the intractable genetic skin disease epidermolysis bullosa (EB)⁵. Others have also shown that these cells are potentially useful for various diseases including ischemic stroke and graft-versus host disease^6,7,8,9. Thus, MSCs are anticipated to be effective for treating intractable diseases that require tissue regeneration and immunosuppression.

MSCs are known to have phenotypic diversity¹⁰. Different factors including tissue origin, gender, age, or culture conditions, can affect the characteristics of these cells¹¹. Among these, tissue origin is a key determinant of cell phenotype. MSCs were originally isolated from the bone marrow, but recently it was shown that they can be isolated from multiple tissues such as adipose, lung, umbilical cord, or dental pulp^12,13,14. These MSCs with different tissue origins are unique in terms of growth rate, differentiation potential, or cell morphology^15,16. However, the gold standard to identify the molecular identity of MSCs has not been established. Although cell surface marker analysis using FACS is a standard method to confirm the identity of MSCs, this assay cannot fully account for such diversity, probably because FACS is limited by the number of proteins that it can analyse. Thus, it is essential to uncover the detailed molecular mechanisms that establish MSC diversity.

Recently, the epigenome, a set of information regarding chemical modifications to DNA and DNA-associated proteins, has been extensively analysed to understand the molecular signatures that specify cell identity¹⁷. Each cell type has a unique epigenome that is used to establish cell-type specific gene expression programs. These cell-type specific gene expression programs are dependent on the well-organized deposition of regulatory proteins such as transcription factors, RNA polymerase, or chromatin remodellers^18,19. Regulatory proteins dynamically control the 3D structure of the genome and modulate the accessibility of chromatin to establish cell-type specific gene expression programs²⁰. To efficiently analyse chromatin accessibility, the assay for transposase-accessible chromatin using sequencing (ATAC-seq) has been recently developed²¹. As ATAC-seq requires fewer cells and handlings compared to conventional techniques, it has been used for various types of cells and has successfully identified their chromatin accessibility profiles²².

Here, we describe the comprehensive analysis of MSC signatures to reveal the molecular mechanisms underlying their diversity. Using cells isolated from different tissues as a model to analyse MSC diversity, we simultaneously assessed chromatin accessibility and the transcriptome of MSCs and showed that compared to transcriptome analysis, chromatin accessibility is a superior indicator for cell type identification. We also mapped the regulatory landscape of transcription factors in MSCs to establish cell-type specific gene expression programs.

Results

Isolation and validation of MSCs

We first isolated MSCs from four different tissues (femoral bone marrow, vertebral bone marrow, adipose, and pulmonary) using well established protocols (see Experimental Procedures section). We chose these four tissues based on our rationale as follows. In general, the tissue source for MSCs can be categorized as bone marrow or non-bone marrow. First, we chose femoral bone marrow and adipose tissue because these are major sources of bone marrow-type and non-bone marrow MSCs, respectively. To facilitate comparisons within bone marrow type MSCs or non-bone marrow type MSCs, we additionally chose vertebral bone marrow and pulmonary tissue, as the MSCs from these tissues were previously shown to have unique properties^23,24. Because tissue origin has been implicated in affecting the phenotypes or characteristics of MSCs, we used these cells to model the diversity of MSCs²⁵. We collected the four tissues from identical mice and to minimize biological variability, we collected MSCs from three mice (Fig. S1a). To validate the cell isolation and culture protocols, we analysed surface markers on the isolated MSCs by FACS (Fig. 1a and Fig. S1b). All MSCs were positive for MSC makers (CD29, CD44, Sca-1, and CD106) but were negative for non-MSC markers (CD31, CD34, and CD45). In addition, the expression levels of each surface protein were highly consistent among the biological replicates (Fig. 1b). These data confirmed that MSCs were successfully isolated from four different tissues.

Tissue origin markedly affects MSC transcriptome

Next, to further validate the isolated MSCs, we performed whole transcriptome analysis using RNA-sequencing (RNA-seq). We constructed a total of 12 RNA-seq libraries (RNA from the MSCs of four different origins, with three biological replicates for each) using the Smart-seq2 method²⁶. First, to visualize differences in the transcriptomes among the four MSC groups, we performed principle component analysis (PCA) using all detected genes (Fig. 2a). The PCA plot clearly separated all four MSC groups, whereas biological replicates from identical tissues clustered together, suggesting that MSC tissue origin is strongly correlated with cell-type specific gene expression patterns. Pearson correlation analysis also confirmed the high correlation between tissue origin and gene expression patterns and confirmed that the biological replicates were in good agreement (Fig. 2b). To further analyse transcriptional differences among the four MSCs, we performed hierarchical clustering (see Experimental Procedures section) and again found that the samples clustered well depending on tissue origin, with all biological replicates clustering together (Fig. 2c). Interestingly, the two bone marrow-derived MSC groups (femoral and vertebral) clustered relatively closely, as compared to the other MSCs, suggesting relatively similar, albeit distinct, transcriptomes between these groups. To further clarify cell type-specific gene expression profiles, we tried to identify the predominant gene expression signatures using the Iterative Clustering and Guide-gene Selection (ICGS) algorithm²⁷. ICGS clustered the MSCs based on 12 gene clusters that are uniquely regulated among the cell types (Fig. 2d). To understand the molecular functions of the genes in each cluster, we performed pathway analysis using Ingenuity Pathway Analysis (IPA; Fig. 2e). The pathways related to bone physiology (e.g., role of osteoblasts, osteoclasts, and chondrocytes in rheumatoid arthritis or osteoarthritis pathway) and retinoid X receptor (RXR) activation were predicted for the clusters (cluster 1 to 6) uniquely expressed in bone marrow-derived MSCs (fBM-and vBM-MSCs, femoral bone marrow MSCs and vertebral bone marrow MSCs, respectively). Genes related to mismatch repair were predicted in the lung derived MSCs (P-MSC)-specific cluster (cluster 9) and genes related to melatonin/serotonin degradation were enriched in the adipose tissue derived MSCs (A-MSC)-specific cluster (cluster 12). To investigate how filtering genes into subsets affects the ICGS results, we tried a number of filtering options for RNA-seq using different cut-offs to obtain comparable ATAC-seq clustering results. We found that the filtering could improve the efficiency of clustering using RNA-seq results (Fig. 3). These data indicated that MSCs from different tissues have distinct and unique transcriptomes.

Chromatin assembly is better at distinguishing MSC origin than transcriptome analysis

Next, we sought to analyse the epigenomes of the isolated MSCs, which were found to have distinct gene expression profiles. For this, we profiled chromatin accessibility using ATAC-seq. In this technique, sequencing adaptors loaded with Tn5 transposase fragments are used to tag the genome (named “tagmentation”). Because the tagmentation preferentially occurs in the open chromatin regions (relatively protein-free regions), DNA fragments from open chromatin regions can be easily recovered by PCR targeting the adaptor sequence inserted by the transposase. With its robustness and high sensitivity, ATAC-seq has been widely used to profile the epigenome. We constructed 12 ATAC-seq libraries (similar to the RNA-seq experimental design; four different tissues with three biological replicates each). The exemplar locus and basic quality control scores are shown (Fig. S2a). Consistent with other ATAC-seq data, metagene analysis identified the enrichment of ATAC-seq reads that are associated with transcription start sites (Fig. S2b)^28,29,30. To obtain information regarding open chromatin regions, we analysed densely-sequenced regions (which represent open chromatin regions) by performing peak call analysis using MACS2. To validate peak calling, we assigned each peak into known genomic features. The majority of called peaks from ATAC-seq data from all four MSCs were located at promoter or genic regions (exons and introns), consistent with other published ATAC-seq data (Fig. S2cd)^28,29,30. To further characterize the differences among the MSCs, we selected peaks with high confidence by using the false discovery rate (FDR); peaks with FDR <1e-7 left more than 65000 peaks (approximately 65% of the total peaks) in each group. We then binned the genome using 5-kb sized bins and assigned the read counts to bins where peaks were located. PCA using the assigned score resulted in clear separation of the MSCs depending on tissue origin, whereas biological replicates clustered together, indicating that ATAC-seq data could also predict the identity of MSCs (Fig. 4a). The Pearson correlation heatmap showed significantly low correlations among MSCs from different tissues compared to those based on RNA-seq data, indicating that the differences among MSCs were more pronounced with ATAC-seq data (Fig. 4b). To further confirm this, we performed hierarchical clustering using all ATAC-seq samples and again obtained better separation among tissues compared to that using RNA-seq data, suggesting that ATAC-seq data contains more cell type-specific information to detect the diversity in MSCs (Fig. 4c). In these comparisons, we used RNA-seq and ATAC-seq data without any filtering (except for basic filtering to exclude unreliable signals) to directly compare the nature of the data themselves. To assess whether filtering with RNA-seq data could improve the clustering results, we performed hierarchical clustering using only the differentially expressed genes (DEGs) identified in Fig. 2 and found that RNA-seq could obtain a similar level of clustering power compared to ATAC-seq by performing gene filtering. These data imply that RNA-seq contains features (genes) shared among different cell types compared to ATAC-seq. To predict cell type-specific cis-regulatory programs, we analysed ATAC-seq data using cisTopic³¹. cisTopic identified 15 topics (group of genomic regions that are accessible in a cell type-specific manner) and peaks in each cluster were enriched in unique motifs, suggesting differential molecular mechanisms to regulate each cluster (Fig. 4d). To infer the molecular functions related to each topic, we assigned peaks to the nearest genes and performed pathway analysis (Fig. 4e). Pathway analysis showed unique enrichment of the integrin related pathways (FAK signalling and integrin signalling) in ATAC-seq peaks specific for A-MSCs. Pathways related to cholesterol synthesis (superpathway of cholesterol biosynthesis and lanosterol biosynthesis) and the acyl-CoA hydrolysis pathway were enriched in bone marrow-type MSCs, particularly vBM-MSCs.

Identification of transcription factors associated with chromatin accessibility among MSC groups

To gain a better understanding of the molecular mechanism associated with chromatin accessibility, we searched for transcription factors that are potentially involved in the establishment of cell-type specific chromatin structures in MSCs. To this end, we used chromVAR, which has been specifically developed for ATAC-seq data, to extract transcription factor binding motifs that are accessible in a cell-type specific manner³². Using ChromVAR, we identified 414 motifs with differential chromatin accessibility among the MSC groups (p ≤ 0.001) and the top 15 transcription factors are shown in Fig. 5a. To understand the cell-type specific use of the identified transcription factors, we visualized and clustered the accessibility of the top 15 identified variable transcription factor motifs (Fig. 5b). Interestingly, unlike RNA-seq and ATAC-seq, the two MSC groups with a bone marrow origin clustered differently; here femoral bone marrow-derived MSCs and lung MSCs clustered closely. In addition, GATA factors, representing a family known to initiate changes in chromatin structure, were specifically available in adipose tissue-derived MSCs³³. Similarly, RUNX1, known to be important for haematopoiesis and haematological malignancies, was selectively accessible in femoral bone marrow MSCs³⁴. Although most transcription factors identified here were expressed in at least single cell types (except for GATA1 and NFE2 with almost undetectable levels of expression), the expression patterns were not similar to the predicted cell type-specific availability, suggesting that the expression level is not the only determinant factor for the usage of transcription factors in the cells (Fig. 5c). To clarify the functional relationships between RNA-seq and ATAC-seq, we investigated chromatin accessibility at the promoters of the DEGs. The enrichment patterns of the ATAC-seq signals were similar to those for the DEGs, indicating that the DEG promoters were accessible in a manner similar to DEG expression (Fig. 5d).

Discussion

Here, we demonstrated a strategy to profile the molecular signatures associated with the establishment of the diversity of MSCs. We comprehensively analysed MSCs by RNA-seq and ATAC-seq and found that each group has a distinct transcriptome and epigenome. Our study directly addressed the fundamental question of how each MSC acquires unique features. Applying this strategy to mouse MSCs with different tissue origins identified many known regulators that are potentially involved in the establishment of cell-type specific chromatin structures and gene expression programs.

By comparing RNA-seq and ATAC-seq data without any prior filtering, we found that ATAC-seq contains more information to identify cell type-specific features. As we showed, the expression of cell type-specific genes are well correlated with the chromatin accessibility of the corresponding promoters; further, the gene expression pattern (identified by RNA-seq) and the chromatin accessibility (ATAC-seq) are in a good agreement. In addition to promoter regions, ATAC-seq can identify the open chromatin regions of intergenic regions, which presumably identify potential enhancer regions²². Our comparison indicated that this intergenic information is useful to identify cell diversity.

Our data are consistent with previous studies showing a connection between the diversity of MSCs and tissue origin. Although studies addressing the molecular signatures of MSCs at the omics level are limited, Cho et al. performed whole transcriptome analysis using human MSCs isolated from bone marrow, adipose tissue, and tonsil and found that MSCs have tissue origin-specific gene expression programs³⁵. However, because most studies addressing the diversity of MSCs have used human-derived samples, which inevitably contain variations caused by genetic and/or environmental factors, it was difficult to precisely evaluate the influence of tissue origin on MSC diversity. In this study, by choosing a mouse model to control sample-to-sample variation, we successfully concluded that the diversity of MSCs is influenced by tissue origin. In addition, we analysed cultured MSCs, rather than primary MSCs, in this study because we aimed to characterize cultured MSCs that are being used for many pre-clinical and clinical studies³⁶. However, although the isolation of primary MSCs is challenging due to the lack of suitable cell surface markers and the paucity of MSCs in the body, it would be of interest to perform similar analyses using primary MSCs³⁷.

In this study, we used the Smart-seq2 protocol, which was developed for RNA-seq using small amounts of RNA (minimum requirement is approximately 10 pg of RNA), to perform whole transcriptome analysis. Although Smart-seq2 is a very powerful method to analyse the full-length transcriptome, there are a few limitations associated with this technology, as compared to other RNA-seq methods. First, because Smart-seq2 is dependent on oligo dT primers to convert RNA into cDNA, it can only analyse polyadenylated RNA and not non-polyadenylated RNA such as histone mRNAs, long non-coding RNAs, nascent RNAs, and enhancer RNAs³⁸. In addition, the use of oligo dT primers, instead of random hexamers, can be biased towards the 3′ end of transcripts. Second, Smart-seq2 is not a strand-specific method. Further studies with multiple RNA-seq methods would be helpful to comprehensively compare the transcriptomes and epigenomes of MSCs.

Our initial efforts to characterize the molecular signatures of MSCs using ATAC -seq have led to the identification of potential transcription factors that might be useful to distinguish MSCs with different characteristics. We speculate that this approach, namely profiling MSCs by focusing on chromatin accessibility, could also by useful to characterize human MSCs. Considering the clinical use of human MSCs, it is important to fully characterize these cells to assure treatment safety and efficacy. Profiling MSCs using ATAC-seq might help to standardize preparation protocols for clinical use.

Methods

Mice

C57BL6/J mice (8-week-old) were purchased from CLEA Japan (Tokyo). All mice were housed under a 12-h light-dark cycle and were provided solid food and filtered water. All animals were handled in accordance with the approved guidelines of the Animal Committee of Osaka University Graduate School of Medicine. The protocols were approved by the Animal Committee of the Osaka University Graduate School of Medicine.

Isolation and culture of MSCs from tissues

Mouse bone marrow-derived MSCs were isolated from the bone marrow of the femur and vertebrae by crushing the bone with a mortar and pestle in MesenCult medium (STEM CELL technologies)³⁹. For the isolation of lung and adipose-derived MSCs, tissues were minced into pieces and digested with MesenCult medium containing 0.2% collagenase (Wako) at 37 °C for 30 min^13,40. The collagenase was removed by washing twice with 1× PBS. The cell suspension was filtered through a cell strainer (40-μm) and collected in a 50-ml tube. Red blood cells were removed by incubating cells in 1× RBC lysis buffer (BioLegend) for 5 min at room temperature. Then, 2 × 10⁷ cells were seeded onto a collagen I-coated, 10-cm dish using MesenCult medium containing 1× MesenPure and 10 nM of a Rock inhibitor. MSCs were cultured for up to three passages for experiments. All MSCs were used within three passages to reduce any artefacts potentially introduced by long-term culture⁴¹.

Flow cytometric analysis

One million cells were aliquoted in 100 μl of cell staining buffer (PBS with 2% FBS) and incubated with purified rat anti-mouse CD16/32 antibody for 10 min at 4 °C, followed by the fluorophore-conjugated antibodies of cell surface markers for 20 min at 4 °C as follows: FITC anti-mouse/rat CD29 (clone HMβ1-1, BioLegend), PE anti-mouse CD31 (clone 390, BioLegend), PE anti-mouse CD34 (clone MEC14.7, BioLegend), PE anti-mouse/human CD44 (clone IM7,BioLegend), PE anti-mouse CD45 (clone 30-F11, BioLegend), PE anti-mouse Sca-1 (clone D7, BioLegend), PE anti-mouse CD106 (clone 429, BioLegend), and PE anti-mouse IgG (isotype control) antibodies. Stained cells were analysed using the FACSCantoII system (BD Bioscience).

RNA-seq library preparation

RNA was extracted using the RNeasy Plus Micro kit (QIAGEN) and RNA concentration was determined using a Qubit 3.0 Fluorometer (Thermo Fisher Scientific). RNA quality was assessed on an Agilent 4200 tapestation instrument (Agilent Technologies). RNA-seq libraries were constructed following the Smart-seq2 protocol²⁶.

ATAC-seq

ATAC-seq libraries were constructed in accordance with a previously published protocol with minor modifications. Tagmentation was conducted with the Nextera DNA Library Prep kit (Illumina). Five thousand cells were resuspended in 50 μl of transposase mixture (25 μl of 2× TD buffer, 2.5 μl of TDE1, 1 μl of 5% IGEPAL-CA600, and 22.5 μl of nuclease-free water) at 37 °C for 30 min. DNA was extracted using the Zymo clean & concentrator kit (Zymo research). Transposed DNA was combined with the PCR mix (5 μl of 25 μM Nextera PCR primer, 25 μl of NEB High Fidelity 2× PCR master mix, 10 μl of nuclease-free water) and amplified using the following conditions: 72 °C for 5 min; 98 °C for 30 s; five cycles of amplification at 98 °C for 10 s, 63 °C for 30 s and 72 °C for 1 min. To enrich for transposed DNA of optimal sizes (150 to 500 bp), dual-size selection was performed using AMpure XP beads (BeckmanCoulter). Size-selected transposed DNA was combined with PCR mix (0.5 μl of 25 μM Nextera PCR primer, 0.06 μl of 100× SYBR green1, 7.5 μl of NEB High Fidelity 2×PCR master mix, 1.94 μl of nuclease-free water) and amplified by performing additional PCR cycles determined by qPCR, which was followed by purification with AMPure XP beads. The concentration of purified DNA was determined using the Qubit 3.0 Fluorometer (Thermo Fisher Scientific) and DNA size was analysed with a Bioanalyzer High sensitivity DNA chip (Agilent Technologies).

Analysis of RNA-Seq data

The mouse reference genome (mm10) was obtained from the iGenomes repository (https://support.illumina.com/sequencing/sequencing_software/igenome.html.). Fastq data for RNA-seq were aligned using RSEM with the following parameters (rsem-calculate-expression –paired-end –star –output-genome-bam)⁴². The clustering procedure is based on two unsupervised methods. First, the hierarchical clustering method was made with the R heatmap.2 function using ward.D2 and the Pearson correlation as a measure of distance. Second, PCA-based clustering were processed using the R function prcomp with default parameters. These steps were applied to all genes as features, with TPM-based expression. The analyses of differential gene expression were performed using AltAnalyze using the following parameters (–runICGS yes –column_method hopach –rho 0.4 –SamplesDiffering 3 –excludeCellCycle conservative –ExpressionCutoff 4 –FoldDiff 3)²⁷. In Fig. 3 we show the results with FoldDiff: 1.5, 2, 4, 5, 6.

We applied IPA for pathway analysis of the DEGs focusing on canonical pathways. In Fig. 2e we highlight the top five pathways ranked according the −log(Pvalue) of IPAs.

ATAC-Seq data analysis

Fastq data for ATAC-seq data were processed using TrimGalore (https://github.com/FelixKrueger/TrimGalore) with the following parameters (–stringency 5 –paired –trim1 –length 30 -q 0 –a CTGTCTCTTATACACATCT) to trim adaptor sequences and then aligned using Bowtie⁴³ with the following parameters (-m 1 -v 2 -S -I 0 -X 2000). Duplicated reads were removed using Picard’s MarkDuplicates (http://broadinstitute.github.io/picard/) with default settings. To account for the Tn5 transposon cleavage position in the mapped reads, we shifted the mapped reads by +4/−5 bp depending on the strand. The reads mapped to the blacklist features as defined in the ENCODE project were removed⁴⁴. Peak calling was performed using MACS2 with the following parameters (macs2 callpeak –nomodel –nolambda –keep-dup all –call-summits)⁴⁵. Based on this output, we created the bigWig file using UCSC bedGraphToBigwig to visually inspect the data. For clustering and correlation analysis, we filtered the significant peaks (FDR <1e-7) from the MACS2 output peak list and then the read density within a peak was calculated using featureCounts⁴⁶. We conducted quantile normalization and GC content correction using the R package CQN⁴⁷. For further downstream analysis, we binned the genome into 5-kbp windows and assigned the normalized read density to the corresponding bins (the peaks that spanned more than two windows were reassigned according the ratio of the base pair overlap). All feature intersections were done using BEDTools⁴⁸. Peak annotation to given genomic features was performed using Homer⁴⁹. Transcription factor motif analyses were performed using chromVar³² with JASPAR’s vertebrates’ motif set⁵⁰. We analysed the differentially accessible regions in ATAC-seq data (Fig. 4d) using cisTopic³¹. We explored six possibilities of models (2, 5, 10, 15, 20, 25). Our analysis showed that the topic with 15 models was most suitable with the lowest log-likelihood. The pathway analysis of these topics (Fig. S5) was performed using IPA, wherein the genes were obtained from the TSS region that overlapped with topic region (BED) within +/− 1 kbp distance. The genes of the corresponding transcription factors were derived from RNA-seq data processed according to the description of the previous section (Fig. 5c). We constructed the heatmap (Fig. 5d) by first obtaining the TSS region (+/− 1 kbp) of corresponding DEGs and performed the read counting using ATAC-seq data. We further performed the GC correction on these read counts with CQN⁴⁷.

Statistical analysis

Statistical analyses were performed using Prism7 (GraphPad software). Data are shown as the mean ± S.E.M and statistical significance was defined at *P < 0.05 based on a one-way ANOVA followed by a Turkey’s multiple comparison test.

Accession numbers

All sequencing data reported here are deposited at GEO and available under the Accession Number GSE116558.

References

Caplan, A. I. Mesenchymal stem-cells. J. Orthop. Res. 9, 641–650 (1991).
Article CAS PubMed Google Scholar
Prockop, D. J. Marrow stromal cells as stem cells for nonhematopoietic tissues. Science 276, 71–74 (1997).
Article CAS PubMed Google Scholar
Wislet-Gendebien, S. et al. Plasticity of cultured mesenchymal stem cells: switch from nestin-positive to excitable neuron-like phenotype. Stem Cells 23, 392–402 (2005).
Article CAS PubMed Google Scholar
Nauta, A. J. & Fibbe, W. E. Immunomodulatory properties of mesenchymal stromal cells. Blood 110, 3499–3506 (2007).
Article CAS PubMed Google Scholar
Tamai, K. et al. PDGFRalpha-positive cells in bone marrow are mobilized by high mobility group box 1 (HMGB1) to regenerate injured epithelia. Proc. Natl. Acad. Sci. USA 108, 6609–6614 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Introna, M. & Rambaldi, A. Mesenchymal stromal cells for prevention and treatment of graft-versus-host disease: successes and hurdles. Curr. Opin. Organ Transplant. 20, 72–78 (2015).
Article CAS PubMed Google Scholar
Le Blanc, K. et al. Mesenchymal stem cells for treatment of steroid-resistant, severe, acute graft-versus-host disease: a phase II study. Lancet 371, 1579–1586 (2008).
Article CAS PubMed Google Scholar
Moon, G. J. et al. Serum-mediated activation of bone marrow-derived mesenchymal stem cells in ischemic stroke patients: A novel preconditioning method. Cell Transplant. 27, 485–500 (2018).
Article PubMed PubMed Central Google Scholar
Weng, J. Y. et al. Mesenchymal stem cell as salvage treatment for refractory chronic GVHD. Bone Marrow Transplant. 45 (2010).
Article CAS PubMed PubMed Central Google Scholar
Mo, M., Wang, S., Zhou, Y., Li, H. & Wu, Y. Mesenchymal stem cell subpopulations: phenotype, property and therapeutic potential. Cell. Mol. Life Sci. 73, 3311–3321 (2016).
Article CAS PubMed Google Scholar
Hass, R., Kasper, C., Bohm, S. & Jacobs, R. Different populations and sources of human mesenchymal stem cells (MSC): A comparison of adult and neonatal tissue-derived MSC. Cell. Commun. Signal. 9, 12 (2011).
Article CAS PubMed PubMed Central Google Scholar
Lv, F. J., Tuan, R. S., Cheung, K. M. C. & Leung, V. Y. L. Concise review: The surface markers and identity of human mesenchymal stem cells. Stem Cells 32, 1408–1419 (2014).
Article CAS PubMed Google Scholar
Summer, R., Fitzsimmons, K., Dwyer, D., Murphy, J. & Fine, A. Isolation of an adult mouse lung mesenchymal progenitor cell population. Am. J. Respir. Cell. Mol. Biol. 37, 152–159 (2007).
Article CAS PubMed PubMed Central Google Scholar
Wu, J. et al. Umbilical cord blood-derived non-hematopoietic stem cells retrieved and expanded on bone marrow-derived extracellular matrix display pluripotent characteristics. Stem Cell Res. Ther. 7 (2016).
Bakopoulou, A. et al. Isolation and prolonged expansion of oral mesenchymal stem cells under clinical-grade, GMP-compliant conditions differentially affects “stemness” properties. Stem cell Res. Ther. 8, 247 (2017).
Article PubMed PubMed Central Google Scholar
Viero Nora, C. C. et al. Molecular analysis of the differentiation potential of murine mesenchymal stem cells from tissues of endodermal or mesodermal origin. Stem Cells Dev. 21, 1761–1768 (2011).
Article CAS PubMed Central Google Scholar
Kelsey, G., Stegle, O. & Reik, W. Single-cell epigenomics: Recording the past and predicting the future. Science 358, 69–75 (2017).
Article ADS CAS PubMed Google Scholar
Chen, F. X., Smith, E. R. & Shilatifard, A. Born to run: control of transcription elongation by RNA polymerase II. Nat. Rev. Mol. Cell. Biol. 19, 464–478 (2018).
Article CAS PubMed Google Scholar
Lambert, S. A. et al. The human transcription factors. Cell 172, 650–665 (2018).
Article CAS PubMed Google Scholar
Bonev, B. & Cavalli, G. Organization and function of the 3D genome. Nat. Rev. Genet. 17, 661 (2016).
Article CAS PubMed Google Scholar
Buenrostro, J. D., Giresi, P. G., Zaba, L. C., Chang, H. Y. & Greenleaf, W. J. Transposition of native chromatin for fast and sensitive epigenomic profiling of open chromatin, DNA-binding proteins and nucleosome position. Nat. Methods 10, 1213–1218 (2013).
Article CAS PubMed PubMed Central Google Scholar
Corces, M. R. et al. Lineage-specific and single-cell chromatin accessibility charts human hematopoiesis and leukemia evolution. Nat. Genet. 48, 1193–1203 (2016).
Article CAS PubMed PubMed Central Google Scholar
Lama, V. N. et al. Evidence for tissue-resident mesenchymal stem cells in human adult lung from studies of transplanted allografts. J. Clin. Invest. 117, 989–996 (2007).
Article CAS PubMed PubMed Central Google Scholar
Foronjy, R. F. & Majka, S. M. The potential for resident lung mesenchymal stem cells to promote functional tissue regeneration: understanding microenvironmental cues. Cells 1, 874 (2012).
Article CAS PubMed PubMed Central Google Scholar
Kwon, A. et al. Tissue-specific differentiation potency of mesenchymal stromal cells from perinatal tissues. Sci. Rep. 6, 23544 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Picelli, S. et al. Full-length RNA-seq from single cells using Smart-seq2. Nat. Protoc. 9, 171–181 (2014).
Article CAS PubMed Google Scholar
Olsson, A. et al. Single-cell analysis of mixed-lineage states leading to a binary cell fate choice. Nature 537, 698 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Davie, K. et al. Discovery of transcription factors and regulatory regions driving in vivo tumor development by ATAC-seq and FAIRE-seq open chromatin profiling. PLoS Genet. 11, e1004994 (2015).
Article CAS PubMed PubMed Central Google Scholar
Bao, X. M. et al. A novel ATAC-seq approach reveals lineage-specific reinforcement of the open chromatin landscape via cooperation between BAF and p63. Genome Biol. 16, 284 (2015).
Article CAS PubMed PubMed Central Google Scholar
Su, Y. et al. Neuronal activity modifies the chromatin accessibility landscape in the adult brain. Nat. Neurosci. 20, 476–483 (2017).
Article CAS PubMed PubMed Central Google Scholar
González-Blas, C. B. et al. Cis-topic modelling of single cell epigenomes. bioRxiv, 370346 (2018).
Schep, A. N., Wu, B., Buenrostro, J. D. & Greenleaf, W. J. chromVAR: inferring transcription-factor-associated accessibility from single-cell epigenomic data. Nat. Methods 14, 975–978 (2017).
Article CAS PubMed PubMed Central Google Scholar
Cirillo, L. A. et al. Opening of compacted chromatin by early developmental transcription factors HNF3 (FoxA) and GATA-4. Mol. Cell 9, 279–289 (2002).
Article CAS PubMed Google Scholar
Sood, R., Kamikubo, Y. & Liu, P. Role of RUNX1 in hematological malignancies. Blood 129, 2070–2082 (2017).
Article CAS PubMed PubMed Central Google Scholar
Cho, K. A., Park, M., Kim, Y. H., Woo, S. Y. & Ryu, K. H. RNA sequencing reveals a transcriptomic portrait of human mesenchymal stem cells from bone marrow, adipose tissue, and palatine tonsils. Sci. Rep. 7, 17114 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Galipeau, J. & Sensebe, L. Mesenchymal stromal cells: Clinical challenges and therapeutic opportunities. Cell Stem Cell 22, 824–833 (2018).
Article CAS PubMed PubMed Central Google Scholar
Ghazanfari, R. et al. Human primary bone marrow mesenchymal stromal cells and their in vitro progenies display distinct transcriptional profile signatures. Sci. Rep. 7, 10338 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Hayashi, T. et al. Single-cell full-length total RNA sequencing uncovers dynamics of recursive splicing and enhancer RNAs. Nat. Commun. 9, 619 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Soleimani, M. & Nadri, S. A protocol for isolation and culture of mesenchymal stem cells from mouse bone marrow. Nat. Protoc. 4, 102 (2008).
Article CAS Google Scholar
Yamamoto, N. et al. Isolation of multipotent stem cells from mouse adipose tissue. J. Dermatol. Sci. 48, 43–52 (2007).
Article CAS PubMed Google Scholar
Bortolotti, F. et al. In vivo therapeutic potential of mesenchymal stromal cells depends on the source and the isolation procedure. Stem Cell Rep. 4, 332–339 (2015).
Article CAS Google Scholar
Li, B. & Dewey, C. N. RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC Bioinformatics 12, 323 (2011).
Article CAS PubMed PubMed Central Google Scholar
Langmead, B., Trapnell, C., Pop, M. & Salzberg, S. L. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol. 10, R25 (2009).
Article CAS PubMed PubMed Central Google Scholar
ENCODE Project Consortium. An integrated encyclopedia of DNA elements in the human genome. Nature 489, 57 (2012).
Article ADS CAS Google Scholar
Zhang, Y. et al. Model-based analysis of ChIP-Seq (MACS). Genome Biol. 9, R137 (2008).
Article CAS PubMed PubMed Central Google Scholar
Liao, Y., Smyth, G. K. & Shi, W. featureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics 30, 923–930 (2014).
Article CAS PubMed Google Scholar
Hansen, K. D., Irizarry, R. A. & Wu, Z. J. Removing technical variability in RNA-seq data using conditional quantile normalization. Biostatistics 13, 204–216 (2012).
Article PubMed PubMed Central Google Scholar
Quinlan, A. R. & Hall, I. M. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 26, 841–842 (2010).
Article CAS PubMed PubMed Central Google Scholar
Heinz, S. et al. Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities. Mol. Cell 38, 576–589 (2010).
Article CAS PubMed PubMed Central Google Scholar
Khan, A. et al. JASPAR 2018: update of the open-access database of transcription factor binding profiles and its web framework. Nucleic Acids Res. 46, D260–D266 (2017).
Article CAS PubMed Central Google Scholar

Download references

Acknowledgements

We thank for Machika Kawamura for helpful comments on the manuscript. This study was supported by AMED under Grant Number JP18bk0104055 and JP18lm0203018, and JSPS KAKENHI Grant Number JP16H05369.

Author information

Yen-Ting Ho and Takashi Shimbo contributed equally.

Authors and Affiliations

Department of Stem Cell Therapy Science, Graduate School of Medicine, Osaka University, Suita, Osaka, Japan
Yen-Ting Ho, Takashi Shimbo, Edward Wijaya, Yuya Ouchi, Eiichi Takaki, Ryoma Yamamoto, Yasushi Kikuchi & Katsuto Tamai
StemRIM Co., Ltd., Ibaraki, Osaka, Japan
Edward Wijaya, Yuya Ouchi, Eiichi Takaki & Ryoma Yamamoto
Division of Gene Therapy Science, Graduate School of Medicine, Osaka University, Suita, Osaka, Japan
Yasufumi Kaneda

Authors

Yen-Ting Ho
View author publications
You can also search for this author in PubMed Google Scholar
Takashi Shimbo
View author publications
You can also search for this author in PubMed Google Scholar
Edward Wijaya
View author publications
You can also search for this author in PubMed Google Scholar
Yuya Ouchi
View author publications
You can also search for this author in PubMed Google Scholar
Eiichi Takaki
View author publications
You can also search for this author in PubMed Google Scholar
Ryoma Yamamoto
View author publications
You can also search for this author in PubMed Google Scholar
Yasushi Kikuchi
View author publications
You can also search for this author in PubMed Google Scholar
Yasufumi Kaneda
View author publications
You can also search for this author in PubMed Google Scholar
Katsuto Tamai
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Y.T.H. performed experiments, collected and interpreted data, and wrote the manuscript. T.S. conceived and designed the study, analysed and interpreted data, and wrote the manuscript. E.W. analysed and interpreted data and wrote the manuscript. Y.O., E.T. and R.Y. analysed and interpreted the data. Y. Kikuchi analysed data and supplied materials. Y. Kaneda interpreted data and helped write the manuscript. K.T. designed the study, interpreted data and helped write the manuscript. T.S. and K.T. approved the final draft. All authors reviewed the manuscript.

Corresponding authors

Correspondence to Takashi Shimbo or Katsuto Tamai.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary info

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Ho, YT., Shimbo, T., Wijaya, E. et al. Chromatin accessibility identifies diversity in mesenchymal stem cells from different tissue origins. Sci Rep 8, 17765 (2018). https://doi.org/10.1038/s41598-018-36057-0

Download citation

Received: 27 July 2018
Accepted: 14 November 2018
Published: 10 December 2018
DOI: https://doi.org/10.1038/s41598-018-36057-0

Keywords

This article is cited by

LINC01638 sustains human mesenchymal stem cell self-renewal and competency for osteogenic cell fate
- Jonathan A. R. Gordon
- Coralee E. Tye
- Jane B. Lian
Scientific Reports (2023)
Alterations in chromatin accessibility during osteoblast and adipocyte differentiation in human mesenchymal stem cells
- Jianyun Liu
- Lijun Gan
- Jianjun Xiong
BMC Medical Genomics (2022)
Optimized assay for transposase-accessible chromatin by sequencing (ATAC-seq) library preparation from adult Drosophila melanogaster neurons
- Collin B. Merrill
- Miguel A. Pabon
- Adrian Rothenfluh
Scientific Reports (2022)
Subtype-Independent ANP32E Reduction During Breast Cancer Progression in Accordance with Chromatin Relaxation
- Garrett L. Ruff
- Kristin E. Murphy
- Patrick J. Murphy
BMC Cancer (2021)
Mesenchymal stromal cells in the bone marrow niche consist of multi-populations with distinct transcriptional and epigenetic properties
- Sanshiro Kanazawa
- Hiroyuki Okada
- Kazuto Hoshi
Scientific Reports (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.