RNA-seq of peripheral blood mononuclear cells of congenital generalized lipodystrophy type 2 patients

Huang, Yen-Hua; Su, Tzu-Chien; Wang, Chung-Hsing; Wong, Siew-Lee; Chien, Yin-Hsiu; Wang, Yu-Tai; Hwu, Wuh-Liang; Lee, Ni-Chung

doi:10.1038/s41597-021-01040-4

Download PDF

Data Descriptor
Open access
Published: 13 October 2021

RNA-seq of peripheral blood mononuclear cells of congenital generalized lipodystrophy type 2 patients

Scientific Data volume 8, Article number: 265 (2021) Cite this article

2902 Accesses
4 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Illumina RNA-seq analysis was used to characterize the whole transcriptomes of peripheral blood mononuclear cells (PBMCs) from patients with congenital generalized lipodystrophy. RNA-seq information for seven patients with type 2 congenital generalized lipodystrophy (CGL2; Berardinelli-Seip congenital lipodystrophy, BSCL2) was obtained and compared with similar information for seven age- and sex-matched healthy control subjects. All seven CGL2 patients carried biallelic pathogenic mutations affecting the BSCL2 gene and had clinical symptoms of varying severity. The findings provide the whole-transcriptome signatures of PBMCs of CGL2 patients, allowing further exploration of gene expression patterns/signatures associated with the various clinical symptoms of patients with this disease.

Measurement(s)	RNA-Seq • RNA
Technology Type(s)	Illumina HiSeq. 2500 • RNA sequencing
Sample Characteristic - Organism	Homo sapiens

Machine-accessible metadata file describing the reported data: https://doi.org/10.6084/m9.figshare.15022521

Diagnosing Cornelia de Lange syndrome and related neurodevelopmental disorders using RNA sequencing

Article 08 January 2020

Single-cell RNA sequencing in cardiovascular development, disease and medicine

Article 30 March 2020

Application of second-generation sequencing in congenital pulmonary airway malformations

Article Open access 28 November 2022

Background & Summary

RNA sequencing (RNA-seq) is a term that describes the use of next-generation sequencing (NGS)-based methodology to target whole transcriptomes. This approach has gained wide popularity in biomedical research, in which profiling of the gene expression levels in samples can be very useful. Compared with other high-throughput transcriptome profiling technologies, such as DNA microarrays, the RNA-seq approach has the advantage of a wider dynamic range of measurement, which enables more sensitive detection of the global gene expression signatures of a given cell population or tissue¹. Thus, RNA-seq allows for a more precise identification of the gene expression differences of cells when normal and abnormal tissues are compared. Indeed, RNA-seq has become a ubiquitous tool for the study of human diseases, including metabolic disorders, cancers, and infectious diseases². Overall, RNA-seq-identified changes in the transcriptome that are associated with a disease can reveal dysregulation affecting associated biological pathways. The approach also aids in the identification of molecular markers reflecting disease progression.

Congenital generalized lipodystrophy (CGL; CGL1, MIM #608594; CGL2, MIM #269700; CGL3, MIM#612526; CGL4, MIM#613327) is a group of rare autosomal recessive disorders characterized by defects in the biogenesis of lipid droplets; individuals with these genetic diseases lack adipose tissue from early in life due to mutations in AGPAT2, BSCL2, CAV1, or CAVIN1^3,4,5,6,7,8. Metabolic disorders in CGL patients are the main factors causing morbidity and mortality. Deficiency in adipose mass results in leptin deficiency⁹, leading to the main clinical features of these diseases, including hepatomegaly, muscular hypertrophy, cardiomyopathy, insulin resistance, and hypertriglyceridemia³. Furthermore, leptin deficiency might be associated with the lowered immunity frequently observed in CGL patients, as leptin is an important modulator of both the innate and adaptive immune systems. For example, leptin activates polymorphonuclear neutrophils (PMNs), exerts proliferative and antiapoptotic activities on T lymphocytes, affects cytokine production and regulates activation of monocytes and macrophages^10,11,12. To date, approximately 500 patients with CGL have been reported in the literature. In Taiwan, type 2 disease (CGL2), which is caused by BSCL2 mutation, is most common based on surveillance across Asia (73% to 100%)^{13,14,15,16,17}.

Compared to CGL1, CGL2 presents with a more severe phenotype, including extensive fat loss, cardiomyopathy, intellectual impairment (50% to 78%), and premature death (15%)^13,18. Hypertrophic cardiomyopathy (25% of CGL2 patients) often leads to death in the third decade of life. In addition, early onset of liver cirrhosis, renal failure, and recurrent bacterial infection are potential lethal complications. Diagnosis of CGL is based on the patient’s clinical and biochemical phenotype and is usually confirmed by molecular testing¹⁹. Therapeutic strategies include restriction of total fat intake (20–30% of total dietary energy), treatment with fibric acid derivatives, prescription of n-3 polyunsaturated fatty acids to control hypertriglyceridemia, and standard glucose-lowering treatments²⁰. In addition, leptin replacement therapy has been approved since 2014 by the Food and Drug Administration (FDA) for the treatment of severe metabolic abnormalities that result in generalized forms of lipodystrophy²¹.

Clinically, we have observed that some CGL2 patients develop hyperactivity and seizures in late childhood but that others develop metabolic syndrome that is difficult to control with glucose-lowering drugs. To elucidate the processes underlying these different outcomes, the aim in this study was to explore differences in RNA signatures between CGL2 patients and matched control subjects.

Methods

Subjects and blood sample preparation

This study investigated differentially expressed genes that might be associated with various subcategories of CGL2 patients and aimed to identify distinct regulatory pathways involved in these various phenotypes. The overall workflow is presented in Fig. 1.

Seven patients (3 female patients and 4 male patients) who had been diagnosed with CGL2 (Table 1) and seven healthy individuals age- and sex-matched with the patients participated in this study; the latter group was used as a control. All enrolled CGL2 patients had been shown to carry BSCL2 mutation by Sanger sequencing after their initial diagnosis of CGL. Each was further categorized into various subgroups based on age, symptoms, and individual biochemical profiles, namely, child or young adult, normal blood lipid or hyperlipidemia, intellectual disability or not, and diabetes mellitus (DM) or not. Additionally, patients with severe DM or hyperlipidemia, namely, patients 1, 4, 6, and 7, were assigned together into another subgroup, the metabolic syndrome subgroup (presence of insulin resistance or a diagnosis of DM); patients 2, 3, and 5 comprised the “nonmetabolic syndrome” subgroup. Peripheral blood samples were collected after obtaining written informed consent from the participants themselves or one of their parents if the individual was younger than 18 years old. The study protocol was approved by the Institutional Review Board of National Yang-Ming University (IRB number: YM108113E) and the Institutional Review Board of National Taiwan University Hospital (IRB number: 201901028RINB). Informed consent for these data to be openly shared was obtained from all subjects or their guardians, and the participants were warned that RNA-seq data carry some inherent risk of reidentification. To reduce variability, each blood sample was drawn in the morning before the participant broke their fast. Each blood sample was collected into a BD Vacutainer® EDTA tube, and soon after collection, an aliquot from each sample was used for RNA-seq transcriptomic profiling of peripheral blood mononuclear cells (PBMCs).

Table 1 Sex, age, and categories of the participants with CGL2 and their BSCL2 mutant alleles.

Full size table

Preparation of total RNA from PBMCs and RNA-seq analysis

Each whole-blood sample was heparinized, and the samples were used to isolate PBMCs by density gradient centrifugation using LymphoprepTM medium according to the manufacturer’s instructions. Specifically, a sample was diluted with phosphate-buffered saline (PBS, pH 7.4) to double its volume, layered on top of 5 mL of Ficoll-Paque Plus (GE Healthcare, Cat. No. 17-1440-02) and centrifuged for 30 minutes at 400 × g. The PBMC layer was aspirated and centrifuged for collection. The PBMC pellet was treated for 15 minutes with RBC lysis buffer in the absence of light and then centrifuged again followed by washing twice with PBS. Total RNA was extracted using a miRNeasy Kit (Qiagen Cat. No. 74004), followed by DNase I treatment to avoid contamination with genomic DNA. All procedures were according to the manufacturer’s instructions. The fourteen samples containing total RNA from the subject’s PBMCs were submitted to The Genomics Center for Clinical and Biotechnological Applications of National Core Facility for Biopharmaceuticals, which is located at National Yang-Ming University, Taipei. At this facility, RNA quality assessment, RNA integrity assessment, and whole-transcriptome sequencing were carried out. The sequencing library was prepared using a TruSeq Stranded Total RNA Library Ribo-Zero™ Globin Kit (Illumina, San Diego, CA, USA). As the amount of input RNA for the preparation of sequencing libraries was ~1 μg, which is approximately the highest level suggested by the manufacturer’s manual, no positive control, such as SMARTer® Ultra™ Low RNA Kit for Illumina® Sequencing (Takara Bio USA, Mountain View, CA, USA), was used. Paired-end RNA-seq (2 × 100 bases) was carried out using the Illumina HiSeq. 2500 platform.

RNA-seq data analysis

RNA-seq data analysis was performed to identify differentially expressed genes (DEGs) by assessing the number of reads mapped to the individual gene present in the human reference genome. The workflow of the RNA-seq data analysis was as follows: (1) preprocessing of raw RNA-seq reads and quality validation; and (2) read mapping and normalization.

Preprocessing of raw RNA-seq reads and quality validation

During RNA-seq data preprocessing, the quality of the raw RNA-seq reads was evaluated using FastQC²² and MultiQC²³, providing per-base and per-sequence assessments of the sample’s read quality, overall GC contents, and presence of adaptors, overrepresented k-mers and duplicated reads. In addition, Trimmomatic was used to discard low-quality reads, to trim adaptor sequences and to eliminate poor-quality bases²⁴. The quality control steps and read trimming were iteratively performed to ensure that all low-quality reads and adaptor sequences were removed as much as possible because they would seriously interfere with the read-mapping step. The quality of the cleaned reads was further assessed by generating per-base box plots using FastQC. Next, MultiQC was applied to create a visualization of the output across the various different samples (Fig. 2), which was used to identify any global trends and/or biases that might affect the sequence quality metrices²³.

Read mapping and normalization

Cleaned reads were mapped to the human reference genome (GRCh38) using the splice-aware alignment tool STAR²⁵, followed by BAM file sorting using SAMtools²⁶. The RNA-seq read mapping results are summarized in Table 2.

Table 2 RNA-seq read statistics. All RNA samples had RIN >7.0 and 260/280 > 1.8.

Full size table

The read counts mapped to each gene were further normalized by edgeR²⁷ using the TMM (trimmed mean of M values) normalization method to calculate various effective library sizes. Boxplots and a multidimensional scaling (MDS) plot were generated to determine whether any unusual expression-pattern similarities between the samples due to batch effects might be present²⁸. The boxplots suggested that the variation in the distributions of normalized gene expression levels across different samples was small and that their median values and IQRs (interquartile ranges) were highly similar (Fig. 3). In addition, no batch effect was noted based on the MDS plot, in which the distributions of data points did not reveal any obvious clustering between sibling pairs, between patients, or between normal controls (Fig. 4).

Downstream analysis

Statistical tests were carried out using edgeR based on negative binomial differential expression methods and were performed to identify differentially expressed genes (DEGs) when CGL2 patients were compared to age- and sex-matched controls²⁷. Based on the instructions provided in the user’s guide of edgeR, we used a paired-sample design to detect DEGs while adjusting for any confounding effects caused by differences in age and sex among the patients.

Data Records

The FASTQ files for the raw RNA-seq reads in this study have been deposited in NCBI Sequence Read Archive (SRA) under the study accession SRP281919²⁹. The raw read count dataset was deposited in the NCBI Gene Expression Omnibus (GEO) under accession number GSE159337³⁰ (Table 2). The DEGs of CGL2 patients’ PBMC were deposited in Excel file format in figshare³¹.

Technical Validation

RNA integrity assessment

Total RNA was quantified using a Nanodrop ND-1000 spectrophotometer, and its quality was then assessed using an Agilent 2100 Bioanalyzer according to the manufacturer’s instructions. Acceptable quality values are in the range of 1.8−2.2 for A260/A280 ratios and with an RNA integrity number (RIN) of >7.0.

RNA-seq data quality assessment

The quality of the raw and cleaned RNA-seq reads was evaluated using FastQC²², which ensured that the adaptors were removed from the raw reads. This program also verified that the quality of the cleaned RNA-seq reads was suitable for downstream analyses (Fig. 2).

Code availability

To finish the RNA-seq data preprocessing, quality validation, and mapping to the human reference genome, only public-domain software but no other custom code was used. These software tools and their versions are listed as follows:

1. FastQC v0.11.8 was used for quality assessment of the raw reads and the trimmed reads of RNA-sequencing data: https://www.bioinformatics.babraham.ac.uk/projects/fastqc/.

2. Trimmomatic 0.38 was used to remove adaptors and do quality trimming: http://www.usadellab.org/cms/?page=trimmomatic.

3. MultiQC 1.0 was used to perform cross-sample quality assessment of the RNA-sequencing reads: https://multiqc.info/.

4. STAR 2.7 was used to map the cleaned RNA-seq reads to the human reference genome assembly, GRCh38: https://github.com/alexdobin/STAR.

5. edgeR 3.30.3 was used to carry out trimmed mean of M values (TMM) normalization for gene expression quantification, and to find the differentially expressed genes (DEGs) between sample groups: https://bioconductor.org/packages/release/bioc/html/edgeR.html.

References

Zhao, S., Fung-Leung, W. P., Bittner, A., Ngo, K. & Liu, X. Comparison of RNA-Seq and microarray in transcriptome profiling of activated T cells. PLoS One 9, e78644 (2014).
Article ADS Google Scholar
Stark, R., Grzelak, M. & Hadfield, J. RNA sequencing: the teenage years. Nat Rev Genet 20, 631–656 (2019).
Article CAS Google Scholar
Seip, M. & Trygstad, O. Generalized lipodystrophy, congenital and acquired (lipoatrophy). Acta Paediatr Suppl 413, 2–28 (1996).
Article CAS Google Scholar
Berardinelli, W. An undiagnosed endocrinometabolic syndrome: report of 2 cases. J Clin Endocrinol Metab 14, 193–204 (1954).
Article CAS Google Scholar
Agarwal, A. K. et al. AGPAT2 is mutated in congenital generalized lipodystrophy linked to chromosome 9q34. Nat Genet 31, 21–23 (2002).
Article CAS Google Scholar
Magre, J. et al. Identification of the gene altered in Berardinelli-Seip congenital lipodystrophy on chromosome 11q13. Nat Genet 28, 365–370 (2001).
Article CAS Google Scholar
Kim, C. A. et al. Association of a homozygous nonsense caveolin-1 mutation with Berardinelli-Seip congenital lipodystrophy. J Clin Endocrinol Metab 93, 1129–1134 (2008).
Article CAS Google Scholar
Hayashi, Y. K. et al. Human PTRF mutations cause secondary deficiency of caveolins resulting in muscular dystrophy with generalized lipodystrophy. J Clin Invest 119, 2623–2633 (2009).
Article CAS Google Scholar
Lightbourne, M. & Brown, R. J. Genetics of Lipodystrophy. Endocrinol Metab Clin North Am 46, 539–554 (2017).
Article Google Scholar
Matarese, G., Moschos, S. & Mantzoros, C. S. Leptin in immunology. J Immunol 174, 3137–3142 (2005).
Article CAS Google Scholar
Naylor, C. & Petri, W. A. Jr. Leptin Regulation of Immune Responses. Trends Mol Med 22, 88–98 (2016).
Article CAS Google Scholar
Fernandez-Riejos, P. et al. Role of leptin in the activation of immune cells. Mediators Inflamm 2010, 568343 (2010).
Article Google Scholar
Agarwal, A. K. et al. Phenotypic and genetic heterogeneity in congenital generalized lipodystrophy. J Clin Endocrinol Metab 88, 4840–4847 (2003).
Article CAS Google Scholar
Van Maldergem, L. et al. Genotype-phenotype relationships in Berardinelli-Seip congenital lipodystrophy. J Med Genet 39, 722–733 (2002).
Article Google Scholar
Magre, J. et al. Prevalence of mutations in AGPAT2 among human lipodystrophies. Diabetes 52, 1573–1578 (2003).
Article CAS Google Scholar
Ebihara, K. et al. Gene and phenotype analysis of congenital generalized lipodystrophy in Japanese: a novel homozygous nonsense mutation in seipin gene. J Clin Endocrinol Metab 89, 2360–2364 (2004).
Article CAS Google Scholar
Huang, H.-H. et al. A Taiwanese boy with congenital generalized lipodystrophy caused by homozygous Ile262fs mutation in the BSCL2 gene. The Kaohsiung Journal of Medical Sciences 26, 615–620 (2010).
Article CAS Google Scholar
Bhayana, S. et al. Cardiomyopathy in congenital complete lipodystrophy. Clin Genet 61, 283–287 (2002).
Article CAS Google Scholar
Hsu, R. H. et al. Congenital generalized lipodystrophy in Taiwan. J Formos Med Assoc (2018).
Patni, N. & Garg, A. Congenital generalized lipodystrophies–new insights into metabolic dysfunction. Nat Rev Endocrinol 11, 522–534 (2015).
Article CAS Google Scholar
Meehan, C. A., Cochran, E., Kassai, A., Brown, R. J. & Gorden, P. Metreleptin for injection to treat the complications of leptin deficiency in patients with congenital or acquired generalized lipodystrophy. Expert Rev Clin Pharmacol 9, 59–68 (2016).
Article CAS Google Scholar
Andrews, S. FASTQC. A quality control tool for high throughput sequence data. http://www.bioinformatics.babraham.ac.uk/projects/fastqc/ (2010).
Ewels, P., Magnusson, M., Lundin, S. & Kaller, M. MultiQC: summarize analysis results for multiple tools and samples in a single report. Bioinformatics 32, 3047–3048 (2016).
Article CAS Google Scholar
Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114–2120 (2014).
Article CAS Google Scholar
Dobin, A. et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29, 15–21 (2013).
Article CAS Google Scholar
Li, H. et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
Article Google Scholar
Robinson, M. D., McCarthy, D. J. & Smyth, G. K. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics 26, 139–140 (2010).
Article CAS Google Scholar
Chen, Y. & Meltzer, P. S. Gene Expression Analysis via Multidimensional Scaling. Current Protocols in Bioinformatics 10, 7.11.11–17.11.19 (2005).
Article Google Scholar
Huang, Y. et al. mRNA-seq read files of peripheral blood mononuclear cells from congenital generalized lipodystrophy patients and their gender/aged-matched controls. NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRP281919 (2021).
Huang, Y. et al. mRNA-seq read counts of peripheral blood mononuclear cells from congenital generalized lipodystrophy patients and their gender/aged-matched controls. Gene Expression Omnibus https://identifiers.org/geo:GSE159337 (2021).
Huang, Y. et al. Differentially expressed genes found in the mRNA-seq of CGL2 patients’ peripheral blood mononuclear cells. figshare https://doi.org/10.6084/m9.figshare.c.5514768 (2021).
American Psychiatric Association. Diagnostic and statistical manual of mental disorders 4th edn, text rev. (American Psychiatric Association, 2000).
Greenspan, S. Borderline intellectual functioning: an update. Curr Opin Psychiatry 30, 113–122 (2017).
Article Google Scholar

Download references

Acknowledgements

This work was supported by a grant, “Prevention and treatment of rare diseases, 2019”, from the Health Promotion Administration, Ministry of Health and Welfare (MOHW), and Ministry of Science and Technology (MOST 110-2321-B-002-011 -), Taiwan. We thank the Genomics Center for Clinical and Biotechnological Applications of National Core Facility for Biopharmaceuticals, Taiwan (MOST 109-2740-B-010-002), for conducting the high-throughput sequencing. We are grateful to the National Center for High-performance Computing (NCHC), Taiwan, for the computer time and facilities.

Author information

Authors and Affiliations

Institute of Biomedical Informatics, National Yang-Ming University, Taipei, Taiwan
Yen-Hua Huang & Tzu-Chien Su
Center for Systems and Synthetic Biology, National Yang-Ming University, Taipei, Taiwan
Yen-Hua Huang
Department of Pediatrics, Children’s Hospital, China Medical University, Taichung, Taiwan
Chung-Hsing Wang
School of Medicine, China Medical University, Taichung, Taiwan
Chung-Hsing Wang
Department of Pediatrics, Ditmanson Medical Foundation Chia-Yi Christian Hospital, Chia-Yi, Taiwan
Siew-Lee Wong
Department of Pediatrics and Medical Genetics, National Taiwan University Hospital, Taipei, Taiwan
Yin-Hsiu Chien, Wuh-Liang Hwu & Ni-Chung Lee
National Center for High-performance Computing, National Applied Research Laboratories, Hsinchu, Taiwan
Yu-Tai Wang

Authors

Yen-Hua Huang
View author publications
You can also search for this author in PubMed Google Scholar
Tzu-Chien Su
View author publications
You can also search for this author in PubMed Google Scholar
Chung-Hsing Wang
View author publications
You can also search for this author in PubMed Google Scholar
Siew-Lee Wong
View author publications
You can also search for this author in PubMed Google Scholar
Yin-Hsiu Chien
View author publications
You can also search for this author in PubMed Google Scholar
Yu-Tai Wang
View author publications
You can also search for this author in PubMed Google Scholar
Wuh-Liang Hwu
View author publications
You can also search for this author in PubMed Google Scholar
Ni-Chung Lee
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Y.H.H. and N.C.L. designed this study and drafted the manuscript. N.C.L., C.H.W., S.L.W., Y.H.C. and W.L.H. performed the diagnoses, the follow-ups and the sample collection. T.C.S. performed the experiments and the data analysis. Y.H.H. and N.C.L. supervised the experiments. Y.H.H., N.C.L. and Y.T.W. supervised the data analysis. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Ni-Chung Lee.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

The Creative Commons Public Domain Dedication waiver http://creativecommons.org/publicdomain/zero/1.0/ applies to the metadata files associated with this article.

Reprints and permissions

About this article

Cite this article

Huang, YH., Su, TC., Wang, CH. et al. RNA-seq of peripheral blood mononuclear cells of congenital generalized lipodystrophy type 2 patients. Sci Data 8, 265 (2021). https://doi.org/10.1038/s41597-021-01040-4

Download citation

Received: 08 January 2021
Accepted: 26 August 2021
Published: 13 October 2021
DOI: https://doi.org/10.1038/s41597-021-01040-4

This article is cited by

Impaired signaling pathways on Berardinelli–Seip congenital lipodystrophy macrophages during Leishmania infantum infection
- Viviane Brito Nogueira
- Carolina de Oliveira Mendes-Aguiar
- Selma Maria Bezerra Jeronimo
Scientific Reports (2024)