Homozygous duplication identified by whole genome sequencing causes LRBA deficiency

Merico, Daniele; Pasternak, Yehonatan; Zarrei, Mehdi; Higginbotham, Edward J.; Thiruvahindrapuram, Bhooma; Scott, Ori; Willett-Pachul, Jessica; Grunebaum, Eyal; Upton, Julia; Atkinson, Adelle; Kim, Vy H. D.; Aliyev, Elbay; Fakhro, Khalid; Scherer, Stephen W.; Roifman, Chaim M.

doi:10.1038/s41525-021-00263-z

Download PDF

Case Report
Open access
Published: 18 November 2021

Homozygous duplication identified by whole genome sequencing causes LRBA deficiency

npj Genomic Medicine volume 6, Article number: 96 (2021) Cite this article

2776 Accesses
3 Citations
8 Altmetric
Metrics details

Subjects

Abstract

In more than one-third of primary immunodeficiency (PID) patients, extensive genetic analysis including whole-exome sequencing (WES) fails to identify the genetic defect. Whole-genome sequencing (WGS) is able to detect variants missed by other genomics platforms, enabling the molecular diagnosis of otherwise unresolved cases. Here, we report two siblings, offspring of consanguineous parents, who experienced similar severe events encompassing early onset of colitis, lymphoproliferation, and hypogammaglobulinemia, typical of lipopolysaccharide-responsive and beige-like anchor (LRBA) or cytotoxic T lymphocyte antigen 4 (CTLA4) deficiencies. Gene-panel sequencing, comparative genomic hybridization (CGH) array, and WES failed to reveal a genetic aberration in relevant genes. WGS of these patients detected a 12.3 kb homozygous tandem duplication that was absent in control cohorts and is predicted to disrupt the reading frame of the LRBA gene. The variant was validated by PCR and Sanger sequencing, demonstrating the presence of the junction between the reference and the tandem-duplicated sequence. Droplet digital PCR (ddPCR) further confirmed the copy number in the unaffected parents (CN = 3, heterozygous) and affected siblings (CN = 4, homozygous), confirming the expected segregation pattern. In cases of suspected inherited immunodeficiency, WGS may reveal a mutation when other methods such as microarray and WES analysis failed to detect an aberration.

The impact of rare and low-frequency genetic variants in common variable immunodeficiency (CVID)

Article Open access 15 April 2021

Atil Bisgin, Ozge Sonmezler, … Mustafa Yilmaz

Whole-genome sequencing of a sporadic primary immunodeficiency cohort

Article 06 May 2020

James E. D. Thaventhiran, Hana Lango Allen, … Kenneth G. C. Smith

Assessment of the gene mosaicism burden in blood and its implications for immune disorders

Article Open access 21 June 2021

Manuel Solís-Moruno, Anna Mensa-Vilaró, … Ferran Casals

Introduction

The LRBA gene encodes the lipopolysaccharide-responsive and beige-like anchor (LRBA) protein, which is highly conserved across species and widely expressed in human tissues^1,2. Mutations in the LRBA gene cause an immunodeficiency encompassing autoimmune and lymphoproliferative features as well as antibody deficiency^2,3. Commonly, these patients present in infancy or childhood with colitis, lymphadenopathy, and recurrent infections⁴. Most mutations described so far are localized throughout the gene and no correlation was found to the clinical presentation⁴. While most mutations resulted in a complete loss of LRBA protein, some had residual expression⁵. LRBA colocalizes with cytotoxic T lymphocyte antigen 4 (CTLA4) in endosomal vesicles and appears to control its turnover⁵.

CTLA4 is expressed on activated and regulatory T cells⁶ and provides an inhibitory proliferative signal by competing with the T cell co-stimulatory receptor CD28⁷. CTLA4 molecules are recycled by trafficking from the membrane to the cytoplasm, and back, through activation cycles⁸. Mutations in LRBA result in reduced CTLA4 expression, thus allowing for unchecked immune dysregulation, providing a plausible explanation for the autoimmune/lymphoproliferative nature of the disorder. Indeed, mutations in CTLA4 result in a similar clinical spectrum to LRBA deficiency⁹. Moreover, patients with LRBA deficiency improve clinically when treated with the CTLA4–immunoglobulin fusion drug abatacept¹⁰.

In a large published cohort with suspected LRBA deficiency, genetic analysis of the LRBA gene including whole-exome sequencing (WES) failed to show a mutation in a significant number of patients⁴, suggesting this technique may fall short on identifying some genetic aberrations. We report here a similar case where whole-genome sequencing (WGS) was used in an attempt to define the diagnosis of LRBA deficiency, while gene-panel sequencing, comparative genomic hybridization (CGH) array, and WES failed to do so.

WGS can be effectively used to detect copy number variants (CNVs) and other structural variants that would be missed by exome sequencing or by genotyping or genome hybridization arrays^11,12, offering a tremendous opportunity to identify a molecular diagnosis for otherwise unresolved cases. Specifically, a pipeline that combines different methods based on read depth (CNVnator¹³ and Estimation by Read Depth with Single-nucleotide variants (ERDS)¹⁴) was shown to be able to detect copy number gains and losses ranging from megabase to kilobase size range with high sensitivity and specificity¹¹.

Results

Patients

Case 1 was born at term to consanguineous parents of Iraqi descent. Chronic watery diarrhea started at the age of 3 months and continued in spite of dietary changes, antibiotic therapy, and periodic management of Clostridium difficile when identified. An endoscopy performed at 18 months revealed complete villus atrophy, focal cryptitis, and chronic lamina propria inflammation. Symptoms improved transiently with corticosteroid treatment, only to resurface at 2 years of age. This time, cytomegalovirus (CMV) was detected in the gut requiring treatment with ganciclovir for 3 months. In parallel, he developed chronic thrombocytopenia and suffered repeated episodes of pneumonia leading to chronic interstitial lung disease. Since the age of 6 years he received intravenous immunoglobulin (IVIG) for hypogammaglobulinemia. At the age of 8 years he developed severe multiorgan serositis with pericardial and pleural effusions and ascites. In addition to vast effusions, imaging detected markedly enlarged lymph nodes in the neck and chest. Enteritis worsened and he eventually required parenteral nutrition or G-tube feeding. He subsequently developed protracted fever, neutropenia, and thrombocytopenia suggestive of hemophagocytic lymphohistiocytosis (HLH). The diagnosis was confirmed by the laboratory findings of elevated CD163 and IL-2, as well as hemophagocytosis identified in a bone marrow biopsy. CMV and pseudomonas were cultured from his blood samples. In spite of immunosuppressive treatment, HLH continued to deteriorate causing liver failure and ultimately leading to his death due to multiorgan failure. While the potential diagnosis of LRBA or CTLA4 deficiency was raised, repeated attempts to reach a genetic diagnosis, including WES, have failed. With no definitive diagnosis, the family refrained from considering a hematopoietic stem cell therapy (HSCT).

Case 2 is the younger sibling of case 1. Like his older brother, he developed chronic diarrhea, thrombocytopenia, and marked generalized lymphadenopathy and splenomegaly by the age of 3 years. Endoscopy and colonoscopy performed at 7 years showed marked pan enteritis with flattened villi in the duodenum and colon, and diffuse inflammatory infiltrates. He too was diagnosed with hypogammaglobulinemia and received IVIG replacement but had consistent CMV and Epstein-Barr virus (EBV) viremia. He subsequently developed chronic liver disease and a low-grade HLH which gradually worsened at the age of 9 years. Following a partial response to treatment with sirolimus he received a HSCT from an unrelated donor, in the absence of a HLA-matched related donor. After myeloablative conditioning, he was fully engrafted but developed severe stage 4 graft versus host disease (GvHD) at 1 month post-transplant leading to hepatocellular damage and vanishing bile duct syndrome. He died at the age of 10 years due to severe gut and liver GvHD and uncontrolled gastrointestinal bleeding.

Whole-genome sequencing variant identification

The two affected siblings, both male, underwent WGS on the Illumina HiSeqX platform, with PCR-free library preparation and 150 bp paired-end reads, resulting in an average genome coverage of 35x and with over 98% of reads aligning to the reference genome.

First, we prioritized substitutions and small insertion/deletion (indel) variants that were rare (frequency <5%) and that impacted exonic sequence directly, or that could impact it indirectly by altering splicing or other regulatory sequences¹⁵. This resulted in 15,536 and 15,839 variants in the two siblings, of which 7897 were shared. We then sorted rare variants into five groups based on mode of inheritance (homozygous, X-chromosome hemizygous, potentially compound heterozygous, autosomal dominant, and potentially dominant constrained genes) and flagged genes implicated in primary immunodeficiency (PID, 400 genes) or predicted to have potential implication in PID (2402 additional genes at a 80% recall cutoff). Focusing on variants present in both siblings and of high quality, we identified 10 homozygous variants, 4 X-chromosome hemizygous variants with allele frequency <0.0001, and 4 variants in two genes forming potential compound heterozygous sets with frequency product <0.0001. None of these variants impacted a PID gene or a gene predicted to be implicated in PID. We also identified 11 variants of high quality, absent from the Genome Aggregation Database (gnomAD), occurring in autosomal dominant genes, and present in both siblings; none occurred in a PID gene, and only one occurred in a gene predicted to be potentially implicated in PID, but closer inspection revealed a mismatched disorder (RUNX2, Cleidocranial or metaphyseal dysplasia). In addition, we identified 6 variants of high quality, absent from gnomAD, and present in both siblings occurring in constrained genes that could act as dominant, of which none was predicted to be implicated in PID (see Supplementary Data 1). Low-quality variants, variants with higher frequencies, and variants not shared by the two siblings were also inspected, but no candidate was found. In conclusion, no rare substitutions or small indel provided an explanation of the patients’ immune condition.

CNVs of size ≥1 kb were detected using WGS read depth¹¹ and prioritized based on frequency, gene impact, and gene annotations. The two siblings shared only one rare variant impacting the exonic sequence of a gene implicated in PID or predicted to be potentially implicated in PID. In both siblings, the variant was detected on chromosome 4, with start position 151,516,001 and end position 151,529,000 (hg19/b37 coordinates) and was estimated to have copy number 4 (see Supplementary Data 2). This duplication was not observed in over 2500 unrelated parents of autism probands, sequenced by an Illumina platform and with CNV calls based on same read depth pipeline¹⁶, or in the Database of Genomics Variants (DGV) gold-standard dataset (23,300 subjects of various ethnicities)¹⁷, whereas Genome Aggregation Database - Structural Variants (gnomAD-SV) v2.1 (14,891 subjects)¹² includes only a larger (131 kb) and ultra-rare duplication overlapping this locus (allele frequency ~0.00005). We additionally assessed the occurrence of any structural variation disrupting the LRBA gene in an internal database of 6941 genomes of predominantly Arab/Middle Eastern ethnicities, sequenced to a minimum of 30x depth¹⁸, but could not detect any. Inspection of the read alignments (BAM file) suggested more accurate coordinates for the duplication, demarcated by sharp transition of read depth, and that the duplication occurs in tandem (see Fig. 1). A 3 bp microhomology (ACT) was present at the start and at the end of the duplicated sequence, suggesting a mechanistic explanation for the duplication;¹⁹ Sanger sequencing later revealed that the microhomology is included only at the start but not at the end of the duplication, and thus the correct duplication coordinates are chr4:151,516,307-151,528,645 (length 12,339 bp; see next section for details). Since the duplication overlaps exons 38–39 (of 58 total exons) and surrounding intronic sequence of the LRBA transcript NM_006726.4 (which is predicted to be the principal transcript by APPRIS²⁰), and since exon 39 is in-frame (length 33 bp) whereas exon 38 is out-of-frame (length 125 bp), we expect this tandem duplication to shift the LRBA reading frame and result in complete loss of function. It is noteworthy that exon 39 is not present in the Ensembl transcript ENST00000651943 (also predicted to be principal by APPRIS), and that GTEx junctional counts suggest a very low inclusion percentage (see Supplementary Fig. 1). However, the inclusion or exclusion of exon 39 does not alter the frameshift effect. In addition, the presence of >300 bp intronic sequence around the exons suggests that they will be spliced correctly in the tandem-duplicated region²¹. Finally, we observed that the ±10 kb region around the duplicated sequence is homozygous in both siblings (97–100% homozygous variants), in contrast to the overall rates for chromosome 4 (40–43%). This suggested that both the maternal and paternal allele present a tandem duplication and that both copies of LRBA are disrupted, which is consistent with the autosomal recessive mode of inheritance reported for LRBA and the disorder “Immunodeficiency, common variable, 8, with autoimmunity” (OMIM ID 614700). In conclusion, this LRBA multi-exonic duplication was considered a very compelling candidate to explain the immune disorder in the two siblings. We then proceeded to experimentally validate this variant and to confirm its correct segregation in the family.

**Fig. 1: Read alignments at the copy number gain locus of chromosome 4.**

Variant validation and family segregation analysis

We performed several experiments to demonstrate the presence of the tandem duplication and its correct segregation in the family. First, we performed PCR to demonstrate the presence of the junction between the reference sequence and the tandem-duplicated sequence in the two affected siblings and in their parents. Primers were designed to generate a product only in the presence of the tandem duplication (see Fig. 2a); an amplification product of the expected size was present in all four family members but not in unrelated controls (see Fig. 2b).

**Fig. 2: Validation and segregation experiments.**

Then, we confirmed the sequence of the PCR-amplified junction by Sanger sequencing. Whereas WGS read alignments suggested that the duplicated sequence spans position 151,516,307–151,528,648 and includes the ACT microhomology sequence at both ends, Sanger sequencing of the junction showed the presence of the ACT sequence only once (see Fig. 2d), thus the correct coordinates are chr4:151,516,307–151,528,645. This discrepancy is the consequence of WGS reads being aligned to the reference sequence as opposed to the alternate sequence with the tandem duplication; therefore, reads spanning the tandem duplication junction extend into the ACT sequence after the end of the duplicated sequence, rather than being split-aligned to the beginning of the duplicated sequence where they belong (see Fig. 2c). Finally, we performed droplet digital PCR (ddPCR) with the TaqMan copy number assay to demonstrate that the duplication locus has copy number 2 in control samples, copy number 3 in the parents, and copy number 4 in the affected sibling (see Fig. 2e). In combination with the PCR and Sanger results, this conclusively proves that the tandem duplication alters both LRBA alleles in the affected siblings, but not in the parents.

Detectability by other genomics platforms

Based on probe coverage reported by the DGV¹⁷, this duplication may be detected by Affymetrix CytoScan HD, but not by other single-nucleotide polymorphisms (SNP) or CGH array platforms (Agilent 244k, Affymetrix SNP Array 5.0, Affymetrix SNP Array 6.0, Illumina HumanHap 300, Illumina HumanHap 550, Illumina 610 Quad, Illumina HumanHap 650Y, Illumina Human 660 W, Illumina HumanHap 1 M) (see Supplementary Fig. 2). It is also noteworthy that, if detected by Cytoscan HD, it would pass previously established “research-grade” but not “clinical-grade” quality thresholds²². Reliable detection of CNVs from WES can be accomplished when CNVs span at least three exons^23,24, thus this duplication would not be detectable. In addition, for both WES and array platforms, follow-up experiment would be required to determine its tandem configuration, whereas this is readily evident from WGS read alignment.

Discussion

Mutations in the LRBA gene located on 4q31.3 and encoding the LRBA proteins are associated with an autosomal recessive immunodeficiency (OMIM#614700). The hallmarks of this deficiency consist of hypogammaglobulinemia, lymphoproliferation, and autoimmunity. Patients frequently present early in infancy with recurrent infections, lymphadenopathy, and enlarged spleen and liver, as well as a variety of autoimmune features including inflammatory bowel disease, autoimmune cytopenias, diabetes, and autoimmune hepatitis².

LRBA, a cytoplasmic protein, interacts with CTLA4, and defects in LRBA interfere with its expression, thus mimicking CTLA4 deficiency⁹. Indeed, the two conditions share clinical manifestations and respond well to treatment with abatacept, the CTLA4–immunoglobulin fusion drug¹⁰.

The gold standard of diagnosing LRBA deficiency is demonstrating a genetic aberration. Indeed, definitive diagnosis in a large cohort of phenotypical LRBA deficiency was attained by demonstrating biallelic mutations in the LRBA gene⁴. Deleterious mutations were located throughout the whole gene and most variants were missense, indels, splice site, or nonsense mutations³ with rare cases of uniparental isodisomy²⁵.

Evaluation of LRBA and CTLA4 protein expression may aid in the evaluation, but are themselves insufficient to establish a diagnosis²⁶. Clinical tests are not widely available, and frequently the results are inconclusive, limiting their use in clinical practice.

We report here two siblings who had a similar disease course which was typical of previously reported cases of LRBA/CTLA4 deficiency. They had early onset colitis, recurrent infections associated with hypogammaglobulinemia, and enlarged liver, spleen and lymph nodes. In spite of this classic LRBA deficiency-associated phenotype, Sanger sequencing and WES failed to identify a mutation in either the LRBA or CTLA4 gene.

We have subsequently performed WGS that revealed a novel 12.3 kb length tandem duplication on chromosome 4 impacting the exonic sequence of the LRBA gene. Both brothers were homozygous for this variant while parents carried the change only on one allele. This duplication overlaps exons 38–39 and surrounding intronic sequence of the LRBA principal transcript. Since exon 39 is in-frame whereas exon 38 is out-of-frame, this tandem duplication was expected to shift the LRBA reading frame, resulting in loss of function. Moreover, the region adjacent to the duplicated sequence was homozygous in both siblings suggesting both maternal and paternal copies are equally disrupted, which is expected in an autosomal recessive condition.

We then went on to validate the variant by demonstrating the presence of the junction between reference sequence and the tandem duplication. This was confirmed by Sanger sequencing of the PCR-amplified junction. Copy number was then examined by performing ddPCR, verifying that while 2 copies existed in wild-type controls, 3 and 4 copies were deleted in parents and patients, respectively. Together, these experiments show conclusively that the tandem duplication alters both alleles in the affected siblings and correctly segregates in the family.

Intragenic exon duplications often lead to loss or alteration of function, as suggested by the depletion observed in the gnomAD-SV dataset for this type of variants in genes under constraint for heterozygous protein-truncating variants. While this depletion is more modest than for copy number losses, the ultimate effect of intragenic exon duplication depends on the sequence context and presence or absence of frameshift¹². Intragenic homozygous duplications have been previously reported to cause recessive disorders, although more rarely than other types of structural variants²⁷. Intragenic duplications expected to shift the reading frame or to cause other profound reading frame alterations have been previously reported as (likely) pathogenic for immune^28,29,30 and non-immune disorders^31,32.

This specific duplication could not be found among individuals of European descent or among a large database of WGS individuals of Arab and Middle Eastern ancestries, and thus may be extremely rare or restricted to one kin.

These results explain why we were unable to detect the mutation by the CGH array platform, as deduced from DGV probe coverage, nor by WES, which is able to detect larger CNV’s spanning at least 3 exons.

Together, the typical clinical features coupled with the convincing genomic analysis provided a compelling argument for the identified duplication causing LRBA deficiency in these patients. The diagnosis of LRBA deficiency is critically important in order to chart a course of treatment. Both innovative CTLA4–immunoglobulin construct as well as curative HSCT are effective in preventing severe outcome if applied early. In this report we have shown that the delay in diagnosis due to failure to detect a pathogenic LRBA variant by WES and Sanger sequencing likely contributed to the demise of the patients. This clearly highlights the need to make WGS a clinically accessible test. Identifying the mutation by WGS aided in genetic counseling as well as treatment planning for future offspring in the affected family.

Methods

Patients

All patient data and samples were obtained in accordance with the Research Ethics Board at The Hospital for Sick Children. Patient data was compiled from medical records and entered into the Primary Immunodeficiency Registry and Tissue Bank (REB protocol no. 1000005598). Written informed consent was obtained from all participants for genetic testing, including WGS.

Whole-genome sequencing

First, 6 μg of genomic DNA were submitted to TCAG (Toronto, Canada) for genomic library preparation and WGS. TCAG quantified DNA samples using Qubit High Sensitivity Assay and checked sample purity using Nanodrop OD260/280 ratio. Then, 700 ng of DNA was used as input material for library preparation using the Illumina TruSeq PCR-free DNA Library Prep Kit following the manufacturer’s recommended protocol. In brief, DNA was fragmented to 400 bp on average using sonication on a Covaris LE220 instrument; fragmented DNA was then end-repaired, A-tailed, and indexed TruSeq Illumina adapters with overhang-T were added to the DNA; libraries were validated on a Fragment Analyzer Using High Sensitivity NGS Kit to check for size and absence of primer dimers, and quantified by qPCR using Kapa Library Quantification Illumina/ABI Prism Kit protocol (KAPA Biosystems). Validated libraries were pooled in equimolar quantities and paired-end sequenced on an Illumina HiSeq X platform following Illumina’s recommended protocol to generate paired-end reads of 150 bases in length.

Whole-genome read alignment and variant calling

Base calling was performed using the HiSeq Analysis Software. Reads were mapped to the b37 reference sequence using the BWA-MEM algorithm³³. Duplicate reads were marked using Picard Tools. Local realignment and base quality score recalibration were performed using GATK 3.7³⁴. Variants were called using HaplotypeCaller (GATK 3.7).

CNVs, comprising losses and gains with size ≥1 kb, were called using a pipeline based on the read depth callers ERDS and CNVnator¹¹. Gains with size 1–5 kb supported only by ERDS were included.

Variant annotation and prioritization

For substitutions and small indels, variants were defined as 5% rare when they had allele frequency ≤5% in all gnomAD 2.1.1 ethnic populations³⁵. High-quality variants were defined as having GATK filter PASS, DP ≥ 6 and not overlapping a segmental duplication. In addition, heterozygous single-nucleotide substitutions were required to have FisherStrand ≤ 60.0, mapping quality ≥ 40.0, MQRankSum ≥ −12.5, and ReadPosRankSum ≥ −8.0; homozygous single-nucleotide substitutions were required to have FisherStrand ≤ 60.0 and mapping quality ≥ 40.0; heterozygous indels were required to have FisherStrand ≤ 200 and ReadPosRankSum ≥ −20; homozygous indels were required to have FisherStrand ≤ 200. Unless explicitly indicated, all variants called by GATK haplotype caller were used. Annovar (April 2018 version)³⁶ was used to determine the variant effect on coding and non-coding genes, using the RefSeq database (Annovar refGene database, downloaded August 2019). We initially selected 5% rare variants impacting coding exons, non-coding exons, 5′ or 3′ UTRs, regions 1 kb upstream of transcription starts site or 1 kb downstream of transcription end site, variants within 100 bp of a splice site or predicted to alter splicing by Spidex (absolute dPSI > 2)²¹, dbscSNV (ADA or RF score > 0.6)³⁷, or SpliceAI³⁸. When considering specific modes of inheritance, coding and splicing variants were further prioritized based on their impact scores³⁹. Constrained genes were defined as having gnomAD observed/expected LOF variants < 0.25 or observed/expected missense variants < 0.7 (where LOF indicates variants predicted to result in complete loss of function)³⁵.

For CNVs, high-quality CNVs¹¹, including gains with size 1–5 kb supported only by ERDS, were deemed rare if they had frequency <1% with respect to CNVs with >50% reciprocal overlap detected in parents of individuals with autism spectrum disorder in the MSSNG whole-genome sequencing project and sequenced by HiSeq 2000 or HiSeq X¹⁶.

Known PID were derived from the Genomics England (GE) primary immunodeficiency panel v2.368 (https://panelapp.genomicsengland.co.uk/panels/398/). Potential implication in PID was predicted using a generalized boosted regression model. The model was trained using the R package gbm version 2.1.8 and 4-fold cross-validation. The PID GE panel was used as known labels for supervised classification, and the features for prediction were constructed from the Human Protein Atlas RNA expression consensus dataset⁴⁰, immune-related Gene Ontology annotations⁴¹ and pathways (KEGG⁴², Reactome⁴³) and MGI immunity-related phenotypes⁴⁴. At 80% recall, 2402 non-PID genes were predicted to be potentially implicated in PID.

APPRIS 2020_06.v32 was used to determine principal transcripts²⁰.

We used the 2016-05-15 release of the DGV gold-standard, which includes 10,451 subjects of undetermined ethnicity, 9022 European, 1378 African, 1030 East Asian, 569 South Asian, 339 Latin American, 178 Middle Eastern, and 333 of other ethnicities.

Variant validation and segregation

The following primers were designed to amplify the sequence spanning the tandem duplication junction: LRBA-Junction-FW primer sequence, ACACGGCAGCAACATACA (hg19/b37 coordinates chr4:151528419-151528436); LRBA-Junction-RV primer sequence, CTAGGGATGACAGATCATGTAAAG (hg19/b37 coordinates chr4:151516554-151516577). The PCR reaction was run using 20 ng of genomic DNA per sample in a 20 μl PCR reaction, with Qiagen HotStarTaq polymerase. Samples were run for 15 min at 95 °C (initial denaturation); followed by 36 cycles of (1) 30 s at 95 °C (denaturation), (2) 30 s at 60 °C (annealing), (3) 1 min at 70 °C (extension); followed by 10 min at 70 °C (final extension). The PCR product was visualized on a 2% agarose gel. All gels were derived from the same experiment and were processed in parallel.

ddPCR with TaqMan assays was used to determine copy number within the duplication in the four family members, and specifically using the probe Hs00518898_cn (hg19/b37 probe coordinate chr4:151520163, overlaps exon 38, https://www.thermofisher.com/order/genome-database/details/cnv/Hs00518898_cn). The TaqMan copy number reference assay based on human RNase P was used as an endogenous control for calibration. The assay was performed with one biological replicate per subject and results were analyzed using QuantaSoft Version 1.7.4.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

Whole-genome sequencing data that support the findings of this study have been deposited in dbGaP with the accession code “phs002557”.

References

Wang, J. W., Howson, J., Haller, E. & Kerr, W. G. Identification of a novel lipopolysaccharide-inducible gene with key features of both A kinase anchor proteins and chs1/beige proteins. J. Immunol. 166, 4586–4595 (2001).
Article CAS PubMed Google Scholar
Lopez-Herrera, G. et al. Deleterious mutations in LRBA are associated with a syndrome of immune deficiency and autoimmunity. Am. J. Hum. Genet. 90, 986–1001 (2012).
Article CAS PubMed PubMed Central Google Scholar
Habibi, S. et al. Clinical, immunologic, and molecular spectrum of patients with LPS-responsive beige-like anchor protein deficiency: a systematic review. J. Allergy Clin. Immunol. Pract. 7, 2379–2386.e2375 (2019).
Article PubMed Google Scholar
Gamez-Diaz, L. et al. The extended phenotype of LPS-responsive beige-like anchor protein (LRBA) deficiency. J. Allergy Clin. Immunol. 137, 223–230 (2016).
Article CAS PubMed Google Scholar
Lo, B. et al. AUTOIMMUNE DISEASE. Patients with LRBA deficiency show CTLA4 loss and immune dysregulation responsive to abatacept therapy. Science 349, 436–440 (2015).
Article CAS PubMed Google Scholar
Takahashi, T. et al. Immunologic self-tolerance maintained by CD25⁺CD4⁺ regulatory T cells constitutively expressing cytotoxic T lymphocyte-associated antigen 4. J. Exp. Med. 192, 303–310 (2000).
Article CAS PubMed PubMed Central Google Scholar
Walker, L. S. K. & Sansom, D. M. The emerging role of CTLA4 as a cell-extrinsic regulator of T cell responses. Nat. Rev. Immunol. 11, 852–863 (2011).
Article CAS PubMed Google Scholar
Linsley, P. S. et al. Intracellular trafficking of CTLA-4 and focal localization towards sites of TCR engagement. Immunity 4, 535–543 (1996).
Article CAS PubMed Google Scholar
Alkhairy, O. K. et al. Spectrum of phenotypes associated with mutations in LRBA. J. Clin. Immunol. 36, 33–45 (2015).
Article PubMed Google Scholar
Kiykim, A. et al. Abatacept as a long-term targeted therapy for LRBA deficiency. J. Allergy Clin. Immunol. Pract. 7, 2790–2800.e2715 (2019).
Article PubMed PubMed Central Google Scholar
Trost, B. et al. A comprehensive workflow for read depth-based identification of copy-number variation from whole-genome sequence data. Am. J. Hum. Genet. 102, 142–155 (2018).
Article CAS PubMed PubMed Central Google Scholar
Collins, R. L. et al. A structural variation reference for medical and population genetics. Nature 581, 444–451 (2020).
Article CAS PubMed PubMed Central Google Scholar
Abyzov, A., Urban, A. E., Snyder, M. & Gerstein, M. CNVnator: an approach to discover, genotype, and characterize typical and atypical CNVs from family and population genome sequencing. Genome Res. 21, 974–984 (2011).
Article CAS PubMed PubMed Central Google Scholar
Zhu, M. et al. Using ERDS to infer copy-number variants in high-coverage genomes. Am. J. Hum. Genet. 91, 408–421 (2012).
Article CAS PubMed PubMed Central Google Scholar
Merico, D. Whole exome and genome sequencing for Mendelian immune disorders: from molecular diagnostics to new disease variant and gene discovery. LymphoSign J 3, 135–158 (2016).
Google Scholar
Yuen, R.K.C. et al. Whole genome sequencing resource identifies 18 new candidate genes for autism spectrum disorder. Nat. Neurosci. 20, 602–611 (2017).
Article CAS PubMed Central Google Scholar
MacDonald, J. R., Ziman, R., Yuen, R. K., Feuk, L. & Scherer, S. W. The Database of Genomic Variants: a curated collection of structural variation in the human genome. Nucleic Acids Res. 42, D986–D992 (2014).
Article CAS PubMed Google Scholar
Rossi, N. et al. Ethnic-specific association of amylase gene copy number with adiposity traits in a large Middle Eastern biobank. npj Genom. Med 6, 8 (2021).
Article CAS PubMed PubMed Central Google Scholar
Vissers, L. E. et al. Rare pathogenic microdeletions and tandem duplications are microhomology-mediated and stimulated by local genomic architecture. Hum. Mol. Genet. 18, 3579–3593 (2009).
Article CAS PubMed Google Scholar
Rodriguez, J. M. et al. APPRIS: annotation of principal and alternative splice isoforms. Nucleic Acids Res. 41, D110–D117 (2013).
Article CAS PubMed Google Scholar
Xiong, H. Y. et al. RNA splicing. The human splicing code reveals new insights into the genetic determinants of disease. Science 347, 1254806 (2015).
Article PubMed Google Scholar
Uddin, M. et al. A high-resolution copy-number variation resource for clinical and population genetics. Genet. Med. 17, 747–752 (2015).
Article PubMed Google Scholar
Krumm, N. et al. Copy number variation detection and genotyping from exome sequence data. Genome Res. 22, 1525–1532 (2012).
Article CAS PubMed PubMed Central Google Scholar
Pfundt, R. et al. Detection of clinically relevant copy-number variants by exome sequencing in a large cohort of genetic disorders. Genet. Med. 19, 667–675 (2017).
Article CAS PubMed Google Scholar
Soler-Palacin, P. et al. LRBA deficiency in a patient with a novel homozygous mutation due to chromosome 4 segmental uniparental isodisomy. Front. Immunol. 9, 2397 (2018).
Article PubMed PubMed Central Google Scholar
Gamez-Diaz, L. et al. Rapid flow cytometry-based test for the diagnosis of lipopolysaccharide responsive beige-like anchor (LRBA) deficiency. Front. Immunol. 9, 720 (2018).
Article PubMed PubMed Central Google Scholar
Yuan, B. et al. CNVs cause autosomal recessive genetic diseases with or without involvement of SNV/indels. Genet. Med. 22, 1633–1641 (2020).
Article CAS PubMed PubMed Central Google Scholar
Martin-Rodriguez, S. et al. Two novel variants in the ATM gene causing ataxia-telangiectasia, including a duplication of 90 kb: utility of targeted next-generation sequencing in detection of copy number variation. Ann. Hum. Genet. 83, 266–273 (2019).
Article CAS PubMed Google Scholar
Buckley, R. M. et al. Assisted reproduction mediated resurrection of a feline model for Chediak-Higashi syndrome caused by a large duplication in LYST. Sci. Rep. 10, 64 (2020).
Article CAS PubMed PubMed Central Google Scholar
Roth, I. L. et al. Novel NCF2 mutation causing chronic granulomatous disease. J. Clin. Immunol. 40, 977–986 (2020).
Article CAS PubMed Google Scholar
Schwaibold, E. M. et al. Intragenic duplication of EHMT1 gene results in Kleefstra syndrome. Mol. Cytogenet 7, 74 (2014).
Article PubMed PubMed Central Google Scholar
Miller, D. E., Squire, A. & Bennett, J. T. A child with autism, behavioral issues, and dysmorphic features found to have a tandem duplication within CTNND2 by mate-pair sequencing. Am. J. Med. Genet. A 182, 543–547 (2020).
Article CAS PubMed Google Scholar
Li, H. & Durbin, R. Fast and accurate long-read alignment with Burrows-Wheeler transform. Bioinformatics 26, 589–595 (2010).
Article PubMed PubMed Central Google Scholar
McKenna, A. et al. The genome analysis toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 20, 1297–1303 (2010).
Article CAS PubMed PubMed Central Google Scholar
Karczewski, K. J. et al. The mutational constraint spectrum quantified from variation in 141,456 humans. Nature 581, 434–443 (2020).
Article CAS PubMed PubMed Central Google Scholar
Wang, K., Li, M. & Hakonarson, H. ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 38, e164 (2010).
Article PubMed PubMed Central Google Scholar
Jian, X., Boerwinkle, E. & Liu, X. In silico prediction of splice-altering single nucleotide variants in the human genome. Nucleic Acids Res. 42, 13534–13544 (2014).
Article CAS PubMed PubMed Central Google Scholar
Jaganathan, K. et al. Predicting splicing from primary sequence with deep learning. Cell 176, 535–548.e524 (2019).
Article CAS PubMed Google Scholar
Merico, D. et al. Compound heterozygous mutations in the noncoding RNU4ATAC cause Roifman Syndrome by disrupting minor intron splicing. Nat. Commun. 6, 8718 (2015).
Article CAS PubMed Google Scholar
Uhlen, M. et al. Proteomics. Tissue-based map of the human proteome. Science 347, 1260419 (2015).
Article PubMed Google Scholar
Gene Ontology, C. et al. Gene Ontology annotations and resources. Nucleic Acids Res. 41, D530–D535 (2013).
Article Google Scholar
Kanehisa, M., Sato, Y., Kawashima, M., Furumichi, M. & Tanabe, M. KEGG as a reference resource for gene and protein annotation. Nucleic Acids Res. 44, D457–D462 (2016).
Article CAS PubMed Google Scholar
Fabregat, A. et al. The Reactome pathway Knowledgebase. Nucleic Acids Res. 44, D481–D487 (2016).
Article CAS PubMed Google Scholar
Eppig, J. T., Blake, J. A., Bult, C. J., Kadin, J. A., Richardson, J. E. & Mouse Genome Database Group. The Mouse Genome Database (MGD): facilitating mouse as a model for human biology and disease. Nucleic Acids Res. 43, D726–D736 (2015).
Article CAS PubMed Google Scholar
Robinson, J. T. et al. Integrative Genomics Viewer. Nat. Biotechnol. 29, 24–26 (2011).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

Whole-genome sequencing was performed by TCAG (The Centre for Applied Genomics, Toronto) with support by the University of Toronto McLaughlin Centre Whole Genome Sequencing Initiative. C.M.R. is supported by Immunodeficiency Canada’s Distinguished Professorship in Immunology, the Program for Immunogenomics and Canadian Centre for Primary Immunodeficiency, and the Jeffrey Modell Foundation. S.W.S. and K.F. are supported by the Qatar National Research Fund awards PPM1-1229-15002 and NPRP10-0202-170320.

Author information

Authors and Affiliations

The Centre for Applied Genomics (TCAG), Program in Genetics and Genome Biology, The Hospital for Sick Children, Toronto, M5G 0A4, ON, Canada
Daniele Merico, Mehdi Zarrei, Edward J. Higginbotham, Bhooma Thiruvahindrapuram & Stephen W. Scherer
Deep Genomics Inc., Toronto, M5G 1M1, ON, Canada
Daniele Merico
Canadian Center for Primary Immunodeficiency and the Jeffrey Modell Research Laboratory for the Diagnosis of Primary Immunodeficiency, Toronto, M5G1X8, ON, Canada
Yehonatan Pasternak, Ori Scott & Chaim M. Roifman
Division of Immunology and Allergy, Department of Paediatrics, The Hospital for Sick Children, Toronto, M5G 1×8, ON, Canada
Yehonatan Pasternak, Ori Scott, Jessica Willett-Pachul, Eyal Grunebaum, Julia Upton, Adelle Atkinson, Vy H. D. Kim & Chaim M. Roifman
University of Toronto, Toronto, M5S 1A8, ON, Canada
Yehonatan Pasternak, Ori Scott, Eyal Grunebaum, Julia Upton, Adelle Atkinson, Vy H. D. Kim & Chaim M. Roifman
Department of Human Genetics, Sidra Medicine, Doha, Qatar
Elbay Aliyev & Khalid Fakhro
Department of Genetic Medicine, Weill-Cornell Medical College, Doha, Qatar
Khalid Fakhro
Department of Molecular Genetics, University of Toronto, Toronto, M5S 1A8, ON, Canada
Stephen W. Scherer
McLaughlin Centre, University of Toronto, Toronto, M5G 0A4, ON, Canada
Stephen W. Scherer

Authors

Daniele Merico
View author publications
You can also search for this author in PubMed Google Scholar
Yehonatan Pasternak
View author publications
You can also search for this author in PubMed Google Scholar
Mehdi Zarrei
View author publications
You can also search for this author in PubMed Google Scholar
Edward J. Higginbotham
View author publications
You can also search for this author in PubMed Google Scholar
Bhooma Thiruvahindrapuram
View author publications
You can also search for this author in PubMed Google Scholar
Ori Scott
View author publications
You can also search for this author in PubMed Google Scholar
Jessica Willett-Pachul
View author publications
You can also search for this author in PubMed Google Scholar
Eyal Grunebaum
View author publications
You can also search for this author in PubMed Google Scholar
Julia Upton
View author publications
You can also search for this author in PubMed Google Scholar
Adelle Atkinson
View author publications
You can also search for this author in PubMed Google Scholar
Vy H. D. Kim
View author publications
You can also search for this author in PubMed Google Scholar
Elbay Aliyev
View author publications
You can also search for this author in PubMed Google Scholar
Khalid Fakhro
View author publications
You can also search for this author in PubMed Google Scholar
Stephen W. Scherer
View author publications
You can also search for this author in PubMed Google Scholar
Chaim M. Roifman
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

D.M. led the variant prioritization and interpretation, and co-led the manuscript writing; Y.P., O.S., J.W.P., E.G., J.U., V.K., and A.A. made substantial contributions to primary clinical data collection; M.Z. and E.J.H. designed and performed the experiments for variant validation; B.T. contributed to the variant interpretation and coordinated the whole-genome alignment and variant calling; E.A. and K.F. contributed to the variant interpretation; S.W.S. supervised the whole-genome analysis and variant validation; C.M.R. oversaw the clinical care of the patients and co-led the manuscript writing. All authors provided critical review of the paper, have approved the submission and accept responsibility for their contribution.

Corresponding author

Correspondence to Chaim M. Roifman.

Ethics declarations

Competing interests

D.M. is a full-time employee and a shareholder of Deep Genomics Inc. S.W.S. is Editor-in-Chief of npj Genomic Medicine. All other authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Data 1

Supplementary Data 2

Reporting summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Merico, D., Pasternak, Y., Zarrei, M. et al. Homozygous duplication identified by whole genome sequencing causes LRBA deficiency. npj Genom. Med. 6, 96 (2021). https://doi.org/10.1038/s41525-021-00263-z

Download citation

Received: 04 January 2021
Accepted: 21 October 2021
Published: 18 November 2021
DOI: https://doi.org/10.1038/s41525-021-00263-z

This article is cited by

Beyond IBD: the genetics of other early-onset diarrhoeal disorders
- Lorraine Stallard
- Iram Siddiqui
- Aleixo Muise
Human Genetics (2023)

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Patients

Whole-genome sequencing variant identification

Variant validation and family segregation analysis

Detectability by other genomics platforms

Discussion

Methods

Patients

Whole-genome sequencing

Whole-genome read alignment and variant calling

Variant annotation and prioritization

Variant validation and segregation

Reporting summary

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links