Main

Pulmonary lymphangioleiomyomatosis is a rare, systemic disease that has been classified anywhere from benign, non-neoplastic or hamartomatous, to a low grade, destructive and even metastasizing neoplasm.1, 2, 3 Regardless of its classification, lymphangioleiomyomatosis progressively destroys lung function of afflicted women primarily of childbearing age1, 2 with an estimated median transplant-free survival of 29 years and 10-year survival of 86%.4 Lymphangioleiomyomatosis is a cystic lung disease associated with proliferation of abnormal smooth muscle cells showing coexpression of muscle and melanocytic markers. Lymphangioleiomyomatosis has been included in the wide group of proliferative lesions named perivascular epithelioid cell tumors, given the overlapping morphologic and immunophenotypic findings shared by lymphangioleiomyomatosis cells and perivascular epithelioid cells. Lymphangioleiomyomatosis is characterized by infiltration and proliferation of atypical smooth muscle cells (lymphangioleiomyomatosis cells) into lung structures including the lymphatics, airway walls and interstitial spaces, impairing lung function through the formation of cysts.1, 2 Studies describe lymphangioleiomyomatosis as a systemic disease, with lymphangioleiomyomatosis cells originating from sources outside of the lung and potentially utilizing the lymphatic system to travel to the lung.1, 2, 3, 4, 5

Lymphangioleiomyomatosis occurs both sporadically or in patients with the hereditary disease tuberous sclerosis complex, and is primarily associated with pathogenic alterations in the TSC1 (hamartin) or TSC2 (tuberin) genes.6, 7, 8 These two tumor suppressors, together with TBC1D7, form the tuberous sclerosis complex trimer that negatively regulates mechanistic target of rapamycin signaling, a pathway involved in maintaining normal cell growth/proliferation.9 Germline (hereditary) pathogenic alterations in TSC1 and TSC2 have been identified in ~75–90% of individuals who meet clinical diagnostic criteria for tuberous sclerosis complex.8 Approximately 40% of female patients with tuberous sclerosis complex also meet criteria for lymphangioleiomyomatosis on computed tomographic (CT) scans, a prevalence which was reported to increase to 80% with advancing age, with the majority of pulmonary manifestations of tuberous sclerosis complex matching those of sporadic lymphangioleiomyomatosis.10, 11 The incidence of lymphangioleiomyomatosis is estimated at between 3 and 9 cases per million women with about 15% of them reportedly having tuberous sclerosis complex.12, 13 The average age of diagnosis is in the early 40s with individuals with tuberous sclerosis complex lymphangioleiomyomatosis being diagnosed slightly earlier than those with sporadic Lymphangioleiomyomatosis.12, 13 Individuals with tuberous sclerosis complex lymphangioleiomyomatosis also tend to develop symptoms at a slightly earlier age.13 Tuberous sclerosis complex lymphangioleiomyomatosis primarily encompasses germline alterations in TSC2 and less frequently in TSC1.6, 7, 8, 10, 11, 14, 15, 16 Sporadic lymphangioleiomyomatosis, however, is primarily considered a TSC2 disease.6, 7, 8

Individuals with sporadic lymphangioleiomyomatosis (that is, no clinical diagnosis of tuberous sclerosis complex and no germline alterations in TSC1/2) frequently harbor two somatic TSC2 alterations in their lymphangioleiomyomatosis tissue, consistent with the two-hit tumor suppressor model.17 Biallelic inactivation of either TSC1 or TSC2 is also frequent in tuberous sclerosis complex lymphangioleiomyomatosis, with the first a tuberous sclerosis driving germline mutation and the second a somatic tuberous sclerosis complex lymphangioleiomyomatosis driving mutation.18 The distinction between sporadic and tuberous sclerosis complex lymphangioleiomyomatosis is fundamentally the diagnosis of tuberous sclerosis complex disease, however, this diagnosis is sometimes complex. Firstly, TSC1mutation driven tuberous sclerosis frequently results in a milder form of disease than TSC2 mutation driven tuberous sclerosis.19 Secondly, somatic and germline mosaicism for TSC1/2 mutations have been described in a sub-population of tuberous sclerosis complex patients and their parents, respectively.20

Although the majority of lymphangioleiomyomatosis tissues are characteristically TSC1 or TSC2 mutated, a significant number of cases (10–15%) still present with no mutations in these genes suggesting undetected mutagenic events.20 While large deletions and duplication in TSC1 and TSC2 have been reported in tuberous sclerosis complex,21 the involvement of somatic large genomic rearrangements have not been well studied in lymphangioleiomyomatosis disease. Large genomic rearrangements frequently encompass key driver events of tumor initiation and progression in tumors,22, 23, 24, 25 including noninvasive adenocarcinoma in situ lesions of the lung.25 Thus, we initially hypothesized that large genomic rearrangements can occur somatically in the lymphangioleiomyomatosis cell genome and could play a role in disease pathogenesis through their impact on genes in the mechanistic target of rapamycin or related pathways. We also hypothesized that these somatic rearrangements could have been missed in previous lymphangioleiomyomatosis cell genomic studies based on point mutations and/or local copy-number changes in the tuberous sclerosis complex genes and thus could define somatic variations in cases where no mutations were previously detected. Through the partnering of laser capture microdissection and whole-genome amplification techniques we have developed robust protocols to enable genome-wide profiling of structural variation in defined cell populations with limited cell numbers.23, 25, 26, 27 Through the application of whole-genome mate pair sequencing, we investigated the genomic landscape of enriched populations of lymphangioleiomyomatosis cells and adjacent control histologically normal smooth muscle cells from the lung tissues of five sporadic lymphangioleiomyomatosis patients.

Materials and methods

Tissue Immunohistochemistry

Fresh frozen lung tissues from five sporadic lymphangioleiomyomatosis cases were obtained from the national disease research interchange tissue bank. No tuberous sclerosis complex was detailed for any of the patients in the national disease research interchange tissue bank clinical notes (Supplementary Table 1). No national disease research interchange tissue bank tissue codes for tuberous sclerosis complex were listed for any patient upon tissue banking. In the absence of available formalin fixed tissue from the five cases, five-micron sections were cut from these fresh frozen lung tissue samples and stained with hematoxylin eosin stain to evaluate the morphologic features and verify the diagnosis of lymphangioleiomyomatosis. Plump spindle-shaped myoid cells with pale eosinophilic cytoplasm characterize the morphologic features of lymphangioleiomyomatosis cells. As the majority of lymphangioleiomyomatosis cells are positive for the melanocytic marker HMB-4528 each case also underwent a standard immunohistochemical staining with the HMB-45 anti-human monoclonal mouse antibody (Dako, Carpinteria, CA, USA; 1:100 dilution) using 3,3′-diaminobenzidine as chromogen and hematoxylin as counterstaining.

DNA Isolation and Sequencing

Guided by the hematoxylin eosin and HMB-45 staining, laser capture microdissection was used to independently isolate enriched populations of lymphangioleiomyomatosis cells and control histologically normal smooth muscle cells from sequential 10-micron fresh frozen tissue sections of each patient. Laser capture microdissection of the control histologically normal smooth muscle cells was performed on the same laser capture microdissection slide as the lymphangioleiomyomatosis cells, but in areas where no abnormal lymphangioleiomyomatosis cell pathology was observed. Limitations and the potential for lymphangioleiomyomatosis cell contamination in this control are presented in discussion. DNA was amplified directly from the captured cells by a single-step procedure using a modified Qiagen Repli-g protocol and 1 μg used to assemble indexed Illumina Nextera Mate Pair libraries, as previously described.23, 24, 25, 26, 27 Libraries were sequenced two per lane on the HiSeq 2000 (2 × 101 bp). Paired reads were mapped to the Hg38 reference genome as previously described.21, 29 Discordant mate pairs mapping >15 kb apart or in different chromosomes were selected for further analysis. Genomic breakpoint junctions detected with 7 or greater mate pair read associates were considered to have a high confidence and low probability of being false positives. A mask was used to eliminate common variants and discordant mate pairs from experimental or algorithmic errors.29 Primers spanning detected fusion junctions were used in PCR validations on lymphangioleiomyomatosis, smooth muscle cells and benign epithelial tissue (when available) and an independent human Genomic DNA control (G304A; Promega, Madison, WI, USA). TSC1/2 mutation panel: DNA from LAM tissues for each case was analyzed for mutations in the TSC1 and TSC2 genes using a gene panel. DNA from the control smooth muscle cell tissues for each case was also tested to support the clinically reported absence of tuberous sclerosis disease in these patients. Next-generation sequencing libraries were prepared using 50 ng of whole-genome amplified DNA and a GeneRead DNAseq Custom Panel V2 kit (Qiagen) encompassing the coding regions of 50 cancer-associated genes: AKT1, ATRX, BRAF, CDKN2A, CDKN2B, CIC, CTNNB1, DAXX, EGFR, FGFR1, FGFR2, FGFR3, FUBP1, GNA11, GNAQ, GNAS, H3F3A, IDH1, IDH2, JAK2, MYBL1, MYC, MYCN, NF1, NF2, NOTCH1, NOTCH2, PDGFRA, PIK3CA, PIK3R1, PIK3R2, PTCH1, PTEN, RB1, SDHA, SDHB, SDHC, SDHD, SMARCA4, SMARCB1, SMO, STAT3, SUFU, TERT, TET1, TET2, TP53, TSC1, TSC2 and WT1. PCR products were further processed using the TruSeq Nano DNA Library Preparation kit, final libraries were pooled equimolar and underwent 2 × 101 bp sequencing on HiSeq 2500. Raw sequencing data was processed by a CLC Bio Genomics Server using custom-built bioinformatics pipelines. Variants with ≥10% mutant allele frequency were reviewed to determine pathogenicity status following the current standards and guidelines for the interpretation of sequence variants.30

Results

Histological Assessment of Lymphangioleiomyomatosis Tissues

Abnormal smooth muscle cells highly indicative of lymphangioleiomyomatosis cells were observed in each case by pathology review of hematoxylin eosin stained tissues (Figure 1; Table 1). HMB-45 immunostaining was positive in these abnormal smooth muscle cells in all but one case (LAM2). There was strong HMB-45 staining in LAM3, LAM4 and LAM5, with weaker staining in LAM1. A representative hematoxylin eosin stain from the fresh frozen tissue is presented in Figure 1f. Sufficient numbers of lymphangioleiomyomatosis cells and control histologically normal smooth muscle cells were successfully isolated from each tissue to yield genomic DNA through the direct whole-genome amplified methodology.

Figure 1
figure 1

Histological review of lymphangioleiomyomatosis tissues. Immunohistochemical staining images of lymphangioleiomyomatosis tissues from cases LAM1-LAM5 (a–e) with the HMB-45 antibody. HMB-45 staining was absent from LAM2 (b). Representative hematoxylin eosin staining of LAM5 from fresh frozen tissue (f).

Table 1 Summary of histological review of LAM tissues and results of a gene sequencing panel for TSC1 and TSC2 mutations

TSC1 and TSC2 Mutation Profiling

While no TSC1 mutations were observed, somatic mutations in the TSC2 gene were observed in three lymphangioleiomyomatosis tissues, LAM1, LAM2 and LAM5 (Table 1), which were not detected in the germline control normal smooth muscle cells. LAM1 and LAM2 each contained two damaging mutations in the TSC2 gene, supporting the two-hit tumor suppressor model. A pathogenic TSC2 splice variant mutation with additional frame shift and premature stop mutations were identified in LAM1 and LAM2, respectively. A single somatic frame shifting pathogenic mutation was observed in TSC2 of LAM5. Just one further pathogenic mutation in the additional genes in the panel was observed in TET2 of LAM5, which was not present in the associated smooth muscle cells (Table 1). No germline pathogenic variations were detected in the TSC1 or TSC2 genes. Thus, both the clinical data and genomic panel analysis supports the reported absence of tuberous sclerosis disease in these patients.

The variant frequency for the observed somatic mutations in LAM1, LAM2 and LAM5 predicted lymphangioleiomyomatosis cell purity in each sample in the range of 35%, 30% and 20%, respectively (Table 1). Visual inspection of the sequencing raw read data for the remaining two cases (LAM3 and LAM4) yielded no evidence of additional low frequency TSC1 or TSC2 mutations.

Genomic Landscape of Lymphangioleiomyomatosis Tissues

In order to assess the involvement of large genomic rearrangements in lymphangioleiomyomatosis disease, DNA from each lymphangioleiomyomatosis tissue and three selected smooth muscle cell tissues were used for structural variance analysis. The Illumina mate pair sequencing protocol was used to elucidate genome-wide structural variations within the tumors. Mate pair sequencing spans the genome with larger spanning genomic fragments than conventional genomic sequencing, increasing the potential of detecting discordantly mapping breakpoints from rearrangements.29 Mate pair sequencing data generated an average of 90 million reads per sample, with high mapping efficiency to the human reference genome (Supplementary Table 2). An average bridged coverage across the genomes of 80 × was observed, enabling high confidence profiling of structural variations from the reference human genome.

Genome plots present the landscape of structural variation within the lymphangioleiomyomatosis cell, displaying coverage frequency across each chromosome29 (Figure 2a–e). No significant evidence of aneuploidy was observed in any of the lymphangioleiomyomatosis samples. Small regions of gains and loss were evident, but the majorities of these copy variations involved normal natural variation from the reference genome and were shared in associated smooth muscle cell samples (Supplementary Figure 1). Figure 2f exemplifies a typical genome plot of a metastatic lung adenocarcinoma, to demonstrate the extensive aneuploidy, focal gains, losses and chromosomal translocations frequently observed in neoplastic tissues.

Figure 2
figure 2

Genome plots of lymphangioleiomyomatosis cells. Genome plots presenting coverage across the 22 chromosomes and the X-chromosomes of the lymphangioleiomyomatosis cells from LAM1-LAM5 (a–e) and a representative lung adenocarcinoma (AD) (f). Coverage frequency (y-axis) for 30 kb windows across each chromosome (x-axis) are presented horizontally. The central gray coverage indicates normal heterozygous two copy of a chromosome. Gains and losses are observed as darker regions of raised or dropped coverage. Inter-chromosomal large genomic rearrangement translocations are presented as black lines linking different chromosomes. Intra-chromosomal rearrangements are presented as black lines above the specific chromosomal positions involved. A full color version of this figure is available in the online version of this paper.

Structural genomic rearrangements result in discordant mapping fusion junctions absent from the reference genome, which can be efficiently mapped from mate pair sequencing data. Each lymphangioleiomyomatosis tissue contained very few structural variations, with genome profiles very similar to the normal smooth muscle cells. In LAM1, LAM2 and LAM5, with detected TSC2 mutations, no somatic fusion junctions were determined (Table 2). Just three events passed bioinformatics filters in LAM1 and LAM2, which were confirmed germline through shared presence in the associated smooth muscle cells (Table 2). Five events passed filters for LAM5, which in the absence of mate pair sequencing data on the associated smooth muscle cells, were not automatically determined as germline variations. Further PCR validation of three selected events confirmed them as germline through their presence in the associated SMC5 DNA (Figure 3a). One of these events involved PPP2R2B, a gene implicated in mechanistic target of rapamycin signaling.31, 32

Table 2 Genomic rearrangements in S-LAM tissues
Figure 3
figure 3

Somatic rearrangements in lymphangioleiomyomatosis tissues. (a) PCR validations using primers spanning breakpoint junctions determined in lymphangioleiomyomatosis, smooth muscle cells or benign epithelium from LAM3, LAM4 and LAM5. The genes and chromosomal loci are presented beside each gel image (ng; no gene). (b–d) Junction plots of the two deletions spanning TSC1 in LAM4 and rearrangement on chromosome 5 of LAM3. The upper and lower panels of the junction plots present the two breakpoint locations on chromosome 9 or 5 with the associated mate pair reads mapping at each region imaged as red (forward strand mapping) and blue (reverse strand mapping) dots in the upper and lower panels, respectively, linked by black lines. Genes located in each breakpoint region are indicated with exon numbers. (e) Schematic of the two TSC1 spanning deletions in LAM4 showing the position of each breakpoint and the genes involved. (f,g) Focused local coverage at the chromosome 9 TSC1 locus for LAM4 and SMC4 and the KDM6A chromosome X locus of LAM3. The small blue arched images in the upper region for the LAM4 and LAM3 samples present the span of the two detected rearrangements in each sample, which are outlined by the orange rectangles with orange shaded areas demonstrating the comparative loss of heterozygosity.

In LAM3 and LAM4, where no somatic or germline tuberous sclerosis complex gene mutations were detected, somatic rearrangements were detected by the mate pair sequencing and selected events successfully validated by PCR (Table 2; Figure 3a–d). A total of eight events passed bioinformatics filters for LAM4, with just one confirmed germline event being present in SMC4. For the seven somatic events, no reads were detected in the associated smooth muscle cells, and selected events were confirmed somatic by PCR validation (Figure 3a). Most significantly, two events predicted nested deletions across the TSC1 gene locus, indicative of a double loss (Figure 3b, c and e). Copy-number variation coverage confirmed a definite loss of heterozygosity at the TSC1 locus not present in the smooth muscle cell tissue (Figure 3f). One of these deletions additionally predicted an ABO-DDX31 gene fusion. Other genes directly hit included; GFI1B, AK8 and SERPINA11 (Table 2).

Somatic fusion junctions were also verified in LAM3 (Figure 3a). Three fusion junctions were initially predicted to be somatic (Table 2) including a PCR validated event on chromosome 5 (Figure 3d) and two events located on the X chromosome. While PCR validation of the X chromosome events were impeded by the repetitive nature of the DNA region involved, copy-number variation did concur with the rearrangement site (Figure 3g). The overlapping rearrangements predicted both deletion and gain within the 0.8 Mb region of Xp11.3 (Figure 3g). An additional translocation event between chromosomes 1 and 5, which did not initially pass filters due to just four mate pair reads spanning the junction, was also PCR validated as somatic in LAM3 (Figure 3a).

No evidence of loss of heterozygosity was apparent at the TSC2 locus on chromosome 16p13.3 or TBC1D7 at 6p24.1 in any case. Loss of heterozygosity at the TSC1 locus on chromosome 9q34.13 was only evident in LAM4, supporting the deletions reported. No additional loss of heterozygosity was observed at other major genes implicated in the mechanistic target of rapamycin signaling pathway (data not shown).

Discussion

We report here the first in-depth genome-wide study of structural variation within sporadic lymphangioleiomyomatosis cell genomes. While loss of heterozygosity at the TSC1/2 loci have been reported in select studies,7, 14 the overall involvement of large genomic rearrangements in lymphangioleiomyomatosis disease has not been well studied. Previous studies on genomic structural variations in lymphangioleiomyomatosis cells have focused primarily on the tuberous sclerosis complex gene loci and primer-based amplicon sequencing techniques.7, 14 Specifically, Kozlowski and colleagues utilized multiplex ligation-dependent probe amplification, which required multiple probe sets specific to each TSC1/2 exon. The mate pair sequencing technique employed in this study enabled a simpler genome-wide evaluation of structural variation within laser capture microdissection enriched populations on lymphangioleiomyomatosis cells and comparison with normal smooth muscle cells from the same patient. Significantly, somatic rearrangements were observed as active in sporadic lymphangioleiomyomatosis disease, being present in two cases lacking any evidence of TSC2 driver mutations. Selected events were validated by PCR and confirmed as absent from adjacent normal smooth muscle cells from the same patients. As the control, histologically normal smooth muscle cells were collected from adjacent tissue to the lymphangioleiomyomatosis histology for each case, we cannot rule out the possibility of contamination of small numbers of lymphangioleiomyomatosis cells in these controls. However, for cases LAM1, LAM2 and LAM5 where TSC2 point mutations were detected in the lymphangioleiomyomatosis tissues, no supporting reads were detected in the smooth muscle cell controls. Similarly, for LAM4, no supporting mate pair reads were detected for the two TSC1 deletion events in SMC4. While LAM3 did not contain a somatic TSC1 or TSC2 event to assess contamination of lymphangioleiomyomatosis cells in the SMC3 control, rearrangement events present in LAM3 failed to validate by PCR in SMC3. Thus, these observations provided confidence that laser capture microdissection of histologically normal smooth muscle cells from the same tissue sections where the lymphangioleiomyomatosis cells were isolated did not significantly contaminate with lymphangioleiomyomatosis cells.

Biallelic inactivation of either TSC1 or TSC2 has been well established as the principal drivers of lymphangioleiomyomatosis disease; however, the complex nature of this disease still presents many unanswered questions. Although the majority of lymphangioleiomyomatosis show genomics supporting the two-hit tumor suppressor inactivation model, a subset of lymphangioleiomyomatosis still lack identifiable mutations. Although more recent deep sequencing studies in previously described no mutation identified cases identified low-prevalence somatic exonic mutations18 or non-coding intronic splice effector mutations20 in tuberous sclerosis complex genes, cases are still reported in these studies with no defined driver events. Additionally, while sporadic lymphangioleiomyomatosis disease is characterized as solely a TSC2 mutation driven disease, with no reported involvement of TSC1 mutations, diagnosis of sporadic lymphangioleiomyomatosis in absence of tuberous sclerosis complex is complicated by potential mosaicism and lower clinical presentation of TSC1–driven disease in a small subset of tuberous sclerosis complex cases.19, 20 A small sub-population of patients with this rare sporadic lymphangioleiomyomatosis disease is thus subsequently diagnosed with sub-clinical tuberous sclerosis complex lymphangioleiomyomatosis.

The presence of lymphangioleiomyomatosis cells in each case was confirmed by histological review of hematoxylin eosin and HMB-45 stained tissue sections by experienced pathologists. Both available clinical data and genomic analysis on control smooth muscle cell tissue from each patient inferred an absence of tuberous sclerosis complex in all of these patients, supporting the diagnosis of sporadic lym-phangioleiomyomatosis. The presence of somatic point mutations in LAM1, LAM2 and LAM5 from the mutational panel, made evaluation of the proportion of lymphangioleiomyomatosis cells captured possible. Even after enrichment by laser capture microdissection, the proportion of lymphangioleiomyomatosis cells still ranged from 20–35% in LAM1, LAM2 and LAM5, emphasizing the limitation of genomic studies on lymphangioleiomyomatosis tissues in the absence of enrichment protocols. The absence of somatic mutations from the panel in LAM3 and LAM4, however, made lymphangioleiomyomatosis cellularity predictions more subjective. Strong positive HMB-45 staining in both LAM3 and LAM4, however, confirmed the presence of lymphangioleiomyomatosis cells in these two cases. Additionally, germline events in mate pair sequencing data often easily distinguish through their characteristic features, which allow us to predict the nature of events that passed filters (Table 2). While the mutation panel failed to detect any evidence of tuberous sclerosis complex gene mutations in LAM3 or LAM4 the validation of somatic structural variations in these tissues supports the presence of adequate levels of lymphangioleiomyomatosis cells for mutational analysis. The supporting loss of heterozygosity at the TSC1 locus of LAM4 and Xp11.3 locus of LAM3 (Figure 3) also support the presence of adequate lymphangioleiomyomatosis cells for evaluation of copy-number variation and aneuploidy. Aneuploidy, involving gains and losses of whole chromosomes or large regions of chromosomes is characteristic of systemic tumors,24 but was not evident of any of the cases in this study. While whole-genome amplified instills some noise in the copy-number variation analysis, bioinformatics algorithms were designed to minimize this noise and effectively assess copy gains and losses.33

The observed double hits on the TSC2 gene in LAM1 and LAM2 were consistent with the two-hit tumor suppressor driver model.17 The homozygous deletion of TSC1 in LAM4 was also consistent with this model and confirms a role of large genomic rearrangements in lymphangioleiomyomatosis disease. LAM5, however, with just a single point mutation detected on TSC2, could infer an undetected mutation on the second allele of TSC2. With the mechanistic target of rapamycin signaling pathway as the major driver of the proliferative lymphangioleiomyomatosis disease, many patients benefit from sirolimus or more recently everolimus treatment; however, the drugs are sometimes poorly tolerated and aimed at cytostatic rather than cytoablative therapy.14, 15, 34 No alternative effective clinical protocols currently exist for lymphangioleiomyomatosis, with advanced patients often benefitting from lung transplantation.4, 16 There is therefore a need to identify specific molecular targets to offer patients additional therapeutic strategies. While additional genes were hit through the large genomic rearrangements detected (Table 2) the roles as drivers in sporadic lymphangioleiomyomatosis are not well-supported by this study, as such statements would require more in-depth supporting mechanistic studies. However, it is clear that additional genes are mutated in sporadic lymphangioleiomyomatosis disease adding large genomic rearrangements to the complex genetic story of drivers of the lymphangioleiomyomatosis phenotype. One of these events involved PPP2R2B, implicated in mechanistic target of rapamycin signaling and reported as a biomarker for rapamycin responsiveness.31, 32 Other novel genes hit by rearrangements with links to mechanistic target of rapamycin signaling included DDX31, GFI1B, KDM6A, TET2, CRLF2 and CRAC2RA31, 32, 35, 36, 37, 38, 39, 40, 41 While driver roles are not predicted for these genes, it could be hypothesized that such mutations could affect disease presentation and/or potential response to treatment therapies. The additional validation of a fusion junction with just four associated mate pair reads in LAM3 also indicates a potential lower concentration of LAM cells or heterogeneity within this specimen. In samples with low cellularity or with fusion junctions in repetitive/complex regions of the genome, true positive events can appear with lower numbers of supporting associate mate pair reads. Generally with mate pair sequencing data conservative filtering is applied to minimize false positives, however, no significant mechanistic target of rapamycin signaling pathway genes were elucidated even in the lower level associate events (data not shown). Therefore, while somatic tuberous sclerosis complex gene driver mutations were discernible in four of the five cases, LAM3 fell into the category of undetermined driver.

We believe mate pair sequencing will be instrumental in aiding detection of somatic variations in a subset of lymphangioleiomyomatosis cases where no TSC1 or TSC2 mutations were detected by standard genomic testing. These cases are the most interesting in defining additional pathways/mechanisms which drive lymphangioleiomyomatosis disease. In our study LAM3 and LAM4 fell under this category, and mate pair was able to efficiently detect the driver TSC1 mutations in LAM4. The driver in LAM3, however, remained undetermined; this case would greatly benefit from further in-depth integrated genomic analysis, including whole exome and transcriptome sequencing, to further dissect somatic variations in this and similar undefined cases. While care must be taken in over interpreting novel mutated genes in this small study, commonality in expanded case studies would aid in distinction between passenger and driver events.

In this study, we potentially report for the first time a sporadic lymphangioleiomyomatosis case driven by a homozygous TSC1 deletion. This event was predicted somatic by PCR, being absent in the germline control smooth muscle cell tissue, with coverage data also corroborating the copy loss. However, we acknowledge that care must be taken in this conclusion. Firstly, the requirement of fresh frozen tissue necessitated us to utilize lung tissue obtained from the NDRI tissue bank, which negated us access to full clinical records or additional patient tissues. While we can confirm the homozygous TSC1 deletion absent from the surrounding normal tissue, we cannot rule out the potential for mosaicism. Secondly, although consultation with NDRI strongly indicated no tuberous sclerosis complex disease in any of the patients studied, as detailed in Supplementary Table 2, the fact that TSC1 mutation driven tuberous sclerosis complex can present with sub-clinical tuberous sclerosis complex, additionally prevents us from categorically ruling out mosaicism in this patient. Furthermore, the fact that TSC1 mutations are less common in tuberous sclerosis complex than TSC2-driven, and that TSC1-dirven mutations could potentially present with sub-clinical sporadic lymphangioleiomyomatosis disease, together with the rarity of this disease, may also explain the lack of reporting of a TSC1-driven sporadic lymphangioleiomyomatosis to date.

In summary, we report here an analysis of structural variation in sporadic lymphangioleiomyomatosis cells. Results definitively demonstrated the involvement of large genomic rearrangements/deletions in a subset of sporadic lymphangioleiomyomatosis cases lacking the well reported TSC2 driver mutations. Most significantly, we observed a homozygous deletion of TSC1 in a wild type TSC2 background, potentially demonstrating for the first time a case of sporadic lymphangioleiomyomatosis driven by two-hits on the TSC1 gene. More importantly, these somatic mutations would have been missed with most technologies used to screen sporadic lymphangioleiomyomatosis tissues. The numbers of somatic rearrangements was, however, limited and not present in all cases. Nevertheless, somatic genomic rearrangements are active in sporadic lymphangioleiomyomatosis and should not be overlooked in the genomic analysis, specifically in no mutation identified cases, for which mate pair sequencing is a useful tool to evaluate all structural variations in one single sequencing procedure.