Recent advances in Bionano optical mapping (BOM) provide a great insight into the determination of structural variants (SVs), but its utility in identification of clinical likely pathogenic variants needs to be further demonstrated and proved. In a family with two consecutive pregnancies affected with ventriculomegaly, a splicing likely pathogenic variant at the LAMA1 locus (NM_005559: c. 4663 + 1 G > C) inherited from the father was identified in the proband by whole-exome sequencing, and no other pathogenic variant associated with the clinical phenotypes was detected. SV analysis by BOM revealed an ~48 kb duplication at the LAMA1 locus in the maternal sample. Real-time quantitative PCR and Sanger sequencing further confirmed the duplication as c.859-153_4806 + 910dup. Based on these variants, we hypothesize that the fetuses have Poretti-Boltshauser syndrome (PBS) presenting with ventriculomegaly. With the ability to determine single nucleotide variants and SVs, the strategy adopted here might be useful to detect cases missed by current routine screening methods. In addition, our study may broaden the phenotypic spectrum of fetuses with PBS.
Accurate molecular diagnosis of genetic disorders is crucial to families with history of diseases and couples with consecutive abnormal fetuses showing similar phenotypes. The identification of pathogenic variants has a great implication for disease diagnosis, treatments, and reproduction guidance. In the past few years, whole-exome sequencing (WES) has greatly facilitated the identification of candidate disease-causing variants in coding regions1,2. Based on the next-generation sequencing, WES is capable of identifying single nucleotide variants (SNVs) and small variants. Variants in genetic disorders are constantly discovered, yet ~60% of cases remain undiagnosed3,4. One reason is that some structural variants (SVs) are challenging to detect using short reads-based technologies5. SVs are known to be implicated in a wide range of genetic diseases6, including copy number variants (CNVs), inversions, translocations, and so on. Currently, chromosome microarray (CMA) remains the most common method to determine CNVs7. However, it is not able to identify small or balanced SVs, nor does it provide information about the location and orientation8. Hence, when common molecular tests cannot identify any potential pathogenic variants for families with a clear indication of genetic problems, new techniques are required to fill the gaps.
Advances in genome mapping technologies continue to provide great insights into the determination of SVs9,10. On the basis of nanochannel-based genome mapping technology, Bionano optical mapping (BOM) utilizes enzymes to label high molecular weight DNA without breaking the DNA or polymerase chain reaction (PCR), and fluorescently tagged mega base size DNA are obtained11. These long fragments of DNA are linearized through pillars and imaged in nanochannels. A de novo genome assembly is derived from many copies of a native DNA molecule, which allows BOM to detect CNVs, translocations and inversions, as well as other complex SVs10,12. It also offers the opportunity to understand the size, location, and orientation of SVs. So far, BOM has been successfully applied in cancer studies13, and Duchenne and Becker muscular dystrophy diagnosis14, but its utility in determination of clinical likely pathogenic variants need to be further demonstrated and proved.
Poretti-Boltshauser syndrome (PBS; MIM: 615960)15,16 is an autosomal recessive disorder characterized by cerebellar dysplasia, cerebellar cysts, and abnormally sized fourth ventricle. Most patients present with ataxia, language delay, cognitive impairment, developmental delay, and ocular motor apraxia. High myopia, abnormal eye movements, and retinal dystrophy are also seen in some patients. PBS is associated with biallelic pathogenic variants in the LAMA1 gene located on 18p11.31. This gene encodes one of the alpha 1 subunits of laminin, which plays an essential role in many biological processes, involving cell adhesion, differentiation, migration, signaling, neurite outgrowth, and metastasis17. Besides SNVs, three multi-exon deletions16,18,19 and a gross insertion including the entire gene20 have been reported in patients with PBS. However, some potential disease-causing SVs may be missed by using WES and CMA.
In this study, we identified two likely pathogenic variants at the LAMA1 locus in a family with two consecutive pregnancies affected with ventriculomegaly by using BOM together with WES, CMA, and Sanger sequencing. Based on these variants, we hypothesize that the fetuses are affected with PBS. With the ability to determine SNVs and SVs, the strategy adopted here might be useful to detect cases missed by current routine screening methods. Our data further demonstrated the clinical utility of BOM in genetic disorders.
WES was carried out on the second fetal and parental DNA samples, and a splicing pathogenic variant at the LAMA1 locus (NM_005559: c. 4663 + 1 G > C) inherited from the father was detected in the proband. No other pathogenic variant associated with the clinical phenotypes was identified. Interestingly, the splicing variant was present in 27.3% of all reads covering this position in the proband and 46.3% in the father, indicating mosaicism or an extra copy of this region in the proband. Given that SVs might be causal variants of PBS, we speculated that there might be another pathogenic variant on the other allele that is beyond the detection ability of WES. By further reviewing the WES data, we noticed a slightly increased coverage of exon 8–32 at the LAMA1 locus in the proband’s and maternal samples (Fig. 1a), indicating a possible duplication in trans in the proband. However, the WES signals of this duplication was equivocal. As there was no fetal DNA sample left after WES, subsequent studies were carried out on parental samples. To determine whether there was an SV, CMA was performed on the maternal sample. No significant abnormality was identified by CMA. Notably, when we lowered the cutoff value (gains ≥ 25 kb and losses ≥ 25 kb) and looked into the LAMA1 gene, a suspected gain encompassing exons 6–33 was observed (Fig. 1b). At this point, both WES and CMA implied a possible and undetectable duplication in the LAMA1 gene. In this case, it would be questionable whether this duplication really exists; if so, where it is and how it duplicates. To solve these questions, other technologies such as BOM with the capability of detecting long DNA molecules would be necessary.
BOM and SV detection
In attempt to confirm and characterize the SV, BOM was performed on the maternal sample. An average mapped coverage depth of 157× was achieved. A total of 2,076,484 molecules longer than 150 kb were captured and a de novo genome map assembly was created from 389 consensus genome maps (Fig. 1c, Table 1). The average consensus genome map is 11.338 Mb in length. More than 85% of the genome maps could be aligned to the human reference hg19, achieving a breadth of coverage of 91.8%. A total of 4280 clustered SVs relative to hg19 were detected (Fig. 1d, Table 1). The density of labels was ~15.14 times per 100 kilo base pairs (kb) on average.
SV analysis revealed an ~48 kb heterozygous duplication at the LAMA1 locus (Fig. 1e). More importantly, high-resolution fluorescence imaging of linearized DNA passing through nanochannels directly shed light on the duplication as a tandem duplication, i.e., the duplication was present adjacent to the 3' of the original copy in the same orientation at the LAMA1 locus. According to the fluorescent labels, the 5' breakpoint was located within exons 6–15, while the 3' breakpoint within exons 30–37. BOM provided a remarkable insight into the presence of an intrachromosomal duplication.
Confirmation of variants via RT-qPCR and/or Sanger sequencing
To validate the duplication and define the breakpoints at single nucleotide level, we performed real-time quantitative PCR (RT-qPCR) and Sanger sequencing on the maternal sample. RT-qPCR revealed two copies of exons 6 and 34, and three copies of exons 7 and 33 (data not shown). Figure 1f displayed the amplification of the junction region, where part of intron 33 was followed by part of intron 6 in the same orientation. Sanger sequencing confirmed the carrier status of the mother, and further demonstrated the breakpoints in intron 6 and intron 33. Thus, the duplication was reported as c.859-153_4806 + 910dup, encompassing 48,161 base pairs (bps). Considering the WES data, the mother transmitted the duplicated allele to the proband. The splicing variant was also confirmed by Sanger sequencing. As shown in Fig. 1g, the father carried the splicing variant at the LAMA1 locus (NM_005559: c. 4663 + 1 G > C).
Genetic analysis described above revealed that the proband carried compound heterozygous variants of c.859-153_4806 + 910dup and c. 4663 + 1 G > C in the LAMA1 gene. The variant c.859-153_4806 + 910dup inherited from the mother has not been reported. It spanned from exons 7 to 33, which encoded several Laminin EGF-like domains and partial domain I. Moreover, it was absent from the controls in Exome Aggregation Consortium (ExAC) database. The multi-exon duplication was predicted to affect the domain I, which is thought to interact with other laminin chains to form a coiled-coil structure and thus destroy the protein structure. There was no reported case in the DECIPHER database involving the duplicated region. Therefore, this duplication was classified as likely pathogenic. The variant c. 4663 + 1 G > C transmitted from the father had been reported in a patient with PBS previously18. It was predicted to cause a splice defect at the mRNA level and was not found in the ExAC database. Thus, it was also classified as likely pathogenic.
For the past few years, diagnoses of genetic disorders mainly focus on the identification of SNVs by WES and CNVs by CMA, but a large portion of cases remain undiagnosed21. It is probably due to the inabilities of such platforms to characterize certain kinds of SVs22, which represent an important class of genetic variations and are usually causative for many genetic disorders23. In our study, these conventional platforms could neither provide a definite determination of the duplication nor reveal the location and orientation. Moreover, WES screens only exons, leaving 99% genome undetected. In addition, short-read sequences are typically aligned to a reference sequence that limits the identification of large SVs. Furthermore, SVs might be missed when references are incorrect or incomplete. Recent advances in single-molecule sequencing and whole-genome mapping are showing promising utility in comprehensive structural variation analysis, particularly for duplications, insertions, inversions, and translocations24. They are able to produce long-read-based de novo assemblies with remarkable quality and are capable of reconstructing the entire human genome, which is impossible by short-read sequencing. However, single-molecule sequencing technologies, such as Pacific Biosciences and Oxford Nanopore Technologies are either too expensive to obtain sufficient coverage, or suffered from poor accuracy25. Taking the cost, time, and throughput into account, whole-genome mapping-based BOM may be a good alternative for SVs detection. Although it is unable to offer base-level sequence information, its abilities to visualize DNA structure and to reveal a wide spectrum of SVs make it a good complement to short-read sequencing26. Since the resolution of breakpoints is limited to label density, more enzymes can be used to improve the performance of SVs characterization. Alternatively, as shown in our study, the combination with other technologies could provide more comprehensive understandings of the SVs. In addition, the consistent results from different methods ensure the accurate determination of variants.
Here, we successfully identified one duplication (c.859-153_4806 + 910dup) and one splicing variant (c. 4663 + 1 G > C) at the LAMA1 locus in a family with two consecutive pregnancies affected with ventriculomegaly. Mild ventriculomegaly has been reported in three children with PBS16,18. Based on the biallelic-predicted pathogenic LAMA1 variants, we hypothesize that the fetuses have PBS presenting with ventriculomegaly. As far as we know, all the cases with PBS described in the literatures are children and adults16,18,19,27,28,29. We reported the characterizations presented in the second trimester. The different phenotypes are likely caused by the variants’ types and locations. Our study showed that the onset of PBS might be as early as the gestation period. These findings may broaden the phenotypic spectrum of fetuses with PBS, but additional studies are needed to better understand the fetal phenotypes. It should be noted that ventriculomegaly, if present, is nonspecific for PBS. However, it highlights that attention should be given to the patients with ventriculomegaly and their LAMA1 gene could be screened. Specific reproduction advices can then be offered to the couple based on the results.
In this study, a strategy involving BOM, WES, CMA, RT-qPCR, and Sanger sequencing for disease-causing variants detection was employed. It might be useful for cases missed by current routine screening methods. We successfully applied it to a family with consecutive abnormal pregnancies and accurately identified two likely pathogenic variants in the LAMA1 gene. BOM further characterized the location and orientation of the SV, which could not be revealed by WES or CMA, proving its values in determination of likely pathogenic SVs.
Patients and biological samples
A 29-year-old healthy woman who had two consecutive pregnancies with ventriculomegaly was referred to the Department of Reproductive Genetics, Women’s Hospital, School of Medicine, Zhejiang University, Zhejiang, China. Ultrasound scans showed dilated bilateral ventricles, dilated third ventricle, and aqueductal stenosis at the 17th week of gestation for the first fetus. For the second fetus, MRI scan indicated dilated bilateral ventricles at the 22th gestational week, and ultrasound scans showed dilated bilateral ventricles and brain parenchyma compression at the 31th week. For both fetuses, no other abnormalities were observed and nuchal translucency thickness were normal. Their CMV results and toxoplasmosis test results were both negative. Cell-free fetal DNA tests also showed negative results. Both pregnancies were terminated. Autopsy was not performed on either fetus. This family has no other conditions or affected family members, and the couple are not consanguineous. Peripheral blood samples from the couple were obtained. Genomic DNA samples were extracted with QIAamp® DNA Blood Mini Kit (QIAGEN, Germany) according to the manufacturer’s protocol. In addition, fetal DNA sample from the second fetus was collected for WES test. Signed informed consent was obtained from the couple. This study was approved by the Ethics Committee at Women’s Hospital, School of Medicine, Zhejiang University. This study is compliant with the ‘Guidance of the Ministry of Science and Technology for the Review and Approval of Human Genetic Resources’, which requires formal approval for the export of human genetic material or data from China.
The genomic DNA samples were captured by Agilent SureSelect Human All Exon V5 (Agilent, USA) and sequenced on Illumina HiSeq2000 platform (Illumina, USA). Data were mapped to the hg19 reference human genome using Sentieon BWA (Sentieon, USA). Average sequencing coverage was 90-fold with >95% of the region-of-interest covered at least 20-fold. Variants were filtered through population databases, annotated by Variant Effect Predictor software, and online databases, such as ClinVar, OMIM, and HGMD. CNVs were analyzed by NxClinical 4.1 (BioDiscovery, USA).
Genotypes and CNVs were detected by Affymetrix CytoScan™ HD array (Affymetrix, USA) following the manufacturer’s instructions. These HD arrays contained ~2,600,000 markers, including 750,000 single nucleotide polymorphic probes and 1,900,000 non-polymorphic probes. Data were visualized and analyzed with the Chromosome Analysis Suite software (Affymetrix, USA) based on hg19 assembly. Gains ≥ 500 kb and losses ≥ 200 kb, containing a minimum of 50 markers in the region were considered as significant.
Bionano optical mapping
High molecular weight DNA was extracted from whole blood by using Bionano Prep Blood and Cell Culture DNA Isolation Kit (Bionano Genomics, USA). Briefly, blood was subjected to two rounds of red blood cell lysis. After quantitation, white blood cells were immobilized in agarose plugs (Bio-Rad, USA). Plugs were digested with proteinase K (Qiagen, Germany), and washed with wash buffer (Bionano Genomics) and TE buffer (Thermo Fisher Scientific, USA). DNA was recovered, dialyzed, homogenized, and quantitated subsequently. DNA labeling was processed with Bionano Prep™ DLS DNA Labeling kit (Bionano Genomics) according to the kit protocol. In brief, DNA was labeled with DL-Green fluorophores using DLE-1 enzyme at 37 °C for 2 h, digested with proteinase K at 50 °C for 30 min, and cleaned up with 1× DLE-1 buffer. Subsequently, DNA backbone was stained with DNA stain, 5× DTT, and 4× flow buffer for 1 h and homogenized overnight at 4 °C. Labeled DNA was loaded on a Saphyr chip (Bionano Genomics) and run on a Saphyr instrument (Bionano Genomics). The de novo genome map assembly was performed using Bionano Solve™ (Bionano Genomics) and SVs were called against the public Genome Reference Consortium hg19 human assembly. Data were analyzed with Bionano Access™ and Bionano Tools™ on Saphyr Compute Servers (Bionano Genomics).
RT-qPCR and Sanger sequencing
The newly identified duplication was validated by RT-qPCR. Primers were designed to amplify exons of interest and the β-globin gene was chosen as the reference gene. The reaction was carried out on a LightCycler 480 II system (Roche, Switzerland) in 20 µl volume containing 10 µl of SYBR Premix Ex Taq Polymerase (TaKaRa, Japan), 0.4 µl of each primers (10 µmol/l), and 2 µl of the genomic DNA (20 ng). Each sample was tested in triplicate. An initial denaturation step at 95 °C for 10 s was followed by 40 cycles of 95 °C for 5 s and 58 °C for 30 s, and then a final extension step at 72 °C for 10 min. The dosage ratio of the tested samples was calculated by the ΔΔCt method30. Target regions flanking variant sites were amplified (F: 5ʹ-CTGCCTGTCTTTAGGTGCATC-3ʹ and R: 5ʹ-GAGGAAAACTGTACTGACCTTC-3ʹ) and Sanger sequencing was performed on an ABI 3500xL Dx Genetic Analyzer (Applied Biosystems, USA).
CNVs and variants were investigated for their clinical relevance using the following databases: DECIPHER (http://decipher.sanger.ac.uk), OMIM (http://omim.org/), PubMed (http://www.ncbi.nlm.nih.gov/pubmed), Exome Aggregation Consortium (ExAC, http://exac.broadinstitute.org/), ClinVar (https://www.ncbi.nlm.nih.gov/clinvar/), and HGMD (http://www.hgmd.cf.ac.uk/ac/index.php).
Further information on experimental design is available in the Nature Research Reporting Summary linked to this article.
The data that support the findings of this study have been deposited in SRA database with the accession code PRJNA636598.
Bamshad, M. J. et al. Exome sequencing as a tool for Mendelian disease gene discovery. Nat. Rev. Genet. 12, 745–755 (2011).
Gilissen, C., Hoischen, A., Brunner, H. G. & Veltman, J. A. Unlocking Mendelian disease using exome sequencing. Genome Biol. 12, 228 (2011).
Sun, Y. et al. Targeted next-generation sequencing as a comprehensive test for Mendelian diseases: a cohort diagnostic study. Sci. Rep. 8, 11646 (2018).
Ewans, L. J. et al. Whole-exome sequencing reanalysis at 12 months boosts diagnosis and is cost-effective when applied early in Mendelian disorders. Genet. Med. 20, 1564–1574 (2018).
Alkan, C., Sajjadian, S. & Eichler, E. E. Limitations of next-generation genome sequence assembly. Nat. Methods 8, 61–65 (2011).
Stankiewicz, P. & Lupski, J. R. Structural variation in the human genome and its role in disease. Annu. Rev. Med. 61, 437–455 (2010).
Miller, D. T. et al. Consensus statement: chromosomal microarray is a first-tier clinical diagnostic test for individuals with developmental disabilities or congenital anomalies. Am. J. Hum. Genet. 86, 749–764 (2010).
Alkan, C., Coe, B. P. & Eichler, E. E. Genome structural variation discovery and genotyping. Nat. Rev. Genet. 12, 363–376 (2011).
Cao, H. et al. Rapid detection of structural variation in a human genome using nanochannel-based genome mapping technology. GigaScience 3, 34 (2014).
Mak, A. C. et al. Genome-wide structural variation detection by genome mapping on nanochannel arrays. Genetics 202, 351–362 (2016).
Lam, E. T. et al. Genome mapping on nanochannel arrays for structural variation analysis and sequence assembly. Nat. Biotechnol. 30, 771–776 (2012).
Jaratlerdsiri, W. et al. Next generation mapping reveals novel large genomic rearrangements in prostate cancer. Oncotarget 8, 23588–23602 (2017).
Dixon, J. R. et al. Integrative detection and analysis of structural variation in cancer genomes. Nat. Genet. 50, 1388–1398 (2018).
Barseghyan, H. et al. Next-generation mapping: a novel approach for detection of pathogenic structural variants with a potential utility in clinical diagnosis. Genome Med. 9, 90 (2017).
Poretti, A. et al. Ataxia, intellectual disability, and ocular apraxia with cerebellar cysts: a new disease? Cerebellum 13, 79–88 (2014).
Aldinger, K. A. et al. Mutations in LAMA1 cause cerebellar dysplasia and cysts with and without retinal dystrophy. Am. J. Hum. Genet. 95, 227–234 (2014).
Yao, Y. Laminin: loss-of-function studies. Cell. Mol. life Sci. 74, 1095–1115 (2017).
Micalizzi, A. et al. Clinical, neuroradiological and molecular characterization of cerebellar dysplasia with cysts (Poretti-Boltshauser syndrome). Eur. J. Hum. Genet. 24, 1262–1267 (2016).
Masson, R. et al. Early-onset head titubation in a child with Poretti-Boltshauser syndrome. Neurology 88, 1478–1479 (2017).
Giordano, M., Muratore, V., Babu, D., Meazza, C. & Bozzola, M. A 18p11.23-p11.31 microduplication in a boy with psychomotor delay, cerebellar vermis hypoplasia, chorioretinal coloboma, deafness and GH deficiency. Mol. Cytogenet. 9, 89 (2016).
Strauss, K. A. et al. Genomic diagnostics within a medically underserved population: efficacy and implications. Genet. Med. 20, 31–41 (2018).
Tattini, L., D’Aurizio, R. & Magi, A. Detection of genomic structural variants from next-generation sequencing data. Front. Bioeng. Biotechnol. 3, 92 (2015).
Weischenfeldt, J., Symmons, O., Spitz, F. & Korbel, J. O. Phenotypic impact of genomic structural variation: insights from and for human disease. Nat. Rev. Genet. 14, 125–138 (2013).
Chaisson, M. J. et al. Resolving the complexity of the human genome using single-molecule sequencing. Nature 517, 608–611 (2015).
Sedlazeck, F. J. et al. Accurate detection of complex structural variations using single-molecule sequencing. Nat. Methods 15, 461–468 (2018).
Pendleton, M. et al. Assembly and diploid architecture of an individual human genome via single-molecule technologies. Nat. Methods 12, 780–786 (2015).
Marlow, E., Chan, R. V. P., Oltra, E., Rusu, I. & Gupta, M. P. Retinal avascularity and neovascularization associated with LAMA1 (laminin1) mutation in Poretti-Boltshauser syndrome. JAMA Ophthalmol. 136, 96–97 (2018).
Vilboux, T. et al. Cystic cerebellar dysplasia and biallelic LAMA1 mutations: a lamininopathy associated with tics, obsessive compulsive traits and myopia due to cell adhesion and migration defects. J. Med. Genet. 53, 318–329 (2016).
Cali, T. et al. A novel mutation in isoform 3 of the plasma membrane Ca2+ pump impairs cellular Ca2+ homeostasis in a patient with cerebellar ataxia and laminin subunit 1alpha mutations. J. Biol. Chem. 290, 16132–16141 (2015).
Livak, K. J. & Schmittgen, T. D. Analysis of relative gene expression data using real-time quantitative PCR and the 2(-Delta Delta C(T)) method. Methods 25, 402–408 (2001).
The work was supported by the Key Research and Development Program of Zhejiang Province (2019C03025), the National Natural Science Foundation of China (81901382), and Zhejiang Provincial Natural Science Foundation (LQ19H090019 and LQ17H040001).
The authors declare no competing interests.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Chen, M., Zhang, M., Qian, Y. et al. Identification of a likely pathogenic structural variation in the LAMA1 gene by Bionano optical mapping. npj Genom. Med. 5, 31 (2020). https://doi.org/10.1038/s41525-020-0138-z