Abstract
Genome sequencing (GS) can identify novel diagnoses for patients who remain undiagnosed after routine diagnostic procedures. We tested whether GS is a better first-tier genetic diagnostic test than current standard of care (SOC) by assessing the technical and clinical validity of GS for patients with neurodevelopmental disorders (NDD). We performed both GS and exome sequencing in 150 consecutive NDD patient-parent trios. The primary outcome was diagnostic yield, calculated from disease-causing variants affecting exonic sequence of known NDD genes. GS (30%, nā=ā45) and SOC (28.7%, nā=ā43) had similar diagnostic yield. All 43 conclusive diagnoses obtained with SOC testing were also identified by GS. SOC, however, required integration of multiple test results to obtain these diagnoses. GS yielded two more conclusive diagnoses, and four more possible diagnoses than ES-based SOC (35 vs. 31). Interestingly, these six variants detected only by GS were copy number variants (CNVs). Our data demonstrate the technical and clinical validity of GS to serve as routine first-tier genetic test for patients with NDD. Although the additional diagnostic yield from GS is limited, GS comprehensively identified all variants in a single experiment, suggesting that GS constitutes a more efficient genetic diagnostic workflow.
Introduction
Despite the diagnostic improvement of routine exome sequencing (ES) in clinical diagnostic laboratories, only about a third of the patients with neurodevelopmental disorders (NDD) of presumed genetic origin receive a definitive diagnosis [1, 2]. This diagnostic uncertainty can have considerable impact on patients, parents and families [3], which can be extenuated by providing more definitive molecular diagnoses.
A major advantage of genome sequencing (GS) over ES is the uniform sequence coverage across the entire genome [4] due to the elimination of enrichment artefacts [5], resulting in improved variant calling, especially for structural variation (SV) and short tandem repeats (STR) [6, 7]. This accurate and comprehensive analysis of all different variant types has great potential for further increasing the diagnostic yield. In rare disease patients in whom standard of care (SOC) testing failed to provide a molecular diagnosis, GS has been shown multiple times to reveal pathogenic variants leading to genetic diagnoses that were not identified at time of routine diagnostic work-up [8,9,10,11,12,13,14]. However, many of these studies were conducted in highly selected patient cohorts, making it difficult to draw firm conclusions on the potential of increased diagnostic yields when GS is employed in a prospective routine care setting [8,9,10]. While other studies have concluded that GS is technically suitable as first genetic test based on randomized control trials with standard of care [11,12,13,14,15,16], those studies have not allowed for a systematic comparison at the individual patient level for (type of) diagnoses gained or missed. As such, a prospective parallel study where patients receive both SOC and GS to determine the concordance between these two pathways and to address the question whether GS has added value over current SOC, has, to our knowledge, not yet been performed.
Here, we tested the hypothesis whether GS provides a higher diagnostic yield for patients with NDD when compared to current ES-based SOC. To this end, we prospectively studied 150 patient-parent trios with NDD, for whom we, in parallel to our (ES-based) SOC, performed GS. This not only allowed for a direct comparison of the clinical validity [17] of both genetic diagnostic pathways based on diagnostic yield, but also provided insights into the additive diagnoses made by either pathway, providing an unbiased evaluation of GS as first-tier genetic test for patients with NDD.
Materials and methods
Patient recruitment, selection and counseling
The departments of Human Genetics of the Radboudumc and MUMC+āare tertiary referral centers for patients with NDD in the Netherlands. A total of 150 consecutive index patients (nā=ā96 Radboudumc, nā=ā54 MUMC+) with neurodevelopmental delay of suspected genetic origin were included between October 1st 2018 and July 1st 2019. The only inclusion criterion was that the clinical geneticist requested a genetic diagnostic test to identify the molecular defect underlying the patientās phenotype. Patients with a clinically recognizable syndrome (requiring genetic confirmation by a molecular genetic test) were not excluded from this study, making this cohort representative of the types of NDD patients seen in a tertiary referral center.
Board-certified clinical geneticists counseled all 150 patients and their parents for the ES-based SOC/GS procedures. All participants or their legal representatives gave written informed consent. This study was approved by the Medical Review Ethics Committee Arnhem-Nijmegen under 2011/188 and 2020-7142.
Collection of family history and phenotypic information
Family history was negative for 145 cases, whereas for 5 patients, a recessive disorder or parental germline mosaicism was expected based on an affected sibling. Of note, if the genetic testing for both siblings started at the same time, making both sibs eligible for inclusion in this study, only one of the sibs was randomly selected and included. For all patients, phenotypic information was collected from the electronic health records and captured by standardized Human Phenotypic Ontology (HPO [18], release 2018-12-21) using PhenoTips (https://github.com/phenotips/phenotips, version 1.4.1).
Standard of care procedures
All patients received two diagnostic pathways in parallel, i.e., the (ES-based) SOC diagnostic pathway and the GS pathway (Fig.Ā 1).
Requests for genetic diagnostic testing in the SOC pathway were left to the discretion of the clinical geneticist. The ES procedure was performed as described previously and included single nucleotide variant (SNV) and CNV analysis [19,20,21]. In brief, all DNA samples were sequenced with paired-end reads on an Illumina HiSeq 4000 or Illumina NextSeq 500 instrument in combination with Agilent version 5 enrichment kit to an anticipated median target coverage of 100-fold (Supplementary TableĀ 1). ES could be supplemented with additional genetic tests, such as genomic microarray-based CNV profiling [22] (irrespective of CNV calling from ES), or targeted single gene-based testing for variants known to escape detection by ES (e.g., methylation assays for Prader Willi (OMIM 176270) or Angelman syndrome (OMIM 105830)), mitochondrial DNA testing, Sanger sequencing of individual genes not well-covered in ES, or repeat expansion analysis. Genetics tests performed prior to inclusion in this study were also considered part of the SOC pathway. Independent of the underlying genetic assay, genetic variants were prioritized and interpreted in the context of the patientās phenotypic presentation, using routine procedures [2].
GS procedure and bioinformatic analysis
GS was performed as described by the manufacturer (Illumina, San Diego, CA, USA). In brief, 1000āng DNA, isolated from whole blood, was used for library preparation using the Illumina TruSeq DNA PCR-free protocol. Insert size was set at an average of 450ābp (Supplementary Fig.Ā 1A) by shearing DNA using a Covaris E210. For efficient use of sequence capacity, barcoded indexing was included in the library preparation. Samples were equimolarly pooled, six samples on an Illumina S2 flowcell (nā=ā5 flowcells), 12 samples on a S4 flowcell (nā=ā3 flowcells), or 15 samples on a S4 flowcell (nā=ā27 flowcells), prior to sequencing using an Illumina NovaSeq6000TM instrument to an anticipated genome-wide coverage of 50-fold (with a confirmed minimum coverage of 45-fold after mapping) (Supplementary Fig.Ā 1B).
After sequencing, FASTQ files were processed through our in-house developed GS-pipeline. In short, reads were mapped to the human reference genome (GRCh37/hg19) using BWA (v.0.78) and the quality of the resulting BAM file was assessed using Qualimap (v.2.2.1) (Supplementary TableĀ 2) [23]. Variant calling was performed per variant type using dedicated tools to optimize sensitivity, subsequently followed by variant annotation to facilitate variant interpretation.
For small variants, including SNVs and insertion/deletion events (indels), xAtlas (v.0.1) [24] was used and variants were annotated using an in-house developed pipeline. This variant annotation was performed using the Variant Effect Predictor (VEP V.91) and Gencode (V.34lift37) basic gene annotations. Frequency information was added from GnomAD (V.2.1.1) [25] and from an in-house database. In-house gene panel information, composed by multidisciplinary expert teams consisting of a clinical laboratory geneticist, a molecular geneticist, and a clinical specialist, was added for those genetic variants within a known disease gene. Additional annotations included CADD score (V.1.6), SpliceAI, OMIM and KEGG pathways. Runs of homozygosity were detected using PLINK (v.1.07) [26] with the following parameters: homozyg-window-het=3, homozyg-snp=50 and homozyg-kb=300.
SVs were called using Manta Structural Variant Caller (v.1.1.0; Illumina) [27], following a paired-end and split-read evidence approach for SVs identification. CNVs were called using Canvas Copy Number Variant Caller (v.1.40.0; Illumina) [28], which detects copy number gains or losses based on read depth. SVs and CNVs were annotated using an in-house developed pipeline based on ANNOVAR and Gencode (V.34lift37) basic gene annotations, and included frequency information from GnomAD (V.2.1) [25], 1000G (V.8) [29] and the GoNL SV database [30].
Short tandem repeats (STRs) were analyzed using ExpansionHunter (v.3.1.2) [31] with default settings, by extracting the relevant reads from a binary alignment file and realign them to the locus to genotype the STR site. As input loci, a standard diagnostic variant catalog containing 39 well-described disease-causing STR loci was used (Supplementary TableĀ 3).
GS variant filtering, prioritization and interpretation
We developed a strategy for the filtering, prioritization, and interpretation of genomic variants, based on variant type and mode of inheritance (Supplementary TablesĀ 4, 5). Clinical interpretation of nuclear DNA was limited to exonic variants and splice site variants up to ±8ābp (Supplementary Fig.Ā 2), thus including non-coding exons and UTRs but excluding deep intronic sequences.
SNVs
We first optimized our SNV prioritization strategy based on quality control (QC) metrics by analyzing coverage, percentage variant allele frequency, and QC scores for ten childāparent trios unrelated to this project (Supplementary Fig.Ā 3). Subsequently, we applied this strategy to our cohort of 150 patient-parent trios. We then extracted the coding SNVs (Supplementary Fig.Ā 2) based on mode of inheritance, by filtering for de novo, homozygous, compound heterozygous and hemizygous variants (Supplementary TableĀ 4). After filtering, the variants were prioritized (Supplementary TableĀ 5) and evaluated in the clinical context of the patient, first focusing on prior disease-gene relationships for NDD, but also including variants in genes for which this was to be established (e.g., ācandidate disease genesā). Variants considered clinically relevant were validated by routine Sanger sequencing.
CNVs
Canvas and MANTA output files were filtered for de novo, homozygous and hemizygous CNVs. Cohort metrics were used to establish CNV filter strategies (Supplementary Fig.Ā 4). CNVs were assessed in three categories 1) all variants in the NDD gene panel, 2) exon-containing variants in all disease-associated genes, or 3) all exon-containing variants >10ākb. In addition, autosomal recessive genes with de novo or heterozygous CNVs were checked for compound heterozygosity with SNVs. CNVs were validated with a 6.85 million probe CytoScan XON array (Applied BiosystemsTM, Thermo Fisher Scientific).
STRs
Clinical interpretation of STRs was limited to AFF2 (Fragile XE syndrome; OMIM 309548), ARX (early infantile epileptic encephalopathy; OMIM 308350), CSTB (progressive myoclonic epilepsy 1A; OMIM 254800), EIF4A3 (Richeri-Costa-Pereira syndrome; OMIM 268305), DMPK (myotonic dystrophy; OMIM 160900) and FMR1 (Fragile X syndrome; OMIM 300624) because of their NDD-associated phenotypes. Allele sizes for both repeats were extracted from the ExpansionHunter data for all patients. Genotypes were then compared to the locus-specific expansion thresholds (Supplementary TableĀ 3). Alignments of the calls exceeding the thresholds were manually curated for sequencing coverage and read mapping quality (GitHub-Illumina/GraphAlignmentViewer). All remaining likely aberrant expansion alleles were confirmed using PCR followed by fragment length analysis and repeat-primed PCR.
Mitochondrial DNA variants
Variants called on āchromosome Mā (for mapping to mitochondrial DNA sequence) were extracted from the variant call files and subsequently compared to confirmed pathogenic mutations reported in MITOMAP (https://www.mitomap.org/foswiki/bin/view/MITOMAP/ConfirmedMutations).
Definition and comparisons of diagnostic yield
The diagnostic outcome was determined at the individual patient level in both testing pathways independently (i.e., GS and SOC). The outcomes were based on guidelines from the Association for Clinical Genetic Science (ACGS), the Dutch Society of Clinical Genetic Laboratory Specialists (VKGL) [32], and European guidelines for constitutional cytogenomic analysis [33]:
-
1.
No conclusive diagnosis: no obvious pathogenic variant(s) (of either variant type) were detected (in either genetic test) that could potentially explain disease.
-
2.
Possible diagnosis obtained: a variant(s) of unknown significance was identified in a previously established disease gene that could explain the patientās phenotype. Or alternatively, a pathogenic variant(s) in a candidate disease-gene(s) was identified with a potential relationship to (part of) the patientās phenotype.
-
3.
Conclusive diagnosis obtained: a (likely) pathogenic variant in a disease gene associated with the patientās phenotype was detected, which explains the phenotype observed.
Data on diagnostic tests were independently analyzed by different teams and outcomes were not exchanged between the two pathways prior to reaching a conclusion in each of them separately. Diagnostic endpoints for each of the care pathways were compared and discussed after completion of the diagnostic quest in regular meetings with a multidisciplinary team including molecular geneticists, clinical genetic laboratory specialists, clinical geneticists, and pediatric neurologists.
Results
Cohort demographics
All 150 patients (101 males, 49 females) with complex NDD were included between October 2018 and July 2019 (Supplementary Fig.Ā 5A). At the time of inclusion, the median age of patients in our cohort was 9 years and 6 months (range 1 year and 10 monthsā42 years and 7 months; Supplementary TableĀ 1 and Supplementary Fig.Ā 5B). Clinical phenotypes of the patients were captured using HPO terms, with a median number of 12 HPO terms per patient (range 2ā44; Supplementary TableĀ 6 and Supplementary Fig.Ā 5C). To gain insight into the representativeness of this cohort versus patients referred to our tertiary referral center, we observed the broad phenotypic spectrum known to be associated with NDD (Supplementary TableĀ 7) but recognized a depletion for epilepsy (pā=ā0.01, Fisherās exact test with Bonferroni correction).
SOC diagnostic yield
The SOC pathway consisted of ES for all 150 patients and their parents. In 105 patients, this was supplemented with additional genetic tests such as genomic microarray profiling (nā=ā63), FMR1 expansion detection (nā=ā66), or other targeted gene-based testing (nā=ā25). On average, 2.3 genetic tests were performed per patient (range 1ā8 tests; Supplementary TableĀ 8).
A conclusive genetic diagnosis was obtained for 43 (28.7%) of 150 patients (TablesĀ 1ā3 and Supplementary TableĀ 9). These diagnoses included 26 SNVs, 13 small InDels, 3 CNVs (ranging in size from 600ākb to 1.3āMb), and one STR expansion in FMR1 causing fragile X syndrome (OMIM 300624). All SNVs, InDels, and CNVs were detected using ES, while the FMR1 repeat was detected through a gene-specific tandem repeat assay requested in addition to ES. In addition to the conclusive diagnoses, 31 patients (20.7%) received a possible genetic diagnosis, and for 76 patients (50.7%), no genetic cause was identified (TableĀ 1 and Supplementary TableĀ 9).
GS diagnostic yield
Prospectively, and in parallel to the ES-based SOC pathway, all patients (and their parents) received GS, which was analyzed independently of the SOC. A total of 45 patients (30.0%) received a conclusive genetic diagnosis, consisting of 26 SNVs, 13 small InDels, 5 CNVs (ranging in size from 5ākb to 1.3āMb), and one STR expansion in FMR1 (TablesĀ 1ā3 and Supplementary TableĀ 9). In addition, 35 patients (23.3%) received a possible diagnosis, and for 70 patients (46.7%), the genetic cause remained elusive (TableĀ 1 and Supplementary TableĀ 9).
Comparison of GS to SOC
To objectively determine the value of GS compared to ES-based SOC for NDD patients, we subsequently determined the concordance between the diagnosis obtained at individual patient level via the ES-based SOC and GS pathways (TableĀ 1). In total, 45 (30%) unique patients received a conclusive diagnosis through either the SOC pathway or the GS pathway (TablesĀ 2, 3 and Supplementary TableĀ 9). For 43 of the 45 conclusive diagnoses, both pathways obtained the same diagnosis, while two patients received a conclusive diagnosis through the GS pathway only. There was no statistically significant difference between the diagnostic yield of the ES-based SOC pathway and the GS pathway (nā=ā43 vs. nā=ā45, respectively; pā=ā1.0 Fisherās exact test). The genetic diagnoses that were not detected using the ES-based SOC pathway included a small 5ākb deletion covering exons 14 and 15 of CHD2, and a 36ākb deletion including the 3ā²UTR (non-coding) exon 9 of AHDC1 (Fig.Ā 2 and TableĀ 3). Of note, for one additional patient, GS revealed a more complex rearrangement consisting of a deletion-duplication event, of which the 32ākb duplication was only detected by GS (TableĀ 3).
Visualization of genomic alignment of GS data in the Integrative Genomics Viewer (IGV) software for two CNVs that were detected by GS only. A 5ākb deletion in CHD2 at 15q26.1(93494184-93499518)x1 (patient 33), encompassing exon 14 and 15. B 36ākb deletion in AHDC1 at 1p36.11(27833526-27869757)x1 (patient 47), including the (non-coding) exon 9 (3ā²UTR). CNVs are visible from multiple lines of evidence, including read depth (Aā+āB; arrow) and read pairs with larger than expected insert size (A; reads in red).
Similar concordance between both pathways was obtained at the level of possible diagnoses: GS identified a possible diagnosis for 35 patients, of which 31 were also identified by the ES-based SOC pathway (Supplementary TableĀ 10). Interestingly, all four possible diagnoses only detected by GS involved CNVs (Supplementary TableĀ 11).
There were no variants of (potential) diagnostic relevance that were identified in the ES-based SOC pathway, but failed detection in GS.
Discussion
For the last decade, molecular diagnosis of patients with rare genetic disorders has increasingly relied on the use of ES, resulting in ES being a first-line diagnostic test [1]. Using ES, on average 1 in 3 patients receives a genetic diagnosis [1, 2]. For patients in whom the genetic etiology remains elusive, it has been shown that GS can help to provide a diagnosis, as it allows for comprehensive variant calling of all types of variation without the technical biases and limitations observed in ES [4,5,6, 8,9,10,11,12,13]. From our prospective parallel study including 150 patient-parent trios, we conclude that GS does not result in a significantly higher diagnostic yield (30.0%) than an ES-based SOC pathway (28.7%). Whereas 1.3% additional conclusive diagnostic yield in GS is not statistically significant from SOC, we do show that even with variant interpretation limited to ālow hanging fruitā (i.e., those affecting exonic sequence and previously reported (likely) pathogenic non-coding variants), GS revealed more diagnoses than current SOC. Of note, to reach statistical significance for a 1.3% difference in diagnostic yield, our study would have required a cohort size of 32,039 patients, hence it may be a matter of time to prove diagnostic increase in GS. Our observations agree with recent randomized controlled studies where the diagnostic yield was also not significantly different from SOC [14, 15], although these studies were not able to show the direct comparisons of diagnoses obtained.
The overall diagnostic yield in our cohort (30%) is also in line with recent meta-analyses, reporting conclusive diagnostic yields between 25% and 36% for routine NDD cohorts [1, 19]. The expectation might have been that the overall conclusive diagnostic yield of GS would be 40ā50% when based on cumulative diagnostic yields of each individual diagnostic strategy alone [9]. A possible explanation for not observing the cumulative yield is likely the absence of stringent clinical preselection of our cohort, which allowed the inclusion of any patient with NDD for whom ES-based genetic diagnostic testing was requested. Consequently, patients with phenotypes featuring the entire spectrum of NDD from mild, to moderate, to severe intellectual disability (ID), and with or without additional congenital anomalies, were included. The previous cumulative estimate of GS was, however, based on severe ID [9]. Another explanation for the seemingly lower than initially expected GS diagnostic yield may be the limited scope of variant interpretation, i.e., only focusing on variants affecting coding sequence using an āexome-from-genomeā approach. Whereas examples of non-coding disease-causing variants most certainly exist [8, 34,35,36], the contribution of such non-coding variants for NDD have so far been estimated to be rare [12, 37, 38]. In addition, interpretation and establishing pathogenicity of variants in non-coding space is not straightforward [39], and often involves labor-intensive and time-consuming (functional) follow-up experiments to determine putative clinical relevance [34, 38, 40]. Such experiments are mostly performed in a research setting, where time is less of an issue than in routine diagnostic settings.
In the GS data, we observed a 100% concordance for clinically relevant variants identified by ES-based SOC (i.e., GS detected all 43 conclusive diagnoses as well as all 31 possible diagnoses identified in ES-based SOC). The 84 variants underlying these 74 diagnoses included SNVs (67.9%), InDels (20.2%), CNVs (10.7%) and repeat expansions (1.2%). Interestingly, however, the two conclusive and four possible diagnoses only identified through GS, all represented CNVs, being a significant overrepresentation from expectation (Pā=ā7.823e-06, Fisherās exact test), suggesting that CNVs might have an important role in the missed diagnoses of rare NDDs. The CNVs were not identified in ES due to limitations in sensitivity and specificity of CNV variant calling in ES (nā=ā4) or due to the absence of targets in the ES enrichment procedure (nā=ā2). Our results thus emphasize the value of equal coverage of the entire human genome in GS for structural variant calling and breakpoint determination at base-pair resolution. It is therefore within reason to expect that improved structural variant calling, filtering and prioritization tools will increase the diagnostic yield of GS.
Our results show that GS comprehensively identified all variant types in a single laboratory experiment. The limited additional diagnostic yield of GS indicates that the ES-based SOC was already of high quality. However, for 70% of the patients, ES was supplemented with additional genetic assays. In order to replace the SOC pathway, GS must not only replace ES, but also most (if not all) genetic assays performed. Such a āone-test-approachā could benefit clinical laboratories where it is common that more than one test is being performed in NDD patients. A āone-test-approachā could also reduce the time to diagnosis for patients as these tests are not always performed in parallel, but in a stepwise (consecutive) fashion. Exceptions to this assumption are variants that cannot be currently captured by GS, such as methylation defects.
Besides the advantages of GS over the ES-based workup, performing GS is also associated with certain challenges that could possibly hamper the implementation of GS in routine diagnostic genetic testing. Two of these challenges are the data interpretation of large numbers of variants and the cost of sequencing. We show that the difficulty of interpreting GS data can be easily overcome by a tiered approach for interpretation where first variants in the āexome from genomeā, CNVs and other ālow hanging fruitā can be analyzed. When this does not yield a genetic diagnosis, the variants in non-coding space can be analyzed subsequently without having to perform novel sequencing. The cost of performing GS, although not assessed in detail in this study, are higher than ES, making wide-spread implementation of GS, as replacement for ES-based SOC, difficult despite the obvious advantages [41]. A detailed cost analysis that also factors in the possible cost reductions from assays that are no longer performed in a GS-first approach is therefore warranted. In addition, such analysis may provide insights into the tipping point under which circumstances GS can be implemented cost-neutrally. Alternatively, an elevation in diagnostic yield from GS due to increasing knowledge of pathogenic variants in non-coding space may also help justify the perceived increase in healthcare-associated costs.
In conclusion, our data demonstrate the technical and clinical validity of GS to serve as routine first-tier genetic test for patients with NDD. Despite the similar diagnostic yields observed between GS and SOC, GS successfully identified all clinically relevant variants in a single test. This contrasts with current routine testing which uses multiple tests to reach the same conclusions while potentially taking more time, suggesting that GS is a more efficient workflow. Whether this increased efficiency, in absence of a significantly increased diagnostic yield, can compensate for the expected increase in costs when considering wide-spread implementation of GS, remains to be determined.
Data availability
The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.
References
Srivastava S, Love-Nichols JA, Dies KA, Ledbetter DH, Martin CL, Chung WK, et al. Meta-analysis and multidisciplinary consensus statement: exome sequencing is a first-tier clinical diagnostic test for individuals with neurodevelopmental disorders. Genet Med. 2019;21:2413ā21.
Vissers L, van Nimwegen KJM, Schieving JH, Kamsteeg EJ, Kleefstra T, Yntema HG, et al. A clinical utility study of exome sequencing versus conventional genetic testing in pediatric neurology. Genet Med. 2017;19:1055ā63.
Graungaard AH, Skov L. Why do we need a diagnosis? A qualitative study of parentsā experiences, coping and needs, when the newborn child is severely disabled. Child Care Health Dev. 2007;33:296ā307.
Meienberg J, Bruggmann R, Oexle K, Matyas G. Clinical sequencing: is WGS the better WES? Hum Genet. 2016;135:359ā62.
Ross MG, Russ C, Costello M, Hollinger A, Lennon NJ, Hegarty R, et al. Characterizing and measuring bias in sequence data. Genome Biol. 2013;14:R51.
Hehir-Kwa JY, Pfundt R, Veltman JA. Exome sequencing and whole genome sequencing for the detection of copy number variation. Expert Rev Mol Diagn. 2015;15:1023ā32.
Tankard RM, Bennett MF, Degorski P, Delatycki MB, Lockhart PJ, Bahlo M. Detecting expansions of tandem repeats in cohorts sequenced with short-read sequencing data. Am J Hum Genet. 2018;103:858ā73.
Elsner J, Mensah MA, Holtgrewe M, Hertzberg J, Bigoni S, Busche A, et al. Genome sequencing in families with congenital limb malformations. Hum Genet. 2021;140:1229ā39.
Gilissen C, Hehir-Kwa JY, Thung DT, van de Vorst M, van Bon BW, Willemsen MH, et al. Genome sequencing identifies major causes of severe intellectual disability. Nature. 2014;511:344ā7.
Palmer EE, Sachdev R, Macintosh R, Melo US, Mundlos S, Righetti S, et al. Diagnostic yield of whole genome sequencing after nondiagnostic exome sequencing or gene panel in developmental and epileptic encephalopathies. Neurology 2021;96:e1770āe82.
Clark MM, Stark Z, Farnaes L, Tan TY, White SM, Dimmock D, et al. Meta-analysis of the diagnostic and clinical utility of genome and exome sequencing and chromosomal microarray in children with suspected genetic diseases. NPJ Genom Med. 2018;3:16.
Smedley D, Smith KR, Martin A, Thomas EA, McDonagh EM, Cipriani V, et al. 100,000 genomes pilot on rare-disease diagnosis in health care ā Preliminary Report. N Engl J Med. 2021;385:1868ā80.
Lionel AC, Costain G, Monfared N, Walker S, Reuter MS, Hosseini SM, et al. Improved diagnostic yield compared with targeted gene sequencing panels suggests a role for whole-genome sequencing as a first-tier genetic test. Genet Med. 2018;20:435ā43.
Brockman DG, Austin-Tse CA, Pelletier RC, Harley C, Patterson C, Head H, et al. Randomized prospective evaluation of genome sequencing versus standard-of-care as a first molecular diagnostic test. Genet Med. 2021;23:1689ā96.
Kingsmore SF, Cakici JA, Clark MM, Gaughran M, Feddock M, Batalov S, et al. A randomized, controlled trial of the analytic and diagnostic performance of singleton and trio, rapid genome and exome sequencing in Ill infants. Am J Hum Genet. 2019;105:719ā33.
Guo H, Duyzend MH, Coe BP, Baker C, Hoekzema K, Gerdts J, et al. Genome sequencing identifies multiple deleterious variants in autism patients with more severe phenotypes. Genet Med. 2019;21:1611ā20.
Burke W. Genetic tests: clinical validity and clinical utility. Curr Protoc Hum Genet. 2014;81:9.15.1ā8.
Kƶhler S, Gargano M, Matentzoglu N, Carmody LC, Lewis-Smith D, Vasilevsky NA, et al. The human phenotype ontology in 2021. Nucleic Acids Res. 2021;49:D1207ād17.
Kaplanis J, Samocha KE, Wiel L, Zhang Z, Arvai KJ, Eberhardt RY, et al. Evidence for 28 genetic disorders discovered by combining healthcare and research data. Nature 2020;586:757ā62.
de Ligt J, Willemsen MH, van Bon BWM, Kleefstra T, Yntema HG, Kroes T, et al. Diagnostic exome sequencing in persons with severe intellectual disability. N Engl J Med. 2012;367:1921ā9.
Pfundt R, Del Rosario M, Vissers L, Kwint MP, Janssen IM, de Leeuw N, et al. Detection of clinically relevant copy-number variants by exome sequencing in a large cohort of genetic disorders. Genet Med. 2017;19:667ā75.
Pfundt R, Veltman JA. Structural genomic variation in intellectual disability. Methods Mol Biol. 2012;838:77ā95.
Okonechnikov K, Conesa A, GarcĆa-Alcalde F. Qualimap 2: advanced multi-sample quality control for high-throughput sequencing data. Bioinformatics. 2016;32:292ā4.
Farek J, Hughes D, Mansfield A, Krasheninina O, Nasser W, Sedlazeck FJ, et al. xAtlas: scalable small variant calling across heterogeneous next-generation sequencing experiments. bioRxiv. 2018:295071.
Karczewski KJ, Francioli LC, Tiao G, Cummings BB, Alfƶldi J, Wang Q, et al. The mutational constraint spectrum quantified from variation in 141,456 humans. Nature. 2020;581:434ā43.
Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, Bender D, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007;81:559ā75.
Chen X, Schulz-Trieglaff O, Shaw R, Barnes B, Schlesinger F, KƤllberg M, et al. Manta: rapid detection of structural variants and indels for germline and cancer sequencing applications. Bioinformatics. 2016;32:1220ā2.
Roller E, Ivakhno S, Lee S, Royce T, Tanner S. Canvas: versatile and scalable detection of copy number variants. Bioinformatics. 2016;32:2375ā7.
Auton A, Abecasis GR, Altshuler DM, Durbin RM, Abecasis GR, Bentley DR, et al. A global reference for human genetic variation. Nature. 2015;526:68ā74.
Francioli LC, Menelaou A, Pulit SL, van Dijk F, Palamara PF, Elbers CC, et al. Whole-genome sequence variation, population structure and demographic history of the Dutch population. Nat Genet. 2014;46:818ā25.
Dolzhenko E, van Vugt J, Shaw RJ, Bekritsky MA, van Blitterswijk M, Narzisi G, et al. Detection of long repeat expansions from PCR-free whole-genome sequence data. Genome Res. 2017;27:1895ā903.
Wallis Y, Payne S, McAnulty C, Bodmer D, Sister-mans E, Robertson K, et al., Practice guidelines for the evaluation of pathogenicity and the reporting of sequence variants in clinical molecular genetics. Association for Clinical Genetic Science & Dutch Society of Clinical Genetic Laboratory Specialists. 2013:1ā16.
Silva M, de Leeuw N, Mann K, Schuring-Blom H, Morgan S, Giardino D, et al. European guidelines for constitutional cytogenomic analysis. Eur J Hum Genet. 2019;27:1ā16.
Fadaie Z, Whelan L, Ben-Yosef T, Dockery A, Corradi Z, Gilissen C, et al. Whole genome sequencing and in vitro splice assays reveal genetic causes for inherited retinal diseases. npj Genom Med. 2021;6:97.
Wright CF, Quaife NM, Ramos-HernĆ”ndez L, Danecek P, Ferla MP, Samocha KE, et al. Non-coding region variants upstream of MEF2C cause severe developmental disorder through three distinct loss-of-function mechanisms. Am J Hum Genet. 2021;108:1083ā94.
Flƶttmann R, Kragesteen BK, Geuer S, Socha M, Allou L, SowiÅska-Seidler A, et al. Noncoding copy-number variations are associated with congenital limb malformation. Genet Med. 2018;20:599ā607.
Short PJ, McRae JF, Gallone G, Sifrim A, Won H, Geschwind DH, et al. De novo mutations in regulatory elements in neurodevelopmental disorders. Nature. 2018;555:611ā6.
Turner TN, Eichler EE. The role of de novo noncoding regulatory mutations in neurodevelopmental disorders. Trends Neurosci. 2019;42:115ā27.
Ellingford JM, Ahn JW, Bagnall RD, Baralle D, Barton S, Campbell C, et al. Recommendations for clinical interpretation of variants found in non-coding regions of the genome. Genome Med. 2022;14:73.
Starita LM, Ahituv N, Dunham MJ, Kitzman JO, Roth FP, Seelig G, et al. Variant interpretation: functional assays to the rescue. Am J Hum Genet. 2017;101:315ā25.
Incerti D, Xu XM, Chou JW, Gonzaludo N, Belmont JW, Schroeder BE. Cost-effectiveness of genome sequencing for diagnosing patients with undiagnosed rare genetic diseases. Genet Med. 2022;24:109ā18.
Rosenfeld JA, Coe BP, Eichler EE, Cuckle H, Shaffer LG. Estimates of penetrance for recurrent pathogenic copy-number variations. Genet Med. 2013;15:478ā81.
Acknowledgements
We thank the Radboudumc Genome Technology Center for GS data generation, and Illumina for supporting part of the reagents. This study contributes toward the goals of the Solve-RD project that has received funding from the European Unionās Horizon 2020 research and innovation program under grant agreement No 779257 (to HGB and LELMV). Multiple authors (DAK, MS, CTRMS, TK, BBAdV, RP, TR, NdL, APS, SJS, HGB and LELMV) of this publication are members of the European Reference Network on Rare Congenital Malformations and Rare Intellectual Disability ERN-ITHACA [EU Framework Partnership Agreement ID: 3HP-HP-FPA ERN-01-2016/739516].
Funding
Financial support was obtained from grants from the Netherlands Organization for Health Research and Development (ZonMw; 843002608, 846002003 and 015014066 to LELMV) and from the Netherlands Organization for Scientific Research (917-17-353 to CG).
Author information
Authors and Affiliations
Contributions
Conceptualization: BPGHvdS, GS, HGB, HGY, MRN, LELMV; Data curation: BPGHvdS, GS, JCG, CG, LELMV; Formal analysis: BPGHvdS, GS, JCG, JvR, MRV, NL, MK, RD, HS, AdO, RP, TR, NdL, APS, SJS; Funding acquisition: MRN, CG, LELMV; Investigation: BPGHvdS, GS, JCG, JvR, RD, HS, AdO; Methodology: JCG, CG; Project administration: LELMV; Resources: DAK, MS, CTRMS, TK, BBAdV, AvdW, HGB, HGY, CG, MRN, LELMV; Software: JCG, CG; Supervision: HGB, HGY, CG, MRN, LELMV; Validation: JCG, JvR, MRV, NL, MK, RD, HS, AdO, RP, TR, NdL, APS, SJS; Visualization: BPGHvdS, GS; Writing-original draft: BPGHvdS, GS, LELMV; Writing-review and editing: BPGHvdS, GS, LELMV.
Corresponding author
Ethics declarations
Competing interests
Part of the reagents required for the study were kindly provided by Illumina. However, Illumina was not involved in study design, the analyses and/or conclusions drawn from the data. The authors declare no conflict of interest.
Ethical approval
All participants or their legal representatives gave written informed consent. This study was approved by the Medical Review Ethics Committee Arnhem-Nijmegen under 2011/188 and 2020-7142.
Additional information
Publisherās note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the articleās Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the articleās Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
van der Sanden, B.P.G.H., Schobers, G., Corominas Galbany, J. et al. The performance of genome sequencing as a first-tier test for neurodevelopmental disorders. Eur J Hum Genet 31, 81ā88 (2023). https://doi.org/10.1038/s41431-022-01185-9
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1038/s41431-022-01185-9
This article is cited by
-
New year, new genes
European Journal of Human Genetics (2023)
-
ADNP in reverse gear
European Journal of Human Genetics (2023)
-
Genome sequencing as a single comprehensive test in molecular diagnosis
European Journal of Human Genetics (2023)
-
From precision diagnosis to precision treatment in epilepsy
Nature Reviews Neurology (2023)
-
GenIDA: an international participatory database to gain knowledge on health issues related to genetic forms of neurodevelopmental disorders
Journal of Neural Transmission (2023)