Clinical whole-exome sequencing for the diagnosis of rare disorders with congenital anomalies and/or intellectual disability: substantial interest of prospective annual reanalysis

Nambot, Sophie; Thevenon, Julien; Kuentz, Paul; Duffourd, Yannis; Tisserant, Emilie; Bruel, Ange-Line; Mosca-Boidron, Anne-Laure; Masurel-Paulet, Alice; Lehalle, Daphné; Jean-Marçais, Nolwenn; Lefebvre, Mathilde; Vabres, Pierre; El Chehadeh-Djebbar, Salima; Philippe, Christophe; Tran Mau-Them, Frederic; St-Onge, Judith; Jouan, Thibaud; Chevarin, Martin; Poé, Charlotte; Carmignac, Virginie; Vitobello, Antonio; Callier, Patrick; Rivière, Jean-Baptiste; Faivre, Laurence; Thauvin-Robinet, Christel

doi:10.1038/gim.2017.162

Download PDF

Original Research Article
Published: 02 November 2017

Clinical whole-exome sequencing for the diagnosis of rare disorders with congenital anomalies and/or intellectual disability: substantial interest of prospective annual reanalysis

Sophie Nambot MD^1,2,3,4,
Julien Thevenon MD, PhD^1,3,4,
Paul Kuentz MD, PhD^2,3,4,
Yannis Duffourd MSc^3,4,
Emilie Tisserant PhD^3,4,
Ange-Line Bruel PhD^3,4,
Anne-Laure Mosca-Boidron MD^2,3,4,
Alice Masurel-Paulet MD^1,3,
Daphné Lehalle MD¹,
Nolwenn Jean-Marçais MD^1,3,
Mathilde Lefebvre MD^1,2,
Pierre Vabres MD, PhD^3,4,
Salima El Chehadeh-Djebbar MD¹,
Christophe Philippe MD, PhD^2,4,
Frederic Tran Mau-Them MD^2,4,
Judith St-Onge DEC^2,4,
Thibaud Jouan BSc^2,3,4,
Martin Chevarin HNC^2,3,4,
Charlotte Poé BSc^2,3,4,
Virginie Carmignac PhD⁴,
Antonio Vitobello PhD^2,3,4,
Patrick Callier MD, PhD^2,3,4,
Jean-Baptiste Rivière PhD^2,3,4,
Laurence Faivre MD, PhD^1,3,4,
Christel Thauvin-Robinet MD, PhD^1,2,3,4 &
Orphanomix Physicians' Group

Genetics in Medicine volume 20, pages 645–654 (2018)Cite this article

7178 Accesses
117 Citations
33 Altmetric
Metrics details

Subjects

Abstract

Purpose

Congenital anomalies and intellectual disability (CA/ID) are a major diagnostic challenge in medical genetics—50% of patients still have no molecular diagnosis after a long and stressful diagnostic “odyssey.” Solo clinical whole-exome sequencing (WES) was applied in our genetics center to improve diagnosis in patients with CA/ID.

Methods

This retrospective study examined 416 consecutive tests performed over 3 years to demonstrate the effectiveness of periodically reanalyzing WES data. The raw data from each nonpositive test was reanalyzed at 12 months with the most recent pipeline and in the light of new data in the literature. The results of the reanalysis for patients enrolled in the third year are not yet available.

Results

Of the 416 patients included, data for 156 without a diagnosis were reanalyzed. We obtained 24 (15.4%) additional diagnoses: 12 through the usual diagnostic process (7 new publications, 4 initially misclassified, and 1 copy-number variant), and 12 through translational research by international data sharing. The final yield of positive results was 27.9% through a strict diagnostic approach, and 2.9% through an additional research strategy.

Conclusion

This article highlights the effectiveness of periodically combining diagnostic reinterpretation of clinical WES data with translational research involving data sharing for candidate genes.

Increased diagnostic and new genes identification outcome using research reanalysis of singleton exome sequencing

Article 23 June 2019

Exome and genome sequencing for pediatric patients with congenital anomalies or intellectual disability: an evidence-based clinical guideline of the American College of Medical Genetics and Genomics (ACMG)

Article 01 July 2021

A single center experience with publicly funded clinical exome sequencing for neurodevelopmental disorders or multiple congenital anomalies

Article Open access 27 September 2021

Introduction

Congenital anomalies and intellectual disability (CA/ID) comprise a vast, heterogeneous group of disorders, encompassing more than 3,000 different clinical entities, individually rare but collectively frequent. Most CA/ID are of genetic origin and incurred via Mendelian inheritance. Because the prevalence of each disorder is low and a large portion of the molecular bases of CA/ID are still unresolved, their diagnosis remains challenging. These chronic, early-onset disorders contribute significantly to morbidity, mortality, and health-care expenditure,¹ and their etiologic diagnosis is essential for genetic counseling, prenatal testing, accurate follow-up, prevention of complications, and personalized treatment.² The current standard of care for the diagnosis of CA/ID includes multiple clinical evaluations by specialized physicians, and countless paraclinical investigations such as imaging, metabolic, and biological tests, which are potentially invasive for patients. The genetic investigations include cytogenetic tests and successive single-gene testing, and more recently gene panels. This long and tedious traditional approach leaves approximately half of the families with no diagnosis.³

Next-generation sequencing (NGS) has revolutionized medical genetics by improving the chances of obtaining a molecular diagnosis for rare genetic diseases. NGS was initially applied in research, and different strategies were considered to implement NGS for diagnostic purposes. Whole-exome sequencing (WES) has shown an unprecedented success rate in the identification of disease-causing genes in projects ranging from tailored sequencing used to discover the molecular bases of a recognizable syndrome in a homogeneous group of patients, to the systematic application of pan-genomic sequencing in large heterogeneous cohorts.⁴ The usefulness of an unbiased sequencing approach has been highlighted in various heterogeneous disorders, including categories of CA/ID, such as syndromic ID,⁵ developmental delay (DD),⁶ autism,⁷ epilepsy,⁸ and congenital heart defects.⁹

Later, a more accurate interpretation of the data and a reduction in sequencing costs enabled its widespread implementation in clinical practice. Between 2010 and 2015, about 555 genes implicated in Mendelian phenotypes were discovered using NGS. This has resulted in WES becoming the current standard of care for the diagnosis of highly heterogeneous rare disorders with suspected Mendelian inheritance,¹⁰ thus blurring the line between diagnosis and research. The widespread application of this test in cohorts of patients with undefined CA/ID allows a diagnostic yield ranging from 25 to 32% (refs. 11, 12, 13, 14. This diagnostic yield corresponds to the identification of a disease-causing variant in a gene previously implicated in a human disorder and with a published compatible phenotype. The sequencing strategy may vary from center to center with trio-based or proband-based WES. Although the diagnostic yield should not vary, the likelihood of identifying a candidate variant for a new disorder may depend on the phenotype and the chosen sequencing strategy.^{13, 15}

This recent acceleration in the discovery of disease-causing genes makes it difficult for physicians to remain up to date with genetic medical knowledge. Initially, the routine use of WES demonstrated the limitations of usual phenotype-driven strategies, based on the clinical expertise of physicians in reference centers for rare diseases, especially in the following situations: (i) atypical presentations of known diseases making it hard to make the diagnosis at first sight; (ii) ultrarare diseases described in very few cases and therefore unknown to most specialists, and (iii) patients exhibiting a specific but only recently discovered phenotype. International data sharing is an efficient solution that overcomes these limitations. By catalyzing the identification of additional patients with similar phenotypic and genotypic profiles, initiatives such as the Matchmaker Exchange project¹⁶ allow fast and accurate phenotype matching to assess the clinical relevance of candidate variants and genes. Reanalyzing and reinterpreting clinical WES data from large research cohorts is also proving to be an effective way to reveal new disease-causing variants. In a clinical context, only three articles have assessed the relevance of reanalyzing data. The first focused on data reanalysis and reported an additional diagnostic yield of 10% in 40 patients.¹⁷ The second, in a series of 2,000 sequential cases submitted to Ambry Genetics for testing prior to 2016, showed that 5.6% of cases that initially received negative or candidate results were upgraded to positive/likely positive or uncertain in a characterized gene.¹⁸ The third one reported seven changes in the result for 14 reanalyzed cases performed by the molecular laboratory 12 to 18 months after the initial report, of which four resulted in a new definitive diagnosis.¹⁹ These observations led us to adapt our WES-based clinical practice and diagnostic process by setting up systematic reanalysis and international data sharing.

This retrospective study reports the results and consequences of implementing clinical WES in our current diagnostic practice, and of introducing a systematic reanalysis strategy of unsolved results combined with translational research for candidate genes in a cohort of 416 consecutive patients with CA/ID.

Materials and methods

Patients

From June 2013 to June 2016, WES was performed in 323 patients referred to the Reference Center for Congenital Anomalies and Malformative Syndromes in Dijon, France, for an etiological diagnosis and in 93 patients referred to the Orphanomix service (http://www.orphanomix.com/index.html) by other centers in France.

The inclusion criteria were (i) signs of ID or DD when the age of the patient (<6 years) did not permit a diagnosis of ID or the presence of at least one congenital anomaly with or without ID of suspected genetic origin; (ii) a negative prior diagnostic workup; and (iii) informed consent of the patient or parents/guardians for inclusion. Fetuses with multiple malformations were not included in this study. Array-comparative genomic hybridization was systematically performed before WES in patients with DD, isolated or syndromic ID (associated with dysmorphism or one congenital anomaly), autism spectrum disorders, or pre- or postnatal malformations (two or more), as well as for the characterization of an anomaly detected by another cytogenetic method. In most patients with a convincing diagnostic etiology, a targeted genetic test (single-gene or gene panel) was first ordered. The prescription of WES or gene panel testing was discussed weekly by a group of trained physicians and depended on (i) the clinical and genetic heterogeneity of the suspected disorder and (ii) the availability, turnaround time, and cost of a targeted approach. The conduct of the pretest consultation has been detailed elsewhere.¹² The local ethics committee approved this study.

Standardized deep phenotyping

Patients were separated into three major phenotype groups according to the clinical indication: neurodevelopmental disorders, CA without ID/DD, and neuromuscular disorders. The neurodevelopmental disorders group was divided into four subgroups: nonsyndromic ID, syndromic ID (defined as ID with CA and/or dysmorphism), epileptic encephalopathy (EE), and syndromic DD (Figure 1b). The proportion of patients with EE was higher during the first year. This initial overrepresentation of patients with EE can be explained by the work done in our center during the first year of inclusion, which focused on the diagnosis of EE by WES12. Detailed phenotypic data were anonymously collected in the PhenomeCentral database (https://phenomecentral.org/) using the standardized Human Phenotype Ontology terms.

Whole-exome sequencing

Sequencing and bioinformatics analysis

In all index cases, libraries of genomic DNA samples were prepared using the Agilent Sureselect Human All Exon v5 kit (Agilent Technologies, Santa Clara, CA), and were sequenced on a HiSeq instrument (Illumina, San Diego, CA) according to the manufacturer’s recommendations for paired-end 76-bp reads. The bioinformatics pipeline, alignment processes, and quality procedures have been described elsewhere.¹² Version 3.4–46 of the Genome Analysis Toolkit was used for this study. Among the 416 patients, 82 (19.7%) were analyzed during year 1, 119 (28.6%) during year 2, and 215 (51.7%) during year 3 (Figure 1a).

Copy-number variant detection

The in-house pipeline for copy-number variant (CNV) detection was developed in November 2015. CNV analysis was retrospectively applied to all patients. The procedure is detailed elsewhere¹² and in the Supplementary Data online.

Variant interpretation strategy

The diagnostic interpretation of the filtered variants was done according to the American College of Medical Genetics and Genomics (ACMG) recommendations of 2008 and 2015 (refs. 20, 21) during the first 2 years and the third year of the study, respectively. The detailed diagnostic interpretation procedure has been reported elsewhere¹² and is described in the Supplementary Data. The familial segregation study is also detailed in the Supplementary Data. The 56 genes on the list of medically actionable secondary findings defined by the ACMG were also studied and interpreted according to the ACMG recommendations available at the period of reanalysis.²² The results were returned to the patient when consent had been given.

Annual reanalysis

Negative and uncertain results were reanalyzed from the raw sequencing data stored as compressed fastq files (Supplementary Data). All variants of the final analysis file were interpreted. The interpretation first focused on variants previously and newly reported as pathogenic/probably pathogenic in public databases of clinical interest (ClinVar, http://www.ncbi.nlm.nih.gov/clinvar; DECIPHER, https://decipher.sanger.ac.uk/) or as affecting well-established human disease genes. The interpretation was then extended to all of the other variants, namely those not meeting the diagnostic interpretation criteria. For relevant variants presenting a good genotype–phenotype correlation, but reported in an insufficient number of patients (only one family, one single isolated population) or in several patients of a large cohort without clinical details, we actively searched for additional patients carrying variants in the same gene with a similar phenotype through national collaborations or international data sharing to confirm the genotype–phenotype relationship. This strategy was also used for atypical presentations or new phenotypes linked to an already known gene, but reported only once in the literature or presented in congresses. Reverse phenotyping and data sharing were widely used in these cases to compare and gather patients with the same mutated gene, and look for common clinical features, and thus increase the recurrence. For variants in genes never associated with human disease, the ACMG interpretation criteria were partially applicable. We based on the evidence proposed by the ACMG guidelines with particular attention to the encoded protein function, functional studies, animals models, and an intensive search of new patients with a similar phenotype carrying a variant in the same gene through a translational research approach²³ (Figure 2).

Time to diagnosis

The time to diagnosis was calculated only for patients seen in our local center with a positive diagnostic result obtained after the first analysis. It corresponded to the overall duration of the diagnostic process, from the first consultation in our center to the date of the WES report.