Whole exome sequencing reveals inherited and de novo variants in autism spectrum disorder: a trio study from Saudi families

Al-Mubarak, Bashayer; Abouelhoda, Mohamed; Omar, Aisha; AlDhalaan, Hesham; Aldosari, Mohammed; Nester, Michael; Alshamrani, Hussain. A.; El-Kalioby, Mohamed; Goljan, Ewa; Albar, Renad; Subhani, Shazia; Tahir, Asma; Asfahani, Sultana; Eskandrani, Alaa; Almusaiab, Ahmed; Magrashi, Amna; Shinwari, Jameela; Monies, Dorota; Al Tassan, Nada

doi:10.1038/s41598-017-06033-1

Download PDF

Article
Open access
Published: 18 July 2017

Whole exome sequencing reveals inherited and de novo variants in autism spectrum disorder: a trio study from Saudi families

Bashayer Al-Mubarak^1,2,
Mohamed Abouelhoda^2,3,
Aisha Omar¹,
Hesham AlDhalaan^4,5,
Mohammed Aldosari⁴,
Michael Nester⁵,
Hussain. A. Alshamrani⁶,
Mohamed El-Kalioby ORCID: orcid.org/0000-0001-7655-4198^2,3,
Ewa Goljan ORCID: orcid.org/0000-0001-6235-3387^2,3,
Renad Albar²,
Shazia Subhani^2,3,
Asma Tahir¹,
Sultana Asfahani⁴,
Alaa Eskandrani⁴,
Ahmed Almusaiab¹,
Amna Magrashi¹,
Jameela Shinwari¹,
Dorota Monies^2,3 &
…
Nada Al Tassan ORCID: orcid.org/0000-0001-9076-0334^1,2

Scientific Reports volume 7, Article number: 5679 (2017) Cite this article

8331 Accesses
61 Citations
3 Altmetric
Metrics details

Subjects

Abstract

Autism spectrum disorder (ASD) is a complex neurodevelopmental disorder with genetic and clinical heterogeneity. The interplay of de novo and inherited rare variants has been suspected in the development of ASD. Here, we applied whole exome sequencing (WES) on 19 trios from singleton Saudi families with ASD. We developed an analysis pipeline that allows capturing both de novo and inherited rare variants predicted to be deleterious. A total of 47 unique rare variants were detected in 17 trios including 38 which are newly discovered. The majority were either autosomal recessive or X-linked. Our pipeline uncovered variants in 15 ASD-candidate genes, including 5 (GLT8D1, HTATSF1, OR6C65, ITIH6 and DDX26B) that have not been reported in any human condition. The remaining variants occurred in genes formerly associated with ASD or other neurological disorders. Examples include SUMF1, KDM5B and MXRA5 (Known-ASD genes), PRODH2 and KCTD21 (implicated in schizophrenia), as well as USP9X and SMS (implicated in intellectual disability). Consistent with expectation and previous studies, most of the genes implicated herein are enriched for biological processes pertaining to neuronal function. Our findings underscore the private and heterogeneous nature of the genetic architecture of ASD even in a population with high consanguinity rates.

Genome-wide association studies

Article 26 August 2021

Emil Uffelmann, Qin Qin Huang, … Danielle Posthuma

Exome-wide analysis implicates rare protein-altering variants in human handedness

Article Open access 02 April 2024

Dick Schijven, Sourena Soheili-Nezhad, … Clyde Francks

Genomic data in the All of Us Research Program

Article Open access 19 February 2024

The All of Us Research Program Genomics Investigators

Introduction

Delineating the genetic architecture of Autism Spectrum Disorder (ASD) is like finding the way in a labyrinth. Despite the rapid advancements in genetic methods, our ability to identify a common pathway for neuropsychiatric disorders with a therapeutic or diagnostic potential is lagging behind¹. This is best exemplified by ASD, for which delivering a near complete genetic picture is hindered by many factors. Examples of such factors include: (i) extensive clinical and genetic heterogeneity (in excess of 600 genes have been implicated in ASD, thus far)²; (ii) the absence of generalizable genetic risk factors as most of the mutations are extremely rare or private in nature; (iii) the variability in diagnosis and data analysis methodology; and finally (iv) the precise function of many of ASD-candidate genes and therefore their possible impact on the central nervous system remains largely undetermined^{3, 4}. All these factors present a substantial challenge, yet incremental progress has and continues to be made towards identifying ASD-risk loci.

Over the past 15 years, efforts were centered on interrogating the role of structural alterations through the study of chromosomal abnormalities and copy number variations (CNVs) in ASD individuals. Key advances have emerged from such studies, for instance, the identification of cytogenetic abnormalities at multiple loci with chromosomes 15q11–13 and 16p11.2 being the most frequent⁵, as well as recurrent CNVs involving genes like, NRXN1, SHANK3 and PARK2 ⁶. However, a causal relationship is often hard to prove for a number of reasons: (i) such events normally span large regions making it difficult to discern the role of single genes in the disorder; (ii) apparently identical alterations have variable phenotypic outcomes or expressivity; (iii) some of these changes are observed in typically developing individuals; (iv) very low rate of replication, as recurrent individual CNVs were observed only in less than 1% of the cases⁷.

In the more recent years and with the advent of next-generation sequencing (NGS) technologies, the focus shifted towards investigating the role of inherited and de novo point mutations. This move was triggered by the observation that potentially risk-conferring structural alterations occurred only in (1–20%) of the cases⁷ and that they probably require other genetic or non-genetics factors for the development of the disease. The increasing availability of whole-genome and –exome sequencing platforms to researchers have accelerated the identification of additional risk factors in previously recognized as well as unrecognized ASD-associated genes^{8,9,10,11,12,13,14,15,16,17,18}. However, most of these studies have focused on the role of either de novo or inherited variants^{3, 8, 9, 11,12,13,14,15,16, 19, 20} and much fewer were designed to interrogate both types^{10, 18, 21, 22}. As there is a growing appreciation for the important part transmitted and de novo mutations play in ASD, more studies assessing both types of events within a given cohort are needed to understand their precise role.

In this study, we implemented whole exome sequencing (WES) in an attempt to identify risk genes/rare variants in a cohort of 19 case-parent trios from singleton families with ASD from Saudi Arabia. Enrichment for inherited causes are predicted in highly inbred populations¹⁷ as are sporadic (de novo) mutations in families with apparently “unaffected” parents^{8,9,10, 12, 13, 18, 20, 23, 24}. We therefore, devised an analysis pipeline that enabled us to detect de novo as well as inherited variants in autosomal or X-linked genes (Fig. 1). Using this approach, rare variants were ascertained in 17 of the 19 trios (~90%).We found that most of the probands carried at least 2 unique rare variants occurring in 47 different genes. To the best of our knowledge 15 of the identified genes were not implicated before in ASD and the remaining were previously reported in ASD or other neuropsychiatric/neurodevelopmental disorders. Examples include PRODH2 and KCTD21 (implicated in schizophrenia), USP9X and SMS (implicated in intellectual disability), TRIM9 and KDM5B (known-ASD genes), and others such as AGL and MOGS (candidate genes involved in energy metabolism and immune responses, respectively).

Results

Sequencing and Alignment Quality

Supplementary Table S1 summarizes the quality of the sequencing and the read mapping steps. It shows that the sequencing achieved good coverage of the target regions (98% average total coverage at 1X and 95% average total coverage at 20X) with enough depth (223X; i.e. each base in the target region is covered by 223 reads on average). We detected an average of 29120.333 total variants comprised of 27466.807 Single Nucleotide Variants (SNVs) and 1653.526 indels per exome (Supplementary Table S2). After comparing the trios, a list of candidate variants was sent to Sanger sequencing for confirmation. The frequency of the confirmed variants in international databases (ExAC, 1000 Genomes Project) and the local the Saudi Human Genome Program Database (SHGP; 2379 local ethnically matching exomes) was also considered. We counted 29 false positives of which 20 were SNVs and 9 were indels as a result of low quality calls.

Reported familial relations history was assessed with three different relatedness tests (described in the methods). The results confirm correct parenthood for all our trios (Supplementary Table S3) as higher scores for parent-child pair were observed by all three methods. For our dataset, the results using the A _jk statistics and shared homozygosity were more confirmative than the ones generated by KING program.

Detected SNVs and Indels

The analysis framework (described in methods) developed in this study allows for comprehensive investigation of the genetic architecture of ASD (Fig. 1). It is designed to capture not only de novo variants, but also those with autosomal or X-linked mode of transmission in unknown or previously described ASD-linked genes. This analysis approach is more suited for complex disorders with high genetic heterogeneity and it may offer much needed explanation for the increased ASD heritability¹⁵.

Using the 4 main inheritance models (refer to methods) a number of variants was detected in each sample except two (ASD-52 and ASD-55). From the variants that have passed all filtering steps, 44 were considered to be deleterious by at least one prediction tool and were confirmed as being true positives for presence of the variant by Sanger sequencing. The majority of the validated variants were missense, while the remaining were either frameshifts, splice-site changes or nonsense and only one small deletion was detected (Fig. 2A). As for the mode of transmission, 3 were de novo, 4 were autosomal dominant, 21 were found in X-linked genes, and the remaining 19 were inherited in an autosomal recessive manner (Fig. 2B). Only 9 of the verified variants were previously reported (in dbSNP with MAF < 1%) (MOGS, rs370842409; ITIH2, rs748626881; OR6C6, rs748626881; GPKOW, rs782015404, TRIM9, rs748897524; NEB rs375412223; BOC, rs752313669; SSTR3,rs577113986; PRODH2,rs370842409), while the remaining 38 variants were novel. Altogether, we have identified 47 rare variants in 47 different genes including 15 ASD-candidate genes (Tables 1 and 2).

Table 1 Summary of the genes with confirmed rare variants detected in this study.

Full size table

Table 2 Rare variants identified in each trio.

Full size table

De novo variants

De novo germline mutations arising spontaneously during meiosis have long been known to confer risk of ASD. This knowledge was inferred largely from studies assessing de novo CNVs in ASD cases from simplex or multiplex families^{25,26,27,28,29,30,31}, however, structural variants seem to account for a small fraction of the cases^{28, 32,33,34}, shifting the focus to de novo point mutations and their contribution to risk, which is a line of enquiry we and others have perused^{8, 9, 11, 13, 20, 24}.

Among the 17 probands, 3 (17.6%) were found to carry rare de novo SNVs. These confirmed de novo events were identified in different genes including one with previous association with ASD: two of these events were predicted to result in a premature truncation, and the remaining was a missense rare variant (p.G209R) affecting PRODH2, found in proband ASD-58. The two nonsense variants were: (i) (p.Y755X) affecting KDM5B identified in ASD-19 and (ii) a frameshift insertion of two adenine bases creating a premature stop codon (p.K278Efs*5) in MOGS that was detected in proband ASD-24 (Table 2).

Autosomal variants

Despite the excess and well-recognized importance of de novo mutations, only a fraction of ASD cases could be accounted for by these mutational events^{8, 17}, while cases bearing inherited events couldn’t be fully explained by this class of genetic alteration. The high heritability of ASD¹⁵ underscores the need to explore transmitted variants that are often overlooked in many of the existing exome studies, which are mainly focused on de novo variants and not designed to capture inherited variants^{8, 9, 13, 20, 23, 24}. Investigating both de novo and inherited variants has become a favored approach in recent next-generation sequencing based ASD studies^{10, 11, 14, 16,17,18}.

Considering the high rate of consanguinity in the Saudi population, a recessive model was implemented under which a total of 19 rare variants were identified in different genes (Table 2).

In addition to recessive variants, we interrogated changes transmitting in a heterozygous manner (autosomal dominant). By only considering rare variants that were transmitted to the proband from one heterozygous parent, 4 rare variants were confirmed in 4 different genes (APC2, AGL, NEB and SEMG2). Two of the variants were observed in ASD-9 occurring within a splice site region of APC2 and AGL. One missense variant affecting NEB was found in ASD-39 and a nonsense variant within SEMG2 was found in ASD-66 (Table 2).

X-linked variants

A strong male bias in ASD has been consistently observed over time and across numerous studies in different populations^35,36,37. This provides a strong clue to the potential involvement of sex chromosomes in the etiology of the disorder, which has been supported by the identification of multiple risk loci, particularly on the X-chromosome^{10, 16, 17, 19}.

By setting our analysis approach to identify inherited variants in X-chromosome genes, we detected and confirmed a total of 21 rare variants (19 missense, 1 small deletion and 1 splice site change) and each was found in a different gene (Table 2). Of these variants, 4 were found in genes not previously implicated in ASD nor listed in Autism databases (DDX26B, HTATSF1, ITIH6 and PLP1). The remaining variants existed in genes already reported in ASD or other neurological disorders.

Description of confirmed rare variants per trio

By tailoring our WES analysis pipeline to capture both de novo and inherited variants, the analysis retained at least 2 rare variants affecting different genes for most probands. The variants are discussed in detail in the supplementary data and Supplementary Table S4.

Biological processes over-represented in our study

IPA functional analysis detected significant enrichment for biological processes such as ‘Cell Signaling’, ‘Cell Morphology’ and ‘Cellular Assembly and Organization’ across the entire genes set. As for Diseases and Disorders, the analysis discovered ‘Neurological Disease’, ‘Organismal Injury and Abnormalities’ and ‘Hereditary Disorder’, among other human diseases, to be significantly correlated. Furthermore, ‘Nervous System Development and Function’, ‘Tissue Development’ and ‘Embryonic Development’ were the 3 top ranking functions under the ‘Top Physiological System Development and Function’ category (Supplementary Table S5).

Discussion

Dissecting the genetic architecture of an otherwise highly complex disorder such as ASD, comprising diverse forms of genetic alterations (from SNVs to chromosomal aberrations affecting numerous genes/loci), requires employing different approaches. Ideal approaches would be those permitting genetic investigation of ASD cases, in which a polygenic model is assumed (or strongly suspected)⁷. Therefore, capturing more classes of genetic variation, especially with the emergence of studies identifying multiple mutations in individual ASD patients of which all are predicted to contribute to the etiology of the disease, has become the preferred approach by many researchers. We, as others¹⁰, took advantage of NGS to perform comprehensive analysis in parent-child trios designed to detect both de novo and transmitted genetic variants. By considering rare variants with predicted damaging effect, we validated variants in 17 of 19 trios (~90%). We could not comment on whether our detection yield is comparable with other studies or not, due to the lack of published ASD trio exome studies of similar design using the same platform. However, Jiang and colleagues conducted a relatively similar study (utilizing an Illumina platform) in which they have reported successfully detecting medically relevant variants in 50% of their ASD trio families using whole-genome sequencing (WGS)¹⁰.

Missense variants constitute the most frequently encountered type of variation (77%) in this study followed by loss of function variants [nonsense (6%), frameshifts (6%) and splice site (9%)] (Fig. 2A). Of the 19 probands only 3 were found to carry de novo events, whereas rare variants in autosomal or X-linked genes were detected in equal number of probands. While most of the previous studies concerned with sporadic autism have focused on assessing the role of de novo variants under the hypothesis that such lesions are more likely to confer risk to ASD than inherited events^{9, 12, 13, 20, 24, 38}, we, on the contrary, observe enrichment for inherited variants. This is unsurprising given the highly inbred nature of the Saudi population in which common ancestry is suspected even in apparently non-consanguineous unions as parents are often unaware of consanguinity in previous generations. Such observation is common in Middle Eastern and North African populations³⁹.

In our study, 32 of the detected variants were observed in genes previously reported in ASD (either in Autism databases or in literature) (Table 1). For instance, SUMF1 ⁴⁰, HSPBP1 ⁴¹, TRIM9 ⁴², DUSP3 ²⁴ and BOC ⁴³. WES detected de novo variants in three genes (Table 2). One missense variant affecting a schizophrenia susceptibility gene (PRODH2)^{44, 45} and the other two were protein-truncating variants. One was identified in KDM5B. This gene was previously associated with ASD^{8, 9} and non-syndromic intellectual disability (ID)⁴⁶ whose function is related to cell cycle control and neural cells differentiation⁴⁷. The other was a frameshift insertion of two adenine bases creating a premature stop codon in MOGS. This gene encodes an endoplasmic reticulum glucosidase involved in normal immune function⁴⁸.

By applying the autosomal recessive model, we validated variants in 19 genes (Table 2). One missense variant was observed in MAN1B1, a gene causally linked to non-syndromic intellectual disability^{49, 50}, and another was found in DNAJC13 previously reported to be mutated in patients with Parkinson’s disease^{51, 52}. Two variants were identified in genes with possible, but not confirmed association with mood disorders: CRY1 ^53,54,55 and NT5DC1 ^{56, 57}. Interestingly, patient (ASD-21) harboring NT5DC1 missense variant displayed symptoms of depression and poor appetite around the age of 5 years. Amongst the autosomal transmitted variants, six were predicted to disrupt a group of genes involved in brain development and/or function namely TRIM9, BOC, SSTR3, NGF, FGF5 and CELSR2 ^{58,59,60,61,62,63,64,65}, two were found in genes related to protein homeostasis (KCTD21 ⁶⁶ and HSPBP1 ⁶⁷) and the remaining variants were identified in genes involved in diverse biological process including cell signaling (DUSP3)⁶⁸, inflammatory responses (NLRP2)⁶⁹, olfaction (OR6C65)⁷⁰, maintenance of DNA integrity (CEP152)⁷¹, and stabilization of the extracellular matrix (ITIH2)⁷².

The X-linked model revealed 21 different variants (Table 2). For instance a missense variant was detected in MXRA5, a gene that was reported to be mutated in three multiplex ASD families¹⁹. A deletion of 12 nucleotides was detected within exon 2 of AVPR2, defects in this gene cause X-linked congenital nephrogenic diabetes insipidus (NDI) and where detected in dizygotic twins with NDI and ID⁷³. One putative splice site variant was found in PDK3, a gene that encodes an enzyme involved in the shift of energy production site from the mitochondria (oxidative phosphorylation) to the cytoplasm (glycolysis) reported in cancer cells⁷⁴. The same gene was recently reported to be mutated in a single family with Charcot–Marie–Tooth disease⁷⁵. However, the patient (ASD-38) reported here to carry PDK3 splice site variant is not affected with Charcot-Marie-Tooth disease, nor does he suffer from any type of neuropathy. Moreover, of the variants detected on the X-chromosome, four were located in genes involved in brain function/development (PLP1 ⁷⁶, MAOB ⁷⁷, USP9X ⁷⁸, and FLNA ^{79, 80}) and two were present in genes regulating splicing and/or transcription (HTATSF1 ⁸¹ and GPKOW ^{82, 83}). Additional rare variants were identified in genes involved in: (i) the hydrolysis of angiotensin II (ACE2)⁸⁴ or sulfate esters (IDS)⁸⁵ and (ARSH)⁸⁶; (ii) or in the ubiquitination and proteasomal degradation of creatine kinase-B (ASB-9)⁸⁷. Furthermore, two missense variants were detected: one in ATP2B3 and another in RPS6KA6. Both genes participate in signal transduction pathways, however only ATP2B3 was linked to a human disease^{88, 89}. The remaining variants were located in genes of largely unknown function such as (CFAP47, ITIH6 and ZNF630) or in cancer-associated genes namely; INTS6L ^{90, 91} and SSX3 ⁹².

With regard to the biological functions that might be influenced by the identified genetic changes, we have compared those highlighted by the IPA with the results of the manual literature mining. Cell signaling was among the common biological themes. Typically, categories pertaining to CNS function such as those found by IPA “Nervous System Development and Function” or manual search “Neuronal function/Development” were over-represented. These categories comprised genes serving diverse functions in the CNS such as neurite outgrowth, synaptic function and formation, and neuronal migration (Dataset 1). Aside from the disruption of neuron-specific functions, defects involving basic cellular processes (presented here and elsewhere) represent another commonly encountered theme in ASD, reflecting the clinical and genetic complexity and variability of this condition¹. It is noteworthy, that several of the detected rare variants were observed in genes without currently recognized functions/roles making it difficult to propose a functional link to ASD, something that might be revealed by future studies.

One of the interesting findings emerging from this study is the identification of two genes (NLRP2 and MOGS) with established roles in the immune system. The former encodes for NLRP2 protein that plays a vital role in astrocytes innate immunity⁶⁹ and was found to be mutated in a rare type of imprinting disorder known as Beckwith-Wiedemann syndrome⁹³. The latter gene (MOGS), on the other hand, encodes an endoplasmic reticulum glucosidase involved in normal immune function and is causally linked to congenital disorders of glycosylation type II B clinically characterized by developmental and neurological defects⁴⁸. It is worth mentioning that immune dysfunction has been documented in ASD and emerging studies propose a role for the immune system in the pathophysiology of autism. In support of this notion, typical behavior and neuropathological symptoms of human ASD were successfully recapitulated in mice born to immune-activated (infected) mothers⁹⁴.

Another key finding demonstrated here, is the genetic variability not only across trios but within each proband. Defects in genes involved in diverse biological processes were observed in most probands and only few (ASD-40, 43, 58 and 73) that were enriched for variants affecting neuronal function/development. Our findings are in keeping with a growing number of reports suggesting a genetic model of autism whereby cumulative contribution of multiple inherited and de novo variants (multiple hit) in different genes (genetic heterogeneity), including genes associated with other neuropsychiatric disorders (pleiotropy) or involved in diverse biological processes beyond synaptic function (molecular diversity), shape the risk to ASD^{1, 10, 11, 16, 95}

Conclusions

Apart from Yu et al. exome sequencing study, aimed at identifying inherited SNVs in a cohort of consanguineous/multiplex families with ASD including two Middle Eastern multiplex families (one from Saudi Arabia and one from Kuwait)¹⁷, the genetic characteristics of ASD in Arab populations remain largely unexplored. This work represents, to the best of our knowledge, a comprehensive exome analysis of trios with ASD from an Arab population. Our primary goal was to identify de novo or rare inherited coding variants of potential clinical relevance through applying WES on case-parent trios from singleton families. Among the advantages of implementing this approach over WGS is that it generates fewer results and therefore is relatively less analytically challenging. Although this approach enabled us to generate a prioritized shortlist of potential deleterious variants, association and extensive functional studies are necessary to identify the disease causing ones with more certainty. Also it is important to consider the possibility of missing candidate variants within coding regions and the fact that neither common variants nor non-coding regions were considered here. In spite of the small number of cases investigated here; our current study is useful in terms of contributing a small, albeit rich data set, revealing new ASD candidate genes that may shed light on potential diagnostic and therapeutic targets.

Methods

Ethics statement

Patient recruitment and all experimental protocols used in this study were in compliance with the Declaration of Helsinki and were approved by the institution’s relevant committees; Internal Review Board (IRB), Research Ethics committee and Basic Research Committee at King Faisal Specialist Hospital and Research Center. Written informed consent was obtained from subjects before enrollment in an IRB-approved protocol (King Faisal Specialist Hospital and Research Center (KFSHRC) RAC#2080001).

ASD trios selection

A total of 19 simplex (singleton) families were selected for the study (Trio’s analysis). Venous blood samples from parents and affected child were obtained for both DNA and RNA extraction. Of the recruited families, 9 were from consanguineous marriages and 10 were not. Non-consanguineous families were included based on the rationale that in populations with high levels of consanguinity/endogamy (tribal or religious)⁹⁶, a common lineage is suspected even when couples regard themselves as unrelated as they are often unaware of consanguinity in distant generations^{39, 97}. Diagnosis of ASD was based on the Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition, Text Revision (DSM-IV-TR) criteria (American Psychiatric Association [APA], 2013) and ADI-R, ADOS (which are not yet validated in Arabic). Both parents were present for the interviews and assessment. All selected cases did not have symptomatic ASD secondary to known genetic or metabolic disorders (such as Fragile X syndrome, Tuberous Sclerosis, Rett syndrome, Angelman, Prader-Willi syndrome or Phenylketonuria). Available clinical and demographic information is summarized in Supplementary Table S6.

All probands were negative for any copy number variants (CNV) in previously reported ASD-associated loci/genes listed in Supplementary Table S7. CNV analysis was performed in probands using the Cyto Scan HD array (Affymetrix, Santa Clara, CA,USA). Targeted analysis was carried out using the Chromosome Analysis Suite version Cyto 3.0 using GRC 38/hg19 of the UCSC Genome Browser and the recommended threshold of log2 ratios of more than 0.58 for CNV gains and less than -1 for CNV losses was used.

Whole exome sequencing, data processing and bioinformatics analysis

WES, alignment of reads, and variant discovery

The exomes of the parents and affected children were sequenced on the Ion Proton platforms, using the whole exome AmpliSeq kit. Briefly, 12 separate Exome Primer Pools, AmpliSeq HiFi mix (Life Technologies, Carlsbad, CA, USA) and 100 ng DNA from each sample were used in the amplification step for 10 cycles. The resultant PCR products were then pooled in preparation for primer digestion using FuPa reagent (Life Technologies, Carlsbad, CA, USA). This was followed by a ligation step using Ion P1 and Ion Xpress Barcode adapters. The libraries were then purified and quantified using qPCR and the Ion Library Quantification Kit (Life Technologies, Carlsbad, CA, USA) prior to emulsion on an Ion OneTouch System. The templated Ion Sphere particles were enriched using Ion OneTouch ES (Life Technologies, Carlsbad, CA, USA) and processed for sequencing on the Ion Proton instrument (Life Technologies, Carlsbad, CA, USA). Approximately 15–17 Gb of DNA sequence was generated per sequencing run/sample. Reads were mapped to UCSC Human reference genome (hg19) (http://genome.ucsc.edu/) and variants were identified using the Ion Torrent pipeline (Life Technologies, Carlsbad, CA, USA).

The sequencing targets 293903 amplicons, covering about 19,000 genes (https://www.ampliseq.com/tmpl/view.action?tmplDesignId=50055032). The analysis pipeline is a multi-step process (Fig. 1). First, the NGS reads were subjected to quality control checks for removing any low quality reads. Then the reads were mapped (aligned) to hg19. For the mapping step, we used the program tmap, which is part of the Torrent Suite package. This program is the same as BWA but it is tuned more to the Ion Torrent technology, by including flow signal information in the alignment process. Next, variant calling was performed using the Torrent Suite Variant Caller, which is a GATK-like variant caller, but more tuned to the Ion NGS data. It takes flow signals into account and also recognizes types of sequencing errors common to this platform.

As part of a tertiary data analysis, each trio was further interrogated using four possible inheritance models; autosomal recessive, autosomal dominant, de novo and X-linked. To that end, we compared the variants detected in the affected child to the corresponding positions in the parents DNA. Basically, we grouped the variants into model-specific subsets according the mode of inheritance. For instance, a variant was considered to be de novo if it was present uniquely in the proband and absent in both parents, while if two copies of the variant were inherited by heterozygous parents, the variant was considered to be autosomal recessive (with consideration of compound heterozygosity). On the other hand, the variant was considered autosomal dominant only if it was transmitted to the proband from one heterozygous parent. Moreover, for a variant to be considered as X-linked, it had to occur on the X-chromosome in either a hemizygous (male) or a homozygous (female) state in the proband in reference to the parents.

Variants annotation

ANNOVAR (http://annovar.openbioinformatics.org) software was used to functionally annotate detected variants as previously described⁹⁸. Briefly, the software was used to perform two types of functional annotations; gene-based and filter-based. The former one identifies the genomic region containing the variant (e.g exonic or intronic), and the latter assesses the frequency of the variant in widely used databases in addition to providing functional prediction scores from a number of tools.

Variants prioritization and Sanger validation

The resulting model-specific set of variants was trimmed by omitting the ones not falling within coding exons or exon/intron boundaries, or non-functional. Then validation by Sanger sequencing was carried out on fresh DNA aliquots prior to assessing the frequency and predicted functional effect of the detected variants. The purpose of this was to avoid overlooking any interesting changes that may have been masked by variant calling/annotation errors. The list of validated variants was further trimmed by omitting those reported in dbSNP, or present in international databases (1000 Genomes project, ExAC) or in the local ethnically matching normal controls database (SHGP 2379 exomes) with a MAF ≥ 1%. Variant deleteriousness was predicted using four different web-based tools: Polyphen-2, SIFT, MutationTaster and CADD as described^{99,100,101,102}. For predicting splice-site variants effect, we utilized PredictSNP2 web interface for its improved performance compared to CADD¹⁰³. In addition, available public Autism genes/variants databases (SFARI Gene; https://gene.sfari.org/autdb/Welcome.do, and AutismKB; http://autismkb.cbi.pku.edu.cn/) were checked for reported SNVs and CNVs in our list of validated genes or bands harboring these genes in case of CNV^{104, 105}. Finally, segregation analysis was carried out in families where DNA from unaffected family members (siblings) was available (ASD-17,-24, -37 and -69).

Relatedness assessment

Three different relationship inference algorithms were employed in this study to verify the self-reported pedigree information. The first is Yang et al. method, which is based on the A _jk statistics¹⁰⁶. For this method we used the implementation in the VCFtools package (with option –relatedness 1). In the second method, we computed shared homozygosity by comparing the homozygous variants (MAF > 1%) between each pair of individuals and counting the number of overlapping variants. Relatedness was denoted as “Not confirmed” if the number of shared homozygous variants was less than 25 SD above the mean (123 ± 25). The third method estimates kinship as described by Manichaikul and colleagues¹⁰⁷ using KING program (http://people.virginia.edu/~wc9c/KING/) and the implementation in VCFtools (option –relatedness2).

Biological functions analysis

In an attempt to gain insight into the biological processes that may potentially be affected by the variants identified herein, we analyzed our final list of genes with validated variants using the core analysis function of Ingenuity Pathway Analysis software (IPA) (IPA®, version 01–07, QIAGEN, Redwood City, https://analysis.ingenuity.com/pa/installer/select). The software identifies the diseases and biological functions that are significantly associated with our genes list. The resulting categories are ranked by p-values calculated using right-tailed Fisher’s exact test.

In parallel, we have grouped gene-sets on the basis of their function (by means of manual literature research) into different categories including: “Neuronal function/Development”, “Mitochondrial function/Energy metabolism”, “Protein quality control”, “Gene regulation”, “Cell signaling cascade”, “Cell division or differentiation”, and “Immune responses” (Table 1).

References

Geschwind, D. H. & Flint, J. Genetics and genomics of psychiatric disease. Science 349, 1489–1494, doi:10.1126/science.aaa8954 (2015).
Article CAS PubMed PubMed Central ADS Google Scholar
Warrier, V., Chee, V., Smith, P., Chakrabarti, B. & Baron-Cohen, S. A comprehensive meta-analysis of common genetic variants in autism spectrum conditions. Molecular autism 6, 49, doi:10.1186/s13229-015-0041-0 (2015).
Article PubMed PubMed Central CAS Google Scholar
Hadley, D. et al. The impact of the metabotropic glutamate receptor and other gene family interaction networks on autism. Nature communications 5, 4074, doi:10.1038/ncomms5074 (2014).
Article CAS PubMed PubMed Central Google Scholar
Schaefer, G. B. & Mendelsohn, N. J. Clinical genetics evaluation in identifying the etiology of autism spectrum disorders: 2013 guideline revisions. Genetics in medicine: official journal of the American College of Medical Genetics 15, 399–407, doi:10.1038/gim.2013.32 (2013).
Article CAS Google Scholar
Nakai, N., Otsuka, S., Myung, J. & Takumi, T. Autism spectrum disorder model mice: Focus on copy number variation and epigenetics. Science China. Life sciences 58, 976–984, doi:10.1007/s11427-015-4891-7 (2015).
Article CAS PubMed Google Scholar
Sener, E. F. Association of Copy Number Variations in Autism Spectrum Disorders: A Systematic Review. Chinese Journal of Biology 2014, 9, doi:10.1155/2014/713109 (2014).
Article CAS Google Scholar
Devlin, B. & Scherer, S. W. Genetic architecture in autism spectrum disorder. Current opinion in genetics & development 22, 229–237, doi:10.1016/j.gde.2012.03.002 (2012).
Article CAS Google Scholar
Iossifov, I. et al. The contribution of de novo coding mutations to autism spectrum disorder. Nature 515, 216–221, doi:10.1038/nature13908 (2014).
Article CAS PubMed PubMed Central ADS Google Scholar
Iossifov, I. et al. De novo gene disruptions in children on the autistic spectrum. Neuron 74, 285–299, doi:10.1016/j.neuron.2012.04.009 (2012).
Article CAS PubMed PubMed Central Google Scholar
Jiang, Y. H. et al. Detection of clinically relevant genetic variants in autism spectrum disorder by whole-genome sequencing. American journal of human genetics 93, 249–263, doi:10.1016/j.ajhg.2013.06.012 (2013).
Article CAS PubMed PubMed Central Google Scholar
Krumm, N. et al. Excess of rare, inherited truncating mutations in autism. Nature genetics 47, 582–588, doi:10.1038/ng.3303 (2015).
Article CAS PubMed PubMed Central ADS Google Scholar
O’Roak, B. J. et al. Exome sequencing in sporadic autism spectrum disorders identifies severe de novo mutations. Nature genetics 43, 585–589, doi:10.1038/ng.835 (2011).
Article PubMed PubMed Central CAS Google Scholar
O’Roak, B. J. et al. Sporadic autism exomes reveal a highly interconnected protein network of de novo mutations. Nature 485, 246–250, doi:10.1038/nature10989 (2012).
Article PubMed PubMed Central ADS CAS Google Scholar
Shi, L. et al. Whole-genome sequencing in an autism multiplex family. Molecular autism 4, 8, doi:10.1186/2040-2392-4-8 (2013).
Article CAS PubMed PubMed Central Google Scholar
Sullivan, P. F., Daly, M. J. & O’Donovan, M. Genetic architectures of psychiatric disorders: the emerging picture and its implications. Nature reviews. Genetics 13, 537–551, doi:10.1038/nrg3240 (2012).
Article CAS PubMed PubMed Central Google Scholar
Toma, C. et al. Exome sequencing in multiplex autism families suggests a major role for heterozygous truncating mutations. Molecular psychiatry 19, 784–790, doi:10.1038/mp.2013.106 (2014).
Article CAS PubMed Google Scholar
Yu, T. W. et al. Using whole-exome sequencing to identify inherited causes of autism. Neuron 77, 259–273, doi:10.1016/j.neuron.2012.11.002 (2013).
Article CAS PubMed PubMed Central Google Scholar
Yuen, R. K. et al. Whole-genome sequencing of quartet families with autism spectrum disorder. Nature medicine 21, 185–191, doi:10.1038/nm.3792 (2015).
Article CAS PubMed Google Scholar
Nava, C. et al. Analysis of the chromosome X exome in patients with autism spectrum disorders identified novel candidate genes, including TMLHE. Translational psychiatry 2, e179, doi:10.1038/tp.2012.102 (2012).
Article CAS PubMed PubMed Central Google Scholar
Neale, B. M. et al. Patterns and rates of exonic de novo mutations in autism spectrum disorders. Nature 485, 242–245, doi:10.1038/nature11011 (2012).
Article CAS PubMed PubMed Central ADS Google Scholar
De Rubeis, S. et al. Synaptic, transcriptional and chromatin genes disrupted in autism. Nature 515, 209–215, doi:10.1038/nature13772 (2014).
Article PubMed PubMed Central CAS Google Scholar
Jimenez-Barron, L. T. et al. Genome-wide variant analysis of simplex autism families with an integrative clinical-bioinformatics pipeline. Cold Spring Harbor molecular case studies 1, a000422, doi:10.1101/mcs.a000422 (2015).
Article PubMed PubMed Central Google Scholar
Michaelson, J. J. et al. Whole-genome sequencing in autism identifies hot spots for de novo germline mutation. Cell 151, 1431–1442, doi:10.1016/j.cell.2012.11.019 (2012).
Article CAS PubMed PubMed Central Google Scholar
Sanders, S. J. et al. De novo mutations revealed by whole-exome sequencing are strongly associated with autism. Nature 485, 237–241, doi:10.1038/nature10945 (2012).
Article CAS PubMed PubMed Central ADS Google Scholar
Christian, S. L. et al. Novel submicroscopic chromosomal abnormalities detected in autism spectrum disorder. Biological psychiatry 63, 1111–1117, doi:10.1016/j.biopsych.2008.01.009 (2008).
Article CAS PubMed PubMed Central Google Scholar
Jacquemont, M. L. et al. Array-based comparative genomic hybridisation identifies high frequency of cryptic chromosomal rearrangements in patients with syndromic autism spectrum disorders. Journal of medical genetics 43, 843–849, doi:10.1136/jmg.2006.043166 (2006).
Article CAS PubMed PubMed Central Google Scholar
Levy, D. et al. Rare de novo and transmitted copy-number variation in autistic spectrum disorders. Neuron 70, 886–897, doi:10.1016/j.neuron.2011.05.015 (2011).
Article CAS PubMed Google Scholar
Marshall, C. R. et al. Structural variation of chromosomes in autism spectrum disorder. American journal of human genetics 82, 477–488, doi:10.1016/j.ajhg.2007.12.009 (2008).
Article CAS PubMed PubMed Central Google Scholar
Poultney, C. S. et al. Identification of small exonic CNV from whole-exome sequence data and application to autism spectrum disorder. American journal of human genetics 93, 607–619, doi:10.1016/j.ajhg.2013.09.001 (2013).
Article CAS PubMed PubMed Central Google Scholar
Sebat, J. et al. Strong association of de novo copy number mutations with autism. Science 316, 445–449, doi:10.1126/science.1138659 (2007).
Article CAS PubMed PubMed Central ADS Google Scholar
Szatmari, P. et al. Mapping autism risk loci using genetic linkage and chromosomal rearrangements. Nature genetics 39, 319–328, doi:10.1038/ng1985 (2007).
Article CAS PubMed PubMed Central Google Scholar
Miles, J. H. Autism spectrum disorders–a genetics review. Genetics in medicine: official journal of the American College of Medical Genetics 13, 278–294, doi:10.1097/GIM.0b013e3181ff67ba (2011).
Article MathSciNet Google Scholar
Walsh, K. M. & Bracken, M. B. Copy number variation in the dosage-sensitive 16p11.2 interval accounts for only a small proportion of autism incidence: a systematic review and meta-analysis. Genetics in medicine: official journal of the American College of Medical Genetics 13, 377–384, doi:10.1097/GIM.0b013e3182076c0c (2011).
Article Google Scholar
Weiss, L. A. et al. Association between microdeletion and microduplication at 16p11.2 and autism. The New England journal of medicine 358, 667–675, doi:10.1056/NEJMoa075974 (2008).
Article CAS PubMed Google Scholar
Gillberg, C., Cederlund, M., Lamberg, K. & Zeijlon, L. Brief report: “the autism epidemic”. The registered prevalence of autism in a Swedish urban area. Journal of autism and developmental disorders 36, 429–435, doi:10.1007/s10803-006-0081-6 (2006).
Article PubMed Google Scholar
Skuse, D. H. Imprinting, the X-chromosome, and the male brain: explaining sex differences in the liability to autism. Pediatric research 47, 9–16 (2000).
Article CAS PubMed Google Scholar
Werling, D. M. & Geschwind, D. H. Recurrence rates provide evidence for sex-differential, familial genetic liability for autism spectrum disorders in multiplex families and twins. Molecular autism 6, 27, doi:10.1186/s13229-015-0004-5 (2015).
Article PubMed PubMed Central Google Scholar
Sanders, S. J. et al. Multiple recurrent de novo CNVs, including duplications of the 7q11.23 Williams syndrome region, are strongly associated with autism. Neuron 70, 863–885, doi:10.1016/j.neuron.2011.05.002 (2011).
Article CAS PubMed PubMed Central Google Scholar
Ben Halim, N. et al. Consanguinity, endogamy, and genetic disorders in Tunisia. Journal of community genetics 4, 273–284, doi:10.1007/s12687-012-0128-7 (2013).
Article MathSciNet PubMed Google Scholar
Gai, X. et al. Rare structural variation of synapse and neurotransmission genes in autism. Molecular psychiatry 17, 402–411, doi:10.1038/mp.2011.10 (2012).
Article CAS PubMed Google Scholar
Nishimura, Y. et al. Genome-wide expression profiling of lymphoblastoid cell lines distinguishes different forms of autism and reveals shared pathways. Human molecular genetics 16, 1682–1698, doi:10.1093/hmg/ddm116 (2007).
Article CAS PubMed Google Scholar
Butler, M. G., Rafi, S. K., Hossain, W., Stephan, D. A. & Manzardo, A. M. Whole exome sequencing in females with autism implicates novel and candidate genes. International journal of molecular sciences 16, 1312–1335, doi:10.3390/ijms16011312 (2015).
Article CAS PubMed PubMed Central Google Scholar
Quintela, I. et al. Female patient with autistic disorder, intellectual disability, and co-morbid anxiety disorder: Expanding the phenotype associated with the recurrent 3q13.2-q13.31 microdeletion. American journal of medical genetics. Part A 167, 3121–3129, doi:10.1002/ajmg.a.37292 (2015).
Article CAS Google Scholar
Chakravarti, A. A compelling genetic hypothesis for a complex disease: PRODH2/DGCR6 variation leads to schizophrenia susceptibility. Proceedings of the National Academy of Sciences of the United States of America 99, 4755–4756, doi:10.1073/pnas.092158299 (2002).
Article CAS PubMed PubMed Central ADS Google Scholar
Liu, H. et al. Genetic variation at the 22q11 PRODH2/DGCR6 locus presents an unusual pattern and increases susceptibility to schizophrenia. Proceedings of the National Academy of Sciences of the United States of America 99, 3717–3722, doi:10.1073/pnas.042700699 (2002).
Article CAS PubMed PubMed Central ADS Google Scholar
Athanasakis, E. et al. Next generation sequencing in nonsyndromic intellectual disability: from a negative molecular karyotype to a possible causative mutation detection. American journal of medical genetics. Part A 164A, 170–176, doi:10.1002/ajmg.a.36274 (2014).
Article PubMed CAS Google Scholar
Dey, B. K. et al. The histone demethylase KDM5b/JARID1b plays a role in cell fate decisions by blocking terminal differentiation. Molecular and cellular biology 28, 5312–5327, doi:10.1128/mcb.00128-08 (2008).
Article CAS PubMed PubMed Central Google Scholar
Lyons, J. J., Milner, J. D. & Rosenzweig, S. D. Glycans Instructing Immunity: The Emerging Role of Altered Glycosylation in Clinical Immunology. Frontiers in pediatrics 3, 54, doi:10.3389/fped.2015.00054 (2015).
Article PubMed PubMed Central Google Scholar
Najmabadi, H. et al. Deep sequencing reveals 50 novel genes for recessive cognitive disorders. Nature 478, 57–63, doi:10.1038/nature10423 (2011).
Article CAS PubMed ADS Google Scholar
Rafiq, M. A. et al. Mutations in the alpha 1,2-mannosidase gene, MAN1B1, cause autosomal-recessive intellectual disability. American journal of human genetics 89, 176–182, doi:10.1016/j.ajhg.2011.06.006 (2011).
Article CAS PubMed PubMed Central Google Scholar
Gustavsson, E. K. et al. DNAJC13 genetic variants in parkinsonism. Movement disorders: official journal of the Movement Disorder Society 30, 273–278, doi:10.1002/mds.26064 (2015).
Article CAS Google Scholar
Vilarino-Guell, C. et al. DNAJC13 mutations in Parkinson disease. Human molecular genetics 23, 1794–1801, doi:10.1093/hmg/ddt570 (2014).
Article PubMed CAS Google Scholar
Hua, P. et al. Cry1 and Tef gene polymorphisms are associated with major depressive disorder in the Chinese population. Journal of affective disorders 157, 100–103, doi:10.1016/j.jad.2013.11.019 (2014).
Article CAS PubMed Google Scholar
Rosenwasser, A. M. Circadian clock genes: non-circadian roles in sleep, addiction, and psychiatric disorders? Neuroscience and biobehavioral reviews 34, 1249–1255, doi:10.1016/j.neubiorev.2010.03.004 (2010).
Article PubMed Google Scholar
Soria, V. et al. Differential association of circadian genes with mood disorders: CRY1 and NPAS2 are associated with unipolar major depression and CLOCK and VIP with bipolar disorder. Neuropsychopharmacology: official publication of the American College of Neuropsychopharmacology 35, 1279–1289, doi:10.1038/npp.2009.230 (2010).
Article CAS Google Scholar
Bigdeli, T. B. et al. Association study of 83 candidate genes for bipolar disorder in chromosome 6q selected using an evidence-based prioritization algorithm. American journal of medical genetics. Part B, Neuropsychiatric genetics: the official publication of the International Society of Psychiatric Genetics 162B, 898–906, doi:10.1002/ajmg.b.32200 (2013).
Article CAS Google Scholar
Nurnberger, J. I. Jr. et al. Identification of pathways for bipolar disorder: a meta-analysis. JAMA psychiatry 71, 657–664, doi:10.1001/jamapsychiatry.2014.176 (2014).
Article CAS PubMed PubMed Central Google Scholar
Berry, A., Bindocci, E. & Alleva, E. NGF, brain and behavioral plasticity. Neural plasticity 2012, 784040, doi:10.1155/2012/784040 (2012).
Article PubMed PubMed Central Google Scholar
Courchet, J. & Polleux, F. Sonic hedgehog, BOC, and synaptic development: new players for an old game. Neuron 73, 1055–1058, doi:10.1016/j.neuron.2012.03.008 (2012).
Article CAS PubMed Google Scholar
Green, J. A., Gu, C. & Mykytyn, K. Heteromerization of ciliary G protein-coupled receptors in the mouse brain. PloS one 7, e46304, doi:10.1371/journal.pone.0046304 (2012).
Article CAS PubMed PubMed Central ADS Google Scholar
Guadiana, S. M. et al. Arborization of dendrites by developing neocortical neurons is dependent on primary cilia and type 3 adenylyl cyclase. The Journal of neuroscience: the official journal of the Society for Neuroscience 33, 2626–2638, doi:10.1523/jneurosci.2906-12.2013 (2013).
Article CAS Google Scholar
Qu, Y. et al. Genetic evidence that Celsr3 and Celsr2, together with Fzd3, regulate forebrain wiring in a Vangl-independent manner. Proceedings of the National Academy of Sciences of the United States of America 111, E2996–3004, doi:10.1073/pnas.1402105111 (2014).
Article CAS PubMed PubMed Central ADS Google Scholar
Winkle, C. C. et al. A novel Netrin-1-sensitive mechanism promotes local SNARE-mediated exocytosis during axon branching. The Journal of cell biology 205, 217–232, doi:10.1083/jcb.201311003 (2014).
Article CAS PubMed PubMed Central Google Scholar
Lindholm, D. et al. Fibroblast growth factor-5 promotes differentiation of cultured rat septal cholinergic and raphe serotonergic neurons: comparison with the effects of neurotrophins. The European journal of neuroscience 6, 244–252 (1994).
Article CAS PubMed Google Scholar
Reuss, B., Dono, R. & Unsicker, K. Functions of fibroblast growth factor (FGF)-2 and FGF-5 in astroglial differentiation and blood-brain barrier permeability: evidence from mouse mutants. The Journal of neuroscience: the official journal of the Society for Neuroscience 23, 6404–6412 (2003).
CAS Google Scholar
Liu, Z., Xiang, Y. & Sun, G. The KCTD family of proteins: structure, function, disease relevance. Cell & bioscience 3, 45, doi:10.1186/2045-3701-3-45 (2013).
Article Google Scholar
Rogon, C. et al. HSP70-binding protein HSPBP1 regulates chaperone expression at a posttranslational level and is essential for spermatogenesis. Molecular biology of the cell 25, 2260–2271, doi:10.1091/mbc.E14-02-0742 (2014).
Article PubMed PubMed Central CAS Google Scholar
Patterson, K. I., Brummer, T., O’Brien, P. M. & Daly, R. J. Dual-specificity phosphatases: critical regulators with diverse cellular targets. The Biochemical journal 418, 475–489 (2009).
Article CAS PubMed Google Scholar
Minkiewicz, J., de Rivero Vaccari, J. P. & Keane, R. W. Human astrocytes express a novel NLRP2 inflammasome. Glia 61, 1113–1121, doi:10.1002/glia.22499 (2013).
Article PubMed Google Scholar
Olender, T., Lancet, D. & Nebert, D. W. Update on the olfactory receptor (OR) gene superfamily. Human genomics 3, 87–97 (2008).
Article CAS PubMed PubMed Central Google Scholar
Kalay, E. et al. CEP152 is a genome maintenance protein disrupted in Seckel syndrome. Nature genetics 43, 23–26, doi:10.1038/ng.725 (2011).
Article CAS PubMed Google Scholar
Hamm, A. et al. Frequent expression loss of Inter-alpha-trypsin inhibitor heavy chain (ITIH) genes in multiple human solid tumors: a systematic expression analysis. BMC cancer 8, 25, doi:10.1186/1471-2407-8-25 (2008).
Article PubMed PubMed Central CAS Google Scholar
Huang, L., Poke, G., Gecz, J. & Gibson, K. A novel contiguous gene deletion of AVPR2 and ARHGAP4 genes in male dizygotic twins with nephrogenic diabetes insipidus and intellectual disability. American journal of medical genetics. Part A 158A, 2511–2518, doi:10.1002/ajmg.a.35591 (2012).
Article PubMed CAS Google Scholar
Zhao, Y., Butler, E. B. & Tan, M. Targeting cellular metabolism to improve cancer therapeutics. Cell death & disease 4, e532, doi:10.1038/cddis.2013.60 (2013).
Article CAS Google Scholar
Kennerson, M. L. et al. A new locus for X-linked dominant Charcot-Marie-Tooth disease (CMTX6) is caused by mutations in the pyruvate dehydrogenase kinase isoenzyme 3 (PDK3) gene. Human molecular genetics 22, 1404–1416, doi:10.1093/hmg/dds557 (2013).
Article CAS PubMed PubMed Central Google Scholar
Nave, K. A. Myelination and the trophic support of long axons. Nature reviews. Neuroscience 11, 275–283, doi:10.1038/nrn2797 (2010).
Article CAS PubMed Google Scholar
Fagervall, I. & Ross, S. B. A and B forms of monoamine oxidase within the monoaminergic neurons of the rat brain. Journal of neurochemistry 47, 569–576 (1986).
Article CAS PubMed Google Scholar
Homan, C. C. et al. Mutations in USP9X are associated with X-linked intellectual disability and disrupt neuronal cell migration and growth. American journal of human genetics 94, 470–478, doi:10.1016/j.ajhg.2014.02.004 (2014).
Article CAS PubMed PubMed Central Google Scholar
Feng, Y. et al. Filamin A (FLNA) is required for cell-cell contact in vascular development and cardiac morphogenesis. Proceedings of the National Academy of Sciences of the United States of America 103, 19836–19841, doi:10.1073/pnas.0609628104 (2006).
Article CAS PubMed PubMed Central ADS Google Scholar
Zhang, L. et al. MEK-ERK1/2-dependent FLNA overexpression promotes abnormal dendritic patterning in tuberous sclerosis independent of mTOR. Neuron 84, 78–91, doi:10.1016/j.neuron.2014.09.009 (2014).
Article CAS PubMed PubMed Central Google Scholar
Miller, H. B., Robinson, T. J., Gordan, R., Hartemink, A. J. & Garcia-Blanco, M. A. Identification of Tat-SF1 cellular targets by exon array analysis reveals dual roles in transcription and splicing. RNA (New York, N.Y.) 17, 665–674, doi:10.1261/rna.2462011 (2011).
Article CAS Google Scholar
Aksaas, A. K. et al. G-patch domain and KOW motifs-containing protein, GPKOW; a nuclear RNA-binding protein regulated by protein kinase A. Journal of molecular signaling 6, 10, doi:10.1186/1750-2187-6-10 (2011).
Article CAS PubMed PubMed Central Google Scholar
Zang, S. et al. GPKOW is essential for pre-mRNA splicing in vitro and suppresses splicing defect caused by dominant-negative DHX16 mutation in vivo. Bioscience reports 34, e00163, doi:10.1042/bsr20140142 (2014).
Article PubMed PubMed Central CAS Google Scholar
Guimond, M. O. & Gallo-Payet, N. The Angiotensin II Type 2 Receptor in Brain Functions: An Update. International journal of hypertension 2012, 351758, doi:10.1155/2012/351758 (2012).
Article PubMed PubMed Central Google Scholar
Wilson, P. J. et al. Hunter syndrome: isolation of an iduronate-2-sulfatase cDNA clone and analysis of patient DNA. Proceedings of the National Academy of Sciences of the United States of America 87, 8531–8535 (1990).
Article CAS PubMed PubMed Central ADS Google Scholar
Diez-Roux, G. & Ballabio, A. Sulfatases and human disease. Annual review of genomics and human genetics 6, 355–379, doi:10.1146/annurev.genom.6.080604.162334 (2005).
Article CAS PubMed Google Scholar
Debrincat, M. A. et al. Ankyrin repeat and suppressors of cytokine signaling box protein asb-9 targets creatine kinase B for degradation. The Journal of biological chemistry 282, 4728–4737, doi:10.1074/jbc.M609164200 (2007).
Article CAS PubMed Google Scholar
Cargnello, M. & Roux, P. P. Activation and function of the MAPKs and their substrates, the MAPK-activated protein kinases. Microbiology and molecular biology reviews: MMBR 75, 50–83, doi:10.1128/mmbr.00031-10 (2011).
Article CAS PubMed PubMed Central Google Scholar
Zanni, G. et al. Mutation of plasma membrane Ca2+ ATPase isoform 3 in a family with X-linked congenital cerebellar ataxia impairs Ca2+ homeostasis. Proceedings of the National Academy of Sciences of the United States of America 109, 14514–14519, doi:10.1073/pnas.1207488109 (2012).
Article CAS PubMed PubMed Central ADS Google Scholar
Bohm, M. et al. Genetic Variants of DICE1/INTS6 in German Prostate Cancer Families with Linkage to 13q14. Urologia internationalis 95, 386–389, doi:10.1159/000366229 (2015).
Article PubMed CAS Google Scholar
Filleur, S. et al. INTS6/DICE1 inhibits growth of human androgen-independent prostate cancer cells by altering the cell cycle profile and Wnt signaling. Cancer cell international 9, 28, doi:10.1186/1475-2867-9-28 (2009).
Article PubMed PubMed Central CAS Google Scholar
Zendman, A. J., Ruiter, D. J. & Van Muijen, G. N. Cancer/testis-associated genes: identification, expression profile, and putative function. Journal of cellular physiology 194, 272–288, doi:10.1002/jcp.10215 (2003).
Article CAS PubMed Google Scholar
Meyer, E. et al. Germline mutation in NLRP2 (NALP2) in a familial imprinting disorder (Beckwith-Wiedemann Syndrome). PLoS genetics 5, e1000423, doi:10.1371/journal.pgen.1000423 (2009).
Article PubMed PubMed Central CAS Google Scholar
Hsiao, E. Y., McBride, S. W., Chow, J., Mazmanian, S. K. & Patterson, P. H. Modeling an autism risk factor in mice leads to permanent immune dysregulation. Proceedings of the National Academy of Sciences of the United States of America 109, 12776–12781, doi:10.1073/pnas.1202556109 (2012).
Article CAS PubMed PubMed Central ADS Google Scholar
Huguet, G., Ey, E. & Bourgeron, T. The genetic landscapes of autism spectrum disorders. Annual review of genomics and human genetics 14, 191–213, doi:10.1146/annurev-genom-091212-153431 (2013).
Article CAS PubMed Google Scholar
El Mouzan, M. I., Al Salloum, A. A., Al Herbish, A. S., Qurachi, M. M. & Al Omar, A. A. Consanguinity and major genetic disorders in Saudi children: a community-based cross-sectional study. Annals of Saudi medicine 28, 169–173 (2008).
Article PubMed Google Scholar
Bittles, A. H. & Black, M. L. Evolution in health and medicine Sackler colloquium: Consanguinity, human evolution, and complex diseases. Proceedings of the National Academy of Sciences of the United States of America 107(Suppl 1), 1779–1786, doi:10.1073/pnas.0906079106 (2010).
Article CAS PubMed PubMed Central ADS Google Scholar
Saudi Mendeliome Group. Comprehensive gene panels provide advantages over clinical exome sequencing for Mendelian diseases. Genome biology 16, 134, doi:10.1186/s13059-015-0693-2 (2015).
Kircher, M. et al. A general framework for estimating the relative pathogenicity of human genetic variants. Nature genetics 46, 310–315, doi:10.1038/ng.2892 (2014).
Article CAS PubMed PubMed Central Google Scholar
Adzhubei, I., Jordan, D. M. & Sunyaev, S. R. Predicting functional effect of human missense mutations using PolyPhen-2. Current protocols in human genetics/editorial board, Jonathan L. Haines. [et al.] Chapter 7, Unit7 20, doi:10.1002/0471142905.hg0720s76 (2013).
Schwarz, J. M., Rodelsperger, C., Schuelke, M. & Seelow, D. MutationTaster evaluates disease-causing potential of sequence alterations. Nature methods 7, 575–576, doi:10.1038/nmeth0810-575 (2010).
Article CAS PubMed Google Scholar
Sim, N. L. et al. SIFT web server: predicting effects of amino acid substitutions on proteins. Nucleic acids research 40, W452–457, doi:10.1093/nar/gks539 (2012).
Article CAS PubMed PubMed Central Google Scholar
Bendl, J. et al. PredictSNP2: A Unified Platform for Accurately Evaluating SNP Effects by Exploiting the Different Characteristics of Variants in Distinct Genomic Regions. PLoS computational biology 12, e1004962, doi:10.1371/journal.pcbi.1004962 (2016).
Article PubMed PubMed Central CAS Google Scholar
Basu, S. N., Kollu, R. & Banerjee-Basu, S. AutDB: a gene reference resource for autism research. Nucleic acids research 37, D832–836, doi:10.1093/nar/gkn835 (2009).
Article CAS PubMed Google Scholar
Xu, L. M. et al. AutismKB: an evidence-based knowledgebase of autism genetics. Nucleic acids research 40, D1016–1022, doi:10.1093/nar/gkr1145 (2012).
Article CAS PubMed Google Scholar
Yang, J. et al. Common SNPs explain a large proportion of the heritability for human height. Nature genetics 42, 565–569, doi:10.1038/ng.608 (2010).
Article CAS PubMed PubMed Central Google Scholar
Manichaikul, A. et al. Robust relationship inference in genome-wide association studies. Bioinformatics (Oxford, England) 26, 2867–2873, doi:10.1093/bioinformatics/btq559 (2010).
Article CAS Google Scholar

Download references

Acknowledgements

All NGS library building, sequencing and bioinformatics analysis was performed by the Saudi Human Genome Program (SHGP) at King Abdulaziz City for Science and Technology (KACST) and at KFSHRC. In addition, we would like to thank sequencing and genotyping core facilities in the department of Genetics at KFSHRC for performing sequencing and HDCytoscan.

Author information

Authors and Affiliations

Behavioral Genetics unit, Department of Genetics, King Faisal Specialist Hospital & Research Center, P.O. Box 3354, Riyadh, 11211, Saudi Arabia
Bashayer Al-Mubarak, Aisha Omar, Asma Tahir, Ahmed Almusaiab, Amna Magrashi, Jameela Shinwari & Nada Al Tassan
Saudi Human Genome Program, King Abdulaziz City for Science and Technology, Riyadh, Saudi Arabia
Bashayer Al-Mubarak, Mohamed Abouelhoda, Mohamed El-Kalioby, Ewa Goljan, Renad Albar, Shazia Subhani, Dorota Monies & Nada Al Tassan
Department of Genetics, King Faisal Specialist Hospital & Research Centre, P.O. Box 3354, Riyadh, 11211, Saudi Arabia
Mohamed Abouelhoda, Mohamed El-Kalioby, Ewa Goljan, Shazia Subhani & Dorota Monies
Center for Autism Research, King Faisal Specialist Hospital & Research Centre, P.O. Box 3354, Riyadh, 11211, Saudi Arabia
Hesham AlDhalaan, Mohammed Aldosari, Sultana Asfahani & Alaa Eskandrani
Department of Neurosciences, King Faisal Specialist Hospital & Research Centre, P.O. Box 3354, Riyadh, 11211, Saudi Arabia
Hesham AlDhalaan & Michael Nester
Pediatric Department, King Faisal Specialist Hospital & Research Centre, P.O. Box 3354, Riyadh, 11211, Saudi Arabia
Hussain. A. Alshamrani

Authors

Bashayer Al-Mubarak
View author publications
You can also search for this author in PubMed Google Scholar
Mohamed Abouelhoda
View author publications
You can also search for this author in PubMed Google Scholar
Aisha Omar
View author publications
You can also search for this author in PubMed Google Scholar
Hesham AlDhalaan
View author publications
You can also search for this author in PubMed Google Scholar
Mohammed Aldosari
View author publications
You can also search for this author in PubMed Google Scholar
Michael Nester
View author publications
You can also search for this author in PubMed Google Scholar
Hussain. A. Alshamrani
View author publications
You can also search for this author in PubMed Google Scholar
Mohamed El-Kalioby
View author publications
You can also search for this author in PubMed Google Scholar
Ewa Goljan
View author publications
You can also search for this author in PubMed Google Scholar
Renad Albar
View author publications
You can also search for this author in PubMed Google Scholar
Shazia Subhani
View author publications
You can also search for this author in PubMed Google Scholar
Asma Tahir
View author publications
You can also search for this author in PubMed Google Scholar
Sultana Asfahani
View author publications
You can also search for this author in PubMed Google Scholar
Alaa Eskandrani
View author publications
You can also search for this author in PubMed Google Scholar
Ahmed Almusaiab
View author publications
You can also search for this author in PubMed Google Scholar
Amna Magrashi
View author publications
You can also search for this author in PubMed Google Scholar
Jameela Shinwari
View author publications
You can also search for this author in PubMed Google Scholar
Dorota Monies
View author publications
You can also search for this author in PubMed Google Scholar
Nada Al Tassan
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conception of the study and experiment design: N.A.T. Whole exome sequencing data analysis and variant validation: A.O., J.S., A.M., A.T. and A.A. Whole exome sequencing: E.G. and R.A. Supervision of the whole exome sequencing and Sanger sequencing: D.M. Bioinformatics data analysis: M.A., M.E. and S.S. Data interpretation: B.A. and N.A.T. Patients recruitment and assessment: H.A., M.A., M.N., S.A., H.A.A. and A.E. Manuscript drafting: B.A. and N.A.T. D.M. and M.A. contributed to manuscript writing.

Corresponding authors

Correspondence to Bashayer Al-Mubarak or Nada Al Tassan.

Ethics declarations

Competing Interests

The authors declare that they have no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary information

Dataset1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Al-Mubarak, B., Abouelhoda, M., Omar, A. et al. Whole exome sequencing reveals inherited and de novo variants in autism spectrum disorder: a trio study from Saudi families. Sci Rep 7, 5679 (2017). https://doi.org/10.1038/s41598-017-06033-1

Download citation

Received: 16 February 2017
Accepted: 06 June 2017
Published: 18 July 2017
DOI: https://doi.org/10.1038/s41598-017-06033-1

This article is cited by

Evaluation of AI tools for healthcare networks at the cloud-edge interaction to diagnose autism in educational environments
- Yue Pan
- Andia Foroughi
Journal of Cloud Computing (2024)
An immunogenomic exome landscape of triple positive primary antiphospholipid patients
- A. Guffroy
- L. Jacquel
- A. S. Korganow
Genes & Immunity (2024)
Whole exome sequencing revealed variants in four genes underlying X-linked intellectual disability in four Iranian families: novel deleterious variants and clinical features with the review of literature
- Atefeh Mir
- Yongjun Song
- Mohammad Amin Tabatabaiefar
BMC Medical Genomics (2023)
Celsr2 regulates NMDA receptors and dendritic homeostasis in dorsal CA1 to enable social memory
- Bailing Chen
- Laijian Wang
- Yibo Qu
Molecular Psychiatry (2022)
Role of CNTNAP2 in autism manifestation outlines the regulation of signaling between neurons at the synapse
- Swati Agarwala
- Nallur B. Ramachandra
Egyptian Journal of Medical Human Genetics (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.