Main

Canavan disease (CD) or Canavan-Van Bogaerrt-Bertrand disease (OMIM 271900) is a spongiform encephalopathy caused by the deficiency of the cytosolic enzyme aspartoacylase (ASPA, EC 3.5.1.15), which subsequently causes the accumulation of N-acetylaspartic acid (NAA) in urine and in the central nervous system.1 This disease is common in Jews of Ashkenazi origin2 and is one of the most common metabolic errors in Saudi Arabia.3

Human ASPA is located on the short arm of chromosome 17 (17p13ter) and spans approximately 23,405 bp, comprises six exons, and encodes 313 amino acids.4 ASPA is a metalloprotease; a zinc containing carboxypeptidase. Studies performed by Herga et al. and Bitto et al. 5,6 have identified the zinc binding ligands and the putative catalytic site. The triad of His21, Glu24, and His116 perfectly form a niche where zinc is located.6 A Glu178 is located in the vicinity of the zinc metal site and has been considered to be the active site of the enzyme. The inhibition of the native enzyme by two zinc-chelating agents EDTA and 1,10-phenanthroline, and by the histidine specific reagent DEPC, strongly confirms that ASPA is a zinc metalloenzyme.6

ASPA shows remarkable similarity to carboxypeptidase A and probably functions in a similar manner that is by a “promoted-water pathway.” It is hypothesized that GLU178 behaves as a general base and activates a water molecule kept appropriately in coordination with zinc at the active site.5 The structure of its carboxy-terminal does not alter other protein degradation pathways except for N-acetylaspartate.5 This is important since ASPA is a cytosolic enzyme and apparently acts only as a specific peptidase. The C-domain also forms a pocket that can accommodate only NAA.5 In the mouse gene knockout model, it has been shown that ASPA does not hydrolyze another abundant brain peptide N-acetyl-aspartyl-glutamate nor does it compete with glutamate carboxypeptidase II activity.7

Bitto et al.5 conducted careful crystallographic protein analysis in rat and human ASPA. They have classified the ASPA mutations into five groups: those that affect (1) zinc coordination, (2) hydrogen bonding, (3) hydrophobic core, (4) premature termination, and (5) destabilization of the dimer surface. Also, a model-based study by Hershfield et al.8 indicated that the mutations at the putative zinc-binding residues (H121G, E24/D/G, and H116G0), the general proton donor (E178A), and mutants that switch over the order of zinc-binding residues (H121E/E24H and E24H/H16E) produced full proteins with no ASPA activity. Mutations at the carboxyl terminal (R71N) also lead to undetectable ASPA activity. Most mutations found in the patients lead to a severe phenotype and undetectable ASPA activity. Only rare mutations such as G212A lead to a milder phenotype compatible with a prolonged life span.9

CD in Ashkenazi Jews is associated with two point mutations: E285A and Y231X, whereas different point mutations are found in non-Jewish patients of European descent.4,10,11 In both ethnic groups, certain patients have unusual mutations. In a non-Jewish population in New York, Zeng et al.12,13 found 14 novel mutations including three deletions 923delT, 33del13, and 244delA. In non-Jewish patients, Kaul et al.11 found a 1 bp deletion (32delta T) and 4 bp deletion (876δAGAA).

Various studies to elucidate the pathophysiology of CD have been performed, and it has been reported that the NAA is synthesized in neurons probably serving as a rapidly functioning water pump that effluxes water against a gradient. The NAA then enters into oligodendrocytes where ASPA deacetylases it. When NAA is present in excess, mitochondrial swelling occurs.14,15 However, it is still not clear why this mitochondrial changes occurs. Demyelination is a constant feature of this disease and accounts for the clinical appearance of early megalencephaly, severe pyramidal tract lesions, blindness, and early death.

In this study we describe mutation analysis of CD patients in Saudi Arabia and present genome-wide gene expression profiles performed on total RNA isolated from cultured fibroblasts of two Saudi CD patients. We compared our gene expression findings with the previous studies performed in the brain of CD knockout mouse by Surendran et al.1618 Our genome-wide expression studies showed many other critical changes in genes involved in glutamate and aspartate metabolism, cytoskeletal processes, cell motility-related activities, different growth factor signaling pathways, axon guidance pathways, muscle contraction and development, and mitochondrial oxidation were significantly affected. We report in the Saudi population the presence of two previously identified single point mutations (splice site and missense mutations), a novel exon 3 deletion (3346 bp), and a previously reported 92 kb large deletion completely abolishing the presence of ASPA and most of the adjacently located TRVP3 gene.

PATIENTS AND METHODS

Patients

Five patients from different parts of Saudi Arabia were included in this study. The patients were from unrelated families and were clinically and biochemically diagnosed with CD using standard criteria confirming this disorder. The project was approved by King Faisal Specialist Hospital and Research Centre's Internal Review Board and informed consent was given by all individuals participating in this study.

Blood collection and fibroblast culture

The collection of blood and establishment of fibroblasts from skin biopsies were as previously described.3

DNA isolation

Five milliliters of whole blood from consented patients was collected into EDTA tubes for genomic DNA isolation. The DNA was isolated from the blood using the PureGene DNA Purification Kit according to the manufacturer's instructions (Gentra Systems, Inc., Minneapolis, MN). QIAGEN′s Mini DNA isolation Kit (QIAGEN, Valencia, CA) was used to isolate DNA from cultured cells. The DNA was checked on a standard agarose gel and then quantified spectrophotometrically. Good quality DNA was stored in aliquots at −20°C until further use.

Long polymerase chain reaction

About 100–200 ng DNA was used as a template for long-polymerase chain reaction (PCR) amplification using Extensor High-Fidelity PCR Enzyme and PCR master mix (ABgene Inc., Rochester, NY). The long-PCR was performed according to the manufacturer's recommendations (ABgene Inc.). Several primer sets spanning the regions from intron 2 to 3 were designed to detect the deletion spanning exon 3 and cover the breakpoints of the deletion.

Polymerase chain reaction

DNA was amplified by PCR using intronic primers designed to amplify the coding exons of ASPA and TRPV3. We used the UCSC Genome Browser (http://genome.ucsc.edu), NCBI Entrez (http://www.ncbi.nlm.nih.gov/sites/gquery), and Ensembl database (http://www.ensembl.org) to obtain exon and intron boundaries. ExonPrimer (http://ihg.gsf.de/ihg/ExonPrimer.html) and Primer3 (http://frodo.wi.mit.edu/cgi-bin/primer3/primer3www.cgi) were used to design primers for each exon including exon/intron boundaries. PCR was performed using the DNA Engine PTC-200 (MJ Research, Watertown, MA) in a final volume of 20 μL containing approximately 10 ng of genomic DNA, 50 mM KCl, 10 mM Tris-HCl (pH 8.3), 1.5 mM MgCl2, 100 μM deoxyribonucleotide triphosphates (dNTPs), 1 unit of HotStar Taq polymerase (QIAGEN), and 50 ng of each primer. Thermocycling consisted of an initial denaturation at 95°C for 15 minutes followed by 35 cycles of PCR. Each cycle of PCR consisted of denaturation at 94°C for 60 seconds, annealing at 62–68°C for 60 seconds, and extension at 72°C for 60 seconds. A final extension step of 10 minutes at 72°C was added. All primer sequences used for PCR, long-PCR, RT-PCR, and sequencing experiments are available on request.

Mutation detection and analysis

Direct sequencing was performed on either ABI 3100 Automated DNA Sequence Analyzer (Applied Biosystems, Foster City, CA) or MegaBACE 1000 DNA Analysis System (Molecular Dynamics, Sunnyvale, CA) according to the manufacturer's recommendations. Briefly, purified PCR products or long-PCR products were used for sequencing reactions using DNA Dye Terminator Cycle Sequencing Kit (Applied Biosystems) and purified with Dye-Ex protocol before it was run on ABI 3100 Automated DNA Sequence Analyzer. For the MegaBACE 1000, sequencing reactions were performed using an Amersham ET Dye Terminator Cycle Sequencing Kit (Amersham Biosciences, Piscataway, NJ) following the manufacturer's instructions. Sequencing reactions were desalted and unincorporated nucleotides removed using ethanol precipitation and resuspended in a formamide EDTA solution for injection on a MegaBACE 1000 DNA Analysis System (Molecular Dynamics).

Sequence data were blasted against the NCBI database and aligned with the reference sequences. Sequencing analysis on all coding exons, exon/intron junctions of ASPA, and contigs assembled using Lasergene-SeqMan version 6.1 (DNA Star, Inc., WI) and ChromasPro 1.31 (Technelysium Pty Ltd., Australia) softwares. Genbank (accession no. AF376770) was used as reference sequence. Numbering commenced with the A of the ATG initiation codon as +1.

mtDNA sequencing

DNA isolated from cultured fibroblasts or whole blood was used for PCR amplification of mtDNA. The entire coding region of the mitochondrial genome was amplified in 24 separate PCRs using common cycling conditions as detailed elsewhere19 for the patients, certain family members, and controls. Each successfully amplified fragment was directly sequenced using the BigDye Terminator V3.1 Cycle Sequencing kit (Applied Biosystems), and samples were run on the ABI Prism 3100 Sequencer (Applied Biosystems).

Sequence analysis of the mitochondrial DNA coding region

Sequencing results were compared with the corrected Cambridge reference sequence.20 All fragments were sequenced in both forward and reverse directions at least twice for confirmation of any detected variant. Sequence results were compared with the MITOMAP database.20

Multiplex ligation-dependent probe amplification

The multiplex ligation-dependent probe amplification (MLPA) technique (SALSA® P025-Canavan MLPA; MRC-Holland, Amsterdam, Holland) was performed in patient samples for which mutations were not identified using standard sequencing analysis.21 Briefly, in each reaction 250–300 ng of genomic DNA was denatured and hybridized with the SALSA® probe-mix. After probe hybridization, a ligation reaction was performed using ligase-65 enzyme and subsequent products were amplified by 35 cycles of PCR; 30 seconds at 95°C, 30 seconds at 60°C, 1 minute at 72°C, and a final extension step of 20 minutes at 72°C. The PCR products were separated and analyzed using an ABI-3130xl Genetic Analyzer (Applied Biosystems). Peak heights for each fragment were visually and quantitatively compared with those of a normal control sample and deletions were suspected when the peak intensities differed by more than 30%. To ensure accurate analysis of suspected deletions of ASPA, normalization of peak areas of each amplification product was performed by normalization methods described in the manufacturer's protocol.

aCGH experiments and data analysis

We used the Human Genome CGH Microarray Kit 244A (Agilent Technologies, Santa Clara, CA) according to the manufacturer's instructions with minor modifications. Briefly, 3 μg of genomic DNA was digested overnight with AluI and RsaI followed by purification. A universal human male genomic DNA (Promega, Madison, WI) was used as reference in the hybridization. The Bioprime Labeling Kit (Invitrogen Inc., Carlsbad, CA) was used to label all of the digested patient's DNA and reference DNA separately with Cy5-dUTP and Cy3-dUTP (Perkin-Elmer, Waltham, MA), respectively. Labeled patient DNA and reference DNA were mixed, purified, and prepared for hybridization. After 38 hours of hybridization, repeated wash cycles were performed according to the manufacturer's instructions. The slides were scanned on an Agilent DNA Microarray Scanner (Agilent Technologies) and images were extracted using the manufacturer's extraction software. The generated QC report was analyzed to ensure proper hybridization and grid placement. The images and data generated by the feature extraction tool were loaded into CGH Analytics 3.4.40 software (Agilent Technologies) to allow visualization of the data. The default settings of the CGH Analytics for the identification of chromosomal aberrations were applied. Z-score algorithm with a threshold of 5.0 was chosen with the 60 pt window to see the distribution of data points and to identify chromosomal aberrations.

RNA isolation

Cultured fibroblasts were used for total RNA isolation using a QiaAmp RNA Blood Mini Kit (QIAGEN). Total RNA quality and quantity was determined by measuring absorbance spectra on a UV/Vis spectrophotometer (NanoDrop® ND-1000; Nanodrop Inc., Wilmington, DE) and further analyzed using the RNA 6000 Nano Assay and 2100 Bioanalyzer (Agilent Technologies). The undegraded, high-quality RNA was either immediately used in the experiments or stored at −80°C for further use.

ABI 1700 microarray experiments

Total RNA (2 μg) was subjected to first strand cDNA synthesis, cycled for 10 minutes at 25°C, 2 hours at 42°C, and 15 minutes at 0°C. The newly synthesized first strand cDNA was subjected to a second strand synthesis using RNAse-H and Escherichia coli DNA polymerase by incubating the cDNA at 16°C for 2 hours. Double-stranded cDNA was purified and approximately 150 μL of cDNA was used in in vitro transcription reaction. During the in vitro transcription reaction DIG-UTP was incorporated into cRNA. Labeled cRNA (10 μg) was purified, quantified, and hybridized to the Human Genome Array V.2 for 16 hours (Applied Biosystems). After washing and rinsing of the arrays, chemiluminescent enhancement was performed on the arrays. Each array was read using the ABI 1700 Chemiluminescent Microarray Analyzer (Applied Biosystems). Applied Biosystems' Expression Analysis Software was used for primary analysis. Two technical replicates were performed on each sample and controls.

Microarray data analysis and functional annotation of genes

Images were autogridded and the chemiluminescent signals were quantified, background subtracted, and finally, spot- and spatially normalized using the Applied Biosystems 1700 Chemiluminescent Microarray Analyzer software v 1.1.1. For transcriptome analysis, detection thresholds were used following the manufacturer's recommendations. Detection threshold was set as S/N >3 and quality flag <5000. Two technical replicates were performed on each sample for the controls, Patients 4 and 5, for a total of eight microarrays.

To determine gene expression differences, the open source and open development software project R together with the Bioconductor packages were used.22 Probability (P) values were adjusted for multiple comparisons using false discovery rate approach. The t test was performed if the probe showed S/N >3 in 50% or more of the samples in either group. It has been shown that for low sample sizes, using the fold change to identify differentially expressed genes has been shown to outperform other t statistic methods,23 which are due to the fact that these methods provide less reliable estimate of the variance because of the small sample size. Hence, to determine the differentially expressed genes we used a stringent criteria of absolute fold change ≥4 between the disease and normal control for the probes showing S/N >3 in 50% or more of the samples in either group. Kmeans clustering to group differentially expressed genes and hierarchical clustering samples using Eucledean distance with average linkage clustering was performed by using TIGR Multi Experiment Viewer (MeV4.0) (available at www.tm4.org).

Functional annotation and biological-term enrichment analysis was performed by using DAVID Bioinformatics Resources,24 Expression Analysis Systematic Explorer (EASE),25 and Protein ANalysis THrough Evolutionary Relationships (PANTHER) Classification System.26,27 Over-representation was defined by P <0.05.

RESULTS

Patient descriptions

Patient 1

This patient presented at 9 months of age with progressive neuroregression and macrocephaly. Currently, at the age of 3 years 5 months, he has spastic quadreperesis, central hypotonia, oropharyngeal dysplasia, macrocephaly, and mild seizures. The diagnosis of CD was established based on this clinical picture and presence of highly elevated NAA in the urine and was confirmed by molecular testing.

Patient 2

This one-year-old boy was diagnosed at birth with CD because of a family history. He had repeated aspiration pneumonias during the first year of life and showed severe psychomotor retardation and macrocephaly. From early in his life he had severe difficulties in sucking and was found to have moderate oral phase dysphagia but severe upper pharyngeal dysphagia. He received fundoplication and gastrostomy. His muscle strength in extremities was mildly decreased, i.e., 4/5 and had spasticity. The patient had seizure disorder for which he received Rivotril. The diagnosis of CD was established based on this clinical picture and presence of elevated NAA in the urine and confirmed by molecular testing.

Patient 3

This boy was diagnosed at 6 months of age with macrocephaly and severe spasticity. He was followed in the clinic for 7 years. The patient presented to our hospital at the age of 2 years with a history of progressive neuroregression of infantile onset, macrocephaly, and seizure disorder. The patient also had a history of recurrent aspiration pneumonia, dental caries, and gastroesophageal reflux for which he had fundoplication and gastrostomy tube insertion. He is severely developmentally delayed, has never vocalized, nor achieved any motor milestones and has remained at a developmental age of 6 months. He developed early and severe spasticity and showed increased deep tendon reflexes. MRI of the brain revealed mild enlargement of posterior fossa and supratentorial CSF spaces indicating diffused cerebellar and cerebral atrophy. Cerebral white matter revealed diffuse abnormal increased signal intensity sparing the internal capsules and the corpus callosum. Central areas of the cerebellar white hemispheres, globi pallidi, and the thalami were all severely involved. MR spectroscopy revealed increased N-acetyl-aspartate peak in conjunction with decreased choline and creatine peaks. Currently, the patient is 10 years old, is wheel chair bound, and has static encephalopathy with no visual fixation, spastic quadriparesis, and central hypotonia. His head circumference is 60 cm (above the 95% centile for age). He has seizure disorder and is on antiseizure medications. The diagnosis of CD was established based on this clinical picture and presence of elevated NAA in the urine and confirmed by molecular testing. A brother presumed to have the same disease died at 18 months of age.

Patient 4

At 4 months of age after normal delivery, the parents noted delayed development, failure to control head movements, no eye contact, and gross macrocephaly in the patient and his twin brother. Initially, both boys had central hypotonia and widely open anterior fontanel. By 11 months of age, both started to show fluctuating muscle tone, with periods of severe spasticity of upper and lower extremity. Neither patient developed gross or fine motor control. They both had repeated aspiration and had to receive oxygen therapy at home. The diagnosis of CD was established in both twins based on this clinical picture and the presence of highly elevated NAA in the urine and confirmed by molecular testing for only Patient 4.

Patient 5

This male patient developed neonatal seizures at 20 days of age and from then on achieved no developmental milestones. His parents were first cousins. The patient presented to our hospital at the age of 8 months with neonatal onset seizures, progressive neuroregression, and macrocephaly. The seizures were tonic with fixed gazing or blinking of the eyes. At 8 months he had no head control with severe head-lag, no traction response, no response to visual threat, could not roll over, and never smiled. He had had difficulty swallowing since 4-weeks-of age. He was treated with Carbamazepine. He started to babble at 2 years and showed some head control at 3 years of age. At 5 years of age he is wheel chair bound and macrocephalic with spastic quadriparesis and central hypotonia. There is no visual contact and flash visual-evoked potentials are absent bilaterally. The diagnosis of CD was established based on this clinical picture and presence of highly elevated NAA in the urine and confirmed by molecular testing.

Mutation analysis

Each exon and exon/intron boundary of ASPA were screened by direct sequencing. In Patient 1, a previously reported homozygous splice donor site mutation (IVS4 + 1G>T) was identified.28 Patient 2 was homozygous for the missense mutation, G27R.11 Heterozygous status of obligate and other carriers in consented members of the associated families was also confirmed.

Initially in Patients 3 and 4, we were unable to amplify the region encompassing exon 3 of ASPA using PCR. Similarly, amplification of exons 1, 2, 3, 4, 5, and 6 of the ASPA gene in Patient 5 was also not possible. To detect prospective large deletions and the associated breakpoints, we used a combination of MLPA assay, long-PCR, and aCGH. For Patients 3 and 4, a long-PCR strategy was designed in which the primers spanned from introns 2 to 3 and encompassed exon 3. Combined results from long PCR (Supplemental Fig. 1A), MLPA (Supplemental Fig. 1B), and subsequent data analysis confirmed the presence of a novel large deletion in Patients 3 and 4. Consequently, we designed PCR primers targeting different parts of introns 2 and 3 separately and performed standard PCR experiments to narrow down the breakpoints. The results of these experiments allowed us to pinpoint the precise position of breakpoints in Patients 3 and 4 (Supplemental Fig. 1C). The deletion starts in intron 2 (chr17: 3,332,550 position) and ends at intron 3 (chr17: 3,335,897 position) encompassing a 3346 bp region. The region encompassing exon 3 on 3-D structure of ASPA is highlighted on the recently published x-ray crystal structure of human ASPA (Supplemental Fig. 1D).5 The carrier status of the consented family members was also determined (Supplemental Fig. 1A).

Finally, in Patient 5, none of the six exons of ASPA could be successfully amplified using the standard PCR methodology. Recently, there have been reports detailing the presence of gross deletions for this gene and this raised the possibility of a similar gross deletion in this patient. Results of MLPA on this patient confirmed the presence of such a deletion encompassing all six exons of ASPA and beyond. Therefore, aCGH experiments using Agilent's 244K Human Genome CGH Microarray Kit were performed on this patient's DNA to detect the breakpoints. The aCGH results confirmed the presence of a large deletion and its approximate breakpoints (Supplemental Fig. 2). To further characterize the deletion, we designed primers covering the exon/intron boundaries and associated exons for the neighboring gene, TRPV3. The deletion extended to the exon 1 of TRPV3 and size of the deletion is approximately 92 kb. After performing PCRs using several intronic primers spanning different regions of intron 1, the deletion was narrowed down to the breakpoint, which was nearly 4.2 kb far away downstream the exon 1 of TRPV3. Therefore, we concluded that this large deletion is the same deletion as the recently reported large deletion in two patients,13 where the proximal deletion point was 7.16 kb upstream the ASPA start site and the distal deletion point extended to the 3′ sequence of the adjacent TRPV3 gene resulting in deletion of exons 2–17.

Genetic signature, functional annotation, and gene enrichment analysis

Because CD is a rare disease and as fibroblasts were only available from two patients, RNA samples on each patient were run on duplicate arrays (technical replicates) and two control RNAs from the control fibroblasts (reference), for a total of eight microarrays. Overall, of 33,096 probes, an average of 23,780 probes were expressed (“present”) based on the manufacturer's recommendation of signal to noise ratio exceeding 3, a value that indicates a 99.9% confidence that the signal is above background. The dataset showed high correlation within patients and control (technical replicates), and between the patients; with all correlations >0.95.

The differentially expressed genes specific to CD were obtained by overlapping genes lists, i.e., common among the differentially expressed genes for each patient compared with the normal control. Compared with normal fibroblasts (control), 1440 genes were commonly significantly modulated (1116 down-regulated [78%] and 324 up-regulated [22%]) in both CD patients (with absolute fold change ≥4). The hierarchical clustering of samples and genes is shown in Supplemental Figure 3 (the top 100 genes commonly dysregulated in both patients were chosen to provide a readable list in the figure). There were more genes with decreased expression than genes with increased expression. Therefore, we selected the genes in proportion to decreased (78%) and increased (22%) expression in CD for visualization (Supplemental Fig. 3). Samples are denoted in columns and genes are denoted in rows. The expression level of each gene across the samples is scaled to [−3, 3] interval. These mapped expression levels are depicted using a color scale as shown at the top of the figure, as such highly expressed genes are indicated in red, intermediate in black, and weakly expressed in green. From the differentially expressed genes in both CD patients, we identified 34 genes and their functional categories, including cell growth/signal transduction/neurogenesis, aspartate/glutamate transporter, apoptosis and inflammation, muscle contraction and development, and compare with the previous study performed by Surendran et al.1618 in the brain of the knockout mouse for CD (Table 1). Among others, neuropilin 1, which plays a role in axon guidance and cell survival,2931 was strongly induced in CD patients. GABA(A) receptor-associated protein (GABARAP), baculoviral IAP repeat-containing 1 (BIRC1), serpine peptidase inhibitor, clade B (ovalbumin) member 2 and 7 (SERPINB2 and SERPINB7), bradykinin receptor B2 (BDKRB2), and capping protein (actin filament) muscle Z-line, alpha1 were also found to be dysregulated in CD patients. Surendran et al.1618 also reported similar observations in CD mouse such as GABA receptor activity (GABRA6), neurogenic differentiation factor (NEUROD1), serine-type endopaptidase inhibitor (Spi2), and caspase-11 (apoptosis).

Table 1 Identification of 34 genes induced in our CD patients and comparison with CD knock-out mouse study by Surendran et al.1618

To identify the biological processes, molecular functions, and pathways activated by the disease, we used DAVID Bioinformatics Resources,24 EASE software,25 and the PANTHER Classification system.26 By using these bioinformatics tools, we were able to gain greater biological insights into functional processes activated or repressed by the disease much more efficiently and reliably than we could by using the list of differentially expressed genes.3234 We identified overrepresented gene categories among the differentially expressed genes changing >4-fold with the disease. We found that genes involved in “cell growth and/or maintenance,” “lysosome,” “cell proliferation,” “Golgi apparatus,” “metabolism,” “actin binding,” and “TGFbeta receptor signaling pathway” were significantly overrepresented (with EASE score >0.02) in differentially expressed genes (data not shown). The PANTHER Classification system determined that genes related to “Protein modification,” “Intracellular signaling cascade,” “Cell structure and motility,” “Actin binding cytoskeletal protein,” “Signal transduction,” “Mitosis,” “Nonmotor and motor actin binding proteins,” “Cell cycle,” “Kinase modulator,” “Developmental processes,” “Translation factor,” “Chaperone,” and “Apoptosis” were the most significantly overrepresented (P < 10−3).

Table 2 shows the top 10 most significantly modulated pathways identified for the 1440 differentially expressed genes that were related to inflammation mediated by chemokine and cytokine signaling pathway (P = 3.60 × 10−7), angiogenesis (P = 5.61 × 10−7), FGF signaling pathway (P = 1.83 × 10−6), apoptosis signaling pathway (P = 5.83 × 10−5), muscarinic acetylcholine receptor 2 and 4 signaling pathway (P = 1.64 × 10−4), and Wnt signaling pathway (P = 2.67 × 10−4). These analyses results not only validate our transcriptome analysis when compared with the candidate gene analysis already reported in the mouse but more importantly, shed new light into a large number of biological processes that take place in human CD.

Table 2 Top 10 significantly over-represented pathways among genes differentially expressed in CD patients

Confirmation of microarray results using real-time RT-PCR

Because the validation experiments to confirm the ABI 1700 microarray experiments using TAGMAN and SYBR Green assays were performed by different groups previously and high concordance were reported, and our results were in concordance with the previous knockout studies, we did not perform real-time RT-PCR experiments to confirm microarray studies.3537

DISCUSSION

We screened CD patients from five different families from Saudi Arabia to identify likely mutations in the ASPA gene. We identified two novel large deletions and two previously known mutations (a splice donor-site mutation IVS4 + 1G>T and a missense mutation, G27R) in the patients. We also performed global gene expression profiling on the fibroblasts of patients and control samples using ABI Human Genome Survey V.2 arrays. Compared with normal (control) samples, 1440 genes were significantly modulated (up or down) in the Canavan patients (absolute fold change [FC] ≥4). Even though the Patient 5 had the TRPV3 deletion, the expression abnormalities that are discussed were concordant in the two patients thus minimizing the possibility for TRPV3 deficiency interference. Global gene expression profiling results indicated that some genes, involved in apoptosis, muscle contraction and development, mitochondrial oxidation, inflamation and glutamate, and aspartate metabolism, were significantly dysregulated.

Patient 1 has a homozygous splice donor-site mutation, that IVS4 + 1G>T, is similar to that described by Rady et al. 28 in a Turkish family. They used the sequencing technique and showed a G>T substitution at the donor site of intron-4, which resulted in a GT → TT change in the patient.

The mutation of Patient 2 was G27R, first reported by Kaul et al.11 The mutation is located in a region of the hydrophobic core of the enzyme. G27R is likely to cause local unfolding problems due to the positioning of a massive, charged residue into the core and could conceivably cause a severe phenotype. Because the mutation is likely to result in an unstable protein, it possibly undergoes degradation in vivo.5 The effect of the mutation on the enzyme activity was tested by in vitro mutagenesis and expression of mutant cDNA in COS1 cells. G27R mutant ASPA showed a reduced residual activity (3%) compared with the native enzyme.11

The mutation found in Patients 3 and 4 is a novel deletion extending from introns 2 and 3 comprising the full exon 3 and extending 3346 bp. The presence of such a deletion is likely to result in an enzyme having affected zinc coordination, substrate binding, and local unfolding problems eventually leading to an unstable protein that will result in degradation in vivo.5

In Patient 5, none of the six exons of ASPA could be successfully amplified using the standard PCR methodology. Using MLPA and aCGH techniques, we confirmed the presence of recently reported large deletion (92 kb) in Patient 5. Previously, the deletion was found in two patients; Italian and German/British, respectively.13 The TRPV3 gene belongs to a family of nonselective cation channels. Its function is unknown; however, it has been implicated in temperature sensation and vasoregulation. This gene is in close proximity to another family member TRPV1, which is a receptor for capsaicin. The neurologically terminal patient in this study could not be tested for temperature sensation.

We compared our gene expression results found in the patients with those findings previously reported.7,1618,3841 The genetic signature studies indicated significant similarities between our study and those found in the CD mouse model. The genes related to apoptosis/antiapoptosis, cell growth, signal transduction genes, cell death, and inflammatory genes were significantly dysregulated as reported in CD mouse brain.18 In fact our analysis of biological pathways represented by the differentially expressed genes in CD patients revealed that “inflammation mediated by chemokine and cytokine signaling pathways” and “apoptosis signaling pathways” were the first and sixth most significantly over-represented pathways. Surprising findings in our study included dysregulation of the genes related to muscle development and contraction, cytoskeleton-related genes, apoptosis, and mitochondrial activity-related genes.

The genes related to apoptosis/antiapoptosis activities such as BIRC1 and HTRA3 were upregulated in the patients compared with the normals, whereas SERPINB2, TNFRSF19, SOCS3, and RHOB were significantly downregulated in CD patients. It has been reported that BIRC1 is linked to spinal muscular atrophy, which occurs in children with degeneration of the motor neurons and leads to progressive muscular weakness and atrophy.42 BIRC1 was also proposed to participate in antiapoptosis pathways together with some of the caspases such as CASP3, CASP1, and CASP9. DFFA that triggers DNA fragmentation and functions together with CASP3 was also decreased 4-fold.43,44

Surendran et al.17,18,40 suggest that the abnormal expression of some genes such as the GABA-A receptor GABRA6, and neurogenic differentiation factor may have some effect on hypotonia and muscle weakness. It was also suggested that these observations needed to be documented in humans. It is interesting to find that some of the related genes were significantly affected in Saudi CD patients. The genes possibly related to muscle weakness and muscle contraction, such as NEBL, MBNL2, SNTB1, BDKRB2, ASPH, PRKG1 and SVIL, were significantly downregulated in our patients. Some other genes found to be affected and related to muscle contraction and development and myosin activities were MYBPHL, GAL, GLMN2, GSR, MYH2, and MYH11. These observations support muscle-related functions being significantly affected. In fact, the functional category enrichment analysis of the differentially expressed genes also identified “cell structure and motility” and “actin binding cytoskeletal protein” as being significantly enriched.

Significant mRNA changes related to mitochondrial function and oxidoreductase activities were also observed. It is well established that mitochondrial morphology is different in CD. In this regard, PPRC1, a gene responsible for mitochondrial biogenesis was increased whereas NRF1, a nuclear respiratory factor, directly interacting with PPRC1 to activate mitochondrial biogenesis was downregulated. The mRNA of genes that import various compounds into mitochondria, such as MTX1, SLC25A32, SLC25A24, and MCFP, were affected in different manners.

The mRNAs of genes that directly influence oxidative phosphorylation such as SDHA, SDHAL2, Citrate synthase, FECH, and F1 subunit of ATPAF1 were significantly decreased. In a key step in mitochondrial ATP synthesis, the F1 subunit of ATPAF1 acts like a maintenance gene and is required for the assembly of beta (F(1)-beta) and alpha (F(1)-alpha) subunits into mitochondrial ATPase. These mRNA changes were paralleled by changes in the mRNA of a large number of oxidoreductases; some of which are located in the mitochondria, others being in the cytosol and endoplasmic reticulum. The list includes short chain acyl-CoA dehydrogenase, LOXL2, ZADH2, AKR1C1, GSR, and ADH1B. It is interesting to note that a subunit of Cytochrome P450 (CYP51A1), the main detoxifying enzyme, was severely decreased.

Glutamate- and aspartate-related genes such as GPRC5B, GLUL, PGCP, PCMTD1, and ASPH were greatly dysregulated suggesting that glutamate and aspartate metabolic pathways are likely to be affected in these patients. The mRNA of GPR51, GABARAPL2, and GPRC5B that are related to GABA activity were also decreased whereas similar genes such as GABRE, GPRC5C, and GABBR1 were upregulated. In fact, the “metabotropic glutamate receptor group II and group III pathways” and “glutamine glutamate conversion pathway” were among the significantly impacted pathways represented by the differentially expressed genes in CD patients.

The mRNA of clusterin was also increased around 4-fold in the Saudi CD patients; this protein binds extracellular proteins and targets them for endocytosis and degradation. However, expression of various metalloproteases such as some of the ADAM group was significantly decreased. MAP7 that promotes microtubule stability was also downregulated.

The mRNA of ATF5 essential for neuronal differentiation was increased significantly whereas that of HAND2 another morphogenesis protein was decreased considerably. The mRNAs of 68 genes related to various aspects of nucleic acid metabolism were variably altered.

Because we found critically modulated genes implicated in mitochondrial function, mtDNA sequencing was performed according to standard methods. No point mutations and deletions were detected for the following mitochondria-associated diseases: MELAS, MERRF, NARP, cardiomyopathies, LHON, Kearns-Sayre, CPEO, and Pearson Marrow/Pancreas Syndrome. Therefore, association of any of these diseases to Canavan was excluded.

In summary, we have screened Saudi CD patients from five families for putative mutations and identified four mutations including two novel large deletions. Our whole genome gene expression analysis indicated that the activities related to apoptosis, inflammation, cell death, mitochondrial process, protein metabolism, muscle development and contraction, and cytoskeleton-related activities were critically affected. Therefore, mitochondrial oxidative phosphorylation and drug detoxification should be impaired. Because we sequenced complete mtDNA and found no point mutations or deletions, the changes related to mitochondrial activities should be related to CD but not to any known mitochondrial diseases. Also, our results shed some light on questions related to the presence of muscle weakness and hypotonia in CD patients by indicating that the genes related to cell motility, muscle contraction and development, actin binding, and cytoskeletal-related activities were significantly dysregulated. Overall, these observations are in accordance with previous studies performed in a knockout mouse model. To the best of our knowledge, this is the first genome-wide gene expression study performed on CD patients' samples and should shed further light on the pathogenesis of this disease in humans.