Introduction

Ornithine transcarbamylase (OTC) deficiency (OMIM no. 311250) is the most common inherited disorder of the urea cycle and is transmitted as an X-linked trait.

The locus of the gene encoding OTC is on the short arm of the X chromosome within band Xp21.1.1 The gene spans 74 kb with an open reading frame of 1062 nucleotides distributed into 10 exons and 9 introns.2, 3 The phenotypes of females heterozygous for a mutant OTC allele vary from asymptomatic carrier state to overt, even fatal disease, depending first on the nature of the gene mutation, second on X-inactivation pattern and third on other genes and environmental factors. In contrast, in hemizygous male patients, the phenotype is determined by the nature of mutation and other yet unknown factors (other genes/environment). Such male patients most commonly develop symptoms of hyperammonemia in the neonatal period or in early infancy and their disease is often fatal.4 However, there are some male patients in whom the onset of the disease is delayed until the preschool age period5 through to adulthood.6, 7 Some affected males within the same families may remain asymptomatic for life.8 Their condition is now recognized as ‘late-onset OTC deficiency in male patients’, accounting for 30% of male patients.9 Such male patients reproduce at a fitness value of 0.49.10 Although the majority of mutations at human OTC locus are ‘private’, being observed in single families only,11 several mutations have been observed repeatedly in discrete families,12, 13, 14 mainly affecting CpG dinucleotides. Among those, the c.119G>A (p.R40H) and c.829C>T (p.R277W) mutations have been repeatedly reported in multiple ethnicities. In our previous series of Japanese families, the c.119G>A (p.R40H) mutation was encountered in a cluster.7, 10, 15 In addition, we identified another novel mutation, c.163T>G (p.Y55D) in two discrete families.10, 16 It is not known whether or not these mutations share a common ancestral origin or have arisen recurrently.

Polymorphic sites in the human OTC gene having potential for family tracking were reported previously. These include single nucleotide substitutions, insertions and short tandem repeats.9, 17, 18, 19, 20, 21, 22 After completion of the HapMap Project, single nucleotide polymorphisms (SNPs) have become available for haplotype analysis on a given gene.

However, SNP-based haplotype analysis of the human OTC locus has not been reported previously. We aimed to determine the haplotypes of these mutant OTC alleles to explore the origins of these mutations.

Materials and methods

Families

A total of 14 families with the c.119G>A (p.R40H) mutation, 5 families with the c.829C>T (p.R277W) mutation and 2 families with the c.163T>G (p.Y55D) mutation were studied. The 14 families with the p.R40H mutation consisted of 9 families from Japan, 1 family each from Spain, Australia and Austria, and 2 families from the United States. The families with p.R277W included 1 family from Japan, 1 family from Korea and 3 families from US. Two families with p.Y55D were both Japanese. The probands were all male, except the proband in family 11, a 13-year-old symptomatic girl who carried the p.R40H mutation, as did her asymptomatic mother. Families 1–9, 15, 20 and 21 were Japanese; family 16, Korean; and families 10–12, Caucasians.

None of the families in the present series were known to be related to each other. Demographic information, including ethnic background, of the US families (13, 14, 17–19) was not available because of the US Personal Data Protection Act. Haplotype analysis was performed on the proband from each family and their relatives, when specimens were available from them. In family 11, the DNA specimens from parents of the proband were also analyzed to determine the mutation-bearing allele. In family 12, the DNA specimen of the proband was prepared from the liver tissue obtained after it had been inadvertently transplanted to a woman.23

Haplotype analysis

A total of 9 tagged SNPs were selected on the Haploview24 with r2 of 0.80 and minor-allele frequency of 0.05 (Table 1). The haplotype frequencies of Japanese in Tokyo (JPT), Utah residents with Northern and Western European ancestry from CEPH collection (CEU) and Yoruban in Ibadan, Nigeria (YRI) were available on http://hapmap.ncbi.nlm.nih.gov/index.html.en. The nucleotide combinations in Pan troglodytes (chimpanzee) were obtained on http://www.ensembl.org/index.html to estimate the human ancestral alleles. The nucleotide in the polymorphic site 6 (rs5963421 for human) for P. troglodytes was represented by that in Pongo pygmaeus abelii (orangutan), because this nucleotide was not available for P. troglodytes.

Table 1 Haplotypes generated by single-nucleotide polymorphism in human ornithine transcarbamylase locus

The SNPs were determined by the TaqMan probe-based real-time PCR on LightCycler LC-480 (Roche Diagnostics GmbH, Mannheim, Germany). The PCR was carried out in a total volume of 20 μl, containing 20 ng genomic DNA, 10 μl LC-480 Probe Master (Roche) and 1 μl 20 × Probe/Primer Mix (Applied Biosystems, LLC, Foster City, CA, USA). The probes used were labeled as either FAM or VIC. The temperature was programmed as follows: a pre-incubation step at 95 °C for 10 min, followed by 40 cycles of denaturation at 95 °C for 10 s, annealing at 60 °C for 60 s and extension at 72 °C for 1 s. At the end of the annealing step, the fluorescence signal was measured. After the PCR reaction, the temperature was decreased to 40 °C as a cooling step.

Ethical considerations

The Ethical Committee of Kurume University approved this project and specimens were obtained in accordance with respective institutional bioethical standards and relevant bioethical regulations or guidelines in each country.

Results

Haplotypes defined in the families and their allelic frequency and heterozygosity

The data on the nine SNPs employed and the haplotypes determined in the families by the use of these SNPs are summarized in Table 1. Six discrete haplotypes (HT's) (1–6) were found among the probands. The parents of the female proband in family 11 carried two additional HT's, 7 and 8.

Association of the three mutant alleles and particular haplotypes

The results of the haplotype analysis of the 21 families are summarized in Table 2. Among the nine Japanese probands carrying the p.R40H mutation, those belonging to families 1–8, who resided in an area within a radius of 140 km, all had HT1, whereas family 9, who lived 650 km away from that area, had HT2. One Spanish family and one Australian family had HT3. In this Australian family, the proband (a girl) bore HT3/8; her mother, 3/7; and father, HT8. Previous analysis had shown that mother carried the mutant allele. Therefore, it was determined that the mutation was linked with the HT3 allele. The Austrian family and one US family shared HT4. The proband in the remaining one US family had HT2. Thus, in the 14 families carrying the p.R40H mutation, this mutation was associated with four different HT's, indicating that it had appeared anew at least four times in the Japanese and the Caucasian populations. Similarly, the p.R277W mutation was associated with three different HT's in our five families carrying this mutation. Three families that carried p.R277W, one Japanese, Korean and US family, shared HT5 each. One of the remaining two US families carried HT6 and the other, HT2. The two Japanese families that carried the p.Y55D mutation shared HT1.

Table 2 Haplotypes of mutant alleles carrying the three mutations

Evolutional order of the haplotypes

To estimate the evolutional order of these haplotypes, nucleotides in each polymorphic site were compared with those in P. troglodytes (Table 1). HT8 differed only in polymorphic site 7 (T-to-G) from that of P. troglodytes. HT1, HT5, HT6 and HT7 exhibited nucleotide changes in two polymorphic sites, whereas in HT2, HT3 and HT4, there were nucleotide changes in three polymorphic sites.

Discussion

Although the majority of mutations at the OTC locus are ‘private’ in patients with neonatal and infantile presentation,11 some mutations have been found to recur in unrelated families.12 Among mutations associated with late-onset OTCD in male patients, the two mutations, p.R40H and p.R277W, have been most frequently reported in multiple different families.7, 12, 13, 14, 15, 25 The p.Y55D mutation has been found only in two unrelated Japanese families.10, 16 It was not known whether or not the recurring mutations shared a common ancestral origin or had arisen independently. The present study suggests that the p.R40H mutation occurred at least four times, or even five times, if the family 14 is non-Japanese American, in the Japanese and in the Caucasian populations. It appears very likely that those eight Japanese families (1–8) share a common ancestral origin and the mutant allele had been retained in the population in this small area. The other Japanese family (family 9) had a distinct haplotype, however, suggesting this mutation had arisen recurrently. Families 10 and 11, both Caucasian, may also have a common ancestral origin. Although the Australian family has no known Spanish ancestry, the proband's maternal grandfather (not studied) was of North Italian descent and could conceivably share ancestry with the Spanish family. It remains possible that the mutation occurred recurrently, but it is noteworthy that the allele in these families has a low haplotype frequency (0.033) among Caucasians. Families 12 and 13 share a haplotype, but there is insufficient information to say whether this represents the effect of common ancestry or of recurrence on the same haplotype. HT4 is the most common in Caucasians (allele frequency 0.356 in the CEPH Caucasian samples) and there is no information about the ethnicity of family 13.

Among the families with the p.R277W mutation, the Japanese and the Korean families (families 15 and 16) could have stemmed from a common ancestor. Family 18 bore HT6, confirming recurrent origin of this mutation.

The families that harbored the p.Y55D mutation (families 20 and 21) shared HT1. This mutation has been found among the Japanese population alone to date.10, 16 In contrast to the p.R40H and p.R277W mutations, which involve CpG dinucleotides and hence are likely to recur, this transversion (T-to-G) is not common among single base substitutions in X chromosome genes in general26 and in the OTC gene in particular.12, 14, 27 This characteristic of the p.Y55D mutation would again support that the affected families share a common ancestral origin.

The search for polymorphic markers and the determination of their allelic frequency is required for exploring the origin of an allele. As expected, different types of polymorphic markers in the OTC gene or in its vicinity are known and include SNPs in the coding region,9, 17, 20 in the 5′ untranslated region21 and extragenic microsatellite markers.18, 19, 22 Four SNPs, A-to-G in codon 46 in exon 2 (E2 46), A-to-T in intron 3 (IVS3-8nt), A-to-G in intron 4 (IVS4-7nt) and A-to-G in codon 270 of exon 8 (E8 270), were found to be informative among a US population,17 though their informativity varied somewhat in the Iberian population and in the Mozambiquan population.18 One additional SNP (IVS3-39_insT) was informative in a Spanish population,20 indicating that informativity of these SNPs varied between populations. The nine tagged SNPs we used in the present study were all highly informative, except rs5963428, which presented 0.095 heterozygosity in JPT (Table 1). These tagged SNPs were all informative in CEU and YRI also, with the exception of rs12557315 in YRI. We thus considered it appropriate to apply these tagged SNPs to define haplotypes in other ethnicities also. There are several advantages of using these tagged SNPs in determining haplotypes. First, these SNPs generally show high heterozygosity frequency, allowing high informativity. Second, the tagged SNPs are located over the entire span of the OTC gene, reducing misdiagnosis due to an extragenic recombination event when diagnosis by linkage analysis is necessary in the absence of an identifiable mutation. Third, it permits easy differentiation between intragenic recombination and mutation of one particular marker single nucleotide (both rare events). Finally, the analytical efficiency of this technique is high as amplification and data acquisition of up to seven specimens can be completed simultaneously within 90 min under the conditions employed.

Both the p.R40H and p.R277W mutations arise in CpG dinucleotides, which represent mutational hot spots, consistent with our finding that they have recurred in different populations. Recurrent point mutations occur evenly among most CpG dinucleotides in the OTC gene.9, 12, 14, 27 These two mutations arise in arginine codons. It is not surprising that these single nucleotide changes have a higher chance to recur than others, because four of six codons that encode for arginine contain CpG dinucleotides. Indeed, scrutiny of a recent OTC mutation update9 reveals that 20 discrete mutations have occurred in a total of 15 arginine codons (ratio: 20/15=1.33) in the human OTC gene, whereas this ratio is less than 1.0 in the vast majority of codons encoding for other amino acids.

Haplotype 8 differed by one nucleotide from that in P. troglodytes, whereas the other seven HT's exhibited two or three nucleotide differences. This possibly indicates that HT8 is the oldest among the eight HT's identified in the present series. HT's 3 and 4 shared segments consisting of identical nucleotides (polymorphic sites 5–9) and HT's 4 and 7 had jointly another identical segment (polymorphic sites 1–3) (Table 1). It is thus possible that HT3 and HT7 were generated by a recombination of the HT4 allele with another allele, or conversely HT4 was generated by a recombination of HT3 with HT7.