Introduction

In the Giemsa-dark-staining Xq21 band, numerous large male-viable deletions have been described which encompass up to 15 megabases (Mb) of DNA, or 8–10% of the estimated size of the human X chromosome [1, 2]. These deletions often give rise to contiguous gene syndromes including choroideremia (CHM), X-linked deafness (DFN3) and nonspecific mental retardation (MR) [3–7]. Sizeable deletions have also been found in patients with nonsyndromic CHM or DFN3 [4–6, 8, 9].

Employing positional cloning strategies, the genes underlying CHM and DFN3 have been isolated. Analysis of patients with Xq21 deletions and syndromic and nonsyndromic forms of MR has enabled us to map the gene for X-linked MR to a region between CHM and DFN3 [4, 7, 10–13].

As a prerequisite for the isolation of this and other genes in Xq21, we have set out to generate a ladder of overlapping yeast artificial chromosomes (YACs) spanning the entire Xq21 region. To this end, we first employed numerous X;autosome translocations and deletions to subdivide the Xq21 band into 24 different intervals [8, 14; Philippe, unpubl. results]. Moreover, we have previously described a 850-kb YAC and cosmid contig encompassing the DFN3 locus and a 350-kb YAC encompassing the CHM gene [9,12,13]. Here, we report on three YAC contigs spanning approximately 7 Mb of the Xq13.3-q21.31 region and on the localization of a novel deletion associated with CHM and MR.

Materials and Methods

Patients

All patients except patient AP have been described elsewhere and are listed in table 1.

Table 1 Patients included in this study

YAC Clones

YAC clones were initially identified by PCR amplification of YAC DNA pools or by hybridization of probes to filters containing gridded YAC DNA. YACs were isolated from previously described CEPH [15] and ICRF YAC libraries [16]. All relevant markers used in this study are listed in table 2.

Table 2 Markers used in this study

Positive YAC clones were plated on NZYM agar [17], and three yeast clones each were picked and cultured as described elsewhere [18]. Yeast cells were embedded in low-melting agarose, incubated for 2 h in the presence of zymolyase (Seikagu) and incubated overnight with pronase (Boehringer Mannheim). Yeast chromosomes were separated using pulsed-field gel electrophoresis (PFGE) and blotted to a GeneScreen Plus membrane (NEN Dupont). YAC blots were hybridized with 32P-labelled total human DNA to estimate the size of the human insert. Subsequently, the identity of the YACs was checked with the marker originally used for their isolation. YACs were further characterized by screening of adjacent markers and through the generation of left- and right-end fragments by ligation-mediated PCR (LM-PCR; see below).

In addition, we screened the CEPH YAC data base with new DNA markers from this region and for YAC clones which overlap with previously mapped YACs.

Southern Blot Analysis

Hybridization of 32P-labelled DNA fragments to nylon filters containing DNA from patients or from YAC clones was essentially done as described elsewhere [9].

LM-PCR and Sequence Analysis

The LM-PCR method was employed essentially as described by Kere et al. [19]. Yeast DNA was digested with AluI, EcoRV, RsaI or PvuII prior to ligation of adaptors. LM-PCR end fragment products of the YAC clones were separated on an agarose gel and purified with the Qiaquick gel extraction kit (Quiagen). Purified DNA was sequenced using fluorescent dideoxynucleotides on an Applied Biosystems 373A DNA sequencer. Sequencing reactions were done with a Taq DyeDeoxy Terminator Cycle Sequencing Kit (Applied Biosystems) according to the manufacturer’s instructions. The oligonucleotides of end fragment 753e10EV/L (ATTTATTCTTCCAGGCCCATA and CGGAATTCTGAAAAAGAGATC) were designed (Isogen Bioscience, The Netherlands) to generate a product of 94 bp. For end fragment 5045H5, the oligonucleotides TGGATAAAGTAAAAAGCACACAAG and TGGTAGTTGTCTCATAGCTCTTG amplify a product of 260 bp. Finally, a 152-bp PCR product (A0822EV/L) can be amplified from the left end of YAC A0822 with the oligonucleotides CCTAATTTGGGAAGAACATATC and AATTTAAGAGAGAAATTATTCATATT.

Results

Construction of a YAC Contig

Several markers from the Xq21 region were employed to screen the ICRF and CEPH YAC libraries [15, 16]. All YAC clones were extensively screened for the presence of X-linked markers and for the absence of autosomal sequences. In this way, a YAC map was generated consisting of three nonoverlapping contigs encompassing the Xq13.3-q21.31 region. From these contigs, YAC 4893 and YAC 5045 have been published elsewhere [9]. To establish novel sequence-tagged sites (STSs) and to check the integrity of the YACs, we employed the LM-PCR technique [19]. LM-PCR products were hybridized to adjacent YACs and to DNAs of a large number of patients with deletions in Xq21 to localize and orient the YACs (fig. 1). End fragments mapping to the Xq21 region were sequenced, and oligonucleotides were designed for PCR amplification to generate new STSs.

Fig. 1
figure 1

Physical map of the Xq13.3-q21.3 region. On the upper horizontal bar, the previously and newly mapped DNA loci are indicated. The YACs are represented by solid bars for X chromosomal sequences and wavy bars for autosomal sequences. Vertical bars indicate the presence of the markers on the YACs and in the DNA of patients with deletions. Open circles are right-end fragments while solid circles indicate left-end fragments of the YACs. Solid and open diamonds represent the outermost X-chromosomal cosmid clones derived from their respective YACs used for direct-visual-hybridization analysis. The identification numbers of the YACs are depicted adjacent to the YACs, while their estimated sizes are depicted underneath in parentheses. The gaps between the three YAC contigs are indicated with double arrowheads. The deletions found in several patients used in this study are depicted at the bottom of the figure. The solid bar represents the chromosomal segments present in the patients. The molecular characterization of all patients has been reported elsewhere except for patient AP (indicated with an arrow), who suffers from MR and CHM. The mapping results of the YAC end fragments are only indicated for overlapping YACs. For purposes of clarity, the region between POU3F4 and DXS233 is not true to scale but expanded twofold. At the bottom, the intervals from this region as defined by Philippe et al. [14; unpubl. data] are depicted.

The middle and most distal YAC contigs span approximately 3 and 2.5 Mb, respectively. Although the size of the most proximal contig has not been determined precisely, we estimate that together, the resulting YAC contigs encompass a chromosomal segment of approximately 7 Mb, extending from PGK1 proximally to DXS1002 distally with a gap between DXS986 and DXS738 and between end fragment 5045H5 and the DXS232 locus. The most proximal contig is linked to a previously published YAC contig spanning the PGK1 locus [20, 21]. The most distal contig overlaps a previously reported YAC contig in the Xq21.3 region [22].

Identification and Fine Mapping of a Deletion Associated with CHM and MR

Patient AP is a 40-year-old mentally retarded man with CHM. Ophthalmologic findings suggest that two of his sisters (II-1 and II-4) and his mother (I-2) are carriers of CHM. Apart from a microdeletion in the Xq21 band (see below) patient AP carries a balanced translocation 46,XY,t(10;15) (q26.3;q24.3), which is also present in his mentally normal sister (II-4) and father, and apparently is not associated with a clinical phenotype (fig. 2). II-1 and II-2 have not been investigated for the presence of the translocation.

Fig. 2
figure 2

Pedigree of kindred with CHM and MR. Patient AP (II-3) was ascertained via his sister (II-4) who was prenatally counselled. Two of his sisters (II-1 and II-4) and his mother (I-2) are carriers of the deletion. AP, his father (I-1) and one of his sisters (II-4) also carry a balanced translocation t(10;15)(q26.3;q24.3). II-1 and II-2 have not been tested for chromosomal abnormalities.

Patient AP was investigated for the presence of a deletion by PCR amplification of several exons from the CHM gene. All exons tested were absent, which prompted us to investigate the extent of the deletion in greater detail. On the proximal side, the markers DXS26 and DXS995 were found to be present in patient AP, while DXS232 and DXS121 were absent. Also, the POU3F4 gene, previously shown to be involved in DFN3, appeared to be present in this patient [13]. These findings located the proximal breakpoint of patient AP between the previously defined intervals 7 and 8 of the Xq21 band (fig. 1) [14; Philippe, unpubl. data]. Subsequently, several cosmids from a previously published 850-kb cosmid contig [9] were hybridized to precisely map the proximal end-point of this deletion. The breakpoint is located in cosmid 5045H3, which detects a novel restriction fragment carrying the deletion junction (fig. 3). A cosmid located proximal to this breakpoint (cosmid 5045A6) is entirely-present in patient AP, while the first cosmid (5045E7) distal to cosmid 5045H3 is completely absent. These results confirm the initial localization of the breakpoint in cosmid 5045H3 (fig. 3). It is noteworthy that the distal breakpoint of another deletion, detected previously in a patient with DFN3 (TD) [9], is also located in cosmid 5045H3, approximately 10 kb from the proximal deletion breakpoint in patient AP (data not shown).

Fig. 3
figure 3

Mapping of the proximal deletion breakpoint in patient AP. Hybridization of the cosmids 5045E7 (a), 5045H3 (b) and 5045A6 (c) to a Southern blot of EcoRI-digested DNA of patients with deletions and a female control. In patient AP, cosmid 5045H3 detects a novel restriction fragment indicative of the proximal breakpoint of the deletion.

Several markers from the Xq21.2-q21.31 region, i.e. DXS1002, DXS95, DXS262, DXS110 and DXS472, appeared to be absent in patient AP, thus the distal breakpoint of the deletion in patient AP could be assigned to the Xq21.32-q21.33 segment. To localize this breakpoint more precisely, several markers from a previously defined interval (interval 20) [14; Philippe, unpubl. data] were employed. PCR analysis revealed that DXS3 is absent, while DXS1170 and DXS990 are present in the DNA of AP. Conventional Southern blot analysis showed that the DXS112 locus is also located proximal to this deletion breakpoint (data not shown). Therefore, the distal breakpoint of the AP deletion subdivides this interval [Philippe, unpubl. data] in two distinct intervals: 20A and 20B. The loci DXS3 and DXS112 are located in interval 20A, whereas interval 20B harbors DXS1170and DXS990.

Mapping of New Markers with Respect to Known Markers

These data and the previously established deletion interval map [14; Philippe et al., unpubl. data] have enabled us to define the physical order for several markers. Four markers have been mapped to interval 2 based on their presence in patient XL62 and in C56N, a human-rodent cell hybrid containing the Xq21.1-qter segment [Philippe et al., unpubl. data]. DXS1225 is present on YAC 910h1 and absent from YAC 752e10. Therefore, this marker is located proximal to DXSH97 and DXS986 which are both present on YAC 753e10. Since DXS1197 is present on both above-mentioned YACs, the order must be Xcen-DXS1225-DXS1197-DXS986-Xqter. DXS346 also maps to interval 2 and is absent in both YACs. Since the left-end fragment of YAC 753e10 (753e10EV/L) is also located in this interval (data not shown), DXS346 can be placed distal to this end fragment.

DXS738 is located in interval 3, based on its presence in patient 1/10 and absence from patient XL62 (fig. 1) [Philippe et al., unpubl. data]. DXS738 is absent from both YAC 753e10 and 321f8, which positions it proximal to DXS72. Since the end fragment A0822EV/L of YAC A0822 is also present in YAC 321f8 while DXS72 is absent from YAC A0822, the end fragment is positioned proximal from DXS72 in interval 3. By combining these results with previous mapping data [14; Philippe et al., unpubl. data] we propose the following order: Xcen-PGK1-DXS566-DXS1225-DXS1197-DXS896-753e10EV/L-DXS346 — DXS738 — A0822EV/L — DXS72 — DXS169-Xqter.

A third relevant end fragment, 5045H5, maps distal to POU3F4 and proximal to DXS232. This end fragment is located ouside the deletions found in patients TD and LGL2905, which assigns this marker to interval 8. 753e10EV/L, A0822EV/Land 5045H5 were sequenced and oligonucleotides were designed for PCR amplification. In this way, three new STSs could be generated (see Materials and Methods).

More distally, DXS1167 bould be placed between DXS326 and DXS165 because of its localization on YAC 949e11 and its absence from YAC 4887. This marker has been mapped previously to interval 10 [14; Philippe et al., unpubl. data] (fig. 1). DXS1209 and ZNF6, a gene containing a zinc finger motif, which has been mapped to Xq21 [23] could be localized in the same way. DXS95, DXS349, DXS364 and DXS1168 could be placed distal to DXS1002 on the basis of their absence from YAC 949e11. A comprehensive overview of the Xq21 region, including the YAC contigs, all above-mentioned markers and the relevant deletions is given in figure 1.

Discussion

In this study we have established three YAC contigs encompassing the Xq13.3-q21.31 region except for a gap between DXS986 and DXS738, and a gap between POU3F4 and DXS232. To estimate the size of the latter gap, cosmids from the proximal segment of YAC 4886 and the most distal cosmid of YAC 5045 were employed as probes for direct-visual-hybridization analyses [24] (fig. 1). These experiments revealed that the size of the gap does not exceed 100 kb [Merkx and van der Maarel, unpubl. data]. We are currently screening for cosmids from the ICRF Cosmid Reference Library [25] and Chromosome X Cosmid Library LL0XNC01 ‘U’ to fill this gap. Despite extensive screening of the ICRF and CEPH YAC libraries [15, 16], we have not yet been able to fill the remaining two gaps. The generation of the end clone 753e10EV/L will hopefully lead to the identification of new YACs between DXS986 and DXS738.

We confirmed the localization of 19 DNA markers and 4 genes from the Xq13.3-Xq21.31 region and were able to map them in the correct order. In addition, three STS clones were generated and positioned in the contigs.

We assume that the DXS566 locus maps to an internal deletion in YAC 910h1, because in this YAC, the respective primer pair has failed to yield a PCR product. We cannot rule out the possibility that DXS346 also maps to this deletion interval. If so, DXS346 should be located between DXS566 and DXS1225.

Many of the microscopically detectable deletions in the Xq21 region are associated with DFN3, MR and CHM [2, 6, 7, 26, 27]. The identification of submicroscopic deletions associated with either CHM [28] or DFN3 [6,9] has enabled the subsequent cloning of the relevant genes [10–13]. Characterization of these deletions has revealed that the physical order of these disease genes is: Xcen-DFN3-MR-CHM-qter. Some patients with deletions in Xq21 show clinical features which are not observed in others and which do not seem to fit the contiguous gene syndrome model. The most striking additional features are obesity [29], hypogonadism [6] and cleft lip and palate [30]. The presence of a gene in Xq21 involved in palate closure is also suggested by linkage studies in three families with X-linked cleft palate and/or ankyloglossia (CPX). The critical region for CPX was demarcated proximally by DXS1002 and distally by DXYS1 [Stanier, pers. commun.; 31–35]. In view of the large number of deletions known in Xq21, it is striking that thus far no small deletions have been found with nonspecific mental retardation or with a combination of MR with either DFN3 or CHM.

Here, we are the first to report on a deletion associated with MR and CHM. The deletion encompasses the CHM gene and extends into the chromosomal region between the DFN3 and CHM genes confirming the initial localization of a MR locus in Xq21.1. Since the deletion in AP encompasses a large segment of Xq21, one could speculate that the deletion disrupts a MR gene in Xq21.33, distal to the CHM gene. There are two arguments against this hypothesis. First, the MR gene would be located distal to several deletions associated with a complex phenotype including MR, e.g. XL45, SD, and DM. Second, two patients with nonsyndromic CHM have been described [8], in whom deletions extend into or beyond interval 20 (patient LUN3 and LUN1). Together, these data strongly suggest that a locus for XLMR is situated in the chromosomal region defined by intervals 8, 9 and 10. At the proximal side, this region is demarcated by the deletions found in DFN3 patients II/7, 1/10, and TD [9]. The proximal deletion breakpoint in AP does not significantly narrow down the localization of the MR locus, since it is located only 10 kb from the distal deletion breakpoint in TD. At the distal side, deletions associated with CHM in patients C759, 3.5, 25.6, MS, and 7.6 [8] commence between ZNF6 and DXS165 and extend telomerically. The deletion in patient LGL2905 who has CHM but no MR, restricts the critical region for MR to interval 8.

It is remarkable that apart from DFN3, patient TD shows hypogonadism and mild MR. Five affected family members are not mentally retarded but show marked antisocial and immature behavior and possibly mild learning deficits [36]. Since the deletion in patient TD is close to the MR locus [7], it could affect the proper transcription of the MR gene. This might explain the complex clinical findings in TD and his family.

So far, we found no evidence for microdeletions in the critical region for MR (interval 8) in 40 patients with XLMR [van der Maarel, unpubl. results], whereas in patients with CHM or DFN3, deletions were detected in approximately 25% of cases [8, 9; de Kok and Cremers, unpubl. results]. It is very likely that the main reason for this discrepancy is the genetic heterogeneity of XLMR [37]. Indeed, the Xq21.1 locus may only be involved in a small proportion of cases with XLMR. Thus far, several syndromic forms of MR, like Allan-Herndon-Dudley syndrome, Juberg-Marsidi, and α-thalassemia/MR (ATR-X) have been mapped to Xq13-q21 [38–43], but recent linkage studies of our group in families with nonspecific XLMR point to a clustering in the Xp11 region [44]. Recently, mutations in a putative global transcriptional regulator (XH2) have been shown to be causative for ATR-X [45].

Definite proof for the existence of a gene for MR in Xq21.1 can only be obtained by cloning of candidate genes from the relevant region and identification of mutations in syndromic or nonsyndromic cases of XLMR. These studies will be greatly facilitated by the Xq13.3-q21.31 YAC contigs reported here.