Introduction

Glaucoma is a complex of ocular disease entities and is the second most common cause of blindness in industrialized countries [1]. The most frequently encountered form of the disease is primary open-angle glaucoma (POAG; MIM 137760 [McKusick 1994, pp 559–560]). POAG is characterized by irreversible lesions of the optic nerve, a normal-appearing morphology of the anterior chamber of the eye and is usually associated with increased intraocular pressures. POAG has been subclassified according to age at onset and clinical progression as (1) juvenile open-angle glaucoma (JOAG), which appears between the age of 3 and early adulthood and manifests an aggressive course, and (2) chronic open-angle glaucoma (COAG), which is distinguished by its mid- to late-age onset and a slow, insidious progression. A dominant JOAG locus was first described on chromosome 1q21–q31 by Sheffield et al. [2]. This locus is now known as GLC1A. More recently, it was shown that both JOAG and COAG were found to be linked to the GLC1A locus within a large French-Canadian family. In this pedigree, ages at onset were distributed over several decades displaying a clinical continuum and all patients, either JOAG or COAG, shared a common ‘disease’ haplotype encompassing GLC1A [3]. As COAG is a complex non-mendelian disorder, COAG patients within this pedigree may show variable expression of the JOAG phenotype, and/or expression of the disease may be altered by modifier genes. Furthermore, evidence for genetic heterogeneity for JOAG was observed in a large Swedish family which did not show linkage to 1q [4]. To date, the most refined localization of the GLC1A putative gene is restricted to an approximately 9-cM interval between loci D1S445 and D1S416/D1S480, in 1q23–q25 [3]. The physiological and molecular bases of POAG have not yet been defined. A better understanding of this disease requires identification of the causative gene(s). We describe here a yeast artificial chromosome (YAC) contig, spanning the 8- to 10-cM POAG interval flanked by D1S445 and D1S416/D1S480. After the completion of our work, a YAC contig spanning the same locus was released by the Whitehead Institute/MIT Genome Research Center (WI/MIT), Human Genetic Mapping Project, Release 9, December 1995. However, the contig we report also includes a number of additional polymorphic and nonpolymorphic STSs as well as several YACs not previously described. Most of the YACs are now characterised by fluorescence in situ hybridization (FISH) and/or their insertion ends.

Materials and Methods

YAC Library Screening

All YACs were isolated by PCR screening [6] of the CEPH megabase size insert library [7, 8]. All PCR markers are shown in table 1.

Table 1a STSs used for the contig
Table 1b STSs generated during this study and not assigned to the GLC1A locus

PCR was performed in a 25-µl reaction volume, containing 10 mM Tris-HCl, pH 8.8, 50 mM KCl, 1.5 mM MgCl2, 0.1% Triton X-100, 200 µM of each dNTP (Pharmacia), 0.5 µM of each primer (synthesized by Genset, France) and 1 unit of Taq DNA polymerase (Cetus). For amplification, 5 ng of YAC DNA were denatured for 5 min at 94°C before adding the Taq polymerase, and the mixture was subjected to 30 cycles as follows: 40 s at 94°C, 30 s annealing at the indicated temperatures (table 1) and 30 s at 72°C. Positive control amplifications were routinely performed on 20 ng human genomic DNA and negative controls on AB1380 yeast DNA and no DNA. Amplification products were analysed on ethidium bromide-stained 2 or 4% agarose gels, depending on their size.

Characterization of YAC Ends

YAC ends were isolated either by direct sequencing of total reactions of inter-Alu-vector PCR [Clépet et al., in preparation], or by a modification of the inverse PCR (IPCR) procedure described by Abderrahim et al. [9]. For IPCR, 200–300 ng of YAC DNA prepared in microtiter dishes [6] were independently digested with 2 units of EcoRV, HaeIII, HhaI, NlaIV, Sau3A, MspI, or TaqI, in 40-µl volumes for 7 h. Heat-sensitive enzymes were then denatured for 20 min at 70°C. After ethanol precipitation, DNA pellets were washed, dried at room temperature and carefully resuspended in 40 µl of H2O.10 µl of digested DNA was denatured 10 min at 65°C and kept on ice before circularization. Ligation was performed in a 40-µl final volume with 0.5 U ligase (Promega) overnight at 4°C, and heat stopped for 15 min at 72°C. To characterize the left end of a YAC insert, EcoRV, HaeIII, NlaIV, Sau3A, MspI and TaqI circularization products were PCR amplified with primers 5U (5′GTTGGTTTAAGGCGCAAGACTT3′) and 5R (5′TGAAGAAAGAGTATACTACATAACA3′). To amplify the right end of a YAC, HaeIII, HhaI and NlaIV self-ligated DNAs were used with primers 3U (5′CCCGATCTCAAGATTACGGAAT3′) and 3R (5′CATTCACTTCCCAGACTTGCAA3′). PCRs were performed on 10 µl of ligation product, in a 45-µl final volume, with 3 units of Taq DNA polymerase, and the same buffer conditions as described above. In order to improve specificity, hot-start and touch-down PCR were performed: 94°C 5 min denaturation before adding the Taq polymerase; then 3 cycles: 94°C 40 s, 61°C 30 s, 72°C 60 s; followed by 30 cycles 94°C 40 s, 58°C (for 5R/5U) or 59°C (for 3R/3U) 30 s, 72°C 60 s. Final extension was performed at 72°C for 2 min. 10 µl of the PCR reaction was controlled on 2% agarose gel, the rest was filtered through Microcon devices (Amicon Inc., USA) and subjected to dye-dideoxy cycle sequencing with same primers as for PCR, at their respective annealing temperature (using the ABI sequencing system; Foster City, Calif., USA). PCR primers were designed with the Oligo 4.0 program [10]. Sequences were screened for repeated elements using the FASTA program [11] and a local database containing a compilation of known human repeat sequences. Chromosomal assignments of YAC end sequence-tagged sites (STSs) were determined by PCR assay on the somatic cell hybrid panel 2 and/or panel 1 [12], purchased from the National Institute of General Medical Sciences (Camden, Pa., USA).

Isolation of Other STSs

Isolation of microsatellite markers AFMGLC20, 21, 22, and AFMGLC23 from YAC 792d9 is described in Morissette et al. [in preparation]. Plasmid pHBI40 (D1S66) was purchased from ATCC. This RFLP probe was partially sequenced in order to generate an STS (STS-HBI40) and make its use more convenient.

Pulsed-Field Gel Electrophoresis and Southern Blotting

Pulsed-field gel electrophoresis (PFGE) was performed in 1% SeaKem agarose gels (Tebu, France), with a CHEF DRIII or a CHEF Mapper apparatus (Bio-Rad, USA). Standard conditions used for analysis of YACs were as follows: 22 h, linear ramp from 60 to 120 s, 6 V/cm, 14°C, TBE 0.5 ×. Gels were depurinated and transfered onto Hybond N+ membranes (Amersham, UK) for 2 h using a vacuum blotter (Appligene, France). Total human DNA was labeled with horseradish peroxidase (using the ECL labeling kit RPN 3001, Amersham, UK) and hybridized at 10 ng/ml final concentration to PFGE filters according to the ECL protocol.

Fluorescent in situ Hybridization

FISH experiments were performed according to Driesen et al. [13] and Dauwerse et al. [14].

Results

From the most recent results [3], the genetic localization of GLC1A is as follows: (D1S196-D1S431, D1S445)-{GLC1A, D1S433-(D1S2851, D1S452, D1S210)-D1S2634-(D1S2814, D1S242, D1S218)}-(D1S416, D1S480). GLC1A is thus located in a region of 8–10 cM which could be subdivided into subintervals with no markers (fig. 1). Subsequently, 10 additional AFM micro-satellite markers [15] were added to this map. During the course of this study, some of the data released on public electronic servers by various genome centers could either help, confirm, or conflict with our own data.

Fig. 1
figure 1

Genetic map of the GLC1A locus. a From Morissette et al. [3]; the gray box shows the boundaries of the GLC1A interval. b Genetic map as in Dib et al. [15]. (*) The position of these markers conflict with physical data (table 3) but their order was determined with odds < 1,000:1 on this genetic map. ** This marker was not retained in the final map [15]. CEN = Centromere; TEL = telomere.

Isolation of YAC End STSs

A total of 67 YAC clones containing at least one STS from the GLC1A interval were identified by PCR screening. Eighteen of these YACs have not been previously reported to be linked to an STS of the GLC1A interval. Most of the 67 YACs were used as templates for isolation of sequences corresponding to the insert ends using inter Alu vector PCR and inverse PCR methods (see Methods). Table 1 summarizes the characterization of YAC ends. In total, 38 YAC insert end sequences were characterized; 22 of them were obtained by inverse PCR and 16 by direct sequencing of Alu vector PCR reactions [Clépet et al., in preparation]. Out of the 38 YAC end sequences, 8 (21%) corresponded to part of a repeated element and could not be used for designing primers (table 2). Overall, 28 YAC end STSs have been generated. Most of these were assigned to a chromosome by PCR assays on a somatic cell hybrid panel [12]. When a YAC end STS was shared by at least 3 different YACs on the contig, it was considered as part of the interval. Of the 28 YAC end STSs, only 16 (55%) could be used to construct the contig; the other 12 eventually proved to originate from chimeric parts of the YACs. As an example, 2 cases of ‘chimeric STSs’ 765h10L and 955c7R are described in more detail: 765h10L has been assigned to chromosome 1 but is absent from all other YACs from the contig except 765h10. Similarly, AFM114xa5 is a chromosome-1-specific marker present on 765h10 (WI/MIT, Release 7) and absent from all other YACs from the GLC1A contig. Therefore YAC 765h10 appears as an intrachromosomal chimera.

Table 2 Characterization of YAC clones from the GLC1A region

955c7 is D1S2814 positive and has its right end (955c7R) on chromosome 4 (tables 1, 3). 928a5 shows the same STS content as 955c7, and contains 955c7R in particular (data not shown). It seems very likely that both YACs derive from the same initial clone; however, their sizes are different (1,170 kb for 955c7 versus 390 kb for 928a5). One possibility is that both YACs derive from the same initial clone 955c7, and that a deletion occurred subsequently in a derived cell, giving rise to 928a5.

Table 3
figure Tab3

STA Content of YAC clones from the GLC1A region

Characterization of the YACs

Most of the YACs (a total of 41) were subjected to FISH analysis on metaphase chromosomes to ascertain their DNA content. Eighteen YACs gave hybridization signals outside chromosome 1q, suggesting chimerism (table 2). Two YACs in particular (857c8 and 765g6) did not show any hybridization signal in 1q, despite their STS content (D1S2815 and D1S2814, respectively), indicating that the chromosome 1 portion of both of these YACs only represents a minor part of the insert. In addition to FISH data, left-end and right-end STSs were characterized

YAC sizes are in kilobases. An asterisk (*) indicates when size or FISH data are from the CEPH-Généthon database [16]. Left end and right end characterizations: (+) insert-end STS mapping in the GLC1A contig; (1,+) assigned to chromosome 1 and to the contig; (!!) chromosome-1 chimera; (KpnA/I) sequences containing part of a repeat element; other indications correspond to chromosomal assignments.

(table 2). Overall, 49 YACs were analysed either by FISH or insert end chromosomal assignment or both; 25 of these YACs appear to be chimeric.

Construction of a YAC Contig

A contig of the proximal part of the GLC1A region (interval A) was derived from published information and data obtained in this study. The CEPH Généthon map release (version 1.1 [16]) on which we initially relied for contig construction proposed one level-6 contig between microsatellites D1S431 and D1S210. This path, based on fingerprint and Alu-PCR hybridization data, comprised the following 6 YACs: 903f4, 875b1, 930a12, 895f3, 985e4, 809c4. To confirm this contig, a series of YAC end STSs were generated from several YACs including those listed in Quickmap. Ten YAC end STSs were characterized in that region and a contig containing 3 YACs (table 3) was sufficient to span interval A, thus reducing the initial Quickmap contig. In the meantime, this contig has been reduced to level 4 (CEPH-Généthon map, Version 2; [5]).

The contig was extended to the distal part of the GLC1A region (interval B) using a YAC walk initiated from four starting points: the flanking markers D1S210 and D1S2814 and internal markers D1S2815 and D1S2790. On average, 10 mega YACs were isolated from each landmark. Overlaps between YACs were established with YAC end STSs except for junction D1S2790-D1S2814, for which all 11 isolated YAC ends were irrelevant (8 being from other parts of the genome) (see tables 1, 3). The link between D1S2790 and D1S2814 was shown by hybridization of inter Alu-PCR products (data not shown), and confirmed with an expressed sequence tag (EST) marker mapped to this region using radiation hybrids [17].

Observation of the presence or absence of each marker in each YAC enabled marker ordering as shown in table 3. This order assumes minimal breakage and rearrangement of the YACs. However, a few deletions still had to be assumed for some YACs: 928h2 and 928g11 show internal deletions encompassing several adjacent STSs (5 and 4, respectively); 3 other YACs show single-STS deletions. Moreover, this physical map is not in total agreement with the genetic map [18] where D1S433 was given as 1 cM centromeric to D1S445; such an order would require gap introduction into two different YACs. In contrast, the reverse order (centromere-D1S445-D1S433) is consistent with the STS content of all 5 YACs spanning this region and with genetic data obtained more recently [3]. However this more recent map also shows a discrepancy with the present physical map for marker D1S2634 which we now place distal to D1S2814. (D1S2634 was not retained on the new Généthon genetic map [15].)

The physical map also resolves markers that could not be separated by linkage analysis or that were ordered with odds of less than 1,000:1 as in the case of the proximal group of markers, containing D1S2799 and D1S2658 (fig. 1, table 3). In addition some other markers were also integrated into the contig: D1S66 located in 1q21–q31 [19] is present in the most proximal YACs (see STS-HBI40, table 3). Indication that D1S1569 (a marker with low polymorphism) mapped in the contig was at first obtained from the WI/MIT (Release 3, July 1994). This marker was then localized inside the centromeric interval A (table 3).

Two ESTs (D1S2067E and D1S2049E) previously assigned to the GLC1A locus using radiation hybrids [17] were mapped to the contig. Both of them belong to the same ‘cDNA families’ as 2 ESTs (respectively D1S2359 and D1S14662) that were reported independently on contigs (WC-1.33 and WC-215) of the GLC1A region by the WI/MIT (Release 6, March 1995).

A tentative estimate of the physical size of the contig was assessed from nonchimeric YACs. The upper size limit was estimated from the smallest path of overlapping YACs. Conversely, the lower limit was calculated by adding the size of the largest nonchimeric YACs showing no STS overlaps. As shown in figure 2 (see also tables 2, 3), the length of the 10-cM GLC1A interval should be comprised between 7.5 Mb (D1S2658-D1S2691) and 9.4 Mb (D1S196-D1S416). This size estimate is consistent with the average of 0.85 Mb per centimorgan observed in the human genome [15].

Fig. 2
figure 2

Tentative estimate of the physical size of the GLC1A YAC contig. a Upper size limit: the smallest path of overlapping YACs found between markers D1S196 and D1S416 is shown. Nonchimeric YACs were chosen according to the present results (FISH and STS content). The deletion (including 4 STSs) of YAC 928 g11 is compensated by 928c8. The 3-STS overlap between 933h12 and 928g11 was subtracted from the overall length as it can be estimated as being roughly equivalent to the (360 kb) size difference between 766e4 (690 kb) and 653b3 (330 kb). The overall length of this path is 9,430 kb (850 kb + 1,580 kb + 1,600 kb + 1,500 kb + 1,180 kb − 360 kb + 950 kb + 900 kb + 1,230 kb). b The lower size limit is estimated between markers D1S2658 and D1S2691. A minimal length of 7470 kb for this interval was calculated using the largest nonchimeric YACs showing no STS overlaps (1,790 kb + 1,600 kb + 1,270 kb + 1,440 kb+ 1,370 kb).

Discussion

We have established a YAC contig spanning the 8–10 cM Iq23-q25 region containing GLC1A. This continuum of YACs was achieved after a series of bidirectional walks initiated from AFM genetic markers which mapped in this region. All YAC clones were derived from the CEPH-mega-YAC library whose large-insert size enabled contig completion with a minimal number of steps. D1S196 and D1S416 are connected by a minimal set of 6 YACs (table 3). This YAC contig is entirely STS based. In total, 41 STSs are physically linked to this region including 23 genetic markers, 16 YAC ends and 2 ESTs. Sixty-seven YACs were used to construct this contig. The chimerism of most of the YACs was assessed by FISH analysis on metaphase spreads, as well as by STS content. Overall, 50% of the YACs that were characterized appear to be chimeric. To reduce the risk of integrating false data from chimeric YACs, the depth of the contig was kept as large as possible. There are on average 10 YACs per STS with a minimal depth of 3 clones. Assessed from the size and STS content of nonchimeric YACs, the physical length of the 10-cM GLC1A interval was tentatively estimated to be between 7.5 and 9.4 Mb. Initially and during the course of this study, data released from Human Genome Mapping Centers were used to some extent. The following groups of markers are integrated at level 1 on YACs in the CEPH-Généthon map (Quickmap, Version 2; [5]), {D1S196, D1S431, D1S445} {D1S433} {D1S210, D1S452, D1S2851} {D1S2814} {D1S218, D1S242} {D1S416}. Based on fingerprint and Alu-PCR hybridization data, a few contigs spanning the proximal part of the GLC1A interval flanked by D1S196 and D1S210 (interval A, fig. 1) are available on Quickmap. The shortest Quickmap contig shown is of the 4th order, and is anchored on the marker in between, D1S433. Although this 4-YAC contig is close to the real situation, this path can actually be shortened to 3 YACs. This part of our contig was mostly assembled from YACs listed in Quickmap. In the more telomeric interval, running from D1S210 to D1S218/D1S242 (interval B, fig. 1), no contigs are shown on the Quickmap server. After the completion of this work, a YAC contig (WC1.23) spanning the whole GLC1A region was released by the WI/MIT (Release 9, December 1995). The GLC1A portion of WC1.23, flanked by D1S196 and D1S416, is based on 30 STSs and more than 69 YACs. Contig WC1.23 shares 14 STSs and 38 YACs with our contig, two ESTs from WC1.23 (D1S2359 and D1S14662) belong to the same cDNA families as D1S2067E and D1S2049E which we mapped independently, using radiation hybrids. The order of the shared markers is consistent except for the doublet (D1S242, D1S2814) which is reversed. The order proposed on WC1.23 for these 2 STSs appears very unlikely as it requires introducing a gap within 7 YACs in our contig. Moreover, in the MIT data, YACs 756c3 and 775e10 are given as negative for D1S2814, and YACs 945g6 and 948g10 as negative for D1S242 whereas we verified that all four YACs were positive for both markers. Therefore this small discrepancy regarding the order of the markers most probably results from false-negative results in the MIT data set, and we believe the order we have determined is more likely. About 14 STSs and 31 YACs from WC1.23 were not used in the present study. Conversely, our YAC continuum was generated with 24 STSs and 18 YACs not previously described (see table 3). In addition, each YAC/STS hit was confirmed by individual PCR assay and most of the YACs (49 in total) are now also characterized either by FISH or insert-end chromosomal assignment or both.

At the present time, the disease locus interval is still very large and this might hamper the gene search. Refinement of the GLC1A localization has been attempted by assaying all microsatellite markers mapped in this contig on our families of patients (data not shown). Although some recombination events were found among unaffected members (confirming the order of some markers), it has not yet been possible to narrow the GLC1A interval. Recruitment of new patients and linkage analysis remain a priority for identifying the GLC1A gene.