Introduction

Down syndrome (trisomy 21) is the commonest birth defect and afflicts 1 in 700 live-born infants. It is characterized by a specific phenotype and mental retardation. In most cases, it results from the presence in all cells of an extra copy of chromosome 21 [1]. Karyotype analyses of rare partial trisomy 21 cases have indicated that only the distal part of chromosome 21, band 21q22, is involved in the pathogenesis of the syndrome [2]. However, the proximal part [3] and the distal part [4] of band 21q22 have independently been shown to be involved in the expression of Down syndrome.

More recently, the molecular study of two different patients with partial trisomy (FG and IG) have highlighted a region, around D21S55, involved in the pathogenesis of short stature, various facial, hand and foot anomalies characteristic of Down syndrome, muscular hypotonia and mental retardation [5, 6]. This region has been named DCR for Down syndrome chromosome region [7]. Using patient IG, the proximal border of DCR has been mapped between D21S17 and D21S55 and using patient FG the distal border of this region has been mapped between D21S55 and ETS2. Analysis of the genotype-phenotype correlation in other patients with partial trisomy 21 has led to a more detailed Down syndrome phenotypic map of chromosome 21: the D21S55 region or DCR is associated with the expression of mental retardation, muscular hypotonia, joint hyperflexibility and 9 morphological features. A larger region from D21S55 to MX1, which includes the DCR, is associated to the DCR features plus 6 other morphological features [8]. These data are consistent with and improve the accuracy of the previously reported DS phenotype maps [9, 10].

The size of the DCR has been estimated to range between 400 and 3,000 kilobases (kb) [5, 6]. The physical map of distal chromosome 21 [11] suggests that the D21S55-MX1 region is between 4.5 and 6 megabases (Mb). Only a preliminary physical map of the DCR is available and no gene has yet been identified in this region.

To improve the physical map of this region, we screened yeast artificial chromosomes (YAC) libraries [12] which allow the study of large DNA fragments. Two YAC libraries were screened by polymerase chain reaction (PCR) and selected YAC clones were analysed by pulsed-field gel electrophoresis (PFGE), AluPCR and in situ hybridization. A 1.2-Mb contig, encompassing the protooncogenes ETS2 and ERG, was identified.

The genomic map of patient FG, which defines the distal boundary of the DCR [13], was compared with the normal genomic map and the YACs maps. The 1.2 Mb contig appears to contain the distal boundary and part of the DCR.

Materials and Methods

Genomic PFGE

High-molecular-weight DNA from human lymphocytes or fibroblasts was prepared in low-melting-point agarose plugs as in [14].

Digestion of the plugs, separation by PFGE and DNA transfer were performed as in [15].

Southern Blot

We used human DNA, YAC DNA or DNA from a panel of somatic cell hybrids containing the totality (WA17) [16] or various segments of chromosome 21 (ACEM, GA9-3, 1881c-13b and 9542-5c) [17] as single human chromosome. To study YAC DNA, the total yeast DNA was used.

DNAs were digested, separated on a 0.8% agarose gel and blotted onto Zetabind (AMF Cuno) or Biodyne B (Pall) by the alkaline procedure.

Gene Dosage Assay

Unique DNA sequence copy numbers were determined by a slot blot method previously described [18].

PCR Conditions

PCR was carried out in a Hybrid TR1 thermal cycler, in a volume of 50 µl containing 500 ng of DNA, 10 mM Tris-HCl pH 8.3, 50 mM KCl, 2 mM MgCl2, 0.01% gelatin, 200 µM of each dNTP, 0.5–1 µM of one of each primer and two units Taq polymerase (Perkin-Elmer Cetus), with a 100-µl mineral oil overlay. Samples were first denatured 5 min at 95°C in the thermal cycler and then submitted to 30 cycles of amplification.

YAC Libraries and Screening

Two YAC libraries were screened by PCR for ETS2 and ERG: the CEPH library [19] and the Chromosome 21 Joint YAC Effort (CHR21JYE) library [20].

For ETS2, the primers ETS2-1 (5′AGATTCTGACTGTGACTCATGCCAC3′) and ETS2-2 (5′AGGTGCTTGAGTCCATGGCTGTTTG3′) correspond to positions 201–225 and 501–525 of the promoter sequence [21]. The cycle of amplification leading to the 325-bp product was: annealing 30 s at 62°C, elongation 30 s at 72°C, and denaturation 20 s at 92°C. For the CHR21JYE library, other primers, determined by Brennan [22] were used.

For ERG, the primers A185 (5′GAAGGCACCAACGGGGAGTTCAA3′) and 68C (5′GACCTTGGTCATGATGTTCTTGTC3′) correspond to positions 1208–1232 and 1332–1355 of the cDNA sequence [23]. The cycle of amplification leading to the 148-bp product was: annealing 2 min at 60°C, elongation 90 s at 70°C, and denaturation 20 s at 92°C.

Preparation of YAC DNA

Yeast was grown in SD medium [24] at 30°C for 30 h. High-molecular-weight yeast DNA plugs were prepared by the method described by Carle and Olson [25] with some modifications [15]. Each plug (22 µl) contained 7.5 × 106 cells.

Total yeast DNA was obtained as follows: the cells were pelleted, incubated for 30 min at 37°C in SCE (1 M sorbitol, 0.1 M sodium citrate, 0.06 M EDTA-citric acid pH 7) containing 171 mM betamercaptoethanol and 25 U/ml of zymolyase. After centrifugation (10 min at 10,000 rpm) the pellet was resuspended in 50 mM Tris-HCl pH 8, 50 mM EDTA, 1% sarkosyl, 3 M urea, 500 µg/ml proteinase K and incubated for 1.5 h at 55°C. DNA was then extracted once with phenol, once with phenol: chloroform:isoamylalcohol (24:24:1[v/v]) and once with chloroform:isoamylalcohol (24:1[v/v]). The last upper phase was ethanol precipitated, and DNA was recovered by spooling and was dissolved in 10mM Tris-HCl, 1 mM EDTA, pH 7.5.

Isolation and Identification of YAC Ends

Insert-terminal YAC segments were isolated by vector AluPCR. The primers for the right arm (1091) or the left arm (1089) [26] were used in combination with the Alu 3′ primer (AluIV) [27] or with the Alu 5′ primer (PDJ33) [28]. The conditions of amplification are the same as with AluIV primer alone or PDJ33 primer alone, as described below. The YAC ends amplified by PCR were fractionated in a 1.5% agarose gel, melted in H2O and frozen at −20°C for 1 h; the mixture was heated to 65°C for 5 min, then labelled by random priming [29] and used as probes.

AluPCR

This technique allows the amplification of unique DNA sequences localized between two Alu sequences [27, 30]. The PCR conditions for the AluIV primer were: annealing 2 min at 57°C, elongation 4 min at 72°C and denaturation 1 min at 93°C, and for the PDJ33 primer: annealing 2 min at 60°C, elongation 4 min at 72°C, and denaturation 1 min at 93°C. The PCR products were fractionated on an 1.5% agarose gel. The fragments shared by several YACs were prepared as described above and used as probes.

Restriction Mapping of the YACs

For total digestion, plugs were incubated overnight with 50 units of enzyme according to the manufacturer’s recommendations.

For partial digestion, plugs were incubated for 2 h with 1 or 5 units of enzyme in the appropriate reaction buffer (Biolabs or Promega).

Digests were fractionated by PFGE [15]. Gels were blotted onto Zetabind (AMF Cuno) or Biodyne B (Pall) using the alkaline procedure. Filters were first hybridized with the pBR322-derivative DNA probes corresponding to each of the pYAC4 arms [12]. The left arm containing the yeast Trp gene was visualized by the 2.7-kb PvuII-BamHI restriction fragment of PBR322 (pBR2.7) and the right arm, containing the yeast Ura gene, was visualized by the 1.6-kb PvuII-BamHI restriction fragment of pBR322 (pBR1.6) [31]. Filters were then hybridized with each of the available internal probes.

Consensus restriction maps of each YAC were constructed from the resulting autoradiographs.

In situ Hybridization

Metaphasic chromosomes were obtained from blood lymphocytes from patient (FG) and from healthy donors. Cells were cultured by a standard protocol and synchronized using thymidine [32]. The probes were labelled by nick translation using biotin-14 ATP (BRL), according to manufacturer’s protocol and purified by passage through G50 Sephadex, precipited with 3 M sodium acetate, in the presence of salmon sperm DNA (100 µg) and ethanol. The amount of labeled total yeast DNA depends on the size of the YAC. To obtain 50 ng of YAC labelled, 3 µg of yeast DNA were used for a YAC size from 500 to 800 kb. Probes were dissolved in 10 µl hybridization buffer with 300-fold human DNA as competitor, and denatured at 100°C for 10 min.

In situ hybridization was performed according to Zhang et al. [33], and antibiotin FITC-conjugated antibody (Sigma) was used for detection. Chromosome preparations were counterstained with propidium iodide (Sigma) and examined with a Zeiss Axiaphot microscope.

Probes

The D21S55 probe used was the 3.9-kb EcoRI genome fragment [34]. The ETS2 cDNA probe was the 2.3-kb EcoRI fragment of the cDNA [35]. The ETS2 promoter probe was the 325-pb fragment amplified by PCR.

ERG cDNAs have been isolated by the screening of a human fetal liver cDNA library. These cDNAs correspond to alternative forms of the already described cDNAs [23], and differ in their 5′ region. Two fragments of the longer cDNA have been used as probes: the ERG cDNA 5′ probe (5′ERG) is 1.7 kb, contains the ATG start codon and corresponds to the whole coding region; the ERG cDNA 3′ probe (3′ERG) is 1-kb and corresponds to the noncoding region. These two ERG probes overlap on 300 pb [M.D.-C., unpubl. data].

The probe for HMG14 is the PstI genomic fragment of 0.7 kb [36].

Results

Genomic Map of the D21S55-ETS2 Region

A partial physical map of this region in lymphocyte DNA has already been published [37] showing a physical linkage between D21S55 and ERG. We describe here a physical linkage between D21S55, ERG and ETS2, established by complete and partial digestion of fibroblast DNA [15].

Fragment sizes, detected after hybridization with these three probes, for complete or partial digestion with the enzymes NotI, NruI, SfiI, KspI, BssHII and MluI, were analyzed. No common fragment was found with SfiI, KspI, and BssHII with the three probes. With MluI, we identified two MluI fragments for ETS2 (360 and 800 kb), and one fragment of 1,650 kb for D21S55 and ERG. With NotI, we identified a 700-kb ETS2 fragment, a 1,450-kb ERG-D21S55 fragment and a 2,100-kb partial NotI fragment common to the three probes (data not shown). With NruI, these three probes hybridized to different complete restriction fragments: 600 kb ETS2, < 1,000 kb ERG and 1,200 kb D21S55. Figure 1 shows the partial NruI restriction fragments for these three probes. Two fragments from partial digests were common to ERG and D21S55 (1,900 and 2,400 kb), whereas no fragments from partial digests and carrying ETS2 contained ERG or D21S55. These fragments presumably contained contiguous NruI fragments distal to ETS2.

Fig. 1
figure 1

Sequential hybridizations of a PFGE blot with D21S55, ERG cDNA5′, and ETS2 cDNA. Fibroblast DNA was digested with NruI. 1 = 10 mM Mg2+; 2 = 0.5 mM Mg2+.

These data established the physical linkage between ERG, D21S55 and ETS2 but are insufficient to map the ERG gene with respect to D21S55 and ETS2.

YAC Contig

The YAC libraries were screened by PCR for ETS2 and ERG. The characteristics of the positive clones are summarized in table 1.

Table 1 Characterization of YAC clones

AluPCR. The AluPCR method allows the identification of unique sequences localized between two Alu sequences in human DNA.

The PCR conditions were determined for the amplification of a few fragments. By varying the Mg2+ concentration, the number of visible bands could be increased but with a higher background signal as well [data not shown, and 37]. As PCR fragments were cut out of the gels and used as probes with no further purification, we performed AluPCR in conditions giving the lowest background.

With the primer AluIV (fig. 2), we obtained fragments of the same size in different YACs. To determine their position on chromosome 21, their identity and the overlap between the different YAC clones, these fragments were used as probes on Southern blots of YAC (data not shown), human, and hybrid DNAs. Figure 3 shows an example of such an analysis: the two PCR products n1, namely AluIV: 1 from 259H11 and AluIV:1 from 448D5 hybridized with the same genomic fragment and are therefore identical, indicating that these two YACs overlap in a region containing this AluPCR product. YACs B19C12, 259H11 and A125B12 overlap as do 259H11, A125B12 and 448D5. As 448D5 and A196B6 both hybridize with the ETS2 cDNA probe, they are also overlapping. These data allow the construction of a contig between the ERG and the ETS2 YACs. More bands were amplified with the primer PDJ33 than with the primer AluIV. A similar analysis of the results with the primer PDJ33 confirmed the YAC overlaps identified with AluIV (data not shown).

Fig. 2
figure 2

Ethidium-bromide-stained gel of inter-AluIV PCR products. 1 = 259H11; 2 = A125B12; 3 = 448D5; 4 = A196B6; 5 = B19C12; 6 = yeast Saccharomyces cerevisiae AB972; nl indicates the inter-Alu IV fragments used as AluIV: 1 probes (fig. 3, 4).

Fig. 3
figure 3

Sequential hybridizations of EcoRI restricted DNAs. 1 = Human fibroblast; 2 = hybrid Acem; 3 = hybrid GA9–3; 4 = hybrid 1881c–13b; 5 = hybrid 9542c–5a; 6 = Chinese hamster; 7 = WA17, and 8 = mouse with AluIV: 1 probes from 448D5 and 259H11 identifying the same 15-kb fragment.

YAC Ends. YAC ends were identified by vector-AluPCR and the results are summarized in table 2. Only 50% of the ends could be isolated by this method. The amplification products were used as probes on Southern blots to determine their localizations on chromosome 21 and the overlap between YAC clones (table 2). The right arm of 259H11 and that of A196B6 did not hybridize with any other YACs, indicating that these ends map at each extremity of the contig. A125B12 (left arm) hybridized with A125B12 and 448D5, and 448D5 (left arm) with A125B12, 259H11 and 448D5. Thus these three YACs overlap and can be oriented with respect to each other. The left arm of B19C12 hybridized with 259H11, but not with the other YACs. This confirms the overlap between B19C12 and 259H11, as determined by AluPCR, and orients B19C12 with respect to 259H11 and A125B12.

Table 2 Isolation and characterization of YAC ends

Probing YAC Southern blots with the left YAC end from 448D5 (probe LE448) visualized an EcoRI fragment of 2.4 kb in the two clones from the CEPH library (259H11 and 448D5) and an EcoRI fragment of 2.7 kb in the clone from the CHR21JYE library (A125B12) (fig. 4A). This size difference may be due to polymorphism. To test this hypothesis, we tested this probe on hybrid DNAs (data not shown) and on 9 different human DNAs (fig. 4B). In hybrid DNAs, we found either the 2.4- or the 2.7-kb fragment. Similarly, using human DNAs we found the 2.4-kb fragment in 5 individuals, the 2.7-kb fragment in 1 individual and both fragments (2.4 kb + 2.7 kb) in 3 individuals. These results confirm that probe LE448 detects an EcoRI polymorphism.

Fig. 4
figure 4

EcoRI polymorphism detected by LE448 probe. A YAC DNAs from 259H11 (1), A125B12 (2), 448D5 (3), and A196B6 (4). B Nine different human DNAs.

Restriction Map of the YACs. The five YACs were completely digested each with BssHII, MluI, SalI, NruI, NotI, SfiI and KspI. 448D5 and 259H11 (fig. 5) were partially digested by NruI, KspI, SfiI and BssHII and by NruI, SalI, BssHII and KspI, respectively. The resulting blots were hybridized with both YAC vector arms (pBR2.7 and pBR1.6) and the internal probes 5′ERG, 3′ERG, ETS2 promoter and ETS2 cDNA. The resulting data allowed the construction of the restriction map for the five YACs (fig. 6).

Fig. 5
figure 5

Sequential hybridizations of a PFGE blot with pBR1.6, ERG cDNA5′ and pBR2.7 probes. 259H11 YAC DNA was partially digested with 5 U SalI (0), 1 U SalI (1), 5 U NruI (2), 1 U NruI (3), 5 U BssHII (4), 1 U BssHII (5), 5 U KspI (6), and 1 U KspI (7).

Fig. 6
figure 6

Contig map of overlapping YAC clones. Right arm (R), left arm (L) and enzymes used for the restriction map are indicated on the figure. Dashed lines correspond to maximum size of the regions hybridizing with the ERG and the ETS2 probes. (?) = Restriction site not found.

The probes 5′ and 3′ERG overlap for 300 bp. The orientation of the ERG gene in the contig was deduced from the following data (shown in fig. 6): in 259H11 and A125B12, these two probes mapped on the same fragment (15 kb NruI, 30 kb SalI and 90 kb partial SalI fragments). The 5′ERG probe mapped on a 60-kb SalI and on a 170-kb NruI fragment. In A125B12, pBR1.6 mapped on the same 30-kb SalI fragment as the two ERG probes, whereas the 5′ERG 60-kb SalI fragment was not visualized. In B19C12, the two ERG probes and the pBR1.6 probe mapped on a 20-kb SalI and on a < 10-kb NruI fragment. The 5′ERG 60-kb SalI and 170-kb NruI fragments are not found in this YAC. Thus, the use of two ERG probes allows the orientation of this gene in the YACs 259H11 and A125B12, and indicates that the YAC B19C12 does not contain the entire gene.

To orient the ETS2 gene in the YACs 448D5 and A196B6, we used two internal probes: the ETS2 cDNA probe hybridized with the two YACs and the ETS2 promoter probe hybridized only with 448D5 (shown in fig. 6). In 448D5, the probes pBR1.6, ETS2 cDNA and ETS2 promoter were on different NruI fragments (40, 48 and 42 kb, respectively), but on single SalI (150 kb) and MluI (120 kb) fragments. Probes pBR1.6 and ETS2 cDNA were on the same 90 kb KspI, 90 kb BssHII and 88 kb partial NruI fragments. Probes ETS2 promoter and ETS2 cDNA were on the same 65 kb SfiI fragment. In A196B6, pBR2.7 and ETS2 cDNA mapped on 45 kb SfiI, 50 kb NruI, 130 kb SalI, 140 kb KspI, 140 kb BssHII and 180 kb MluI fragments. Thus the probes for the ETS2 gene can be ordered as follows in 448D5: right arm — ETS2 cDNA — ETS2 promoter; and in A196B6: left arm — ETS2 cDNA. This shows that the overlapping of these two YACs involves the coding region of the ETS2 gene.

An overlapping contig map (fig. 6) was constructed by comparing the restriction maps of the five YACs. There were a few differences between the maps of these YACs: one MluI site and one SalI site, which were not digested to completion in 259H11, were not found in A125B12. Similarly one NruI site which was only partially cleaved in 259H11 was not found in B19C12. This contig contains the two protooncogenes ERG and ETS2 in a head-to-head orientation. Based on the genomic map, ERG is localized between D21S55 and ETS2.

The presence of HMG14 in this contig was tested by hybridization on YAC Southern blots. The human fragment specific for HGM14 was not found in the YACs of the contig.

In situ Hybridization. All the five YACs were used for in situ hybridization assays on metaphasic lymphocyte chromosomes of normal individuals, to check the absence of coligation artifacts in these five YACs, and their localization on chromosome 21. All the YACs gave a hybridization signal in the same region, the 21q22.2–21q22.3 junction, and no signal on other chromosomes. Thus there are no coligation artifacts in these YACs.

The 3 ERG YACs (259H11, B19C12 and A125B12) were hybridized with the metaphasic lymphocyte chromosomes of patient FG. The karyotype of this patient is 46 XY, dir dup (21) (pter→q22.300:ql 1.205→qter). Previous data obtained by in situ hybridization with probes close to the borders of the duplicated segments were consistent with a true tandem duplication of chromosome 21 [13]. Only 259H11 gave 2 signals on the duplicated chromosome, showing that a large part of this YAC is duplicated in this patient (fig. 7). These data indicate that part of the 259H11 YAC is within the DCR.

Fig. 7
figure 7

In situ hybridization on metaphasic chromosomes of the patient FG with the YAC 259H11. Spots of hybridization on the normal and rearranged chromosomes are indicated by white arrows.

Localization of the ERG Gene with Respect to the DCR. To test if the ERG gene is within the DCR, we estimated the copy number of the 5′ERG sequence in blood DNA from patient FG by the slot blot method [18]. No duplication of this gene in patient FG was found (data not shown).

Previous PFGE analysis of the patient FG, with D21S55 and D21S16, indicated an abnormal NotI restriction fragment which contains the distal boundary of the DCR [13]. PFGE analysis with the ERG probes did not reveal this abnormal NotI restriction fragment (data not shown). This confirms that the ERG gene is distal to D21S55 and shows that it is outside the DCR.

Discussion

In this study, a YAC contig of 1.2 Mb in the DCR, and encompassing the two protooncogenes ETS2 and ERG, has been established by studying five YACs positive with either ERG or ETS2. The overlaps between these YACs were first determined by AluPCR. The extent of the overlapping regions were assessed by AluPCR, ETS2 and ERG PCR and by restriction mapping. 50% of the YAC ends were isolated by vector AluPCR (table 2) and localized on chromosome 21, especially the ends of the contig (259H11 right arm and A196B6 right arm). In two cases, B19C12 right arm and A196B6 left arm, the extremities hybridized with ERG or ETS2 cDNAs respectively, indicating their localization on chromosome 21. YAC ends for 259H11 left arm, A125B12 right arm, and 448D5 right arm were not isolated, but showed identical restriction maps with at least one other YAC. For these three ends, a minor rearrangement cannot be excluded. The AluPCR fragments isolated from these YACs allowed the identification of new potentially polymorphic markers on chromosome 21: the left YAC end of 448D5 (probe LE448) detects an EcoRI polymorphism. These new markers therefore constitute powerful tools for the systematic physical and genetic mapping of chromosome 21 and for the analysis of partial trisomy 21.

Previous PFGE mapping data reported a restriction fragment common to D21S55 and ERG [36]. Our genomic map shows a physical linkage between D21S55, ERG and ETS2, but whether ERG is distal or proximal to D21S55 cannot be determined. Comparison of the genomic and YAC maps indicates that D21S55 is more proximal than ERG to ETS2. This was confirmed by the analysis of patient FG. The distance between ERG and ETS2 is about 400 kb. The genomic NotI map indicates that the distance between D21S55, which is not present in this contig, and ERG is about 1,000 kb. Two other sequences, HMG14 and D21S3, previously mapped in this region of the chromosome [17], can be excluded from the contig: HMG14 did not hybridize on YAC Southern blots, and D21S3 PCR screening [CHR21JYE and I. Chumakov, personal commun., for the CEPH library] was found negative with these YACs.

There are more rare cutter restriction sites in YAC DNA than appear on the genome map. This is likely due to differential DNA methylation since DNA cloned in YACs does not undergo cytosine methylation [38]. Only NotI digestion did not reveal any additional sites. It has been suggested that multiple sites containing CpG identify HTF islands (HpaII tiny fragment islands) associated with many mammalian genes [39]. At least three putative islands (BssHII + KspI) were identified in the contig: one is located in the ETS2 promoter region [21], which is also identified on the genomic map [40]; two others, one distal to ETS2 and one proximal to ERG at the extremity of the contig (right end of 259H11) may be landmarks of unknown genes.

ERG (Ets-related gene) and ETS2 are two related protooncogenes that are 400 kb apart and in opposite orientations on the chromosome. They encode proteins which are specific DNA-binding molecules acting as sequence-specific transcriptional activators [4143 and M.D.C., unpubl. data]. ETS2 is expressed in various adult cells, particularly those with dividing potentiality [44, 45], and the presence of a HTF island in its promoter region is consistent with the characteristics of an ubiquitous gene. Conversely, no HTF was found in the 5′ region of the ERG gene, the expression of which was not observed in a variety of normal tissues [46], except in transformed cells [23]. Thus, ERG might be expressed in a cell- or tissue-specific way during embryogenesis and play a role in development.

As tissue-specific genes lacking HTF islands have been described [39], the search for genes in this region will be carried out after subcloning in phages or cosmids by several alternative approaches: conservation of sequences by hybridization on Zooblots indicating putative exonic sequences, exon trapping, and screening of cDNA libraries. Moreover, chromosome walking from the extremity of 259H11 is in progress to extend the contig towards D21S55, and to identify the gene content of the DCR.