Introduction

Sugarcane (Saccharum spp.) belongs to the genus Saccharum, family Gramineae, and is an important energy crop. Saccharum consists of six species, namely S. officinarum, S. sinense, S. bareri, S. edule, S. robustum, and S. spontaneum. Saccharum has a close genetic relationship with Miscanthus, Sclerostachya, Erianthus, and Narenga, which constitute an interbreeding group called the “Saccharum complex”1,2. In 1996, D’Hont et al.3 reported that modern sugarcane cultivars possess approximately 120 chromosomes, with 70–80% derived from S. officinarum, 10–20% derived from S. spontaneum, and a few chromosomes derived from interspecific recombination.

Commercial sugarcane cultivars are derived from interspecific hybridization of different Saccharum species. Sugarcane has low heterogeneity and a narrow genetic base, which limits its yield, quality, and resistance4. However, another genus, Erianthus, if introduced into sugarcane, could overcome these limitations. E. arundinaceus has many desirable agronomic traits for sugarcane breeding, such as broad adaptability, disease resistance, drought resistance, and a high biomass5. The intergeneric F1 progeny between Saccharum spp. and E. arundinaceus cannot be developed using the “nobilization” hybridization strategy to increase cane yield and restore high sugar content because pollen from the hybrid clones are sterile6. In 2001, hybridization between an F1 clone as the female parent and a sugarcane cultivar as the male parent at the Hainan sugarcane breeding station in China was achieved and produced intergeneric BC1 progenies. BC2 progenies were produced in two years later. Meanwhile, sugarcane breeders found that the pollen fertility of the BC2 progenies was recovered7. Thus, a new hybridization strategy using a sugarcane cultivar as the female parent and a BC2 clone as the male parent has since been used8. Based on this strategy, our group also successfully created intergeneric BC3 and BC4 progeny.

Repetitive DNA sequences represent a large part of the eukaryotic genome, have different number of copies in genome and possess various specific features9. Consequently, repetitive sequences are sources of molecular markers that are useful in plant genetic studies, and have been cloned from many higher plant genera for use in phylogenetic studies or introgression breeding programs10,11,12,13. They diverged rapidly during evolution and are constantly homogenized, giving rise to sequences that are species-specific, genus-specific, and even chromosome-specific14,15. Moreover, their localization by situ hybridization provide important information on chromosome structure16. Accordingly, these repetitive DNA sequences have provided new molecular tools to investigating genetic diversity and phylogenetic relationships in the Saccharum complex and improving the efficiency of modern molecular breeding of sugarcane.

The genomic in situ hybridization (GISH) technique is used to study chromosomal structure, exchange, and mode of transmission and inheritance between parent and filial generations17. The mode of chromosome inheritance has been studied through the processes of nobilization breeding between S. officinarum and wild germplasm18. This work has indicated that chromosome inheritance occurs through n + n, 2n + n, n + 2n, and 2n + 2n transmission19,20,21,22. Wu et al.23 studied the chromosomal inheritance of hybrid progeny generated between Saccharum spp. and E. arundinaceus and found that the mode of chromosome inheritance was n + n transmission in the hybrid F1 progenies, 2n + n transmission in 9 of 13 hybrid BC1 progenies, and more than 2n + n transmission in the 4 of 13 hybrid BC1 progenies. However, previous studies have not reported the pattern of E. arundinaceus chromosome transmission from parents to progeny in the BC4 generation.

Distant hybrid utilization of E. arundinaceus in sugarcane has made great progress in recent years. Unfortunately, the genomics of Saccharum spp. and E. arundinaceus are far behind those of cereal crops since Saccharum spp. and E. arundinaceus are polyploid plants with large genomes and many homologous sequences24,25. However, researchers can use the genome sequencing strategy for wheat, which is called “break up the whole into parts,” or “BAC BY BAC,” for Saccharum spp. or E. arundinaceus. In this strategy, the wheat genome is broken down into a single chromosome or a single chromosome arm, which can be used to build the BAC library and physical map26. Therefore, sorting and sequencing E. arundinaceus chromosomes can be achieved by screening BC4 generation clones that contain 1 or 2 E. arundinaceus chromosomes.

Here, we aimed to determine whether EaHN92, the PCR product of AGRP52/53 as a primer, is an E. arundinaceus-specific tandem sequence, and whether it can hybridize with every E. arundinaceus chromosome through FISH. We also selected 3 cross combinations of intergeneric hybrid BC4 (1511, 1514, and 1625) as research materials to identify true hybrids in a BC4 population using PCR with an AGRP52/53 primer pair. We then used GISH to clarify the pattern of E. arundinaceus chromosome transmission to determine the presence of intergeneric chromosomal translocation and to screen clones that contained approximately 1 or 2 E. arundinaceus chromosomes to sort and sequence the E. arundinaceus chromosomes. This work will provide a basis for subsequent genome research in E. arundinaceus and Saccharum spp.

Materials and Methods

Plant materials

A total of 74 clones from three intergeneric BC4 populations, named 1511, 1514, and 1625 respectively, were selected in this study. These clones were from a crossing combination between a sugarcane cultivar (♀) and the clone YCE06–61 (♂). The latter was a clone in the BC3 generation derived from E. arundinaceus containing 7 E. arundinaceus chromosomes8. There were 20 clones from population 1511, 28 clones from population 1514 and 26 clones from population 1625 (Table 1). A further 17 clones from Saccharum, E. arundinaceus and progenies of E. arundinaceus were also included (Table 2).

Table 1 BC4 progenies between Saccharum spp. and E. arundinaceus.
Table 2 Saccharum, Erianthus and the hybrids used in the study.

Genomic DNA extraction

Young leaves of different individual plants were cut, ground with liquid nitrogen and genomic DNA was extracted using the traditional CTAB method following the method of Mace et al.27.

Polymerase chain reaction (PCR)

PCR identification of true hybrid progeny was conducted. A PCR reaction mixture was prepared on ice (Table 3) and carried out in a thermal cycler (ABI, USA) using the primer pairs AGRP52 and AGRP53 (AGRP52: 5′-AGGAAGTTATGGTGGAGTAT-3′; AGRP53: 5′-CGCCATTCCTATTGC-3′) following the method of Alix et al.28. The PCR program was performed as follows: pre-denaturation at 95 °C for 3 min, 30 cycles of 95 °C for 20 s, 55 °C for 20 s, 72 °C for 5 s, and 72 °C for 3 min. PCR products were tested using 1.5% agarose gel electrophoresis.

Table 3 PCR reaction mixture.

Using HN92–105 as a template, positive PCR products were purified via using the OMEGA EZNA Gel Extraction Kit (Omega, USA). The purified products were stored at −20 °C, as standby, and designated EaHN92. EaHN92 was then cloned into a pMD19-T vector (Takara, Japan) and transformed into an E. coli DH5α competent cell (Takara, Japan). The recombinant clones were grown in LB culture medium with ampicillin (100 μg/mL). Five clones were randomly selected for sequencing by the Sangon Biotech Company (Shanghai, China).

Chromosome preparation and slide preparation

Plants of the clone HN92–105 were grown in a pot with sandy loam soil. After growing for 60 days in summer root tips were cut at 9:00am every three days. Cane stalks of YCE06–61, and clones from populations 1511 (19 clones), 1514 (27 clones) and 1625 (24 clones) were cut into single eye setts, planted into trays which were kept at 25°C and covered by gauze to keep moist. After 5 days when the length of the roots were about 2–3 cm root tips were cut at 9:00 am. The root tips were pretreated with saturated dichlorobenzene solution at room temperature for 2.0 h to accumulate metaphase cells, placed into a fixation solution with a ratio of ethanol to acetic acid of 3:1 (v/v) for 18 h, and then stored at −20 °C in 75% ethanol solution until use. Chromosomal slide preparation was the same for FISH and GISH. The fixed roots were washed in water and digested at 37 °C for 150 min in an enzyme solution containing 8% Onozuka R10 cellulose (Yakult, Tokyo, Japan), 2% pectinase (Sigma, USA) and 1% pectolyase Y-23 (Yakult, Tokyo, Japan). The meristematic cells of root tips were squashed on a clean slide in a drop of fixation solution, then air-dried and stored at −20 °C until use.

Fluorescence in situ hybridization (FISH) and genomic in situ hybridization (GISH) procedures

EaHN92 sequence was labeled with Biotin-16-dUTP (Roche) as the FISH probe, carried out using the procedures of PCR identification of true hybrid progeny. FISH experiments were performed according to Panwar et al.29 with some modifications. The 50 μL hybridization mixture containing 5 μL of the labeled FISH probe, 25 μL deionized formamide, 5 μL dextran sulfate, 10 μL 20 × SSC and 5 μL ddH2O was denatured at 97 °C for 10 min, and then placed immediately in ice-water for 10 min. Each chromosomal slide was denatured at 80 °C for 3 min in the denaturation solution containing 70% deionized formamide and 2 × SSC, dehydrated in a series of precool ethanol solutions (75%, 95%, and 100% ethanol), and incubated in a humid box with 2 × SSC at 37 °C for 20 h. Post-hybridization washes were performed sequentially, once in 2 × SSC at 42 °C for 5 min, twice in 20% deionized formamide and 2 × SSC at 42 °C for 5 min, twice in 2 × SSC at 42 °C for 5 min, once in 2 × SSC at room temperature for 5 min, and once in 4 × SSC and 0.2% Tween-20 for 5 min at room temperature.

Genomic DNA from HN92–105 (E. arundinaceus) was labeled with Biotin-16-dUTP (Roche) as probes for GISH. Genomic DNA from Badila (S. officinarum) were not labeled with a biomarker as the blocking agent in GISH. GISH was performed in accordance with D’Hont et al.30 with some modifications. The 50 μL hybridization mixture containing 2 μL of the 100 ng/μL labeled genomic probe of HN92–105, 3 μL of the 100 ng/μL unlabeled genomic probe of Badila, 25 μL deionized formamide, 5 μL dextran sulfate, 10 μL 20 × SSC and 5 μL ddH2O was denatured at 97 °C for 10 min. The following procedures were the same as with FISH except for post-hybridization washing, which was twice in 20% deionized formamide and 2 × SSC at 42 °C for 8 min.

The Biotin-labeled probe was detected by Avidin D, Rhodamine 600 (XRITC), and a Biotinylated anti-avidin antibody (Vector Laboratories, Burlingame, CA). Finally, chromosomes were counterstained with 30 μL/slide Vectashield antifade solution (Vector Laboratories, concentration of 10 μg/mL DAPI) and mounted with a coverslip. FISH and GISH signals were captured using an AxioScope A1 Imager fluorescent microscope (Carl Zeiss, Gottingen, Germany), of which the blue and red fluorescence signal were excited in DAPI and Texas-red channel respectively. Images were processed using AxioCam MRc5 and AxioVision v.4.7 software (Carl Zeiss, Gottingen, Germany). For each sample, the number of E. arundinaceus chromosomes was calculated as a range from observations of 10 to 15 cells in metaphase.

Results

Analysis of EaHN92 repeated sequence

The repeat units of EaHN92 tandem repeat sequences was 364 bp satellite DNA sequence, which was cloned from E. arundinaceus HN92–105. The EaHN92 specific repeated sequence was submitted to NCBI database (Accession number: MH133205). Its homology was estimated using a nucleotide blast tool in the NCBI database, which showed 93% homology with EaCIR1 cloned from E. arundinaceus (Accession number: Y13453.1), 81% homology with SSCIR2 cloned from S. spontaneum (Accession number: Y13452.1) and 79% homology with SOCIR1 cloned from S. officinarum (Accession number: Y13451.1).

Analysis of fluorescence FISH results

The FISH experiment using the repeat sequence EaHN92 as a probe identified EaHN92 hybridization sites in sub-telomeric regions at one or both ends of 60 chromosomes in HN92–105 (Fig. 1A). EaHN92 hybridization sites were also detected in sub-telomeric regions at both ends of 7 chromosomes in YCE06–61 (Fig. 1B). These results indicated that EaHN92 was a 364 bp E. arundinaceus-specific sequence and that AGRP52/53 was an E. arundinaceus specific primer pair, which could be used through PCR to identify true hybrid progeny, generated between Saccharum spp. and E. arundinaceus (Figs 1B2).

Figure 1
figure 1

FISH analysis of EaHN92 labeled on mitotic chromosomes. (A) HN92–105 (2n = 6x = 60); (B) YCE06–61 contained 7 E. arundinaceus chromosomes. Red represents hybridization sites of EaHN92. All chromosomes are shown in blue. Scale bars: 5 μm.

Figure 2
figure 2

Electrophoretogram of 1625 progenies and sugarcane germplasm. Marker: 100 bp DNA ladder.

PCR identification of true hybrids between Saccharum spp. and E. arundinaceus

A preliminarily evaluation of hybrid progeny of BC4 crosses was conducted using PCR and detected two electrophoretic bands (364 bp, 728 bp, respectively) (Figs 2, 3). These results indicated that the two electrophoretic bands clones without E. arundinaceus chromosomes were not amplified. The true hybrid rate of the BC4 populations 1511, 1514, and 1625 were 30.0%, 14.3%, and 92.3%, respectively (Figs 2, 3; Supplementary information dataset 1).

Figure 3
figure 3

Electrophoretogram of 1511 and 1514 progenies. Marker: 100 bp DNA ladder.

Analysis of GISH results

The results of GISH detecting experiment indicated that the clones which were identified as being true hybrids via PCR identification contained 2, 4, and 6 E. arundinaceus chromosomes in the case of population 1511, 4 and 6 E. arundinaceus chromosomes in the case of population 1514, and approximately 1–6 E. arundinaceus chromosomes in the case of population 1625 (Table 4). The percentage of approximately 1–6 E. arundinaceus chromosomes of true hybrids appeared as Gaussian distribution in the 3 populations, of which 4 E. arundinaceus chromosomes (36.67%) made up the largest proportion, followed by 3 E. arundinaceus chromosomes (23.33%), 5 E. arundinaceus chromosomes (16.67%), 2 and 6 E. arundinaceus chromosomes (10.00%) and 1 E. arundinaceus chromosome (3.33%) (Table 4). These results revealed that the number of E. arundinaceus chromosome in transmission of BC3 to BC4 progenies were approximately reduced by half, but there would were special cases in this transmission where reduction was more or less than half (Fig. 4; Table 4; Supplementary information dataset 2). Intergeneric chromosomal translocation occurred between Saccharum spp. and E. arundinaceus in the clones 1625–4, 1625–7, and 1625-22; only one chromosome translocated in 1625-4 and 1625-7 and two chromosomes translocated in 1625-22 (Fig. 4H–J). The chromosomal translocation in 1625-4 occurred in the terminal regions (Fig. 4H). The chromosomal translocations in 1625-7 and 1625-22 occurred in the centromeric regions (Fig. 4I, J).

Table 4 Chromosome transmission in BC4 progenies of Saccharum spp. × E. arundinaceus.
Figure 4
figure 4

GISH analysis of BC4 progenies between Saccharum spp. and E. arundinaceus. E. arundinaceus chromosomes are shown in red and Saccharum spp. chromosomes are shown in blue. (A) 1625-13: 1 E.; (B) 1625-10: 2 E.; (C) 1625-19: 3 E.; (D) 1625-25: 3 E.; (E) 1625-23: 4 E.; (F) 1625-26: 5 E.; (G) 1514-8: 6 E.; (H) 1625-4: 4 E. + 1 E./S.; (I) 1625-7: 4 E. + 1 E./S.; (J) 1625-22: 4 E. + 2 E./S. Arrowheads in (H) (I) and (J) show translocated chromosomes. E. and S. indicate E. arundinaceus chromosomes and Saccharum spp. chromosomes, respectively. E./S. indicates translocation of the E. arundinaceus chromosome and Saccharum spp. chromosome. Scale bars: 5 μm.

Discussion

In eukaryotes, a significant fraction of the genome is comprised of repetitive DNA sequences and this component is often greater than the coding sequence component31. Researchers have shown that the repetitive DNA sequences play an important role in numerous cell processes and species evolution32. Consequently, understanding the contents and origins of repetitive DNA sequences represents an important step towards completely deciphering the organization and function of the genome sequence33. Satellite DNA sequence is a type of repetitive DNA sequence used as molecular marker to assist breeding. In 1998, Alix et al.28 developed an E. arundinaceus-specific primers pair, AGRP52/53, based on the E. arundinaceus-specific satellite DNA sequence EaCIR1. However, earlier workers believed that the AGRP52/53 cannot be used as molecular marker for identifying true intergeneric hybrids of Saccharum spp. and E. arundinaceus because two of the E. arundinaceus chromosomes could not hybridize with EaCIR1 in the FISH experiment. Our FISH results were different from that of Alix et al.28 in indicating that the 364 bp sequence EaHN92 is actually an E. arundinaceus-specific sequence and hybridized with all E. arundinaceus chromosomes in the subtelomeric regions. Therefore, the EaHN92 sequence is an E. arundinaceus-specific tandem repeat sequence.

Intergeneric hybrid populations between S. officinarum and E. arundinaceus often turn out to be of false hybrids due to the high selfing rate34. Therefore, in 2002, Deng et al.35 used isozyme markers to identify true hybrid progeny generated between E. arundinaceus and Saccharum spp. However they could not identify the true hybrids because there were many similar bands in different parents. Following this work, SSR and 5 S rDNA markers were used to identify the true hybrid progeny36,37. In 2004, Zheng et al.38 identified the true hybrid progeny of E. arundinaceus through PCR using the primers, EF1/ER1 and EF2/ER2, which were based on the internal transcribed spacer (ITS) sequence of E. arundinaceus. However, in 2010, Deng et al.22 found that it was necessary to combine the results of PCR using EF1/ER1 or EF2/ER2 as primers to identify the true hybrid BC2 progenies generated from E. arundinaceus and Saccharum spp. In our study, true intergeneric hybrids between Saccharum spp. and E. arundinaceus could be rapidly and precisely identified using PCR with an AGRP52/53 primer pair. This approach could have useful wide application.

In recent years, much research has been carried out on chromosome transmission of different generations derived from hybrids between Saccharum spp. and E. arundinaceus. Wu et al.23 reported the mode of chromosome transmission was “n + n” in the F1 generation between E. arundinaceus and S. officinarum and the mode of chromosome transmission was “2n + n” in most BC1 generations produced between F1 and sugarcane cultivars. However, in some cases the chromosomes transmitted were more than “2n + n” in the BC1 generation. Huang et al.8 and Piperidis et al.39,40 reported that the mode of chromosome transmission was “n + n” in the BC2 and BC3 generation. In fact, previous studies reported the number of E. arundinaceus chromosomes was approximately 28–29, 22–31, 8–17 and 4–8 in the F1, BC1, BC2 and BC3 generation respectively8,23. In this study, the number of E. arundinaceus chromosomes was approximately 1–6 in the BC4 populations of Saccharum spp. and YCE06–61 with most having 3–4 E. arundinaceus chromosomes. YCE06–61 contains 7 E. arundinaceus chromosomes, and therefore the number of E. arundinaceus chromosomes of true BC4 hybrids appeared as Gaussian distribution in 3 cross combinations. In distant hybridization of plants, the complete and partial elimination of chromosome from one parent has been observed in crosses covering Gramineae and other species41. Such elimination appeared to be a common and nonradom event. Based on observation on meiosis behavior of pollen mother cells, Lin et al.42 found that chromosomes unevenly separated to new cells during meiosis in the F1 generation of S. officinarum and Erianthus rockii because E. rockii chromosomes lagged and lost. Unfortunately, the exact mechanism of this phenomenon is still not clear and need to be investigated further.

Translocated chromosomes are stable sources for transmitting hereditary information to the progeny in plant distant hybridization. In this study, we found intergeneric translocated chromosomes between Saccharum and E. arundinaceus in three clones of the BC4 generation, of which one occurred in the terminal regions and two occurred in the centromeric regions. Earlier studies had not reported intergeneric translocated chromosome in YCE06–618. However, an intergeneric chromosomal translocations were reported within the BC1, BC2 and BC3 generation generated between Saccharum spp. and E. arundinaceus8,23,39. This indicates that intergeneric translocated chromosomes can arise randomly in different generations.

In this study, we identified clones in the BC4 generation with 1 or 2 E. arundinaceus chromosomes that did not appear to be translocated for sorting and sequencing of the E. arundinaceus chromosomes. Researchers around the world have pursued S. spontaneum and S. officinarum genome sequencing but have not achieved success due to the presence of many homologous sequences due to high level polyploidy and very complex sequence data. Garsmeur et al.43 achieved a mosaic monoploid reference sequence for sugarcane from R570 BAC clones, but missed many sequences because the BAC clones mostly distributed in sorghum gene-rich distal chromosomal regions. In 2018, Zhang et al.44 had achieved allele-defined genome of the autopolyploid S. spontaneum L. successfully, which it was an important finding for sugarcane. However, this genome map of S. spontaneum L. missed telomere sequences, many homologous sequences and repeated sequences. Therefore, in order to achieve a high-accuracy genome sequence of E. arundinaceus, we suggest that those performing E. arundinaceus genome research use the “BAC BY BAC” sequencing strategy since E. arundinaceus is a hexaploid plant with a large and complex genome. Although genome research of Saccharum and its relative genus polyploid plants is very difficult, this study provides new techniques to aid researchers.

Conclusion

True hybrid progenies could be rapidly and precisely identified using PCR with an E. arundinaceus-specific primer pair, AGRP52/53. EaHN92, a 364 bp E. arundinaceus-specific sequence, was cloned from HN92–105 with AGRP52/53 and was localized on sub-telomeric regions of all E. arundinaceus chromosomes using FISH. According to the results of GISH, the number of E. arundinaceus chromosome in BC4 population ranged from 1 to 6 and appeared as Gaussian distribution. Identifying true hybrid clones with approximately 1–2 E. arundinaceus chromosomes without translocation using GISH could be used for sorting and sequencing E. arundinaceus chromosomes.