Introduction

Cytoplasmic male sterility (CMS) is a robust, easy-to-maintain and produces satisfactory hybrid seed yield which are essential features for a genetic mechanism to use in commercial hybrid seed production. It is the maternal inheritance of the inability of a plant to produce a viable pollen grain and is governed by mitochondrial genome1,2. Introgression of sterile cytoplasm from different species or genus creates new nulceo-cytoplasmic interactions in the converted CMS lines. However, the level of impact needs to be investigated which may vary depending upon donor source (species or genus) and recipient backgrounds (different genotypes).

There are different male sterile cytoplasms available for Brassica crops, namely ā€˜Oguā€™ from wild Japanese radish3,ā€˜Canā€™ of Erucastrum canariense Webb and Berthel4, ā€˜Tourā€™ of Brassica tournefortii Goun5, napā€™ of Brassica napus L.6 ,ā€˜kosenaā€™ of ā€˜Raphanus sativus L.7 , ā€˜murā€™ of Diplotaxis muralis (L.) DC8 , ā€˜Polā€™ of B. napus9,ā€˜hauā€™ of Brassica juncea (L.) Czern10 ,ā€˜moriā€™ of Moricandia arvensis L.11 ,ā€˜frutiā€™ of Brassica fruticulosa Cirillo12 ,ā€˜inapā€™ of Brassica napus L.ā€‰+ā€‰Isatis indigotica Fort.13 and Tour-Stiewe of B. tournefortii Goun (somatic cell fusion)14. The present study elaborates on alloplasmic CMS lines developed using two sets of combinations: first, two species of the same genus i.e. cytoplasm of Brassica tournefortii (Tour cytoplasm) and nuclear genome of cauliflower (Brassica oleracea var. botrytis L.) and second, two different genera either ā€˜Raphanus sativus L.ā€™ (Ogura cytoplasm) or E. canariense (Can cytoplasm) and nuclear genome of cauliflower. The Ogura cytoplasm is the only commercially exploited male sterility in Brassica oleracea L. for hybrid breeding.

Cauliflower (Brassica oleracea var. botrytis L.; 2nā€‰=ā€‰2xā€‰=ā€‰18) is one of the most important vegetable crops worldwide. It is being grown in an area of 1.36 million ha with production of 25.53 million tonnes15. With a contribution of around 33% in area and 34.5% in global production, India ranks second behind China. In India, area and production of cauliflower are 0.473 million ha and 9.22 million tonnes, respectively16. Cauliflower is being grown for white curds which have unique texture, flavour and taste. Fresh curd portion has 1.9% protein, 5% carbohydrate, 95% water, 48.2Ā mg vitamin C, and 199Ā mg potassium17. It contains health-beneficial glucosinolates particularly sinigrin, glucobrassicin and 4 hydroxy-glucobrassicin18.

Cauliflower originated in the Island of Cyprus and travelled to Europe via trade routes. The European type varieties of cauliflower were first introduced to the Indian sub-continent in 1822 at Company Bagh, Saharanpur in United Province (currently Uttar Pradesh, India) by a British botanist named Dr. Jemson19. The newly introduced cauliflower could only be grown in winter due to Indiaā€™s higher temperature during summer and rainy seasons20. However, with staggered planting, human and natural selection pressure and crossing-recombination occurrences, the introduced cauliflowers evolved into ā€˜local typesā€™ that were better adapted to high temperature and humidity. It formed curd at higher temperature (20ā€“27Ā Ā°C) than the source European type varieties (10ā€“16Ā Ā°C or less). The ā€˜localā€™ or ā€˜Indianā€™ type cauliflower produced satisfactory seeds with a lesser requirement of cool temperature which was prevalent during winters in North Indian plains19,21,22. Based on temperature requirement for curd formation, the Indian cauliflower is grouped into Early (20ā€“27Ā Ā°C), Mid-early (16ā€“20Ā Ā°C) and Mid-late (12ā€“16Ā Ā°C) maturity groups23. Mid-September to mid-November, mid-November to mid-December and mid-December to mid-January, respectively are the contiguous seasons for these temperatures. Late or snowball cauliflower (10ā€“16Ā Ā°C) is also common for harvesting during Februaryā€“March in North Indian plains; nevertheless, due to temperature considerations, its seed production is feasible in the middle to higher altitude mountainous regions24,25.

The F1 hybrids have been crucial in increasing yield of cauliflower while also improving crop uniformity and product quality, both of which attracted farmers and retailers. The F1 hybrids are advantageous over the open-pollinated varieties (OPV) for uniformity, productivity, quality and tolerance to various biotic and abiotic stresses26. Therefore, the primary breeding goal for cauliflower is to develop F1 hybrids using genetic mechanisms for economic hybrid seed production. The earlier efforts were confined to self-incompatibility (SI) for hybrid development due to lack of alternative usable genetic mechanisms. In India, the SI system has been utilized for breeding three hybrids namely Pusa Hybrid-2 (1998), Pusa Kartik Sankar (2004) and Pusa Cauliflower Hybrid-101 (2021). However, the SI system has issues of poor stability at higher temperature, sib seeds and cumbersome maintenance26,27. The breeders tried for alternative genetic mechanism(s) with robust phenotype and easy maintenance. Genic male sterility (GMS) was identified in Brassica oleracea crops such as cauliflower (ms-4, ms-5) and other Cole vegetables namely ms-128,29, ms-2, ms-4, ms-5 and ms-630, DGMs79-399-331 in broccoli and Ms-cd1 in cabbage32, but unfit for commercial use in hybrid breeding25. The identification of Ogura sterile cytoplasm in wild Japanese radish3, its deployment in Brassica oleracea L.33 and refinements for chlorosis defects34 has opened a new era of hybrid breeding. The cytoplasmic male sterility (CMS) owing to its robustness and easy maintenance was deployed in different nuclear backgrounds of cauliflower including genotypes from all the maturity groups22. The Ogura system imparts complete male sterility which facilitated its use in commercial hybrid seed production. E. canariense (Can system) and B. tournefortii (Tour system) are two other sterile cytoplasms deployed in cauliflower4,35. Cauliflower breeders at ICAR-IARI, New Delhi transferred these three CMS systems in different genetic backgrounds in cauliflower36. Singh et al.22 studied the Ogura, Can and Tour CMS cytoplasm deployed CMS lines in Indian cauliflower for floral and yield traits and indicated for better prospect of Ogura- and Can- based CMS systems in hybrid breeding. They reported that Tour cytoplasm-based CMS lines of cauliflower have ā€˜gripping phenomenaā€™ wherein stigma gets trapped in a sepal which hinders insect pollination. Ogura cytoplasm has been well investigated for its interaction with different genetic backgrounds22,25,37,38 and for their heterotic potential7,39,40. It impact floral traits in snowball cauliflower41 and Indian cauliflower22. The Can cytoplasm is the most recently deployed sterile cytoplasm in different genotypes of the early group of Indian cauliflower at ICAR-IARI, New Delhi. These Can-based CMS lines need to be investigated for the impact of Can cytoplasm on agro-morphological and floral traits in nuclear backgrounds of cauliflower genotypes and also their potential in F1 hybrid seed production.

Materials and methods

Plant material and morphological observations

Seventeen CMS lines (11 Can CMS; 3 Ogura CMS; 3 Tour CMS) and their 11 maintainers were developed and being maintained by Division of Vegetable Science, IARI, New Delhi (Table 1). The original introgression Can CMS line ā€˜NIPB-1ā€™ was developed and maintained by ICAR-NIPB, New Delhi and maintainer ā€˜PMā€™ by ICAR-IARI, New Delhi. Three maintainers namely DC 98-4, DC-41-5 and DC-67 were kept common for all three sterile cytoplasms (i.e. Can, Ogura and Tour) carrying CMS lines. The experiment was carried out at Vegetable Research Farm, Division of Vegetable Science, ICAR-IARI, New Delhi (28.08 0N and 77.12 0E, above mean sea level being 228.61Ā m). The seedlings were raised on nursery beds of 3Ā mā€‰Ć—ā€‰0.5Ā mā€‰Ć—ā€‰0.20Ā m size. Sowing was done on 20.07.2021 and transplanted on 25.08.2021 at 45ā€‰Ć—ā€‰45Ā cm spacing in complete randomized block design (RBD) with 3 replications. Standard crop management practices and plant protection measures were adopted for raising the trial crop23.

Table 1 List of Can, Ogura and Tour CMS lines and their maintainers used for study on agro-morphological and molecular characterization of Indian cauliflower.

The CMS lines and their maintainers were observed for 12 agro-morphological and curd yield traits, 18 floral traits and four seed yield traits. Plant height (PH), plant spread (PS), leaf length (LL), leaf width (LW), number of leaves per plants (NLP), gross plant weight (GPW), marketable curd weight (MCW), net curd weight (NCW), root length (RL) and root width (RW) were recorded from five random plants in each plot at curd maturity or harvesting stage. Marketable curd yield (MCY, t/ha) was calculated from plot yield. The harvest index (HI) was obtained by using the flowing formula: HIā€‰=ā€‰MCW (g)/GPW (g).

The 18 floral traits were recorded each of 10 random plants at the flowering stage in CMS lines and their maintainers grown in replicated trials with three replications. The observations were taken using 25 to 30 fully developed floral buds (for floral bud traits) or naturally opened flowers (for flower traits) from each plant. Flower length (FL), flower diameter (FD), stalk length (StaL), petal length (PL), petal width (PW), sepal length (SL), sepal width (SW), bud length (BL) and bud width (BW) were recorded using standard rural scale. Stigma length (StiL), long stamen length (LSL), short stamen length (SSL), long filament length (LFL), short filament length (SFL), anther length (AL), nectary length (NL) and nectary width (NW) were taken using Discovery.18 SteREO Zoom Microscope (Carl Zeiss Microimaging Gmbh, Germany) in Division of Floriculture and Landscaping, IARI, New Delhi. Fully opened flowers from already covered inflorescence branches were taken and nectar was collected in calibrated capillary tubes (Sigma-Aldrich, Missouri, USA). The nectar volume (NV, Ī¼l/10 flowers) was calculated by dividing the length of the nectar column in the capillary tubes by the total length of the capillary tube42.

For seed parameters, all the 17 CMS lines were grown alongwith fertile pollen-parent in 1:1 ratio in three replications at two isolation blocks namely Mela Gate Farm (MG) and Vegetable Research Farm (VRF) located ā‰ˆ1000Ā m apart in ICAR-IARI, New Delhi. Pollen parent DC-67 and DC-98-4 were planted at MG and VRF site, respectively. The seed parameters namely siliqua length (SqL), number of seeds (NS) per siliqua, seed yield (SY) per plant and 1000-seed weight or test weight (TW) were recorded from five randomly tagged plants.

DNA extraction and PCR analysis

The DNA of CMS lines and their maintainers was extracted from fresh mature leaf samples from healthy young plants by using CTAB protocol43 and stored at āˆ’80Ā Ā°C for further molecular work. The sampling for DNA extraction was done from five plants of each genotype (CMS and maintainers). The DNA quality was checked using agarose gel and DNA concentration was measured using Nanodrop (Eppendorf). The DNA was diluted with 0.1 X TE buffer (10Ā mM Tris, 0.1Ā mM EDTA, pH 8.0) to yield a working concentration of 30Ā ng/Ī¼l. An equal volume of this working DNA sample from each of the five plants was mixed and made a representative sample for the respective genotype.

A set of 25 mt-DNA primers available in the public domain were synthesised by Integrated DNA Technologies, (IDT, Coraville, Iowa, USA) and screened for amplification using polymerase chain reaction (PCR) (Biometra Tone 96G, Analytik Jena, Germany). The details of the primers and their sequence information are given in Table S1. These were synthesized from Integrated DNA Technologies, Inc (IDT, USA). All the primers were amplified using 10Ā Ī¼l PCR mixture volume with 2X Master mix (GeneDireXĀ® OnePCRā„¢, Forward and Reverse Primer 1.0 Ī¼L (0.5 Ī¼L each, 25Ā Ī¼M), Template DNA (1.0 Ī¼L, 30Ā ng/Ī¼L) and Sterile double distilled water (3.0 Ī¼L, 75Ā ng/Ī¼L) using thermocycler. The PCR steps were: Initial denaturation at 94Ā Ā°C for 5Ā min followed by 35 cycles of denaturation at 94Ā Ā°C for 1Ā min, annealing at 55ā€“65Ā Ā°C for 1Ā min (varied with primer), extension at 72Ā Ā°C for 1Ā min; final extension at 72Ā Ā°C for 10Ā min and hold at 4Ā Ā°C for 30Ā min followed by storing at 4Ā Ā°C for further processing.

Electrophoresis and fragment detection

The amplified PCR products were resolved on 3% agarose gel in 100Ā mL 1X TAE buffer using 1Ā Ī¼g/mL of ethidium bromide florescent dye in gel electrophoresis (BioRad, USA). DNA bands were visualized and photographed under UV light in gel documentation unit (Alpha imager Cell Biosciences, Santa Clara, CA). The molecular markers showing variation in banding pattern among the genotypes were considered as polymorphic while rest were counted as monomorphic. Amplicon size was measured using the DNA ladder (GeneDireX Inc., 50Ā bp) as a reference.

Nucleotide sequencing

The PCR products were obtained from selected eight genotypes representing three different sterile cytoplasms namely Can, Ogura and Tour. These included five CMS lines, namely Can (DC-23), Can (DC-98-4), Can (DC-41-5), Ogu(DC-41-5) and Tour (DC-41-5) and three maintainers, namely DC-23, DC-98-4 and DC-41-5. The PCR products from five selected primers (i.e., ITS5a-ITS4, atpF-atpH, trnL, P16 and rbeL) were custom sequenced at Agri-Genome Pvt. Ltd., Hyderabad. The forward and reverse sequences, thus, obtained were assembled to obtain contigs using Bioedit sequence alignment editor version 7.0.5.3. The contigs were aligned using ClustalW multiple alignment option of BioEdit software and output was obtained under graphic view and exported as rich text with 50 residues per row and in blocks of ten residues.

Statistical analysis

The data from agro-morphological, floral and seed yielding traits were recorded and subjected "doebioresearch" package of R software for ANOVA (analysis of variance). Standard deviation and per cent change due to CMS introgression were calculated over the fertile (maintainer) line for agro-moprhological and floral traits using Microsoft Excel 2019. K-means clustering was implemented using ā€œclusterā€ package of R software. The optimal number of clusters was determined by average silhouette width. Average silhouette width was computed employing ā€œfactoextraā€ package of R software. The "Hmisc" R package was utilized to perform correlation analysis on both agro-morphological and floral traits. Non-significant correlations, determined at a significance level of 5%, have been marked with a strikeout. Principal Component Analysis (PCA) has been employed using the ā€œfactominerā€ package of R Software. Factorial RBD design has been employed using the ā€œdoebioresearchā€ package of R software.

Statement regarding plant guidelines

The use of plant parts in the study complies with international, national, and/or institutional guidelines.

Results

Analysis of variance (ANOVA) for traits

Table 2 provides the results of the analysis of variance for agro-morphological features observed in early cauliflower CMS lines and their maintainers. There were significant differences among the groups (CMS lines and maintainers) for the observed agro-morphological traits except PH. No significant difference observed between the groups.

Table 2 Analysis of variance (ANOVA) for agro-morphological and curding traits observed from CMS lines carrying Can, Ogura and Tour cytoplasmā€™s and their maintainers in Indian cauliflower.

Variation in agro-morphological and curding traits

The CMS lines and their maintainers showed significant differences for all the observed agro-morphological traits (Table 3). Box plot of these traits from the CMS lines (Fig.Ā 1a) and their maintainers (Fig.Ā 1b) depicts minimum, first quartile (Q1), median, third quartile (Q3) and maximum values of the traits. The box plot analysis showed wide variation in both CMS and maintainer lines for PH, PS, MCW, NCW and GPW. PH in CMS lines was ranged from 53.5Ā cm in Can (DC-94-2) to 63.6Ā cm in Ogu (DC-98-4) and in maintainers from 58.1Ā cm (DC-41-5) to 64.8Ā cm (DC-63). PS ranged from 55.7Ā cm in Can (DC-41-5) to 89.8Ā cm in Tour (DC-98-4) in CMS lines. LL, LW and NLP also showed significant variation in CMS lines and maintainers. LW ranged from 40.0Ā cm in Can (DC-41-5) to 54.8Ā cm in Can (DC-63) and LL from 19.1Ā cm in Can (DC-67) to 23.9Ā cm in Ogu (DC-98-4), respectively. NLP were maximum in Tour (DC-41-5) (20.3) in CMS lines and DC-63 (22.4) in testers. GPW ranged from 1215.0Ā g in Can (DC-98-4) to 1766.3Ā g in Can (DC-209). Among maintainers, the GPW was highest in DC-41-5 (1677.9Ā g). Can (DC-903) (690.9Ā g; 567.0Ā g) in CMS lines and DC-63 (673.8Ā g; 551.7Ā g) in testers had maximum MCW and NCW, respectively. MCY was highest in Can (DC-903), ranging from 17.7 to 24.2 t/ha. It was the greatest among the testers in DC-63 (23.6 t/ha). HI ranged from 0.34 to 0.53 with the highest in Can (DC-41-5) in CMS lines. Among maintainers, it was highest in DC-8 (0.42). RL (18.6Ā cm) and RW (19.1Ā cm) were the greatest in CMS Can (DC-8) while in maintainers these were maximum in DC-94-2 (20.2Ā cm) and DC 41-5 (19.4Ā cm), respectively.

Table 3 Agro-morphological and curding traits observed from CMS lines carrying Can, Ogura and Tour cytoplasmā€™s and their maintainers in Indian cauliflower.
Figure 1
figure 1

Box plot of 12 agro-morphological traits of Canarianse based CMS lines (a) and their maintainers (b) in early group of Indian cauliflower. Box plot of 18 floral traits of E. canarianse based CMS lines (c) and their maintainers (d) in early group of Indian cauliflower.

Variation in CMS lines and fertile maintainers for floral traits

The ANOVA for 18 floral traits in CMS lines and their maintainers is Table 4. Table 5 displays average values of 18 floral traits observed in CMS lines and their maintainers. The box plot of floral traits showed wide range (minimum, maximum), median, first quartile (Q1) and third quartile (Q3) in CMS lines (Fig.Ā 1c) and their maintainers (Fig.Ā 1d). Box plot analysis also revealed large variation in floral traits such as FL, FW, PL, PW, BW, StiL and NL in CMS lines. The cytoplasmic sterility reduced overall flower size and size of reproductive organs (stigma and stamens) in CMS lines (Fig.Ā 2a). The flower parts (sepals, petals, stigma and anthers) of CMS lines and their maintainers of early cauliflower are presented in Fig.Ā 2b. All the observed floral traits showed a significant difference in the CMS lines and maintainers. The tiniest and narrowest flowers were observed in all the three Tour-based CMS lines. FD ranged from 1.23Ā cm in Tour (DC-67) to 1.76Ā cm in Can (DC-94-2) in CMS lines and DC-41-5 (2.13Ā cm) in maintainers. Can (DC-41-5) had the shortest StaL (1.61Ā cm) in CMS lines while DC-7 had the longest StaL (1.93Ā cm) in maintainers. PL and PW were ranged from 1.12Ā cm [Can (DC-18)] to 1.59Ā cm [Can (DC-7)] and 0.62Ā cm [Can (DC-209)] to 0.79Ā cm [Can (DC-67)], respectively. DC-63 (1.62Ā cm) and DC-98-4 (1.93Ā cm) had the highest values of these two petal parameters. SL and SW were ranged from 0.69 to 0.78Ā cm and 0.31 to 0.40Ā cm in CMS lines. DC-67 had maximum SL (0.91Ā cm) while DC-63 had the widest sepals (0.47Ā cm). BW ranged from 0.39 to 0.56Ā cm and BL from 0.53Ā cm to 0.70Ā cm.

Table 4 Analysis of variance (ANOVA) for floral traits observed from CMS lines carrying Can, Ogura and Tour cytoplasmā€™s and their maintainers in Indian cauliflower.
Table 5 Floral traits observed from CMS lines carrying Can, Ogura and Tour cytoplasms and their maintainers in Indian cauliflower.
Figure 2
figure 2

(a) Flowers of fertile maintainers (top side) and CMS lines (bottom side) of cauliflower. (b) Flower floral parts (sepals, petals, stigma and anthers) of CMS lines and maintainers in early group of Indian cauliflower.

StiL in CMS lines was ranged from 0.65 to 1.03Ā cm which was significantly less than the maintainers (0.69ā€“1.05Ā cm). Observations on length of stamens (i.e. LSL and SSL) and anther filament (LFL and SFL) showed significant differences in CMS lines and their maintainers. NL was maximum in Ogu(DC-67) (1.77Ā mm) and NW in Ogu (DC-41-5) (1.83Ā mm) while in maintainers, the observed maximum values were 2.06Ā mm (DC-63) and 1.95Ā mm (DC-8), respectively. NV also varied significantly in CMS lines (0.34ā€“0.46 ĀµL/10 flowers) and maintainers (0.48ā€“0.56 ĀµL/10 flowers).

Can cytoplasm-associated changes in agro-morphological and floral traits

In comparison to their maintainers, the introgression of nuclear genome of Indian cauliflower genotypes in sterile cytoplasms had significant impact on all 12 of the observed morphological traits (Fig.Ā 3). PL changed in range of āˆ’16.1% in Can (DC-94-2) to 9.1% in Can (DC-98-4) and PS reduced the greatest in Can (DC-8) (āˆ’20.8%). LL and LW also influenced by sterile cytoplasms in the ranges of -26.2% to 6.1% and -17.7% to 11.2%, respectively. Except for Can (DC-23), all Can based CMS lines showed reduction in NLP. Impact of Can cytoplasm was noticed on GPW (-21.7 to 16.6%), MCW (-19.6 to 12.0%) and NCW (up to 13.4%). The changes in RL and RW ranged from -23.3% to 15.0% and -22.1% to 19.1%, respectively. The sterile cytoplasms also affected MCY, which was decreased in eight CMS lines while slightly increased in 3 CMS lines by 9.6 to 14.0%. Can (DC-98-4) experienced the greatest decline (19.5%). HI was also influenced in both the directions.

Figure 3
figure 3

Per cent change in agro-morphological traits due to ingression of Can sterile cytoplasm in genotypes of early group of Indian cauliflower.

Contrary to agro-morphological traits, the sterile cytoplasms had highly significant and negative impact on all the 18 observed floral traits (Fig.Ā 4). Eight floral parameters namely SL and SW, StaL, LSL, SSL, AL, NL, NW and NV showed negative impact only. The reduction in FL ranged from -36.5% in Can (DC-67) to 11.4% in Can (DC-7). Similar trend was observed for FD and StaL. Notable impact was observed on petals (PL: āˆ’30.2 to 14.3%; PW: āˆ’36.0 to 13.4%), sepals (SL: āˆ’1.2 to āˆ’27.4%; SW: āˆ’31.1 to 2.7%) and floral buds (BL: āˆ’29.4 to 25%; BW: āˆ’41.5 to 21.7%). Significant reduction was observed in StiL (āˆ’1.9 to āˆ’35.8%), LSL (āˆ’24.7 to -33.70%) and SSL (āˆ’4.8 to āˆ’23.3%). The CMS introgression reduced NL, NW and NV in range of āˆ’18.9 to āˆ’8.2%, āˆ’16.0 to āˆ’8.2% and āˆ’32 to āˆ’6.2%, respectively.

Figure 4
figure 4

Per cent change in floral traits due to ingression of Can sterile cytoplasm in genotypes of early group of Indian cauliflower.

Impact of sterile cytoplasms on agro-morphological traits

Nine CMS lines developed through introgression of Tour, Ogura and Can cytoplasms in nuclear backgrounds of DC-98-4, DC-67 and DC-41-5 were observed for SqL, NS/siliqua, SY (g/plant) and TW (g). The sterile cytoplasms influenced seed traits and the greatest reduction was observed in Tour-based CMS lines (Fig.Ā 5aā€“d). While, the TW was found to be in the same range, SqL and NS/siliqua showed significant different. SY was significantly affected due to CMS introgression particularly due to Tour sterile cytoplasm.

Figure 5
figure 5

(aā€“d) Seed parameters from selected CMS lines and their maintainers in genotypes of early group of Indian cauliflower.

Principal component, bi-plot and correlation analyses

Principal component analysis (PCA) for agro-morphological traits (Fig.Ā 6a, b) and floral traits (Fig.Ā 6c, d) explains prominence of PCA1 (65.3%) and PCA2 (22.5%) combining of 87.8% in explaining variances in CMS lines and PCA1 (36.5%), PCA2 (28.3%) and PCA3 (17.3%) in maintainers (Fig. S1a, b). For floral traits, PCA1 (38.3%) and PCA2 (23.8%) together explained maximum part of variance in CMS lines while only PCA1 could explain 59.7% variance in maintainers (Fig. S1c, d).

Figure 6
figure 6

Principal component analysis (PCA) of agro-morphological traits (a, b) and floral traits (c, d) in CMS lines and maintainers in early group of Indian cauliflower using "FactoMiner" R Package.

Contribution of variables of agro-morphological traits in variation in CMS lines and maintainers is presented in (Fig. S2a, b). and their floral traits in (Fig. S2c, d). RL, GPW, HI and PS are important determinant show similarity in CMS lines while GPW, HI, NCW, MCY and MCW in maintainers .SL, FL, StaL, SW, NL, LFL, StiL, PL, FW and BL in CMS lines and SSL, LSL, LFL, SFL, FW, BW, NV, NL and StaL were contributors in variance in maintainers.

Bi-plot analysis for agro-morphological traits (Fig.Ā 6a, b) and floral traits (Fig.Ā 6c, d) in CMS lines and maintainers in early group of Indian cauliflower using "FactoMiner" R Package. This plot interpreted the similarity in most of the agro-morphological traits. HI, RL and GPW showed similarity in CMS lines whereas all the traits showed similarity in maintainers except GPW. For floral traits, LSL and SSL showed similarity in CMS lines whereas all the traits showed similarity in maintainers except SL. Cluster analysis of CMS lines and their maintainers for agro-morphological trait (Fig.Ā 7a, b) and floral traits (Fig.Ā 7c, d) could reveal three and two clusters, respectively.

Figure 7
figure 7

(aā€“d) Cluster analysis of CMS lines and their maintainers for agro-morphological trait (a, b) and floral traits (c, d) in early group of Indian cauliflower.

Correlation analysis between agro-morphological traits in CMS lines and maintainers is presented in Fig.Ā 8a, b and their floral traits in Fig.Ā 8c, d. A significant positive correlation was observed among the traits such as NCW with MCW and MCY. RL positively correlated with HI and NL positively correlated with RW while GPW negatively correlated with PS and PS negatively correlated with LL in CMS lines. In maintainers MCW positively correlated with MCY and NCW positively correlated with MCW while PS negatively correlated with HI. In CMS lines, RL was strongly correlated with NLP, RW and HI. In floral traits, LFL has strong correlation with FW and NW. StiL significantly correlated with StaL and SL in CMS lines. In maintainers, LFL was strongly correlated with NV, SSL, SFL and FW (Fig.Ā 8c, d). SSL significantly correlated with NV and SFL and BW with NL. These correlation studies are significant at 5% level of significance.

Figure 8
figure 8

(aā€“d) Correlation analysis was done for agro-morphological (a, b) as well as floral traits (c, d) in genotypes of early group of Indian cauliflower using the ā€œHmiscā€ R package.

Variation in CMS lines for F1 hybrid seed traits

Table 6 lists the performance of 17 CMS lines of Indian cauliflower for F1 hybrid seeds production in two isolations using DC-67 and DC-98-4 fertile inbred lines as male parents. The SqL was ranged from 4.7Ā cm [Tour (DC-41-5)] to 5.9Ā cm [Tour (DC-67)]. Can (DC-23) had maximum NS (14.5/siliqua) while Tour-based CMS lines namely Tour (DC-41-5) and Tour (DC-67) had minimum NS (4.8/siliqua). Among CMS lines, SY was maximum in Ogu (DC-98-4) (28.0Ā g/plants) followed by Can (98-4) (24.2Ā g/plant) and minimum in Tour (DC-67) (1.5Ā g/plant). TW was ranged from 1.9Ā g [Can (DC-18)] to 3.6Ā g [Tour (DC-41-5)]. Satisfactory flowering and seed settings were observed in all the Can and Ogura cytoplasm-based CMS lines.

Table 6 Observations on F1 hybrid seed produced from Can, Ogura and Tour cytoplasm carrying CMS lines in Indian cauliflower.

Mitochondrial specific DNA markers for CMS lines

The observations for amplification and level of polymorphism in CMS lines with 25 mtDNA markers are presented in Fig. S3. Two markers, namely P14, P15 and P16 were polymorphic in CMS and fertile maintainers. The marker P14 amplified fragments of 200Ā bp in CMS lines (lane 1ā€“6, 8,9) and that of 250Ā bp in fertile maintainers (lane 15ā€“20), whereas P15 amplified distinct bands of 200Ā bp in CMS lines (lane 3ā€“8) and 250Ā bp in fertile maintainers (lane 13ā€“18), respectively. P14 showed polymorphism between CMS lines carrying Ogura (lane 21, 22, 24) and canarianse (lane 1ā€“6, 8, 9) CMS systems and their fertile counterparts. It also generated a unique band (175Ā bp) for Tour cytoplasm (lane 23), separating it from Ogura and canarianse cytoplasms. P15 could distinguish canarianse (lane 3ā€“8) from Ogura (lane 21, 22, 24)) and Tour (lane 23) cytoplasms, however, it could not differentiate between the latter two systems. Notably, intra-CMS system variation was also observed among the Can cytoplasm carrying CMS lines such as Can (DC-209) (lane 7) and Can (DC-63) (lane 10) which amplified an amplicon of 250Ā bp while rest of the eight Can based CMS lines generated 200Ā bp with P14 primers. Similar observations were made from P15 which could generate three different bands in Can based CMS lines. P16 revealed a polymorphic band (450Ā bp) for Ogura (DC-45) (lane 24) while its maintainer DC-41-5 (lane 18) amplified 500Ā bp amplicon.

Sequence analysis

The results of alignment of amplicons produced by the eight genotypes employed in the study using ITS 5a-ITS 4 primer are shown in Fig. S3a. The aligned sequences were filtered from both the ends to remove the gaps and the central 614 residues were finally analyzed. In the aligned sequences, the sequences bearing similarity are shaded with the same colour e.g. A is shaded green, T red, C blue and G black. The unshaded residues are indicating that the residue in question is variable in the compared genotypes. There was not much variation in the analysed sequence up to 460 bases. In this portion, the sequences were observed to be same in all the genotypes except DC-23 and DC-98-4. It was observed that there was C in place of A at 95th position, G in place of T at 163rd position, C in place of T at 402nd, C in place of T at 416th position in DC-23 and DC-98-4. At the same time genotype Can (DC-41-5) showed presence of G at this position (416th). At position 432nd there was C in Can (DC-41-5) while G in rest of the genotypes. Similar analysis of the sequences revealed that only DC-23, DC-98-4 and Can (DC-41-5) showed different pattern as compared to the rest of the genotypes. The results of alignment of amplicons obtained using atpF-atpH primer from the eight genotypes (5 CMS lines and 3 fertile maintainers), two maintainers were common for four CMS lines (Fig. S3b). The central 429 residues were finally analysed. In the aligned sequences, the sequences bearing similarity are shaded with the same colour as in case of ITS 5a-ITS4. The lines showed variation at different positions for nucleotides. Can (DC-23) had deletions of A at two positions i.e. at 166th position A is deleted and T is deleted at 299th position. Five conversions were also observed at 178th, 258th, 318th, 370th and 429th positions. The sequence analysis revealed variation among the sterile cytoplasms and also in fertile inbreds for the observed sequences. The deletion was also observed in Can (DC-98-4) at 299th position but not in Can (DC-41-5) indicating change in sequences. At 323th position, the sequence alignment of atpF-atpH showed difference in the CMS lines namely Can (DC-41-5), Ogu (DC-41-5) and Tour (DC-41-5) and their maintainers DC-41-5. These CMS lines had T, while maintainer DC-41-5 had A. The sequence alignment of amplicons obtained using P16 primer from the eight genotypes (Fig. S3c) revealed no variation between Can (DC-23) and DC-23, while Can (DC-98-4) and Can (DC-41-5) showed variation due to addition and deletion of nucleotides at different positions. In case of rbeL primer (Fig. S3d). The central 663 residues were finally analyzed. In the aligned sequences no variation was detected between Can (DC-23) and DC-23 while Can (DC-98-4) and Can (DC-41-5) showed variation due to the addition and deletion of nucleotides at different positions. The sequence alignment of amplicons obtained using trnL primer from eight genotypes (Fig. S3e) could show addition and deletion of nucleotides in Can (DC41-5). It has additions at 14th, 15th, 60th, 74th, and 80th positions. Can (DC-41-5) had one deletion of nucleotides at 22nd, 32nd, 42nd, 48th, 50th and 57th positions.

Discussion

In terms of agro-morphological features, the CMS lines carrying Can and Tour sterile cytoplasms are on par with the CMS lines carrying Ogura sterile cytoplasm. However, there was a significant difference in floral traits between the cauliflower CMS lines carrying these three sterile cytoplasms. Tour cytoplasm-based CMS lines namely Tour (DC-41-5) and Tour (DC-67) showed a considerable reduction in floral traits.

Variations in agro-morphological, floral and seed traits

Cauliflower is a crop that was introduced to the Indian subcontinent, and the original genotypes had a temperate flowering habit meaning that they needed a protracted period of cold temperature for proper bolting and flowering for seed production. Newly evolved cauliflower had varied morphologies in terms of stalk length, leaf orientation, leaf size and colour, curd size and colour and plant frame19. ā€˜Indian cauliflowerā€™ or ā€˜tropical cauliflowerā€™ was the name given to the evolved ecotype particularly to the early and mid-early groups. The cytoplasmic male sterility is widely used genetic mechanism for hybrid breeding in Brassica crops. It arises as a result of a genomic conflict between the mitochondrial and the nuclear genomes. The Ogura sterile cytoplasm was first deployed in European cauliflowers35 and later on introgressed in Indian cauliflowers26,38. Still it is a robust and stable genetic system that ensures complete cross-pollination across the temperature range. The modification or alteration in the mitochondrial genome that causes male sterility may substantially impair CMS system performance. This has been documented in the ā€˜Polimaā€™ cytoplasm where partial restoration of fertility has occurred as a result of higher temperature44. As hybrid breeding in cauliflower currently solely depends on the Ogura CMS system, this will be a major calamity, if a Polima-like situation occurs. Thus, two additional sterile cytoplasms namely E. canariense (Can) and B. tournefortii Goun (Tour)4,35. This CMS introgression process might had an impact on other non-target but important traits such as agro-moprphological and floral traits. Because, normal size of flowers, properly developed petals, normal stigma and style and well-developed functional nectaries are required for drawing honey bees for pollination activities. The previous researchers have showed the negative effects of Ogura sterile cytoplasm on floral traits including flower size, petal length and width, stamens, stigma and nectary size as well as nectar volume37,45,46,47. These defects/deformities included reduced flower size, petaloid anthers, carpeloid stamens, malformed nectary, twisted stigma and split anthers in Brassica crops. A reduction in floral traits in CMS lines of Indian cauliflower carrying Ogura, Can and Tour sterile cytoplasm was also reported22,38. Size of functional nectaries is correlated with nectar volume38,48. And, the pollinator insects in Brassica crops are attracted to flowers based on the amount of nectar they produce 49,50,51.

In current investigation, the pattern of the per cent change in plant morphology, curd yield and floral traits revealed varied results. The parameters of PH, PS and leaf attributes showed a variable pattern. Maximum reduction in PH was recorded in DC 98-4 (āˆ’16.1%), while overall gain was seen in Ogu (DC-98-1) and Can (DC-7) in Ogura and Can cytoplasm-based CMS lines, respectively. Three CMS lines namely Can (DC-903) (12.0%), Can (DC-209) (7.5%) and Can (DC-18) (3.5%), which were considerably greater than other two CMS systems, demonstrated good gains in MCW. Dey et al.37 and Singh et al.38 both made similar observations. The MCY from Can-based CMS lines (21.8 t/ha) was nearly equal to that of the CMS lines carrying Ogura (21.8 t/ha) and Tour (21.4 t/ha) cytoplasm. This might be explained by nuclearā€“cytoplasmic interactions and cooperative functions between particular nuclear backgrounds of cauliflower and alien cytoplasms (i.e. sterile cytoplasm from other species). These alloplasmic lines, which carry cytoplasms namely Can, refined Ogura and Tour from three different sourcesā€”E. canariense (complete cytoplasm), Raphanus sativus L. (only mitochondria) and B. tournifortti (complete cytoplasm), respectively. Refined Ogura based CMS lines carry chloroplast of B. oleracea, a unique functional combination in plant system.

The conversion procedure from fertile inbred lines to CMS lines had impact on floral features. Both FL, FD, PL and PW were decreased significantly. The findings are consistent with earlier studies in broccoli46, snowball cauliflower37,48 and Indian cauliflower22. The conversion process significantly decreased the size of the nectaries (NL, NW). The findings align with earlier reports22,48. NV also affected due to CMS trait. This might be the reason for a negative effect on seed yield in honey bee pollinated crops. The reduction in floral characteristics agreed with the findings of Dey et al.41 In comparison to their fertile lines, they reported a considerable decline in floral characteristics in refined Ogura based CMS lines of snowball cauliflower. However, all the Can and refined Ogura carrying CMS lines had decent size of flowers for honey bee visits (based on field observations of the senior author). Among the Tour-based CMS lines, only Tour (DC98-4) had a proper flower while the remaining Tour (DC-67) and Tour (DC-41-5) had most malformed flowers with ā€˜gripping phenomenonā€™ as also noted by Singh et al.22 The CMS phenotype is associated with abnormal recombination of mitochondria genome and CMS genes (i.e. orf138 in Ogura CMS) encodes transmembrane proteins (ORF138) which bind to the mitochondrial membrane, affecting the hydrogen ion concentration gradient, thus affecting ATP synthesis. This leads to energy deficiency and excess accumulation of reactive oxygen species (ROS). The ROS play key role in cell signaling and their excess accumulation causes damage to lipids, proteins and DNA, inhibit enzyme activity, activate the programmed cell death (PCD) pathway and ultimately affect plant growth and development, particularly reproductive parts52.

The high seed yield in natural crossing conditions is essential for economic use of CMS system in hybrid breeding. The CMS trait impact most of floral traits which reflects in reduced seed yield also, however, in comparison, certain CMS systems have better performance than others53. The seed yield per plant in the refined Ogura-based CMS lines of snowball cauliflower namely Ogu1A, Ogu2A, and Ogu3A was reported to be 30.9Ā g, 21.4Ā g and 26.8Ā g per plant, respectively54. They reported that the seed yield per plant for these CMS lines was 24.3%, 29.3% and 10.1% less than its maintainer, respectively. Kucera et al.55 reported a reduction in seed yield per plant by 79.5% and 85.7% in FT CMS and BR CMS, respectively. It was attributed to fewer siliqua per plant and a smaller number of seeds per siliqua than their maintainers. The observed decrease in seed yield due to CMS trait in present study was consistent with the earlier findings37,55. However, this is the first in-depth account of changes brought on by Can sterile cytoplasm in agro-morphological and floral features of cauliflower. Similar to this, it was found that 11 Can-based CMS lines with the two pollen parents DC-67 and DC-98-4 produced hybrid seeds on average at rates of 19.5Ā g and 12.8Ā g/plant, respectively. Three Ogura based CMS lines with same pollen parents produced hybrid seeds 14.7Ā g and 18.5Ā g/plant, respectively. The Tour-based CMS lines, particularly Tour (DC-41-5) and Tour (DC 67) were worst performers for hybrid seed yield, mainly due to gripping phenomena of stigma38. High yield of hybrid seeds in present study could be due to genotype, open condition and nicking of CMS and pollen parents. Kucera et al.55 obtained only 2.3Ā g/plant hybrid seeds from BR CMS line with pollen parent FT13 grown in isolation cages provided with bumble bees. Thus, the study demonstrates the potential of these novel Can sterile cytoplasm carrying CMS lines in production of hybrid seeds.

The PCA and cluster analysis identified key agro-morphological and floral traits which contribute major share of variance in the CMS lines. The potential for trait comprehension for breeding purposes is indicated by the strong association between root and leaf parameters, curd weight and yield characteristics, and stamen length and nectary traits.

mtDNA level molecular variation in CMS lines

At flowering stage, the CMS phenotype is persistent and simple to distinguish from fertile analogues. It is challenging to distinguish them at earlier stages, though. Further, the deployment of several sterile cytoplasms in cauliflower makes it challenging to distinguish these cytoplasms carrying CMS lines at a morphological level. Breeders will therefore benefit from the creation or discovery of DNA markers to differentiate between these several sterile cytoplasms. Additionally, because these markers are stage-independent, they can be applied to any specious mixture during the seedling stage or before flowering. Mitochondrial markers are thought to be a reliable method for identifying sterile and fertile cells as well as the type of cytoplasm that CMS lines carry56,57.

A dominant marker called ISSR3 was used by Wang and Song58 to distinguish fertile counterparts from the CMS lines. According to them, it is possible to distinguish between fertile and sterile lines using an amplicon of 313Ā bp size using a primer pair P9/P10 for mtDNA marker specific to the CMS cauliflower knxd612. Only two of the 25 mtDNA markers namely P14 and P15 were polymorphic between CMS and male fertile phenotypes. P16 also distinguished Ogura cytoplasm-based CMS line Ogura (DC-41-5) from its maintainer (DC-41-5). Notably, P14 distinguished Tour cytoplasm from Ogura and canarianse CMS systems which was in line with previous reports of Singh et al.22 Additionally, the polymorphism was found among the Can-based CMS lines. This might be the result of an adaptive interaction between the mitochondrial genome of E. canerianse and the nuclear genome of B. oleracea var. botrytis L. (cauliflower), which has changed the genetic makeup of the CMS lines. Such occurrences are uncommon, hence warrant for further studies because the modifications to the mitochondrial DNA do not occur so frequently (i.e. 6 to 7Ā years of introgression in case of Canā€”oleracea cytoplasm interaction) as observed from Oguraā€”Brassica interaction (nearly 25ā€“30Ā years in existence). Also, further research is warranted to understand the variability among the CMS lines carrying same Can cytoplasm. This could be explained by the lower levels of polymorphism of target locus in the four cytoplasms studied (Oleracea, Eru, Can and Tour). Since these species share a common ancestor and except a few minor variations, their cytoplasmic genomes are still conserved to great extent.

The sequence analysis from five CMS lines including three Can-based [Can (DC-23), Can (DC-98-4), Can (DC-41-5)] and one each with Ogura [Ogu (DC-41-5)] and Tour cytoplasms [Tour (DC-41-5] along with their maintainers revealed insertions and deletions. This could explain the variations among these lines. Sequence analysis of P16 amplicon from Can (DC-98-4) and Can (DC-41-5) CMS lines had distinct bases from their maintainers. Further, sequence analysis of Tour (DC-41-5) CMS line revealed that P16 amplicon at 243rd position and atpF-atpH amplicon at 138th position had T while rest of the lines had C and A, respectively. This indicates that flower deformity (i.e. gripping phenomenonā€™ could be due to this change in ORF region. However, it needs further investigation to establish the association. Primer atpF-atpH amplicon sequence revealed unique bases at manifold position indicating its distinctness from the other three cytoplasms. Earlier reports also suggest variation among the Ogura-cytoplasm carrying CMS lines of cauliflower25 and broccoli31. Deletions of nucleotide in the ORF coding region of broccoli cytolines was suggested to be associated with carpelloid stamen phenotype31, however, such abnormality was not reported in investigated CMS lines.

Conclusion

The present work provides the first in-depth information on the effect of this sterile cytoplasm in new nucleo-cytoplasmic combination (oleracea-canerianse) lines of Indian cauliflower. Erucastrum canarianse Webb and Berthel is a new CMS system deployed in cauliflower. These lines contain the cauliflower nuclear genome, whereas E. canarianse provides the cytoplasm. Although most agro-morphological features and floral attributes changed as anticipated due to the novel cytoplasm-nuclear interaction, the impactā€™s size was not averse to the level which can affect pollination drastically. The introgression of Can CMS system in the tropical Indian cauliflower reduced flower parameters equal to those of the Ogura CMS line, but seed-setting capacity is adequate, emphasizing the potential for hybrid seed production. P14 and P15, two polymorphic co-dominant mtDNA markers that were also discovered for Can CMS system, will be helpful for early detection, tracking of change(s) in mtDNA sequences and spotting variation between CMS lines and their maintainers. Similarly, P16 was found to be useful to distinguish Ogu (DC-41-5) from its maintainer DC-41-5. In the instance of sequence analysis, the Can, Ogu and Tour cytoplasm harbouring CMS lines displayed variation. These novels Can CMS lines of Indian cauliflower creates the possibility for immediate use in hybrid breeding or further introgression in other members of B. oleracea group through backcrossing.