Introduction

Cotton is considered the most important fiber crop in the world, with Egyptian cotton (Gossypium barbadense L.) being renowned as the best fiber crop globally and remaining significant crops in Egypt1,2,3,4. Cotton breeders worldwide aim to enhance early maturity, high yield, and high-quality cotton varieties. However, selecting for high yield in cotton breeding is challenging due to its complexity and susceptibility to environmental and genetic factors. Direct selection based solely on yield is difficult and inefficient5,6,7, because of genotype x environment interactions, low heritability, polygenic nature, and non-additive gene action8. To address these challenges, the selection index technique was proposed to simultaneously improve multiple traits simultaneously, achieving superior results compared to direct selection. This method effectively select elite genotypes what superior yield and fiber traits9,10,11,12,13. In contrast, the pedigree selection method may less effective in identifying superior genotypes14. Timely maturity, known as earliness, cotton cultivation is crucial to prevent damage from frost, insects, diseases, conserve soil moisture and prevent weathering of open cotton. Cotton can also be used in a multi-crop system15,particularly in Egypt, where, it is rotated with winter crops like wheat. Studies have shown that a rotation system of wheat and cotton can be effective16. Early maturity offers various benefits, including better development in favorable weather, fitting into a double-cropping pattern with minimal impact on yield, higher water use efficiency in irrigated areas, and reduced damage from boll worms17. Although various selection methods have been used to enhance cotton traits, pedigree selection is the most popular among plant breeders due to its versatility, speed, and compatibility with genetic studies18. Most Egyptian cotton varieties were produced using pedigree selection. Studies have shown decreased Genotypic Coefficient of Variance (GCV) and Phenotypic Coefficient of Variance (PCV) after two cycles of pedigree selection17. Researchers have reported significant genotypic differences for various cotton traits in different generations and strains, with high to moderate broad-sense heritability estimates for all traits in direct selection for yield and its components. Broadsense heritability estimates for seed cotton yield/plant, boll weight, and number of bolls/plant have been reported as 0.98, 0.96, and 0.96, respectively16,19,20.

Directly selecting cotton plants based on yield alone is a challenging task due to the complex nature of the trait. Yield is greatly influenced by environmental conditions, as well as genotype x environment interactions, which diminishes the effectiveness of using yield as the only selection criterion11,21,22. Moreover, other factors like the polygenic nature of the trait, low heritability, linkage, and non-additive gene action may hinder the effectiveness of selection, especially in the early segregating generations. To overcome these challenges, the selection index method is proposed as a more efficient alternative approach to simultaneously improve multiple traits that are more heritable and correlated4,23,24,25,26. The selection index method has been shown to be more efficient than pedigree selection in isolating superior elite families in most studied traits. , increasing the probability of identifying desirable genotypes. However, the reliability of the parameter estimates used in constructing the selection index is crucial for its efficiency and may be subject to large sampling errors, including biases due to genotype by environment interaction27,28,29,30. To improve the efficiency of selection, it is necessary to consider heritability estimates simultaneously with genetic advances expressed as a percentage of the mean. Understanding heritability helps plant breeders predict the behavior of succeeding generations, make desirable selections, and assess the magnitude of genetic improvement through selection. Several studies have reported genetic variability for yield traits in cotton31,32,33,34. However, the estimates of broad-sense heritability for the characters studied were low to moderate, indicating that heritability alone is not a sufficient indication of the genetic progress that can be achieved through selection. Therefore, other traits and selection criteria should be considered when breeding cotton plants to improve yield and other desirable characteristics16,17,35,36,37,38. The primary objective of this research was to (i) identify cotton genotypes suited to semi-arid climates that produce high yields and fiber quality meeting textile industry standards, and (ii) evaluate predicted and actual improvements in key economic traits through the application of different selection indices during the breeding process.

Materials and methods

Plant materials

The study was conducted at Sakha Agricultural Research Station for three consecutive growing seasons (2017, 2018, and 2019). The materials used for the study were intraspecific cotton (Gossypium barbadense L.) cross (Giza 94 × Suvin) from the F2, F3, and F4 generations. Giza 94 is known for its high yield, high lint percentage, and earliness, while Suvin is known for its high yield and earliness. The F1 generation was self-pollinated, and the resulting seeds were used for the F2 generation. In the 2017 season, the F2 population and their parents were grown with a spacing of 70 cm between rows and plants, and one plant per hill was maintained. The plants were self-pollinated by covering the flowers with craft paper bags, and 375 selected plants were harvested separately. The plants were evaluated based on the first opening flower, boll weight, seed cotton yield, lint percentage, and lint index, with a selection intensity of 5%. The plants with the highest performance for each criterion were maintained, resulting in 65 selected progenies for the F3 generation. In the 2017 season, 65 self-selected plants from the F2 generation were used to raise the F3 generation. The F3 families were planted in plots represented by two rows for each plant, with recommended cultural practices carried out throughout the growing season. The best 30 plants out of the 65 F3 generation were chosen based on a selection intensity of 5% for the five selection criteria. In the 2018 season, the selfed seeds of the 30 F3 families, as well as the two parents and a commercial variety Giza 86, were planted to represent the F4 family. Giza 86 was used as a check variety due to its high yield and good fiber properties. A randomized complete block design with three replications was used; one row represented each plot for each plant, 4.5 m. long and 0.7 m. wide with 70 cm. hill spacing. Hills were thinned to one plant per hill. Normal cultural practices for cotton production were performed at the proper time. The studied characters were the first opening flower, first fruiting node, boll weight (g), seed cotton yield, lint cotton yield, lint percentage, seed index and lint index.

Statistical and genetic analysis

The PCV and GCV were calculated using the method described by Kearsey and Pooni39. The variance and covariance components obtained from the analysis of a regular randomized complete block design were used to estimate the phenotypic and genotypic variances and covariances.

Heritability in a broad sense (h2b) was calculated as follows:

$${\text{h}}^{{2}} {\text{b }}\left( {{\text{in F}}_{{2}} {\text{generation}}} \right) \, = \, \left( {{\text{VF}}_{{2}} - \left( {{\text{VP}}_{{1}} + {\text{VP}}_{{2}} } \right)/{\text{VF}}_{{2}} } \right) \, \times { 1}00$$
$${\text{h}}^{2} {\text{b }}\left( {{\text{in F}}_{2} {\text{ F}}_{{3{ }}} {\text{and F}}_{4} {\text{ generations}}} \right){ } = { } h_{b}^{2} = \frac{{\delta^{2} g}}{{\delta^{2} ph}} \times 100$$

Heritability in the broad sense was obtained as described by Warner40.

where VF2 is the phenotypic variance of the F2 generation, VP1 and VP2 are the variances of the first and second parents, respectively, σ2g is the genotypic variance of the F3 and F4 generations, and σ2ph is the phenotypic variance of the F3 and F4 generations.

The phenotypic and genotypic correlation coefficients between the studied characters in the F2, F3, and F4 generations were estimated using the method outlined by Dewey and Lu41 and Millar et al.42.

The appropriate index weights (b, s) were calculated from the following formula. (b) = (P)−1(G)(a)43,44.

where (b) = Vector of relative index coefficients, (P)−1 = Inverse phenotypic variance–covariance matrix, (G) = Genotypic variance–covariance matrix and (a) = Vector of relative economic values based on equally important, i.e., (a)w = (a)1 = (a)2 = (a)3 = 1.

To calculate various selection indices, we used the following formula:

I = b1x1 + b2x2 +  + bnxn43,44.

The formula for estimating the predicted improvement in lint yield based on an index is:

ΔY Selection advance (SA) = SD × Σ(bi × σgiw)1/245.

where SD is the selection differential in standard units, bi represents the index weights for the characters considered in the index, and σgiw represents the genotypic covariances of the characters with yield.

The formula for estimating the predicted genetic advance in lint yield based on direct selection is ΔGwdue to selection for Xi = K·σgwi/σpi46.

The predicted response in any selected and unselected character was calculated based on the methodology proposed by Walker45 Robinson and Comstock 48. The realized gains were then determined by computing the deviation of the generation mean for each character from the procedure mean of that character. An ordinary analysis of variance was performed for the randomized complete block design according to Kearsey and Pooni39, and estimates were obtained for the phenotypic (PCV %) and genotypic (GCV%) coefficients of variations, phenotypic (σ2ph) and genotypic (σ2g) variances. The calculation of phenotypic and genotypic correlation coefficients was done using the methodology outlined by Hussain et al.47. Heritability in the broad sense was obtained as described by Borojević et al.48, and as described by Warner40.

Ethics approval and consent to participate

This article does not contain any studies with human or animal subjects. The current experimental research and field study including the collection of plant material, is complying with relevant institutional, national, and international guidelines and legislation and used for research and development.

Results

Range, means, phenotypic and genotypic coefficients of variation (PCV and GCV), as well as broad-sense heritability (H2b %), for the studied traits across F2, F3, and F4 generations, are presented in Table 1. The results indicate that the mean values for the first opening flower (FOF) were 72.05, 71.98, and 70.54 days, and the PCV values were 4.62%, 3.93%, and 3.19%, while the GCV values were 3.83%, 3.76%, and 3.02%, respectively. Similarly, for the first fruiting node (FFN), the mean values were 7.66, 7.17, and 6.80, and the PCV values were 15.05%, 8.64%, and 6.30%, while the GCV values were 10.78%, 7.78%, and 5.85%, respectively, across F2, F3, and F4 generations. As for boll weight (BW) (g), the mean values were 3.19 g, 3.25 g, and 3.81 g, and the PCV values were 9.88%, 8.20%, and 8.10%, while the GCV values were 7.40%, 7.14%, and 6.63%, respectively, across F2, F3, and F4 generations. Regarding seed cotton yield per plant (SCY/P), the mean values across F2, F3, and F4 generations were 237.01 g, 298.63 g, and 304.21 g/plant, with corresponding PCV values of 35.75%, 17.62%, and 16.36%, and GCV values of 26.85%, 16.16%, and 15.37%. For lint yield per plant (LY/P), the mean values were 88.06 g, 117.15 g, and 126.89 g, with PCV values of 37.15 g, 19.29 g, and 18.58 g, and GCV values of 28.79%, 17.76%, and 17.62%, respectively. As for lint percentage (L%), the mean values were 37.06%, 39.08%, and 41.67%, with PCV values of 5.55%, 5.50%, and 5.19%, and GCV values of 5.26%, 5.42%, and 4.57%. For seed index (SI), the mean values were 11.10 g, 10.29 g, and 12.17 g, with PCV values of 8.13%, 7.38%, and 4.27%, and GCV values of 7.04%, 6.83%, and 4.03%. Concerning the lint index, the mean values were 6.55, 6.63, and 8.73, with PCV values of 11.26%, 11.02%, and 10.46%, and GCV values of 10.26%, 10.00%, and 8.98%. Overall, the results suggest that heritability in broad sense (h2b%) values in F4 were higher than in F3, which were in turn higher than in F2 for all the traits analyzed.

Table 1 Estimates of broad-sense heritability (h2b), phenotypic (PCV), and genotypic (GCV) coefficients of variation, range and means, for the eight studied characters in F2, F3, and F4 generations.

Selection criteria

Table 2 presents the mean values, as well as the phenotypic and genotypic coefficients of variation for traits in the five different selection criteria from F4 families. The data revealed that selection criterion I1 had the highest value for the first opening flower at 67.14, as well as the highest PCV and GCV values for the first fruiting node at 7.33% and 6.74%, respectively. Selection criterion I2 yielded the highest mean value for both seed cotton yield (122.93) and seed index (12.58). Additionally, selection criterion I3 had the highest PCV and GCV values for the first opening flower, boll weight, seed index, and lint index, with values of 2.48%, 2.18%, 11.11%, 10.39%, 5.86%, 5.43%, and 13.59%, 13.33%, respectively. Selection criterion I4 showed the highest mean values for boll weight, lint yield, lint percentage, and lint index, with values of 4.11, 51.94, 42.60, and 9.27, respectively. Finally, selection criterion I5 gave the best mean value for the first fruiting node at 6.66, and the highest values for PCV and GCV for seed cotton yield and lint yield with values of 19.07%, 18.22%, and 21.81%, 20.95%, respectively. The highest PCV and GCV estimates suggest that the first fruiting node, seed cotton yield, boll weight, first flower opening, seed index, lint yield, and lint percentage are mainly under genetic control and less affected by the environment.

Table 2 Mean, phenotypic (PCV) and genotypic coefficient of variations (GCV) for earliness, yield, and its components for the five different selection criteria parameters of F4 selected families.

Mean performance of the thirty selected families of F4 generation

Table 3 displays the average performance of thirty selected families from the F4 generation of the Giza 94X Suvin population for all traits studied. Significant differences were noted among various progenies compared to the better parent and commercial variety. For the first opening flower trait, progenies No. 11, 14, and 21 showed significant differences from the superior parent, while progenies No. 12 and 24 exhibited highly significant differences. When compared to the commercial variety, progenies No. 2, 18, 27, and 29 differed significantly, and progenies No. 11, 12, 13, 14, 15, 21, 24, and 25 differed highly significantly. In terms of the first fruiting node trait, progenies No. 11 and 12 differed significantly from the superior parent, while progeny No. 2 showed highly significant differences. Progenies No. 3, 5, 10, 17, and 20 also exhibited significantprogenies No. 1, 2, 6, 11, 12, 14, 18, 24 , 27, and 30 differing significantly compared to the commercial variety. For boll weight, progenies No. 16 and 29 significantly and highly significantly differed from the superior parent, respectively. Progenies No. 3 and 7 showed significant differences and progenies No. 6, 16, and 29 differed significantly compared to the commercial variety. The seed cotton yield ranged from 90.85 g (progeny No. 1) to 167.06 g (progeny No. 20). Progenies No. 6, 17, and 22 significantly and highly significantly differed from the superior parent, respectively. Progenies No. 17, 22, and 23 also shwoed significant and highly significant differences from the commercial variety, respectively. For lint yield, progenies No. 6, 9, 13, 17, 19, 20, and 28 highly differed from the better parent, with progenies No. 21, 22, and 29 showing significant differences. Progenies No. 6, 9, 13, 17, 19, 20, and 28 also differed highly significantly from the commercial variety, with progenies No. 21, 22, and 29 showing significant differences. Regarding the lint percentage trait, progenies No. 25 and 27 exhibited significant and highly significant differences from the better parent, respectively. Meanwhile, twenty families significantly differed from the commercial variety. For the seed index trait, progenies No. 1, 2, 6, 13, 16, and 29 showed highly significant differences from the superior parent. Progenies No. 14, 26, and 28 exhibited significant and highly significant differences from the commercial variety, respectivelyIn terms of the lint index trait, progenies No. 9, 15, and 20 significantly and highly significantly differed from the superior parent, respectively. Progenies No. 6 , 7, 16, 17, 18, 19, 28, 29, and 30 also showed significant and highly significant differences from the superior parent, respectively. Progenies No. 4, 5, 13, 24, and 27 exhibited significantly and highly significantly differed from the commercial variety, respectively.

Table 3 Mean performance of thirty selected families from the F4 generation of the Giza 94X Suvin population for all studied traits.

Phenotypic and genotypic correlation

Phenotypic and genotypic correlation coefficients were calculated for all studied traits in the F4 selected families (Table 4). Positive and highly significant correlation values wereobserved for first opening flower and first fruiting node (0.445* and 0.521**), first fruiting node vs seed index (0.410* and 0.591**), seed cotton yield with both lint yield and seed index (0.848** and 1.000**) and (0.809** and 0.912**) respectively. Additionally, the correlation values between boll weight with bothlint percentage and lint index were also highly significant and positive (0.560** and 0.664**) and (0.685** and 0.851**), respectively. Conversely,a negative correlation was observed between boll weight (-0.349*) and seed index (-0.472**). Highly significant positive correlation values were observed between seed cotton yield and both lint yield and seed index (0.848** and 1.000**) and (0.809** and 0.912**). Furthermore, for lint yield, positive highly significant correlation values (0.503** and 0.532**), (0.725** and 0.747**) and (0.516** and 0.531**) were observed between lint yield and both lint percentage, seed index and lint index (0.503** and 0.532**), (0.725** and 0.747**) and (0.516** and 0.531**).

Table 4 Coefficient of phenotypic and genotypic correlations among different character combinations of 8 quantitative traits for the F4 selected families.

Density plots for the studied traits in the three generations

Figure 1 displays density plots displays the studied traits in the three generations using a smoothed kernel density function to represent the probability of the trait values. The area under the curve represents the distribution of values, and the Y-axis represents their probability. The X-axis value corresponding to the peak of each density plot is the average of the trait. The peaks of each density plot show a higher concentration of values and therefore a higher probability, while the tails demonstrate a lower concentration of values and a lower probability. The figure indicates that the peak value of LI for F2 is different from F3 and F4, with the means higher than the mean of LI for F3 and F4. A similar trend is observed for SI, while the opposite is true for other traits. The density plots demonstrate that selection from F2 improved the means of LI and SI but decreased the variation of all studied traits.

Figure 1
figure 1

The density plot of six studied traits: First Opening Flower (FOF), First Fruiting Node (FFN), Boll Weight (BW), Seed Cotton Yield (SCY), Lint Percentage (L%), Seed Index (SI), and Lint Index (LI). The plot uses a smoothed kernel density function to represent the probability of trait values, with the area under the curve indicating the distribution of values and the Y-axis representing the probability.

Principal component analysis (PCA)

In biplot analysis (Fig. 2), the sharp angle (below 90 degrees) and the obtuse angle (above 90 degrees) between the variables indicated positive and negative correlation between variables, respectively. Positive correlations were observed among the indices of BW, LI, L, LY, SI, and SCY. These indices were highly positively correlated with F2, F3 and F4.

Figure 2
figure 2

The PCA biplot for the three generations and traits (with a sample size of 8) displays the contribution of individuals to each variable. Individuals on the same side as a variable are seen as having a high impact on it. The strength of contribution to each principal component (PC) is shown by the magnitude of the vectors (lines). The direction of the vectors indicates the correlation between variables, with vectors pointing in similar directions showing positive correlation, vectors pointing in opposite directions indicating negative correlation, and vectors at proximately right angles indicating low or no correlation. The colored concentration ellipses represent the observations grouped by mark class, with the size determined by a 0.95 probability level.

Path analysis

The study utilized path analysis, a statistical method for examining causal relationships between variables, using the "sem" function in the "lavaan" package of R software. The results were displayed in path diagrams (Figs. 3, 4 and 5) that illustrated the effects of five factors (lint index, seed index, lint percentage, first opening flower, and first fruiting nod) on three outcomes (boll weight, lint yield, and seed cotton yield) across three generations. Three types of arrows were used in the path diagrams: single-headed arrows indicating causal relationships, double-headed arrows indicating covariance between two factors, and double-headed arrows pointing to a single factor representing the variance of that factor. The findings revealed that the lint index had a strong positive effect on both lint yield and boll weight in all three generations, with R2 values of 0.93, 0.87, and 1.51 for lint yield and 1.07, 1.24, and 1.7 for boll weight in F2, F3, and F4, respectively. Moreover, lint yield had a direct and significant effect on seed cotton yield in all three generations, with R2 values of 1.03, 1.05, and 1.04 for F2, F3, and F4, respectively. Boll weight had a stronger direct effect on lint yield than both lint percentage and boll weight themselves, as evidenced by a more substantial direct effect of boll weight on lint yield than the direct effect of lint percentage and boll weight on lint yield. The indirect effect of the lint index and seed index on lint yield through lint percentage had the same pattern across all three generations, where the lint index had a positive effect and the seed index had a negative effect. Meanwhile, the seed index had no indirect effect on lint yield through boll weight, whereas the lint index and seed index had a positive indirect effect on lint yield through boll weight in all three generations. Finally, the direct and indirect effects on lint yield were more pronounced in F4 than in F2 and F3, possibly due to selection that strengthened the relationship between lint yield and its components, resulting in a more stable progeny of that generation. In summary, the path analysis provided valuable insights into the causal relationships among different factors and their effects on lint yield, boll weight, and seed cotton yield across three generations.

Figure 3
figure 3

Path diagram that illustrates both the direct and indirect effects of the studied traits in the F2 generation. Bidirectional arrows show correlation between the variables, and unidirectional arrows indicate a direct effect on the direction of the arrow, blue and red arrows represent positive and negative effects. Solid arrows indicate P < 0.05 and dashed arrows indicate P > 0.05. First Opening Flower (FOF), First Fruiting Node (FFN), Boll Weight (BW), Seed Cotton Yield (SCY), Lint Percentage (L%), Seed Index (SI), and Lint Index (LI). The plot uses a smoothed kernel density function to represent the probability of trait values, with the area under the curve indicating the distribution of values and the Y-axis representing the probability.

Figure 4
figure 4

Path diagram that illustrates both the direct and indirect effects of the studied traits in the F2 generation. Bidirectional arrows show correlation between the variables, and unidirectional arrows indicate a direct effect on the direction of the arrow, blue and red arrows represent positive and negative effects. Solid arrows indicate P < 0.05 and dashed arrows indicate P > 0.05. First Opening Flower (FOF), First Fruiting Node (FFN), Boll Weight (BW), Seed Cotton Yield (SCY), Lint Percentage (L%), Seed Index (SI), and Lint Index (LI). The plot uses a smoothed kernel density function to represent the probability of trait values, with the area under the curve indicating the distribution of values and the Y-axis representing the probability.

Figure 5
figure 5

Path diagram that illustrates both the direct and indirect effects of the studied traits in the F2 generation. Bidirectional arrows show correlation between the variables, and unidirectional arrows indicate a direct effect on the direction of the arrow, blue and red arrows represent positive and negative effects. Solid arrows indicate P < 0.05 and dashed arrows indicate P > 0.05. First Opening Flower (FOF), First Fruiting Node (FFN), Boll Weight (BW), Seed Cotton Yield (SCY), Lint Percentage (L%), Seed Index (SI), and Lint Index (LI). The plot uses a smoothed kernel density function to represent the probability of trait values, with the area under the curve indicating the distribution of values and the Y-axis representing the probability.

Discussion

The results indicate that genetic factors play a significant role and that there is a possibility of achieving positive outcomes by selecting early segregating generations in the presence of water deficit stress. These findings are consistent with earlier research32,49,50. The F2 and F3 generations showed greater phenotypic and genotypic coefficients of variation for all traits studied compared to the F4 generation. This decrease in genetic variability and heterozygosity may have resulted from various selection methods that reduced a significant portion of the variability30,51,52. Although the mean performance of all traits studied was higher in the F4 generation than in the F3 generation, the favorable alleles that accumulated due to effective selection procedures resulted in a lower desirable MR in the F4 generation. The study suggests that selecting for these traits is possible, and other researchers have reported similar results5,49,50,53. Plant breeders need to consider all the economic traits and not just focus on one. Correlation analysis is a useful tool for predicting how a change in one trait will affect another. The genotypic correlations were generally higher than the phenotypic correlations, which could be attributed to the relative stability of genotypes under certain selection pressures. This finding is consistent with other studies16,37,43,51,54. The study showed that most traits exhibited an increase in mean values across generations in the population due to additive gene action. Additionally, high heritability values were observed, possibly due to the close relationship between PCV and GCV values. All traits under study showed a very high to high degree of heritability, indicating a low or negligible influence of the environment on their expression, making them suitable for selection and improvement. The findings are in agreement with Ali et al.'s research53 which reported a persistent correlation between GCV and PCV values, leading to high estimates of broad-sense heritability. The results also indicated a minor disparity between GCV and PCV values for most traits, suggesting that environmental factors have a minimal impact on their expression. The variation in the values of genotypic and phenotypic coefficients was narrow for all studied characters which indicated less influence of environment in the expression of these characters35. The high values of PCV and GCV together with high values of heritability are a good indicator for genetic advance in this population through the amount of genetic variance to be expected from the selection. It was suggested that combining the genetic coefficient of variation with heritability would provide the most accurate estimation of the expected genetic variance resulting from selection8,14. The three most influential factors in genetic progress in the three populations were high selection intensity, genotypic coefficient of variation, and heritability1,10,32,37,43,53,55. The differences between generations may be due to the various genotypes scored by each selection criteria. The study found that selection criteria I2, I4, and I5 were effective in improving most yield and its component traits, while criterion I1 was efficient in improving earliness traits. The narrow range between PCV and GCV values for different selection criteria suggests that environmental factors have a minimal impact on these traits. The analysis of variance revealed significant differences among families of F4 for all studied traits, indicating that selection criteria would be effective. The increase in selected families was greater than the better parent and commercial variety. The results also showed variations in the mean performance of most selected families, which could be attributed to gene expression. Therefore, selecting adaptable families with higher means than the better parent and commercial variety would help produce highly yielding and earlier crops, particularly in winter cropping, without a decrease in yield23,27,31,39,56. It appeared that most of the yield and its component traits were positive and either significant or highly significant correlated with each other, indicating that these traits are important components of cotton yield. A positive association between major yield components is very significant to the breeder because component breeding would be very effective under such a situation. Such correlations indicate that selection for improving one or more of these traits would improve other traits17,36,57. The increases in yield and its components for this population were higher in family numbers (9, 13, 19, 20 and 21). These families could be continued to further generations as breeding genotypes for developing higher yield and its components. Nazmey et al.58 studied cotton yield and yield components in relation to the relative contribution and found that seed cotton yield was significantly and positively correlated with boll weight and lint yield. The same relationship was found between lint yield and each boll weight and seed cotton yield59,60,61.Which reported that positive significant correlation values were found between seed cotton and lint yields with boll weight and lint percentage. It was indicated that seed cotton yield was highly significantly genetically correlated with boll weight (r = 0.99), lint yield (r = 0.88) and lint index (r = 0.96). With respect to, the phenotypic and genotypic correlation values between lint percentage and lint index there were positive highly significant correlation values (0.906** and 0.938**)62.

Conclusion

Based on our findings, it can be concluded that the studied cotton traits, including first opening flower, first fruiting node, boll weight, seed cotton yield, lint yield, lint percentage, seed index, and lint index, are significantly influenced by genetic factors. The broad-sense heritability values indicate that the traits are heritable, with higher heritability values observed in the F4 generation compared to F3 and F2. The results of the different selection criteria indicate that the traits under genetic control are suitable for breeding programs. The mean performance of selected families from the F4 generation of the Giza 94X Suvin population showed significant differences in various progenies compared to the better parent and commercial variety, indicating the potential for further selection and improvement of cotton traits. There are several prospects for future research and breeding programs aimed at improving cotton traits. Firstly, the traits found to be under genetic control and suitable for breeding programs, including first fruiting node, seed cotton yield, boll weight, first flower opening, seed index, lint yield, and lint percentage, can be further studied to identify specific genetic markers and genes responsible for these traits. This information can then be used to develop more efficient breeding strategies, such as marker-assisted selection, to improve these traits in cotton varieties. Secondly, the selected families from the F4 generation with significant differences in progenies compared to the better parent and commercial variety can be further evaluated for their agronomic performance and adaptability to different growing conditions. This information can then be used to identify the best-performing families and varieties, which can be released for commercial cultivation. Lastly, the heritability values observed in the F4 generation suggest that this generation may be suitable for developing new cotton varieties through the application of different breeding methods, such as recurrent selection or hybridization with other elite varieties. The resulting new varieties can then be evaluated for their performance and suitability for cultivation in different regions.