Introduction

Various breeding programs (GIFT, GET‐EXCEL, FaST, and GST) have utilized hybridization between different Oreochromis species and strains to improve growth rate. Followed by selection, these interspecific crosses have demonstrated a wide range of genetic variation in growth rate (Bolivar 1998; Tayamen 2004; Zimmermann and Natividad 2004). The Thai-Chitralada O. niloticus strain is derived from a strain of 50 fish from Japan whose lineage originated in Egypt (Mair et al. 2004); and which was introduced to the Royal Chitralada Palace in Thailand in 1965. Chitralada has been considered a potential leading strain in Asian countries, as it has desirable traits for aquaculture (Macaranas et al. 1997). Based on mitochondrial and genomic analyses, it has been demonstrated that O. aureus, O. niloticus (Egypt), and O. mossambicus contributed to the formation of Thai-Chitralada (Curzon et al. 2019). Effective farming of O. niloticus requires all-male culture, characterized by uniformity and high growth rate (Shirak et al. 2018). This can be achieved by deciphering the genetic regulation of sex determination (SD), and by developing the manipulation thereof, for the production of an all-male population (Chen et al. 2018).

A gene whose action is sufficient to initiate an entire developmental program with a binary switch that determines whether an individual will develop as a male or a female is designated a “master key regulator” (MKR; Harrison 2007). Different MKR genes such as sox3, dmrt1, irf9, Anti-Müllerian hormone (amh), amhrII, and gsdf are involved in SD of vertebrate and insect taxa (Volff et al. 2003, 2007; Mei and Gui 2015; Li and Gui 2018). Many MKR genes of SD in animals demonstrate copy number variation (CNV) between sexes (Volff and Schartl 2001). The sex-determining region Y (sry) is an MKR that initiates testis development in most eutherian mammals, and may have evolved from sox3 through duplication and regulatory mutation in the proto-Y chromosome allele (Gubbay et al. 1990; Sinclair et al. 1990). A sex-specific copy of dmrt1 has been proposed as the prime candidate for initiating GSD in several species (Nanda et al. 2002; Matsuda et al. 2002; Yoshimoto et al. 2008; Smith et al. 2009; Chen et al. 2014; Cui et al. 2017). In the case of rainbow trout (Oncorhynchus mykiss), it has been suggested that sdY, a male-specific MKR universal in salmonids, evolved through a duplication event (Yano et al. 2013). amh is a member of the TGF-β super family, which has been suggested as an MKR for GSD in O. niloticus (Shirak et al. 2006; Eshel et al. 2012), as well as in other distant fish species (Hattori et al. 2012; Yamamoto et al. 2014; Rondeau et al. 2016; Bej et al. 2017). In O. niloticus, amhΔy is located near the regular gene of amh, and differs from the sequence of amh by a 233 bp deletion in exon 7 (Eshel et al. 2014) (Fig. 1). A more recent study shows an insertion of 5 bp in exon 6 of amhΔy, but not in amh; furthermore, the Y-linked amh (amhy) differs from the X-linked amh by a deletion in the promoter region, and by a missense mutation in exon 2 (Li et al. 2015).

Fig. 1: Schematic representation of the structure of the X and Y copies of the amh gene in O. niloticus.
figure 1

Adapted from Eshel et al. (2014) and Li et al. (2015). Black vertical lines and their nucleotide position relative to the beginning of the ORF, on exon 1 of each gene copy, represent alteration of the DNA sequence (deletion, insertion, and SNP). Relative positions of pairs of primers for PCR are marked by identical arrows in opposing directions. The PCR based on forward primer on E2 has a different reverse primer on E6 for amhΔy than that for amh and amhy. 100 bp size ladder is indicated.

Three SD loci on linkage groups (LGs) 1, 3, and 23 have been detected by several independent groups for different Oreochromis species and their hybrids (Shirak et al. 2002; Lee et al. 2003, 2004; Cnaani et al. 2008; Eshel et al. 2011; Lühmann et al. 2012; Liu et al. 2013; Palaiokostas et al. 2013; Sun et al. 2014; Lin et al. 2016). Other LGs that are associated with SD have been reported, e.g., LGs 2 and 6 (Shirak et al. 2002), LG12 (Shirak et al. 2008), and LG14 (Gammerdinger et al. 2019). It has also been demonstrated that the effect on GSD might be mimicked by embryonic or adult sex-specific mortality (Shirak et al. 2002, 2008). Environmental temperature may also influence sex, as high incubation temperature (over 32 °C) during weeks 1–3 post hatching causes masculinization of O. niloticus (Baroiller et al. 2009). GSD in Oreochromis species was proposed to be monofactorial (Wohlfarth and Wedekind 1991). The XX/XY SD system in O. mossambicus, the WZ/ZZ system in O. aureus and O. urolepis hornorum, and the XX/XY system in O. niloticus were mapped to LGs 1, 3, and 23, respectively (Lee et al. 2003, 2004; Ezaz et al. 2004; Liu et al. 2013; Eshel et al. 2014; Zhu et al. 2016). Consequently, amh was mapped as the causative gene for SD on LG23 (Eshel et al. 2014; Li et al. 2015).

The objectives of this study were: (1) to evaluate the possible role of identified indels and SNPs in the amh gene on GSD, based on association with sex of different O. niloticus strains; (2) to develop an efficient assay for different O. niloticus strains capable of discriminating between XX, XY, and YY genotypes, in order to develop and maintain an all-male population; and (3) to determine additional genomic regions affecting GSD in the Thai-Chitralada hybrid strain.

Materials and methods

Genetic strains

A collection of six strains was assembled, comprising the Chitralada hybrid, and five O. niloticus strains that included a local strain of Ghana at Dor Research Station (Shirak et al. 2018), plus strains from the University of Swansea, and from Canada, Egypt, and Ghana (Shirak et al. 2009). The complete data on genetic strains, full-sib progeny, and number of samples from both sexes are provided in Table 1. The strains and full-sib progeny were bred and maintained in Dor research station (Israel). All O. niloticus strains were previously analyzed for COI sequences that corresponded to the respective O. niloticus sequences in the BOLD database (Shirak et al. 2009).

Table 1 Genetic strains and samples used for the different experimentsa.

Fish rearing and sexing

Three of the strains described above, Chitralada, Swansea, and Ghana, are currently bred in Dor station. A family for each of these strains was produced by artificial insemination at the Dor research station. Two additional families of Thai-Chitralada, consisting of 130 and 50 full-sib progeny, were reared at Dor Research Station (Tables 5 and 6). Fertilized eggs were incubated in Zuger bottles (28–29 °C). Fry of each strain was transferred into separate rearing tanks (28–29 °C), and sex was determined at the age of 4 months by microscopic examination of squashed gonads after fry dissection (Guerrero and Shelton 1974).

Fin samples, DNA extraction, and purification of PCR products

Fin samples were preserved in alcohol. DNA was extracted from 100 to 200 mg of caudal fin using MasterPure DNA Purification Kit (Madison, USA), and PCR was performed using Super-Therm Taq DNA polymerase (JMR Holding Inc, London, UK) at 60° for annealing (Table 2). Reactions were performed in a 10-μl volume containing: 1 × PCR buffer with 2 mM MgCl, 1 U DNA polymerase, 100 μM dNTP, 10 pM primer, and 10–30 ng of genomic DNA. The band of relevant size was excised from the gel, purified with DNA Montage Gel Extraction Kit (Millipore, Bedford, MA, USA), and used as template for sequencing.

Table 2 Primers for detection and analysis of amh copies.

Genotyping of microsatellite markers on LGs 1 and 3

PCR for microsatellite markers used dye-labeled forward and unlabeled reverse primers for BYL18 and UNH168 markers on LGs 1 and 3, respectively (Shirak et al. 2018). The amplified products were separated on an ABI3130 DNA sequencer, and sized using the GeneMapper software v. 4.0 (Applied Biosystems) with GeneScan-500 LIZ size standard (Applied Biosystems) (Dor et al. 2014).

amh sequencing and detection of polymorphism

Polymorphism in exon 2 of amh was tested by sequencing four males and females from each strain (except for Egypt, where only three females were present). A forward primer in the beginning of exon 2 and two reverse primers in exon 6 (Table 2) were designed to investigate polymorphism in exon 2 of amh, amhΔy, and amhy (Eshel et al. 2014; Li et al. 2015). Orientation and position of primers are presented in Fig. 1 and Table 2, respectively. As intronic regions were not conserved, sequencing was analyzed only for exon 2 for the different strains. Presence or absence of 5 bp insertion in exon 6 (Ins6) was analyzed for all available strain samples by using PCR primer pair Ins6F/R (Table 2 and Fig. 2). Presence or absence of 233 bp deletion in exon 7 (ILL2) was analyzed following Eshel et al. (2014).

Fig. 2: Missense mutations in exon 2 of amh copies of different O. niloticus strains.
figure 2

Sequencing results of amh for XY and YY genotypes of different strains (Sw Swansea, Ch Chitralada, OnG Ghana) based on ratio of peak height of 5 bp insertion on amhΔy exon 6 (312 bp) designated as “Y,” and its absence (307 bp) designated as “X.” The PCR for sequencing included E2 forward primer and either both reverse primers on E6 for all amh versions (a) or copy specific reverse primer on E6 for amh and amhy lacking the 5 bp insertion (b) or copy specific reverse primer for amhΔy harboring the 5 bp insertion (c). A summary of deduced mutations is given based on the structure of the amh gene (Fig. 1) (d). Each single nucleotide change is denoted by its position (in bp) relative to the beginning of the ORF, on exon 1, and the prevalent and novel nucleotides before and after the dividor symbol (/), respectively. Novel nucleotides for the different gene copies and strains are in gray. Each single nucleotide change capable of encoding an amino acid is denoted by its position (in aa) and by the single-letter amino acid codes of the prevalent and novel amino acids before and after the asterisk, respectively.

Statistics

Association of marker genotypes and sex in O. niloticus strains was tested using a chi-square test. The JMP® statistical package (Pro 13, SAS Institute, Cary, NC) was used for logistic regression of sex on markers on LGs 1 and 3, and on amh gene on LG23. Separate analysis was used for each one of the two families of Chitralada consisting of 130 and 50 full-sib progeny, respectively. Fisher’s exact chi-squared test was applied for association study of individual genetic markers and sex.

Results

Exon 2 sequencing of amh, amhΔy, and amhy

Allele-specific amplification for different copies of the amh gene identified three SNPs in the different Oreochromis strains. The schematic representation of the gene following our previous findings (Eshel et al. 2014) and that of Li et al. (2015), and the identified polymorphism, is presented in Fig. 1. The sequencing results for exon 2 in different Oreochromis strains, and the nucleotide changes capable of encoding amino acid substitutions, are presented in Fig. 2. Three non-synonymous SNPs in positions 310, 364, and 393 nucleotides of exon 2, relative to the beginning of the open reading frame (ORF) in exon 1, were detected in the Ghana and Swansea strains of O. niloticus and in the hybrid of Chitralada. None of the polymorphisms was conserved in all strains. Individuals of the O. niloticus strain of Ghana at Dor station showed the same profile of SNPs as the original Ghana strain. Polymorphism was not detected in any of the individuals of Egypt and Canada strains. Fluorescent peak ratios in SNP positions of XY individuals were 2:1, as compared to 1:1 for YY individuals (Fig. 2). This reflects the number of different amh copies underlying the chromatograms of Sanger sequencing (Seroussi et al. 2013).

Development of a marker for sex based on 5 bp insertion in amhΔy exon 6

Following the observation of 5 bp insertion on exon 6 (Li et al. 2015), we developed a fluorescent assay, using primers Ins6F/R, for the detection of the insertion in amhΔy, representing exclusively the Y chromosome; and for its absence in amh and amhy copies, representing both the X and Y chromosomes (Fig. 3). The resulting genotypes 307/307, 307/312, and 312/312 were in complete concordance with XX, XY, and YY genotypes, respectively (Fig. 3a). Full-sib progeny of Thai-Chitralada XY × XY cross was analyzed for calibration of the 5 bp insertion marker using the fluorescent assay (Fig. 3b). Twelve XX females exhibited a single product of 307 bp. The realized ratios between the 307 and 312 bp peak height alleles for 22 XY males varied from 1.87 to 2.03, with a mean of 1.947 ± 0.042. The respective ratios for 11 YY males varied from 0.959 to 1.002, with a mean of 0.980 ± 0.015. Thus, the realized peak height ratios in XX, XY, and YY individuals were 1:0, 2:1, and 1:1 based on the fluorescent fragment analysis, this being compatible with the expected dosage of X and Y chromosomes.

Fig. 3: Development of a marker for sex identification based on a 5 bp insertion in exon 6 of amhΔy.
figure 3

Analysis of a 5 bp insertion on amhΔy exon 6 allowed detection of fragments with lengths of 312 and 307 bp that are associated with sex. The insertion (312 bp) is designated as “Y,” and its absence (307 bp) as “X.” To the left, alleles’ patterns of three individuals with XX, XY, and YY are shown (a). To the right, based on peak height ratios, determination of XY and YY genotypes of 32 Chitralada individuals is exemplified (b). X-axis represents the individuals ordered by their genotypes (21 XY males followed by 10 YY males). Y-axis represents peak height ratios between these allele fragments that were calculated based on their fluorescence values.

Associations of 5 bp insertion in amhΔy exon 6 with sex

The concordance rate between the 5 bp insertion in amhΔy and sex varied in samples of six Oreochromis strains from 85 to 100% (Table 3). Significant associations between the marker and sex were found for Chitralada and Ghana strains (p < 0.001): 170 out of 200 Chitralada individuals (85%) showed the expected sex, whereas complete concordance was obtained for the Ghana strain (n = 90). Other O. niloticus strains had small samples of individuals that were under the minimum threshold required for statistical analysis by chi-square test. Nevertheless, the overall association of this marker and sex for 323 individuals over all strains was highly significant (p < 0.0001) (Table 3). Full-sib progeny of the current strains of Chitralada, Swansea, and Ghana showed significant associations with sex, with concordance rates of 91%, 92%, and 100%, respectively (Table 4).

Table 3 Associations of 5 bp insertion in exon 6 of amhΔy and sex distribution in six O. niloticus strains.
Table 4 Associations of 5 bp insertion in exon 6 of amhΔy and sex distribution in three families of different O. niloticus strainsa.

Analysis of the 233 bp deletion in amhΔy exon 7

Genotyping exon 7 of amh revealed 209/434 and 434/434 genotypes. By sequencing of the different alleles, we found several SNP differences between species, while maintaining the size and location of the 233 bp deletion in exon 7. However, the peak height ratio of the 209–434 alleles was not stable, with significant bias toward amplification of the smaller 209 bp allele. This may be expected in view of the twofold difference in allele fragment length. It should be noted, however, that the observed size difference between these alleles was only 225 bp, in comparison to the expected 233 bp. Sequencing of the different alleles validated the expected size difference, and the size calling of the two alleles may have been affected by its altered migration on the acrylamide gel. Nevertheless, complete correspondence was obtained between the 233 bp deletion on exon 7 and the 5 bp insertion on exon 6, and between both markers and sex in Chitralada full-sib progeny, as well as in all other strains.

Effects of markers in LGs 1, 3, and 23 (amhΔy) on sex in Chitralada

Genotypes of full-sib progeny of Chitralada XY × XY mating (Family 1) were analyzed for the effects of BYL018 (LG1) and UNH168 (LG3) markers, and of amhΔy (LG23) on sex (Table 5). Dam and sire genotypes of Family 1, for markers on LGs 1, 3 and 23, formed 24 possible genotypic combinations (4 × 2 × 3). Genotype effects of the main combinations that fully explained sex are presented in Table 5 (logistic probability of 1022). All 99 individuals bearing the Y chromosome (amhΔy) on LG23 are males. In the absence of the Y chromosome on LG23 (amhΔy), alleles of 245 bp on LG1 and 159 bp on LG3 were associated with females. An XX genotype on LG23 without a 159 bp allele on LG3, and with a 245 bp allele on LG1, resulted in males, whereas all other allele combinations were females. The genotypes for the three loci and sex of Chitralada XY × XX full-sib progeny of Family 2 are presented in Table 6 and Supplementary File 1. Markers on LGs 1 and 23 showed significant effects by Fisher’s exact test, with R squared values of 0.10 and 0.37, respectively. However, genotypes of both markers were highly correlated, despite segregation in different LGs (p < 0.02), thus indicating an equivocal effect on LG1.

Table 5 Genotype effects of BYL018 (LG1), UNH168 (LG3) markers, and amhΔy (LG23) on sex distribution (female:male ratio) of Chitralada full-sibs progeny of Family 1.
Table 6 Genotype effects of BYL018 (LG1), UNH168 (LG3) markers, and amhΔy (LG23) on sex distribution (female:male ratio) of Chitralada full-sibs progeny of Family 2.

Discussion

SD mechanisms are ancient and conserved, yet, there is diversity in primary sex-determining signals that trigger male or female development (Bachtrog et al. 2014). O. niloticus demonstrates different regulators of GSD in closely related strains, accordingly, comparative genomic analysis may reveal the different regulators that affect GSD in the different strains.

Conservation of amhΔy and its potential role in GSD

A novel male-specific duplication of amh, denoted amhΔy, lacking the TGF-β domain was identified and mapped to the QTL region on LG23 for SD, thus indicating its potential role in GSD of O. niloticus (Eshel et al. 2014, Li et al. 2015). Li et al. (2015) found that amhΔy is located upstream to another copy of amh on the Y chromosome (amhy). The schematic representation of the structure of the X and Y versions of the amh gene in O. niloticus is presented in Fig. 1. Knockout of amhy by CRISPR/Cas9 resulted in male to female sex reversal in XY fish, while amhy, but not amh, overexpression resulted in female to male sex reversal (Li et al. 2015). A missense SNP (C/T) in the ORF of amh (exon 2) has been suggested as regulating male SD (Li et al. 2015). However, the results of our study showed that, although in all strains sex was associated with polymorphism in amh, this SNP in exon 2 was found only in a single strain (Swansea). In a recent study all GIFT individuals were found to have the base C in this position, indicating absence of the suggested SNP (Taslima et al. 2020). Thus, the candidacy of this SNP for the causal variation of SD is excluded. Expression analysis previously showed that both amhΔy and amhy genes are transcribed, but amhΔy is probably not translated (Li et al. 2015). However, interestingly, some nonfunctional genomic pseudogenes which are in-tandem duplicates of the intact gene may act as regulators of gene expression (Han et al. 2011). As amhΔy is not translated to a protein, it may act by regulating amh gene expression. The amh gene was recently discovered as an MKR in additional distant teleost species, e.g., lingcod, Ophiodon elongates (Rondeau et al. 2016) and silverside, Hypoatherina tsurugae (Bej et al. 2017). Similar to O. niloticus, both males and females share one copy of amh, however, a second duplicate copy appears solely in male individuals, suggesting that convergent evolution of an additional copy of amh gene may have occurred in multiple teleost lineages, thus adopting an MKR role. Sequence analysis of different strains of O. niloticus for amhΔy clearly shows the high association between the occurrence of an additional copy of amh and maleness in strains originating from Eastern and Western Africa; and thus supports a putative conserved function of amh. Other polymorphisms between amhΔy and amhy, such as the large deletion in amhy promoter, have been suggested as being involved in GSD (Li et al. 2015). Thus, further study is needed to test this association and to explore the causative region in amh for SD.

Development of an accurate assay for genotyping of amhΔy

We developed a sensitive fluorescent assay for detecting the occurrence of a 5 bp insertion on exon 6 of amhΔy, and its absence in amh and amhy copies, yielding complete discrimination between XX, XY, and YY genotypes. Complete concordance was evident between the 233 bp deletion on exon 7 and the 5 bp insertion on exon 6, with increased accuracy for the latter. The realized fluorescent peak height ratios in XX, XY, and YY individuals were compatible with the expected dosage of X and Y chromosomes. Similar assays demonstrated identification of CNV (Seroussi et al. 2013). Thus, we used the 5 bp insertion on exon 6 of amhΔy as the marker of choice for GSD. Although the assay provides high but not complete prediction of GSD, it can be used for selection of individuals with desired amh genotypes for mating toward the production of all-male populations in different O. niloticus strains.

Concordance between amhΔy polymorphism and sex

The concordance rate between the 5 bp insertion on exon 6 of amhΔy and sex varied in samples from six O. niloticus strains and three strains that are currently maintained in Dor station (Table 4). Interestingly, Ghana strains showed full concordance, but all strains that originated from Egypt showed incomplete concordance. Unbalanced sex ratios in progeny of wild and domesticated O. niloticus, have been attributed to environmental effects such as temperature, or to polygenic SD (Bezault et al. 2007; Baroiller et al. 2009; Palaiokostas et al. 2015; Baroiller and D’Cotta 2016, 2018; Sissao et al. 2019). Additional studies suggested that specific variants of amh are sensitive to thermal effects (Wessels et al. 2014, 2017). Thus, it is possible that O. niloticus from an Egyptian origin carries variants of amh that are sensitive to temperature. In Israeli growing conditions, temperatures above 29 °C seem to prevent spawning, and thus may not be responsible for sex reversal (unpublished data). Furthermore, although Oreochromis strains have been observed even at extreme temperatures (Bezault et al. 2007; Nyingi et al. 2009), they were found to be genetically distant and morphologically different from those analyzed in the current study (Ndiwa et al. 2016).

Multiple loci affecting GSD in the Thai-Chitralada hybrid strain

In our study, we examined the effects of markers on LGs 1, 3, and 23 (amhΔy) in Thai-Chitralada families. Interestingly, these markers fully explained sex distribution in a Thai-Chitralada family (R square of unity). Nevertheless, only the effects of markers on LGs 1 and 23 were significant in a second Thai-Chitralada family, with partial R-squared values of 0.10 and 0.37, respectively, indicating the existence of additional unidentified loci that regulate SD. However, the significant correlation between the genotypes of both markers, although segregating in different LGs, undermines the validity of the statistical analysis, thus indicating that, in this family, the effect of LG1 is probably associated through its correlation to amhΔy on LG23. Previously, such co-segregation of markers has been attributed to sex-specific mortality (Shirak et al. 2002; Cnaani et al. 2008), or nonrandom segregation of chromosomes during meiosis, resulting in pseudo-linkage (Woram et al. 2004). The two alleles of 245 and 159 bp of genetic markers on LGs 1 and 3, respectively, seem to be linked to the genetic contributors of the effects on sex of Family 1 (Table 5), whereas absence of allele of 159 bp for UNH168 on LG3 in Family 2 may explain the loss of effect of this locus, or the distant location of the marker from the causative polymorphism (Table 6 and Supplementary File 1).

We have recently shown that O. mossambicus, O. aureus, and O. niloticus contributed to the formation of the Thai-Chitralada hybrid (Curzon et al. 2019). Thus, involvement of LGs 1, 3, and 23 in GSD of Thai-Chitralada could be explained by its progenitor species harboring these segregating loci (Lee et al. 2004; Liu et al. 2013; Shirak et al. 2018). Recent studies support the major role of amh in GSD of farmed O. niloticus (Cáceres et al. 2019; Taslima et al. 2020), although incomplete linkage between amhΔy and the deletion in the amhy promoter was also reported, suggesting that multiple segregating loci may be found in wild O. niloticus strains (Sissao et al. 2019). Nevertheless, the purity of these wild strains may be suspect in view of mitochondrial DNA heterogeneity (Rognon and Guyomard 2003), indicating a contamination of West African strains, possibly through the escape of Oreochromis hybrids from farms (Elder and Garrod 1961; Pruginin et al. 1975). Currently, breeding strains can be tested for their origin using the COI gene, and other assays developed specifically for Oreochromis species (Syaifudin et al. 2019; Lind et al. 2019; Curzon et al. 2019).

Conclusions

A sensitive fluorescent assay for the detection of a 5 bp insertion in amhΔy, and its absence in amh and amhy copies, efficiently discriminated between XX, XY, and YY genotypes. The association of amhΔy with sex was conserved in all tested O. niloticus strains, and thus supports its key role in GSD. However, the previously identified missense SNP (C/T) in amh exon II was found only in the Swansea strain, thus excluding its candidacy for the causal variation of SD across all strains. Thus, amhΔy on LG23 is associated with GSD, either as a single causal gene in different Oreochromis strains, or in combination with segregating genes on LGs 1 and 3 in the Thai-Chitralada hybrid strain.