Genomic selection for heterobothriosis resistance concurrent with body size in the tiger pufferfish, Takifugu rubripes

Lin, Zijie; Hosoya, Sho; Sato, Mana; Mizuno, Naoki; Kobayashi, Yuki; Itou, Takuya; Kikuchi, Kiyoshi

doi:10.1038/s41598-020-77069-z

Download PDF

Article
Open access
Published: 17 November 2020

Genomic selection for heterobothriosis resistance concurrent with body size in the tiger pufferfish, Takifugu rubripes

Zijie Lin¹,
Sho Hosoya¹,
Mana Sato¹,
Naoki Mizuno¹,
Yuki Kobayashi²,
Takuya Itou² &
…
Kiyoshi Kikuchi¹

Scientific Reports volume 10, Article number: 19976 (2020) Cite this article

2329 Accesses
12 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Parasite resistance traits in aquaculture species often have moderate heritability, indicating the potential for genetic improvements by selective breeding. However, parasite resistance is often synonymous with an undesirable negative correlation with body size. In this study, we first tested the feasibility of genomic selection (GS) on resistance to heterobothriosis, caused by the monogenean parasite Heterobothrium okamotoi, which leads to huge economic losses in aquaculture of the tiger pufferfish Takifugu rubripes. Then, using a simulation study, we tested the possibility of simultaneous improvement of parasite resistance, assessed by parasite counts on host fish (HC), and standard length (SL). Each trait showed moderate heritability (square-root transformed HC: h² = 0.308 ± 0.123, S.E.; SL: h² = 0.405 ± 0.131). The predictive abilities of genomic prediction among 12 models, including genomic Best Linear Unbiased Predictor (GBLUP), Bayesian regressions, and machine learning procedures, were also moderate for both transformed HC (0.248‒0.344) and SL (0.340‒0.481). These results confirmed the feasibility of GS for this trait. Although an undesirable genetic correlation was suggested between transformed HC and SL (r_g = 0.228), the simulation study suggested the desired gains index can help achieve simultaneous genetic improvements in both traits.

Shaping of microbial phenotypes by trade-offs

Article Open access 18 May 2024

Complexity of avian evolution revealed by family-level genomes

Article Open access 01 April 2024

The killifish germline regulates longevity and somatic repair in a sex-specific manner

Article 15 May 2024

Introduction

Selective breeding is potentially able to boost aquaculture efficiency, now the fastest-growing food production industry¹. In particular, pedigree-based breeding methods have contributed to aquaculture development by improving economically important traits, as seen in the salmonids and tilapias^2,3,4. However, pedigree-based methods have innate drawbacks where it is assumed estimated breeding values (EBVs) of target traits for candidate individuals are the average breeding values of parents, ignoring Mendelian segregation within families⁵. Thus, pedigree-based methods can not differentiate EBVs among full sibs and large-scale pedigrees were needed to evaluate breeding values. On the other hand, molecular markers can be effective in handling Mendelian sampling by capturing genetic variance at DNA levels, i.e., full sibs could have different EBVs. By harnessing whole-genome high-density markers and advanced regression methods, Meuwissen et al.⁶ proposed genomic selection (GS) to estimate the genomic estimated breeding values (GEBVs) of selection candidates. Thanks to the recent advances in genotyping by sequencing technologies, it is now affordable to genotype genome-wide single nucleotide polymorphisms (SNPs) for GS aquaculture breeding programs⁷. As expected, the greater performance of GS over the pedigree-based method in prediction and inbreeding control has been demonstrated by empirical studies using cultured fish populations^8,9.

The tiger pufferfish Takifugu rubripes is a delicacy in Japan and is one of the most valuable marine fish species in Japanese aquaculture, ranking fourth in production value among cultured finfish¹⁰. To fulfill the growing demand for this species, selective breeding will be a practical approach to boost farming efficiency; however, tiger pufferfish aquaculture has not yet fully applied this technology^11,12. Apart from growth-related traits, disease resistance should be highly valued in the breeding program, as disease outbreaks easily hamper the aquaculture industry. For instance, heterobothriosis, the gill disease caused by a monogenean parasite Heterobothrium okamotoi, severely threatens tiger pufferfish productivity and welfare¹³. The most severe infectious occurs at early phases of production, just after transfer from land-based hatcheries to sea cages^14,15. These naïve juveniles are afflicted by the parasite, persistently present at oceanic aquaculture sites, resulting in retarded growth and high mortality rate¹⁶. While the mechanisms of host immune system response to H. okamotoi are still unclear^17,18, host resistance to heterobothriosis is considered polygenic¹⁹ and it is difficult to apply marker-assisted selection, which has worked well in infectious pancreatic necrosis resistant Atlantic salmon²⁰. Recently, the potential of GS for disease resistance has been demonstrated in farmed populations of Atlantic salmon (Salmo salar)^21,22, rainbow trout (Oncorhynchus mykiss)⁹, European sea bass (Dicentrarchus labrax)²³, and gilthead seabream (Sparus aurata)²⁴. As most of the disease resistance traits have moderate or high heritability in fish species^{21,22,23,24,25}, GS can also be applied to facilitate heterobothriosis resistance in the tiger pufferfish.

Selecting one quantitative trait may improve or diminish others due to the genetic pleiotropy and/or linkage disequilibrium²⁶. For example, the breeding program which improves the resistance to sea lice possibly diminishes growth-related traits in farmed Atlantic salmon²⁷. Likewise, improving resistance to H. okamotoi may negatively affect growth-related traits in the tiger pufferfish¹¹. Thus, simultaneous genetic improvement of resistance to heterobothriosis and body size is most desirable for aquaculture of the tiger pufferfish, although complicated by traits with antagonistic genetic correlations. One of the conventional methods for multiple-trait improvement is the linear selection index (LSI) method developed by Smith and Hazel^28,29. Net genetic merit (i.e., LSI) of each animal is calculated from each target trait and used for ranking breeding candidates. To maximize the selection response, a general LSI is computed by a linear combination of phenotypes or EBVs and the corresponding coefficients. Extensive LSI methods have been proposed³⁰, as determined by the method of coefficient calculation. For instance, the desired gain selection index allows breeders to restrict traits according to the expected change of genetic gain values of traits³¹. In the era of GS, those LSI methods can be directly applied to compute the linear genomic selection index (LGSI), which showed higher efficiency in both simulation and real data, compared to pedigree-based LSI³². Although LGSI showed great advantages, successful applications of LGSI still largely depend on the accurate estimation of GEBVs and genetic parameters³³, which are sensitive to many factors, including the genetic architecture of target traits, population structure, genotyping technologies, etc.^34,35,36. Consequently, an LGSI method might have different performances in different cases. Therefore, it is essential to find the optimal strategy incorporating LGSI and examine its performance in each breeding program. The GS breeding simulator will be a practical tool that approximates the real genetic progress by sophisticated modeling of the meiosis and GS procedure at the DNA level³⁷. Further, as regards selection targeting disease resistance traits in aquaculture, a recent simulation study of acute hepatopancreatic necrosis disease (AHPND) in shrimp (Litopenaeus vannamei) showed that GS was superior to pedigree-based methods³⁸. Therefore, with the assistance of simulation, the breeding strategies incorporating LGSI are expected to greatly accelerate the simultaneous genetic improvement of disease resistance and growth-related traits.

In this study, we tested the possibility of GS to improve heterobothriosis resistance of the tiger pufferfish and designed a GS breeding strategy that could improve the resistance trait concurrent with growth-related traits. We initiated artificial infection on cultured tiger pufferfish obtained from wild parents, applied genome-wide association studies (GWAS), and genetic parameter estimation to survey the genetic architecture of target traits. We then examined the possibility of genomic prediction (GP) for both traits by applying 12 different prediction models. Finally, we investigated the optimal breeding strategy incorporating LGSI using a simulation study by comparing six breeding scenarios.

Results

Phenotypes

We produced test fishes by artificially crossing 11 wild males and 10 wild females, and subjected 240 4-month-old individuals to an artificial infection for 37 days. Heterobothriosis resistance was evaluated by counting the number of parasites attached to the branchial cavity walls (HC), and the standard length (cm) was measured on each fish (SL). The phenotypic mean was 15.85 (± 9.15 S.D.) for HC and 9.83 (± 0.78 S.D.) for SL (Fig. 1 and Supplementary Table S1). As the plot shows, the distribution of HC was non-normal (Shapiro–Wilk test: p = 3.79 × 10^–6, alpha level = 0.05) while SL approximated a normal distribution (Shapiro–Wilk test: p = 0.406, alpha level = 0.05). Therefore, we applied a square-root transformation on (HC + 1), approximating a normal distribution (Shapiro–Wilk test: p = 0.235, alpha level = 0.05). Transformed HC was used in the following genetic analysis. Weak but significant phenotypic correlation was observed between HC and SL (Pearson’s r analysis: r = 0.157, p = 0.015; 95% confidence interval: 0.031 ≤ r ≤ 0.278).

Genotyping

We genotyped genome-wide SNPs of each individual using AmpliSeq³⁹. The MiSeq sequencing generated an average of 174,870 (± 83,576 S.D.) raw reads per fish. After the quality-trimming step, the mean number of reads for each fish was 161,426 (± 83,576 S.D.) with the mean read length of 124 bp. The survived reads were mapped onto a reference fugu genome (FUGU5/fr3) for SNP calling. Following the quality filtration of SNPs, 6718 putative SNPs were yielded. Missing SNPs were imputed using LinkImputeR⁴⁰. At this imputation step, 11 SNPs were discarded and 6707 imputed SNPs were called for each individual with the imputation accuracy of 0.888.

Population structure

Population structure, which can bias the genetic parameter estimation, was examined by t-SNE analysis⁴¹ based on SNP data (Fig. 2). As seen in the plot, we did not observe clear clusters or strong stratification within the tested samples.

Heritability and genetic correlation

To investigate the extent of genetic effects on the phenotypic variation, heritability was estimated by a multivariate linear mixed model. Moderate heritability was observed for each trait (transformed HC: h² = 0.308 ± 0.123 S.E.; SL: h² = 0.405 ± 0.131). With the same model, the strength of the genetic correlation between the transformed HC and SL was also estimated. We detected a moderate antagonistic genetic correlation (r_g = 0.228), where large individuals were suffering from higher parasitic loads. This genetic correlation could be due to the phenotypic correlation, although phenotypic correlation between HC and SL was weak as described above. Therefore, we tested correlation between GEBV for each trait using a univariate linear model (i.e. GBLUP); to predict GEBV for HC, SL was included as the covariate to minimize non-genetic effects from SL. If genetic correlation exists between the two phenotypes, the GEBVs would also show a correlation. We found positive correlation (Pearson’s r = 0.252, p = 7.67 × 10^–5). This supports genetic correlation between the two traits.

Genome-wide association study (GWAS)

GWAS was applied to detect loci highly associated with transformed HC and SL (Fig. 3). Although none of these loci exceeded the significance threshold of 5.13 (= – log₁₀ (0.05/6707)), SNPs with relatively high association were found in the chromosome 1, 6 and 9 for HC (– log₁₀(p) = 3.48, 3.46 and 3.95, respectively) and in the chromosome 8 and 12 for SL (– log₁₀(p) = 3.58 and 3.51, respectively).

Model comparison of genomic prediction

To examine the availability of GS for HC and SL, we applied 12 regression models: i.e., GBLUP, Bayes A, Bayes B, Bayes C, Bayes Ridge, Bayes LASSO, Bayesian reproducing kernel Hilbert space (Bayesian RKHS), support vector machine regression with a linear kernel (SVR-linear), SVR with a poly kernel (SVR-poly), SVR with a radial basis function kernel (SVR-rbf), feedforward neural networks (FNN), and multi-task feedforward neural networks (multi-task FNN). We compared predictive ability defined as Pearson’s r between the GEBVs and observed phenotypes by means of a tenfold cross-validation scheme. Predictive abilities for transformed HC ranged from 0.248 to 0.344 under 12 models (Table 1). Among these models, SVR-poly and SVR-rbf models were inferior, while two deep learning models were slightly better. On the contrary, the two SVR based models ranked at the top for prediction of SL, and deep learning models were inferior. Bayes RKHS and GBLUP models showed good performance in both traits.

Table 1 Predictive ability (mean ± standard error) on Heterobothrium okamotoi count (HC) and standard length (SL) under 12 models: GBLUP, Bayes A, Bayes B, Bayes C, Bayes LASSO, Bayes reproducing kernel Hilbert space (Bayes RKHS), support vector machine with a linear kernel (SVR-linear), SVR with a poly kernel (SVR-poly), SVR with a radial basis function kernel (SVR-rbf), feedforward neural networks (FNN), and multi-task feedforward neural networks (multi-task FNN).

Full size table

Simulation

The selection of one trait can have a negative impact on others when an unfavorable antagonistic correlation exists between traits. In this study, we tested the availability of LGSI methods for simultaneous improvements of HC and SL, assuming a genetic correlation estimated above (r_g = 0.228), using simulation studies. We simulated six scenarios each different in selection schemes, i.e. random mating (RAND), GS on HC only (GS_HC), GS on SL only (GS_SL), selection applying Smith-Hazel index^28,29 (S1_SHI and S2_SHI, different in economic weights) and desired gains index³¹ (S_DGI), for 10 generations with 50 replications (Fig. 4). In short, RAND was based on random mating while GS_HC and GS_SL were based on GS on either of the traits. GEBV was estimated by GBLUP. In S1_SHI, selection candidates were ranked based on the Smith-Hazel index. Since economic importance for each trait has not been evaluated in the tiger pufferfish aquaculture industry, we assume both traits have equal economic weights, which is w = [− 1, 1] for HC and SL for S1_SHI (HC is expected to decrease by selection). For S_DGI, d was set as [− 3, 0.3] for HC and SL, so that SL can be improved preferentially while HC can be reduced by 30% after 10 generations (− 3 * 10/100 = − 30%). To compare the two selection index methods, we also ran an additional scenario (S2_SHI) based on Smith-Hazel index, where the economic weight for each trait was set the same as the designed weights of S_DGI (w = [− 3, 0.3]). As expected, only S_DGI could improve the two traits simultaneously, where true breeding values (TBVs) of parasite load (HC) decreased while SL increased in each generation (Fig. 5).

Discussion

In this study, we tested the possibility of GS for genetic improvements in heterobothriosis resistance of the tiger pufferfish from empirical data and conducted a simulation study to design a GS breeding strategy that could improve the resistance trait concurrent with growth-related traits. Overall, our results suggest GS for the parasite resistance trait is feasible (predictive ability = 0.248‒0.344) and breeding strategy incorporating the DGI method can simultaneously improve both HC and SL, even though an unfavorable antagonistic genetic correlation was suggested (r_g = 0.228).

With 6707 SNP makers, moderate estimated heritability of transformed HC (h₂ = 0.308, SE = 0.123) and SL (h₂ = 0.405, SE = 0.131) were obtained, indicating selective breeding for those traits is feasible. The estimated heritability was comparable to those estimated for resistance against sea lice in Atlantic salmon (h₂ = 0.22 to 0.33 with 35 k SNPs)²² and bacterial cold water disease resistance (survival days) in farmed rainbow trout (h₂ = 0.33 with 35 k SNPs)⁹. This suggested our small SNP panel could successfully capture the genetic variance for HC in the tiger pufferfish. In this study, we could not detect significant SNPs from GWAS. Even with the small SNP panel and small sample size, strong effect SNPs (the sex-determining SNP) could be detected in a cultured population of the tiger pufferfish³⁹. Therefore, our GWAS result suggests the parasitic resistance is controlled by a large number of quantitative trait locus (QTL) with small or moderate effects, and marker-assisted selection is not feasible. This result is consistent with the previous QTL analysis using the interspecies hybrid system of pufferfishes¹⁹. Although the effect was not significant, genes neighboring the SNP with highest – log₁₀(p) values on chromosome 9 (12,024,615 bp) deserve further investigation, because the genomic region including this site was reported to have a small QTL effect on host specificity of H. okamotoi¹⁹.

The predictive abilities for HC estimated under 12 models were moderate (0.248‒0.344), and within the range observed for other disease resistance traits examined in other fish species^21,23,24,42. The predictive abilities of Bayesian hierarchical linear models (i.e. Bayes A, B, C, LASSO, and Ridge) were similar (0.303‒0.312) and scarcely higher than the GBLUP model (0.307 ± 0.018) for HC. This suggests that these linear models did not greatly differ regarding the predictive ability and the assumptions of the prior distribution of genetic effects have a limited impact on this trait. Bayes RKHS showed slightly better performance in HC compared to these linear models. For SVR-poly and SVR-rbf models, relatively low abilities for HC were observed, however, high abilities were found for SL. Since the default hyperparameters were used in the SVR models, hyperparameter tuning may aid achievement of better performance for HC as in the case of the previous study⁴³. The architectures of FNN and multi-task FNN were tuned to achieve high predictive ability of GS for HC, however, the same architecture was applied to calculate the predictive ability of GS for SL. As expected, these models resulted in high predictive ability for HC but low for SL. This indicates that a deep learning model is task-specific and high accuracy can be obtained with careful optimization as described previously⁴⁴. However, a great improvement in predictive ability was not achieved by FNN methods compared to GBLUP and Bayesian models even with the model complexity.

Our simulation study showed the availability of DGI for simultaneous genetic improvement in HC and SL even when the unfavorable antagonistic genetic correlation was assumed. The two scenarios incorporating the Smith-Hazel index showed the undesired consequences, where the average TBV for both SL and HC increased (smaller HC is favored). This happened because the breeding scheme only selected the individuals with the top LGSI values, but the high LGSI calculated by the Smith-Hazel index method does not guarantee the selected individuals are superior in both of the traits⁴⁵, especially when target traits show a negative correlation. On the other hand, DGI, a variation of the selection index methods, allows selection with restrictions on multiple traits via the desired gains vector (d). In this study, the d vector was set with intending to reduce HC by 30% during 10 generations while maximizing SL. The desired gains vector (d) can be further optimized by comparing simulation scenarios with various d to achieve the self-defined breeding goal. Unfavorable genetic correlation between body size and disease resistance is commonly observed in aquaculture species, e.g. vibriosis in Atlantic cod⁴⁶, bacterial cold water disease in rainbow trout⁴⁷, and piscirickettsiosis in coho salmon⁴⁸. Therefore, it is expected that DGI or the similar LGSI method can be widely applied for the simultaneous improvement of disease resistance trait and growth-related traits, which are the primary targets of most breeding programs.

In summary, the availability of GS for HC and SL was confirmed in this study. Moderate heritability for both traits suggests the genetic return from GS is high. GBLUP and Bayesian linear regression models showed similar predictive abilities for these traits. Although an unfavorable antagonistic genetic correlation was suggested between the two traits, the GS breeding strategy incorporating DGI can be a solution for the simultaneous genetic improvement.

Methods

Sample fish

The empirical experiments were performed in the Fisheries Laboratory, University of Tokyo (Hamamatsu, Shizuoka Prefecture, Japan). All samples (n = 240) were generated by a full-factorial mating among 10 wild males and 11 wild females, which were commercially caught from Wakasa Bay (Fukui Prefecture, Japan). For the mating, artificial fertilization was applied following the previous study¹² with minor modification. In brief, females were anesthetized with 200 mg/l of 2-phenoxyethanol and then ripened by injection of 150 µg/kg of luteinizing hormone-releasing hormone (LHRH, Sigma-Aldrich, St. Louis, MP, USA). Gametes were stripped from each individual and fertilized per male–female pair (110 pairs in total). Fertilized eggs of each maternal half-sib family were mixed and kept in a hatching jar. After hatching, each maternal half-sib was kept in a holding tank for 1 month and then all families were mixed and cultured in a three-ton communal tank. Rearing and feeding conditions were set as previously described¹⁰. At 4 months age, 240 fish were randomly collected and subjected to the artificial challenge test.

Artificial infection and phenotyping

Artificial infection was done following previous studies^12,49. A day before the infection, fish were equally distributed into three identical one-ton experimental tanks (80 individuals/tank) supplied with H. okamotoi-free fresh seawater (UV treated and filtered). Meanwhile, eggs of H. okamotoi were collected from tanks containing infected fish and kept in a glass jar containing fresh seawater until infection. Hatching was induced by physical stimulation (shaking at 140 rpm for 10 min) and the density of oncomiracidia, the free-living larvae of H. okamotoi, in the suspensions was determined under the microscope just before the infection. At infection, the water depth of experimental tanks was adjusted to 15 cm, and approximately 4000 oncomiracidia was introduced into each tank. At 3 h post-exposure, fish were transferred into three, newly-setup one-ton holding tanks and reared for 32 days, when H. okamotoi reaches maturation and moves to the branchial cavity walls (BCW)¹³. At the 32-day mark, fish were euthanized, measured for SL and the BCWs dissected from both sides. For each fish, the caudal fin was clipped and kept in 600 µl TNE8U buffer⁵⁰ (10 mM Tris–HCl (pH 7.5), 125 mM NaCl, 10 mM EDTA, 1% SDS, 8 M urea) at room temperature to extract genomic DNA for genotyping. Collected BCWs tissues were kept in 10% formalin until counting the number of parasites. The parasites attached to the whole BCWs were counted under a stereo microscope. The host resistance against H. okamotoi is assessed by parasite count on the entire BCWs (HC).