Genetic differences in host infectivity affect disease spread and survival in epidemics

Survival during an epidemic is partly determined by host genetics. While quantitative genetic studies typically consider survival as an indicator for disease resistance (an individual’s propensity to avoid becoming infected or diseased), mortality rates of populations undergoing an epidemic are also affected by endurance (the propensity of diseased individual to survive the infection) and infectivity (i.e. the propensity of an infected individual to transmit disease). Few studies have demonstrated genetic variation in disease endurance, and no study has demonstrated genetic variation in host infectivity, despite strong evidence for considerable phenotypic variation in this trait. Here we propose an experimental design and statistical models for estimating genetic diversity in all three host traits. Using an infection model in fish we provide, for the first time, direct evidence for genetic variation in host infectivity, in addition to variation in resistance and endurance. We also demonstrate how genetic differences in these three traits contribute to survival. Our results imply that animals can evolve different disease response types affecting epidemic survival rates, with important implications for understanding and controlling epidemics.

Infectious disease presents an enormous threat to animal and human populations, with epidemic outbreaks often causing high mortality due to insufficient disease control strategies. Over the last decades genetic and genomic studies have produced compelling evidence of substantial genetic variability in host response to infectious agents, potentially affecting epidemic risks and survival [1][2][3][4][5][6][7][8] . Nevertheless, remarkably little is known about how the genetics of individuals affects survival and disease spread at a population level. Quantifying the host genetic contribution to epidemic risk and severity remains a long-standing challenge in infectious disease research [9][10][11] .
Quantitative genetic studies tend to consider disease resistance as the primary and often sole host target trait for genetic disease control. The definition of disease resistance varies across studies depending on the disease in question and the knowledge of the host response mechanism exhibiting genetic variation 1 . However, due to large sample size requirements in quantitative genetic studies, disease resistance is often defined by individual mortality, as it is easy to observe whether and when an individual exposed to infection dies. But survival is multifaceted, and may not only depend on the individual's resistance to becoming infected or diseased when exposed to infectious material, but also on its endurance, i.e. its ability to survive after becoming infected or diseased [12][13][14] . Endurance encompasses tolerance, which refers to the ability of an infected host to reduce the fitness consequences of infection 15,16 . Moreover, recently emerging evidence for the superspreader phenomenon, where a small proportion of highly infectious individuals are responsible for the majority of transmission, has raised awareness that individual variation in infectivity can also hugely influence population mortality rates [17][18][19][20][21][22] . Unsurprisingly, interest in the genetic regulation of infectivity, which can be defined as the host ability to infect an average susceptible individual upon unit contact, is increasing rapidly. Understanding the genetic regulation of infectivity is particularly pertinent if there are unfavourable genetic correlations between this trait and resistance or endurance components 20,23 . Such unfavourable genetic correlations could for example arise if individuals with greater genetic endurance cannot only cope better with infection but also tend to harbour and thus shed more infectious pathogens. It could also result from a trade-offs between resource demanding immune processes, such 1 the Roslin institute and Royal (Dick) School of Veterinary Studies, University of edinburgh, edinburgh, UK. 2 centro tecnológico del cluster de la Acuicultura (cetGA), A coruña, Spain. 3 Departamento de Mejora Genética Animal, iniA, Madrid, Spain. 4 institute of Mathematical and computer Sciences, University of São Paulo, São carlos, Brazil. correspondence and requests for materials should be addressed to O.A. (email: osvaldo.anacleto@gmail.com) as e.g. those associated with inhibiting infection and those reducing within-host pathogen replication and thus shedding of infectious material 24 .
Resistance, endurance and infectivity may be regulated by different sets of genes with varying contributions, both in direction and in degree, to overall survival 12,25 . However, no study to date has simultaneously investigated these three traits. Whereas a plethora of quantitative genetic studies has demonstrated genetic variation in resistance 1,2,[26][27][28][29] , relatively few studies have demonstrated genetic variation in disease endurance [12][13][14] . No study to date has demonstrated genetic variation in host infectivity. Since genetic control strategies are environmentally friendly, non-intrusive and long-term preventive 30 , a more precise understanding of the genetic mechanisms underlying disease outbreaks and resulting mortality rates is urgently needed.
In this study, which considers infectious diseases with potentially fatal outcomes, we define disease resistance as an individual's propensity for developing disease following contact with an infectious individual or material of average infectivity, and endurance as the propensity of a diseased individual to survive. In line with these definitions, measurements of time of onset of disease and time from onset of disease to death in disease challenge experiments can disentangle resistance from endurance 12 . Endurance is closely related to tolerance, which refers to the propensity of a host to reduce fitness loss under infection 15,16,31 , and which is more commonly considered as an alternative host defence mechanism to resistance 32,33 . However, operationally, tolerance is typically defined as the slope of a regression of host fitness against pathogen burden, and thus requires measures of pathogen burden 34-36 which were not available in this study.
In the context of natural disease transmission, time of onset of disease and time from onset of disease to death are no longer just affected by the genetics of the individuals in consideration. Instead they result from interactions between infected and non-infected individuals. Hence, time of onset of disease and time of death due to infection may no longer only depend on an individual's own resistance and endurance genes, but also on the infectivity genes of infected individuals in the same contact group. This complex genetic interaction calls for particular experimental designs and statistical models that can reliably disentangle and estimate genetic effects for all three epidemiological host traits.
Here we show how transmission experiments can be designed and analysed to identify genetic variation and disentangle genetic effects in resistance, endurance and infectivity. We used infection by Philasterides dicentrarchi in turbot fish (Scophthalmus maximus) as a model system. P. dicentrarchi is a ciliate parasite which is the causative agent of scuticociliatosis, a common cause of mortality in turbot farming. Until recently, little was known about host genetic control of P. dicentrarchi infections. A QTL mapping study carried out using a controlled intracelomical challenge infection of turbot with a fixed high dose of virulent P. dicentrarchi strain, identified several genomic regions controlling dichotomous survival at the end of the experiment (28dpi) and survival time of infected fish, suggesting that survival is under genetic control 37 . A companion paper analysing the same dataset as used in this study further identified genomic regions associated with endurance 12 . Our resulting data not only allow quantitative genetic studies of infectious disease traits, but also enable evaluation of the relative contribution of all three traits resistance, endurance and infectivity to survival. By analysing family differences in these traits, we demonstrate, for the first time, that there is genetic variance in infectivity, in addition to resistance and endurance, and how genetic differences in host infectivity affect disease spread and survival.

Results
Designing disease transmission experiments to simultaneously identify genetic variation in host resistance, infectivity and endurance. Transmission experiments such as that depicted in Fig. 1 provide data to identify genetic regulation of resistance, infectivity and endurance to disease. Our experimental design meets the contrasting requirements for studying genetic regulation of each trait in isolation. In outbred populations, detection of genetic variation in host resistance and endurance with respect to a particular pathogen can be deduced from differences in response to infection between genetically different individuals, thus requiring individuals with different degree of genetic relatedness, ideally exposed to or infected with the same pathogen strain or dose. Genetic variation in these traits can then be detected through differences in time to disease and time from disease to death, respectively 12 . On the other hand, infectivity of a host impacts the disease status of its group mates rather than its own disease status. Because of this, and due to the difficulty in tracking who acquired infection from whom during an epidemic, host variation in this trait can be inferred by observing the speed at which the disease is naturally transmitted to susceptible individuals 38 . This can only be achieved by allocating genetically related infected individuals (shedders) across many contact groups containing susceptible individuals (recipients), such that recipient time to onset of disease can be observed (Fig. 1). The groups should be closed such that transmission only occurs between shedders and recipients within the groups, therefore restricting the sources of infection and thus reducing uncertainty in infectivity estimates. Furthermore, in contrast to resistance and endurance experiments, smaller groups are preferable to larger groups to minimise the confounding of the expression of infectivity from shedders and recipients 39,40 .
Genetic differences in host infectivity based on shedder family information can be detected by analysing recipient time to onset of the disease. However, the risk of transmission and, consequently, time to onset of the disease depend not only on host infectivity but also on the resistance of recipient individuals 41 . Moreover, infected recipients may also become sources of transmission in closed groups. Therefore, to detect genetic differences in shedder infectivity, recipients exposed to different shedder families must have, on average, similar levels of resistance and infectivity. Then, a comparison of the speed of the transmission within recipients across shedders from different families allows detection of genetic variation in host infectivity. In particular, infection is expected to spread faster in groups of recipients exposed to individuals from a more infectious shedder family.
To obtain empirical data to study genetic variation in resistance, endurance and infectivity, we performed a Philasterides dicentrarchi infection experiment in two large cohorts of turbot fish (total n = 1,800) from 44 full-sib families (see Fig. 1). The visually conspicuous infection signs caused by this parasite enable non-invasive tracking of the epidemics in real-time [42][43][44] (see Methods), which is usually difficult to observe in transmission trials. We distributed shedder fish artificially inoculated with the parasite across 72 isolated tanks, such that these fish transmitted the disease to previously non-infected recipient fish (see Methods).
Each tank comprised five shedder fish from the same family and 20 recipient fish from four different families, forming a recipient family composition (Fig. 1). This group size and composition allowed to meet requirements for reliably detecting genetic differences in all the three disease traits considered in our study. Particularly, in line with results from previous theoretical studies, the small number of fish in tanks with families distributed across them allows reliable detection of differences in infectivity, which is an indirect genetic effect 39,45 . Also, the large sample size comprising multiple offspring from many families favours the detection of the direct genetic effects in recipient resistance and endurance 46,47 (see Methods). As environmental conditions were controlled (see Methods), the only sources of variation in tanks were recipient family composition and shedder family. Family composition in tanks was designed to detect genetic variance in host infectivity while having a suitable experimental setup to verify genetic differences in resistance and endurance. To achieve this, we exposed each of four shedder fish families to the same nine recipient family compositions, in each of two trials of the experiment (see Methods). With this, infection was expected to spread consistently faster in recipient groups exposed to the most infectious shedder family, thus avoiding confounding between shedder family infectivity and genetic resistance and infectivity of recipient fish. Also, we included each recipient family in two recipient family compositions ( Fig. 1, see Methods). This way, each recipient family was exposed to all shedder families, which avoided confounding between recipient family resistance and shedder family infectivity. The recipient families were also exposed to sufficiently large number of fish from different families, ensuring reliable detection of differences in resistance and endurance between recipient families. Therefore, the resulting design allowed not only detection of genetic variation in infectivity through differences in disease onset profiles of recipient fish pooled by shedder families, but it also enabled detection of genetic variation in resistance and endurance by comparing different profiles for onset of disease and infection-induced death, respectively, pooled by recipient family.
Disease and survival data from the experiment were collected by inspecting all fish twice a day over the duration of the experiment for visual signs of infection (see methods) and for mortality. Fish typically displayed colour change and/or exophthalmos as the first visual sign, and within the same day several signs 42 . For each fish displaying visual infection signs, the date of first detection of any one or combination of signs was recorded, denoted hereafter as 'time to signs' . Similarly, for fish that had succumbed to the infection, the date and time of detected mortality was recorded. Whilst measurements of time to signs allow genetic analysis of resistance of recipient fish and infectivity of shedder fish, measurements of the time from signs to death provide information on genetic variation in endurance. (b) Transmission experimental design in one of the trials. Each grey circle corresponds to one tank and a unique symbol-colour combination is assigned to each recipient (R i ) or shedder (S i ) family in 36 tanks in one of the two trials of our experiment. The 4 shedder families (S 1 to S 4 ) were distributed across nine tanks each. Recipient families were housed in tanks such that nine recipient family combinations were created, which in turn were housed with each of four shedder families. Each recipient family was allocated in two recipient family compositions.

Turbot fish infected by Philasterides dicentrarchi can be accurately identified through visual signs.
Shedder fish successfully transmitted the parasite to recipients. In our experiment, 766 out of 1417 recipient fish showed signs of infection and Philasterides dicentrarchi was detected by post-mortem inspection in 94% of these fish ( Supplementary Fig. 1). Although the parasite was detected post-mortem in 34% of recipients with no visual signs of infection (see Discussion), high true positive proportions (sensitivity) and true negative proportions (specificity) when considering these signs as a diagnostic test suggest that they are reliable indicators of presence of Philasterides dicentrarchi in fish (Table 1, also see Methods). Infection by Philasterides dicentrarchi was the only cause of natural death during the trials, and 945 out of 1417 recipient fish died during the experiment.
Analysis of time-to-event measurements associated with fish onset of disease and subsequent survival. It took longer for the recipients in the experiment to develop disease signs than to die following onset of signs, with large variation in time to signs and subsequently also in time to death across trials (median time to signs was 43 and 110 days for trials 1 and 2 respectively, see also Fig. 2). This large variation between trial 1 and 2 might be caused by lower virulence of the pathogen strain in trial 2, as different isolates of P. dicentrarchi can significantly affect pathology and mortality rates 48 . This is also supported by the observation that the parasite was not detected in the post-mortem examination of some of the shedder fish in trial 2 (see Discussion). Correspondingly, trials 1 and 2 lasted respectively 104 and 160 days (see Methods). There was however similar within-trial variation in time to signs and time from signs to death, suggesting that the same traits were measured despite the different pathogen virulence in both trials. (Fig. 2). Fish from both trials died quickly and with low variation following onset of signs (Fig. 2). Median times from signs to death were 8 and 7 days for recipient fish from trials 1 and 2 respectively. Also, the large recipient variation in time to signs that we could not observe in time from signs to death can be explained by factors associated with tank composition as described below.
Long recipient survival post exposure was strongly associated with long time to signs (Kendall correlation coefficients 0.70 and 0.74 for trials 1 and 2 respectively, P < 0.001, see Fig. 3), but weakly correlated with time from signs to death (Kendall correlation coefficients are 0.20 (P < 0.001) and −0.01 (P = 0.50) for trials 1 and 2, respectively). We also found a weak positive relationship between recipient time to signs and time from these signs to death (Kendall correlation coefficients −0.19 (P = 0.003), and −0.34 (P < 0.001) for trials 1 and 2, respectively, see also Fig. 3). These results suggest that factors controlling the onset of disease have a stronger influence on survival than factors driving within host disease progression.  www.nature.com/scientificreports www.nature.com/scientificreports/ Fish family variation in disease resistance and endurance. We found significant differences across the 36 recipient fish families in time to signs (log-rank test: P < 0.001 for both trials) but small recipient family variation in time from signs to death (log-rank test: P = 0.053 and P = 0.084 for trial 1 and 2, respectively, also see Fig. 4). These results may indicate large genetic variation in resistance but small variation in endurance to the disease. Large and low variation for, respectively, time to signs and time from signs to death were also found across tanks (Supplementary Fig. 1).

Effect of shedder family infectivity on onset of disease and subsequent survival. Shedder fish
family had a strong effect on recipient time to signs but little influence on time from these signs to death ( Fig. 2 and Table 2). To quantify the effect of shedder family on recipient infection and survival, we fit generalized linear mixed models (GLMMs) to daily counts of recipient fish with visual signs and recipient fish that died due to infection (see Methods). When applied to discrete time-to-event data and assuming a baseline hazard for each day, these models can estimate hazard ratios between different shedder families while controlling for other factors of the transmission experiment 49,50 . Recipients exposed to the most infective shedder families (C and F for trials 1 and 2 respectively, see Fig. 5a,b) got diseased at approximately twice the rate of recipients exposed to the least infective shedder families (B and G for trials 1 and 2 respectively, see Fig. 5a,b; hazard ratio estimates: trial 1 posterior mean: 1.80, 95% CI: 1.37, 2.23; trial 2 posterior mean: 2.10, 95% CI: 1.60, 2.75). As a further illustration of the effects of genetic differences in shedder infectivity, we found that the relative hazard of a recipient showing disease signs when exposed to the most vs the least infective shedder family was greater than 1.5, with Bayesian posterior probabilities 0.85 and 0.98 for trials 1 and 2, respectively. In contrast, shedder family infectivity did not significantly affect recipient time of death following disease (hazard ratio estimates: trial 1 posterior mean: 1.27, 95% CI: 0.88, 1.84; trial 2 posterior mean: 0.98, 95% CI: 0.76, 1.21; Bayesian posterior probabilities that recipient hazard ratio of death following disease between most and least infective shedder fish family is at least 1.5: <0.005 for both trials).

Effect of recipient family composition on onset of disease and subsequent survival.
To evaluate the effect of the genetic composition of the tank members (accounting for combined differences in recipient resistance, infectivity and endurance) on disease and survival times, random effects representing recipient family composition were considered in all models (see Methods). Variance component estimates for recipient family composition are much larger in the models for time to onset of visual signs than in models for time between www.nature.com/scientificreports www.nature.com/scientificreports/ infection signs and death ( Table 3), suggesting that the genetic composition of tank members played a strong role in disease spread but had little influence on survival post infection.

Discussion
Host genetics has yet not been fully exploited to reduce the impact of infectious disease in animal populations. It is therefore crucial to understand and disentangle the effects of genetic diversity in different host defence mechanisms to disease. Here we presented a novel transmission experimental design which allows simultaneous genetic analyses of host resistance, endurance and infectivity. Temporal epidemic data generated by applying our experimental design in an infectious disease model using family-structured fish allowed to dissect the different sources of genetic variation on disease prevalence and subsequent mortality. This contrasts with most genetic studies of infectious disease which focus on disease resistance alone, and often use binary mortality data at the end of the epidemic, thus not fully capturing the dynamic nature of infectious disease [51][52][53] .
Our study provided the first direct evidence that there are genetic differences in host infectivity and that the genetic make-up of a host can largely affect the survival of its group mates by affecting their risk of becoming diseased. The novel genetic differences found in host infectivity significantly explain variation in time to the appearance of visual infection signs, which in turn was found to be strongly related to survival. Host ability to transmit infections may also depend on the rate of contact between hosts or their excreted infectious material with susceptible individuals and on duration of the infection period 41,54 . The mode of the transmission of Philasterides  www.nature.com/scientificreports www.nature.com/scientificreports/ dicentrarchi through contamination of water suggests constant rate of contact between susceptible fish and the parasites shed by hosts (see Methods). Moreover, we could not find significant family differences in shedder time to death in both trials ( Supplementary Fig. 2). This suggests that differences in infectivity are not simply explained by the fact that more infectious individuals shed longer nor that they are more tolerant to the disease. In addition, although higher infectivity can be manifested by shedding higher quantity or more virulent infectious material which may correspond to a higher infection dose for recipient fish 55 , there is no evidence for this in our study, as shedder family did not have a significant effect on recipient time from signs to death. Therefore, apart from the detected genetic variation in infectivity, we could not identify other significant epidemic factors that may explain variation in the ability to transmit the disease.
Our findings suggest that genetic variation in host resistance is larger than genetic variation in endurance, and are in line with previous genetic studies of these traits 12,13 . However, the results may also partly depend on the trait measures used in our study. Since time to infection can usually not be observed under natural transmission, we analysed resistance to onset of the disease, which may be different from resistance to becoming infected. Not developing disease after infection may be considered as an aspect of endurance. Indeed, despite high sensitivity and specificity for onset of visual signs as a diagnostic test for infection by P. dicentrarchi (see Table 1 and Methods), we found, through post mortem detection, parasites in 222 recipient fish (out of 1420 recipients that were post mortem examined -see Methods) which did not show signs of infection during the experiment. We surprisingly also detected neither parasites nor signs of infection in 54 out of the 180 artificially infected fish used in trial 2. This may be due to the lower virulence of the parasite used in trial 2, which may have prevented the parasite from invading the tissues inspected for the post-mortem examination, or enabled the infected shedder fish to clear the parasite before the end of the trial. These findings indicate that fish can endure infection with parasites to some degree without developing infection signs or apparent impact of the parasite on their health. This aspect of endurance could however not be separated from resistance in our study. It should be noted that these asymptomatic shedder fish are not expected to compromise the results of this study, as it is reasonable to assume that these   Table 3. Estimates of variance components representing recipient family composition effects, included in the generalised linear mixed models for time to signs and time from signs to death.
www.nature.com/scientificreports www.nature.com/scientificreports/ fish were infected and thus also likely infectious for some time, and also because they were approximately equally distributed across the shedder families.
The family structure of the fish considered in our transmission experiment is expected to provide genetic variance component estimates and estimation of individual genetic risks for resistance, endurance and infectivity 56 . Such estimates would facilitate selective breeding for reduced disease transmission. Indeed, a recent companion study using the same dataset estimated heritabilities for resistance and endurance of 0.26 and 0.12, respectively, and found no significant genetic correlation between both traits 12 . However, the genetic models used to infer these estimates ignored genetic variation in infectivity as the statistical methods for estimating genetic effects for all three traits don't yet exist. Under the assumption of infectivity being a heritable trait, it can be defined as an indirect genetic effect (IGE), also known as an associative or a social genetic effect 40,[57][58][59] . Statistical models incorporating IGEs require populations structured into many isolated groups with related individuals in each of these groups 60,61 , which was the case in our experiment. We have recently extended IGE models to incorporate the dynamics of infection processes, and this extension can accurately estimate genetic risks as well as heritabilities and environmental effects for both infectivity and resistance 39 . However, further work is needed to adapt these models for infections that cause recovery or death (such as scuticociliatosis) and to incorporate genetic variation in endurance to disease.
These IGE models have the advantage that they incorporate transmission between recipients into estimation of genetic risks for resistance, infectivity and endurance, which was not explicitly considered in our study. In particular, these IGE models would provide infectivity genetic risk estimates for all individuals in a population, from which individuals with extreme risks can be identified as genetic superspreaders 39 . In the context of animal breeding, these genetic superspreaders have higher probability of generating offspring that would transmit most of the infections in a disease outbreak, and therefore preventing their occurrence would be an effective means to reduce disease prevalence in subsequent generations 62 .
Approaches to extend genetic studies of infectious disease beyond disease resistance have only emerged relatively recently. Several studies have explored tolerance as an alternative host genetic trait to disease resistance affecting health and survival 32,34,[63][64][65] . Disentangling resistance from tolerance requires however individual measures of pathogen load 32,36,66 . Similarly, one may expect that estimating infectivity requires measures of individual shedding rates. Obtaining either of these measures on a sufficiently large number of individuals typically required for genetic analyses is costly and may not be feasible in field situations. In this paper, we demonstrate an alternative approach for decomposing genetic effects underlying disease transmission that does not require costly measures of pathogen load but relies on an appropriate experimental design and appropriate statistical models. The benefit of using endurance over tolerance is that it does not require measures of pathogen load and that statistical constraints of reaction norms don't apply 14,34 . Also, infectivity and resistance estimates can be derived from the same disease phenotypes, i.e. observations of individuals' infection status over time 39,45 . Given the novelty of this approach, we chose the infection model in turbot due to the availability of non-invasive indicators of individuals' infection status over time and because of the relative ease of creating many large families and generating large sample sizes in aquaculture. Conducting similar transmission experiments in fish or other model species may provide useful genetic information for studying complex disease traits such as resistance, endurance and infectivity in the context of infectious diseases in other livestock or human populations, where similar experiments are too costly or non-feasible 67 . However, knowledge about data requirements emerging from simulation studies and transmission experiments such as the one described may help to eventually apply similar concepts to disease field data in various livestock species. Digital dermatitis or bovine Tuberculosis in cattle provide some recent examples where infectivity has been considered as an additional trait to resistance for genetic disease control in domestic livestock 68,69 . Recent trends in automated data recording and cost-effective genotyping, together with global investments in infra-structure for storing and analysing big data in agriculture hold great promise for obtaining deeper insight into the different host traits underlying disease transmission and impact and thus for achieving breakthroughs in genetic disease control in all species.
In conclusion, our results imply that animals can potentially evolve three conceptually different types of response affecting their own and their group members' chances of surviving infections. In particular, we demonstrate for the first time that there is significant genetic variation in in host infectivity. Our findings reveal new opportunities for devising more effective genetic disease control strategies by simultaneously exploiting genetic variation underlying host resistance, infectivity and endurance to disease, rather than focusing only on disease resistance as measured by survival. As illustrated with a biological infection model in fish, genetic effects in resistance, endurance and infectivity can be disentangled by a single experimental design. Broadening the scope of disease traits in genetic studies may open new avenues for identifying novel genes affecting disease transmission and survival as well as further understanding of mechanisms underlying infectious disease dynamics and evolution.

Methods
The scuticociliatosis transmission experiment in turbot. Philasterides dicentrarchi (P. dicentrarchi) is a histophagous ciliate causing scuticociliatosis in many fish species, in particular olive flounder and turbot. Scuticociliatosis is one of the most important parasitological problems in marine aquaculture worldwide 70 . In recent years, severe scuticociliatosis outbreaks with high mortality rates and devastating economic effects have been reported in East Asia (Korea, Japan, China), Europe (Spain, Portugal) and other regions of the world 44,71 . The parasite generates a severe systemic infection invading internal organs such as brain, gills, liver and intestine that generally results in the death of the host 72 .
Experimental facilities and turbot family structure. The farmed turbot used in the experiment were bred and reared in the experimental facilities of CETGA, Spain. The experiment complies with ethical regulations and was approved by the Regional Government of Xunta de Galicia (registered under the code (2019) 9:4924 | https://doi.org/10.1038/s41598-019-40567-w www.nature.com/scientificreports www.nature.com/scientificreports/ ES150730055401/16/PROD.VET.047ROD.01). Due to space restrictions, the transmission experiments were carried out in two consecutive trials. Each trial comprised 900 fish (sex ratio 1:1) from 22 and 23 full-sibling families generated from 29 sires and 25 dams, for trials 1 and 2 respectively. The resulting full-sib families included eight and five paternal half-sibling families for respectively, trials 1 and 2 and seven maternal half-sib families in each of these trials. Parental fish were mostly unrelated, except for two paternal and three maternal half-sibs. Average body weights at the start of trials 1 and 2 were 32.1 g (std. dev. 9.2 g) and 91.6 g (std. dev. 38.4 g), respectively. Since all the turbot families were simultaneously cultured, fish from the trial 2 were heavier as they had more time to grow. Fish were tagged according to their full-sib family using elastomers, and fin clipping was used for individual identification.
Distinction between shedder and recipient fish and their allocation across isolated experimental units. We assigned families in each trial into either shedder families or recipient families. Shedder families were inoculated with a virulent strain of P. dicentrarchi and then introduced into isolated tanks comprising non-infected recipient fish to seed the infection in the transmission experiment. Eventually, both infected shedders and recipients express infectivity, but artificial infection of shedders implies that shedder fish provide more accurate information about infectivity than recipient fish. To maximise genetic diversity, the eight shedder families were unrelated, i.e. shared no common sire or dam.
Each trial consisted of 36 independent transmission experiments carried out in 36 isolated 50-litre closed-circuit aerated tanks with constant water temperature of 17-18 °C and 0.36‰ salinity. To maximise statistical power for detecting and disentangle genetic variation in resistance and infectivity, we strategically distributed the shedder and recipient fish into the different tanks as follows (see Fig. 1): each tank comprised 25 fish, with five fish from the same shedder family seeding the infection to 20 fish from 4 different recipient families, such that fish from each of the families were selected at random. Also, each shedder family seeded the infection in 9 different tanks. To minimise confounding between shedder fish infectivity and genetic resistance and infectivity of recipient fish and therefore minimise the noise for detecting shedder family differences in infectivity, the same recipient family compositions were used across the 4 shedder families. This resulted in 9 different family compositions per trial, with each recipient family represented in 8 tanks and in 2 different family compositions. The sample size of 1800 fish and the family composition in tanks are in line with experimental design recommendations supported by theoretical results regarding studies of social genetic effects 46 as well as simulation studies for detecting ideal experimental conditions for estimating infectivity genetic effects 39 .

Infection of donors and transmission to recipients. Inoculation of shedder fish was carried out by
intraperitoneal injection 72 into the abdominal cavity of 200 µl of sterile physiological saline containing (50,000 and 56,000 ciliates were used in each of shedder fish from trials 1 and 2, respectively). After inoculation, infected shedder fish were immediately introduced into the tanks comprising initially only non-infected recipient fish and the epidemics were allowed to develop naturally. The trials were terminated when mortality had plateaued to zero for 3 consecutive days, which was at 104 and 160 days in trials 1 and 2, respectively. Fish still alive at the end of this period were killed with an overdose of anaesthetic. One of the tanks in trial 1 was discarded from the analysis due to a problem with oxygen supply. Also, due to technical difficulties in daily tracking of the disease status of all fish in the experiment, two fish from trial 1 and one fish from trial 2 had missing disease information and was also discarded from the analysis.
Visual signs and histology. Visual infection signs included exophthalmia, colour change or depigmentation, visible lesions or abnormal swimming behaviour (no blinding method was used). Upon detection, dead fish were removed and both their brain and ascetic fluid collected from the body cavity were analysed to detect the presence of the scuticociliade. At the end of the experiment, all remaining fish were euthanized and screened for parasites.

Statistical analyses.
To compare infection and survival profiles associated with trials, tanks and families, we used Kaplan-Meier estimators for survival probability of developing disease and dying from disease using time to signs (with censoring given by surviving fish that did not show signs by the end of the experiment) and time to death, respectively. For fish that displayed visual infection signs, Kaplan-Meier estimators were also generated for time from signs to death. Fish that had signs but survived until the end of the experiment were censored in this case. The right-censoring mechanism considered was independent of the time to signs and time to death observed in recipient fish, a crucial assumption for calculation of Kaplan-Meier curves and log-rank tests 73 .
Daily counts of recipient fish with visual infection signs in each tank were used to estimate the effect of shedder fish family on time to infection. For a tank i and day t j of the experiment, the number of fish with visual signs, represented by C i (t j ) was assumed to follow a binomial distribution with parameters given by the number of susceptible individuals at day t j in tank i, represented by S i (t j ), and p j , which is the probability of a fish showing visual signs at t j given it was susceptible prior to that day. This conditional probability is, by definition, the disease hazard at day t j 49,50 and can be modelled using a generalised linear mixed model with a complementary log-log link function given by where α j is the day-specific intercept representing a baseline hazard, SF i is the shedder family at tank i with effect b i and I ij is an offset representing the proportion of infected fish at tank i and day t j . Individual family effects were not incorporated in this tank-day level model as each tank comprised four recipient families. Therefore, RFC i is a random effect representing recipient family composition at tank i such that its covariance matrix incorporates