Introduction

Ecological and microevolutionary processes impacting parasite populations are increasingly being studied using population genetic analyses (reviewed by Criscione et al., 2005; de Meeûs et al., 2007; Criscione, 2008; Archie et al., 2009; for this study the term ‘parasite’ is restricted to macro-parasites; sensu Lafferty and Kuris, 2002). For example, numerous aspects of a parasite's biology, including mating patterns and transmission dynamics, may be expected to lead to deviations of genotypic frequencies from Hardy–Weinberg equilibrium (HWE) expectations (Criscione et al., 2005; Criscione, 2008). Interestingly, deviations of genotypic frequencies due to heterozygote deficits (HDs) are not uncommon in parasite populations (for example, Plantard and Porte, 2004; Guzinski et al., 2009; Dharmarajan et al., 2010; but also see Caillaud et al., 2006; Keeney et al., 2007; Thiele et al., 2008).

When observed, HDs in parasite populations have historically been attributed to two main causes: the first being technical issues (for example, null alleles; de Meeûs et al., 2004; Roed et al., 2006; Wielgoss et al., 2008), and the second being related to sampling issues associated with parasite social systems, movement behavior and/or mating tactics leading to a Wahlund effect (for example, Luo et al., 2003; McCoy et al., 2003; Chevillon et al., 2007; Criscione et al, 2007; Ravel et al., 2007) or inbreeding (for example, Picard et al., 2004; Plantard et al., 2008).

Ticks are major vectors of diseases affecting humans and animals (Jongejan and Uilenberg, 2004) and population genetic tools have the potential to further our understanding of their biology (McCoy, 2008). Although numerous studies have explored the factors affecting genotypic frequencies in parasite populations (see references above), few systematic evaluations of alternative causal hypotheses for deviations from HWE in ticks exist (for example, Chevillon et al., 2007). Our study organism, the raccoon tick (Ixodes texanus), shows a three-host life cycle with all stages (larvae, nymph and adults) predominantly parasitizing the raccoon, Procyon lotor (Darsie and Anastos, 1957). Being a nidicolous tick, I. texanus lives in shelters used by their vertebrate hosts, and in contrast to non-nidicolous ticks, mating in this species usually takes place on the ground rather than on the host (Sonenshine, 1993). In I. texanus deviations from HWE may be caused by three mechanistic factors: technical issues, hierarchical population structure and cryptic structure.

Technical issues are generally associated with problems experienced during polymerase chain reaction amplification of genomic DNA (that is, genotyping error due to stutter, large allele dropout and/or null alleles), and can lead to a deficit of heterozygotes compared with HWE expectations. Alternatively, the presence of undetected hierarchical population structure within a sample of ticks can also lead to deviations from HWE because of the inadvertent pooling of individuals from different populations (that is, a Wahlund effect). Such inadvertent pooling of populations is quite likely, because ticks are biologically structured at numerous orthogonal and/or nested levels. For example, ticks are structured with respect to life-history stage (adults and nymphs, in our case), which are further subdivided into infrapopulations (IPs; that is, ticks of a particular life-history stage on a particular host individual). In addition, the hosts themselves are heterogeneous with respect to age and sex. The presence of genetic differentiation among any of these nested and/or orthogonal levels could lead to a Wahlund effect (and thus HD) when ticks are pooled at the scale of the component population (CP; that is, ticks of a particular life-history stage infecting a spatially distinct group of hosts).

Finally, the presence of cryptic population structure could also lead to a Wahlund effect when ignored. Numerous cryptic factors could lead to population genetic structure in I. texanus. For example, because I. texanus transmission takes place only at the den site, raccoons that use dens in habitat patches other than the one in which they are trapped may have a genetically differentiated cohort of ticks compared with resident raccoons. Although there is no clear a priori method of differentiating between these two groups of raccoons, pooling ticks from these microgeographic groups at the CP scale would be expected to lead to a Wahlund effect. Alternatively, the presence of kin structure at the CP scale (for example, the presence of groups of related individuals because of variance in reproductive success of individual ticks) may lead to a ‘family’ Wahlund effect (Pudovkin et al., 1996).

In this study we genotyped 718 I. texanus adult and nymphal ticks collected from raccoons (P. lotor) at 11 microsatellite loci with the objective of identifying the major mechanism(s) contributing to the HD in this parasite species. To achieve our objective we tested three inter-related hypotheses that could lead to deviations from HWE: the presence of technical issues, hierarchical structure and cryptic population structure.

Materials and methods

Sample collection and microsatellite data

I. texanus ticks (n=736) were collected in the spring of 2006 from 91 raccoons trapped in five habitat patches (Pop_01, Pop_03, Pop_14, Pop_16 and Pop_17) located in the Upper Wabash River Basin, Indiana (Supplementary Figure S1; see Table 1 for sampling details; patch numbers follow Dharmarajan et al., 2009a). The patches were separated by distances ranging from 5.98 to 28.06 km, and raccoons inhabiting them were genetically differentiated (0.02FST0.06; P<0.001 for all pairwise comparisons; Dharmarajan et al., 2009a). Raccoon trapping, immobilization and post-capture processing have been described earlier (Beasley et al., 2007; Beasley and Rhodes, 2008). After raccoons were immobilized, all visible adult/nymphal ticks were collected and stored in 95% ethanol before DNA extraction. The period of raccoon immobilization was short and thus larval ticks could not be collected, the time required for collecting them being prohibitive.

Table 1 Sampling and genetic information for Ixodes texanus ticks collected from five raccoon subpopulations located in Northern Indiana

Ticks were genotyped at 11 microsatellite loci following the protocols reported in Dharmarajan et al. (2009b). We used quality control measures that were standard in our lab (for example, the inclusion of pre-amplified allelic standards in each electrophoretic run and the rescoring of a random subset of samples by an independent scorer; see Drauch et al., 2008) to ensure that the proportions of genotyping error and missing data were <0.01 across all loci.

Allelic richness, observed heterozygosity and Nei (1987) unbiased expected heterozygosity were calculated for all ticks collected within each habitat patch using the program GENETIX (Belkhir et al., 2004). As our tick sample consisted of nymphs and adults, all subsequent analyses were carried out separately in these life-history stages. We use the term IP to represent ticks of a particular life-history stage infesting a host individual, and the term CP to represent ticks of a particular life-history stage infecting a spatially distinct group of hosts. Only nine of the possible ten CPs (five patches × two life-history stages) were used for all subsequent analyses; one CP (adult ticks from Pop_14) was excluded because of low sample size (five individuals; Table 1).

Levels of linkage disequilibrium (LD) for all pairwise locus combinations were tested using the genotypic randomization test (10 000 randomizations) implemented in the program GDA (Lewis and Zaykin, 2001). Genotypes rather than alleles were randomized to remove the effect of within-locus disequilibrium (that is, deviation from Hardy–Weinberg equilibrium) on pairwise LD levels. We carried out a total of 495 pairwise comparisons (55 locus pairs evaluated in each of 9 CPs). The existence of significant levels of LD across all combinations was tested by calculating the exact binomial probability (Pbinomial; Rosner, 2006) of obtaining the observed number of statistically significant tests (nsig) given the number of tests carried out (ntests) and the Type I error rate (α=0.05).

We tested for HD at the scales of the IP (within each CP), CP, and globally using Weir and Cockerham (1984) estimator of FIS (f). The U-test was used to test for significant levels of HD using the Markov chain algorithm implemented in GENEPOP007 (dememorization 1 × 104, batches 500, iterations per batch 1 × 106; Rousset, 2008).

To determine whether there was a direct effect of host age and sex on f at the IP scale, we used a generalized linear model with normally distributed errors and an identity link (SPSS version 16; SPSS, Chicago, IL, USA). The initial model was of the form f Site+Host sex+Host age+Host sex:Host age+Constant (where site was a nominal factor with d.f.=4, and ‘:’ implies an interaction effect). Model parsimony was assessed using Akaike's Information criterion corrected for sample size (AICC; Burnham and Anderson, 2002).

Technical issues

We tested for the presence of three technical issues associated with polymerase chain reaction amplification: genotyping error caused by stuttering, large allele dropout and null alleles. We tested for genotyping error caused by stuttering and large allele dropout at the scale of each CP using the randomization routines (5000 randomizations) implemented in the program MICROCHECKER (Van Oosterhout et al., 2004). In addition, we tested for a signature of large allele dropout at the scales of both the IP and CP by regressing allele-specific FIS values against allele size (as per de Meeûs et al., 2004; for detailed methodology, see Supplementary Methods).

We initially inferred the presence of null alleles in all locus/IP combinations that revealed significant HD as indicated by the U-test (see above). As null homozygotes are expected to appear as missing genotypes (that is, blanks) we also tested for a significant positive relationship between FIS at the jth locus in the ith IP (fij) and the observed number of blanks using a generalized linear model. The generalized linear model analysis was carried out using a binomial error distribution and a logit link function (SPSS version 16), and the dependent variable was modeled as a binomial count variable (the number of ‘successes’ (that is, blanks) given the number of trials (that is, individuals genotyped)). We carried out the generalized linear model analysis separately at each locus and across all loci. We also repeated the above analyses to infer the presence of null alleles at the CP scale.

Hierarchical structure

Heterozygote deficits may occur if non-interbreeding populations are sampled as a single entity, even if each population is independently in HWE (Wahlund, 1928). We tested for the presence of hierarchical genetic structure in each CP at two nested scales (IP and individual). As genetic structuring of parasite populations could be affected by demographic factors related to the host (age/sex), we controlled for these factors using the method described by de Meeûs and Goudet (2007). We divided ticks within each CP into two groups based on the age of the host (adult vs yearling) and hierarchal analyses were undertaken separately for each group (within each CP). We used the program HIERFSTAT (Goudet, 2005) to estimate genetic variance partitioned among host sexes within host age (FSex−Age), among IPs within host sex (FIP−Sex), among individuals within IPs (FI−IP), among individuals within host sex (FI−Sex) and among individuals within host age (FI−Age). We generated 95% confidence intervals for all the F-statistics by bootstrapping (5000 times) across loci (as implemented in HIERFSTAT).

In the case of FSex−Age and FIP−Sex, we also tested for significance using 5000 randomizations of IPs among host sex and individuals among IPs (within host sex), respectively. Independent P-values were combined across CPs using the exact binomial test (Rosner, 2006) in the case of FI−IP, FI−Sex and FI−Age (exact P values being unavailable) and Fisher's method (Sokal and Rohlf, 1981) in the case of FSex−Age and FIP−Sex.

In the absence of host age/sex effects, we estimated the genetic variance partitioned among all IPs within each CP (FIP−CP) and among CPs (FCP−Total), and evaluated the statistical significance by permuting (5000 times) individuals among IPs within CPs (in case of FIP−CP) and IPs among CPs (in case of FCP−Total).

Cryptic structure

As mentioned earlier, cryptic genetic structure could be caused by the presence of microgeographic and family (kin) structure. We tested for the presence of cryptic microgeographic structure using the program BAPS version 5.3 (Corander et al., 2008), which identifies cryptic sub-structure by minimizing HWE and LD within each of k clusters. We used BAPS to estimate k within each CP using 15 runs wherein maximum k was constrained at the number of individuals genotyped. The k having the highest posterior probability (across the 15 runs) was considered the most likely number of clusters identified by BAPS. As BAPS may overestimate k due to the inference of a few small spurious clusters (Latch et al., 2006), we verified the results of the BAPS analyses using the program STRUCTURE version 2.2 (Pritchard et al., 2000; see Supplementary Methods for details of STRUCTURE analyses).

To test for the presence of kin structure at the CP scale, we partitioned individuals into putative kin groups using the program PEDIGREE (Herbinger, 2005). PEDIGREE uses an MCMC algorithm to sample the space of possible partitions of individuals, and calculates for each partition a score based on the pairwise score method described by Smith et al. (2001) and Butler et al. (2004). In order to identify the best partition of individuals into putative kin groups, we performed five PEDIGREE runs for each CP (500 000 iterations per run) with the following parameters: ‘full-sib constraint’=0; ‘temperature’=10; and ‘weight’=1 (the functions of each of these three parameters are detailed in Supplementary Table S1). The statistical strength of the best partition was tested by performing hundred PEDIGREE runs (using parameters above) after randomizing alleles among individuals (as per Herbinger, 2005), and statistical significance was calculated as the proportion of randomizations producing a partition score the observed score.

Although other clustering software (for example, BAPS) have been used to identify the presence of kin structure in ticks (see Chevillon et al., 2007), our empirical evaluation of three algorithms indicated that both BAPS and STRUCTURE have higher α error rates (≈0.20 and ≈0.50, respectively) compared with PEDIGREE (≈0.03) when identifying half-sib groups (Supplementary Figure S2; See Supplementary Methods for the detailed methodology used to calculate error rates). However, PEDIGREE has low power (0.50); thus, estimates of kin structure by PEDIGREE are likely to be conservative (Supplementary Figure S2).

Results

We successfully genotyped 718 of 736 (98%) I. texanus ticks at 11 microsatellite loci (Table 1). Ticks sampled from each of the five habitat patches showed high levels of polymorphism (range: 12.9–14.6 alleles/locus) and unbiased expected heterozygosities (range: 0.78–8.0; Table 1). Statistically significant levels of LD were detected in about 5% (25 of 495) of the locus pairs tested across the nine CPs, which was within binomial expectations given the Type 1 error rate (ntests=495; nsig=25; Pbinomial=0.508). A large number of locus/CP combinations (28%) showed significant deviations from HWE (ntests=99; nsig=28; Pbinomial<0.001; Table 2). Significant levels of HD were detected in 8 of 9 CPs (across all loci; Table 2) and 6 of 11 loci (across all CPs; Table 2), and the global FIS across all loci and CPs was 0.060 (P<0.001). The estimates of FIS calculated at the scale of the CP and IP were highly variable across loci (Supplementary Figure S3). We found no evidence that this variation was due to the effects of host sex or age; none of the variables were significant in the linear regression, and the intercept-only model was the most parsimonious.

Table 2 Weir and Cockerham's (1984) estimates of FIS (f) across nine Ixodes texanus component populations (CP) at 11 loci

Technical issues

We found statistical evidence for genotyping error due to stutter in two locus/CP combinations (see Table 2 and Supplementary Figure S4), which was within binomial expectations given the Type I error rate (ntests=99; nsig=2; Pbinomial=0.961). There was no evidence of large allele dropout at any locus, at the scale of the IP or CP (see Supplementary Table S2). We found that 28 of 99 locus/CP combinations examined showed significant levels of null alleles as detected by the U-test (Table 2). However, there was no evidence of a significant positive relationship between number of blanks and FIS at any locus or across all loci at the scale of the IP or CP (Supplementary Table S3).

Hierarchical structure

We found no evidence for genetic differentiation among ticks infecting male vs female raccoons or among IPs within each host sex category (for details, see Supplementary Table S4). We also found no significant differences between FSex−Age or FIP−Sex values between yearling and adult raccoons, as indicated by a complete overlap in bootstrap CIs (see Supplementary Table S4). As raccoon age and sex did significantly affect the genetic structuring of I. texanus, we pooled IPs across host sex and age categories before global analyses. The results of the global analysis revealed that there was no genetic differentiation between IPs within the CPs (adult ticks: FIP−CP≈0, P=0.526; nymphs: FIP−CP≈0, P=0.413). However, we found low but statistically significant levels of genetic differentiation among the CPs (adult ticks: FCP−Total=0.002, P<0.001; nymphs: FCP−Total=0.005, P<0.001). The hierarchical analyses revealed that the maximum genetic variance was usually partitioned at the lowest scale at which we sampled (that is, within individuals at the IP scale; see Supplementary Table S4).

Cryptic structure

The clustering algorithm used by BAPS identified a variable number of clusters (mean: 7.2; range: 3–11) in every CP examined; however, in 8 of 9 CPs, more than 80% of individuals were grouped into a single large cluster (see Supplementary Table S5). In contrast to the results of BAPS, analyses with STRUCTURE revealed no evidence of cryptic genetic structure in 8 of 9 CPs (see Supplementary Table S5). The exception to this pattern was nymphs in Pop_03, wherein STRUCTURE identified the presence of two groups of ticks; however, the partition showed little biological relevance because individuals were not strongly assigned to either of the clusters (see Supplementary Table S5).

Our use of the program PEDIGREE revealed significant levels of kin structure in 5 of 9 CPs examined (χ2[18]=52.69; combined P-value <0.001; Table 3); however, the kin groups were usually small (≈2.1 individuals) and regularly pooled individual ticks from different IPs within each CP (<5% individuals within kin groups being from a single host). Interestingly, the probability of an IP showing significant HD was higher than binomial expectations in the majority of CPs showing significant levels of kin structure (that is, 4 of 5 CPs; Table 3), but not in those with non-significant levels of kin structure (that is, 1 of 4; Table 3). Thus, deviations from HWE expectations at the IP scale were more likely in CPs that exhibited significant vs non-significant levels of kin structure.

Table 3 Results of kin group clustering and Monte–Carlo simulation analyses in nine Ixodes texanus component populations (CP)

Post hoc analyses

Our analyses indicated that the signature of HD originated at the lowest scale at which analyses were performed: the IP. Although the presence of null alleles might be considered to be a simple explanation for the observed HD, this hypothesis received statistical support only with the U-test, and thus may be confounded by factors related to parasite biology, such as kin structure (see Discussion). We evaluated whether biological factors (that is, the levels of kin structure and life-history characteristics of I. texanus) could adequately explain deviations from HWE at the IP scale using a MC simulation approach. The MC simulation was principally based on the subdivided breeding group model proposed by Criscione and Blouin (2005), modified to take into account the levels of kin structure observed in I. texanus.

In each CP we carried out the MC simulation in three stages. First, we generated half-sib groups whose sizes reflected the sizes of kin groups identified by PEDIGREE (in CPs with non-significant levels of kin structure we considered all the ticks to be unrelated; that is, the number of kin groups was equal to the number of ticks sampled). The half-sib groups were generated from the allele frequency distribution of an IP selected at random (with replacement). In the second stage we generated a random set of hosts (the number of hosts being same as that in the observed CP) and redistributed the kin groups among these hosts to reflect the distribution of kin groups among IPs in the observed data. Finally, we calculated the FIS at each locus and across all loci for each randomly generated IP. We repeated the simulation procedure 1000 times to obtain a distribution of FIS values (at each locus and across all loci) for each observed IP. For each IP (at each locus and across all loci) we tested whether the MC simulation provided adequate support for the observed data by calculating the probability of obtaining a simulated value observed. We evaluated the overall fit of the model expectations to observed values (at each locus and across all loci) using the exact binomial test described earlier. To visualize the fit of the simulated data to the observed data, we also generated 95% CIs for each IP (at each locus and across all loci). Although our MC simulation analyses were based on some simplifying assumptions regarding I. texanus biology, these assumptions were unlikely to seriously affect our results (see Testing MC simulation assumptions in Supplementary Methods; see also Supplementary Figure S5 and S6).

Across all IPs evaluated we found that the MC simulation adequately explained the observed levels of HD both across all loci (Figure 1; see also Supplementary Table S6) and at each locus (no individual locus deviating from MC expectation across all IPs; Supplementary Table S6 and Figure S7). In addition, at the scale of the CP we found that the number of IPs showing significant levels of HD across all loci (compared with MC simulation expectation) did not deviate from the binomial expectation in any of the nine CPs examined (Table 3).

Figure 1
figure 1

Distribution of observed FIS values calculated across 11 microsatellites at the scale of the infrapopulation (IP) in the case of adult (closed circles) and nymphal (open circles) Ixodes texanus ticks. The IPs are grouped by the raccoon subpopulations, and discrete subpopulations delineated by dashed lines; the order of subpopulations represented is Pop_01, Pop_03, Pop_14, Pop_16 and Pop_17, respectively. The 95% confidence intervals based on the kin-structured Monte–Carlo simulation (black lines) are also represented.

Discussion

This study evaluated three interrelated hypotheses that may cause genotypic frequencies in I. texanus CPs to deviate from HWE expectations: the existence of technical issues (genotyping error due to stutter, large allele dropout and null alleles), hierarchical structure and cryptic population structure.

The only technical issue for which we obtained statistical support was the presence of null alleles and this hypothesis was only supported by the U-test (Rousset, 2008). However, the U-test could lead to the spurious identification of null alleles in the presence of any demographic process (for example, kin structure) that causes genotypic frequencies to deviate from HWE (Dakin and Avise, 2004). We thus return to the null allele hypothesis later (see below).

The second hypothesis that we evaluated as a cause of HD in I. texanus CPs was the evidence of underlying hierarchical genetic structure caused by the interaction between host demography (that is, host age and sex) and the spatial subdivision of ticks into IPs. Given the life cycle of the parasite, we expected that HD (at the CP scale) could have been caused by the limited movement of ticks among raccoon den sites. Surprisingly, in both adult and nymphal ticks we found very low levels of genetic structure both at the scale of the CP (beacuse of host demography or spatial subdivision) and at the scale of the entire population (that is, among CPs).

The third hypothesis we evaluated was the presence of cryptic microgeographic and/or kin structure. Although our analyses provided no support for the presence of cryptic microgeographic structure, they provided strong statistical support for the presence of kin structure in a large proportion of CPs examined. Empirical research has shown that ignoring social structure can lead to serious biases in inferences drawn from molecular data with regard to the biology of the species being studied (for example, Boone and Rhodes, 1996; Latch and Rhodes, 2006), and this is because cryptic social structure in a population may lead to deviations from HWE (see Sugg et al., 1996). Although our analyses provided strong statistical support for the existence of kin structure in the I. texanus CPs, we note that the microsatellites used have low power in terms of differentiating half-sibs from unrelated individuals (Type II error≈50%), making our estimates of kin structure conservative. Although the existence of temporally stable social structure is unlikely in the case of parasites, even transient kin structure because of the presence of admixed kin groups at the CP scale could lead to a HD through a ‘family’ Wahlund effect (Pudovkin et al., 1996). The presence of admixed sib groups could be a direct result of high variance in female reproductive success (Criscione and Blouin, 2005; Chevillon et al., 2007). Indeed, Chevillon et al. (2007) proposed that the admixture of ticks from distinct sib-groups could have led to significant levels of HD found at the CP scale in the cattle tick (Rhipicephalus microplus).

In I. texanus the presence of a large number of kin groups in a majority of CPs, as well as the high variance in kin group size, lends support to the hypothesis of variance in the reproductive success of individual ticks. However, less than 5% of the identified kin groups consisted of ticks sampled from a single host, revealing that though there is kin structure within CPs, there are also high levels of gene flow among raccoon dens (a fact supported by the low levels of FST among IPs).

Of the hypotheses explored, only two received statistical support: null alleles and kin structure. Clearly, the high variance in FIS across loci at the scale of the CP and IP is suggestive of the presence of null alleles. On the other hand, three lines of evidence suggest that null alleles were unlikely to be the main cause of deviation from HWE: (1) levels of missing data did not support the null allele hypothesis at the IP or CP scales; (2) our analyses provided strong statistical support for the presence of kin structure in a majority of CPs examined, and analysis of genetic patterns in randomly generated kin-structured populations revealed that pooling kin groups could lead to a pattern usually considered to be a signature of null alleles: high levels of FIS as well as high variance in FIS among loci (compare Supplementary Figures S2A and S3); and (3) finally, null alleles are expected to lead to an underestimation of relatedness between individuals (Dakin and Avise, 2004; Wagner et al., 2006); thus, we would expect low levels of kin structure when null allele frequency is high. However, our data show the opposite pattern with evidence that HD at the IP scale was significantly higher in CPs having significant levels of kin structure, compared with those with non-significant levels of kin structure. Given the three reasons mentioned above, it seems unlikely that HD in I. texanus is caused by the presence of null alleles.

Our post-hoc analyses revealed interesting insights into the biology of I. texanus in particular and potentially nidicolous parasites in general. We found that kin structure, when combined with subdivided breeding groups (that is, den sites), has the potential to produce deviation from HWE at the IP scale along with high variance in FIS at individual loci. There is no doubt that the model makes certain simplifying assumptions regarding the life-history characteristics of I. texanus, one important assumption being that allele frequencies observed at the IP scale adequately reflected the allele frequencies of parents forming the subsequent generation of ticks. Unfortunately, empirical support for this assumption requires sampling of the dens, which our sampling scheme did not incorporate. However, relatedness and FIS values within the MC-simulated kin groups emulated closely those of the observed kin groups, indicating that our assumptions are unlikely to seriously affect the simulation results. It is clear that future studies will need to concentrate on attempting to integrate population genetic patterns at the scale of the dens with those at the scale of the IP.

To summarize, our analyses revealed that in parasite systems showing heterozygote deficiency at the IP scale, the existence of subdivided breeding groups (Criscione and Blouin, 2005) and the high variance in individual reproductive success can lead to deviations from HWE. Importantly, our analyses have revealed that in such systems there is likely to be high variance in FIS among loci due to random factors affecting the genetic composition of parents contributing progeny as well as a Wahlund effect when these progeny are pooled at the IP scale. Although we do not imply that null alleles could not be a factor leading to HD in either I. texanus or parasites in general, we contend that biological factors can also lead to patterns that have usually been interpreted as being caused by technical issues (for example, null alleles). Our results indicate that it is important to take such biological factors into consideration when addressing HD in natural systems, particularly because loci that deviate from HWE are likely to reflect the effects of real biological processes, and eliminating such loci may be equivalent to ‘throwing the baby out with the bath water’.