Introduction

Pedigree relationship (e.g. uncle–nephew) of diploid organisms determines the expected proportion of the genome that relatives share identical by descent (ibd). At any locus or site in the genome, there is a given probability that two relatives have a copy of the same parental locus. If all sites segregated independently, the variance in the actual proportion shared would be negligible. However, linkage produces a positive correlation of identity among sites and thereby reduces the effective number of segregating sites on each chromosome. As a result, individual pairs having the same pedigree relationship can vary substantially in their actual or realised relationship.

The magnitude of the variation depends on the closeness of the relationship, and more generally on the pedigree, differing for example between that for offspring–grandparent and half-sib pairs, though the expected proportion of ibd sharing is 0.25 for both. As already noted, the variance also depends on the degree of linkage and therefore on the number and map lengths of chromosomes. While the variance of relationship becomes smaller the more distant the relationship, its coefficient of variation increases.

The variance in realised relationship is an important parameter which has to be taken into account when constructing pedigrees in natural or domesticated populations using data on genetic markers (Wang 2016) and in inferences about, for example, risks of inbreeding. It is also the basis of a method for estimating genetic variance free from confounding environmental effects (Visscher 2009; Odegaard and Meuwissen 2012).

Formulae and evaluations of the variance in actual or realised relationship have been developed over a number of years (e.g. Franklin 1977; Stam and Zeven 1981; Hill 1993; Guo 1995), and a comprehensive analysis for a broad range of relationships was given by Hill and Weir (2011). Visscher (2009) provides some approximate summary statistics for the human genome and describes how these can be utilised to estimate genetic variation in quantitative traits in non-pedigreed populations. Variation in realised inbreeding of the offspring of related parents (Hill and Weir 2011) and variation in relationship among partially inbred individuals (Hill and Weir 2012) can be computed similarly.

In the previous work cited above, the assumption has been that the recombination rate is uniform along the genome. However, average recombination rates (expressed as centimorgans per megabase, cM/Mb) usually differ both among and within chromosomes. A typical mammalian value is 1 cM/Mb, depending inter alia on positions of centromeres and repetitive regions. The range of sex-averaged values among human chromosomes is from 0.96 to 2.11 cM/Mb (Kong et al. 2002), and greater variation is seen at finer scales of observation. For the longer human metacentric chromosomes, the relationship between linkage map and physical map is not far from linear, although it is somewhat sigmoidal. For the shortest acrocentric chromosome, over 25% of the centromeric end shows no recombination (Matise et al. 2007). Birds typically have many small chromosomes with high and diverse recombination rates (Groenen et al. 2009). For example, the chicken has a very high cM/Mb ratio compared with mammals, and recombination rates on the microchromosomes are particularly high (Stapley et al. 2017). Generally, there can be both broad-scale and small-scale differences within chromosomes, notably at recombination hotspots (McVean et al. 2004; Stapley et al. 2017).

Observations of genomic identity between chromosomes at the molecular level are initially likely to be in terms of physical length rather than map length. Here, we examine the effect on the variance of ibd sharing of measuring lengths of shared and unshared segments on the physical scale (Mb) rather than on the map scale (cM), extending some results of Hill and Weir (2011).

Materials and methods

First, we re-express some methods and results of Hill and Weir (2011) in a simpler form as a basis for further analysis. The pedigree relationships mentioned are described in Table 1.

Table 1 Summary of relationships

Simplification and generalisation of basic formulae

We assume that the genome consists of a number of independently segregating chromosomes, and focus on just one of these.

We begin by considering a unilineal relationship, i.e. a pair of individuals who share exactly one pair of alleles ibd at any given locus with probability k, and otherwise share zero pairs of alleles, with probability 1–k. Sharing derives from either the maternal or paternal lineage, but not both. The probability k corresponds to k1 in Hill and Weir (2011). Wright’s (1922) coefficient of relationship is then k/2, and k/4 is the coancestry coefficient.

For a range of different relationships, Hill and Weir (2011) calculate the probability of ibd sharing at two linked sites with recombination fraction c as a polynomial in 1–c. Here, we follow the same approach, but express the probability as a polynomial in the linkage parameter 1–2c, which leads to equivalent but simpler coefficients. Following Hill and Weir (2011), we assume that Haldane’s (1919) mapping function applies, so that the linkage parameter is exp(–2d), where d > 0 is the map distance between sites. Let I and J be 0/1 indicator variables for sharing at the two sites. Then E(I) and E(J) are both equal to k, and the probability of ibd sharing at both loci is

$${\mathrm{E}}\left(IJ\right) = k^2\left[ {{a}_0 + {a}_1{\mathrm{exp}}\left( { - 2d} \right) + a_2{\mathrm{exp}}\left( { - 4{d}} \right) + \ldots } \right],$$
(1)

where a0, a1, etc. are the polynomial coefficients, all of which are positive or zero. Values of the coefficients for a range of relationships are given in Table 2. The limit d → , when the two sites segregate independently, shows that a0 = 1. The case d = 0 (when the two sites segregate as one) shows that the coefficients sum to 1/k.

Table 2 Coefficients of the variance of identity required for Eq. (1), for a range of relationships

The covariance of identity for two sites d Morgans apart E(IJ)–E(I)E(J) is obtained by subtracting k2 from expression (1):

$$k^2\left[ {{a}_1{\mathrm{exp}}\left( {-2{{d}}} \right) + {{a}}_2{\mathrm{exp}}\left( {-4{{d}}} \right) + \ldots } \right]$$
(2)

For example, for great-uncle and great-nephew, the covariance is

$$\left( {1/16} \right)\,\left[ {{\mathrm{exp}}\left( {-2{d}} \right) + 0.5{\mathrm{exp}}\left( {-4{d}} \right) + {\mathrm{exp}}\left( {-6{d}} \right) + 0.5{\mathrm{exp}}\left( {-8{d}} \right)} \right].$$

Expression (2) shows how, for a given relationship, covariance is attenuated by recombination as the map distance between sites increases. The size and number of coefficients are related to the number and complexity of coancestry paths linking the two relatives (Wright 1922).

Variation as a function of physical length

In order to compute the variance in relationship as a function of physical length, we weight each DNA base equally. Hence, let X and Y denote physical positions on the chromosome, measured as proportions of its total physical length, and let ψ(X) and ψ(Y) denote the corresponding map positions, expressed as proportions of the total map length ℓ. We refer to ψ as the ‘Marey’ map, relating map length to physical length (Chakravarti 1991). The derivative of ψ is the local recombination rate, i.e. the rate at which the proportion of map length increases with increasing proportion of physical length. Scaling by the ratio of map length of the chromosome in cM to physical length of the chromosome in Mb produces the conventional units cM/Mb. Following Hill and Weir (2011), we obtain the variance of ibd for a chromosome by averaging over all pairs of loci. Specifically, in the sth term of expression (2), we replace d by (ψ(X)−ψ(Y)) and integrate exp[–2s(ψ(X)–ψ(Y))] over the triangular region 0 < Y < X < 1 to obtain the average

$$\varphi \left( s\ell \, \right) = 2\int\!\!\!\int{\mathrm{exp}}\left[ {-2{s}\ell \left( {{\mathrm{\psi }}\left( {{X}} \right)-{\mathrm{\psi }}\left( {{Y}} \right)} \right)} \right]\,{dY}\,{dX},$$
(3)

where we have assumed that the number of genomic sites is large enough that the sum of terms can be approximated by Eq. (3). Then, from Eq. (2), variance of ibd sharing on the chromosome is

$$v\left( \ell \right) = k^2\left[ {a_1\varphi \left( \ell \right) + a_2\varphi \left( {2\ell } \right) + \ldots } \right]$$
(4)

For the grandparent–grand-offspring relationship (GPO), the variance is k2φ(), and Eq. (4) extends the result to other relationships, taking coefficients a1, a2, etc. from Table 2. For half-sibs (HS), for example, the variance is k2φ(2), so that the variance of sharing is the same for a HS relationship with map length and a GPO relationship with map length 2.

For the special case of uniform recombination rate (indicated by suffix *),

$$\varphi_\ast \left( {{s}\ell } \right) = \left( {2s\ell -1 + {\mathrm{exp}}\left( {-2s\ell } \right)} \right)/\left( {2s^2\ell ^2} \right)$$
(5)

(Hill and Weir 2011). The corresponding variance of relationship is

$$v_\ast \left( \ell \right) = k^2\left[ a_1\varphi _ \ast \left( \ell \right) + a_2\varphi _ \ast \left( {2\ell } \right) + \ldots \right]$$

and the ratio

$$\frac{v\left( \ell \right)}{v_\ast \left( \ell \right)} = \frac{ a_1\varphi \left( \ell \right) + a_2\varphi \left( {2\ell } \right) + \ldots }{ a_1\varphi_\ast \left( \ell \right) + a_2\varphi_\ast \left( 2\ell \right) + \ldots }$$
(6)

is a measure of the effect of non-linearity in the Marey map, which we refer to subsequently as the discrepancy ratio, with values far from unity indicating a strong effect.

For bilineal relationships (e.g. full-sibs, double first cousins), ibd sharing at a given site is possible through both paternal and maternal lineages, and the overall proportion of ibd sharing is the average of two values, each of which derives from a unilineal relationship (maternal or paternal). Because the lineages segregate independently, the covariance of identity is the sum of two terms like Eq. (2), one for each lineage. In the simplest cases, e.g. FS and DFC, when the chromosomal relationship is the same in both lineages, the bilineal variance is simply twice that for the corresponding unilineal relationship. Variation in ‘double’ sharing (simultaneous sharing in both lineages) can be calculated in a similar way to variance in single sharing, based on products of terms like Eq. (1) for each lineage (Table 2).

Broken-stick models

In practice, physical and map positions will be measured at a discrete set of markers. We assume that these are closely spaced, and that recombination rate is uniform between markers. The Marey map then takes the form of a continuous, piecewise linear function, resembling a broken stick. The variation in physical distance shared can then be calculated by summation, rather than numerical integration.

Therefore, we consider a single chromosome divided into segments, within each of which the recombination rate is constant. Denote the map length of the ith segment by i and its physical length, expressed as a proportion of the total physical length of the chromosome, by pi, with Σpi =  1. First, we consider the case of grandparent and grand-offspring, for which the only non-zero coefficient in Eq. (4) is a1 = 1 (Table 2).

Let Si be the proportion of physical (and map) length shared in the ith segment. The calculation of var(Si) is the same as that for a whole chromosome with constant recombination rate: from Eq. (5)

$${\mathrm{var}}\left( {{{S}}_{{i}}} \right) = {{k}}^2\left( {2\ell _{{i}}-1 + {\mathrm{exp}}\left( {-2\ell _{{i}}} \right)} \right)/\left( {2\ell _{{i}}^2} \right),$$
(7)

and the covariance of sharing between two segments is

$${\mathrm{cov}}\left( {{{S}}_{\mathrm{i}},\,{{S}}_{\mathrm{j}}} \right) = {{k}}^2{{c}}_{{{\mathrm{i}}}}{{c}}_{{{\mathrm{j}}}}\,{\mathrm{exp}}\left( {-2\ell _{{{{\mathrm{ij}}}}}} \right)/\left( {\ell _{{{\mathrm{i}}}}\ell _{{{\mathrm{j}}}}} \right),$$
(8)

where ci = (1–exp(–2i))/2 is the recombination probability for the ith segment, and ℓij is the total map length of segments lying between the ith and jth segments, with ij = 0 when they are adjacent.

For the whole chromosome, the proportion of physical length shared ΣpiSi has variance

$$\Sigma \,{\mathrm{p}}^2_{\mathrm{i}}{\mathrm{var}}\left( {{{S}}_{\mathrm{i}}} \right) + 2\,\Sigma _{{\mathrm{i} < \mathrm{j}}}{\mathrm{p}}_{\mathrm{i}}{\mathrm{p}}_{\mathrm{j}}{\mathrm{cov}}\left( {{{S}}_{\mathrm{i}}{{,S}}_{\mathrm{j}}} \right){\mathrm{.}}$$
(9)

For other relationships in Table 2, the variance is calculated from Eq. (9), with Eq. (7) modified to

$${\mathrm{var}}\left( {{{S}}_{\mathrm{i}}} \right) = {{k}}^2\Sigma _{\mathrm{s}}\,{a}_{s}\left( {2s\ell _{\mathrm{i}}-1 + {\mathrm{exp}}\left( {-2s\ell _{\mathrm{i}}} \right)} \right)/\left( {2s^2\ell _{\mathrm{i}}^2} \right),$$

and with a similar modification to Eq. (8).

Application

We investigated variance of identity for two very different genomes, human and chicken. For human data, we used web table E from Kong et al. (2002), which provided physical and map distances (separately for males and females) for segments between 5136 markers on the 22 autosomes. For the chicken genome, we used physical and map distances for segments between the 9268 markers in the published linkage map for chromosomes 1–28 (Groenen et al. 2009, Supplemental Table 1). We omitted chromosomes 16, 22 and 25, for which we did not have sufficient data to produce reliable estimates. For each data set, theoretical variances for a GPO relationship were calculated on map and physical scales, using a broken-stick model for the latter, and the results combined to produce the variance for the genome-wide proportion of ibd sharing on each scale. This was calculated as Σp2ivi /(Σpi)2, where pi was the length and vi the variance of sharing for the ith chromosome, with vi and pi calculated on the same scale (map or physical).

The effect of interference

Our results are based on Haldane’s mapping function, which assumes no interference in the crossover process. We briefly consider the effect of replacing the Haldane mapping function by Kosambi’s (1944) mapping function, which allows for interference.

Results

An illustrative example

For illustration, we consider the three Marey maps shown in Fig. 1. In one (afferent, ‘bringing inwards’), recombination is concentrated at the centre of the chromosome, and in another (efferent, ‘conveying outwards’), there are high recombination rates near the chromosome ends. Midway between these two extremes, the Marey map is linear, corresponding to a constant recombination rate.

Fig. 1
figure 1

Three examples of Marey maps. Recombination is distributed away from the chromosome centre for the ‘efferent’ map, and concentrated at the centre for the ‘afferent’ map. The horizontal dashed lines indicate the two segments mentioned in the text

The chromosome of Fig. 1 comprises two segments of equal physical length. Let S1 and S2 be the proportions shared in each segment, so that the proportion of the whole chromosome shared is the average of S1 and S2, with variance V(1 + ρ)/2, where V = var(S1) = var(S2) is the variance within each segment, and ρ is the correlation between S1 and S2. For the GPO relationship, the effect of changing the Marey map from efferent (recombination mainly at ends) to afferent (recombination mainly at the centre) is that the segment variance changes very little, while the correlation steadily diminishes (Table 3).

Table 3 Variance (V) of physical length shared within two half-chromosomes, and the correlation (ρ) between sharing in one and in the other, for a range of Marey maps (the three examples of Fig. 1 and two intermediate cases)

In this example, variance of sharing for the whole chromosome is largely determined by the correlation between the amounts of sharing in the two segments, which in turn is dependent on the degree to which recombination is concentrated at the junction between the segments. A low recombination rate at the centre creates high correlation, whereas a high recombination rate there reduces the correlation.

Recombination hotspots

Consider a chromosome with a uniform recombination rate, except for one or more idealised recombination hotspots, each with finite map length but assumed zero physical length. The hotspots divide the chromosome into segments, within each of which the recombination rate is assumed to be constant. The standard broken-stick calculation applies, except that in Eq. (8), the exponential terms for intervening segments include any hotspots therein.

When considering the effect of a hotspot, the baseline for comparison can be either the result of removing the hotspot, leaving just the uniform background, or the result of merging the hotspot map length uniformly with the background. With the first approach, the effect of a single hotspot is always a reduction in variance, with maximum effect when centrally situated. With the second approach, which we consider more appropriate and therefore use below, the effect can be either an increase or decrease in variance, and the maximum effect could be either at a central position or at a chromosome end.

Figure 2 shows the effect of two hotspots at different positions on the chromosome. The upper curve has an alternative interpretation as the effect of a single hotspot. In each case, the GPO relationship is assumed. The variance obtained with a single central hotspot is generally less than if the recombination rate is constant along the chromosome. Thus, a single hotspot reduces variance when it is centrally placed, and increases variance when it is near a chromosome end. In the central position, the hotspot breaks up the positive correlation between adjacent segments in a similar way to the afferent map of Fig. 1. The effect (increase or decrease) is generally small, however. The maximum effect of two hotspots occurs when both hotspots are at a chromosome end (either with one at each end or with both at the same end).

Fig. 2
figure 2

Effect on the discrepancy ratio (Eq. 6) of different positions of two hotspots on the chromosome. A background of uniform recombination rate is assumed. The relationship is GPO, chromosome map length is 1 M and the map length of each hotspot is 0.1 M. Upper curve: one hotspot is placed at a chromosome end. Lower curve: hotspots are symmetrically placed about the chromosome centre. The baseline for comparison is the uniform case (i.e. hotspot map lengths merged uniformly with the background). The upper curve can also be viewed as the effect of a single 0.1 M hotspot on a 0.8 M uniform background

Further analysis shows that similar results to those illustrated in Fig. 2 are obtained with a wide range of relationships, map lengths and hotspot intensities. Generally, the effect on ibd variance is small, unless hotspot map lengths are large compared with the background and the hotspots are positioned near chromosome ends.

We now consider briefly the more realistic situation, where numerous hotspots are located throughout the chromosome, with varying intensities (crossover probabilities). In this case, the Marey map consists of a mixture of straight line segments (between hotspots) and vertical jumps (at the hotspots). Fixing the total map length of the chromosome constrains the individual jumps to be small. As a result, the Marey map, viewed on a broad scale, is reasonably smooth and only departs from a straight line if there is a systematic trend in either hotspot or background intensity. In the absence of such broad-scale changes, the Marey map will be nearly linear and the discrepancy ratio will be close to 1. Thus, the effect of multiple hotspots changes from fine scale to broad scale as the number of hotspots increases.

Broad-scale variation in recombination rate

A broken-stick model with three segments of equal physical length, but different map lengths, was found sufficiently flexible to describe many types of broad-scale variation in recombination rate.

For the GPO relationship, and a chromosome with map length 2 M, Fig. 3 shows the effect on the discrepancy ratio of different ways of dividing the total map length between the outside segments and the middle segment, (i) when the outside segments have equal map length, and (ii) when one of the two outside segments has zero map length. The maximum disparity occurs when all recombination is concentrated in one end segment. This is in agreement with our results for hotspots, but now the magnitude of the effect is much greater.

Fig. 3
figure 3

With 1, 2, and 3, the proportional map lengths of the left, middle and right segments of a three-segment broken stick (1 + 2 + 3 = 1), the figure shows the effect on the discrepancy ratio (Eq. 6) of changing 1 + 3, and of different ways of partitioning 1 + 3 between the outside segments. GPO relationship, map length 2 M. Lower curve: the effect of changing 1 + 3 while 1 = 3; upper curve, the same, but with no recombination in one of the two outside segments (say 3 = 0). At point A, all recombination is confined to the middle segment, with end segments either completely sharing or completely non-sharing. At point B, there is no recombination in the middle segment, with map length equally divided between the end segments. At point C, all recombination is confined to one of the end segments

The results for other relationships and map lengths are expressed in terms of the three extreme points labelled A, B and C in Fig. 3. For different map lengths and relationships, A, B, and C change position, but the area bounded by the three extreme points retains the same basic shape (Fig. 4). The largest discrepancy ratio (at C) increases as the relationship becomes more remote. Changes in A, B and particularly C become more pronounced as map length is increased. Overall, the potential for an increase in variance is greatest with more remote relationships and longer chromosome lengths. Similar trends with map length and bilineal relationship are obtained in the variance of double sharing (Fig. 5).

Fig. 4
figure 4

Discrepancy ratios (Eq. 6) for Marey maps represented by points A, B and C of Fig. 3, for map lengths 0.5, 1 and 2 M, and a subset of unilineal relationships from Table 1

Fig. 5
figure 5

Discrepancy ratios (Eq. 6) for double sharing for Marey maps represented by points A, B, and C of Fig. 3, using map lengths 0.5, 1 and 2 M, and the bilineal relationships of Table 1

Contrasting results for human and chicken genomes

Figure 6 shows discrepancy ratios for the GPO relationship, separately for chickens, human males and human females. With the human data, both male and female, an ‘efferent’ pattern for every chromosome consistently produced discrepancy ratios >1. In females, the discrepancy ratio was almost constant, never rising above 1.1, but there was a clear positive correlation with chromosome length. In males, variances were greater than those for females on both physical and map scales. Discrepancy ratios were also higher, but with no discernible relationship with chromosome length, apart from relatively small values for the two shortest chromosomes. For the chicken genome, there was a strong positive correlation between the discrepancy ratio and chromosome length, with values exceeding 1.0 for the longer chromosomes, and <1.0 for some of the shorter chromosomes. Similar results were obtained with more remote relationships.

Fig. 6
figure 6

Discrepancy ratios (Eq. 6) for human chromosomes 1–22 and chicken chromosomes 1–15, 17–21, 23, 24 and 26–28. Assumed relationship GPO

For the 25 chicken chromosomes, although the discrepancy ratio for individual chromosomes never exceeded a 15% increase (Fig. 6), the variance of the genome-wide proportion of physical length shared was 41% greater than that on the map scale. The corresponding genome-wide values for the 22 human autosomes were 16% (females) and 33% (males).

Allowing for interference

The linkage parameter associated with a segment of map length is exp(–2) according to Haldane’s function and 1–tanh(2) according to Kosambi’s function. Plotting these curves shows that the linkage parameter for the Kosambi map diminishes to zero more rapidly than that for the Haldane map, such that the Kosambi linkage parameter at map length is about the same as the Haldane linkage parameter evaluated at 1.3. Using this approximation, we estimate that the effect on standard deviation of ibd sharing of changing from the Haldane to the Kosambi map is a reduction of 4%, 7%, 9% and 10% for map lengths of 0.5, 1, 2 and 3 M (based on a constant recombination rate). These values are in close agreement with those found by Hill and Weir (2012). For the human genome, Caballero et al. (2019) estimated an average 11% reduction in SD of sharing due to interference over a range of relationships from full-sibs to second cousins. For the case of non-uniform recombination rate, inspection of Figs. 4 and 5 shows that increasing map length by 30% would cause a small increase in the discrepancy ratio under scenarios A and B, and a large increase with remote relationships under scenario C.

Discussion

Variation in recombination rate within and between chromosomes is a well-recognised feature of the genome. Stapley et al. (2017) reviewed contributing factors, and Ritz et al. (2017) discussed the extent to which variation in recombination rate is itself adaptive. Since the PRDM9 gene was identified, analysis and discussion have been published on its evolution, its role in determining sequence-wide hotspots and (possibly) speciation events (Schwartz et al. 2014). Factors that determine observed chromosome numbers, average recombination rates and distribution over chromosomes have been discussed in the evolutionary literature. It is likely that there are many factors involved, including, for example, epistasis and reproductive isolation. Charlesworth and Charlesworth (2010, pp. 546–561) provide an extensive discussion.

The chicken genome provides an example of substantial heterogeneity in recombination rate both between and within chromosomes (Groenen et al. 2009). The longest chromosomes have a nearly constant rate, except for elevated values at the chromosome ends, where the influence of the higher recombination rate is likely to be small, because a small amount of genome is involved. The shorter chromosomes tend to have higher mean recombination rates than do the long chromosomes and contribute rather little to the overall variance in relationship.

More generally, we have shown that, when recombination rate varies along the chromosome, variation in relationship as measured in terms of the physical length of the genome can differ quite substantially from that measured in map units. The effect is usually an increase in variance, although a reduction is also possible. Among the main influences on the difference are recombination hotspots and cases where much of the recombination generated by meiosis is found towards one chromosome end. The discrepancy ratio is similar over a wide range of relationships, but tends to increase as relationships become more distant (and variances decrease). The effect of the degree of relationship on the discrepancy ratio is the same order of magnitude as the effect of map length or non-linearity in the Marey map. For example, the difference in discrepancy ratio between GPO and 2C1R is similar to the difference between map lengths of 0.5 and 2 M, or the difference between having recombination concentrated at the chromosome centre (scenario A in Fig. 4) or at one chromosome end (scenario C).

We find that the pattern of large-scale variation in recombination rate is more important than small-scale variation in its effect on variance of sharing. For this reason, the broken-stick calculation, which works well with high-density linkage map data, should also give reasonable results even with markers which are more sparsely distributed. However, it is not appropriate for the type of cytological data that has been used to define crossover distributions (e.g. based on MLH1 foci).

Our observations raise the question of which is the more appropriate quantity to analyse, physical or map distance. Map distance does not necessarily reflect parts of the genome with high gene density but low recombination rate, for example. In some applications, map distance may be the appropriate quantity, because it is potentially the best indicator of local linkage disequilibrium. This is relevant when using genomic data to predict genetic merit in livestock and crops (Meuwissen et al. 2001) or complex human diseases (Maier et al. 2018). Here, the issue is how much information about the genotype at trait loci comes from neighbouring markers, and marker linkage is likely to be the most suitable quantity with which to weight observations.

Although in theory the differences between physical and map distances can have a substantial impact on variance of sharing, in practice, the impact seems to be limited. Compared with the vast range of recombination rates among different orders of animals and even species, the effects on variation in relationship are relatively small. Only in rather specific circumstances (e.g. distant relationship combined with a long chromosome in which most of the recombination occurs near one end) is the discrepancy ratio likely to be large enough to be of practical importance.