Introduction

The origins of dogs in the Arctic are complex. Although the first dogs arrived the Americas ca.12 000 years ago (Leonard et al, 2002), people and their dogs did not settle in the Arctic until ca.4500 years ago (Paleoeskimo) and then again 1000 years ago (Thule) (Morey & Aaris-Sorensen 2002; Morey 2010; Gilbert et al, 2008; Raghavan et al, 2014). It is unclear whether the dogs associated with these two Arctic cultures arrived on the continent in two separate waves of colonization along with their human counter parts (see Raghavan et al, 2014), or if the original Paleoeskimo dogs served as founding stock for the later Thule dogs. It is also possible that Paleoeskimo dogs themselves arose from earlier Native American dogs initially brought to the continent at the end of the last ice age (Leonard et al, 2002). Thus, one of our most basic questions pertained to the roots of indigenous North American Arctic dogs.

Second, it is unclear to what extent today’s Arctic dogs reflect that indigenous ancestry. Throughout most of the Americas, post-colonial dogs largely erased the genetic signatures of pre-historical dogs (Leonard et al, 2002). Today’s Inuit dogs of Canada and Greenland seem to be an exception, most likely the descendants of pre-colonial indigenous dogs (Brown et al, 2013; van Asch et al, 2013). Since European colonization, however, inhabitants of the Arctic have adopted as sled dogs several additional breeds or types, including the Alaskan Husky (created in post-colonial North America), Alaskan Malamute (hereafter, Malamute) and Siberian Husky. Although these types are of similar form and function to the Inuit dog, their genealogical relationships are unclear. The Alaskan Husky of today, which is defined on the basis of (and are bred for) function rather than ancestral ‘purity’, reflects a century or more of admixture (for example, Pointers, Shepherds, Salukis) and selection for performance (Coppinger and Coppinger, 2001; Huson et al, 2010). Malamutes were thought to be created by the Malemiut Inupiaq people of Alaska’s Norton Sound region, and therefore presumably share a common ancestry with Inuit dogs (Handford 1998). The Malamute was used primarily for sledding but is heavier bodied than Inuit dogs, with different markings (Handford 1998). It is unknown to what extent native ancestry has been retained through breed development. The Siberian Husky was originally developed by the Chukchi people of the Chukotka Peninsula in eastern Siberia (Fiszdon & Czarkowska 2008) and was brought to Nome, Alaska, in 1908 for sled-dog sport racing.

Existing genetic work on these dogs provides an incomplete picture. Genomic studies have invariably excluded Inuit dogs (as well as other putative indigenous dogs), but suggest that other arctic spitz-type dogs (Malamutes, Siberian Huskies) form one of the most ancestral groups among the modern-defined breeds (Parker et al, 2004; vonHoldt et al, 2010; but see Larsen et al, 2012). Mitochondrial DNA studies have included a wider range of arctic dog breeds and types, including Inuit dogs, along with village dogs indigenous to a variety of non-Western regions (for example, Pang et al, 2009; Brown et al, 2011, 2013; Sacks et al, 2013); these studies generally support the indigenous ancestry of Inuit dogs but are equivocal with respect to Malamute ancestry. In particular, genetic analysis of 582 base-pairs (bp) of mitochondrial DNA in modern (Savolainen et al, 2002; Pang et al, 2009; Klütsch et al, 2011: van Asch et al, 2013) and ancient (Brown et al, 2013) dogs demonstrated genetic continuity between Thule (pre-colonial) and modern Canadian and Greenland dogs by the continued presence and high prevalence of the North American haplotype A31, not found in modern Siberian dogs. The Malamute has been associated primarily with haplotype A29 (Klütsch et al, 2011; Brown et al, 2013; van Asch et al, 2013), which has been identified in prehistoric dog remains from sites in the Americas, in both Alaskan and Siberian huskies, and in Southeast Asian village dogs and Australian dingoes (Leonard et al, 2002; Savolainen et al, 2004; Brown et al, 2013; Sacks et al, 2013). The occurrence of this haplotype in indigenous populations on both continents renders it equivocal with respect to evidence for proximate origins in North America, but nevertheless supports the ancient origins of the spitz-type group in general.

The only study involving arctic dog Y-chromosomes was restricted to low-resolution sequence variants and included two Inuit dogs, along with two Malamutes, a Siberian Husky, and a Chukotka dog from Siberia (Ding et al, 2011). Both Malamutes and an Inuit dog shared a haplotype otherwise not found in any other of the hundreds of dogs genotyped throughout Eurasia (Ding et al, 2011; Sacks et al, 2013), suggesting that use of a higher-resolution Y-chromosome marker and a larger sample of arctic dogs could help further elucidate their ancestry.

Here, we apply maternal, paternal and biparentally inherited markers to assess the genetic history of today’s North American arctic sled dogs. Our objectives were to (1) test the hypothesis that today’s Inuit dogs have retained indigenous North American ancestry, (2) characterize their relationship to one another and to the more recently imported or constructed arctic dog breeds and (3) estimate the age of North American indigenous matrilines and patrilines. For this last objective, we used a 6024-bp portion of the mitochondrial genome and Y-chromosome microsatellite haplotypes to test coalescent predictions corresponding to three alternative hypotheses about the timing of arrival from Asia: (a) ca.12000 years ago, corresponding to the initial introduction of dogs to the continent (Leonard et al, 2002, b) ca. 4500 years ago, consistent with a Paleoeskimo dog origin, or (c) ca. 1000 years ago, consistent with origin from a subsequent Thule migration from eastern Siberia.

Materials and methods

Samples

We collected buccal swabs from arctic-breed owners and breeders between 1998 and 2010 (sampled by Veterinary Genetics Lab Dog Research Group) and 2012–2013 (sampled by SKB). Altogether, we collected a total of 178 individual buccal swabs from Inuit dogs (Canada and Greenland), Alaskan Husky, Malamute and Siberian Huskies (Table 1). Our sample sizes varied among the types of marker used owing to insufficient DNA quantities for some samples, processing of different batches among markers in variable order, restriction of Y-chromosome analyses to males (obviously) and removal of closely related individuals (based on autosomal microsatellites) for expanded sequencing (Appendix 1 online).

Table 1 Sample sizesa used for data analyses involving hypervariable region I (HVI) mtDNA, expanded mtDNA sequences, Y-chromosome microsatellites and autosomal microsatellites

Laboratory procedures

We extracted DNA from buccal swabs using a sodium hydroxide protocol (Brown et al, 2011) or Qiagen DNeasy kit, according to manufacturer’s instructions. We genotyped and analyzed a total of 24 ISAG (International Society for Animal Genetics) canine autosomal microsatellite loci (AHT121, AHT137, AHTH130, AHTh171, AHTh260, AHTk211, AHTk253, C22.279, FH2001, FH2054, FH2328, FH2848, INRA21, INU005, INU030, INU055, LEI004, REN105L03, REN162C04, REN169D01, REN169O18, REN247M23, REN54P11, REN64E19) as previously described (Pedersen et al, 2013). For comparison in the population tree analysis, we included 12 wolves from Yukon Territory, Canada (Appendix 1 online).

We first sequenced the 582-bp portion of the mitochondrial DNA (mtDNA) hypervariable region I (HVI) using primers H15422 and L16106 (Savolainen et al, 2002). Among individual Inuit dogs found to possess the A31 haplotype, we selected for expanded sequencing a subset (n=10; seven from Canada, three from Greenland) that were maximally unrelated (r<0.22) based on microsatellite analysis (see below) to ensure that we represented the greatest genealogical diversity in the sample set. We targeted 6024 bp of mitochondrial DNA, including 5582 bp identified previously as the most highly polymorphic regions of the dog mtDNA genome (Imes et al, 2012), using primers listed in Supplementary Table S1. The resulting sequence was annotated using the online web server MITOS (Bernt et al, 2013).

For all mitochondrial amplicons, we performed polymerase chain reactions in an ABI 2720 thermal cycler in 23-μl volumes consisting of 3 μl DNA, 1 × PCR buffer, 3 mM MgCl2, 0.2 mM dNTPs, 0.5 μM of each primer and 1 U of Amplitaq DNA polymerase (Applied Biosystems, Grand Island, NY, USA). Conditions were set at 94–96 °C for 10 min, followed by 40 cycles of 94 °C for 15–45 s, 50–56 °C for 30–45 s and 72 °C for 45–60 s, followed by 10 min of extension at 72 °C. Amplicons were purified with EXO-SAP-it (Affymetrix, Santa Clara, CA, USA) as per manufacturer’s protocol and were then sequenced with Big Dye v 3.1 chemistry on an ABI 3730 capillary sequencer. Sequences were aligned to a reference mitogenome (Kim, 1998). Naming convention for HVI haplotypes followed Savolainen et al. (2002).

We genotyped male dogs using five dinucleotide-repeat microsatellites from the non-recombining region of the Y chromosome, including 650–79.2, 650–79.3, 990–35 (Bannasch et al, 2005), MS34A and MS41B (Sundqvist et al, 2001), in two separate multiplex reactions as previously described (Brown et al, 2011). We also genotyped these dogs at 29 Y-chromosome single-nucleotide polymorphisms reported by Ding et al. (2011) using a Sequenom platform with primers and conditions described by Sacks et al. (2013).

Data analyses

We estimated autosomal microsatellite diversity statistics (HO, HE) and allelic richness using Microsatellite Toolkit (Park 2001). We used GENEPOP v. 4.2 (Raymond & Rousset 1995) to test for departures from Hardy–Weinberg equilibrium and linkage disequilibrium and to estimate FST. In tests for Hardy–Weinberg equilibrium and linkage disequilibrium, we adjusted P-values for multiple comparisons using the sequential Bonferroni correction (Rice 1989). We estimated the inbreeding coefficient (FIS) in Arlequin v. 3.5.1.3 (Excoffier et al, 2005), and estimated relatedness among individuals using a maximum likelihood approach implemented in MLrelate (Kalinowski et al, 2006).

To visualize patterns of genomic heterogeneity among individual Arctic dogs irrespective of prior categorization into a particular breed or type, we used a Bayesian model-based clustering approach implemented in program STRUCTURE v. 2.3.3 (Pritchard et al, 2000). We performed 100 independent runs using the correlated allele frequencies model (Falush et al, 2003), including 10 runs each for K=1–10 genetic clusters. Each run was composed of 500 000 Markov Chain Monte Carlo (MCMC) repetitions from which the first 250 000 were discarded as burn-in. The final K (number of groups) was chosen that best fit the data (denoted by X), by plotting the negative in Pr (X|K) versus K (K ranging from 1 to 10). Finally, the K that was supported was then assessed to assure that this value did not sacrifice substantial explanatory power (Pritchard and Wen 2002; Waples and Gaggiotti 2006). To complement this analysis with one based on pre-categorized groups (that is, breeds/types), we also used program Populations (Langella 1999; http:/bioinformatics.org/~tryphon/populations/) to estimate Nei’s DA (Takezaki & Nei 1996) and produce a neighbor-joining tree among populations with node support calculated bootstrapping on loci.

We estimated mitochondrial HVI haplotype diversity and mitochondrial pairwise FST in Arlequin. For expanded mitochondrial haplotypes, ideally, we would have measured the average (and s.d.) numbers of substitutions accumulating between individuals’ haplotypes and the ancestral haplotype (Forster et al, 1996; Saillard et al, 2000). However, for recently derived lineages (for example, <1200 years), it was likely that we would find no mutations in the entire 6024 bp, which prevented use of this approach. Therefore, we turned this question around: instead of asking ‘what is the coalescence time suggested by these data?’ we asked ‘what is the probability that these data (or more extreme values) would be observed if their true coalescence were <1000 years ago?, <5000 years ago?, <12 000 years ago?’ Specifically, we treated the problem as one of decay, whereby the probability (or expected portion) of haplotypes having no mutations decreased with increasing coalescence time. We assumed a substitution rate of 2.5% per million years (Nabholtz et al, 2008), and determined the probability (P) associated with the observed frequency of derived haplotypes (for example, 0) versus ancestral haplotypes (conservatively presumed to be the haplotype at highest frequency) corresponding to a range of the divergence times (1000–12 000 year), and used these values to identify the divergence time corresponding to the critical value at α=0.05. Specifically, we used the binomial formula,

where k was the total number of dogs sequenced (that is, 10), x was the number of these dogs exhibiting the most frequent haplotype (for example, 10), and p varied according to the hypothetical divergence time (Figure 1).

Figure 1
figure 1

Relationship between the expected proportion of mitochondrial haplotypes identical in state to the ancestral haplotype (p) (as opposed to all mutant forms (1−p)) as a function of time since divergence from a common ancestor for the expanded sequence (6024 bp). Filled circles and numeric values on the expanded-sequence curve correspond to two hypothetical time point, 1000 years, approximately when Thule culture and dogs spread east across North America, and 4400 years, corresponding to the Paleoeskimo migration from Siberia. The expected proportion of ancestral haplotypes (p) corresponding to 12 000 years ago (not shown) was 16% (P=0.16). Curves for the hypervariable region I (HVI) fragment (582 bp) and the entire mitochondrial genome (16 000+bp) are shown for reference.

We arranged Y-chromosome microsatellite alleles into five-locus haplotypes, which were nested within haplogroups defined according to single-nucleotide polymorphism haplotypes (hereafter, ‘haplogroups’ to distinguish from the more highly resolved, nested microsatellite haplotypes) and compared with published haplogroups and haplotypes (Brown et al, 2011; Ding et al, 2011; Sacks et al, 2013). We first constructed median-joining haplotype networks and then applied a reduced-median analysis to create the final networks, using program Networks (4.6.1.1.; Bandelt et al, 1995).

Results

Autosomal microsatellites

We observed an average of 4.91 alleles per locus among the 24 autosomal microsatellite loci, with breed-specific averages ranging from 3.38 to 6.58 (Table 2). Otherwise, all 24 loci were polymorphic in all breeds. Inuit dogs (both Canadian and Greenland) exhibited lower heterozygosity than the other breeds (Table 2). Tests of linkage disequilibrium revealed no locus-pairs in any of the breeds to be out of equilibrium. No loci showed significant deviations from Hardy–Weinberg equilibrium in Inuit dogs (both Canadian and Greenland), Alaskan Huskies, or Siberian Huskies. However, CC22.279 showed significant deviations from Hardy–Weinberg equilibrium in Malamutes. Four breeds showed heterozygote deficiency in FIS, when calculated across loci, consistent with substructure within these breeds/types.

Table 2 Autosomal microsatellite statistics, including number of dogs genotyped (n), expected (assuming HWE) heterozygosity (HE), expected heterozygosity s.d., observed (proportion of loci counted in heterozygous state) heterozygosity (HO), observed heterozygosity s.d., number (No.) of alleles, inbreeding coefficient (FIS), FIS P-value

Our admixture analysis indicated the greatest support for K=4 clusters, corresponding to each of the presumptive breeds (Figure 2). The structure appeared hierarchical. At K=3, the Alaskan and Siberian Huskies clustered together. At K=2 all Inuit dogs grouped in one cluster versus Alaskan Huskies, Malamutes and Siberian Huskies in another.

Figure 2
figure 2

Admixture analysis based on 24 autosomal microsatellites of four Arctic dog breed/types, including (a) the average (±s.d.) estimated log probability of the data (Pr(X|K)) versus the specified number of genetic clusters (K) calculated from 10 replicate runs for each K value and (b) bar plot illustrating for each dog the ancestry (q) apportioned to each of the K=2–4 genetic clusters, arranged in breed groups. The most parsimonious number of clusters supported by the increase in Pr(X|K) per increase K(A) was K=2 and the maximal partition supported by these criteria was K=4.

Similarly to the admixture analysis, a population tree based on Nei’s DA indicated clustering of all Inuit dogs from Canada and Greenland with 100% bootstrap support and clustering the Malamute with 84% bootstrap support with this group, and the Yukon wolves and Siberian Husky with 64% bootstrap support (Supplementary Fig. S1). The pairwise FST estimates additionally indicated that Alaskan Huskies were more closely related to both Siberian Huskies and Malamutes than either of the latter breeds were to one another, consistent with high contributions from Siberian Huskies and Malamutes to the Alaskan Husky gene pool (Table 3).

Table 3 Microsatellite pairwise FST (above diagonal) estimates calculated from 24 autosomal microsatellites among four Arctic breeds. Below the diagonal are the mitochondrial HVI DNA FST estimates among four Arctic breeds. Sample sizes (in italics) for microsatellite FST are on the right side of the ‘\’ and the mitochondrial FST on the left hand side

Mitochondrial DNA

We obtained 88 HVI sequences (~582 bp) that represented 12 haplotypes from four arctic breeds (Table 4). Mitochondrial haplotype diversity was relatively high in Alaskan Huskies, Malamutes and Siberian Huskies, and lower in Inuit dogs (Table 5). The Inuit dogs were the only group to possess haplotype A31. Inuit dogs also possessed another private type, A181 (n=2). Haplotype A29 occurred in all breeds. As with the microsatellite analyses, pairwise mitochondrial FST comparisons indicated considerable differentiation (FST>0.42) between haplotype frequencies of Inuit dogs versus Alaskan Huskies, Malamutes and Siberian Huskies (Table 3). Inuit dogs had similarly low haplotype diversity (Table 5) relative to the higher diversity observed in the other three breeds.

Table 4 Numbers of individuals of four modern Arctic sled dog breeds/types exhibiting 13 mitochondrial HVI haplotypes (582 bp), with reference to previous identification in Arctic or Spitz type breeds
Table 5 Mitochondrial HVI DNA diversity statistics, including number of individuals sequenced (n), number (No.) of haplotypes, haplotype diversity and nucleotide diversity

Among individual Inuit dogs with A31 HVI haplotypes, we selected the 10 (seven from Canada, three from Greenland) least closely related individuals in our data set for expanded mtDNA sequencing (Supplementary Table S3). We detected no polymorphisms among these individuals in the 6024-bp fragment, which we deposited in Genbank (Accession No. KM821048). A Basic Local Alignment Search Tool (BLAST) search revealed no other matching haplotype and differed by the nearest completely overlapping haplotype by 11 substitutions (0.18%; Thalmann et al, 2013; Genbank No. KF661086). This haplotype was a 1000-year-old dog from Florida with the corresponding HVI haplotype matching A3, which is also the nearest known HVI haplotype basal to A31 (Pang et al, 2009). The divergence time corresponding to the observed frequency of haplotypes (that is, 10 ancestral, 0 mutant) at P=0.05 was estimated at 1725 years, indicating that this was the oldest statistically supported time since divergence from a common ancestor. In particular, the P-values corresponding to 1000 years, 4400 years and 12 000 years were P=0.21, P=0.001 and P=9 × 10−9, respectively, providing support for coalescence on the Thule timeframe and evidence to reject coalescence on both the Paleoeskimo and late Pleistocene timeframes. Because these P-values assume a mutation rate of 2.5%, we also explored sensitivity to this assumption. We determined that the 4400-year coalescence timeframe would not have been statistically plausible (that is, P > 0.05) for any mutation rate greater than 1% per site per million years, and the 1000-year timeframe would remain within statistical plausibility for mutation rates up to 4.8% per site per million years, indicating that our conclusions about the coalescence timeframe apply as long as the true substitution rate falls within a window of 1–4.8%.

Y-chromosome DNA

We genotyped the 5 Y-chromosome microsatellites and 29 single-nucleotide polymorphisms of 71 male dogs and observed 14 Y-chromosome microsatellite haplotypes, a similar number to the number of mtDNA haplotypes. In contrast to mtDNA haplotypes, however, Y-chromosome haplotypes tended to be specific to a particular breed, with fewer haplotypes shared among breeds (Table 6). Sharing of Y haplotypes among breeds/types also differed from the mtDNA (Figure 3). For example, Inuit dogs from Canada and Greenland did not share Y-chromosome haplotypes, whereas Inuit dogs from Canada and Malamutes shared their most prevalent haplotype (H22-n33). Alaskan Huskies and Malamutes shared a single haplotype as did Malamutes and Siberian Huskies.

Table 6 Y-chromosome types, composed of haplogroups (defined by SNP haplotypes, beginning ‘H’) and microsatellite haplotypes (numerals and lower-case letters), in four Arctic breeds, and comments on whether microsatellite haplotypes were previously reported in Western breed dogs (WBD), Southeast Asia village dogs (SEAVD), other breeds, or were previously unreported (Novel)
Figure 3
figure 3

Y-chromosome haplotypes of 67 Arctic from four breeds/types in the present study organized in networks with previously published (Gray and white haplotypes-Brown et al. 2011; Sacks et al. 2013) haplotypes of East Asian village dogs (n=144), New Guinea Singing Dogs (n=18), Australian Dingoes (n=94), Iranian village dogs (n=135), African village dogs (n=3) and western breed dogs (n=79), illustrating (a) Arctic breed dogs that group into haplogroups H1-H22 (Southeast Asian haplogroup) and (b) Arctic breed dogs (Greenland dogs) that group into haplogroup H23.

The 14 Y-chromosome microsatellite haplotypes were distributed among seven haplogroups, which facilitated comparison with previous studies, some of which only examined haplogroups (Ding et al, 2011). Greenland dogs exhibited haplotypes in the H23 and H23* haplogroup, which was identified in dogs throughout the Middle East and East Asia, but corresponded to particular microsatellite haplotypes that had not been previously observed (Figure 3). Haplogroup H20, which we observed in Alaskan Huskies, Malamutes and Siberian Huskies, was previously reported only in a single Chukotka dog from Siberia (Ding et al, 2011) and a mixed-breed dog in the USA (Sacks et al, 2013). Haplogroup H22, which we observed in Inuit dogs and Malamutes, was previously reported also only in North American Arctic dogs and in these two breeds/types (Ding et al, 2011). The Alaskan Huskies, which exhibited the highest Y-haplotype diversity, carried haplotypes in the H1/H2 haplogroup, which is closely associated with European breeds (Brown et al, 2011; Ding et al, 2011; Sacks et al, 2013). Four Siberian Huskies also possessed haplotypes in the H1/H2 haplogroup. Haplotype n65, found in an Inuit dog, differed from the basal n33 haplotype by a single mutation, suggesting an erroneous placement of this haplotype in the single-nucleotide polymorphism-structured network.

Discussion

We undertook the most comprehensive study to date of the genomic relationships among dogs of the North American Arctic. Ours was the first to assemble multilocus autosomal data on a substantial number of Inuit dogs, and consequently to enable the assessment of their contemporary relationships with one another and to more recently imported or created breeds/types. Moreover, our use of mtDNA and Y-chromosome markers provided an opportunity to assess ages of founder lineages on the basis of mutational accumulation.

Because previous ancient mtDNA analysis demonstrated genetic continuity between pre-colonial (~750 BP) Alaskan and Greenland dogs and modern Inuit dogs from Canada and Greenland, we knew at the outset of this study that the dominant maternal lineage composing these two arctic breeds was indigenous (Brown et al, 2013). However, the use of strictly maternally inherited DNA was insufficient to draw conclusions about the overall genomic constitution of these dogs and provided no information on paternal contributions, for example, from post-colonization European/Siberian dogs in these putative indigenous breeds. Ideally, we could have compared nuclear markers directly between ancient (pre-colonial) and modern samples. However, recovery of sufficient amounts of nuclear DNA from ancient samples was not feasible. Therefore, we instead relied on nuclear DNA solely from modern Inuit dogs, in reference to that of the known post-colonial North American arctic dogs. Under the hypothesis that Inuit dogs maintained their original pre-colonial indigenous ancestry, we predicted that they would exhibit greater genetic distance from the post-colonial breeds/types than from one another, that the post-colonial breeds would cluster together relative to the putative indigenous breeds, that the putative indigenous breeds would exhibit distinct paternal (and maternal) ancestry and that the mutational accumulation in apparently North American-endemic maternal and paternal lineages would reflect a timeframe consistent with pre-colonial origins.

All markers supported these predictions, consistent with the indigenous ancestry of Inuit dogs. Both mtDNA and autosomal microsatellites indicated no detectable genomic differentiation between Inuit dogs from Canada and Greenland, and that they were distinct from Siberian Huskies, Alaskan Huskies and Malamutes, all of which clustered together.

The Y-chromosome findings were less straightforward, but nevertheless were consistent with indigenous patrilines in the two putative indigenous breeds in that the microsatellite haplotypes clustered together but reflected mutational accumulation consistent with pre-colonial origins (see below). Haplogroups and/or haplotypes associated with these breeds also were distinctive and not previously reported in any other dog breed or type (except for Malamutes, but see below; Brown et al, 2011; Ding et al, 2011; Sacks et al, 2013).

In contrast to the other markers, however, Inuit dogs from Canada versus Greenland carried different Y chromosome in distinct haplogroups indicating that at least the modern representatives reflected distinct paternal lines. This observation could reflect recent breeding practices. Although ultimately derived from Canadian or Greenlandic Inuit-owned dogs, all of our samples were supplied by individuals who breed these animals with the intent of preserving their ancestry but who control mate-pairings in a way that is not necessarily reflective of their historical development in the context of Inuit (and Thule) culture. In Asian randomly breeding village dog populations, as with wild canid populations, paternal and maternal lines tend to be similarly heterogeneous (Brown et al, 2011; Sacks et al, 2008, 2013). However, post-Victorian western breeding practices tend to use many fewer males than females in the creation of breeds, presumably because males can sire many more litters than females can produce in a lifetime (Sundqvist et al, 2006). This practice has resulted in a pattern whereby mtDNA haplotypes tend to be distributed widely across breeds but Y-chromosome haplotypes tend to be much more breed-specific (Bannasch et al, 2005; Brown et al, 2011; Pedersen et al, 2013). In addition, it is possible that the difference between a Canadian versus Greenland Inuit dog is defined solely on the basis of the sire, which would explain the contrasting patterns observed in the Y-chromosome and mtDNA between these breeds. In fact, the inability to differentiate them with the 24 autosomal microsatellites implies that they have not been systematically bred separately (at least not for many generations) despite the correspondence between nominal country of origin and paternal haplotype. Thus, we conclude that the Inuit dog and Greenland dog are synonymous and simply contain two paternal haplogroups.

Our findings were also informative about the post-colonial North American breeds. Microsatellite, mitochondrial and Y-chromosome markers indicated a close relationship among Alaskan Huskies, Malamutes and Siberian Huskies. As expected on the basis of their documented breed history during and since the Alaskan Gold Rush (for example, Coppinger and Coppinger, 2001; Huson et al, 2010), Alaskan Huskies appeared genetically intermediate between Malamutes and Siberian Huskies, most likely because of these breeds contributing to their founding, and also exhibited numerous mitochondrial and Y-chromosome haplotypes previously identified in Siberian Huskies and among a variety of European breed dogs (for example, Pang et al, 2009; Klütsch et al, 2011; Brown et al, 2011, 2013; van Asch et al, 2013).

In contrast, Malamutes did not share any of their Y-chromosome haplotypes with western breeds, and, although generally less breed-specific, their mitochondrial haplotypes were all previously reported in other Arctic or Spitz type breeds, suggesting that they retained ancient Arctic dog ancestry. The Y-chromosome haplotype that was common to the Malamute, Alaskan Husky and Siberian Husky (7e) was subsequently found also in two Chukotka sled dogs from Siberia (unpublished data), confirming very recent (that is, post-colonial) shared ancestry of these breeds with dogs indigenous to Asia. The close clustering of Malamutes and Siberian Huskies (and Alaskan Huskies) based on microsatellites and mtDNA relative to Inuit dogs also was consistent with a primarily post-colonial derivation.

Lastly, the finding that both uniparentally inherited marker types from Inuit dogs were indigenous allowed us to use these markers to investigate the timing of their arrival from Siberia. On the basis of the expanded mitochondrial sequences corresponding to the HVI haplotype, A31, our statistical tests over a reasonable range of potential mutation rates enabled us to strongly reject the hypotheses that these maternal lines traced their coalescence back to either the late Pleistocene or Paleoeskimo timeframes and invariably supported a timeframe corresponding to the arrival and expansion of Thule culture across the North American Arctic. An important caveat of this analysis is that coalescence need not necessarily imply origin. It is conceivable that a major bottleneck would have resulted in loss of several ancestral A31 matrilines leaving only a single one corresponding to the contemporary population. However, we think this unlikely for a couple reasons. First, although a historical culling of Canadian Inuit dogs substantially reduced their numbers across Canada in the late 1900s (Montcombroux, 2002), this was geographically localized and did not affect Greenland. That we found the same haplotype in the 10 most unrelated individuals across in our broad sample suggests it more likely that the lack of ancient coalescence reflects the bottleneck associated with the initial founding from dogs brought over a millennium ago by the Thule. Nevertheless, mitogenomic studies done directly on pre-colonial material are needed to confirm this conclusion.

We have demonstrated the relationship of contemporary Arctic breeds through the use of maternal, paternal and autosomal markers. Inuit dogs from Canada and Greenland appear to compose a single population despite reflecting different paternal lineages, and are indigenous to pre-colonial North America. Our findings further provide preliminary evidence that these indigenous Inuit dogs derived from ancestors brought to North America by Thule migrants from Siberia and not from the earlier dogs occasionally found with Paleoeskimo cultures or those initially brought to North America initial movements in the late Pleistocene.

Data Archiving

Data available from the Dryad Digital Repository: http://dx.doi.org/10.5061/dryad.46170.