Introduction

Grey wolf (Canis lupus) was once abundant over much of Eurasia and North America. However, some regional populations have suffered a severe decline in numbers because of habitat destruction and human interference1,2,3, poaching4,5, and human-wolf conflicts6,7. Although the global population is stable8, the Convention on International Trade in Endangered Species of Wild Fauna and Flora (CITES) list grey wolf in Appendix I (populations of Bhutan, India, Nepal and Pakistan) and Appendix II (other populations)9. The objective of these listings is to restrict international trade in wolves and wolf body parts. The species is also protected by national laws of most range countries.

Culturally, wolf bodies and body parts are often believed to have magical powers10 and medicinal functions11,12,13,14,15. Such unfounded superstitions as well as human-wolf conflicts are factors that drive poaching and trafficking of wolves and body parts in Eurasian countries. For example, media reports from China include at least 35 trafficking cases uncovered by China customs from 2010 to 2018. Seizures included hundreds of skins, carcasses, mounted specimens, and more than a thousand carpal bones and canines being trafficked from Mongolia, Russia, Kazakhstan, and Kyrgyzstan. In attempts to avoid penalties, traders often claimed that the bones and canines were those of large dogs and were sold as counterfeits of wolf body parts. Prosecution of these cases is dependent on definitive subspecies identification.

There are many morphological differences between wolves and dogs. The most apparent differentiation is based on cranial morphology16,17. However, studies on morphological diagnostic characters of other body parts may not result in accurate species identification. DNA-based identification offers a potentially reliable solution to this problem. Current genetic evidence indicates that dog and wolf share a common ancestry as recently as 27,000 years ago18. Domestication led to dogs differentiating gradually from wolves to a level of subspecies, and so named C. l. familiaris19, but their gene pools exhibit great homogeneity and contain detectable historical and recent genetic admixtures20,21,22,23,24. For this reason, coding regions of the mitochondrial genome have either no resolution between wolf and dog (12S and 16S ribosomal RNA genes)25,26, or detectable but poor resolution (Cyt b and COI genes)27. The control region (CR) holds a large number of polymorphic sites in the neutrally evolving hypervariable regions (HV)28,29, where some separation of wolf versus dog have been successful30,31. However, identification based on CR sequence may not be reliable unless additional nuclear markers are combined32,33 especially when the reference sample size is not sufficient to represent total genetic variation34.

In contrast to these sequence-based mitochondrial markers, allelic frequency-based microsatellite markers have been tested and proved effective in this regard. For instance, six autosomal microsatellites were adequate to distinguish Scandinavian wolves from dogs35, and 12 microsatellites achieved a very high power of identification33. To date, numerous microsatellites have been isolated from the canine genome and subsequently characterized and multiplexed36,37,38,39. The results have become a rich and convenient resource for forensic identification of wolf and dog.

SNPs are alternative nuclear markers to microsatellites for population genetic analysis, but often show higher tolerance to DNA degradation and lower analysis cost40. Based on next generation sequencing technology, massive genotyping of SNPs is now possible41,42, allowing deep detection of genetic relatedness between populations. For example, Pilot et al. found widespread and long‐term admixture between grey wolves and domestic dogs across Eurasia by analyzing 61 k SNPs21. Ancestry-informative SNP markers are SNPs that exhibit substantially different frequencies between various populations. A panel of ancestry-informative SNP markers can be used to estimate the proportion of ancestry of an individual wolf or dog derived from each population43,44. These studies highlight the potential of ancestry-informative SNP markers to separate closely related subspecies such as wolf and dog.

In order to provide an effective method for forensic identification of wolves and dogs by using advantages of SNPs assay, we randomly selected 13 dog-wolf ancestry-informative SNP markers identified in vonHoldt et al.43 and Zhang et al.44 and characterized them in samples collected from the Mongolian wolf and domestic dog using a SNaPshot multiplex assay. This system was then validated for forensic identification of the two subspecies following the Society for Wildlife Forensic Science (SWFS) Standards and Guidelines for Wildlife Forensic Analysis—Version 345.

Results

Repeatability of genotyping

All SNP loci in the panel generated clear allele signals. Although the intensity of the signals varied from 1,000 to 40,000 pixels (Fig. 1), all alleles were called without ambiguity. However, the success of one-time genotyping of rs22103787 was 0.926 for wolf but as low as 0.444 for dog (Table 1). As a result, this locus was discarded from the panel. For the remaining 12 SNP loci, the average success of one-time genotyping was 0.990 for wolf (0.938–1.000) and 0.993 for dog (0.972–1.000). The overall average success of one-time genotyping was 0.991 for the two subspecies. The average cost to genotype one sample was around US$ 8.60.

Figure 1
figure 1

Electropherogram of a wolf sample genotyped using the multiplex SNaPshot of 13 loci. Color letters are alleles called for the signals. Letters Y, K, R and M in the name of extension primers are degenerate codes for nucleotide options at SNP sites: Y = C/T, K = G/T, R = A/G, M = A/C.

Table 1 One-time genotyping success of the panel of 13 SNPs in wolf and dog samples.

The success of genotyping was also tested on samples of different quality. The proportion of one-time successfully genotyped loci varied from: 0.83 (10/12, n = 2) to 0.92 (11/12, n = 3) for skins in high states of decay (n = 3) and viscera (n = 2); from 0.92 (11/12) to 1.00 (12/12) for moderately decayed skins (n = 3) and muscles (n = 2); and 1.00 (12/12) for samples in early stages of decay. One-time genotyping results between visceral samples (highly decayed) and skin samples (moderately decayed) were the same (0.92).

Characterization of SNP markers

Only two alleles were observed for each SNP marker for both wolf and dog (Table 2). The difference in frequencies between the two alleles ranged from 0.848 to 1.000 across all loci for wolf, with an average of 0.939 ± 0.061. The difference ranged from 0.281 to 0.856 and averaged 0.656 ± 0.197 for dog. For either allele on a locus, the difference of frequency between wolf and dog populations ranged from 0.640 to 0.889 with a mean of 0.797 ± 0.080.

Table 2 Characteristics of 12 SNP markers in wolf and dog populations.

Assignment accuracy

Euclidean pairwise genetic distance among wolves ranged from 0 to 3.464 with a mean of 1.197 ± 1.066. This range was slightly smaller than that among dogs, which ranged from 0 to 4.899 and averaged 2.980 ± 0.776. The overall intra-subspecific distance was 2.194 ± 1.274. Inter-subspecific distances ranged from 3.464 to 4.959 and averaged 4.686 ± 0.265, significantly greater than intra-subspecific distances (t = 162.552, p = 0.000). Histograms of the frequency distributions of intra-subspecific distances overlapped slightly with those of inter-subspecific distances (Fig. 2). The difference between the upper limit of intra-d and the lower limit of inter-d at 95% confidence was 0.953.

Figure 2
figure 2

Frequency distribution of intra- and inter-subspecific pairwise Euclidean genetic distance (Intra-d and inter-d) for wolves and dogs inferred from the panel of 12 SNPs.

Assignment accuracy was estimated via Monte-Carlo cross-validation with three proportion levels of training individuals (50%, 70% and 90% of individuals from each population) crossed by four proportion levels of training loci (the highest 10%, 25% and 50% Fst loci and all loci). Results showed that this panel of SNPs had great accuracy for wolves when only the 25% highest Fst loci were used for the training that included 50% of individuals. However, similar accuracy could not be achieved unless all 12 loci were applied for 50% and greater proportions of individuals included in training for domestic dogs (Fig. 3). When the whole panel of SNP markers was used, the probabilities of wolf and dog being assigned correctly to their original groups reached 0.99 ± 0.04 and 0.97 ± 0.04, respectively.

Figure 3
figure 3

Assignment accuracies estimated via Monte-Carlo cross-validation. Proportions of individuals used for training were 50%, 70% and 90% from each population (x-axis); Four proportions of loci used for training were the highest 10%, 25% and 50% of Fst loci and all loci (color-coded boxes).

Furthermore, K-fold cross-validation showed that when all 12 SNP loci were used, the membership probability of all wolves and dogs reached nearly 100% for K = 3, 4 and 5 (Fig. S1). All SNP markers were verified 48 times, and the membership probability of domestic dogs being correctly assigned to their subspecies was 0.97 ± 0.05. The membership probability of Mongolian wolves being correctly assigned to their subspecies was as high as 1.00 ± 0.03, providing good supports the use of the assay in forensic practice.

Blind test with known samples

Blind tests were performed using 17 wolves and 18 dogs. All animals were assigned to their original species (17/17 and 18/18). The membership probability of tested dogs being identified as dogs was 1.000 and that for wolf incorrectly identified as dog was 4.978 × 10–28 with standard deviation (SD) 2.112 × 10–27. The membership probability of tested wolves belonging to wolf was 0.9999997 (± 1.023 × 10–6), and that of wolf being incorrectly identified as dog was 2.833 × 10–7 (± 1.023 × 10–6). These data suggest that the probabilities of correct assignment for wolves and dogs were about 107–1028 greater than those of incorrect assignment.

Reproducibility

Reproducibility of the SNaPshot method was tested using 2 wolves and 1 dog using the same protocol repeated 10 times on three different PCR machines. Results showed that all samples were successfully genotyped and the genotypes of each sample at 12 SNP loci were in complete accordance between each repeat and on different PCR machines.

Species-specificity

Genotyping of the SNP panel yielded negative results in 9 of 12 species including human, brown bear, African lion, tiger, sika deer, red deer, moose, Siberian roe deer, and domestic cattle. For the two canid species most closely related to dogs and wolves (Arctic fox and red fox) positive allele signals were detected at 8 loci (66.7% of all loci). Wild boar was the only prey species that showed positive allele signals. Specifically, five loci showed clear positive allele signals, two loci showed negative results, and five loci showed ambiguous signals that differed between two repeated efforts at genotyping (Table 3).

Table 3 Genotyping results of 12 species across the panel of SNPs*.

Sensitivity

Sensitivity test showed that the whole panel of 12 SNP loci was successfully genotyped for all three samples when the final concentration of total DNA in the reaction system reached 6.25 pg/μl. One dog locus failed to genotype when DNA concentration was reduced to 3.125 pg/μl. This did not occur for wolf until DNA concentration further was reduced to 1.560 pg/μl (Fig. 4). This suggested the minimum concentration of total DNA input to guarantee genotyping in this system was 6.25 pg/μl.

Figure 4
figure 4

Percent of successfully genotyped loci in the SNPs panel at serial concentration input of wolf and dog total DNA.

Discussion

Speciation of dog from wolf under natural and artificial selection has led to specific genetic combinations from the ancestral gene pool for favored phenotypes46,47,48. This process produced numerous ancestral informative SNP markers that are highly informative to differentiate wolves from dogs43,44. However, they cannot be incorporated into for forensic practice without systematic validation being performed; this complies with the SWFS Standards and Guidelines for Wildlife Forensic Analysis45. For this purpose, we set up and validated a panel of 12 such SNPs, which generated reproducible and reliable SNP data from a range of sample types, such as highly decayed skins, suitable for incorporation in to forensic practices.

The reproducibility of the genetic markers is essential in forensic practice. Because all SNP loci in this panel were diallelic and showed significantly biased allelic frequency to either wolf or dog (Table 4), each provided differential information and combination of the 12 loci reached a high accumulative discriminant power. Our analysis demonstrated a gap spanning 0.953 between the upper limit of intra-d and the lower limit of inter-d at 95% confidence, indicating a low probability of false positive subspecies assignment. Cross validation showed the assignment accuracy of the whole panel reached 0.99 ± 0.04 for wolf and 0.97 ± 0.04 for dog (Fig. 3), and membership probability nearly reached 100% for all wolves and dogs (Fig. S1). Empirical (blind) tests further showed 100% apparent assignment correctness and the membership probabilities of correct assignment were 107–1028 times greater than those of incorrect assignment. These figures suggest that this panel of SNPs has robust discriminative power between wolf and dog. An individual with the correct assignment membership probabilities fall between 107–1028 could be expected as dog-wolf hybrid.

Table 4 Primers for multiplex amplification of 13 SNP markers.

The second but important concern in forensic practice is the robustness of operation. SNaPshot technology is a single-base extension assay with capillary electrophoresis as its detection system. It offers the advantage of easy integration into operational forensic laboratories without the requirement for any additional equipment51. We chose to use SNaPshot technology for the 12 loci with the intention to allow easy utilization of this panel. Allele signals of all loci were sufficiently clear to designate alleles without ambiguity. Empirical tests on various platforms obtained 100% reproducibility and repeatability. The overall average success of one-time genotyping reached 0.991 with an average cost US$ 8.6 per sample. These results demonstrate that experimental analysis of this panel is robust, reliable and cost effectively.

The ability to generate genetic data from highly compromised samples, as often encountered in wildlife forensic science, is essential such that the resulting data are not prone to false assignment52,53,54,55,56. Ideal amplified fragment size proposed for SNaPshot assay ranges from 60 to 100 bp57. The amplified fragment size we designed for this panel varied between 92 and 323 bp with an average 160.3 bp. This range is not much different from other PCR-based forensic genotyping systems58,59,60, though not reduced to below 100 bp. Our tests on highly decayed samples showed the proportion of one-time successfully genotyped loci greater than 0.83. The sensitivity of normal quality sample was as low as 6.25 pg/μl. These data indicate that this SNP panel is trustable to genotype most evidence if it is not extremely degraded.

Specificity of markers is a double-edged issue for wildlife forensic science. Low specificity is conducive to cross-species utilization61,62, but on the other hand, is prone to genotyping errors raised by DNA contamination63. Therefore, the specificity test here was performed with a view on contamination issues. We observed that this panel did not yield positive genotyping results in human and species of Ursidae, Felidae, Cervidae, and Bovidae, but did generate a genotype from two Canidae species, Arctic fox and red fox at 8 loci. Five loci unexpectedly yielded allele signals from a sample taken from a wild boar of Suidae (Table 5). These results suggest DNA contamination from Canidae and Suidae species may raise a risk of incorrect exclusion. Even though these species are distantly related to wolf and dog in comparison to the closeness between wolf and dog, and the membership probabilities of these contaminants might be extremely low, our data could not determine the risk based on the present sample size. Therefore, caution should be exercised when this method is applied to scenarios such as bite identification of Canidae and Suidae victims. Fortunately, discriminating wolves from dogs is often required after evidence is assigned to wolf-dog group by routine species identification, for example, using cytochrome b27.

Table 5 Extension primers for genotyping of 13 SNP markers.

We note that the reference wolves in this study represented populations in north and northeast China, Mongolia, and the Russian Far East, while reference dogs were all sampled from China. The assignment accuracy should be tested for wolves and dogs outside these regions before this method is more widely applied.

In conclusion, the panel of the 12 SNPs are highly effective for forensic discriminating wolves from dogs with high assignment accuracy. The SNaPshot multiplex system had high genotyping success, reproducibility, sensitivity and tolerance to poor DNA quality. Species-specificity is adequate to avoid DNA contamination from taxa except for Canidae, thus this method should be used only if the evidence is identified as from a wolf-dog group. The established gene frequency database in this study are ready to support forensic identification of wolves and dogs by law enforcement agencies.

Materials and methods

Samples collection

Naturally dried skin and fresh blood samples of 97 Mongolian wolves (Canis lupus chanco) were collected from museums, wildlife safari parks, and zoos with their kind permission. Ninety-six of these wolves were born in the wild of north and northeast China, Mongolia and the Russian Far East. The remaining wolf was born in captivity to parents captured in northeast China. The taxonomy of these wolves was verified according to morphology and breeding records prior to sample collection. We also sampled a total of 108 large dogs, including German shepherd, border collie, mongrel, golden retriever, American Eskimo, and mongrels. For each dog, a stain of K2-EDTA-anticoagulated blood on sterilized gauze was obtained from pet hospitals after receipt of consent from dog owners, dried naturally and preserved at 4 °C for 1 to 20 days until DNA extraction. Eighty wolves and 90 dogs were randomly chosen to set reference groups, and 17 wolves and 18 dogs were used for blind testing.

A total of 15 dog and Mongolian wolf samples from different tissues with different quality were collected for the purpose of a repeatability test of sample in different quality. This comprised: 8 skin samples (slightly decayed, n = 5; moderately decayed, n = 3), 5 muscle samples (moderately decayed, n = 2; highly decayed, n = 3) and 2 viscera samples (highly decayed). All these samples were collected from museums where they were preserved at − 20 °C in refrigerators for years.

Samples were also collected for species specificity testing, including human (Homo sapiens, hairs, n = 4), canid species, viz. Arctic fox (Vulpes lagopus, frozen semen, n = 8) and red fox (Vulpes vulpes, frozen semen, n = 2), brown bear (Ursus arctos, muscle, n = 1), African lion (Panthera leo, muscle, n = 2), tiger (Panthera tigris, muscle, n = 2), and prey species including wild boar (Sus scrofa, muscle, n = 1), sika deer (Cervus nippon, muscle, n = 2), red deer (Cervus elaphus, muscle, n = 2), moose (Alces alces, muscle, n = 1), Siberian roe deer (Capreolus pygargus, muscle, n = 1), and cattle (Bos taurus, muscle, n = 1). Human hair samples were donated by volunteers in our team with the informed consent of all the participants. All other animal samples were provided by the State Forestry and Grassland Administration Detecting Center of Wildlife. All sample collection and experimentation were approved by Northeast Forestry University Institutional Review Board of Ethics and Administration of Experimental Animals (Approval No. 2018001).

SNP genotyping

Total DNA was isolated from each sample using AxyPrep Genomic DNA Isolation Kit (Kang Ning Life Science Ltd., China). DNA was quantified using NanoPhotometer (Implen, Germany) and diluted to 10 ng/μl. Thirteen dog-wolf ancestry-informative SNP markers were selected from Zhang et al. (2014), viz. rs9159232, rs22835438, rs23249721, rs24863098, rs23608542, rs23882488, rs9089629, rs24198287, rs24189603, rs24163825, rs24383001, rs24355642 and rs2210378744,62. Sequence information for these SNP markers is listed in Supplement Information Table S1. All 13 of these ancestry-informative SNP markers revealed great divergence of allelic frequency between wolf and dog in previous tests43,44. Primer pairs were designed based on these sequences (Table S1) for multiplex amplification of fragments, each containing an SNP site (Table 4). The size of amplified fragments ranged from 92 to 323 bp.

All SNP loci were genotyped using the SNaPshot Multiplex System (Applied Biosystems, USA). Multiplex PCR of all 13 markers was set up in a 20 μl system containing 1 × HotStarTaq buffer, 3.0 mM Mg2+, 0.3 mM dNTP, 1 U HotStarTaq polymerase (Qiagen Inc., Germany) and 1 μl total DNA. Final concentration of each pair of primers was 0.05 µM except for rs24163825F/R and rs24355642F/R of which final concentrations were 0.025 µM. Cycling program included 1 cycle at 95 °C for 2 min; 11 cycles at 94 °C for 20 s, 65 °C for 40 s, and 72 °C for 1.5 min; 24 cycles at 94 °C for 20 s, 59 °C for 30 s and 72 °C for 1.5 min; followed by 72 °C for 2 min and 4 °C forever.

Five unit of SAP enzyme (Progema) and 2U Exonuclease I (Epicentre) were added to 10 μl of PCR products. The mixture was incubated at 37 °C for 30 min and then inactivated at 75 °C for 15 min. Amplicons were purified using High Pure PCR Product Purification Kit (Roche Diagnostics Ltd., China). An SNP extension reaction was set up in a 10 μl system containing 5 μl SNaPshot Multiplex Ready Reaction Mix (Applied Biosystems), 2 μl purified multiplex PCR amplicons, 1 μl extension primers mix (0.8 μM of each primer, Table 5) and 2 μl deionized water. Extension reaction was performed using the program: 1 cycle at 96 °C for 1 min; 28 cycles at 96 °C for 10 s, 55 °C for 5 s, and 60 °C for 30 s; followed by 4 °C forever. After cycling was finished, 1 U SAP was added to the reaction system and incubated at 37 °C for 1 h followed by inactivation at 75 °C for 15 min. Subsequently, 0.5 μl of extension product was transferred to a new tube, mixed with 0.5 μl Liz120 size standard and 9 μl Hi-Di (Applied Biosystems), denatured at 95 °C for 5 min, and separated on ABI3730XL DNA analyzer (Applied Biosystems). GeneMapper 4.1 Software (Applied Biosystems) was used to read genotypes of all SNPs. Individuals with failure at one or two loci were re-genotyped by adjusting Tm and template concentration until the genotype was complete.

Separation of pairwise genetic distance

Pairwise Euclidean distances within and between subspecies were calculated based on SNP genotype data of the reference populations using PopGen in R62. The distribution of frequency at each distance interval was visualized using a histogram that was built using SPSS 19.0 (IBM SPSS Statistics, Chicago, USA). 95% confidence intervals (CIs) were computed for within- and between-subspecific distances. The safety of subspecies assignment was then estimated using the difference between the lower bound of 95% CIs for inter-subspecific distances (inter-d) and the upper bound of 95% CIs for intra-subspecific distances (intra-d).

Resampling cross-validation

Assignment accuracy was tested using two resampling cross-validations performed using the assignPOP package in R63. Two hypothetical populations were defined using reference samples of wolf (n = 80) and dog (n = 90). Fst value of each locus was calculated by the assignPOP package in R. Monte-Carlo cross-validation was performed with 50%, 70%, and 90% of randomly selected individuals from each population crossed by the top 10%, 25%, and 50% of high Fst loci, and all loci as training data. Linear discriminant analysis was performed to build predictive models. Each combination of training data was resampled 30 times, for a total of 360 assignment tests (30 resamples × 12 sets of data). The membership probability was tested using K-fold cross-validation. Individuals from each population were divided into K groups. One of the K groups was tested by the predictive model that was built based on the remaining K-1 groups until every group was tested. In this study K was set as 3, 4, and 5. In each group, the top 10%, 25%, and 50% of high Fst loci as well as all loci were sampled as training data. In total, 48 assignment tests were performed for membership probability.

Blind tests of assignment correctness

Seventeen wolves and 18 dogs were used for blind tests based on the predictive function assign.X in R that was built previously using genotype data of reference wolves and dogs63. The naive Bayes model was selected for the assignments. The membership probabilities of blind test samples were obtained and exhibited in a column diagram.

Repeatability and reproducibility test

Based on our preliminary experiments, we used sufficient template input (~ 10 ng) to genotype all wolves and dogs on the same set of instruments. The success of one-time genotyping and the cumulative success of repeat-once genotyping were recorded as indicators of repeatability. Also, one-time genotyping of both dog and wolf samples from different tissues with different quality were calculated for repeatability test. Results of all 3 groups from slightly decayed, moderately decayed to highly decayed samples were included. Two wolves and 1 dog were genotyped 10 times each using the same SNaPshot protocol on three different PCR machines, 3 times on an Eppendorf Mastercycler (Eppendorf, Germany), 3 times on a SimpliAmp cycler (Applied Biosystems) and 4 times on a LifePro cycler (Bioer, China). The identity of the genotype of each sample at 12 SNP loci was recorded.

Specificity test

Specificity tests were performed using the same SNaPshot genotyping protocol for DNA samples of 12 species including human, other carnivore species, and prey species. Each sample was genotyped twice using the same protocol and allele calling criteria as for wolf and dog. Allele signals and genotypes were recorded for each SNP site and sample.

Sensitivity test

DNA samples of two Mongolian wolves (Wolf_#53, Wolf_#58) and a dog (Dog_#4) were arbitrarily chosen and diluted serially to reach final concentration of input DNA 400.0 ng/μl, 200.0 ng/μl, 100.0 ng/μl, 50.0 ng/μl, 25.0 ng/μl, 12.5 ng/μl, 6.3 ng/μl, 3.1 ng/μl, 1.6 ng/μl, and 0.8 ng/μl in the PCR system, and were genotyped for all loci using the same SNaPshot protocol. Positive and negative allele signals were recorded for all dilution gradients and the correctness of genotyping was controlled using the original genotype obtained previously for the three individuals.

Ethics statement

All tissue samples including skins, muscles and semen were provided by the State Forestry and Grassland Administration Detecting Center of Wildlife of China. Blood samples of live wolves and dogs were all deposited remains of serum testing samples for health examination or medical treatments in a wolf safari park and pet hospitals. All collections were approved by animal owners with signed informed consent. The human hair samples were obtained with the informed consent of all the participants. All the experiments were carried out in accordance with the relevant guidelines and regulations of Northeast Forestry University, and the protocols were approved by Ethical Review Board of Northeast Forestry University (Approval Number: 2018001).