Genomic regions under selection in the feralization of the dingoes

Zhang, Shao-jie; Wang, Guo-Dong; Ma, Pengcheng; Zhang, Liang-liang; Yin, Ting-Ting; Liu, Yan-hu; Otecko, Newton O.; Wang, Meng; Ma, Ya-ping; Wang, Lu; Mao, Bingyu; Savolainen, Peter; Zhang, Ya-ping

doi:10.1038/s41467-020-14515-6

Download PDF

Article
Open access
Published: 03 February 2020

Genomic regions under selection in the feralization of the dingoes

Shao-jie Zhang ORCID: orcid.org/0000-0003-1079-9680^1,2^na1,
Guo-Dong Wang^1,3^na1,
Pengcheng Ma¹^na1,
Liang-liang Zhang⁴,
Ting-Ting Yin¹,
Yan-hu Liu¹,
Newton O. Otecko¹,
Meng Wang²,
Ya-ping Ma²,
Lu Wang²,
Bingyu Mao^1,3,
Peter Savolainen⁴ &
…
Ya-ping Zhang ORCID: orcid.org/0000-0002-5401-1114^1,2

Nature Communications volume 11, Article number: 671 (2020) Cite this article

11k Accesses
44 Citations
101 Altmetric
Metrics details

Subjects

Abstract

Dingoes are wild canids living in Australia, originating from domestic dogs. They have lived isolated from both the wild and the domestic ancestor, making them a unique model for studying feralization. Here, we sequence the genomes of 10 dingoes and 2 New Guinea Singing Dogs. Phylogenetic and demographic analyses show that dingoes originate from dogs in southern East Asia, which migrated via Island Southeast Asia to reach Australia around 8300 years ago, and subsequently diverged into a genetically distinct population. Selection analysis identifies 50 positively selected genes enriched in digestion and metabolism, indicating a diet change during feralization of dingoes. Thirteen of these genes have shifted allele frequencies compared to dogs but not compared to wolves. Functional assays show that an A-to-G mutation in ARHGEF7 decreases the endogenous expression, suggesting behavioral adaptations related to the transitions in environment. Our results indicate that the feralization of the dingo induced positive selection on genomic regions correlated to neurodevelopment, metabolism and reproduction, in adaptation to a wild environment.

Comparative genomics provides new insights into the remarkable adaptations of the African wild dog (Lycaon pictus)

Article Open access 06 June 2019

Daniel E. Chavez, Ilan Gronau, … Robert K. Wayne

Genomic signatures of convergent shifts to plunge-diving behavior in birds

Article Open access 24 October 2023

Chad M. Eliason, Lauren E. Mellenthin, … Shannon J. Hackett

Analysis of dog breed diversity using a composite selection index

Article Open access 30 January 2023

Wei-Tse Hsu, Peter Williamson & Mehar Singh Khatkar

Introduction

Domestication is the process when a wild species is bred in captivity and modified by artificial selection, becoming phenotypically and genetically distinct from the wild ancestor^{1,2,3,4,5,6,7}. Feralization is, in a sense, the reverse process, when a domestic species escapes human control, adapts to the wild through natural selection, and diverges from the domestic ancestor into a genetically distinct population⁸. In the shift from artificial to natural selection, feralization is accompanied by phenotypic changes resulting in a phenotype closer to that of the original wild ancestor than to the domestic type. For instance, feralized rodents tend to look more like wild than domestic rodents⁹, feral chicken on the island Kauai have increased brooding like wild Red Junglefowl¹⁰, and the dingo’s hunting and social behavior is more similar to that of the wolf than of the dog^11,12,13. In plants, weedy rice (a feralized rice population) has a closer semblance to wild than to domestic rice for several growth characters^14,15,16.

Although the feralization process has aroused considerable research interest, only limited research about the genomic mechanisms involved in this phenomenon has so far been presented. A major obstacle for such studies is that, in most cases, the feral populations are not isolated from the wild and/or domestic ancestors, implying a problem to distinguish genetic change caused by feralization from change caused by crossbreeding with the ancestral populations. So far, only two comprehensive studies of genomic changes under feralization have been performed, on feral chicken on Kauai, and on Chinese weedy rice^10,15. The research on feral chicken shows adaptation of genes associated with sexual selection and reproduction but suggested that feralization and domestication mostly target different genomic regions¹⁰. Similar conclusions are reached concerning the feralization of Chinese weedy rice¹⁵, suggesting convergent evolution of different weedy types but little overlap of genes under selection in the domestication and feralization processes. However, both these studies have problems to distinguish genetic change caused by feralization from change caused by crossbreeding with the ancestral populations.

The dingo (Canis dingo) is a wild canid native to Australia, and its apex predator¹⁷. It originates from domestic dogs but has, since it arrived at least 3500 years ago, developed into a phenotypically and genetically distinct population of feral dogs. Its appearance is similar to the domesticated dog but there are big differences in its behavior^18,19. Like the wolf, the dingo is a predominantly meat-eating omnivorous animal, and lacks the expansions of the alpha-amylase locus giving improved starch digestion in dog lineages that are associated with agrarian societies^12,20,21. Although not a variant associated with domestication²², it differentiates dingoes and wolves from most domestic dogs. Dingoes hunt in the wild, can catch and kill large prey such as kangaroos, cattle, water buffalos, and wild horses, and use the same tactics as their wild ancestor, wolves, to hunt the large prey¹¹. While young dingoes are often solitary, adults often form a settled group, and the dingo’s social behavior is as flexible as that of a coyote or gray wolf^13,23. The dingo is an ideal and unique model for studying the evolutionary and genomic mechanisms of feralization, because of two features. Firstly, the dingo population has a longer history of feralization than any other animal, since their arrival in Australia at least 3500 years ago^24,25. Secondly, the dingoes have been isolated from both their domestic and their wild ancestor during this feralization except the last 200 years, because of Australia’s position outside the natural range of wolves and its isolation until the arrival of Europeans. Today, dingoes and European breeds hybridize, especially in the Southeast, but in most other regions hybridization is limited^26,27,28,29. Therefore, unlike the feral chickens of Kauai island and weedy rice, the dingoes have not experienced hybridization with ancestral populations which may complicate the deciphering of the genomic mechanisms of feralization.

In the present study, we sequence the nuclear genomes of 10 dingoes from across Australia and 2 New Guinea Singing Dogs (NGSDs; wild canids living in highland New Guinea), and retrieve the genomes of a worldwide representation of 78 dogs and 21 wolves from literature. Based on this, we analyze population structure and phylogenetic structure to assess the detailed demographic history and migration route of dingoes. The results show dingoes originated from domestic dogs that migrated to Australia approximately 8300 years ago. We perform selection scans to decipher the genomic mechanisms of natural selection under feralization and to reveal the correlation between selection in domestication and feralization.

Results

Sample collection and whole-genome sequencing

Ten dingoes and two NGSDs were sequenced for the current study. The samples of dingoes have a wide distribution across Australia (Fig. 1a), and the two NGSDs are from the NGSD Conservation Society stud book. After DNA extraction, individual genomes were sequenced to an average of 14.7×. We also retrieved 97 canine whole-genome sequences from published articles^{3,21,30,31,32,33}, which involved 1 dingo, 1 Taiwan village dog, 43 indigenous dogs from China and Vietnam, 19 individuals from various breeds, 4 village dogs from Africa, 6 Indian village dogs, 3 village dogs from Indonesia, 3 village dogs from Papua New Guinea, and 21 wolves from across Eurasia (Supplementary Data 1). Downloaded data have a high quality and an average sequencing depth of 14.6×. Overall, the dataset covers all major dog and wolf groups³⁴ that are putative ancestors of dingoes. Raw sequence reads were mapped to the dog reference genome (Canfam3) using the Burrows-Wheeler Aligner (BWA)³⁵. DNA sequence analysis was done using the Genome Analysis Toolkit³⁶. After strict filtering, we identified ~24.7 million autosomal SNPs for further analysis (see details in the Methods).

**Fig. 1: Population structure and genetic diversity of 109 canids.**

Population structure and phylogenetic analysis

Principal component analysis (PCA) of the 109 individuals was performed to explore the relationships among dingoes, NGSDs, and other canids. In a two-dimensional plot of the genotypes, there is a clear separation in three groups: wolves, dogs and dingoes/NGSDs. Dogs can be divided into two basic groups: dogs from Europe and indigenous dogs from Asia. All dingoes and NGSDs cluster together tightly, on a relatively large distance from the dogs. Thus, the dingo and NGSD populations are genetically clearly distinct from domestic dogs. Among the dogs, Indonesian village dogs cluster closest to the dingoes/NGSDs, followed by indigenous dogs from southern East Asia (South China) (Fig. 1b). We then analysed the 10 dingoes and 2 NGSDs separately, to explore their detailed structure (Fig. 1c). The two-dimensional plot separates NGSDs from the dingoes. The dingoes cluster together, but are distributed in three sub-clusters in accordance with geographical origin: Southeast, West/central and Northeast Australia. This suggests that there are subpopulations within the dingo population.

To explore the genetic relationships among the 109 individuals, we also performed a structure analysis using the expectation maximization (EM) algorithm in ADMIXTURE to cluster the individuals into different numbers of groupings³⁷. Partitioning the individuals into four groups gave least cross-validation error (0.31132), and separated the samples into: (i) wolves, (ii) dingoes and NGSDs, (iii) indigenous dogs from southern East Asia and Indonesia, and (iv) breeds and village dogs from other regions (Fig. 1d, Supplementary Figs. 1, 2). We find an admixture of these four components with varying proportions among indigenous dogs from northern China, some dog breeds and the village dogs from India and New Guinea, consistent with the results of the PCA. Notably, two dingoes, D05 and D06, show a mixture indicating hybridization with European breed dogs, and D05 originate from the region in Australia with highest incident of dingo-dog hybridization (the Southeast)^26,27,28,29. Therefore, we performed D-statistics analysis using qpDstat in the Admixtools software package³⁸ to test events of gene flow between the dingoes and European breeds in the form of D (Dhole, European breed; Pop1, Pop2), where Pop1 was all dog groups tested in turn and Pop2 was each individual dingo. Interestingly, the results varied depending on the tested Pop 1 (Supplementary Fig. 3). When Pop1 was NGSDs, there were seven dingoes showing significantly positive D (Z>3): (D00 (Z = 8.457), D01 (Z = 9.394), D03 (Z = 5.338), D05 (Z = 14.551), D06 (Z = 10.221), D07 (Z = 8.888), and D08 (Z = 5.423)). However, when Pop1 was any of the other populations (Indonesian village dogs, indigenous dogs from southern China, Taiwan indigenous dog, indigenous dogs from north China, Indian village dogs, African village dogs, respectively), there was significantly negative D (Z < −3) in all cases (Supplementary Fig. 3). We then used NGSDs as Pop2, testing D (Dhole, European breed; Pop1, NGSDs), where Pop1 was all dog groups and the dingoes tested in turn. This gave significantly negative D in all cases (Indonesian village dogs: −23.2, indigenous dogs from southern China: −27.8, Taiwan indigenous dog: −34.1, indigenous dogs from north China: −42.0, Indian village dogs: −34.7, African village dogs: −28.0 and dingoes: −8.8), but it is notable that dingoes had considerably less negative value than all other populations. This indicates that all dog populations show higher affinity to European breed dogs than the dingoes, except the NGSDs which have the lowest affinity to European breeds. Therefore, we made another D-statistics analysis, where Pop1 was all dingo individuals tested in turn and Pop2 was all other dingo individuals tested in turn. This showed that when three of the dingoes (D01, D05, and D06) were Pop2, they had significantly positive D (Z > 3) compared to most other dingoes and no significantly negative D (Z < −3) in any comparison (Supplementary Data 2), suggesting that these three dingoes could have a gene flow with European breeds. We also performed an additional D-statistics using the red fox³³ as outgroup and obtained very similar results (Supplementary Data 2, Supplementary Fig. 4).

We further performed phylogenetic analysis by the Neighbor-Joining (NJ) approach (Fig. 1e, Supplementary Fig. 5). The result matches the observations from the PCA and structure analysis. First, dogs and dingoes/NGSDs separate from the wolves, and then they further split into two clades, one including dingoes and NGSDs together with indigenous dogs from Indonesia, southern East Asia and Taiwan, and the other including village dogs and breeds from all other regions. Indonesian village dogs are closest to the dingoes and NGSDs, and dogs from southern East Asia are basal to the clade. This suggests that indigenous dogs from southern East Asia may be the ancestors of dingoes. Notably, India has been suggested as a possible origin for the dingo but, similarly to the PCA, the dogs from India cluster in the second clade, far from the dingoes^39,40. We also performed phylogenetic analysis by the Maximum-Likelihood approach (Supplementary Fig. 6), obtaining consistent results. We further used the qp3pop program³⁸ to perform outgroup f3-statistics analysis in the form of f3(Dingoes, Pop2; Dhole)^38,41, to assess the relative genetic similarity of the dingo population to the other populations (Supplementary Table 1). The highest value of the f3-statistics (indicating highest degree of shared genetic history) was obtained for the NGSD population, followed by Indonesian dogs, and indigenous dogs from southern China. We also performed TreeMix analysis (Supplementary Figs 7, 8). The topology is consistent with the aforesaid phylogeny constructed by the NJ and ML approaches, and indicates a single admixture event: from the dingo/NGSD clade to the Papua New Guinea village dog lineage. All these results agree in suggesting that indigenous dogs from southern East Asia were the ancestors of dingoes.

Notably, the branch to dingoes and NGSDs is relatively long. We therefore estimated nuclear diversity using the genetic diversity θ_π, grouping individuals into six dog populations. The result shows that dingoes had the lowest diversity of the six dog populations (Supplementary Fig. 9). This suggests a severe bottleneck event in the evolutionary history of dingoes, or long periods of isolation with low effective population size, which may explain the long phylogenetic branch^39,42.

Demographic and migration histories

Based on the results from the TreeMix, phylogeny, and Outgroup f3 analyses, the indigenous dogs from southern East Asia are plausible ancestors of dingoes and NGSDs, with the Indonesian village dogs as the most closely related population. To study the migration and demographic history of the dingoes, we performed a demographic analysis using G-PhoCS⁴³. We repeated the computations three times by randomly picking 1000 neutral loci and randomly selecting three samples among the dingoes (Supplementary Table 2) and set the gene flow between southern East Asia dogs and Indonesian dogs. Based on a mutation rate of 1.3 × 10⁻⁹ per site per year⁴⁴ and a generation time of 3 years^40,41,42, our analysis indicates that the split between dingoes and Indonesian village dogs occurred around 8300 (CI: 5400–11,200) years ago and that, before that, Indonesian village dogs diverged from the indigenous dogs from southern East Asia around 9900 (CI: 6500–12,700) years ago (Fig. 2a, Supplementary Data 3). Furthermore, we performed a second round of G-PhoCS analysis replacing the two dingoes indicated to be admixed with dogs (D05 and D06) with D01 and D08, respectively (Supplementary Table 3), giving results consistent with the first analysis (Supplementary Data 4). The G-PhoCS analysis also shows that the dingo population has a very small effective population size compared to the dog populations. We also used smc++ employing unphased whole genomes to infer population history⁴⁵. This analysis approximated the split between Indonesian village dogs and dingoes at around 9100 years ago, in consistence with the result of G-PhoCS. We also used smc++ to estimate dates for the population history of dingoes/NGSDs and dogs. The result shows that the dog population experienced a slight growth after the population split, while the dingo and NGSD populations suffered a decrease (Fig. 2b), followed by an increase possibly reflecting the expansion into the new ecological niches in Australia and New Guinea. Notably, the NGSDs show a severe decrease in more recent times followed by a sharp increase. This is consistent with the history of the western population of NGSDs (bred outside New Guinea the last 60 years), which originates from very few individuals.

**Fig. 2: Demographic history of dingoes and dogs from southern East Asia.**

Mitochondrial genome analysis

We also performed phylogenetic analysis based on mitochondrial genomes, analyzing totally 35 dingoes and 3 NGSDs, the 10 dingoes and 2 NGSDs sequenced in this study and 25 dingoes and 1 NGSD from Cairns et al.⁴⁶, in the context of 169 dogs and 8 wolves from across the Old World from Pang et al.⁴⁷. We constructed a phylogenetic tree showing all dingoes and NGSDs to group into a single branch, separated from all domestic dogs except one, a dog (A103 10002) originating from Hunan in South China (Fig. 3a). The dingo/NGSD branch is part of the major domestic dog haplogroup A, to which approximately 75% of domestic dogs worldwide belong. Haplogroup A has six sub-haplogroups, and the dingo/NGSD branch is part of sub-haplogroup a2, which is frequent in dogs originating from across East Asia but absent in western Eurasia⁴⁷. Notably, of the eight dogs clustering closest to the dingo/NGSD branch, seven were from Mainland or Island Southeast Asia and one from East Siberia. These results indicate that dingoes and NGSDs originate from domestic dogs in Southeast Asia, via Island Southeast Asia, and that dingoes and NGSDs are closely related, as earlier suggested^24,42,46,48.

**Fig. 3: Phylogenetic and demographic history analysis of mtDNA.**

To study the detailed phylogeny among dingoes and NGSDs we created a sub-dataset including all individuals in the dingo/NGSD branch and the three most closely related dogs (yellow box in Fig. 3a), and constructed new phylogenetic trees (Fig. 3c, Supplementary Figs. 10, 11). These trees show a division of dingoes into two main branches, following a geographical distribution earlier reported by Cairns et al.⁴⁶; all dingoes from Southeast and East Australia (we denote this region S/E), except one, group in one branch, and all dingoes from all other parts of Australia group in the other (Fig. 3b, c). The S/E-related branch also includes two of three NGSD samples, while the third NGSD and the domestic dog from South China (A103 10002) have an intermediate position. Notably, outside the S/E region there is only limited geographical structure among the dingoes. Thus, there is a genetic subdivision of dingoes between the southeastern/eastern part of Australia and the rest of the continent.

Molecular clock analysis (based on a mutation rate of 7.7 × 10⁻⁸ per site per year)⁴⁹ suggests a most recent common ancestor (MRCA) for all dingoes and NGSDs (the division into the two main branches) ~8300 years ago, in agreement with the nuclear genome estimate for the split between dingoes and Indonesian dogs. Notably, the two main branches both have MRCAs ~4600 years ago, indicating population expansions.

Natural selection in feralization

Our analyses of population structure and demography confirms that the dingoes originated by feralization of domestic dogs around 8300 years ago and have remained virtually isolated from both the wild and the domestic ancestor until recent time. This affirms that the dingo is an excellent model for studies of the genomic effects of feralization. We used analysis of population branch statistics (PBS)⁵⁰ and iHS⁵¹ to identify positive selection in the dingoes. Firstly, the PBS was calculated by the formula (Eq. 1, see Methods for details), with non-overlapping 20 kb genomic windows. By comparing the three pairwise Fst val ${\mathrm{PBS}}1 = \frac{{{{T}}_{{\rm{ds}}} \,+ \,{{T}}_{{\rm{db}}} \, - \, {{T}}_{{\rm{bs}}}}}{2}$ ues, we can estimate the frequency change that occurred in the dingoes⁵⁰. We retrieved genomic regions with the top 5% PBS1 windows by the value of PBS1 >0.14476. Furthermore, we performed a windowed iHS test⁵², dividing the genome into the same non-overlapping 20 kb windows, and identified candidate regions for selection as those in which more than 30% of the sites had an iHS absolute value above the threshold (2.4217, top1% of iHS). In summary, the overlap of the two approaches indicated 87 candidate windows under positive selection, containing 50 genes (Supplementary Data 5) considered as candidates associated with feralization of dingoes.

We performed the GO enrichment evaluation using the parent-child model⁵³ in the topGO R package⁵⁴. To control for biasing factors, such as gene size and clustering of related gene families, we used the same approach as Pendleton et al.²². We calculated permutation-based p values (p_perm) for each GO term, and the parent-child significance scores observed for each GO term were compared with the distribution of identified gene sets by applying the parent-child test by 1000 randomly permuted genome intervals. Hereby, we identified 67 GO terms that were significantly overrepresented (p_perm < 0.05) and represented by more than one gene (Supplementary Data 6). Notably, there were four GO terms related to metabolism: fatty acid derivative biosynthetic process (GO: 1901570, p_perm = 0.001), fatty acid derivative metabolic process (GO: 1901568, p_perm = 0.001), regulation of carbohydrate metabolic process (GO: 0006109, p_perm = 0.001), and regulation of carbohydrate catabolic process (GO: 0043470, p_perm = 0.01). These functions may be related to diet change of dingoes. The candidate genes include also genes related to, e.g., reproduction and neuronal function which may have played roles in the feralization adaptation of dingoes: Prss37 (Protease, Serine 37), shown to be required for male fertility in mice⁵⁵, ARHGEF7 (Rho Guanine Nucleotide Exchange Factor 7) which promotes the formation of neural spine and synapses in hippocampal neurons⁵⁶, and TAS2R5 (Taste 2 Receptor Member 5) which plays a role in the perception of bitterness⁵⁷. Interestingly, four of the genes have previously been indicated to be related to the domestication of dogs: SLC5A1 and ZNF516 were identified by Axelsson et al.²⁰, TAS2R5 and ZNF516 by Cagan et al.⁵⁸, and a novel gene (ENSCAFG00000023577) was found by Pendleton et al.²².

Change in the candidate regions in two processes

To compare the candidate regions in the domestication and feralization steps we also performed the PBS analysis using the formula ${\mathrm{PBS}}2 = \frac{{{{T}}_{{\rm{ds}}} \, + \, {{T}}_{{\rm{ws}}} - {{T}}_{{\rm{dw}}}}}{2}$ (Eq. 2, see Methods for details), comparing dingoes, dogs from Southern East Asia and wolves, to identify genomic regions in dingoes that were more similar to wolf than to dog. High PBS2 values (the first percentile rank was used as threshold, 0.0766) indicate large difference between dog and dingo and between dog and wolf, but low difference between wolf and dingo. This identifies regions with large difference between dingo/wolf and dog, while regions with smaller difference may be ignored. Based on this, we identified 1100 windows with high PBS2 values, and compared these windows with the 87 windows identified as candidates associated with feralization by PBS1 and iHS. This identified 17 overlapping windows, containing 13 genes, with high values for PBS1 and iHS as well as for PBS2 (Table 1, Fig. 4). This suggests that these 13 genes were under positive selection in dingoes and also more similar to gray wolves than to domestic dogs. Functional annotation showed that four of these 13 genes are associated with neurodevelopment, metabolism and reproduction (Table 1, Supplementary Table 4). Specifically, ARHGEF7 (Rho Guanine Nucleotide Exchange Factor 7) may promote the formation of neural spine and synapses in hippocampal neurons⁵⁶, SLC5A1 plays an important role in the absorption of glucose and sodium⁵⁹, TAS2R5 (Taste 2 Receptor Member 5) may play a role in the perception of bitterness⁵⁷ and Prss37 (Protease, Serine 37) is related to reproduction⁵⁵. We visualized the genotypes in 70 canine samples for these four genes. This showed that the genotypes for dingo and NGSD were almost identical for all four genes, and that dingo/NGSD had low diversity with most positions being homozygote for the non-reference variants (Supplementary Fig. 12). It also showed that dingo/NGSD were more similar to the wolves than to dogs, dingo/NGSD and wolves sharing the non-reference homozygote type in many positions. The dogs were more heterogenous, with a large proportion of sites that were heterozygous, or homozygous for the reference variants. However, there was a large difference between the dogs from Southern East Asia and Europe. For all four genes, diversity was largest for the dogs from Southern East Asia, which had all three genotype variants in most positions. In contrast, for three of the genes, SLC5A1, TAS2R5, and Prss37, the European breed dogs were homozygous for the reference variant across almost the whole region. This suggests that selection has occurred in European breeds but not in the dogs from Southern East Asia, which would imply that these genes were not under selection during the domestication of the dog but during the later development of the European breeds. It also suggests that selection in these genes had not occurred for the ancestors of dingoes but occurred during the feralization of dingoes. Given the strong bottleneck in dingoes, we also performed simulations based on the inferred demographic history as null expectation for selection, with consistent results (Supplementary Figs. 13–15).

Table 1 The 13 genes under selection in both domestication and feralization.

Full size table

**Fig. 4: The result of iHS and PBS2 for three candidate regions.**

Functional assay revealed that a mutation in dingo gives decreased enhancer activity for ARHGEF7

ARHGEF7 is related to neural function⁵⁶ and may therefore be involved in behavior changes in the development from dog to dingo. We found an A-to-G mutation (chr 22: 59234593) within the ARHGEF7 gene, which had a very high allele frequency in dingoes (100%) and wolves (93.3%) compared to indigenous dogs from southern East Asia (32.5%). Furthermore, detailed bioinformatics analysis showed that the A-to-G mutation may influence the expression of ARHGEF7 since it is located in a transcription factor-binding site⁶⁰. To test whether the SNP variants can actually affect expression of ARHGEF7, we performed dual-luciferase reporter experiments (enhancer assay using pGL3-promoter vectors) on two human cell lines (Daoy, human medullablastoma and HEK293, human embryonic kidney) and one canine cell line (MDCK, Madin-Darby Canine kidney). The analyses showed that all three cell lines displayed significantly lower enhancer activities for SNP-G than for SNP-A (Fig. 5), suggesting that SNP-G may confer decreased endogenous ARHGEF7 production.

**Fig. 5: Functional assay of ARHGEF7.**

Discussion

In this study, we have investigated the process of feralization on the genomic level, using the dingo as a model. The analyses of population structure and demography reinforces that the dingo is an excellent model for this, because its feralization started 8000 years ago and because it has then remained isolated from its domestic and wild ancestors until the last 200 years. This makes the dingo a unique tool for identifying genomic regions under positive selection in the feralization process without confusing the impact of feralization with hybridization to ancestral populations.

Our study has presented important new findings about the origins and history of the dingo. In the past decades, numerous population genetic studies of the dingo have been performed based on mitochondrial and Y-chromosomal DNA^24,39,42,48, indicating an origin from East Asian domestic dogs but lacking in precision about timing, routes of arrival to Australia and demographics. Our studies of whole genomes in dingoes and related canids clarify several of these details. Our analyses of phylogeny, population structure, and demography as well as selection analysis show that the dingo is a genetically distinct population clearly differentiated from the domestic dog. The selection analyses indicate that 8000 years of feralization has affected numerous genes linked to, e.g., neurodevelopment, metabolism and reproduction.

TreeMix, phylogenetic analyses, and outgroup f3 analyses all identified indigenous dogs from southern China and Indonesian village dogs as the dog populations which are genetically most closely related to dingoes. The genomic data provides strong evidence that the dingo originates from domesticated dogs in southern East Asia, which migrated via Island Southeast Asia 9900 years ago, to eventually reach Australia 8300 years ago, and the mitochondrial data supports this picture. With this data, we can reject two previous hypotheses about the origin and migration routes of dingoes. Based on similarity in skeletal anatomy to Indian pariah dogs and wolves, and gene flow from ancient Indian populations to indigenous Australians dated at BP4200⁶¹, it has been suggested that the dingo ancestors came from India^25,40. An alternative theory has been that dingoes originate from dogs introduced with the Austronesian expansion into Island Southeast Asia, which arrived in New Guinea about 3600 years ago²⁴. However, the genomic analyses, as well as previous mtDNA data, clearly indicate an origin from dogs in southern East Asia, which arrived to Australia via mainland Southeast Asia, and our demographic analysis indicates an arrival in Australia 8300 years ago, well before the Austronesian expansion^62,63,64. Thus, the genetic data clearly suggest that the dingoes originate from domestic dogs in southern East Asia that migrated via mainland and Island Southeast Asia to reach Australia 8300 years ago, but the human population that was involved in this migration remains unknown.

The results showed that dingoes and NGSDs are genetically very closely related, indicating a common origin from dogs in Island Southeast Asia around 8300 years ago. We also note that there is a phylogeographic structure in the dingo population recorded by nuclear as well as mitochondrial data. This was possibly caused by drift in the earliest formation of the dingo population, but may also relate to an origin from more than one introduction to Australia, but if so from a very homogenous source population and at similar points of time.

Our analyses identify 50 candidate genes in genomic regions under selection in dingo, and find an overrepresentation of genes correlated in particular to digestion, metabolism and reproduction. This indicates an adaptation to a new environment, in the form of a change of diet and changed sexual and reproductive mechanisms. This agrees with the two previous studies of genomic change under feralization, on feral chicken and rice. In the feral rice, genomic regions containing numerous genes correlated with adaptation to the new environment were identified, linked to, e.g., flowering time, reproduction and stress response¹⁵. In the feral chickens, especially genes correlated with sexual selection and reproduction were identified, e.g., genes correlated with fecundity traits, which may be targets of selection that facilitated the feralization¹⁰. It is notable that genes correlated with sexual selection and reproduction were identified in feral chicken and rice as well as in the dingo, indicating that change in reproduction mechanisms is an important effect of feralization in both animals and plants.

There is considerable difference in diet between domestic dogs and the two related wild canids, wolves and dingoes. The wild canids have a diet consisting predominantly of meat, while domestic dogs normally eat considerable amounts of vegetable food, provided by humans^{21,33,44,65,66}. This diet change has been shown to be reflected by strong selection for improved digestion of starch in domestic dogs⁴⁴. This is manifested most prominently by expansion of copy numbers of the gene for pancreatic amylase (AMY2B) in most dogs, but dingoes have the non-expanded wild type found in wolves^67,68. In our selection analyses we now also found feralization genes related to digestion and absorption. This indicates that diet change has implied a major environmental influence on the dingo, resulting in genomic change.

We demonstrate that the feralization of the dingo induced positive selection on genomic regions correlated to neurodevelopment, metabolism and reproduction. We also compared the genomic regions under selection in the feralization step to those selected in the domestication step, by comparing PBS2 among dingoes, dogs from Southern East Asia and wolves, where a high PBS2 indicate genomic regions in dingoes that were more similar to wolf than to dog. This analysis identified 13 genes in regions under selection in dingoes which were also more similar to gray wolves than to domestic dogs. This may indicate that selection on the 13 genes occurred in the dog lineage after the split from the dingo ancestors. However, inspection of the genotypes for four of these genes suggests that selection did not occur during the domestication of the dog, but during the later development of the European breeds and during the feralization of dingoes.

Importantly, two of the 13 genes are related to neurodevelopment (ARHGEF7 and PANX2), and therefore possibly involved in behavior change necessary for feralization. We performed a functional analysis on one of these genes, ARHGEF7 which promotes the formation of spines and synapses in hippocampal neurons⁵⁶. This test showed that a SNP found in dingo, located in a transcription factor-binding site, gives significantly lower enhancer activities. Hippocampus plays important roles in response inhibition, memory, and spatial cognition^69,70, and some studies suggest that hippocampus relates to purposive behaviorism⁷¹. Therefore, changes in expression of this gene may be related to behavior changes in the dingo, linked to the adaptations to a wild environment.

In this study, we show that the feralization of the dingo induced positive selection on genomic regions correlated to neurodevelopment, metabolism and reproduction. We demonstrate that an SNP variant for one of these genes found in dingo gives significantly decreased enhancer activities. We also establish that the dingo originated around 8300 years ago from domestic dogs in southern East Asia. The dingo has thereafter remained isolated, and under 8300 years of adaptation to a life in the wild it has developed into a genetically distinct population clearly differentiated from its domestic ancestors.

Methods

Samples and sequences

We examined whole-genome sequences from the largest and most diverse group of dingo studied to date, amassing a dataset of 109 canines around the world. The map was drawn by the R Packages (maps: https://CRAN.R-project.org/package=maps). We sequenced genomes of 10 dingoes and 2 New Guinea Singing Dogs in the study. Total genomic DNA was extracted from blood or tissue samples using the phenol/chloroform method. For each individual, 1–3 μg of DNA was sheared into fragments of 200–800 bp with the Covaris system. DNA fragments were then processed and sequenced using the Illumina HiSeq 2000 platform. Raw sequence reads were mapped to the dog reference genome (Canfam3)⁷² using the bwa mem –M (version 0.7.10-r789)³⁵. We used PICARD (version 1.87) to remove duplicated reads and merged BAM files for multiple lanes. Sequences were then locally realigned and base-recalibrated using the Genome Analysis Tool Kit (GATK, version 2.5-2-gf57256b)³⁶. Base quality was recalibrated using GATK BQSR to produce final BAM files. Sequence data were next subjected to a strategic procedure for variant calling using the UnifiedGenotypeCaller in Genome Analysis Tool Kit (GATK). Raw variants were then recalibrated using the Variant Quality Score Recalibration (VQSR). During base and variant recalibration, a list of known SNPs downloaded from the Ensembl database (ftp://ftp.ensembl.org/pub/release-73/variation/vcf/canis_familiaris/) was used as the training set.

Genetic diversity and population structure

Genetic diversity was calculated using VCFtools⁷³. Principal component analysis was made using the smartPCA⁷⁴. After thinning to a single SNP per 50 kb window, population structure analysis was performed using the block relaxation algorithm implemented in the ADMIXTURE software³⁷ The NJ phylogenetic tree was built by MEGA7⁷⁵. The ML phylogenetic tree was built by RAxML-8.0.12⁷⁶. We also used TreeMix⁷⁷ to investigate the genetic relationship and population level admixture.

D-statistics and outgroup-f3 analysis

We performed D-statistics analysis using qpDstat in the Admixtools³⁸ software package to test events of gene flow between the dingoes and European breeds in the form of D (Dhole, European breed; Pop1, Pop2). We performed three tests, where (i) Pop1 was all dog groups tested in turn and Pop2 was each individual dingo, (ii) Pop1 was all dog groups and the dingoes tested in turn and Pop2 was the NGSDs, and (iii) Pop1 was all dingo individuals tested in turn and Pop2 was all other dingo individuals tested in turn. A significantly positive D value (>3) suggests that Pop2 shows higher affinity to European breed than do Pop1, and that there may be admixture between European breed and Pop2. If instead D is significantly negative (<−3), there may be admixture between European breed and Pop1. The qp3pop program³⁸ in the Admixtools³⁸ software package was used to perform outgroup f3-statistics analysis in the form of f3(Dingoes, Pop2; Dhole)^38,41, to assess the relative genetic similarity of the dingo population to the other dogs, where high f3 values indicate a high degree of shared genetic history between the populations³⁷.

Population history

We inferred a complete demographic model for dingo and other dogs, including population divergence times and population size using the Generalized Phylogenetic Coalescent Sampler (G-PhoCS)⁴³. We dated two important internal nodes in the history of dingo: the divergence time between indigenous dogs from southern East Asia and Indonesian village dogs (Tau1), and the divergence time between Indonesian village dogs and dingo (Tau2). The phylogeny input was (indigenous dogs from southern East Asia, (Indonesian village dogs, dingoes)). We made two rounds of G-PhoCS analysis (Supplementary Table 2, Supplementary Data 3), and in each analysis we repeated the computations three times by randomly picking 1000 neutral loci (Supplementary Table 3, Supplementary Data 4), and took the average as the result. For the first round, we randomly selected 3 samples from all dingoes and for the second round, we replaced the D05 to D08 and D06 to D01 respectively, since the random procedure did not involve D05 and D06. The complete demographic history was inferred for dingoes, including population divergence times, ancestral population size, and migration rates based on the 1000 neutral loci. The parameters were inferred in a Bayesian manner using a Markov Chain Monte Carlo (MCMC) to jointly sample model parameters and genealogies of the input loci. Burn-in and convergence of each run were determined with TRACER 1.5⁷⁸. For the control file of G-PhoCS, divergence times in units of years, effective population sizes, and migration rates were calibrated by the estimates of generation time and neutral mutation rate from previous studies. A generation time of 3 years, a neutral mutation rate of 1.3e−09 per site per year were used to convert the population sizes and scaled time into real sizes and time. The mutation rate of 1.3 × 10⁻⁹ is calibrated by ancient DNA and used in many previous studies^22,40,41, and the result when we use this as the mutation rate is in agreement with the mitochondrial genome estimate.

Mitochondrial genome analysis

The NJ phylogenetic tree was built using MEGA⁷⁵. Bayesian analysis was made using Beast⁷⁹, assuming a mutation rate of 7.7 × 10⁻⁸ per site per year with SD 5.48 × 10 according to Thalmann et al.⁴⁹ Burn-in and convergence of each run were determined with TRACER 1.5⁷⁸.

Selection analysis

We performed the PBS statistics using the following formula:

$${\mathrm{PBS}}1 = \frac{{{{T}}_{{\rm{ds}}} + {{T}}_{{\rm{db}}} - {{T}}_{{\rm{bs}}}}}{2}$$

(1)

and

$${\mathrm{PBS}}2 = \frac{{{{T}}_{{\mathrm{ds}}} + {{T}}_{{\mathrm{ws}}} - {{T}}_{{\mathrm{dw}}}}}{2}$$

(2)

where T is computed by

$${{T}} = - {\mathrm{log}}(1-{\mathrm{Fst}})$$

(3)

and T_ds computed from Fst between dingoes and dogs from SE Asia/South China, T_db computed from Fst between dingoes and European breeds, T_bs computed from Fst between European breeds and dogs from SE Asia/South China, T_dw computed from Fst between Gray wolves and dingoes, and T_ws computed from Fst between Gray wolves and dogs from SE Asia/South China.

We phased our data using the software SHAPEIT⁸⁰ based on the genetic recombination map from Auton et al.³³. We calculated iHS in dingo using the software of selscan⁸¹, and normalized the scores by norm(in the software of selscan) with a 20 kb sliding window across the autosomes. We identified windows as candidate regions for selection if 30% of sites within them had an iHS absolute value above the threshold (top1% of iHS). Simulations were performed using the ms program⁸². We simulated five groups of genome sequences (wolves, indigenous dogs from southern China, Indonesian indigenous dogs, and dingoes for PBS2 and iHS; indigenous dogs from southern China, European breeds, Indonesian indigenous dogs, and dingoes for PBS1) under a neutral evolutionary model considering the inferred demographic history. A mutation rate of 2.2 × 10⁻⁹ per site per year with a generation time of 3 years was assumed. Since our demographic analysis did not include wolves and European breeds, the effective population sizes and split time for wolves and European breeds were taken from Wang et al.³ and Liu et al.⁸³.

Scripts used for simulation were as follows:

For PBS1:

ms 90 110000 -t 52.5 -r 80 20000 -I 4 42 20 6 22 -n 1 0.17917 -n 2 0.069 -n 3 0.19305 -n 4 0.01256 -ej 0.00409 4 3 -en 0.00409 3 0.09217 -ej 0.00488 3 1 -ej 0.01383 2 1 -em 0.00409 3 1 5.236

For PBS2:

ms 112 110000 -t 52.5 -r 80 20000 -I 4 42 42 6 22 -n 1 0.87120 -n 2 0.17917 -n 3 0.19305 -n 4 0.01256 -ej 0.00409 4 3 -en 0.00409 3 0.09217 -ej 0.00488 3 2 -en 0.00488 2 0.56680 -ej 0.0275 2 1 -em 0.00409 3 2 5.236

For iHS:

ms 112 11000 -t 52.5 -r 80 20000 -I 4 42 42 6 22 -n 1 0.87120 -n 2 0.17917 -n 3 0.19305 -n 4 0.01256 -ej 0.00409 4 3 -en 0.00409 3 0.09217 -ej 0.00488 3 2 -en 0.00488 2 0.56680 -ej 0.0275 2 1 -em 0.00409 3 2 5.236

Gene enrichment analysis

We randomly select 87 windows with 20Kb in the whole genome defined as the permuted gene sets. GO analyses were performed on both observed and permuted gene sets using the parent-child model⁵³ in the topGO R package⁵⁴. Permutation-based p values (pperm)²² were produced for all GO terms by comparing the observed parent-child test score with the results of the 1000 permutations using the formula pperm = (Xperm + 1)/(N + 1), where Xperm is the number of instances where a permutation obtained a parent-child p value less than or equal to the observed p value, and N is 1000. GO terms with pperm values less than 0.05. And we download go annotation sets from NCBI (https://ftp.ncbi.nlm.nih.gov/gene/DATA/gene2go.gz).

Functional test using dual-luciferase reporter assay

To construct ARHGEF7 enhancer SNP reporters, we inserted five repeats of the 50 bp fragments arounding the indicated SNP site into the pGL3-Promoter vector (Promega) within the MluI and XhoI sites. We verified all recombinant clones by sequencing. Daoy (human medullablastoma), HEK293 (human embryonic kidney) and MDCK (Madin-Darby Canine Kidney) cells were cultured in high-glucose Dulbecco’s modified Eagle’s mediun (DMEM) (Corning) with 10% fetal bovine serum (FBS) (Gibco) at 37 °C in 5% CO₂ condition. For luciferase reporter assays, Daoy, HEK293 and MDCK cells were transfected with the indicated reporter plasmids together with the same TK-Renilla internal control reporter vectors by using the lipofectamine 2000 transfection reagent (Invitrogen) and changed with the fresh medium at 6 h after transfection. According to the manufacture’s instruction, luciferase activity was measured at 36 h after transfection by using the Dual-Luciferase Reporter Assay System (Promega). All assays were performed in at least three independent experiments with a minimum of three replicates.

Ethics statement

We have complied with all relevant ethical regulations for animal testing and research. Australian Government Export permit number N39585 and University of New South Wales Ethic’s Approval 16/77B to Professor Bill Ballard.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

The raw sequence data from this study have been submitted to the GSA (http://gsa.big.ac.cn/) under accession CRA000200 for raw data of genomes. This project has also been deposited at the National Center for Biotechnology Information (NCBI) Sequence Read Archive database with the accession code PRJNA593363 (SRP234866). We have used downloaded data from published articles: SRA307300³, SRP044399³², SRP035294³¹, SRP062184²¹, SRP062060³³ and SRP058219³⁰. The dog reference genome is Canfam3⁷². The source data underlying Figs. 1b, c, 4a–c, 5 and Supplementary Fig. 9 and Supplementary Table 4 are provided as a Source Data file.

References

Darwin, C. The Variation of Animals and Plants under Domestication (J. Murray, 1868).
Van Gelder, R. G. Biology of mammals. Nature 181, (1969).
Wang, G. D. et al. Out of southern East Asia: the natural history of domestic dogs across the world. Cell Res. 26, 21–33 (2016).
Article PubMed Google Scholar
Leonard, J. A. et al. Ancient DNA evidence for old world origin of new world dogs. Science 298, 1613–1616 (2002).
Article ADS CAS PubMed Google Scholar
Vilà, C. et al. Multiple and ancient origins of the domestic dog. Science 276, 1687–1689 (1997).
Article PubMed Google Scholar
Ostrander, E. A., Wayne, R. K., Freedman, A. H. & Davis, B. W. Demographic history, selection and functional diversity of the canine genome. Nature Rev. Genet. 18, 705–720 (2017).
Wang, G. D., Larson, G. & Kidd, J. M. Dog10K: the International Consortium of Canine Genome Sequencing. Natl. Sci. Rev. 6, 611–613.
Price, E. O. Behavioral aspects of animal domestication. Q. Rev. Biol. 59, 1–32 (1984).
Article Google Scholar
Robert, B. Captivity and feralization. Psychological Bull. 89, 407–421 (1981).
Article ADS Google Scholar
Johnsson, M. et al. Feralisation targets different genomic loci to domestication in the chicken. Nat. Commun. 7, 12950 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Fleming, P., Corbett, L., Harden, B. & Thomson, P. Managing the Impact of Dingoes and Other Wild Dogs (Bureau of Rural Sciences, Canberra, 2001).
Corbett, L. K. & Knight, F. The dingo: in Australia and Asia. J. Wildl. Manag. 60, 462 (1995).
Google Scholar
David, B. & Wilson, D. E. Animal: The Definitive Visual Guide to the World’s Wildlife (Smithsonian Institution, 2001)
Delouche, J. C., Burgos, N. R., Labrada, R., & Gealy, D. R. Weedy rices: origin, biology, ecology and control. 188 (Food & Agriculture Org., 2007).
Qiu, J. et al. Genomic variation associated with local adaptation of weedy rice during de-domestication. Nat. Commun. 8, 15323 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Ellstrand, N. C. et al. Crops gone wild: evolution of weeds and invasives from domesticated ancestors. Evol. Appl. 3, 494–504 (2010).
Article PubMed PubMed Central Google Scholar
Cairns, K. M., Brown, S. K., Sacks, B. N. & Ballard, J. Conservation implications for dingoes from the maternal and paternal genome: multiple populations, dog introgression, and demography. Ecol. Evol. 7, 9787–9807 (2017).
Article PubMed PubMed Central Google Scholar
Johnston, A. M., Holden, P. C. & Santos, L. R. Exploring the evolutionary origins of overimitation: a comparison across domesticated and non-domesticated canids. Dev. Sci. 20 (2017).
Smith, B. (ed). The Dingo Debate: Origins, Behaviour and Conservation (Wiley, 2015).
Axelsson, E. et al. The genomic signature of dog domestication reveals adaptation to a starch-rich diet. Nature 495, 360–364 (2013).
Article ADS CAS PubMed Google Scholar
Freedman, A. H. et al. Genome sequencing highlights the dynamic early history of dogs. Plos Genet. 10, e1004016 (2014).
Article PubMed PubMed Central CAS Google Scholar
Pendleton, A. L. et al. Comparison of village dog and wolf genomes highlights the role of the neural crest in dog domestication. BMC Biol. 16, 64 (2018).
Article PubMed PubMed Central CAS Google Scholar
Macdonald, D. W. ed. “Other Dogs”. The Princeton Encyclopedia of Mammals. Princeton, p. 619 (NJ: Princeton University Press, 2006).
Savolainen, P., Leitner, T., Wilton, A. N., Matisoo-Smith, E. & Lundeberg, J. A detailed picture of the origin of the Australian dingo, obtained from the study of mitochondrial DNA. Proc. Natl Acad. Sci. USA 101, 12387–12390 (2004).
Article ADS CAS PubMed PubMed Central Google Scholar
Hickling, G. J. The dingo in Australia and Asia. N.Z. J. Ecol. 19, 227–227 (1995).
Google Scholar
Cairns, K. M., Shannon, L. M., Koler-Matznick, J., Ballard, J. W. O. & Boyko, A. R. Elucidating biogeographical patterns in Australian native canids using genome wide SNPs. Plos ONE 13, e0198754 (2018).
Article PubMed PubMed Central CAS Google Scholar
Stephens, D., Wilton, A. N., Fleming, P. J. & Berry, O. Death by sex in an Australian icon: a continent-wide survey reveals extensive hybridisation between dingoes and domestic dogs. Mol. Ecol. 24, 5643–5656 (2016).
Article CAS Google Scholar
Stephens, D. The molecular ecology of Australian wild dogs: hybridsation, gene flow and genetic structure at multiple geographic scales. PhD Thesis, The Unviersity of Wesern Australia (2011).
Parr, W. C. H. et al. Cranial shape and the modularity of hybridization in dingoes and dogs; hybridization does not spell the end for native morphology. Evolut. Biol. 43, 171–187 (2016).
Article Google Scholar
Zhang, W. et al. Hypoxia adaptations in the grey wolf (Canis lupus chanco) from Qinghai-Tibet Plateau. PloS Genet. 10, e1004466 (2014).
Article PubMed PubMed Central CAS Google Scholar
Gou, X. et al. Whole-genome sequencing of six dog breeds from continuous altitudes reveals adaptation to high-altitude hypoxia. Genome Res. 24, 1308–1315 (2014).
Article CAS PubMed PubMed Central Google Scholar
Fan, Z. et al. Worldwide patterns of genomic variation and admixture in gray wolves. Genome Res. 26, 163–173 (2016).
Article CAS PubMed PubMed Central Google Scholar
Auton, A. et al. Genetic recombination is targeted towards gene promoter regions in dogs. PLoS Genet 9, e1003984 (2013).
Article PubMed PubMed Central CAS Google Scholar
Parker, H. G. et al. Genomic analyses reveal the influence of geographic origin, migration, and hybridization on modern dog breed development. Cell Rep. 19, 697 (2017).
Article CAS PubMed PubMed Central Google Scholar
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25,1754–1760 (2009).
Depristo, M. A. et al. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat. Genet. 43, 491–498 (2011).
Article CAS PubMed PubMed Central Google Scholar
Alexander, D. H. & Novembre, J. K. Fast model-based estimation of ancestry in unrelated individuals. Genome Res. 19, 1655–1664 (2009).
Article CAS PubMed PubMed Central Google Scholar
Patterson, N. et al. Ancient admixture in human history. Genetics 192, 1065–1093 (2012).
Article PubMed PubMed Central Google Scholar
Sacks, B. N. et al. Y chromosome analysis of dingoes and southeast asian village dogs suggests a neolithic continental expansion from Southeast Asia followed by multiple Austronesian dispersals. Mol. Biol. Evol. 30, 1103–1118 (2013).
Article CAS PubMed Google Scholar
Misra, V. N., Bellwood, P. S. (eds). Recent Advances in Indo-Pacific Prehistory (Oxford and IBH Publishing Co., New Delhi, 1985).
Maanasa, R. et al. Upper Palaeolithic Siberian genome reveals dual ancestry of Native Americans. Nature 505, 87 (2014).
Article CAS Google Scholar
Ardalan, A. et al. Narrow genetic basis for the Australian dingo confirmed through analysis of paternal ancestry. Genetica 140, 65–73 (2012).
Article PubMed PubMed Central Google Scholar
Gronau, I., Hubisz, M. J., Gulko, B., Danko, C. G. & Siepel, A. Bayesian inference of ancient human demography from individual genome sequences. Nat. Genet. 43, 1031–1034 (2011).
Article CAS PubMed PubMed Central Google Scholar
Wang, G. D. et al. The genomics of selection in dogs and the parallel evolution between dogs and humans. Nat. Commun. 4, 1860 (2013).
Article ADS PubMed CAS Google Scholar
Terhorst, J., Kamm, J. A. & Song, Y. S. Robust and scalable inference of population history from hundreds of unphased whole genomes. Nat. Genet. 49, 303–309 (2016).
Cairns, K. M. & Wilton, A. N. New insights on the history of canids in Oceania based on mitochondrial and nuclear data. Genetica 144, 553–565 (2016).
Article PubMed Google Scholar
Pang, J. F. et al. mtDNA data indicate a single origin for dogs south of Yangtze River, less than 16,300 years ago, from numerous wolves. Mol. Biol. Evol. 26, 2849–2864 (2009).
Article CAS PubMed PubMed Central Google Scholar
Oskarsson, M. C. et al. data indicate an introduction through Mainland Southeast Asia for Australian dingoes and Polynesian domestic dogs. Proc. Biol. Sci. 279, 967–974 (2012).
Article PubMed Google Scholar
Thalmann, O. et al. Complete mitochondrial genomes of ancient canids suggest a European origin of domestic dogs. Science 342, 871–874 (2013).
Article ADS CAS PubMed Google Scholar
Yi, X. et al. Sequencing of 50 human exomes reveals adaptation to high altitude. Science 329, 75–78 (2010).
Article ADS CAS PubMed PubMed Central Google Scholar
Voight, B. F., Kudaravalli, S., Wen, X. & Pritchard, J. K. A map of recent positive selection in the human genome. Plos Biol. 4, e72 (2006).
Article PubMed PubMed Central Google Scholar
Sabeti, P. C. et al. Genome-wide detection and characterization of positive selection in human populations. Nature 449, 913–918 (2007).
Article ADS CAS PubMed PubMed Central Google Scholar
Grossmann, S., Bauer, S., Robinson, P. N. & Vingron, M. Improved detection of overrepresentation of Gene-Ontology annotations with parent child analysis. Bioinformatics 23, 3024–3031 (2007).
Article CAS PubMed Google Scholar
Alexa, A. & Rahnenfuhrer, J. Enrichment analysis for Gene Ontology. R Package Version 2006.
Shen, C. et al. Prss37 is required for male fertility in the mouse. Biol. Reprod. 88, 123 (2013).
Article PubMed Google Scholar
Saneyoshi, T. et al. Activity-dependent synaptogenesis: regulation by a CaM-kinase kinase/CaM-kinase I/βPIX signaling complex. Neuron 57, 94–107 (2008).
Article CAS PubMed PubMed Central Google Scholar
Shang, S. et al. The repertoire of bitter taste receptor genes in canids. Amino acids 49, 1159–1167 (2017).
Article CAS PubMed Google Scholar
Cagan, A. & Blass, T. Identification of genomic variants putatively targeted by selection during dog domestication. Bmc Evolut. Biol. 16, 10 (2016).
Article CAS Google Scholar
Hamilton, K. L. & Butt, A. G. Glucose transport into everted sacs of the small intestine of mice. Adv. Physiol. Educ. 37, 415–426 (2013).
Article PubMed Google Scholar
Zheng, M. H. et al. The transcription factor RBP-J is essential for retinal cell differentiation and lamination. Mol. Brain 2, 38 (2009).
Article ADS PubMed PubMed Central CAS Google Scholar
Pugach, I., Delfin, F., Gunnarsdottir, E., Kayser, M. & Stoneking, M. Genome-wide data substantiate Holocene gene flow from India to Australia. Proc. Natl Acad. Sci. USA 110, 1803–1808 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Bellwood, P. S. Prehistory of the Indo-Malaysian Archipelago (University of Hawai’i Press, 1997).
Chang, C. S. et al. A holistic picture of Austronesian migrations revealed by phylogeography of Pacific paper mulberry. Proc. Natl Acad. Sci. USA 112, 13537–13542 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Xu, S. et al. Genetic dating indicates that the Asian–Papuan admixture through Eastern Indonesia corresponds to the Austronesian expansion. Proc. Natl Acad. Sci. USA 109, 4574 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Wang, G.-D. et al. Structural variation during dog domestication: insights from gray wolf and dhole genomes. Natl. Sci. Rev. 6, 110–122 (2018).
Article CAS PubMed PubMed Central Google Scholar
Tang, B. et al. iDog: an integrated resource for domestic dogs and wild canids. Nucleic Acids Res. 47, D793–D800 2018.
Article PubMed Central CAS Google Scholar
Botigue, L. R. et al. Ancient European dog genomes reveal continuity since the Early Neolithic. Nat. Commun. 8, 16082 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Arendt, M., Cairns, K. M., Ballard, J. W., Savolainen, P. & Axelsson, E. Diet adaptation in dog reflects spread of prehistoric agriculture. Heredity 117, 301–306 (2016).
Article CAS PubMed PubMed Central Google Scholar
Cohen, N. J. & Eichenbaum, H. Memory, Amnesia, and the Hippocampal System Xiii (Illinois Experts, 1993).
Matsumura, N. et al. Spatial- and task-dependent neuronal responses during real and virtual translocation in the monkey hippocampal formation. J. Neurosci. 19, 2381 (1999).
Article CAS PubMed PubMed Central Google Scholar
Eichenbaum, H. The hippocampus and declarative memory: cognitive mechanisms and neural codes. Behav. Brain Res. 127, 199–207 (2001).
Article CAS PubMed Google Scholar
Lindbladtoh, K. et al. Genome sequence, comparative analysis and haplotype structure of the domestic dog. Nature 438, 803 (2005).
Article ADS CAS Google Scholar
Danecek, P. et al. The variant call format and VCFtools. Bioinformatics 27, 2156–2158 (2011).
Article CAS PubMed PubMed Central Google Scholar
Patterson, N., Price, A. L. & Reich, D. Population structure and eigenanalysis. Plos Genet. 2, e190 (2006).
Article PubMed PubMed Central CAS Google Scholar
Kumar, S., Tamura, K., Jakobsen, I. B. & Nei, M. MEGA4: Molecular evolutionary genetics analysis (MEGA) software version 4.0. Mol. Biol. Evol. 24, 1596 (2007).
Article PubMed CAS Google Scholar
Alexandros, S. RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics 22, 2688 (2006).
Article CAS Google Scholar
Pickrell, J. K. & Pritchard, J. K. Inference of population splits and mixtures from genome-wide allele frequency data. Plos Genet. 8, e1002967 (2012).
Article CAS PubMed PubMed Central Google Scholar
Rambaut A, Drummond A. Tracer v1.5: MCMC trace analyses tool. Available: http://beast.community/tracer (2007). Accessed 11 Jan 2020.
Drummond, A. J. & Rambaut, A. BEAST: Bayesian evolutionary analysis by sampling trees. BMC Evol. Biol. 7, 214 (2007).
Article PubMed PubMed Central CAS Google Scholar
Delaneau, O., Marchini, J. & Zagury, J. F. A linear complexity phasing method for thousands of genomes. Nat. Methods 9, 179–181 (2011).
Article PubMed CAS Google Scholar
Szpiech, Z. A. & Hernandez, R. D. selscan: an efficient multithreaded program to perform EHH-based scans for positive selection. Mol. Biol. Evolution 31, 2824–2827 (2014).
Article CAS Google Scholar
Hudson, R. R. Generating samples under a Wright-Fisher neutral model of genetic variation. Bioinformatics 18, 337–338 (2002).
Article CAS PubMed Google Scholar
Liu, Y. H. et al. Whole-genome sequencing of african dogs provides insights into adaptations against tropical parasites. Mol. Biol. Evol. 35, 287–298 (2018).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

The authors thank J. William O. Ballard of the University of New South Wales for providing dingo samples and Janice Koler-Matznick for providing NGSD samples. This work was supported by grants from the NSFC (91531303, 31571353 and 91731304), the Breakthrough Project of Strategic Priority Program of the Chinese Academy of Sciences (CAS) (XDB13000000), Carl Tryggerʼs Stiftelse and Agria and SKK Forskningsfond. G.D.W. is supported by the Youth Innovation Promotion Association of CAS and the 13th Five-year Informatization Plan of CAS (Grant No. XXH13503-05). L.L.Z. was sponsored by the China Scholarship Council (CSC#201700260248).

Author information

These authors contributed equally: Shao-jie Zhang, Guo-Dong Wang, Pengcheng Ma

Authors and Affiliations

State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, 650223, China
Shao-jie Zhang, Guo-Dong Wang, Pengcheng Ma, Ting-Ting Yin, Yan-hu Liu, Newton O. Otecko, Bingyu Mao & Ya-ping Zhang
State Key Laboratory for Conservation and Utilization of Bio-resource in Yunnan, Yunnan University, Kunming, 650091, China
Shao-jie Zhang, Meng Wang, Ya-ping Ma, Lu Wang & Ya-ping Zhang
Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, 650223, China
Guo-Dong Wang & Bingyu Mao
KTH Royal Institute of Technology, School of Engineering Sciences in Chemistry, Biotechnology and Health, Department of Gene Technology, Science for Life Laboratory, SE-171 65, Solna, Sweden
Liang-liang Zhang & Peter Savolainen

Authors

Shao-jie Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Guo-Dong Wang
View author publications
You can also search for this author in PubMed Google Scholar
Pengcheng Ma
View author publications
You can also search for this author in PubMed Google Scholar
Liang-liang Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Ting-Ting Yin
View author publications
You can also search for this author in PubMed Google Scholar
Yan-hu Liu
View author publications
You can also search for this author in PubMed Google Scholar
Newton O. Otecko
View author publications
You can also search for this author in PubMed Google Scholar
Meng Wang
View author publications
You can also search for this author in PubMed Google Scholar
Ya-ping Ma
View author publications
You can also search for this author in PubMed Google Scholar
Lu Wang
View author publications
You can also search for this author in PubMed Google Scholar
Bingyu Mao
View author publications
You can also search for this author in PubMed Google Scholar
Peter Savolainen
View author publications
You can also search for this author in PubMed Google Scholar
Ya-ping Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Y.P.Z., P.S. and B.Y.M. supervised the research. S.J.Z. and G.D.W. designed the research. S.J.Z. and G.D.W. performed the research and analyzed data. P.M. performed Functional assay. L.L.Z performed mtDNA analyses. S.J.Z., T.T.Y. and L.W. carried out data submission. Y.H.L. and Y.W. helped perform the analysis with constructive discussions. M.W. and Y.P.M. performed experiments. S.J.Z. and G.D.W. wrote the manuscript. N.O. revised the manuscript. Y.P.Z., P.S. and B.Y.M. approved the final version.

Corresponding authors

Correspondence to Bingyu Mao, Peter Savolainen or Ya-ping Zhang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature Communications thanks the anonymous reviewers for their contribution to the peer review of this work. Peer reviewer reports are available.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Reporting Summary

Description of Additional Supplementary Files

Supplementary Data 1

Supplementary Data 2

Supplementary Data 3

Supplementary Data 4

Supplementary Data 5

Supplementary Data 6

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Zhang, Sj., Wang, GD., Ma, P. et al. Genomic regions under selection in the feralization of the dingoes. Nat Commun 11, 671 (2020). https://doi.org/10.1038/s41467-020-14515-6

Download citation

Received: 17 November 2018
Accepted: 16 December 2019
Published: 03 February 2020
DOI: https://doi.org/10.1038/s41467-020-14515-6

This article is cited by

Reprogramming of the gut microbiota following feralization in Sus scrofa
- Simona Petrelli
- Maria Buglione
- Domenico Fulgione
Animal Microbiome (2023)
Chromosome-level genome assembly and population genomic analyses provide insights into adaptive evolution of the red turpentine beetle, Dendroctonus valens
- Zhudong Liu
- Longsheng Xing
- Jianghua Sun
BMC Biology (2022)
Whole-genome sequencing reveals the genetic mechanisms of domestication in classical inbred mice
- Ming Liu
- Caixia Yu
- Zhibin Zhang
Genome Biology (2022)
An isolated population reveals greater genetic structuring of the Australian dingo
- Danielle Stephens
- Peter J. S. Fleming
- Tim P. Mayr
Scientific Reports (2022)
Odorant-binding proteins in canine anal sac glands indicate an evolutionarily conserved role in mammalian chemical communication
- Sunita Janssenswillen
- Kim Roelants
- Franky Bossuyt
BMC Ecology and Evolution (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Sample collection and whole-genome sequencing

Population structure and phylogenetic analysis

Demographic and migration histories

Mitochondrial genome analysis

Natural selection in feralization

Change in the candidate regions in two processes

Functional assay revealed that a mutation in dingo gives decreased enhancer activity for ARHGEF7

Discussion

Methods

Samples and sequences

Genetic diversity and population structure

D-statistics and outgroup-f3 analysis

Population history

Mitochondrial genome analysis

Selection analysis

Gene enrichment analysis

Functional test using dual-luciferase reporter assay

Ethics statement

Reporting summary

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Supplementary information

Source data

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links