Arctic introgression and chromatin regulation facilitated rapid Qinghai-Tibet Plateau colonization by an avian predator

Hu, Li; Long, Juan; Lin, Yi; Gu, Zhongru; Su, Han; Dong, Xuemin; Lin, Zhenzhen; Xiao, Qian; Batbayar, Nyambayar; Bold, Batbayar; Deutschová, Lucia; Ganusevich, Sergey; Sokolov, Vasiliy; Sokolov, Aleksandr; Patel, Hardip R.; Waters, Paul D.; Graves, Jennifer Ann Marshall; Dixon, Andrew; Pan, Shengkai; Zhan, Xiangjiang

doi:10.1038/s41467-022-34138-3

Download PDF

Article
Open access
Published: 27 October 2022

Arctic introgression and chromatin regulation facilitated rapid Qinghai-Tibet Plateau colonization by an avian predator

Li Hu^1,2,3^na1,
Juan Long^1,2,3^na1,
Yi Lin^1,2,3^na1,
Zhongru Gu^1,2,
Han Su^1,2,3,
Xuemin Dong^1,3,
Zhenzhen Lin^1,2,
Qian Xiao^1,3,4,
Nyambayar Batbayar ORCID: orcid.org/0000-0002-9138-9626⁵,
Batbayar Bold ORCID: orcid.org/0000-0002-3227-3762^1,3,5,
Lucia Deutschová⁶,
Sergey Ganusevich⁷,
Vasiliy Sokolov ORCID: orcid.org/0000-0002-0115-3151⁸,
Aleksandr Sokolov⁹,
Hardip R. Patel¹⁰,
Paul D. Waters ORCID: orcid.org/0000-0002-4689-8747¹¹,
Jennifer Ann Marshall Graves ORCID: orcid.org/0000-0001-6480-7856¹²,
Andrew Dixon^13,14,
Shengkai Pan ORCID: orcid.org/0000-0002-1333-4902^1,2 &
…
Xiangjiang Zhan ORCID: orcid.org/0000-0002-4517-1626^1,2,3,15

Nature Communications volume 13, Article number: 6413 (2022) Cite this article

7298 Accesses
8 Citations
21 Altmetric
Metrics details

Subjects

An Author Correction to this article was published on 01 December 2022

This article has been updated

Abstract

The Qinghai-Tibet Plateau (QTP), possesses a climate as cold as that of the Arctic, and also presents uniquely low oxygen concentrations and intense ultraviolet (UV) radiation. QTP animals have adapted to these extreme conditions, but whether they obtained genetic variations from the Arctic during cold adaptation, and how genomic mutations in non-coding regions regulate gene expression under hypoxia and intense UV environment, remain largely unknown. Here, we assemble a high-quality saker falcon genome and resequence populations across Eurasia. We identify female-biased hybridization with Arctic gyrfalcons in the last glacial maximum, that endowed eastern sakers with alleles conveying larger body size and changes in fat metabolism, predisposing their QTP cold adaptation. We discover that QTP hypoxia and UV adaptations mainly involve independent changes in non-coding genomic variants. Our study highlights key roles of gene flow from Arctic relatives during QTP hypothermia adaptation, and cis-regulatory elements during hypoxic response and UV protection.

Genomic basis of insularity and ecological divergence in barn owls (Tyto alba) of the Canary Islands

Article Open access 29 September 2022

Extant and extinct bilby genomes combined with Indigenous knowledge improve conservation of a unique Australian marsupial

Article Open access 01 July 2024

Historical DNA reveals climate adaptation in an endangered songbird

Article 19 June 2023

Introduction

The Qinghai-Tibet Plateau, also known as the Third Pole, has a climate as cold as the Arctic. To cope with cold extremes, animals living in these two poles have evolved similar morphologies such as large body size, and long, thick wintering fur¹. Previous fossil analyses even suggested that cold adaptive traits of some Arctic mammals (e.g. Arctic fox and woolly rhinoceros) may have occurred already in their QTP ancestors^1,2. Recent phylogenetic studies have found that QTP and Arctic animals are closely related^3,4, but to date it is unclear whether there was gene flow between the two poles that facilitated cold adaptation of QTP animals, or whether these adaptations were gained independently.

Different from Arctic relatives, QTP animals have also evolved unusual morphological and physiological traits (e.g. increased hemoglobin-oxygen affinity, protective pigmentation) to cope with stresses of low oxygen and UV radiation experienced at high altitude^5,6. Accumulating evidence has demonstrated a key role of natural selection in the evolution of these unique QTP traits. Previous studies have attributed a few coding variations to these QTP phenotypes⁷, but it was recently found that the majority of selective loci occurred in non-coding regions⁸, implying that gene regulation elements may have an important role in QTP hypoxic adaptation and response. Among them, cis-regulatory elements are a group of well recognized non-coding DNAs, and variations on them, especially promoter modifications, were found to alter gene transcription^9,10. However, promoter variations generally account only for a small proportion (about 1%)⁸ of non-coding selective loci, how the selected variants on non-coding regions regulate gene expression related to QTP adaptation or response remains largely unknown.

Saker falcons (Falco cherrug), provide an ideal model to address these issues because sakers have a broad breeding distribution across Eurasia¹¹ and a recent colonization to the QTP¹². The origin and evolution of saker populations have been explored using limited sequence information. Comparison of the 460 bp mitochondrial control region sequences suggested a very recent and central European origin of this species^13,14, but revealed no genetic partitions across Eurasian populations. Our pilot cDNA-based study covering about 4% of genomic regions indicated that sakers may have initially inhabited central Europe, dispersed to central Asia, and finally colonized to the QTP¹². However, our current understanding of saker colonization processes may be compromised by the non-neutral nature of cDNA variants¹⁵. Moreover, studies of mitochondrial DNA have produced some evidence of gene flow between sakers and gyrfalcons (Falco rusticolus), the largest falcon species, which is distributed mainly in the Arctic and subarctic tundra¹⁶. The evidence for this is debated, however, as the mixed mitochondrial ancestry^13,14 may also result from a recent divergence¹⁷.

Here, we de novo assemble a chromosome-level saker genome as reference and perform whole genome resequencing of 30 sakers across their main breeding distribution:¹¹ western Eurasia (West) comprising Moldova (MD), Slovakia (SK) and Crimea (CE), eastern Eurasia (East) including Mongolia (MN) and QTP (Fig. 1a). We have also sequenced 10 gyrfalcons sampled across northern Russia (Eurasian Arctic, Fig. 1a). With these population genomic data, we verify the occurrence of introgression from gyrfalcons to sakers during the last glacial maximum (LGM), and elucidate its indispensable role in colonization of the low temperature QTP by sakers. Combined with the comparative 3D genome and functional genomics analysis, we highlight a unique contribution of mutations on cis-regulatory elements to plateau hypoxic response (enhanced chromatin interaction) and UV protection (elevated melanin synthesis).

**Fig. 1: Stepwise colonization of saker falcons onto the Qinghai-Tibet Plateau.**

Results

Stepwise colonization of sakers onto the QTP

Although sakers have a wide distribution across Eurasia, and their prominent roles in grassland ecosystem balance and human culture have been well recognized for a long time^14,18, we still know little about their detailed evolutionary history. To clarify the history of saker colonization, we assembled a complete saker genome and resequenced the whole genomes of sakers from different geographic populations, as well as gyrfalcons from the Eurasian Arctic (Fig. 1a). We sequenced a female saker and assembled a 1.23 Gb genome by integrating PacBio, HiSeq and Bionano data (scaffold N50, 36.05 Mb), and anchored 1.20 Gb sequences to 38 super-scaffolds using Hi-C (Supplementary Figs. 1, 2; Supplementary Tables 1–5). Finally, we identified 24 autosomes (10 macro- and 14 micro- chromosomes) and ZW chromosomes by aligning the super-scaffolds against four bird genomes¹⁹ (“Methods”; Supplementary Fig. 3). We then generated an average of 26.38 Gb sequences (21×; Supplementary Data 1) for each of the 40 studied falcons using Illumina short read sequencing technology.

Both population genetic structure (K = 2; Fig. 1b; Supplementary Fig. 4) and principal component analysis (PCA; Supplementary Fig. 4) separated the studied falcon individuals into two clusters, gyrfalcons and sakers, with the latter further splitting into West and East populations (K = 3; Fig. 1b). We then used SMC++²⁰ to reconstruct the demographic histories, and inferred that sakers and gyrfalcons had begun to diverge about 300 thousand years ago (ka) due to the observation of differentiation in effective population size (Ne) (Fig. 1c), and supporting evidence comes from the fossil record²¹ and previous mitochondrial variations²².

Previous reports suggested a hybridization event between sakers and gyrfalcons after their divergence^13,14,22, but could not pinpoint how, when, and where hybridization occurred. Our analysis of saker and gyrfalcon population genomic data showed that East sakers share more than 21.3% autosomal and 25.4% Z chromosomal alleles with gyrfalcons whereas West sakers share less than 0.2% and 0.0001% (K = 2; Fig. 1b; Supplementary Fig. 5), respectively, suggesting an asymmetric gene flow from gyrfalcons to East sakers. We also confirmed this using an f₃-statistic method²³and a significantly negative f₃ (East saker; gyrfalcon, West saker) with a mean Z-score of -23.96 (Supplementary Table 6), strongly supported admixture between gyrfalcons and East sakers.

We next wanted to know when and where the hybridization had occurred. Using Ancestry_HMM²⁴ to trace the ancestry of discrete genomic segments, we estimated the introgression time to be 17.5 ka (confidence interval, 16–19 ka) (Fig. 1d; “Methods”), falling into the LGM period (16–27 ka)²⁵. Using Ecological Niche Modeling (ENM)²⁶, we reconstructed potential breeding areas that had suitable climate for sakers and gyrfalcons in Eurasia during the last interglacial (LIG) and LGM, respectively (Fig. 1e). During the LGM, we found the breeding areas of gyrfalcons had shifted to southern Siberia, which overlapped with 370,488 km² of those predicted for sakers. We therefore propose that the introgression occurred in southern Siberia during the LGM. Supporting evidence comes from the discovery in the Altai Mountains of falcon fossils dated 25-45 ka, with the mixed characteristics of both species²⁷. Our MSMC²⁸ simulation implied that gene flow between the two species ceased ca. 10 ka (Supplementary Fig. 6). This may have been due to the northwards retreat of glaciers and geographical isolation between the two species as a result of the formation of the Siberian boreal forest (ca. 12 ka)²⁹.

We then investigated whether hybridization between the two species was sexually biased. We estimated the contribution of female gyrfalcons to the gene pool in ancient East sakers by analyzing maternally inherited SNPs from falcons’ W chromosomes. We found that West sakers and gyrfalcons were clearly separated into two clades (Fig. 1f), again confirming distinct genetic differentiation between West sakers and gyrfalcons. In contrast, the East sakers occurred in both clades with 58.8% (10/17) of the examined female sakers in the gyrfalcon clade, and 41.2% (7/17) in the West saker clade (Fig. 1f). This result indicated that more female sakers in East populations possessed genetic backgrounds from gyrfalcons. Because W chromosomes do not recombine (except for pseudoautosomal regions (PARs)), this proportion (58.8%) could proxy the contribution of female gyrfalcons to the ancient gene pool of female East sakers until hybridization ceased. Assuming that the ancestral saker population had the same sex ratio (1: 1) as present³⁰, we estimated that female gyrfalcons contributed about 29.4% of the gene pool of the East saker population.

To estimate the contribution of male gyrfalcons to the East saker gene pool, we assumed the loss of ancient introgressed alleles at a constant rate and designed a method based on the observed introgression rates on autosomes and Z chromosome (“Methods”). We estimated that male gyrfalcons contributed only 13.6%, half of that from female gyrfalcons. This biased hybridization between female gyrfalcons and male sakers might reflect a size advantage for gyrfalcons in female competition for access to males and/or breeding territories³¹ (Fig. 1g).

The identification of the hybridization event between gyrfalcons and East sakers enabled us to reconstruct a map of saker colonization onto the QTP. Based on the detected population structure, population histories, and the identified introgression time with gyrfalcons, we used fastsimcoal2³² (Supplementary Table 7) to simulate population divergence and reconstructed a stepwise colonization route of QTP sakers: (1) ca. 41 ka (38-78 ka), sakers gradually dispersed from central Europe to East Asia (e.g. MN); (2) during the LGM, gyrfalcons shifted to southern Siberia from the Arctic and gene flow occurred between gyrfalcons and eastern Eurasia sakers at ca. 21 ka (16–25 ka); (3) ca. 10 ka (8–12 ka), sakers colonized the QTP, probably from the MN population (Fig. 1a; Supplementary Fig. 7). The elucidation of these stepwise processes provides an opportunity to systematically study unique genetic underpinnings that helped sakers colonize the QTP.

Hybridization with Arctic gyrfalcons facilitated sakers’ adaptation to cold extremes

Paleoclimatic data³³ analysis demonstrated that from ca. 50 ka to the present, East Asia (e.g. MN) had a lower annual mean temperature than central Europe. The lowest temperature in the coldest month (−30 °C) occurred during the LGM (Supplementary Fig. 8), coinciding with the time when gyrfalcons hybridized with ancestral East sakers. To live in cold Arctic conditions, gyrfalcons evolved adaptive traits such as the largest body size in extant Falconidae¹⁶. In line with recent work suggesting that genetic introgression has played an important role during the colonization of recipient species to new environments³⁴, we hypothesize that gene flow from gyrfalcons to East sakers during the LGM promotes low temperature adaptation in East sakers.

To test this hypothesis, we compared the body size (wing length as the indicator³⁵) among East sakers (MN and QTP), West sakers and gyrfalcons. We found that sex-matched adult East sakers have a larger body size than West sakers (Fig. 2a; Supplementary Fig. 9). Our results, thus, concord with the Bergmann’s rule, that within a species, individuals living in a colder environment generally have a larger body size³⁶.

**Fig. 2: Larger body size in East sakers due to hybridization with Arctic gyrfalcons.**

We then wanted to know whether introgression from Arctic-adapted gyrfalcons contributed to the size differentiation between the two main saker populations. To do this, we applied an ABBA-BABA model³⁷, and found five outstanding genomic islands (> 200 Kb) on the five chromosomes (Chr 1, 3, 5, 8 and 16) exhibiting adaptive introgression signatures (top 1% D = 0.73, top 1% f_d = 0.70) (Fig. 2b; Supplementary Fig. 10) which were much longer than the expected length of fragments (26.0 Kb) from incomplete lineage sorting (ILS; “Methods”). The strongest introgression signature came from the sex comb on midleg homolog 1 (SCMH1) gene (D = 0.97, f_d = 0.89; Fig. 2b; Supplementary Figs. 10, 11) with all the variants located in non-coding regions (Supplementary Fig. 12). This gene appears to be associated with skeletal growth. SCMH1^–/– mutated mice were reported to have skeletal abnormalities³⁸, and mutations on SCMH1 were correlated with adult heights in European humans³⁹ and body sizes in horses⁴⁰.

We therefore investigated the roles of introgressed SCMH1 variants in the development of body size in sakers. We detected transposase-accessible chromatin around the SCMH1 gene using the ATAC-seq data from forelimb, keel and flight muscle of a saker embryo sample (Supplementary Table 8). We found a peak that spanned 3.3 Kb in the Intron 5 and covered 10 introgressed SNPs, which provided experimental support for its existence as a cis-regulatory element (CRE) (Fig. 2c; Supplementary Figs. 12c, 13). With the Hi-C data generated for sakers (Supplementary Table 2), we further found this element is co-located in the same topologically associating domain (TAD) with the SCMH1 promoter (Supplementary Fig. 12), a fundamental chromatin topology. Contacts between CREs and promoters are mainly constrained within TADs⁴¹, so our findings imply that the active element could regulate the expression of SCMH1.

We phased this fragment using BEAGLE⁴² (Fig. 2d) and performed a functional study of the SCMH1 cis-regulatory element by comparing activities of dominant wild and dominant introgressed haplotypes (Fig. 2e) with a luciferase reporter assay expressed in duck embryonic fibroblast cells (CCL-141, ATCC). Our experiments showed that both haplotypes had suppressing functions, but the introgressed one had a stronger effect (P = 6.3E-03; Fig. 2f; Supplementary Fig. 14). Since SCMH1 acts as an E3 ubiquitin ligase to suppress the expression of growth-promoting HOX genes in mammals⁴³, we suggest that the greater repressive effect of the cis-regulatory element on SCMH1 in East sakers may relieve the suppression of HOX expression, which could lead to larger body size in East sakers (Fig. 2g). Across the genome, we also identified six other adaptively introgressed genes previously reported to be involved in animal body size development (Supplementary Table 9; Supplementary Data 2). These genes, together with SCMH1, comprised three gene blocks (SCMH1/FOXO6, HMGA2/MSRB3/LEMD3, and FBXL15/NFKB2), and our Hi-C analysis showed that each block was located in a different TAD (Supplementary Figs. 12, 15, 16). Collectively, our results suggest that hybridization with the Arctic gyrfalcons, the largest falcon species, provides new gene variants that promote larger body size and relieve hypothermia stress in the East saker population.

Polar animals such as polar bears and penguins feature higher body mass and fat storage relative to temperate animals as classical adaptations to meet high energy demands under the extremely cold polar environments^44,45. To test whether this is the case for East sakers, we compared the body mass of East (MN and QTP) with West sakers, and found that the former was significantly heavier in both sexes (Fig. 3a; Supplementary Fig. 9), in line with observations that birds living in the cold regions are heavier⁴⁶. However, a larger body size may also cause a higher body mass. To control this effect, we checked the body mass index (BMI, body mass/wing length⁴⁷) of Mongolian sakers⁴⁸ and obtained a BMI coefficient for adult males and females, respectively (Supplementary Fig. 17a). Based on this coefficient, we used the observed wing length data of adult West and QTP sakers to predict the expected distribution of body mass. We then compared the expected body masses with those observed in the field, and found the observed ones were significantly higher than expected in QTP sakers, but lower than expected in West sakers (Supplementary Fig. 17b). Our results thus suggest a higher BMI in East sakers (QTP + MN). Since BMI is positively correlated with fat content in birds⁴⁹, a high body mass in East sakers may result from either lower physical activities, increased intestinal absorption, higher intake of food^50,51,52 or intake of their higher fat-containing mammalian prey (e.g. Brandt’s vole Lasiopodomys brandtii in MN and plateau pika Ochotona curzoniae in QTP; Supplementary Table 10).

**Fig. 3: Higher body mass in East sakers due to introgressed SR-B1^121Leu from gyrfalcons.**

Dietary animal fat (e.g. visceral fat) usually has a high cholesterol level, co-transported with triglyceride by blood lipoproteins⁵³, which could affect the blood cholesterol level⁵⁴. Usually, cholesterol from normal diet is sufficient for utilization (e.g. component of cell membrane, precursor of steroid hormones) since about 65% cholesterol is endogenously synthesized^55,56,57. In contrast, excessive exogenous cholesterol from a high fat diet will cause an elevated blood cholesterol level⁵⁴ after triglyceride release and absorption by other tissues. We therefore compared the total cholesterol in blood of East and West sakers and found a relatively higher average total cholesterol level in East sakers (Supplementary Fig. 18b). This physiological phenomenon, interestingly, is similar to that reported in polar bears (the ref. 58 and Supplementary Figs. 18a, b).

We further measured the high-density lipoprotein cholesterol (HDLC) and low-density lipoprotein cholesterol (LDLC) concentration in the studied sakers since cholesterols were dominantly bound to high-density lipoprotein and low-density lipoprotein to form HDLC and LDLC in blood. Our results showed that HDLC generally accounted for most bound cholesterols (60% on average; Supplementary Fig. 18c), consistent with observations in other bird species⁵⁹. Interestingly, we found that East sakers had a higher level of HDLC, but comparable LDLC relative to West sakers (Fig. 3b; Supplementary Fig. 18d). The elevated level of cholesterol in blood of East sakers is expected to pose stresses such as atherosclerosis⁶⁰, and it might be expected that East sakers have evolved a strategy to combat this negative stressor.

Indeed, the analysis of adaptive introgressed genes related to lipid metabolism (Supplementary Table 11) appears to reflect adaptation to a colder environment in East sakers. The top one of these enriched genes, scavenger receptor class B member 1 (SCARB1), exhibited the third strongest signal of adaptive introgression across the whole genome (D = 0.96, f_d = 0.82; Fig. 2b; Supplementary Figs. 10, 11). This gene encodes scavenger receptor class B type I (SR-B1) protein which is a surface receptor of hepatocytes and mediates selective uptake of HDLC from blood, contributing to the removal of excessive cholesterol⁶¹. Further analysis narrowed an introgressed SNP into the 121^st amino acid residue of this protein, which is leucine (Leu) in East sakers (SCARB1^362CTT) as in gyrfalcons, but proline (Pro) in West sakers (SCARB1^362CCT). Alignment of the SR-B1 orthologs among 172 avian species (Fig. 3c; Supplementary Fig. 19) showed that SR-B1^121Leu occurred only in gyrfalcons (introgressed to East sakers), suggesting this genetic innovation may have benefits for adaptation to the extreme cold of the Arctic.

To assess the effect of this substitution in East sakers, we performed crystal structure simulations⁶², which suggested that the SR-B1 protein formed a large hydrophobic tunnel transporting lipophilic molecules (e.g. HDLC) into cells, with its N- and C-terminus forming transmembrane domains anchoring on the membrane. The 121^st amino acid is located on a helix, forming a ferrule-like structure that fasten the tunnel (Supplementary Fig. 20), so the substitution of Pro (cyclic structure) with Leu (branched-chain) may loosen the ferrule-like structure and enlarge the tunnel. We therefore hypothesize that the introgressed SCARB1 mutation leads to a more efficient absorption of HDLC into the liver in East sakers (Fig. 3d).

To test this hypothesis, we compared the extracellular HDLC uptake efficiency into cells expressing, respectively, the West saker wild type (SR-B1^121Pro) and East saker substituted type (SR-B1^121Leu) in vitro (Supplementary Fig. 21; “Methods”). Our high-performance liquid chromatography (HPLC) analysis observed a significantly higher cellular cholesterol content in the HeLa cells transfected with SR-B1^121Leu plasmids (P = 0.017; Fig. 3e), suggesting a higher uptake of HDLC. Our functional results are therefore consistent with our hypothesis that the introgressed SR-B1^121Leu enhances the efficiency of blood HDLC removal (Fig. 3d), imparting a lower risk of blood vessels blockage during fat accumulation in East sakers.

Moreover, within East sakers, we found QTP sakers were significantly heavier (also higher BMI) than MN sakers (Supplementary Figs. 9, 17b), coinciding with colder annual mean temperature on the plateau (Supplementary Fig. 8). Notably, both total cholesterol and HDLC levels were also significantly higher in QTP sakers, and no differences in triglyceride (Fig. 3b; Supplementary Figs. 18a, b), suggesting an even higher pressure related to accumulated cholesterol in blood. We found that the introgressed allele T on the SCARB1³⁶² (i.e. SR-B1^121Leu) was subject to positive selection in plateau sakers (Frequency = 0.9 in QTP vs. 0.55 in MN) (hapFLK^63,64 test; P = 0.03; Supplementary Fig. 22), suggesting that HDLC uptake in QTP sakers was further enhanced by selection of this variation. We propose that this combination of genotype-phenotype has promoted the ability of sakers to cope with QTP’s cold extremes, facilitating their plateau colonization.

Local adaptation and response to a hypoxic environment

Our analyses revealed a rapid population expansion for QTP sakers, with a 1.4-fold Ne increase after their arrival on the plateau ca. 10 ka (Supplementary Fig. 7). This could be attributed to the loss of ice sheet and the expansion of main food resource (plateau pikas) of QTP sakers⁶⁵ after the LGM. However, such rapid colonization also required surmounting physiological limitations set by low oxygen, and strong UV environment, as well as low temperature, at high elevations.

We wished to understand how QTP sakers adapted to their extreme environment in such a short period of time (< 10 ka). We therefore conducted a positive selection analysis between the QTP and MN saker populations across the whole genome using XP-EHH⁶⁶ (top 1% value = 2.13; Fig. 4a; Supplementary Data 3) and F_ST (top 1% value = 0.17; Supplementary Fig. 22). We identified eight selective sweeps containing 27 genes that were significantly enriched for functions in oxygen transport (GO: 0005344, 0019825, 00015671) (Supplementary Table 12), as expected if hypoxia is the most significant stressor for QTP sakers¹².

**Fig. 4: Local adaptation and response of QTP sakers to hypoxia.**

The strongest selective sweep covers a ~500 Kb region of Chr 4 (Fig. 4a; Supplementary Data 4). It is outside of the introgressed regions (Supplementary Fig. 10b) and there are no selection signatures in the West saker population (Supplementary Fig. 23), implying that the selection event happened after sakers’ QTP colonization. Of the 293 selective SNPs identified on the sweep, 282 were located in non-coding regions and 11 were coding variations (Fig. 4a; Supplementary Fig. 24; Supplementary Table 13). We thus sought to explore how these non-coding variants could regulate gene expression in QTP sakers. Furthermore, since gene expression regulation, at the genome level, is realized through the contacts between cis-regulatory elements that are constrained by the chromatin topology⁴¹, we examined both cis-regulatory elements and chromatin topologies in the focal sweep.

On the sweep, we identified a total of 22 CREs (Fig. 4a; Supplementary Fig. 24) based on our ATAC-seq data from three blood samples of QTP saker (Supplementary Table 8; Supplementary Figs. 13, 25), among which 14 CREs had selected SNPs (N = 26). Significantly, most of the 14 CREs were enriched near the cluster of three hemoglobin genes HBZ, HBAD, HBA1, together forming a loop that is a small domain in the same TAD⁶⁷. Furthermore, our Hi-C (Supplementary Table 2) analysis showed that all the detected CREs and promoters of other genes embedded in this sweep were co-located within a TAD (bin size = 20 Kb) without changing the TAD boundaries either in QTP or MN sakers (Fig. 4b; Supplementary Fig. 24), so this long selected region shared conserved chromatin structure in the two populations. However, when compared the contact frequency for each CRE and each gene promoter between the two populations, we found that QTP sakers always had a significantly higher contact frequency (P = 4.0E–05, Wilcox test; Figs. 4c, d; Supplementary Fig. 26). The contact frequency was found to be positively correlated with the strength of selection pressure (R² = 0.21, P = 1.0E-04; Fig. 4e). Given that the intra-TAD contact frequency correlates with chromatin accessibility⁶⁸, our result indicates that the focal genomic fragment is more accessible in QTP sakers (Fig. 4f).

A change in chromatin openness is expected to alter expression levels of these embedded genes⁶⁹. Since avian blood contains a certain proportion (~10%) of immature erythrocytes (the ref. 70 and Supplementary Fig. 27) that transcribe genes⁷¹ and the expression pattern in circulating blood is correlated with bone marrow (e.g. chicken, Supplementary Fig. 28), we compared the expression profiles of the embedded genes between plateau and lowland saker blood samples. For the 49 full-length (Iso-Seq) transcripts of 11 genes (Supplementary Table 14) on the sweep, we have identified 14 transcripts from six genes (HBA1, HBAD, NPRL3, MRPL28, LUC7L, POLR3K) that were differentially expressed between the QTP and MN populations (Supplementary Fig. 29). Of these, the most highly expressed transcript was HBA1.2, accounting for more than 92% of the total HBA1 expression, and 30% (average) of the whole transcriptome (Supplementary Fig. 30a). This gene was marginally up-regulated in the QTP compared with MN sakers (TPM (Transcripts Per Million): (3.59 $\pm$ 0.36) E+05 vs. (3.06 $\pm $ 0.26) E+05, q = 0.01; Supplementary Fig. 30b). We also measured physiological attributes of saker blood samples by principles of colorimetic and electrical impedance⁷², respectively, finding significantly higher hemoglobin concentration (194.83 $\pm $ 18.56 vs. 160.50 $\pm $ 25.17 g/L, P = 4.5E-03; Supplementary Fig. 31) and comparable hematocrit (HCT, P = 0.09, Supplementary Fig. 31) in the QTP relative to MN sakers, consistent with elevated hemoglobin concentration in many plateau birds⁷⁰. We therefore propose that the selected non-coding SNPs of QTP sakers have impacted on the regulation of hypoxia relevant genes in response to plateau hypoxic stress.

Local adaptation and response to strong UV radiation

At high elevations, UV radiation is stronger and can induce DNA damage in the skin⁷³. It has been reported that avian feathers provide the first defense line for a bird against UV, because pigments (e.g. melanin and carotenoid) in feathers are able to absorb UV⁷⁴. However, there are few investigations of whether and how feathers protect highland birds from intense UV radiation. To answer this question, we evaluated plumage differences (dorsal, wing and tail feathers) between QTP and MN saker populations using a HR2000CG-UV-NIR spectrometer (Fig. 5a; Supplementary Fig. 32). Our results showed that the lightness (L*) values were significantly lower in QTP chicks (Fig. 5a), suggesting a darker plumage.

**Fig. 5: Local adaptation and response of QTP sakers to intense UV radiation.**

To investigate the potential molecular basis of this plumage difference, we scanned positively selected genes in QTP sakers. From the total of 15 genes that have been reported to be related to avian melanin synthesis⁷⁵ (no carotenoids-pigmented feathers in hierofalcon⁷⁶), we have found only one gene under selection, agouti signaling protein (ASIP) (Fig. 4a; Supplementary Data 3), which suppresses eumelanogenesis by binding to melanocortin-1 receptor (MC1R)⁷⁷. We then detected cis-regulatory elements around the ASIP gene using ATAC-seq data (Supplementary Table 8; Supplementary Figs. 13, 25) from three QTP saker dorsal skin samples. We found that the chromatin of an intronic fragment with dense variations (ten selected SNPs in 2.1Kb) of ASIP gene was accessible in sakers (Supplementary Fig. 33c). Using H3K27ac ChIP-seq data generated from embryonic chicken leg scale skin, we found the orthologous region in chicken is located on the start of a long enhancer-like element (Supplementary Figs. 33b, c). The fragment should be a weak regulatory element as reflected by its regional specific lower intensity either in sakers or chickens (Supplementary Figs. 33b, c). It is also noted that relatively low signal intensity associated with ATAC-seq peaks in saker skin samples is probably because they were collected from dead animals in the field. This cis-regulatory element is co-located with the ASIP promoter in the same TAD (Supplementary Fig. 33), so it could affect the gene activity. The phasing of this element showed that one haplotype has high frequency (90%) in QTP sakers, in contrast with a medium frequency (40%) in MN sakers (Supplementary Fig. 34).

We compared the activity of this QTP-dominant haplotype with that of the MN-main haplotype by designing a luciferase experiment in CCL-141 cells. Our experiments showed that both haplotypes have enhancing functions (Fig. 5b; Supplementary Fig. 35), however, the QTP haplotype has a lower enhancing effect (P = 4.0E–05; Fig. 5b; Supplementary Fig. 35). Thus our results suggested that the QTP dominant haplotype of cis-regulatory element causes relatively lower expression of ASIP, leading to more eumelanin synthesis in melanocytes. This may comprise a molecular basis for the darker plumage observed in QTP sakers (Fig. 5c) that will limit UV penetration through the feathers and thus minimize UV-induced damage.

Discussion

Colonization of the QTP by humans and other animals have been explored genetically⁷⁸, but most studies of high-altitude adaptation of QTP animals were conducted simply by comparing highland populations with their lowland counterparts. This produced incomplete or even contradictory evolutionary inferences for many species including humans^79,80. In this study, we figured out a stepwise colonization of wild sakers onto the highest plateau in the world (Fig. 1a), and untangled the roles of multiple evolutionary processes during this process. We demonstrated that the rapid QTP colonization and adaptation was not realized by simple dispersal from lowland, but took place with different processes (introgression from sister species, natural selection, etc.) that played varied spatial-temporal roles in response to different environmental stressors.

The QTP and Arctic share similar extreme weather conditions, and similar morphological traits have been observed in animals from both poles¹. Combining paleo-climatological, ecological and genetic evidence, we found that a secondary contact occurred between Arctic-adapted gyrfalcons and Asian sakers in the LGM, and this hybridization allowed ancestral sakers to develop larger body size and helped overcome negative effects of higher fat storage by more effectively removing excessive cholesterols from blood. This may facilitate sakers to cope with cold stress in eastern Eurasia and also predispose their survival in even colder QTP environments.

Recently, numerous studies have suggested important roles for key genes in high-altitude adaptation or response, but few have addressed the roles of cis-regulatory elements and their interaction with target genes⁸¹. Our comparative 3D genome analyses identified a hard sweep within which expression of multiple genes was regulated through altering chromatin interactions of cis-regulatory elements (e.g. enhancers, suppressors) and gene promoters within a TAD. Notably, this may represent a general paradigm of gene regulation for non-coding variants because we also observed similar patterns in three other selective sweeps associated with hypoxia and UV responses (Supplementary Fig. 36). Natural selection, may favor a conserved mode that not only clusters genes with similar functions, but also constrains genes and regulatory elements within a higher-order genome architecture such as TAD.

Qinghai-Tibet Plateau is commonly conceived as a ‘natural laboratory’ for organism adaptation to extreme environments, but there are still few systematic investigations of the three main stresses (hypothermia, hypoxia and strong UV radiation). Our work shows that adaptation to environmental extremes on the ‘third pole’ has resulted from adaptive introgression from hypothermia-adapted Arctic relatives (bigger body size and resistance to high fat loads), as well as local adaptation or response to hypoxia mainly through changes on higher-order genome architecture and UV protection high likely through melanin production. Identification of essential genes relevant to stress resistance (e.g. SCARB1 target lipid-lowering drug) opens new avenues to investigate the genomics of adaptation, and may also have potential medical applications in future. Since saker falcons prefer cooler habitats in east Eurasia, our study will also be relevant to the conservation of QTP sakers (supporting the largest wintering population³⁰), especially in the context of ongoing global warming⁸².

Methods

Ethics oversight

All lab experiment procedures were under the guidance of the Ethics Committee of the Institute of Zoology, Chinese Academy of Sciences. The collection and processing of falcon tissues in this study were conducted in accordance with the guidelines of Institutional Animal Care and Use Committee of the Institute of Zoology, Chinese Academy of Sciences.

De novo assembly of the saker falcon genome

We assembled a chromosome-level reference genome of an adult female saker falcon (a QTP saker rescued by Xining Wildlife Park) (2n = 52) using a multi-platform sequencing strategy (PacBio, Illumina, Bionano).

First, genomic DNA was extracted from a blood sample following the protocol of Blood & Cell Culture DNA Midi Kit (QIAGEN) and quantified by Qubit 2.0 Fluorometer (Thermo Fisher Scientific). A long-read library was constructed through 26G Needle fragmentation, fragments selection (20 Kb), adaptor linkage and subjected to sequencing on a PacBio Sequel System (Pacific Biosciences)⁸³. The output raw subreads were filtered using SMRTlink (version 5.0, https://www.pacb.com/support/software-downloads/) with parameters “-minLength 50 -minReadScore 0.8”. The filtered subreads were used for the contig assembly by wtdbg2⁸⁴ (version 1.2.8) with parameters “–tidy-reads 5000 –edge-min 4 –rescue-low-cov-edges”, followed by two rounds of polishing using wtdgb-cns. The assembled contigs were corrected using all the subreads by Pbalingn with parameters “–minAccuracy 70.0 –minLength 100 –hitPolicy randombest” and the consensus sequences were obtained by variantCaller with parameters “–minConfidence 40 –minCoverage 5 –algorithm arrow”. We further re-corrected the consensus sequences with 89.58 Gb short reads generated by a HiSeq X Ten sequencer (Illumina) using BWA⁸⁵ (version 0.7.12) and pilon⁸⁶ (version 1.22) to obtain a primary genome assembly.

Second, we used an optical mapping method to order and orient the primarily assembled contigs to scaffolds and validate the assembly⁸⁷. Briefly, the isolated megabase genomic DNA was labeled following the Nick-Label-Repair-Stain protocol with Nt.BspQI enzyme to construct libraries, which were subjected to sequencing on a Bionano Saphyr System (BioNano Genomics). The raw sequenced molecules were filtered with: (1) length < 150 Kb; (2) molecule SNR (ratio of brightness of DNA intercalator to background noise for the dsDNA molecule (Signal to Noise)) < 2.75 and label SNR (corresponding ratio of label brightness) < 2.75; (3) label intensity > 0.8. The clean data were aligned with the assembled contigs using Solve (version 3.1) with parameters “pipelineCL.py -i 1 -minlen 150 -minsites 11 -MapRate 0.45”. The mapped molecules were used to correct and link these contigs into scaffolds.

Third, we performed Hi-C⁸⁸, an all-vs-all chromosome conformation capture technique, to assign the scaffolds to groups (super-scaffolds). The Hi-C libraries were constructed sequentially through formaldehyde crosslinking, MboI enzyme digestion, biotin marking, ligation and purification⁸⁹. The qualified libraries (200–600 bp insert size) were then subjected to sequencing on a HiSeq X Ten platform. The raw sequencing data were filtered by removing reads with: (1) low quality data (more than 50% bases with PHRED values < 19); (2) ‘N’ rate higher than 5%; (3) adaptor sequences. The clean data were aligned with the assembled scaffolds using HiC-Pro⁹⁰ (version 2.7.8) to retrieve the uniquely mapped paired-end reads to identify chromatin interactions. LACHESIS⁹¹, a method based on the agglomerative hierarchical clustering algorithm, was used to cluster, order and orient scaffolds to super-scaffolds according to the interaction information.

Finally, we identified the macro-chromosomes (length > 50 Mb) and micro-chromosomes (length < 50 Mb) from super-scaffolds by aligning the saker assembly against with those of Aquila chrysaetos (golden eagle)⁹², Falco peregrinus (peregrine falcon)^18,93, Gallus gallus (chicken; GRCg7b) and Deomaius novaehollandiae (emu)⁹⁴ using the method we have developed¹⁹. To further confirm W chromosomal sequences, we compared the sequencing depth of each assembled chromosome for each resequenced individual. The assembled sequences with depth less than one in male falcons but half of the mean whole genome sequencing depth in females were considered as W chromosome sequences.

Gene annotation

Gene prediction was performed using our previous pipeline¹⁸. Briefly, for the homolog predictions, we first masked the transposable elements of the assembled genome using RepeatMasker (version 4.0.8, http://www.repeatmasker.org/). Then, we mapped the protein sequences of Gallus gallus, Taeniopygia guttata, Falco peregrinus against the assembled genome using TBLASTN⁹⁵ (version 2.2.23) with an E-value threshold of 1E-05 and determined gene models using GENEWISE⁹⁶ (v2.2.0). For the transcriptome-based prediction, we aligned the blood transcriptome data with the assembled genome using Tophat⁹⁷ (version 2.1.2) and identified transcripts using Cufflinks⁹⁸^. For the de novo predictions, we trained the parameters using the well annotated genes from homolog and RNA-seq evidence. The trained parameters were used to predict candidate genes using AUGUSTUS⁹⁹ (version 2.5.5) and GENESCAN¹⁰⁰ (version 1.0). Function annotations were conducted by aligning each protein sequence to SwissProt and TrEMBL databases¹⁰¹ using BlastP. Domains of genes were searched using InterProScan¹⁰² (version 4.7).

Sampling and genomic DNA extraction for genome resequencing

A total of 30 saker samples (27 blood and three plucked chick feathers) were collected in the wild. These included two MD, three CE, five SK, 10 from MN and 10 QTP individuals. Approximately 0.2 mL fresh blood was collected from each bird and immediately put into the vacutainer containing 7.2 mg of K2 EDTA (BD). Plucked chick feathers were collected and stored in 75% ethanol. In addition, blood samples of 10 wild gyrfalcons were collected from Arctic Russia with three individuals from Kola, three from Yamal and four from Chukotka. These samples were collected during our field work from 2007 to 2017.

DNA was extracted using DNeasy Blood & Tissue kit (QIAGEN) and quantified by Qubit 2.0 Fluorometer. One library with 350 bp insert size was constructed for each sample following the manufacturer’s protocol and subjected for sequencing on a HiSeq X Ten platform. The raw sequencing data were filtered by removing (1) low quality reads (more than 50% bases with PHRED <7); (2) ‘N’ rate higher than 5%; (3) reads with adaptors.

SNP calling

The clean reads of each individual were aligned against the assembled saker genome using BWA. The reads with low mapping quality (PHRED < 20) were excluded using Samtools¹⁰³ (version 1.9) and duplicates were removed using Picard (https://www.broadinstitute.github.io/picard/) (version 1.95). The variants were identified using the Genome Analysis Toolkit¹⁰⁴ (GATK, version 3.3.0) (parameter “-stand_call_conf 30”) and filtered with parameters “MQ0 > 1 or MQ < 30 or BaseQRankSum < −8 or ReadPosRankSum < −8 or FS > 40”. Alleles that were not covered in all samples and minor allele frequency (MAF) less than 0.05 were removed.

Demographic history reconstruction and relative cross coalescent rate estimation

We used SMC++²⁰ (version 1.9.2) to reconstruct the demographic histories of falcons. The autosomal unphased bi-allelic SNPs of each falcon were used to infer the demographic history using SMC++ with 20 EM iterations and 1E-4 for the threshold of terminating the EM algorithm. 100 bootstraps were conducted.

For the estimation of relative cross coalescent rate using MSMC (version 2.1.2)²⁸, we first phased the genotypes using BEAGLE⁴² (version 4.1), which applies a Hidden Markov model (HMM) to locally cluster the haplotypes. We then randomly selected eight phased haplotypes to infer relative cross coalescent rates between each two genetically separated populations.

Population structure detection

We used the autosomal bi-allelic SNPs to detect the potential population genetic structure using the PCA method and a maximum likelihood approach Frappe¹⁰⁵ (version 1.1). To reduce the impacts of linkage disequilibrium (LD) on the analysis, only intergenic sites for which the distance of any two neighboring sites was at least 10 Kb were considered. For PCA, we converted eigenvectors from the covariance matrix (calculated from the SNP matrix) by R function EIGEN and examined significance by Tracy-Widom test implemented in EIGENSOFT¹⁰⁶ (version 3.0). For Frappe, genetic cluster K was predefined from 2 to 6 without assuming any prior information and the maximum iteration of expectation-maximization was set as 10,000.

Phylogeny of the W chromosome

The variants on the W chromosome were identified using the BWA/GATK pipeline above mentioned. Considering the W chromosome is haploid except for PARs (regions homologous to Z chromosome of saker genome), we masked the SNPs in PARs and gene regions¹⁰⁷, and retained neutral haploid SNPs that existed in each individual by filtering the loci potentially affected by purifying selection (MAF < 0.05)^108,109. A phylogenetic tree was reconstructed using a neighbor-joining method with p-distance by fneighbor implemented in EMBOSS package¹¹⁰ with a female peregrine as the outgroup¹¹¹ and the tree was plotted by Figtree (version 1.4.3, http://www.tree.bio.ed.ac.uk/software/figxtree/).

Contributions of female and male gyrfalcons to the gene pool of ancient East saker population

Since the W chromosome lacks recombination (except for PARs), which would not cause the loss of ancient introgressed alleles, the contribution of female gyrfalcons to the ancient gene pool of female East sakers could be estimated by the observed proportion of female East sakers that were clustered with female gyrfalcons in the W chromosomal phylogeny. We assumed that the effects of genetic drift were negligible since the effective population size of East sakers was large and increasing after the LGM (Fig. 1c). In addition, we removed the variants with MAF < 0.05 to exclude the potential effects of purifying selection, the main selection model in avian W chromosome^108,109. After assuming that the adults in ancient East saker populations had the same sex ratio (1: 1) as present³⁰, the contribution of female gyrfalcons to the gene pool of ancient East saker populations was half of the proportion.

Also, we assumed recombination will lead to loss of ancient neutral introgressed alleles at a constant rate r per generation, the proportion of the ancient East saker gene pool contributed by male gyrfalcons could be estimated following the two formulas:

$$\frac{1}{2}(x+y){(1-r)}^{N}=A$$

(1)

$$\frac{1}{3}y+\frac{2}{3}x{(1-r)}^{N}=Z$$

(2)

where x and y represent the contributions of male and female gyrfalcons to the gene pool in the ancient East saker population at the time when hybridization ceased, r the rate at which ancient introgressed alleles are lost per generation, N the number of generation estimated by (the time when hybridization ceased to now) / (generation time of the saker falcon), A and Z the proportions of introgressed alleles observed in autosome and Z chromosome at present. In the formula (1), the coefficient 1/2 means half of the autosomal genetic material comes from males and the other half from females. In the formula (2), the coefficients 1/3 and 2/3 mean 1/3 of the genetic material on Z chromosome comes from females and 2/3 from males.

Admixture estimation by f ₃-statistic

The f₃-statistic implemented in Admixtools (version 5.1)²³ emerges from a test of three populations (A; B, C) that explicitly asks whether A, is the result of admixture between B and C. It measures the covariance of the differences in allele frequencies of A–C and B–C population pairs across all genomic loci. If f₃(A; B, C) is significantly negative, then B and C contribute to the admixed population A. Here, we used West saker to proxy the ancestral East saker to test f₃ (East saker; gyrfalcon, ancestral East saker).

Admixture time estimation

To estimate the admixture time, we used a local ancestry inference method Ancestry_HMM²⁴ (version 0.94) to trace the ancestry of discrete genomic segments. We fitted a single pulse admixture model to genome-wide variation data^112,113 and gave the ancestry types in the introgressed populations (East saker) with the one (saker type) with the proportion of 0.7 and the other (gyrfalcon type) with the proportion of 0.3. We quantified uncertainties by 500 bootstraps.

Simulation of potential breeding areas for gyrfalcons and sakers

To predict potential breeding areas for gyrfalcons and sakers, we performed an Ecological Niche Modeling (ENM) analysis using MaxEnt²⁶ (version 3.3.3k) in the R dismo package. We downloaded the occurrence data of sakers and gyrfalcons from GBIF (https://www.gbif.org/). The breeding records were limited to those from June to August to avoid bias due to the presence of potential migrants. For the GBIF data, we firstly removed the occurrence points located in ocean or having low accuracy. Then, we removed the points if there is only one individual recorded. Finally, we used a spatial filter distance of 40 km between the points to minimize the effects of over-sampling. Due to the limited data for gyrfalcons in GBIF, we additionally used 79 randomly selected occurrence sites in breeding areas across Eurasian Arctic reported in two previous studies^17,114.

For climate variables, we downloaded the bioclimate variables (0.5° resolution) from a previous study¹¹⁵ and cropped the spatial extent of the ENMs to include all known occurrence sites, covering an area ranging from 10° S to 90° N and 20° W to 180° E. To train the model, we selected 80% of occurrence sites to fit the MaxEnt species distribution model, and kept the remaining 20% sites for model testing. We used all layers to predict the current distribution and then selected the variables which contributed greater than 10% to the predicted distribution. Finally, we projected the ENMs built under current climate to paleoclimates, including the LIG and LGM. To account for the uncertainty of paleoclimates on single snapshot, we selected paleoclimates of multiple snapshots at the LGM (20 to 27 ka) and LIG (110 to 114 ka) to project the ENMs and then calculated the maximum suitability scores by using corresponding snapshots. The present breeding areas of saker and gyrfalcon were obtained from two previous studies^11,18. To calculate the overlapping breeding areas between sakers and gyrfalcons during the LGM, we cropped the spatial extent to an area ranging from 46° N to 60° N and 68° E to 98° E, and calculated the overlapped area in QGIS (http://www.qgis.osgeo.org/).

Simulation of colonization scenarios

We used the software fastsimcoal2³² (version 2.6) to simulate colonization scenarios of saker populations, taking account of the inferred hybridization event. We used easySFS (https://www.github.com/isaacovercast/easySFS) package to convert the unlinked autosomal intergenic SNPs (one SNP per 10 Kb) to the site frequency spectrum and fed it into fastsimcoal2. The parameters (Supplementary Table 7) were set based on the detected population structure (K = 4; Fig. 1b; Supplementary Fig. 4), population histories (Fig. 1c; Supplementary Fig. 6) and the identified introgression time with gyrfalcons. The software was run with 1,000,000 coalescent situations and 40 ECM cycles. The model with the largest maximum likelihood was selected.

Biometric data of falcons

We measured the wing lengths of 19 adult QTP saker specimens (11 females and eight males): 11 from the National Zoological Museum of China, three from the Northwest Institute of Plateau Biology, Chinese Academy of Sciences, one from the Xinjiang Institute of Ecology and Geography, Chinese Academy of Sciences, one from the Tibet Museum of Natural Science and three from our field surveys. We measured the body masses of eight adult females (one specimen and seven wild falcons) and ten males (one adult specimen and nine fledging birds) in the QTP population. The adult specimens were collected from the Qinghai-Tibet Plateau during multiple surveys from 1935 to 2007. Our field surveys on the QTP were conducted from 2018 to 2020. The wing length and body mass of Mongolian adult specimens were obtained from the field surveys in 2013-2014⁴⁸. The wing length and body mass of West sakers and gyrfalcons were derived from previous references^{116,117,118,119}. The correlation between biometrics and temperature was modeled using a linear regression model. The correlation between body mass and wing length in MN sakers was also obtained using a linear regression model, and the expected and observed body masses were compared using t tests.

Estimation of adaptively introgressed signals and discrimination from ILS

We used the ABBA-BABA model³⁷ to test the Patterson’s D statistic and f_d values in 100 Kb sliding window size with 50 Kb step size along the autosomes. The (((P1, P2), P3), O) topology was set as (((West saker, East saker), gyrfalcon), peregrine). We considered fragments in the top 1% D and f_d values as candidate regions adaptively introgressed from gyrfalcons to East sakers. Introgressed genomic islands were determined as a cluster of at least three consecutive adaptively introgressed regions. To confirm the detected introgression, we also calculated the fixation index (Weir and Cockerham’s F_ST) using VCFtools (version 0.1.13, https://www.vcftools.github.io/), and the genetic divergence d_XY¹²⁰ between gyrfalcons and East sakers, gyrfalcons and West sakers, West and East sakers, respectively, and nucleotide diversity θ_π for each of the three clusters with a sliding window size of 20 Kb by using VCFtools. We considered true signals as those regions with significantly different F_ST/d_XY in the gyrfalcon/West saker and West/East saker comparisons but not in the gyrfalcon/East saker comparison.

We used a strategy described below to determine whether the signatures identified above resulted from introgression or ILS. Because ILS blocks were randomly distributed in the genome and fragmented following recombination, their lengths should be shorter than those fragments caused by introgression¹²¹. Given an observed length of a fragment, we calculated its probability as an ILS block using the formula P = exp (-k/L)¹²², where L is the expected length of a shared sequence between East sakers and gyrfalcons, which equals [(1 - m) r (t - 1)]⁻¹ (t is the admixture time in generations, m is the admixture fraction and r is the recombination rate per base pair per generation). In our case, we set the parameters according to the fastsimcoal2 result: t = 3,181 generations (21 ka), m = 0.213 (the minimum introgression rate in Frappe result), r = 2E-08 (assuming that each of the 26 chromosomes experiences on average one crossover per generation¹²³). When the P was larger than 0.05, the fragments shorter than 26.0 Kb were considered as those that may be influenced by ILS. Accordingly, fragments longer than 26.0 Kb were considered from introgression.

Luciferase reporter assay

We used the luciferase reporter assay to validate the activities of the target REs of SCMH1 and ASIP genes. The primers were designed using PRIMER3¹²⁴. For the SCMH1 gene, the primers were: forward, 5ʹ- CGACGCGTGGTGATGATGGTTCATGGGTG −3ʹ and reverse, 5ʹ- GAAGATCTGCTAAACGTGCACCTTCCTTT −3ʹ. For the ASIP gene, the primers were: forward, 5ʹ- CGACGCGTACAGAGGTAAGTGCACCAG −3ʹ and reverse, 5ʹ- GAAGATCTTTATTTTCTTCCTTTTCAACCC −3ʹ. The primers were used to amplify the target sequences from genomic DNA extracts (dominant introgressed and dominant wild haplotypes of SCMH1 gene were amplified from MD1 and QTP7, respectively; main MN- and dominant QTP- haplotypes of ASIP gene were amplified from MN6 and QTP7, respectively). The amplified DNA was cloned into the pGL3-Promoter vector (Promega) digested with MluI and BglII. After confirmed by Sanger sequencing, the successfully constructed plasmids were isolated using Endo-free Plasmids Maxi Kit (OMEGA). The pGL3-Basic and pGL3-Promoter plasmids were used as controls. When the CCL-141 cells grew up to 70% confluent in the 24-well plate (Falcon), the constructed and control plasmids were respectively co-transfected into the cells together with pRL-TK (Promega) using Lipofectamine 2000 (Invitrogen). Cell lysis was collected using the Dual-Glo Luciferase Assay Kit (Promega) for the following assessments after 24 hours. The normalized luciferase activities were measured using the Dual-Luciferase Report Assay System (Promega) and GloMax® Explorer Multimode Microplate Reader (Promega) according to the manufacturer’s instructions. Experiments were performed in hexaplicates and independently repeated three times.

Measurements of triglyceride and cholesterol content in plasma

We took blood samples from 17 saker chicks at 4–6 weeks old (including five SK, six MN and six QTP sakers) for the measurements of lipid components including total triglyceride, total cholesterol, HDLC and LDLC. Blood was centrifuged at 500 × g for 10 min at 4 °C. The contents of each lipid components were measured using assay kits¹²⁵ (BioSino Bio-technology and Science Inc.).

In brief, for the triglyceride assay (GPO-PAP method), Reagent1 and Reagent2 were mixed and reacted with plasma to hydrolyze triglyceride. The generated glycerin was then reacted with enzymes to produce quinone imine followed by measurements of the absorbance values at 505 nm using a SpectraMax i3 microplate reader (Molecular Devices). Triglyceride standard and water were used as control and blank respectively. The triglyceride content was determined by the differences in absorbance as:

$$\frac{{Sample}-{blank}}{{Control}-{blank}}\,\times \,{TG}\,{standard}\,{concentration}$$

(3)

For the total cholesterol assay (CHOD-PAP method), Reagent1’ and Reagent2’ were mixed and reacted with plasma to degrade cholesterol to quinone imine followed by measurements of the absorbance values at 505 nm. Cholesterol standard and water were used as control and blank respectively. The cholesterol content was determined using a formula below:

$$\frac{{Sample}-{blank}}{{Control}-{blank}}\,\times \,{TC}\,{standard}\,{concentration}$$

(4)

For the HDLC assay, Reagent1* (R1) was reacted with plasma to remove chylomicron, low-density lipoprotein cholesterol and very-low-density lipoprotein cholesterol, followed by the absorbance value measurement at 600 nm. Reagent2* (R2) was then mixed with the above residuals to release the HDLC, again followed by the absorbance value measurement at 600 nm. The HDLC concentration was determined by the differences in absorbance as:

$$\frac{\left(R2-R2\,{blank}\right)-(R1-R1\,{blank})}{\left(R2\,{control}-R2\,{blank}\right)-(R1\,{control}-R1\,{blank})}\\ \qquad \times \,{HDLC}\,{standard}\,{concentration}$$

(5)

For the LDLC assay (surfactant assay), Reagent1^# (R1) was reacted with low-density lipoprotein and protected the loaded cholesterol that was not degraded by enzymes, followed by measurements of the absorbance values at 600 nm. Reagent2^# (R2) was then mixed with the above residuals to release the LDLC, and was measured absorbance at 600 nm. The LDLC concentration was determined using a formula below:

$$\frac{\left(R2-R2\,{blank}\right)-(R1-R1\,{blank})}{\left(R2\,{control}-R2\,{blank}\right)-(R1\,{control}-R1\,{blank})}\\ \qquad \times \,{LDLC}\,{standard}\,{concentration}$$

(6)

Each sample was repeatedly measured three times for each experiment.

Protein structure prediction of SR-B1

We annotated the protein sequences of SCARB1 gene in a total of 319 avian species from previous references^126,127 and this study. A multiple sequence alignment on these SR-B1 protein sequences was conducted using MAFFT¹²⁸ (version 7.407).

We predicted the SR-B1 protein (belongs to the CD36 family) structure of falcons using SWISS-MODEL¹²⁹ with the human homologous protein LIMP-2 (code: 4F7B)¹³⁰ modeling as templates. The transmembrane domains of N- and C- terminals in the falcon’s SR-B1 proteins were predicted using TMHMM (version 2.0, http://www.cbs.dtu.dk/services/TMHMM/).

Evaluation of the cholesterol uptake efficiency of SR-B1 proteins

To evaluate the cholesterol uptake of the wild type SR-B1^121Pro and mutated type SR-B1^121Leu, we performed an over expression experiment in vitro using the method modified from Zanoni’s⁶¹. The full-length cDNA sequences of SCARB1^362CCT and SCARB1^362CTT were synthesized (without the stop codon) and cloned into pEGFP-N1 vectors (Clonetech) separately, with GFP expressed at the C-terminus as the index of protein expression. The successfully constructed plasmids were verified by Sanger sequencing. The human HeLa cells (CCL-2, ATCC) were cultured in Dulbecco’s modified Eagle’s medium (Hyclone) supplemented with 10% fetal bovine serum (FBS; Gibco; HDLC-enriched) and 1% Penicillin-Streptomycin (P/S) (Gibco) (complete medium) at 37 °C in a humidified 5% CO₂ incubator and passaged using trypsin. Cells were then plated at a density of 9.5 × 10⁵ cells/cm² in 6-well plates (each well 9.6 cm²) and prepared for transfection when they grew up to 90% confluency. The control (empty vector), SCARB1^362CCT and SCARB1^362CTT plasmids were transfected respectively using Lipofectamine 2000 (Invitrogen) following the manufacturer’s instructions. The transfected cells were incubated for 24 hours and assayed through GFP expression under an Eclipse Ti-s fluorescence optical microscope (Nikon). The successful expression of transfected SR-B1-GFP fusion proteins was further confirmed by Western blotting using GFP antibody (1: 10000, Abcam, ab183734) with β-Actin (1: 5000, Gene-Protein Link, P01L03) as the internal control. The secondary antibodies goat anti-rabbit IgG-HRP (1: 10000, Gene-Protein Link, P03S02S) and horse anti-mouse IgG-HRP (1: 3000, Cell Signaling Technology, 7076) were used for GFP and β-Actin respectively. The media was then removed, and the cells were washed three times with PBS, and harvested in 800 μL 0.9% NaCl. After lysing the cells and centrifuging (4 °C, 6210 × g, 10 min), the supernatant was obtained for measuring the protein content by a bicinchoninic acid (BCA) assay (Beyotime) and cholesterol content by an HPLC method, respectively.

Here, for the HPLC assay, we first extracted 400 μL cell suspension, added an equal volume of 15% KOH ethanol solution, vortexed for 5 min, and incubated in a 60 °C water bath for an hour. Next, we added 100 μL trichloroacetic acid and vortexed for 5 min, and added 400 μL of hexane-isopropanol mixture (3: 2, vol/vol). After centrifugation with 14,850 × g for 20 min at 4 °C, the solvent fraction was collected. The collected sample was dried under a stream of nitrogen and resuspended in 400 μL acetonitrile-isopropanol mixture (1: 1, vol/vol), centrifuged at 14,850 × g for 20 min at 4 °C. 20 µL suspension were extracted and injected into a LC-30A UHPLC system (Shimadzu) with Eclipse XDB-C18 column (4.6 mm × 250 mm, 5.0 µm) to examine the cellular cholesterol content. The samples were eluted with acetonitrile-isopropanol mixture (1: 1, vol/vol) for 10 min and the peak of cholesterol was detected at 206 nm. During the measurement, the column temperature was set at 40 °C and the flow rate was 1 ml/min.

The peak area of each sample was used to evaluate the abundance of cholesterol according to a standard calibration curve, which was obtained based on six standard concentrations (2, 5, 10, 60, 80 and 100 ng/μL) of cholesterol (Sigma-Aldrich).

Each experiment was repeated independently three times.

Detection of positively selective signatures

We used a cross-population extended haplotype homozygosity (XP-EHH) method implemented in the Selscan⁶⁶ (version 1.2.0) to identify the recent positively selected signals in a sliding window of 20 Kb between MN and QTP saker populations. The windows with top 1% XP-EHH values were considered as positively selected regions in QTP sakers. Hard sweeps were determined as a cluster of more than five consecutive windows with top 1% XP-EHH values. F_ST and hapFLK (version 1.4) methods were further used to confirm the selection signals. GO category enrichment of positively selected genes in QTP sakers was conducted using a Chi-square test and adjusted by False Discovery Rate (FDR) method (q < 0.05)¹³¹. The GO terms were excluded if the enriched gene number was less than three.

Identification of immature erythrocytes in avian circulating blood and correlation analysis of gene expression between chicken blood and bone marrow

We examined the proportion of immature erythrocytes in avian blood by the Giemsa stain method⁷⁰. About 50 µL blood was extracted from each of three saker falcons (one 6 months-old and two 4.5 years-old) and three budgerigars (Melopsittacus undulatus) (aged 1.5, 3 and 6 months-old), respectively. Five blood smears were produced from each individual, stained by Giemsa (Yeason) and scanned at × 40 magnification using Aperio VESA8 system (Leica). For each smear, more than 700 cells were randomly selected for counting and identifying the immature erythrocytes^70,132.

We downloaded the chicken transcriptomes of blood (14-days-old and 35-weeks-old) and marrow tissues (19-days-old, 4-weeks-old and 8-months-old) from NCBI and compared these data to further check whether gene expression in circulating blood could proxy that in bone marrow. The RNA-seq data used in this analysis are available in the NCBI database under accession codes PRJNA542984, PRJEB44038, PRJNA323973, PRJNA279487, PRJNA412404, respectively. The sequences were aligned with the chicken genome (galGal4) using Bowtie2¹³³ (version 2.3.4.3). The expression of each transcript was quantified as transcripts per million (TPM) using RSEM¹³⁴ (version 1.3.1). After filtering the lowly expressed transcripts (TPM <1), the expression correlation between transcriptome of blood and bone marrow samples was calculated using ggcorrplot package in R.

Detection of genome-wide chromatin accessibilities

In order to study the genome-wide chromatin accessibilities of sakers, we performed ATAC-seq for different tissues. We collected three blood samples from QTP saker chicks (5–6 weeks old). The blood was subjected to centrifugation at 4 °C, 800 × g for 10 min. After removing plasma, the remaining cells were washed by PBS. Each of 1 µL blood cells were mixed with 1 mL freeze medium (90% FBS and 10% DMSO). We dissected an embryo from an un-hatched egg (about 28 days incubated) of QTP saker and sequenced tissues of forelimb, keel and flight muscle. We got the dorsal skin from three QTP saker juveniles which naturally died in the field in 2019 and 2022.

The ATAC-seq libraries for these samples were constructed following the Omni-ATAC protocol¹³⁵. Lysate buffer was added to obtain the nuclei and the libraries were conducted by Tn5 enzyme transposition mix buffer reaction. The qualified libraries were subjected to pair-end sequencing on a Novaseq 6000 platform (Illumina). The raw sequences were filtered by Trimmomatic¹³⁶ with default parameters.

To detect the chromatin accessibility, clean reads were aligned with the saker genome using BWA. The reads with low mapping quality (PHRED < 20) were filtered and duplicates were removed. The candidate accessible open chromatin regions (peaks) were identified using MACS2-callpeak¹³⁷ with parameters “-nomodel -f BAMPE -p 0.05”. For the blood samples, the reproducible peaks were identified using an irreproducibility discovery rate (IDR ≤ 0.05) method¹³⁸ and the signals were kept when it occurred in at least two samples. For the dorsal skin samples from dead sakers, the reproducible peaks were identified using BEDTools¹³⁹ (version 2.25.0) with overlapping rate of peaks larger than 50% in at least two samples.

ATAC-seq data of chicken bone and muscle tissues were downloaded from NCBI (data are available in the NCBI database under accession code PRJNA433154) to find the cis-regulatory elements around SCMH1 gene. H3K27ac ChIP-seq data of chicken leg scale skin samples were downloaded from NCBI (data are available in the NCBI database under accession code PRJNA561632) to identify the cis-regulatory elements around ASIP gene. The downloaded data were aligned with chicken genome (Galgal4) using BWA and peaks were identified using MACS2-callpeak. The homologous cis-regulatory elements in chickens were identified by aligning the assembled saker genome sequences against the chicken genome (Galgal4) using LASTZ¹⁴⁰ (version 1.04.00).

We have normalized all of the ATAC-seq using reads per genome coverage (RPGC) calculated from bamCoverage (deepTools¹⁴¹, v3.5.0), and ChIP-seq data using read count ratio (log₂ scale) between H3K27ac and input data calculated from bamCompare (deepTools, v3.5.0) respectively to show the tracks.

Detection of chromatin architectures using the Hi-C technique

To compare the chromatin architectures of the focal sweep between MN and QTP sakers, we performed Hi-C sequencing on blood samples of QTP (N = 2) and MN chicks (N = 2) (aged 5–6 weeks old). The sequenced reads were aligned with the assembled saker genome using HiC-Pro. HiCExplorer3¹⁴² was used to generate the contact matrix, identified TADs and computed contact ratio. The loops were identified using HICCUPS¹⁴³ (v1.0.0). The TADs of SCMH1 and ASIP genes were identified following the HiC-Pro/HiCExplorer3 pipeline. The correlation between contact ratio (QTP/MN) and difference of XP-EHH values (XP-EHH value in QTP population minus that in MN population) for each bin (20 Kb size) was simulated using a linear regression model. We also compared the contact ratio of QTP/MN sakers (log₂ scale) between the TAD region and flanking 500 Kb regions, between the TAD region and the whole Chr 4, and between the TAD region and the whole genome, respectively, for testing the enhanced chromatin interactions of the 500 Kb focal sweep.

Identification of full-length transcripts and differentially expressed transcripts

We performed an Iso-Seq using a PacBio platform from a QTP saker chick (aged 5–6 weeks old) to obtain full-length transcripts in blood. The sequenced reads were aligned with the assembled saker genome assembly using Minimap2¹⁴⁴ (version 2.13). After filtering the low quality mapping (PHRED < 10) reads and removing duplicates, we identified the gene isoforms using cDNA_Cupcake (version 5.8, https://www.github.com/Magdoll/cDNA_Cupcake). To identify the differentially expressed transcripts (DETs) in the focal sweep between MN and QTP saker populations, we calculated the expression of each transcript from our published blood RNA-seq data (data are available in the CNCB database under accession code PRJCA008052¹²) (the MN population transcriptomic data were generated from samples collected in central Asia including Kazakhstan and Mongolia). The RNA sequencing reads were aligned with all of the transcripts from whole genome annotation and Iso-Seq using Bowtie2¹³³ (version 2.3.4.3). The expression of each transcript was quantified as TPM using RSEM¹³⁴ (version 1.3.1). The DETs were detected using edgeR¹⁴⁵ (version 3.32.0) based on an exact test and P-values were adjusted by the FDR method. A transcript was identified as a DET when the fold change was larger than 1 (or less than 1) and q-value was required less than 0.05.

Hemoglobin concentration measurements

Blood samples were extracted from eight MN and six QTP chicks aged 4–6 weeks old during our field surveys in Mongolia and Qinghai-Tibet Plateau in 2017 and 2022. The hemoglobin concentration was measured using an automated Auto Hematology Analyzer BC-2600Vet⁷² (Mindray). Each sample was measured for three repeats.

Plumage color measurements

We used an HR2000CG-UV-NIR spectrometer¹⁴⁶ (Ocean Optics) with an HL 10000-Mini halogen lamp (Oceanhood) and a QR400-7-VIS-NIR fiber probe (Ocean Optics) to measure the plumage coloration. The spectra acquisition software package OceanView (version 1.6.7) was applied with the parameters set as: (1) integration time 100 ms; (2) the average number of spectra 5; (3) the electric dark correction on. The color module was selected and the CIELAB color space (L*a*b*) values were used to describe the plumage color. L* represents the lightness from black (0) to white (100), a* represents color from green (−128) to red (+127), and b* represents color from blue (−128) to yellow (+127).

For the comparison between MN and QTP saker populations, we randomly scanned the coloration of dorsal, wing and tail feathers (excluding spots/bands) of 11 chicks at 5–7 weeks old from each population during our field surveys in Mongolia and Qinghai-Tibet Plateau in 2019. Each feather was measured five times. The plumage color of each individual was assessed by the values of L*, a* and b*. The averaged L*, a* and b* values for each individual were plotted by scatterplot3d package and the PCA was conducted by FactoMineR and factoextra packages in R (version 4.0.3).

Statistical analysis

All P values were calculated from Student’s t tests (two-sided) unless specified. For the t test, Cohen’s d is determined by calculating the mean difference between two groups, and dividing the result by the pooled standard deviation.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

The saker genome assembly sequences have been deposited in the CNCB database under accession code GWHBOUP00000000. The PacBio, Bionano, HiSeq, and Hi-C data for genome assembly; whole genome resequencing data of 30 saker falcons and 10 gyrfalcons; and functional genomics data of ATAC-seq, Hi-C, Iso-Seq have been deposited in the CNCB database under accession code PRJCA010321. The four bird genomes for chromosomal alignments used in this study are available in the NCBI database under accession codes GCA_900496995.4, GCA_001887755.1, GCA_016699485.1, GCA_016128335.1, respectively. The RNA-seq data of saker blood samples used in this study are available in the CNCB database under accession code PRJCA008052. The RNA-seq data of chicken blood and marrow samples used in this study are available in the NCBI database under accession codes PRJNA542984, PRJEB44038, PRJNA323973, PRJNA279487, PRJNA412404, respectively. The ATAC-seq data of chicken bone and muscle samples used in this study are available in the NCBI database under accession code PRJNA433154. The H3K27ac ChIP-seq data of chicken leg scale skin samples used in this study are available in the NCBI database under accession code PRJNA561632. The chicken genome for reference used in this study are available in the NCBI database under accession code GCA_000002315.2 [https://www.ncbi.nlm.nih.gov/assembly/GCF_000002315.3]. Source data are provided with this paper.

Change history

01 December 2022
A Correction to this paper has been published: https://doi.org/10.1038/s41467-022-35205-5

References

Deng, T. et al. Out of Tibet: Pliocene woolly rhino suggests high-plateau origin of Ice Age megaherbivores. Science 333, 1285–1288 (2011).
Article CAS PubMed Google Scholar
Wang, X., Tseng, Z. J., Li, Q., Takeuchi, G. T. & Xie, G. From ‘third pole’ to north pole: a Himalayan origin for the arctic fox. Proc. R. Soc. B 281, 20140893 (2014).
Article PubMed PubMed Central Google Scholar
Lan, T. et al. Evolutionary history of enigmatic bears in the Tibet Plateau-Himalaya region and the identity of the yeti. Proc. R. Soc. B 284, 20171804 (2017).
Article PubMed PubMed Central Google Scholar
Fuentes-González, J. A. & Muñoz-Durán, J. Phylogeny of the extant canids (Carnivora: Canidae) by means of character congruence under parsimony. Actual. Biol. 34, 85–102 (2012).
Google Scholar
Zhu, X. et al. Divergent and parallel routes of biochemical adaptation in high-altitude passerine birds from the Qinghai-Tibet Plateau. Proc. Natl. Acad. Sci. USA 115, 1865–1870 (2018).
Article CAS PubMed PubMed Central Google Scholar
Liu, R. et al. Detection of genetic diversity and selection at the coding region of the melanocortin receptor 1 (MC1R) gene in Tibetan pigs and Landrace pigs. Gene 575, 537–542 (2016).
Article CAS PubMed Google Scholar
Semenza, G. L. The genomics and genetics of oxygen homeostasis. Annu. Rev. Genomics Hum. Genet. 21, 183–204 (2020).
Article CAS PubMed Google Scholar
Wu, D. et al. Convergent genomic signatures of high-altitude adaptation among domestic mammals. Natl Sci. Rev. 7, 952–963 (2020).
Article PubMed Google Scholar
Julian, C. G. Epigenomics and human adaptation to high altitude. J. Appl. Physiol. (1985) 123, 1362–1370 (2017).
Article CAS PubMed Google Scholar
Xiong, X. et al. Yak response to high-altitude hypoxic stress by altering mRNA expression and DNA methylation of hypoxia-inducible factors. Anim. Biotechnol. 26, 222–229 (2015).
Article CAS PubMed Google Scholar
Zhan, X. et al. Exonic versus intornic SNPs: contrasting roles in revealing the population genetic differentiation of a widespread bird species. Heredity 114, 1–9 (2015).
Article CAS PubMed Google Scholar
Pan, S. et al. Population transcriptomes reveal synergistic responses of DNA polymorphism and RNA expression to extreme environments on the Qinghai-Tibetan Plateau in a predatory bird. Mol. Ecol. 26, 2993–3010 (2017).
Article CAS PubMed Google Scholar
Nittinger, F., Haring, E., Pinsker, W., Wink, M. & Gamauf, A. Out of Africa? Phylogenetic relationships between Falco biarmicus and the other hierofalcons (Aves: Falconidae). J. Zool. Syst. Evol. Res. 43, 321–331 (2005).
Article Google Scholar
Nittinger, F., Gamauf, A., Pinsker, W., Wink, M. & Haring, E. Phylogeography and population structure of the saker falcon (Falco cherrug) and the influence of hybridization: mitochondrial and microsatellite data. Mol. Ecol. 16, 1497–1517 (2007).
Article CAS PubMed Google Scholar
Gutenkunst, R., Hernandez, R. D., Williamson, S. & Bustamante, C. Inferring the joint demographic history of multiple populations from multidimensional SNP frequency data. PLoS Genet. 5, e1000695 (2009).
Article PubMed PubMed Central Google Scholar
Cade, T. J. Biological traits of the Gyrfalcon (Falco rusticolus) in relation to climate change. In Watson, R. T. et al. (Eds.). Gyrfalcons and Ptarmigan in a Changing World. The Peregrine Fund, Idaho (2011).
Potapov, E. & Sale, R.The Gyrfalcon. T. & A. D. Poyser and New Haven, London and Yale University Press, Connecticut (2005).
Zhan, X. et al. Peregrine and saker falcon genome sequences provide insights into evolution of a predatory lifestyle. Nat. Genet. 45, 563–566 (2013).
Article CAS PubMed Google Scholar
Waters, P. D. et al. Microchromosomes are building blocks of bird, reptile, and mammal chromosomes. Proc. Natl Acad. Sci. USA 118, e2112494118 (2021).
Article CAS PubMed PubMed Central Google Scholar
Terhorst, J., Kamm, J. A. & Song, Y. S. Robust and scalable inference of population history from hundreds of unphased whole genomes. Nat. Genet. 49, 303–309 (2016).
Article PubMed PubMed Central Google Scholar
Mlíkovsky, J. Cenozoic Birds of the World, Part 1: Europe. Ninox Press, Prague (2002).
Wink, M., Sauer-Giirth, H., Ellis, D. & Kenward, R. Phylogenetic Relationships in the Hierofalco Complex (Saker-, Gyr-, Lanner-, Laggar Falcon). In: Chancellor R. D., Meyburg, B. U. (eds.). Raptors worldwide. World Working Group on Birds of Prey and Owls, Berlin and MME/BirdLife Hungary, Budapest (2004).
Patterson, N. et al. Ancient admixture in human history. Genetics 192, 1065–1093 (2012).
Article PubMed PubMed Central Google Scholar
Medina, P., Thornlow, B., Nielsen, R. & Corbett-Detig, R. Estimating the timing of multiple admixture pulses during local ancestry inference. Genetics 210, 1089–1107 (2018).
Article CAS PubMed PubMed Central Google Scholar
Cohen, K. M. & Gibbard, P. L. Global chronostratigraphical correlation table for the last 2.7 million years, version 2019 QI-500. Quat. Int. 500, 20–31 (2019).
Article Google Scholar
Elith, J. et al. A statistical explanation of MaxEnt for ecologists. Divers. Distrib. 17, 43–57 (2011).
Article Google Scholar
Burchak-Abramovich, N. I. & Burchak, D. H. The birds of the Late Quaternary of the Altai Mountains. Acta Zool. Cracov. 41, 51–60 (1998).
Google Scholar
Schiffels, S. & Durbin, R. Inferring human population size and separation history from multiple genome sequences. Nat. Genet. 46, 919–925 (2014).
Article CAS PubMed PubMed Central Google Scholar
Markova, A. K. et al. Late Pleistocene distribution and diversity of mammals in northern Eurasia. Paleont. Evo. 28-29, 5–143 (1995).
Google Scholar
Dixon, A., Ma, M. & Batbayar, N. Importance of the Qinghai-Tibet Plateau for the endangered Saker Falcon Falco cherrug. Forktail 31, 37–42 (2015).
Google Scholar
McDonald, P. G., Olsen, P. D. & Cockburn, A. Selection on body size in a raptor with pronounced reversed sexual size dimorphism: are bigger females better? Behav. Ecol. 16, 48–56 (2005).
Article Google Scholar
Excoffier, L., Dupanloup, I., Huerta-Sánchez, E., Sousa, V. C. & Foll, M. Robust demographic inference from genomic and SNP data. PLoS Genet. 9, e1003905 (2013).
Article PubMed PubMed Central Google Scholar
Beyer, R. M., Krapp, M. & Manica, A. High-resolution terrestrial climate, bioclimate and vegetation for the last 120,000 years. Sci. Data 7, 236 (2020).
Article PubMed PubMed Central Google Scholar
Hedrick, P. W. Adaptive introgression in animals: examples and comparison to new mutation and standing variation as sources of adaptive variation. Mol. Ecol. 22, 4606–4618 (2013).
Article PubMed Google Scholar
Eastham, C. P., Nicholls, M. K. & Fox, N. C. Morphological variation of the saker (Falco cherrug) and the implications for conservation. Biodivers. Conserv. 11, 305–325 (2002).
Article Google Scholar
Meiri, S. & Dayan, T. On the validity of Bergmann’s rule. J. Biogeogr. 30, 331–351 (2003).
Article Google Scholar
Martin, S. H., Davey, J. W. & Jiggins, C. D. Evaluating the use of ABBA-BABA statistics to locate introgressed loci. Mol. Biol. Evol. 32, 244–257 (2015).
Article CAS PubMed Google Scholar
Takada, Y. et al. Mammalian polycomb Scmh1 mediates exclusion of polycomb complexes from the XY body in the pachytene spermatocytes. Development 134, 579–590 (2007).
Article CAS PubMed Google Scholar
Weedon, M. N. et al. Genome-wide association analysis identifies 20 loci that influence adult height. Nat. Genet. 40, 575–583 (2008).
Article CAS PubMed PubMed Central Google Scholar
Petersen, J. L. et al. Genome-wide analysis reveals selection for important traits in domestic horse breeds. PLoS Genet 9, e1003211 (2013).
Article CAS PubMed PubMed Central Google Scholar
Szabo, Q., Bantignies, F. & Cavalli, G. Principles of genome folding into topologically associating domains. Sci. Adv. 5, eaaw1668 (2019).
Article CAS PubMed PubMed Central Google Scholar
Browning, S. R. & Browning, B. L. Rapid and accurate haplotype phasing and missing data inference for whole genome association studies by use of localized haplotype clustering. Am. J. Hum. Genet. 81, 1084–1097 (2007).
Article CAS PubMed PubMed Central Google Scholar
Pitulescu, M., Kessel, M. & Luo, L. The regulation of embryonic patterning and DNA replication by geminin. Cell Mol. Life Sci. 62, 1425–1433 (2005).
Article CAS PubMed Google Scholar
Liu, S. et al. Population genomics reveal recent speciation and rapid evolutionary adaptation in polar bears. Cell 157, 785–794 (2014).
Article CAS PubMed PubMed Central Google Scholar
Li, C. et al. Two Antarctic penguin genomes reveal insights into their evolutionary history and molecular changes related to the Antarctic environment. GigaScience 3, 27 (2014).
Article PubMed PubMed Central Google Scholar
Benítez-López, A. et al. The island rule explains consistent patterns of body size evolution in terrestrial vertebrates. Nat. Ecol. Evol. 5, 768–786 (2021).
Article PubMed Google Scholar
Peig, J. & Green, A. J. New perspectives for estimating body condition from mass/length data: the scaled mass index as an alternative method. Oikos 118, 1883–1891 (2009).
Article Google Scholar
Dixon, A. et al. Variation in electrocution rate and demographic composition of Saker Falcons electrocuted at power lines in Mongolia. J. Raptor Res. 54, 136–146 (2020).
Article Google Scholar
Kraft, F., Driscoll, S. C., Buchanan, K. L. & Crino, O. L. Developmental stress reduces body condition across avian life-history stages: A comparison of quantitative magnetic resonance data and condition indices. Gen. Comp. Endocrinol. 272, 33–41 (2019).
Article CAS PubMed Google Scholar
Nie, Y. et al. Exceptionally low daily energy expenditure in the bamboo-eating giant panda. Science 349, 171–174 (2015).
Article CAS PubMed Google Scholar
Bäckhed, F. et al. The gut microbiota as an environmental factor that regulates fat storage. Proc. Natl. Acad. Sci. USA 101, 15718–15723 (2004).
Article PubMed PubMed Central Google Scholar
Wang, G. et al. Transcriptomic analysis between normal and high-intake feeding geese provides insight into adipose deposition and susceptibility to fatty liver in migratory birds. BMC Genomics 20, 1–12 (2019).
Google Scholar
Kwan, B. C., Kronenberg, F., Beddhu, S. & Cheung, A. K. Lipoprotein metabolism and lipid management in chronic kidney disease. J. Am. Soc. Nephrol. 18, 1246–1261 (2007).
Article CAS PubMed Google Scholar
Schwingshackl, L. & Hoffmann, G. Comparison of effects of long-term low-fat vs high-fat diets on blood lipid levels in overweight or obese patients: a systematic review and meta-analysis. J. Acad. Nutr. Diet. 113, 1640–1661 (2013).
Article PubMed Google Scholar
Kapourchali, F. R., Surendiran, G., Goulet, A. & Moghadasian, M. H. The role of dietary cholesterol in lipoprotein metabolism and related metabolic abnormalities: a mini-review. Crit. Rev. Food Sci. Nutr. 56, 2408–2415 (2016).
Article CAS PubMed Google Scholar
Jansen, G. R., Zanetti, M. E. & Hutchison, C. F. Studies on lipogenesis in vivo. Biochem. J. 99, 333 (1966).
Article CAS PubMed PubMed Central Google Scholar
Teekell, R. A., Breidenstein, C. P. & Watts, A. B. Cholesterol metabolism in the chicken. Poult. Sci. 54, 1036–1042 (1975).
Article CAS PubMed Google Scholar
Ormbostad, I. Relationships between persistent organic pollutants (POPs) and plasma clinical-chemical parameters in polar bears (Ursus maritimus) from Svalbard, Norway. Master Thesis, Norwegian University of Science and Technology (2012).
Peebles, F. D., Cheaney, J. D., Brake, J. D., Boyle, C. R. & Latour, M. A. Effects of added dietary lard on body weight and serum glucose and low density lipoprotein cholesterol in randombred broiler chickens. Poult. Sci. 76, 29–36 (1997).
Article CAS PubMed Google Scholar
Bérard, A. et al. High plasma HDL concentrations associated with enhanced atherosclerosis in transgenic mice overexpressing lecithinchoesteryl acyltransferase. Nat. Med. 3, 744–749 (1997).
Article PubMed Google Scholar
Zanoni, P. et al. Rare variant in scavenger receptor BI raises HDL cholesterol and increases risk of coronary heart disease. Science 351, 6278 (2016).
Article Google Scholar
Toomey, M. B. et al. High-density lipoprotein receptor SCARB1 is required for carotenoid coloration in birds. Proc. Natl Acad. Sci. USA 114, 5219–5224 (2017).
Article CAS PubMed PubMed Central Google Scholar
Bonhomme, M. et al. Detecting selection in population trees: the Lewontin and Krakauer test extended. Genetics 186, 241–262 (2010).
Article PubMed PubMed Central Google Scholar
Fariello, M. I., Boitard, S., Naya, H., SanCristobal, M. & Servin, B. Detecting signatures of selection through haplotype differentiation among hierarchically structured populations. Genetics 193, 929–941 (2013).
Article PubMed PubMed Central Google Scholar
He, Y. et al. The past population dynamics of Ochotona curzoniae and the response to the climate change. North-West. J. Zool. 14, 220–225 (2018).
Google Scholar
Szpiech, Z. A. & Hernandez, R. D. Selscan: an efficient multithreaded program to perform EHH-based scans for positive selection. Mol. Biol. Evol. 31, 2824–2827 (2014).
Article CAS PubMed PubMed Central Google Scholar
Rao, S. S. et al. A 3D map of the human genome at kilobase resolution reveals principles of chromatin looping. Cell 159, 1665–1680 (2014).
Article CAS PubMed PubMed Central Google Scholar
Krijger, P. H. L. & Laat, W. D. Regulation of disease-associated gene expression in the 3D genome. Nat. Rev. Mol. Cell Biol. 17, 771–782 (2016).
Article CAS PubMed Google Scholar
Luo, X. et al. 3D Genome of macaque fetal brain reveals evolutionary innovations during primate corticogenesis. Cell 184, e21 (2021).
Article Google Scholar
Glomski, C. A. & Pica, A. The avian erythrocyte: its phylogenetic odyssey. CRC Press, Boca Raton (2011).
Williams, A. F. DNA synthesis in purified populations of avian erythroid cells. J. Cell. Sci. 10, 27–46 (1972).
Article CAS PubMed Google Scholar
Liu, B. et al. Grape seed procyanidin extract ameliorates lead-induced liver injury via miRNA153 and AKT/GSK-3β/Fyn-mediated Nrf2 activation. J. Nutr. Biochem. 52, 115–123 (2018).
Article CAS PubMed Google Scholar
Sinha, R. P. & Häder, D. P. UV-induced DNA damage and repair: a review. Photochem. Photobiol. Sci. 1, 225–236 (2002).
Article CAS PubMed Google Scholar
Nicolaï, M. P. J., Shawkey, M. D., Porchetta, S., Claus, R. & D'Alba, L. Exposure to UV radiance predicts repeated evolution of concealed black skin in birds. Nat. Commun. 11, 2414 (2020).
Article PubMed PubMed Central Google Scholar
Galván, I. & Solano, F. Bird integumentary melanins: biosynthesis, forms, function and evolution. Int. J. Mol. Sci. 17, 520 (2016).
Article PubMed PubMed Central Google Scholar
Thomas, D. B. et al. Ancient origins and multiple appearances of carotenoid-pigmented feathers in birds. Proc. Biol. Sci. 281, 20140806 (2014).
PubMed PubMed Central Google Scholar
Toews, D. P. L. et al. Plumage genes and little else distinguish the genomes of hybridizing warblers. Curr. Biol. 26, 2313–2318 (2016).
Article CAS PubMed Google Scholar
Witt, K. E. & Huerta-Sánchez, E. Convergent evolution in human and domesticate adaptation to high-altitude environments. Philos. Trans. R. Soc. B 374, 20180235 (2019).
Article CAS Google Scholar
Yi, X. et al. Sequencing of 50 human exomes reveals adaptation to high altitude. Science 329, 75–78 (2010).
Article CAS PubMed PubMed Central Google Scholar
Huerta-Sánchez, E. et al. Altitude adaptation in Tibetans caused by introgression of Denisovan-like DNA. Nature 512, 194–197 (2014).
Article PubMed PubMed Central Google Scholar
Xin, J. et al. Chromatin accessibility landscape and regulatory network of high-altitude hypoxia adaptation. Nat. Commun. 11, 4928 (2020).
Article CAS PubMed PubMed Central Google Scholar
Chen, H. et al. The impacts of climate change and human activities on biogeochemical cycles on the Qinghai-Tibetan Plateau. Glob. Chang. Biol. 19, 2940–2955 (2013).
Article PubMed Google Scholar
Rhoads, A. & Au, K. F. PacBio sequencing and its applications. Genom. Proteom. Bioinf. 13, 278–289 (2015).
Article Google Scholar
Ruan, J. & Li, H. Fast and accurate long-read assembly with wtdbg2. Nat. Methods 17, 155–158 (2020).
Article CAS PubMed Google Scholar
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler Transform. Bioinformatics 25, 1754–1760 (2009).
Article CAS PubMed PubMed Central Google Scholar
Walker, B. J. et al. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS One 9, e112963 (2014).
Article PubMed PubMed Central Google Scholar
Lam, E. T. et al. Genome mapping on nanochannel arrays for structural variation analysis and sequence assembly. Nat. Biotechnol. 30, 771–776 (2012).
Article CAS PubMed Google Scholar
Yaffe, E. & Tanay, A. Probabilistic modeling of Hi-C contact maps eliminates systematic biases to characterize global chromosomal architecture. Nat. Genet. 43, 1059–1065 (2011).
Article CAS PubMed Google Scholar
van BerkumL, N. L. et al. Hi-C: a method to study the three-dimensional architecture of genomes. J. Vis. Exp. 39, e1869 (2010).
Google Scholar
Servant, N. et al. HiC-Pro: an optimized and flexible pipeline for Hi-C data processing. Genome Biol. 16, 259 (2015).
Article PubMed PubMed Central Google Scholar
Burton, J. N. et al. Chromosome-scale scaffolding of de novo genome assemblies based on chromatin interactions. Nat. Biotechnol. 31, 1119–1125 (2013).
Article CAS PubMed PubMed Central Google Scholar
Mead, D. et al. The genome sequence of the European golden eagle, Aquila chrysaetos chrysaetos Linnaeus 1758. Wellcome Open Res. 6, 112 (2021).
Article PubMed PubMed Central Google Scholar
Damas, J. et al. Upgrading short-read animal genome assemblies to chromosome level using comparative genomics and a universal probe set. Genome Res. 27, 875–884 (2017).
Article CAS PubMed PubMed Central Google Scholar
Liu, J. et al. A new emu genome illuminates the evolution of genome configuration and nuclear architecture of avian chromosomes. Genome Res. 31, 497–511 (2021).
Article PubMed PubMed Central Google Scholar
Altschul, S. F. et al. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 25, 3389–3402 (1997).
Article CAS PubMed PubMed Central Google Scholar
Birney, E., Clamp, M. & Durbin, R. GeneWise and genomewise. Genome Res. 14, 988–995 (2004).
Article CAS PubMed PubMed Central Google Scholar
Trapnell, C., Pachter, L. & Salzberg, S. L. TopHat: discovering splice junctions with RNA-seq. Bioinformatics 25, 1105–1111 (2009).
Article CAS PubMed PubMed Central Google Scholar
Trapnell, C. et al. Transcript assembly and quantification by RNA-seq reveals unannotated transcripts and isoform switching during cell differentiation. Nat. Biotechnol. 28, 511–515 (2010).
Article CAS PubMed PubMed Central Google Scholar
Mario, S. & Burkhard, M. AUGUSTUS: a web server for gene prediction in eukaryotes that allows user-defined constraints. Nucleic Acids Res. 33, 465–467 (2005).
Article Google Scholar
Burge, C. & Karlin, S. Prediction of complete gene structure in human genomic DNA. J. Mol. Biol. 268, 78–94 (1997).
Article CAS PubMed Google Scholar
Bairoch, A. & Apweiler, R. The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000. Nucleic Acids Res. 28, 45–48 (2000).
Article CAS PubMed PubMed Central Google Scholar
Zdobnov, E. M. & Apweiler, R. InterProScan–an integration platform for the signature- recognition methods in InterPro. Bioinformatics 17, 847–848 (2001).
Article CAS PubMed Google Scholar
Li, H. et al. The sequence alignment/map (SAM) format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
Article PubMed PubMed Central Google Scholar
DePristo, M. A. et al. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat. Genet. 43, 491–498 (2011).
Article CAS PubMed PubMed Central Google Scholar
Tang, H., Peng, P. & Wang, N. J. R. Estimation of individual admixture: analytical and study design considerations. Genet. Epidemiol. 28, 289–301 (2005).
Article PubMed Google Scholar
Patterson, N., Price, A. L. & Reich, D. Population structure and eigenanalysis. PLoS Genet. 2, e190 (2006).
Article PubMed PubMed Central Google Scholar
Smeds, L. et al. Evolutionary analysis of the female-specific avian W chromosome. Nat. Commun. 6, 7330 (2015).
Article CAS PubMed Google Scholar
Wright, A. E., Harrison, P. W., Montgomery, S. H., Pointer, M. A. & Mank, J. E. Independent stratum formation on the avian sex chromosomes reveals inter-chromosomal gene conversion and predominance of purifying selection on the W chromosome. Evolution 68, 3281–3295 (2014).
Article PubMed PubMed Central Google Scholar
Radke, D. W. et al. Purifying selection on noncoding deletions of human regulatory loci detected using their cellular pleiotropy. Genome Res. 31, 935–946 (2021).
Article PubMed PubMed Central Google Scholar
Rice, P., Longden, L. & Bleasby, A. EMBOSS: The European molecular biology open software suite. Trends Genet. 16, 276–277 (2000).
Article CAS PubMed Google Scholar
Gu, Z. et al. Climate-driven flyway changes and memory-based long-distance migration. Nature 591, 259–264 (2021).
Article CAS PubMed Google Scholar
Gravel, S. Population genetics models of local ancestry. Genetics 191, 607–619 (2012).
Article PubMed PubMed Central Google Scholar
Liang, M. & Nielsen, R. The lengths of admixture tracts. Genetics 197, 953–967 (2014).
Article PubMed PubMed Central Google Scholar
Johnson, J. A., Burnham, K. K., Burnham, W. A. & Mindell, D. P. Genetic structure among continental and island populations of gyrfalcons. Mol. Ecol. 16, 3145–3160 (2007).
Article CAS PubMed Google Scholar
Beyer, R. M., Krapp, M. & Manica, A. High-resolution terrestrial climate, bioclimate and vegetation for the last 120,000 years. Sci. Data 7, 1–9 (2020).
Article Google Scholar
Glutz von Blotzheim, U. N., Bauer, K. M. & Bezzel, E. Handbuch der Vfgel Mitteleuropas, Band 4. Akademische Verlagsgesellschaft, Frankfurt am Main (1971).
Gamauf, A. & Dosedel, R. Satellite telemetry of saker falcons (Falco cherrug) in Austria: juvenile dispersal at the westernmost distribution limit of the species. Aquila 119, 65–78 (2012).
Google Scholar
Kenward, E. R., Pfeffer, R. H., Al-Bowardi, M. A. & Fox, N. Settting harness sizes and other marking techniques for a falcon with strong sexual dimorphism. J. Field Ornithol. 72, 244–257 (2001).
Article Google Scholar
Dementiev, G. P., Gladkov, N. A., Ptushenko, E. S., Spangenberg, E. P. & Sudilovskaya, A. M. Birds of the Soviet Union 1. Soviet Science, Moscow (1951).
Han, F. et al. Gene flow, ancient polymorphism, and ecological adaptation shape the genomic landscape of divergence among Darwin’s finches. Genome Res. 27, 1004–1015 (2017).
Article CAS PubMed PubMed Central Google Scholar
Prüfer, K. et al. The bonobo genome compared with the chimpanzee and human genomes. Nature 486, 527–531 (2012).
Article PubMed PubMed Central Google Scholar
Racimo, F., Sankararaman, S., Nielsen, R. & Huerta-Sánchez, E. Evidence for archaic adaptive introgression in humans. Nat. Rev. Genet. 16, 359–371 (2015).
Article CAS PubMed PubMed Central Google Scholar
Fan, R. et al. Genomic analysis of the domestication and post-Spanish conquest evolution of the llama and alpaca. Genome Biol. 21, 159 (2020).
Article CAS PubMed PubMed Central Google Scholar
Untergasser, A. et al. Primer3-new capabilities and interfaces. Nucleic Acids Res. 40, e115 (2012).
Article CAS PubMed PubMed Central Google Scholar
Zhang, H., Tao, Y., Guo, J., Hu, Y. & Su, Z. Hypolipidemic effects of chitosan nanoparticles in hyperlipidemia rats induced by high fat diet. Int. Immunopharmacol. 11, 457–461 (2011).
Article CAS PubMed Google Scholar
Feng, S. et al. Dense sampling of bird diversity increases power of comparative genomics. Nature 587, 252–257 (2020).
Article CAS PubMed PubMed Central Google Scholar
Doyle, J. M. et al. New insights into the phylogenetics and population structure of the prairie falcon (Falco mexicanus). BMC Genomics 19, 233 (2018).
Article PubMed PubMed Central Google Scholar
Katoh, K. & Standley, D. M. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol. Biol. Evol. 30, 772–780 (2013).
Article CAS PubMed PubMed Central Google Scholar
Guex, N. & Peitsch, M. C. SWISS-MODEL and the Swiss-PdbViewer: an environment for comparative protein modeling. Electrophoresis 18, 2714–2723 (1997).
Article CAS PubMed Google Scholar
Neculai, D. et al. Structure of LIMP-2 provides functional insights with implications for SR-B1 and CD36. Nature 504, 172–CD176 (2013).
Article CAS PubMed Google Scholar
Cho, Y. S. et al. The tiger genome and comparative analysis with lion and snow leopard genomes. Nat. Commun. 4, 2433 (2013).
Article PubMed Google Scholar
Gallo, S. S., Ederli, N. B., Bôa-Morte, M. O. & Oliveira, F. C. Hematological, morphological and morphometric characteristics of blood cells from rhea, Rhea Americana (Struthioniformes: Rheidae): a standard for Brazilian birds. Braz. J. Boil. 75, 953–962 (2015).
Article CAS Google Scholar
Langmead, B. & Salzberg, S. Fast gapped-read alignment with Bowtie 2. Nat. Methods 9, 357–359 (2012).
Article CAS PubMed PubMed Central Google Scholar
Li, B. & Dewey, C. N. RSEM: accurate transcript quantification from RNA-seq data with or without a reference genome. BMC Bioinformatics 12, 323 (2011).
Article CAS PubMed PubMed Central Google Scholar
Corces, M. R. et al. An improved ATAC-seq protocol reduces background and enables interrogation of frozen tissues. Nat. Methods 14, 959–962 (2017).
Article CAS PubMed PubMed Central Google Scholar
Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114–2120 (2014).
Article CAS PubMed PubMed Central Google Scholar
Zhang, Y. et al. Model-based Analysis of ChIP-Seq (MACS). Genome Biol. 9, R137 (2008).
Article PubMed PubMed Central Google Scholar
Li, Q., Brown, J. B., Huang, H. & Bickel, P. J. Measuring reproducibility of high-throughput experiments. Ann. Appl. Stat. 5, 1752–1779 (2011).
Article MathSciNet MATH Google Scholar
Quinlan, A. R. & Hall, I. M. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 26, 841–842 (2010).
Article CAS PubMed PubMed Central Google Scholar
Harris, R. S. Improved pairwise alignment of genomic DNA. PhD. Thesis, The Pennsylvania State University (2007).
Ramírez, F. et al. deepTools2: a next generation web server for deep-sequencing data analysis. Nucleic Acids Res. 44, 160–165 (2016).
Article Google Scholar
Wolff, J. et al. Galaxy HiCExplorer 3: a web server for reproducible Hi-C, capture Hi-C and single-cell Hi-C data analysis, quality control and visualization. Nucleic Acids Res. 48, 177–184 (2020).
Article Google Scholar
Durand, N. C. et al. Juicer provides a one-click system for analyzing loop-resolution Hi-C experiments. Cell Syst. 3, 95–98 (2016).
Article CAS PubMed PubMed Central Google Scholar
Li, H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics 34, 3094–3100 (2018).
Article CAS PubMed PubMed Central Google Scholar
Zhou, X., Lindsay, H. & Robinson, M. D. Robustly detecting differential expression in RNA sequencing data using observation weights. Nucleic Acids Res. 42, e91 (2014).
Article CAS PubMed PubMed Central Google Scholar
Vaquero-Alba, I. et al. A quantitative analysis of objective feather color assessment: measurements in the laboratory do not reflect true plumage color. Auk 133, 325–337 (2016).
Article Google Scholar

Download references

Acknowledgements

This study was supported by National Natural Science Foundation of China grants 31930013 (to X.Z.), 32125005 (to X.Z.), 32270455 (to Z.G.), 32222012 and 32070416 (to S.P.), Strategic Priority Program of Chinese Academy of Sciences grant XDB31000000 (to X.Z.), Youth Innovation Promotion Association of Chinese Academy of Sciences grant 2020086 (to S.P.), Third Xinjiang Scientific Expedition and Research Program grant 2021XJKK0600 (to Z.G.), Second Tibet Plateau Scientific Expedition and Research Program (STEP) grant 2019QZKK0501 (to X.Z.), the China National Postdoctoral Program for Innovative Talents BX20220295 (to Z.G.) and the ‘Bingzhi’ Postdoctoral Program of the Institute of Zoology (to Z.G.). Partial sampling and genomic resequencing was funded by the Environment Agency-Abu Dhabi (to N.C.F.). We thank J. Sun, Z. Shao, W. Wu, F. Li, C. Xing, Y. Liu, D. Tang, X. Yue, C. Luo, Z. Yuan, J. Chavko, V. Vetrov and D. Ragyov for assistance during fieldwork and sample collection; Three-River-Source National Park Administration for help in the field; M. Bruford, X. Hou, J. Qu for help in sampling; J. Qu, X. Luo, X. Chen, M. Ma, Y. Mei, L. Yang for assistance in biometric measurements; C. Shang, F. Li, D. Tang, J. Liu, X. Zhang, X. Shi for their help on figure drawing; and G. Liu for her assistance on HPLC measurement.

Author information

These authors contributed equally: Li Hu, Juan Long, Yi Lin.

Authors and Affiliations

Key Laboratory of Animal Ecology and Conservation Biology, Institute of Zoology, Chinese Academy of Sciences, 100101, Beijing, China
Li Hu, Juan Long, Yi Lin, Zhongru Gu, Han Su, Xuemin Dong, Zhenzhen Lin, Qian Xiao, Batbayar Bold, Shengkai Pan & Xiangjiang Zhan
Cardiff University - Institute of Zoology Joint Laboratory for Biocomplexity Research, Chinese Academy of Sciences, 100101, Beijing, China
Li Hu, Juan Long, Yi Lin, Zhongru Gu, Han Su, Zhenzhen Lin, Shengkai Pan & Xiangjiang Zhan
University of the Chinese Academy of Sciences, 100049, Beijing, China
Li Hu, Juan Long, Yi Lin, Han Su, Xuemin Dong, Qian Xiao, Batbayar Bold & Xiangjiang Zhan
Ministry of Education Key Laboratory for Biodiversity Science and Ecological Engineering, College of Life Sciences, Beijing Normal University, 100875, Beijing, China
Qian Xiao
Wildlife Science and Conservation Center, Union Building B-802, Ulaanbaatar, 14210, Mongolia
Nyambayar Batbayar & Batbayar Bold
Raptor Protection of Slovakia, Trhová 54, SK-841 01, Bratislava, Slovakia
Lucia Deutschová
Wild Animal Rescue Centre, Krasnostudencheskiy pr., 21-45, Moscow, 125422, Russia
Sergey Ganusevich
Institute of Plant and Animal Ecology, Ural Division Russian Academy of Sciences, 202-8 Marta Street, Ekaterinburg, 620144, Russia
Vasiliy Sokolov
Arctic Research Station of the Institute of Plant and Animal Ecology, Ural Division Russian Academy of Sciences, 21 Zelenaya Gorka, Labytnangi, Yamalo-Nenetski District, 629400, Russia
Aleksandr Sokolov
The John Curtin School of Medical Research, Australian National University, Canberra, ACT, 2601, Australia
Hardip R. Patel
School of Biotechnology and Biomolecular Science, Faculty of Science, UNSW Sydney, Sydney, NSW, 2052, Australia
Paul D. Waters
School of Life Sciences, La Trobe University, Melbourne, Australia
Jennifer Ann Marshall Graves
Emirates Falconers’ Club, Al Mamoura Building (A), P.O. Box 47716, Muroor Road, Abu Dhabi, UAE
Andrew Dixon
International Wildlife Consultants, P.O. Box 19, Carmarthen, SA33 5YL, UK
Andrew Dixon
Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, 650223, China
Xiangjiang Zhan

Authors

Li Hu
View author publications
You can also search for this author in PubMed Google Scholar
Juan Long
View author publications
You can also search for this author in PubMed Google Scholar
Yi Lin
View author publications
You can also search for this author in PubMed Google Scholar
Zhongru Gu
View author publications
You can also search for this author in PubMed Google Scholar
Han Su
View author publications
You can also search for this author in PubMed Google Scholar
Xuemin Dong
View author publications
You can also search for this author in PubMed Google Scholar
Zhenzhen Lin
View author publications
You can also search for this author in PubMed Google Scholar
Qian Xiao
View author publications
You can also search for this author in PubMed Google Scholar
Nyambayar Batbayar
View author publications
You can also search for this author in PubMed Google Scholar
Batbayar Bold
View author publications
You can also search for this author in PubMed Google Scholar
Lucia Deutschová
View author publications
You can also search for this author in PubMed Google Scholar
Sergey Ganusevich
View author publications
You can also search for this author in PubMed Google Scholar
Vasiliy Sokolov
View author publications
You can also search for this author in PubMed Google Scholar
Aleksandr Sokolov
View author publications
You can also search for this author in PubMed Google Scholar
Hardip R. Patel
View author publications
You can also search for this author in PubMed Google Scholar
Paul D. Waters
View author publications
You can also search for this author in PubMed Google Scholar
Jennifer Ann Marshall Graves
View author publications
You can also search for this author in PubMed Google Scholar
Andrew Dixon
View author publications
You can also search for this author in PubMed Google Scholar
Shengkai Pan
View author publications
You can also search for this author in PubMed Google Scholar
Xiangjiang Zhan
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

X.Z. conceived and designed the study. L.H., G.Z., H.S., Q.X., N.B., B.B., L.D., A.S., V.S., S.G., A.D., and S.P. conducted the fieldwork and sample collection. X.Z. and S.P. supervised the genome and population genomic research. L.H., Z.G., X.D., H.R.P., P.D.W., and S.P. performed the data analyses. J.L., Y.L., H.S., Z.L., and Q.X. conducted the molecular experiments. X.Z., S.P., and L.H. wrote the manuscript, with contributions from J.A.M.G. and A.D.

Corresponding authors

Correspondence to Shengkai Pan or Xiangjiang Zhan.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Martien A.M. Groenen and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Description of Additional Supplementary Files

Supplementary Data 1

Supplementary Data 2

Supplementary Data 3

Supplementary Data 4

Reporting Summary

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Hu, L., Long, J., Lin, Y. et al. Arctic introgression and chromatin regulation facilitated rapid Qinghai-Tibet Plateau colonization by an avian predator. Nat Commun 13, 6413 (2022). https://doi.org/10.1038/s41467-022-34138-3

Download citation

Received: 22 December 2021
Accepted: 14 October 2022
Published: 27 October 2022
DOI: https://doi.org/10.1038/s41467-022-34138-3

This article is cited by

Genomic structural variation is associated with hypoxia adaptation in high-altitude zokors
- Xuan An
- Leyan Mao
- Kexin Li
Nature Ecology & Evolution (2024)
Phylogenomic insights into the polyphyletic nature of Altai falcons within eastern sakers (Falco cherrug) and the origins of gyrfalcons (Falco rusticolus)
- Liudmila Zinevich
- Mátyás Prommer
- Gábor Sramkó
Scientific Reports (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.