The tiger genome and comparative analysis with lion and snow leopard genomes

Cho, Yun Sung; Hu, Li; Hou, Haolong; Lee, Hang; Xu, Jiaohui; Kwon, Soowhan; Oh, Sukhun; Kim, Hak-Min; Jho, Sungwoong; Kim, Sangsoo; Shin, Young-Ah; Kim, Byung Chul; Kim, Hyunmin; Kim, Chang-uk; Luo, Shu-Jin; Johnson, Warren E.; Koepfli, Klaus-Peter; Schmidt-Küntzel, Anne; Turner, Jason A.; Marker, Laurie; Harper, Cindy; Miller, Susan M.; Jacobs, Wilhelm; Bertola, Laura D.; Kim, Tae Hyung; Lee, Sunghoon; Zhou, Qian; Jung, Hyun-Ju; Xu, Xiao; Gadhvi, Priyvrat; Xu, Pengwei; Xiong, Yingqi; Luo, Yadan; Pan, Shengkai; Gou, Caiyun; Chu, Xiuhui; Zhang, Jilin; Liu, Sanyang; He, Jing; Chen, Ying; Yang, Linfeng; Yang, Yulan; He, Jiaju; Liu, Sha; Wang, Junyi; Kim, Chul Hong; Kwak, Hwanjong; Kim, Jong-Soo; Hwang, Seungwoo; Ko, Junsu; Kim, Chang-Bae; Kim, Sangtae; Bayarlkhagva, Damdin; Paek, Woon Kee; Kim, Seong-Jin; O’Brien, Stephen J.; Wang, Jun; Bhak, Jong

doi:10.1038/ncomms3433

Download PDF

Article
Open access
Published: 17 September 2013

The tiger genome and comparative analysis with lion and snow leopard genomes

Yun Sung Cho¹,
Li Hu²,
Haolong Hou²,
Hang Lee³,
Jiaohui Xu²,
Soowhan Kwon⁴,
Sukhun Oh⁴,
Hak-Min Kim¹,
Sungwoong Jho¹,
Sangsoo Kim⁵,
Young-Ah Shin¹,
Byung Chul Kim^1,6,
Hyunmin Kim⁶,
Chang-uk Kim¹,
Shu-Jin Luo⁷,
Warren E. Johnson⁸,
Klaus-Peter Koepfli⁹,
Anne Schmidt-Küntzel¹⁰,
Jason A. Turner¹¹,
Laurie Marker¹²,
Cindy Harper¹³,
Susan M. Miller^13,14,
Wilhelm Jacobs¹⁵,
Laura D. Bertola¹⁶,
Tae Hyung Kim⁶,
Sunghoon Lee^1,6,
Qian Zhou²,
Hyun-Ju Jung⁶,
Xiao Xu⁷,
Priyvrat Gadhvi¹,
Pengwei Xu²,
Yingqi Xiong²,
Yadan Luo²,
Shengkai Pan²,
Caiyun Gou²,
Xiuhui Chu²,
Jilin Zhang²,
Sanyang Liu²,
Jing He²,
Ying Chen²,
Linfeng Yang²,
Yulan Yang²,
Jiaju He²,
Sha Liu²,
Junyi Wang²,
Chul Hong Kim⁶,
Hwanjong Kwak⁶,
Jong-Soo Kim¹,
Seungwoo Hwang¹⁷,
Junsu Ko⁶,
Chang-Bae Kim¹⁸,
Sangtae Kim¹⁹,
Damdin Bayarlkhagva²⁰,
Woon Kee Paek²¹,
Seong-Jin Kim^6,22,
Stephen J. O’Brien^9,23,
Jun Wang^2,24,25 &
…
Jong Bhak^1,6,26,27

Nature Communications volume 4, Article number: 2433 (2013) Cite this article

70k Accesses
166 Citations
332 Altmetric
Metrics details

Subjects

Abstract

Tigers and their close relatives (Panthera) are some of the world’s most endangered species. Here we report the de novo assembly of an Amur tiger whole-genome sequence as well as the genomic sequences of a white Bengal tiger, African lion, white African lion and snow leopard. Through comparative genetic analyses of these genomes, we find genetic signatures that may reflect molecular adaptations consistent with the big cats’ hypercarnivorous diet and muscle strength. We report a snow leopard-specific genetic determinant in EGLN1 (Met39>Lys39), which is likely to be associated with adaptation to high altitude. We also detect a TYR260G>A mutation likely responsible for the white lion coat colour. Tiger and cat genomes show similar repeat composition and an appreciably conserved synteny. Genomic data from the five big cats provide an invaluable resource for resolving easily identifiable phenotypes evident in very close, but distinct, species.

Conservation genomic analysis reveals ancient introgression and declining levels of genetic diversity in Madagascar’s hibernating dwarf lemurs

Article Open access 21 August 2019

Rachel C. Williams, Marina B. Blanco, … Anne D. Yoder

A high-quality genome assembly highlights the evolutionary history of the great bustard (Otis tarda, Otidiformes)

Article Open access 18 July 2023

Haoran Luo, Xinrui Jiang, … Luohao Xu

Chromosome-scale genome assembly of the brown anole (Anolis sagrei), an emerging model species

Article Open access 25 October 2022

Anthony J. Geneva, Sungdae Park, … Jonathan B. Losos

Introduction

The tiger (Panthera tigris), the largest felid species on Earth and a widely recognized symbol for wildlife conservation¹, is one of the world’s most endangered species. Tigers are a keystone species and natural indicators of the health of the ecological communities in which they are found². The current estimates of wild tigers range from just 3,050 to 3,950 individuals. It is postulated that without conservation measures tigers will soon become extinct in the wild, thus turning the preservation of existing wild tiger populations into a major goal of conservation efforts^3,4. Tigers comprise of nine genetically validated subspecies^1,5,6. Four of these went extinct in the wild during the last century (Javan, Balinese, South China and Caspian tigers), leaving five extant subspecies (Amur, Bengal, Indochinese, Malayan and Sumatran tigers)⁵. The Amur tiger (Panthera tigris altaica) is the largest in overall size and the only subspecies inhabiting snow-covered regions.

Previous genetic studies using mitochondrial and nuclear loci have helped to elucidate the phylogeography and population genetics of tigers^1,5,6,7, and the low coverage genome (1.8 × ) of the domestic cat (Felis catus) has provided insights into felid evolution^8,9. However, no whole-genome reference sequence has been reported for the tiger, or any of the Panthera species, thus limiting current understanding of genetic diversity and demography^10,11,12.

We report the first tiger genome sequence assembly and annotation as well as a comparative analysis of the lion (Panthera leo) and snow leopard (Panthera uncia) genomes. We describe genotypic variation and genotype association analyses with species-specific phenotypes and adaptation. Panthera whole-genome sequences provide valuable information on genome organization, evolutionary divergence and overall endemic diversity.

Results

The Amur tiger genome

The DNA of a 9-year-old male Amur tiger from Everland Zoo in Korea was sequenced by Illumina HiSeq2000 (Supplementary Fig. S1, Supplementary Tables S1–S3). Sequence reads were assembled using SOAPdenovo¹³ into scaffolds (2.4 Gb in length) having an N50 length of 8.84 Mb (contig N50 length of 29.8 kb; Table 1, Supplementary Figs S2–S4, Supplementary Tables S4 and S5, Methods). Assembly quality was assessed by aligning the assembled tiger blood transcripts and cat EST sequences onto the tiger scaffolds (>96% coverage and 98.9% mapping rate, respectively), and heterozygous single nucleotide variants (SNVs) were validated by the Sanger method (Supplementary Tables S6–S9, Supplementary Methods). Additionally, analysis of the tiger draft genome assembly for core eukaryotic genes¹⁴ revealed homologues for >93.4% of conserved genes in the assembly (Supplementary Table S10). The tiger genome sequence shows 95.6% similarity to the domestic cat (Supplementary Table S11) from which it diverged approximately 10.8 million years ago (MYA)¹⁵; human and gorilla have 94.8% similarity and diverged around 8.8 MYA (from TimeTree). This high similarity allowed us to improve the assembly of the tiger genome by using the recently completed high coverage (12 × coverage) domestic cat genome (Supplementary Fig. S5, Supplementary Tables S12–S17, Supplementary Methods). For comparative genomic analysis of big cats, we also sequenced four other Panthera genomes (Table 1, Supplementary Tables S1 and S18): a white Bengal tiger (Panthera tigris tigris), an African lion, a white African lion and a snow leopard; their genome sequences were aligned with the domestic cat and tiger genomes.

Table 1 Global statistics of the Panthera genomes.

Full size table

Adaptation of the big cats

The assembled Amur tiger genome was predicted to contain 20,226 protein-coding genes (Supplementary Tables S19–S23, Supplementary Methods) and 2,935 non-coding RNAs (Supplementary Table S24, Supplementary Methods). To create a detailed annotation of the tiger proteome, gene clusters were constructed using seven mammalian genomes (tiger, cat, human, dog, mouse, giant panda and opossum). The tiger proteome contained 14,954 orthologous gene families. Among these, 14,425 orthologous gene families were shared by all seven genomes, whereas 103 orthologous gene families were exclusively shared by the tiger and cat (Fig. 1a, Supplementary Fig. S6, Supplementary Table S25). The Felidae-specific gene families contained 287 InterPro domains (Supplementary Tables S26–S29). Based on the comparison of orthologous gene families among seven mammalian species, the Amur tiger genome displays 381 expanded and 1,790 contracted gene families compared with the feline common ancestor (Fig. 1b, Supplementary Figs S7 and S8). The tiger genome is particularly enriched in olfactory receptor activity (GO:0004984, P=5.75 × 10⁻¹⁸⁵, ChiSquare test followed by a Fisher's exact test, 289 genes), G-protein coupled receptor signalling pathway (GO:0007186, P=2.98 × 10⁻¹⁰⁶, 302 genes), signal transducer activity (GO:0004871, P=2.25 × 10⁻⁷⁴, 295 genes), amino-acid transport (GO:0006865, P=3.09 × 10⁻¹⁰, 16 genes) and protein metabolic process (GO:0019538, P=5.72 × 10⁻¹⁰, 220 genes) (Supplementary Fig. S9, Supplementary Table S30). In most cats, smell has an important role in social behaviour such as territory ownership and mating, while vision and hearing are important for hunting¹⁶.

**Figure 1: Relationship of the tiger to other mammalian species.**

Additionally, we investigated Panthera lineage-specific amino-acid changes by comparison with the known genes from the human, dog and mouse. A total of 3,646 genes had amino-acid changes specific to big cats (Amur tiger, white tiger, snow leopard, African lion and white lion; Supplementary Tables S31 and S32), and 5,882 genes had amino-acid changes unique to the felid lineage (big cats plus domestic cat). Among these, 1,376 genes had big cat-specific amino-acid changes that were found to be protein functional changes according to computational predictions (PolyPhen2¹⁷). Metabolism pathways associated with protein and fatty acid, which are important sources of energy, were enriched with genes having Panthera-specific functional changes; histidine metabolism (P=0.00024, Fisher's exact test, six genes), beta-alanine metabolism (P=0.00078, six genes), phenylalanine metabolism (P=0.014, three genes), valine, leucine and isoleucine degradation (P=0.035, six genes), cysteine and methionine metabolism (P=0.037, four genes), fatty acid metabolism (P=0.00038, eight genes) and fat digestion and absorption (P=0.025, five genes) (Supplementary Tables S33 and S34). These signals of amino-acid metabolism have been associated with an obligatory carnivorous diet¹⁸.

In order to detect tiger genes evolving under positive selection, we used the conserved genome synteny methodology¹⁹ (between human and other mammalian species (cat, dog, mouse and panda)) and a branch-site likelihood ratio test (Methods). A total of 178 positively selected genes out of 7,415 high-quality ortholog genes were identified in the Amur tiger (Supplementary Table S35, Supplementary Data 1). Functional categories for positively selected genes were over-represented in muscle filament sliding (GO:0030049, P=0.0049, Fisher's exact test, MYH7, TPM4 and TNNC2), filamentous actin (GO:0031941, P=0.0062, TPM4 and MYO1A) and stress fibre (GO:0001725, P=0.0039, MYH7, TPM4 and ACTN4) (Supplementary Table S36, Supplementary Data 2). We also identified GO categories, which are evolving under significantly high constraints¹⁹ (Supplementary Tables S37–S39, Supplementary Methods). Significantly altered Ka/Ks ratios of non-synonymous to synonymous substitutions for GO categories (P<0.01, Binomial test) revealed evidence of rapid evolution in the tiger for muscle strength (muscle contraction and actin cytoskeleton), energy metabolism (GTPase activity, ATP binding and energy reserve metabolic process), and sensory nerves (G-protein coupled receptor activity, olfactory receptor activity, visual perception and nervous system development) (Supplementary Tables S40–S43).

Genetic landscape of the snow leopard and white lion

In addition to the Amur tiger data, we used sequence data from the four other big cats to investigate the genetic basis of several unique physiological or phenotypic traits. Snow leopards generally live in alpine areas, 3,350–6,700 m above sea level, in Central Asia²⁰. Recent genome-wide association studies implicated two human loci EGLN1 (Egl nine homologue 1) and EPAS1 (endothelial PAS domain-containing protein 1) as mediating high-altitude adaptation²¹. We examined mutational substitutions in mammalian EGLN1 and EPAS1 genes and found that the snow leopard had unique amino-acid changes in both genes that were not found in other mammalian species. Although EGLN1 is highly conserved in mammals, Met39 (non-polar), instead of Lys39 (positively charged), was found in the snow leopard (Fig. 2a, Supplementary Fig. S10, Supplementary Table S44), indicating a significant change in charge that might alter protein function. This Met39 residue was replicated as a genetically fixed substitution in 14 additional snow leopards, whereas the ancestral Lys39 was monomorphic in a sampling of 28 individuals of Panthera and Neofelis (clouded leopard) (Supplementary Table S45, Supplementary Methods). Naked mole rats have also adapted to hypoxia²² in a different manner by having unique amino-acid changes in different positions of EGLN1 (Pro15, Arg17 and Arg36). Ile663 and Arg794 in EPAS1 are two additional snow leopard-specific changes (Supplementary Fig. S11), and Arg794 was predicted to confer a functional change on the protein. Taken together, these EGLN1 and possibly EPAS1 variants are provocative candidates that may have contributed to the snow leopard’s acquisition of an alpine, high altitude ecological niche.

**Figure 2: *EGLN1* and *TYR* mutations related to hypoxia in snow leopard and white fur in white lion.**

Tyrosinase (TYR) mutational variants cause white coat colour in the domestic cat²³, and TYR mutations are related to human oculocutaneous albinism 1 (refs 24, 25). The genetic basis of the white tiger with white fur and dark stripes is revealed as an amino-acid change (A477V) in the transporter protein SLC45A2 (ref. 26). Therefore, we examined the pigment-associated gene mutation in the white lion and found a unique nucleotide (TYR260G>A) and corresponding amino-acid change in the white lion (Fig. 2b, Supplementary Fig. S12), causing the positively charged Arg87, located in the central domain of TYR, to be changed to uncharged Gln87. We verified the proposed candidate mutation (TYR260G>A) underlying the amino-acid change (R87Q) in the TYR gene sequence of 47 lions: 17 of white phenotype and 30 of wild phenotype, of which 11 were known to be carriers (heterozygotes) based on their pedigree, 14 of unknown genetic makeup, and 5 of wild phenotype from control populations where no white lions were found (Supplementary Tables S46–S48, Supplementary Methods). The concordance between the expected and observed genotype was 100% for the candidate mutation TYR260G>A. A second non-synonymous mutation (TYR176C>T), observed in a subset of animals, did not correlate with the expected genotype, and animals with this variant did not appear to be phenotypically different.

Genomic comparison between the tiger and other mammals

Although repeat characteristics can vary considerably among closely related species²⁷, the tiger and cat genomes showed very similar repeat compositions (39.3% versus 39.2%, respectively), as well as ratios of repeat components, including tandem repeats and transposable elements (Supplementary Fig. S13), suggesting a similar genome architecture between domestic cats and tigers. By contrast, for the great apes, the ratio of repeat components was considerably different between species, especially between human and orangutan, which diverged about 12 MYA²⁸. Additionally, we estimated the evolutionarily conserved sequences (77 Mb, 3.2%), segmental duplication (11.2 Mb, 0.47%) and lineage-specific insertions and deletions in the tiger genome (Supplementary Tables S49–S52, Supplementary Methods).

To detect genome-wide structural differences, we aligned the tiger scaffolds to the cat genome using dog genome as a reference after masking repeats. A total of 571 of the 674 tiger scaffolds (length >20 kb, 99.6% of the total scaffold length) were aligned with the cat genome sequence, and as high as 98.8% of gene-coding regions and 98.3% (2.38 Gb) of conserved synteny blocks were shared by the tiger and cat genomes. We detected a rather high level of genomic synteny, containing six breaks with large-size chromosomal segmental rearrangement between the tiger and cat genomes (Fig. 3, Supplementary Fig. S14, Supplementary Tables S53–S56, Methods). These consisted of two inter- and four intra-chromosomal rearrangements. Divergence in genomic structure among closely related species is considered as a major factor underlying species diversification, as gene flow requires recombination in collinear chromosomes, and the reduction in recombination associated with chromosomal rearrangements results in a partial reproductive barrier²⁹. These structural variations may be one of the important factors underlying species diversification among the felines.

**Figure 3: Synteny blocks between tiger and cat genomes.**

The level of within-species genetic diversity, as measured by the rate of heterozygous SNVs, of the tiger (0.00049–0.00073) and lion (0.00048–0.00058) genomes was found to be similar to that of human (0.00066) (Supplementary Table S57, Supplementary Methods). Interestingly, diversity of the snow leopard genome was nearly half that of the other Panthera species and slightly lower than that of the Tasmanian devil³⁰, which is purported to display a low level of genetic diversity (Fig. 4a). We also estimated the occurrence of a marked bottleneck around the last glacial maximum 20 kyr ago (7–70 kyr) using a pairwise sequentially Markovian coalescent (PSMC)³¹ model inference of tiger demographic history based on SNV distribution (Fig. 4b, Supplementary Figs S15–S18, Supplementary Tables S58 and S59, Methods). A similar bottleneck was estimated a bit earlier (72–108 kyr) based on mitochondrial DNA coalescence¹. White lion (0.00048) and domestic cat (0.00012) have both undergone multiple rounds of close inbreeding during breed development and therefore would display lower SNV diversity bias as a consequence. Therefore, we investigated the genetic diversity of Panthera using the rate of heterozygous SNVs and confirmed that the genetic diversity within a single individual coincided with that deduced from mitochondrial sequences of several individuals³⁰.

**Figure 4: Genetic diversity and population size history in *Panthera* species.**

Discussion

The Amur tiger genome is the first reference genome sequenced from the Panthera lineage and the second from the Felidae species. For comparative genomic analyses of big cats, we additionally sequenced four other Panthera genomes and tried to predict possible big cats’ molecular adaptations consistent with the obligatory meat eating and muscle strength of the predatory Panthera lineage. The tiger and cat genomes showed unexpectedly similar repeat compositions and high genomic synteny, and these indicated strong genomic conservation in Felidae. These results could be supported by the recency of the 37 species-Felidae radiation (<11 MYA)¹⁵ and well-known hybridizations in captivity among subspecies in Felidae lineage such as liger and tigon. By contrast, the ratio of repeat components for the great apes was considerably different among species, especially between human and orang-utan²⁸, which diverged about the same time as felines. The breaks in synteny that we observed are likely occasional rare sporadic exchanges that accumulated over this short period (<11 MYA) of evolutionary time. The paucity of exchanges across the mammalian radiations (by contrast to more reshuffled species such as Canidae, Gibbons, Ursidae and New World monkeys) is a hallmark of evolutionary constraints.

Many whole-reference genome studies used few close species genomes that can be directly compared with the reference genome constructed. Although we did not have the resources to construct lion and leopard reference genomes, and hence were not able to show all the structural variations on the genomes, our ‘close species comparative genomics’ approach, utilizing at least one reference species, heralds a new level of genome studies. It is because those very close Panthera species have distinct species-specific and readily identifiable phenotypes that can be associated quickly to mutations by comparing the homologous genes of interest as shown in the fur colouring (white lion) and high-altitude adaptation (snow leopard). If sufficiently distinct phenotypes are biologically curated, genetic mutations causing species specificity can be systematically detected using next generation sequencing. Once such candidate genetic mutations are confirmed in the set of species genomes, experimental validations can be carried out, as in the additional 47 lion samples here, for targeted genes. This genetic variation comparison using whole genomes among species and subspecies can thus provide valuable insight and information for the whole family’s conservation. Our data from tigers, lions and snow leopard can provide a rich and diverse genome resource that could be used in future studies of conservation and population genomics so that the genetic underpinnings of local adaptation and potential inbreeding and/or outbreeding¹⁰ in wild and captive populations can be illuminated and thereby help ensure the future survival of these majestic species.

Methods

Genome sequence assembly and annotation

The blood samples used for genome sequencing were acquired from the Everland Zoo of Korea (Amur tiger, white Bengal tiger, African lion and white African lion) following the Everland Zoo (Korea) ethical guidelines and procedures, and a muscle sample was obtained from a Mongolian snow leopard carcass preserved in the Conservation Genome Resource Bank for Korean Wildlife, Seoul National University. No animals were killed or captured as a result of this study. Libraries for the Amur tiger genome were constructed at BGI, Shenzhen, and the insert sizes of the libraries were 170 bp, 500 bp, 800 bp, 2 kb, 5 kb, 10 kb and 20 kb. The libraries were sequenced using HiSeq2000. Other big cat genomes were sequenced at Theragen BiO Institute (TBI), Korea, using HiSeq2000 with read and insert lengths of ~90 bp and ~400 bp, respectively.

The corrected reads were used to complete the genome assembly using SOAPdenovo¹³. First, the short insert size library (170 bp, 500 bp and 800 bp) data were used to construct a de Bruijn graph. Second, all reads were realigned with the contig sequences. The amount of shared paired-end relationships between pairs of contigs were calculated and weighted with the rate of consistent and conflicting paired ends, before constructing the scaffolds step by step from the short insert size paired ends to the long distant paired ends. Third, the gaps between the constructed scaffolds were closed using the paired-end information to retrieve read pairs where one end mapped to a unique contig while the other was located in the gap region.

The Amur tiger genes were predicted using three approaches. First, de novo prediction was performed using the repeat-masked genome using AUGUSTUS (version 2.5.5)³² and GENSCAN (version 1.0)³³. Second, homologous proteins in other species were mapped to the genome using tBLASTn (Blast 2.2.23)³⁴ with an E-value cutoff of 1E-5. The aligned sequence and its query protein were then filtered and passed to GeneWise (version 2.2.0)³⁵ to search for accurately spliced alignments. Third, cat EST and full-length cDNA sequences (from UCSC) were aligned to the genome using BLAT³⁶ to generate spliced alignments. For EST results, spliced alignments were linked according to overlap using PASA³⁷. Source evidence generated from the three approaches was integrated with GLEAN³⁸ to produce a consensus gene set. Then, the Amur tiger genome sequence was aligned to two well-assembled and annotated genomes (human and domestic cat) using LASTZ (version 1.02). Finally, mapped results yielding information on homologous proteins were filtered by syntenic blocks of genome sequences. We also predicted the domestic cat (Felis_catus-6.2) gene set, because the gene set of the cat genome is preliminary.

Orthologous gene families

A comparative analysis was used to examine the rate of protein evolution and the conservation of gene repertoires among orthologs in the genomes of the Amur tiger, dog, human, mouse, giant panda, domestic cat (Felis_catus-6.2) and opossum. We used the TreeFam methodology³⁹ to define a gene family as a group of genes that descended from a single gene in the last common ancestor of a considered species. We assigned a connection (edge) between two nodes (genes) if more than 1/3 of the region was aligned to both genes. An H-score (minimum edge weight) that ranged from 0 to 100 was used to weigh the similarity (edge). For two genes, G1 and G2, the H-score was defined as score (G1G2)/max (score (G1G1), score (G2G2)), where the score shown is the BLAST raw score. Gene families were extracted by clustering using Hcluster_sg. We used the average distance for the hierarchical clustering algorithm, requiring the H-score to be larger than five, and the minimum edge density (total number of edges/theoretical number of edges) to be larger than 1/3. The clustering for a gene family would also stop if it already had one or more of the out-group genes.

We determined the expansion and contraction of the orthologous protein families among seven mammalian species (tiger, cat (Felis_catus-6.2), dog, human, mouse, giant panda and opossum) using CAFÉ 2.2 (ref. 40) with 0.001080 of lambda option. GO of all tiger genes was annotated by InterPro. A χ² test followed by a Fisher’s exact test (P≤0.01) were used to test for over-represented functional categories among expanded genes and ‘genome background’ genes; Fisher’s exact test was used when any expected value of count was below 5, which would have make the χ² test inaccurate⁴¹.

Gene evolution

We investigated Panthera lineage-specific amino-acid changes by comparison with the known genes from the human, dog and mouse (from the Ensembl 69 release). We used lion and snow leopard gene sets by mapping reads to the tiger scaffolds and substituting SNVs. Artifacts from the multiple sequence alignment (ClustalW2⁴²) limitations were removed by filtering option with ≥1/2 of coverage and ≥of well-matched amino acids (consensus string is ‘*’, ‘:’ or ‘.’).

To detect tiger genes evolving under positive selection, we used conserved genome synteny methodology¹⁹ to establish a high-confidence orthologous gene set. Briefly, whole-genome multiple alignments were performed between human (hg19) and other species (cat (Felis_catus-6.2), dog (CanFam2.0), mouse (mm9) and panda (ailMel1) genomes) by the LASTZ alignment pipeline. We collected all the human protein-coding genes from RefSeq⁴³, KnownGene⁴⁴ and VEGA⁴⁵, and mapped them to the other species via the syntenic regions. We then filtered the resulting blocks with rigorous conditions to get large-scale synteny of high-alignment quality, and a conservation of exon–intron structure. Finally, we found 7,415 1:1 high-quality ortholog genes to analyse, most of which also correspond to genes in the panda, dog and mouse genomes. Then, we aligned ortholog genes by PRANK⁴⁶ and used the optimized branch-site model of PAML (version 4.5) and likelihood ratio tests (LRTs) (P≤0.05). A GO annotation download from Ensembl was used to assign GO categories to 7,415 orthologs. A χ² test followed by a Fisher’s exact test (P≤0.01) were used to test for over-represented functional categories among positively selected genes; a Fisher’s exact test was used when any expected value of count was below 5, which would have made the χ² test inaccurate⁴¹.

We also used an approach based on Ka/Ks^47,48 to identify GO categories significantly above or below average in the tiger genome. The Ka and Ks rates are estimated by PAML from all aligned bases with a quality score >20 in orthologs, using the F3 × 4 codon frequency model and the REV substitution matrix. To determine whether the GO categories are evolving under significantly high constraints, we repeated this procedure 10,000 times on the same data set after randomly permuting the GO annotations. Then, we acquired the GO categories if the P-value was less than 0.05.

Chromosomal rearrangement

Among the alignment data generated from SyMAP⁴⁹, when one scaffold happened to be mapped to several physically distant cat (Felis_catus-6.2) chromosomal locations, they were considered to be inter- or intra-chromosomal rearrangement events of the Amur tiger genome relative to the cat genome. The species (tiger and domestic cat)-specific genomic rearrangements were also analysed. We performed the dog versus tiger and cat versus tiger whole-genome pair-wise alignments using LASTZ software on the repeat-masked genomes. Using these methods, we identified clusters of unique alignments with well-defined order and orientation. There was a total of 18 chromosomal rearrangement (12 inter- and 6 intra-chromosomal rearrangements) overlaps when the results from SyMAP and LASTZ were integrated by comparing syntenic break positions. As the tiger assembly was generally fragmented, we carefully validated the 18 syntenic breaks to examine the assembly integrity by aligning long insert mate-pair libraries (2 kb, 5 kb, 10 kb and 20 kb) to the tiger scaffolds. Finally, we reported six putative chromosomal rearrangements (two inter- and four intra-chromosomal rearrangements) between the tiger and cat. All six rearrangements were validated by long-range PCR experiments followed by the Sanger sequencing method.

Demographic history

The history of population size helps to develop insights into evolution. Based on the pairwise sequentially Markovian coalescent model (PSMC)³¹, we inferred detailed population size histories of Amur tiger (TG), African lion (LN), snow leopard (SL), white tiger (WTG) and white lion (WLN). Using SNV data sets scanned with all the big cat sequencing reads mapped to Felis_catus-6.2, the consensus sequences of each big cat were constructed and then divided into non-overlapping 100-bp bins marked as homozygous or heterozygous. The resultant bin sequences for their sex chromosomal parts were removed, and then they were taken as the input of the PSMC estimation. To test the estimation accuracy, bootstrapping was performed by randomly resampling 100 sequences from the original sequences. Using the neutral mutation rates, the raw PSMC outputs were scaled to time and population sizes. We obtained atmospheric surface air temperature and global relative sea level data of the past 3 million years⁵⁰.

Additional information

Accession codes: The Amur tiger whole-genome shotgun project has been deposited at DDBJ/EMBL/GenBank under the accession number ATCQ00000000. The version described in this paper is the first version, ATCQ01000000. Raw DNA and RNA sequencing reads have been submitted to the NCBI Sequence Read Archive database (SRA074975, SRA091968).

How to cite this article: Cho, Y. S. et al. The tiger genome and comparative analysis with lion and snow leopard genomes. Nat. Commun. 4:2433 doi: 10.1038/ncomms3433 (2013).

Accession codes

Accessions

Sequence Read Archive

References

Luo, S. J. et al. Phylogeography and genetic ancestry of tigers (Panthera tigris). PLoS Biol. 2, e442 (2004).
Article Google Scholar
Smith, J. L. D., Ahern, S. C. & McDougal, C. Landscape analysis of tiger distribution and habitat quality in Nepal. Conserv. Biol. 12, 1338–1346 (1998).
Article Google Scholar
Chundawat, R. S. et al. Panthera tigris. In: IUCN 2012. IUCN red list of threatened species. version 2012.1 www.iucnredlist.org (2011).
Walston, J. et al. Bringing the tiger back from the brink-the six percent solution. PLoS Biol. 8, e1000485 (2010).
Article Google Scholar
Luo, S. J., Johnson, W. E. & O'Brien, S. J. Applying molecular genetic tools to tiger conservation. Integr. Zool. 5, 351–362 (2010).
Article Google Scholar
Driscoll, C. A. et al. Mitochondrial phylogeography illuminates the origin of the extinct Caspian tiger and its relationship to the Amur tiger. PLoS One 4, e4125 (2009).
Article ADS Google Scholar
Kitpipit, T. & Linacre, A. The complete mitochondrial genome analysis of the tiger (Panthera tigris). Mol. Biol. Rep. 39, 5745–5754 (2012).
Article CAS Google Scholar
Pontius, J. U. et al. Initial sequence and comparative analysis of the cat genome. Genome Res. 17, 1675–1689 (2007).
Article CAS Google Scholar
Mullikin, J. C. et al. Light whole genome sequence for SNP discovery across domestic cat breeds. BMC Genomics 11, 406 (2010).
Article Google Scholar
Allendorf, F. W., Hohenlohe, P. A. & Luikart, G. Genomics and the future of conservation genetics. Nat. Rev. Genet. 11, 697–709 (2010).
Article CAS Google Scholar
Murchison, E. P. et al. Genome sequencing and analysis of the Tasmanian devil and its transmissible cancer. Cell 148, 780–791 (2012).
Article CAS Google Scholar
Zhao, S. et al. Whole-genome sequencing of giant pandas provides insights into demographic history and local adaptation. Nat. Genet. 45, 67–71 (2012).
Article CAS Google Scholar
Li, R. et al. SOAP2: an improved ultrafast tool from short read alignment. Bioinformatics 25, 1966–1967 (2009).
Article CAS Google Scholar
Parra, G., Bradnam, K. & Korf, I. CEGMA: a pipeline to accurately annotate core genes in eukaryotic genomes. Bioinformatics 23, 1061–1067 (2007).
Article CAS Google Scholar
Johnson, W. E. et al. The late Miocene radiation of modern Felidae: a genetic assessment. Science 311, 73–77 (2006).
Article CAS ADS Google Scholar
Corbett, J. The Temple Tiger and More Man-eaters of Kumaon Oxford Univ. Press: London, (1954).
Adzhubei, I. A. et al. A method and server for predicting damaging missense mutations. Nat. Methods 7, 248–249 (2010).
Article CAS Google Scholar
Irizarry, K. J. et al. Sequencing and comparative genomic analysis of 1227 Felis catus cDNA sequences enriched for developmental, clinical and nutritional phenotypes. BMC Genomics 13, 31 (2012).
Article CAS Google Scholar
Qiu, Q. et al. The yak genome and adaptation to life at high altitude. Nat. Genet. 44, 946–949 (2012).
Article CAS Google Scholar
Charudutt, M. et al. The role of incentive programs in conserving the snow leopard. Conserv. Biol. 17, 1512–1520 (2003).
Article Google Scholar
Xu, S. et al. A genome-wide search for signals of high-altitude adaptation in Tibetans. Mol. Biol. Evol. 28, 1003–1011 (2011).
Article Google Scholar
Kim, E. B. et al. Genome sequencing reveals insights into physiology and longevity of the naked mole rat. Nature 479, 223–227 (2011).
Article CAS ADS Google Scholar
Schmidt-Küntzel, A., Eizirik, E., O'Brien, S. J. & Menotti-Raymond, M. Tyrosinase and tyrosinase related protein 1 alleles specify domestic cat coat colour phenotypes of the albino and brown loci. J. Hered. 96, 289–301 (2005).
Article Google Scholar
King, R. A. et al. Tyrosinase gene mutations in oculocutaneous albinism 1 (OCA1): definition of the phenotype. Hum. Genet. 113, 502–513 (2003).
Article CAS Google Scholar
Opitz, S., Käsmann-Kellner, B., Kaufmann, M., Schwinger, E. & Zühlke, C. Detection of 53 novel DNA variations within the tyrosinase gene and accumulation of mutations in 17 patients with albinism. Hum. Mutat. 23, 630–631 (2004).
Article Google Scholar
Xu, X. et al. The genetic basis of white tigers. Curr. Biol. 23, 1031–1035 (2013).
Article CAS Google Scholar
Mayer, C., Leese, F. & Tollrian, R. Genome-wide analysis of tandem repeats in Daphnia pulex-a comparative approach. BMC Genomics 11, 277 (2010).
Article Google Scholar
Locke, D. P. et al. Comparative and demographic analysis of orang-utan genomes. Nature 469, 529–533 (2011).
Article CAS ADS Google Scholar
Zhang, J., Wang, X. & Podlaha, O. Testing the chromosomal speciation hypothesis for humans and chimpanzees. Genome Res. 14, 845–851 (2004).
Article CAS Google Scholar
Miller, W. et al. Genetic diversity and population structure of the endangered marsupial Sarcophilus harrisii (Tasmanian devil). Proc. Natl Acad. Sci. USA 108, 12348–12353 (2011).
Article CAS ADS Google Scholar
Li, H. & Durbin, R. Inference of human population history from individual whole-genome sequences. Nature 475, 493–496 (2011).
Article CAS Google Scholar
Stanke, M. et al. AUGUSTUS: ab initio prediction of alternative transcripts. Nucleic Acids Res. 34, W435–W439 (2006).
Article CAS Google Scholar
Burge, C. & Karlin, S. Prediction of complete gene structures in human genomic DNA. J. Mol. Biol. 268, 78–94 (1997).
Article CAS Google Scholar
Altschul, S. F., Gish, W., Miller, W., Myers, E. W. & Lipman, D. J. Basic local alignment search tool. J. Mol. Biol. 215, 403–410 (1990).
Article CAS Google Scholar
Birney, E., Clamp, M. & Durbin, R. GeneWise and Genomewise. Genome Res. 14, 988–995 (2004).
Article CAS Google Scholar
Kent, W. J. BLAT-the BLAST-like alignment tool. Genome Res. 12, 656–664 (2002).
Article CAS Google Scholar
Haas, B. J. et al. Improving the Arabidopsis genome annotation using maximal transcript alignment assemblies. Nucleic Acids Res. 31, 5654–5666 (2003).
Article CAS Google Scholar
Elsik, C. G. et al. Creating a honey bee consensus gene set. Genome Biol. 8, R13 (2007).
Article Google Scholar
Li, H. et al. TreeFam: a curated database of phylogenetic trees of animal gene families. Nucleic Acids Res. 34, D572–D580 (2006).
Article CAS Google Scholar
Hahn, M. W., Demuth, J. P. & Han, S. G. Accelerated rate of gene gain and loss in primates. Genetics 177, 1941–1949 (2007).
Article Google Scholar
Chen, S. et al. De novo analysis of transcriptome dynamics in the migratory locust during the development of phase traits. PLoS One 5, e15633 (2010).
Article CAS ADS Google Scholar
Chenna, R. et al. Multiple sequence alignment with the Clustal series of programs. Nucleic Acids Res. 31, 3497–3500 (2003).
Article CAS Google Scholar
Pruitt, K. D., Tatusova, T. & Maglott, D. R. NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins. Nucleic Acids Res. 33, D501–D504 (2005).
Article CAS Google Scholar
Hsu, F. et al. The UCSC Known Genes. Bioinformatics. 22, 1036–1046 (2006).
Article CAS Google Scholar
Wilming, L. G. et al. The vertebrate genome annotation (Vega) database. Nucleic Acids Res. 36, D753–D760 (2008).
Article CAS Google Scholar
Löytynoja, A. & Goldman, N. An algorithm for progressive multiple alignment of sequences with insertions. Proc. Natl Acad. Sci. USA 102, 10557–10562 (2005).
Article ADS Google Scholar
Rhesus Macaque Genome Sequencing and Analysis Consortium. Evolutionary and biomedical insights from the rhesus macaque genome. Science 316, 222–234 (2007).
Chimpanzee Sequencing and Analysis Consortium. Initial sequence of the chimpanzee genome and comparison with the human genome. Nature 437, 69–87 (2005).
Soderlund, C., Bomhoff, M. & Nelson, W. M. SyMAP v3.4: a turnkey synteny system with application to plant genome. Nucleic Acids Res. 39, e68 (2011).
Article CAS Google Scholar
Bintanja, R. & van de Wal, R. S. North American ice-sheet dynamics and the onset of 100,000-year glacial cycles. Nature 454, 869–872 (2008).
Article CAS ADS Google Scholar

Download references

Acknowledgements

This work was supported by the Industrial Strategic Technology Development Program, 10040231, ‘Bioinformatics platform development for next generation bioinformation analysis’ funded by the Ministry of Knowledge Economy (MKE, Korea). This work was partly supported by the National Research Foundation of Korea (NRF-2011-0019745) and grants from the Ministry of Science, ICT & Future Planning (MSIP), Republic of Korea (2012R1A1A2043851 and NRF-2008-2004707). This research was also supported in part by Russian Ministry of Science Mega-grant no.11.G34.31.0068; SJ O'Brien Principal Investigator. Scaffolds (sequence data) and gene sets are available at http://tigergenome.org. We thank Dr Rui-Qiang Li for contributing to the manuscript. We thank Maryana Bhak for editing. Authors thank many people not listed as authors who gave us feedback, samples and encouragement, especially Won-shik Min. This project was initiated with the generous support of TheragenEtex, Seong-Jin Kim and Jin-Up Goh. We thank the following people and organization for providing valuable samples, data and time for this project: Johannesburg Zoo, in particular Director Dominic Moss for pedigree information and permission to use samples, Janine Fearon and Lucia Muuhlu for laboratory work done at the Life Technologies Conservation Genetics Laboratory of the Cheetah Conservation Fund, Ukutula Lodge & Lion Centre, Global White Lion Protection Trust and Tsau Conservancy, Bhagavan “Doc” Antle from Tiger's Preserve, Myrtle Beach, and Ouwehands Dierenpark for contributing to this project with their lion population, Dr Carlos Driscoll for making one control sample (PLE171) available to this project, and Life Technologies and Biomatters for generous support with software (Life Technologies and Geneious) and equipment (Life Technologies) of the Genetics Laboratory at the Cheetah Conservation Fund. Finally, we thank the many conservationists who have helped protect endangered big cats throughout the world.

Author information

Authors and Affiliations

Personal Genomics Institute, Genome Research Foundation, Suwon, 443-270, Republic of Korea
Yun Sung Cho, Hak-Min Kim, Sungwoong Jho, Young-Ah Shin, Byung Chul Kim, Chang-uk Kim, Sunghoon Lee, Priyvrat Gadhvi, Jong-Soo Kim & Jong Bhak
BGI-Shenzhen, Shenzhen, 518083, China
Li Hu, Haolong Hou, Jiaohui Xu, Qian Zhou, Pengwei Xu, Yingqi Xiong, Yadan Luo, Shengkai Pan, Caiyun Gou, Xiuhui Chu, Jilin Zhang, Sanyang Liu, Jing He, Ying Chen, Linfeng Yang, Yulan Yang, Jiaju He, Sha Liu, Junyi Wang & Jun Wang
Tiger and Leopard Conservation Fund in Korea and College of Veterinary Medicine, Seoul National University, Seoul, 151-742, Republic of Korea
Hang Lee
Samsung Everland Zoo, Yongin, 449-715, Republic of Korea
Soowhan Kwon & Sukhun Oh
School of Systems Biomedical Science, Soongsil University, Seoul, 156-743, Korea
Sangsoo Kim
Theragen BiO Institute, TheragenEtex, Suwon, 443-270, Republic of Korea
Byung Chul Kim, Hyunmin Kim, Tae Hyung Kim, Sunghoon Lee, Hyun-Ju Jung, Chul Hong Kim, Hwanjong Kwak, Junsu Ko, Seong-Jin Kim & Jong Bhak
Peking-Tsinghua Center for Life Sciences, School of Life Sciences, Peking University, Beijing, 100871, China
Shu-Jin Luo & Xiao Xu
Smithsonian Conservation Biology Institute, National Zoological Park 1500 Remount Road, Front Royal, 22630, Virgina, USA
Warren E. Johnson
Theodosius Dobzhansky Center for Genome Bioinformatics, St Petersburg State University, St Petersburg, 199034, Russia
Klaus-Peter Koepfli & Stephen J. O’Brien
Life Technologies Conservation Genetics Laboratory, Cheetah Conservation Fund, Otjiwarongo, 9000, Namibia
Anne Schmidt-Küntzel
Global White Lion Protection Trust, Tsau Conservancy, Greater Timbavati Region, PO Box 858, Limpopo, 1380, South Africa
Jason A. Turner
Cheetah Conservation Fund, Otjiwarongo, 9000, Namibia
Laurie Marker
Veterinary Genetics Laboratory, Faculty of Veterinary Science, University of Pretoria, Onderstepoort, 0110, South Africa
Cindy Harper & Susan M. Miller
Department of Nature Conservation, Tshwane University of Technology, Private Bag X680, Pretoria, 0001, South Africa
Susan M. Miller
Ukutula Lodge & Lion Centre, NorthWest Province Brits, 0250, South Africa
Wilhelm Jacobs
Institute of Environmental Sciences (CML), Leiden University, PO Box 9518, Leiden, 2300, RA, The Netherlands
Laura D. Bertola
Korean Bioinformation Center, Korea Research Institute of Bioscience and Biotechnology, Daejeon, 305-806, Republic of Korea
Seungwoo Hwang
Department of Life Science, Sangmyung University, Seoul, 110-743, Republic of Korea
Chang-Bae Kim
Department of Biology, Sungshin Women’s University, Seoul, 142-732, Republic of Korea
Sangtae Kim
Department of Molecular Biology and Genetics, School of Biology and Biotechnology, National University of Mongolia, Ulaanbaatar, 210646A, Mongolia
Damdin Bayarlkhagva
National Science Museum, Daejeon, 305-705, Republic of Korea
Woon Kee Paek
CHA Cancer Institute, CHA University, Seoul, 135-081, Republic of Korea
Seong-Jin Kim
Oceanographic Center, Nova Southeastern University, Ft Lauderdale, Florida, 33004, USA
Stephen J. O’Brien
Department of Biology, University of Copenhagen, Copenhagen, DK-2200, Denmark
Jun Wang
King Abdulaziz University, Jeddah, 21589, Saudi Arabia
Jun Wang
Advanced Institutes of Convergence Technology Nano Science and Technology, Suwon, 443-270, Republic of Korea
Jong Bhak
Department of Transdisciplinary Studies, Program in Nano Science and Technology, Seoul National University, Suwon, 443-270, Republic of Korea
Jong Bhak

Authors

Yun Sung Cho
View author publications
You can also search for this author in PubMed Google Scholar
Li Hu
View author publications
You can also search for this author in PubMed Google Scholar
Haolong Hou
View author publications
You can also search for this author in PubMed Google Scholar
Hang Lee
View author publications
You can also search for this author in PubMed Google Scholar
Jiaohui Xu
View author publications
You can also search for this author in PubMed Google Scholar
Soowhan Kwon
View author publications
You can also search for this author in PubMed Google Scholar
Sukhun Oh
View author publications
You can also search for this author in PubMed Google Scholar
Hak-Min Kim
View author publications
You can also search for this author in PubMed Google Scholar
Sungwoong Jho
View author publications
You can also search for this author in PubMed Google Scholar
Sangsoo Kim
View author publications
You can also search for this author in PubMed Google Scholar
Young-Ah Shin
View author publications
You can also search for this author in PubMed Google Scholar
Byung Chul Kim
View author publications
You can also search for this author in PubMed Google Scholar
Hyunmin Kim
View author publications
You can also search for this author in PubMed Google Scholar
Chang-uk Kim
View author publications
You can also search for this author in PubMed Google Scholar
Shu-Jin Luo
View author publications
You can also search for this author in PubMed Google Scholar
Warren E. Johnson
View author publications
You can also search for this author in PubMed Google Scholar
Klaus-Peter Koepfli
View author publications
You can also search for this author in PubMed Google Scholar
Anne Schmidt-Küntzel
View author publications
You can also search for this author in PubMed Google Scholar
Jason A. Turner
View author publications
You can also search for this author in PubMed Google Scholar
Laurie Marker
View author publications
You can also search for this author in PubMed Google Scholar
Cindy Harper
View author publications
You can also search for this author in PubMed Google Scholar
Susan M. Miller
View author publications
You can also search for this author in PubMed Google Scholar
Wilhelm Jacobs
View author publications
You can also search for this author in PubMed Google Scholar
Laura D. Bertola
View author publications
You can also search for this author in PubMed Google Scholar
Tae Hyung Kim
View author publications
You can also search for this author in PubMed Google Scholar
Sunghoon Lee
View author publications
You can also search for this author in PubMed Google Scholar
Qian Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Hyun-Ju Jung
View author publications
You can also search for this author in PubMed Google Scholar
Xiao Xu
View author publications
You can also search for this author in PubMed Google Scholar
Priyvrat Gadhvi
View author publications
You can also search for this author in PubMed Google Scholar
Pengwei Xu
View author publications
You can also search for this author in PubMed Google Scholar
Yingqi Xiong
View author publications
You can also search for this author in PubMed Google Scholar
Yadan Luo
View author publications
You can also search for this author in PubMed Google Scholar
Shengkai Pan
View author publications
You can also search for this author in PubMed Google Scholar
Caiyun Gou
View author publications
You can also search for this author in PubMed Google Scholar
Xiuhui Chu
View author publications
You can also search for this author in PubMed Google Scholar
Jilin Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Sanyang Liu
View author publications
You can also search for this author in PubMed Google Scholar
Jing He
View author publications
You can also search for this author in PubMed Google Scholar
Ying Chen
View author publications
You can also search for this author in PubMed Google Scholar
Linfeng Yang
View author publications
You can also search for this author in PubMed Google Scholar
Yulan Yang
View author publications
You can also search for this author in PubMed Google Scholar
Jiaju He
View author publications
You can also search for this author in PubMed Google Scholar
Sha Liu
View author publications
You can also search for this author in PubMed Google Scholar
Junyi Wang
View author publications
You can also search for this author in PubMed Google Scholar
Chul Hong Kim
View author publications
You can also search for this author in PubMed Google Scholar
Hwanjong Kwak
View author publications
You can also search for this author in PubMed Google Scholar
Jong-Soo Kim
View author publications
You can also search for this author in PubMed Google Scholar
Seungwoo Hwang
View author publications
You can also search for this author in PubMed Google Scholar
Junsu Ko
View author publications
You can also search for this author in PubMed Google Scholar
Chang-Bae Kim
View author publications
You can also search for this author in PubMed Google Scholar
Sangtae Kim
View author publications
You can also search for this author in PubMed Google Scholar
Damdin Bayarlkhagva
View author publications
You can also search for this author in PubMed Google Scholar
Woon Kee Paek
View author publications
You can also search for this author in PubMed Google Scholar
Seong-Jin Kim
View author publications
You can also search for this author in PubMed Google Scholar
Stephen J. O’Brien
View author publications
You can also search for this author in PubMed Google Scholar
Jun Wang
View author publications
You can also search for this author in PubMed Google Scholar
Jong Bhak
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Y.S.C., L.H., H.H., H.L. and J.X. contributed equally to this work. The tiger genome project was initiated by J.B., B.C.K., H.L., T.H.K., S. Lee., Sangtae K., C.-B.K., S.-J.K., W.K.P. and Jun W. Library construction, sequencing, bioinformatics data processing and analysing genetic variation data were carried out by L.H., J.X., H.K., S.J., Y.-A.S., Q.Z., H.K., C.-U.K., Y.X., Y.L., S.P., C.G., X.C., J.Z., Sanyang L., Jing H., Y.C., L.Y., Y.Y., Jiaju H., S.-J.L., Junyi .W., J.-S.K., H.-M.K., Y.S.C., T.H.K., Sangsoo K., J.B. and Jun W. Several big cat genome re-sequencing were performed by H.-J.J. and C.H.K. PCR validations were performed by H.-J.J. and Hwanjong K., S. Kwon., S.O., W.K.P., H.L. and D.B. provided samples, advice and associated information. Y.S.C., L.H., H.H., S.-J.L, W.J., K.-P.K. X.X., P.G., S.H., J.K., C.-B.K., H.L., Sangtae K., Sangsoo K., S.J.O., Jun W., and J.B. wrote, edited and revised the manuscript. A.S.-K. conducted overall project coordination, data analysis and presentation of lion DNA analysis and overseeing/planning of laboratory work done at CCF (Cheetah Conservation Fund). J.A.T. carried out the lion DNA project initiation for samples of the Tsau Conservancy and the Johannesburg Zoo. L.M. did support of the laboratory work done at CCF. C.H. performed communication and project initiation at VGL (Veterinary Genetics Laboratory) and oversaw the laboratory work done at VGL, sampling and funding. S.M.M. at VGL did planning and execution of the laboratory work, pedigree verificationand reference samples of other populations. Wilhelm J. did pedigree information processing and sampling at the Ukutula Lodge. L.B. did the sample processing and laboratory work of Ouwehands Dierenpark and PLE171.

Corresponding authors

Correspondence to Stephen J. O’Brien, Jun Wang or Jong Bhak.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Supplementary information

Supplementary Tables, Figures, and Methods

Supplementary Figures S1-S18, Supplementary Tables S1-S59, Supplementary Methods and Supplementary References (PDF 3081 kb)

Supplementary Data 1

Positively selected genes identified in the tiger genome (XLS 44 kb)

Supplementary Data 2

GO categories over-represented among genes predicted to be under positive selection in tiger (XLS 28 kb)

Rights and permissions

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-sa/3.0/

Reprints and permissions

About this article

Cite this article

Cho, Y., Hu, L., Hou, H. et al. The tiger genome and comparative analysis with lion and snow leopard genomes. Nat Commun 4, 2433 (2013). https://doi.org/10.1038/ncomms3433

Download citation

Received: 02 May 2013
Accepted: 13 August 2013
Published: 17 September 2013
DOI: https://doi.org/10.1038/ncomms3433

This article is cited by

Maternal dominance contributes to subgenome differentiation in allopolyploid fishes
- Min-Rui-Xuan Xu
- Zhen-Yang Liao
- Hua-Hao Zhang
Nature Communications (2023)
Ancient DNA reveals genetic admixture in China during tiger evolution
- Xin Sun
- Yue-Chen Liu
- Shu-Jin Luo
Nature Ecology & Evolution (2023)
Population genomics reveals moderate genetic differentiation between populations of endangered Forest Musk Deer located in Shaanxi and Sichuan
- Gang Liu
- Bao-Feng Zhang
- De-Fu Hu
BMC Genomics (2022)
Whole genome sequencing and the application of a SNP panel reveal primary evolutionary lineages and genomic variation in the lion (Panthera leo)
- L. D. Bertola
- M. Vermaat
- K. Vrieling
BMC Genomics (2022)
Exploiting genomic synteny in Felidae: cross-species genome alignments and SNV discovery can aid conservation management
- Georgina Samaha
- Claire M. Wade
- Bianca Haase
BMC Genomics (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.