The complete mitogenome of Himalayan black bear (Ursus thibetanus laniger) from Indian Himalayan region was assembled following the modified approach of mitochondrial baiting and mapping using the next-generation sequencing reads. The complete mitogenome was of 16,556 bp long, consisted of 37 genes that contained 13 protein-coding genes, 22 tRNAs, 2 rRNAs and 1 control region. The complete base composition was 31.33% A, 15.24% G, 25.45%C, and 27.98%T and gene arrangement was similar to the other sub-species of Asiatic black bear. The relative synonymous codon usage analysis revealed the maximum abundance of Isoleucine, Tyrosine, Leucine and Threonine. The assembled mitogenome of U. t. laniger exhibited 99% similarity with the mitogenomes of Himalayan black bear available from Nepal and Tibetan Plateau-Himalaya region. The findings of the present study has proven low depth sequencing data, adequate and highly efficient in rapid recovering the mitochondrial genome by overcoming the conventional strategies of obtaining long-range PCR and subsequently drawing phylogenetic inferences.
The Asiatic black bear (Ursus thibetanus) with wide range distribution, consists of seven well recognized subspecies, i.e. Japanese black bear (U. t. japonicus) in Japan, Ussuri black bear (U. t. ussuricus) in far-east Russia, northeast China, and Korea, Formosan black bear (U. t. formosanus) in Taiwan, Indochinese/Sichuan black bear (U. t. mupinensis) in Southwest China, Baluchistan black bear (U. t. gedrosianus) in South Pakistan and Iran, Tibetan black bear (U. t. thibetanus) in the eastern Himalayas and southeast Asia, and Himalayan black bear (U. t. laniger) in the western Himalayas1. Among the seven sub-species of Asiatic black bear, the Himalayan black bear (henceforth, HBB) is distributed in between 1200 and 3300 m asl all along the forested habitats of the Himalayas and hills of northeastern states of India covering an area of about 270,000 km2 with an estimated population of 5400 to 6700 divides2,3. A small population of HBB is patchily distributed across Pakistan, northwest India, and likely northeast India and Nepal4. In India, HBB has experienced several challenges including habitat loss, population decline due to hunting/poaching for pelts, paws and gall bladders and retaliatory killing in the response to Human–Bear Conflicts3,5,6. Considering the increased threats and species vulnerability in wild, HBB is listed as Vulnerable in the Red list of IUCN1 and categorized under the Schedule-I of Indian Wildlife (Protection) Act 1972. Complete mitochondrial genomes of six sub-species of Asiatic black bear except the U. t. gedrosianus have been sequenced using long range PCR strategy7,8,9,10. However, no study has provided the detailed genome organization and comparative assessment for gene arrangements and structural consistency in the t-RNA model, important in variety of cellular processes controlling species life history traits.
Further, Next Generation Sequencing (NGS), which rapidly captures the broad spectrum of mutations and has dramatically revolutionized DNA sequencing11, and has been popularized to address questions in the field of molecular ecology12, phylogeographic13, population genetics14 and phylogenetic studies15. Most studies in bears have made use of the conventional strategy of combining long-range PCR with subsequent primer walking for sequencing the complete mitogenomes7,8,9,10,16. However, conventional sequencing is tedious and challenging in particular for optimizing long range PCRs. In contrast, revolution in NGS technology has made considerable decrease in cost and increase in throughput (millions of short sequencing reads) and accuracy17,18. Several studies have demonstrated the application of NGS in drawing the phylogenetic inferences, genome organization and comparative assessment among the sympatric species by mapping and assembly the complete mitogenomes from low depth sequencing reads19,20,21.
Therefore, to overcome the unwieldy process of conventional sanger sequencing, we assembled the complete mitogenome directly from low depth NGS reads following a modified approach of mitochondrial baiting and mapping reported earlier by Hahn et al.22. We demonstrated organization of complete mitogenome of HBB for the first time and presented its structure consistency of tRNA model with the other sub-species of Asiatic black bear. We also testified the assembled mitogenome of U. t. laniger in re-construction of bear phylogeny among the other black bear subspecies.
Results and discussion
A total of 3.73 GB data of ~ 1.6 × coverage was obtained from Illumina HiSeq 2500 platform which yielded 12,418,314 reads. With reference-based assembly, we obtained the longest contig of 16, 556 bp length that represented the complete mitochondrial genome of U. t. laniger and submitted to GenBank (accession no. MN935768). The observed total AT and GC contenst were 59.3% and 40.7% (Fig. 1a), and mitogenome showed positive AT skewness (+ 0.057), indicating that adenine bases occurred more frequently than the thymine, whereas GC-skew was negative, − 0.25. The assembled mitogenome encoded 37 genes, of which, 13 were PCGs, 22 tRNAs, 2rRNAs and one control region. The arrangement and distribution of genes were similar to the other mammalian species23. The overall nucleotide composition was: 31.33% -A, 25.45% -C, 15.24% -G, 27.98%- T. In exception to ND6 and eight tRNA genes (trnQ, trnA, trnN, trnC, trnY, trnS2, trnE and trnP), most genes were encoded on the heavy strand (Fig. 2). The five pairs of overlapping regions in mitogenome were observed among trnV/rrnL, trnI/trnQ, atp8/atp6, nd4l/nd4, and trnT/trnP. The overlapping regions ranged from − 1 to − 34 bp. The smallest overlapping region was located in between trnV/trnL and trnT/trnP (1 bp) whereas the longest overlapping was in between ATP8/ATP6 (40 bp). Besides, 20 intergenic spacers were observed between the mitochondrial regions ranging from 1 to 33 bp length; the longest space was found between trnN/trnC (Table 1).
PCGs and rRNAs
All 13 PCGs had ATG start codon except nd2 and nd6 which encoded by ATA and TCC, respectively. The total length of PCGs was 11,316 bp which shared 68.3% of complete mitogenome (Table 2). The average base composition in PCGs were 30.1%- A, 28.6%- T, 14.2%- G and 27.2%- C. The abundance of AT (%) was higher than GC (%).Comparative analysis of U. t. laniger with the other subspecies of Asiatic black bear and Ursus americanus exhibited relatively high adenine and cytosine contents than thymine and guanine. All the PCGs showed positive AT skew except for the genes cox1, cox3, nd3 and nd4l whereas GC skew showed negative skewness for all the genes (Fig. 1b). The PCGs region consisted of twelve heavy strands and one light strand as commonly found in other vertebrate species24,25,26. The PCGs region consisted of seven NADH dehydrogenases, three cytochrome c oxidases, two ATPases and one cytochrome b genes.
The mtDNA ribosomal region is known to be highly conserved and widely used for phylogenies of higher and middle category level, such as phyla, family and genera24,26. The length of 12S rRNA and 16S rRNA genes was 966 bp and 12,582 bp, respectively. The 12S rRNA gene was positioned between the tRNA-Phe and tRNA-Val and 16S rRNA gene was positioned between tRNA-Val and tRNA-Leu2. Similar to PCGs, the AT skewness was positive (0.208) and the GC skewness was negative − 0.086) (Table 2) and the total AT content of rRNA was 59.4% which was in correspondence with other sub-species of Asiatic black bear (Table 2).
Transfer RNAs and control region
The length of the tRNA was 1508 bp, overall AT and GC content was 64.1% and 35.9% respectively. The average AT and GC skewness values for tRNAs were 0.026 and 0.083, respectively (Table 3). The results exhibited 21 tRNAs can fold into cloverleaf structure except for tRNAser which lacks the dihydrouridine arm (Fig. 3). The tRNA genes length varied from 59 to 75 bp and out of 22 tRNAs, fourteen were located on heavy strand and eight were on the light strand (Table 1).
The control region was located between trnP and trnF and the length was 1109 bp in size and contributed to 6.7% of the whole mitogenome with containing a microsatellite repeat, (AT)4 and seven 10 bp tandem repeats (Table S1). The A + T composition was 58.7%, higher than that of G + C content. The AT and GC skewness values were negative, − 0.069 and − 0.142, respectively (Table 2).
RSCU and reconstruction of bear phylogeny
The relative synonymous codon usage showed the highest utilization of codons of UAC, UUG, AUC and ACC among all the PCGs (Fig. 4). The RSCU analysis revealed the most occurred amino acids in protein-coding genes of U. t. laniger mitochondrial genome were Ile, Tyr, Leu, and Thr with 449, 482, 439 and 419 codon frequencies, respectively (Table S2, Fig. 5), whereas Met, Cys and Asp were less abundant. We did not find any difference in the RSCU of the U. t. laniger when compared with the other subspecies of Asiatic black bear. The phylogenetic analysis showed that the two mitogenomes i.e. MG066704.2 and MH281753.1 shared 99% similarity with the assembled mitogenome of U.t. laniger. These two mitogenomes were sequenced from Nepal10 and Tibetan Plateau-Himalaya region27 which are the known distribution ranges of the U.t. laniger, exhibiting an obvious trend of clustering in phylogeny with strong bootstrap support (Fig. 6). All mitogenomes of different sub-species of Asiatic black bear i.e. U. t. japonicus, U. t. formosanus, U. t. ussuricus, U. t. thibetanus and U. t. mupinensis, formed sister branches to the U.t.laniger. In an earlier study conducted on the sequencing of complete mitogenome of Japanese black bear, U. t. laniger was not included in the phylogeny due to non-availability of the sequences28.
Further, tRNAs secondary structure of U .t. laniger were compared with the other taxon of Ursids whose complete mitogenomes were available. The comparison showed more than 90% structure similarity with MH281753.1 (99%), MG066704.2 (99%), EF076773.1 (95%), EF19666.1 (93%) with z-score value of more than 10.0 and lesser similarity with EF196665.1 (39%) and EF212882.1 (46%) (Table S3) having low structure stability which was also evident from the phylogenetic analysis. We found no functional change in wobble position of anticodon (UAA) except in Ailuropoda melanoleuca (AAG). The pairwise genetic distances matrix, calculated based on Kimura 2-parameter model indicated that assembled mitogenome of U. t. laniger showed highest genetic differentiation with U. t. mupinensis (0.019) and lowest with the subspecies of Asiatic black bear sequenced from Nepal and Tibetan Plateau-Himalaya region (0.001), expectedly the HBB, U. t. laniger within the species of U. thibetanus (Table S4).
Mitogenome analysis is imperative to make inferences on species phylogenies and resolving species divergence at different taxonomic levels29,30. In the present study, size of the complete mitogenome of HBB (U.t. laniger) was found to be 16,556 bp in length and showed similar gene order as found in other sub-species of Asiatic Black bears. Further, the tRNA secondary structure comparative analysis revealed no functional change in wobble position of anticodon except in Ailuropoda melanoleuca. Similar to the tRNA secondary structure, PCGs and RSCU structural comparative analysis, we did not find any change among the different subspecies of Asiatic black bear.No structural differences in tRNA, PCGs and RSCU among different subspecies of the Asiatic Black bear indicated evolutionary conserved nature of the mitochondrial genes. Bayesian tree showed distinct clusters, species wise paraphyletic clades formed, where all the six subspecies of Asiatic black bears formed sister branches and the phylogenetic relationships were congruent with the tRNA structure similarity with the other available ursids species. The findings of the present study demonstrate the detailed workflow in rapid recovery and assembly the complete mitogenome of HBB (U.t. laniger) from the low depth sequencing data. A close similarity (99%) of HBB from Indian Himalayan region with U.t. laniger from Nepal and Tibetan Plateau-Himalaya region suggested its distribution in large area and open the scope for transboundary research among range countries for population level information. The study also generates opportunities to overcome the conventional strategies of obtaining long-range PCR and subsequently drawing phylogenetic inferences. The complete mitogenome reported in the present study is expected to allow for further genomics studies of the ursidae species and would be useful for conservation genetics.
Sample collection, DNA extraction and library preparation
We collected a small tissue portion, approx. 500 mg from a HBB carcass during the field surveys in the district of Uttarkashi of the State Uttarakhand, India. Total genomic DNA was isolated using Qiagen DNeasy Blood and Tissue Kit (Qiagen, Germany) according to the manufacturer’s instructions. The NGS was outsourced to the Xcelris Labs Pvt. Ltd. Ahmedabad, India. Approximately, 200 ng genomic DNA was sheared using Covaris S2 sonicator (Covaris, Woburn, Massachusetts, USA) to generate fragment of read length 2 × 150 bp PE. The TruSeq DNA Library Preparation kit (https://support.illumina.com/downloads/truseq) was used for the construction of the paired-end library (8 lanes) with standard protocols. The resultant library was sequenced using Illumina HiSeq 2500 (2 × 150 base paired-end reads) (Illumina, USA) platform which yielded ~ 12 million reads.
Quality check and reference-based assembly
Quality screening of raw reads was done using FastQC (https://www.bioinformatics.babraham.ac.uk/projects/fastqc/) and reads with low quality (Q < 20) and shorters (< 50 bp) were filtered out using NGS QC toolkit (https://www.nipgr.res.in/ngsqctoolkit.html). Usable reads were mapped against reference genome (MH281753) using bwa-aln (0.7.17) and then the fishing reads were grouped into extended reads (blocks) and the resultant contig was re-mapped with the filtered reads in order to increase the correctness of assembly using CLC genomics workbench version 12.0.3 (https://www.qiagenbioinformatics.com/products/clc-genomics-workbench/) with default parameters One of the longest contigs that represented the assembled complete mitochondrial genome of HBB, was thus generated (Fig. 7).
Genome characterization and comparative analysis
The circular representation of the generated mitogenome was viewed using CGView Server (https://stothard.afns.ualberta.ca/cgview_server/)31. The 22 tRNA genes were verified using tRNAscan-SE software32 using the mammalian mitochondrial genetic code under the default mode. The gene arrangement and their order were verified using MITOS online server (https://mitos.bioinf.uni-leipzig.de). The overlapping regions and intergenic spacers were counted manually using Microsoft Excel (2007). The start and stop codon of Protein Coding Genes (PCGs) were checked through Open Reading Frame Finder (https://www.ncbi.nlm.nih.gov/orffinder/) web tool. The base skewness was calculated using AT skew = [A − T]/[A + T], GC skew = [G − C]/[G + C]33 and Short Sequence repeats (SSRs) were screened using MISA-web software34 and tandem repeats were identified using Tandem Repeat Finder (https://tandem.bu.edu/trf/trf.html) web tool35. AT-GC content and its skewness of mitogenome U. t. laniger sequences were compared with other sub-species of black bear. The comparative analysis of Relative Synonymous Codon Usage (RSCU) and codon distribution of U. t. laniger sequences and other sub-species of black bear were calculated using MEGA X36. Comparative analysis in the consistency of the secondary structure of tRNA of U. thibetanus with other available complete mitogenome of Ursidae family was done using web-beagle (https://beagle.bio.uniroma2.it/) aligner37.
Genetic distance and phylogenetic re-construction
We also downloaded 17 mitogenomes, i.e. one each from the six subspecies of Asiatic black bear -U.t.ussuricus, U.t. mupinensis, U.t. thibetanus, U.t. formosanus, U.t.laniger, U.t.japonicus, two mitogenomes of American black bear (Ursus americanus), one each from brown bear (Ursus arctos), polar bear (Ursus maritimus), sun bear (Helarctos malayanus), Sloth bear (Melursus ursinus), Spectable bear (Tremarctos ornatus), Giant panda (Ailuropoda melanoleuca), and three mitogenomes of unassigned subspecies of Asiatic black bear (Table 3). The pair-wise genetic distances were estimated using MEGA X36 among the different subspecies of Asiatic black bear and the other taxon of Ursids. Bayesian-based phylogeny among the mentioned taxon of Ursids was reconstructed using BEAUti v 1.6.1 and BEAST v.1.10.438 considering Ailuropoda melanoleuca as an out-group. We applied the best fit model HKY selected by Model test 3.639 with BIC criteria. For the molecular clock rate, we used a normal prior divergence as used by Wayne et al.40 by placing a standard deviation on the rate equals to 10% of the mean to account for variation and uncertainty in the rate. Dating analyses were performed for 20 million generations while sampling every 1000th tree, and the first 10% of trees sampled were treated as burn-in, and visualized in Figtree v 1.4.441.
Garshelis, D. & Steinmetz, R. Ursus thibetanus. The IUCN Red List of Threatened Species 2016. https://dx.doi.org/Downloaded on 17 January 2020 (2016).
Parter, S. H. The Book of Indian Animal (Bombay Natural History Society and Oxford University Press, India, 1980).
Sathyakumar, S. & Choudhury, A. Distribution and status of asiatic black bear in India. J. Bomb. Nat. Hist. Soci. 104, 316–323 (2007).
Pocock, R. I. The Fauna of British India, Including Ceylon and Burma, Mammalia Vol. 2 (Taylor and Francis, London, 1941).
Sathyakumar, S., Kaul, R., Ashraf, N. V. K., Mookerjee, A. & Menon, V. National Bear Conservation and Welfare Action Plan (Ministry of Environment and Forest, Wildlife Institute of India and Wildlife Trust of India, 2012).
Charoo, S.A., Sharma, L.K. & Sathyakumar, S. Asiatic Black Bear—Human Conflicts around Dachigam National Park, Kashmir. Technical Report. Wildlife Institute of India, Dehradun. 29 (2009).
Hou, W. R. et al. A complete mitochondrial genome sequence of Asian black bear Sichuan subspecies (Ursus thibetanus mupinensis). Int. J. Biol. Sci. 3, 85–90 (2007).
Yu, L. et al. Analysis of complete mitochondrial genome sequences increases phylogenetic resolution of bears (Ursidae), a mammalian family that experienced rapid speciation. BMC Evol. Biol. 7, 198 (2007).
Hwang, D. S. et al. A comprehensive analysis of three Asiatic black bear mitochondrial genomes (subspecies ussuricus, formosanus and mupinensis), with emphasis on the complete mtDNA sequence of Ursus thibetanus ussuricus (Ursidae). Mitochondrial. DNA 19, 418–429 (2008).
Kadariya, R. et al. High genetic diversity and distinct ancient lineage of Asiatic black bears revealed by non-invasive surveys in the Annapurna Conservation Area Nepal. PLoS ONE 13, 0207622 (2018).
Shendure, J. & Ji, H. Next-generation DNA sequencing. Nat. Biotechnol. 26, 1135–1145 (2008).
Ekblom, R. & Galindo, J. Applications of next generation sequencing in molecular ecology of non-model organisms. Heredity (Edinb) 107, 1–15 (2011).
Edwards, S. V., Shultz, A. J. & Campbell-Staton, S. C. Next-generation sequencing and the expanding domain of phylogeography. Folia Zool. 64, 187–206 (2015).
Natesh, M. et al. Conservation priorities for endangered Indian tigers through a genomic lens. Sci. Rep. 7, 9614 (2017).
Song, N., Cai, W. & Li, H. Deep-level phylogeny of Cicadomorpha inferred from mitochondrial genomes sequenced by NGS. Sci. Rep. 7, 10429 (2017).
Delisle, I. & Strobeck, C. Conserved primers for rapid sequencing of the complete mitochondrial genome from carnivores, applied to three species of bears. Mol. Biol. Evol. 19, 357–361 (2002).
Van Dijk, E., Auger, H., Jaszczyszyn, Y. & Thermes, C. T. years of next-generation sequencing technology. Trends Genet. 30(9), 418–426 (2014).
Heather, J. & Chain, B. The sequence of sequencers: the history of sequencing DNA. Genomics 107(1), 1–8 (2016).
Wang, S., Wang, B., Wang, F. & Wu, Z. Complete mitochondrial genome of Gallus domesticus (Galliformes: Phasianidae). Mitochondrial. DNA A DNA Mapp. Seq. Anal. 27(2), 978–979 (2016).
Zhou, M., Yu, J., Li, J., Ouyang, B. & Yang, J. The complete mitochondrial genome of Budorcas taxicolor tibetana (Artiodactyla: Bovidae) and comparison with other Caprinae species: Insight into the phylogeny of the genus Budorcas Int. J. Biol. Macromol. 121, 223–232 (2019).
Kamalakkannan, R. et al. The complete mitochondrial genome of Indian gaur, Bos gaurus and its phylogenetic implications. Sci. Rep. 10, 11936. https://doi.org/10.1038/s41598-020-68724-6 (2020).
Hahn, C., Bachmann, L. & Chevreux, B. Reconstructing mitochondrial genomes directly from genomic next-generation sequencing reads – a baiting and iterative mapping approach. Nucleic Acids Res. 41, 129 (2013).
Boore, J. L. Animal mitochondrial genomes. Nucleic Acids Res. 27, 1767–1780 (1999).
Feng, H., Feng, C., Wang, L. & Huang, Y. Complete mitochondrial genome of the golden takin (Budorcas taxicolor bedfordi). Mitochondrial. DNA 1, 186–188 (2016).
Kumar, A. et al. Sequencing and characterization of the complete mitochondrial genome of Mishmi takin (Budorcas taxi color taxicolor) and comparison with the other Caprinae species. Int. J. Biol. Macromol. 137, 87–94 (2019).
Sarvani, R. K. et al. Characterization of the complete mitogenome of Indian Mouse Deer, Moschiola indica (Artiodactyla: Tragulidae) and its evolutionary significance. Sci. Rep. 8, 2697 (2018).
Lan, T. et al. Evolutionary history of enigmatic bears in the Tibetan Plateau-Himalaya region and the identity of the yeti. Proc. R. Soc. B 284, 20171804 (2017).
Jiagi, W. et al. Phylogeographic and demographic analysis of the Asian Black Bear (Ursus thibetanus) based on Mitochondrial DNA. PLoS ONE 10, e0136398 (2015).
Timmermans, M. J. et al. Why barcode? High-throughput multiplex sequencing of mitochondrial genomes for molecular systematics. Nucleic Acids Res. 38, e197 (2010).
Cabrera-Brandt, M. A. & Gaitan-Espitia, J. D. Phylogenetic analysis of the complete mitogenome sequence of the raspberry weevil Aegorhinus superciliosus (Coleoptera: Curculionidae), supports monophyly of the tribe Aterpini. Gene 571, 205–211 (2015).
Grant, J. R. & Stothard, P. The CGView Server: a comparative genomics tool for circular 423 genomes. Nucleic Acids Res. 36, 181–184 (2008).
Lowe, T. M. & Eddy, S. R. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 25, 955–964 (1997).
Perna, N. T. & Kocher, T. D. Patterns of nucleotide composition at fourfold degenerate sites of animal mitochondrial genomes. J. Mol. Evol. 41, 353–358 (1995).
Beier, S., Thiel, T., Munch, T., Scholz, U. & Mascher, M. MISA-web: a web server for microsatellite prediction. Bioinformatics 33, 2583–2585 (2017).
Benson, G. Tandem repeats finder: a program to analyze DNA sequences. Nucleic Acids Res. 36, 573–580 (1999).
Kumar, S., Stecher, G., Li, M., Knyaz, C. & Tamura, K. Mega X: molecular evolutionary genetics analysis across computing platform. Mol. Biol. Evol. 35, 1547–1549 (2018).
Mattei, E., Pietrosanto, M., Ferree, F. & Citterich, M. H. Web-Beagle: a web server for the alignment of RNA secondary structures. Nuc Acids Res. 43, 493–497 (2015).
Suchard, M. A. et al. Bayesian phylogenetic and phylodynamic data integration using BEAST 1.10. Virus Evol. 4, 1–5 (2018).
Nylander, J.A.A. MrModelTest Version 2. Programme Distributed by the Author. Evolutionary Biology Centre, Uppsala University (2004).
Wayne, R. K., Van, V. B. & O’Brien, S. J. Molecular distance and divergence time in carnivores and primates. Mol. Biol. Evol. 8, 297–319 (1991).
Rambaut, A. FigTree, Version1.4.4. Available at: https://tree.bio.ed.ac.uk/software/figtree/. (2018).
Authors acknowledge forest officials—Uttarakhand Forest Department for providing necessary permission and logistic support to undertake field surveys. The study was funded under the financial support received from National Mission on Himalayan Studies (NMHS; Grant No. NMHS/2017-18/LG09/02), Ministry of Environment of Forest and Climate Change (MoEF&CC), Govt. of India. The third author is supported by the Science and Engineering Research Board, DST by national post doctoral scheme (Grant No. PDF_2017_002910_2018-19) New Delhi.
The authors declare no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Bit, A., Thakur, M., Singh, S.K. et al. Assembling mitogenome of Himalayan Black Bear (U. t. laniger) from low depth reads and its application in drawing phylogenetic inferences. Sci Rep 11, 730 (2021). https://doi.org/10.1038/s41598-020-76872-y