Introduction

Genetic analysis of human mitochondrial DNA (mtDNA) is an important and indispensable part in population genetic studies all over the world. MtDNA hypervariable segments is an advanced research hotspot due to their molecular properties, inheritance pattern, specific population polymorphism, rapid evolutionary rate, low recombination rate and so on1,2,3,4,5,6. Meanwhile, frequencies of certain mtDNA sequence variations in a given population play an important role in the field of forensic genetics. The biomaterials from crime scenes are affected by various factors, such as human activities, environment, long intervals after the crime and so on, the analysis of mtDNA provides a more critical and helpful approach to specimen source identification on highly degraded biomaterials including teeth and dated bloodstains that contain infinitesimal nuclear DNA than usual7,8. Hence, the genetic analysis of mtDNA is utilized widely for the forensic purposes, especially for maternal lineage study and parentage identification. In addition, mtDNA sequence types as available mitochondrial markers are strongly correlated with geographic origin and benefit anthropological researches9,10.

Xibe as an ethnic minority group, is officially recognized by China. It had a population of 190,481 according to the latest China population census in 2010. The Xibe people mostly live in Jilin (bordering North Korea), Xinjiang and Liaoning. The Xibe people in northeast (Jilin and Liaoning provinces) and northwest (Xinjiang Uygur Autonomous Region) China have obvious characteristics due to the different geographical locations. Chinese is the first language of northeastern Xibe people while people from Xinjiang Xibe group speak a southern language which is a mixture of Chinese and Xinjiang languages including Kazakh, Russian and Uygur11. In recent years, many studies focusing on different autosomal short tandem repeat (STR) loci of Xibe group have been reported12,13,14. In this study, we collected the data of 54 mtDNA variants in Xibe ethnic group in order to study genetic polymorphisms in the field of forensic science and infer the genetic relationships between Xibe and other groups.

Materials and Methods

Samples

In total, 137 blood samples were collected from unrelated healthy donors of Xibe ethnic group in Ili, Xinjiang Uygur Autonomous region. The volunteers were randomly selected from the Xibe group and their ancestors must have been living there for at least three generations. Every participant wrote informed consent and blood samples were obtained respectively according to standard procedures. The study was conducted in accordance with the human and ethical research principles of Xi’an Jiaotong University Health Science Center, China and approved by the ethics committee of Xi’an Jiaotong University Health Science Center.

DNA extraction, amplification and genotyping

DNA was extracted using the Chelex-100 as described in previous protocol15. Multiplexed PCR amplifications of 60 variants were co-amplified in 5 fluorescence-based reaction using Expressmarker mtDNA-SNP 60 kit (nt10398, 9 bp, 10873, 3010, 709, 7196, 12705, 3970, 13104, 10310, 5178, 13928, 6446, 8414, 8793, 8794, 15043, 16311, 16126, 16129, 8701, 8697, 4883, 10400, CA, 1719, 14668, 12811, 9824, 9123, 7028, 11719, 8584, 11251, 8020, 5460, 2706, 11215, 4216, 12372, 16362, 9698, 1541, 8684, 9477, 4491, 1811, 16316, 16319, 9545, 152, 14569, 8964, 10397, 3348, 4833, 7600, 5417, 5442, 15784, AGCU ScienTech Incorporation, Jiangsu, Wuxi). Briefly, PCR amplification system (25 μl) contained 1 μl genomic DNA, 10 μl reaction mix, 5 μl primers, 1μl tag DNA polymerase (5U) and 8 μl sdH2O. The cycling parameters were set up according to the manufacturer’s protocol. The PCR production of 1 μl was mixed with 0.5 μl AGCUMarkerSIZ-500 and 12 μl Hi-Di formamide. Electrophoresis was performed by the ABI Prism 3130XL Genetic analyzer and fragment sizing was analyzed by GeneMapper ID v3.2.1 software (Applied Biosystems, USA). Control DNA 9947A was used as the positive control in our experiment.

Quality control

The multiplex allele-specific system was based on allele-specific primers which can analyze single nucleotide polymorphism (SNP) effectively. The 3′ end of the specific primers with different length were matched to the corresponding allele. In order to avoid non-specific extension, mismatched base was introduced into the third or another position at 3′ end of the specific primers. Each common primer was labeled with one fluorescence (FAM, HEX or TAMRA in our study) for detection, respectively. Allelic ladder (known allele of human) and positive control were used in our study for comparison and typing. The PCR production was mixed with a marker (DNA size was known) for the determination of DNA size (primers with fluorescence, allelic ladder, positive control 9947A and the marker were provided in the kit). Furthermore, negative control and reagent blank extraction experiments were also carried out in our study. All experimental procedures were operated exactly according to manufacturer’s instruction and laboratory internal standard to minimize errors.

Statistical analysis

However, this kit has a certain detection rate and the rate is affected by many factors: the experience of experimenters, the technology of detection, the quality of samples, the sensitivity of primers and so on. Due to technology problem and genetic variation of these markers in Xibe populations, 4 loci did not generated clear genotypic results in all Xibe individuals. Moreover, (CA)n and 9 bp variants were not widely studied in different groups. Therefore we took the data of these 54 variants for further analysis (nt152, 709, 1541, 1719, 1811, 2706, 3010, 3348, 3970, 4216, 4491, 4833, 4883, 5178, 5417, 5442, 5460, 6446, 7028, 7196, 7600, 8020, 8414, 8584, 8684, 8964, 9123, 9477, 9545, 9698, 9824, 10310, 10397, 10398, 10400, 10873, 11215, 11251, 11719, 12372, 12705, 12811, 13104, 13928, 14569, 14668, 15043, 15784, 16126, 16129, 16311, 16316, 16319, 16362). The sequence data about 54 mtDNA variants of Xibe group were aligned with the revised Anderson Reference Sequence (rCRS) of the Human mitochondrial DNA for statistical analysis16,17. The software DNAsp 5.0 and Mega 4.0 were employed to estimate the gene diversities and other genetic parameters in Chinese Xibe group. Haplotype diversity18 is a measure of the uniqueness of a particular haplotype in a certain population. The haplotype diversity (H) is computed as:

Where xi is the (relative) haplotype frequency of each haplotype in the sample and n is the sample size. Nucleotide diversity (π)19 is the average number of nucleotide differences per site between two DNA sequences selected randomly from a given population. Where xi and xj are the frequencies of the i-th and the j-th sequence, respectively. πij is the number of nucleotide differences per nucleotide site between the i-th and the j-th sequences and n is the number of sample.

The probability of two random individuals from a population having the same mtDNA types was calculated as P = ∑xi2 and the discrimination power is 1 − ∑xi2 (xi is the frequency of the i-th mtDNA haplotype) for evaluating the probability of two unrelated random samples having different phenotypes within a certain population.

Besides, we used Arlequin3.0 to calculate the Fst values among Xibe ethnic group and other groups using the 54 shared variants based on pair-wise comparisons (nt152, nt16126, nt16129, nt16311, nt16316, nt16319, nt16362 were in the control region and the remaining were in the coding region). On this basis, principle component analysis (PCA) and phylogenetic tree reconstruction were operated by SPSS19.0 and Mega 4.0 respectively to study the Xibe genetic background. P values below 0.05 were considered to be statistically significant.

Results

Genetic parameters analysis

The 54 selected mtDNA variants were successfully detected in all samples of the Chinese Xibe ethnic minority group, and the allele frequencies were showed in Table 1 while other 6 variants (nt8697, 8701, 8793, 8794, (CA)n and 9 bp) were excluded for vaguely genotypic results (nt8697, 8701, 8793 and 8794) or localized application ((CA)n and 9 bp). The mutation rates among Xibe ethnic group and other groups20,21,22,23,24,25,26 were listed in Supplemental Table 1(S1). In the studied group, single nucleotide transitions were the most common polymorphism (83.93%). At nt5178, nt7196 and nt13928 variants, the polymorphisms were caused by transversion. At the locus nt9824, transition and transversion were existed simultaneously (A/T/C), and there were no polymorphisms at nt3348, nt4491, nt6446, nt8684 and nt13104 variants.

Table 1 The allele frequencies of 54 mtDNA SNP loci in the Chinese Xibe ethnic minority group.

Generally, the relative frequency of the mtDNA variants between 0.20~0.80 is considered to be polymorphic SNPs27. Moreover, as shown in S1, the variants 709 and 14569 were specific to Xibe compared to Guangdong Han20, Liaoning Han24, Qingdao Han24, Xinjiang Han24, Wuhan Han24 and Yunnan Han24 groups. Similarly, the variants 14668 and 15043 mutations were observed in Xibe group rather than Daur21, Mongolian21, Korean21 and Ewenki21 groups. Therefore, these loci could be highly polymorphic and helpful in the forensic application in Xibe ethnic group.

As mentioned above, there were 51 variable positions (transition and transversion) in the studied Xibe group, and these variations defined 64 different mtDNA haplotypes (excluding (CA)n and 9 bp deletion) that were listed in Table 2. In addition, the two variable sites ((CA)n and 9 bp deletion) defined two additional haplotypes as shown in Table 2. Which means, 61-a and 61-b will be the same haplotype if (CA)n and 9 bp deletion are not taken into account. The differences between them are the repeat number of (CA) which is 5 in 61-a while 4 in 61-b.

Table 2 Haplotypes and variant positions in the Chinese Xibe ethnic group according to the revised Cambridge Reference Sequence (rCRS).

As shown in Table 2, the haplotype that is comprised of the variants including 2706G, 3010A, 4883T, 5178A, 7028T, 8414T, 10398G, 10400T, 10873C, 11719A, 12705T, 14668T, 15043A and 16362C (No. 61, 15/137) was the most common haplotype among all 64 haplotypes. Likewise, No. 61-a haplotype (variants: 2706G, 3010A, 4883T, 5178A, 7028T, 8414T, 10398G, 10400T, 10873C, 11719A, 12705T, 14668T, 15043A and 16362C, the number of (CA) repeats was 5, the 9 bp was NORM) was the most frequent (12/137) in our study when (CA)n and 9 bp deletion were taken into consideration. However, unique haplotypes which were only observed in a single person accounted for 27% (37/137) of the total subjects. Besides, the shared haplotypes were listed in Supplemental Table 2(S2) which included the haplotypes found between Xibe and other groups or the haplotype whose frequency in Xibe group was higher than 0.02. All the listed haplotypes were specific to Xibe group except for No. 14, No. 17, No. 30, No. 31, No. 32 and No. 61 (the sequence number was consistent with Table 2). No. 61 was the most common haplotype in Xibe ethnic group while lower frequency or none was observed in the rest groups. Table 3 shows the genetic parameters of the studied mtDNA variants in the Chinese Xibe group. The haplotype diversity and the nucleotide diversity in the studied Xibe population were 0.9800 ± 0.004 and 0.1875 respectively. The DP value in Chinese Xibe samples was 0.9699.

Table 3 Genetic parameters in the Chinese Xibe group (excluding the (CA)n and 9bp).

Principle component analysis

Fst and p values between Xibe and 18 other groups20,21,22,23,24,25,26 were showed in Table 4, and data below the diagonal were Fst values, while the above data were p values. The PCA plot based on Fst values was showed in Fig. 1 and Supplemental Figs 1–3. All groups had a significant difference with Hispanics and African Americans. Among the six Chinese Han groups (Qingdao Han, Wuhan Han, Guangdong Han, Liaoning Han, Yunnan Han and Xinjiang Han groups), no difference was observed between each other except for the comparisons between Wuhan Han and Guangdong Han, Xinjiang Han and Yunnan Han groups. The results also demonstrated the relationships between Xibe and other groups. According to the results, Xibe group was the closest to Xinjiang Han group (Fst = 0.00276, p > 0.05) while the farthest to Italians (Fst = 0.14807, p < 0.05). Meanwhile, the Xibe group also had a relatively close genetic distance to Guangdong Han, Yanbian Korean, Liaoning Han and Daur groups. Similarly, the same results were shown by the PCA plot. Xibe, Guangdong Han, Liaoning Han and Yanbian Korean groups were in one cluster. In contrast, African Americans, Italians, Estonians and Caucasians were dispersed in the plot.

Table 4 Fst and P values between Xibe and 18 other groups.
Figure 1
figure 1

The distribution of different groups analyzed by principle component based on the Fst values between Xibe and 18 other groups.

Phylogenetic analysis

In the phylogenetic tree based on Xibe and 18 other groups (Fig. 2), Italians, Estonians and Caucasians were in one cluster, the remaining 16 groups were in another. As mentioned above, Xibe, Yanbian Korean and Xinjiang Han had closer distance between each other and they were distributed in the same branch. Moreover, among these five groups (Xibe, Yanbian Korean, Xinjiang Han, Daur and Ewenki) that were located in the same sub branch, Daur and Ewenki were on a separated branch to the other three groups. It reveals that Xibe group was also close to Daur and Ewenki groups. As shown in Fig. 2, Liaoning Han, Yanbian Han, Qingdao Han and Mongolian groups constructed a sub branch, which meant these populations have a closer genetic relationship with each other in a way.

Figure 2
figure 2

The phylogenetic tree was reconstructed by the neighboring-joining method.

Discussion

Mitochondrial DNA analysis is the only effective method that can be utilized for the bio-materials containing infinitesimal nuclear DNA7,8. The analysis of mtDNA has an irreplaceable value in forensic application for its unique characteristics such as strong sensitivity, maternal genetic characteristics and so on. Hence, the analysis of mtDNA is competent for specimen source identification as well as excluding the innocent.

The analysis of the relative frequency could reveal some unique characteristic features in Chinese Xibe group due to some variants observed with high polymorphisms while some with no polymorphisms. Due to different specific primers, experimental principles, operation methods and reaction conditions, we could not distinguish the complicated polymorphism caused by substitution, insertion and deletion. But through the analysis of these polymorphic variants in Xibe group, we can still find accurately the criminal from suspects and improve the discrimination power in forensic cases. Meanwhile, haplotypes of mtDNA variants can be used as a new genetic marker for personal identification in the Chinese Xibe group. In addition, (CA)n is a short tandem repeat (STR) marker with a dinucleotide repeat (C and A) in mitochondrial D-loop region (from nt00514 to nt00524 in the rCRS). The repeat number is of polymorphism, lower than 10 normally and has some certain relationship with tumorigensis according to the previous report28. As well, 9 bp sequence deletion is also a genetic marker in mtDNA. Its geographical distribution was correlated with the trace of human immigration. So the analysis of (CA)n and 9 bp deletion can improve the forensic power of SNPs and haplotypes.

The results of Fst, PCA plot and phylogenetic tree simultaneously pointed out that our studied Xibe group had nearer genetic relationships with Xinjiang Han, Yanbian Korean and Daur groups. The branch that was composed of Xibe, Xinjiang Han and Yanbian Korean supported the opinion of nationwide territorial dispersion of Xibe people during Jin Dynasty. Furthermore, the closer relationship between Xibe, Daur and Ewenki groups(from Inner Mongolia21) which was confirmed by the previous report showed the relocation of Xibe people in 1700A.D14. Besides, the Xibe group was incorporated by Daur due to the fact that Khorchin dedicated them to Kangxi Emperor in exchange for silver in 1962. Besides, intermarriage between different races existed inevitably during abovementioned historical events. Hence, we inferred that the Xibe ethnic group probably had common ancestors with Xinjiang Han and Yanbian Korean groups. In addition, Daur and Ewenki groups also had high possibilities to own an ancestor with Xibe group or influence Xibe greatly during the development of these groups.

Concluding remarks

The selected 54 mtDNA variants were utilized for analyzing the genetic polymorphism of the Xibe ethnic minority group from China. Among these variants, a total of 14 mtDNA SNP loci (nt152, nt709, nt3010, nt4883, nt5178, nt8414, nt10398, nt10400, nt10873, nt12705, nt14668, nt15043, nt16129, and nt16362) were found to be genetic polymorphic and can be used as valid genetic markers for forensic and population genetic application. Meanwhile, 51 variable positions (transition and transversion) defined 64 different mtDNA haplotypes (excluding (CA)n and 9 bp deletion). The haplotype diversity was 0.9800 ± 0.004. Finally, we made a comparison between Xibe group and other populations to infer the genetic structure and relationships. The results of Fst, PCA and phylogenetic analysis showed that Xibe group had closer genetic relationships with Xinjiang Han and Yanbian Korean. However, more details should be collected to reconstruct the phylogenetic tree and research the migration and evolution of human populations.

Additional Information

How to cite this article: Shen, C.-M. et al. Genetic polymorphisms of 54 mitochondrial DNA SNP loci in Chinese Xibe ethnic minority group. Sci. Rep. 7, 44407; doi: 10.1038/srep44407 (2017).

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.