Allelic frequencies of 22 short tandem repeats loci and tri-allelic patterns of Penta D and TPOX identified in Gabonese population

Short tandem repeats (STRs) are repeating DNA sequences used in forensic human identity testing and the diagnosis of aneuploidies. Many STRs like Penta D and TPOX are used routinely for paternity tests, but these tests are not widely used in sub-Saharan Africa. In this study we recruited individuals from Gabonese families seeking a paternity test. After DNA extraction from buccal swabs, we genotyped samples using a panel of 22 STRs. A total of 115 unrelated subjects from 39 families were included. Allele frequencies of the 22 STR loci were determined in unrelated Gabonese subjects. The most polymorphic loci were D21S11 (16 alleles) and FGA (17 alleles), while D3S1358 and TH01 loci were less polymorphic, with five alleles each. Deviation from Hardy–Weinberg equilibrium was observed for TPOX, D3S1358, CSFPO and D7S820 loci. We reported tri-allelic patterns that indicate aneuploidies at a combined frequency of 4% (4/115) with 3% for Penta D (1/35) and 3% for TPOX (3/102). Furthermore, we identified a new tri-allelic genotype 5-8-16 for the Penta D locus located on chromosome 21 in a healthy subject. In addition, we observed three tri-allelic variants of TPOX, located on chromosome 2, in healthy subjects, namely 8-10-11, 8-9-10, and 8-8-10. Our study revealed unsuspected polymorphic variations in Penta D and TPOX for the first time in Gabon, raising several questions about chromosomal disorders. Further population genetics studies are needed in Gabon to better characterize these variations, both qualitatively and quantitative.

TPOX is a simple tetranucleotide repeat (AATG) located in the tenth intron of the human thyroid peroxidase gene on chromosome 2 (2p25.3).TPOX alleles size ranged from 4 to 18 repeats, and 22 tri-allelic patterns are reported in STRbase 4 .The tri-allelic pattern 8-10-11 is the most common 4 .
In 2004, Clayton et al. distinguished two types of tri-allelic patterns: tri-allelic Type 1 occurs when two peaks have a different height from the third and tri-allelic Type 2 occurs when all three peaks have a similar height 5 .More recently, Picanço et al. (2015) identified three tri-allelic Type 2 subcategories: Type 2-A (three peaks of the same height), Type 2-B (one peak with a 2:3 height ratio and one with 1:3 height ratio) and Type 2-C (one peak with a 3:3 height ratio) 6 .
In this study, we determined the allele frequencies of 22 STRs loci in unrelated Gabonese subjects, and we reported for the first time in the Gabonese population, the existence of a novel tri-allelic pattern of the Penta D locus, and tri-allelic patterns at the TPOX locus.

Allele frequencies
We screened for paternity testing 115 unrelated subjects from 39 families at DNA-LAB Gabon located in Libreville.Were included 42 children, 39 presumed fathers and 34 mothers.
Of the 22 STRs tested, eleven were highly polymorphic (Table 1), while other loci had fewer than ten alleles (Table 1).The most polymorphic loci were D21S11 (16 alleles) and FGA (17 alleles), while D3S1358 and TH01 loci were less polymorphic, with only five alleles each.Deviations from Hardy-Weinberg equilibrium were observed in TPOX, D3S1358, CSFPO and D7S820 loci.
Comparing 10 STRs, the allelic frequencies observed in our study were not statistically different from those of the Gabonese samples in a previous study published in 2002 (Table 2) 11 .Furthermore, comparison of the frequencies of 13 STRs between African-Americans from California (USA) 15 and the Gabonese in the present study also revealed no significant differences (Table 3).

Tri-allelic patterns
We found tri-allelic patterns in 8% of recruited families (3/39 families), and 4% of recruited subjects (4/115 subjects).In all tri-allelic cases, presumed fathers were the biological fathers of tested children, and we did not observe any physical abnormalities that could suggest a genetic disorder in any member of the recruited families.

Penta D family case
Were screened for parentage a phenotypically normal family.Penta D genotypes were 2.2-10 for presumed father, 5-8-16 for the mother and 5-10 for the child.This new tri-allelic Penta D genotype 5-8-16 was observed with a frequency of 3% in tested subjects (1/35 subjects) (Fig. 1).This genotype has never been reported in Sub-Saharan Africa.

TPOX family cases
Tri-allelic TPOX genotypes were observed with a frequency of 3% (3/102 subjects tested).In the second family, which comprised a man with genotype 9-11, a woman with genotype 8-8-10, and their child (male) with genotype 8-10-11, we found two different types of tri-allelic TPOX genotypes.In this family, the tri-allelic type 2-B, 8-8-10, was observed in the healthy mother (Fig. 2).Type 2-A was found in the healthy child of this family (8-10-11, Fig. 2) as in the healthy child of the third family (8-9-10; figure not show).The third family comprising a father with genotype 8-8, a mother with genotype 9-9, and their child (female) with genotype 8-9-10 (figure not show).

Discussion
We investigated more loci than in a previous study of the Gabonese population twenty years ago 11 .Allele frequencies were similar between the two studies, but we found features related to specific alleles that did not appear in the previous survey.These were allele 14 of D2S1338, alleles 15  11 .The polymorphic loci FGA, D18S51, and D21S11 showed remarkable differences in terms of various allele presence.Locus D8S1179 was the most polymorphic in our study, with three new alleles compared to the older study, of which one of the alleles (10) is involved in tri-allelic inheritance in India 12 .
Power discrimination analysis suggests that all 22 STR may be promising markers for paternity testing (PD between 0.973 and 0.993).One of the four probability tests showing significant departures from the HWE equilibrium (CSFPO locus) was also significant in Rwandans (Hutu) and Angolans 13,14 .The frequency of homozygotes at the CSFPO locus (9/33 = 27%) indicates that this deviation is due to homozygote excess, as in Rwandans 13 .The allele frequencies are also similar to those for descendants of African people in California 15 .These comparisons suggest that these loci are stable and good genetic identification indicators.
We report a tri-allelic Penta D pattern (5-8-16) that has not been described previously.The Penta D allele 5, the only allele transmitted from the mother to the child in family #1, occurs at a frequency of 11% in our study and 4% in Africa 16 .Of the other Penta D alleles in this family, allele 8, is less frequent in our study population (6%) than in Africa more generally (15.5%) 16 .Penta D allele 16, at a frequency of 2% in our study, has not yet been detected elsewhere in Africa but exists in other populations, such as the Middle East, at a frequency of 2.5% 16 .Further population studies of the STR locus Penta D should be conducted in Sub-Saharan Africa to determine the types of changes and their frequencies.In our study, the phenotypically normal mother has a tri-allelic pattern of The tri-allelic TPOX genotypes we observed were tri-allelic Type 2, which is due to a constitutional chromosomal rearrangement, while tri-allelic Type 1 is probably due to a mutation in an early somatic cell 5 .The tri-allelic Type 2 pattern of TPOX is present at a very low frequency in various human populations, ranging from 0.003 to 0.2% 18 .The highest frequency of tri-allelic TPOX genotypes observed (2.4% or 165/6827 people) was in indigenous black populations from South Africa 19 .Although our sample size was small, the observed frequency (3% or 3/102 people investigated) supports observations in African populations 19 .Moreover, the presence of tri-allelic genotypes of TPOX in Gabon (Central Africa) supports the hypothesis that the TPOX variants may have existed before the expansion of Bantus from Central Africa 19,20  In the second family, a mother with three TPOX alleles transmitted two of them to her son, supporting the hypothesis that the extra allele comes from the X chromosome, as proposed by two studies 6,19 , or from chromosome 2 with a potential impact of chromosomal rearrangement on the activity of Y-sperm 21 .
In the third family, the pattern is entirely different.Each parent transmitted an allele to their daughter (allele 8 from the mother and allele 9 from the father), but the daughter shows an extra allele de novo, allele 10.Some authors have suggested that the additional allele in TPOX is allele 10 6,19 .However, other authors have shown that it is allele 11 in Chinese and Korean populations 21 .Allele 11 results from a strand slippage mutation of an extra allele 10 of TPOX originating from Bantu groups in Africa 21 .Our results show that allele 10 is an additional allele, as observed in the TPOX locus of the daughter in the third family.Furthermore, this allele was found in other tri-allelic TPOX genotypes in our study: a healthy mother and her healthy child (from the second family).

Limitations of the study and future directions
Due to a lack of funding, we could not sequence the extra-allele 10 of TPOX found in Family #3.This was the main limitation of this work.Future studies could sequence the de novo allele in this girl or extend population genetic study to her entire family.

Conclusion
We observed similar allele frequencies of 22 STRs to those in other Black populations.These findings suggest that these STRs are good identification markers, allowing us to diagnose aneuploidies without symptoms.The presence in chromosome 21 of a tri-allelic genotype of the Penta D locus with a new allele in our study suggests that we need more in-depth studies of this locus in sub-Saharan Africa.The presence of three subtypes (8-8-10, 8-10-11, and 8-9-10) of the tri-allelic variants of TPOX in our small sample suggests that we need an extended study of genetic polymorphism in Central Africa, where the Bantu peoples originate.

Methods
Data were collected during paternity tests on indigenous Gabonese people.As such analyses are not yet routine in Gabon, we collaborated with a partner laboratory (Labor Für DNA Analytik/Germany) for the complete analysis after DNA extraction.On this topic, DNA was extracted from buccal swabs and prepared with the nucleospin tissue kit following the manufacturer's protocol (Macherey Nagel, Freiburg, Germany).This form clearly states that the signatory parties agreed to use the results for research and publications.
To assess allelic frequencies in this study, we only considered unrelated subjects who were therefore defined in two ways.For 2-parent families, all children were excluded from the unrelated subject's group.In the case of single-parent families, we included in this group, children without proven parentage with the presumed father.
17e other STR on chromosome 21 in the panels of STR loci used for paternity testing, D21S11, showed no allelic abnormality.Her tested offspring inherited only two alleles, one from each parent.A tri-allelic pattern of Penta D indicates a genetic abnormality on chromosome 21.Trisomy 21 (also known as Down syndrome) is the most common chromosomal anomaly and corresponds to the presence of an extra chromosome 21, in whole or in part.It can be due to various chromosomal aberrations: free trisomy, translocations, mosaicism, critical region duplication, and other structural rearrangements of chromosome 2117.Mosaicism or partial trisomy 21 are more challenging to diagnose because the karyotype is often normal17, showing the importance of studies of Penta D STR.

Table 2 .
20-year comparisons between allele frequencies of STR loci in Gabonese subjects using Wilcoxon signed rank test with continuity correction.

Table 3 .
Comparisons between allele frequencies of STR loci in Gabonese subjects of our study and African-American subjects from California using Wilcoxon signed rank test with continuity correction.