Introduction

Microsatellite markers, the short tandem repeats (STRs) evenly distributed in whole genome, are exploited widely for forensic medicine and used to search for identifying genes associated with diseases. The dinucleotide-repeat microsatellites are more useful than other STR markers because of their high heterozygosities and even distribution in the genome. Such markers can be analyzed by using a very small amount of DNA through amplification with polymerase chain reaction and subsequent electrophoresis. In fact, the technology of analyzing microsatellite polymorphism is adapted to automation and high throughput. Due to their highly polymorphism traits, microsatellite markers have been heavily utilized in areas such as mapping disease loci using linkage analyses, loss of heterozygosity analyses, and association studies.

Genotypic information, including allelic frequency and heterozygosity, is available through various databases. However, most of these data were derived from analyses using Caucasian samples. The data obtained from Asian population currently available are: 358 dinucleotide-repeat marker loci validated in 32 normal Japanese individuals (Yamane-Tanaka et al. 1998), 406 microsatellite marker loci validated in 64 unrelated Japanese subjects (Ikari et al. 2002), and 285 autosomal microsatellite marker loci validated in Chinese Singaporeans (Tan et al. 2002). The significant differences of allelic distributions of certain markers have been found between Caucasians and Japanese (Yamane et al. 1997). Since allelic information often differs between ethnic groups and could drastically influence the statistical analysis, it is not certain that currently available information on Caucasians, Japanese, and Singaporeans could be applied to study the disease genes in other ethnic groups.

Theoretically, multiallelic markers always have more power to detect linkage disequilibrium than biallelic markers due to their multiple alleles and greater heterozygosity (Ott and Rabinowitz 1997; Chapman and Wijsman 1998). Existing sets of microsatellite data, if sufficiently dense, may be useful for estimating the density of additional markers needed for screening a region for disease alleles in an association analysis (Schulze et al. 2002). Therefore, gathering information on additional markers in the Asian population is important for mapping disease genes in Asia by linkage analyses or association studies.

The STR genotyping laboratory was established in March 2002 by Vita Genomics, Inc. To date, we have completed several whole-genome screening projects and have built a comprehensive database for microsatellite markers. Here we report allelic distribution, allelic frequency, and heterozygosities of 811 dinucleotide-repeat markers among 190 Taiwanese subjects. The data indicate a high degree of genetic similarity between Taiwanese and Japanese. Our results represent an essential database for mapping genes associated with diseases not only in Taiwanese but also in Japanese populations.

Materials and methods

DNA preparation

Genomic DNAs were extracted from 190 unrelated Han-Chinese children living in Taiwan. We collected written informed consent from all subjects recruited. The informed-consent documents for all underage participants were signed by their guardians. DNA was isolated from blood samples using QIAamp DNA blood kit (QIAGEN, Valencia, CA, USA) according to the manufacturer’s instructions. The isolated genomic DNA was quality checked by agarose gel electrophoresis analysis, quantity determined spectrophotometrically, and stored at −80°C.

Microsatellite genotyping

Genotyping was performed using the ABI PRISM Linkage Mapping Sets HD-5 (811 markers, 5 cM average resolution, version 2.5, Applied Biosystems, Foster City, CA, USA). Each marker set included a fluorescence-labeled forward primer and a tailing reverse primer. PCR amplifications were carried out according to the manufacturer’s instructions, and the PCR products were separated on ABI 3700 DNA analyzers (Applied Biosystems). The use of GeneScan 500 LIZ (Applied Biosystems) as the internal-size standard assists polymorphic fragment length calling and allows more accurate allele calling and unambiguous comparison of data across experimental conditions. Genotypes were initially scored using GeneScan and Genotyper (Applied Biosystems) software and were then verified independently by three individuals.

Genotypic data derived from Caucasians for each marker were obtained from the Foundation Jean DAUSSET-CEPH (http://www.cephb.fr/). A control individual was derived from CEPH 1347-02. There are differences in the allele size of the STR markers between the CEPH database and our genotyping result because the pair primers of each locus had been redesigned by ABI and the mobility of PCR products were different among the variant sequencers (i.e., ABI373, 377, 3100, 3700, etc.). We corrected the differences comparing control data obtained from CEPH database and our genotyping data. Possible error in genotyping or allele calling was checked by comparing the expected number against the observed number of heterozygosities by the Hardy–Weinberg (HW) equilibrium test utilizing the Popgene (version 1.31) program (ftp://ftp.microsoft.com/Softlib/MSLFILES/HPGL.EXE).

Results and discussion

Genotyping for all 811 markers was performed on 190 unrelated Han-Chinese children living in Taiwan. We experienced no ambiguous genotypes or unsuccessful amplifications. Based on the results of the chi-square test and likelihood ratio test in the Popgene program of the HW test, most loci we tested met the criteria of HW equilibrium. Nevertheless, there were 38 of 811 loci, or less than 5% of total markers tested, which did not meet HW equilibrium criteria. This could be due to the combination factors of random sampling, original genetic variations of the subjects, and the presence of low-frequency alleles. Heterozygosities were calculated at each of the autosomes and X-chromosome marker loci using 190 individuals and 64 females, respectively. Table 1 summarize heterozygosity of the 811 marker loci for the Taiwanese and Caucasian populations. In general, overall heterozygosities of the markers from Taiwanese contained higher similarity to those from Japanese than to those from Caucasians. Among these 811 markers, there were 400 markers identical to the ones used by Ikari et al. (2002). Between Taiwanese and Japanese, there were only two out of 400 loci with differences in heterozygosities greater than or equal to 20%. On the contrary, the differences between Taiwanese and Caucasians were much higher at the rate of 81 out of 811 markers when compared to the Caucasian data from the manufacture. The significant differences between Japanese and Caucasians have also been observed at the rate of 33 out of 400 markers. The level of heterozygosity in the Taiwanese population ranged from 0.13 (DXS8088) to 0.94 (D6S291). As shown in Table 2, 50 marker loci revealed heterozygosity lower than 50% compared to only five marker loci in Caucasians. The X chromosome has lower average heterozygosity in the Taiwanese population, as shown in Table 3. Although heterozygosity was higher overall in Caucasians, a few markers showed higher heterozygosity in Taiwanese. For example, at D6S291, we observed 94% heterozygosity in Taiwanese as compared to 73% reported in Caucasians.

Table 1 Heterozygosities of the 811 tested short tandem repeat (STR) markers (part 1). Chr chromosome, Het heterozygosity, Twn Taiwanese, Cau Caucasian
Table 2 Comparison of heterozygosities at 811 test loci between Taiwanese and Caucasians. ABI Applied Biosystems, Inc., Foster City, CA, USA
Table 3 Comparison of averaged ethnic heterozygosities between autosome and sex chromosome. VITA Vita Genomics, Inc., Wugu Shiang, Taiwan, ABI Applied Biosystems, Inc., Foster City, CA, USA

In conclusion, we constructed a comprehensive database of allelic frequencies and heterozygosities of all 811 microsatellite markers evaluated in this study. We believe that our data provide a useful tool for mapping genes associated with diseases in the Taiwanese population. Moreover, data similarity of the 400 markers between Japanese and Taiwanese suggests a potentially powerful application of this database in the Japanese population. Data are freely available to all researchers and can be accessed at the Web site of Vita Genomics, Inc. (http://www.vitagenomics.com/str.html).