Introduction

Effective use of genetic resources is important for improving the quality and yield of rice, an important staple crop for the world. Common wild rice (CWR, Oryza rufipogon Griff.), as the ancestor of cultivated rice (Oka, 1974), contains a high level of genetic diversity (GD) and represents a rich genetic resource. However, a great part of the genetic information in it has not been uncovered. Only 60–70% of its genetic variation has been found in cultivated rice (Sun et al., 2001). Its abundant genetic variation in factors such as disease resistance, yield and quality (Yuan et al., 1989; Xiao et al., 1996) should be of major relevance to breeders.

Genetic structure is the basis of management, research and utilization of germplasm resources (Waples, 1995) and is critical to ecological conservation and studies of evolutionary relationships among populations (Frankham, 2003). China is one of the largest centers of GD for CWR in the world (Wang et al., 2004). The genetic structure and diversity of Chinese CWR is of interest to the scientists and has been investigated using different markers and populations (Gao et al., 2000a, 2000b, 2002a, 2002b; Zhou et al., 2003; Song et al., 2003a). All of these studies indicated that Chinese CWR is rich in GD, with obvious and complicated population subdivisions. Although these studies contributed much to our understanding of its GD and structure, the samples were inadequate to measure fine-scale GD and genetic structure over the entire population. CWR grows in 113 counties spread across eight provinces in China, but even the most representative of these studies sampled only 21 natural populations from 21 counties across provinces (Gao et al., 2000a). As they used different samples, these studies also reached different conclusions about the primary factors affecting the genetic structure of CWR in China (Gao et al., 2000b, 2002a; Zhou et al., 2003; Cai et al., 2004).

In China, CWR grows in eight provinces: Guangdong, Guangxi, Hainan, Jiangxi, Hunan, Fujian, Yunnan and Taiwan. Between 1988 and 1993, 5571 accessions in 113 counties across eight provinces were collected (Sheng and Huang, 1991; Huang and Sheng, 1996). Most of these are now planted in the germplasm gardens of wild rice in Nanning, Guangxi province and Guangzhou, Guangdong province. To examine the fine-scale GD and genetic structure of such a large CWR population, it is essential to obtain a representative sample. The concept of core collection (Frankel, 1984), which aims to settle the conflict between genetic representation and population scale, could be used to choose the smallest representative population with the largest amount of diversity. In our study, the research accessions obtained from the primary core collection represent 90.5% of the diversity of the entire Chinese CWR (Yu et al., 2003).

Simple sequence repeat (SSR) markers have proved to be effective and appropriate tools for studying GD (Davierwala et al., 2000) and detecting population structure (Garris et al., 2005). The objectives of our study were: (1) to identify the genetic structure of CWR in China (2) to find out how the genetic structure of CWR was formed and (3) to determine the genetic relationships among CWR populations.

Materials and methods

Plant material

According to Yu et al. (2003), the primary core collection of CWR in China was based on phenotypic data. It contained 920 accessions; of these, 860 were collected as a logarithmic proportion from the provinces where they grew and by the use of a clustering method based on a simple matching coefficient. The remaining 60 accessions were special phenotypes collected subjectively. The primary core collection represented 90.5% of the diversity of 5571 CWR. Among the 889 accessions used in this study (Supplementary Table S1), 864 were taken from the primary core collection at the gardens of wild rice in Nanning and Guangzhou. (The remaining 56 accessions out of the 920 that make up the core collection do not exist at these gardens.) These 864 accessions represented 98 counties across six provinces: Hainan (88), Guangdong (268), Guangxi (256), Fujian (59), Hunan (107) and Jiangxi 110 (86). A further 25 accessions were collected from Jinghong (2) and Yuanjiang (23) in Yunnan province, making the total of 889. The accessions could be grouped as prostrate (445), slant (255), semierect (126) and erect (63) according to the growth habits. All the accessions were perennial types. To investigate whether Chinese CWR diverged into indica-like and japonica-like types, we sampled 410 accessions of cultivated rice (Oryza sativa L.) (Supplementary Table S2), including 260 Indica accessions and 150 Japonica accessions from the same seven provinces where CWR was sampled.

Genome DNA extraction and analysis with microsatellite markers

DNA from every accession was extracted from silica gel-dried leaf tissues using the CTAB (cationic detergent hexadecyltrimethylammonium bromide) method (Scott and Bendich, 1988). The 36 SSR loci (Supplementary Table S3) were randomly distributed over the 12 rice chromosomes. Amplification reactions were performed in a final volume of 15 ul comprising 0.9 unit Taq enzyme, 1.5 ul 10 × buffer, 22.2 mM MgCl2, 46 ng SSR primers, 1.8 mM dNTP and 10 ng total DNA. The PCR amplification program was: (1) pre-denature for 5 min at 95 °C, (2) denature for 0.5 min at 95 °C, (3) annealing for 1 min at 55 °C (annealing temperature varied with actual primers), (4) extending for 1.5 min at 72 °C, (5) repeating cycles 2–4 30 times and (6) final extension for 10 min at 72 °C. The amplified products were denatured at 95 °C for 5 min, then cooled on ice, and subsequently run on 8% denatured polyacrylamide gel at 70 W. One check was randomly designated from accessions with certain alleles. For the same marker, all runs after the first run included not only the samples but also the checks and a standard molecular weight marker—PUC19 DNA digested by MspI. When all the samples were completely run, all the checks were run with another standard molecular weight marker—10 bp DNA ladder from Invitrogen (Catalog No. 10821-015), and the molecular weight for each allele was estimated. All the gels were stained using the silver method (Bassam et al., 1991). In the case of null alleles in these species, PCR amplifications were repeated to exclude failed PCR reactions. In some accessions, there were more than two alleles per locus. In these cases, we amplified and ran them again, and then selected the stable alleles—those that occurred in both replications. If there were more than two stable alleles, we randomly selected two alleles to form the genotype.

Statistical analyses

The structural analysis was carried out using STRUCTURE version 2 (Pritchard et al., 2000; Falush et al., 2003; http://pritch.bsd.uchicago.edu), which implements a clustering method for inferring population structure using genotype data. We ran the simulation 10 times independent for each k (the number of partitions) value in the range 1–15, using the method allowing for the admixture, correlated allele frequencies and no prior population information, with burn-in length 10 000 and run length 10 000. The graphical display of the STRUCTURE results was generated using Distruct software (Rosenberg, 2002; http://www.cmb.usc.edu/noahr/distruct.html). Evanno et al. (2005) reported that in most cases, the estimated ‘log probability of data’ did not provide a correct estimation of cluster number (k-value), and argued that an ad hoc statistic ΔK based on the rate of change in the log probability of data between successive K-values could accurately detect true K. The suggested Δk=m(∣L(k+1)−2 L(k)+L(k−1)∣)/s[L(k)], where L(k) represents the kth LnP(D), m is to the mean of 10 runs and s their standard deviation. We used the method of Evanno et al. (2005) to estimate the number of populations. Regression analyses between genetic distance and geographic distance of CWR in different latitude–longitude sections were made using the inline applets of regression in Microsoft Excel. To investigate whether water systems have an effect on genetic relations among CWR populations, accessions from 35 counties with more than five accessions were collected in Guangdong and Guangxi. The UPGMA tree of 35 counties was constructed using PowerMarker version 3.25 (Liu and Muse, 2004; http://www.powermarker.net) based on Nei's genetic distances (Nei et al., 1983). Allele number, observed heterozygosity, biased GD, genotype number, polymorphism information content index and Nei's genetic distance (Nei et al., 1983) were all calculated in PowerMarker version 3.25. Using program HP-rare 1.0 (Kalinowski, 2005), the allelic richness (an estimator independent of the sample size; Hurlbert, 1971) of inferred populations was investigated by rarefaction methods. To investigate the directional differentiation of allele size among the inferred populations, we calculated the average standardized allele size of each CWR population using 17 SSR loci with stepwise mutation (Supplementary Table S4). A stepwise mutation index was calculated in PowerMarker version 3.25, and we treated those SSR loci with a stepwise mutation index higher than 0.9 as having stepwise mutation, according to Vigouroux et al. (2003). Their significances were assessed by t-test. Differences in allele frequency between indica and japonica were investigated using X2-test and G2-test. We used the terms, indica-specific alleles and japonica-specific alleles, to describe alleles with a significantly (P<0.0001) higher frequency in indica and those with a significantly (P<0.0001) higher frequency in japonica. Frequency distributions of specific alleles in inferred CWR populations, longitudinal regions (with intervals of 5°) and latitudinal regions (with intervals of 1°) were investigated using the t-test, and regression analyses were used to determine the frequency distribution along longitude and latitude. The index of genetic differentiation between indica and japonica for each SSR locus was calculated using the formula: D=(GDt−(GDi+GDj)/2)/GDt, where GDt represents biased GD of total cultivitated rice, Gdi and GDj represent those of indica and japonica, respectively (Supplementary Table S5). Using four SSR loci with D higher than 0.3, the haplotype frequencies in different populations were calculated. Differences in the significance of haplotype frequencies between indica and japonica were examined using the X2-test. We used the term indica-specific haplotypes to describe haplotyes with a significantly (P<0.05) higher frequency in indica, and japonica-specific haplotypes to describe those in reverse.

Results

Genetic structure of CWR in China

The STRUCTURE simulation demonstrated that the LnP(D) value showed no clear peak in K between 1 and 15 (Supplementary Figure S1). Thus, it was difficult to determine the true K (number of populations). The magnitude change of LnP(D) relative to the standard deviation, called ΔK by Evanno et al. (2005), showed the highest peak at two; and there were two smaller peaks at seven and nine (Figure 1). We also checked the structure patterns in 10 repeats of each k from two to nine (Supplementary Figure S2). A clear and stable population structure could be found at two, three and seven, where a major structure pattern could be found in at least half of the simulations. Figure 2 shows the membership of accessions to the populations identified by STRUCTURE, the growth habits as classified by Pang et al. (1995) and Wang et al. (2004) and the geographic origins at k=2 and k=7. The membership indicated that the inferred genetic structure of Chinese CWR was accorded with the geographic origins (Figures 2b and d), but not with the growth habits (Figures 2a and c). CWR could be grouped into two model-based populations (MB-populations) and seven model-based subpopulations (MB-subpopulations) at k=2 and k=7, respectively. Two MB-populations at k=2 were isolated by the Nanling mountains, which separate the south subtropical climatic area from the middle subtropical area. One of these MB-populations grew in the south subtropical area (with the exception of the subpopulations from Hainan Island, which is tropical) and could be called a south subtropical population (SSP); the other grew in the middle subtropical area and thus could be called a middle subtropical population (MSP). The SSP included five MB-subpopulations and covered five provinces: Hainan, Guangdong, Guangxi, Fujian and Yunnan (Figure 3). Three of the five MB-subpopulations contained most of the accessions from Hainan, Fujian and Yunnan and were temporarily denoted as HN, FJ and YN, respectively. Most of the accessions from Guangdong and Guangxi were clustered into one MB-subpopulation, GD-GX1. The exception to this was the accessions from Fusui County in Guangxi, which formed another MB-subpopulation, GX2. MSP was divided into two MB-subpopulations and covered Hunan and Jiangxi provinces (Figure 3). One of these, from the Jiangxi and Chaling counties of Hunan, was called JX-HuN1, and the other, which included most of the accessions from the Jiangyong county of Hunan, was denoted as HuN2. Most of the MB-subpopulations were isolated by natural barriers or wide spaces (Figure 3). HN was isolated by the Qiongzhou channel; FJ was separated from GD-GX1 by the Wushan mountains and from JX-HuN1 and HuN2 by the Wuyi mountains. There were wide spaces from YN to GD-GX1 and GX2, and from JX-HuN1 to HuN2. Regression between genetic distance and geographical distance indicated that geographical distance was significantly correlated with genetic distance (Supplementary Figure S3).

Figure 1
figure 1

Magnitude of ΔK for each K value.

Figure 2
figure 2

Membership of CWR individuals in the model-based populations when k=2 (a and b) and k=7 (c and d), and in predefined groups according to growth habits (a and c) and original provinces (b and d). Each individual was represented by a thin vertical line, the predefined groups were separated by black lines. At k=2 (a and b), the lake blue corresponding to south subtropical population and the orange corresponding to middle subtropical population. Blue—south subtropical population, orange—middle subtropical population. At k=7 (c and d), different colors represented the following model-based subpopulations, respectively: Pink—HN, yellow—GD-GX1, red—GX2, green—FJ, black—YN, blue—HuN2, Orange—JX-HuN1.

Figure 3
figure 3

Geographical distributions of the inferred populations. The colored dots represented inferred subpopulations of CWR in China (same to the color in Figure 2). The colored background represented different water systems of seven provinces. Grass green—Yangtze river valley, plum—Zhujiang river valley, lake blue—Southeast water system of China, light pink—Yuanjiang and Honghe river valley, wheat—water system in south Guangxi, Leizhou Byland and Hainan, gold—Lancangjiang and Mekong river valley.

In addition to the apparent isolations among MB-subpopulations, we found seven populations distributed in five water systems: GD-GX1 and GX2 in the Zhujiang river system; HN in the river system of south Guangxi, Leizhou Byland and Hainan Island; YN in the river system of Yuanjiang-Lancangjiang (with two accessions in the Lancangjiang and Mekong river valley); FJ in the river system of coastal southeast China; JX-HuN1 in the Yangtze river system; and HuN2 between the Zhujiang river system and the Yangtze river system. GD-GX1 grew in a wide region within 20°N–25°N and 106°E–116°E, where the Zhujiang, a big river with several branches, runs (Figure 3 and Supplementary Figure S4). A phylogenetic tree was constructed based on different counties where more than five accessions were collected (Figure 4). The results indicated that populations from the same branch tended to be clustered together (Supplementary Figure S5). The differentiation between HN, YN, FJ, HuN2 and GD-GX1 could be explained by geographical isolation and water system, but there were exceptions. GX2, whose habitat was surrounded by that of GD-GX1 with no significant isolation clustered into a different population. We suspect that the factors besides geographical isolation and water system have contributed to its differentiation.

Figure 4
figure 4

Distributions for indica-specific and japonica-specific haplotypes in two model-based populations (a) and seven model-based subpopulations (b). Haplotype-I—indica haplotype; Haplotype-J—japonica haplotype.

Genetic diversity and relationships of inferred populations

A total of 805 alleles were detected at 36 SSR loci in the 889 accessions. The number of alleles at each locus ranged between 8 and 53, with an average of 22.36. Gene diversity ranged 0.2817–0.9648, with an average of 0.7946 (detailed diversity information for SSR loci is listed in Supplementary Table S3). Table 1 shows that SSP has a higher level of GD than MSP. Among MB-subpopulations, GD-GX1 contained the largest number of alleles and the most genotypes among the inferred populations, followed by HN. However, allele and genotype numbers in the other five MB-subpopulations were obviously lower than those in GD-GX1 and HN. Similarly, the majority (90.8%) of population-specific alleles were in HN and GD-GX1. Gene diversity and heterozygosity decreased from HN to JX-HuN1 with the increase in latitude, with the exception of YN, which had the lowest level of gene diversity and heterozygosity. Allelic richness followed a similar distribution.

Table 1 Summary statistics for model-based CWR populations and subpopulations

Pair-wise genetic distance (Nei et al, 1983) of seven MB-subpopulations indicated that GD-GX1showed smallest genetic difference with all of others and YN showed the largest genetic difference with all of the others (Table 2) . Other research on the directional evolution of microsatellites (Rubinsztein et al., 1995; Vigouroux et al., 2003) has indicated that the size of SSR alleles in modern types is increased relative to ancestral types. We also investigated the distribution of allele size in the CWR populations for 17 stepwise SSR loci (Supplementary Table S4). The average standardized allele molecular weight of different MB-populations was estimated as follows: YN >> FJ > GD-GX1 > HuN2 > JX-HuN1 >> GX2> HN. The average allele size of YN was significantly larger than that of the other inferred populations (Table 3).

Table 2 Pair wise Nei et al.'s (1983) genetic distances among seven model-based CWR subpopulations
Table 3 T-test of differences in average standardized allelic size for pairs of model-based CWR subpopulations

Possibility of indica-like or japonica-like differentiation within Oryza rufipogon Griff.

It would be interesting to know which CWR have diverged into indica-like and which into japonica-like types, and how those two types are distributed in China. To answer this question, we detected 72 japonica-specific alleles and 61 indica-specific alleles from cultivated rice using the X2-test and the G2-test. Distribution of those two types of alleles within each MB-subpopulation indicated that there were more japonica-specific than indica-specific alleles in JX-HuN1 and HuN2, but no significant difference in other populations (Table 4). Regression of specific-allele frequency within geographic regions (longitude and latitude) indicated that the frequency of japonica–specific alleles significantly increased with increased latitude (P=0.009; Supplementary Figure S6); the frequency of indica-specific alleles, however, showed no significant correlation with latitude. The frequencies of both japonica-specific and indica-specific alleles showed no significant correlation with longitude. However, significant differences between the frequencies of those two types of specific alleles were found in the longitude region of 116–117°E and the latitude regions of 26–27°N and 28–29°N (Table 4), where the accessions were mostly came from JX-HuN1. In summary, japonica property increased from south to north. MB-subpopulations in SSP showed no significant divergence between indica and japonica, demonstrating both indica and japonica properties; MB-subpopulations in MSP were evidently japonica-like.

Table 4 Average frequency of specific alleles and their t-test in model-based subpopulations, longitude regions and latitude regions

We also investigated the distribution of haplotypes composed of four loci, rm296, rm71 rm267 and rm25, which showed a higher indica-japonica differentiation index (>0.3) than the other 32 (<0.2; Supplementary Table S5). There was a total of 1287 haplotypes in cultivated rice and CWR, of which 74 were in both cultivated rice and CWR, 47 were only in the cultivated rice and 1166 were only in CWR. The X2-test of 74 haplotypes in cultivated rice revealed 38 indica-specific haplotypes (haplotype-I) and 11 japonica-specific haplotypes (haplotype-J). Both haplotype-I and haplotype-J were much more distributed in HN, GD-GX1, GX2 and FJ, which were MB-subpopulations in SSP, than in JX-HuN1, which was an MB-subpopulation in MSP. Neither type was found in YN and HuN2. The MB-subpopulations in SSP had a higher frequency of haplotype-I than haplotype-J; however, the reverse was the case for the MB-subpopulaton in MSP (Figure 4).

Table 5 indicates that GD-GX1 is genetically closest to both indica and japonica. Comparison of population genetic difference between CWR inferred populations and cultivated rice from different provinces (Table 6) indicates that GD-GX1 has the closest genetic relationship with all rice cultivars. So if cultivated rice was partially domesticated in China, GD-GX1 might be the ancestry population.

Table 5 Genetic distance (Nei et al., 1983) between inferred CWR populations and indica, japonica
Table 6 Population genetic distance (Nei et al., 1983) between inferred CWR populations and cultivated rice collected from different provinces in China

Discussion

What contributed to the population subdivision within Oryza rufipogon Griff. in China?

Oryza rufipogon Griff. is the direct ancestor of cultivated rice (Oka, 1974). To understand the genetic differentiation, evolution, conservation and use of wild rice, its genetic structure was studied by many researchers, especially in China (Gao et al., 2000a, 2000b, 2002a, 2002b; Zhou et al., 2003; Song et al., 2003a). Despite this, the genetic structure of the whole germplasm resource in China has not been identified. Using the STRUCTURE program to infer population structure and Evanno's method (Evanno et al., 2005) to estimate the number of clusters, we found a two-level hierarchical structure composed of two populations—a SSP and a MSP—further divided into seven subpopulations. What contributed to these hierarchical divisions? In theory, mutation, isolation, selection and genetic drift can all influence genetic divergence (Thingsgaard, 2001; England et al., 2002; Primmer et al., 2006). For the CWR population, several possible factors have been put forward, including spatial or physical isolation and local adaptation (Gao et al., 2000b, 2002a; Zhou et al., 2003; Cai et al., 2004). However, we are far from reaching a consensus about which factors took effect and their relative significances. The hierarchical structure might indicate that there were hierarchical factors in the formation of the population subdivision. The highest ΔK and the most stable structure pattern at k=2 suggest that the differentiation that took place at the first level was the essential one within Chinese CWR. Most of the SSP grew in the south subtropical climate, whereas the MSP grew in the middle subtropical climate. This suggests that the first and most important factor was climate; that is, that natural adaptation caused the first level of differentiation within the Chinese CWR population. The role of climate is also supported by the japonic-like character of the MSP and the indica-like character of the SSP (see below). On this point, therefore, we differ from Cai et al. (2004), who argued that genetic differences among natural populations were the result of spatial isolation and not local adaptation. The most important factor at the second level of the hierarchy is undoubtedly isolation by space or physical barrier, which could explain all the divergences among each pair of seven subpopulations, with the exception of that between GX2 and GD-GX1. This observation is supported by previous studies, as summarized in the introduction section. In Guangdong and Guangxi, CWR grew widely and two populations in Jiangxi and Hunan, despite being located far away, formed the populations GD-GX1 and JX-HuN1, respectively. Thus, there must have been a way of achieving efficient gene flow. Given that CWR grows mainly beside or near rivers, seed and rhizome would have been easily dispersed by the water flow (Pang and Chen, 2002). The water system could compensate for the effect of spatial isolation by increasing the gene flow among individuals and populations living on the same water system. The relationship between population structure and water system has proved this. Furthermore, it has been observed that CWR in the same branches of the Zhujiang river tend to be clustered together. Although the bootstrap values are not very high, it is reasonable, given that they all belong to the same water system, to surmize that gene flow had occurred easily among the accessions in different branches. Underestimating the influence of the water system may be one of the causes of disagreement on the role of isolation in population subdivision, for example, the positive opinion of Gao et al. (2000b, 2002a) and the negative view taken by Zhou et al. (2003).

GX2 was an interesting population. It was not isolated from GD-GX1 by any evident natural barrier and both were within the same water system. Possible explanations include local adaptation and genetic drift. We prefer local adaptation as most of the accessions grew in the pond where there was no standing water after the rainy season (Zhengbin Chen, Rice 400 Research Institute, Guangdong Academy of Agricultural Sciences; private communication). The divergence of GX2 and GD-GX1 may have been a result of the particular landscape in Fusui county. The influence of local ecological adaptation on genetic structure has been demonstrated by others (Semon et al., 2005; Coulon et al., 2006).

The genetic variation of CWR populations could be summarized as follows: the smallest allelic size and highest GD are seen in HN, indicating that it might be the most ancestral population in China. GD-GX1, the largest population, had the most alleles, high GD and the closest genetic relationship with all other populations. The other populations, with the exception of YN, had lower genetic diversities and fewer alleles, and their allelic sizes were similar to that of GD-GX1. These results could be interpreted in two ways. In the first interpretation, the genetic variation represents an evolutionary relationship among seven CWR populations. Thus, CWR in China might have originated in Hainan and been dispersed to Guangdong and Guangxi and hence to other populations (with the exception of YN). The CWR from Yunnan, with an extremely distant genetic relationship and no common haplotype with other wild-rice populations in China, might be a south Asian population, as proposed by Sun et al. (2002). The HN population grew mainly in Hainan. As an island, Hainan has not been considered a likely candidate for the role of original center. However, it was joined with the mainland one million years ago; Oryza rufipogon Griff. appeared about seven million years ago (Second, 1985b). Thus, genes could have been exchanged between populations in Hainan and those in mainland of China one million years ago. If CWR had been introduced from south Asia, its first station would have been Hainan. This possibility needs further study with more CWR, including CWR from outside China. In the second interpretation, changes in the distribution of the CWR population were related to changes in the environment and other factors, such as human population. According to both the fossil rice phytoliths (Zhao and Piperno, 2000; Lu et al., 2002) and ancient Chinese documents (Huang et al., 1998), CWR may have been distributed further north than it is today. Cooling during the Younger Dryas epoch (∼13 000–10 000 BP) and the pressure of population expansion during past 2000 years are other possible reasons for the loss of diversity in CWR at high latitude.

Although both interpretations could explain the decay in GD from the south to the north of China, only the first could explain the decrease in allele size from south to north, so we would argue that the first interpretation is the correct one.

Indica-japonica divergence of Oryza rufipogon Griff. in China

The question of whether differentiation between indica and japonica occurred in Chinese CWR has been widely discussed by researchers. Second (1982, 1985a) reported that it was japonica-like. Sun et al. (2002) compared germplasm collected from south Asia, southeast Asia and China and concluded that China had both indica-like and japonica-like accessions. Cai et al. (1996) also found that some indica-like accessions existed in southern China, and that those in the north were japonica-like. In our results, both japonica- or indica-like alleles and haplotypes indicated that there were both japonica-like and indica-like variants in Chinese CWR populations. These variants were more japonica-like in the MSPs and more indica-like in the SSPs. It was known that more japonica than indica rice was planted from north to south in China, and that gene flow may occur, especially from cultivated rice to CWR when the cultivated rice was planted near the CWR population (Messeguer et al., 2001; Song et al., 2003b). Was the indica-japonica divergence within CWR a result of gene flow from cultivated rice to CWR? No, in our opinion. First, it is well known that CWR in China grew in a region lower than 600 m, and the wild rice of JX-HuN1 and HuN2 in a region lower than 250 m. No japonica was planted near populations of wild rice in Jiangxi and Hunan. Thus, the possibility that japonica rice transferred genes to CWR is very slim. In addition, most of the research reported that the pollen of cultivated rice could be transferred only 30–40 m (Messeguer et al., 2001; Song et al., 2003b), and most of the individuals were sampled from a point at least 50 m away from the population edge. So, we deduced that indica-like and japonica-like divergence was caused mainly by natural adaptation to different climatic regions rather than the influence of gene flow from cultivated rice. There are two major hypotheses about the domestication of the two subspecies of Asian cultivated rice, indica and japonica. Recent research favors the multiple origin model (Second, 1982; Londo et al., 2006) over the single origin model (Ting, 1957; Oka, 1974; Oka and Morishima, 1982). The multiple origin model suggests that japonica was domesticated in China and indica in south Asia. If so, it is difficult to understand some of our results. We found that both indica and japonica, especially indica, have the least genetic distance with GD-GX1; all indica and japonica haplotypes could be found and the frequency of indica haplotypes was higher than that of japonica haplotypes in GD-GX1. Further research is needed to consider the possibility that Chinese indica was domesticated independently in China. If that proves to be the case then GD-GX1, with the closest genetic relationship with cultivated rice, might be the closest to its ancestral population, and japonica may have been domesticated, in turn, from the domesticated indica or from wild rice at high latitude.