Introduction

Asian cultivated rice (Oryza sativa L.) is one of the world's most important crops, meeting the stable food demand of more than half of the global population. The development of hybrid rice is a major approach for increasing the yield potential of rice, since hybrid rice varieties have about 20% or more yield advantage over improved inbred varieties1. Hybrid rice technologies are mainly based on the three-line and two-line systems. The three-line system uses cytoplasmic male sterility lines, maintainer lines and restorer lines2,3. The two-line hybrid rice is based on the discovery and application of environmentally sensitive genic male sterile (EGMS) lines, which serve as both the male sterile lines and maintainer lines under different environmental conditions4. Thus, the two-line hybrid rice system is an important innovation for the better exploitation of hybrid vigor (heterosis). Compared with the three-line system, the advantages of the two-line system include a wider range of germplasm resources used as breeding parents, better grain quality and higher yields, and benefiting from simpler procedures for breeding and hybrid seed production5. In recent years, two-line hybrid rice has been applied for large-scale grain production in China6.

Cell development in plants is controlled by cross-talk between certain genetic networks and environmental conditions such as photoperiod and temperature. Photoperiod-sensitive genic male sterile (PGMS) lines and thermo-sensitive genic male sterile (TGMS) lines are two major types of EGMS germplasm resources, and have been widely used for the breeding of two-line hybrid rice. Nongken 58S (NK58S) is the first spontaneous PGMS mutant found in 1973 from the japonica (O. sativa ssp. japonica) cultivar Nongken 58 (NK58)7,8. Therefore, the discovery of NK58S has opened the door to the development and large-scale application of two-line hybrid rice in agriculture. NK58S retains complete male sterility when the day length (photoperiod) is longer than 13.75 h during the anther development; it converts to partial or complete male fertility when the day length is shorter than 13.5 h. A high temperature (∼29 °C) slightly promotes complete male sterility under long-day conditions9.

NK58S has been used as a donor parent for breeding of a large number of rice EGMS lines by introgressive backcrossing10. Peiai 64S (PA64S) is one such NK58S-derived line with indica (O. sativa ssp. indica) genetic background. With wide compatibility and good agronomic traits, PA64S has become the most widely used female parent for two-line hybrid rice breeding, including being used as the parent of the super-rice variety Liangyoupei96,11,12. However, the male sterility-fertility conversion of PA64S and other indica lines derived from NK58S is controlled mainly by temperature rather than by day length: PA64S exhibits male sterility at temperatures higher than 23.5 °C during the anther development, but it converts to male fertility when the temperature is ∼21-23 °C13,14. Long-day (14 h) conditions can suppress the degree of sterility-to-fertility conversion under low temperatures (21-23 Â°C), but short-day (12 h) conditions cannot restore male fertility under high temperatures14.

To date, a number of loci that control PGMS or TGMS, including reverse PGMS or reverse TGMS, in different rice lines have been mapped to different chromosomes: photoperiod-sensitive genic male sterile genes pms115,16, pms215, pms317,18,19,20; reverse photoperiod-sensitive genic male sterile genes rpms121, rpms221; thermo-sensitive genic male-sterile genes tms122, tms223,24, tms325,26, tms427, tms55,28,29,30,31, tms632, tms6(t)33; photoperiod-thermo-sensitive genic male sterile genes ptgms2-134, pms1(t)35; and reverse thermo-sensitive genic male-sterile genes rtms136. Despite the wide application of PGMS and TGMS lines in hybrid rice breeding, the genes responsible for PGMS and TGMS in these lines have not been cloned, and the molecular mechanisms underlying these traits remain unclear.

MicroRNAs (miRNAs) and small interfering RNAs (siRNAs) are two major types of endogenous small, 20-30 nucleotide (nt) RNAs that are widely distributed in eukaryotes. They regulate a number of biological processes, including cell development, metabolic control and adaptive responses to biotic and abiotic stresses, by suppressing gene expression via acting either on RNA to guide cleavage or translational repression or on DNA to mediate chromatin remodeling37,38,39,40. miRNAs are generated from single-stranded precursor transcripts that self-fold into stem-loop structures. Mature miRNAs form complexes mainly with the Argonaute protein; they can then bind to target mRNAs, after which they cleave and degrade the mRNA strands41,42,43. In addition, miRNAs are involved in translational suppression44,45. siRNAs are derived from long double-stranded RNAs by an RNase III named DCL (Dicer-like) in plants46,47,48, where several types of siRNAs have been discovered: trans-acting siRNAs (ta-siRNAs), natural antisense transcript-derived siRNAs (nat-siRNAs) and repeat-associated siRNAs (ra-siRNAs)49. ta-siRNAs arise from defined genetic loci (TAS loci) through an miRNA-dependent biogenesis pathway, and negatively regulate gene expression by guiding the cleavage of target mRNAs. nat-siRNAs originate from pairs of natural cis-antisense transcripts and guide the cleavage of the constitutive transcripts. ra-siRNAs originate from genomic repetitive sequences and are associated with DNA remodeling.

In this study, we report the identification and cloning of the major locus, p/tms12-1, that confers TGMS in PA64S and PGMS in NK58S. We found that the locus encodes a noncoding RNA precursor that produces a 21-nt small RNA. A point mutation is present in the small RNA of p/tms12-1. Transformation of NK58S and PA64S plants with the allele P/TMS12-1 produced the wild-type small RNA and suppressed both PGMS and TGMS under long-day and high-temperature conditions. Our results reveal that this point mutation in p/tms12-1 confers the PGMS and TGMS traits. We discuss possible mechanisms by which this mutation in the small RNA might cause the PGMS and TGMS traits in the japonica and indica lines, respectively.

Results

Identification of a major locus for TGMS in PA64S

We observed that PA64S was completely male sterile when grown under sterility-inducing high temperatures (25-30 Â°C), but became male fertile when it was subjected to lower temperatures (21-23 Â°C) during the microspore mother cell (MMC) to the meiosis stages50, corresponding to the anther developmental stages 6-8a defined recently by Zhang et al.51 (Figure 2A-2D). This feature of sterility-fertility conversion is similar to previous observations of this line13,14,52. To estimate the number of loci that control the TGMS trait in PA64S, an F2 population was produced from a cross between PA64S and Peiai 64 (PA64) (the recurrent backcrossing parent used for breeding of PA64S). Genetic analysis of this F2 population grown under the sterility-inducing temperatures (higher than 25 Â°C) showed that its segregation into 880 fertile plants and 105 sterile plants deviated from the ratio of 3:1 expected for a single locus. This suggested that TGMS in PA64S is determined by one major recessive locus and might be affected by other minor gene(s).

Figure 2
figure 2

Pollen fertility of transgenic NK58S and PA64S plants is restored by TMS12-1(t). (A, B) Fertile pollen of NK58S and PA64S plants grown under short-day (12 h) and low-temperature (22-23 Â°C) conditions during the MMC-meiosis stages. (C, D) Sterile pollen of NK58S and PA64S plants grown under sterility-inducing conditions (14-h day length, 25-30 Â°C). (E, F) Fertility-restored pollen of PA-10.4- and PA-9-transgenic T1 NK58S plants, respectively. (G, H) Fertility-restored pollen of PA-2.4-transgenic T1 plants of NK58S and PA64S, respectively. (I, J) Fertility-restored pollen of transgenic T1 NK58S (I) and T1 PA64S (J) plants overexpressing a 375-bp sequence (PA-0.37). All transgenic plants were grown under sterility-inducing conditions (14-h day length, 25-30 Â°C). Scale bars: 50 Î¼m.

We then performed linkage analysis with these 105 sterile plants, using 256 simple sequence repeat markers that were distributed across the 12 chromosomes. One marker (RM2992) on chromosome 12 was found to be linked to the TGMS phenotype. Use of additional markers around RM2992 for primary mapping enabled us to define a 937-kilobase (kb) region on chromosome 12, which was flanked by two markers, PA107 and PA243 (Figure 1A). We temperately designated the recessive locus tms12-1(t) (thermo-sensitive genic male sterility locus on chromosome 12) in PA64S.

Figure 1
figure 1

Molecular mapping of tms12-1(t). (A) Fine mapping of tms12-1(t) to a 5.8-kb region on chromosome 12. (B) Predicted gene/ORF composition of the mapped region in a BAC sequence, in which the only mutation within the mapped regions is indicated. (C) Fragments used for genetic complementation tests, of which PA-0.37 was linked to the promoter of the maize ubiquitin gene (Pubi).

Fine mapping of the TGMS locus and detection of a point mutation within the allele

To fine-map tms12-1(t), 690 completely sterile plants were selected from the F2 population and more molecular markers (Supplementary information, Table S1) were developed from the initially mapped region. Finally, tms12-1(t) was mapped within a region of 5.8 kb flanked by the markers PA301 and PAIDL2. This region is carried in the BAC clone AL731757 (Figure 1A and 1B). A single nucleotide polymorphism (SNP) marker, PASNP3, cosegregated with tms12-1(t) (Figure 1A). This 5.8-kb sequence contained partial sequences of a gene Os12g0545900 (AK111270) of unknown function and of a predicted open reading frame (ORF2) (Figure 1B).

We performed sequence analysis of this mapped region in four sterile lines, NK58S, 7001S (a NK58S-derived japonica PGMS line), PA64S and GD-8S (another NK58S-derived indica TGMS line), as well as three normal fertile lines, PA64, NK58 and Zhonghua11 (a japonica fertile line). Only one SNP of the marker PASNP3 was identified, with a C residue in all the fertile lines and a G residue in all the PGMS and TGMS lines. The genomes of an indica cultivar, 93-11 (fertile line) and an African cultivated rice (O. glaberrima Steud), searched on the GenBank, also possess the C residue in this position. The G in the PGMS and TGMS lines was therefore derived from a point mutation in the recessive allele. A BLAST similarity search showed that the mapped sequence is unique in the rice genome.

A 2.4-kb sequence of the wild-type allele restores male fertility to NK58S and PA64S

As the sequence of tms12-1 in PA64S was identical to that of NK58S, we speculated that TGMS in PA64S and PGMS in NK58S are caused by this SNP. To test this hypothesis, we carried out genetic complementation tests. A 10.4-kb fragment (designated PA-10.4) and a 9-kb fragment (PA-9), which contained the mapped region and overlapped each other by 2.4-kb including the SNP site (Figure 1C), were amplified from PA64. The fragments were cloned into a plant transformation binary vector, pCAMBIA1380, and the constructs were used to transform NK58S. Overall, 52 and 38 independent T0 transgenic plants were obtained with PA-10.4 and PA-9, respectively. We observed that the pollen fertility of the transgenic plants (T0, T1 and T2) was restored under long-day (14 h) conditions by both PA-10.4 and PA-9 (Figure 2E and 2F), suggesting that the locus is located within this 2.4-kb region encompassing the SNP. We then prepared a construct that contained this 2.4-kb fragment (PA-2.4) and transformed it into PA64S and NK58S. Overall, 10 and 29 independent T0 transgenic plants were obtained from PA64S and NK58S, respectively. NK58S- and PA64S-transgenic plants (T0 and T1) carrying this fragment became male fertile under long-day (14 h) and high-temperature (25-30 Â°C) conditions (Figure 2G and 2H). The results demonstrate that this 2.4-kb sequence from the wild-type allele TMS12-1(t) is able to suppress PGMS and TGMS, while the C-to-G mutation in this locus confers PGMS in the japonica line NK58S and TGMS in the indica line PA64S. Therefore, we renamed this locus p/tms12-1 (photo- or thermo-sensitive genic male sterility locus on chromosome 12) and P/TMS12-1 for the mutant and wild-type alleles, respectively.

Identification of a small RNA in P/TMS12-1 and p/tms12-1

Since this 2.4-kb region excluded Os12g0545900 and ORF2 as candidate genes, we speculated that this locus might encode small RNA(s). By whole-genome small RNA sequencing, we detected a small RNA of 21 nt (CAUUGUUUGUCUACCAUCCAU; 21 reads in a total of 27 274 860 reads) from P/TMS12-1 of PA64 and one (CAUUGUUUGUGUACCAUCCAU; 10 reads in a total of 27 957 497 reads) from p/tms12-1 of PA64S. The C-to-G SNP between P/TMS12-1 and p/tms12-1 (underlined above) is present at position 11 of these small RNAs. The presence of the same small RNAs in NK58 and NK58S was confirmed by a specially designed 3′ and 5′ RACE assays (Figure 3A). Later we searched the small RNA database CSRDB (http://sundarlab.ucdavis.edu/smrnas) and also found that this small RNA (osa-smRNA615912) expressed in wild-type rice.

Figure 3
figure 3

Identification of the small RNA and its precursor. (A) Confirmation of the small RNAs in NK58, NK58S, and overexpressing (P/TMS12-1-OE) plants by 3′ RACE and 5′ RACE assays and sequencing, by which the 5′ and 3′ termini of the small RNAs (shown by rectangles) were determined. The RNA samples were treated with poly(A) polymerase to polyadenylate the 3′ end of small RNAs for reverse transcription with a poly dT-adapter primer. The 5′ part (5′-ACTCACT-3′) of the specific primer is an adapted arbitrary sequence. (B) Identification of the 5′ and 3′ termini of the precursor RNA in an overexpressing plant (P-2 shown in Figure 5C) by 5′ and 3′ RACE assays and sequencing. (C) The 375-bp sequence from P/TMS12-1 used for overexpression in NK58S and PA64S, in which the transcribed sequence generating the processed intermediate RNA precursor (positions 106-241) is shown in italic and that generating the small RNA is in capital letters. Specific primers used for semi-nested PCR of the 5′ or 3′ RACE product are shown.

Overexpression of P/TMS12-1 generates the same small RNA and restores male fertility

To test if the overexpression of a short sequence from P/TMS12-1 could generate the small RNA and whether it was functional in suppressing the PGMS and TGMS phenotypes, we cloned a 375-bp fragment of P/TMS12-1 (Figure 3C) containing the SNP site into a binary vector, in which the fragment was located downstream of the maize ubiquitin promoter (Figure 1C). This construct was transformed into NK58S and PA64S. In total, 78 and 31 NK58S- and PA64S-transgenic plants were obtained, respectively. Similar to the results of complementation tests with the native sequence constructs, the pollen fertility of NK58S- and PA64S-transgenic plants harboring this recombinant construct was also restored under long-day and high-temperature conditions (Figure 2I and 2J). Furthermore, 3′ and 5′ RACE assays confirmed that the same wild-type small RNA was produced (Figure 3A). Since the transgenic wild-type small RNA was expressed at much higher levels than those of endogenous one in the lines (Figure 5C), all the obtained overexpressing transgenic plants were restored to fully male fertile.

Figure 5
figure 5

Expression patterns of osa-smR5864w and osa-smR5864m. (A) Expression patterns for osa-smR5864w in different organs of PA64. (B) Expression levels of osa-smR5864w (NK58 and PA64) and osa-smR5864m (NK58S and PA64S) in young panicles under sterility-inducing conditions. t-tests were performed between NK58 and NK58S and between PA64 and PA64S, with significant level of P < 0.01 (**). (C) High levels of osa-smR5864w were detected in overexpressing transgenic plants (three independent plants were assayed for each of NK58S-OE and PA64S-OE). (D) Expression levels of osa-smR5864w and osa-smR5864m in young panicles of NK58 and NK58S under long-day (14 h, LD) and short-day (12 h, SD) conditions, respectively; and of PA64 and PA64S under high- temperature (27-30 Â°C, HT) and low-temperature (22-23 Â°C, LT) conditions (* and ** indicate P < 0.05 and P < 0.01, respectively). All qRT-PCR assays used the 5S RNA as an internal control for normalization, and the samples marked with S were used as standards (relative value=1) against which expression in the other samples was compared.

Determination of the precursor sequence of the small RNA

To determine the precursor RNA sequence that produced the small RNA in the overexpressing transgenic plants, we performed 5′ and 3′ RACE assays. Sequencing the 5′ and 3′ RACE products (Figure 3B) identified a 136-nt RNA containing the small RNA sequence (Figure 3C). The 3′ RACE product could be specifically amplified from the RNA sample polyadenylated by poly(A) polymerase, but not from that without the in vitro 3′ polyadenylation reaction (Supplementary information, Figure S1), suggesting that this 136-nt RNA was an in vivo-processed intermediate precursor that lacked the 3′ poly(A) end. These results indicate that this locus is a noncoding RNA gene that yields the small RNA. We named the wild-type and mutant small RNAs osa-smR5864w and osa-smR5864m, respectively.

To assess whether the small RNAs might have been generated from miRNA precursor structures, we performed a prediction of the secondary structures of the RNA precursors of P/TMS12-1 and p/tms12-1 using the CONTRA fold tool (http://contra.stanford.edu/contrafold/server.html). The sequences were predicted to adopt conformations (Figure 4A and 4B) that are markedly different from those of typical pre-miRNAs53.

Figure 4
figure 4

Prediction of the secondary structures of the small RNA precursors. (A) The secondary structure of the precursor from P/TMS12-1. (B) The secondary structure of the precursor from p/tms12-1. The small RNAs are located at positions 34-54 indicated by arrows and the SNP sites are shown by arrowheads. In secondary RNA structures, U residue can pair with G residue (shown with dots).

The small RNA is expressed preferentially in young panicles with little effect of day length and temperature

The expression patterns of the small RNAs were analyzed using a small RNA qRT-PCR (quantitative real-time PCR) detection kit (GeneCopoeia). Two osa-smR5864w- and osa-smR5864m-specific primers (Supplementary information, Table S1) were used for qRT-PCR. The expression level of endogenous osa-smR5864w in young panicles (at the MMC-meiosis stages) was at least 10-fold higher than those in other vegetative tissues under 27-30 Â°C/14-h day length conditions (Figure 5A). Differential expression between osa-smR5864w and osa-smR5864m in young panicles was observed under 27-30 Â°C/14-h day length conditions: the level in PA64 was 3.1-fold higher than that in PA64S, while the level in NK58 was about 4.8-fold higher than that in NK58S (Figure 5B). High-level production of osa-smR5864w in the overexpressing transgenic plants was detected (Figure 5C): of the analyzed samples, the expression levels were 366- to 4 086-fold higher than those of osa-smR5864m in NK58S and PA64S.

To understand whether the expression of the small RNAs is regulated by photoperiod and/or temperature, we then examined the expression levels in young panicles of NK58, NK58S, PA64 and PA64S under different day length and temperature conditions. The results showed that, although the expression levels of the small RNAs in the same lines were different between long-day (14 h) and short-day (12 h) conditions or between high (27-30 Â°C) and low (21-23 Â°C) temperatures (Figure 5D), the differences were only 1.3- to 1.43-fold. The results suggest that expression of the small RNAs in the plants is not greatly affected by photoperiod or temperature conditions.

Candidate targets of osa-smR5864w

To identify candidate gene(s) targeted by the small RNA, we used CSRDB to search the rice genome and found 10 candidates target genes of osa-smR5864w (Table 1). The functions of these candidate genes are unknown. All of them possess imperfectly paired annealing sites for the small RNA. All but one (LOC_Os08g05710.1) have the nucleotide G complementary to the SNP nucleotide C of osa-smR5864w. Therefore, the mutation in osa-smR5864m reduces the degree of its matching to these nine candidate targets.

Table 1 Target gene candidates and the targeting sites of osa-smR5864w (the SNP site is underlined)

A number of anther development-related genes have been cloned in rice, including Gibberellins MYB gene (GAMYB)54,55, Undeveloped Tapetum1 (UDT1)56, Tapetum Degeneration Retardation gene (TDR)57, CYP704B258, OsC659, MADS360, Carbon Starved Anther (CSA)61, APOPTOSIS (API5)62 and MEIOSIS ARRESTED AT LEPTOTENE1 (MEL1)63. These genes were not recognized as the potential targets of osa-smR5864w as searched by the CSRDB tool. To understand whether these genes are regulated indirectly by P/TMS12-1, we carried out qRT-PCR analysis using specific primers (Supplementary information, Table S1) to investigate their expression profiles. The result showed that the expression levels of these genes were similar between NK58S and NK58 and between PA64S and PA64, although four genes, GAMYB, TDR, MADS3 and API5, exhibited 1.3- to 1.5-fold difference between some pairs of the lines (Supplementary information, Figure S2). The result suggests that the regulatory pathway of P/TMS12-1 does not involve these genes.

Discussion

PGMS and TGMS are very useful germplasm resources for hybrid breeding in crops. In this study, we identified a major locus that controls TGMS in the indica rice line PA64S and mapped the locus to a 5.8-kb region on chromosome 12. Sequence analysis revealed a C-to-G mutation in this region of PA64S and its donor japonica parent line NK58S. Genetic complementation tests showed that a 2.4-kb sequence of the wild-type allele, P/TMS12-1, could restore male fertility of transgenic NK58S and PA64S plants. As the locus pms3 conferring PGMS in NK58S has been mapped to the region of chromosome 1220 in which p/tms12-1 identified in this study is located, we speculate that p/tms12-1 and pms3 may represent the same locus.

We identified a novel 21-nt small RNA from P/TMS12-1 and p/tms12-1, and the only polymorphic site between P/TMS12-1 and p/tms12-1 is located in the center of this small RNA sequence. Furthermore, overexpression of a 375-bp sequence of P/TMS12-1 in transgenic PA64S and NK58S produced the same small RNA and restored pollen fertility under sterility-inducing conditions. Since the sequence of P/TMS12-1 is unique in the rice genome and no gene displays appreciable similarity to the precursor RNA sequence except for those that match partially with osa-smR5864w (Table 1), it is unlikely that the precursor RNA itself acts directly on target gene(s). Therefore, the small RNA osa-smR5864w encoded by P/TMS12-1 is the most likely player for suppression of the PGMS and TGMS traits; the point mutation in p/tms12-1 may lead to a loss-of-function of osa-smR5864m, and thereby confer PGMS and TGMS in the japonica and indica genetic backgrounds, respectively. Our findings suggest that this small RNA gene is an important regulator of genetic networks controlling the development of the male reproductive organ under certain environmental conditions.

Endogenous small RNAs can be generated by the miRNA and siRNA pathways. Most endogenous siRNA families are generated from double-stranded RNAs derived from repeated sequences or transposable elements. As the P/TMS12-1 sequence is unique in the rice genome, osa-smR5864w (or osa-smR5864m) does not belong to the repeated sequence-based siRNA family. On the other hand, miRNAs are generated from RNA precursors that form fold-back stem-loop structures. Our analysis showed that the RNA precursors of P/TMS12-1 and p/tms12-1 are predicted to form secondary structures (Figure 4A and 4B). However, their conformations are obviously distinct from the pre-miRNA structures of conventional miRNA precursors53. Thus, the small RNAs are unlikely to be generated by the miRNA pathway. However, the high-level production of osa-smR5864w in the overexpressing transgenic plants indicates that the small RNA was generated from overexpressed precursor RNA. This result suggests that the small RNAs are not likely to be produced from cis sense-antisense genes of the P/TMS12-1 and p/tms12-1 alleles by the nat-siRNA pathway53. It is possible that the precursor RNAs of P/TMS12-1 and p/tms12-1 are converted into double-stranded RNA structures by the RNA-dependent RNA polymerase pathway64, after which they are processed into mature ta-siRNAs by 21-nt phased small RNA biogenesis pathway as defined in rice46.

The major function of small RNAs is to repress the expression of target genes53. We searched the CSRDB database and found a number of candidates target genes for osa-smR5864w (Table 1), whose functions are unknown. However, all these candidate target genes have imperfectly paired sites to anneal with osa-smR5864w, with a maximum match of 19/21 nt; the point mutation in osa-smR5864m further reduces the degree of matching. Therefore, it is possible that the decreased match of osa-smR5864m to its target(s), together with its reduced expression level (Figure 5B), results in the loss of its ability for repressing the expression of the target gene(s) and thereby inducing PGMS and TGMS.

Our results showed that P/TMS12-1 (osa-smR5864w) was expressed preferentially in young panicles, which is consistent with its proposed function in the control of male fertility. In addition, the levels of osa-smR5864w and osa-smR5864m were not substantially affected by different day lengths in NK58S or temperatures in PA64S (Figure 5D). Our findings in this study therefore raise some intriguing questions, such as how the same mutant allele p/tms12-1 causes PGMS in the japonica line (NK58S) but TGMS in the derived indica lines represented by PA64S, and how different day lengths or temperatures control the sterility-fertility conversion in these lines of different subspecies. A plausible explanation is that the expression of the same or different target gene(s) of the small RNA may be upregulated by long day in japonica lines but by high temperature in indica lines, probably due to their different genetic backgrounds. The increased expression of the target gene(s) may impair the anther development. In wild-type rice plants, osa-smR5864w may downregulate the expression of the target gene(s) in the japonica and indica lines under long-day or high-temperature conditions to suppress PGMS and TGMS, whereas the mutant osa-smR5864m in the PGMS and TGMS lines fails to suppress the expression of the target gene(s), thus PGMS or TGMS occurs. Further study of the biogenesis of the small RNA and the functions of its target gene(s) will elucidate the mechanistic connection between the actions of the P/TMS12-1 and p/tms12-1 alleles and the male sterility-fertility conversion in the PGMS and TGMS lines, and will thus promote a more efficient exploitation of heterosis in rice.

Materials and Methods

Plant materials

The rice lines used in this study were PA64, NK58, NK58S, 7001S (a NK58S-derived japonica PGMS line), PA64S and GD-8S (NK58S-derived indica TGMS lines) and Zhonghua11 (japonica line). An F2 population was generated by a cross between PA64S and PA64. The F2 plants were grown in summer of 2006 in Guangzhou for genetic analysis and screening of TMGS individuals. Parents and transgenic plants requiring artificial conditions were reared in phytotrons. Pollen fertility was assayed by staining with potassium iodide (1% I2-KI).

Molecular cloning of p/tms12-1

Primary and fine mapping of p/tms12-1 were carried out using TGMS plants obtained from the F2 population. New polymorphic markers (Supplementary information, Table S1) for fine mapping were developed based on sequence differences between indica variety 93-11 and japonica variety Nipponbare available in NCBI (http://www.ncbi.nlm.nih.gov) and by sequence analysis among the materials described above.

Preparation of binary constructs and rice transformation

To prepare the constructs for complementation tests, a 10.4-kb (PA-10.4) and a 9-kb (PA-9) sequences of P/TMS12-1 were amplified using primers PA-10.4 F/R and PA-9 F/R (Supplementary information, Table S1) and cloned into the EcoR I/Spe I sites (for PA-10.4) or EcoR I/Pst I sites (for PA-9) of the pCAMBIA1380 binary vector. To yield the construct PA-2.4, a 2.5-kb sequence of P/TGMS12-1 was amplified using primers PA-2.4 F/R (Supplementary information, Table S1) and cloned into the Spe I/Bam H I sites of pCAMBIA1380. To prepare the overexpression construct of P/TMS12-1, a 375-bp fragment (PA-0.37) of P/TMS12-1 was amplified and inserted into the Hind III/Spe I sites downstream of the promoter of maize ubiquitin (Pubi) in a pCAMBIA-based vector. These constructs were transformed into the Agrobacterium tumefaciens strain EHA105 and were used for rice transformation by a previously described method65.

RT-PCR, RACE and qRT-PCR

Total RNAs from roots, stems, leaves and young panicles (at MMC-meiosis stages) were prepared using TRIzol (Invitrogen, Carlsbad, CA, USA) in accordance with the manufacturer's recommendations. cDNA was synthesized from 1 Î¼g of DNaseI-treated total RNAs with a Moloney Murine Leukemia Virus Reverse Transcriptase (M-MLV-RT) kit (Takara Biotechnology, Dalian, China). An aliquot (1 Î¼l) of the cDNA was subjected to PCR amplification using gene-specific primers (Supplementary information, Table S1). For 5′ and 3′ RACE reactions to determine the termini of the precursor RNA, the cDNAs were synthesized using the All-in-One miRNA qRT-PCR Detection Kit (GeneCopoeia, Rockville, MD, USA), with which 3′ polyadenylation and reverse transcription (using a polydT-adaptor primer) proceed in the same reaction. For the control 3′ RACE reaction without in vitro 3′ polyadenylation, the poly(A) polymerase was not added. For 5′ RACE, the first strand of the cDNA was tailed at the 3′ end by terminal deoxynucleotidyl transferase (Takara Biotechnology) and dGTP. Expression levels of the small RNAs were analyzed by qRT-PCR using the All-in-One miRNA qRT-PCR Detection Kit. The standard curve was prepared from series dilutions of template cDNA. Expression levels of the small RNAs were calculated after normalization to the expression levels of 5S ribosomal RNA as an internal control.

Small RNA sequencing

Whole-genome small RNA sequencing was performed by Beijing Genomics Institute using high-throughput pyrosequencing technology developed by Illumina. RNA samples were quantified and equalized so that equivalent amounts of RNA from mutant and wild-type plants were analyzed. Small RNAs were first separated from total RNAs from young panicles of PA64 and PA64S by size fractionation and recovery from a 15% acrylamide/urea gel. The isolated small RNAs were then ligated to 5′ and 3′ adaptors using T4 RNA ligase (Promega, Madison, WI, USA). The adaptor-ligated small RNAs were subsequently transcribed into cDNAs by SuperScript II Reverse Transcriptase (Invitrogen). The cDNA samples were loaded onto an Illumina 1G Genome Analyzer for sequencing. The sequencing data was analyzed by Science Corporation of Gene. The target candidates of the small RNA were search using the CSRDB webpage (http://sundarlab.ucdavis.edu/smrnas).