Structural basis for the modular recognition of single-stranded RNA by PPR proteins

Yin, Ping; Li, Quanxiu; Yan, Chuangye; Liu, Ying; Liu, Junjie; Yu, Feng; Wang, Zheng; Long, Jiafu; He, Jianhua; Wang, Hong-Wei; Wang, Jiawei; Zhu, Jian-Kang; Shi, Yigong; Yan, Nieng

doi:10.1038/nature12651

Letter
Published: 27 October 2013

Structural basis for the modular recognition of single-stranded RNA by PPR proteins

Ping Yin^1,2^na1,
Quanxiu Li^1,2^na1,
Chuangye Yan^2,3,
Ying Liu^1,2,
Junjie Liu^2,3,
Feng Yu⁴,
Zheng Wang⁵,
Jiafu Long⁵,
Jianhua He⁴,
Hong-Wei Wang^2,3,
Jiawei Wang^1,2,
Jian-Kang Zhu^6,7,
Yigong Shi^2,3 &
…
Nieng Yan^1,2

Nature volume 504, pages 168–171 (2013)Cite this article

27k Accesses
248 Citations
11 Altmetric
Metrics details

Subjects

Abstract

Pentatricopeptide repeat (PPR) proteins represent a large family of sequence-specific RNA-binding proteins that are involved in multiple aspects of RNA metabolism. PPR proteins, which are found in exceptionally large numbers in the mitochondria and chloroplasts of terrestrial plants^1,2,3,4,5, recognize single-stranded RNA (ssRNA) in a modular fashion^6,7,8. The maize chloroplast protein PPR10 binds to two similar RNA sequences from the ATPI–ATPH and PSAJ–RPL33 intergenic regions, referred to as ATPH and PSAJ, respectively^9,10. By protecting the target RNA elements from 5′ or 3′ exonucleases, PPR10 defines the corresponding 5′ and 3′ messenger RNA termini^9,10,11. Despite rigorous functional characterizations, the structural basis of sequence-specific ssRNA recognition by PPR proteins remains to be elucidated. Here we report the crystal structures of PPR10 in RNA-free and RNA-bound states at resolutions of 2.85 and 2.45 Å, respectively. In the absence of RNA binding, the nineteen repeats of PPR10 are assembled into a right-handed superhelical spiral. PPR10 forms an antiparallel, intertwined homodimer and exhibits considerable conformational changes upon binding to its target ssRNA, an 18-nucleotide PSAJ element. Six nucleotides of PSAJ are specifically recognized by six corresponding PPR10 repeats following the predicted code. The molecular basis for the specific and modular recognition of RNA bases A, G and U is revealed. The structural elucidation of RNA recognition by PPR proteins provides an important framework for potential biotechnological applications of PPR proteins in RNA-related research areas.

Access through your institution

Buy or subscribe

This is a preview of subscription content, access via your institution

Access options

Access through your institution

Buy this article

Purchase on Springer Link
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

**Figure 1: Crystal structure of RNA-free PPR10.**

**Figure 2: Structure of PPR10 bound to an 18-nt *PSAJ* RNA element.**

**Figure 3: Base-specific recognition of ssRNA by PPR10 repeats.**

**Figure 4: Coordination of the 3′-end segment of the *PSAJ* RNA by PPR10.**

A synthetic RNA editing factor edits its target site in chloroplasts and bacteria

Article Open access 10 May 2021

Structure of the OMEGA nickase IsrB in complex with ωRNA and target DNA

Article Open access 12 October 2022

DYW domain structures imply an unusual regulation principle in plant organellar RNA editing catalysis

Article Open access 21 June 2021

Accession codes

Primary accessions

Protein Data Bank

4M57
4M59

Data deposits

The atomic coordinates and structure factors of RNA-free and RNA-bound PPR10 have been deposited in the Protein Data Bank (PDB) with the accession codes 4M57 and 4M59, respectively.

References

Small, I. D. & Peeters, N. The PPR motif — a TPR-related motif prevalent in plant organellar proteins. Trends Biochem. Sci. 25, 45–47 (2000)
Article Google Scholar
Nakamura, T., Yagi, Y. & Kobayashi, K. Mechanistic insight into pentatricopeptide repeat proteins as sequence-specific RNA-binding proteins for organellar RNAs in plants. Plant Cell Physiol. 53, 1171–1179 (2012)
Article CAS Google Scholar
Schmitz-Linneweber, C. & Small, I. Pentatricopeptide repeat proteins: a socket set for organelle gene expression. Trends Plant Sci. 13, 663–670 (2008)
Article CAS Google Scholar
Fujii, S. & Small, I. The evolution of RNA editing and pentatricopeptide repeat genes. New Phytol. 191, 37–47 (2011)
Article CAS Google Scholar
Kotera, E., Tasaka, M. & Shikanai, T. A pentatricopeptide repeat protein is essential for RNA editing in chloroplasts. Nature 433, 326–330 (2005)
Article CAS ADS Google Scholar
Barkan, A. et al. A combinatorial amino acid code for RNA recognition by pentatricopeptide repeat proteins. PLoS Genet. 8, e1002910 (2012)
Article CAS Google Scholar
Yagi, Y., Hayashi, S., Kobayashi, K., Hirayama, T. & Nakamura, T. Elucidation of the RNA recognition code for pentatricopeptide repeat proteins involved in organelle RNA editing in plants. PLoS ONE 8, e57286 (2013)
Article CAS ADS Google Scholar
Yagi, Y. et al. Pentatricopeptide repeat proteins involved in plant organellar RNA editing. RNA Biol. 10, 1236–1242 (2013)
Article Google Scholar
Pfalz, J., Bayraktar, O. A., Prikryl, J. & Barkan, A. Site-specific binding of a PPR protein defines and stabilizes 5′ and 3′ mRNA termini in chloroplasts. EMBO J. 28, 2042–2052 (2009)
Article CAS Google Scholar
Prikryl, J., Rojas, M., Schuster, G. & Barkan, A. Mechanism of RNA stabilization and translational activation by a pentatricopeptide repeat protein. Proc. Natl Acad. Sci. USA 108, 415–420 (2011)
Article CAS ADS Google Scholar
Zhelyazkova, P. et al. Protein-mediated protection as the predominant mechanism for defining processed mRNA termini in land plant chloroplasts. Nucleic Acids Res. 40, 3092–3105 (2012)
Article CAS Google Scholar
Lurin, C. et al. Genome-wide analysis of Arabidopsis pentatricopeptide repeat proteins reveals their essential role in organelle biogenesis. Plant Cell 16, 2089–2103 (2004)
Article CAS Google Scholar
Cushing, D. A., Forsthoefel, N. R., Gestaut, D. R. & Vernon, D. M. Arabidopsis emb175 and other ppr knockout mutants reveal essential roles for pentatricopeptide repeat (PPR) proteins in plant embryogenesis. Planta 221, 424–436 (2005)
Article CAS Google Scholar
Khrouchtchova, A., Monde, R. A. & Barkan, A. A short PPR protein required for the splicing of specific group II introns in angiosperm chloroplasts. RNA 18, 1197–1209 (2012)
Article CAS Google Scholar
Bentolila, S., Alfonso, A. A. & Hanson, M. R. A pentatricopeptide repeat-containing gene restores fertility to cytoplasmic male-sterile plants. Proc. Natl Acad. Sci. USA 99, 10887–10892 (2002)
Article CAS ADS Google Scholar
Desloire, S. et al. Identification of the fertility restoration locus, Rfo, in radish, as a member of the pentatricopeptide-repeat protein family. EMBO Rep. 4, 588–594 (2003)
Article CAS Google Scholar
Wang, Z. et al. Cytoplasmic male sterility of rice with boro II cytoplasm is caused by a cytotoxic peptide and is restored by two related PPR motif genes via distinct modes of mRNA silencing. Plant Cell 18, 676–687 (2006)
Article CAS Google Scholar
Chase, C. D. Cytoplasmic male sterility: a window to the world of plant mitochondrial–nuclear interactions. Trends Genet. 23, 81–90 (2007)
Article CAS Google Scholar
Hu, J. et al. The rice pentatricopeptide repeat protein RF5 restores fertility in Hong-Lian cytoplasmic male-sterile lines via a complex with the glycine-rich protein GRP162. Plant Cell 24, 109–122 (2012)
Article CAS Google Scholar
Mootha, V. K. et al. Identification of a gene causing human cytochrome c oxidase deficiency by integrative genomics. Proc. Natl Acad. Sci. USA 100, 605–610 (2003)
Article CAS ADS Google Scholar
Ruzzenente, B. et al. LRPPRC is necessary for polyadenylation and coordination of translation of mitochondrial mRNAs. EMBO J. 31, 443–456 (2012)
Article CAS Google Scholar
Howard, M. J., Lim, W. H., Fierke, C. A. & Koutmos, M. Mitochondrial ribonuclease P structure provides insight into the evolution of catalytic strategies for precursor-tRNA 5′ processing. Proc. Natl Acad. Sci. USA 109, 16149–16154 (2012)
Article CAS ADS Google Scholar
Ringel, R. et al. Structure of human mitochondrial RNA polymerase. Nature 478, 269–273 (2011)
Article CAS ADS Google Scholar
Wang, X., McLachlan, J., Zamore, P. D. & Hall, T. M. Modular recognition of RNA by a human pumilio-homology domain. Cell 110, 501–512 (2002)
Article CAS Google Scholar
Grove, T. Z., Cortajarena, A. L. & Regan, L. Ligand binding by repeat proteins: natural and designed. Curr. Opin. Struct. Biol. 18, 507–515 (2008)
Article CAS Google Scholar
Deng, D. et al. Structural basis for sequence-specific recognition of DNA by TAL effectors. Science 335, 720–723 (2012)
Article CAS ADS Google Scholar
Gao, H., Wu, X., Chai, J. & Han, Z. Crystal structure of a TALE protein reveals an extended N-terminal DNA binding region. Cell Res. 22, 1716–1720 (2012)
Article CAS Google Scholar
Kobayashi, K. et al. Identification and characterization of the RNA binding surface of the pentatricopeptide repeat protein. Nucleic Acids Res. 40, 2712–2723 (2012)
Article CAS Google Scholar
Filipovska, A. & Rackham, O. Modular recognition of nucleic acids by PUF, TALE and PPR proteins. Mol. Biosyst. 8, 699–708 (2012)
Article CAS Google Scholar
DeLano, W. L. The PyMOL Molecular Graphics System. http://www.pymol.org (2002)
Otwinowski, Z. & Minor, W. Processing of X-ray diffraction data collected in oscillation mode. Methods Enzymol. 276, 307–326 (1997)
Article CAS Google Scholar
Collaborative Computational Project, 4. The CCP4 suite: programs for protein crystallography. Acta Crystallogr. D 50, 760–763 (1994)
Article Google Scholar
Schneider, T. R. & Sheldrick, G. M. Substructure solution with SHELXD. Acta Crystallogr. D 58, 1772–1779 (2002)
Article Google Scholar
Emsley, P. & Cowtan, K. Coot: model-building tools for molecular graphics. Acta Crystallogr. D 60, 2126–2132 (2004)
Article Google Scholar
McCoy, A. J. et al. Phaser crystallographic software. J. Appl. Crystallogr. 40, 658–674 (2007)
Article CAS Google Scholar
Adams, P. D. et al. PHENIX: building new software for automated crystallographic structure determination. Acta Crystallogr. D 58, 1948–1954 (2002)
Article Google Scholar
Schuck, P. On the analysis of protein self-association by sedimentation velocity analytical ultracentrifugation. Anal. Biochem. 320, 104–124 (2003)
Article CAS Google Scholar

Download references

Acknowledgements

We thank X. Yu and Y. Chen at the Institute of Biophysics, Chinese Academy of Sciences, for technical support. We thank K. Hasegawa and T. Kumasaka at the SPring-8 beamline BL41XU for on-site assistance. This work was supported by funds from the Ministry of Science and Technology (grant number 2011CB910501 for N.Y.), and Projects 91017011 (N.Y.), 31070644 (N.Y.), 31021002 (Y.S., N.Y., J.W.) and 31200567 (P.Y.) of the National Natural Science Foundation of China. The research of N.Y. was supported in part by an International Early Career Scientist grant from the Howard Hughes Medical Institute.

Author information

Ping Yin and Quanxiu Li: These authors contributed equally to this work.

Authors and Affiliations

State Key Laboratory of Bio-membrane and Membrane Biotechnology, Tsinghua University, Beijing, 100084, China
Ping Yin, Quanxiu Li, Ying Liu, Jiawei Wang & Nieng Yan
Center for Structural Biology, School of Life Sciences and School of Medicine, Tsinghua-Peking Center for Life Sciences, Tsinghua University, Beijing, 100084, China
Ping Yin, Quanxiu Li, Chuangye Yan, Ying Liu, Junjie Liu, Hong-Wei Wang, Jiawei Wang, Yigong Shi & Nieng Yan
Ministry of Education Key Laboratory of Protein Science, Tsinghua University, Beijing, 100084, China
Chuangye Yan, Junjie Liu, Hong-Wei Wang & Yigong Shi
Shanghai Institute of Applied Physics, Chinese Academy of Sciences, 239 Zhangheng Road, Shanghai 201204, China,
Feng Yu & Jianhua He
State Key Laboratory of Medicinal Chemical Biology and College of Life Sciences, Nankai University, 94 Weijin Road, Tianjin 300071, China,
Zheng Wang & Jiafu Long
Shanghai Center for Plant Stress Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, 200032, China
Jian-Kang Zhu
Department of Horticulture and Landscape Architecture, Purdue University, West Lafayette, 47907, Indiana, USA
Jian-Kang Zhu

Authors

Ping Yin
View author publications
You can also search for this author in PubMed Google Scholar
Quanxiu Li
View author publications
You can also search for this author in PubMed Google Scholar
Chuangye Yan
View author publications
You can also search for this author in PubMed Google Scholar
Ying Liu
View author publications
You can also search for this author in PubMed Google Scholar
Junjie Liu
View author publications
You can also search for this author in PubMed Google Scholar
Feng Yu
View author publications
You can also search for this author in PubMed Google Scholar
Zheng Wang
View author publications
You can also search for this author in PubMed Google Scholar
Jiafu Long
View author publications
You can also search for this author in PubMed Google Scholar
Jianhua He
View author publications
You can also search for this author in PubMed Google Scholar
Hong-Wei Wang
View author publications
You can also search for this author in PubMed Google Scholar
Jiawei Wang
View author publications
You can also search for this author in PubMed Google Scholar
Jian-Kang Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Yigong Shi
View author publications
You can also search for this author in PubMed Google Scholar
Nieng Yan
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

P.Y., Q.L., J.-K.Z., Y.S. and N.Y. designed all experiments. P.Y., Q.L., C.Y., Y.L., J.L., F.Y., Z.W., J.L., J.H., H.-W.W., J.W. and N.Y. performed the experiments. All authors analysed the data and contributed to manuscript preparation. N.Y. wrote the manuscript.

Corresponding author

Correspondence to Nieng Yan.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Extended data figures and tables

Extended Data Figure 1 Sequence alignment of the 19 repeats of PPR10.

a, PPR10 from maize specifically recognizes two RNA elements. The cartoon above illustrates the predicted domain organization of PPR10. 1 and 19 refer to the repeat numbers. CTP, chloroplast transit peptide. The blue brick with ‘?’ represents a fragment of approximately 30 amino acids whose function remains to be characterized. The minimal RNA elements of PSAJ and ATPH that are targeted by PPR10 are shown below the cartoon. b, Sequence alignment of 19 repeats in PPR10. The secondary structural elements of a typical PPR motif are shown above. The residues at the 2nd, 5th and 35th positions which were predicted to be the molecular determinants for RNA-binding specificity are highlighted in magenta. The RNA sequences that can be recognized by PPR10 are listed on the right, 5′ to 3′ from top to bottom. The nucleotides which are recognized by PPR10 in a modular fashion in the PSAJ–PPR10 structure are shaded grey. c, The three numbering systems for a PPR motif. 1 is being used by Lurin et al.¹², Barkan et al.⁶ and others; 2 is adopted by the Pfam database and being used by Kobayashi et al.²⁸, Yagi et al.⁷ and others; and 3 is our proposed, structure-based numbering system. The residues that are predicted to specifically recognize RNA are coloured magenta.

Extended Data Figure 2 AUC-SE of PPR10 (residues 37–786, C256S/C279S/C430S/C449S) in the absence or presence of the target RNA elements.

The molar concentrations of PPR10 are indicated above each panel. PPR10 and the RNA oligonucleotides were mixed at a stoichiometric ratio of approximately 1:1.5. Details of the experiments are described in Methods.

Extended Data Figure 3 The two protomers of the RNA-bound PPR10 dimer exhibit similar conformations.

a, The two protomers can be superimposed with a root-mean-squared deviation of 1.31 Å over 629 Cα atoms. b, c, The two ssRNA segments are coordinated by the PPR10 dimer similarly. The 5′ and 3′ segments of the bound PSAJ RNA are separately coordinated by the N-terminal repeats of one protomer (b), and the C-terminal repeats of the other protomer (c). Stereo-views are shown for all panels.

Extended Data Figure 4 Electron density maps for a bound ssRNA segment.

a, The 2F_o– F_c electron density for one segment of the bound PSAJ RNA. The electron density, contoured at 1σ and coloured blue, is displayed in stereo. b, c, The anomalous signals for bromine in the structures where the highlighted nucleotides were substituted with 5-bromouracil (5-BrU). The anomalous signals, shown in magenta mesh, are contoured at 5σ.

Extended Data Figure 5 Coordination of the bound ssRNA by PPR10.

a, The 5′ and 3′ portions of the PSAJ RNA element are separately bound by the N-terminal and C-terminal repeats of the two PPR10 protomers. Shown here is a close-up view of the binding of PSAJ by one end of the PPR10 dimer. b, The nucleotides U5–A10, which form a U-turn in the ssRNA, are uncoordinated in the cavity of the PPR10 dimer. The two protomers of PPR10 are shown in semi-transparent surface contour. c, The RNA backbone is coordinated by polar or charged residues through hydrogen bonds. The hydrogen bonds are represented by red dotted lines. The two protomers of PPR10 are coloured light purple and grey.

Extended Data Figure 6 Mutational analysis of PPR10 residues that may be important for PSAJ.

a, EMSA analysis of the interaction between PPR10 (residues 37–786, C256S/C279S/C430S/C449S) and PSAJ (5′-GUAUUCUUUAAUUAUUUC-3′). PPR10 was added with increasing concentrations of 0, 2, 4, 8, 16, 31, 63, 125, 250, 500, 1,000 nM in lanes 1–11 with approximately 40 pM ³²P-labelled PSAJ in each lane. b, Mutational analysis of the 5th residues of the indicated PPR motifs. The indicated point mutations were introduced to PPR10 (residues 37–786, C256S/C279S/C430S/C449S). c, Examination of the 2nd residues in repeats 3 and 5. d, Examination of the 35th residue of repeat 6. Note that the side group of Asp 314 is hydrogen bonded to the side chain of Asn 284, the 5th residue of repeat 6. The same structural feature is also seen in repeat 4 (Fig. 3b).

Extended Data Figure 7 The predicted coordination of base C by an Asn at the 5th position of a PPR motif.

Left, the coordination of base U by Asn observed in the structure. Right, the coordination of base C by Asn at the 5th position of a PPR motif modelled on the basis of the structure shown on the left.

Extended Data Figure 8 Comparison of ssRNA coordination by PUF and PPR proteins.

a, The structure of the human PUF protein PUM1 (also known as HSPUM) bound to the RNA element NRE1-19 (PDB accession code, 1M8W)²⁴. The PUF repeats constitute an arc with 8-nt ssRNA bound to the concave side. Notably, the orientations of the bound RNA and the protein are antiparallel, namely the 5′ end is close to the C terminus of PUF. b, The structure of a PUF repeat. One canonical PUF repeat contains three helices, of which a short helix precedes a helical hairpin. c, Representative recognition of the RNA bases G, U, A by PUF repeats as seen in the structure of PUM1 bound to NRE1-19. The amino acids are labelled by the repeat number (r5, r6, r7, r8) followed by its one-letter code and position on the 2nd helix within a PUF repeat (S3, N4, E8, and so on). The same scheme applies to d. d, The coordination of RNA bases G, U, A by PPR10. It is noteworthy that PUF and PPR proteins share several common features for RNA binding: (1) the ssRNA elements are coordinated by the helices on the inner layer; and (2) the base is sandwiched mostly by hydrophobic residues or Arg. Yet the differences are evident between the two families of repeat proteins. As seen in c, the RNA base is usually coordinated by two residues that are located at the 4th and the 7th positions on helix 2 within a PUF repeat. By contrast, the base is mainly coordinated by the 5th residue of a PPR motif. The 35th residue, the last residue of a PPR motif that is located at a loop region preceding the next PPR motif, also contributes to base recognition.

Extended Data Table 1 Statistics of data collection and refinement

Full size table

Extended Data Table 2 Statistics of data collection

Full size table

PowerPoint slides

PowerPoint slide for Fig. 1

PowerPoint slide for Fig. 2

PowerPoint slide for Fig. 3

PowerPoint slide for Fig. 4

Rights and permissions

Reprints and permissions

About this article

Cite this article

Yin, P., Li, Q., Yan, C. et al. Structural basis for the modular recognition of single-stranded RNA by PPR proteins. Nature 504, 168–171 (2013). https://doi.org/10.1038/nature12651

Download citation

Received: 28 May 2013
Accepted: 12 September 2013
Published: 27 October 2013
Issue Date: 05 December 2013
DOI: https://doi.org/10.1038/nature12651

This article is cited by

Differential adaptive RNA editing signals between insects and plants revealed by a new measurement termed haplotype diversity
- Yuange Duan
- Ye Xu
- Hu Li
Biology Direct (2023)
A mitochondrial pentatricopeptide repeat protein enhances cold tolerance by modulating mitochondrial superoxide in rice
- Xiaofeng Zu
- Lilan Luo
- Xiaofeng Cao
Nature Communications (2023)
The PPR-Domain Protein SOAR1 Regulates Salt Tolerance in Rice
- Kai Lu
- Cheng Li
- Ya-Dong Zhang
Rice (2022)
U-to-C RNA editing by synthetic PPR-DYW proteins in bacteria and human culture cells
- Mizuho Ichinose
- Masuyo Kawabata
- Bernard Gutmann
Communications Biology (2022)
DYW domain structures imply an unusual regulation principle in plant organellar RNA editing catalysis
- Mizuki Takenaka
- Sachi Takenaka
- Gert Weber
Nature Catalysis (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.