Hyperphosphorylation of the C-terminal domain (CTD) of the RPB1 subunit of human RNA polymerase (Pol) II is essential for transcriptional elongation and mRNA processing1,2,3. The CTD contains 52 heptapeptide repeats of the consensus sequence YSPTSPS. The highly repetitive nature and abundant possible phosphorylation sites of the CTD exert special constraints on the kinases that catalyse its hyperphosphorylation. Positive transcription elongation factor b (P-TEFb)—which consists of CDK9 and cyclin T1—is known to hyperphosphorylate the CTD and negative elongation factors to stimulate Pol II elongation1,4,5. The sequence determinant on P-TEFb that facilitates this action is currently unknown. Here we identify a histidine-rich domain in cyclin T1 that promotes the hyperphosphorylation of the CTD and stimulation of transcription by CDK9. The histidine-rich domain markedly enhances the binding of P-TEFb to the CTD and functional engagement with target genes in cells. In addition to cyclin T1, at least one other kinase—DYRK1A6—also uses a histidine-rich domain to target and hyperphosphorylate the CTD. As a low-complexity domain, the histidine-rich domain also promotes the formation of phase-separated liquid droplets in vitro, and the localization of P-TEFb to nuclear speckles that display dynamic liquid properties and are sensitive to the disruption of weak hydrophobic interactions. The CTD—which in isolation does not phase separate, despite being a low-complexity domain—is trapped within the cyclin T1 droplets, and this process is enhanced upon pre-phosphorylation by CDK7 of transcription initiation factor TFIIH1,2,3. By using multivalent interactions to create a phase-separated functional compartment, the histidine-rich domain in kinases targets the CTD into this environment to ensure hyperphosphorylation and efficient elongation of Pol II.
Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
We thank S. McKnight, M. Geyer, J. Hurley and their colleagues for providing the various expression plasmids, and U. Schulze-Gahmen for technical help. This work was supported by the National Institutes of Health grant R01AI041757 to Q.Z. and the California Institute of Regenerative Medicine grant LA1-08013 to X.D.Reviewer information
Nature thanks J. Lis, D. Taatjes and the other anonymous reviewer(s) for their contribution to the peer review of this work.
Extended data figures and tables
Extended Data Fig. 1 The CYCT1 HRD-dependent hyperphosphorylation of Pol II CTD52 by affinity-purified CDK9–CYCT1–Flag dimer can also be detected with anti-phospho-Ser2 antibody.
a, Examination of the affinity-purified CDK9–CYCT1–Flag dimer containing wild-type CYCT1–Flag by SDS–PAGE and silver staining. The anti-Flag affinity-purification was performed under high salt plus detergent (1 M KCl + 1% NP-40) conditions to strip away all the P-TEFb-associated factors but keep the CDK9–CYCT1 interaction intact. b, Affinity-purified CDK9–CYCT1–Flag heterodimers containing the indicated CYCT1–Flag proteins were tested in kinase reactions containing a mixture of GST-fused CTD52 and CTD9 as the substrates. The phosphorylated p-CTD52 and p-CTD9 were detected by western blotting with the anti-phospho-Ser2 antibody 3E10. Although a very similar pattern of CTD phosphorylation was detected with both anti-phospho-Ser2 and anti-phospho-Ser5 antibodies, the unphosphorylated CTD present in the ATP(–) lanes was only detected by the former antibody, making the phospho-Ser5 antibody a preferred choice for detecting CTD phosphorylation in these kinase reactions.
Extended Data Fig. 2 The CYCT1 HRD is required for efficient transcription of human immediate early genes as well as the HSP70-1 gene under both basal and heat-shock conditions.
a, Confirmation by western blotting of the knockdown of endogenous CYCT1 expression in HEK293T cells expressing the CYCT1 (also known as CCNT1)-specific shRNA (shCYCT1). b, Anti-Flag western blotting analysis of the expression of either wild-type CYCT1–Flag or CYCT1ΔHRD–Flag from the shCYCT1-resistant plasmid introduced into the knockdown cells. The α-tubulin and CDK9 levels were used as controls. c, The HRD-dependent transcription of four cellular immediate early genes. The mRNA levels of the indicated immediate early genes in the knockdown cells expressing wild-type CYCT1 or CYCT1ΔHRD–Flag (analysed in b) were examined by qRT–PCR, normalized to that of GAPDH and shown. The activity in the first column of each group was set to 1. Data are mean ± s.d., n = 3, and P values from two-tailed Student’s t-test. d, The CYCT1 HRD is required for optimal HSP70-1 transcription under both basal and heat-shock conditions. The HSP70 mRNA levels in the knockdown cells expressing wild-type CYCT1 or CYCT1ΔHRD were examined under heat-shock or non-heat-shock conditions, as in c. e, HeLa cells were transfected in twofold increments with the plasmid expressing either wild-type CYCT1–Flag or CYCT1ΔHRD–Flag. Anti-Flag immunoprecipitates (IP) from nuclear extracts were examined by western blotting for the proteins labelled on the left. The levels of the co-precipitated MEPCE, AFF4 and AFF1 were first quantified and then normalized to that of their corresponding CYCT1–Flag bait. The levels of MEPCE, AFF4 and AFF1 bound to low, middle and high levels of CYCT1ΔHRD–Flag were then divided by those of MEPCE, AFF4 and AFF1 bound to the corresponding levels of wild-type CYCT1–Flag and shown in lanes 5–7, with the numbers in lanes 2–4 set to 1 as a reference.
Extended Data Fig. 3 The HRD promotes the binding of CYCT1 to activated gene expression array, endogenous genes and the RNA Pol II CTD.
a, Estimates of parameters extracted from fitting of FRAP data (top) and parameters calculated from the fitted FRAP data (bottom). b, Model fit overlaid on raw FRAP data. A full description of how the reaction–diffusion FRAP model was fitted to the data is provided in Supplementary Information. c, Anti-CYCT1 western blotting analysis of the levels of endogenous CYCT1, and the stably expressed Halo–CYCT1 and Halo–CYCT1ΔHRD, in lysates of the engineered U2OS cell lines. The α-tubulin levels provided an internal control. d, Histograms of displacements at the indicated Δτ with three-state model fit overlaid. The three-state model is described in Supplementary Information. e, Table of best-fit parameters from fitting Spot-On to the raw displacements for three independent replicates (about 5–10 cells per replicate). Values in the table are mean ± s.d., n = 3.
Extended Data Fig. 4 Examination of recombinant P-TEFb and CAK for their phosphorylation of the Pol II CTD in kinase reactions and contribution of the DYRK1A HRD to DYRK1A–Pol II interaction.
a, b, Compared to CDK9 in recombinant P-TEFb, CDK7 in recombinant CAK (CDK7–CYCH–MAT1) shows decreased ability to hyperphosphorylate CTD52 in a time- and dosage-dependent manner. The indicated amounts of baculovirus-produced recombinant P-TEFb or CAK (Millipore) were added to in vitro kinase reactions that also contained GST–CTD52 as the substrate. The reactions were performed for the indicated periods of time. The products were analysed by western blotting with the phospho-Ser5 antibody. CDK7 in recombinant CAK and CDK9 in recombinant P-TEFb were also examined by western blotting. c, The deletion of the HRD causes DYRK1A to decrease interaction with RNA Pol II but not DCAF7. Nuclear extract (NE) of HeLa cells expressing the indicated proteins and anti-Flag immunoprecipitates derived from NE were analysed by western blotting with the various antibodies as labelled.
Extended Data Fig. 5 GFP–T1-IDR purified from recombinant E. coli forms phase-separated liquid droplets that are HRD-dependent and highly sensitive to elevated salt concentration and exposure to 1,6-hexanediol.
a, Alignment of CYCT1 amino acid sequences in the HRD regions among the indicated vertebrate species, with the central histidine cluster shaded in red. b, Intrinsic disorder tendency was predicted by IUPred across the entire length of CYCT1. The scores are assigned between 0 and 1, and a score above 0.5 indicates disorder. The HRD region is shaded in blue. The longest stretch of IDR is labelled as T1-IDR, and its boundaries are marked. c, d, The C-terminally Strep-tagged GFP–T1-IDR fusions (c) and N-terminally His-tagged mCherry–CTD fusion (d) were purified from recombinant E. coli BL21 cells (Supplementary Information). Ten micrograms of each of the fusion proteins was examined by SDS–PAGE followed by Coomassie blue staining. e, Protein solutions containing either wild-type GFP–T1-IDR or GFP–T1-IDRΔHRD at 6 mg ml−1 were adjusted to the indicated salt concentrations and their appearances in Eppendorf tubes are shown. f, NaCl concentrations in the wild-type GFP–T1-IDR solution were changed in the indicated sequential order and then examined under a fluorescence microscope. g, Protein solutions containing wild-type GFP–T1-IDR at 6 mg ml−1 were adjusted to 37.5 mM NaCl with or without 10% 1,6-hexanediol, and then examined under a fluorescence microscope. h, Live-cell images of a HeLa cell expressing either wild-type eGFP–CYCT1 or eGFP–CYCT1ΔHRD at levels similar to that of endogenous CYCT1.
Extended Data Fig. 6 The longest stretch of IDR in DYRK1A promotes formation of phase-separated droplets in an HRD-dependent manner.
a, Intrinsic disorder tendency was predicted by IUPred across the entire length of DYRK1A. The scores are assigned between 0 and 1, and a score above 0.5 indicates disorder. The HRD region is shaded in blue. The longest stretch of IDR is labelled as DYRK1A-IDR and its boundaries are marked. b, The indicated C-terminally Strep-tagged GFP fusions were purified from recombinant E. coli BL21 cells. Two micrograms each of the fusion proteins was examined by SDS–PAGE followed by Coomassie blue staining. c, Solutions containing 5 mg ml−1 of the indicated fusion proteins, 37.5 mM NaCl and 10% PEG8000 were trapped between coverslips and examined with a microscope under either fluorescent (488 nm) or normal white light (brightfield).
Extended Data Fig. 7 The central histidine cluster within the CYCT1 HRD is essential for promotion of phase separation by CYCT1 IDR in vitro and in cells, and for P-TEFb to hyperphosphorylate the Pol II CTD and activate HIV transcription.
a, The nine histidines in the central histidine cluster within the CYCT1 HRD are highlighted in red and changed to alanines in the 9A mutant. b, The C-terminally Strep-tagged GFP–T1-IDR fusions containing either wild-type GFP–T1-IDR or the GFP–T1-IDR(9A) mutant sequence were purified from E. coli BL21 cells and examined by SDS–PAGE followed by Coomassie blue staining. c, CDK9–CYCT1–Flag heterodimers containing the indicated CYCT1–Flag proteins were affinity-purified from HeLa cells and tested in kinase reactions, with the reaction products analysed as in Fig. 1a. d, Plasmids expressing the Gal4 DNA binding domain fused to the indicated CYCT1 proteins were co-transfected into HeLa cell with a HIV-1 LTR–luciferase reporter construct containing the UAS for Gal4. Luciferase activities in cell extracts were measured and analysed as in Fig. 1b. e, Fixed and permeabilized HeLa cells expressing wild-type CYCT1–Flag or CYCT1(9A)–Flag were examined by indirect immunofluorescence with the mouse anti-Flag monoclonal antibody and Alexa Fluor 488-conjugated goat anti-mouse secondary antibody. DNA was counterstained using DAPI.
Extended Data Fig. 8 CYCT1 binds directly to the Pol II CTD in an HRD-dependent manner and the binding is enhanced after the CTD is phosphorylated by CAK (CDK7–CYCH–MAT1).
a, Immobilized GST–CTD was incubated with recombinant GFP–T1-IDR or GFP–T1-IDRΔHRD. The input (2.5%) and the bound proteins were analysed by western blotting. b, Binding reactions containing the purified recombinant fusion proteins indicated on the right were analysed in a 7.5 to 20% glycerol gradient containing 500 mM NaCl plus 0.5% NP-40, which was centrifuged at 55,000 r.p.m. and 4 °C for 13 h. The indicated fractions were analysed by western blotting to detect the distributions of proteins marked on the left. The entire length of the CTD could be bound by varying numbers of IDRs, resulting in the formation of a series of complexes with broad distributions in the gradient. c, mCherry–CTD was incubated with or without immobilized CAK for 6 h in kinase reactions and then analysed by SDS–PAGE and Coomassie blue staining. d, Immobilized GST–CTD was incubated with (phos.) or without (unphos.) CAK for 6 h in kinase reactions. After washing, the GST–CTD beads were incubated with GFP–T1-IDR. The indicated proteins were eluted off the beads and analysed by western blotting.
Extended Data Fig. 9 A model depicting how P-TEFb uses the CYCT1 HRD to target and recruit the Pol II CTD into a phase-separated compartment—which is formed by weak, multivalent homotypic interactions among the HRDs—to enable highly efficient phosphorylation of CTD by P-TEFb in the presence of ATP.
At 2.5% 1,6-hexanediol, the HRD-mediated phase separation but not the direct HRD–CTD binding is disrupted, making it impossible for P-TEFb to hyperphosphorylate the CTD.
This file contains Supplementary Figure 1, the uncropped gel source data with size marker indications, Supplementary Methods and Supplementary References.
Representative video for spaSPT data.
Representative video for slowSPT data.