The general transcription factor IID (TFIID) is the first component of the preinitiation complex (PIC) to bind the core promoter of RNA polymerase II transcribed genes. Despite its critical role in protein-encoded gene expression, how TFIID engages promoter DNA remains elusive. We have previously revealed a winged-helix DNA-binding domain in the N-terminal region of the largest TFIID subunit, TAF1. Here, we report the identification of a second DNA-binding module in the C-terminal half of human TAF1, which is encoded by a previously uncharacterized conserved zinc knuckle domain. We show that the TAF1 zinc knuckle aids in the recruit of TFIID to endogenous promoters vital for cellular proliferation. Mutation of the TAF1 zinc knuckle with defects in DNA binding compromises promoter occupancy of TFIID, which leads to a decrease in transcription and cell viability. Together, our studies provide a foundation to understand how TAF1 plays a central role in TFIID promoter binding and regulation of transcription initiation.
The formation of the preinitiation complex (PIC) is a step-wise process required for eukaryotic transcription of protein-encoded genes1,2. The highly-orchestrated assembly of the general transcription factors (GTFs) TFIIA, B, D, E, F, and H is responsible for properly positioning RNA polymerase II (RNAPII) at core promoters and initiating transcription. TFIID nucleates PIC formation by directly binding to promoter sequences. TFIIA further stabilizes this interaction, while TFIIB association is firmly established by TFIIF/Pol II recruitment, and TFIIE/TFIIH are essential for promoter clearance3,4.
TFIID is a multi-subunit complex comprised of the TATA binding protein (TBP) and 14 TBP associated factors (TAFs) in humans5. A small subset of these factors recognizes specific core promoter elements (CPEs) to orient the PIC at the transcription start site6,7,8. TFIID’s ability to selectivity recognize promoter sequences was initially thought to be largely dictated by TBP binding to the TATA box9. The primary evidence for this model centers on TBP’s ability to support a low level of transcription in the absence of TAFs, whose roles include responding to transcriptional activators10,11. Growing evidence supports the idea that TBP alone does not identify gene promoters, especially in higher organisms5,11,12. The majority of human protein-coding genes lack a TATA box sequence, yet TFIID is still able to engage the initiator and direct transcription from TATA-less promoters13. Therefore, other PIC components must be involved in dictating DNA specificity. A recent study demonstrated the TAF components of TFIID are vital for sequence selectivity while TBP alone showed no sequence preference14. This expands upon previous DNA footprinting analyses that illustrated an extended region of protection by TFIID compared to TBP, which was interpreted as TAFs binding near the transcriptional start site and downstream promoter sequences6,15.
The specific combination of CPEs varies from gene to gene, and their unique arrangement may help to ensure genes are expressed at specific times in response to cellular needs. Most of these elements are recognized by TAFs to enhance the interaction between TFIID and promoter DNA2. The Initiator (Inr) is positioned at the transcription start site of over 40% of human protein-encoding genes and is enriched at TATA-less promoters16,17. Motif ten element (MTE), downstream promoter element (DPE), and downstream core element (DCE) are located downstream of the transcription start site and thought to contribute to promoter recognition in the absence of a TATA box2,18,19. TAF1, the largest subunit of TFIID, has been shown to associate with the DCE, a discontinuous sequence found in viral promoters as well as the human beta globin gene19. TAF1 also interacts with the Inr when in complex with TAF26. While there is a wealth of knowledge about promoter sequences, little is known about TAF/DNA binding interfaces and their mode of interaction. Characterizing these protein-DNA interactions will advance our understanding of how the entire TFIID complex mediates transcriptional initiation.
Our previous work describing a winged helix (WH) located in the central DUF3591 domain provided the first look at TAF1’s promoter recognition capabilities20. When our structure of TAF1 DUF domain bound to TAF7 was modeled into the cryo-EM structure of promoter bound TFIID, it confirmed the WH can contact downstream promoter sequences21. The cryo-EM structure further revealed a small secondary density of protein contacting the Inr, which the authors attribute to TAF1. The advancements in cryo-EM have given us a general idea of the overall shape of TFIID and insights into the topological rearrangements that ensue upon promoter binding21,22. Yet, further refinements are necessary to definitively resolve all the crucial interfaces between the TAF subunits of TFIID and core promoter sequences during the dynamic structural reorganization evoked by promoter engagement.
The idea that TAF1 contains multiple DNA binding domains is supported by several pieces of evidence. In addition to structural data, TAF1 has been shown to bind two entirely distinct and separate promoter elements (i.e. Inr and DCE) implying multiple binding modules6,19. The DCE is a cluster of three non-contiguous sequences. Each was shown to be important for transcription and can crosslink to TAF1; the current model suggests TAF1 directly binds the entire DCE, a region too large to be bound by the WH alone. Moreover, yeast TAF1 has been shown to contain a promoter-binding region near its C-terminus outside the WH23. This region corresponds to sequence adjacent to the double bromodomain in the human homolog, and yeast TAF1 lacking this region was unable to bind promoter DNA. Multiple DNA binding domains in TAF1 may convey plasticity to allow recognition of diverse promoter sequences and permit structural rearrangements during promoter engagement.
Hidden between the double bromodomain and the DUF domain of TAF1, we discovered a strictly conserved CCHC zinc knuckle (ZnK) that plays a vital role in transcriptional regulation. By examining human TAF1 ZnK function under both native and in vitro conditions, we identified a second previously uncharacterized DNA binding module in the largest subunit of TFIID. Our study shows that TAF1 ZnK is important for cellular viability and helps to direct TFIID to endogenous core promoters through its DNA binding abilities. This work establishes a basis for dissecting the complexity of TFIID promoter recognition.
TAF1 contains an evolutionarily conserved zinc knuckle motif
TAF1 is an essential protein found in all eukaryotic organisms. The preservation of TAF1 across these kingdoms signifies its biological importance. By using sequence information and mapping regions of high conservation, we identified portions of the protein that were selectively maintained throughout evolution. A sequence alignment performed on full-length TAF1 using eight species that span the eukaryotic kingdom revealed two domains characterized by strictly conserved residues, which are absent in other regions of the protein (Fig. 1A). The DUF3591 domain is the largest region of conservation. Notably, in the ts13 mutant cell line, this domain harbors a temperature-sensitive point mutation (G716D) that causes cells to cell cycle arrest in late G1 and eventually undergo apoptosis24,25,26. We previously characterized the structural features of this domain with its binding partner TAF7 and discovered a winged-helix (WH)20. The second region of strict conservation sits between DUF3591 and the double bromodomain (2 × Bromo). Zooming in on the amino acid sequence within this region reveals an invariable Cx2Cx4Hx6C motif (Fig. 1B). Bioinformatics predicts this motif as a putative zinc knuckle (ZnK), a common bimolecular interacting domain.
The CCHC ZnK is a widely occurring domain commonly found in nucleic acid binding proteins27,28,29. A large fraction of CCHC-domain containing proteins, which can be aligned with TAF1 ZnK, are involved in DNA binding, hinting that TAF1 ZnK also may interact with DNA (Fig. 1C). In addition to the zinc coordinating residues, several other amino acids are common to the majority of CCHC motifs: glycine following the second cysteine, glycine preceding the histidine, and proline after the third cysteine. The consistent location of these residues suggests that they may be important for proper domain folding. The spacing of the first two cysteines and histidine is strictly conserved; however, the spacing between the histidine and third cysteine can vary. Interestingly, TAF1 shares the same spacing of two proteins: I-factor from Drosophila and FAM90a in humans30,31,32. I factor is a LINE-like transposable element and contains a ZnK in ORF1, which has been shown to bind DNA31. FAM90 is a protein family found in primates, thought to have arisen during multiple duplication and rearrangement events32. The ZnK in FAM90 has been proposed to function as a DNA binding domain. The commonality of DNA binding to this similarly spaced group of zinc knuckle proteins further suggests that TAF1 ZnK may interact with DNA.
A structural model of TAF1 ZnK was generated using I-TASSER33 and shows a compact fold with the C-C-H-C side chains pointing towards the center, allowing for coordination of a zinc ion (Fig. 1D). Our analysis estimated the probability of ligand binding for the C-C-H-C to be above 90% based on the COACH binding prediction. A structural homology search identified HIV-1 nucleocapsid protein and lin-28 as top hits. Both proteins are known to interact with nucleic acid, and solution structures of these proteins bound to RNA have been solved28,34,35. An alignment between these structures reveals substantial similarities in the zinc coordinating residues as well as overlapping charged residues (Fig. 1D). Charged surface analysis on ZnK of TAF1 identified one side of the ZnK fold that is enriched with positively charged residues with positive electrostatic potential (Fig. 1E). Positive electrostatic patches are a highly predictive hallmark of DNA binding proteins. Furthermore, this charged surface displays considerable similarity to the HIV-1 nucleic acid binding surface and contains two strictly conserved positively charged amino acids that may be critical for electrostatic interactions (Fig. 1E). These outward facing positively charged residues produce an interface to presumably interact with negatively charged nucleic acids. Taken together, these features signify TAF1 ZnK has the potential to function as a DNA binding domain and thus contribute to TFIID promoter recognition.
TAF1 zinc knuckle motif is essential for cell viability
Given the strict conservation of the TAF1 ZnK domain, we asked whether it plays a physiological function in cells. We used the ts13 cell viability assay to determine if the ZnK domain is required to complement the ts13 temperature-sensitive G1 growth defect. ts13 cells contain a conditional mutation in TAF1 and proliferate at the permissive temperature of 33.5 °C24,25. When shifted to 39.5 °C, the cells arrest in late G1 and undergo apoptosis unless a functional copy of TAF1 is exogenously expressed25,36,37. Cells transfected with wild-type TAF1 continue through the cell cycle and proliferate at the elevated temperature. Conversely, non-functional TAF1 results in a decrease in the number of viable cells. We chose to mutate the first two cysteines in the ZnK domain of TAF1 to ensure the motif was fully compromised. Changing the cysteines to alanines resulted in a ~60% reduction in cell viability, indicating a structurally intact ZnK is vital for full physiological function (Fig. 2A,B). The level of cell survival was comparable to point mutations in the previously characterized TAF1 winged helix (Fig. 2B). Moreover, combining the ZnK and WH mutations (3AZnM) tended to further reduce cell viability compared to the single mutants but the decrease was not statistically significant. This finding can be taken to suggest that both domains must be fully functional for proper TAF1 activity. Importantly, all TAF1 constructs were expressed at equivalent levels according to western blot analysis, indicating that the mutations do not cause protein instability (Fig. 2C).
Zinc knuckle and winged helix of TAF1 are necessary for cyclin gene transcription
TAF1 is known to be important for transcription of the cell cycle genes cyclin A, D1 and E36,37,38. The failure of ZnM and WH3A to complement the growth defect in ts13 cells signifies that these mutations possibly compromise a function of TAF1 important for cyclin gene expression. We determined that this functional defect is not due to an inability to incorporate into TFIID, but rather a deficiency in promoter recognition. We verified the TAF1 mutants can successfully integrate into the TFIID complex by expressing each TAF1 variant as an N-terminal HA-fusion protein (HA-TAF1), using an antibody against the TAF4 subunit to immunoprecipitate the entire TFIID complex, and immunoblotting for exogenously expressed TAF1 variants using an anti-HA antibody. The presence of other TFIID subunits was examined by western blotting (Fig. 3A) and silver staining (Supplemental Figure S5). We observed that all TAF1 variants incorporated into TFIID at relatively equal levels compared to the wild-type protein (Fig. 3A). By contrast, the TAF1 mutants did not associate with core promoters of cell cycle genes in chromatin immunoprecipitation experiments (Fig. 3B). For these studies, different HA-TAF1 variants were expressed in HEK293 cells and TAF1-bound promoter fragments were recovered and quantified by qPCR. Cyclin D1 and cyclin A2 were chosen as target promoters because the proper expression of these genes is required for cell cycle progression; their promoter function also has been shown to be dependent on TAF1 activity37,38,39. The specific promoter regions for qPCR amplification were selected based on ENCODE ChIP data for TAF1. Additionally, TAF1 promoter binding is considered synonymous with canonical TFIID binding, because TAF1 is unique to TFIID whereas other TAFs and TBP are found in additional auxiliary transcription regulatory complexes2,40. We detected that wild type TAF1 effectively bound to cyclin D1 and A2 promoter regions, while neither the ZnM nor WH3A mutant selectively enriched for these promoter fragments (Fig. 3B). Promoter association of the double mutant (3AZnM) was not significantly lower than either single domain mutant. The overall conclusion is that the WH3A and ZnM mutants consistently bind less efficiently to cyclin genes and that both domains can contribute equally to TFIID promoter recognition.
Next, we determined the impact loss of promoter recognition had on transcription levels from the cyclin D1 and A2 promoters. By leveraging a luciferase reporter assay, a conventional method for studying gene expression, we asked whether exogenously expressed TAF1 variants affected cyclin promoter function. The luciferase gene was placed under the control of the human cyclin D1 or cyclin A2 promoter. TAF1 variants were co-transfected with each luciferase reporter construct, and the amount of luciferase activity, a measure of promoter activity, was assessed approximately 36 hours post-transfection. Reporter activity in the absence of exogenously expressed TAF1 was quantified to assess the endogenous transcriptional activity of the cyclin promoters. This signal was used to normalize across experiments and served as a control ensuring transcription was not negatively impacted by expressing a single TFIID subunit. A nominal increase in TAF1 should enhance TFIID formation because under normal conditions TAF1 is in limited supply compared to other TFIID components41. Indeed, the expression of wild type TAF1 significantly increased the level of luciferase activity from both promoters compared to basal levels (Fig. 3C). In contrast, TAF1 with mutations to either the ZnK or WH failed to significantly stimulate the activity of the cyclin promoters above background levels. Similarly, the double mutant had no effect on cyclin promoter activity (Fig. 3C). Hence, there was no statistical difference between all TAF1 mutants and background levels. We also examined the effects of WT and mutant TAF1 expression on transcription from the endogenous cyclin D1 promoter in ts13 hamster cells, which were used for the cell complementation assays. We observed reduced stimulation of cyclin D1 transcription by the TAF1 domain mutants compared to WT TAF1 (Supplemental Figure S1). Collectively, these data suggest ZnK and WH are critical to recruit TFIID to core promoters and transcriptional initiation.
TAF1 Zinc Knuckle Binds DNA
Structure homology modeling predicts that TAF1 ZnK potentially interacts with nucleic acids. We performed electrophoretic mobility shift assays (EMSA) to determine if the ZnK domain binds DNA. The protein fragment used in this assay contained the annotated ZnK domain and flanking regions (ZnA, aa 1234–1375) so as to not exclude any potential interacting residues. Increasing concentrations of ZnA protein were incubated with three different 32P-labeled double-stranded DNA fragments then subjected to native polyacrylamide gel electrophoresis. The fragments represent an optimized promoter containing an Inr, MTE and DPE (IMD); an endogenous promoter, cyclin D1 (CD1P); and a random DNA sequence (Random). We observed that TAF1 ZnA bound to all three DNA fragments (Fig. 4A–C). We followed up this analysis by using an orthogonal technique, bio-layer interferometry (BLI), for a more quantitative measurement of binding affinity. Biotinylated double-stranded IMD, CD1P and Random oligonucleotides were loaded on streptavidin probes and incubated with different concentrations of ZnA protein. Association and dissociation kinetics were monitored in real-time over two consecutive 5-minute periods, respectively (Fig. 4D–F). Dissociation of the protein from the probe indicated that the proteins were not irreversibly aggregating on DNA. Because of the slow off rate observed, binding affinities were calculated based on steady state levels for each protein concentration and involved plotting steady state binding levels against protein concentration for each DNA. ZnA binds significantly better to the optimized promoter (IMD) and CD1P DNA (288 nM ± 46 nM, and 411 nM ± 101 nM, respectively) over random DNA (1011 nM ± 228 nM). This data correlates with the promoter strength of each fragment in luciferase reporter assays, where IMD exhibits the highest activity followed by the CD1 promoter (Supplemental Figure S2). Interestingly, the null construct lacking a promoter sequence is still able to support low levels of transcription indicating general transcription factors are able to bind and initiate transcription from random sequences. The lack of strong specificity may in fact be a necessary feature of TAF1 given the sequence diversity within Pol II promoters. Overall, these studies provide a possible mechanism for how compromising the TAF1 ZnK leads to diminished transcription.
Zinc Knuckle is critical for DNA binding
To delineate the minimal ZnK DNA binding domain, TAF1 ZnA protein was incubated with IMD DNA fragment, and the mixture subsequently exposed to increasing concentrations of subtilisin protease. Digestion products were separated by SDS-PAGE, and the pattern of protein fragments compared to the apo-ZnA digestion pattern, generated in the absence of IMD incubation. Protein regions bound to DNA were protected from proteolytic cleavage and led to three stabilized species (Fig. 5A). As anticipated, at the higher concentrations of protease, ZnA is fully degraded as DNA-binding is a dynamic process and the DNA is unable to permanently protect the protein from enzymatic degradation. Quantification of the relative intensity of the different proteolytic products (Supplemental Figure S3) revealed that the full-length construct (ZnA) was considerably stabilized along with two smaller fragments (ZnC and ZnD), which were subsequently N-terminally sequenced. Surprisingly, fragment ZnC began at the same amino acid as the intact ZnA protein (aa 1234). The conversion of ZnA into ZnC, therefore, must be due to cleavage of the C-terminus. The N-terminus of the ZnD fragment was mapped to amino acid 1256, indicating that N-terminal cleavage of ZnC results in ZnD production. Interestingly, ZnC was the least stabilized by the IMD promoter, suggesting that DNA binding more effectively prevents cleavage of the C-terminus than the N-terminus of the ZnA fragment.
To further define the TAF1 ZnK domain, we analyzed ZnA sequence with EVfold, a program that mines evolutionary information to detect connections between residues in a protein and predicts a three-dimensional shape based on the co-conservation of amino acids42. Strong connections are given high coupling scores and indicate the residues have a high probability of existing in the same three-dimensional space. Plotting the connections illustrates the distance between co-evolved residues and identifies modular domains within proteins. EVfold analysis revealed the ZnA construct contains two smaller subdomains: the N-terminus from aa 1234–1313, which contains the ZnK, and a C-terminal domain spanning aa 1313–1375 (Fig. 5B). It is very striking that the defining boundary between these two subdomains corresponds to the end of the yeast TAF1 homolog. An alignment of fungal TAF1 proteins reveals that sequence conservation ends three amino acids after the last zinc knuckle cysteine residue. Taken together, we mapped the potential minimal ZnK binding domain to be aa 1256–1303 (ZnD) (Fig. 5C). We then generated two constructs, ZnC (aa 1234–1303) and ZnD, and purified these protein fragments, which were further analyzed by BLI using IMD loaded probes (Fig. 5D,G). The Kd values for ZnC (343 nM ± 42 nM) and ZnD (503 nM ± 67 nm) were similar to that obtained for ZnA, indicating that ZnD represents the core domain of DNA binding. As expected, mutations to the strictly conserved cysteines completely abolished binding (Fig. 5E,H). Due to the lack of appreciable signal above background, this binding data could only be used for qualitative assessment. Next, we sought to determine the residues critical for DNA binding. Guided by our predicted structural model, we mutated two positively charged residues between the histidine and third cysteine, which are located on the ZnK predicted DNA binding surface and are conserved across species (see Fig. 1B,E). Consistent with a critical role in binding DNA, mutation of these two positively charged residues either substantially compromised or completely abrogated the DNA binding activity of ZnC and ZnD, respectively (Fig. 5F,I). The residual DNA binding activity of the ZnC_RK mutant is most likely attributable to several additional positively charged residues at the N-terminal region of the ZnC fragment. Together, these experiments demonstrate the capacity of TAF1 ZnK to bind to DNA and map two charged residues essential for this activity. The identification of an additional DNA binding module in human TAF1 implies TFIID employs a multi-level approach to engage DNA, and with this knowledge we will continue to progress our understanding of promoter recognition and transcriptional initiation.
CCHC zinc knuckles are critical physiological domains and are found across the biological spectrum. The majority of ZnKs are involved in nucleotide processing including chaperoning, splicing, transcriptional activation and termination27,28,29. The ZnK found in TAF1 has a unique spacing shared by only a small number of annotated ZnK proteins, I factor and FAM90a, both of which have been annotated as interacting with DNA31,32. Given the propensity for zinc knuckles to bind DNA and TFIID’s role in transcription, we set out to determine if TAF1 ZnK interacts with promoter DNA. We found that TAF1 ZnK can bind to DNA with an affinity similar to other zinc knuckles43,44 and identified the residues important for this function. The sequence specificity of the ZnK, while significant, is not absolute. This mirrors ChIP-seq studies that fail to identify any sequences enriched by endogenous TAF1. This data strengthens our findings that show a marginal yet significant preference for promoter DNA over a random sequence. TAF1 ZnK is still able to bind random DNA with considerable affinity, 1 μM, suggesting ZnK may bind with high affinity at some promoters and low affinity at others. This also could be argued as a caveat of using an in vitro system to understand the complexities of physiological conditions. Follow-up experiments such as ChIP seq with mutant forms of TAF1 could help resolve this proviso and identify any sequences that preferentially bind an intact ZnK domain over WHD. Likewise, structural studies would definitely establish the mechanism of binding to DNA and can be used to clearly visualize if the protein/DNA contacts could allow for flexibility in sequence recognition.
We further show that TAF1 ZnK is important for cellular viability. Mutations in the ZnK lead to a reduction in TFIID association with cyclin D1 and A2 promoter sequences, as shown by a decrease in TAF1 ZnM promoter binding by ChIP and a lack of transcriptional stimulation in luciferase reporter activity assays when compared to wild type TAF1. The discovery of a second DNA binding module in human TAF1 is a notable step forward for our understanding of transcriptional initiation. TAF1 DNA binding activity is a vital yet under studied aspect of transcription. Further elucidation of TAF1’s role in core promoter recognition will have a profound impact on our perception of transcriptional regulation.
TFIID promoter recognition is an essential step in transcriptional regulation, yet we are only beginning to untangle the complexities of its mechanism. Our data identified a new conserved element in TAF1 that can contribute to the stabilization of TFIID at RNAPII promoters during PIC formation. There are several possible explanations as to why TAF1 has evolved multiple DNA binding domains (DBDs). Firstly, several points of contact could aid in forming a secure connection between TFIID and a corresponding promoter long enough to recruit the other essential components of the transcriptional machinery. Additionally, multiple DBDs could confer TFIID the necessary flexibility to associate with a variety of core promoter sequences throughout the human genome, such that recognition of different promoters might involve different DNA binding domains. Last but not least, multiple DBDs might be important for the dynamic rearrangement of TFIID, which transitions from the conical form to the promoter bound form, as demonstrated by Cryo-EM studies21,22. Different DNA binding modules may be necessary to support this movement, properly orient TFIID, and load GTFs onto the core promoter. With these possibilities in mind, we raise two models for how TAF1 contributes to TFIID promoter recognition (Fig. 6). Either simultaneous binding where the WH and ZnK equally contribute to DNA binding or sequential binding where one DNA binding domain engages during the initial stages followed by a second DNA binding event in the rearranged conformation. Future studies using advanced high-resolution techniques will be needed to reveal the details of the elusive aspects of PIC assembly, as the current views may underestimate the complexity of TFIID promoter recognition.
Cyclin proteins are master regulators of the cell cycle. Their phasic expression mediates enzymatic activities required for growth and proliferation45. The impact of the TAF1 ZnK mutations on cyclin transcription is not due to an inability to incorporate into TFIID, implying this domain is not involved in TAF interactions or the core structure of TAF1, both important for TFIID assembly. While, this study focuses on the ability of ZnK to bind nucleic acids, our findings do not exclude the possibility the ZnK may interact with other regulatory proteins. The close proximity of the ZnK to the double bromodomain suggests the ZnK could play a role in facilitating the recognition of epigenetic markings, which could influence promoter association. There is growing evidence that zinc fingers can play a dual role: supporting DNA binding and facilitating protein interactions with histones or transcriptional activators, such as those found in CBP and ZYMND1146,47,48,49. However, our experimental approach cannot rule out this possibility considering our transcription assays were conducted with luciferase reporter plasmids not known to incorporate histones; despite this caveat, we conclude that the impact on cyclin transcription results from a defect in active PIC formation, attributed to loss of promoter binding by the ZnK mutant containing TFIID complex. The inability to stimulate transcription from the cyclin promoters would explain why the ZnK mutant cannot rescue the ts13 growth defect in the cell complementation assay.
The mechanisms proposed in this work may also provide some insight into the pathologies of human disease states associated with TAF1 mutations by GWAS. The cBioPortal and catalog of somatic mutations in cancer (COSMIC) databases denote several mutations in the conserved CCHC residues of TAF1 including: C1285R, C1288R, H1293N, C1300W/R. While it is unknown whether these mutations are causative, they reinforce the notion that these residues are crucial for proper TAF1 function in cells. TAF1 is essential for cell viability with deletion of the gene lethal to cells. For this reason, it is doubtful that disease-associated mutations completely obliterate TAF1 function and cause global transcriptional dysfunction. More likely, TAF1 function is altered in some subtle manner that results in aberrant transcription from select genes. The role of TFIID in proliferation and cell survival makes it of particular interest to cancer biology50,51.
In summary, we uncovered a second critical DNA binding domain in TAF1, the largest subunit of the TFIID general transcription factor. The conserved zinc knuckle is essential for the full in vivo function of TAF1. Mutations within this region do not disrupt the stability of TAF1 or its incorporation into TFIID, but instead interfere with its ability to associate with core promoters. The loss of promoter association would result in inefficient transcription initiation. We speculate that two motifs in TAF1 provide greater flexibility within TFIID for promoter DNA sequence recognition.
Materials and Methods
ts13 Complementation Assay
Mammalian TAF1 expression plasmids contain N-terminal HA-tagged human TAF1 coding sequence inserted downstream of the CMV promoter in CS2 + vector52. Point mutations in TAF1 were introduced by site-directed mutagenesis and confirmed by DNA sequencing (Supplemental Table S1,20). ts13 cells were grown at 33.5 °C in Dulbecco’s modified Eagles medium (Gibco) supplemented with 10% fetal bovine serum, 2 mM L-glutamine, and penicillin/streptomycin. For complementation assays, cells were seeded into 6-well plates, grown overnight to 70–80% confluency and transfected with 2 μg of CS2 + or TAF1 expression plasmids using polyethylenimine (PEI) transfection reagent (ratio 2.5:1 of PEI:DNA) according to previously described protocol53. Transfected cells were maintained for an additional 18–24 h at 33.5 °C, then either harvested for Western Blot analysis or shifted to nonpermissive temperature of 39.5 °C. Number of DAPI positive viable cells was determined after 36–48 h at 39.5 °C, and the percentage relative to WT-TAF1 transfected cells (given the value of 100%) was calculated.
Luciferase Reporter Assay
Human cyclin D1 luciferase construct was constructed by subcloning the EcoRI to PvuII fragment of pD1-G065 (kindly provided by Yue Xiong) into the SmaI site of pGL2-basic as previously described54. The human cyclin A2 luciferase construct has been described55. The IMD luciferase construct is previously described in Juven-Gereshon et al.56 and a gift from J. Kadonaga. HEK293 cells were grown at 37 °C in Dulbecco’s modified Eagles medium (Gibco) supplemented with 10% fetal bovine serum, 2 mM L-glutamine, and penicillin/streptomycin. For luciferase assays, cells were seeded into 24-well plates, grown overnight to 60%-70% confluency and cotransfected with 100 ng of CS2 + or TAF1 expression constructs and 50 ng luciferase reporter construct using FuGene transfection reagent (ratio 2.5:1 of FuGene:DNA) according to the manufacturer’s protocol (Roche). Transfected cells were maintained an additional 36 h at 37 °C, after which cells were harvested and lysed in 100 μL of 1× Passive lysis buffer (Promega). Luciferase activity was measured according to manufacturer’s protocol (Promega) on Lumat LB 9507 (Berthold Technologies) and normalized for total protein. Transcriptional activity was expressed relative to CS2+/reporter construct transfected cells (set to 1).
Cyclin D1 RT-qPCR
ts13 cells seeded into 35-mm dishes were grown at 33.5 °C to 60–70% confluency and transfected with 500 ng of CS2 + or HA-TAF1 expression plasmids using PEI (2.5:1 PEI:DNA ratio). Total RNA was isolated approximately 24–36 hours post-transfection and 1 μg RNA reverse transcribed in cDNA using iScript cDNA synthesis kit (BioRad). Cyclin D1 transcript levels were measuring by qPCR using SsoFast EvaGreen Supermix (BioRad) on the Applied Biosystems 7500Fast Real-Time PCR system. Relative transcript levels were determined by calculating 2−ΔCt.
HEK 293 cells were seeded into 10-cm dishes and transfected with CS2 + or TAF1 expression constructs and incubated for 24–36 h at 37 °C. Before harvesting, cells were cross-linked with 400 μl of 37% formaldehyde and analyzed as previously described57. In brief, cells were lysed in 0.5 mL immunoprecipitation (IP) buffer (50 mM Tris-HCl, pH 7.5, 150 mM NaCl, 5 mM EDTA, 1% Triton X-100, and 0.5% Nonidet P-40) containing protease inhibitors and sonicated 8 times for 20 s at 40% amplitude (Branson Digital Sonifier). Nuclear extract was incubated overnight at 4 °C with either anti-HA (clone 3F10, Sigma Aldrich) or mouse IgG (Abcam). Immunoprecipitated complexes were washed and eluted. Cross-links were reversed and DNA was isolated. Input DNA was purified from 100 μl extract. Fifty nanograms of input DNA and 2 μL of purified TAF1 bound DNA was amplified with SsoFast EvaGreen Supermix (Bio-Rad) using primers spanning the promoters of cyclin D1 and cyclin A2 (Supplemental Table S1). Quantitative PCR was performed on the Applied Biosystems 7500Fast Real-Time PCR system and the data expressed as percent input.
TFIID and TAF1 Immunoprecipitation
For preparation of nuclear extracts, HEK293 cells were resuspended in buffer A (10 mM HEPES, pH 7.9, 1.5 mM MgCl2, 10 mM KCl, 0.5 mM DTT) and incubated on ice for 15 min. Cells were lysed by pushing through a 25-gauge needle 5 times. The crude nuclear pellet was isolated by centrifugation for 20 s at 12,000 g, resuspended in buffer C (20 mM HEPES, pH 7.9, 25% glycerol, 420 mM NaCl, 1.5 mM MgCl2, 0.2 mM EDTA, 0.5 mM phenylmethylsulfonyl fluoride, 0.5 mM DTT), and incubated for 30 min at 4 °C. After centrifugation for 5 min at 12,000 g, the supernatant/nuclear extract was incubated with anti-TAF4 (mAb 3A6, ref.58) or anti-HA antibody overnight at 4 °C. Ten microliters of protein A–Sepharose CL-4B (GE Healthcare) was added, and the samples were nutated for 2 h at 4 °C. Isolated proteins were washed 5 times with Buffer C and analyzed by silver stain or immunoblotting using antibodies against HA (clone 3F10, Sigma Aldrich), TAF1 bromodomain (Ab1230, ref.59), TBP (gift from R. Tjian, ref.5), TAF4 (mAb 3A6), and TAF5 (mAb 6C1, unpublished data).
For expression of His-tagged TAF1 proteins, cDNA of TAF1 (ZnA-1234–1371, ZnC-1234–1303, ZnD-1256–1303) was PCR amplified from CS2 + containing full length TAF1 and cloned into pAL-GB1 using ligase independent cloning; site-directed mutagenesis was used to introduce point mutations into ZnC and ZnD constructs (Supplemental Table S1). TAF1 ZnK expression constructs were transformed into BL21* cells for protein expression. Starter cultures of 10 mL were diluted into 1 L LB containing 30 μg/ml chloramphenicol, and cells were grown to an optical density of 0.8–1.0 at 600 nm and chilled on ice for 1 h. TAF1 expression was induced by addition of 0.2 mM IPTG (isopropyl–D-thiogalactopyranoside) for 18 h at 16 °C. Cells were harvested by centrifugation and lysed by pulse-sonication in 50 mM Tris-HCl, pH 8.0, 300 mM NaCl, 5 mM imidazole, 5% glycerol, 0.05% Triton-X, 1 mM DTT supplemented with protease inhibitors. His-GB1-TAF1 was purified using Ni-nitrilotriacetic acid (NTA)- agarose (Qiagen), washed with Buffer A (50 mM Tris-HCl, pH 8.0, 150 mM NaCl, 10 mM imidazole, 5% glycerol, 1 mM DTT) followed by a high salt wash (50 mM Tris-HCl, pH 8.0, 1 M NaCl, 5% glycerol, 1 mM DTT) and a final wash with Buffer A. Protein was eluted with 50 mM Tris-HCl, pH 8.0, 150 mM NaCl, 200 mM imidazole, 5% glycerol, 1 mM DTT. For proteolytic digestion and electrophoretic mobility shift assays, ZnA was further purified by cation exchange (GE Healthcare) after off-column cleavage by the tobacco etch virus (TEV) protease. Proteins for biolayer interferometry were further purified by size-exclusion chromatography (GE Healthcare) using 10 mM HEPES, pH 7.9, 150 mM NaCl, 1 mM DTT.
Electrophoretic Mobility Shift Assay
Purified TAF1 (ZnA-1234–1371) (7.6–61.3 pmoles) purified from E. coli was incubated with 4 ng of 32P 5′-end labeled DNA in 10 mM HEPES pH 7.9, 5 mM MgCl2, 100 mM NaCl, 10% glycerol, 20 mM tetrasodium pyrophosphate, 0.2 mM dI:dC for 1 h at 25 °C. For gel electrophoresis, 6× loading buffer (20% Ficoll, 0.025% bromophenol blue) was added and binding reactions were loaded onto nondenaturing 5% polyacrylamide (37.5:1 acrylamide:bis) pre-run in 0.5× TBE at 100 V for 30 min. Samples were resolved for 1.5 h at 100 V and shifted complexes detected by autoradiography. The sequences of DNAs used for EMSA are provided in Supplemental Table S1.
5′-biotinylated oligonucleotides were annealed to their complement (IDT, USA) in 1× binding buffer (10 mM HEPES pH 7.9, 100 mM NaCl, 10 mM MgCl2, 1 mM DTT) by heating at 72 °C for 5 minutes and slow cooling to 25 °C. Double-stranded oligos (Supplemental Table S1) were purified by extraction from agarose after gel electrophoresis then ethanol precipitated. Purified biotinylated double-stranded oligos were loaded onto streptavidin-coated (SA) sensors and the interactions between the DNA coated probes and TAF1 proteins were measured using ForteBio Octet RED96 system (Pall Life Sciences). For loading, SA sensors were prewetted in assay buffer (10 mM HEPES pH 7.9, 5 mM MgCl2, 100 mM NaCl, 10% glycerol, 0.1% ovalbumin, 0.2 mM dI:dC) for 30 min then dipped into individual wells on 96-well plates (200 μL/well) containing biotinylated DNA (100 nM) in assay buffer. Unbound DNA was removed by washing and exposed streptavidin blocked with biocytin. Protein was diluted in assay buffer to 3 μM, 1 μM, 333 nm 111 nm, 37 nm, and 12 nm. Association of the protein to the probe was observed for 300 s followed by dissociation in assay buffer for 300 s. Throughout loading, association and dissociation sensors were shaking (1,000 rpm) at a constant 30 °C to promote specific interaction and to reduce non-specific binding. A reference sensor (no biotinylated DNA loaded) was included and subjected to the same procedure to control for non-specific interactions. Sensors were regenerated (10 mM HEPES pH 7.9, 1 M NaCl) between experiments and a new baseline established. Reference-subtracted data was used to calculate the equilibrium dissociation constant (Kd) by steady-state response analysis assuming 1:1 kinetics.
DNA Protection Assay
TAF1 (aa 1234–1371) was exposed to limited proteolytic digestion in the presence and absence of DNA (IMD) to determine DNA-bound stabilized regions. TAF1 (625 pmol) was incubated with or without DNA (700 pmol) in 10 mM HEPES pH 7.9, 5 mM MgCl2, 100 mM NaCl, 10% glycerol, 20 mM tetrasodium pyrophosphate, 0.2 mM dI:dC) for 1 h at 25 °C. Subtilisin (0.002, 0.006, 0.02, 0.06, 0.2, and 0.6 μg/ml) was added to each sample and incubated overnight on ice at 4 °C. Samples were divided in two and analyzed as follows: SDS-PAGE, coomassie stained, and scanned for analysis; SDS-PAGE, transferred to PVDF membrane, coomassie stained and stabilized fragments excised and N-terminally sequenced.
All data generated or analyzed during this study are included in this published article (and its Supplementary Information files).
Buratowski, S., Hahn, S., Guarente, L. & Sharp, P. A. Five intermediate complexes in transcription initiation by RNA polymerase II. Cell 56, 549–561 (1989).
Thomas, M. C. & Chiang, C. M. The general transcription machinery and general cofactors. Crit Rev Biochem Mol Biol. 41, 105–178 (2006).
Van Dyke, M. W., Roeder, R. G. & Sawadogo, M. Physical analysis of transcription preinitiation complex assembly on a class II gene promoter. Science 241, 1335–1338 (1988).
Orphanides, G., Lagrange, T. & Reinberg, D. The general transcription factors of RNA polymerase II. Genes Dev. 10, 2657–2683 (1996).
Pugh, B. F. & Tjian, R. Transcription from a TATA-less promoter requires a multisubunit TFIID complex. Genes Dev. 5, 1935–1945 (1991).
Chalkley, G. E. & Verrijzer, C. P. DNA binding site selection by RNA polymerase II TAFs: a TAF(II)250-TAF(II)150 complex recognizes the initiator. EMBO J. 18, 4835–4845 (1999).
Shao, H. et al. Core promoter binding by histone-like TAF complexes. Mol Cell Biol. 25, 206–219 (2005).
Kim, Y., Geiger, J. H., Hahn, S. & Sigler, P. B. Crystal structure of a yeast TBP/TATA-box complex. Nature 365, 512–520 (1993).
Peterson, M. G., Tanese, N., Pugh, B. F., & Tjian, R. Functional domains and upstream activation properties of cloned human TATA binding protein. Science 248, 1625–1630.
Hoey, T., Dynlacht, B. D., Peterson, M. G., Pugh, B. F. & Tjian, R. Isolation and characterization of the Drosophila gene encoding the TATA box binding protein, TFIID. Cell 61, 1179–1186 (1990).
Weis, L. & Reinberg, D. Accurate positioning of RNA polymerase II on a natural TATA-Less promoter Is independent of TATA-binding-protein-associated factors and initiator-binding proteins. Mol Cell Biol. 17, 2973–2984 (1997).
Martinez, E. et al. Core promoter-specific function of a mutant transcription factor TFIID defective in TATA-box binding. Proc Natl Acad Sci USA 92, 11864–11868 (1995).
Martinez, E., Chiang, C. M., Ge, H. & Roeder, R. G. TATA-binding protein-associated factor(s) in TFIID function through the initiator to direct basal transcription from a TATA-less class II promoter. EMBO J. 13, 3115–3126 (1994).
Zhang, Z. et al. Rapid dynamics of general transcription factor TFIIB binding during preinitiation complex assembly revealed by single-molecule analysis. Genes Dev. 30, 2106–2118 (2016).
Matangkasombut, O., Auty, R. & Buratowski, S. Structure and function of the TFIID complex. Adv in Protein Chem. 67, 67–92 (2004).
Yang, C., Bolotin, E., Jiang, T., Sladek, F. M. & Martinez, E. Prevalence of the initiator over the TATA box in human and yeast genes and identification of DNA motifs enriched in human TATA-less core promoters. Gene 389, 52–65 (2007).
Vo Ngoc, L., Cassidy, C. J., Huang, C. Y., Duttke, S. H. & Kadonaga, J. T. The human initiator is a distinct and abundant element that is precisely positioned in focused core promoters. Genes Dev. 31, 6–11 (2017).
Lewis, B. A., Sims, R. J. 3rd, Lane, W. S. & Reinberg, D. Functional characterization of core promoter elements: DPE-specific transcription requires the protein kinase CK2 and the PC4 coactivator. Mol Cell 18, 471–481 (2005).
Lee, D. H. et al. Functional characterization of core promoter elements: the downstream core element is recognized by TAF1. Mol Cell Biol. 25, 9674–9686 (2005).
Wang, H., Curran, E. C., Hinds, T. R., Wang, E. H. & Zheng, N. Crystal structure of a TAF1-TAF7 complex in human transcription factor IID reveals a promoter binding module. Cell Res. 24, 1433–1444 (2014).
Louder, R. K. et al. Structure of promoter-bound TFIID and insight into human PIC assembly. Nature 531, 604–609 (2016).
Cianfrocco, M. A. et al. Human TFIID binds to core promoter DNA in a reorganized structural state. Cell 152, 120–131 (2013).
Mencia, M. & Struhl, K. Region of yeast TAF 130 required for TFIID to associate with promoters. Mol Cell Biol. 21, 1145–1154 (2001).
Hayashida, T. et al. The CCG1/TAFII250 gene is mutated in thermosensitive G1 mutants of the BHK21 cell line derived from golden hamster. Gene 141, 267–270 (1994).
Talavera, A. & Basilico, C. Temperature sensitive mutants of BHK cells affected in cell cycle progression. J Cell Physiol. 92, 425–436 (1977).
Sekiguchi, T. et al. Apoptosis is induced in BHK cells by the tsBN462/ts13 mutation in the CCG1/TAFII250 subunit of the TFIID basal transcription factor. Exp Cell Res. 218, 490–498 (1994).
Darlix, J., Lastra, M., Mely, Y. & Roques, B. Nucleocapsid protein chaperoning of nucleic acids at the heart of HIV structure, assembly and cDNA synthesis. HIV Sequence Compendium 69–88 (2002).
Loughlin, F. E. et al. Structural basis of pre-let-7 miRNA recognition by the zinc knuckles of pluripotency factor Lin28. Nat Struct Mol Biol. 19, 84–89 (2011).
Armas, P. & Calcaterra, N. Retroviral zinc knuckles in eukaryotic cellular proteins. Nova Science Publishers 51–80 (2013).
Abad, P. et al. A long interspersed repetitive element—the I factor of Drosophila teissieri—is able to transpose in different Drosphila species. Proc Natl Acad Sci USA 86, 8887–8891 (1989).
Dawson, A., Hartwood, E., Paterson, T. & Finnegan, D. A LINE-like Transposable element in Drosphila, the I factor, encodes a protein with properties similar to those of retroviral nucleocapsids. EMBO J. 16, 4448–4455 (1997).
Bosch, N. et al. Characterization and evolution of the novel gene family FAM90A in primates originated by multiple duplication and rearrangement events. Hum Mol Genet. 16, 2572–2582 (2007).
Yang, J. et al. The I-TASSER Suite: Protein structure and function prediction. Nature Methods 12, 7–8 (2015).
De Guzman, R. et al. Structure of the HIV-1 nucleocapsid protein bound to the SL3 psi-RNA recognition element. Science 279, 384–388 (1998).
Zeng, Y. et al. Lin28A binds active promoters and recruits Tet1 to regulate gene expression. Mol Cell 61, 153–160 (2016).
Wang, E. & Tjian, R. Promoter-selective transcriptional defect in cell cycle mutant ts13 rescued by hTAFII250. Science 263, 811–814 (1994).
Suzuki-Yagawa, Y., Guermah, M. & Roeder, R. The ts13 mutation in the TAF(II)250 subunit (CCG1) of TFIID directly affects transcription of D-type cyclin genes in cells arrested in G1 at the nonpermissive temperature. Mol Cell Biol. 17, 3284–3294 (1997).
Sekihuchi, T. et al. D-type cyclin expression is decreased and p21 and p27 CDK inhibitor expression is increased when tsBN462 CCG1/TAFII250 mutant cells arrest in G1 at the restrictive temperature. Genes Cells 1, 687–705 (1996).
Wang, E. H., Zou, S. & Tjian, R. TAFII250-dependent transcription of cyclin A is directed by ATF activator proteins. Genes Dev. 11, 2658–2669 (1997).
Pennington, K. L., Marr, S. K., Chirn, G. W. & Marr, M. T. 2nd Holo-TFIID controls the magnitude of a transcription burst and fine-tuning of transcription. Proc Natl Acad Sci USA 110, 7678–7683 (2013).
Wang, M. et al. PaxDb, a database of protein abundance averages across all three domains of life. Mol Cell Proteomics 11, 492–500 (2012).
Marks, D., Hopf, T. & Sander, C. Protein structure prediction from sequence variation. Nature Biotechnology 30, 1072–1080 (2012).
Dey, A., York, D., Smalls-Mantey, A. & Summers, M. Composition and Sequence-Dependent Binding of RNA to the Nucleocapsid Protein of Moloney Murine Leukemia Virus. Biochemistry 44, 3735–3744 (2005).
Vandevenne, M. et al. New insights into DNA recognition by zinc fingers revealed by structural analysis of the oncoprotein ZNF217. J Biol Chem. 288, 10616–10627 (2013).
Schafer, K. A. The Cell Cycle: A Review. Vet Pathol. 35, 461–478 (1998).
Song, C., Keller, K., Chen, Y., Murata, K. & Stamatoyannopoulos, G. Transcription coactivator CBP has direct DNA binding activity and stimulates transcription factor DNA binding through small domains. Biochem Biophys Res Commun. 296, 118–124 (2002).
Wang, J. et al. Crystal structure of human BS69 Bromo-ZnF-PWWP reveals its role in H3K36me3 nucleosome binding. Cell Res. 24, 890–3 (2014).
Wen, H. et al. ZMYND11 links histone H3.3 K36 trimethylation to transcription elongation and tumor suppression. Nature 508, 263–268 (2014).
Li, Y. Y., Wen, H., Shi, X. B. & Li, H. T. Determination of protein-DNA (ZMYND11-DNA) interaction by a label-free biolayer interferometry assay. bio-protocol 5, e1402 (2015).
Ribeiro, J. R., Lovasco, L. A., Vanderhyden, B. C. & Freiman, R. N. Targeting TBP-associated factors in ovarian cancer. Frontiers in Oncology 4, 45 (2014).
Zhao, S. et al. Landscape of somatic single-nucleotide and copy-number mutations in uterine serous carcinoma. Proc Natl Acad Sci USA 110, 2916–2921 (2013).
Ruppert, S., Wang, E. H. & Tjian, R. Cloning and expression of human TAFII250: a TBP-associated factor implicated in cell-cycle regulation. Nature 362, 175–179 (1993).
Longo, P. A., Kavran, J., Kim, M. S. & Leahy, D. J. Transient mammalian cell transfection with polyethylenimine (PEI). Methods Enzymol. 529, 227–240 (2013).
Hilton, T. L. & Wang, E. H. TFIID recruitment and Sp1 activation: dual function of TAF1 in cyclin D1 transcription. J. Biol.Chem. 15, 12992–13002 (2003).
Yamamoto, M. et al. Effect of tumor suppressors on cell cycle-regulatory genes: RB suppresses p34cdc2 expression and normal p53 suppresses cyclin A expression. Exp Cell Res 210, 94–101 (1994).
Juven-Gershon, T., Cheng, S. & Kadonaga, J. T. Rational design of a super core promoter that enhances gene expression. Nat. Methods 3, 917–922 (2006).
Kloet, S. L., Whiting, J. L., Gafken, P., Ranish, J. & Wang, E. H. Phosphorylation-Dependent Regulation of Cyclin D1 and Cyclin A Gene Transcription by TFIID Subunits TAF1 and TAF7. Mol Cell Biol. 32, 3358–3369 (2012).
Näär, A. M. et al. Chromatin, TAFs, and a novel multiprotein coactivator are required for synergistic activation by Sp1 and SREBP-1a in vitro. Genes & Dev. 12, 3020–3031 (1998).
Li, A. G. et al. An Acetylation Switch in p53 Mediates Holo-TFIID Recruitment. Mol Cell 28, 498–421 (2007).
We thank the Hague and Scott laboratories in the Department of Pharmacology at the University of Washington for the use of reagents and instruments, L Cheng for assistance with the in vivo complementation experiments, and members of the Zheng and Wang laboratories for discussion. This study was supported by PHS NRSA T32GM007270 from NIGMS (to E.C.C.), Howard Hughes Medical Institute (to N.Z.) and University of Washington Royalty Research Fund (to E.H.W.).
The authors declare no competing interests.
Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
About this article
Cite this article
Curran, E.C., Wang, H., Hinds, T.R. et al. Zinc knuckle of TAF1 is a DNA binding module critical for TFIID promoter occupancy. Sci Rep 8, 4630 (2018). https://doi.org/10.1038/s41598-018-22879-5
Nature Communications (2019)
Identification of the gene encoding the TATA box-binding protein-associated factor 1 (TAF1) and its putative role in the heat shock response in the protozoan parasite Entamoeba histolytica
Parasitology Research (2019)
Involvement of non-coding RNAs and transcription factors in the induction of Transglutaminase isoforms by ATRA
Amino Acids (2019)