Binding of the HSF-1 DNA-binding domain to multimeric C. elegans consensus HSEs is guided by cooperative interactions

The protein HSF-1 is the controlling transcription factor of the heat-shock response (HSR). Its binding to the heat-shock elements (HSEs) induces the strong upregulation of conserved heat-shock proteins, including Hsp70s, Hsp40s and small HSPs. Next to these commonly known HSPs, more than 4000 other HSEs are found in the promoter regions of C. elegans genes. In microarray experiments, few of the HSE-containing genes are specifically upregulated during the heat-shock response. Most of the 4000 HSE-containing genes instead are unaffected by elevated temperatures and coexpress with genes unrelated to the HSR. This is also the case for several genes related to the HSP chaperone system, like dnj-12, dnj-13, and hsp-1. Interestingly, several promoters of the dedicated HSR-genes, like F44E5.4p, hsp-16.48p or hsp-16.2p, contain extended HSEs in their promoter region, composed of four or five HSE-elements instead of the common trimeric HSEs. We here aim at understanding how HSF-1 interacts with the different promoter regions. To this end we purify the nematode HSF-1 DBD and investigate the interaction with DNA sequences containing these regions. EMSA assays suggest that the HSF-1 DBD interacts with most of these HSE-containing dsDNAs, but with different characteristics. We employ sedimentation analytical ultracentrifugation (SV-AUC) to determine stoichiometry, affinity, and cooperativity of HSF-1 DBD binding to these HSEs. Interestingly, most HSEs show cooperative binding of the HSF-1 DBD with up to five DBDs being bound. In most cases binding to the HSEs of inducible promoters is stronger, even though the consensus scores are not always higher. The observed high affinity of HSF-1 DBD to the non-inducible HSEs of dnj-12, suggests that constitutive expression may be supported from some promoter regions, a fact that is evident for this transcription factor, that is essential also under non-stress conditions.


Material and methods
Analysis of microarray data. Initially three microarray data sets investigating the heat-shocked (GSM62937, GSM62941, GSM62945) versus non-shocked condition (GSM62936, GSM62940, GSM62944), which can be obtained from the GEO microarray depository under the GSE2862 tag 31 , were used to identify genes with strong overexpression. Here, L2 stage C. elegans larvae were heat-shocked for 20 min at 33 °C, followed by a recovery at 20 °C for 40 min. As expected, and previously published 15 , the strongest upregulated genes were hsp- 16.1, hsp-16.48 and F44E5. 4 and their duplicated loci. Individual genes that show elevated expression under heat-shock conditions were determined. To obtain information on whether these genes commonly express together, genome-wide clique set analysis was performed as described before 29,30 . The heat-shock data sets were used together with the publicly available coexpression cliques. Altogether 307 cliques had been obtained before, with the largest clique containing 1200 genes and the smallest clique containing 6 genes and the publicly available information was used (www. richt erlab. de/ DataS ets/ and https:// github. com/ klari chter/ clust erEX_ cliqu es_ Celeg ans) 29,30 . We then used each of the three microarray replicates to assign their values to the genes in the coexpression cliques and analyzed those in respect to significant induction or repression as previously described for yeast and nematode expression studies 29,30 . As these heat-shock data and the clique set were both based on the GPL200 platform (Affymetrix C. elegans genome st-1.0), each Probe Set ID was represented by exactly one value in the described clique set. Analyses were performed for each replicate and average values for each clique were calculated to rank the cliques according to their average induction and p-values for induction significance as described 29,30 .
Given the complexity of the heat-shock response, we compared these data to other genome-wide expression data sets. As such a heat-shock time course defined by microarray data 32 was investigated as well as heat-shock experiments based on RNA sequencing 15,17 . The Subio64 software package 1.24.5853 (Subio Inc. Kagoshima, Japan) was used to derive annotated, normalized expression data from the publicly available SRA-files in cases where the annotated data were not available from GEO repository. HSE-detection in the promoter regions. HSE-detection in the promoter region 1000 bp upstream of the ATG was performed based on the PWM-models published for the human Hsf1's DNA binding sequence 33 , which is represented by the following PWM pattern: The 1000 bp promoter regions were obtained from Wormbase (www. wormb ase. org) 34 and searched with this HSF-1 consensus description. As recommended, a threshold level of 9 was used as lower limit for detection 35 . In several cases HSEs were detected in the same promoter and located only 5 bp from each other, which implies that the investigated HSE actually is a tetrameric HSE. If this pattern is observed a second time, a pentameric HSE-element was detected or in rare cases even larger arrays of HSEs were detected.

HSF-1 fragmentation and purification.
Fragmentation was performed based on hydropathy plots and expression tests, which indicated that fragments, which contained additional domains outside the DBD showed either very weak expression or insoluble expression and that full-length HSF-1 could also not be obtained in soluble amounts sufficient for biochemical analysis. Due to our plan to investigate the direct interaction with the DNA, we chose the isolated DBD as model protein for interaction analysis. Therefore, the N-and C-terminus www.nature.com/scientificreports/ of C. elegans HSF-1 were determined by comparing both hydropathy plots and sequence alignments of different Hsf proteins from diverse species. This yielded the fragment AA82-AA198 which was subcloned into the pGATE vector (HSF-1 DBD) and thereby fused with a GST-tag. A GST-trap column was used for purification and the GST-tag was cleaved off by TEV-protease before the HSF-1 DBD was further purified via ion exchange chromatography and size-exclusion chromatography (all columns from GE Healthcare, Chicago, USA). Purity was determined by SDS-PAGE and peptide fingerprinting using mass spectrometry on a Bruker ultra-flex III MALDI-TOF/TOF instrument (Bruker, Billerica, USA) was employed to confirm the identity of the protein.
Circular dichroism spectroscopy. CD-spectroscopy on a Jasco J-715 was performed to obtain information on the structure and stability of the HSF-1 fragment. The folding state and the thermal stability of the expressed HSF-1 fragment was assessed at a concentration of 0.2 mg/mL in storage buffer (40 mM K 2 HPO 4 , 150 mM KCl). CD-spectra were recorded in the Far-UV region between 215 and 260 nm. To analyze the thermal stability of the fragment an unfolding transition was recorded at 220 nm in a temperature range between 25-95 °C.
Thermal shift assays. The stability of the folded structure was analyzed with thermal shift assays in a Mx3005P qPCR cycler (Stratagene, La Jolla, USA). Thermal shift assays were performed at a protein concentration of 0.2 mg/mL after addition of SYPRO orange (Invitrogen, Waltham, USA) at a dilution of 1:1000. The total volume was adjusted to 20 µL with storage buffer.  36 . All experiments were analyzed with the 2DSA-IT model employing the same settings (s-value range from 0 to 10 and f/f0 range from 1 to 4). This way two species distributions were obtained for each experiment, one for the data at 280 nm and one for 260 nm. The complexity of these distributions did not allow a unanimous assignment of solutes to species, which suggests that for a unifying solution a further reduction in search space has to be enforced. A reduced model therefore contained only the most abundant species of the binding reaction (HSF-1 DBD, ssDNA, dsDNA, dsDNA + 1 HSF-1, dsDNA + 2 HSF-1, dsDNA + 3 HSF-1, dsDNA + 4 HSF-1 and dsDNA + 5 HSF-1) at defined s 20,w values. These values were known for HSF-1 DBD, dsDNA and ssDNA from control experiments, while the other species were estimated from a stepwise optimization of these values. Given that all DNA strands were of the same size, a unique value for the sedimentation coefficient (s 20,w ) of each assembly intermediate was assigned independent of the dsDNA used. A custom grid model containing the species at the respective s 20,w values was developed in UltraScan III and used to fit all data sets again. RMSD values of the unconstrained fit and the custom grid constrained fit were compared to verify that the fit quality despite the constraints is acceptable and the species s 20,w values are sufficiently good estimates. To estimate the specific volume of each species and to confirm the MW of each obtained species the following equation was used: Value pairs for D 20,w and s 20,w were estimated and the extinction coefficients, specific volumes and molecular weights were calculated for each species in the custom grid model.

Estimation of interaction parameters for dsDNA-DBD interaction.
Data analysis was finally performed using the species concentrations determined from UltraScan III in the first unconstrained 2DSA-IT analysis and data fitting was based on previously developed models. The fitting function was modified from an Origin DLL-file developed originally for the interaction of two proteins (PPH-5 and HSP-90) 37 to now describe the five-step binding process. Fitting was performed in analogy to the Nelder-Mead implementation for C# accessible at https:// docs. micro soft. com/ de-de/ archi ve/ msdn-magaz ine/ 2013/ june/ test-run-amoeba-methodoptim izati on-using-csharp. Employing this function, K D -values for each step could be estimated. To this end, www.nature.com/scientificreports/ detected species absorbance was converted to species concentrations by employing the estimated extinction coefficients and fitting was performed globally for all species at both wavelengths. In few cases, especially where binding was very weak, RMSD values were almost exclusively influenced by the free ligand concentration. Under these conditions a weighing factor of 0.3 or 0.1 was applied to the free HSF-1 concentration to give more relevance to the other species M, ML1, ML2, ML3, ML4 and ML5. Cooperativity was observed, if K D -values for later assembly steps showed higher affinity than K D -values for early binding steps. Despite the constrained model the obtained K D -values contain large error intervals and are therefore considered as estimations due to the complexity of the binding events and the differences within the individual binding sites on the DNA.
ChIPseq-data analysis. ChIPseq data as available from the GEO repository were obtained as bedgraphfiles 17 . Bedgraph files were searched to retrieve the values for specified regions and the reads identified in HSF-1 IP under various conditions were summarized in Excel to display the regions relevant for the genes of interest.

Results
The heat-shock response is represented by a small set of genes in C. elegans. We initially aimed at identifying those HSE-containing promoters that are most strongly upregulated under heat-stress conditions. Given that the HSR is complex in nematodes we used data from several heat-shock studies based on microarray and RNAseq analysis. Besides defining the individual genes, which are induced upon heat-shock in the different experiments, we also tested, whether the differently regulated genes are enriched in one or more of the 307 C. elegans coexpression cliques. These groups of genes (or gene sets) were obtained based on coexpression analysis of more than 2000 microarray experiments recently 30 and found to contain many coexpressing tissue specific, phenotype specific and GO-term specific gene sets.
To initially define heat shock inducible and non-inducible HSE-sites in the promoter regions from the experiment series GSE2862 31 , we defined the gene sets (or cliques) that represent the heat-shock response under these conditions (L2 larvae, 20 min 33 °C followed by recovery period of 40 min). To this end, we used the method previously described 30 to determine significant coregulation units responding to heat shock. The procedure searches the 307 predefined coexpression cliques and identifies those with significant expression changes. In all three replicates of GSE2862 31 in particular one gene set out of the 307 cliques was highly induced (log 2 > 2), the clique termed hsp-16.2-F44E5.4_19238, which contains the well described heat-shock genes hsp- 16 Fig. 1a, Summary of the three replicates in Table 1, whole genome clustered in Supplemental Fig. 1). While unc-23 and lact-4 were not significantly upregulated in the three microarray experiments, the other genes of this coregulation clique are highly induced so that the hsp-16.2-F44E5.4_19238 gene set stands out with a 4.2-fold average induction (Table 2). Several canonical chaperones, like dnj-12 (two probes in cliques cdc-42_17192-rab-5_18073 and srj-42-srw-113), dnj-13 (two probes in cliques unc-116_2109-zfp-1_3976 and tars-1-AFFX-r2-3026-5_at) and the constitutively expressed Hsc70-homolog hsp-1 (clique dld-1-skn-1_16701) are not part of the HSRclique hsp-16.2-F44E5.4_19238 and we individually tested their induction to confirm that they are indeed not coregulated with the induced heat-shock proteins ( Table 2). As we find them them not upregulated in either of the replicates, the assignment to other coexpression cliques seems justified..

Nematode HSEs vary widely in size and co-expression clique affiliation.
We aimed at understanding, whether different affinities of the heat-shock transcription factor HSF-1 for the promoter sequences can be observed. Previous reports had highlighted that large number of HSEs can be found in the nematode genome 24,38 . Most of these genes are not induced in the heat-shock experiment investigated here. To obtain the HSEs of the genes of interest we searched the 1000 bp promoter regions of all genes of C. elegans. We identified 4120 HSE in genes, which contain a consensus sequence for HSF-1 in their promoter region. Despite not being induced upon heat-shock, several genes related to the chaperone system were found to contain HSE-like sequences in their promoter region, like dnj-12, dnj-13, and hsp-1. We then compared the sequence and structure of the HSEs in the promoter region of the chaperone proteins. Here, several promoters in the HSR-cluster contain more HSEs than the usually expected trimeric DNA-binding sequence, like hsp-16.2a and F44E5.4, which contain four or five HSF-1 binding sites in close vicinity (Fig. 1b).
Heat-shock inducibility varies with the employed stress conditions. We used data from other heat-shock experiments-performed with RNAseq-to see, whether these chaperones and heat-shock proteins are induced with the same pattern. In these RNAseq experiments analysis had been performed in young adults and L2 larvae with and without a heat-shock exposure. In the experiment performed by Brunquell et al. 15 , a very similar set of genes was induced upon heat-shock and likewise only one coexpression clique out of the 307 was found to be significantly upregulated, the clique hsp-16.2-F44E5.4_19238. Concomitantly the chaperone genes also represent the strongest upregulated genes on the single-gene basis ( Table 3, Supplemental Fig. 2a). This also was observed in the second RNAseq experiment performed by Li et al. 17 in L2 and young adult larvae (Table 4, Supplemental Fig. 2b).
We inspected one other experiment 32 , which had determined a time course of the heat-shock response, to investigate whether further genes get differentially expressed after prolonged incubation at the heat-shock temperature. At the shortest incubation time, hsp-16.2-F44E5.4_19238 was the dominant differentially expressed gene set and the chaperones in this clique were the genes with the strongest expression changes ( Table 5, Supplemental Fig. 2c, Clique set in Supplemental Fig. 3a). This changes with longer exposure times and after 720 min of heat-shock several cliques are differentially expressed representing gene groups from very different processes and tissue specific expression (     www.nature.com/scientificreports/ identified differ in their kinetics to heat-stress, in that most are not substantially affected at the shortest heatexposure (30 min), but get affected starting from 60 min incubation time (Supplemental Fig. 4). Of the genes expressed under the harshest conditions, only few contain HSEs in their promoter region and even under those conditions dnj-12, dnj-13 and hsp-1 are only weakly changing their expression levels, while the heat-shock genes grouped in hsp-16.2-F44E5.4_19238 are highly elevated at all time points. Therefore, we consider these HSE-regulated genes to be "heat-inducible" while dnj-12, dnj-13 and hsp-1 represent genes that change their expression more weakly under heat-shock, despite HSE-sequences in the promoter region. unc-23, despite having been assigned to the HSR coexpression clique hsp-16.2-F44E5.4_19238 by the global coregulation analysis, also is upregulated weaker compared to the small heat-shock proteins and the Hsp70s.

The isolated DBD of HSF-1 shows affinity to the F44E5.4 inducible promoter.
To test, to what extent binding differences correlate with expression differences and structural differences of the HSE we set out to determine in vitro, how the interaction of HSF-1 DBD is at these differently structured HSEs. To this end the isolated DNA binding domain of nematode HSF-1 was purified, containing the DBD and omitting the nema- Table 5. Significantly enriched genes in the heat-shock experiments of Jovic et al. 32 . Individual induced genes are shown for 30 min, and 720 min of heat-shock compared to their non-stressed state.  www.nature.com/scientificreports/ tode-specific sequences at the N-terminus and the further regulatory domains at the C-terminus. The structure of the purified DNA-binding domain was investigated by far-UV CD-spectroscopy. The spectra revealed a mostly α-helical structure (Fig. 2a). To confirm the stability of the domain, we performed a thermal transition in the Far-UV CD-range and obtained a temperature midpoint of the unfolding transition of 55 °C (Fig. 2b). We also performed a stability investigation employing the TSA assay, where no obvious differences were observed regarding the melting point (Fig. 2c). Thus, all spectroscopic methods imply that the isolated DNA-binding domain of C. elegans HSF-1 is a stable and structured protein.
dsDNA probes were then generated by us from the heat-shock responsive cluster, in order to gain a better insight into the differential expression form the chaperone-gene derived HSEs. F44E5.4 features a high consensus score pentameric site, both hsp-70 and unc-23 consist of only one trimeric site, while hsp-16.2 has a high consensus score tetrameric site plus an additional trimeric site. Probes of equal length were also made for hsp-1, dnj12 (trimeric HSE-site) and dnj-13 (tetrameric site) representing the non-induced heat-shock related proteins. Since both sequence and position in the promoter region of the following genes are identical the probe for F44E5.4 also represents F44E5.5, while hsp-16.2 represents hsp- 16.11, hsp-16.41, hsp-16.48. The sequences of the probes were obtained from the respective promoter regions. Here only HSEs were considered that locate within 1000 bps upstream of the starting point of transcription (Table 6). F44E5.4p contains more HSEs in its sequence than synthesized in this study (comparison of the promoter regions), but here likewise the probes with the highest consensus score were synthesized.
EMSA-assays imply differences between the chaperone-gene derived HSEs. Electrophoreticmobility shift assays (EMSA) were performed to test the interaction between purified HSF-1 DBD and dsDNAs (Fig. 3a). We set out to perform an initial binding analysis HSF-1 DBD to the promotor of F44E5.4, which also  (Fig. 3b), which showed depending on the probe used, a highly variable reduction in migration speed. While probes derived from the promoter of dnj-13, unc-23 and hsp-1 hardly showed any interaction with the DBD of HSF-1, F44E5.4, hsp-70, hsp16.2 and dnj-12 derived probes appeared to interact strongly, thereby forming intense bands with HSF-1 DBD, representing the dsDNA-protein complex. These results indicate that the HSF-1 DBD alone can interact with the different promoter-derived HSEs to a different extent.

Analytical ultracentrifugation confirms the binding differences at the various HSE-sites.
To unravel the interaction patterns, we performed SV-AUC under the condition employed for the gel-based assay.
To this end, a titration with the DNA probe representing F44E5.4p was performed. Addition of HSF-1 DBD resulted in an increase in the sedimentation coefficient, indicating the binding of HSF-1 DBD to dsDNA (Fig. 4).
In the titration, the progressive binding of HSF-1 DBD molecules increases the s 20,w of the main species and indicates further complex formation at higher protein:DNA ratios. The complex with F44E5.4p appears to reach a saturated level when a tenfold excess of HSF-1 DBD is added. At this point, the presence of remaining unbound HSF-1 DBD becomes visible, which is in agreement to the EMSA binding assay.
Having investigated the promoter region with 5 potential binding sites, we tested, whether the promoter regions with less binding sites, show a similar response. Thus, the same approach was chosen for a DNA with only 3 binding sites derived from the promoter of hsp-70. Here the saturation point of the binding reaction was shifted to lower s 20,w values in both wavelength detection modes, suggesting that in this case less HSF-1 DBD molecules bind to the promoter (Fig. 5a). This behavior therefore appears to be a sequence-specific property. Further analog experiments were performed with all the other dsDNA strands and initially the highest s 20,w values were noted (Fig. 5b-g).
SV-AUC fitting to defined species reveals potential differences in occupation of complex binding sites. The very weak interaction at several consistent-at least on a monomeric level-HSE sites, ques- Table 6. HSE-containing probes designed from the promoter sequences of chaperone genes and used in the binding studies. The designed probes originate from the 1000 bp promoter sequence and are positioned as indicated. The strand and anti-sense strand were synthesized and combined to give the promoter sequence able to bind HSEs. www.nature.com/scientificreports/ tions the independent interaction of monomeric units at these sites. UltraScan III was employed to analyze the data from these experiments and to obtain information on the binding equilibrium in solution. To this end we compared the general ability to fit the data with a very flexible model (2DSA-IT) and with a very constrained model, where a custom grid was designed containing one s 20,w value (Table 7) for each species to be considered (2DSA-CG-IT). This method reveals available free protein concentration dependent changes in complex species distributions and offers the opportunity to fit distributions of DNA/protein complex obtained directly from raw data to hypothetical species, thus, to obtain the concentration of each potential complex species and to describe the composition of the complex mixture in each sample. The comparison of RMSD values from the 2DSA-CG-IT fit of each complex species formed with different DNAs is shown in Fig. 6. It is very clear from these data that different assembly mechanisms are happening in different probes and different stoichiometries must be assumed. In the UltraScan III analysis, the higher order complexes are only populated when using larger HSEs and in all cases the buildup of the free HSF-1 DBD can be observed at the higher concentrations employed in each titration. Furthermore, almost no binding was observed for the constructs of hsp-1, dnj-13, and unc-23. (Fig. 6e, f and g). www.nature.com/scientificreports/ Global fitting of stepwise binding models implies favorable cooperative action at second and third binding steps. We then set out to globally fit one titration to a predefined set of species, which is kept invariant throughout all the DNA probes analyzed. This is possible, as the dsDNA strands are of equal length and the binding sites are engineered to be in the middle of each dsDNA scaffold. Indeed, for each of the stronger binding species, the second binding step is exposing a lower dissociation constant compared to the first binding steps and similar relationships occur at the later binding steps at probes that harbor more than three binding sites. In fact, the four strongly interacting systems (hsp-70, hsp- 16.2a, hsp-16.2b and F44E5.4) show a second binding step with submicromolar affinity, while the first binding step is weaker (Table 8). Thus, it is indeed to be expected that cooperative actions increase the binding affinity and interactions between the occupied binding sites modulate and potentially coordinate the binding of HSF-1 at these HSEs.

Discussion
In the nematode genome there are 4120 HSEs, which contain HSF-1 binding consensus regions in the 500 bp upstream of their start codon. It is very surprising that despite these many HSF-1 regulated genes the canonical heat-shock response only represents a clique of 8 genes, 7 of which are regulated by HSF-1 binding promoter regions. Thus, the extent of regulation resulting from HSF-1's actions is well beyond the induction of stress genes under stress conditions and reaches far into the normal growth cycle of the nematode under non-stressed conditions. The ability to resolve the clique membership based on coexpression analysis shows that also in larger organisms this approach may be successful and able to connect different cliques to different tissues and developmental states.
Binding affinity, cooperativity and stoichiometry on complex promoter sequences. We here tested the binding of the HSF-1 DBD to some of the likely interacting promoter regions. From these studies we can find that the HSF-1 DBD alone can bind the HSE-regions originated from the genome with certain selectivity based on its affinity. Despite this, the affinities correlate to some extent with the calculated consensus score and with the inducibility of the respective gene. It is interesting to note, that despite the proposed trimeric binding mode, tetrameric and pentameric HSEs exist and that binding to those sites is driven by additional cooperativity. Among the probes we investigate in this study, the tetrameric and pentameric sites represent those, which are inducible upon heat-shock. In general, the developed AUC assay to test the binding of several proteins to one DNA strand is very valuable in quantifying the binding events and may represent an opportunity to study the many interactions occurring on dsDNA with different binding sites for individual transcription factors. While the sedimentation coefficients for the custom grid are an assumption, they provide a rational to obtain stepwise binding information from the SV-AUC titration data. The absolute values of the obtained stepwise dissociation constants are to be used with care, but trends can be derived from these values with good confidence. The ability to resolve different intermediate assembly steps may be further increased by using direct interaction models for the fitting, but the stepwise procedure shown here already represents the chance to quantify these events. Nevertheless, the grouping of the    www.nature.com/scientificreports/ genes into coexpression cliques, the identification of common transcription factors for these cliques and the analysis of binding events to the predicted transcription factor binding sites opens possibilities to gain further insight into the complex relationships leading to the spatio-temporal expression of genes during development and aging of C. elegans, or complex multi-step binding reactions in general.

Correlation between binding and inducibility.
Comparing the binding ability of HSF-1 to the promoter regions and the observed response to heat-stress may be far fetching, given that only the DBD of HSF-1 was studied and further regulation will surely come from the other regions of this complex protein. Nevertheless, for the strongest inducible genes, also the highest affinities are observed (hsp-16.2, F44E4.5, hsp-70), which are also in accordance with previous studies 15,17 .
One exception among the probes studied here is dnj-12, which is only weakly inducible but well capable of binding to the HSF-1 DBD. Interestingly dnj-12 is already at non-stressed conditions highly expressed, similarly to hsp-1. This can be derived from the relatively high number of RNAseq reads originating from these ORFs. Given the ubiquitous expression of this protein it might be envisioned that its binding to HSF-1 is constitutive, and the induced expression therefore is not increased upon heat-shock. Looking into publicly available ChIPseq data 17 for the locations described here, some of these speculations can be tested. Indeed, for the genes coexpressed upon heat-shock, hsp-16, F44E5.4 and hsp-70 this can be confirmed (Fig. 7) and the inducibility from the promoters F44E5.4/5, hsp-16.2 and hsp-70 correlates well with increased occupancy of HSF-1 on the HSE-sites. Even for unc-23 a slight increase in occupancy can be observed. This change at the promoter regions cannot be observed for the non-inducible probes. Here (dnj-12, dnj-13 and hsp-1), HSF-1 sites are occupied in a similar or even reduced manner with and without heat-shock implying a constitutive expression and possible constitutive function of HSF-1 responsible for the high expression levels observed for these genes under stressed and non-stressed conditions. This logic may be relevant for several of the 4120 HSE-binding sites found in promoter regions. Despite the correlations observed, it is important to note, that the approach employed in this study solely considers the DBD of HSF-1 and that HSF-1 HSE binding in the cell is further regulated by other regulatory domains, oligomerization domains and posttranslational modifications, like phosphorylation 39,40 and deacetylation 41 . Due to these limitations further studies with longer fragments or full-length protein will need to be performed to unravel the full relationship between promoter sequences and HSF-1 binding. Therefore, the here applied approach shows the direct affinity of the unmodified DBD to the DNA, but will require adaptations, when used for the dsDNA binding analysis of full-length HSF-1 in the future.  Promoter regions of C. elegans were investigated to identify the occupancy as determined by HSF-1 ChIPseq data of Li et al. 17 . Four experiments were compared based on the available data: Young adult with and without heat-shock and L2 larvae with and without heat-shock for all the promoter regions investigated here.