Predicting the impact of promoter variability on regulatory outputs

Kreamer, Naomi N.; Phillips, Rob; Newman, Dianne K.; Boedicker, James Q.

doi:10.1038/srep18238

Download PDF

Article
Open access
Published: 17 December 2015

Predicting the impact of promoter variability on regulatory outputs

Naomi N. Kreamer^1,2,
Rob Phillips^1,4,
Dianne K. Newman^1,3 &
…
James Q. Boedicker^5,6

Scientific Reports volume 5, Article number: 18238 (2016) Cite this article

2382 Accesses
7 Citations
1 Altmetric
Metrics details

Subjects

Abstract

The increased availability of whole genome sequences calls for quantitative models of global gene expression, yet predicting gene expression patterns directly from genome sequence remains a challenge. We examine the contributions of an individual regulator, the ferrous iron-responsive regulatory element, BqsR, on global patterns of gene expression in Pseudomonas aeruginosa. The position weight matrix (PWM) derived for BqsR uncovered hundreds of likely binding sites throughout the genome. Only a subset of these potential binding sites had a regulatory consequence, suggesting that BqsR/DNA interactions were not captured within the PWM or that the broader regulatory context at each promoter played a greater role in setting promoter outputs. The architecture of the BqsR operator was systematically varied to understand how binding site parameters influence expression. We found that BqsR operator affinity was predicted by the PWM well. At many promoters the surrounding regulatory context, including overlapping operators of BqsR or the presence of RhlR binding sites, were influential in setting promoter outputs. These results indicate more comprehensive models that include local regulatory contexts are needed to develop a predictive understanding of global regulatory outputs.

Automated model-predictive design of synthetic promoters to control transcriptional profiles in bacteria

Article Open access 02 September 2022

Travis L. LaFleur, Ayaan Hossain & Howard M. Salis

The Escherichia coli transcriptome mostly consists of independently regulated modules

Article Open access 04 December 2019

Anand V. Sastry, Ye Gao, … Bernhard O. Palsson

Machine learning uncovers independently regulated modules in the Bacillus subtilis transcriptome

Article Open access 11 December 2020

Kevin Rychel, Anand V. Sastry & Bernhard O. Palsson

Introduction

It is well appreciated that the rate of generation of new genome sequencing data is far outpacing our ability to make sense of it. For example, although considerable progress has been made in recent years to understand the roles of noncoding genomic regions^1,2, our ability to apply this understanding in a predictive fashion is still quite limited. More fundamentally, it is not clear to what extent we can accurately predict genome-wide regulatory outputs from DNA nucleotide sequence, though many complementary approaches are being applied to decipher the regulatory logic in the genomes of both eukaryotes and bacteria^3,4. Here, we put our understanding of how an individual bacterial transcription factor influences global gene expression to the test to explore the extent to which the binding sequence and the surrounding regulatory context tune promoter outputs.

Making precise predictions of genome-wide expression is hampered by an incomplete understanding of how regulatory information is encoded at different promoter regions^3,5. We cannot accurately predict a priori what influence on gene expression a particular transcription factor will have at a given promoter. For the majority of transcription factors we only know where these factors are likely to bind and whether the transcription factor acts as an activator or repressor. Current models typically cannot quantitatively determine the magnitude of expression, nor predict how expression is modulated by the number of regulatory proteins per cell or the promoter architecture (defined as the sequence, orientation, location and number of transcription factor binding sites and their proximity to the RNA polymerase binding site). Even for well-characterized regulators, we are often surprised by experimental results contradicting expected trends^6,7,8. A predictive understanding of how regulatory information is encoded in the genome would lead to more meaningful comparisons between genomes of related organisms, enhance our understanding of regulatory-genome changes associated with niche differentiation and improve our ability to design synthetic regulatory networks.

We chose the ferrous iron [Fe(II)] responsive two-component system, BqsRS, from Pseudomonas aeruginosa⁹ to test our ability to quantitatively predict the regulatory output for an individual transcription factor. Most organisms require iron for essential cellular processes, including electron transfer steps in metabolism, but iron is a limiting nutrient in many environments¹⁰. Because iron uptake and localization are critical for growth and function, the cellular response to iron is complex and tightly regulated. In the body, our immune system controls pathogen proliferation in part by sequestering iron through high-affinity ferric iron [Fe(III)]-binding molecules such as lactoferrin¹¹. In many environments, including the cystic fibrosis (CF) lung and soil, iron is present in both the Fe(II) and Fe(III) forms^12,13. Furthermore, elevated Fe(II) in CF sputum correlates with severe disease states¹². BqsRS consists of a sensor histidine kinase (BqsS) and a response regulator (BqsR) that responds specifically to Fe(II) at low micromolar concentrations in the periplasmic space⁹. BqsRS is known to be involved in rhamnolipid production and biofilm dispersal¹⁴ and it mediates cellular defenses against cationic stressors, including aminoglycoside and polymyxin antibiotics¹⁵. The consensus BqsR DNA binding motif¹⁵ is found in promoter regions of genes that are upregulated by BqsR, but it remains unclear whether BqsR binding alone is sufficient to elicit a regulatory response, or if the impact of BqsR depends on the context at each promoter.

One common model used to predict the influence of transcription factors on global gene expression is based on position weight matrices (PWM)^{16,17,18,19,20,21,22,23,24,25}. Here, we used the previously derived PWM for the Fe(II) responsive transcription factor BqsR¹⁵ to search for potential binding sites throughout the genome. Hundreds of potential binding sites were found; RNA-Seq expression measurements revealed that the majority of genes containing potential BqsR operators were not strongly regulated by BqsR. This discrepancy suggests that either the PWM of BqsR does not accurately describe the interaction between BqsR and the genome, or that the surrounding operator context has a greater influence on promoter outputs than the local BqsR binding sequence. Through systematic measurements of how the architecture of BqsR containing promoters influences gene expression, we address how BqsR regulatory information is encoded in the genome and to what extent understanding the regulatory context is critical to develop a predictive understanding of the global effect of an individual transcription factor.

Results

Operator sequence diversity throughout the genome

To construct a quantitative model capable of predicting the magnitude of gene expression directly for any given promoter region, we first dissected how variability in the BqsR operator modulates gene expression. BqsR is an activator that binds upstream of the gene transcription start site (Fig. 1A). Earlier work established the BqsR binding sequence contained a pair of highly conserved pentamers (Fig. 1A)¹⁵. There were 432 operators with 2 or fewer mutations in the pair of consensus repeated pentamers (TTAAG(N)₆TTAAG) and over 5000 potential operators containing 3 mutations. This frequency of binding sites is not expected to occur by chance in the genome (Supplemental Fig. S1). Many potential BqsR operators were found within promoter regions throughout the genome, but it is not obvious at which of these operators BqsR binds and has a regulatory consequence.

The magnitude of BqsR-mediated gene expression for each of these potential operators can be predicted using a PWM for the operator binding site. A PWM uses a set of operator sequences to generate a DNA sequence motif reflecting the nucleotide frequency for each position in the sequence^16,17. The PWM can then be used to rank order the regulatory strength of each potential operator. The operator strength is calculated using,

in which S is the operator score calculated using the PWM (see Position weight matrix calculations in Supplemental Material for further details) and a and b are parameters relating the PWM score to the operator affinity¹⁷. Operator strength is assumed to be proportional to the affinity of the operator for BqsR^26,27,28.

Figure 1B compares the measured fold change in gene expression to the BqsR operator strength calculated using Equation 1, revealing a poor correlation between the PWM predictions and experimental expression. Scores were calculated for operators located in a promoter region, defined here as the region within 600 bp upstream of the protein coding sequence, using the PWM derived in¹⁵. The data in Fig. 1B was used to obtain values for parameters a and b, 1.6 and 1.9 respectively, which fall in the range of typical values²⁹. These values appear in Equation 1 to relate the PWM score to the operator strength. The fold change in expression was calculated from RNA-seq data in WT and ΔbqsR strains grown anaerobically and shocked with 200 μm Fe(II), as reported in¹⁵.

Typically it is assumed that the rate of gene expression at a given promoter is proportional to the affinity for the transcription factor to the operator, known as the occupancy hypothesis⁶. The fact that several binding sites with high PWM scores were not induced by an Fe(II) shock raises several questions: how is regulator-binding specificity achieved; how does operator sequence modulate promoter outputs; and to what extent does the surrounding promoter context influence BqsR-mediated regulatory responses?

Influence of the pentamer sequence on BqsR-mediated regulation

One potential cause of disagreement between model predictions and experimental measurements was that PWM did not accurately capture the relationship between operator sequence and binding affinity. To determine whether the PWM was missing key regulatory information, we experimentally dissected how the structure of the operator (i.e., the sequence, length, orientation and position) affected the level of gene expression. Although we analyzed operator diversity throughout the genome for clues as to which operator variations might impact BqsR-mediated expression, direct experimental measurements were used to construct our gene regulatory model. FIMO (Find Individual Motif Occurrences), part of the MEME Suite³⁰, was used to identify potential BqsR binding sites in the genome. These potential operator sequences were further characterized by comparing the sequence, location and orientation of the binding sites. The operator was split into three regions: the upstream and downstream pentamers and the spacer region (Fig. 1A). A small library of synthetic promoters was fused to the lacZ reporter gene and inserted into the genome at the glmS locus to quantify the influence of specific changes in promoter architecture on expression output. The synthetic promoter library was based on the BqsR binding sequence in the promoter for gene PA14_04180, the gene most highly upregulated by BqsR.

Previously, two repeated pentamers were found to be highly conserved and sufficient for BqsR binding¹⁵. The PWM score indicates the frequency of a given bp at each position in the sequence compared to the background distribution of bp in the genome [Supplemental Equations S1 and S2 and^16,17]. Bases with a high score indicate a particular base being favored at a given position and bases with negative scores indicate bases that are rare at a given position. The PWM of the pentamer regions was calculated using the 236 operator sequences from Fig. 1B containing up to 2 mutations in the pentamer regions (Fig. 2B). There is a non-uniform distribution of the scores, implying that some bases contribute more than others to the binding energy of BqsR to the operator.

A library of synthetic promoters based on a modified PA14_04180 promoter, shown in Fig. 2A, was constructed to test the influence of the pentamer sequence on gene expression. Only the upstream pentamer was mutated because the downstream pentamer overlaps the −35 region of the RNA polymerase (RNAP) binding site¹⁵. Additionally, the symmetry in the PWM scores in Fig. 2B suggests the two pentamer sequences may interact similarly with BqsR. The library contains all possible single point mutations for the upstream pentamer sequence, 15 constructs in total. In Fig. 2C, we report the gene expression level after Fe(II) shock for each mutant relative to the expression level of the wild-type promoter containing the binding site shown in Fig. 2A. Expression analysis of the promoter library revealed heterogeneity in the contribution of each position to the expression level. For example, the bases in position 5 have a weak influence on regulation, whereas mutating position 1 from T to C reduced expression nearly 10 fold. The large decrease in expression of this particular mutation may explain why a C in position 1 was rare in the potential binding sites found in the genome (Fig. 2B).

Influence of the spacer sequence on BqsR-dependent regulation

We also determined the natural variability of the spacer region between the two pentamers and its influence gene regulation. First, we examined the variability of spacer region lengths throughout the genome. All previously reported operator sequences had a spacer length of 6 bp, but it was unclear if spacer length modulated BqsR binding. Analyzing all the potential operators in the genome with spacer lengths of 5, 6 or 7 pairs shows that all the pentamer pairs with no mutations had 6-bp spacer regions and that a spacer length of 6-bp was most common for operators with a single pentamer mutation (Fig. 3A). Indeed, spacer length is a key operator parameter. Spacers of length 5 or 7 lowered gene expression to less than 10%, an expression level similar to an operator for which the upstream pentamer has been deleted (control in Fig. 3B). Together these results suggest that a 6-bp spacer region is essential for operator binding. We also examined the influence of operator orientation on expression. The reverse or reverse complement of the upstream pentamer reduced expression to background levels (Supplemental Fig. S2).

The spacer PWM scores show that some bases occur frequently at specific positions, such as a C at position 11, although the sequence logo reveals a low information content of the spacer region (Fig. 3C). Synthetic promoters were created to quantify the influence of the spacer region on gene regulation. Because the number of potential spacer sequences was large (4⁶ = 4096), targeted changes were made to the spacer region based on analysis in Fig. 3C. Figure 3D shows measurements of gene expression from these constructs, revealing the spacer region sequence has the potential to moderately influence expression levels. For the six sequences measured, gene expression levels were found to vary up to a factor of 2.

Binding site clusters

Yet another aspect of promoter architecture is the number of binding sites in the promoter region. We searched the genome for operator clusters with a 6-bp spacer region between pentamers and only considered operators with a maximum of 2 pentamer mutations. The genome contained many promoters with multiple BqsR operators, up to a maximum of 7 potential operators in the same promoter region, which cannot be accounted for by a random distribution of binding sites (Fig. 4A). Some promoters contained overlapping operators that share a common pentamer sequence, creating multiple repeats of the pentamer sequence separated by a 6-bp spacer region with up to 5 repeated pentamers, as shown in the schematic of Fig. 4B.

We next experimentally dissected the role of operator clusters in the regulatory output. The native promoter region for gene PA14_04180 contains 4 proximal pentamers as shown in Fig. 4C (the truncated PA14_04180 promoter used in Fig. 2 contained only the 2 downstream pentamers). The most downstream pentamer, labeled 4 in Fig. 4C, is unique given that it overlaps the −35, RNAP binding site in the promoter. We created several synthetic promoters that lacked one or more pentamers, all of which retained pentamer 4 given its dual role in both BqsR and RNAP binding (Fig. 4D). A pentamer was deleted by mutating the pentamer sequence from TTAAG to ACTCA. As shown in Fig. 4D, BqsR binding to pentamers 3 and 4 was critical for strong expression. The supplemental binding sites 1 and 2 led to an additive increase in promoter output, but were not essential for expression.

Deriving a binding energy matrix for BqsR

From the experimental dissection of BqsR-mediated gene expression, we constructed a BqsR activity matrix with a 6-bp spacer region to predict gene regulation for the entire genome. The activity matrix translates the operator nucleotide sequence to output gene expression. Because gene expression is assumed to be proportional to operator affinity, this matrix is referred to as a “binding energy matrix”. Such binding energy matrices have been developed previously for other transcription factors^22,31 and in the case of the Lac repressor in E. coli have been shown to accurately predict a wide range of promoter outputs³². We assumed an additive contribution of operator clusters, operator occupancy was linearly proportional to the change in expression level and stronger BqsR binding increased transcription.

For the pentamer region, measurements of the fold change in expression from Fig. 2C were converted to a change in binding energy using,

where k_B is Boltzmann’s constant, T is the temperature and ΔE is the change in the binding energy of BqsR to the operator (see Supplemental section “Predicting gene expression from operator sequence” and Figs S3 and S4 for further details). For the spacer region, a “best fit” binding energy matrix was fit to the expression data for the 7 variants of the spacer sequence measured in Fig. 3D (see Supplemental Materials section “Deriving the energy matrix for the spacer region” and Fig. S5).

The final 16-mer energy matrix reported the change in binding energy for any operator sequence relative to the initial sequence (Fig. 5B). Using this matrix we predicted the expression of a given promoter relative to a promoter containing the reference operator (Fig. 5A). For comparison, Fig. 5C showed the binding energy matrix derived from the PWM, scaled using the parameters a and b as in Fig. 1B. The PWM method of constructing an energy matrix differed from ours, in that we did not assume that the consensus sequence in the genome has the strongest binding energy. Although we analyzed potential operators throughout the genome for clues as to what operator parameters might influence regulation, ultimately, our matrix was based on direct, quantitative experimental measurements of gene expression from a library of synthetic operators containing lacZ-promoter fusions.

Global prediction of BqsR-mediated gene regulation

With these operator rules and the experimentally derived energy matrix (Fig. 5B), we made new predictions for BqsR-mediated gene regulation to test if the PWM missed key regulatory information encoded in the primary DNA sequence. If sequence information was missing from the PWM approach, predictions derived from Fig. 5B will significantly improve our predictive capability. The fold change in gene expression due to each potential operator in the genome was calculated using Equation 2 (see Supplemental Figs S6 and S7 for the distribution of predicted operator affinities). For promoter regions containing multiple BqsR binding sites, we assumed each operator acted independently and the total fold change was the sum of the fold change for each individual operator (see Supplemental section “Additive approximation for multi-operator promoters” for more details). Since it is known that the position of the operator relative to the transcription start site plays a role in regulatory outputs, we only included potential operators up to 600 bp upstream from the protein coding sequence. Additionally, genes were predicted to be BqsR-regulated if they contain a maximum of 3 mutations in the 10 base pairs encoding the upstream and downstream pentamers, given that most mutations only moderately influenced expression (Fig. 2C).

These predictions were compared to RNA-Seq measurements of global BqsR-mediated gene expression reported previously¹⁵. As a conservative annotation of BqsR-regulated genes we compare transcriptional units [regions which may contain several genes that are co-transcribed as defined by Wurtzel et al.³³ whose expression was changed 2-fold or greater in WT compared to the ∆bqsR mutant in response to Fe(II) shock. Figure 6A shows comparisons of the predicted fold change in expression for 75 transcriptional units, predicted using either our binding energy matrix (Fig. 5B) or the PWM derived binding energy matrix (Fig. 5C), to the experimentally measured fold change in expression. All predictions were normalized to the expression level of the reference gene, PA14_04180. The predictions from our model and the model based on the PWM were similar with our model predicting stronger expression for weak potential binding sites (Fig. 6B). Additionally, our model predictions were accurate for promoters giving a strong response, greater than 25 fold, to ferrous iron shock in experiments (Fig. 6C). These results suggested the energy matrix accurately predicted operator occupancy. Our inability to predict the regulatory influence of BqsR at most promoters was not rooted in a misunderstanding of BqsR binding, instead, for most genes the surrounding regulatory context of each promoter was more important than operator affinity in setting expression levels.

A closer look at genes that were poorly predicted

Both energy matrices in Fig. 5 poorly predicted BqsR-mediated regulation for the majority of the transcriptional units containing predicted BqsR operators. To explore why this might be, we examined the broader regulatory context of each promoter to determine whether the model overlooked key inputs into BqsR-mediated regulatory decisions. One parameter ignored in our initial predictions was the position of the BqsR operator relative to the gene transcription start site (TSS). The results shown in Fig. 6 consider operator position relative to the protein coding sequence (CDS), only including BqsR binding sites within 600 bp upstream of the protein coding sequence. However, the position of the BqsR binding site relative to TSS as opposed to the CDS is more relevant to transcriptional regulation⁶.

To gauge the influence of the spatial relationship between the TSS and the BqsR binding site on gene expression, the distance between the TSS and the BqsR operators was examined. Figure 7A showed the ratio of the predicted expression level to the experimentally measured expression level as a function of operator position relative to the TSS. The TSS for 62 out of the 75 transcriptional units predicted in Fig. 6 could be determined from RNA-Seq data or were known³³. A ratio near 1 indicates an accurate prediction, with higher values signifying greater error. Although it was interesting that most of the potential operators were found near the TSS, Fig. 7A indicated predictability was not correlated with operator position. The transcriptional units most accurately predicted had operators located within 200 bp upstream of the TSS, as would be expected for a typical activator in bacteria³⁴.

In general we predicted a higher level of expression than experimentally observed, potentially caused by our assumption of additive contributions from multiple operators in the same promoter region. To analyze the impact of operator number on predictive ability, we examined the ratio of predicted to measured expression as a function of the number of putative operators in the promoter region (Fig. 7B). Each individual BqsR operator contained two pentamer regions separated by 6 bp with any number of nucleotides allowed between the 16-mer operators. The results reveal a trend of poor predictions for promoter regions containing multiple BqsR binding sites. For promoter regions containing up to three operators, predictive ability varied, but in general predictions were within 50 fold of measured values. For promoter regions containing 4 or more binding sites, predictions were typically higher than experimental measurements by a factor of 50 or more, with the poorest predictions occurring for the promoter containing 8 operators. These results suggest that the assumption that operators behave independently may not be valid for promoter regions containing many operators and that some operators in large clusters of binding sites may be nonfunctional. Further analysis showed the additive model used above had similar predictions to a thermodynamic model derived for a promoter containing two operators (Supplemental Figs S8 and S9). Predictive ability also did not improve when considering only the strongest or most downstream operator at each promoter (Supplemental Fig. S10), or when taking operator orientation relative to the direction of gene coding sequence into account (Supplemental Fig. S11).

Role of additional transcription factors in regulation

We next examined whether the poorly predicted transcriptional units had other known transcription factor binding sites in their promoter regions. To address this, we took a two-pronged approach. We used the PWMs reported for the 12 P. aeruginosa transcription factors annotated on the Prodoric database²⁴. However, due to our findings that PWMs vastly overpredict potential operators, we also compared our predicted regulon (defined as the genes under BqsR control) with 13 experimentally validated P. aeruginosa regulons^{35,36,37,38,39,40,41,42,43,44,45,46,47,48}.

Only two transcription factors, Anr³⁵ and PqsR^42,43, had experimentally determined regulons that had a statistically significant overlap with the transcriptional units predicted to be upregulated by BqsR but which were not upregulated in the RNA-Seq data (Fig. 7C “false positives” and Supplemental Table S1). PqsR controls the Pseudomonas quorum sensing regulon that responds to PQS; Dong et al.¹⁴ showed the concentration of PQS is reduced in the ∆bqsR mutant. Cells were grown in anaerobic conditions to ensure Fe(II) remained stable, the condition where Anr is active. Figure 7C “false negatives” also shows overlap of the RpoN⁴⁴ regulon with transcriptional units whose expression levels changed as a result of the iron shock, but were not predicted. RpoN encodes an alternate sigma factor induced in stationary phase⁴⁹ and under nitrogen limitation and has been shown to influence quorum sensing regulation⁵⁰. The cells used in the RNA-Seq experiment were harvested in late stationary phase. All of the overlapping genes predicted showed upregulation by RpoN.

The PWMs from the Prodoric database were used to predict potential transcription factor binding sites throughout the P. aeruginosa PA14 genome. In an attempt to limit the error associated with PWM-based predictions, for each transcription factor we assumed only the most probable 100 binding sites were capable of having a regulatory influence. Overall we identified potential secondary regulators in 25 of the 75 predicted transcriptional units (Fig. 7D). To gauge whether the presence of specific transcription factors led to poor predictions of gene expression, the average prediction error, the predicted fold change in expression divided by the experimental fold change in expression, was calculated for all the genes containing a secondary transcription factor (Fig. 7D). Genes containing the transcription factors Fur, PsrA and Vfr were all predicted well, despite the potential influence of these additional regulators. Expression measurements for genes containing AlgR, Anr, ExsA, RcsB and RhlR in their promoter regions deviated most from predictions. Because the presence of a potential RhlR binding site in promoter regions caused the most deviation between predictions and experimental measurements, the effect of RhlR on BqsR-mediated gene expression was examined in further detail.

Measuring gene expression from promoters coregulated by both BqsR and RhlR

Prodoric was used to search promoter regions of BqsR-regulated genes for potential RhlR binding sites. Intriguingly, the RhlR binding motif overlapped with the BqsR binding site in many of the promoters. In the bqs promoter the RhlR binding site overlaps with the upstream BqsR pentamer and the downstream BqsR pentamer overlaps with the −35 RNAP binding site. In these overlapping promoters, because RhlR and BqsR cannot bind simultaneously, high levels of RhlR should competitively exclude BqsR and thus lower expression. Ordinarily the quorum sensing gene rhlR is upregulated in response to an autoinducer indicative of high cell density during stationary phase. However, in the absence of autoinducer, RhlR can act as a repressor⁵¹. An rhlR overexpressing strain was used to express RhlR in early exponential phase. To determine the effect of RhlR on bqsR expression, qPCR analysis was used. Figure 8 shows BqsR-dependent expression of genes predicted to have an RhlR binding site. For all genes, the RhlR overexpressing strain showed a statistically significant (p-value ≤ 0.05 by unpaired two-tailed t-test) decrease in expression compared to wildtype. For oprH, a gene containing a BqsR-responsive operator but not a putative RhlR binding site, expression did not significantly change when RhlR was overexpressed. RhlR significantly changes the regulatory influence of BqsR in promoter regions containing BqsR and RhlR operators.

Discussion

Using the P. aeruginosa Fe(II) responsive regulator, BqsR, as a test case, we examined our ability to make quantitative predictions about the influence of an individual transcription factor on global gene expression levels. A PWM model did not accurately predict global gene expression patterns, leading us to hypothesize that either the PWM model did not capture how operator affinity was encoded in the operator sequence, or that the regulatory influence of BqsR was dictated by the surrounding regulatory context of each promoter. A detailed model of operator binding through a synthetic promoter library revealed that predicting transcription factor affinity alone was insufficient to predict the global expression levels. Although clusters of overlapping operators had a combined impact on regulatory outputs, promoters containing large numbers of potential operators were poorly predicted by the model. The proximity to the transcription start site also did not correlate with predictive ability, despite the most upregulated genes containing potential operators within 200 bp of the transcription start site. These finding suggest that secondary regulators were important in determining the influence of BqsR on expression levels at promoters throughout the genome, as supported by the impact of RhlR in modulating BqsR-mediated expression.

While our model of operator occupancy outperformed the original PWM-derived model, the improvement was modest. Similarity between our binding energy matrix and the PWM supports the ability of PWM to describe operator affinity, at least when enough operator sequences can be identified to calculate an accurate binding matrix. However, our results underscore that caution should be used in relating operator strength to expression levels. Many genes highly upregulated in experiments were accurately predicted (within a factor of 3), but many operators with binding strengths predicted to be similar to the most upregulated gene (PA14_04180) had weak or immeasurable expression. This type of disagreement between experimental measurements and predictions highlights how difficult it is to make reliable predictions of gene expression directly from the genome sequence and call attention to the need to more systematically study the influence of promoter diversity on expression.

That the broader regulatory context at each promoter may control the influence of individual transcription factors is not a new idea. Several studies have attempted to predictively integrate inputs from multiple transcription factors at a single promoter, however none of these studies used their findings to predict expression for additional promoter contexts within the genome^52,53,54,55. DNA structure is another aspect of the regulatory context that modulates promoter outputs. Genome shape and mechanics, such as nucleosome wrapping and DNA loop formation, mediate both transcription factor binding and interactions^3,8,56. DNA can also mediate allosteric effects between adjacent transcription factors⁵⁷. Regulatory interactions have also been explored from a systems perspective⁵⁸. One study reported that only 60% of the interactions between regulators could be accurately predicted in E. coli⁴. Despite careful studies on many aspects of the broader regulatory context in several systems, it remains unclear to what extent a general framework can predict the influences of promoter diversity on regulation.

Moving forward, we can leverage the lessons learned here by examining in more detail remaining questions. For example, several promoters have large clusters of binding sites, unlikely to be present by pure chance, however our predictive ability decreased with increasing cluster size. Perhaps under the conditions measured, weaker operators have a low probability of occupancy and therefore do not contribute to regulation⁵⁹, although how weak an operator must be before it no longer modulates expression level is unclear and may be context dependent⁶⁰. By examining a broader set of expression conditions, we may be able to develop a set of rules that predict when and how the number of operators in the clusters is significant. Additionally, we should transition from the bioinformatic analysis presented here to rigorous experimental quantification of the role multiple transcription factors play in modulating promoter outputs. Future quantitative regulatory models should incorporate feedback in the dynamics of transcription factor levels. Bacteria respond to a wide variety of external stimuli, offering useful model systems in which to understand the logic and mechanisms of signal integration at the promoter level. Such work would complement ongoing efforts in synthetic and developmental biology^55,58,61.

Methods

Growth media and culturing conditions

P. aeruginosa PA14 was grown both aerobically and anaerobically at 37 °C in MOPS minimal medium (MOMM) in acid washed glassware to ensure cells were Fe-limited. The basic MOMM is composed of 40 mM C₄H₄Na₂O₄ · 6 H₂O, 9.3 mM NH₄Cl, 2.2 mM KH₂PO₄, 25 mM KNO₃, 25 mM NaNO₃, 25 mM MOPS, 25 mM NaMOPS pH 7.2. Additionally, immediately prior to inoculation 100 μM CaCl₂, 1 μM (NH₄)₂Fe(SO₄)₂ 6 H₂O, 1 mM MgSO₄ and trace metals were added⁶². Any composition changes are noted. All PA14 cultures were prepared by inoculation of MOMM media with the desired strains for 16 hours overnight shaking aerobically then grown anaerobically in a coy chamber with an atmosphere of 80% N₂, 15% CO₂ and 5% H₂ at 37 °C.

Strain construction

The strains used in this work were constructed from the wild type strain P. aeruginosa UCBPP-PA14. To monitor gene expression in these strains, the gene construct containing the versions of the PA14_04180 promoter attached to the lacZ reporter gene were inserted into the genome. Briefly, Gibson assembly was used to insert a gene construct containing 530 bases of the wildtype PA14_04180 attached to lacZ between the transposon sites of the plasmid pUC18T-mini-Tn7t. The region between the transposon sites contained a selection marker for growth on gentamicin. This base construct was then mutated using site directed mutagenesis to create synthetic versions of the promoter. Gene constructs were transferred into the glmS locus of the genome of P. aeruginosa using triparental mating⁶³. Once inserted, constructs were verified using Sanger sequencing.

Measuring gene expression of mutant library in response to iron shock

The gene reporter lacZ quantified the change in gene expression in response to changes in ferrous iron. Anaerobic cultures grown to final OD₅₀₀ of 0.2–0.3 were uncapped inside of an anaerobic chamber and aliquoted to 1.7 ml tubes containing a final concentration of 400 μM ferrous iron (solutions of FeNH₄SO₄). Shocking with 400 μM Fe(II) more rapidly induced the maximal BqsR-mediated regulatory response observed with 200 μM Fe(II), which was used in previous RNA-Seq experiments⁹ (Supplemental Fig. S12). Fe(II) has been measured approaching 300 μM in CF sputum¹². A ferrozine assay was used to confirm the concentration of the stock ferrous iron solution. The ferrous iron treatment was performed in triplicate for each strain measured. Cells were incubated in the anaerobic chamber at room temperature for 4 hours and then transferred to a 96 well plate for measurement of gene expression. Each well received 150 μl of ferrous iron treated cells with 50 μL of a media containing 50 ng/ml of the fluorogenic LacZ indicator fluorescein di-β-D-Galactopyranoside (FDG, Marker Gene Technologies, Inc.). OD₅₅₀ and fluorescence with an excitation and emission of 490 nm and 520 nm respectively were measured in each well under anaerobic conditions every 5 minutes for 1 hour with a BioTek Synergy 4 plate reader. See Supplemental Fig. S12 for control experiments regarding this procedure.

To calculate the change in gene expression after Fe(II) shock, the background corrected fluorescence measurements were divided by the background corrected absorbance measurement to quantify the gene expression per cell. To calculate the fold change in gene expression for a given strain, the gene expression measurement was divided by the expression from a strain containing the reference BqsR operator shown in Fig. 2A, the downstream operator of the PA14_04180 gene.

RhlR and BqsR co-regulon prediction

The position weight matrix for the RhlR DNA binding site⁴⁰ was input into FIMO³⁰, a tool which searches for a consensus sequence within a database. In this case, the database supplied was the 500 bp upstream from the translation start site for genes in the BqsR regulon.

Effect of RhlR on Fe(II) shock conditions

Aerobic cultures of P. aeruginosa WT-pMQ72⁶⁴, ∆bqsR-pMQ72 and WT-pMQ72-rhlR were grown in 3 ml MOMM supplemented with 100 μg/ml gentamycin at 37 °C for 36 hours. Anaerobic cultures were grown in 20 ml MOMM supplemented with 100 μg/ml gentamycin and 1% arabinose (to induce rhlR expression) with 1% inoculum from aerobic overnight culture. When the cells reached early exponential phase (Beckman spectrophotometer 20; OD₅₀₀ = 0.2), 9 ml of culture was removed. 4.5 ml culture was added to 9 ml of RNAprotect (Qiagen) before and after a 30 minute 200 μM ferrous ammonium sulfate shock at room temperature. The cells were incubated with RNAprotect for 5 minutes and centrifuged for 10 minutes at 5000 × g. The supernatant was discarded and the pellets stored at −80 °C.

mRNA isolation and qPCR data analysis

mRNA was isolated from stored cell pellets using the RNeasy kit mini (Qiagen) with optional on-column DNA digestion according to the manufacturer’s instructions. Subsequently, the RNA was treated with TURBO DNA-free (Applied Biosystems). cDNA was generated with iScript (Bio-Rad) random-primed reverse transcriptase reaction following the manufacturer’s protocol. An mRNA genomic contamination control and cDNA was used as template for quantitative-reverse transcriptase-PCR (Real Time 7500 PCR Machine, Applied Biosystems) using iTaq Universal SYBR Green Supermix (Bio-Rad). Samples were assayed with 3–5 biological replicates. recA and clpX were used as endogenous controls⁶⁵. Fold changes were calculated using the ΔΔC_t method⁹. To ensure recA was constant in all conditions tested, the relative fold change was measured for the internal control clpX, whose expression was also expected to remain constant across all our treatments. Only those samples with a clpX fold change between 0.5–2 were used. Log₂ of the final fold change was reported. Results were compared with an unpaired 2-tailed t-test assuming unequal variances.

Ferrozine assay

This colorimetric assay measures Fe(II) concentration. The Stookey method⁶⁶ was modified for 96-well plate format. All measured Fe(II) concentrations were within 5% of reported value.

Bioinformatics

To analyze potential BqsR binding sites in the genome, occurrences of the motif (TTAAG(N)6TTAAG) were found in the genome of Pseudomonas aeruginosa PA14 using the program FIMO (Find Individual Motif Occurrences), part of the MEME Suite30. Motif occurrences were then sorted and analyzed using Matlab. The same process was used to locate binding sites of other transcription factors using binding motifs listed in the Prodoric database24. Sequence logos were calculated using WebLogo 3.4.

Comparison to the other operons

From published microarray and RNA-Seq papers^{35,36,37,38,39,40,41,42,43,44,45,46,47,48} regulons for other transcription factors were defined. The list of genes in the transcription factor regulons were converted to transcriptional units³³. Comparisons between transcriptional units (TU) in the regulon to two lists were made to discover the number of shared TUs: TUs in prediction but not observed in RNA-Seq data and TUs in RNA-Seq data but not predicted. In R, a hypergeometric test assigned a p-value to the overlapping regulons. For those regulons with significant overlap, whether the TUs were upregulated or downregulated was noted. See Supplemental Material for further details.

Additional Information

How to cite this article: Kreamer, N. et al. Predicting the impact of promoter variability on regulatory outputs. Sci. Rep. 5, 18238; doi: 10.1038/srep18238 (2015).

References

Kung, J. T., Colognori, D. & Lee, J. T. Long noncoding RNAs: past, present and future. Genetics 193, 651–69 (2013).
Article CAS Google Scholar
Meister, G. Argonaute proteins: functional insights and emerging roles. Nat Rev Genet 14, 447–59 (2013).
Article CAS Google Scholar
Levo, M. & Segal, E. In pursuit of design principles of regulatory sequences. Nat Rev Genet 15, 453–468 (2014).
Article CAS Google Scholar
Faith, J. J. et al. Large-scale mapping and validation of Escherichia coli transcriptional regulation from a compendium of expression profiles. Plos Biology 5, 54–66 (2007).
Article CAS Google Scholar
Spitz, F. & Furlong, E. E. M. Transcription factors: from enhancer binding to developmental control. Nature Reviews Genetics 13, 613–626 (2012).
Article CAS Google Scholar
Garcia, H. G. et al. Operator Sequence Alters Gene Expression Independently of Transcription Factor Occupancy in Bacteria. Cell Reports 2, 150–161 (2012).
Article CAS Google Scholar
Mayo, A. E., Setty, Y., Shavit, S., Zaslaver, A. & Alon, U. Plasticity of the cis-regulatory input function of a gene. Plos Biology 4, 555–561 (2006).
Article CAS Google Scholar
Lam, F. H., Steger, D. J. & O’Shea, E. K. Chromatin decouples promoter threshold from dynamic range. Nature 453, 246–U16 (2008).
Article CAS ADS Google Scholar
Kreamer, N. N. K., Wilks, J. C., Marlow, J. J., Coleman, M. L. & Newman, D. K. BqsR/BqsS Constitute a Two-Component System That Senses Extracellular Fe(II) in Pseudomonas aeruginosa. Journal of Bacteriology 194, 1195–1204 (2012).
Article CAS Google Scholar
Vasil, M. L. & Ochsner, U. A. The response of Pseudomonas aeruginosa to iron: genetics, biochemistry and virulence. Molecular Microbiology 34, 399–413 (1999).
Article CAS Google Scholar
Singh, P. K., Parsek, M. R., Greenberg, E. P. & Welsh, M. J. A component of innate immunity prevents bacterial biofilm development. Nature 417, 552–555 (2002).
Article CAS ADS Google Scholar
Hunter, R. C. et al. Ferrous Iron Is a Significant Component of Bioavailable Iron in Cystic Fibrosis Airways. mBio 4 (2013).
Castro, A. P., Fernandes, G. D. R. & Franco, O. L. Insights into novel antimicrobial compounds and antibiotic resistance genes from soil metagenomes. Frontiers in Microbiology 5 (2014).
Dong, Y.-H., Zhang, X.-F., An, S.-W., Xu, J.-L. & Zhang, L.-H. A novel two-component system BqsS-BqsR modulates quorum sensing-dependent biofilm decay in Pseudomonas aeruginosa. Commun Integr Biol 1, 88–96 (2008).
Article CAS Google Scholar
Kreamer, N. N. K., Costa, F. & Newman, D. K. The ferrous iron responsive BqsRS two component system activates genes that promote cationic stress tolerance. mBIo, n press (2015).
Hertz, G. Z. & Stormo, G. D. Identifying DNA and protein patterns with statistically significant alignments of multiple sequences. Bioinformatics 15, 563–577 (1999).
Article CAS Google Scholar
Vilar, J. M. G. Accurate Prediction of Gene Expression by Integration of DNA Sequence Statistics with Detailed Modeling of Transcription Regulation. Biophysical Journal 99, 2408–2413 (2010).
Article CAS ADS Google Scholar
Zhu, L. J. et al. FlyFactorSurvey: a database of Drosophila transcription factor binding specificities determined using the bacterial one-hybrid system. Nucleic Acids Research 39, D111–D117 (2011).
Article CAS Google Scholar
Li, H., Rhodius, V., Gross, C. & Siggia, E. D. Identification of the Binding Sites of Regulatory Proteins in Bacterial Genomes. Proceedings of the National Academy of Sciences of the United States of America 99, 11772–11777 (2002).
Article CAS ADS Google Scholar
Chen, Z. et al. Discovery of Fur binding site clusters in Escherichia coli by information theory models. Nucleic Acids Res 35, 6762–77 (2007).
Article CAS Google Scholar
Gilbert, K. B., Kim, T. H., Gupta, R., Greenberg, E. P. & Schuster, M. Global position analysis of the Pseudomonas aeruginosa quorum-sensing transcription factor LasR. Molecular microbiology 73, 1072–1085 (2009).
Article CAS Google Scholar
Stauff, D. L. & Bassler, B. L. Quorum Sensing in Chromobacterium violaceum: DNA Recognition and Gene Regulation by the CviR Receptor. Journal of Bacteriology 193, 3871–3878 (2011).
Article CAS Google Scholar
Keseler, I. M. et al. EcoCyc: fusing model organism databases with systems biology. Nucleic Acids Research 41, D605–D612 (2013).
Article CAS Google Scholar
Munch, R. et al. PRODORIC: prokaryotic database of gene regulation. Nucleic Acids Research 31, 266–269 (2003).
Article CAS Google Scholar
Zwir, I. et al. Dissecting the PhoP regulatory network of Escherichia coli and Salmonella enterica. Proceedings of the National Academy of Sciences of the United States of America 102, 2862–2867 (2005).
Article CAS ADS Google Scholar
Kim, A.-R. et al. Rearrangements of 2.5 Kilobases of Noncoding DNA from the Drosophila even-skipped Locus Define Predictive Rules of Genomic cis-Regulatory Logic. PLoS Genetics 9, e1003243 (2013).
Article CAS Google Scholar
Leith, J. S. et al. Sequence-dependent sliding kinetics of p53. Proceedings of the National Academy of Sciences of the United States of America 109, 16552–16557 (2012).
Article CAS ADS Google Scholar
Berg, O. G. & Von Hippel, P. H. Selection of DNA binding sites by regulatory proteins. Journal of Molecular Biology 193, 723–743 (1987).
Article CAS Google Scholar
Ma, X., Ezer, D., Navarro, C. & Adryan, B. Reliable scaling of position weight matrices for binding strength comparisons between transcription factors. BMC Bioinformatics 16, 265 (2015).
Article Google Scholar
Bailey, T. L. et al. MEME Suite: tools for motif discovery and searching. Nucleic Acids Research 37, W202–W208 (2009).
Article CAS Google Scholar
Kinney, J. B., Murugan, A., Callan, C. G., Jr. & Cox, E. C. Using deep sequencing to characterize the biophysical mechanism of a transcriptional regulatory sequence. Proceedings of the National Academy of Sciences of the United States of America 107, 9158–9163 (2010).
Article CAS ADS Google Scholar
Brewster, R. C., Jones, D. L. & Phillips, R. Tuning Promoter Strength through RNA Polymerase Binding Site Design in Escherichia coli. Plos Computational Biology 8, e1002811–e1002811 (2012).
Article CAS ADS Google Scholar
Wurtzel, O. et al. The Single-Nucleotide Resolution Transcriptome of Pseudomonas aeruginosa Grown in Body Temperature. PLoS Pathog 8, e1002945 (2012).
Article Google Scholar
Lee, D. J., Minchin, S. D. & Busby, S. J. Activating transcription in bacteria. Annu Rev Microbiol 66, 125–52 (2012).
Article CAS Google Scholar
Trunk, K. et al. Anaerobic adaptation in Pseudomonas aeruginosa: definition of the Anr and Dnr regulons. Environmental Microbiology 12, 1719–1733 (2010).
Article CAS Google Scholar
Lu, C.-D., Yang, Z. & Li, W. Transcriptome Analysis of the ArgR Regulon in Pseudomonas aeruginosa. Journal of Bacteriology 186, 3855–3861 (2004).
Article CAS Google Scholar
Ochsner, U. A., Wilderman, P. J., Vasil, A. I. & Vasil, M. L. GeneChip® expression analysis of the iron starvation response in Pseudomonas aeruginosa: identification of novel pyoverdine biosynthesis genes. Molecular Microbiology 45, 1277–1287 (2002).
Article CAS Google Scholar
Palma, M., Worgall, S. & Quadri, L. N. Transcriptome analysis of the Pseudomonas aeruginosa response to iron. Archives of Microbiology 180, 374–379 (2003).
Article CAS Google Scholar
Tian, Z.-X. et al. Transcriptome profiling defines a novel regulon modulated by the LysR-type transcriptional regulator MexT in Pseudomonas aeruginosa. Nucleic Acids Research 37, 7546–7559 (2009).
Article CAS Google Scholar
Schuster, M. & Greenberg, E. Early activation of quorum sensing in Pseudomonas aeruginosa reveals the architecture of a complex regulon. BMC Genomics 8, 1–11 (2007).
Article Google Scholar
Wagner, V. E., Bushnell, D., Passador, L., Brooks, A. I. & Iglewski, B. H. Microarray Analysis of Pseudomonas aeruginosa Quorum-Sensing Regulons: Effects of Growth Phase and Environment. Journal of Bacteriology 185, 2080–2095 (2003).
Article CAS Google Scholar
Bredenbruch, F., Geffers, R., Nimtz, M., Buer, J. & Häussler, S. The Pseudomonas aeruginosa quinolone signal (PQS) has an iron-chelating activity. Environmental Microbiology 8, 1318–1329 (2006).
Article CAS Google Scholar
Déziel, E. et al. The contribution of MvfR to Pseudomonas aeruginosa pathogenesis and quorum sensing circuitry regulation: multiple quorum sensing-regulated genes are modulated without affecting lasRI, rhlRI or the production of N-acyl- l-homoserine lactones. Molecular Microbiology 55, 998–1014 (2005).
Article Google Scholar
Damron, F. H. et al. Analysis of the Pseudomonas aeruginosa Regulon Controlled by the Sensor Kinase KinB and Sigma Factor RpoN. Journal of Bacteriology 194, 1317–1330 (2012).
Article CAS Google Scholar
Schuster, M., Hawkins, A. C., Harwood, C. S. & Greenberg, E. P. The Pseudomonas aeruginosa RpoS regulon and its relationship to quorum sensing. Molecular Microbiology 51, 973–985 (2004).
Article CAS Google Scholar
Burrowes, E., Baysse, C., Adams, C. & O’Gara, F. Influence of the regulatory protein RsmA on cellular functions in Pseudomonas aeruginosa PAO1, as revealed by transcriptome analysis. Microbiology 152, 405–418 (2006).
Article CAS Google Scholar
Wolfgang, M. C., Lee, V. T., Gilmore, M. E. & Lory, S. Coordinate Regulation of Bacterial Virulence Genes by a Novel Adenylate Cyclase-Dependent Signaling Pathway. Developmental Cell 4, 253–263
Hentzer, M. et al. Attenuation of Pseudomonas aeruginosa virulence by quorum sensing inhibitors (2003).
Farrell, M. J. & Finkel, S. E. The Growth Advantage in Stationary-Phase Phenotype Conferred by rpoS Mutations Is Dependent on the pH and Nutrient Environment. Journal of Bacteriology 185, 7044–7052 (2003).
Article CAS Google Scholar
Heurlier, K., Dénervaud, V., Pessi, G., Reimmann, C. & Haas, D. Negative Control of Quorum Sensing by RpoN (σ54) in Pseudomonas aeruginosa PAO1. Journal of Bacteriology 185, 2227–2235 (2003).
Article CAS Google Scholar
Medina, G., Juarez, K., Valderrama, B. & Soberon-Chavez, G. Mechanism of Pseudomonas aeruginosa RhlR transcriptional regulation of the rhlAB promoter. J Bacteriol 185, 5976–83 (2003).
Article CAS Google Scholar
Razo-Mejia, M. et al. Comparison of the theoretical and real-world evolutionary potential of a genetic circuit. Physical Biology 11 (2014).
Kuhlman, T., Zhang, Z., Saier, M. H., Jr. & Hwa, T. Combinatorial transcriptional control of the lactose operon of Escherichia coli. Proceedings of the National Academy of Sciences of the United States of America 104, 6043–6048 (2007).
Article CAS ADS Google Scholar
Aow, J. S. Z. et al. Differential binding of the related transcription factors Pho4 and Cbf1 can tune the sensitivity of promoters to different levels of an induction signal. Nucleic Acids Research 41, 4877–4887 (2013).
Article CAS Google Scholar
Smith, R. P. et al. Massively parallel decoding of mammalian regulatory sequences supports a flexible organizational model. Nature Genetics 45, 1021–8 (2013).
Article CAS Google Scholar
Boedicker, J. Q., Garcia, H. G., Johnson, S. & Phillips, R. DNA sequence-dependent mechanics and protein-assisted bending in repressor-mediated loop formation. Physical Biology 10 (2013).
Kim, S. et al. Probing Allostery Through DNA. Science 339, 816–819 (2013).
Article CAS ADS Google Scholar
Segal, E., Raveh-Sadka, T., Schroeder, M., Unnerstall, U. & Gaul, U. Predicting expression patterns from regulatory sequence in Drosophila segmentation. Nature 451, 535–540 (2008).
Article CAS ADS Google Scholar
Brewster, R. C. et al. The Transcription Factor Titration Effect Dictates Level of Gene Expression. Cell 156, 1312–1323 (2014).
Article CAS Google Scholar
Tanay, A. Extensive low-affinity transcriptional interactions in the yeast genome. Genome Research 16, 962–972 (2006).
Article CAS Google Scholar
Kosuri, S. et al. Composability of regulatory sequences controlling transcription and translation in Escherichia coli. Proceedings of the National Academy of Sciences of the United States of America 110, 14024–14029 (2013).
Article CAS ADS Google Scholar
Kopf, S. H., Henny, C. & Newman, D. K. Ligand-Enhanced Abiotic Iron Oxidation and the Effects of Chemical versus Biological Iron Cycling in Anoxic Environments. Environmental Science & Technology 47, 2602–2611 (2013).
Article CAS ADS Google Scholar
Choi, K.-H. & Schweizer, H. P. Mini-Tn7 insertion in bacteria with single attTn7 sites: example Pseudomonas aeruginosa. Nature Protocols 1, 153–161 (2006).
Article CAS Google Scholar
Shanks, R. M. Q., Caiazza, N. C., Hinsa, S. M., Toutain, C. M. & O’Toole, G. A. Saccharomyces cerevisiae-based molecular tool kit for manipulation of genes from gram-negative bacteria. Applied and Environmental Microbiology 72, 5027–5036 (2006).
Article CAS Google Scholar
Dietrich, L. E. P., Price-Whelan, A., Petersen, A., Whiteley, M. & Newman, D. K. The phenazine pyocyanin is a terminal signalling factor in the quorum sensing network of Pseudomonas aeruginosa. Molecular Microbiology 61, 1308–1321 (2006).
Article CAS Google Scholar
Stookey, L. L. Ferrozine—a new spectrophotometric reagent for iron. Analytical Chemistry 42, 779–781 (1970).
Article CAS Google Scholar

Download references

Acknowledgements

This work was supported by the National Institutes of Health, grant numbers DP1 OD000217, (Directors Pioneer Award), 5R01HL117328-03, R01 GM085286 and R01 GM085286B and 1 U54 CA143869 (Northwestern PSOC Center); La Fondation Pierre Gilles de Gennes (R.P.); the Center for Microbial Interactions at Caltech (J.B.). D.K.N. is an HHMI Investigator.

Author information

Authors and Affiliations

Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, 91125, CA, USA
Naomi N. Kreamer, Rob Phillips & Dianne K. Newman
Department of Chemistry, California Institute of Technology, Pasadena, 91125, CA, USA
Naomi N. Kreamer
Division of Geological and Planetary Sciences, California Institute of Technology, Pasadena, 91125, CA, USA
Dianne K. Newman
Department of Applied Physics, California Institute of Technology, Pasadena, 91125, CA, USA
Rob Phillips
Department of Physics and Astronomy, University of Southern California, Los Angeles, 90089, CA, USA
James Q. Boedicker
Department of Biological Sciences, University of Southern California, Los Angeles, 90089, CA, USA
James Q. Boedicker

Authors

Naomi N. Kreamer
View author publications
You can also search for this author in PubMed Google Scholar
Rob Phillips
View author publications
You can also search for this author in PubMed Google Scholar
Dianne K. Newman
View author publications
You can also search for this author in PubMed Google Scholar
James Q. Boedicker
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

N.K., D.N., R.P. and J.B. conceived and designed the study, N.K. and J.B. performed experiments and analyzed the results, N.K., D.N., R.P. and J.B. all contributed to writing the manuscript.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Electronic supplementary material

Supplementary Information

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Kreamer, N., Phillips, R., Newman, D. et al. Predicting the impact of promoter variability on regulatory outputs. Sci Rep 5, 18238 (2016). https://doi.org/10.1038/srep18238

Download citation

Received: 27 August 2015
Accepted: 16 November 2015
Published: 17 December 2015
DOI: https://doi.org/10.1038/srep18238

This article is cited by

Evolutionary potential of transcription factors for gene regulatory rewiring
- Claudia Igler
- Mato Lagator
- Călin C. Guet
Nature Ecology & Evolution (2018)
Genetic cargo and bacterial species set the rate of vesicle-mediated horizontal gene transfer
- Frances Tran
- James Q. Boedicker
Scientific Reports (2017)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.