Introduction

In the three domains of life, DNA-binding proteins are involved in genome organization, packaging it into eukaryotic organelles or into the cell (in bacteria or archaea), while efficiently accommodating DNA-based processes such as transcription, replication and repair. Each organism has a characteristic set of DNA-binding proteins that has unique regulatory features. Advances in this field have undoubtedly benefited from high-resolution structures of DNA-protein complexes, as well as analyses of the global and local effects of these proteins on chromatin structure.

DNA-binding proteins are classified in three categories: benders, bridgers and wrappers1. Focusing on archaea, Alba and histone proteins are the most widely represented2,3,4. They belong to the groups of bridgers and wrappers, respectively. Alba is present in all archaea, except in two classes of Euryarchaea (Methanomicrobia and Haloarchaea). In archaea lacking Alba, the Methanogen Chromosomal protein 1 (MC1) is present. Alba shows a bimodal DNA-binding behavior by bridging or stiffening the DNA5. The Alba family of proteins exhibits functional diversities: genome packaging and organization, transcriptional and translational regulation (through acetylation/deacetylation mechanisms), RNA metabolism, development and differentiation processes6. Histone proteins are present in all Euryarchaea (except Thermoplasmatales) and in some Crenarchaea. The homo- and heterodimers of HMfA and HMfB proteins, from the hyperthermophile Methanothermus fervidus, are the benchmark of the wrapper histone proteins7. They oligomerize in the presence of DNA, further forming tetramers and even hexamers. Besides, in order to compact DNA, histone proteins modulate transcriptional regulation by interplaying with transcription factors8.

Several other archaeal nucleoid-associated proteins (NAPs) belonging to the benders are less widely distributed among the archean phyla. In Crenarchaea lacking histone proteins, Cren7 is always present whereas Sul7 (also known as Sso7d or Sac7d) is restricted to Sulfolobus8,9. They non-specifically bind to DNA by inserting hydrophobic residues into the minor groove, bending the DNA by up to 60° 10,11,12. HTa and HUTvo belonging to the HU structural superfamily are found in Thermoplasma species, which are the only Euryarchaea lacking histone proteins13,14. The dimeric HU-family proteins present a compact core of α-helices with two emerging flexible β-ribbon arms14. They form low-affinity HU-DNA complexes without sequence specificity and high-affinity complexes with flexible or bent DNA, such as single-strand nicks and gaps, and bulged and branched DNA structures15,16,17.

The MC1 protein studied here belongs to Methanosarcina thermophila. In laboratory growth conditions, MC1 is the most abundant architectural protein in Methanosarcina thermophila CHTI5518 and was primarily classified as histone-like protein19,20,21. Depending on the growth conditions, MC1 could compact and organize DNA in a number of different ways. It was also proposed to be involved in DNA transcription (in vitro essays)22 and cellular division (proteomics by LCMS)23,24, through as yet unknown mechanisms. This small basic monomeric protein of 93 residues (UniProt P12770) is structurally unrelated to other DNA-binding proteins25,26. Its affinity, determined by gel retardation essays under direct competitive conditions, for any double-stranded DNA is high (KD ~10−7 M) without strong sequence specificity but with a conformational preference for bent DNA, such as negatively supercoiled DNA minicircles (KD <10−8 M)20 and four-way junctions (KD ~2.5 10−8 M)27. Binding of two MC1 molecules per DNA minicircle induces two diametrically opposed kinks visible by electron microscopy21 and binding of MC1 to linear 176 base pairs (bp) DNA duplexes leads to a bend angle estimated to be 116° 19. When DNA is bound to MC1, it is protected against thermal denaturation18 and radiolysis28.

Recently, combining NMR and biochemical data through data-driven docking, we proposed an atypical compaction mechanism of DNA by MC129. We describe here the solution structure of an MC1-DNA15bp complex, uncovering the first structure of a monomeric protein bound to the concave side of a strongly bent and distorted DNA (PDB code 2NBJ). This new data allowed us to establish that MC1, as a monomer, is able to induce a V-turn on DNAs. The existence of such a conformation opens new insights into MC1-related and MC1-based biological processes, including transcription, replication and repair.

Results

In this work, a new type of a protein-DNA complex structure, formed by MC1 and a DNA15bp was determined using NMR spectroscopy data. To avoid overlaps in NMR resonance peaks, we chose the [AAAAACACACACCCA] DNA15bp sequence from the consensus [AAAAACACAC(A/C)CCC(C/A)] sequences revealed by a SELEX (systematic evolution of ligands by exponential experiment) procedure30.

We observed six NOEs between the protons of the side chain of Pro72 with the sugar protons of T16, G17 and G18 located in the minor groove. Twelve other NOEs were unambiguously identified to the side chain protons of Trp74 (HB1, HB2, HD1, HE3 and HZ2) in contact with H2 and sugar protons of A15 (H1′) and T16 (H2′, H3′, H5′) and amino protons H21/22 of G17, all located in the minor groove. Moreover, we observed eight NOEs between the side chain protons of Ile89 (HB, HD1#) and the sugar protons of A5 (H5′) and T30 (H1′, H4′, H5′and H5″) located in the minor groove. The amino side chain of Gln23 gave observable NOEs with sugar protons of T22 and amino protons of C6 located in the major groove. The use of NOE restraints is generally not sufficient to determine global structural features such as bending of nucleic acids. RDC constraints, known to improve the precision and accuracy of both the local and global structures of the double helix31,32, were measured on both the DNA (the labelled strand) and protein. Their addition in the calculation was an absolute necessity to resolve the 3D structure of the complex.

The type and number of restraints, and the structural statistics for the twelve structures deposited to the PDB are reported in Table 1. Superimposition of these structures clearly shows that they are well defined (Fig. 1).

Table 1 Experimental restraints and structure statistics for MC1-DNA15bp complex.
Figure 1
figure 1

Overlay of the twelve-deposited structures of the MC1-DNA15bp in cartoon representation. The rmsd for all heavy atoms of the complex is equal to 0.7 Å.

MC1 slightly adapts its conformation upon DNA binding

Small conformational changes of the 3D structure of MC1 occur upon DNA binding (Fig. 2a). In its free form, secondary structure elements of MC1 consist of an α-helix (Arg25-Ala32) and two β-sheets26. The first sheet is composed of two antiparallel β-strands, β1 (Arg4-Arg9) and β2 (Glu15-Gly21), while the second contains three antiparallel β-strands β3 (Asp43-Arg48), β4 (Val55-Val65), and β5 (Ile79-Glu90). Each β-sheet contains one antiparallel β-bulge composed of Leu8, His16, Gly17 and Val57, Glu87, Arg88, respectively (Fig. 2b). Upon DNA binding, the helix of MC1 appears longer and is, indeed, composed of an α-helix (Pro24-Ala31) followed by a small 310 helix (Ala32-Arg34). The helix rotates by 10° around Ala32, the β1-strand by 10° around Val7 and the β2-strand by 30° around Val18, leading the side chain of Gln23, Arg25 and Gln26 to be optimally positioned for an interaction with DNA (Fig. 2c). Due to the displacement of both β1 and β2 strands, a unique five β-stranded antiparallel β-sheet is formed, and the first β-bulge (Leu8, His16, and Gly17) disappears. The second β-bulge (Val57, Glu87 and Arg88) also disappears, due to interactions of Lys86, Arg88 and Ile89 (β5-strand) in the DNA minor groove. Furthermore, MC1 adopts an extended and less mobile conformation of the arm (Ala67-Glu77) due to its binding to DNA26.

Figure 2
figure 2

MC1 slightly adapts its conformation upon binding to the DNA15bp. (a) 3D solution structures of MC1 in its free (left) and bound (right) forms in cartoon representation. The β-strands and the α-helix are colorized in blue and red, respectively. (b) Primary and secondary structures of MC1 free (top) and bound (bottom): Blue arrows and red cylinder are for β - and α -structures, respectively, and black wickets for bulges. (c) Overlay of the cartoon representations of the free (dark grey) and bound (light grey) conformations of the MC1 protein. In order to highlight the conformational changes of MC1 upon binding, three couples of axes are represented: rotation of the α-helix by 10° around Ala32 in green, of the β1-strand by 10° around Val7 in magenta and of the β2-strand by 30° around Val18 in cyan. Close-up pictures of the side chains of Gln23, Arg25 and Gln26, in yellow (free MC1) and orange (bound MC1), show that their orientation have changed upon binding in order to be optimally positioned for an interaction with DNA. Alignment of the two MC1 structures, in their free and bound forms, give a rmsd of 4 Å (all residues) and a rmsd of 2.3 Å (without the arm).

MC1-DNA15bp interactions

MC1 binds the DNA through two main areas of contact into the minor groove of the DNA (Fig. 3a). The first one, in the A-tract, is dominated by the formation of hydrogen bonds between the side chains of Lys54, His56, Lys86 and Lys91, and DNA phosphate groups (Fig. 3b). Note that the side chain of Arg4 interacts in the major groove and Van der Waals’ interactions occurring between the non-polar side chain of Ile89 and sugar groups of A5 and T30 strengthen these interactions. The second is dominated by hydrophobic interactions through the insertion of Pro72 and the intercalation of Trp74 between the C14pA15 step. To reinforce these interactions, Arg71 and Lys81 make electrostatic contacts with the sugar-phosphate backbone of G18pG19. Between these two areas of contact, the DNA is composed of a more flexible sequence (CpA)3 in which the major groove is compressed. The presence of the positive side chain of Arg25 is essential to neutralize the repulsive negative charges of the phosphates belonging to the closely spaced C6pA7 and T20pG21 steps in the major groove (Fig. 3b). The side chains of Lys22, Gln23 and Gln26 participate in the stabilization of the complex by forming hydrogen bonds with the sugar-phosphate backbone of G21pT22pG23 steps, Gln26 in the minor groove, Gln23 in the major groove and Lys22 either in the minor or the major groove depending on the structures. Residues essential to DNA-binding and -bending were highlighted and confirmed by site-directed mutagenesis29. MC1, as many NAPs, interacts with DNA using a combination of the two main and often interrelated readout mechanisms: recognition of the DNA shape (e.g., narrow A-tract minor groove, wide CpCpC major groove, kinking, bending) and recognition of bases (e.g., hydrogen bonds and hydrophobic interactions into the major and minor grooves)33.

Figure 3
figure 3

(a) Interactions between MC1 and DNA15bp. Green and red arrows represent observed NOEs and H-bonds formed in the complex, respectively. Dashed arrows are for interactions in the major groove and standard arrows for interactions in the minor groove. Kinks are represented by black arrows. (b) Neutralization of the phosphates by side chains of Lys86 and Arg25 is essential to the compression of the minor and major grooves, respectively.

MC1 imposes a severe bend and a torsion on DNA15bp leading to a V-turn DNA structure

The MC1-DNA15bp complex reveals large conformational perturbations in the DNA structure, which exhibits a dramatic bending, unstacked base pairs and anomalous groove widths. The DNA conformation analysis with Curves+34,35 and 3DNA36 programs cannot correctly define the base-pair parameters and the geometry of the grooves because of the very unusual conformation of the DNA, which cannot be easily classified as a B- or an A-DNA conformation. However, the roll angle profile shows two sharp peaks, which reflect distortions of base stacking owing to acute DNA bending by two kinks: the first of (55 ± 2)° between A4pA5/T26pT27 (<tilt> = −13°, <roll> = −45°, <twist> = 50°), and the second of (70 ± 3)° between C10pA11/T20pG21 (<tilt> = 31°, <roll> = −71°, <twist> = 20°). Negative roll results in bending of the oligonucleotide toward the minor groove, which becomes very narrow at the kinks (Fig. 4a). These two bend angles, resulting principally from large tilt angles at the site of kinking, are not coplanar and create a dihedral angle of (106 ± 3)° (Fig. 4b). Between the two kinks, another striking feature is the C8pA9/T22pG23 junction (<tilt> = −28°, <roll> = 17°, <twist> = 50°, A9 in north pucker) which leads a smooth bend towards the major groove. From A7 to A9, the minor groove is particularly wide and the major groove abnormally narrow.

Figure 4
figure 4

(a) Top: Tilt (blue), roll (red) and twist (green) angles profiles in the MC1-DNA15bp structure. Bottom: DNA15bp sequence, colored arrows summarized the base pairs parameters differing from standard values, boxed letters indicate sugars in north pucker conformations and values of the bend angles are given. (b) Superimposition of the 12 deposited structures of the MC1-DNA15bp in cartoon representation. The two kinks are represented as well as the torsional angle.

In agreement with the ability of MC1 to restrain negative supercoils, our structure shows that MC1 introduces both unwinding and negative writhe37. Positive base pair tilt values and/or north pucker of the sugars are characteristic of an A-DNA-like structure. A1pA2/T29pT30, A5pC6/G25pT26, C10pA11/T20pG21 and C13pC14/ G18pG19 exhibit such tilt values and the sugars of A5, C10, A11, and T27 adopt a north pucker (C3′-endo, C2′-exo and C4′-exo conformations). It is known that the B to A transformation leads to a rotation of −3.3° for every transformed base-pair, resulting in an overall untwisting of DNA38. Moreover, the average twist in the MC1-DNA15bp is 35°/bp compared with ~36°/bp or ~32.7°/bp for a relaxed B-form DNA and a relaxed A-form DNA, respectively. Although the average twist is close to that of a relaxed B-form, unwinding is locally observed around A7pC8/G23pT24, C10pA11/T22pG23 and A11pC12/G19pT20 steps with twist angle values lower than 30°. As the two kink angles are not coplanar (vide supra), the orientation of the resulting torsional angle is consistent with a negative supercoiling.

A third kink evaluated around 40° occurs between the two terminal base pairs C14pA15/T16pG17 upon the insertion of Pro72 and the intercalation of Trp74 (Fig. 5a). As A15 and T16 are unpaired, the intercalation of Trp74 implies a great perturbation in the stacking with the previous base pair preventing the correct determination of the associated base pairs parameters (tilt, roll and twist). To further extend our exploration of the DNA conformations upon MC1 binding, we decided to study a complex with a longer DNA.

Figure 5
figure 5

(a) Cartoon representation of the MC1-DNA15bp complex. Side chains of key residues are represented as spheres (blue for hydrophilic and brown for hydrophobic residues, respectively). The side chains of Pro72, Trp74, Lys86 and Ile89 contact the minor groove of the DNA, whereas the side chains of Gln23, Arg25 and Gln26 contact the major groove of the DNA. (b) Sequences of the chimeric DNA23bp and the DNATFAM. The first 15 base pairs of the chimeric DNA23bp are identical to those of the DNA15bp (written in red). Both sequences have three domains, an AT-rich (blue boxed), a CpA-rich (green boxed) and a CG-rich (black boxed) domains. The eight base pairs chosen from DNATFAM to lengthen our DNA15bp into DNA23bp are red boxed. (c) Cartoon representation of the model of the MC1-DNA23bp complex. New contacts in the DNA minor groove are predicted for His16 and Gly17 side-chains (blue spheres).

A third contact observed with a longer DNA: dynamic nature of the V-turn structures

We took inspiration in the U-turn formed by the DNATFAM promoter upon interaction with TFAM, the major mitochondrial NAP comprising a tandem of HMG-box domains39. Indeed, the sequence of DNATFAM has three domains (AT-rich, CpA-rich and CG-rich domains) similar to DNA15bp and a supplementary domain of 8 bp which was chosen to lengthen our DNA15bp to DNA23bp (Fig. 5b). A structural model of the MC1-DNA23bp complex was built starting from the MC1-DNA15bp complex structure and new interactions with His16, and Gly17 were observed in the minor groove (Fig. 5c). This supplementary contact point, due to the bend and the torsion of the DNA, stabilizes another V-turn conformation of the DNA.

We validated the use of the chimeric DNA23bp using EMSA (electrophoretic mobility shift assay) experiments (Fig. 6a). A 2.3-fold increase in affinity to MC1 was observed when the DNA was 8 bp longer (KDapp = (1.2 ± 0.5) nM for MC1-*DNA23bp and KDapp = (3.1 ± 0.6) nM for MC1-*DNA15bp). This strongly suggested that additional protein residues are involved in the interaction with DNA23bp.

Figure 6
figure 6

Binding of MC1 protein to two different DNA probes of 17 bp (*DNA15bp) and 25 bp (*DNA23bp). (a) Top: Electrophoresis gel of EMSA reaction mixtures (10 µl) prepared at 4 °C by mixing DNA (0.05 nM) and increasing MC1 protein concentrations from 0.04 nM to 6.6 nM. Bottom: Binding curves obtained from quantification using ImageQuant software of the EMSA experiments. The fraction of bound DNA is plotted as a function of protein concentration [MC1]. (b) Relaxation dispersion curves, fitted using the NESSY software, for Glu15 and Gly17 of the MC1 protein in complex with the DNA23bp. Data were fitted to a two-site, fast exchange model where pa and pb are the populations of the two state models (pa + pb = 1), kex is the exchange constant and δω is the chemical shift difference between states.

A 15N-HSQC spectrum of the MC1-DNA23bp complex revealed that this longer DNA affects both the local environment and the dynamics of the protein. Chemical shift perturbations (CSPs) between the free protein and the protein bound to DNA23bp were calculated for all of the observable residues. We observed CSPs for all of the residues already highlighted in the MC1-DNA15bp complex26 but most importantly, new CSPs were observed for Val7, Arg9, Glu15, His16 and Gly17. Moreover, some peaks were broader than expected, leading us to hypothesize that the protein is in an intermediate exchange between at least two conformations at the NMR timescale40. In particular, the Gln26, Lys30, Lys54, Trp74, Met75, Lys81, Phe83 and Lys86 residues exhibited such a broadened linewidth that they were undetectable. To probe the intermediate chemical exchange regime for the bound MC1, (1H-15N) CPMG relaxation dispersion experiments were performed. Unfortunately, the resonances of some key residues, especially His16, were overlapped, preventing us from obtaining reliable measurements. However, a slow intermediate chemical exchange was observed for Glu15 and Gly17 with kex of (775 ± 73) s−1 and (814 ± 17) s−1, respectively (Fig. 6b). Both residues were in exchange between two equally populated conformations, one close to the MC1-DNA15bp complex structure (longer DNA but only two contact areas) and one corresponding to a different V-turn in which MC1 is bound to DNA23bp with a supplementary contact point.

Discussion

The 3D structure of the MC1-DNA15bp complex uncovers a very strong bend in the DNA15bp featuring an unprecedented torsional angle of 106°, and a V-turn DNA conformation. It is the first time that atomic data enable to corroborate data from electron micrographs showing the sharp bend angle of 116° in MC1-DNA176bp complexes19.

The combination of results from our previous26 and present studies agree with a three-step mechanism of complexation. Firstly, the dynamic nature of the free DNA15bp structure enables MC1 to transiently select narrow minor groove segments (A-tract), which can be further stabilized through Coulombic interactions with the electropositive binding surface of the protein (Arg4, Lys54, His56, Lys86, Lys91). The widths of the DNA minor groove varies with sequence and can be a major determinant of DNA shape recognition by proteins41. For example, the NAP Fis protein selects targets primarily through indirect recognition mechanisms involving the shape of the minor groove (intrinsically narrow minor groove) and sequence-dependent induced fits over adjacent major groove interfaces. Fis-DNA X-ray structures have revealed that narrow minor grooves containing A/T-rich sequences are compressed to a width that is about half of that observed for canonical B-form DNA42,43 and that the neutralization of the proximal phosphates is due to the side chain of Lys90. In MC1, the minor groove compression is particularly important between C6 and T29 (just after the A-tract) and is facilitated by the neutralization of their phosphates by the side chain of Lys86 (Fig. 3b). Secondly, the kinked DNA15bp V-turn conformation is stabilized by direct hydrophobic contacts via the intercalation or insertion of hydrophobic side chains (Pro72 and Trp74) into the minor groove. The resulted dramatic bend is made possible through the neutralization of the repulsive negative charges of the phosphates belonging to the narrow major groove by the Arg25 side chain with the help of Gln23 and Gln26 side chains (Fig. 3b). Thirdly, MC1 is able to stabilize a V-turn on a longer DNA because of a supplementary contact in the minor groove. The affinity of MC1 for this supplementary area is probably weak (non-specific interactions) resulting in an equilibrium between two different V-turn conformations of the MC1-DNA23bp complex.

A few other DNA binding proteins are able to impose such a drastic DNA conformational change: two eukaryotic mitochondrial proteins (human TFAM39,44 and its yeast counterpart Abf2p45) and the prokaryotic nucleoid IHF46,47 and HU48 family proteins (Table 2, Fig. 7). At the atomic level, MC1, IHF, HU and TFAM use similar mechanisms to strongly bend DNA: binding in the minor groove, neutralization of the negative charges of the DNA backbone and intercalation of hydrophobic residues. However, the HMG boxes of TFAM interact into the convex face of the DNA curvature, whereas IHF/HU/MC1 interact with the concave side. IHF and TFAM create two almost coplanar kinks on DNA upon binding, leading to the DNA being bent by 163° and 180°, respectively, thus reversing the direction of the DNA helix forming a U-turn (Table 2). On the contrary, HU and MC1 induce two non-coplanar kinks, resulting in a medium (40–73°) or strong (106°) dihedral angle on the DNA. Torsion has a major impact on the overall bend angle. In the case of MC1 and HU, the bend angle values are then consistent with a V-turn DNA conformation.

Table 2 Comparison of the kinks angles, overall bend angles and dihedral angles for DNAs in complex with the MC1 (this work), HU48, IHF46,47 and TFAM39,44 proteins.
Figure 7
figure 7

Cartoon representation of (a) MC1-DNA15bp (PDB code 2NBJ), (b) model of MC1-DNA23bp, (c) HU-DNA (PDB code 1P71), (d) IHF-DNA (PDB code IHF) and, (e) TFAM-DNA (PDB code 3TMM) complexes. Red lines show the curvature of the DNA axis.

Our 3D structure helps us to understand how a small monomeric protein, such as MC1 with a completely asymmetric conformation, can sharply bend the DNA and impose a strong torsional angle. As previously known, MC1 is a bender but might also have a secondary binding mode as a wrapper protein. Indeed, some NAPs, as HU (bender and wrapper) or as Fis (bender and bridger) exhibit dual architectural properties that are likely dependent on protein concentration or DNA binding sequence1. In the IHF/HU complexes, under-twisting of the DNA near the kinks is partially compensated by over-twisting of the DNA between the kinks, whereas, in the MC1-DNA15bp, there is an overtwisting at the first kink and near the second kink, which is compensated by undertwisting between the kinks and after the second kink. This new mode of torsion is a key point to allow a small monomeric protein to impose a V-turn conformation on the DNA. The flexible and dynamic nature of the V-turn conformation can be modulated through the intrinsic dynamics of the protein (adaptability of its core through small local rearrangement and of its arm through a more or less extended conformation) and/or additional interactions with DNAs, such as those observed with the MC1-DNA23bp complex. Indeed, MC1, contrary to IHF and like HU, binds DNA without sequence specificity and selects structural features of DNA for binding17,49.

More than just a histone-like protein involved in the maintenance of negative supercoiling and chromosomal compaction, the flexibility and dynamics of the MC1-DNA complexes observed in solution could also reveal anchor points (i.e. molecular targets) for other molecular partners involved in various DNA transactions. The formation of such higher-order protein-DNA architectures requires the conformation of the DNA template to be bent or distorted. Bending brings distant sites of the DNA into proximity, which is necessary for the site-directed recombination process, whereas negative supercoiling favors unwinding and is likely to facilitate processes in which proteins need access to the DNA bases, such as repair, replication and transcription molecular machineries.

The binding mode of MC1 results in the formation of a particularly widened minor groove in the A7pC8pA9/T22pG23pT24 region, permitting ready access to the DNA by other DNA-binding proteins. For example, architectural DNA-binding proteins can modulate the DNA glycosylase activity. DNA glycosylases, which initiate the base excision repair (BER) process, recognize and extrude the damaged base, stabilizing it into an extrahelical conformation inside the active site pocket. To achieve this base-flipping mechanism, DNA at the lesion site is bent by inserting an intercalating residue triad inside the minor groove. The damaged nucleotide is then expelled from the DNA double helix by the major groove. We previously showed that the E. coli HU could stimulate the turnover of the Formamidopyrimidin-DNA glycosylase (Fpg)50. Like HU, MC1 could play a role in DNA repair in vivo in Archaea.

The knowledge of which proteins are involved in modulating chromatin structure in archaeal organisms is incomplete, and views on the interplay between chromatin proteins, such as MC1, and transcription are still unknown. The archaeal transcription machinery is closely related to the RNA polymerase II system in terms of subunit composition, structure, use of general factors and molecular mechanisms8. There are two putative transcriptional regulatory mechanisms involving chromatin proteins in Archaea. The first involves histone proteins, which compete for DNA-binding to the promoter. Their steric and torsional effects limit the binding of basal transcription factors to the DNA8,51. In vitro, the E. coli RNA polymerase activity is stimulated by the addition of MC1 at low protein to DNA ratios, but is inhibited at higher ratios22. Such a bell-shape effect was also observed in the experiments showing the stimulation of Fpg by HU50. We propose that MC1 uses the same transcriptional regulatory mechanism of competition as histones. The second mechanism involves the post-translational modification of NAPs, especially acetylation of the Alba protein6, which modifies their DNA-binding properties. MC1-α from Methanosarcina mazei strain Gö1 was described to be specifically methylated (Lys37) by the methyltransferase Gö1-SET, in vitro52. In MC1, this Lys37 is replaced by a serine, located into the loop between the helix and the β3-strand, a region which has not been implicated in the interaction surface of the MC1-DNA15bp complex. Among the MC1 family proteins, this loop presents a great variability in the number and nature of amino acids (see alignment in ref.29). It often contains at least one lysine, which could be methylated. Since methylation does not alter the DNA-binding interface of MC1-α, the role of this modification needs to be clarified. Cren7 and Sul7 are also both known to be methylated at several lysine residues without any known impact in vitro53. Such post-translational modifications of MC1-α, Sul7, Cren7, and thus MC1, can alter their affinity for other molecular partners.

Finally, the V-shape structure of the MC1-DNA complexes provides a new perspective for the high affinity of MC1 to four-way junctions27. Indeed, cruciform structures are fundamentally important for a wide range of biological processes, including replication and recombination. MC1 has been shown to be involved in cellular division in promoting replication and high rates of growth in the psychrophilic methanogen Methanococcoides burtonii after heat-shock at 23 °C23,24. In Halobacterium salinarum, a growth-phase-dependent effect on the relative expression of MC1 and histone proteins was shown54. The Holliday junction-resolvase Hjc, conserved in archaea, specifically recognizes four-way DNA junctions, cleaving them without sequence preference to generate recombinant DNA duplex. Hjc, which is retro-inhibited by its Hollyday-junction cleavage product (no turnover in vitro) is released by the addition of the architectural double-stranded DNA-binding protein Sso7d55. MC1 could regulate the Hjc enzymatic activity in methanosarcinal, as Sso7d does in Sulfolobus55.

In summary, we have determined the 3D NMR structure of a MC1-DNA15bp complex showing the remarkable ability of a small and monomeric protein to induce DNA bending and DNA torsional constraints. Our structural work highlights a new and very efficient way, used by MC1, to produce a DNA V-turn. Indubitably, MC1 largely contributes to the organization of the genome of methanosarcinal and haloarchaeal classes of Euryarchaea. The structural determination of MC1-DNA complexes opens up new opportunities for studying and understanding the different emerging roles of MC1 to modulate DNA accessibility to transcription and replication machineries.

Materials and Methods

NMR sample preparation

The MC1-DNA15bp complex samples (20 kDa) were prepared in 10 mM phosphate buffer pH 6, 100 mM NaCl, 1 mM EDTA, 10% D2O and were previously described: 15N MC1-DNA15bp complex29 (1 mM) and 15N MC1-13C15N DNA15bp complex (1.18 mM)56. The first strand (5′- A1AAAACACACACCCA15 −3′) is uniformly 15N and 13C-labelled and the second strand (5′- T16GGGTGTGTGTTTTT30 −3′) is in natural abundance. To measure residual dipolar couplings, the filamentous bacteriophage Pf1 (50 mg/ml, Asla Biotech. Ltd., Latvia) was added in the 15N MC1-13C15N DNA15bp complex sample as the orienting medium. The corresponding final concentration was 15N MC1-13C15N DNA15bp complex (0.85 mM) with Pf1 bacteriophage (10 mg/ml).

The 15N MC1-DNA23bp complex sample (25 kDa) was prepared in the same way to obtain a final concentration of 0.67 mM.

NMR experiments and structure calculations of the MC1-DNA15bp complex

All NMR experiments were performed at 298 K on a 700 MHz Bruker Avance III HD spectrometer equipped with a cryoprobe. All data were processed using Topspin Bruker and assignments were performed with the CcpNmr suite57. Interproton distances were derived from NOESY data sets run at 120 ms mixing time. 1DNH RDCs were measured by using 2D IPAP 1H-15N and 1H-13C HSQC experiments acquired on a 600 MHz Varian UNITYINOVA spectrometer for both the isotropic and anisotropic conditions32.

Structures were calculated with NOE distance, hydrogen-bond, and RDC constraints using the HADDOCK web server based on restrained MD/SA (XPLOR/CNS)58. The H-bond restraints used for the calculation of the 3D structures (Table 1) were deduced from the cross peaks observed in the NOESY spectrum. For MC1, they were set in accordance with the observation of typical long or medium NOE cross peaks network for β-sheets and α-helices respectively – HN/HN, HN/Hα, Hα/Hα. For DNA, they were set in accordance with the observation of NOE cross peaks between the H1 imino protons of the guanines and the H4 amino protons of the cytosines and, between the H3 imino protons of the thymines and the H2 aromatic protons of the adenines. Our precedent model, obtained by data-driven docking29, allowed us to calculate the expected protein-DNA contacts which helped us to unambiguously identify 50 intermolecular distance restraints derived from NOEs. RDC restraints were incorporated at the last iteration with the proper parameters for MC1 (Da = 18.94 and R = 0.58) and DNA (Da = −13.8 and R = 0.36). RDC restraints were fitted and analyzed with the MODULE program59.

Docking was started from the whole ensemble of the 15 lowest-energy MC1 free structures (2KHL.pdb) and a B-standard DNA structure obtained with 3D-DART60. The first step of the protocol58 described for a protein-DNA complex which consisted of a rigid-body docking (1000 models) followed by a refinement stage (200 models). Both protein and DNA were successively defined to be fully flexible on all their length (first run), then fully flexible for residues 63–80 (MC1) and 12–15/16–18 (DNA15bp) and finally automatically semi-flexible throughout the different runs, 40 in total. The final step of the structure refinement was performed in explicit water (200 models). The twelve structures with the lowest interaction energies and lowest constraints violations were selected for further analysis. Protein analysis was performed with Procheck. The figures were prepared with PyMOL61 and Molmol62.

Relaxation experiments

1H-15N CPMG experiments were recorded with νCPMG = 0, 50, 100, 150, 200, 250, 300, 350, 400, 500, 600, 700, 800 and 900 Hz, and a CPMG constant time delay, Tcp of 80 ms and 40 ms for MC1 and MC1-DNA23bp, respectively63. Peak intensities were plotted as effective relaxation rates and relaxation dispersion curves were fitted using the NESSY software64.

Model construction of the MC1-DNA23bp complex

DNA23bp was constructed with 3D-Dart starting from bound DNA15bp structure and a canonical B-DNA of 8 bp was added to the terminal A15T16 base pair. By substituting DNA23bp in the MC1-DNA15bp complex structure and minimization, we obtained a model of the MC1-DNA23bp complex.

Electrophoretic mobility-shift assays

Two 32P-labeled DNA duplexes *DNA23bp and *DNA15bp, containing the NMR used sequences with an additional base pair at each extremity, were respectively prepared by annealing a 25-mer (5′-CAAAAACACACACCCAAACTAACAG) and a 17-mer (5′-CAAAAACACACACCCAG) oligonucleotide to their complementary strand, as previously described20. EMSA reaction mixtures (10 µl) were prepared at 4 °C by mixing DNA duplexes (0.05 nM) and increasing MC1 protein concentrations from 0.04 nM to 6.6 nM, in binding buffer (10 mM Tris–HCl, 400 mM NaCl, 200 µg.ml−1 BSA, and 10% (v/v) glycerol, pH 7.5), followed by incubation for 30 min at 4 °C. The different mixtures were then loaded onto a 12% polyacrylamide gel (29:1 acrylamide:bisacrylamide) in 0.5 TBE buffer (44.5 mM Tris-HCl, pH 8.3, 44.5 mM boric acid, 0.5 mM EDTA). Electrophoresis was run at 14 V/cm, for 2 hours at 10 °C. After drying, gels were scanned with a β-scanner (Typhoon Trio) and quantified using ImagQuant software. The binding curves were fitted to a single binding site model using the equation Y = [MC1]/([MC1] + KDapp) where Y is the fraction of bound DNA, [MC1] is the concentration of free protein, and KDapp is the apparent dissociation constant. Under our experimental conditions, the free protein concentration is reasonably approximated by the total protein concentration. The obtained values correspond to the mean (±SD) of three independent experiments.

Assession numbers

NMR restraints and atomic coordinates for the MC1-DNA15bp complex have been deposited with the RCSB PDB under the accession code PDB ID: 2NBJ.