A multi-enzyme machine polymerizes the Haemophilus influenzae type b capsule

Cifuente, Javier O.; Schulze, Julia; Bethe, Andrea; Di Domenico, Valerio; Litschko, Christa; Budde, Insa; Eidenberger, Lukas; Thiesler, Hauke; Ramón Roth, Isabel; Berger, Monika; Claus, Heike; D’Angelo, Cecilia; Marina, Alberto; Gerardy-Schahn, Rita; Schubert, Mario; Guerin, Marcelo E.; Fiebig, Timm

doi:10.1038/s41589-023-01324-3

Download PDF

Article
Open access
Published: 05 June 2023

A multi-enzyme machine polymerizes the Haemophilus influenzae type b capsule

Nature Chemical Biology volume 19, pages 865–877 (2023)Cite this article

6751 Accesses
7 Citations
48 Altmetric
Metrics details

Subjects

Abstract

Bacterial capsules have critical roles in host-pathogen interactions. They provide a protective envelope against host recognition, leading to immune evasion and bacterial survival. Here we define the capsule biosynthesis pathway of Haemophilus influenzae serotype b (Hib), a Gram-negative bacterium that causes severe infections in infants and children. Reconstitution of this pathway enabled the fermentation-free production of Hib vaccine antigens starting from widely available precursors and detailed characterization of the enzymatic machinery. The X-ray crystal structure of the capsule polymerase Bcs3 reveals a multi-enzyme machine adopting a basket-like shape that creates a protected environment for the synthesis of the complex Hib polymer. This architecture is commonly exploited for surface glycan synthesis by both Gram-negative and Gram-positive pathogens. Supported by biochemical studies and comprehensive 2D nuclear magnetic resonance, our data explain how the ribofuranosyltransferase CriT, the phosphatase CrpP, the ribitol-phosphate transferase CroT and a polymer-binding domain function as a unique multi-enzyme assembly.

Transition transferases prime bacterial capsule polymerization

Article Open access 01 July 2024

A bifunctional O-antigen polymerase structure reveals a new glycosyltransferase family

Article 09 March 2020

Chemical structure and genetic organization of the E. coli O6:K15 capsular polysaccharide

Article Open access 28 July 2020

Main

Haemophilus influenzae is a Gram-negative bacterium colonizing the human nasopharynx. It commonly causes upper and lower respiratory infections and occasionally more serious diseases like meningitis and septicemia with high incidence rates, especially in infants¹. Based on the chemical properties of the linear, charged capsule polymer surrounding the pathogen, H. influenzae can be grouped into six serotypes a–f, of which serotype b (Hib) is clinically most important. The capsule polymer of Hib, consisting of a ribofuranose- (Ribf) and ribitol-5-phosphate-containing repeating unit² ([→3)-β-d-Ribf-(1→1)-d-Ribitol-(5→OPO₃→]_n) (Fig. 1a), is one of the pathogen’s main virulence factors and enables it to evade the host’s immune system^1,3.

**Fig. 1: Bcs3 is the capsule polymerase of Hib.**

Capsule polymers can be used as an antigen in vaccine formulations⁴. In the early 1990s, the Hib conjugate vaccine was the first glycoconjugate vaccine ever licensed and was immensely successful in reducing the disease burden⁵. However, the global distribution of Hib conjugate vaccines is still a challenge, partly because the high production costs associated with purifying the capsule polymer from pathogen cultures limit the introduction of Hib vaccination worldwide, including in developing countries⁶. Promising alternative strategies are the use of either chemical and/or enzymatic synthesis for the provision of capsule polymer⁷. Notably, a chemically synthesized, cost-effective Hib vaccine was introduced in the Cuban national immunization schedule in 2004 (ref. ⁵). However, an enzyme-based synthesis approach has not yet been realized, as the enzymatic machinery responsible for capsule biosynthesis in Hib remains elusive. Its characterization would open up exciting opportunities for the fermentation-free production of a Hib vaccine from ultrapure components, both in the context of recombinant proteins and substrates.

On the DNA level, capsule expression in Hib is organized in a conserved region named capsule gene cluster, which can be subdivided into regions 1–3 (refs. ^8,9; Fig. 1b and Extended Data Fig. 1). Regions 1 and 3 are conserved between species and encode the export machinery belonging to an ABC-transporter-dependent capsule assembly system, historically referred to as group 2 biosynthesis system³. Region 2 is serotype-specific and encodes the capsule biosynthesis enzymes. In Hib, region 2 contains four open reading frames called bcs1-bcs4 (ref. ⁸; Fig. 1b and Extended Data Fig. 1a), of which only the product of bcs1 (Bcs1) was biochemically characterized¹⁰ and shown to encode a CDP-ribitol synthase, which putatively provides the nucleotide-activated precursor for the ribitol-5-phosphate moiety in the Hib polymer (Fig. 1c). So far, no information is available about the function of the gene products Bcs2–4.

At least the following two enzymatic functions are needed to assemble the Hib polymer ([→3)-β-d-Ribf-(1→1)-d-Ribitol-(5→OPO₃→]_n): a ribofuranosyltransferase and a ribitol-phosphate transferase. Despite the abundance of ribose in bacterial surface glycans¹¹, the enzymatic machinery for ribose incorporation into capsular polysaccharides has not been identified yet. Ribitol-phosphate transferases have been described biochemically in the context of Gram-positive wall teichoic acids (WTAs)¹². However, evidence of these enzymes in the Gram-negative background is lacking, and structural information is missing entirely. In this context, the capsule biosynthesis machinery of Hib might connect biosynthesis modules used in both Gram-negative and Gram-positive pathogens, and the structural characterization of the enzyme would greatly improve the understanding of surface glycan biosynthesis in both systems, thereby advancing the search for the discovery of new antimicrobials.

In this study, we defined the entire pathway for the fermentation-free synthesis of the Hib capsule polymer starting from purified enzymes and inexpensive, widely available substrates. We identified Bcs3 as the capsule polymerase of Hib and solved its crystal structure in complex with cytidine monophosphate (CMP), phosphate and Hib polymer. The enzymes of Bcs3 form a machine for heteropolymer synthesis of as yet undescribed complexity. We unveil the molecular basis of substrate specificity in each one of the three enzymes and suggest a mechanism by which they act in concert to sequentially assemble the Hib capsule.

Results

Bcs3 is the Hib capsule polymerase

From the noncharacterized genes of the Hib capsule gene cluster (bcs2–4), only bcs3 codes for an enzyme large enough (1,215 amino acids) to encompass at least two catalytic activities (Fig. 1b). Homology modeling performed with Bcs3 using Phyre2 (ref. ¹³) predicted (Fig. 1b) (1) an N-terminal TagF-like polymerase^14,15 (Supplementary Fig. 1), (2) a central phosphatase¹⁶ and (3) a large C-terminal region of ca. 600 amino acids for which no significant homologs could be provided by Phyre2. We hypothesized that this region might harbor the ribosyltransferase domain. Bcs3 was cloned and purified, and CDP-ribitol was considered a suitable donor substrate. Because CDP-ribitol is not commercially available and its precursor ribulose-5-phosphate is comparatively expensive, the enzymes Bcs1 (ref. ¹⁰) and RpiA¹⁷ from Hib were cloned to establish a cascade synthesis for CDP-ribitol starting from d-ribose-5-phosphate, NADPH and CTP (Fig. 1c,d and Supplementary Fig. 2). As expected, the presence of Bcs1 in a reaction containing ribulose-5-phosphate led to the consumption of NADPH, while NADP⁺ and a peak eluting similarly to CDP-glycerol were produced. The same result can be observed if ribulose-5-phosphate is produced in situ from ribose-5-phosphate by RpiA (Fig. 1d, compare reactions 1 and 2), indicating that CDP-ribitol can be generated by RpiA and Bcs1 in a one-pot reaction.

PRPP is the ribose donor for the synthesis of Hib polymer

ADP-ribose, α-d-ribose-1-phosphate (Rib-1P) and 5-phospho-α-d-ribose-1-diphosphate (PRPP; Fig. 1e) have been discussed as putative ribose donors for Hib polymer synthesis⁹ and were thus tested as substrate candidates for the putative ribofuranosyltransferase activity of Bcs3. Because CDP-ribitol degraded during attempts of isolation (Supplementary Fig. 3), it was generated in situ by Bcs1 and RpiA. Interestingly, CDP-ribitol was only used up in the presence of PRPP, with concomitant production of CMP and large polymeric products detectable by Alcian blue-stained PA gel¹⁸ (Fig. 1d,f, reaction 6). In contrast, ADP-ribose and ribose-1-phosphate did not stimulate the consumption of CDP-ribitol nor the production of CMP or polymer (Fig. 1d,f, reactions 4 and 5). Thus, we concluded that PRPP is the donor substrate for Bcs3. To confirm the identity of the produced polymer, the enzyme cascade of RpiA, Bcs1 and Bcs3 was scaled up. Polymer was purified and analyzed by immunoblotting (Fig. 1g) and nuclear magnetic resonance (NMR; Extended Data Fig. 2). ¹H and ¹³C chemical shifts agreed perfectly with spectra previously reported for natural Hib polymer (Supplementary Tables 1 and 2). Notably, 1D ³¹P NMR and ¹H-³¹P HMBC spectra showed only a single phosphodiester signal and no indication of the presence of phosphomonoesters in the polymer, suggesting that the predicted phosphatase (Fig. 1b) might be required to remove the 5-phosphate present in PRPP. Of note, the ribofuranose has α-configuration in PRPP and β-configuration in the polymer (Fig. 1a,e), indicating that Bcs3 uses an inverting mechanism.

The overall architecture of Bcs3

Crystallographic studies were performed using a C-terminally truncated version of the enzyme (Supplementary Fig. 4). The crystal structure of Bcs3 (residues 1–1,115, called Bcs3-CMP hereafter) in complex with CMP was solved by molecular replacement methods using an Alphafold¹⁹ model (see Methods for details) at a maximum resolution of 2.9 Å (Fig. 1h, Supplementary Fig. 5 and Supplementary Table 3). Bcs3-CMP crystallized in the P 2₁2₁2₁ space group, with two molecules in the asymmetric unit and residues 1–1,115 visible in the electron density map (Extended Data Figs. 3 and 4 and Supplementary Fig. 6). The following five different regions were identified from the N- to the C-terminus (Fig. 1h and Supplementary Fig. 5): (1) a TagF-like capsule polymerase^14,15 called CroT (Capsule ribitol-5-phosphate Transferase (residues 1–373), (2) a phosphatase called CrpP (Capsule ribose phosPhatase (residues 374–591), (3) the ribose-5-phosphate transferase CriT (Capsule ribose-5-phosphate Transferase (residues 592–1,036) and (4) an SH3b²⁰ domain (residues 1,037–1,115). Region 5, the amino acids C-terminally attached to the SH3b domain, was not visible in the structure (residues 1,116–1,215; Supplementary Fig. 7). However, a bundle of amphipathic α-helices (CT-HB) compatible with membrane association and/or protein–protein interactions could be predicted for this region using Alphafold¹⁹. Interestingly, these helices are not present in Bcs3 homologs identified via BLAST²¹ in Gram-positive bacteria (Supplementary Data 1), suggesting that they might be specific for the Gram-negative capsule biosynthesis systems³ shown in Extended Data Fig. 1.

The Bcs3 protomers build into a physiological and functional head-to-tail dimer (Fig. 1h and Supplementary Fig. 4f,5), which is assembled primarily by interactions between CriT from one protomer and CroT’/CrpP’ from the neighbor protomer, with an overall basket-like shape (Extended Data Fig. 5). The active sites of the three enzymes in each protomer, as well as each of the SH3b domains and the predicted amphipathic α-helices, are located at the concave surface of the dimer (Supplementary Fig. 5). The concave surface of Bcs3 contains several hydrophobic patches interspersed with clusters of positively charged residues, which could indicate potential binding sites for Ribf/ribitol and for phosphate of the repeating unit, respectively. In contrast, the opposite side of Bcs3 displays a negatively charged surface, which would generate a substantial electrostatic repulsion with the bacterial plasma membrane (Supplementary Fig. 5d).

Structural basis of CriT substrate specificity and catalysis

CriT comprises the following four domains from the N- to the C-terminus (Fig. 2a–c): (1) a first α-helical bundle (HB-1; residues 599–654 and 904–959), (2) a catalytic domain (CriT-CAT; residues 655–715 and 796–903), (3) a second α-helical bundle (HB-2; residues 716–795) and (4) a third α-helical bundle (HB-3; residues 960–1,036). The CriT-CAT domain adopts an α/β core structure composed of five parallel β-strands arranged in a twisted β-sheet, surrounded by six α-helices (Fig. 2a). A similar structural arrangement can be observed in phosphoribosyltransferases (PRTases) involved in the formation of N-glycosidic bonds during nucleotide synthesis²². PRTases contain a deep pocket for PRPP binding²³. In Bcs3-CMP, a corresponding pocket contains two phosphate groups (PO4₁ and PO4₂; Extended Data Fig. 4d and Supplementary Fig. 8), is located at one end of the β-sheet, and is surrounded by α-helices α37, α38, α45 and α46, and the loops shown in Fig. 2. Structural comparison of the CriT-CAT domain with PRTases revealed that the 5-phosphate of PRPP superimposes well with PO4₂, whereas the β-phosphate of PRPP superimposes well with PO4₁ (Supplementary Fig. 9). Thus, molecular docking guided by homology (Methods) was used to place PRPP in an energetically favored configuration into the deep pocket of the CriT-CAT (Fig. 2d–g).

PRTases use the side chains of two conserved vicinal aspartates (DD motif) to accommodate the ribose ring of PRPP in the corresponding active sites²⁴. In addition, the ribose C2 and C3 hydroxyl groups and the pyrophosphate moiety interact with Mg²⁺. The functional conservation of the DD motif in CriT-CAT is achieved by equivalent positioning of the side chains of D680 and E818, which are conserved in all homologs identified in this study (Fig. 3a, Supplementary Data 1 and Supplementary Fig. 9). The replacement of D680 and E818 by alanine led to a clear reduction of the Bcs3 enzymatic activity (Fig. 3b,c, constructs 5–7). The location of these residues in different structural elements, namely loop 3 and loop 1 (Fig. 2d), provides a more open topology of the CriT-CAT active site when compared to the PRTase homologs (Supplementary Fig. 9) and enables the Hib polymer to access the binding site. As a consequence, C1 from PRPP faces a putative entrance for the nonreducing end ribitol of the nascent polymer (Fig. 2e–g). Alongside this entrance, we hypothesize that E874 is ideally positioned to deprotonate 1-OH of ribitol for a nucleophilic attack (Supplementary Fig. 10). Based on homology with PRTases, we propose that CriT follows the mechanism proposed in Supplementary Fig. 11a.

**Fig. 3: Biochemical characterization of Bcs3.**

Structural basis of CrpP substrate specificity and catalysis

The phosphatase CrpP adopts a general topology of the haloacid dehalogenase (HAD)-like enzymes, a large superfamily of phosphohydrolases found in organisms from all three superkingdoms of life^16,25. CrpP comprises the following two domains from the N- to the C-terminus: a catalytic Rossmann-fold domain (CrpP-CAT; residues 374–392 and 491–591, gray) and a mostly α-helical mobile cap domain (CrpP-CAP; residues 393–490, yellow; Fig. 4a–c). The CrpP-CAT comprises a five-stranded parallel β-sheet flanked by six α-helices on both sides. CrpP-CAT displays the following two key structural signature motifs that are shared by HAD-like enzymes¹⁶: (1) a single helical turn (residues 387–390 in CrpP, tan and loop 1) and (2) a ‘flap’ in the form of a β-hairpin motif (β14 and β15 in CrpP) located immediately after the first β-strand of the Rossmann-fold domain (Fig. 4a). These two signatures are predicted to provide the necessary mobility between the ‘open’ and ‘closed’ conformations in HAD-like enzymes¹⁶.

Besides the structural motifs, HAD-like phosphohydrolases are defined by the presence of four sequence motifs¹⁶. Motif I comprises a signature DXD located at the end of the first β-strand, right at the beginning of loop 1. The first aspartate of this motif acts as a nucleophile that forms an aspartyl-intermediate during catalysis and only this first aspartic acid is conserved in CrpP (D386; loop 1 in Fig. 4d, Supplementary Figs. 12 and 13 and Supplementary Data 1). In HAD-like phosphohydrolases, a deep pocket contains the binding site for the terminal phosphomonoester substrate. In CrpP, such a binding site could be located at one end of the β-sheet (β13, β16 and β18) and would be flanked by α-helix α33 and loops 1–4 of CrpP-CAT, as well as by the flap (β14-β15), α23, α25, α26 and loops 5 and 6 of CrpP-CAP (Fig. 4a,b). In agreement with that, a phosphate and a Zn²⁺ found in CrpP’s deep pocket superimpose well with substrates and Mg²⁺ identified in structural homologs (Fig. 4a–c, Extended Data Fig. 4a,b and Supplementary Fig. 12). Guided by the ligands and homology, we performed molecular docking, which placed one Hib repeating unit ending with a 5-phosphorylated, nonreducing end Ribf ([OPO₃→3)-β-d-Ribf-(1→1)-d-Ribitol-(5→OPO₃]) in an energetically favored configuration into the CrpP-CAT deep pocket (Fig. 4d–g; Methods). Notably, an internal 5-phosphorylated Ribf could not be placed into the pocket, supporting an exo-cleavage of the 5-phosphate over an endo-cleavage, and suggesting that CrpP acts after CriT and before CroT in the reaction cycle. Notably, the architecture of the active sites of CrpP and other HAD-like phosphatases support a common catalytic mechanism (Supplementary Fig. 11b).

Structural basis of CroT substrate specificity and catalysis

CroT displays the typical GT-B fold of glycosyltransferases (Supplementary Fig. 14), consisting of an N-terminal (CroT-NT) and a C-terminal (CroT-CT) Rossmann-fold domain, with a deep fissure at the interface that includes the catalytic center²⁶ (Fig. 5a,b). The core of each domain is composed of a six-stranded parallel β-sheet flanked by α-helices on both sides (Fig. 5a). The close inspection of the electron density maps revealed a CMP molecule and glycerol located in one of the CroT protomers (Fig. 5a,b, Extended Data Fig. 3a,b and Supplementary Fig. 15). The CMP binding pocket on the C-terminal side of the interdomain crevice is defined by α-helices α8, α15 and α16, loops 1–3 of the CroT-CT domain, as well as loop 4 of the CroT-NT domain (Fig. 5a–c). A structural comparison of the CroT protomer complexed with CMP and the apo CroT protomer reveals an important interdomain movement that has been proposed for other members of the GT-B superfamily during substrate binding and catalysis (Extended Data Fig. 6 and Supplementary Fig. 16).

CroT transfers ribitol-phosphate from CDP-ribitol, generating a phosphodiester bond with the 3-OH of the nonreducing end ribose of the Hib polymer. The only other bacterial polyol-phosphate transferase whose structure was solved is the glycerol-phosphate polymerase TagF from Staphylococcus epidermidis¹⁴. TagF creates a phosphodiester bridge by transferring a glycerol-3-phosphate moiety from CDP-glycerol to the 1-OH of poly-glycerol-phosphate. It uses a simple displacement mechanism and by homology, we propose a similar mechanism for CroT (Supplementary Fig. 11c). To support this, we performed molecular docking calculations of CDP-ribitol into CroT (Methods). A structural comparison of the CroT-CMP complex with the TagF_H444N-CDP-glycerol complex (PDB code 3L7K¹⁴) revealed that the CMP moieties of both structures superimpose well (Supplementary Fig. 14). Therefore, the ribitol-phosphate moiety was placed into the CroT active site using the observed orientation of the glycerol-3-phosphate moiety in the TagF_H444N-CDP-glycerol complex¹⁴, as well as the glycerol and CMP identified in CroT as a reference. The glycerol moiety of CDP-glycerol accommodates into a small pocket flanked by P447 in TagF, whereas in CroT, the equivalent residue G99 creates extra space, presumably facilitating the ribitol-phosphate binding. In agreement with that, the proline is conserved in homologs of CroT that transfer glycerol-phosphate^15,27, whereas the glycine is conserved in homologs transferring ribitol-phosphate¹² (Supplementary Data 2, highlighted in yellow). The putative base H96, as well as residue H233, are strictly conserved among all homologs identified in this study and substitution to alanine abolished or drastically reduced activity (Fig. 3a–c, constructs 2–3, and Supplementary Data 1 and 2).

Structural basis of Bcs3 Hib capsule polymer recognition

To study the location of Hib capsule polymer in Bcs3, crystals were soaked with oligomer fractions having a narrowed degree of polymerization (DP) (Extended Data Fig. 7, Supplementary Note and Supplementary Table 2). A structure was solved by molecular replacement methods (3.6 Å maximum resolution) showing Bcs3 in complex with two complete repeating units (DP2) of the Hib capsule polymer and an additional nonreducing end phosphate coming from a third repeating unit (Fig. 6). Bcs3-DP2 crystallized in the P 1 2₁ 1 space group, with four molecules in the asymmetric unit (Supplementary Table 3). The overall architecture between Bcs3-CMP and Bcs3-DP2 is essentially preserved. However, we observed more flexibility for the CroT domain in Bcs3-DP2 than for the CroT domain in Bcs3-CMP (Supplementary Figs. 6 and 16). DP2 is clearly visible in the electron density maps and is present in all four Bcs3 molecules bound to an SH3b domain (residues 1,037–1,115), with the nonreducing end facing toward the active centers in the main groove (Fig. 6a,b, Extended Data Fig. 3c,d and Supplementary Fig. 17). SH3b domains are the prokaryotic counterparts of the well-characterized SH3 domain family^20,28. The SH3b domain in Bcs3-DP2 revealed a completely conserved fold—a compact β-barrel, formed by two anti-parallel, three-stranded β-sheets (Fig. 6e and Supplementary Fig. 14c).

**Fig. 6: The structure of Bcs3 in complex with DP2.**

Molecular basis of Bcs3 enzymology

As described above, the constructs Bcs3_H96A/Bcs3_H233A (CroT), Bcs3_D386A (CrpP) and Bcs3_D680A/Bcs3_E818A/Bcs3_D680A/E818A (CriT) were (partially) inactive (Fig. 3a–c, constructs 2–7), confirming the importance of the targeted residues for the catalytic function of the Bcs3 enzymes. Constructs 8–10, in which only one of the enzymes remained intact, could be combined in trans during polymer assembly, corroborating that the introduced substitutions did not alter the function of the remaining Bcs3 domains (Fig. 3b,c, last lane) and indicating that the polymer could be released from each construct despite the fact that processivity is observed for wild-type Bcs3 (Extended Data Fig. 8).

To determine the relevance of the two predicted amphipathic α-helices and the SH3b domain, we performed C-terminal truncations in Bcs3 (Fig. 3b, constructs 11–14). Interestingly, the deletion of the last 53 amino acids (Fig. 3b; Bcs3_2–₁₁₆₂, construct 11) comprising the entire amphipathic α-helical bundle (α56-α59 in Supplementary Data 1) improved the yield of purified protein by fourfold (20 mg l⁻¹ (Bcs3) versus 86 mg l⁻¹ (Bcs3_2–₁₁₆₂)). Because truncations can influence the ability of capsule polymerases to start the reaction de novo²⁹, activity tests were performed in the presence and absence of acceptor. Both Bcs3_2–₁₁₆₂ and Bcs3_2–₁₁₂₂ maintained their enzymatic activity, indicating that the catalytic machinery is not dependent on the two predicted amphipathic α-helices (Fig. 3d). As expected, the observed polymers are slightly shorter in the presence of acceptor, indicating specific elongation and a certain control over the product length, which would be beneficial for the biotechnological use of the enzyme^29,30 (Extended Data Fig. 8). Notably, Bcs3_2–₁₀₉₃ and Bcs3_2–₁₀₆₈, in which the SH3b domain is disrupted, were completely inactive, strongly supporting the mechanistic role of the SH3b domain for Hib capsule polymer recognition.

N-terminal truncations were performed to further advance our understanding of the CriT domain. CriT and CrpP were separated between α35 and α36, which are part of the N-terminal HB-1 of CriT (Fig. 2a). The loop connecting these structural elements is of variable length (Supplementary Data 1) or even missing (Fig. 3a) in CriT homologs identified by BLAST²¹. The resulting constructs Bcs3_2–631, Bcs3_632–₁₁₆₂ and Bcs3_632–₁₂₁₅ were inactive alone but active if combined in trans (Fig. 3b,e, constructs 15–17), demonstrating that the separated enzymes indeed maintain their ability to generate polymer and that HB-1 of CriT can be partly reduced in size. Since Bcs3_632–₁₁₆₂ was purified with improved quality if compared to Bcs3_632–₁₂₁₅ (Supplementary Fig. 2c), Bcs3_632–₁₁₆₂ was N-terminally truncated to narrow down the minimal active CriT domain. The resulting Bcs3_643–₁₁₆₂, Bcs3_653–1162 and Bcs3_664–₁₁₆₂ (Fig. 3b, constructs 18–20) were combined in trans with Bcs3_2–631. The ability to initiate chain elongation de novo was only observed for Bcs3_2–631 + Bcs3_632–1162, with Bcs3_2–631 + Bcs3_643–1162 still being able to elongate priming acceptor oligosaccharides (Fig. 3e, 15 + 17, 15 + 18).

The rationale for separating CroT and CrpP was based on a comparison of Bcs3 with the capsule biosynthesis enzymes of Escherichia coli K18 (Extended Data Fig. 1b). The Hib ([→3)-β-d-Ribf-(1→1)-d-Ribitol-(5→OPO₃→]_n) and E. coli K18 ([→2)-β-d-Ribf-(1→2)-d-Ribitol-(5→OPO₃→]_n) capsules only differ with regard to how their constituents are linked. We identified separate open reading frames for CroT and CrpP/CriT in the K18 capsule genes cluster (Fig. 3b and Extended Data Fig. 1b). An alignment of Bcs3 with CroT-K18 and CrpP-CriT-K18 demonstrates that the region comprising α21 and α22 (amino acids 348–379) of Bcs3 is missing in E. coli K18, indicating that it might not be crucial for activity (Supplementary Data 3). Consequently, we designed Bcs3_2–354, Bcs3_2–379, Bcs3_354–631 and Bcs3_378–631 (Fig. 3b, constructs 21–24) and tested them in all possible combinations together with the CriT domain Bcs3_632–1162. The activity was only visible in the presence of the acceptor and when constructs Bcs3_2–378 + Bcs3_354–631 were combined (Fig. 3f, 17 + 24 + 21), indicating that the truncated region is important for the function of both CroT and CrpP under in vitro conditions.

Discussion

Gram-negative bacteria use ABC-transporter-dependent assembly systems for the expression of polymers in which charged carbohydrate moieties (sialic acid, 3-deoxy-d-manno-oct-2-ulosonic acid (Kdo) and glucuronic acid) or phosphate residues alternate with noncharged carbohydrates (group 2 for E. coli nomenclature)³¹. Polymers exclusively containing sugar units are generated by enzymes, adopting one of the two most abundant GT folds: GT-A or GT-B^3,26,32,33. Polymerases assembling a heteropolymer, in which monosaccharides are bridged by phosphodiester linkages^29,34, belong to a protein family called stealth³⁵. Polyol-phosphate-containing polymers are assembled by TagF-like enzymes^15,18,27,30. Structural data for capsule polymerases embedded in group 2 systems are limited to the chondroitin polymerase K4CP from E. coli K4 (ref. ³²) and the polysialyltransferase from Mannheimia haemolytica serotype A2 (ref. ³³; MhPST). With Bcs3, we present the structure of a multi-enzyme machine that generates a complex heteropolymer and combines three catalytic domains that are new in the context of capsule biosynthesis.

Our structural data support a mechanistic model in which the catalytic cycle that adds one repeating unit to the nascent chain consists of three steps as follows: (1) CriT transfers Ribf-5P from PRPP to the C1’ OH group of the nonreducing end ribitol of the growing chain, (2) CrpP removes the phosphate group from the 5 position of the now terminal Ribf-5P transferred by CriT and (3) CroT catalyzes the transfer of ribitol-phosphate from CDP-ribitol to the 3 position of the ribose ring released by CrpP (Extended Data Figs. 9 and 10). Even though the dimeric structural arrangement of Bcs3 allows the nascent chain to be elongated according to two scenarios (Extended Data Fig. 9), the arrangement of CriT, CrpP, CroT and SH3b supports a model in which two reaction centers inside the concave side of Bcs3 are formed, comprising two triads of enzymes—CriT/CrpP’/CroT’ and CriT’/CrpP/CroT (Extended Data Figs. 6a and 9c). The fact that constructs 15 (CroT/CrpP) and 17 (CriT) can form a polymerase (Fig. 3e) that is even active de novo supports this hypothesis.

A polysaccharide-binding site could be identified through molecular dynamics simulations and a priming acceptor monosaccharide in the crystal structure of the hyaluronan synthase Cv-HAS from Chlorella virus³⁶. However, no structural evidence is available for extended binding sites or binding modules complexed with native polymer for bacterial group 2 capsule polymerases^32,33. The Hib capsule dimer found in Bcs3 is bound to an SH3b domain, which is located 20–40 Å away from the active sites, suggesting a tethering mechanism³⁷ that allows Bcs3 to stay attached to its acceptor while the nonreducing end is elongated (Extended Data Fig. 9). It is tempting to speculate that the SH3b domain allows a sliding motion to guide the polymer to the outside of the concave Bcs3 basket. The concave walls of Bcs3 are decorated with scattered patches of positively charged residues we suspect to participate in the channeling of the capsule polymer during the catalytic cycle, possibly via phosphate interactions. We hypothesize that this allows efficient elongation³⁸ and will likely contribute to the level of processivity³⁹ observed in Extended Data Fig. 8 and discussed in Supplementary Discussion.

In the living cell, most physiological processes are coordinated in networks. Colocalization of enzymes in compartments promotes higher performance of individual enzyme activities and reaction efficiency⁴⁰. The Bcs3 crystal structures reveal a complex architecture compatible with a multi-enzyme macromolecular machine that acts as a concave basket-like compartment to polymerize the Hib capsule polymer. The electrostatic surface potential of Bcs3 suggests that the concave surface—including the three active sites, the SH3b domain and the predicted amphipathic α-helices—is oriented toward the membrane. Interestingly, the exit points of the SH3b domains are approximately 70–80 Å apart, while ABC transporters exporting polymers similar to the Hib capsule have a diameter of approximately 60–70 Å^41,42. Combined with the observation that capsule polymers are synthesized in rafts⁴³, it is tempting to speculate that the Bcs3 architecture supports the bridging of two ABC-transporter complexes, thereby coupling capsule biosynthesis to export (Extended Data Fig. 10).

The identity of the activated precursor for the transfer of Ribf and the machinery for its incorporation into bacterial CPS has long been a matter of debate. We show that Ribf transfer catalyzed by Bcs3 is realized by combining a HAD-like phosphatase (CrpP) and a glycosyltransferase (CriT) of a yet undescribed GT fold. Typically, glycosyltransferases adopt Rossmann GT-A or GT-B folds and use NDP/NMP-sugars or lipid-linked donors as substrates²⁶. In contrast, CriT resembles PRPP-transferases involved in the formation of N-glycosidic linkages²² but uses PRPP as a substrate for the transfer of O-linked ribofuranose-5-phosphate, which is dephosphorylated by CrpP. During the final preparation of this study, a study was published⁴⁴ describing CrpP and CriT homologs, and the crystal structure of a HAD-like phosphatase and a putative ribofuranosyltransferase from Thermobacillus composti. A structural comparison with Bcs3 revealed considerable differences between the two structures, indicating structural variety in PRPP-dependent glycosyltransferases (Supplementary Fig. 18).

The closest homologs to the Hib polymer are complex Gram-positive WTAs, but polymerases generating complex WTA have not been identified yet¹². WTAs make upto 60% of the cell wall of Gram-positive pathogens, extend beyond the peptidoglycan layer and are thus in direct contact with the host immune system^12,45, making them attractive vaccine antigens^7,46, and WTA biosynthesis enzymes promising drug targets⁴⁷. Of the four WTA types (I–IV), only WTA I biosynthesis has been investigated thoroughly¹². We identified the homologs of the entire Bcs3 sequence in Gram-positive pathogens (Supplementary Data 1). Future studies will show if these homologs are part of WTA biosynthesis gene clusters. Notably, BLAST searches performed herein using CriT and CroT as queries could not identify homologs in the human proteome, corroborating that mammalian enzymes^22,48 with similar substrate usage are distinct from CriT and CroT and emphasizing the importance of the Bcs3 structure for the development of inhibitors against bacterial pathogens.

Enzymatic and chemo-enzymatic polymer synthesis is a promising strategy to provide antigens for glycoconjugate vaccine development^4,7. It omits pathogen culture and starts with highly pure precursors^27,49. Herein, the established multi-enzyme cascade is scalable, could be used to produce milligram amounts of polymer and allows a considerable measure of length control (Extended Data Figs. 7 and 8). The developed hydrolysis protocols yielded oligomers with free reducing ends that can be coupled through reductive amination and have the right size (for example, 10–25 repeating units) for vaccine development studies⁷. The single action transferases developed herein might even allow the step-wise build-up of the polymer according to established protocols³⁰. The structural and biochemical characterization of Bcs3, the identification of homologs in both Gram-negative and Gram-positive pathogens and the development of fermentation-free synthesis protocols represent important milestones toward understanding, manipulating and exploiting multi-enzyme machines that will allow new access to structurally complex surface glycans.

Methods

General cloning

A PreScission cleavage site was inserted into the modified p-Mal-c (New England BioLabs) vector pMBP-S3N10-csxA-His₆ (ref. ⁵¹) (tac) resulting in pMBP-PreScission-S3N10-csxA-His₆. The gene cps3B²⁷ was cloned via BamHI/XhoI into plasmid pMBP-PreScission-S3N10-csxA-His₆, replacing the csxA sequence and resulting in pMBP-PreScission-S3N10-cps3B-His₆. The genes bcs1 (OOD27571.1), bcs3 (OOD27573.1; Uniprot: Q2ERG0), rpiA (OOD27170.1), prsA (OOD25971.1) and rk (OOD27132.1) were amplified by PCR using heat-inactivated lysate of Hib strain ATCC 10211 (GenBank accession number MTGI00000000.1) as a template. The resulting PCR products were cloned into pMBP-S3N10-csxA-His₆ via BamHI/XhoI restriction sites or pMBP-PreScission-S3N10-cps3B-His₆ via restriction-free cloning⁵² replacing csxA and cps3B, respectively. Single amino acid mutants and truncations of Bcs3 were introduced according to ref. ⁵³ or by using the Q5 Site-Directed Mutagenesis Kit (New England BioLabs) according to the manufacturer’s guidelines. Constructs and primers used in this study are listed in Supplementary Tables 4 and 5.

Expression and purification of recombinant proteins

For the expression of recombinant proteins, E. coli M15 (pREP4) cells were transformed with expression plasmids and grown in 250–500 ml of PowerBroth medium (ATHENAES) supplemented with 50 μg ml⁻¹ carbenicillin at 37 °C. When the culture reached an optical density at 600 nm (OD₆₀₀) of 1.0, the culture was incubated at 4 °C for 30–45 min, Bcs3 expression was induced by the addition of 0.5 mM isopropyl β-d-thiogalactopyranoside (IPTG) and incubation was continued at 15 °C and 200 rpm overnight. Cells were collected by centrifugation at 6,000g for 10 min and resuspended in 10 ml of purification buffer (50 mM Tris, pH 8.0, 300 mM NaCl), containing EDTA-free protease inhibitors (Complete EDTA-free; Roche). Cells were sonicated (Sonorex sonication; Bandelin) for 10 cycles of 30 s each at 25–30% power and 50% amplitude, interrupted by 30 s of cooling on ice. Cell debris was separated by centrifugation at 27,000g for 30 min. The supernatant was filtered through a 0.8-µm cutoff membrane and protein was purified by affinity chromatography using a HisTrap column (1 ml bed volume; GE Healthcare) or MBPTrap column (1 or 5 ml bed volume; GE Healthcare), previously equilibrated in purification buffer. The enzyme was eluted using a linear gradient from 35 to 500 mM imidazole over 15 column volumes (HisTrap column) or with 10 mM maltose over 5 column volumes (MBPTrap column). Protein-containing fractions were pooled and applied to a HiPrep 26/10 Desalting column (GE Healthcare) equilibrated in purification buffer. The protein was concentrated using an Amicon centrifugal device (10,000–100,000 molecular weight cutoff, depending on protein size; Merck) and aliquots were snap-frozen in liquid nitrogen and stored at −80 °C. Aliquots of 10 µl of each purification step were analyzed by SDS-PAGE as previously described⁵¹. The crystallization construct MBP-Bcs3_2–1162-His₆ was purified using MBPTrap column as described above and shown in Supplementary Fig. 4. Protein-containing fractions were further purified by size exclusion chromatography (SEC) using a Superdex 200 10/300 GL column (GE Healthcare) equilibrated in purification buffer. PreScission digest was performed with 5.8 µg PreScission per milligram of recombinant protein overnight at 4 °C. After centrifugation at 15,000g for 5 min, the supernatant was applied to MBPTrap column to retain the MBP tag. The Bcs3-His₆-containing flowthrough was further purified using a HisTrap column as described above, followed by preparative SEC. The SEC column was equilibrated using a gel filtration marker kit for protein molecular weights of 29,000–700,000 (Sigma) according to the manufacturer’s guidelines. The protein was concentrated using an Amicon centrifugal device (50,000 molecular weight cutoff; Merck) to give 10 mg ml⁻¹ of protein, and aliquots were snap-frozen in liquid nitrogen and stored at −80 °C.

Enzymatic reactions, HPLC-AEC analysis and PAGE

Enzymatic reactions were carried out with 0.5 to 2 µM of the enzyme at 37 °C in a total volume of 20 to 30 µl of assay buffer (50 mM Tris, pH 8.0, 10 mM MgCl₂, 1 mM DTT (dithiothreitol)) containing 5 mM of each required substrate (ADP-ribose (Sigma), ATP (Roche), CTP (Sigma), NADPH (Roche), PRPP (Sigma), d-ribose (Sigma), ribose-1-phosphate (Sigma), ribose-5-phosphate (Sigma), ribulose-5-phosphate (Sigma) and UTP (Roche)). After 2 h of incubation or after overnight incubation, samples were analyzed by high-performance liquid chromatography-anion-exchange chromatography (HPLC-AEC) and high-percentage (25%) polyacrylamide gel electrophoresis (PAGE) as previously described¹⁸ with minor modifications—chromatograms were recorded using 5 µl sample and a linear elution gradient from 0% to 25% of mobile phase 2 (1 M NaCl, 20 mM Tris, pH 8.0) over 42 min. Data were collected using LCsolution version 1.25 SP4 (Shimadzu). Alcian blue-/silver-stained PA gels were scanned on an Amersham Imager 680 and colors were adjusted equally across the entire image to improve the visualization of Alcian blue. If in situ-synthesized ribose-5-phosphate was used as a substrate for further enzymatic reactions, 0.5–2 µM ribokinase was preincubated in a total volume of 30–50 µl assay buffer at 37 °C for 2 h and heat-inactivated at 80 °C for 5 min. The substrate concentration of d-ribose and ATP was 20 mM, resulting in 5 mM of ribose-5-phosphate in subsequent reactions.

Upscaling of polymer synthesis and purification

For the synthesis of 5–50 mg polymer, enzymes were incubated overnight at 37 °C in 3–30 ml of assay buffer (50 mM Tris, pH 8.0, 10 mM MgCl₂, 1 mM DTT (dithiothreitol)) supplemented with 5 mM of each substrate. In vitro-synthesized polymer was purified by AEC using a Mono Q 10/100 GL column (GE Healthcare) as described previously³⁰ with the minor adjustment that polymer-containing fractions were either dialyzed against water or thoroughly washed using an Amicon centrifugal device (30,000 molecular weight cutoff; Merck).

Preparation of oligomers

For the generation of oligomers, hydrolysis conditions for the Hib polymer (2.5 mg ml⁻¹) were tested in (1) 25 mM TFA at 37 °C or (2) 10 mM acetic acid at 70 °C. Samples of 10 µl were collected after 15, 30, 45, 60, 90, 120, 180, 255 and 330 min of reaction time and mixed with 10 µl of 2 M sucrose. The hydrolysis products were analyzed by Alcian blue/silver strained high-percentage (25%) PAGE as previously described¹⁸. Preparative hydrolysis of 50 mg polymer was performed by incubation with 25 mM TFA for 120 min at 37 °C. The reaction mixture was freeze-dried and dissolved in 20 ml buffer (50 mM Tris, pH 7.0, 100 mM NaCl, 10 mM MgCl₂). Dephosphorylation of hydrolysis products was performed by the addition of 200 U alkaline phosphatase (Quick CIP; New England BioLabs) for 75 min at 37 °C, followed by the addition of another 100 U of alkaline phosphatase and incubation for 75 min at 37 °C. Alkaline phosphatase was removed using an Amicon centrifugal device (50,000 molecular weight cutoff; Merck). The oligomer-containing filtrate was diluted to 140 ml and further purified by AEC (Mono Q 10/100 GL; GE Healthcare) using a linear gradient from 0 mM to 350 mM NaCl over 40 column volumes. After analysis by high-percentage PAGE, fractions were divided into 12 pools, dialyzed against water (Spectra/Por 7, 1,000 molecular weight cutoff; Roth) and then freeze-dried (Extended Data Fig. 7).

Dot blot/agglutination assay

Dot blot analysis of purified polymer was performed as previously described⁵¹. Hib polymer was detected using a rabbit anti-Hib agglutinating serum (Remel/Thermo Fisher Scientific; 1:1,000 or 1:2,000 dilution).

NMR analysis

All spectra were measured on a 600-MHz Bruker Avance III HD equipped with a ¹H/¹³C/¹⁵N/³¹P QXI probe at 298 K. Typically, samples were dissolved in 500 μl D₂O (100.0 atom%; Armar Chemicals) and measured in a 5-mm NMR standard tube. ¹H 1D spectra were recorded with eight transients and a recycle delay of 10 s. ³¹P 1D spectra were recorded with a recycle delay of 3 s and 352 transients. Standard ¹H-¹³C HSQC experiments from the Bruker library (hsqcedetgpsisp2.2) were recorded with 32 scans, 2,048 × 230 points and a recycle delay of 1.5 s. ¹H-¹H TOCSY spectra were measured with four scans, 2,048 × 360 points, a recycle delay of 1.5 s and a mixing time of either 80 ms or 12 ms. ¹H-¹³C HMBC spectra (Bruker pulse sequence: hmbclpndqf) were recorded with 32 scans, 4,096 × 512 points and a recycle delay of 1.5 s. ¹H-³¹P HMBC spectra (Bruker pulse sequence: hmbcgpndqf) were recorded with 16 scans, 4,096 × 320 points and a recycle delay of 1.5 s. All spectra were processed using Topspin 3.6 (Bruker) and analyzed using Sparky 3.111 and 3.115 (T. D. Goddard and D. G. Kneller, SPARKY 3, University of California, San Francisco, CA, USA). Proton chemical shifts were calibrated to 2,2-dimethyl-2-silapentane-5-sulfonic acid (DSS) using an external sample of 2 mM sucrose and 0.5 mM DSS in H₂O/D₂O (Bruker). Indirect chemical shift referencing was applied to ¹³C and ³¹P according to IUPAB⁵⁴ using the chemical shift referencing ratios of 0.251449530 and 0.404808636.

Bcs3 crystallization and data collection

The Bcs3-CMP complex was crystallized by mixing 0.25 µl of a protein solution (9.5 mg ml⁻¹ in 50 mM Tris, pH 8.0, 300 mM NaCl, 1 mM DTT) in the presence of 5 mM CMP with 0.25 µl of 0.04 M potassium phosphate monobasic, 16% (wt/vol) PEG 8000 and 20% (vol/vol) glycerol (JCSG + commercial screening, Molecular Dimensions). Crystals were frozen under liquid nitrogen. Complete X-ray diffraction datasets were collected at the beamline i24 using a beam transmission equal to 16.33% (Diamond Light Source Synchrotron). Bcs3-CMP crystallized in the P 2₁2₁2₁ space group with two molecules in the asymmetric unit and diffracted to a maximum resolution of 2.9 Å (Supplementary Table 3). Bcs3-DP2 was crystallized by mixing 0.25 µl of protein solution (9.5 mg ml⁻¹ in 50 mM Tris, pH 8.0, 300 mM NaCl, 1 mM DTT) with 0.25 µl of 0.06 M magnesium chloride hexahydrate, 0.06 M calcium chloride dihydrate, 0.1 M Imidazole, 0.1 M MES monohydrate, pH 6.5, 37.5% (vol/vol) MPB, 37.5% (vol/vol) PEG 1000 and 37.5% (vol/vol) PEG 3350 (Morpheus, Molecular Dimensions). The crystal was plunged frozen in liquid nitrogen. Complete X-ray diffraction datasets were collected at the beamline iO4 using a beam transmission equal to 100% (Diamond Light Source Synchrotron). Bcs3-DP2 crystallized in the P 1 2₁ 1 space group with four molecules in the asymmetric unit and diffracted to a maximum resolution of 3.6 Å (Supplementary Table 3). The dataset was processed in situ using the Automatic Software Pipeline available at Diamond Light Source Synchrotron, integrated and scaled through the xia2 system.

Bcs3 structure determination and refinement

The structure determination of the Bcs3-CMP complex was performed by molecular replacement methods implemented in Phaser⁵⁵ and the PHENIX suite⁵⁶. The initial coordinates of Bcs3 were generated by using a model predicted by AlphaFold as a search model¹⁹. The model rebuilding was carried out with Buccaneer⁵⁷ and the CCP4 suite⁵⁸. The final manual building was performed with Coot⁵⁹ and refinement with phenix.refine⁶⁰ and Refmac5 (ref. ⁶¹). The structure was validated by MolProbity⁶². The Bcs3-DP2 complex was solved by molecular replacement using the Bcs3-CMP complex as a search model. Data collection and refinement statistics are presented in Supplementary Table 3. Molecular graphics and structural analyses were performed with the UCSF Chimera package⁶³.

Structural analysis and sequence alignment

Homologs of Bcs3 were identified by BLAST²¹, aligned using Clustal Omega⁶⁴, MUSCLE⁶⁵ or Cobalt⁶⁶ and annotated using Jalview⁶⁷. Structure-based sequence alignment analysis was performed using Chimera⁶³. Protein pocket volume was calculated using HOLLOW⁶⁸. Z-score values were produced using DALI⁶⁹. Domain interface analysis was performed using PISA⁷⁰. Conserved and similar residues were labeled using the BoxShade server (http://embnet.vital-it.ch/software/BOX_form.html).

Molecular substrate docking

The substrate binding analysis of the active sites has been performed for CroT, CrpP and CriT using a mixed approach based on (1) the experimental location of phosphate groups and/or metals at the active site, (2) placement of the substrate by homology and (3) the minimization of the substrate docked ligand using USCF Chimera. For CroT, the modeling of CDP-ribitol in the active site was achieved firstly by placing CDP-glycerol from the complex TagF mutant N444H (PDB 3L7K)¹⁴ by structural superposition. Then, the docked CDP-glycerol was modified to CDP-ribitol using UCSF Chimera editing tools and following the orientation of CMP and glycerol found in the CroT active site. Finally, docking was performed using a tight search box to minimize the position of CDP-ribitol using Vina. For CrpP, the terminal 5-phospho-d-ribose end of a 5-phospho-d-Ribf-β-(1→1)-d-Ribitol-5-phosphate was placed in the active site using the phosphate found in the structure as a reference. A guided docking was performed following the orientation of substrates in CrpP homologs: (1) the structure of pyridoxal phosphate phosphatase in complex with a non-hydrolyzable pyridoxal 5-phosphate analog and Mg²⁺ (PDB code 5AES) and (2) the structure of phosphoserine phosphatase in complex with phosphoserine (PDB 1L7P). The docking of the moiety was oriented according to the substrates in the homologous structures and minimized in Chimera. Finally, for CriT, the placing of a PRPP-Mg²⁺ complex at the active site was guided by the position of the two phosphates found in the active site, and docked by homology extracting a ligand displaying an overall consensus orientation from the following four PRTase homologs in complex with PRPP-Mg²⁺: hypoxanthine-guanine phosphoribosyltransferase (PDB code 1FSG), adenine phosphoribosyltransferase (PDB code 1ZN7), orotate phosphoribosyltransferase (PDB code 1LH0) and uracil phosphoribosyltransferase (PDB code 1JLS). All dockings were thoroughly inspected and confronted with the protein surfaces to verify the absence of clashes.

Statistics and reproducibility

Alignments were performed once with each algorithm, but separate algorithms, for example, MUSCLE, Cobalt and Clustal omega, yielded highly similar results regarding the critical amino acids targeted in this study. Phyre2 homology modeling was performed in 2015, 2018, 2019, 2020 and 2021 to reevaluate the results and include potential new entries to the PDB. All submissions yielded highly similar results. The detection of Hib polymer (Fig. 2f) using agglutination sera was repeated two times with similar results and the polymer structure was confirmed by NMR. Qualitative analysis of polymer production by different Bcs3 constructs or construct combinations (Fig. 3c–f and Supplementary Fig. 10) was performed at least three times with similar results. Constructs were purified once, each purification was documented by SDS-PAGE and separate Coomassie-stained gels displaying all constructs were included in this study (Supplementary Figs. 2 and 10a). CDP-ribitol was purified at least three times with similar results. Bcs3 activity in the presence of purified CDP-ribitol was analyzed twice by HPLC and a representative sample was chosen to be displayed by PAGE (Supplementary Fig. 3d). The crystallization construct was purified twice with similar results using the protocol shown in Supplementary Fig. 4. Four enzymatic syntheses of Hib polymer (yielding 5 mg, 2 × 30 mg and 50 mg of Hib polymer) were performed with similar results, and representative documentation of these reactions and the purification of the polymer are shown in Extended Data Fig. 2 and 7. The effect of ATP and UTP on the enzyme cascade was analyzed twice by HPLC and Alcian blue staining after PAGE (Extended Data Fig. 7g). Test hydrolysis of Hib polymer (Extended Data Fig. 7j) and purification and visualization of oligomers (Extended Data Fig. 7k) were performed once. To analyze the elongation mechanism of polymerase constructs, at least five reactions were performed at different donor-to-acceptor ratios with highly consistent results (Extended Data Fig. 8).

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

The atomic coordinates and structure factors have been deposited in the Protein Data Bank, accession codes 8A0C for the Bcs3-CMP complex and 8A0M for the Bcs3-DP2 complex. Data collection and refinement statistics are presented in Supplementary Table 3. PDB IDs used in the analysis of this work include 1ZN7, 1LH0, 1FSG, 1JLS, 1L7N, 5AES, 3VAY, 1L7P, 3L7K, 4ZHT, 6JDT, 4X1T, 5LEO, 6RJE, 1R77, 7BNH and 7SHG. Accession codes for sequences used in this study are available in Supplementary Table 4 and Extended Data Fig. 1. NMR chemical shifts are presented in Supplementary Tables 1 and 2. Source data are provided in separate files with this paper.

References

Hallström, T. & Riesbeck, K. Haemophilus influenzae and the complement system. Trends Microbiol. 18, 258–265 (2010).
PubMed Google Scholar
Branefors-Helander, P. et al. Structural studies of the capsular antigen from Haemophilus influenzae type b. Acta Chem. Scand. 30b, 276–277 (1976).
Google Scholar
Willis, L. M. & Whitfield, C. Structure, biosynthesis, and function of bacterial capsular polysaccharides synthesized by ABC transporter-dependent pathways. Carbohydr. Res. 378, 35–44 (2013).
CAS PubMed Google Scholar
Micoli, F. et al. Glycoconjugate vaccines: current approaches towards faster vaccine design. Expert Rev. Vaccines 18, 881–895 (2019).
CAS PubMed Google Scholar
Verez-Bencomo, V. et al. A synthetic conjugate polysaccharide vaccine against Haemophilus influenzae type b. Science. 305, 522–525 (2004).
CAS PubMed Google Scholar
Ojo, L. R. et al. Global use of Haemophilus influenzae type b conjugate vaccine. Vaccine. 28, 7117–7122 (2010).
PubMed Google Scholar
del Bino, L. et al. Synthetic glycans to improve current glycoconjugate vaccines and fight antimicrobial resistance. Chem. Rev. 122, 15672–15716 (2022).
CAS PubMed PubMed Central Google Scholar
Satola, S. W., Schirmer, P. L. & Farley, M. M. Complete sequence of the cap locus of Haemophilus influenzae serotype b and nonencapsulated b capsule-negative variants. Infect. Immun. 71, 3639–3644 (2003).
CAS PubMed PubMed Central Google Scholar
van Eldere, J. et al. Region II of the Haemophilus influenzae type b capsulation locus is involved in serotype-specific polysaccharide synthesis. Mol. Microbiol. 15, 107–118 (1995).
PubMed Google Scholar
Zolli, M., Kobric, D. J. & Brown, E. D. Reduction precedes cytidylyl transfer without substrate channeling in distinct active sites of the bifunctional CDP-ribitol synthase from Haemophilus influenzae. Biochemistry 40, 5041–5048 (2001).
Kunduru, B. R., Nair, S. A. & Rathinavelan, T. EK3D: An E. coli K antigen 3-dimensional structure database. Nucleic Acids Res. 44, D675–D681 (2016).
CAS PubMed Google Scholar
Brown, S., Santa Maria, J. P. & Walker, S. Wall teichoic acids of gram-positive bacteria. Annu. Rev. Microbiol . 67, 313–336 (2013).
CAS PubMed Google Scholar
Kelley, L. A., Mezulis, S., Yates, C. M., Wass, M. N. & Sternberg, M. J. E. The Phyre2 web portal for protein modeling, prediction and analysis. Nat. Protoc. 10, 845–858 (2015).
CAS PubMed PubMed Central Google Scholar
Lovering, A. L. et al. Structure of the bacterial teichoic acid polymerase TagF provides insights into membrane association and catalysis. Nat. Struct. Mol. Biol. 17, 582–589 (2010).
CAS PubMed Google Scholar
Litschko, C. et al. A new family of capsule polymerases generates teichoic acid-like capsule polymers in gram-negative pathogens. mBio 9, 16017 (2018).
Google Scholar
Burroughs, A. M., Allen, K. N., Dunaway-Mariano, D. & Aravind, L. Evolutionary genomics of the HAD Superfamily: understanding the structural adaptations and catalytic diversity in a superfamily of phosphoesterases and allied enzymes. J. Mol. Biol. 361, 1003–1034 (2006).
CAS PubMed Google Scholar
Rangarajan, E. S., Sivaraman, J., Matte, A. & Cygler, M. Crystal structure of d-ribose-5-phosphate isomerase (RpiA) from Escherichia coli. Proteins 48, 737–740 (2002).
CAS PubMed Google Scholar
Litschko, C., Budde, I., Berger, M. & Fiebig, T. Exploitation of capsule polymerases for enzymatic synthesis of polysaccharide antigens used in glycoconjugate vaccines. Methods Mol. Biol. 2183, 313–330 (2021).
CAS PubMed Google Scholar
Jumper, J. et al. Highly accurate protein structure prediction with AlphaFold. Nature. 596, 583–589 (2021).
CAS PubMed PubMed Central Google Scholar
Whisstock, J. C. & Lesk, A. M. SH3 domains in prokaryotes. Trends Biochem. Sci. 24, 132–133 (1999).
CAS PubMed Google Scholar
Boratyn, G. M. et al. BLAST: a more efficient report with usability improvements. Nucleic Acids Res. 41, W29–W33 (2013).
PubMed PubMed Central Google Scholar
Hove-Jensen, B. et al. Phosphoribosyl diphosphate (PRPP): biosynthesis, enzymology, utilization, and metabolic significance. Microbiol Mol. Biol. Rev. 81, e00040-16 (2017).
PubMed Google Scholar
Scapin, G., Grubmeyer, C. & Sacchettini, J. C. Crystal structure of orotate phosphoribosyltransferase. Biochemistry 33, 1287–1294 (1994).
CAS PubMed Google Scholar
González-Segura, L., Witte, J. F., McClard, R. W. & Hurley, T. D. Ternary complex formation and induced asymmetry in orotate phosphoribosyltransferase. Biochemistry 46, 14075–14086 (2007).
PubMed Google Scholar
Kuznetsova, E. et al. Functional diversity of haloacid dehalogenase superfamily phosphatases from saccharomyces cerevisiae: biochemical, structural, and evolutionary insights. J. Biol. Chem. 290, 18678–18698 (2015).
CAS PubMed PubMed Central Google Scholar
Albesa-Jové, D., Giganti, D., Jackson, M., Alzari, P. M. & Guerin, M. E. Structure-function relationships of membrane-associated GT-B glycosyltransferases. Glycobiology 24, 108–124 (2014).
PubMed Google Scholar
Litschko, C. et al. Mix-and-match system for the enzymatic synthesis of enantiopure glycerol-3-phosphate-containing capsule polymer backbones from Actinobacillus pleuropneumoniae, Neisseria meningitidis, and Bibersteinia trehalosi. mBio 12, e0089721 (2021).
PubMed Google Scholar
Mayer, B. J. & Eck, M. J. SH3 domains: minding your p’s and q’s. Curr. Biol. 5, 364–367 (1995).
CAS PubMed Google Scholar
Fiebig, T. et al. Efficient solid-phase synthesis of meningococcal capsular oligosaccharides enables simple and fast chemoenzymatic vaccine production. J. Biol. Chem. 293, 953–962 (2018).
CAS PubMed Google Scholar
Budde, I. et al. An enzyme-based protocol for cell-free synthesis of nature-identical capsular oligosaccharides from Actinobacillus pleuropneumoniae serotype 1. J. Biol. Chem. 295, 5771–5784 (2020).
CAS PubMed PubMed Central Google Scholar
Whitfield, C., Wear, S. S. & Sande, C. Assembly of bacterial capsular polysaccharides and exopolysaccharides. Annu. Rev. Microbiol 74, 521–543 (2020).
CAS PubMed Google Scholar
Osawa, T. et al. Crystal structure of chondroitin polymerase from Escherichia coli K4. Biochem. Biophys. Res. Commun. 378, 10–14 (2009).
CAS PubMed Google Scholar
Lizak, C. et al. X-ray crystallographic structure of a bacterial polysialyltransferase provides insight into the biosynthesis of capsular polysialic acid. Sci. Rep. 7, 5842 (2017).
PubMed PubMed Central Google Scholar
Muindi, K. M. et al. Characterization of the meningococcal serogroup X capsule N-acetylglucosamine-1-phosphotransferase. Glycobiology 24, 139–149 (2014).
CAS PubMed Google Scholar
Sperisen, P., Schmid, C. D., Bucher, P. & Zilian, O. Stealth proteins: in silico identification of a novel protein family rendering bacterial pathogens invisible to host immune defense. PLoS Comput. Biol. 1, 0492–0499 (2005).
CAS Google Scholar
Maloney, F. P. et al. Structure, substrate recognition and initiation of hyaluronan synthase. Nature 604, 195–201 (2022).
CAS PubMed PubMed Central Google Scholar
May, J. F., Splain, R. A., Brotschi, C. & Kiessling, L. L. A tethering mechanism for length control in a processive carbohydrate polymerization. Proc. Natl Acad. Sci. USA 106, 11851–11856 (2009).
CAS PubMed PubMed Central Google Scholar
Kari, J. et al. Sabatier principle for interfacial (heterogeneous) enzyme catalysis. ACS Catal. 8, 11966–11972 (2018).
CAS Google Scholar
Yakovlieva, L. & Walvoort, M. T. C. Processivity in bacterial glycosyltransferases. ACS Chem. Biol. 15, 3–16 (2020).
CAS PubMed Google Scholar
Nussinov, R., Tsai, C. J. & Ma, B. The underappreciated role of allostery in the cellular network. Annu. Rev. Biophys. 42, 169–189 (2013).
CAS PubMed PubMed Central Google Scholar
Chen, L. et al. Cryo-electron microscopy structure and transport mechanism of a wall teichoic acid ABC transporter. mBio 11, e02749-19 (2020).
PubMed PubMed Central Google Scholar
Bi, Y., Mann, E., Whitfield, C. & Zimmer, J. Architecture of a channel-forming O-antigen polysaccharide ABC transporter. Nature 553, 361–365 (2018).
CAS PubMed PubMed Central Google Scholar
Phanphak, S. et al. Super-resolution fluorescence microscopy study of the production of K1 Capsules by Escherichia coli: evidence for the differential distribution of the capsule at the poles and the equator of the cell. Langmuir 35, 5635–5646 (2019).
CAS PubMed PubMed Central Google Scholar
Kelly, S. D. et al. The biosynthetic origin of ribofuranose in bacterial polysaccharides. Nat Chem. Biol. 18, 530–537 (2022).
CAS PubMed Google Scholar
Guerardel, Y. et al. Complete structure of the enterococcal polysaccharide antigen (EPA) of vancomycin-resistant enterococcus faecalis V583 reveals that EPA decorations are teichoic acids covalently linked to a rhamnopolysaccharide backbone. mBio 11, e00277–20 (2020).
CAS PubMed PubMed Central Google Scholar
van Dalen, R. et al. Do not discard Staphylococcus aureus WTA as a vaccine antigen. Nature 572, E1–E2 (2019).
PubMed Google Scholar
Gerlach, D. et al. Methicillin-resistant Staphylococcus aureus alters cell wall glycosylation to evade immunity. Nature 563, 705–709 (2018).
CAS PubMed Google Scholar
Kuwabara, N. et al. Crystal structures of fukutin-related protein (FKRP), a ribitol-phosphate transferase related to muscular dystrophy. Nat. Commun. 11, 303 (2020).
CAS PubMed PubMed Central Google Scholar
Oldrini, D. et al. Combined chemical synthesis and tailored enzymatic elongation provide fully synthetic and conjugation-ready neisseria meningitidis serogroup X vaccine antigens. ACS Chem. Biol. 13, 984–994 (2018).
CAS PubMed Google Scholar
Fiebig, T. et al. Molecular cloning and functional characterization of components of the capsule biosynthesis complex of Neisseria meningitidis serogroup A: toward in vitro vaccine production. J. Biol. Chem. 289, 19395–19407 (2014).
CAS PubMed PubMed Central Google Scholar
Fiebig, T. et al. Functional expression of the capsule polymerase of Neisseria meningitidis serogroup X: a new perspective for vaccine development. Glycobiology 24, 150–158 (2014).
CAS PubMed Google Scholar
Bond, S. R. & Naus, C. C. RF-Cloning.org: an online tool for the design of restriction-free cloning projects. Nucleic Acids Res. 40, W209–W213 (2012).
CAS PubMed PubMed Central Google Scholar
Liu, H. & Naismith, J. H. An efficient one-step site-directed deletion, insertion, single and multiple-site plasmid mutagenesis protocol. BMC Biotechnol. 8, 91 (2008).
PubMed PubMed Central Google Scholar
Markley, J. L. et al. Recommendations for the presentation of NMR structures of proteins and nucleic acids. IUPAC-IUBMB-IUPAB Inter-Union Task Group on the Standardization of Data Bases of Protein and Nucleic Acid Structures Determined by NMR Spectroscopy. J. Biomol. NMR 12, 1–23 (1998).
CAS PubMed Google Scholar
McCoy, A. J. et al. Phaser crystallographic software. J. Appl. Crystallogr. 40, 658–674 (2007).
CAS PubMed PubMed Central Google Scholar
Adams, P. D. et al. PHENIX: a comprehensive Python-based system for macromolecular structure solution. Acta Crystallogr. D Biol. Crystallogr. 66, 213–221 (2010).
CAS PubMed PubMed Central Google Scholar
Cowtan, K. The Buccaneer software for automated model building. 1. Tracing protein chains. Acta Crystallogr. D Biol. Crystallogr. 62, 1002–1011 (2006).
PubMed Google Scholar
Winn, M. D. et al. Overview of the CCP4 suite and current developments. Acta Crystallogr. D Biol. Crystallogr. 67, 235–242 (2011).
CAS PubMed PubMed Central Google Scholar
Emsley, P., Lohkamp, B., Scott, W. G. & Cowtan, K. Features and development of Coot. Acta Crystallogr D. Biol. Crystallogr 66, 486–501 (2010).
CAS PubMed PubMed Central Google Scholar
Afonine, P. V. et al. Towards automated crystallographic structure refinement with phenix.refine. Acta Crystallogr. D Biol. Crystallogr. 68, 352–367 (2012).
CAS PubMed PubMed Central Google Scholar
Murshudov, G. N. et al. REFMAC5 for the refinement of macromolecular crystal structures. Acta Crystallogr. D Biol. Crystallogr. 67, 355–367 (2011).
CAS PubMed PubMed Central Google Scholar
Chen, V. B. et al. MolProbity: all-atom structure validation for macromolecular crystallography. Acta Crystallogr D. Biol. Crystallogr 66, 12–21 (2010).
CAS PubMed Google Scholar
Pettersen, E. F. et al. UCSF Chimera–a visualization system for exploratory research and analysis. J. Comput. Chem. 25, 1605–1612 (2004).
CAS PubMed Google Scholar
Sievers, F. et al. Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega. Mol. Syst. Biol. 7, 539 (2011).
PubMed PubMed Central Google Scholar
Madeira, F. et al. Search and sequence analysis tools services from EMBL-EBI in 2022. Nucleic Acids Res. https://doi.org/10.1093/nar/gkac240 (2022).
Article PubMed PubMed Central Google Scholar
Papadopoulos, J. S. & Agarwala, R. COBALT: constraint-based alignment tool for multiple protein sequences. Bioinformatics 23, 1073–1079 (2007).
CAS PubMed Google Scholar
Waterhouse, A. M., Procter, J. B., Martin, D. M. A., Clamp, M. & Barton, G. J. Jalview version 2–a multiple sequence alignment editor and analysis workbench. Bioinformatics 25, 1189–1191 (2009).
CAS PubMed PubMed Central Google Scholar
Ho, B. K. & Gruswitz, F. HOLLOW: generating accurate representations of channel and interior surfaces in molecular structures. BMC Struct. Biol. 8, 49 (2008).
PubMed PubMed Central Google Scholar
Holm, L. & Rosenström, P. Dali server: conservation mapping in 3D. Nucleic Acids Res. 38, W545–W549 (2010).
CAS PubMed PubMed Central Google Scholar
Krissinel, E. & Henrick, K. Inference of macromolecular assemblies from crystalline state. J. Mol. Biol. 372, 774–797 (2007).
CAS PubMed Google Scholar
Willis, L. M. & Whitfield, C. KpsC and KpsS are retaining 3-deoxy-D-manno-oct-2-ulosonic acid (Kdo) transferases involved in synthesis of bacterial capsules. Proc. Natl Acad. Sci. USA 110, 20753–20758 (2013).
CAS PubMed PubMed Central Google Scholar
McNulty, C. et al. The cell surface expression of group 2 capsular polysaccharides in Escherichia coli: the role of KpsD, RhsA and a multi-protein complex at the pole of the cell. Mol. Microbiol. 59, 907–922 (2006).
CAS PubMed Google Scholar
Rigg, G. P., Barrett, B. & Roberts, I.The localization of KpsC, S and T, and KfiA, C and D proteins involved in the biosynthesis of the Escherichia coli K5 capsular polysaccharide: evidence for a membrane-bound complex. Microbiology 144, 2905–2914 (1998).
CAS PubMed Google Scholar

Download references

Acknowledgements

This study was funded by Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) under project 412824531 (to T.F.), MHH internal impact orientated funds (LOM) to the Institute of Clinical Biochemistry, the MINECO/FEDER EU under contracts PID2019-105649RB-I00 and Severo Ochoa Excellence Accreditation SEV-2016-0644 (to M.E.G.), the Basque Government under contracts KK-2021-00034 and KK-2022/00107 (to M.E.G.), and NIH R01AI149297 (to M.E.G.). We acknowledge Diamond Light Source (DLS; proposal MX28360), ALBA synchrotron beamline BL13-XALOC (proposal MX2018093013) and Swiss Light Source (proposal 20201980) for providing access to synchrotron radiation facilities. We gratefully acknowledge all members of the Structural Glycobiology Group (IIS-BioCruces, Spain) and the Institute of Clinical Biochemistry for valuable scientific discussions. We thank Jan Faix for kindly providing PreScission protease and Jana Führing for feedback on the manuscript.

Funding

Open access funding provided by Medizinische Hochschule Hannover (MHH).

Author information

These authors contributed equally: Javier O. Cifuente, Julia Schulze, Andrea Bethe, Valerio Di Domenico.

Authors and Affiliations

Structural Glycobiology Laboratory, Biocruces Bizkaia Health Research Institute, Cruces University Hospital, Barakaldo, Spain
Javier O. Cifuente, Valerio Di Domenico, Cecilia D’Angelo & Marcelo E. Guerin
Structural Glycobiology Laboratory, Center for Cooperative Research in Biosciences (CIC bioGUNE), Basque Research and Technology Alliance (BRTA), Bizkaia Technology Park, Derio, Spain
Javier O. Cifuente, Cecilia D’Angelo, Alberto Marina & Marcelo E. Guerin
Institute of Clinical Biochemistry, Hannover Medical School, Hannover, Germany
Julia Schulze, Andrea Bethe, Christa Litschko, Insa Budde, Hauke Thiesler, Isabel Ramón Roth, Monika Berger, Rita Gerardy-Schahn & Timm Fiebig
Department of Applied Genetics and Cell Biology, University of Natural Resources and Life Sciences, Vienna, Austria
Lukas Eidenberger
Institute for Hygiene and Microbiology, University of Würzburg, Würzburg, Germany
Heike Claus
Department of Biosciences and Medical Biology, University of Salzburg, Salzburg, Austria
Mario Schubert
Ikerbasque Basque Foundation for Science, Bilbao, Spain
Marcelo E. Guerin

Authors

Javier O. Cifuente
View author publications
You can also search for this author in PubMed Google Scholar
Julia Schulze
View author publications
You can also search for this author in PubMed Google Scholar
Andrea Bethe
View author publications
You can also search for this author in PubMed Google Scholar
Valerio Di Domenico
View author publications
You can also search for this author in PubMed Google Scholar
Christa Litschko
View author publications
You can also search for this author in PubMed Google Scholar
Insa Budde
View author publications
You can also search for this author in PubMed Google Scholar
Lukas Eidenberger
View author publications
You can also search for this author in PubMed Google Scholar
Hauke Thiesler
View author publications
You can also search for this author in PubMed Google Scholar
Isabel Ramón Roth
View author publications
You can also search for this author in PubMed Google Scholar
Monika Berger
View author publications
You can also search for this author in PubMed Google Scholar
Heike Claus
View author publications
You can also search for this author in PubMed Google Scholar
Cecilia D’Angelo
View author publications
You can also search for this author in PubMed Google Scholar
Alberto Marina
View author publications
You can also search for this author in PubMed Google Scholar
Rita Gerardy-Schahn
View author publications
You can also search for this author in PubMed Google Scholar
Mario Schubert
View author publications
You can also search for this author in PubMed Google Scholar
Marcelo E. Guerin
View author publications
You can also search for this author in PubMed Google Scholar
Timm Fiebig
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.O.C, V.D.D. and A.M. crystallized proteins and collected X-ray data; J.O.C. and V.D.D. performed alpha fold modeling; J.O.C., V.D.D. and M.E.G. solved and analyzed the crystal structure; J.O.C. and C.D. performed structural and sequence analyses; J.S. and T.F. analyzed capsule gene clusters to identify homologs of Bcs3; A.B. purified all proteins, polymer and substrates; J.S., A.B., C.L., I.B., I.R.R. and M.B. generated mutants and performed activity assays; L.E. and H.T. purified CDP-ribitol, H.C. provided the Hib reference strain, agglutination sera and expertise regarding Hib biology; M.S. performed and analyzed NMR experiments; T.F. identified Bcs3, performed homology modeling and sequence analysis, and designed mutants, truncations and the cascade synthesis; J.O.C., V.D.D. and M.E.G. proposed the mechanism for Bcs3; R.G.S. helped with data interpretation and manuscript drafting. M.E.G. and T.F. conceived the project and oversaw the experimental work; J.O.C., J.S., V.D.D., C.D., M.S., M.E.G. and T.F. generated figures, tables and artwork; J.O.C., J.S., M.S., M.E.G. and T.F. wrote the initial draft of the manuscript. All authors discussed the results, gave feedback on the manuscript and made substantial contributions to the final version.

Corresponding authors

Correspondence to Marcelo E. Guerin or Timm Fiebig.

Ethics declarations

Competing interests

All the authors declare no competing interests.

Peer review

Peer review information

Nature Chemical Biology thanks Mark von Itzstein, Helge Dorfmueller and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data

Extended Data Fig. 1 Schematic overview of capsule gene clusters and capsule structures from Hib and E. coli.

They encode ABC transporter-dependent capsule assembly systems (group 2 for E. coli nomenclature) that can be found in other important human and animal pathogens, like extraintestinal pathogenic E. coli, Neisseria meningitidis, Actinobacillus pleuropneumoniae, Campylobacter jejuni, and Pasteurella multocida³. a, b, Capsule gene clusters of (a) Hib and (b) E. coli strains are divided in conserved regions (gray boxes) and capsule-specific regions (white boxes), encoding proteins involved in the generation of a conserved glycolipid³¹ (green), proteins involved in polymer export (blue), and proteins involved in capsule polymer synthesis. The E. coli K5 capsule gene cluster is depicted as a reference for a well-characterized capsule biosynthesis system³ and the genes encoding the polymerase domains, which share no similarity with Bcs3, are indicated in orange. The bcs3 gene and genes harboring homologs of the CrpP and CriT domains are indicated in red. In E. coli, these genes are either paired with genes encoding CroT homologs in strains expressing capsules containing ribitol-phosphate (for example in K18). Or they are paired with proteins of unknown function, for which Phyre2¹³ homology modeling predicts (among others) weak similarity to Kdo-transferases. In both cases, the domains can be part of a single polypeptide or expressed separately. c, Capsule structures of Hib and E. coli strains as indicated in a, b, and capsule structures of E. coli strains that harbor ribofuranose in their capsule polymer, but for which sequence information is unavailable (extracted from the E. coli K antigen 3-dimensional structure database EK3D)¹¹.

Extended Data Fig. 2 Purification and NMR analysis of enzymatically synthesized polymer.

a, Hib polymer (5 mg) was purified from an enzyme reaction containing RpiA, Bcs1, and Bcs3 using preparative AEC. The remaining nucleotide substrates and nucleotide products (see Fig. 1c) were UV-detectable at 280 nm, eluted early from the column and could be separated from b, the UV-inactive Hib polymer, which was visualized by Alcian blue / silver staining after PAGE. Polymer containing fractions were pooled, dialyzed against water, and c, analyzed by 1D and 2D NMR. All correlations are in agreement with published literature (Supplementary Table 1). Correlations confirming the linkage of the moieties in the repeating unit are highlighted in red in the structure and indicated by dashed lines. HMBC: heteronuclear multiple bond correlation, HSQC: heteronuclear single-quantum coherence.

Source data

Extended Data Fig. 3 Local stereo views of CroT in complex with CMP, and of the SH3b domain in complex with polymer.

a, Model of CroT (orange ribbons) in complex with CMP, glycerol, and phosphate and the corresponding electron density map displayed at 1 RMSD. b, Local density map for the CMP, glycerol, and phosphate found in CroT. c, Model of selected SH3b domain (tan-colored ribbons) in complex with DP2 (dimer of Hib capsule polymer) displaying the overall density map at 1 RMSD. d, Density map for the DP2 ligand. The density for ribose, ribitol, and phosphate allows the unambiguous placing of the DP2 and helped determine the orientation of the polymer.

Extended Data Fig. 4 Local stereo views of the CrpP enzyme and CriT in complex with CMP.

a, Model of selected CrpP (yellow ribbons) and overall density map at 1 RMSD. b, The density map for the phosphate and Zn²⁺ found in the active site of CrpP was used as a reference for the polymer substrate docking. c, Model of selected CriT (red ribbons) displaying the overall density map at 1 RMSD. CriT has an overall size of 30 Å × 25 Å × 25 Å. The overall scaffold and the conformation of the CriT and CriT’ protomers were essentially preserved (RMSD of 0.77 Å for residues 592–1036). d, The density map for the two phosphates found in the active site was used as a reference for the docking of PRPP.

Extended Data Fig. 5 Bcs3 protomer dimerization contacts.

a-c, Contacts and ribbon representations are colored as follows: CroT (orange), CrpP (yellow), CriT (red), and SH3b domain (tan). In panel b, the dimer from the Bcs3-CMP complex is shown in two representations. Top: The protomer containing the CroT-CMP complex is shown in ribbons and the complementary protomer in a surface representation (up to the dashed line). Bottom: The protomer containing the CroT-CMP complex is shown in a surface representation and the complementary protomer is shown in ribbons (below the dashed line). The panels a and c present 90° rotations of the individual protomers in surface representation, disclosing the surface contacts between protomers by coloring contacting residues according to the enzyme they belong to. Note the conservation of the main interactions in the center of the protomers that correspond to interactions between CrpP and CriT’, and CrpP’ and CriT. The HB-1 of CriT mediates the interaction of CriT with the phosphatase CrpP and the SH3b domain of the same protomer, as well as with the ribose transferase CriT’ of the other protomer (see HB-1 in Fig. 2a-c). The HB-2 mediates the interaction of CriT with (i) CrpP’ and (ii) the C-terminal Rossmann-fold domain of CroT’ from the neighboring protomer. In contrast, HB-3 is mainly exposed to the solvent. Two encircled regions in the protomers, 1 and 1’ for CriT and CriT’ (a, c), and 2 and 2’ for CroT and CroT’ (a, c), display variability associated with the conformational changes in the CriT-CroT’ and CriT’-CroT interphase. These changes in the interaction correlate with the presence or absence of CMP in CroT. The contacts visualized above are mediated by the following structural elements (secondary structural elements are shown in Supplementary Data 1): (i) the α42 (residues 756–761) of CriT interacts with α11 (residues 207–210) and α14 (residues 240–248) of CroT, and (ii) α43 (residues 770–780) of CriT interacts with α11 (residues 207–210), the loop connecting β7-α11 (residues 196–206), and α17 (residues 300–303) of CroT. Furthermore, (i) α24 (residues 419–423) of CrpP interacts with the α34-α35 loop (residues 607–612), the α47-α48 loop (residues 923–928), and α48 (residues 929–946) of CriT, (ii) the long α25 (residues 425–443) of CrpP interacts with α39 (residues 722–725), the α39-α40 loop (residues 726–736), β22-α45 loop (residues 818–823), β23-β24 loop (residues 848–857), β24-α46 loop (863–869), and α48 (929–946) of CriT, (iii) the α26-α27 loop (residues 468–470) of CrpP interacts with the end of α36 (residues 638–669) of CriT, and (iv) α26 (residues 456–467) of CrpP interacts with the end of α36 (residues 638–669), β24 (858–862), β23-β24 loop (residues 848–857), and β24-α46 loop (863–869) of CriT.

Extended Data Fig. 6 Conformational comparison between both reaction center triads in the Bcs3-CMP complex crystal structure.

a, A surface and ribbon representation of a cross-section of the concave face of the Bcs3 dimer showing the two reaction centers, each containing one triangular active site. The reaction center triads are formed by CriT’/CrpP/CroT-CMP (left), and CriT/CrpP’/CroT’ (right). Colored arrows indicate the presence of grooves for the putative entrance of the polymer to the active sites of CroT (orange arrow), CriT (red arrow), and CrpP (yellow arrow). Observe conformational changes in the CriT-CroT’ and CriT’-CroT interphase between reaction centers, indicated by a dashed line. A closer interaction between CroT and CriT correlates with the occupation of the CroT active site by CMP. b, Local structural superposition of the CroT nucleotide-binding domains of the CroT-CMP complex (in orange) and CroT’ (in gray). The superposition demonstrates the relative open-to-close motion between the two CroT enzymes, with a more closed conformation for CroT-CMP and a more open conformation for CroT’, indicated by dashed arrows. c, Local structural superposition of CriT (in red) and CriT’ (in gray). The superposition demonstrates the relative, mostly unaffected conformation of the CriT enzymes, only presenting a slight movement for the CriT’ HB3 indicated by the dashed arrows. The above presented data suggest that conformational changes in the reaction centers are due to the entrance and release of substrates in the CroT enzymes, which likely undergo conformational changes during the catalytic cycle.

Extended Data Fig. 7 Development of a multi-enzyme cascade suitable for generating Hib polymer at mg-scale and production of oligomers.

See Supplementary Note for a detailed description of the results. a, Schematic of the enzyme cascade for the synthesis of Hib polymer starting from d-ribose (Rib). b-f, HPLC-AEC analysis of samples as indicated. Enzymatic activity of (b) RK and (c) PrsA using ATP, CTP or UTP as donor substrate. Enzymatic activity of (d) PrsA in presence of ADP. The reaction cascade (e, f) was started with (e) Rib or (f) Rib-5P using the indicated enzymes and their substrates as shown in a. g, Alcian blue stained polyacrylamide (PA) gel of polymer synthesized starting from Rib-5P (lane 1) or Rib phosphorylated by RK using UTP (lane 2) or ATP (lane 3), respectively. h, Purification of polymer from an up-scaled synthesis using preparative AEC. Remaining nucleotide substrates and nucleotide products (see a) are UV-detectable at 280 nm, eluted early from the column, and could be separated from i, the UV-inactive Hib polymer, which was visualized by Alcian blue staining after PAGE. j, Test hydrolysis of Hib polymer in 25 mM TFA (37 °C) or 10 mM acetic acid (70 °C). After the indicated time points, samples were taken and analyzed using PAGE followed by Alcian blue / silver staining. k, 50 mg of Hib polymer were hydrolyzed (25 mM TFA, 120 min, 37 °C), fragments were separated by preparative AEC, collected as 12 separate pools (pool 1 not shown), and visualized by Alcian blue / silver staining after PAGE. l, 1D NMR analyses of the indicated oligomer pools. The annotation of the signals is based on a comprehensive 2D NMR analysis and in agreement with published literature (see Supplementary Table 2 for chemical shifts and references). The ratio between the signal intensities of the anomeric protons from the non-reducing end β-Ribf and the internal β-Ribf in the polymer allows an estimation of the average degree of polymerization of the oligomers comprised in the pool. E.g. for pool 2: 1 + 7.25 = 8.25 → oligomers of pool 2 have an average chain length of app. eight repeating units. Alternatively, the H3 signal of the internal β-Ribf can be used instead of the anomeric signal.

Source data

Extended Data Fig. 8 Analysis of the chain length distribution generated by Bcs3 and Bcs3 truncations.

Reactions were performed with the constructs as indicated (numbering refers to Fig. 3b or Supplementary Fig. 2) at varying donor (CDP-ribitol, PRPP) to acceptor (pool 3, Extended Data Fig. 7) ratios and products were visualized after PAGE by Alcian blue staining. The donor to acceptor ratio (d/a) is indicated above each lane. a: acceptor substrate in a concentration detectable on a gel (6x higher than in a reaction with d/a of 5). a, b, Even at low d/a (50, 25, 10), products of considerable length are visible in reactions containing the C-terminally truncated constructs 11 (Bcs3_2–1162) and 12 (Bcs3_2–1122), which both contain an intact polymer binding domain. Product populations are disperse and small neglected acceptors can be detected, especially at a low d/a of for example 10 or 5 (d). Both observations are hallmarks of processive elongation³⁹ mediated by an extended acceptor binding site and are consistent with a model in which the polymer binding domain tethers the acceptor. c, With the intent to disrupt the efficiency and/or processivity of the Bcs3 machine, acceptor elongation was performed using the separated CroT, CrpP, and CriT domains in trans-complementation reactions. Indeed, products generated using the truncations 21 (Bcs3_2–378), 24 (Bcs3_354–631), and 17 (Bcs3_6312–1162) are smaller and of lower dispersity (c, d). Both observations are consistent with a more distributive elongation mode, putatively enforced by the domain separation and the fact that each domain carries an MBP tag, which, we hypothesize, prevents the concerted action of the domains to a certain degree. A dashed line indicates that samples not relevant for data presentation were excised.

Source data

Extended Data Fig. 9 A model of action for Bcs3.

The dimeric structural arrangement of Bcs3 allows the formation of a, a large reaction center in which the nascent chain is assembled by enzymes from both protomers - CriT/CrpP/CroT and CriT’/CrpP’/CroT’ - in a common cycle with all six enzymes working in sequential order. However, the elongated shape of the basket-like compartment, which places the CroT enzymes far away from each other, might prevent them from participating in the synthesis of a common polymeric thread. Moreover, this model would likely promote the two capsule polymers forming a coil after a few rounds of catalysis. b, Alternatively, two reaction centers are formed by different protomers - CriT/CrpP’/CroT’ and CriT’/CrpP/CroT - along the transversal longest axis across the main cleft of Bcs3. This model is supported by c, d, the fact that the active sites of the CriT/CrpP’/CroT’ and CriT’/CrpP/CroT triads are in the same plane, respectively, each separated by an angle of ca. 120°. It is worth noting that each reaction center is equidistant from the three active sites, forming a triangle (b-d, see also Extended Data Fig. 6a). The Bcs3-DP2 complex structure shows that the two chains attached to the corresponding SH3b domains each point to one of the reaction centers proposed by model 2. It is tempting to speculate that this structural arrangement of the active sites within a concave basket-like compartment promotes substrate channeling and facilitates the sequence of reactions, a characteristic of multi-enzyme complexes involved in efficient pathways.

Extended Data Fig. 10 Model of Bcs3 in the context of the ABC-transporter dependent capsule assembly system of Hib.

a, (1, 2) Capsule expression in ABC-transporter dependent assembly systems starts with the synthesis of a poly-Kdo linker attached to a (lyso)phosphatidylglycerol. This precursor is conserved in all Gram-negative bacteria utilizing the ABC-transporter dependent pathway and its assembly is catalyzed in E. coli by the β-Kdo transferases KpsS and KpsC³¹. KpsS transfers a single Kdo, which is then extended by KpsC. By homology, HcsB and HcsA were suggested to catalyze this reaction in Hib⁷¹. (3) One or several yet to be identified transition transferases were hypothesized to add the first repeating unit to the poly-Kdo, in order to create an acceptor for the (4) capsule polymerase, which assembles the capsule polymer and determines the serotype. The electrostatic surface potential of Bcs3 and its concave surface suggest an orientation in which the active centers of the enzyme face the inner membrane and/or the (5) ABC transporter-dependent export complex (BexA-D). The distance between the two SH3b binding domains might allow an enzymatic assembly in which each of the two chains generated by the Bcs3 dimer is exported by one ABC transporter complex. It is of note that interactions between the components of the Hib assembly system have not been experimentally proven. However, studies in E. coli present evidence for the formation of a multi-protein complex^72,73. Moreover, ATP-dependent translocation coupled with Bcs3 synthesis and channeling of two polymer threads by the SH3b domain through two transporters is hypothesized as an efficient scenario for capsule expression. To allow a concise presentation, enzymes providing nucleotide substrates (for example CMP-Kdo, CDP-ribitol) are not shown. b, Reaction cycle of Bcs3 for the addition of one repeating unit to the non-reducing end of the nascent Hib capsule polymer.

Supplementary information

Supplementary Information

Supplementary Tables 1–5, Supplementary Figs. 1–22 (Supplementary Figs. 19–22 contain supporting data for Supplementary Figs. 2, 3, 4 and 10), Supplementary Note and Supplementary Discussion.

Reporting Summary

Supplementary Data 1

Sequence alignment of Bcs3 homologs—A BLAST⁶⁹ search was performed using the amino acid sequence of full-length Bcs3 as query and hits with high sequence coverage were manually chosen, aligned using Cobalt⁷⁰ and annotated using Jalview⁷¹. Homologs from Gram-negative and Gram-positive pathogens are highlighted in dark gray (top half of the alignment) or light gray (bottom half), respectively. Database references are provided. The secondary structural elements extracted from a selected protomer of Bcs3-CMP are shown above the alignment. Amino acid residues with at least 70 % of identity are highlighted in black. A sequence logo plot is depicted below the alignment to provide a detailed overview of the general consensus at every position. Amino acids H96, H233, D386, D680 and E818, which were targeted in this study by site-directed mutagenesis, are shown in red. The consensus of the DUF5776 according to the conserved domain database⁵¹ for the SH3b domain is shown in blue, highlighting that this domain is present in the majority of identified homologs. Interestingly, the C-terminal alpha-helical bundle (α56-α59) is only present in homologs having a Gram-negative background. The C-terminus of the E. coli sequence was manually aligned to allow a gap between α53 and α56.

Supplementary Data 2

Sequence alignment of CroT homologs—Biochemically characterized CroT homolgoues from Gram-negative (gray) and Gram-positive (light gray) sources were aligned using MUSCLE⁷² and the alignment was annotated using Jalview⁷¹. Database references are provided. Homologs transferring ribitol-phosphate and glycerol-phosphate are indicated by full and empty circles, respectively^9,73–76. The secondary structural elements extracted from a selected CriT from Bcs3-CMP are shown above the alignment. Amino acid residues with >70 % of identity are highlighted in black. A sequence logo plot is depicted below the alignment to provide a detailed overview of the general consensus at every position. Amino acids H96 and H233, which were targeted in this study by site-directed mutagenesis, are shown in red. Glycine and proline residues putatively determining the enzymes’ preference for ribitol and glycerol, respectively, are highlighted in yellow.

Supplementary Data 3

Sequence alignment of Bcs3 with homologs from E. coli—Bcs3 was aligned using MUSCLE⁷² with the putative CroT-K18 (STI11141.1) and CrpP-CriT-K18 (STI11142.1) from E. coli K18 (see Extended Data Fig. 1), and with a homolog (STP42972.1) identified by BLAST (see Supplementary Data 1) in a non-serotyped E. coli strain, which expresses all three domains as single polypeptide and has high sequence identity with CroT-K18 and CrpP-CriT-K18. The secondary structural elements extracted from a selected protomer from Bcs3-CMP are shown above the alignment. The color code introduced for CroT, CrpP, CriT and SH3b in Figs. 1 and 3 was used to color the Bcs3 sequence. Amino acids targeted to separate the Bcs3 enzymes are highlighted in gray.

Source data

Source Data Fig. 1

Unprocessed gel and blot.

Source Data Fig. 3

Unprocessed gels.

Source Data Extended Data Fig. 2

Unprocessed gel.

Source Data Extended Data Fig. 7

Unprocessed gels.

Source Data Extended Data Fig. 8

Unprocessed gels.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Cifuente, J.O., Schulze, J., Bethe, A. et al. A multi-enzyme machine polymerizes the Haemophilus influenzae type b capsule. Nat Chem Biol 19, 865–877 (2023). https://doi.org/10.1038/s41589-023-01324-3

Download citation

Received: 13 July 2022
Accepted: 31 March 2023
Published: 05 June 2023
Issue Date: July 2023
DOI: https://doi.org/10.1038/s41589-023-01324-3

This article is cited by

Transition transferases prime bacterial capsule polymerization
- Christa Litschko
- Valerio Di Domenico
- Timm Fiebig
Nature Chemical Biology (2024)
Basket of sweets
- Helge C. Dorfmueller
Nature Chemical Biology (2023)

Subjects

Abstract

Similar content being viewed by others

Main

Results

Bcs3 is the Hib capsule polymerase

PRPP is the ribose donor for the synthesis of Hib polymer

The overall architecture of Bcs3

Structural basis of CriT substrate specificity and catalysis

Structural basis of CrpP substrate specificity and catalysis

Structural basis of CroT substrate specificity and catalysis

Structural basis of Bcs3 Hib capsule polymer recognition

Molecular basis of Bcs3 enzymology

Discussion

Methods

General cloning

Expression and purification of recombinant proteins

Enzymatic reactions, HPLC-AEC analysis and PAGE

Upscaling of polymer synthesis and purification

Preparation of oligomers

Dot blot/agglutination assay

NMR analysis

Bcs3 crystallization and data collection

Bcs3 structure determination and refinement

Structural analysis and sequence alignment

Molecular substrate docking

Statistics and reproducibility

Reporting summary

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Extended data

Supplementary information

Source data

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links