Structural basis for cooperative oxygen binding and bracelet-assisted assembly of Lumbricus terrestris hemoglobin

The iron-containing hemoglobins (Hbs) are essential proteins to serve as oxygen transporters in the blood. Among various kinds of Hbs, the earthworm Hbs are the champions in carrying oxygen due to not only their large size but also the unusually high cooperativity of ligand binding. However, the cooperative oxygen binding mechanisms are still mostly unknown. Here we report the cryo-electron microscopy structure of Lumbricus terrestris Hb in its native, oxygenated state at 9.1 Å resolution, showing remarkable differences from the carbon monoxide-binding X-ray structure. Our structural analysis first indicates that the cooperative ligand binding of L. terrestris Hb requires tertiary and quaternary transitions in the heme pocket and a global subunit movement facilitated by intra-ring and inter-ring contacts. Moreover, the additional sinusoidal bracelet provides the confirmation for the long-standing debate about the additional electron densities absent in the X-ray crystal structure.

proposed by Vinogradov et al. 13 described the role of the central bracelet to act as linkers between globin subunits. The 3.5 Å resolution crystal structure revealed that the central subunits are formed by the N-terminal triple-stranded coiled coils of the linker chains. However, the smaller central cavity observed in some cryo-EM structures 6,14,15 and the sinusoidal pillars observed in the 14.9 Å cryo-EM structure 4 imply the plausible existence of additional central densities which are still absent in the X-ray structure.
In this paper, we report the 9.1 Å resolution cryo-EM structure of the entire L. terrestris Hb in the oxygenated form, which provides the first near-atomic resolution structure for this protein in its native state. By combining the 3.5 Å resolution crystal structure and the flexible fitting procedures, with our cryo-EM reconstruction, we construct a pseudo-atomic model for high-resolution description of the subunit arrangement in the oxygenated state. Comparison of the conformation of the L. terrestris Hb between two different functional states reveals tertiary and quaternary allostery in the heme pocket and an alteration of the overall size of this complex which provide clues to the cooperative mechanism. Moreover, the electron densities of additional sinusoidal bracelet are clearly visible in our cryo-EM data, and this discovery may account for a new assembly mechanism of the whole Hb complex.

Results
9.1 Å resolution cryo-EM map of L. terrestris Hb. To investigate the conformational change induced upon O 2 binding, we carried out single particle cryo-EM analysis of L. terrestris Hb in its oxygenated state ( Fig. 1a and 1b). No reference was made during the reconstruction process to avoid introducing model bias. The final 3D reconstruction is presented in Fig. 1, with one colored protomer. The map resolution was 9.1 Å according to the gold standard criterion ( Supplementary Fig. 1). At 9.1 Å resolution, all the 144 globin subunits and 36 linker chains can be unambiguously assigned. The 36 non-heme linker chains formed a central core which acted as a scaffold covered by 144 O 2 binding globins. Two unique dyad axes designated ''Q'' and ''P'' (Fig. 1b, 1c and 1d) oriented every 30u in the central plane reflected the D6 symmetry of the complex. The higher resolution of our cryo-EM map allows a clear visualization of each a-helix in the N-terminal triple-stranded coiled coils. This allowed unambiguous assignment of three linker chains, designated L1, L2, and L3 based on the known crystal structure. Interestingly, our cryo-EM map reveals additional electron densities in the N-terminal domain of the L1, L2 linker chains.
Structural fitting. Since it is difficult to interpret the entire map, we extracted the protomer from the complex ( Fig. 2a and 2b) and rigidbody docked the CO-binding form of L. terrestris Hb crystal structure into it. As the rigid body docking is performed using the entire protomer, the colied coil domain matches well, however, significant discrepancies are observed in the globin subunits and the b barrel domains. The overall cross-correlation coefficients is 0.70. The discrepancies between the O 2 -binding cryo-EM map and the CO-binding crystal structure suggested a conformational change induced by different ligand binding. To analyze the domain movements more accurately, we used flexible fitting protocol Flex-EM 16 to dock the crystal structure into our EM map. This method maintains the connectivity between the domains and optimizes the position and orientation of each defined rigid body segment simultaneously 17 . The model for the protomer fits tightly into the density map (see Fig. 2c and 2d), except for some loop regions. The most remarkable discrepancies involve the extra densities in the Nterminal coiled coil domain. Compared to the crystal structure, the L1 linker chain is N-terminally extended by a unique tail domain   (Fig. 2d), and finally forms a bracelet structure around the center of the assembled complex.
After the protomer structure was flexibly docked into the density map, the whole map was symmetrically fitted while avoiding clashes between symmetrically placed molecules. The very high similarity between the density map and the flexible-fitting structure (crosscorrelation coefficients ,0.94) allowed us to construct a pseudoatomic model of the observed hexamer in the O 2 -binding state. With this approach, we were able to model the conformational change induced by O 2 binding. The obtained O 2 -bound structure of L. terrestris Hb provides a different state to enable us to investigate in detail the effect of different ligand binding. After the flexible fitting procedure, the correlation coefficients increased to 0.94 (before flexible fitting ,0.7), which implies conformational difference between CO and O 2 binding states. The average root-mean-square deviation (RMSD) of the overall complex between the fitted model and the CO binding X-ray structure was 6.6 Å . Figure 3 shows the comparison of the L. terrestris Hb models in the O 2 -bound and CO-bound states. Upon binding of the O 2 molecules, each of the protomers moves outward along the quasi 3-fold axis, causing a radial expansion of the hexagonal bilayer complex. The fitted pseudo-atomic model has a diameter of 301 Å and a height of 193 Å while the CO-bound state has a 288 Å diameter and 186 Å height. On the inside, the cavity of the hexamer after O 2 binding is 47 Å in diameter and remains almost unchanged. Even though the inside cavity diameter of the CO-bound crystal structure still matches that of the O 2 -bound complex, the outside diameter is 13 Å narrower in CO-binding state.
Each of 12 protomers is composed of a globin dodecamer which binds to the head of a linker heterotrimer. The twelve globin subunits (abcd) 3 in each protomer appear to group into six peanut-like subunit pairs (a-d, b-c) that create a pseudo three-fold symmetry (Fig. 2a), whereby each protomer is considered to be a trimer of tetramers. The L1, L2, and L3 linker chains are held together as a trimer by disulfide bonds and strong hydrophobic interactions 7 . Each linker chain is comprised of a long N-terminal a helix, a lowdensity lipoprotein receptor (LDLR)-like domain 18 , and a typical eight-stranded b barrel domain 7 . A nonhelical region divides the N-terminal a helix into a long coiled coil and a shorter one near the b barrel domain. It was also known that linker chain L1 exhibits the longest inter-helical segment which introduces a break in the crystal structure 7 . Thus, we refined the inter-helical loop by generating additional possible conformations 19 which fit our density map more accurately (Fig. 2c).
Central linker complex. The linker trimer has a long stalk at the center of L. terrestris Hb, formed from the triple-stranded coiled coil. The coiled coil domains provide the primary contacts between onetwelfth protomers that form the overall hexagonal bilayer structure. At this resolution, the triple-stranded coiled coils were clearly evident. In order to demonstrate the arrangement of the 36 linker chains at the core of the complex, we masked out the globin subunits and a portion of the linker chains (Fig. 4a). Then we found that the two hexagonal rings are staggered, so that the stalks appear curved between the top and bottom rings when viewed from the Q dyad ( Fig. 4b). Each triple-stranded stalk projected from outside toward the main plane of the central linker complex (Fig. 4c) are arranged in an alternating manner. The angle between the top and bottom stalks is approximately 45u (Fig. 4b). Interestingly, our map also reveals additional densities in the N termini of the linker chains, which make the central cavity much smaller than that of the X-ray structure. The extra density forms a bracelet structure which connects all the coiled coil domains of L1 and L2 linker chains in the assembled complex (Fig. 4a). These features are totally absent in the X-ray structure for unknown reasons 4 . The interactions formed by the additional electron densities observed in our cryo-EM map might be the potential forces making the complex more compact and stable.  The interdigitation between the extra density and the L1, L2 coiled coils can be appreciated from the unwrapped planar map (Fig. 4a). As shown in the unrolled central linker complex, the extra density appears as a continuous sinusoid which was presented as six sinusoidal pillars in the 14.9 Å cryo-EM structure 4 . To further investigate the changes occurred in the coiled coil domain, we compare the extracted coiled coil structures from the O 2 -bound and CO-bound states. By superimposing their structures, the local conformational changes within the subunit can be examined. One apparent structural change upon oxygenation is in the short coiled coil of L1 linker chain (see Supplementary Fig. 2b). The short coiled coil of L1 tilts toward the central plane with the hinge point near the inter-helical loop. The conformations of L2 and the long coiled coil of L1 which were connected by the sinusoidal bracelet are almost unchanged.
Interactions between protomers. Except for the primary contacts made by the coiled coil domains, two distinct inter-ring and intraring contacts are formed between neighboring protomers. The most extensive inter-ring contacts occur along the Q-dyad. As shown in Fig. 5a, the b barrel domains of L1 linker chain from two protomers pack together at the Q-dyad. The intra-ring contacts include interactions between globin dodecamers and the b barrel domains of L2 and L3 linker chains (Fig. 5b). The two hexagonal layers of L. terrestris Hb are partially staggered such that the architecture is more compact than the type II Hb 5 . Despite the closer manner of two hexagonal layers, the globin subunits actually do not have direct inter-ring interactions. It's the b barrel domain of L1 linker chain providing the only inter-ring contacts, which could have an important role coordinating inter-ring allostery. Supplementary  Fig. 2a  Conformational change in the heme pocket. The extracellular L. terrestris Hb comprises four unique heme-containing subunits, a, b, c, and d in equal proportions. The four globin subunits exhibit the standard myoglobin fold, with seven a helices designated A through H (no D-helix) 8 . The heme group was contained in the E and F helices. Assembly of the globin dodecamer was dictated by four unique interfacial contacts, including two distinct EF dimer interfaces, one intra-tetramer interface and one inter-tetramer interface 8 . EF dimer pairing has been observed in all cooperative invertebrate hemoglobins to date 20 , in which an extensive dimeric interface forms from contacts involving the heme containing E and F helices. In order to find the tertiary changes occurred in the heme pocket, we compared each globin subunit from the O 2 -bound and CO-bound states. Their structures were superimposed for comparison and analyses (Fig. 6). In this way, we can examine the local tertiary changes within the subunit, which is independent of the quaternary movement. The RMSD over each entire globin domain is small (less than 1 Å ), which implies most of the residues match well in the CO and O 2 binding states. However, the RMSD increases from the beginning of the F helix and has a local maximum ,3.1 Å near the proximal histidine. In our results, one remarkable tertiary conformational change upon O 2 binding is in the helix F which contains the proximal histidine for interaction with the heme group (Supplementary Movie 2). By interacting with the heme iron atom, the helix F tilts upward from the heme plane upon O 2 binding. Since we can not determine the location of the heme group by the cryo-EM map at this resolution, the heme group is positioned relative to the CO-bound crystal structure. Interestingly, the tilt of the helix F only occurs in the subunit b and subunit d. In contrast to subunit b and subunit d, subunit a and subunit c show no obvious tertiary change between two states.

Discussion
The focus of our research was to investigate the structural transitions induced by different ligand binding to the giant hexagonal-bilayer Hb of L. terrestris. The combination of cryo-EM structure and the flexible fitting technique provides a good means for studying the tertiary and quaternary structure in the specific functional state. Earlier resonance Raman spectra 10 and small angle x-ray scattering studies 11 have previously pointed that the giant L. terrestris Hb lacked ligand-binding induced tertiary and quaternary changes. On the contrary, our results indicate that the ligand-induced structural changes of L. terrestris Hb contain both tertiary and quaternary allostery which may account for the unusually high cooperativity of O 2 binding. Since the resonance Raman observations reflect an average behavior of all the heme pockets, it can not be ruled out that a small number of heme environments could undergo ligand-induced conformational change. Indeed, our results show that only the heme pocket in globin b and globin d undergoes tertiary changes. The distinctive feature in these two subunits could be due to the tryptophan at position B10 that reaches into the binding heme pocket, making van der Waals contacts with the gas ligand 8 . It is reasonable to attribute the tertiary changes to the interaction between the tryptophan and the gas ligand. The structures shown indicate that the proximal histidine becomes detached from the heme iron. The proximal histidine in the F helix 21 is a strictly conserved residue in the hemoglobin which helps to coordinate the heme iron. Actually, it is the only point of direct interaction between the heme and protein 8 . The pioneering studies of Max Perutz highlighted roles of the proximal histidine in regulating the oxygen affinity by allosteric effect 21,22 . According to Perutz's model, the proximal histidine comes closer to the heme plane in oxy than in deoxyhemoglobin 23,24 . Geometric analysis and Monte Carlo simulation indicate that the oxygenated conformation lies along the pathway between deoxy and CO-binding states 25,26 . Moreover, analysis of Hb structures in the Protein Data Bank also reveals the reaction pathway is T (deoxy) -R (O 2 binding) -R2 (CO binding) 27,28 . Therefore, such a movement of the F-helix of our data is what would be expected as a difference between oxygenated and CO liganded hemoglobins. The allosteric signal is transmitted through the heme iron to the proximal histidine, causing the tilting of helix F in globin b and d away from the heme plane. The structural transitions found here can explain the unusually high cooperativity of O 2 binding in the L. terrestris Hb which has long been an unsolved mystery in the past. Here our results show that the overall cooperativity of a hexagonal bilayer complex is caused by the plurality of subunit-subunit interactions in this assemblage, which is quite different from those of the vertebrate and other invertebrate Hbs. The earthworm Hb requires assembly of multiple copies of four distinct globin subunits while the higher vertebrates invariably display the a 2 b 2 tetrameric form 20 . The retention of only partial cooperativity by incomplete assembly 29,30 implies that the full cooperativity relies on the presence of multiple subunit-subunit interactions in the hierarchy of the assembly. Besides the tertiary and quaternary allostery in the heme pocket, there is an alteration of the overall size of the complex. Upon oxygen binding, the allosteric signal is transmitted through the heme iron to the proximal histidine, causing the tilting of helix F. In the next step, the allosteric signal is transmitted to the linker via the most extensive contacts that involve the b barrel and LDLR-like domain 7 . One interesting interaction occurs between globin b and LDLR-like domain. The Arg B11 from globin b subunit reaches into the LDLR-like domain to make very favorable interactions with Asp 88 7 (L1 numbering). The involvement of residue B11, next to B10 in the binding pocket, suggests that this contact could have implications for the allosteric regulation. Finally, the short helix of L1 tilts outward around the flexible hinge (inter-helical loop) ( Supplementary Fig. 2b), and the b barrel domain was pushed outward causing the radial expansion of the whole complex (Supplementary Movie 1). The oxygenation causes a radial expansion of the hexagonal-bilayer complex while maintaining the rigidity of the central linker complex. Considering the error margins of fitting (,4 Å ), there is still ,9 Å radial expansion of the whole hemoglobin complex between our cryo-EM data (301 Å ) in the O 2 -bound state and the known crystal structure (288 Å ) in the CO-bound state 31 . This global subunit movement is found to be accomplished by the allosteric communications between intra-ring and inter-ring contacts. The intraring interactions primarily involve main chain hydrogen bonding between b barrel domains of L2 and L3 7 , while the inter-ring contacts involve the extensive packing of the b barrel domains of L1 as seen in Fig. 5a. This emphasizes the critical role of linker chains in the allosteric behavior exhibited by this macromolecule. One interesting feature is the longest inter-helical segment of L1, which serves as a flexible hinge region (Supplementary Fig. 2b) and allows structural transitions of the short coiled coil relative to the longer one. The allosteric role of linker chains observed in our results provides a reasonable explanation for the smaller cooperativity of dodecamer 11 and Riftia Hb 32 without linker chains.
Another unanswered issue in the annelid Hb literatures concerns the presence of the additional densities in the central linker complex. The so-called ''bracelet model'' 13 had been proposed to explain the assembly of the quaternary structure by the central bracelet. Some of the early publications on L. terrestris Hb 6,12,15,33 provided some evidence of visible density in the center of the Hb macromolecule. The 3.5 Å resolution crystal structure revealed the interdigitation of 12 triple-stranded coiled-coils near the center of the complex. The smaller central cavity and the sinusoidal pillars observed in the 14.9 Å cryo-EM structure 4 gave a further support of the additional densities which were absent in the crystal structure for unknown reasons. These findings, however, do not close the discussions because no detailed and complete structural evidence was provided so far. Our 9.1 Å resolution cryo-EM structure clearly reveals the details of the additional sinusoidal bracelet (Supplementary Fig. 3) that further enhance the completeness of the structural interdigitations between 12 triple-stranded coiled-coils. Moreover, the two smallest RMSD values in the L2 (2.25 Å ) and L1 (2.52 Å ) linker chains between the fitted model and the X-ray structure strongly suggest the structural role that the extra densities played to reinforce the strength of the central linker complex. The absolute requirement for the L1 linker chain for the Hb assemblage and the interchangeability between L1 and L2 33 also support the interdigitations between the extra densities (Fig. 4a) and the L1 and L2 linker chains.
This study yields important implications for the allosteric mechanisms of the giant hexagonal bilayer Hb based on structural comparison and analysis. The tertiary and quaternary allostery and the allosteric communications facilitated by intra-ring and inter-ring contacts underlie the requirement for distinct hemoglobin sequences and the non-heme linker chains. The flexibility of the b barrel domain and the rigidity of the central linker complex strengthened by the extra bracelet clarify the dual functions of the linker chains. Besides, the radial expansion in the oxygenated state of L. terrestris Hb complex was confirmed and analyzed by cryo-EM and flexible fitting technique, which are good means to register protein dynamics in native environment. It remains to be seen what constituents could account for the additional sinusoidal densities. One possibility may be that the additional densities could correspond to the N-terminal residues of L1 and L2 linkers which were not observed in the crystal structure. Structural analysis at higher resolution will be required to elucidate how the allosteric units drive global movement, and how the structural transitions alter the O 2 binding affinity. The determination of the corresponding structures at specific functional states is a matter of vital importance.

Methods
Hemoglobin purification. The common earthworm L. terrestris was purchased from Carolina Biological Supply Co. (Burlington, NC). Live earthworms were washed in distilled water to remove dirt and mucus, dried off, and cut near the seventh body segment. The blood was collected via capillary tubes into a centrifugal tube containing 0.1 M Tris-HCl buffer, pH 7.0, 1 mM EDTA. The collected blood was centrifuged at 20,000 3 g for 0.5 hr at 4 degrees Celsius to remove any cell debris and particulate matter. The supernatant was then centrifuged at 150,000 3 g for 2 hr at 4 degrees Celsius. The red pellet was dissolved in the Tris-HCl buffer and centrifuged at 150,000 3 g again. The final pellet was dissolved in 0.1 M Tris-HC1 buffer, pH 7.0, 1 mM EDTA, 2 mM phenylmethanesulfonyl fluoride (PMSF) 34 .
Sample preparation for electron microscopy analysis. For cryo-EM, the Hb sample was dialyzed against a 50 mM Tris-HCl buffer, pH 7.2, 10 mM CaCl 2 , 10 mM MgCl 2 to a concentration of 2 mg/ml. Approximately 3.5 ml of the sample dilution was applied onto holey carbon side of each freshly glow-discharged Quantifoil copper grid (R2/2, Quantifoil Micro Tools GmbH, Jena, Germany). After blotting at room temperature for 3.5 s at 100% humidity in the Vitrobot (FEI, Netherlands), the grid was rapidly plunged into liquid ethane cooled by liquid nitrogen.
Electron microscopy. Grids of frozen hydrated samples were transferred into the electron microscope by using the Gatan 626 cryo-transfer system (Gatan, USA). The cryo specimens were imaged in a Tecnai F20 (FEI, Netherlands) electron microscope operated at an acceleration voltage of 200 kV. Low dose images were recorded at a nominal magnification of 80,000 using a Gatan UltraScan 4000 4 k 3 4 k CCD camera (Gatan, USA) by an automated data acquisition system, Leginon 35 .
Image processing and 3D reconstruction. The quality of the CCD images were determined by their power spectra. Micrographs with noticeable drift, charging or astigmatism were discarded. The defocus and astigmatism parameters of each micrograph were determined using the program CTFFIND3 36 , and the contrast transfer function (CTF) correction was done with the module CTF2D-FLIP from IMAGIC 37 . All the particles without contaminants and aggregates were semiautomatically selected using the EMAN2 tool e2boxer 38 . Approximately 6,000 particles were selected from 250 micrographs. Particles were then extracted into boxes of 400 3 400 pixels, band-pass filtered between 200 and 3 Å , and normalized to a constant mean and standard deviation using IMAGIC. To increase the speed of the alignment process, the pretreated particles were first coarsened to 200 3 200 pixels and band-pass filtered between 200 and 10 Å . Particles were translationally centered to the total sum of the dataset and classified by multivariate statistical analysis (MSA) in IMAGIC with approximately 7 images per class. Ten best class averages with characteristic views were selected as references for multi-reference alignment (MRA). After three rounds of MSA/MRA cycle, an initial map was calculated from class averages by angular reconstitution with D6 symmetry imposed. Reprojections of the initial model were used as an anchor set for angular refinement, after angular reconstitution, a new 3D model was calculated. The angular refinement process was then repeated iteratively until the euler angles stabilize. Subsequently, particle orientations were refined by several cycles of MRA, MSA and angular reconstitution until a reasonable convergence was achieved. Finally, the alignment parameters were needed to be refined on the un-coarsened dataset, including the high frequency terms, by the same iterative cycle. In our 3D reconstruction data reported here, the reconstructed map was further refined by twelve cycles of projection matching procedure implemented in IMAGIC with a progressively decreased angular increment (with a final value of 1u only). After each competitive projection matching, particles were classified into different defocus groups and 75% of the best particles of each group (according to their cross-correlation coefficients) were selected for 3D reconstruction. The resolution of the structure was assessed according to the Fourier shell correlation (FSC) of two maps calculated separately from two halves of the dataset.
Structure analysis and EM density fitting. The 3D volume was analyzed, segmented and visualized by UCSF Chimera 39 . The CO-binding crystal structure (PDB code: 2GTL) protomer was first rigid-body fitted into the segmented cryo-EM map by Chimera. Then the conformation refinement was performed by the simulated annealing molecular dynamics optimization protocol embedded in the Flex-EM software 16 . During the flexible fitting process, the secondary structural elements detected by RIBFIND 17 were displaced in the direction which maximized their crosscorrelations with the cryo-EM density map. After the protomer structure was flexibly fitted into the cryo-EM density map, the whole map was symmetrically fitted while avoiding clashes between neighboring subunits by UCSF Chimera.