Crystallization of a human galectin-3 variant with two ordered segments in the shortened N-terminal tail

Flores-Ibarra, Andrea; Vértesy, Sabine; Medrano, Francisco J.; Gabius, Hans-Joachim; Romero, Antonio

doi:10.1038/s41598-018-28235-x

Download PDF

Article
Open access
Published: 29 June 2018

Crystallization of a human galectin-3 variant with two ordered segments in the shortened N-terminal tail

Andrea Flores-Ibarra¹^na1,
Sabine Vértesy²^na1,
Francisco J. Medrano¹,
Hans-Joachim Gabius² &
…
Antonio Romero ORCID: orcid.org/0000-0002-6990-6973¹

Scientific Reports volume 8, Article number: 9835 (2018) Cite this article

3678 Accesses
37 Citations
2 Altmetric
Metrics details

Subjects

Abstract

Among members of the family of adhesion/growth-regulatory galectins, galectin-3 (Gal-3) bears a unique modular architecture. A N-terminal tail (NT) consisting of the N-terminal segment (NTS) and nine collagen-like repeats is linked to the canonical lectin domain. In contrast to bivalent proto- and tandem-repeat-type galectins, Gal-3 is monomeric in solution, capable to self-associate in the presence of bi- to multivalent ligands, and the NTS is involved in cellular compartmentalization. Since no crystallographic information on Gal-3 beyond the lectin domain is available, we used a shortened variant with NTS and repeats VII-IX. This protein crystallized as tetramers with contacts between the lectin domains. The region from Tyr101 (in repeat IX) to Leu114 (in the CRD) formed a hairpin. The NTS extends the canonical β-sheet of F1-F5 strands with two new β-strands on the F face. Together, crystallographic and SAXS data reveal a mode of intramolecular structure building involving the highly flexible Gal-3’s NT.

The choanoflagellate pore-forming lectin SaroL-1 punches holes in cancer cells by targeting the tumor-related glycosphingolipid Gb3

Article Open access 12 September 2022

The dynamic nature of netrin-1 and the structural basis for glycosaminoglycan fragment-induced filament formation

Article Open access 03 March 2023

Intrinsic tryptophan fluorescence spectroscopy reliably determines galectin-ligand interactions

Article Open access 14 August 2019

Introduction

The functional pairing of cellular glycoconjugates with tissue lectins is giving the unsurpassed structural variability of lipid/protein-linked glycans a physiological meaning¹. In fact, reading glycan-encoded messages by these endogenous effectors appears to underlie a wide array of cellular activities². At the molecular level, this pairing has a remarkably low degree of promiscuity, meaning that exclusively distinct glycoconjugates become binding partners for a particular endogenous lectin. This specificity for contact formation and of the nature of the ensuing trigger mechanisms that e.g. result in growth regulation does not only depend on mutual recognition of complementary binding sites. In addition, (i) the topological mode of glycan presentation, (ii) the lectin’s architecture and (iii) the arrangement of aggregates are emerging as factors that can contribute to the precision of the final outcome. Since respective protein families such as the C-type lectins and the ga(lactose-binding) lectins are organized in groups differing in the modular display around a common carbohydrate recognition domain (CRD)^3,4 a fundamental importance of this type of protein design for the lectins’ activity profile can be postulated. Consequently, this hypothesis has invigorated the efforts to achieve complete structural characterization of all proteins within a family.

Looking at vertebrate adhesion/growth-regulatory galectins, three types of protein design are known: (i) the non-covalently associated homodimer (proto type), (ii) the heterodimer connected by a linker (tandem-repeat type) and (iii) the trimodular chimera, uniquely represented by galectin-3 (Gal-3)^3,5,6. In the case of human Gal-3, the 21-amino-acid-long N-terminal stretch (NTS) with its two sites for serine phosphorylation is followed by nine non-triple-helical collagen-like Pro/Gly-rich repeats (I-IX), which harbor cleavage sites for diverse proteases⁷. The NTS and the nine repeats form the N-terminal tail (NT). In full-length Gal-3, it is connected to the C-terminal CRD that harbors two sites for c-Abl kinase-dependent tyrosine phosphorylation⁷. These three sections, i.e. NTS, collagen-like repeats and CRD (for sequences, see Supplementary Fig. S1), likely cooperate in a not yet clearly defined manner to account for this protein’s special role within the galectin network.

Inside the cell, Gal-3 can shuttle between cytoplasm and nucleus, a pathway involving import and export signals at the CRD^8,9,10. Transport to late endosomes critically depends on the NT¹¹. Serine phosphorylation in the NTS favors departure from the nucleus¹². Secretion to the extracellular environment proceeds via a non-classical pathway, which appears to involve the NTS and sections of the collagen-like repeat region^13,14,15,16. In solution, Gal-3 is monomeric, unless high concentrations are reached or bi- to multivalent ligands are present that serve as core for aggregation^{17,18,19,20,21,22,23,24,25,26,27}. An active role in this intermolecular assembly has been attributed, in varying extents, to both the NT and the CRD^28,29,30. NMR Spectroscopy data “clearly indicate that the recombinant NT… is unstructured in solution and exists as an interconverting mixture of conformations”, with evidence for “significantly reduced mobility values” in the NT region proximal to the CRD³¹. Transient intramolecular interactions occur, too. Their presence was first observed by NMR spectroscopy due to a shielding of nuclei³¹ and by electrospray ionization mass spectrometry due to a bimodal charge distribution³². Thereafter, NMR spectroscopy based on full-assignment work^33,34,35 and studies of binding of Gal-3-derived peptides to the ¹⁵N-labeled CRD³⁶ extended the respective body of evidence. Obviously, the NT is not an inert appendix of the CRD what explains the high interest in elucidating structural aspects of this region of the protein. In the words of a recent study, the status of our understanding of Gal-3 has been summarized that “functional multivalency therefore is somewhat of a mystery”³⁵. For this report, we have extended crystallographic analysis beyond the CRD despite the NT’s inherent dynamics by working with a variant with shortened length of this part.

Up to now, crystallographic data are available only for a truncated protein starting at Leu¹¹⁴ as monomer³⁷. No insights into properties of structurally ordered segments of the NT or a mode of CRD aggregation have so far been reported, most likely due to inherent flexibility of the full-length protein. Therefore, new routes were needed to address this issue. The stepwise deletion of collagen-like repeats from the tail by genetic engineering³⁸ is a means to reduce this impediment, and, indeed, crystallization of a shortened Gal-3 variant has recently been achieved³⁹. However, X-ray diffraction at 3.3 Å was insufficient for structural resolution of any part of the NT or oligomers so that further work on obtaining high-quality crystals had to be performed. Hereby, the detailed crystallographic data of ordered regions within the NT of the Gal-3[NTS/VII-IX] protein (for sequence details, please see Supplementary Fig. S1) became possible. This accomplishment led to a structural model that was set in relation to the experimentally determined shape in comparative small angle X-ray scattering (SAXS) studies of this protein, together with full-length Gal-3 and the variant with repeats IV-IX in its NT (Gal-3[NTS/IV-IX]). When combined, these investigations disclosed a conformation of two regions of the highly flexible NT seen in the crystal of Gal-3[NTS/VII-IX] and the overall shape of the three proteins in solution. Furthermore, the obtained data enabled us to suggest a molecular pattern of contacts favoring the formation of aggregates of Gal-3 into a tetramer under the given conditions driven by CRD-CRD interactions.

Results

Overall Folding and Quaternary Association of the Gal-3 Variant

Crystallographic data of Gal-3[NTS/VII-IX] at 2.2 Å resolution indicated the presence of 12 monomers in the asymmetric unit (see Supplementary Table S1 for data collection and refinement statistics). These monomers are arranged as three tetramers related by two-fold non-crystallographic symmetry axes, as can be seen in the self-rotation function previously reported³⁹. At the level of the monomer, the overall folding of the CRD maintains the typical β-sandwich topology of two β-sheets constituted by the antiparallel S1-S6/F1-F5 strands (Fig. 1). Lactose binds to the canonical site of each monomer (Fig. 2a).

The CRD is also the platform for homotypic aggregation. In this tetrameric arrangement, the monomers come into relative vicinity via their CRDs. Each tetramer (ABCD) is constituted by two dimers (AB and CD). The monomers of each dimer face each other by the concave surface of the CRDs, while the dimers interact by the side of the β-sandwich opposite to the NT and rotated around 90° respective to each other. The interactions at the monomer-monomer interface in each dimer (A to B and C to D) and at the dimer-dimer interface (AB and CD) are mainly hydrogen bonds. Two sets of interactions can be defined at these interfaces. One of them occurs at the respective interface (AB and CD) in a pairwise manner between Asn143 (A) and Asn153 (B) as well as between Arg162 (A) and Asn179 (B) (likewise for the CD pair). The interactions at the lateral interfaces (AC and BD) follow a similar mode of contact formation. In detail, inter-monomer contacts within the AC (and BD) interface are between Asn166 (A) and Arg183 (C) as well as between Arg168 (A) and Arg186 (C) (Fig. 2b). Other types of interactions between monomers, i.e. by protein-carbohydrate recognition, involve hydrogen bonds between amino acids of a monomer and of lactose bound to a different subunit. This type of network bridges residues Lys176 (monomer A) and Asn179 (monomer A) to the hydroxyl groups (O2 and O3) of the lactose unit in monomer B. The same type of interplay is operative in monomers C and D. Additional cross-interactions between Glu184 of monomers C and D and the O2′ from the lactose unit in monomers A and B, respectively, stabilize the tetramer (Fig. 2c). Owing to this kind of structural arrangement a channel-like cavity inside the tetramer is generated. It appears to be suited for accommodating glycans with N-acetyllactosamine (LacNAc) repeats, such epitopes are present in chains of glycoproteins such as the Gal-3 counterreceptor laminin and of glycosphingolipids such as (neo)lactotetraosyl (LNnT, LNT) ceramide.

Beyond disclosing these structural aspects of the CRD, X-ray diffraction offered the first crystallographic insights into the NT. In fact, additional electron density could be observed for regions of the NT in three of the 12 monomers. This information opened the way to describe the structure of ordered parts of this crystallographically so far uncharacterized section of Gal-3.

Ordered Sequence Stretches in the NT

The spatial arrangement of the electron density originating from the NT indicated its assignment to two different segments of the NT rather than resulting from conformers of the same region, as graphically explained in Supplementary Fig. S2. These two sections in Gal-3’s NT were readily distinguishable. This is especially the case for the continuous electron density in one of the monomers around the start of the CRD. It could be assigned to the sequence starting at Tyr101 up to Leu114 (Supplementary Figs S1, S2a,b). In the other two instances, the electron density could unambiguously be assigned by identifying the characteristic phenyl ring of Phe5 and the imidazole ring of His8 as diagnostic indicators. Thus, the sequence from Asn4 to Pro17 in the NTS is the second source of extra electron density (Supplementary Figs S1, S2c,d). As a consequence, two parts of the NT could structurally be characterized in detail.

The obtained data revealed that the region in the terminal section of repeat IX and the beginning of the CRD adopted a β-hairpin structure (Fig. 3a). It is stabilized by hydrogen bonds between backbone atoms of sequential residues and also by an interaction of Tyr107 with His217 of a symmetry-related neighbor (Fig. 3a, Table 1). The proximal segment of the repeat section and the first part of the CRD thus can establish this hairpin.

Table 1 Hydrogen bond interactions stabilizing the N-terminal tail.

Full size table

At the N-terminus, an ordered segment is detected. The NTS is arranged in a double-stranded anti-parallel β-sheet (Fig. 3b). It is stabilized by a network of hydrogen bonds (Fig. 3b, Table 1). As consequence, the residues Phe5-His8 are constituents of the first β-strand followed by the second β-strand (Gly13-Asn16), named F–1 and F0, respectively. Their presence and the observation that the F0 strand runs anti-parallel to the carboxy-terminal F1 strand combine to explain the extension of the β-sheet of the F1-F5 β-strands (Fig. 3b). The NTS appears in the crystal structure wedged between the F-faces of two separate molecules, one of them a symmetry-related molecule. The actual conformation, these two new β-strands belonging to the monomer in the asymmetric unit and not to the symmetry-related partner, was selected because the angle formed between the contacting strands is much lower (close to 0°) compared to the other conformation (approx. 45°), hereby maximizing the contact surface area and the number of contacts between the interacting strand F0 and the first strand of the F-face (F1) (Supplementary Fig. S3). In spite of our assignment and due to the inherent flexibility of the NT, this new element could belong to the symmetry-related monomer, or much less likely, to some other monomer from the asymmetric unit or a symmetry-related molecule. In any case, it is not possible to unambiguously assign the new element to any monomer, although the given assignment is the most likely scenario.

Obviously, in this spatial arrangement, flexibility of the NTS is reduced, with implications for the presentation of substrate site(s) for serine phosphorylation (please see below). Having hereby solved the structure of two segments of the NT, this information facilitated the development of a structural model of Gal-3 with an NT, albeit truncated.

Building a Structural Model for the Gal-3 Variant

The two newly identified ordered structures of the NT, i.e. at its start and its terminus, provide essential information on how intramolecular recognition between NT and CRD can give shape to the full-length protein. When added to the CRD core, the NTS is able to introduce a novel double-stranded antiparallel β-sheet at the F-face (Fig. 4a). The segment with the hairpin is located at the S-face (Fig. 4b). Superposing these two separate structures a model could be generated. It served as a platform to include the remaining part of the NT as in a puzzle to build full-length Gal-3 (Fig. 4c). Overall, the core is composed of a seven-stranded β-sheet (F–1 to F5) and a six-stranded β-sheet (S1-S6) (Fig. 4c). Due to lack of electron density the repeats VII and VIII could not be modeled and likely exhibit flexibility.

This model with its advanced description of the NT structure enables us to examine a mode of presentation of its two sites of phosphorylation. Looking at these target sites for functional post-translational modification, Ser6 and Ser12 are readily accessible in this fixed constellation (Fig. 5a). In more detail, the pocket with Ser6 is a groove that is complementary in shape with the active region of the casein kinase 1 (CK1) (Fig. 5b). The segment comprising repeats VII and VIII not present in the crystallographic structure and modeled by hand in one fixed structure might most likely not represent the entire conformational space, but with its inherent flexibility could adopt the proper conformation to dock to CK1 or undergo a mutual conformational adaptation when the two proteins approach for catalytic phosphorylation. Notably, the same kind of inspection was possible for the sites of tyrosine phosphorylation that reside in the second ordered stretch within the start region of the CRD. Tyr107 and Tyr118 that belong to the CRD are similarly accessible to the solvent and able to be phosphorylated by c-Abl kinase when adopting this crystallographically fixed structure. A further consequence of this model is to give an evidence-based idea of the shape of the full-length Gal-3. To test the validity of the model for actual shape parameters in solution we performed SAXS experiments. In addition to this variant and the full-length Gal-3, we studied an intermediate-length variant with six of the nine collagen-like repeats, i.e. Gal-3[NTS/IV-IX] (for sequence information, please see Supplementary Fig. S1), therefore with a longer NT than Gal-3[NTS/VII-IX].

Gal-3 Shape: SAXS Analysis vs. Model

The scattering curves of the full-length Gal-3 and the two variants with differently shortened NT, i.e. Gal-3[NTS/IV-IX] and Gal-3[NTS/VII-IX], are shown in Fig. 6a,b. Explicitly, the length of the natural NT was thus reduced by deletion of either three or six repeats, whereas the presence of the NTS was maintained. The analysis of the scattering curves gave R_g values of 3.54, 2.71 and 1.69 nm as well as maximum dimension (D_max) values of 13.60, 9.48 and 7.49 nm for the three proteins, respectively. The estimation of the molecular weight is in fair agreement with the respective value of each of the three proteins (Supplementary Table S2). The shape of their distance distribution function (inset of Fig. 6a), together with D_max values, indicates that Gal-3 has an elongated shape under these conditions. In comparison, the analysis of the scattering curves (Fig. 6a), together with the Kratky plots (Fig. 6b), came up with Gal-3[NTS/VII-IX] as the most structured protein, followed by a progressive loss of compactness in the Gal-3[NTS/IV-IX] variant and then in full-length Gal-3. Ab initio models were generated from the scattering curves (Fig. 6c). They characterize the overall shape of the three proteins (Supplementary Table S2).

The alignment of the herein reported crystal structure of Gal-3[NTS/VII-IX] with the sphere obtained by these ab initio calculations, using the program SUPCOMB⁴⁰ from the ATSAS package⁴⁰, revealed additional information on the SAXS envelope. This allowed us to generate a possible model of the flexible (missing) residues in the crystallographic model. Figure 7a presents a reasonable constellation, where the crystallographic model matches the calculated shape from the SAXS-based data. Considering Gal-3 as substrate for various types of processing, the phosphorylation sites and proteolytic cleavage sites are all accessible to the solvent in this model (Fig. 7b).

Discussion

Gal-3 has a unique architecture among the members of the galectin family. It underlies this lectin’s capacity to interact with diverse types of counterreceptors (glycans and proteins)⁴¹. Moreover, this protein harbors sites for post-translational modifications and proteolytic cleavage, the latter acting as a biochemical switch for controlling its capacity for lattice formation^36,38. In contrast to the 2-fold symmetric dimer organization of most proto-type galectins and the bivalent tandem-repeat-type galectins, the trimodular design also accounts for self-aggregation in the presence of multivalent ligands. Owing to the availability of a growing number of human galectins for testing, functional analysis has moved from work with a single protein to considering their activities as a network. Intriguingly, these efforts are unveiling intense cooperation between galectins. The case of Gal-1 and -3 is a focus of current research, in terms of antagonism^42,43 and positive cooperation⁴⁴. The competition for the same counterreceptor and the disparity in structural organization of the cross-linked galectin-glycoconjugate complexes, referred to as highly organized (homogeneous) vs heterogeneous aggregates²², are assumed to have a context-dependent bearing on cellular responses to galectin binding. These far-reaching physiological implications, with positive or negative consequences on tumor growth regulation or pathogenesis of autoinflammatory disorders^42,43,44, explain the enormous interest to clarify the structure of Gal-3 beyond the CRD.

The association of Gal-3 with natural counterreceptors exhibits a different behavior relative to a proto-type galectin. When labeled Gal-3 was probed with surface (microtiter plate well or sensor chip)-immobilized glycoprotein (laminin), binding data revealed positive cooperativity, even for a hamster Gal-3 variant without amino acids 1–93 of the NT^18,45. Since the collagen-like repeats are endowed with ability for self-aggregation, as shown for example by electron microscopy of rotary shadowed protein preparations³¹ and NMR spectroscopy of ¹⁵N-labeled protein^33,35, it is clear that the NT can be a biochemical means toward oligomer formation. The same end was inferred to be reached by mutual recognition between CRDs^28,29,30,34, and our crystal structure informs us about a quaternary arrangement via CRD association up to the level of defining the underlying hydrogen-bond pattern. That the nature of the CRD matters to yield aggregation has recently been documented⁴⁶.

In this study, we have been able to obtain crystallized sections of Gal-3 beyond the CRD and to define their structures. The highly dynamic structure, as seen in NMR-based analysis^31,33,34,35, can thus adopt conformers that limit the enormous flexibility in solution in distinct sections, allowing crystallization. Evidence for such a restricted conversion between conformers had first been traced in hamster Gal-3 for the region composed of NT and CRD segments³¹. In detail, two parts of the NT could be structurally characterized. This new information enabled us to see the entire CRD, a part of repeat IX and nearly all the NTS, which has so far not been characterized by crystallographic analysis of human Gal-3^37,47,48. These data provide details on the nature of interactions underlying transient contacts between the CRD and either the NTS or a part of repeat IX. Intramolecular contacts of the collagen-like repeats with the F-face had been inferred to occur previously by NMR spectroscopy of unlabeled and of isotopically labeled full-length and fully truncated Gal-3^31,34,35,36.

The information on both regions here identified by crystallography, when implemented into a structural model of Gal-3, offers the opportunity to construct an evidence-based model of full-length Gal-3. The documented possibility for a conformational stabilization of the NTS may be beneficial for its role in cellular compartmentalization¹⁵ and also for presentation of the two sites for serine phosphorylation. The generated extension of the β-sheet (F–1 and F0 strands) produces a remarkable degree of organization by the interaction of the NTS with the CRD, rationalizing the occurrence of a compact form of Gal-3. Clearly, studies on the other two vertebrate galectins with a N-terminal addition to the CRD, i.e. rat Gal-5 and galectin-related protein^5,49,50, are now warranted to reveal whether such comparatively short N-terminal extensions will also interact with the CRD.

When targeting natural glycans, Gal-3 has a high affinity for polyLacNAc chains^51,52. Crystal structural analysis of Gal-3 in complex with two respective tetrasaccharides (LNT, LNnT) revealed association to the reducing-end galactose unit that explains why Gal-3 can bind to α2,6-sialylated LacNAc oligomers^53,54. As Fig. 8 illustrates, such a LacNAc-based tetra- or hexasaccharide may act like a string for arraying CRDs. Since contact to the LNnT tetrasaccharide was reported to remove the NT “from the CRD by competition, triggering the release of this N-terminal domain”³³, LacNAc repeats can favor self-association via the NT and also via the CRD’s F-face³⁴, both now fully accessible. A cooperation of these two mechanisms and the oligomer arrangement described herein will likely let Gal-3 acquire the ability to generate more than one topological type of aggregate structure.

In summary, having applied the approach of engineering Gal-3 variants with stepwise-shortened NT³⁸, we describe here the first crystallographic evidence of a quaternary structure of a Gal-3 protein stabilized by CRD-CRD contacts. Additionally, a structural representation of two segments of the NT could be defined by crystallographic analysis aided by crystallographic contacts counteracting the high-level flexibility in solution. This feat encourages further work with variant Gal-3 proteins with different NT lengths to relate changes in this parameter to function, in the quest to crack the sugar code⁵⁵. The strategically combined study of hybrids constituted by the NT and a CRD different from that of Gal-3 such as the recently engineered Gal-3NT/8 N protein⁵⁶ and of further glycan ligands such as LacNAc oligomers or 3′-sulfated Lac, which strongly induced glycodendrimersome aggregation by Gal-3⁴⁶, will help to dissect the contributions of the two parts of Gal-3, i.e. NT and CRD, to self-association.

Methods

Proteins

Full-length Gal-3 and its two variants with a stepwise truncated NT were obtained by recombinant production in E. coli BL21 (DE3)-pLysS cells (Promega, Mannheim, Germany) using pET24a plasmid (Novogen, Darmstadt, Germany), purified by affinity chromatography on lactose-bearing Sepharose 4B obtained by conjugation of ligand to divinyl sulfon-activated resin, then precipitated by addition of ammonium sulfate up to 80% saturation and processed further as given in detail previously³⁹. Assessment of molecular integrity and purity by one- and two-dimensional gel electrophoresis, gel filtration and mass spectrometry were done as described³⁸.

Crystallization of Gal-3[NTS/VII-IX]

Crystallization trials were performed at 295 K using the sitting-drop vapor-diffusion method with commercial screening solutions including JBScreen Classic (Jena Bioscience, Jena, Germany), Wizard Classics I–III (Emerald Bio, Bainbridge Island, USA) and Index (Hampton Research, Aliso Viejo, USA) in 96-well sitting-drop plates (Swissci MRC; Molecular Dimensions, Suffolk, England). Drops were set up by mixing equal volumes (0.2 µl) of protein-containing solution at 12 mg/ml and reservoir solution using a Cartesian Honeybee System (Genomic Solutions, Irvine, USA) nano-dispenser robot and equilibrated against 50 µl reservoir solution³⁹. However, no crystals were obtained for either full-length Gal-3 or the Gal-3[NTS/IV-IX] variant in any of the conditions tested. Single well-diffracting crystals were obtained in 18% PEG 8 K, 100 mM Tris-HCl (pH 8.5) and 200 mM lithium sulfate. Crystals grew in approximately one month to an average size of 0.15 × 0.15 × 0.10 mm.

X-ray data collection and structure determination

For data collection, crystals were cryo-protected with a cryo-solution containing the reservoir supplemented with 12.5% (v/v) PEG 400 and flash-frozen in liquid nitrogen. X-Ray data collection experiments were performed at the ALBA Synchrotron (Cerdanyola del Vallès, Spain) BL13 XALOC beamline. The data were indexed and integrated using XDS⁵⁷, scaled and merged using Aimless^58,59. The structure was solved by molecular replacement using the Gal-3 CRD structure (PDB ID: 1A3K)³⁷ with Phaser⁶⁰. The initial model was first refined using Refmac⁶¹ and alternating manual building with Coot⁶². The final model was obtained by repetitive cycles of refinement; solvent molecules, lactose and sulfate molecules were added automatically and inspected visually for chemically plausible positions. The model was validated and analyzed by MolProbity⁶³, figures illustrating protein structure were drawn with PyMOL⁶⁴. Data processing and refinement statistics are listed in Supplementary Table S1. Plot of the average B-factors is shown in Supplementary Fig. S4.

Small-angle X-ray scattering (SAXS)

SAXS data were collected on BM29 at the European Synchrotron Radiation Facility (ESRF, Grenoble, France) using the BioSAXS robot and a Pilatus 1M detector (Dectris AG, Baden-Daettwil, Switzerland) with synchrotron radiation at a wavelength of λ = 0.1 nm and a sample-detector distance of 2.867 m⁶⁵. Each measurement consisted of 10 frames each of 1 s exposure of a 100 μL sample solution flowing continuously through a 1 mm diameter capillary. Buffer scattering was determined immediately before each measurement of the corresponding protein sample at 269 K. The scattering images obtained were spherically averaged, and the buffer scattering intensities subtracted using in-house software. Protein-containing solutions of Gal-3[NTS/VII-IX], Gal-3[NTS/IV-IX] and full-length Gal-3 were prepared at concentrations of 2, 4, 6, 8 and 10 mg/mL in 20 mM sodium/potassium phosphate buffer at pH 7.0 containing 150 mM NaCl, 4 mM β-mercaptoethanol and 5 mM lactose. Data points affected by aggregation, possibly induced by radiation damage, were excluded. Regularized indirect transforms of the scattering data were performed with the program GNOM⁴⁰ to obtain the radius of gyration (Rg) and P(r) functions of interatomic distances. Three-dimensional bead models that fitted with the scattering data were generated ab initio using the program DAMMIF⁴⁰. Multiple runs were performed to generate 20 independent model shapes that were combined and filtered to produce an averaged model using the program DAMAVER⁴⁰.

Modelling inside the SAXS envelope

The X-ray crystal structure of Gal-3[NTS/VII-IX] was superimposed over the SAXS-defined envelope using SUPCOMB⁴⁰. The connecting segment of about 30 residues between Pro17 and Tyr101, corresponding to the last few NTS residues and repeats VII and VIII of the Gal-3[NTS/VII-IX] variant was analyzed by two secondary-structure prediction servers, i.e. RaptorX⁶⁶ and I-TASSER⁶⁷. The predicted structure was an almost linear and long polypeptide chain. Thus, Coot⁶² was used to build a model interconnecting the visible parts of the NT from the crystal structure. The model was then subjected to different simulated annealing torsion-angle refinement protocols using CNS⁶⁸ with a multi-temperature approach method (3,000 K to 10,000 K) until the model reached convergence at 300 K. The stereochemical quality was then checked with MolProbity⁶³ showing reasonable scores, with no bad contacts and 85.8% of the residues in the most favoured regions of the Ramachandran plot. Docking of the structure of the new model for the Gal-3[NTS/VII-IX] variant generated by our data with CK1 kinase was performed using HADDOCK⁶⁹ (HADDOCK score of −95.7 ± 14.0; buried surface area of 3068.6 ± 207.7 Å²). Docking of the LacNAc-based saccharides on the CRD tetramer was performed using Autodock4⁷⁰.

References

Gabius, H.-J. & Roth, J. An introduction to the sugar code. Histochem. Cell Biol. 147, 111–117 (2017).
Article PubMed CAS Google Scholar
Manning, J. C. et al. Lectins: a primer for histochemists and cell biologists. Histochem. Cell Biol. 147, 199–222 (2017).
Article PubMed CAS Google Scholar
Kaltner, H. et al. Galectins: their network and roles in immunity/tumor growth control. Histochem. Cell Biol. 147, 239–256 (2017).
Article PubMed CAS Google Scholar
Mayer, S., Raulf, M. K. & Lepenies, B. C-type lectins: their network and roles in pathogen recognition and immunity. Histochem. Cell Biol. 147, 223–237 (2017).
Article PubMed CAS Google Scholar
Cooper, D. N. W. Galectinomics: finding themes in complexity. Biochim. Biophys. Acta 1572, 209–231 (2002).
Article PubMed CAS Google Scholar
Hirabayashi, J. (ed.) Recent topics on galectins. Trends Glycosci. Glycotechnol. 9, 1–180 (1997).
Hughes, R. C. Mac-2: a versatile galactose-binding protein of mammalian tissues. Glycobiology 4, 5–12 (1994).
Article PubMed CAS Google Scholar
Davidson, P. J. et al. Transport of galectin-3 between the nucleus and cytoplasm. I. Conditions and signals for nuclear import. Glycobiology 16, 602–611 (2006).
Article PubMed CAS Google Scholar
Li, S. Y. et al. Transport of galectin-3 between the nucleus and cytoplasm. II. Identification of the signal for nuclear export. Glycobiology 16, 612–622 (2006).
Article PubMed CAS Google Scholar
Nakahara, S., Hogan, V., Inohara, H. & Raz, A. Importin-mediated nuclear translocation of galectin-3. J. Biol. Chem. 281, 39649–39659 (2006).
Article PubMed CAS Google Scholar
Gao, X. et al. The two endocytic pathways mediated by the carbohydrate recognition domain and regulated by the collagen-like domain of galectin-3 in vascular endothelial cells. PLoS One 7, e52430 (2012).
Article ADS PubMed PubMed Central CAS Google Scholar
Takenaka, Y. et al. Nuclear export of phosphorylated galectin-3 regulates its antiapoptotic activity in response to chemotherapeutic drugs. Mol. Cell. Biol. 24, 4395–4406 (2004).
Article PubMed PubMed Central CAS Google Scholar
Sato, S., Burdett, I. & Hughes, R. C. Secretion of the baby hamster kidney 30-kDa galactose-binding lectin from polarized and nonpolarized cells: a pathway independent of the endoplasmic reticulum-Golgi complex. Exp. Cell Res. 207, 8–18 (1993).
Article PubMed CAS Google Scholar
Mehul, B. & Hughes, R. C. Plasma membrane targetting, vesicular budding and release of galectin 3 from the cytoplasm of mammalian cells during secretion. J. Cell Sci. 110, 1169–1178 (1997).
PubMed CAS Google Scholar
Gong, H. C. et al. The NH₂ terminus of galectin-3 governs cellular compartmentalization and functions in cancer cells. Cancer Res. 59, 6239–6245 (1999).
PubMed CAS Google Scholar
Menon, R. P. & Hughes, R. C. Determinants in the N-terminal domains of galectin-3 for secretion by a novel pathway circumventing the endoplasmic reticulum-Golgi complex. Eur. J. Biochem. 264, 569–576 (1999).
Article PubMed CAS Google Scholar
Hsu, D. K., Zuberi, R. I. & Liu, F.-T. Biochemical and biophysical characterization of human recombinant IgE-binding protein, an S-type animal lectin. J. Biol. Chem. 267, 14167–14174 (1992).
PubMed CAS Google Scholar
Massa, S. M., Cooper, D. N. W., Leffler, H. & Barondes, S. H. L-29, an endogenous lectin, binds to glycoconjugate ligands with positive cooperativity. Biochemistry 32, 260–267 (1993).
Article PubMed CAS Google Scholar
Ochieng, J. et al. Structure-function relationship of a recombinant human galactoside-binding protein. Biochemistry 32, 4455–4460 (1993).
Article PubMed CAS Google Scholar
Mehul, B., Bawumia, S., Martin, S. R. & Hughes, R. C. Structure of baby hamster kidney carbohydrate-binding protein CBP30, an S-type animal lectin. J. Biol. Chem. 269, 18250–18258 (1994).
PubMed CAS Google Scholar
André, S., Liu, B., Gabius, H.-J. & Roy, R. First demonstration of differential inhibition of lectin binding by synthetic tri- and tetravalent glycoclusters from cross-coupling of rigidified 2-propynyl lactoside. Org. Biomol. Chem. 1, 3909–3916 (2003).
Article PubMed Google Scholar
Ahmad, N. et al. Galectin-3 precipitates as a pentamer with synthetic multivalent carbohydrates and forms heterogeneous cross-linked complexes. J. Biol. Chem. 279, 10841–10847 (2004).
Article PubMed CAS Google Scholar
Morris, S. et al. Quaternary solution structures of galectins-1, -3, and -7. Glycobiology 14, 293–300 (2004).
Article PubMed CAS Google Scholar
Nieminen, J., Kuno, A., Hirabayashi, J. & Sato, S. Visualization of galectin-3 oligomerization on the surface of neutrophils and endothelial cells using fluorescence resonance energy transfer. J. Biol. Chem. 282, 1374–1383 (2007).
Article PubMed CAS Google Scholar
Wang, G. N., André, S., Gabius, H.-J. & Murphy, P. V. Bi- to tetravalent glycoclusters: synthesis, structure-activity profiles as lectin inhibitors and impact of combining both valency and headgroup tailoring on selectivity. Org. Biomol. Chem. 10, 6893–6907 (2012).
Article PubMed CAS Google Scholar
Mauris, J. et al. Modulation of ocular surface glycocalyx barrier function by a galectin-3 N-terminal deletion mutant and membrane-anchored synthetic glycopolymers. PLoS One 8, e72304 (2013).
Article ADS PubMed PubMed Central CAS Google Scholar
Goodman, C. K. et al. Multivalent scaffolds induce galectin-3 aggregation into nanoparticles. Beilstein J. Org. Chem. 10, 1570–1577 (2014).
Article PubMed PubMed Central CAS Google Scholar
Kuklinski, S. & Probstmeier, R. Homophilic binding properties of galectin-3: involvement of the carbohydrate recognition domain. J. Neurochem. 70, 814–823 (1998).
Article PubMed CAS Google Scholar
Yang, R. Y., Hill, P. N., Hsu, D. K. & Liu, F.-T. Role of the carboxyl-terminal lectin domain in self-association of galectin-3. Biochemistry 37, 4086–4092 (1998).
Article PubMed CAS Google Scholar
Lepur, A., Salomonsson, E., Nilsson, U. J. & Leffler, H. Ligand induced galectin-3 protein self-association. J. Biol. Chem. 287, 21751–21756 (2012).
Article PubMed PubMed Central CAS Google Scholar
Birdsall, B. et al. NMR solution studies of hamster galectin-3 and electron microscopic visualization of surface-adsorbed complexes: evidence for interactions between the N- and C-terminal domains. Biochemistry 40, 4859–4866 (2001).
Article PubMed CAS Google Scholar
Kopitz, J. et al. Homodimeric galectin-7 (p53-induced gene 1) is a negative growth regulator for human neuroblastoma cells. Oncogene 22, 6277–6288 (2003).
Article PubMed CAS Google Scholar
Halimi, H. et al. Glycan dependence of galectin-3 self-association properties. PLoS One 9, e111836 (2014).
Article ADS PubMed PubMed Central CAS Google Scholar
Ippel, H. et al. Intra- and intermolecular interactions of human galectin-3: assessment by full-assignment-based NMR. Glycobiology 26, 888–903 (2016).
Article PubMed PubMed Central CAS Google Scholar
Lin, Y.-H. et al. The intrinsically disordered N-terminal domain of galectin-3 dynamically mediates multisite self-association of the protein through fuzzy interactions. J. Biol. Chem. 292, 17845–17856 (2017).
Article PubMed CAS Google Scholar
Berbís, M. A. et al. Peptides derived from human galectin-3 N-terminal tail interact with its carbohydrate recognition domain in a phosphorylation-dependent manner. Biochem. Biophys. Res. Commun. 443, 126–131 (2014).
Article PubMed CAS Google Scholar
Seetharaman, J. et al. X-ray crystal structure of the human galectin-3 carbohydrate recognition domain at 2.1-Å resolution. J. Biol. Chem. 273, 13047–13052 (1998).
Article PubMed CAS Google Scholar
Kopitz, J. et al. Human chimera-type galectin-3: defining the critical tail length for high-affinity glycoprotein/cell surface binding and functional competition with galectin-1 in neuroblastoma cell growth regulation. Biochimie 104, 90–99 (2014).
Article PubMed CAS Google Scholar
Flores-Ibarra, A. et al. Preliminary X-ray crystallographic analysis of an engineered variant of human chimera-type galectin-3 with a shortened N-terminal domain. Acta Crystallogr. F71, 184–188 (2015).
Google Scholar
Franke, D. et al. ATSAS 2.8: a comprehensive data analysis suite for small-angle scattering from macromolecular solutions. J. Appl. Cryst. 50, 1212–1225 (2017).
Article CAS Google Scholar
Dawson, H., André, S., Karamitopoulou, E., Zlobec, I. & Gabius, H.-J. The growing galectin network in colon cancer and clinical relevance of cytoplasmic galectin-3 reactivity. Anticancer Res. 33, 3053–3059 (2013).
PubMed CAS Google Scholar
Kopitz, J. et al. Negative regulation of neuroblastoma cell growth by carbohydrate-dependent surface binding of galectin-1 and functional divergence from galectin-3. J. Biol. Chem. 276, 35917–35923 (2001).
Article PubMed CAS Google Scholar
Sanchez-Ruderisch, H. et al. Tumor suppressor p16^INK4a: downregulation of galectin-3, an endogenous competitor of the pro-anoikis effector galectin-1, in a pancreatic carcinoma model. FEBS J. 277, 3552–3563 (2010).
Article PubMed CAS Google Scholar
Weinmann, D. et al. Galectin-3 induces a pro-degradative/inflammatory gene signature in human chondrocytes, teaming up with galectin-1 in osteoarthritis pathogenesis. Sci. Rep. 6, 39112 (2016).
Article ADS PubMed PubMed Central CAS Google Scholar
Barboni, E. A., Bawumia, S. & Hughes, R. C. Kinetic measurements of binding of galectin-3 to a laminin substratum. Glycoconj. J. 16, 365–373 (1999).
Article PubMed CAS Google Scholar
Xiao, Q. et al. Exploring functional pairing between surface glycoconjugates and human galectins using programmable glycodendrimersomes. Proc. Natl. Acad. Sci. USA 115, E2509–E2518 (2018).
Article PubMed PubMed Central CAS Google Scholar
Collins, P. M., Hidari, K. I. & Blanchard, H. Slow diffusion of lactose out of galectin-3 crystals monitored by X-ray crystallography: possible implications for ligand-exchange protocols. Acta Crystallogr. D63, 415–419 (2007).
Google Scholar
Saraboji, K. et al. The carbohydrate-binding site in galectin-3 is preorganized to recognize a sugar-like framework of oxygens: ultra-high-resolution structures and water dynamics. Biochemistry 51, 296–306 (2012).
Article PubMed CAS Google Scholar
Gitt, M. A. et al. Sequence and mapping of galectin-5, a ß-galactoside-binding lectin, found in rat erythrocytes. J. Biol. Chem. 270, 5032–5038 (1995).
Article PubMed CAS Google Scholar
García Caballero, G. et al. Galectin-related protein: an integral member of the network of chicken galectins. 1. From strong sequence conservation of the gene confined to vertebrates to biochemical characteristics of the chicken protein and its crystal structure. Biochim. Biophys. Acta 1860, 2285–2297 (2016).
Article PubMed CAS Google Scholar
Knibbs, R. N. et al. Carbohydrate-binding protein 35. II. Analysis of the interaction of the recombinant polypeptide with saccharides. J. Biol. Chem. 268, 14940–14947 (1993).
PubMed CAS Google Scholar
Hirabayashi, J. et al. Oligosaccharide specificity of galectins: a search by frontal affinity chromatography. Biochim. Biophys. Acta 1572, 232–254 (2002).
Article PubMed CAS Google Scholar
Ahmad, N. et al. Thermodynamic binding studies of cell surface carbohydrate epitopes to galectins-1, -3 and -7. Evidence for differential binding specificities. Can. J. Chem. 80, 1096–1104 (2002).
Article CAS Google Scholar
Collins, P. M. et al. Galectin-3 interactions with glycosphingolipids. J. Mol. Biol. 426, 1439–1451 (2014).
Article PubMed CAS Google Scholar
Gabius, H.-J. How to crack the sugar code. Folia Biol. (Praha) 63, 121–131 (2017).
Google Scholar
Ludwig, A. K. et al. Playing modular puzzle with adhesion/growth-regulatory galectins: design and testing of a hybrid to unravel structure-activity relationships. Protein Pept. Lett. 23, 1003–1012 (2016).
Article PubMed CAS Google Scholar
Kabsch, W. X. D. S. Acta Crystallogr. D66, 125–132 (2010).
Google Scholar
Evans, P. R. An introduction to data reduction: space-group determination, scaling and intensity statistics. Acta Crystallogr. D67, 282–292 (2011).
Google Scholar
Evans, P. R. & Murshudov, G. N. How good are my data and what is the resolution. Acta Crystallogr. D69, 1204–1214 (2013).
Google Scholar
Adams, P. D. et al. PHENIX: a comprehensive Python-based system for macromolecular structure solution. Acta Crystallogr. D66, 213–221 (2010).
Google Scholar
Murshudov, G. N. et al. REFMAC5 for the refinement of macromolecular crystal structures. Acta Crystallogr. D67, 355–367 (2011).
Google Scholar
Emsley, P., Lohkamp, B., Scott, W. G. & Cowtan, K. Features and development of Coot. Acta Crystallogr. D66, 486–501 (2010).
Google Scholar
Chen, V. B. et al. Molprobity: all-atom structure validation for macromolecular crystallography. Acta Crystallogr. D66, 12–21 (2010).
Google Scholar
DeLano, W. L. http://www.pymol.org (2012).
Pernot, P. et al. Upgraded ESRF BM29 beamline for SAXS on macromolecules in solution. J. Sync. Rad. 20, 660–664 (2013).
Article CAS Google Scholar
Wang, S., Li, W., Liu, S. & Xu, J. RaptorX-Property: a web server for protein structure property prediction. Nucleic Acids Res. 44, W430–W435 (2016).
Article PubMed PubMed Central CAS Google Scholar
Yang, J. & Zhang, Y. Protein structure and function prediction using I-TASSER. Curr. Protoc. Bioinformatics 52, 5.8.1–5.815 (2015).
Article Google Scholar
Brünger, A. T. Version 1.2 of the crystallography and NMR system. Nat. Protoc. 2, 2728–2733 (2007).
Article PubMed CAS Google Scholar
van Zundert, G. C. P. et al. The HADDOCK2.2 webserver: user-friendly integrative modeling of biomolecular complexes. J. Mol. Biol. 428, 720–725 (2015).
Article PubMed CAS Google Scholar
Morris, G. M. et al. Autodock4 and AutoDockTools4: automated docking with selective receptor flexibility. J. Computational Chemistry 16, 2785–2791 (2009).
Article CAS Google Scholar

Download references

Acknowledgements

The authors thank Drs B. Friday and A. Leddoz for inspiring discussions. This work was supported by the European Union’s Seventh Framework Program FP7/2007-2013 under REA grant agreement no. 317297 (“GLYCOPHARM“). AR acknowledges support from Ministry of Economy and Competitiveness through grant BFU2016-77835-R. We thank the staff of the XALOC beamline at ALBA (Spain) and the BM29 beamline at ESRF (France) for assistance. The atomic coordinates and structure factors (code 6FOF) have been deposited in the Protein Data Bank.

Author information

Andrea Flores-Ibarra and Sabine Vértesy contributed equally to this work.

Authors and Affiliations

Department of Structural and Chemical Biology, Centro de Investigaciones Biológicas, CSIC, Ramiro de Maeztu 9, 28040, Madrid, Spain
Andrea Flores-Ibarra, Francisco J. Medrano & Antonio Romero
Institute of Physiological Chemistry, Faculty of Veterinary Medicine, Ludwig-Maximilians-University Munich, Veterinärstrabe 13, 80539, Munich, Germany
Sabine Vértesy & Hans-Joachim Gabius

Authors

Andrea Flores-Ibarra
View author publications
You can also search for this author in PubMed Google Scholar
Sabine Vértesy
View author publications
You can also search for this author in PubMed Google Scholar
Francisco J. Medrano
View author publications
You can also search for this author in PubMed Google Scholar
Hans-Joachim Gabius
View author publications
You can also search for this author in PubMed Google Scholar
Antonio Romero
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

H.-J.G. and A.R. conceived and designed the study, and they wrote the manuscript. S.V. provided reagents. A.F.-I. performed crystallography and SAXS experiments. F.J.M. analyzed and interpreted the SAXS data. A.R. carried out the structural study, analyzed the data and edited the manuscript. All authors approved the final version prior to submission.

Corresponding authors

Correspondence to Hans-Joachim Gabius or Antonio Romero.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary material

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Flores-Ibarra, A., Vértesy, S., Medrano, F.J. et al. Crystallization of a human galectin-3 variant with two ordered segments in the shortened N-terminal tail. Sci Rep 8, 9835 (2018). https://doi.org/10.1038/s41598-018-28235-x

Download citation

Received: 12 February 2018
Accepted: 19 June 2018
Published: 29 June 2018
DOI: https://doi.org/10.1038/s41598-018-28235-x

This article is cited by

Neuroinflammation and galectins: a key relationship in neurodegenerative diseases
- Eleazar Ramírez Hernández
- Beatriz Alanis Olvera
- Edgar Zenteno
Glycoconjugate Journal (2022)
Galectin-3: a key player in microglia-mediated neuroinflammation and Alzheimer's disease
- Yinyin Tan
- Yanqun Zheng
- Qingqing Yin
Cell & Bioscience (2021)
The marriage of chemokines and galectins as functional heterodimers
- Philipp von Hundelshausen
- Kanin Wichapong
- Kevin H. Mayo
Cellular and Molecular Life Sciences (2021)
The emerging role of galectins in (re)myelination and its potential for developing new approaches to treat multiple sclerosis
- Charlotte G. H. M. de Jong
- Hans-Joachim Gabius
- Wia Baron
Cellular and Molecular Life Sciences (2020)
Chicken lens development: complete signature of expression of galectins during embryogenesis and evidence for their complex formation with α-, β-, δ-, and τ-crystallins, N-CAM, and N-cadherin obtained by affinity chromatography
- Gabriel García Caballero
- Sebastian Schmidt
- Hans-Joachim Gabius
Cell and Tissue Research (2020)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.