Journal home
Advance online publication
Current issue
Archive
Press releases
Supplements
Focus
Guide to authors
Online submissionOnline submission
Permissions
For referees
Free online issue
Contact the journal
Subscribe
Advertising
work@npg
naturereprints
About this site
For librarians
 
NPG Resources
Nature
Nature Cell Biology
Nature Reviews Molecular Cell Biology
The EMBO Journal
Nature Reports Avian Flu
NPG Subject areas
Biotechnology
Cancer
Chemistry
Clinical Medicine
Dentistry
Development
Drug Discovery
Earth Sciences
Evolution & Ecology
Genetics
Immunology
Materials Science
Medical Research
Microbiology
Molecular Cell Biology
Neuroscience
Pharmacology
Physics
Browse all publications
Article
Nature Structural Biology  9, 628 - 634 (2002)
Published online: 1 July 2002; | doi:10.1038/nsb817

A novel fucose recognition fold involved in innate immunity

Mario A. Bianchet1, Eric W. Odom2, Gerardo R. Vasta2 & L. Mario Amzel1

1 Department of Biophysics and Biophysical Chemistry, The Johns Hopkins University School of Medicine, 725 North Wolfe Street, Baltimore, Maryland 21205, USA.

2 Center of Marine Biotechnology, University of Maryland Biotechnology Institute, 701 East Pratt Street, Baltimore, Maryland 21202, USA.

Correspondence should be addressed to Mario A. Bianchet bianchet@juliet.med.jhmi.edu or L. Mario Amzel mario@neruda.med.jhmi.edu
Anguilla anguilla agglutinin (AAA), a fucolectin found in the serum of European eel, participates in the recognition of bacterial liposaccharides by the animal innate immunity system. Because AAA specifically recognizes fucosylated terminals of H and Lewis (a) blood groups, it has been used extensively as a reagent in blood typing and histochemistry. AAA contains a newly discovered carbohydrate recognition domain present in proteins of organisms ranging from bacteria to vertebrates. The crystal structure of the complex of AAA with a-L-fucose characterizes the novel fold of this entire lectin family, identifying the residues that provide the structural determinants of oligosaccharide specificity. Modification of these residues explains how the different isoforms in serum can provide a diverse pathogen-specific recognition.
Invertebrates and vertebrates have a self/nonself pattern recognition innate immunity system based on the detection of carbohydrates that decorate the surface of potential pathogens1, 2. Humoral and cell-associated carbohydrate-binding proteins from the host are responsible for the initial recognition step in the first line of internal defense, which leads to agglutination, endocytosis by phagocytic cells and the activation of protease cascades, resulting in clotting, melanization or complement-mediated killing of the pathogen. Fucolectins — lectins that bind fucose and share a characteristic sequence motif — have been described as immune-recognition molecules in both invertebrates and vertebrates, such as the horseshoe crab (Tachypleus tridentatus3) and the Japanese eel (Anguilla japonica4). The A. japonica fucolectin is induced by the presence of bacterial liposaccharides4. A closely related fucolection from serum of the European eel, Anguilla anguilla agglutininin (AAA), binds not only polysaccharides containing L-fucose and D-galactose methylether derivatives5 but also recognizes the H and Lea antigens6. This property eventually led to the determination of the structures of ABH and Lewis blood group oligosaccharide determinants7. Since then, AAA, together with unrelated fucose-binding lectins from plants, such as asparagus pea (Lotus tetragonolobus) and gorse seed (Ulex europaeus), have become useful reagents with wide application in hematology and glycobiology. Thus, the structure of an animal fucolectin−ligand complex is needed to rigorously address their fine specificity and biological role as host defense recognition molecules.

The structure of the AAA−fucose complex described here reveals that the protein has no similarities to any other structurally characterized fucose-recognizing protein and represents the fold of an entire lectin family. In addition, it reveals the architecture of the physiological trimer and allows the identification of the interactions that define the specificity of animal fucose-binding lectins and several related proteins. The presence of the AAA fold can be recognized in the sequence of several other proteins from organisms ranging from bacteria to vertebrates. Some contain the conserved sequence motif that we found characterizes binding to alpha-L-fucose. On the basis of the presence of a fully conserved fucose-binding motif, these proteins can be also expected to behave as fucolectins. The structure also identifies exposed loops around the binding site that contain higher sequence variability among the seven A. japonica fucolectins, suggesting how these loops modulate the fine specificity of lectin for different pathogens.

Despite having no significant sequence similarity, AAA has unexpected structural similarities with the C1 and C2 domains of human coagulation factor V8, a fungal galactose oxidase9 and a bacterial sialidase10 and with two proteins whose structures were recently determined, the APC10/DOC1 (a ubiquitin ligase)11 and the XRCC1 (part of the single-stranded DNA repair complex)12. The presence of a similar fold in proteins of coagulation and innate immunity can be taken as further support for the postulate of a common origin for these systems13.

Overall fold
A. anguilla agglutinin obtained from a commercial source was re-purified by affinity chromatography and crystallized with l-fucose present in the crystallization medium. The crystal structure was determined at 1.9 Å resolution by single isomorphous replacement using the anomalous scatter (SIRAS) signal of a gold derivative crystal (Table 1). In total, 158 residues, 3 ligands — 1 alpha-L-fucose, 1 anion (Cl-) and 1 cation (either Na+ or Ca2+) — and a 130 solvent molecules were built into the resulting high quality electronic density map.

Table 1. Crystallographic data collection, phasing and refinement statistics
Table 1 thumbnail

Full TableFull Table
AAA folds as a beta-barrel with jellyroll topology (Fig. 1a). The bulk of the fold consists of eight major antiparallel beta-strands arranged in two beta-sheets of five (beta2, beta3, beta10, beta6 and beta7) and three (beta11, beta5 and beta8) strands packed against each other. Two short antiparallel strands (beta4 and beta9) close one end of the barrel (bottom in Fig. 1a). The N- and C-terminal strands (beta1 and beta11) protrude 15 Å from this end of the barrel, forming an antiparallel two-stranded beta-sheet. Five loops connect the two main beta-sheets at the other end of the barrel (top in Fig 1a); we named these loops complementarity determining regions (CDR) 1−5 by analogy to the immunoglobulin nomenclature. The CDR loops encircle a heavily positively charged hollow that binds the carbohydrate (Fig. 1b). These loops are spanned by the following residue ranges: Gln 22−Ser 31 (CDR1), Arg 41−Cys 50 (CDR2), Ser 53−Asn 58 (CDR3), Arg 79−Ala 90 (CDR4) and Pro 138−Ser 142 (CDR5). Of the five loops, CDR1 protrudes farthest away from the barrel; at its exposed apex, the aliphatic segment of Glu 26 stacks over the aromatic ring of His 27, both reaching toward the central depression. On the other side of the hollow, CDR4 has two solvent-exposed Cys residues (Cys 82−Cys 83) that form a rare disulfide bridge between consecutive residues. At the side of the barrel, a loop-rich substructure, which includes CDR1 and CDR2, tightly binds a cation. A disulfide bridge between Cys 50 and Cys 146 links the N-terminus of beta11 with the C-terminus of CDR2, and an Arg 41−Glu 149 salt bridge attaches the loop between eta2 and eta3 to beta11. Both of these interactions hold the cation-binding substructure against the bulk of the fold. A disulfide bridge between Cys 108 and Cys 124 links strands beta7 and beta8, attaching the two main beta-sheets. An intrasheet salt bridge between Asp 64 and Arg 131 links the N-terminus of beta3 with the C-terminus of beta10 (in the front sheet, Fig. 1a). The two disulfide bridges, the two salt bridges and the cation clamp the structure together. The number of structural elements associated with the cation site involved in shaping CDR1 and CDR2 stresses the importance of the conformation of these two loops for the in vivo function of AAA.

Figure 1. The structure of the AAA in complex with alpha-L-fucose.
Figure 1 thumbnail

a, Ribbon diagram. The labeled secondary structure elements are represented as arrows (beta-strands) and helices (310-helices). b, Electrostatic potential energy surface. Electrostatic potential at the molecular surface is colored blue for positive values (>10 kT) and red for negative values (<-10 kT). c, Physiological trimer. This figure, as well as Figs 2, 4 and 5b, were prepared using BobScript41, Raster3D42 and GRASP43. The figure is colored from N- to C-terminus in a progression from blue to red. Bound fucose is shown as stick models. Color coding: nitrogen, blue; chlorine, green; oxygen, red; carbons, gold; and calcium, cyan.



Full FigureFull Figure and legend (120K)
Oligomeric state
Lectin oligomerization results in the association of binding sites that confers lectins their agglutination properties and their high affinity (multivalence) for the repetitive patterns of clustered glycans typically observed in bacterial pathogens14. There is no agreement in the literature about the oligomeric state of AAA or other eel serum fucolectins (that is, from A. rostrata and A. japonica); reported molecular weights of these proteins vary from 44 to 140 kDa (refs 4,15,16). Our size-exclusion chromatography experiments (data not shown) confirm early reports15, 16 of an oligomer size of 51 kDa. Reported experiments using SDS-PAGE suggested that the native oligomer consists of one or more subunits, some linked by disulfide bridges4, 15, 16. Recently, a tetrameric association — two dimers, with each dimer covalently linked by a disulfide bridge — has been proposed for A. japonica on the basis of the pattern of three bands (20 kDa, 40 kDa and 80 kDa) observed in nonreducing, reducing and reducing-heated SDS-PAGE, respectively4.

The crystal structure of AAA shows a large area of molecular interaction around a crystallographic three-fold axis, forming a tight trimer (Fig. 1c); only loose contacts are observed between trimers. Each monomer in this trimer buries 1,240 Å2 of solvent-accessible area, which is evenly divided between apolar (590 Å2) and polar (650 Å2) contributions. The burying of this large area suggests that this noncovalently associated trimer of 17 kDa monomers represents the AAA native oligomeric state of 51 kDa, as we observed in size-exclusion chromatography experiments (data not shown; see Methods). Contrary to the results of electrophoresis, no Cys residue participates in the observed trimer in the crystal structure of AAA. However, the Cys residues in the strained disulfide bridge (Cys 82−Cys 83) are solvent exposed and could participate in nonspecific intermolecular crosslinks, which would explain the covalent dimer observed in nonreducing SDS-PAGE. In light of the structure, however, the physiological relevance of such a dimer is questionable because the intramonomer disulfide bridge participates in fucose binding (see below) and an intermolecular disulfide bridge at this position would impede such binding.

Cation site
The cation is coordinated with seven oxygen atoms from the main chain or the side chain of six residues: Asn 35 (O), Asp 38 (Odelta1), Asn 40 (O), Ser 49 (O, Ogamma1), Cys 146 (O) and Glu 147 (Oepsilon1). Coordination distances (2.4−2.6 Å) and pentagonal bi-pyramidal geometry are compatible with two types of cations, Na+ and Ca2+, which are both present in the crystallization medium. We tentatively interpret this feature as a Ca2+ ion. Although the cation site is distant from the residues involved in carbohydrate binding, enhancement of carbohydrate affinity by calcium addition has been observed in related proteins3 (see below). This enhancement may result from possible changes induced by cation binding on the conformation of contiguous loops CDR1 and CDR2, which are involved in binding of the ligand.

Carbohydrate-binding site
The crystal structure of the complex shows the bound fucose resting on a highly positively charged depression encircled by the CDRs (Fig. 1b). AAA recognizes the fucose ring O5 and 3-OH and 4-OH hydroxyls using the Nepsilon of His 52 and the guanidinium groups of Arg 79 and Arg 86 (Fig. 2). A network of hydrogen bonds maintains this triad of residues in optimal positions to provide all the polar interactions to the carbohydrate: His 144 interacts with Arg 86, the main chain oxygen of Ser 53 interacts with His 52 and Asp 81 interacts with Arg 79 to form a salt bridge (Fig. 2). The triad of residues at the center of the binding site forms hydrogen bonds with the fucose axial 4-OH with perfect tetrahedral geometry. Completing the list of polar interactions between AAA and the fucose, Arg 79 hydrogen bonds to O5 and Arg 86 hydrogen bonds to the equatorial 3-OH. The network of hydrogen bonds, together with one of the disulfide bridge (Cys 82−Cys 83), makes CDR4 a rigid and wide loop. This loop provides four of the five polar interactions with the carbohydrate. Its sequence, RGDCCER, contains the RGD motif found in cell adhesion proteins17. The monosaccharide also makes key van der Waals contacts with the binding site: its ring atoms C1 and C2 rest over the CDR4 disulfide and its C6 docks loosely in a hydrophobic pocket stacking against the rings of His 27 (on top) and Phe 45. These residues, together with Leu 23 and Tyr 46 on the sides, form the binding pocket.

Figure 2. Fucose bound to the AAA-binding site.
Figure 2 thumbnail

Stereo view of the final model shows the 2Fo - Fc density around the bound carbohydrate in green. Residues and ligand are represented using the same representation as in Fig. 1. Broken cyan lines represent polar interaction between alpha-L-Fuc and protein residues.



Full FigureFull Figure and legend (51K)
AAA monosaccharide specificity
Analysis of AAA binding-site interactions suggests that the fucose-recognition motif is a His residue followed 24 residues downstream by the sequence RGDCCGER. This motif contains the three basic residues (His 52, Arg 79 and Arg 86) that recognize the axial 4-OH, equatorial 3-OH and O5 of the L-fucose. Also important is the recognition of the methyl group by the pocket shaped by CDR1 and CDR2, in particular by Leu 23 and the aromatic residues His 27, Phe 45 and Tyr 46, which form the hydrophobic lining of the pocket (Fig. 3a). Colitose (3-deoxy-L-fucose), a unique sugar present in E. coli liposaccharides (O-antigen), is a good candidate for a specific ligand of AAA.

Figure 3. Schematic representation of protein interactions with ligands.
Figure 3 thumbnail

a, Interactions observed in the complex. b, Interactions predicted between a D-galactose derivate and AAA.



Full FigureFull Figure and legend (9K)
Surprisingly, AAA also shows high affinity for D-galactose derivatives, such as 3-O-methyl-D-galactose and 3-O-methyl-6-deoxy-D-galactose (3-O-methyl-D-fucose)18. In these monosaccharides, the axial 4-OH group and the small hydrophobic group at their C3 are in topologically equivalent positions to the fucose C6. On the basis of the structure of the AAA−L-fucose complex, we expect that the axial hydroxyls of the galactose derivatives conserve the three hydrogen bonds with His 52, Arg 79 and Arg 86, and the 3-OMe groups occupying the hydrophobic pocket used by the fucose C6 (Fig. 3b). Besides, the larger methoxy group fits in the pocket even more tightly than the smaller methyl (C6). This, as well as interactions between Arg 86 and either O5, 6-OH or both, would compensate for the apparent loss of other interactions present in the AAA−L-fucose complex: O5−Arg 79 and 3-OH−Arg 86 (Fig. 3b).

AAA oligosaccharide specificity
AAA recognizes blood group oligosaccharides such as the fucosyl terminal in the H type 1 antigen (Fucalpha1-2 Galbeta1-3GlcNAcbeta1-3Galbeta1-4Glc) and in Lea antigen (Galbeta1-3[Fucalpha1-4]GlcNAcbeta1-3Galbeta1-4Glc) but not the Lex antigen (Galbeta1-4[Fucalpha1-3]GlcNAcbeta1-3Galbeta1-4Glc)6. In oligosaccharides containing these antigens, the alpha-linked fucose protrudes from the linear beta-linked carbohydrate backbone, making the fucose a suitable target of lectin recognition. The fucosyl terminal trisaccharides of Lex and Lea are rigid as a result of close packing between their fucose and galactose moieties19, 20. Crystallographic and NMR studies agree that the conformations of the protein bound carbohydrates are closely related to predicted low energy conformations in solution, although not always that with the lowest energy21, 22. The preferred conformations in solution of Lea and Lex terminal trisaccharides are with their GlcNAc (N-acetylglucosamine) bridging the partially juxtaposed Fuc (Fucose) and Gal (Galactose) moieties. In Lea, the GlcNAc moiety has the 2-N-acetyl group pointing toward the Gal moiety side and the 5-CH2OH group pointing toward the Fuc side of the trisaccharide. This is reversed in the Lex, because the Fuc and Gal moieties switch places (3-linkage to 4-linkage and vice versa) in linking to GlcNAc. In the fucosyl terminal trisaccharide of the H type 1 antigen (Fucalpha1-2Galbeta1-3GlcNAc), GlcNAc and the smaller Gal moieties switch places with respect to the ones in Lea and Lex trisaccharides.

Modeling experiments show that Lea and H trisaccharides can be docked into AAA, using L-fucose binding as a guide, with small adjustments of their glycoside angles and/or movements of CDR1. In these models, the trisaccharides present several epitopes to polar residues of the AAA. CDR1 residues Glu 26 and His 27 are in position to recognize the Gal equatorial hydroxyls (3-OH and 2-OH) and the oxygen of the 2-N-Acetyl group from GlcNAc in Lea (Fig. 4a), or the 6-OH and 4-OH groups from GlcNAc in H (Fig. 4b). In both cases, the OH group from Tyr 46 (CDR2) coordinates the glycosidic bond oxygen between the Gal and GlcNAc moieties. Furthermore, Asp 81 and Arg 79 (CDR4) are in position to interact with the 6-OH group from GlcNAc in Lea, and a water molecule (W2, Fig. 4b) may bridge Gal 4-OH and Asp 81 in H.

Figure 4. Model of the interactions between AAA and a terminal fucosyl trisaccharide.
Figure 4 thumbnail

a, Lea antigen. b, H-type 1. Residues and trisaccharides are shown using the same representation and color scheme of Fig. 1, with the exception of the carbon atoms (pink) of the ligand.



Full FigureFull Figure and legend (56K)
In Lex a bulkier group than the 5-hydroxymethylene, the 2-N-acetyl, points toward the Fuc side of the trisaccharide (against the protein), impeding the adjustment of the Gal moiety position to remove clashes with the CDR1. Although a large movement of the CDR1 could remove these clashes, it would require a large structural disruption of the loop. The rigidity of this loop, which is implied by the low mean temperature factor value (22 Å2) compared with the value of entire protein (28 Å2), makes such a movement unlikely and provides an explanation of why AAA lacks affinity for Lex.

Structural similarity with other proteins
The AAA fold has no similarity to other fucose-recognition proteins, including selectins (C-lectin fold) and plant fucolectins (legume lectin fold), and represents a novel fold within the entire lectin family23. However, a search for similar structures in the DALI database24 reveals three proteins with a fold similar to AAA, none with significant sequence similarity to AAA: the C1 and C2 repeats of blood coagulation factor V8 (FVa-C1 and -C2; FVa-C2 PDB entry 1CZT), the C-terminal domain of sialidase10 (CSIase and neuramidase; PDB entries 1EUT and 1EUU) and the N-terminal domain of galactose oxidase9 (NGOase; PDB entry 1GOF). Recently, other proteins with a similar fold and even lower sequence similarities have been reported: APC10/DOC1 (a ubiquitin ligase)11 and XRCC1 (part of the single-stranded DNA repair complex)12. An alignment of CSIase, NGOase, FVa-Ca and AAA sequences on the basis of the superposition of structurally similar domains indicates sequence identities with AAA in the 2−14% range (Fig. 5). This alignment shows that only three residues, a Pro, an Asp and an Arg, are strictly conserved; these are equivalent to AAA residues Asp 64, Pro 106 and Arg 131. In the four structures, the Pro residue is in different conformations, but the Asp and the Arg form an equivalent salt bridge. The core and the bottom (as viewed in Fig. 5b) of the beta-barrel are similar in all these structures. The loops at the top of the barrel, although varied in size and conformation, all encircle a central depression. This depression is shallower and has fewer aromatic residues in AAA. In NGOase and CSIase, the hollow conserves two members of the triad of basic residues that holds the axial hydroxyl of the ligand in AAA, the His and one Arg residue. The third partner, the second Arg residue, is replaced by a Glu in CSIase and an Asn residue in a distant position in NGOase. FVa-C2 has lost all the triad residues related to carbohydrate binding, rendering its pocket the most hydrophobic and the deepest.

Figure 5. Structure-based sequence alignment of AAA with three proteins that share the AAA fold.
Figure 5 thumbnail

a, Sequence alignment of AAA on the basis of b, the structural superposition of AAA (cyan) with CSIase (red), NGOase (blue) and FVa-C2 (green). Conserved residues are white in a magenta box; regions of residues with high similarity are boxed in blue and colored red. The positions of residues participating in polar interactions with the fucose are marked with a red star; residues that coordinate the cation are marked with green triangles. This sequence alignment figure and Fig. 6 were prepared with ESPrint44.



Full FigureFull Figure and legend (134K)
All three proteins conserve a large portion of the fold corresponding to the cation-binding site; two of them, NGOase and CSIase, show a bound Na+. CSIase, the galactose-binding domain of the bacterial sialidase, is the only one of the structures in addition to AAA that was determined with bound carbohydrate10. Although Gal 3-OH and 4-OH are described as interacting with His 539 Arg 572 and Glu 578 in a way that is homologous to what we see for AAA, the structure (PDB entry 1EUU) shows that many of the interactions with these residues are weak, in particular those with the axial 4-OH (distances >3.4 Å). Structural evidence that NGOase may bind carbohydrates arises from the presence of residues (His 40 and Arg 73) homologous to those involved in carbohydrate recognition in AAA (His 52 and Arg 79) and in CSIase (His 539 and Arg 572). These residues are absent in human FVa-C2, which has a well-documented affinity for phospholipids8.

AAA domains in other proteins
From bacteria to vertebrates, several proteins in the sequence database are homologous to or contain domains homologous to AAA (Fig. 6). One may assume that these sequences fold in a manner similar to that of AAA: with a 'fucolectin-like' carbohydrate-recognition domain (CRD). For example in bacteria, two proteins of Streptococcus pneumoniae TIGR425 (Entrez entries SP 2159 and NP 346573) have domains that are homologous to AAA. Interestingly, the C-terminus of one of these bacterial proteins (Entrez entry NP 346573) has a tandem of three AAA-like CRDs (Fig. 6). The N-terminal domains of the 'furrowed' receptor and CG9095 of Drosophila melanogaster26, the horseshoe crab tachylectin-4 (ref. 3) and the N-terminal domain of the Xenopus laevis27 pentraxin 1 fusion protein are involved in innate immunity and contain fucolectin domains4. AAA-like CRDs are found together with other domains such as the pentraxin domain in the Xenopus protein or the C-lectin type CRDs and short complementary control proteins (also known as consensus repeats or sushi domains)28, which form a complex receptor in the Drosophila proteins.

Figure 6.  Sequence alignments of AAA with proteins of similar sequence.
Figure 6 thumbnail

Sequences are selected from those present in the 12/2001 version of the NCBI nonredundant protein sequence database with >25% sequence identity with A. anguilla agglutinin (AAA). The sequences are from AAA; eFL1-7, A. japonica fucolectins (Entrez accession numbers: BAB0352(3-9); eFL1−4 residues 21−172 and eFL5−7 residues 32−182); S.pneum (1,2,3), S. pneumoniae TIGR4 (NP 346573; 1 is residues 611−741, 2 is residues 762-892 and 3 is residues 904-1034); Tach4, T. tridentatus tachylectin-4 (ref. 3); XPLN, X. laevis (P49263; residues 39-179); fw, D. melanogaster furrowed receptor (NP 511136; residues 65-203); and CG9095 D. melanogaster (AAF48429; residues 15−154). The color scheme is similar to that of Fig. 5, with the exception of the structural disulfide bridge Cys residues (green box).



Full FigureFull Figure and legend (127K)
Except for the Drosophila proteins, most of these sequences conserve a fucose-binding sequence motif: HX24RXDX4(R/K) (where X stands for any residue). Also highly conserved are the size and hydropathic profile of CDR2, which forms part of the hydrophobic pocket for the fucose methyl group. We expect that these sequences fold as fucolectin-like domains and have similar monosaccharide specificity for l-fucose. Another conserved motif is h2DGx, where h stands for a small hydrophobic residue (Val, Ala or Ile) and 'x' for a small hydrophilic residue (Asn, Asp or Ser). This motif, located in AAA just after the 310-helix eta3, forms the cation-binding site and provides three of the seven oxygens that bind the cation. Horseshoe crab tachylectin-4, which conserves this motif, shows a four-fold enhancement of its hemagglutination activity for A-type erythrocytes upon the addition of Ca2+; this enhancement is abolished by EDTA3. In supporting the function assigned to this motif, two of the other proteins of known structures that conserve this motif also show a bound cation (NGOase and CSIase; Fig. 5).

In the sequence alignment (Fig. 6), all non-eel sequences possess shorter CDR1, lacking five of the residues that form 310-helix eta2. This eliminates the loop protruding-feature observed in AAA, reducing or removing interactions between CDR1 and the putative oligosaccharide antennae and perhaps broadening the specificity spectra of these fucolectins to include Lex oligosaccharides. Organisms other than bacteria conserve the Cys residues involved in the two interstrand disulfide bridges of AAA (50−146 and 104−128).

Similar to the fucolectin from A. japonica, several isoforms of AAA are present in European eel serum16, 29. The isoform predominant in the crystal shows sequence identities with the seven fucolectins from A. japonica (eFL-1−7 in Fig. 6) in the 68−78% range. All fucolectins from the Anguilla genus (Fig. 6) conserve the residue triad associated with polar interactions with the fucose, showing complete conservation of His 52 and the CDR4 sequence. These fucolectins conserve the size of CDR1 and CDR2, although the loops have interesting sequence variations in residues associated with the fucose C6 pocket. CDR1 shows the greatest sequence variability; most eel fucolectins conserve polar residues at the apex of this loop (such as Glu 26 and His 27 in AAA), probably to interact with the third moiety of their oligosaccharide ligands. Also, most eel sequences conserve two aromatic CDR2 residues in the N-terminus of eta4 (such as Phe 45 and Tyr 46 in AAA) that form the C6 pocket. In the eFL-1 and eFL-5 isoforms, the residues in the apex of CDR1 are replaced with smaller residues, making this loop thinner and more flexible, and opening the C6 hydrophobic pocket to the solvent. The eFL-5 isoform maximizes changes in this pocket, replacing all residues involved in the pocket with smaller ones (Leu 23 by Ser, Glu 26 by Gly, His 27 by Gly, Phe 45 by Ser and Tyr 46 by Ser), perhaps conferring a broader specificity on eFL-5. Ser substitutions of Leu 23 and Phe 45 may provide additional polar interactions with hydroxyl groups, such as the 6-OH of a galactose-containing oligosaccharide.

A broader sequence alignment adds mostly adhesion domains of bacterial virulence factors (including sialidases, hyaluronidases, galactosidases and chitinases) and coagulation factors. This broad alignment shows two low sequence variability regions corresponding to beta3 (W[W/I/L][R/K/Q]hDh) and the turn between beta9 and b10 (RY). Three of these residues play roles in the structural stability of the fold; in AAA, these are Trp 61, which stacks in the core of the barrel, and Asp 64 and Arg 131, which form a salt bridge. However, the most conserved residues in the broad alignment are exposed and clustered in a small area of the protein surface (Trp 60, Asp 64, Arg 131 and Tyr 132 in AAA), suggesting the conservation of an interaction region with another protein or specific receptor.

Conclusions
The crystal structure of the AAA defines the fold of this type of fucose-specific lectin. It provides the basis for understanding its alpha-L-fucose specificity and allows the identification of this fold and prediction of function in many other proteins or protein domains of organisms ranging from bacteria to animals. Interestingly, the CRDs from AAA are one of only a few domains (R-lectins and C-lectins) present in animals that have also been found in bacteria30, 31, 32.

The structure of AAA is related to a large family of adhesion proteins and others from unrelated pathways (APC10/DOC1 and XRC11). The diversity of primary structures that share a similar tertiary structure suggests either convergent evolution or a high tolerance for mutations in this fold. However, the common function of these protein domains is to bind specific ligands, such as DNA nucleotides in the case of XRC11, cell surface glycoconjugates in the case of galactose oxidase and sialidase, or phospholipids of the mammalian cell membrane in the case of the coagulation factor V. NGOase and FVa-C1/2 have sequence similarities to members of the discoidin domain family33, most of which have been implicated in cell adhesion or developmental processes34. Discoidin I and II, proteins from the slime mold Dictyostelium discoideum, were initially described as lectins with a high affinity for galactose35. Thus, carbohydrate recognition may be the ancestral function of this domain, which may have occasionally evolved into a phospholipid-binding domain, as in the case of FVa-C2.

Because the function of lectins is to bind surface glycans in a multivalent fashion, the crystal structure of AAA−fucose complex provides the bases for understanding the function of this protein in innate immunity. The fucose-binding sites placed in one of the AAA trimer faces can crosslink fucose-carrying groups on the surface of a pathogen separated by at least 23 Å, about half of the distance observed in the mannose-binding lectin trimer (45 Å)36. Furthermore, the high positive charge of the binding site correlates with the activity against negatively charged polysaccharides present in bacterial membranes.

Although in silico modeling cannot substitute for direct observation of the binding of blood groups to AAA, it facilitates the interpretation of the carbohydrate specificity of AAA, suggesting the mechanism by which CDR1, CDR2 and CDR4 confer AAA its characteristic specificity for H type 1 and Lea antigens through additional interactions with the other moieties of these oligosaccharides. These interactions suggest a mechanism for modulating specificity through selected changes in these exposed loops and provides a rationale for the existence of the several isolectins observed in eel serum, which could represent molecules with fine-tuned specificity for different pathogens. Finally, the ubiquity of this fold warrants the use of F-lectin or F-type lectin fold as a common name for members of this lectin family.

Methods
Protein purification.
Anguilla anguilla agglutinin was purchased from Sigma. The lyophilized protein was resuspended in 10 mM phosphate, pH 7.2. The active lectin was re-purified by affinity chromatography on an L-fucose-Sepharose 6B matrix and eluted with 200 mM alpha-L-fucose. HPLC size exclusion chromatography was preformed in a Superose-12HR (10 times 300 mm2) (Amersham Biosciences) that was pre-equilibrated in 10 mM Tris-HCl, 10 mM CaCl2 and 100 mM NaCl, pH 7.5, buffer. Ovalbumin (Ov; 43 kDa) and bovine serum albumin (BSA; 67 kDa) were used as standards. Filtration was carried out at 4 °C at a flow rate of 0.4 ml min-1. The eluted protein was detected, by measuring the absorbance at 280 nm, at an elution volume of 12.3 ml, between the two standard elution volumes (12 ml for BSA and 12.8 ml for Ov).

Crystallization and data collection.
Crystals of AAA were grown in hanging drops of equal amounts of reservoir solution (40−50% (v/v) 2-methyl-2,4-pentanediol in pH 7−9) and protein solution (10 mg ml-1 protein, 5 mM alpha-L-fucose, 10 mM NaCl, 10 mM Tris-HCl, pH 7.5, and 1 mM CaCl2) equilibrated against a volume of 1 ml of reservoir solution. Rhombohedrally shaped crystals of eel lectin reached a size of 0.3 mm in 72 h at 18 °C. These crystals belong to the space group R32, with unit cell dimensions a = 65.5, b = 65.5 and c = 245.1 Å (Table 1).

Diffraction data from crystals frozen at 100 K were obtained using CuKalpha radiation from a rotating anode RU-H3R with an image plate detector Raxis IV++. All data were processed with CrystalClear (RIGAKU).

A gold heavy atom isomorphous derivative was obtained by overnight soaking of a crystal in mother liquor, with the addition of K2AuCl4 at a final concentration of 500 muM.

Structure determination.
A single gold heavy atom position, found by inspection of the difference Patterson map, was refined using MLPHARE37. The residual map calculated using phases from the single atom partial solution showed the presence of two additional minor sites that were added to the refinement. Heavy atom parameters were initially refined using only centric reflections to a resolution of 3 Å and finally refined including acentric reflections and anomalous differences. The final heavy atom model includes anisotropic temperature factors as parameters. Phases from SIRAS, calculated to 2.5 Å resolution (heavy atom derivative limit), were modified by solvent flattening and histogram matching, and extended to 1.9 Å using DM37. The resulting electron density map shows clear density for secondary structure elements, side chains and ligands. After identification of the correct enantiomer, using the right-handedness of consecutive beta-strands as an indicator, the initial model was built without difficulty using O38 in the high quality experimental map and refined with CNS39. The sequences for the N- and C-termini were taken from the work of Kelly16. The last three residues, which did not show interpretable density for their side chains, were modeled as Ala residues. For the rest of the residues, the tentative sequence was obtained by selecting on the basis of the electron density from the seven sequences of the isoforms of A. japonica fucolectin. In cases in which the density suggested a residue not present in those sequences, a new residue based in density shape and protein environment was assigned. It is interesting to note that, toward the end of the refinement, weak indications of alternative sequences could be observed in some cases, suggesting a minor presence in the crystal of other isolectins. The final structure has excellent geometry and average B-values compatible with the resolution of the data and no sigma cut-off in Fs (Table 1).

Modeling and database search.
Quanta (MSI Inc.) was used to model and dock the oligosaccharides. The search for homologous sequences was performed with BLASTP v2.2.1 at NCBI gene-bank in the nonredundant database (12/2001)40. Structural homology was searched using DALI24.

Coordinates.
Coordinates and structure factors have been deposited in the Protein Data Bank (accession code 1K12).

 Top
Received 13 February 2002; Accepted 20 May 2002; Published online: 1 July 2002.

REFERENCES
  1. Hoffmann, J.A., Kafatos, F.C., Janeway, C.A. & Ezekowitz, R.A. Phylogenetic perspectives in innate immunity. Science 284, 1313−1318 (1999). | Article | PubMed | ISI | ChemPort |
  2. Vasta, G.R., Quesenberry, M., Ahmed, H. & O'Leary, N. C-type lectins and galectins mediate innate and adaptive immune functions: their roles in the complement activation pathway. Dev. Comp. Immunol. 23, 401−420 (1999). | Article | PubMed | ISI | ChemPort |
  3. Saito, T., Hatada, M., Iwanaga, S. & Kawabata, S. A newly identified horseshoe crab lectin with binding specificity to O-antigen of bacterial lipopolysaccharides. J. Biol. Chem. 272, 30703−30708 (1997). | Article | PubMed | ISI | ChemPort |
  4. Honda, S., Kashiwagi, M., Miyamoto, K., Takei, Y. & Hirose, S. Multiplicity, structures, and endocrine and exocrine natures of eel fucose-binding lectins. J. Biol. Chem. 275, 33151−33157 (2000). | Article | PubMed | ISI | ChemPort |
  5. Springer, G.F. & Desai, P.R. Monosaccharides as specific precipitinogens of eel anti-human blood-group H(O) antibody. Biochemistry 10, 3749−3761 (1971). | Article | PubMed | ISI | ChemPort |
  6. Baldus, S.E. et al. Characterization of the binding specificity of Anguilla anguilla agglutinin (AAA) in comparison to Ulex europaeus agglutinin I (UEA-I). Glycoconj. J. 13, 585−590 (1996). | Article | PubMed | ISI | ChemPort |
  7. Morgan, W.T. & Watkins, W.M. Unravelling the biochemical basis of blood group ABO and Lewis antigenic specificity. Glycoconj. J. 17, 501−530 (2000). | Article | PubMed | ISI | ChemPort |
  8. Macedo-Ribeiro, S. et al. Crystal structures of the membrane-binding C2 domain of human coagulation factor V. Nature 402, 434−439 (1999). | Article | PubMed | ISI | ChemPort |
  9. Ito, N. et al. Novel thioether bond revealed by a 1.7 Å crystal structure of galactose oxidase. Nature 350, 87−90 (1991). | Article | PubMed | ISI | ChemPort |
  10. Gaskell, A., Crennell, S. & Taylor, G. The three domains of a bacterial sialidase: a beta-propeller, an immunoglobulin module and a galactose-binding jelly-roll. Structure 3, 1197−1205 (1995). | Article | PubMed | ISI | ChemPort |
  11. Wendt, K.S. et al. Crystal structure of the APC10/DOC1 subunit of the human anaphase-promoting complex. Nature Struct. Biol. 8, 784−788 (2001). | Article | PubMed | ISI | ChemPort |
  12. Marintchev, A. et al. Solution structure of the single-strand break repair protein XRCC1 N-terminal domain. Nature Struct. Biol. 6, 884−893 (1999). | Article | PubMed | ISI | ChemPort |
  13. Kairies, N. et al. The 2.0 Å crystal structure of tachylectin 5A provides evidence for the common origin of the innate immunity and the blood coagulation systems. Proc. Natl. Acad. Sci. USA 98, 13519−13524 (2001). | Article | PubMed | ChemPort |
  14. Rudd, P.M., Elliott, T., Cresswell, P., Wilson, I.A. & Dwek, R.A. Glycosylation and the immune system. Science 291, 2370−2376 (2001). | Article | PubMed | ISI | ChemPort |
  15. Bezkorovainy, A., Springer, G.F. & Desai, P.R. Physicochemical properties of the eel anti-human blood-group H(O) antibody. Biochemistry 10, 3761−3764 (1971). | PubMed | ISI | ChemPort |
  16. Kelly, C. Physicochemical properties and N-terminal sequence of eel lectin. Biochem. J. 220, 221−226 (1984). | PubMed | ISI | ChemPort |
  17. Pierschbacher, M.D., Hayman, E.G. & Ruoslahti, E. The cell attachment determinant in fibronectin. J. Cell. Biochem. 28, 115−126 (1985). | Article | PubMed | ISI | ChemPort |
  18. Springer, G.F. & Desai, P.R. The immunochemical requirement for specific activity and the physiological properties of eel anti-human blood-group H(O) 7 S Globulin. Vox Sang. 18, 551−554 (1970). | PubMed | ISI | ChemPort |
  19. Imberty, A. Oligosaccharide structures: theory versus experiment. Curr. Opin. Struct. Biol. 7, 617−623 (1997). | Article | PubMed | ISI | ChemPort |
  20. Imberty, A. et al. Computer simulation of histo-blood group oligosaccharides: energy maps of all constituting disaccharides and potential energy surfaces of 14 ABH and Lewis carbohydrate antigens. Glycoconj. J. 12, 331−349 (1995). | Article | PubMed | ISI | ChemPort |
  21. Bush, C.A., Martin-Pastor, M. & Imberty, A. Structure and conformation of complex carbohydrates of glycoproteins, glycolipids, and bacterial polysaccharides. Annu. Rev. Biophys. Biomol. Struct. 28, 269−293 (1999). | Article | PubMed | ISI | ChemPort |
  22. Perez, S. et al. Crystal and molecular structure of a histo-blood group antigen involved in cell adhesion: the Lewis x trisaccharide. Glycobiology 6, 537−542 (1996). | PubMed | ISI | ChemPort |
  23. Rini, J.M. & Lobsanov, Y.D. New animal lectin structures. Curr. Opin. Struct. Biol. 9, 578−584 (1999). | Article | PubMed | ISI | ChemPort |
  24. Holm, L. & Sander, C. Protein structure comparison by alignment of distance matrices. J. Mol. Biol. 233, 123−138 (1993). | Article | PubMed | ISI | ChemPort |
  25. Tettelin, H. et al. Complete genome sequence of a virulent isolate of Streptococcus pneumoniae. Science 293, 498−506 (2001). | Article | PubMed | ISI | ChemPort |
  26. Adams, M.D. et al. The genome sequence of Drosophila melanogaster. Science 287, 2185−2195 (2000). | Article | PubMed | ISI |
  27. Seery, L.T., Schoenberg, D.R., Barbaux, S., Sharp, P.M. & Whitehead, A.S. Identification of a novel member of the pentraxin family in Xenopus laevis. Proc. R. Soc. Lond. B Biol. Sci. 253, 263−270 (1993). | PubMed | ISI | ChemPort |
  28. Meindl, A. et al. A gene (SRPX) encoding a sushi-repeat-containing protein is deleted in patients with X-linked retinitis pigmentosa. Hum. Mol. Genet. 4, 2339−2346. (1995). | PubMed | ISI | ChemPort |
  29. Horejsi, V. & Kocourek, J. Studies on lectins. XXXVI. Properties of some lectins prepared by affinity chromatography on O-glycosyl polyacrylamide gels. Biochim. Biophys. Acta 538, 299−315 (1978). | PubMed | ISI | ChemPort |
  30. Dodd, R.B. & Drickamer, K. Lectin-like proteins in model organisms: implications for evolution of carbohydrate-binding activity. Glycobiology 11, 71R−79R (2001). | Article | PubMed | ISI | ChemPort |
  31. Zhang, H. et al. Crystal structure of YbaK protein from Haemophilus influenzae (HI1434) at 1.8 Å resolution: functional implications. Proteins 40, 86−97 (2000). | Article | PubMed | ISI | ChemPort |
  32. Saukkonen, K., Burnette, W.N., Mar, V.L., Masure, H.R. & Tuomanen, E.I. Pertussis toxin has eukaryotic-like carbohydrate recognition domains. Proc. Natl. Acad. Sci. USA 89, 118−122 (1992). | PubMed | ChemPort |
  33. Baumgartner, S., Hofmann, K., Chiquet-Ehrismann, R. & Bucher, P. The discoidin domain family revisited: new members from prokaryotes and a homology-based fold prediction. Protein Sci. 7, 1626−1631 (1998). | PubMed | ISI | ChemPort |
  34. Vogel, W. Discoidin domain receptors: structural relations and functional implications. FASEB J. 13, S77−S82 (1999). | PubMed | ISI | ChemPort |
  35. Poole, S., Firtel, R.A., Lamar, E. & Rowekamp, W. Sequence and expression of the discoidin I gene family in Dictyostelium discoideum. J. Mol. Biol. 153, 273−289 (1981). | Article | PubMed | ISI | ChemPort |
  36. Weis, W.I., Drickamer, K. & Hendrickson, W.A. Structure of a C-type mannose-binding protein complexed with an oligosaccharide. Nature 360, 127−134 (1992). | Article | PubMed | ISI | ChemPort |
  37. Collaborative Computational Project, Number 4. The CCP4 suite: programs for protein crystallography. Acta Crystallogr. D 50, 760−763 (1994). | Article | ISI |
  38. Jones, T.A., Zou, J.Y., Cowan, S.W. & Kjeldgaard, M. Improved methods for building protein models in electron density maps and the location of errors in these models. Acta Crystallogr. A 47, 110−119 (1991). | Article | PubMed | ISI |
  39. Brünger, A. et al. Crystallography & NMR system: a new software suite for macromolecular structure determination. Acta Crystallogr. D 54, 905−921 (1998). | Article | PubMed | ISI |
  40. Altschul, S.F. et al. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 25, 3389−402 (1997). | Article | PubMed | ISI | ChemPort |
  41. Esnouf, R.M. Further additions to MolScript version 1.4, including reading and contouring of electron density maps. Acta Crystallogr. D 55, 938−940 (1999). | Article | PubMed | ISI |
  42. Merrit, E.A. & Bacon, D.J. RASTER3D Methods Enzymol. 277, 505−524 (1997). | ChemPort |
  43. Nicholls, A., Sharp, K. & Honig, B. Protein folding and association: insights from the interfacial and thermodynamic properties of hydrocarbons. Proteins Struct. Func. Genet. 11, 281−286 (1991). | ISI | ChemPort |
  44. Gouet, P., Courcelle, E., Stuart, D.I. & Metoz, F. ESPript: analysis of multiple sequence alignments in PostScript. Bioinformatics 15, 305−308 (1999). | Article | PubMed | ISI | ChemPort |
 Top
Acknowledgments
We thank D. Leahy and G. Hart for critical reviewing of the manuscript. This research is supported by a grant from National Institute of General Medical Sciences to L.M.A. and a grant from the National Science Foundation to G.R.V.

Competing interests statement:  The authors declare that they have no competing financial interests.

FULL TEXT
Previous
Table of contents
Download PDFDownload PDF
Send to a friendSend to a friend
Save this linkSave this link
More articles like this

Open Innovation Challenges

naturejobs

Abstract
Methods
Figures & Tables
Acknowledgments
References
Export citation
Export references
natureproducts

Search buyers guide:

 
ADVERTISEMENT
 
Nature Structural & Molecular Biology
ISSN: 1545-9993
EISSN: 1545-9985
Journal home | Advance online publication | Current issue | Archive | Press releases | Supplements | For authors | Online submission | Permissions | For referees | Free online issue | About the journal | Contact the journal | Subscribe | Advertising | work@npg | naturereprints | About this site | For librarians
Nature Publishing Group, publisher of Nature, and other science journals and reference works©2002 Nature Publishing Group | Privacy policy