Many viral surface glycoproteins and cell surface receptors are homo-oligomers1,2,3,4, and thus can potentially be targeted by geometrically matched homo-oligomers that engage all subunits simultaneously to attain high avidity and/or lock subunits together. The adaptive immune system cannot generally employ this strategy since the individual antibody binding sites are not arranged with appropriate geometry to simultaneously engage multiple sites in a single target homo-oligomer. We describe a general strategy for the computational design of homo-oligomeric protein assemblies with binding functionality precisely matched to homo-oligomeric target sites5,6,7,8. In the first step, a small protein is designed that binds a single site on the target. In the second step, the designed protein is assembled into a homo-oligomer such that the designed binding sites are aligned with the target sites. We use this approach to design high-avidity trimeric proteins that bind influenza A hemagglutinin (HA) at its conserved receptor binding site. The designed trimers can both capture and detect HA in a paper-based diagnostic format, neutralizes influenza in cell culture, and completely protects mice when given as a single dose 24 h before or after challenge with influenza.
Circulating influenza viruses undergo constant antigenic drift 9,10,11, which renders acquired immunity against previous seasonal strains ineffective. The occasional adaptation of avian or swine viruses to human hosts12 poses a serious threat to global health. Resistance has evolved to widely used anti-viral drugs13,14, and seasonal vaccines are unlikely to provide much, if any, protection against newly emerging viruses. Thus, general or quickly adaptable solutions are needed for both diagnostic and therapeutic applications. The surface glycoprotein HA enables viral entry into host cells by binding to cell-surface, sialic-acid-containing glycans (Fig. 1a) and mediating fusion between the viral and host membranes in endosomal compartments. As the regions on HA involved in binding and fusion are highly conserved, they are attractive sites for binder design. The affinity of an individual receptor-binding site (RBS) for a sialic-acid moiety is low (Kd ∼1 mM)15, and small-molecule analogs and oligomers of sialic acid have similarly low affinities, preventing their use in clinical settings to date16. Trimerization of terminal sialic-acid moieties using flexible alkyl chains resulted in only weak affinity, likely due to the entropic cost of fixing the long flexible alkyl chains17. Broadly neutralizing antibodies targeting the stem and head regions of HA have been identified5,6,7,18; the former antibodies have broader binding specificity as they bind to a larger highly conserved patch, but the latter can neutralize more potently as the head region is more accessible19,20. Neutralizing antibodies use two related strategies to bind the head region: direct mimicry of the receptor using the carboxylic acid of an aspartic acid and backbone atoms to mimic the carboxylate and acetamido groups of sialic acid, respectively, or insertion of a large hydrophobic amino acid into the RBS pocket5 (Fig. 1b), with a backbone carbonyl group making interactions similar to those of the sialic-acid carboxylate6,7,21 (Supplementary Fig. 1). The heavy-chain CDR3 (HCDR3) of the C05 broadly neutralizing antibody, which inserts into the RBS, contains internal, self-stabilizing hydrogen bond interactions, similar to a β-hairpin (Fig. 1a), and provides extensive hydrogen bonding interactions with backbone atoms of the 130 loop of HA5 (Fig. 1b).
In a first step toward the design of a high-avidity trimeric flu inhibitor, we developed a computational approach to design small protein mimics of the C05 binding mode. We searched for close backbone alignments of the first and last residues in 7–15 residue segments of the C05 HCDR3 with pairs of positions in a set of 1,718 native proteins (Supplementary Table 1). At sites with close matches, the intervening segment in the scaffold protein was replaced with the antibody loop segment, the bonded geometry at the junctions between the introduced loop and the scaffold was idealized22, and the interactions within the designed protein and with HA (positioned as in the C05 co-crystal structure, Fig. 1d) were optimized in sequence design calculations favoring key interface contacts and overall shape complementarity. Designs with low computed monomer and HA interaction energies were selected for experimental characterization. This approach goes beyond previous efforts to graft antibody loops in using near-perfect geometric alignment with the variable parts of the CDR, explicitly modeling and optimizing the junctions between the scaffold and CDR, and optimizing the scaffold to support the CDR and make additional interactions with the target22,23.
Genes encoding the 80 selected designs were screened as a pool against H3 A/Hong Kong/1/1968 HA (H3 HK68) using yeast surface display24, while in parallel monitoring off-target binding (Fig. 1c and data not shown). Five designs specifically bound H3 HK68 HA; all contained grafts of short 7-8 residue fragments from the tip of the C05 loop; longer loop fragments likely require more extensive supporting contacts to prevent alternative loop configurations. Screening with a panel of HAs from different flu strains showed that these initial designs only bound H3 HA, which is not surprising since there is considerable sequence and structure divergence in the periphery of the sialic-acid binding site in other subtypes (Fig. 2a, Supplementary Fig. 2). Screening of simple random mutagenesis libraries resulted in only very limited extension of the binding spectrum (Supplementary Fig. 3).
To effectively target the diverse binding pockets presented by different HA molecules, we sought to match sequence variation in HA with complementary diversity in the designed binder focusing on the design with the smallest buried surface area, which we refer to as HSB (Fig. 2a,b). Selections were carried out either alternating between strains representing group 1 and group 2 HA subtypes (Supplementary Table 5), or against individual strains (Supplementary Fig. 4). The varying specificities of the resulting clones can be understood based on interactions with features that vary in different HA molecules (Fig. 2c–f, Supplementary Fig. 5 and Supplementary Table 6). As HSB.1C had the broadest binding spectrum, we proceeded with this variant and further optimized its stability (Supplementary Fig. 6), resulting in variant HSB.2.
Other than a domain swap observed previously for cystatin folds, the crystal structure of HSB.2 agrees closely with the design model (0.44 Å r.m.s. deviation, Supplementary Fig. 6a and Supplementary Table 2). To avoid the domain swap, we rigidified loop 2 (HSB.2A; Online Methods), and succeeded in solving the structure of HSB.2A in complex with H3 HK68 HA at 2.6 Å resolution (Fig. 3a–c and Supplementary Table 2). The structure confirms the designed mode of binding, with three HSB.2A molecules bound to the HA (Fig. 3a) and with all designed hydrogen bonds between the grafted loop and HA present (Fig. 3b,c). In the bound state, the contacting hairpin containing the grafted loop moves slightly away from the protein core (Supplementary Fig. 7b).
With a well-characterized small binding domain in hand, we proceeded to the second step of our avid-binder design strategy and designed trimeric versions of HSB with the three units of HSB bound to the three sialic-acid-binding sites on trimeric HA. We sampled translations and rotations of homo-trimers from the PDB (Supplementary Table 3) along the symmetry axis of the HA trimers (Fig. 3d), searching for proximity between the termini of naturally occurring trimers and the termini of HSB when bound to the HA. Charges and potentially clashing bulky residues at the newly formed interface with HA were replaced with neutral and smaller amino acids, respectively, and the gap between the two domains (trimerization unit and binder) was closed with a Gly-Ser linker. Nine designs were experimentally tested; we chose the one with the most monodisperse size-exclusion chromatography (SEC) trace (Tri-HSB.1C), which was also expressed at high levels (more than 100 mg/L culture). Trimerization of HSB.1C and HSB.2 reduced the off-rate for HA by more than 1,000-fold (Fig. 3f and Supplementary Fig. 8), resulting in sub- and single-digit nanomolar binding to a variety of H1, H2, H3, H9 and H12 strains (Supplementary Table 4; the approximately threefold decrease in the on-rate is likely due the fact each unit has to bind independently). Tri-HSB.2 binds H3 A/Victoria/361/2011 (Vic11), a close relative of current circulating strains, with low nanomolar affinity (the monomeric design does not give a binding signal; Fig. 2c). The trimer has a similar overall breadth in binding as the C05 IgG, with higher affinity than the IgG for several H2 and H12 strains and weaker affinity for several H3 strains (Supplementary Table 4). Because of its binding geometry, the IgG C05 can engage only one of three sialic-acid binding sites on a single HA molecule, but can achieve avidity by binding sites in two different HA trimers5.
Asymmetric reconstruction of negative-stain, single-particle electromicroscopic (EM) images of the Tri-HSB.1C–HA complex yielded clear density for the three trimerization domains and three binding domains (one of the connecting loops is not visible); the computational model of the trimer fits the density well (Fig. 3d and Supplementary Fig. 9). Small-angle X-ray scattering (SAXS) and hydrogen-deuterium exchange with mass spectrometry (HDX-MS) data were also consistent with the design model (Supplementary Figs. 10 and 11); HA loop 130, which extensively forms hydrogen bonds with the edge β-strand of Tri-HSB.2, is protected from deuterium exchange in the bound state, as are residues near the RBS (Supplementary Fig. 11).
We tested the utility of Tri-HSB for detecting HA in a paper-based assay similar to what might be used for diagnosis of infectious diseases at point-of-care sites or in home-testing kits25. Tri-HSB is effective both for capturing and detecting HA in a lateral flow assay, with a very clear band on the nitrocellulose strip and little or no background using either gold nanoparticles or diaminobenzidine (DAB) precipitation by conjugated horseradish peroxidase (HRP; Fig. 4a).
We next investigated whether Tri-HSB.2 could neutralize influenza virus in vitro and protect animals against infection. In vitro, Tri-HSB.2 potently neutralized human H3 HK68 (Fig. 4b) at subnanomolar concentrations with an IC50 < 150 pM (Fig. 4b), in the range of the apparent Kd. The weaker binder Tri-HSB.1C with an apparent Kd of 12 nM has an IC50 of 19 nM. To determine whether Tri-HSB.2 can protect against influenza in vivo, as a therapeutic or prophylactic, we administered a single intranasal dose of Tri-HSB.2 (3 mg/kg) to BALB/c mice either 24 h before or after challenge with a high dose (1,000-fold TCID50) of H3 HK68 virus. Both treatments fully protected mice from disease (Fig. 4c), while untreated control mice and mice treated with a non-binding double point mutant, Tri-HSB.2.ko, had significant disease and weight loss (>25%).
The adaptive immune system is capable of generating very high-affinity interactions with target proteins, but the fixed geometry of antibodies does not allow the positioning of the binding sites to maximally engage homomeric antigens. Our approach goes beyond elegant recent work using DNA linkers to optimize the spacing between Fab fragments for HIV binding26 by (1) optimizing both the distance and the relative orientation of the three displayed binding moieties using structure-based design, (2) bringing together three binding domains rather than two and (3) achieving the desired orientation with protein scaffolding rather than DNA, which has advantages for downstream applications. Our two-step protocol for optimally positioning mimics of antibody combining sites for maximally avid engagement of target homo-oligomers has considerable potential for both diagnostic and therapeutic applications. The power and applicability of our approach should continue to increase as additional crystal structures of antibodies in complex with both pathogen and homo-oligomeric surface receptors are elucidated.
Design of the RBS binding proteins. For a scaffold database, we curated 1,718 high-resolution, monomeric crystal structures from the PDB and relaxed them into the Rosetta energy function. Using Epigraft22, segments of varying lengths extracted from the heavy chain CDR3 of the C05 antibody (PDB ID 4FP8) ranging from Ser97 to Leu100K (or Ala100I) were aligned with each scaffold. If the r.m.s. distance between the joining backbone endpoints was less than 0.25 Å and diverged less than 1.0 Å when aligning the replaced scaffold fragment with the CDR fragment motif, grafting was attempted. All predicted interface residues of suitable scaffolds were substituted with alanine and, using small-fragment insertion and backbone minimization, we attempted to close broken bonds. For the top ten closed solutions, the remainder was briefly redesigned by Epigraft22. Both intra- and intermolecular interfaces were then extensively redesigned using three rounds of the most current RosettaDesign28 while also allowing rigid-body minimization, but maintaining key contacts (100A, 100C-F) of the grafted piece. The best scoring design for each successful graft solution was kept if hydrogen bonds to the HA 130 loop and Tyr98 were maintained and had a computed binding energy below −12 units. All residues that were changed and did not contribute to the computed ΔΔG of binding were reverted to their native identities. Source code is freely available to academic users through the Rosetta Commons agreement (https://rosettacommons.org).
Computation-based design of disulfide bridges. To identify possible locations for the incorporation of two cysteine residues to form a disulfide bridge in HSB variants without disturbing the backbone, we used RosettaRemodel29. Briefly, Cβ distances between all residue pairs were calculated and disulfide bridges were introduced, and the model was relaxed and minimized when the appropriate distances were identified.
Loop 2 Re-design. To prevent domain swapping and enable crystallization with HA, loop 2 of HSB was redesigned. Ideal phi-psi angles and loop truncations were sampled using blueprint files30 and sequence design was biased toward amino acid identities commonly seen within beta-sheet turns. Out of ten tested designs, a 3-residue truncated version, HSB.2A, was expressed well and ran as a mono-disperse peak on SEC.
Homology modeling. The query sequence was threaded onto the HA structure with the closest sequence similarity as identified by BLAST. The model was then subjected to several rounds of relaxation while constraining the backbone atoms to their original coordinates. We used the following PDBs as templates for homology models: chain A of 3sm5 for H1 SI and H1 NewCal, and chain A of 4hfu for H2 Ada.
HA sequence alignments and Shannon entropy. Non-redundant human influenza A H3 sequences for HA were downloaded from the Influenza Research Database (https://www.fludb.org/) and only sequences from complete genomes were extracted and aligned using Muscle31. Shannon entropy was calculated using method calls from the Python-based ProDy package32.
Design of trimeric binding units. Translations and rotations of 300 homo-trimeric structures were sampled along the symmetry axis of the HA trimers and scored on the basis of their proximity to either termini while avoiding clashes. Out of the successful solutions, we focused on small trimers of thermophilic organisms with compatible geometry for loop closure between the identified trimer and HSB. Residues of the trimer that could interfere with binding to HA were “shaved off,” for example, charges were designed to become neutral residues, and potentially clashing bulky residues were changed to smaller amino acids in FoldIt33. The original trimeric scaffolds were selected to have a minimal distance toward either terminus of HSB, which allows loop closure with a simple glycine-serine linker. Loops were closed via RosettaRemodel29. Blueprint files were generated so that 2-, 3-, 5- or 8-residue insertions between the trimerizing unit and HSB were modeled. Trimerization is based on the 1nza PDB coordinates. For initial experimental characterization, we fused the trimer adaptor to the HSB.1C variant. Different loop lengths for most successful design were cloned and expressed.
Design library. DNA encoding the designed proteins (Gen9 Inc.) flanked by homologous region of the pETCON plasmid was co-transformed with linearized plasmid (NheI/XhoI/BglII digested) into EBY100 cells24 and expressed as Aga2 fusions.
PCR-mutagenesis. The genes for HSB were subjected to an error-prone PCR amplification using the Metamorph Mutagenesis kit (Agilent). Briefly, using primers upGS, downMyc and 50 ng of total plasmid DNA as a template, 31 cycles of 94 °C, 54 °C for annealing and 72 °C for primer extension were performed. Genes were purified through gel extraction (Qiagen). As the first mutagenesis PCR resulted in less than 20% of the genes with at least one mutation, we repeated the amplification using the mutagenized DNA as a template, resulting in an error rate of an average of one to two mutations per gene as determined by ten randomly chosen clones.
Generation of the combinatorial libraries. The two combinatorial gene libraries (one to address the HA diversity and the second to introduce stabilizing mutations identified via selections at increasing temperatures) were generated through recursive assembly of 4 ultramers (IDT) containing degenerate codons. Two fragment pieces annealing (HSBlib.gII.frag1 or HSB.1Cdis.frag1 and HSBlib.gII.frag2a or HSB.1Cdis.frag2) were amplified through outer primers (HSBlib.gII.For or HSB.1CdisYFor and HSBlib.gII.Rev or HSB.1CcysYRev). To limit the library size for the HA diversity-addressing libraries, which increased due to additional diversity gained by the use of degenerate codons, we designed two gene libraries for fragment 1 (a and b). After assembly, the amplification reactions were combined. DNA was gel-purified before transformation.
Generation of site-saturated mutagenesis (SSM) library. All possible point mutations for each position were generated through overlap PCR. To simplify the preparation step and to generate standardized primers, we automated the primer design step with a custom Python script. We designed one shorter reverse primer and a longer forward primer that contained the mismatch encoded by NNK for each of the 87 positions. For the PCR generating the two overlapping DNA fragments, 25 ng of the plasmid DNA was used together with primers colonyF and colonyR. As template served DNA encoding HSB.1C with a disulfide bridge between position 8 and 40. The overlap fusing reaction for each position was performed by combining 1 μl of the two fragments followed by 30 cycles of amplification using the primer colonyF and colonyR primers and the same PCR conditions as the first reaction. All 87 overlap PCRs were checked for their success through agarose gel electrophoresis. Reactions (5 μl of each) were combined and gel extracted as a pool.
For library transformation, EBY100 (ref. 24) cells were transformed with 2 μg of the gene library together with 1 μg linearized pETCON (SalI, XhoI, NheI digested)27 using previously reported protocol34. Transformation resulted in 2–5 × 107 cells.
Library generation and yeast surface selections and titrations.
Yeast cells were induced for about 16–18 h at 22 °C, then washed once with PBS containing 0.1% BSA (PBSF). For the very first round of selections, unless noted, cells were typically incubated for 4 h at 4 °C with the indicated HA, before adding a fourth of the used concentration streptavidin conjugated to phycoerythrin (SAPE, Invitrogen) and 2 ng/ml anti-c-Myc antibody conjugated to fluorescein isothiocyanate (FITC) and incubating for another hour. For non-avid conditions in later sorts, cells were incubated for 2 h at 4 °C while rotating, unless noted otherwise, and then washed once with ice-cold PBSF before resuspending in 100 mL with 72 nM SAPE and 2 ng/ml anti-c-Myc antibody conjugated to FITC for 30 min on ice. Cells were washed once with 1 mL ice-cold PBSF and stored as pellets on ice before their final resuspension in 1 mL ice-cold PBSF followed by fluorescence-activated cell sorting (FACS). The combinatorial library was screened by sorting under a less stringent, avid condition at 1,000 nM with a 4:1 ratio of SAPE, followed by non-avid, more stringent selection at 500 nM, 125 nM and 50 nM H1 SI HA.
Library preparation and next-generation sequencing.
Plasmids were extracted from 5 × 107 yeast cells for each selection and starting pool as previously described35. After ExoI (NEB) and Lambda exonuclease (NEB) treatment, the Illumina sequencing primers sequence, flow cell adapters and selection-specific barcodes were added via two nested PCRs using NEBnext. They also add 12 entirely degenerate bases at the beginning of the forward and reverse read, ensuring adequate diversity for the Illumina base-calling algorithms. Amplicons were sequenced as 250 paired-end reads with 15% phiX at 10-pM concentration using a MiSeq sequencing system (Illumina). After quality control filtering35, we calculated enrichment for each sort by dividing the frequencies of 'selected' by ones seen in the starting pool.
Protein expression and purification of design variants.
For expression of the design variants in E. coli, genes were cloned into pET29b between the restriction sites NdeI and XhoI and transformed into BL21Star (Invitrogen). For expression, 10 ml of overnight culture grown in Terrific Broth (BD Difco) was used to start a 500-ml culture using Studier's autoinduction medium but replacing the N-Z-amine fraction with Terrific Broth (TB). Cells were grown for 8 h at 37 °C, before reducing the temperature to 18 °C for another 14–16 h. Cells were resuspended in 35 ml phosphate buffered saline (PBS, 150 mM NaCl and 25 mM phosphate buffer at pH 7.4) and lysed using a M110P Microfluidizer (Microfluidics). Insoluble cell debris was removed by centrifugation for 20 min at 40,000g. Supernatant was applied to gravity-flow columns containing 2 mL of Ni-NTA for each 500 ml of culture, washed with 50 ml PBS and 50 ml PBS containing 30 mM imidazole. Proteins were eluted with 20 ml of 250 mM imidazole in PBS. If necessary, proteins concentrated to 1.5 mg/ml using a Vivaspin 10 kD MWCO centrifugal concentrator (Sartorius Stedim) at 4,000g. For animal studies, proteins were purified using column and an imidazole gradient. The 400 mM elution fraction was subjected to further cleaning via SEC and further processed using bacterial endotoxin removal beads (Miltenyi Biotec).
Expression, purification and biotinylation of HA.
Recombinant HA was expressed, purified and biotinylated as previously described18. In brief, HAs were expressed as a fusion with a C-terminal Avi-tag, Foldon trimerization domain and hexahistidine tag in Hi5 cells. After 3 d expression at 28 °C, cells were removed by centrifugation and the culture supernatant was concentrated and buffer exchanged into 50 mM Tris pH 8.0, 300 mM NaCl, 10 mM imidazole before overnight incubation with Ni-NTA resin at 4 °C. The resin was washed with 50 mM Tris pH 8.0, 300 mM NaCl and 10 mM imidazole, and bound protein was eluted in the same buffer containing 250 mM imidazole. Proteins used in crystallographic or EM studies were digested using trypsin at a 1:1,000 mass ratio overnight at 4 °C and further purified by size-exclusion chromatography on a Superdex S200 16/90 column (GE) in 25 mM Tris pH 8.0, 150 mM NaCl. Hemagglutinin used in library selections and bio-layer interferometry were site specifically biotinylated by incubation with recombinantly expressed BirA in 100 mM Tris pH 8.0, 10 mM ATP, 10 mM magnesium acetate and 50 μM biotin, and purified by size-exclusion chromatography as above.
Optimization of HSB.1C.
After the introduction of a disulfide bridge to HSB.1C (see computational design methods above), we generated a site-saturation mutagenesis (SSM) library of all positions and sorted for binding after incubation with HA at 22 °C, 30 °C, 37 °C and 42 °C (Supplementary Fig. 5). SSM libraries were incubated with 5 nM H1 SI in 50 μl at 22 °C, 30 °C, 37 °C and 42 °C for 2 h while rotating, washed once with ice-cold PBSF and incubated on ice for suspending in 100 mL with 72 nM SAPE and 2 ng/ml anti-c-Myc antibody. After deep-sequencing of all pools, we used linear regression of the enrichment values for each sort at increasing temperatures to identify substitutions that enable better binding at higher temperatures and plotted these as a heat map using a custom Python script. Substitutions that were increasingly enriched over the temperature range were included in the design of a combinatorial library if they were not part of the interface with HA. The variant with the highest expression level in E. coli was selected, which also exhibited the most monodisperse SEC trace. This variant named HSB.2 and used for further analysis. It differs from HSB.1C at eight positions. The combinatorial library based on the identified mutations was screened by sorting at 5 nM H1 SI at 37 °C.
BioLayer interferometry (BLI).
Titrations were performed on an OctetRED96 BLI system (ForteBio, Menlo Park, CA) using streptavidin-coated biosensors. Sensors were equilibrated for 20 min in PBSTB buffer (PBS, 0.002% Tween 20, 0.01% BSA). Titrations were executed at 30 °C while rotating at 1,000 r.p.m.; baseline were set for 60 s. Each sensor was loaded with 7–20 nM biotinylated HA for 100 s. Data were fit to a 1:1 interaction. For measurements resulting in apparent Kd values below 1 nM, we report only as subnanomolar affinity, as the off-rates were too slow to measure accurately.
Crystallization of the unbound monomer and complex.
Initial crystal screening was carried out at the JCSG robotic crystallization facility using the Rigaku Crystalmation system. Crystals of HSB.2 were grown by sitting drop vapor diffusion in drops containing a protein and mother liquor at a 1:1 ratio with a well solution containing 15% PEG 8000, 0.1 M HEPES pH 7.9. Crystals were cryo-protected in mother liquor supplemented with 20% ethylene glycol before flash freezing in liquid N2. Diffraction data were collected at the Stanford Synchrotron Radiation Lightsource (SSRL) beam line 12-2. For the HA-HSB complex, H3 HK68 and HSB.2A were incubated at a 1:1.5 molar ratio for 1 h at 4 °C before the sample was concentrated to 10 mg/ml in 25 mM Tris pH 8.0, 150 mM NaCl. Crystals were grown with a well solution of 10% PEG 6000, 100 mM 2-(N-morpholino)ethanesulfonic acid (MES) pH 6.0 using the sitting drop vapor diffusion method and cryo-protected in well solution supplemented with 20% ethylene glycol before being flash frozen in liquid N2. Data were collected at APS beamline 23 ID-D (GM/CA @ APS) and processed with HKL2000 (ref. 36). Phaser37 was used for molecular replacement within Phenix using two search models; a single protomer of H3 HK68 HA from PDB ID 4FNK and a monomeric model of HSB.2. The resulting solution contained one HA trimer and two copies of HSB. Density for the third HSB copy was clearly observable at the third RBS and a copy was manually placed by superimposing an HA-HSB complex from within the model. The position of the manually placed copy was further optimized by rigid body fitting within the FO-FC map in COOT38 and rigid body refinement in Phenix39. The model was refined through iterative rounds of model building in COOT and refinement in Phenix. Several regions of the third copy of HSB are poorly ordered. The final model contains all regions of the protein with 2FO-FC density for backbone atoms at 1.0 σ. Glycans and waters were manually added and edited in COOT. The final models were assessed with quality metrics within the Phenix. refine interface and the JCSG quality control server, both of which utilize MolProbity40.
Samples were prepared by mixing purified HK68 HA with trimeric-HSB (Tri-HSB.1C) at a 1:2 molar ratio. The complex was then purified by size-exclusion chromatography on Superdex S200 16/60 column equilibrated with TBS (25 mM Tris pH 8.0, 150 mM NaCl) to separate excess HSB. A thin layer of nitrocellulose was floated onto 400-mesh copper grids followed by a thin layer of evaporated carbon. The complex was diluted to ∼60 μg/ml and placed on a freshly glow discharged grid using a Gatan Solarus. Grids were stained with a 3 μL droplet of Nano-W for 7.5 s, blotted and allowed to dry. All samples were treated using this standardized method. Images were collected at 52kX magnification on a Tecnai T12 spirit at 120 kV equipped with a Tietz F416 4kx4k CMOS camera, resulting in a final pixel size of 2.05 Å/pixel. The Leginon41 software package was used for all automated image acquisition. Areas of the grids with thin crystalline Nano-W resulted in the best-resolved particles. Particles were picked using DoGpicker through the Appion web-based software package and stacked into 128 pixel boxes. Particles were aligned using either Xmipp CL2D42 or IMAGIC43 to remove suboptimal particles. Final alignment was performed with ISAC44 and 16 classes were chosen to build an initial 3D asymmetric model using EMAN2 (ref. 45). Further model refinement with 14,956 raw particles was performed using EMAN.
Circular dichroism spectroscopy.
HSB.2 was tested for folding and its melting temperature using an Aviv 420 circular dichroism spectrometer (Aviv Biomedical, Lakewood, NJ). Protein was diluted to 1 mg/mL in PBS and ellipticity between 205 and 260 nm was measured. To obtain its melting temperature by measuring ellipticity at 220 nm was measured in 2 °C increments between 15° and 95 °C.
Protease resistance assay.
Before evaluating the neutralization potential of design variants, proteins were tested for its susceptibility to proteases as the cell-based neutralization assay contains high amounts of trypsin. Cleavage can occur either if a site is accessible in a loop area of if the protein is partially unfolded and exposes a site. Proteins to be tested were diluted to 1mg/mL in PBS to which trypsin was added with a final concentration of 0.005% (w/v) (Gibco 0.25% (w/v) trypsin/phenol red, Life Technologies, Carlsbad, CA) and incubated at 37 °C. Time points were taken by removing 6 μL aliquots of solution. Digestion was instantly quenched by mixing with 6 μL of 2 × Laemmli Sample Buffer (Bio-Rad) and incubating it for 2 min at 95 °C. Samples were loaded onto a 12% NuPage Bis-Tris gel (Life Technologies, Carlsbad, CA) and run 1 × MES buffer (50 mM MES, 50 mM Tris Base, 0.1% SDS, 1 mM EDTA, pH 7.3). Trypsin did not degrade monomeric or trimeric constructs over the monitored period of 24 h at 37 °C.
Influenza and BHA preparation.
X-31 (H3N2) influenza A virus (Charles River Laboratories) was used for production and purification of soluble bromelain-cleaved (BHA) as previously described46.
Small angle X-ray scattering (SAXS) and all-atom modeling.
Using a 1:3 molar excess of the inhibitor, Tri-HSB.2 and BHA were incubated at 4 °C overnight and complex was purified via size exclusion chromatography (SEC) directly followed by SAXS. SAXS measurements were conducted on beam line 4-2 at the Stanford Synchrotron Radiation Lightsource. The buffer for size exclusion was 150 mM NaCl, 0.02% NaN3 10 mM sodium phosphate, 1 mM EDTA at pH 7.5. The eluate from the column was passed through a capillary cell and 1 s X-ray exposures were collected every 5 s. Scattering data were scaled for the transmitted beam intensity at each exposure, and azimuthally averaged. The radius of gyration (RG) and I(0) for each frame were batch analyzed using autoRg, and frames with stable RG47 values were merged in primus48. The particle distance distribution function [P(r)] plots were generated from GNOM49 and used to calculate a real-space RG and maximum linear dimension (DMax) of each protein.
All-atom ensemble modeling of glycosylated trimer.
All-atom glycosylated models of unliganded H3 BHA (3HMG) and BHA-bound Tri-HSB.2 were generated using MODELER, as previously described46. Glycans were added and set as flexible residues during simulated annealing. For the ensembles of unbound Tri-HSB.2, the glycine linkers connecting the binding domains to the trimerization domain were left unrestrained to allow them to sample conformational space while the trimerization fold was restrained as a rigid-body in the template. As for the BHA bound to Tri-HSB.2, three different ensembles were created in which one, two or three binding domains of Tri-HSB.2 were bound to one, two or three HA1 subunits, respectively, within the BHA trimer while constraining Tri-HSB.2 trimerization domain to the rigid-body template. Consistency of the models with the experimental SAXS data was analyzed with the FoXS webserver50,51. The goodness of fit (χ value) for each model relative to the experimental SAXS data of the corresponding protein complex was used to identify the models most consistent with the scattering data. FoXS was also used for multiple ensemble searches (MES)52 of conformations within a collection of submitted ensembles with distinct conformations (at least 30 all-atom ensembles) to determine whether a single state or a combination of states was sufficient for minimizing the χ value.
Hydrogen deuterium exchange (HDX).
To establish an internal standard for intrinsic exchange rates, 1 μM of the tetra peptides PPPI and PPPF were added to unliganded H3 BHA and complexes (prepared as above). HDX reactions of each protein stock were initiated by diluting 3 μg of the protein or complex into deuterated buffer (10 mM HEPES, 150 mM NaCl, 0.02% NaN3) for a final pH of 7.5 and incubated at room temperature for 3 s, 20 s, 3.5 min or 30 min. All reactions were quenched with ice-cold quench buffer (100 mM TCEP (Tris(2-carboxyethyl) phosphine), 0.2% formic acid) for a final pH of 2.5, rapidly frozen in liquid nitrogen and stored at −80 °C until analysis. Undeuterated controls were performed as above with optima pure (Thermo-Fisher) water in place of D2O. To account for “in-exchange” during online-pepsin digestion (described in the mass spectrometry section below), a “zero” time point was made by diluting 3 μg of stock protein or complex into the quenched deuteration buffer and freezing as mentioned above. Fully deuterated “protein” stocks were prepared by denaturing BHA (H3) in 10 mM DTT, 3.5 M Gnd-HCl, 10 mM HEPES, 150 mM NaCl at 80 °C for 30 min, and 3 μg denatured of BHA was added to deuteration buffer containing PPPI and PPPF for 30 min at 40 °C. Fully deuterated “peptide” stocks were made by collecting the BHA pepsin peptides that were digested by the online-pepsin column and eluted from the liquid chromatography gradient used for mass spectrometry (explained in further detail below). Following elution, the BHA peptides were dried and resuspended in the fully deuterated denaturant buffer, and deuteration of the peptides was carried out identically.
Analysis of Tri-HSB.2 complex and identification of HA peptides. Undeuterated BHA peptides were identified as described46 on a Waters Synapt G1 HDMS. HDX samples were loaded onto a 250 μL steel line UPLC (ultra-pressure liquid chromatography) loop and injected onto 2.1 × 50 mm cartridge filled with POROS AL/20 resin containing immobilized porcine pepsin (Worthing Labs) at 200 μL/min with solvent C (2% acetonitrile (ACN), 0.1% FA, 0.04% tri-fluoroacetic acid (TFA) pH 2.5) and peptides were trapped onto 1.7 μm 1 × 17 mm ethylene bridge hybrid (BEH) reversed phase C18 (Waters) trap column in-line with the pepsin column for 5 min. The peptides were then resolved using a BEH 1.7 μm 1x100 mm C18 (Waters) reversed phase column running a gradient of 5% to 40% solvent B for 9 min (solvent A: 2% acetonitrile (ACN), 0.1% FA, 0.04% TFA pH 2.5; solvent B: 100% ACN, 0.1% FA). The injection loop, pepsin, trap and analytical columns were kept at a constant 1 °C within the Waters HDX manager module. During the main chromatographic gradient, the pepsin column is switched offline and the LEAP Technologies dual arm PAL was programed to perform a series of pepsin column washes with two 250 μL injections of 2M guanidine hydrochloride (GndHCl), 0.1% TFA pH 2.2; one 250 μL 10% FA and one 250 μL injection of 0.1% FA with 200 μL/min of solvent C. After the main LC gradient, the pepsin column is back-flushed with 0.1% FA, 0.04% TFA at 100 μL/min for 15 min. During this time, the trap column was washed to minimize sample carryover by cycling through one 250 μL injection of 10% FA, one 250 μL injection of 50% trifluoroethanol, one 250 μL injection 80% methanol, one 250 μL injection of a 2:1 mixture of ACN to isopropanol and one 250 μL injection of 80% ACN and re-equilibrated with solvent C for subsequent injections. Mass shifts of each resolved isotopic envelopes were calculated by fitting to a tailored binomial distribution within the semi-automated HX-Express v2 software53 (HX-Express.com). Experimental error was calculated using the s.d. from duplicate measurements. For glycoproteins, the exchange for the major glycoform is reported. Percent exchange was calculated for each time point relative to the “zero” and fully deuterated standards. In a few cases where the signal for the fully deuterated “protein” standard was insufficient (presumably due to aggregation before digestion or altered digestion efficiency) the fully deuterated “peptide” standard was used to calculate the percent exchange.
Cell neutralization assay.
To evaluate whether the designed inhibitors could indeed neutralize in vitro, virus was grown as a green fluorescent protein (GFP)-based system, in which the pb1 gene is replaced by gfp, previously described as PB1flank-eGFP viruses54. The trimeric inhibitor variants were titrated and incubated with indicated virus at 37 °C for 1 h to allow protein binding before cells54 (not recently retested for mycoplasma) were added. After an 18-h incubation, the GFP fluorescence intensity of the cells was measured and the signal above background was normalized to the negative control. Values are reported as percent infectivity remaining averaged over triplicate measurements.
Diagnostic application: detection of hemagglutinin on paper.
Protein solutions were filtered through a centrifugal filter (0.2 μm nylon membrane, VWR) before spotting. Capture proteins were immobilized to nitrocellulose membrane strips (GE FF80 HP, GE Healthcare) via a piezoelectric printer (sciFLEXARRAYER S3, Scienion AG) by depositing 12 spots (250 μm apart from each other) at 30 drops per spot and 450–500 pL per drop. Each test line received approximately 300 nL of total protein solution. The spotted membrane strips were assembled into a comb for ease of use and solutions were pre-loaded into 96-well plate (20 μL). Tri-HSB.1C was prepared at 0.62 mg/mL, and recombinant streptavidin (Abcam) at 1.0 mg/mL. Membranes were moved from one well to another to apply the following solutions for the detection of HA using Tri-HSB.1C as a capture reagent: (1) 100 nM recombinant HA, (2) 100 nM biotinylated HB36.6, and (3) Au-streptavidin. After each step, we washed with PBST (PBS+0.1% (v/v) Tween 20 (PBST) and 1% (w/v) BSA). Recombinant HA was from one of two influenza strains, as indicated: A/California/04/2009 (Influenza Reagent Resource (IRR)) or A/Solomon Islands/03/2006 (IRR). For the inverted set-up, biotinylated HB36.6 was immobilized to nitrocellulose by mutant streptavidin25 (Abcam) lined on the nitrocellulose membrane. For signal amplification, Tri-HSB.2 was conjugated to HRP. For the detection of HA the reagents were delivered by inserting the membrane into wells containing solutions in the following order: (1) 100 nM recombinant HA premixed with 10 nM bHB36.5 and 300 nM of Tri-HSB.2-HRP (2) 1 mg/mL biotin-BSA (Biovision, Inc.) (3) Amplification buffer with 0.0125% H2O2 and 122 mg/mL of DAB as the substrate for HRP.
Protein administration and influenza challenge.
Animal studies were approved by the University of Washington Institutional Animal Care and Use Committee. For in vivo efficacy of Tri-HSB.2, female BALB/c mice (Envigo) of 6–8 weeks of age were randomly assigned groups of 5 mice per cage by the housing facility. N = 10/group yields 80% power to see a significant difference in protection based on 100% infectivity in controls. Investigators were not blinded to the treatment group identification. Mice were anesthetized with 2.5% isoflurane and intranasally (IN) administered protein binder (Tri-HSB.2) at a dose of 3 mg/kg body weight at a final volume of 30 μL per mouse. Mice were anesthetized with 2.5% isoflurane and challenged IN with 1,000 TCID50 (50% tissue culture infectious dose) of X-31 virus (A/Hong Kong/X31/1968) in 30 μL of PBS. The mice were monitored daily for weight loss and survival until 14 d post-infection. Animals that lost more than 25% of their initial body weight were euthanized by carbon dioxide and cervical dislocation in accordance with our animal protocols.
Error bars, n values, and type of replicates for each experiment are defined in the respective section within Online Methods and figure legends. For comparison of the body weights of the animals for the influenza challenge, an unpaired Student's t-test was performed to determine the P-value.
Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Protein Data Bank
Protein Data Bank
This work was funded by DTRA grants HDTRA1-16-C-0029 and HDTRA1-11-1-0041 (D.B), Life Science Discovery Fund Grant # 9598385 (D.B.), NIH 5R01AI096184-05 (P.Y.) and the NIH/NIAID R21AI119258 (D.H.F.). E.M.S. was supported by a career development award by the NW regional center of excellence (NIAID). J.D.B. and K.A.H. were supported by NIH grant R01 GM102198, K.A.H. by an NRSA training grant (T32GM007270) and I.A.W. and S.M.B. by R56 AI117675 and the Skaggs Institute for Chemical Biology at TSRI. A.J.B. was supported by a grant from the National Science Foundation (DGE-1256082). We would like to thank Erika O. Saphire for the generous gift of Ebola GP. We thank L. Carter and her team for help with protein purifications, Y. Song for help with comparative modeling and K. Godin for help with the selections and library generation. Use of the Stanford Synchrotron Radiation Lightsource, SLAC National Accelerator Laboratory, is supported by the US Department of Energy, Office of Science, Office of Basic Energy Sciences under Contract No. DE-AC02-76SF00515. The SSRL Structural Molecular Biology Program is supported by the DOE Office of Biological and Environmental Research, and by the National Institutes of Health, National Institute of General Medical Sciences (including P41GM103393). The contents of this publication are solely the responsibility of the authors and do not necessarily represent the official views of NIGMS or NIH. GM/CA@APS has been funded in whole or in part with Federal funds from the National Cancer Institute (ACB-12002) and the National Institute of General Medical Sciences (AGM-12006). This research used resources of the Advanced Photon Source, a US Department of Energy (DOE) Office of Science User Facility operated for the DOE Office of Science by Argonne National Laboratory under Contract No. DE-AC02-06CH11357.
Integrated supplementary information
Supplementary Figures 1–11 and Supplementary Tables 1–6