Crystal structures and molecular dynamics simulations of a humanised antibody fragment at acidic to basic pH

Tang, Jiazhi; Zhang, Cheng; Castillo, Nuria Codina; Lalaurie, Christophe J.; Gao, Xin; Dalby, Paul A.; Kozielski, Frank

doi:10.1038/s41598-023-42698-7

Download PDF

Article
Open access
Published: 28 September 2023

Crystal structures and molecular dynamics simulations of a humanised antibody fragment at acidic to basic pH

Jiazhi Tang¹,
Cheng Zhang²,
Nuria Codina Castillo²,
Christophe J. Lalaurie²,
Xin Gao³,
Paul A. Dalby² &
…
Frank Kozielski¹

Scientific Reports volume 13, Article number: 16281 (2023) Cite this article

1182 Accesses
1 Altmetric
Metrics details

Subjects

Abstract

Antibody-fragment (Fab) therapy development has the potential to be accelerated by computational modelling and simulations that predict their target binding, stability, formulation, manufacturability, and the impact of further protein engineering. Such approaches are currently predicated on starting with good crystal structures that closely represent those found under the solution conditions to be simulated. A33 Fab, is an undeveloped immunotherapeutic antibody candidate that was targeted to the human A33 antigen homogeneously expressed in 95% cases of primary and metastatic colorectal cancers. It is now used as a very well characterised testing ground for developing analytics, formulation and protein engineering strategies, and to gain a deeper understanding of mechanisms of destabilisation, representative of the wider therapeutic Fab platform. In this article, we report the structure of A33 Fab in two different crystal forms obtained at acidic and basic pH. The structures overlapped with RMSD of 1.33 Å overall, yet only 0.5 Å and 0.76 Å for the variable- and constant regions alone. While most of the differences were within experimental error, the switch linker between the variable and the constant regions showed some small differences between the two pHs. The two structures then enabled a direct evaluation of the impact of initial crystal structure selection on the outcomes of molecular dynamics simulations under different conditions, and their subsequent use for determining best fit solution structures using previously obtained small-angle x-ray scattering (SAXS) data. The differences in the two structures did not have a major impact on MD simulations regardless of the pH, other than a slight persistence of structure affecting the solvent accessibility of one of the predicted APR regions of A33 Fab. Interestingly, despite being obtained at pH 4 and pH 9, the two crystal structures were more similar to the SAXS solution structures obtained at pH 7, than to those at pH 4 or pH 9. Furthermore, the P6₅ crystal structure from pH 4 was also a better representation of the solution structures at any other pH, than was the P1 structure obtained at pH 9. Thus, while obtained at different pH, the two crystal structures may represent highly (P6₅) and lesser (P1) populated species that both exist at pH 7 in solution. These results now lay the foundation for confident MD simulations of A33 Fab that rationalise or predict behaviours in a range of conditions.

IgG1 and IgG4 antibodies sample initial structure dependent local conformational states and exhibit non-identical Fab dynamics

Article Open access 23 March 2023

Large-scale application of free energy perturbation calculations for antibody design

Article Open access 21 July 2022

Design Strategy to Create Antibody Mimetics Harbouring Immobilised Complementarity Determining Region Peptides for Practical Use

Article Open access 21 January 2020

Introduction

A33 Fab, is an undeveloped immunotherapeutic fully-humanised antibody candidate that was targeted to the human A33 antigen¹. The fully-humanized A33 Fab was developed by grafting the CDRs from a previous mAb candidate, into the variable region framework of a human antibody². A33 antigen is expressed in the epithelia of the lower gastrointestinal tract, as well as carcinoma lesions that originate from the rectal and colonic mucosa³. The A33 antigen expressed in metastatic colorectal cancers share 95% similarity⁴. Despite not progressing to a commercial release, A33 Fab is a highly characterised representative of the wider therapeutic Fab platform, due to its use as a testing ground for the development of analytics⁵, formulation strategies^6,7,8, protein engineering tools⁹, and to gain a deeper understanding of destabilising mechanisms^10,11,12. All of these studies have used the H/C226S variant which mutated cystein 226 in the heavy chain to serine, to eliminate intermolecular dimerisation¹⁰, and will be referred to simply as A33 Fab throughout.

As the conformational flexibility and kinetic stability of A33 Fab have become better understood through combined biophysical and computational approaches^9,10, and as a wide range of formulations (pH and ionic strength) and mutations have been experimentally characterised, the potential to use computational approaches such as all-atom molecular dynamics (MD) simulations with greater precision in the prediction of protein formulation behaviour has arisen. While small-angle X-ray scattering experiments, and initial MD simulations indicate important structural changes as a function of pH, the MD simulations have only been carried out using homology model structures as starting points which do not take into account any differences in structure induced by the pH. While those studies have been informative in helping to explain experimentally observed behaviours, the confidence with which such simulations can be used to predict new behaviours under different conditions or upon mutation, may depend on the accuracy of the starting structures and conditions used, and how well they represent the actual structure in those different conditions¹³. There are currently no crystal structures of A33 Fab under any conditions, or for any of the A33 Fab variants characterised experimentally. Therefore, to investigate whether the impact of pH on A33 Fab structure can be captured during crystallography, we obtained crystal structures at two different pH.

We crystallized the A33 Fab at both pH 9 and pH 4, which gave rise to structures from two different space groups, P1 and P6₅, that were solved to 2.5 and 2.2 Å resolution, respectively. These provide insights into the general structure of the A33 Fab as well as the six CDRs that originated from the A33 mAb. Initially, this was compared with structures of Certolizumab, a humanised therapeutic antibody developed from the same humanization framework but targeting a different antigen and elucidated the differences in their CDR- and switch regions. This ruled out any similarities between the Fab elbow angles observed between the two structures of A33 Fab, and those observed for Certolizumab when bound or unbound to antigen.

We then used the two crystal structures to explore the importance of differences in starting structure on the outcomes of MD simulations carried out at various pH. MD simulations starting from each of the crystal structures, evaluated the influence of starting structure on the simulations, and explored the dynamics and range of conformations accessible to A33 Fab at the respective crystallisation and neutral pH. These were further compared by using MD simulations to fit against experimental SAXS data obtained previously at various pH. Together, these showed that the simulation pH had a far greater influence on MD trajectories than did the starting structure. It also revealed that both of the crystal structures were highly representative of SAXS solution structures at pH 7, despite having been obtained in crystallisation conditions of pH 4 and pH 9.

Materials and methods

A33 Fab sample preparation

Expression, purification and buffer exchange of A33 Fab has been described previously⁹. Purified A33 Fab was further concentrated to 22.2 mg/ml in water with a 30 kDa Vivaspins (Generon Ltd., Bershire, UK). Prior to crystallization, samples were diluted with PIPES buffer (10 × stock of 50 mM PIPES pH 7.0 and 100 mM NaCl) and MilliQ water to a final concentration of 10 mg/ml, 15 mg/ml and 20 mg/ml, respectively.

Protein crystallization

Purified A33 Fab was used to screen for crystallization conditions using the Low Ionic Strength Screen Kit (Hampton Research). Hanging drops were set up in 24-well Linbro plates (Molecular Dimensions) with 4 μl protein solution, 2 μl buffer reagent and 5 μl precipitant reagent, which were equilibrated against 1 mL of 24% w/v PEG3350 reservoir as described in the Low Ionic Strength Screen handbook (Hampton Research). Drops were set up at both 277 K and 291 K.

Crystals grew in two different space groups. Triclinic crystals (Space group P1, 2.5 Å resolution) grew from 4 μl protein solution at 20 mg/ml mixed with 2 μl 50 mM Glycine pH 9.0 and 5 μl 16% w/v PEG3350 at 291 K. The crystals were cryoprotected with 60 mM Glycine pH 9.0, 19% PEG3350 and 15% Glycerol and flash-frozen in liquid nitrogen. Hexagonal crystals (Space group P6₅, 2.2 Å resolution) grew from 4 μl protein solution at 20 mg/ml mixed with 2 μl 50 mM Citric Acid pH 4.0 and 5 μl 8% w/v PEG3350 at 291 K. The crystals were cryoprotected with 60 mM Citric Acid pH 4.0, 10% PEG3350 and 15% Glycerol and flash-frozen in liquid nitrogen.

Data processing and structure determination

X-ray diffraction data were collected on Diamond beamline I04-1 for the hexagonal structure (P6₅) and I24 for the triclinic structure (P1) (Diamond Light Source, Harwell, UK). Data were indexed and integrated with iMOSFLM¹⁴. Data reduction and scaling was accomplished using SCALA¹⁵ within the CCP4 suite of programs¹⁶.

The first structure, which is in the triclinic form, was determined using phenix.phaser¹⁷. A chimeric model was applied as the probe for molecular replacement consisting of the heavy chain of the humanized RK35 antibody¹⁸ and the light chain of anti-ErbB2 Fab2C4¹⁹. The second structure in the hexagonal form was then solved based on a partially refined model obtained from the triclinic crystals. Refinements were carried out by phenix.refine¹⁷ iterated with real-space manual rebuild using Coot²⁰. The first rounds of refinements were done in phenix.refine with rigid body and update water functions applied. Once a sharp decrease in R_free value had been observed, simulated annealing and translation–libration–screw (TLS) parameterization were included separately for systematic comparison. Refined models were manually rebuilt in Coot and iteratively refined by phenix.refine. The final data collection parameters and refinement statistics are shown in Table 1.

Table 1 Data collection and structure refinement statistics for A33 Fab in space groups P1 and P6₅.

Full size table

Homology modelling

Some loops were too flexible to be resolved in the raw crystal structures and needed to be constructed. RosettaCM²¹ was used to fill those missing residues through a homology modelling protocol by applying the Fab A33 full-length amino acid sequences (SI) to the raw crystal structures of P1 and P6₅. For each homology modelling, more than 20,000 structures were generated after the RosettaCM Hybridize step. The highest scoring structure with all five disulfide bonds intact was selected for the subsequent RosettaCM Final relax step. In the Final relax step, more than 20,000 structures were obtained and the highest scoring structure with all five disulfide bonds intact was selected. Jobs were submitted to the UCL Myriad High-Performance Computing Facility (Myriad@UCL) with Rosetta Version 2018.48.60516-mpi.

Molecular dynamics

The Gromacs²² software was used to perform molecular dynamics, using the full-residue structures modelled based on P1 and P6₅ crystals. The PDB2PQR (Dolinsky et al. 2004) webserver was used to determine the protonation states of ionisable residues at pH 4.5, pH 7 and pH 9 for P1 structure, and pH 7 and pH 4 for P6₅ structure. The pH was maintained in simulations by the protonation state of the initial structure. Protonation states were not updated during the simulation. The simulation was carried out under OPLS-AA/L all-atom force field at 300 K and 1 bar. The protein molecule was placed within a cubic box with 1 nm distance between the protein and the box. The box was solvated with SPC/E waters, and Na⁺ and Cl⁻ ions to neutralise and provide ionic strength at 50 mM to the system. The system was then energy minimised and equilibrated around the solute protein, under an NVT (constant Number of particles, Volume, and Temperature) and an NPT (constant Number of particles, Pressure, and Temperature) ensemble. Production run was finally conducted for 100 ns. Six repeats were performed for each condition. Jobs were submitted to the UCL Myriad High-Performance Computing Facility (Myriad@UCL) using gromacs/2019.3/intel-2018.

Elbow angle calculation

The elbow angle of the Fab is the angle between the pseudo-twofold axes between the variable and constant domains. It is calculated using the Pymol (Version 2.3.2) elbow_angle.py script, setting the V_L domain as residue 1–109 and V_H domain as residue 1–117. The elbow angles were calculated for the P1 and P6₅ crystal structures, the free and binding states of the Certolizumab, as well as the trajectories from the molecular dynamics based on full-residue P1 and P6₅ structures.

SAXS curve fitting

The raw buffer-subtracted SAXS data of A33 Fab was derived from our previous work¹¹. SAXS data collected in vitro at pH 4.5, 7 and 9 were used to compare the fitting from the five MD conditions as shown in Table 2. The PDB structure from the homology modelling step and MD trajectories from the molecular dynamics step were converted to dcd file format and submitted to the SASSIE webserver²³ to obtain their corresponding SAXS curves, i.e. in-silico SAXS data. This was fitted against the in vitro SAXS data, using Chi-Square to indicate the fitness²³. The top 10 best-fit structures were extracted into PDB format, aligned in Pymol and RMSD calculated by Gromacs.

Table 2 The MD trajectories used to fit to their corresponding SAXS data.

Full size table

Results

Overall description of A33 Fab structure

A33 Fab is a fully-humanised fragment consisting of a γ heavy chain and a κ light chain. Each chain contains a variable domain (V_L and V_H) and a constant domain (C_L and C_H1). The variable framework sequence was derived from the human antibody LAY and substituted with murine-derived CDRs while the constant domain fully originated from human immunoglobulin constant regions¹. The elbow angles, defined as the intersection angle of the two pseudo-dyad axes (PDAs) between the variable domain and the constant domain, for the two structures solved in P1 and P6₅ space groups were 156° and 145° respectively, which implied flexibility of the switch region²⁴.

A33 Fab illustrates a canonical β-sandwich Ig fold within four domains (V_L, V_H, C_L and C_H1). Each domain has two layers of β-sheets, an inner- and an outer β-sheet. One canonical disulphide bridge was identified in each domain between the β-sheets layers (between _heavyC22-_heavyC96 in V_H, _lightC23-_lightC88 in V_L, _heavyC144-_heavyC200 in C_H and _lightC134-_lightC194 in C_L), which contributes to the stability of the Fab (Fig. 1). An inter-chain disulphide bridge, _lightC214-_lightC220, was not resolved due to the missing loop in the hinge region of the heavy chain. Seven inter-chain hydrogen bonds have been identified between the heavy and light chains, which are _heavyP102-_lightY36, _heavyF103-_lightT46, _heavyW106-_lightT46, _heavyQ39-_lightQ38, _heavyQ39-_lightQ38, _heavyP170-_lightS162 and _heavyP126-_lightS121. In addition to hydrophobic interactions, there are five inter-chain hydrogen bonds in variable regions while only two in the constant domains, illustrating tighter contacts within the variable regions.

Comparison of A33 Fab structures in different space groups

The triclinic crystals (P1) diffracted to 2.5 Å resolution with two highly similar copies (RMSD = 0.339 Å) of A33 Fab per asymmetric unit (AU), while the hexagonal crystals (P6₅) had only one copy per AU and diffracted to 2.2 Å. The overall structures in both crystal forms were similar but displayed slight differences (Fig. 2a). The main difference between the two structures was the elbow angle. Different compact patterns affected the intersection angle between the variable region and the constant region. The elbow angles shifted from 156° in the triclinic form to 145° in the hexagonal form, which implied flexibility of the switch region. However, the variable regions and the constant regions shared great similarity. The RMSD calculated between C^α-atoms of matched residues at 3D superposition of the two structures was 1.33 Å. In contrast, the RMSD decreased to only 0.5 Å and 0.76 Å for separately superposed variable- and constant regions, respectively.

Closer insight into the linkers between the variable and the constant regions revealed conformational differences between the two models in the heavy-chain switch regions. The side chains of _heavyS117 and _heavyS119 had different orientations and formed different hydrogen bonds with adjacent residues. Also, two unique hydrogen bonds were formed between _heavyA118 and adjacent residues in the hexagonal model. In contrast, the linker in the light chain was fixed by a set of hydrogen bonds, which granted it less flexibility (Fig. 2b,c). The PDB2PQR web server was used to calculate the pKa for ionisable linker residues. However, there were no discernible differences that would alter their protonation states between the P1 and P6₅ models in the linker region, indicating that protonation has a limited impact on the conformational differences observed. As such it remains unclear whether the structural differences in the switch linker region were as a result of the pH or the differences in crystal packing.

Two loops were missing in both structural models, which indicated high flexibility in this region. In the C_H domain the missing loop was composed of four residues (SKST, _heavyS132 to _heavyT135) in the hexagonal model and six residues (SKSTSG, _heavyS132 to _heavyG137) in the triclinic model. The hydrophilicity of the missing loop could facilitate interactions with solvents and thus, contribute to the flexibility of the loop. The other missing loop was located at the C-terminus of the heavy chain, which was also the hinge region of A33 Fab. The missing loop in this region contained 11 residues (KSCDKTHTSAA, _heavyK218 to _heavyA228).

Humanised complementarity determining regions (CDRs)

A33 Fab contains a highly humanised variable region composed of human variable framework regions (FWRs) from the LAY antibody and murine CDRs to minimise the formation of human anti-chimeric antibodies (HACAs)¹.

Here we aligned the A33 Fab for comparison with the peer therapeutic antibody Certolizumab which targets a different antigen²⁵. Certolizumab and A33 Fab have been developed from the same humanization scaffold with high sequence similarity (Supplementary, Fig. S1). The constant regions of A33 Fab and Certolizumab share high similarity. The RMSDs for C_H and C_L regions between A33 Fab and Certolizumab are 0.65 and 0.49 Å, respectively. By aligning the paratope regions, major differences can be found in CDR2 in V_L domains, CDR3 and CDR2 in V_H domains. Despite the large variation between CDRs, the FWRs remain highly similar (Fig. 3a,b), which implies the rigidity of the FWRs.

Molecular dynamics (MD) probed distinct Rg and APR/SASA at different pH

Two crystal structures for A33 Fab were obtained at different pH, namely the P1 form at pH 9 and the P6₅ form at pH 4. However, it is unknown as to how closely these structures match the solution structures of Fab at various pH derived from SAXS and/or MD simulations. Potentially the two crystal structures would also provide improved starting points for MD simulations at a range of pH. To investigate this, the two crystal structures were used to investigate their respective impact on MD simulations at different pH, and the range of solution structure conformations explored by each simulation. These were then also compared with SAXS data available from a previous study at each pH¹¹. This would shed light on the importance of the choice of starting structure on subsequent MD analyses at various pH, and also compare the impact of pH on the structures obtained through all three approaches.

The impact of starting structure and pH on the total protonation states of the protein are reported in Supplementary Table S1, and show that pH 7 leads to lower total charge (+ 7) as expected than for pH 4 (+ 17 to + 26). P6₅_pH4 (+ 26) carried a greater charge than P1_pH4.5 (+ 17), indicating some structural differences that affected surface ionisable groups and/or salt bridges. As shown in Fig. 4, all five conditions started with similar Rg values (2.44–2.46 nm) but diverged towards the end of simulations. At 100 ns, the two low pH conditions (P1_pH4.5, P6₅_pH4) both achieved the highest Rg at more than 2.51 nm, followed by the two neutral pH conditions (P1_pH7, P6₅_pH7) at 2.47–2.48 nm, and with the high pH condition (P1_pH9) having the lowest Rg at 2.46 nm. This corresponded well to our previous experimental SAXS data¹¹, where the Fab shared comparable Rg at pH 7 and pH 9, but had much higher Rg at pH 4.5. The similar ranking of end-point Rg at various pH was clearly the result of their respective protonation states in the MD simulation, and was unaffected by any initial crystallographic bias from the two crystal structures of P1 and P6₅.

The increased Rg at low pH suggested a possible conformational instability of the A33 Fab, that exposed more aggregation-prone regions (APR) to the solvent and resulted in tenfold faster in vitro aggregation at pH 4 compared to at pH 9⁵. Therefore, we evaluated all seven APRs of A33 Fab predicted previously⁵ and their different responses to the various pH stresses, based on the current simulations starting from the two different crystal structures (Fig. 5a). This showed that at the end of the simulation (80–100 ns), P1_pH9 had the lowest Solvent Accessible Surface Area (SASA) in nearly all the APRs, while P1_pH4.5 and/or P6₅_pH4 saw increased SASA at four APRs located at residues _heavyV173-V188 (C_H1), _lightT129-F139 (C_L), _lightL47-A51 (V_L) and _lightT31-Y36 (V_L). This implies an increased Fab light chain volatility under low-pH stresses.

It is noted that for C_H1 residues _heavyV173-V188 and C_L residues _lightT129-F139, the end-point SASA for P6₅ simulations were clearly higher. For C_L residues _lightT129-F139, this was likely due to the fact that the starting SASA (0 ns) of P6₅ C_L residues _lightT129-F139 were already 25% higher than those of P1 (Fig. 5d). By contrast, C_H1 residues _heavyV173-V188 had comparable starting SASA between P1 and P6₅ (Fig. 5e). Therefore, for C_L residues _lightT129-F139 there appeared to be a persistent difference between the P6₅ and P1 structures within this region. The distribution of SASA and elbow angles for each simulation in the final 20 ns, compared to their starting structures (Fig. 6) reveals that both P1 and P6₅ simulations moved towards decreased elbow angles, and increased SASA for C_L residues _lightT129-F139. However, P1 SASA and elbow angles remained offset from those of P6₅, by a similar amount found between their starting structures. Indeed the P1 simulations appeared to have only just converged on and then passed the P6₅ starting position in the final 20 ns of the 100 ns simulations. Thus, the P6₅ starting structure was already a better representation of the solution structure at pH 4 for residue _lightT129-F139, than was P1. However, it is expected that for the C_L residues _lightT129-F139, the P1 trajectories would also eventually converge with the P6₅ trajectories at each respective pH given longer simulations. Only for this specific APR did the solvent accessibility depend on the starting structure used.

In addition, the evolution of APR residues _lightT31-Y36 can be more clearly visualised in Fig. 5b,c. This shows that the end points converged for P1_pH4.5 and P6₅_pH4 in this region (Fig. 5c), while P1_pH9 retained its divergence from P6₅_pH4 (Fig. 5b), indicating the tendency towards increased exposure of this APR at low pH. Furthermore, the initial conformations of P1 and P6₅ were already good representatives of their respective crystallographic pH for this APR, as the SASA from MD simulations at the same pH did not change significantly.

RMSD, RMSF and elbow angle did not reveal crystal structure and pH dependency

RMSD (Fig. 7), elbow angle (Fig. 8) and RMSF (Supplementary Fig. S2) did not reveal significant differences at the end of the MD performed at the different pH. Initial changes seen in the first 3 ns are typical of the system continuing to equilibrate into the solution conditions as seen previously⁵ and can be ignored. RMSD was relatively insensitive to either pH or starting structure conditions compared to Rg or the SASA of certain APRs. Residue-level RMSF was analysed at each 10 ns interval for the four Fab domains independently, and was largely unaffected by either pH or starting structure. Exceptions, shown for the final 10 ns interval in Supplementary Fig. S2, were at residues 349–350 and 404–410 in the C_H1 domain for which RMSF values were higher for P6₅ simulations than for P1 simulations (regardless of pH). These two solvent exposed regions interact in the structure close to the C-terminal hinge end of the molecule and bury a significant portion of the APR _lightT129-F139. This increased flexibility at residues 349–350 and 404–410 within the P6₅ simulations therefore accounts for the increased SASA for APR _lightT129-F139 also observed in these simulations.

Also, the low pH appeared to increase RMSF within a small region at residues 199–204 in the C_L domain for both the P1 and P6₅ simulations. This demonstrates again that the starting structure did not significantly influence the simulations, giving consistent effects due to changes in the pH.

For the elbow angle, while the starting angles varied in the range 147°–151°, simulations from all conditions converged the elbow angles to around 139°–145° in the last 20 ns. P1_pH9 and P6₅_pH4 with initial elbow angles of 151° and 148.8° respectively, were simulated at their respective crystallisation pH conditions as shown in Fig. 8d. In the P1_pH9 structure, the angle dropped to 141° at 10 ns, before rising back to 146° at 20 ns, then settling out at between 140° and 145°. By comparison, the P6₅_pH4 structure witnessed a much smaller decrease in elbow angle, with a short-lived excursion to 148.8° at 62.5 ns, but mostly remaining within the 138°–145° range. This implied that the P6₅ crystal structure resolved at pH 4 had an elbow angle more similar to those in solution at any pH, while the elbow angle for P1 was possibly due to a conformation that is only rarely populated in solution, even at pH 9, and yet became stabilised by crystallisation at pH 9. It also suggests that for the elbow angle at least, the starting structure is much less critical than the influence of pH configuration on the end-points of MD simulations, and that longer simulations would likely remove any influence of starting structure. This does however, highlight that where the aim is to use MD simulation data taken from throughout the trajectory to evaluate protein structure and dynamics, then the starting structure does influence the outcome initially, and so care should be taken to run longer simulations, or to compare multiple starting points.

SAXS curve fitting revealed representative MD conformations at pH 7

The fitness of the MD trajectories to the in vitro experimental SAXS data is shown in Fig. 9. The two pH 7 conditions gave the best fits of around 0.02 chi-square, followed by the pH 9 condition with 0.05 chi-square, while the pH 4/4.5 simulations gave the lowest fitness of 0.10 chi-square. It is clear that the ranking in the SAXS fitting heavily depended on the pH, regardless of the initial crystal structure used, and that the simulations at pH 4.5 still had a long way to go until they accessed the structures observed by SAXS experiments at low pH. Clearly the low-pH solution structures were also very different to the crystal structure P6₅ obtained at pH 4, as well as to P1 obtained at pH 9.

The top 10 best-fits for each condition were highlighted as stars in Fig. 9. This shows that for the good-fit trajectories (pH 7 and pH 9), the best fits all occurred during the first half of the MD simulations. However, most of the best-fits arose in a single repeat for each condition, and occurred as outliers in the data within a 20 ns timeframe. In other words they were obtained from relatively rare events in the MD simulations, accessed as rapid fluctuations from a structural state that itself only existed for specific transient periods of approximately 20 ns. Again this type of event clustering was similar for both P1 and P65-based simulations, and so for application to SAXS fitting, the specific starting structure used in MD simulations was not a critical factor.

Even though P1 and P6₅ were crystallised at pH 9 and pH 4, respectively, neither one of them outperformed the other by revealing conformations during MD simulations with better fitting to SAXS data. At any given pH, the simulations from the two different starting structures had essentially converged within the 100 ns timeframe sampled, and so were unlikely to lead to any differences if sampled for longer or with more advanced sampling methods.

For each MD condition, the chi-square fits were also plotted against Rg for all the trajectory frames (Supplementary Fig. S3a–e), to determine whether a minimum was reached which would indicate that sufficient sampling of states had been achieved to correctly model the SAXS data. Only the two pH 7 simulations reached such a minimum in the Chi-square vs Rg plot, with the simulation from P6₅ at pH7 reaching the best fit overall. As no minimum was observed for pH 4/4.5 or 9, then either much longer simulations would be needed or otherwise more advanced sampling methods to fully resolve the best fit structures from the in vitro SAXS data at those pH. Indeed, better fits to the SAXS data were obtained in our previous work by increasing the temperature to 380 K, which enabled the simulations to explore more extensive structural changes in the MD¹¹.

Discussion

Over the past decades, significant efforts have been undertaken into the development of therapeutic antibodies against CRC targeting human antigen A33. Various A33 Fab variants have been developed and multiple clinical trials have been carried out^3,26. However, various A33 Fab variants have failed in clinical trials due to the normal gut localization and intrinsic stability issues of the variants^11,26. To our knowledge, the structure of A33 Fab or its variants has never been reported. Here we determined the structure of the highly characterised A33 Fab mutant, H/C226S, in two crystal forms.

The structural analysis of A33 Fab in two different crystal forms disclosed the intrinsic rigidity of four separate regions (V_L, C_L, C_H1, C_L) and the flexibility of the switch region. The different compact patterns and the interactions with solvents altered the elbow angles between the two pseudo-dyad axes from 156° (triclinic form) to 145° (hexagonal form), which led to a noticeable RMSD between the structures obtained from the two crystal forms. In contrast, the canonical β-sandwich Ig folds contributed to the rigidity of both variable and constant regions, which led to the high similarity of separate domains in different crystal forms.

We hypothesised that the major differences between the A33 Fab and Certolizumab were located at the variable region. Certolizumab was developed from the same humanization scaffold as A33 Fab, possessing a similar variable region framework and the same constant region protein sequence. To our surprise, the variable regions of these two Fabs shared higher similarities than expected (RMSD 0.91 Å). Only the V_H CDR3 and V_H CDR2 of both Fabs varied significantly, while the rest of the CDRs had similar conformations even with different protein sequences, which suggested the key roles of V_H CDR3 and V_H CDR2 in the identification between human A33 and TNFα. The FWRs were also similar between the two structures despite the sequence differences, which implied that the FWRs offer good support to the CDRs while being neutral to scaffolding for antibody structural integrity. The other main difference was the elbow angle. According to the previously published Certolizumab structure, the elbow angle of Certolizumab changed 9° during antigen binding, from 138° (free) to 129° (binding)²⁵. In contrast, the elbow angles of P1 and P6₅ structures of A33 Fab were 156° and 145° respectively. Certolizumab and Fab A33 possess the same protein sequence in switch and constant regions while the different elbow angles implied flexibility in the switch region. It has been reported that the flexibility of elbow angle is vital for the binding affinity of Fabs to antigens by optimizing antibody conformational dynamics and adaptation to antigen structure^27,28. In conclusion, the scaffold of the A33 Fab is optimal to support CDRs targeting different antigens and has the potency to be applied to other therapeutic candidates from an engineering perspective. However, the pH-induced changes in elbow angle for A33 Fab do not appear to relate to the elbow-angle changes induced by ligand binding in Certolizumab.

The two structures could not resolve two flexible regions, including residues _heavyS132 to _heavyT135 and _heavyK218 to _heavyA228. Thus, homology modelling was conducted to predict the missing residues, followed by molecular dynamics to study their impact on the conformational changes at different pH.

Overall, the structural differences between P6₅ and P1 were small in terms of their impact on MD simulations, as reflected by their comparable SAXS fitting and overall RMSD regardless of pH (Supplementary Fig. S3f) when comparing the top 10 best-fits. In addition, most of the APR solvent exposure was not impacted by the starting structure during the simulations. The only observed impacts of the starting structures were:

(a)
The solvent exposure of APR _lightT129-F139, which remained different between P1 and P6₅ simulations after 100 ns. P1 simulations lagged behind the P6₅ simulations but after 100 ns they had at least passed through the solvent exposure (and elbow angles) found in the P6₅ starting structure.
(b)
P6₅ achieved slightly better fits than P1 at pH 7 and pH 4.5, as it was already closer in structure to the solution forms.

Finally, it was interesting that the crystal structures P1 and P6₅ obtained at pH 9 and pH 4, respectively, were not biased towards the solution structures indicated by SAXS under the equivalent conditions. However, both of the starting structures were already reasonable approximations of the pH 7 SAXS structure (Chi-square around 0.04), with P6₅ marginally better than P1. This is an indication that crystallisation stabilised against the partial denaturations found in acidic or alkaline solutions, and instead selected out structures closer to those found at pH 7, despite the crystallisation at pH 9 or pH 4.

Successful development of protein therapeutics depends critically on achieving stability under a range of conditions. Starting from the crystal structure of A33 Fab in the P1 space group, we have previously compared the impacts of low pH and high temperature stresses on the structure of a humanized antibody fragment (Fab) A33, using atomistic molecular dynamics simulations⁵. An extensive dataset now exists for the stability and aggregation behaviour of A33 Fab variants and formulations, from which a deeper understanding of these mechanisms can be potentially gained by exploring MD simulations in more detail. The crystal structures of human A33 Fab, and the present finding that the specific starting structure chosen actually has little influence on the MD simulations at a range of pH, especially beyond the first 80 ns, will now provide a solid support for this testing ground.

Data availability

Coordinates for the A33 Fab models 7NC0 and 7NFA have been deposited and released in the PDB. Other data reported here are available on request to the corresponding author.

References

King, D. et al. Preparation and preclinical evaluation of humanised A33 immunoconjugates for radioimmunotherapy. Br. J. Cancer 72, 1364–1372 (1995).
Article PubMed PubMed Central CAS Google Scholar
King, D. J., Adair, J. R. & Owens, R. J. (Google Patents, 2001).
Welt, S. et al. Phase I study of anticolon cancer humanized antibody A33. Clin. Cancer Res. 9, 1338–1346 (2003).
PubMed CAS Google Scholar
GarinChesa, P. et al. Organ-specific expression of the colon cancer antigen A33, a cell surface target for antibody-based therapy. Int. J. Oncol. 9, 465–471 (1996).
PubMed CAS Google Scholar
Zhang, C. et al. Comparison of the pH-and thermally-induced fluctuations of a therapeutic antibody Fab fragment by molecular dynamics simulation. Comput. Struct. Biotechnol. J. 19, 2726–2741 (2021).
Article PubMed PubMed Central CAS Google Scholar
Barata, T. S., Zhang, C., Dalby, P. A., Brocchini, S. & Zloh, M. Identification of protein–excipient interaction hotspots using computational approaches. Int. J. Mol. Sci. 17, 853 (2016).
Article PubMed PubMed Central Google Scholar
Zhang, H. & Dalby, P. A. Stability enhancement in a mAb and Fab coformulation. Sci. Rep. 10, 1–11 (2020).
Article Google Scholar
Zhang, H. & Dalby, P. A. Stability convergence in antibody coformulations. Mol. Pharm. 19, 4098–4110 (2022).
Article PubMed PubMed Central CAS Google Scholar
Zhang, C. et al. Computational design to reduce conformational flexibility and aggregation rates of an antibody fab fragment. Mol. Pharm. 15, 3079–3092 (2018).
Article PubMed CAS Google Scholar
Chakroun, N., Hilton, D., Ahmad, S. S., Platt, G. W. & Dalby, P. A. Mapping the aggregation kinetics of a therapeutic antibody fragment. Mol. Pharm. 13, 307–319 (2016).
Article PubMed CAS Google Scholar
Codina Castillo, N. Stability and Aggregation-Prone Conformations of an Antibody Fragment Antigen-Binding (Fab) (University College London, 2019).
Google Scholar
Zhang, H. et al. Machine learning reveals hidden stability code in protein native fluorescence. Comput. Struct. Biotechnol. J. 19, 2750–2760 (2021).
Article PubMed PubMed Central CAS Google Scholar
Wan, S., Sinclair, R. C. & Coveney, P. V. Uncertainty quantification in classical molecular dynamics. Phil. Trans. R. Soc. A 379, 20200082 (2021).
Article ADS PubMed PubMed Central CAS Google Scholar
Battye, T. G. G., Kontogiannis, L., Johnson, O., Powell, H. R. & Leslie, A. G. iMOSFLM: A new graphical interface for diffraction-image processing with MOSFLM. Acta Crystallogr. D Biol. Crystallogr. 67, 271–281 (2011).
Article ADS PubMed PubMed Central CAS Google Scholar
Evans, P. Scaling and assessment of data quality. Acta Crystallogr. D Biol. Crystallogr. 62, 72–82 (2006).
Article ADS PubMed Google Scholar
Collaborative, C. P. The CCP4 suite: Programs for protein crystallography. Acta Crystallogr. Sect. D Biol. Crystallogr. 50, 760 (1994).
Article ADS Google Scholar
Liebschner, D. et al. Macromolecular structure determination using X-rays, neutrons and electrons: Recent developments in Phenix. Acta Crystallogr. Sect. D Struct. Biol. 75, 861–877 (2019).
Article ADS CAS Google Scholar
Apgar, J. R. et al. in MAbs. 1302–1318 (Taylor & Francis).
Vajdos, F. F. et al. Comprehensive functional maps of the antigen-binding site of an anti-ErbB2 antibody obtained with shotgun scanning mutagenesis. J. Mol. Biol. 320, 415–428 (2002).
Article PubMed CAS Google Scholar
Emsley, P., Lohkamp, B., Scott, W. G. & Cowtan, K. Features and development of Coot. Acta Crystallogr. D Biol. Crystallogr. 66, 486–501 (2010).
Article ADS PubMed PubMed Central CAS Google Scholar
Song, Y. et al. High-resolution comparative modeling with RosettaCM. Struct. 21, 1735–1742 (2013).
Article CAS Google Scholar
Abraham, M. J. et al. GROMACS: High performance molecular simulations through multi-level parallelism from laptops to supercomputers. SoftwareX 1–2, 19–25 (2015).
Article ADS Google Scholar
Curtis, J. E., Raghunandan, S., Nanda, H. & Krueger, S. SASSIE: A program to study intrinsically disordered biological molecules and macromolecular ensembles using experimental scattering restraints. Comput. Phys. Commun. 183, 382–389 (2012).
Article ADS CAS Google Scholar
Stanfield, R. L., Zemla, A., Wilson, I. A. & Rupp, B. Antibody elbow angles are influenced by their light chain class. J. Mol. Biol. 357, 1566–1574 (2006).
Article PubMed CAS Google Scholar
Lee, J. U., Shin, W., Son, J. Y., Yoo, K.-Y. & Heo, Y.-S. Molecular basis for the neutralization of tumor necrosis factor α by certolizumab pegol in the treatment of inflammatory autoimmune diseases. Int. J. Mol. Sci. 18, 228 (2017).
Article PubMed PubMed Central Google Scholar
O’Donoghue, J. A. et al. 124I-huA33 antibody uptake is driven by A33 antigen concentration in tissues from colorectal cancer patients imaged by immuno-PET. J. Nucl. Med. 52, 1878–1885 (2011).
Article PubMed CAS Google Scholar
Henderson, R. et al. Selection of immunoglobulin elbow region mutations impacts interdomain conformational flexibility in HIV-1 broadly neutralizing antibodies. Nat. Commun. 10, 1–14 (2019).
Article Google Scholar
Lord, D. M. et al. in MAbs. 444–452 (Taylor & Francis).

Download references

Acknowledgements

We would like to thank Diamond Light Source for beamtime (proposal mx23853), and the staff of beamlines I04-1 and I24 for assistance during data collection. We are grateful for funding support to Jiazhi Tang from the Chinese Scholarship Council. We thank UCB Pharma for providing the A33 Fab. We also gratefully acknowledge the Engineering and Physical Sciences Research Council (EPSRC) for funding (EP/N025105/1) that supported Cheng Zhang.

Author information

Authors and Affiliations

UCL School of Pharmacy, 29-39 Brunswick Square, London, WC1N 1AX, UK
Jiazhi Tang & Frank Kozielski
Department of Biochemical Engineering, UCL, Bernard Katz Building, Gower Street, London, WC1E 6BT, UK
Cheng Zhang, Nuria Codina Castillo, Christophe J. Lalaurie & Paul A. Dalby
Division of Biosciences, Department of Structural and Molecular Biology, UCL, London, WC1E 6BT, UK
Xin Gao

Authors

Jiazhi Tang
View author publications
You can also search for this author in PubMed Google Scholar
Cheng Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Nuria Codina Castillo
View author publications
You can also search for this author in PubMed Google Scholar
Christophe J. Lalaurie
View author publications
You can also search for this author in PubMed Google Scholar
Xin Gao
View author publications
You can also search for this author in PubMed Google Scholar
Paul A. Dalby
View author publications
You can also search for this author in PubMed Google Scholar
Frank Kozielski
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.T. set up the crystallization screen, harvested the crystal, and solved the A33 Fab structure. C.Z. purified the protein, conducted the in silico simulation and helped with SAXS analyses. N.C., C.L. and X.G. conducted the SAXS experiments and analyses. F.K. and P.D. designed and supervised research. J.T. and C.Z. wrote the manuscript.

Corresponding authors

Correspondence to Paul A. Dalby or Frank Kozielski.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Tang, J., Zhang, C., Castillo, N.C. et al. Crystal structures and molecular dynamics simulations of a humanised antibody fragment at acidic to basic pH. Sci Rep 13, 16281 (2023). https://doi.org/10.1038/s41598-023-42698-7

Download citation

Received: 27 March 2023
Accepted: 13 September 2023
Published: 28 September 2023
DOI: https://doi.org/10.1038/s41598-023-42698-7

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.