AlphaFold predictions are valuable hypotheses and accelerate but do not replace experimental structure determination

Terwilliger, Thomas C.; Liebschner, Dorothee; Croll, Tristan I.; Williams, Christopher J.; McCoy, Airlie J.; Poon, Billy K.; Afonine, Pavel V.; Oeffner, Robert D.; Richardson, Jane S.; Read, Randy J.; Adams, Paul D.

doi:10.1038/s41592-023-02087-4

Download PDF

Article
Open access
Published: 30 November 2023

AlphaFold predictions are valuable hypotheses and accelerate but do not replace experimental structure determination

Nature Methods volume 21, pages 110–116 (2024)Cite this article

47k Accesses
16 Citations
175 Altmetric
Metrics details

Subjects

Abstract

Artificial intelligence-based protein structure prediction methods such as AlphaFold have revolutionized structural biology. The accuracies of these predictions vary, however, and they do not take into account ligands, covalent modifications or other environmental factors. Here, we evaluate how well AlphaFold predictions can be expected to describe the structure of a protein by comparing predictions directly with experimental crystallographic maps. In many cases, AlphaFold predictions matched experimental maps remarkably closely. In other cases, even very high-confidence predictions differed from experimental maps on a global scale through distortion and domain orientation, and on a local scale in backbone and side-chain conformation. We suggest considering AlphaFold predictions as exceptionally useful hypotheses. We further suggest that it is important to consider the confidence in prediction when interpreting AlphaFold predictions and to carry out experimental structure determination to verify structural details, particularly those that involve interactions not included in the prediction.

Highly accurate protein structure prediction with AlphaFold

Article Open access 15 July 2021

De novo design of protein structure and function with RFdiffusion

Article Open access 11 July 2023

Discovery of potent inhibitors of α-synuclein aggregation using structure-based iterative learning

Article Open access 17 April 2024

Main

Protein structure predictions using AlphaFold¹, RoseTTAFold² and related methods³ are far more accurate than previous generations of prediction algorithms⁴, bringing much closer to reality the biological understanding that could be derived from knowing the three-dimensional structures of all macromolecules^{1,2,5,6,7,8,9}. AlphaFold predictions have already been made available for 200 million individual protein sequences to further drug discovery and protein engineering and to understand biology¹⁰. A question that immediately arises is to what extent these predictions can substitute for experimental structure determinations^11,12.

Both experimentally determined protein structures and predicted models have important limitations^11,13,14. Proteins are flexible and dynamic, and their distributions of conformations depend on temperature, solution conditions and binding of ligands or other proteins (including crystal contacts in the case of crystallography)¹⁵. A model of a high-resolution crystal structure can accurately represent the dominant conformation(s) present in a crystal in a particular environment¹¹, but the structure may differ under another set of conditions¹⁴. Artificial intelligence (AI)-based models can in many cases be very accurate; however, they do not yet take into account the presence of ligands, covalent modifications or environmental factors, and take protein–protein interactions and multiple conformations into account in a limited way^1,2,16,17.

The accuracy of a prediction is typically assessed by how closely it matches a structure in the Protein Data Bank¹⁸ (PDB) with the same sequence, and there are many ways to make such a comparison⁴. Using comparisons that focus on local accuracy, predictions obtained with AlphaFold have been assessed as having ‘atomic accuracy’¹⁹, having accuracies competitive with ‘the best experimental results’⁴ and being of comparable quality to an experimental crystal structure⁷. It has been argued that AlphaFold predictions might be more accurate than estimated by comparison with models in the PDB, or even more accurate than the deposited models, because the deposited models are poorly defined in some places⁴. This reasoning notes that side-chain positions and loops are sometimes not clear in crystallographic electron density maps²⁰, and in such cases a difference between an AlphaFold prediction and a deposited model would not indicate an error in the prediction. On the other hand, analyses carried out by the DeepMind team and others show that AlphaFold predictions vary substantially in their global and local agreement with deposited models and also in their coverage at the highest levels of confidence^1,11,21, with only 36% of residues in the human proteome²² and 73% of residues in Escherichia coli modeled with very high confidence²³. Of course, many of the proteins in the human proteome that have low-confidence AlphaFold predictions are likely to contain regions that are intrinsically disordered^24,25 that would also often not be revealed by experimental methods.

Here, we address the accuracies of AlphaFold predictions by assessing how well they agree with experimental data²⁶. We put these results into context by examining how closely one crystal structure in the PDB can typically be reproduced by another crystal structure containing the same components, but crystallized in a different space group (resulting in different crystal contacts).

Results

Comparing AlphaFold predictions with density maps

We used a set of crystallographic electron density maps determined without reference to deposited models as standards for evaluation of AlphaFold predictions. The density maps were obtained²⁷ using iterated AlphaFold prediction and model rebuilding with X-ray crystallographic data deposited in the PDB. For the present work we selected a high-quality subset of 102 models and maps from this analysis consisting of those that had free R values of 0.30 or better. The density maps in our analysis do not have any bias towards deposited models, as no information from deposited structures was used to compute these maps. Therefore, if features of a prediction are incompatible with the density maps and different from the deposited model, they are likely to be incorrect representations of the actual molecule in the crystal.

AlphaFold predictions are produced with residue-specific confidence metrics (pLDDT, the predicted value of the local distance difference test), which are estimates of the local accuracy of the prediction¹. Residues with pLDDT values of greater than 90 are considered to be predicted with very high confidence and those with values of 70 or greater have moderate-to-high confidence.

Figure 1 compares AlphaFold predictions, experimental density maps and corresponding deposited models (predictions were superimposed on the deposited models). All the residues shown in Fig. 1 were predicted with very high confidence (pLDDT > 90) and the density maps range in resolution from 1.1 Å to 1.6 Å.

**Fig. 1: Comparison of details of AlphaFold predictions with density maps.**

Figure 1a shows an example of an AlphaFold prediction that superimposes closely on the corresponding density map (PDB entry 7waa; ref. ²⁸). For comparison, Fig. 1e shows the deposited model along with the same density map. The overall map–model correlation for the superimposed AlphaFold prediction is 0.72 and the root mean squared (r.m.s.) C_α difference from the deposited model is 0.5 Å.

Figure 1b shows a prediction for PDB entry 7s5L (ref. ²⁹) which contained high-confidence regions that did not match the density map. The main chain corresponding to residues N137 through F142 matches the density map poorly. In contrast, the deposited model matches the map very closely (Fig. 1f). The overall map–model correlation for the superimposed prediction is 0.44, much lower than that for the 7waa prediction shown in Fig. 1a, and the r.m.s. C_α difference from the deposited model is 2.1 Å.

Figure 1c shows an example of a prediction that does not match the density map but which might still represent a plausible conformation of the molecule. The prediction for PDB entry 7t26 (ref. ³⁰) does not superimpose on the density near P101 and D102, while the deposited model does (Fig. 1g). The density map is less clear in this region than in other parts of the map. A break in main-chain density at D102 suggests that the chain adopts multiple conformations in this region. It is possible that the conformation in the AlphaFold prediction could be one of these alternative conformations, though not a dominant one as it does not appear in the density map.

Figure 1d illustrates a case where the AlphaFold prediction is distorted relative to the density map (PDB entry 7naz). Residues in the vicinity of V156 match the density closely (Fig. 1d), while residues near L235 are shifted relative to the map. For comparison, the deposited model matches the map closely throughout the region shown (Fig. 1h).

Figure 2a (open bars) shows the overall compatibility of 102 AlphaFold predictions with their corresponding density maps, as measured by map–model correlation. The mean map–model correlation for AlphaFold predictions (open bars) after superimposing them on corresponding deposited models was 0.56, substantially lower than the mean map–model correlation of deposited models to the same maps of 0.86 (hatched bars).

**Fig. 2: Overall comparison of AlphaFold predictions with density maps and deposited models.**

Distortion and domain movement in AlphaFold predictions

Figure 1d illustrated that an AlphaFold prediction can be somewhat distorted relative to the actual structure. To determine whether this occurs for many AlphaFold predictions, we ‘morphed’ each AlphaFold prediction to make it more similar to the deposited model (Methods). This process reduces differences between predictions and deposited models that arise from either distortion or alternate locations of domains within chains. After morphing each predicted model, the predictions agree more closely with the electron density maps (Fig. 2a, solid bars, mean map correlation of 0.67 versus 0.56 before morphing), but still much less closely than the deposited models (Fig. 2a, hatched bars, mean map correlation of 0.86).

If two models are related by a long-range distortion or alternate locations of domains, inter-atomic distances that are short will be similar in the two models, while those that are long will differ. We quantified this relationship by comparing inter-atomic distances in predicted models with matching distances in deposited models and examining the median differences as a function of distance. Figure 2b shows that this median inter-atomic distance deviation between deposited models and moderate-to-high-confidence parts of AlphaFold predictions (pLDDT above 70) is about 0.1 Å for atom pairs that are close (between 4 Å and 8 Å apart) and increases to 0.7 Å for distant atom pairs (48 Å to 52 Å), indicating a typical distortion of about 0.5–1 Å over this range of distances. As a reference, we analyzed 926 pairs of high-resolution structures in the PDB that had identical sequences but were obtained in different crystallographic space groups (so that crystal contacts influencing conformation would differ). Figure 2b shows that atom pairs in these matching structures had distances that differed by an r.m.s. of 0.1 Å for nearby residues and 0.4 Å for distant ones, about half the values found for AlphaFold predictions.

As a third method of assessing distortion and differences in domain relationships in AlphaFold predictions, we compared them with the corresponding models from the PDB, calculating the r.m.s. deviation (r.m.s.d.) of C_α atoms both before and after applying the distortion field described above. For this analysis we used all 215 structures analyzed in our previous work³¹. Figure 2c shows the distribution of C_α r.m.s.d. values for the AlphaFold predictions; the median r.m.s.d. is 1.0 Å. After applying the distortion field, the median r.m.s.d. is reduced to 0.4 Å (Fig. 2d, the median r.m.s.d. distortion applied was 0.6 Å). For matching pairs of structures in the PDB crystallized in different space groups, the median C_α r.m.s.d. was only 0.6 Å, and this could be reduced to 0.4 Å by applying a distortion field (median r.m.s. distortion applied of 0.2 Å). Overall, the C_α coordinates in AlphaFold predictions are considerably more different from PDB entries than deposits of high-resolution structures of the same molecule in different space groups are from each other (median r.m.s.d. of 1.0 Å versus 0.6 Å), and a substantial part of this difference consists of long-range distortion.

Comparing AlphaFold side-chain predictions with experimental maps

As illustrated in Fig. 1, AlphaFold predictions often contain at least some regions that are similar to deposited structures, but even in these regions many details often differ. We used the 102 electron density maps described above along with deposited models to evaluate side-chain conformations (the locations of atoms in side chains relative to the atoms in the main chain that they are connected to), an important local feature of a structural model. To analyze the local side-chain structure and remove confounding effects from domain shifts or distortions, we grafted the side chain from each residue in an AlphaFold prediction onto the corresponding main-chain atom residues of the deposited model. This yielded a composite model with the main-chain coordinates of the deposited models and side-chain conformations corresponding to the AlphaFold predictions.

Figure 3a shows a local portion of PDB entry 7vgm, and Fig. 3b shows the AlphaFold prediction superimposed on the deposited model. Figure 3c shows the same region with the grafted side chain and the composite model. The positions of several of the side chains in the AlphaFold model (for example, R32, D62, E530, E533, R494) are different from those in the deposited model. Figure 3d shows the deposited model for 7vgm along with the density map obtained for PDB entry 7vgm, and Fig. 3e shows the AlphaFold model superimposed on the same density map. Even though the density map was obtained with the AlphaFold prediction and without reference to the deposited model, all the side chains in the deposited model match the map closely. In contrast, side chains in the AlphaFold prediction that were different from those in the deposited model do not match the density map, both before (Fig. 3e) and after (Fig. 3f) grafting, indicating that these side-chain conformations are likely to be incorrect.

**Fig. 3: Comparison of AlphaFold side-chain predictions with density map for PDB entry 7vgm.**

We carried out this side-chain grafting procedure for 102 AlphaFold predictions and the corresponding deposited models. For each pair of side chains, we examined the agreement between atomic positions in that side chain and the corresponding optimized density map. We identified pairs in which the AlphaFold side-chain prediction differed substantially from the deposited model (r.m.s.d. of side-chain atoms > 1.5 Å). Then, based on estimates of the uncertainty of density values in each map and of the number of independent points sampled by side-chain atomic positions in that map, we identified AlphaFold side-chain predictions that differed from the deposited model and were highly unlikely (P < 0.01) to be as compatible with the density map as the deposited model. We considered these AlphaFold side-chain predictions to be incompatible with the experimental data.

Overall, we found that 20% of the side chains in moderate-to-high-confidence residues of AlphaFold predictions and not involved in crystal contacts had different conformations than in the corresponding deposited model (at least 1.5-Å r.m.s.d.), and one-third of these (7% overall) were clearly incompatible with the experimental data. As the number of clearly incompatible residues identified by our method is a lower bounds estimate, we expect that the actual level of disagreement between AlphaFold predictions and conformations of the molecules in the crystals is somewhere between the 7% that are clearly incompatible with the data and the 20% that differ from the deposited models.

To put the fraction of side-chain positions in AlphaFold predictions that are incompatible with the experimental data into perspective, we carried out a similar analysis, but using the set of high-resolution structures from the PDB containing the same components but crystallized in a different space group. For these tests we used experimentally based density maps (2mFo-DFc maps³²) calculated using one model from each pair. Here, only 6% of the side chains differed by 1.5-Å r.m.s.d., and only 2% were in conformations that were experimentally incompatible with the corresponding conformations from the other set. Therefore, at a detailed level as well as an overall level, the differences between AlphaFold predictions and these crystal structures are substantially greater than for pairs of crystal structures determined in different space groups.

We then analyzed whether the 7% of residues in AlphaFold predictions that were incompatible with experimental data included residues with functional importance. We extracted all the residues that were explicitly mentioned in the 49 publications describing the 102 analyzed structures, yielding a total of 733 named residues. Of these, 53 (7%) were among the residues we identified as being incompatible with experimental data, the same percentage that we found for all residues. For example, residues R32, D62, R497 and E533 in Fig. 3 are all in this group of functional residues that are incompatible with experimental data.

As functionally important residues are constrained by evolution, it might have been expected that the evolutionary covariation that forms a central element of AlphaFold prediction¹⁹ would be stronger than average, leading to improved prediction accuracy. On the other hand, these same residues are more conserved than average³³, possibly balancing that effect. In our small sample, we do not see a substantial effect either way; rather, we find that side chains for residues in AlphaFold predictions with functional importance are about as likely to be incompatible with experimental data as other side chains.

Using confidence (pLDDT) to estimate errors in AlphaFold predictions

As AlphaFold predictions can differ substantially from corresponding experimental models, straightforward methods to estimate coordinate uncertainties of these predictions would be useful. As a first step, we superimposed AlphaFold predictions on corresponding deposited models and determined the distance between the C_α atoms in the predicted and deposited models, as well as the confidence (pLDDT) for the predicted C_α atom.

Figure 4a shows the distribution of prediction errors for various ranges of the confidence measure. For comparison, the dashed line in Fig. 4a shows the distribution of differences between matching C_α atoms in pairs of structures containing the same components but crystallized in different space groups. The median prediction error for high-confidence (pLDDT > 90) residues was 0.6 Å, while for residues with pLDDT between 80 and 90 it was 1.1 Å, and for those between 70 and 80 it was 1.5 Å (Table 1). By comparison, matching C_α atoms in pairs of structures in different space groups differed by a median of 0.3 Å. Figure 4b shows that morphing one member of each pair as described above reduces the differences over all confidence ranges, but differences between matching pairs of structures in the PDB are reduced similarly.

**Fig. 4: Distribution of prediction errors for ranges of AlphaFold prediction confidence.**

Table 1 Median prediction error and percentage with prediction error over 2 Å by AlphaFold confidence

Full size table

The relevance of the median coordinate errors found above depends on what the coordinates are going to be used for^13,14. If coordinates are intended for use in comparing distantly related structures to infer evolutionary and structural relationships, where typical differences among structures may be large (for example, 2–3 Å), median coordinate errors of 1.1 Å may have little effect on the analysis. On the other hand, the same coordinate errors might substantially affect an analysis involving docking of a ligand to identify specific protein–ligand interactions.

We note that the distributions in Fig. 4 do not resemble the Maxwell–Boltzmann distribution expected for random three-dimensional Gaussian errors (there is an excess kurtosis of over 200 for errors in prediction versus an expected value of 0.1). The distributions have a small fraction of values that are very large (long tails in the distributions), so describing uncertainties in terms of r.m.s. errors may not ordinarily be effective. Instead, it may be more useful to note the median errors described above as a measure of typical errors, and to also take into account the percentage of instances where the error is very large (that is, completely wrong). The definition of very large errors will depend on the situation, but often atomic positions that deviate by more than 2 or 3 Å are of limited value.

For the structures analyzed here, about 10% of C_α atoms with pLDDT over 90 are found to be in error by over 2 Å, along with 22% of those with pLDDT between 80 and 90, 33% of those between 70 and 80, and 77% of those with pLDDT under 70 (Table 1). For comparison, just 5% of C_α atoms in the matched pairs of structures in the PDB crystallized in different space groups we analyzed differ by over 2 Å.

The extent of agreement between AlphaFold predictions and experimental data found here is consistent with results of the uncertainty quantification carried out by DeepMind during the development of AlphaFold²⁵. That analysis showed that estimated model accuracy (pLDDT) was an unbiased predictor of actual model accuracy (LDDT), and that the correlation between pLDDT estimates and actual LDDT was about 0.76. The uncertainty quantification further estimated that 7% (for pLDDT > 90) to 30% (for 70 < pLDDT < 90) of side chains have a χ₁ angle deviation of at least 40°. Such a deviation typically leads to an r.m.s.d. of side-chain atoms of over 1.5 Å. In our analysis, the average pLDDT was 94, with 12% of residues having a pLDDT between 70 and 90. Therefore, the errors estimated in AlphaFold development are generally consistent with our observation that between 7% and 20% of side chains with pLDDT of 70 or above are incompatible with experimental data.

Conclusions

While AlphaFold predictions are often astonishingly accurate (for example, Fig. 1a), we find that many parts of AlphaFold predictions are incompatible with experimental data from corresponding crystal structures. In particular, our results show that AlphaFold predictions are not better representations of the contents of a crystal than the models deposited in the PDB, as the deposited models agree much more closely with experimental data where the predicted and deposited models differ. Our results also show that even very high-confidence AlphaFold predictions differ from corresponding models deposited in the PDB by about twice as much as pairs of high-resolution structures in the PDB that were crystallized in different space groups, indicating that AlphaFold predictions are in error by more than the amount that might be expected due to flexibility. We note that as AlphaFold prediction does not take into account the presence of ligands, ions, covalent modifications or environmental conditions, it cannot be expected to correctly represent the many details of protein structures that depend on these factors.

A confidence metric (pLDDT) is produced for each AlphaFold prediction. This confidence metric was examined in detail by the DeepMind team and was shown to be unbiased (equally likely to be too low or too high) and to have a good correlation to the LDDT metric that it estimates (Pearson’s correlation of 0.76)¹. This confidence metric can therefore be a very useful residue-specific indicator of the accuracy of a prediction. For the structures examined here, the parts of AlphaFold predictions that had very high confidence (pLDDT > 90, 86% of residues in the analysis) were generally quite accurate (median C_α coordinate difference from deposited model of 0.6 Å). It is important to note, however, that about 10% of residues predicted with very high confidence differed from the deposited model by over 2 Å (Table 1).

Despite their limitations, AlphaFold predictions are already changing the way that hypotheses about protein structures are generated and tested^1,2,5,6. Indeed, even though not all parts of AlphaFold predictions are accurate, they provide plausible hypotheses that can suggest mechanisms of action and allow designing of experiments with specific expected outcomes. Using these predictions as starting hypotheses can also greatly accelerate the process of experimental structure determination^27,34,35. AlphaFold predictions often have very good stereochemical characteristics, making them excellent hypotheses for local structural features. For example, for the 102 structures analyzed here, the mean percentage of residues with ‘favored’ Ramachandran configurations was 98%, greater than that of the corresponding deposited models (97%), and the mean percentage of side-chain conformations classified as outliers was just 0.2%, compared with 1.5% for deposited models³¹. Such AlphaFold predictions with highly plausible geometry could be used in later stages of experimental structure determination as potential conformations for segments of structure that are not fully clear in experimental density maps.

All these capabilities are very likely just the beginning of an age of increasingly broad use of AI methods in structural biology¹². AI approaches will surely be extended from proteins to include nucleic acids, ligands, covalent modifications, environmental conditions, interactions among all these entities and multiple structural states. The accuracies of these predictions and of the uncertainties associated with them are very likely to improve continuously as additional factors are included and as databases of sequence and structural information expand. The resulting predictions will be increasingly useful structural hypotheses that will form a solid foundation for experimental and theoretical analyses of biological systems.

Methods

Experimental data, models, AlphaFold predictions and density maps

We used the results of our automated structure redeterminations³¹ for crystallographic PDB deposits in this work. The structures in that study were chosen based on the method of structure solution (single-wavelength anomalous diffraction), used as a proxy for relatively challenging structure determinations. The anomalous data were not used in our structure redeterminations, that is, the Bijvoet pairs were averaged. All the unique, protein-containing structures in a 6-month period (December 2021 to May 2022) were analyzed (215 structures). Structures were determined with molecular replacement using trimmed AlphaFold predictions³⁶ as search models, followed by iterative model rebuilding and AlphaFold prediction²⁷. In this work we use the initial AlphaFold predictions (made without templates) and the final density-modified electron density maps³⁷ from those analyses. Except as noted, in this work we used only structures yielding a free R value of 0.30 or lower (102 structures) to ensure that the density-modified electron density maps used as a reference were of high quality.

Model morphing with a distortion field

We used a morphing procedure based on a smoothed distortion field³⁸ to modify one model to make it globally more similar to another model, while retaining local differences. In this procedure any point in space has an associated shift vector, the shift that is to be applied to any atom located at that point in space. This association of a vector to each point in space amounts to a shift or distortion field. To create a smoothly varying distortion field relating a pair of structures, we first create an exact distortion field that maps one structure onto the other; then this field is smoothed.

First, the two structures are superimposed. Then a set of positions in space and corresponding shift vectors is created, with the positions in space y_i corresponding to C_α atom coordinates in one structure, and the shift vectors v_i corresponding to the differences between matching C_α atoms in the two structures. At this point, each of these positions in space has the property that if the associated shift vector is added, it will match the corresponding C_α atom coordinate in the other structure. This exact distortion field is defined only at the C_α atom coordinates of the first structure.

Then we create a smoothed distortion field v(x) that is defined at any point in space x by averaging all the shift values in the exact distortion field, weighting individual shifts v_i with a weight w_i based on the distances between their positions in space y_i and that point x,

$${{\mathbf{w}}}_{i}={\mathrm{exp}}(-||{{\mathbf{y}}}_{i}-{\mathbf{x}}|{|}^{2}/{u}^{2}),$$

where the scaling factor u determines the distance over which smoothing occurs, typically set to 15 Å.

Analytical procedures

Map–model correlations for predicted models were calculated after superposition on the corresponding deposited models.

For structures with more than one chain, only the first chain was included for each structure in comparisons.

Side-chain grafting

The grafting procedure was carried out using the model_building.graft_side_chains method in Phenix. This function identifies matching residues in two models and then uses the coordinates of atoms in the main chain for a residue in one model to position the main-chain and side-chain atoms in a matched residue from another. We excluded residues with low confidence (pLDDT < 70, 2% of the total residues), and residues that participate in crystal contacts (any atom in the residue within 6 Å of any atom in a symmetry-related molecule, 23% of all residues).

Choice of examples of AlphaFold predictions with varying relationships to density maps

The goal of Fig. 1 is to illustrate four situations that occurred among the AlphaFold predictions that we examined. We noticed four distinct situations (prediction agrees exceptionally well with density map, prediction does not match density map, prediction does not match density map but might be correct, prediction is distorted relative to the map). We then chose one example of each type that was as clear as possible and that contained only very high-confidence parts of these predictions to display.

Evaluation of compatibility of side-chain positions with density maps

We identified side-chain conformations in AlphaFold predictions that were incompatible with corresponding electron density maps as cases where the predicted side-chain conformation matched the density map much more poorly than the deposited model and differed substantially from that found in the corresponding deposited model. To focus on the side-chain conformation separately from the overall location and orientation of each residue, we used the side-chain grafting procedure described above to orient the main chain of each residue from an AlphaFold prediction to match the main chain of the corresponding residue in the deposited model. We considered side chains to differ substantially if the r.m.s.d. of side-chain atoms beyond the C_β atom was greater than 1.5 Å.

We then identified incompatible AlphaFold side-chain conformations as those that were highly unlikely (P < 0.01) to be as compatible with the density map as the deposited model. This probability was estimated from the uncertainty of density values in each map and the number of independent points sampled by side-chain atomic positions in that map. To obtain the uncertainty of density values, we calculated the r.m.s. difference between Fobs and Fcalc maps obtained from the phenix.refine³⁹ software using the deposited model and crystallographic data to calculate the maps. To estimate the number of independent points sampled by side-chain atomic positions for a particular side chain, we counted the number of side-chain atoms that could be selected where each atom is separated from all others by at least the resolution of the data.

As an example of this procedure, for the 7vgm example shown in Fig. 3, the mean electron density map value at atoms in the side chain of residue R32 in 7vgm was 2.8 and the mean density for the side chain from the AlphaFold prediction was 0.1 (the map is normalized to have a mean of zero and r.m.s. of 1). These side chains differed by an r.m.s.d. of 3.9 Å and the six side-chain atoms corresponded to approximately four unique positions in the map (four positions that are each separated from the others by the resolution of the map). The map, adjusted to have a mean of zero and r.m.s. of 1, had an estimated uncertainty of 0.8 (based on agreement between the calculated and observed structure factor amplitudes), leading to a probability of P < 10⁻¹⁰ that the AlphaFold prediction is actually in better agreement with the map than the deposited model.

Graphics software used

ChimeraX⁴⁰ was used for graphics display.

Control experiments and limitations

Our analysis of side-chain conformations is based on the premise that the backbone conformation of the deposited model is largely correct. However, it is possible that the backbone is systematically distorted at residues with incorrect rotamers, as the main-chain atom positions might compensate for errors in the side chain. We checked for this scenario by refitting the side chains for all 102 structures, and applying a ‘backrub’ correction to the main chain to correct for these distortions if necessary⁴¹. A repeat of our analysis, skipping the 4% of side chains where a backrub correction was applied (C_β shift⁴¹ of more than 0.2 Å), yielded very similar results, with 18% of residues differing in side-chain orientation and again 7% overall clearly incompatible with experimental data.

We also checked for the possibility that backbone conformations might differ in the two models for some residues, making the grafting procedure inappropriate. We repeated our analysis, removing all residues where the Ramachandran angles differed in the two structures by more than 30° (10% of all residues). Once again, the results were similar, with 17% of residues differing in side-chain orientation and 7% overall clearly incompatible with experimental data.

Our test set of residues (from 102 PDB entries for most analyses, 215 for some) is a small fraction of those in the entire PDB, so it could be useful to analyze a larger, more representative set. Most of the residues in our analysis had very high confidence, with 86% having pLDDT values above 90, 10% from 80 to 90, 2% from 70 to 80 and 2% under 70. In contrast, in the AlphaFold prediction of the human proteome²⁵, only 36% of residues had pLDDT values above 90, and 42% were under 70. The small fraction of residues with predictions under 80 may lead to some uncertainty in the error estimates for moderate- and low-confidence predictions in Table 1. The median r.m.s.d. between AlphaFold predictions and deposited models in the PDB in our analysis (1.0 Å; Fig. 2c) was considerably lower than that obtained in a large-scale analysis of recent structures by DeepMind¹ (2.3 Å for all C_α atoms, 1.5 Å excluding the largest 5% of differences), perhaps due to the high confidence in prediction in our sample.

As we wanted to estimate the accuracy of the 200 million predictions made with the standard version, we did not remove predictions that might be better predicted with a multimer version of AlphaFold¹⁶. For example, PDB entry 7e1d is a domain-swapped dimer⁴² that was predicted by AlphaFold to be a compact chain.

In some instances, domain-swapping or other incorrect connections between domains resulted in very large differences between predictions and deposited models. Therefore, we attempted to reduce the effect of these outlier structures by quoting median values where possible.

We used a local installation of AlphaFold for our predictions and did not use templates from the PDB in prediction, which could reduce the accuracy of the predicted models. Based on a comparison of our AlphaFold predictions and those in the AlphaFold database¹⁰, which included templates in prediction, this effect is likely to be small, however. We identified 81 models in the AlphaFold database that corresponded to the first chains in one of our 102 analyses. The median C_α atom r.m.s.d. between our initial predicted models³¹ and the corresponding chain in the AlphaFold database was just 0.54 Å. The predictions from the AlphaFold database had a median r.m.s.d. of 1.15 Å compared with deposited models; our predictions without templates also had an r.m.s.d. of 1.15 Å.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

Input data for deposited models were taken from the Protein Data Bank. The 102 accession codes used were as follows: 7e0m, 7fhr, 7v6p, 7Ljh, 7p3a, 7v38, 7v3b, 7o9p, 7rLz, 7qdv, 7ewj, 7rw4, 7waa, 7kdx, 7fiu, 7n3v, 7ptb, 7dtr, 7aoj, 7rc2, 7tcr, 7wja, 7vnx, 7x8v, 7raw, 7rpy, 7aov, 7tb5, 7t8L, 7vwk, 7ne9, 7nqd, 7s5L, 7wbk, 7x77, 7e3z, 7f0o, 7v1q, 7etx, 7ety, 7ecd, 7dxn, 7eyj, 7e4d, 7wsj, 7fi3, 7wnn, 7vgm, 7eio, 7v9n, 7tvc, 7Lbk, 7e6v, 7b3n, 7bLL, 7djj, 7dms, 7dqx, 7drh, 7dri, 7e1d, 7e85, 7edc, 7ejg, 7es4, 7esi, 7eus, 7ew8, 7exx, 7f2a, 7fjg, 7kzh, 7Lsv, 7mku, 7naz, 7ncy, 7nxg, 7o51, 7o5y, 7oc3, 7oom, 7oq6, 7qs4, 7rm7, 7t7j, 7tbs, 7tem, 7tfq, 7tj1, 7tL5, 7tmu, 7tog, 7toj, 7trv, 7trw, 7tt9, 7twc, 7tzp, 7unn, 7w3s, 7wdq, 8cuk. All models are downloadable from the PDB with links such as https://files.rcsb.org/download/7tzp.pdb or (for larger models that are not available in this format) https://files.rcsb.org/download/7tzp.cif. We used the Phenix tool fetch_pdb to download models and crystallographic data for each structure. Predicted models, rebuilt models and density-modified map coefficients are available at https://phenix-online.org/phenix_data/terwilliger/alphafold_crystallography_2022/ along with a spreadsheet that contains all the raw data and analyses described in our previous work³¹ and described here. The directory terwilliger/alphafold_crystallography_2022/ contains a README file describing the contents of the site, the spreadsheet and a data directory with one compressed archive for each structure containing models and crystallographic data files. This directory also contains a compressed archive (alphafold_crystallography.tgz) containing all the data and all the scripts used to create the spreadsheet.

Code availability

All code for the Phenix version of the AlphaFold2 Colab is freely available on GitHub at https://github.com/phenix-project/Colabs. All code for Phenix is available at phenix-online.org.

References

Jumper, J. et al. Highly accurate protein structure prediction with AlphaFold. Nature 596, 583–589 (2021).
Article CAS PubMed PubMed Central Google Scholar
Baek, M. et al. Accurate prediction of protein structures and interactions using a three-track neural network. Science 373, 871–876 (2021).
Article CAS PubMed PubMed Central Google Scholar
Lin, Z. et al. Evolutionary-scale prediction of atomic level protein structure with a language model. Science 379, 1123–1130 (2023).
Article CAS PubMed Google Scholar
Kryshtafovych, A., Schwede, T., Topf, M., Fidelis, K. & Moult, J. Critical assessment of methods of protein structure prediction (CASP)—round XIV. Proteins 89, 1607–1617 (2021).
Callaway, E. ‘The entire protein universe’: AI predicts shape of nearly every known protein. Nature 608, 15–16 (2022).
Article CAS PubMed Google Scholar
Thornton, J. M., Laskowski, R. A. & Borkakoti, N. AlphaFold heralds a data-driven revolution in biology and medicine. Nat. Med. 27, 1666–1669 (2021).
Article CAS PubMed Google Scholar
van Breugel, M., Rosa e Silva, I. & Andreeva, A. Structural validation and assessment of AlphaFold2 predictions for centrosomal and centriolar proteins and their complexes. Commun. Biol. 5, 312 (2022).
Article PubMed PubMed Central Google Scholar
Subramaniam, S. & Kleywegt, G. J. A paradigm shift in structural biology. Nat. Methods 19, 20–23 (2022).
Article CAS PubMed Google Scholar
Ourmazd, A., Moffat, K. & Lattman, E. E. Structural biology is solved—now what? Nat. Methods 19, 24–26 (2022).
Article CAS PubMed Google Scholar
Hassabis, D. AlphaFold reveals the structure of the protein universe. DeepMind www.deepmind.com/blog/alphafold-reveals-the-structure-of-the-protein-universe (2022).
Shao, C., Bittrich, S., Wang, S. & Burley, S. K. Assessing PDB macromolecular crystal structure confidence at the individual amino acid residue level. Structure 30, 1385–1394 (2022).
Article CAS PubMed PubMed Central Google Scholar
Goulet, A. & Cambillau, C. Present impact of AlphaFold2 revolution on structural biology, and an illustration with the structure prediction of the bacteriophage J-1 host adhesion device. Front. Mol. Biosci. 9, 907452 (2022).
Article CAS PubMed PubMed Central Google Scholar
Moore, P. B., Hendrickson, W. A., Henderson, R. & Brunger, A. T. The protein-folding problem: not yet solved. Science 375, 507 (2022).
Article PubMed Google Scholar
Acharya, K. R. & Lloyd, M. D. The advantages and limitations of protein crystal structures. Trends Pharmacol. Sci. 26, 10–14 (2005).
Article CAS PubMed Google Scholar
Fraser, J. S. et al. Accessing protein conformational ensembles using room-temperature X-ray crystallography. Proc. Natl Acad. Sci. USA 108, 16247–16252 (2011).
Article CAS PubMed PubMed Central Google Scholar
Evans, R. et al. Protein complex prediction with AlphaFold-Multimer. Preprint at https://doi.org/10.1101/2021.10.04.463034 (2022).
Stein, R. A. & McHaourab, H. S. SPEACH_AF: sampling protein ensembles and conformational heterogeneity with Alphafold2. PLoS Comput. Biol. 18, e1010483 (2022).
Article CAS PubMed PubMed Central Google Scholar
wwPDB Consortium. Protein Data Bank: the single global archive for 3D macromolecular structure data. Nucleic Acids Res. 47, D520–D528 (2018).
Article Google Scholar
Jumper, J. & Hassabis, D. Protein structure predictions to atomic accuracy with AlphaFold. Nat. Methods 19, 11–12 (2022).
Article CAS PubMed Google Scholar
van Beusekom, B., Joosten, K., Hekkelman, M. L., Joosten, R. P. & Perrakis, A. Homology-based loop modeling yields more complete crystallographic protein structures. IUCrJ 5, 585–594 (2018).
Article PubMed PubMed Central Google Scholar
Hryc, C. F. & Baker, M. L. AlphaFold2 and CryoEM: revisiting CryoEM modeling in near-atomic resolution density maps. iScience 25, 104496 (2022).
Article PubMed PubMed Central Google Scholar
Porta-Pardo, E., Ruiz-Serra, V., Valentini, S. & Valencia, A. The structural coverage of the human proteome before and after AlphaFold. PLoS Comput. Biol. 18, e1009818 (2022).
Article CAS PubMed PubMed Central Google Scholar
Akdel, M. et al. A structural biology community assessment of AlphaFold2 applications. Nat. Struct. Mol. Biol. 29, 1056–1067 (2022).
Article CAS PubMed PubMed Central Google Scholar
Dunker, A. K. et al. Intrinsically disordered protein. J. Mol. Graph. Model. 19, 26–59 (2001).
Article CAS PubMed Google Scholar
Tunyasuvunakool, K. et al. Highly accurate protein structure prediction for the human proteome. Nature 596, 590–596 (2021).
Article CAS PubMed PubMed Central Google Scholar
Flower, T. G. & Hurley, J. H. Crystallographic molecular replacement using an in silico-generated search model of SARS-CoV-2 ORF8. Protein Sci. 30, 728–734 (2021).
Article CAS PubMed PubMed Central Google Scholar
Terwilliger, T. C. et al. Improved AlphaFold modeling with implicit experimental information. Nat. Methods 19, 1376–1382 (2022).
Article CAS PubMed PubMed Central Google Scholar
Zhang, Q. et al. Re-sensitization of mcr carrying multidrug resistant bacteria to colistin by silver. Proc. Natl Acad. Sci. USA 119, e2119417119 (2022).
Article CAS PubMed PubMed Central Google Scholar
Burkhardt, I., de Rond, T., Chen, P. Y.-T. & Moore, B. S. Ancient plant-like terpene biosynthesis in corals. Nat. Chem. Biol. 18, 664–669 (2022).
Article CAS PubMed PubMed Central Google Scholar
Hobbs, S. J. et al. Phage anti-CBASS and anti-Pycsar nucleases subvert bacterial immunity. Nature 605, 522–526 (2022).
Article CAS PubMed PubMed Central Google Scholar
Terwilliger, T. C. et al. Accelerating crystal structure determination with iterative AlphaFold prediction. Acta Crystallogr. D 42, 234–244 (2023).
Read, R. J. Improved Fourier coefficients for maps using phases from partial structures with errors. Acta Crystallogr. A 42, 140–149 (1986).
Bartlett, G. J., Porter, C. T., Borkakoti, N. & Thornton, J. M. Analysis of catalytic residues in enzyme active sites. J. Mol. Biol. 324, 105–121 (2002).
Article CAS PubMed Google Scholar
McCoy, A. J., Sammito, M. D. & Read, R. J. Implications of AlphaFold2 for crystallographic phasing by molecular replacement. Acta Crystallogr. D Struct. Biol. 78, 1–13 (2022).
Article CAS PubMed PubMed Central Google Scholar
Barbarin-Bocahu, I. & Graille, M. The X-ray crystallography phase problem solved thanks to AlphaFold and RoseTTAFold models: a case-study report. Acta Crystallogr. D Struct. Biol. 78, 517–531 (2022).
Article CAS PubMed Google Scholar
Oeffner, R. D. et al. Putting AlphaFold models to work with phenix.process_predicted_model and ISOLDE. Acta Crystallogr. D Struct. Biol. 78, 1303–1314 (2022).
Article CAS PubMed PubMed Central Google Scholar
Terwilliger, T. Maximum-likelihood density modification. Acta Crystallogr. D Biol. Crystalogr. 56, 965–972 (2000).
Article CAS Google Scholar
Cowtan, K., Metcalfe, S. & Bond, P. Shift-field refinement of macromolecular atomic models. Acta Crystallogr. D Struct. Biol. 76, 1192–1200 (2020).
Article CAS PubMed PubMed Central Google Scholar
Afonine, P. V. et al. Towards automated crystallographic structure refinement with phenix.refine. Acta Crystallogr. D Biol. Crystallogr. 68, 352–367 (2012).
Article CAS PubMed PubMed Central Google Scholar
Pettersen, E. F. et al. UCSF ChimeraX: structure visualization for researchers, educators, and developers. Protein Sci. 30, 70–82 (2021).
Article CAS PubMed Google Scholar
Davis, I. W., Arendall, W. B., Richardson, D. C. & Richardson, J. S. The backrub motion: how protein backbone shrugs when a sidechain dances. Structure 14, 265–274 (2006).
Article CAS PubMed Google Scholar
Bennett, M. J., Choe, S. & Eisenberg, D. Domain swapping: entangling alliances between proteins. Proc. Natl Acad. Sci. USA 91, 3127–3131 (1994).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We acknowledge funding from the Lawrence Berkeley National Laboratory (grant no. DE-AC02-05CH11231, P.D.A.), the National Institutes of Health (grant no. GM063210, P.D.A., J.S.R., R.J.R., T.C.T.), the Wellcome Trust (grant no. 209407/Z/17/Z, R.J.R.) and the Phenix Industrial Consortium (P.D.A.). The funders had no role in study design, data collection and analysis, decision to publish or preparation of the manuscript.

Author information

Authors and Affiliations

New Mexico Consortium, Los Alamos, NM, USA
Thomas C. Terwilliger
Los Alamos National Laboratory, Los Alamos, NM, USA
Thomas C. Terwilliger
Molecular Biophysics & Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
Dorothee Liebschner, Billy K. Poon, Pavel V. Afonine & Paul D. Adams
Department of Haematology, Cambridge Institute for Medical Research, University of Cambridge, Cambridge, UK
Tristan I. Croll, Airlie J. McCoy, Robert D. Oeffner & Randy J. Read
Department of Biochemistry, Duke University, Durham, NC, USA
Christopher J. Williams & Jane S. Richardson
Department of Bioengineering, University of California, Berkeley, CA, USA
Paul D. Adams

Authors

Thomas C. Terwilliger
View author publications
You can also search for this author in PubMed Google Scholar
Dorothee Liebschner
View author publications
You can also search for this author in PubMed Google Scholar
Tristan I. Croll
View author publications
You can also search for this author in PubMed Google Scholar
Christopher J. Williams
View author publications
You can also search for this author in PubMed Google Scholar
Airlie J. McCoy
View author publications
You can also search for this author in PubMed Google Scholar
Billy K. Poon
View author publications
You can also search for this author in PubMed Google Scholar
Pavel V. Afonine
View author publications
You can also search for this author in PubMed Google Scholar
Robert D. Oeffner
View author publications
You can also search for this author in PubMed Google Scholar
Jane S. Richardson
View author publications
You can also search for this author in PubMed Google Scholar
Randy J. Read
View author publications
You can also search for this author in PubMed Google Scholar
Paul D. Adams
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

T.C.T., P.D.A., R.J.R. and J.S.R. conceptualized the study and were responsible for the methodology. T.C.T., A.J.M., B.K.P., P.V.A., T.I.C., C.J.W., D.L. and R.D.O. performed investigations. T.C.T. was responsible for visualization. T.C.T., P.D.A., R.J.R. and J.S.R. were responsible for funding acquisition and project administration and supervised the study. T.C.T. wrote the original draft of the manuscript. T.C.T., P.D.A., R.J.R., J.S.R., A.J.M., B.K.P., P.V.A., T.I.C., C.J.W., D.L. and R.D.O. reviewed and edited the manuscript.

Corresponding author

Correspondence to Thomas C. Terwilliger.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Methods thanks Sriram Subramaniam and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Primary Handling Editor: Arunima Singh, in collaboration with the Nature Methods team.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Terwilliger, T.C., Liebschner, D., Croll, T.I. et al. AlphaFold predictions are valuable hypotheses and accelerate but do not replace experimental structure determination. Nat Methods 21, 110–116 (2024). https://doi.org/10.1038/s41592-023-02087-4

Download citation

Received: 30 January 2023
Accepted: 11 October 2023
Published: 30 November 2023
Issue Date: January 2024
DOI: https://doi.org/10.1038/s41592-023-02087-4

This article is cited by

AlphaFold found thousands of possible psychedelics. Will its predictions help drug discovery?
- Ewen Callaway
Nature (2024)
Packaging monoamine neurotransmitters
- Ravi Yadav
- Cornelius Gati
Cell Research (2024)
Structural biology in the age of AI
- Sriram Subramaniam
Nature Methods (2024)
The rapid developments of membrane protein structure biology over the last two decades
- Lan Guan
BMC Biology (2023)