Predicting unseen antibodies’ neutralizability via adaptive graph neural networks

Zhang, Jie; Du, Yishan; Zhou, Pengfei; Ding, Jinru; Xia, Shuai; Wang, Qian; Chen, Feiyang; Zhou, Mu; Zhang, Xuemei; Wang, Weifeng; Wu, Hongyan; Lu, Lu; Zhang, Shaoting

doi:10.1038/s42256-022-00553-w

Download PDF

Article
Published: 07 November 2022

Predicting unseen antibodies’ neutralizability via adaptive graph neural networks

Jie Zhang ORCID: orcid.org/0000-0001-8264-5950^1,9^na1,
Yishan Du¹^na1,
Pengfei Zhou¹,
Jinru Ding¹,
Shuai Xia²,
Qian Wang²,
Feiyang Chen^1,3,
Mu Zhou⁴,
Xuemei Zhang⁵,
Weifeng Wang⁶,
Hongyan Wu ORCID: orcid.org/0000-0002-6887-2548⁷,
Lu Lu ORCID: orcid.org/0000-0002-2255-0391² &
…
Shaoting Zhang ORCID: orcid.org/0000-0002-0057-2944^1,8

Nature Machine Intelligence volume 4, pages 964–976 (2022)Cite this article

11k Accesses
6 Citations
24 Altmetric
Metrics details

Subjects

Abstract

Most natural and synthetic antibodies are ‘unseen’. That is, the demonstration of their neutralization effects with any antigen requires laborious and costly wet-lab experiments. The existing methods that learn antibody representations from known antibody–antigen interactions are unsuitable for unseen antibodies owing to the absence of interaction instances. The DeepAAI method proposed herein learns unseen antibody representations by constructing two adaptive relation graphs among antibodies and antigens and applying Laplacian smoothing between unseen and seen antibodies’ representations. Rather than using static protein descriptors, DeepAAI learns representations and relation graphs ‘dynamically’, optimized towards the downstream tasks of neutralization prediction and 50% inhibition concentration estimation. The performance of DeepAAI is demonstrated on human immunodeficiency virus, severe acute respiratory syndrome coronavirus 2, influenza and dengue. Moreover, the relation graphs have rich interpretability. The antibody relation graph implies similarity in antibody neutralization reactions, and the antigen relation graph indicates the relation among a virus’s different variants. We accordingly recommend probable broad-spectrum antibodies against new variants of these viruses.

Fast, accurate antibody structure prediction from deep learning on massive set of natural antibodies

Article Open access 25 April 2023

The RESP AI model accelerates the identification of tight-binding antibodies

Article Open access 28 January 2023

Potential neutralizing antibodies discovered for novel corona virus using machine learning

Article Open access 04 March 2021

Main

Antibodies (Abs) opsonize and neutralize viruses¹, working as potent bio-pharmaceuticals in clinical treatments². An individual is estimated to have around 10⁸ different Abs³ and produces on the order of 10²⁰ Abs in response to viral infections⁴. Among them, only a small fraction can opsonize and an even smaller fraction can neutralize the infected virus. The majority of these Abs are ‘unseen’. We are blind to their neutralizability with any antigen (Ag) before conducting wet-lab experiments (Fig. 1a). Besides natural Abs, de novo synthetic Abs are also unseen and need to be demonstrated experimentally before clinical treatments. The conventional experiments include phage display⁵, enzyme-linked immunosorbent assay (ELISA)⁶, pseudovirus assay⁷ and so on, which are resource intensive and time consuming⁸. We seek to develop accurate and fast computational methods as preliminary screening, to reduce blindness and improve foresight for the wet experiments and accelerate the process of discovering novel therapeutic Abs⁹.

According to the prediction tasks, studies related to Ab–Ag interaction prediction can be categorized into mainly three groups: (1) predicting Ab–Ag binding sites, (2) discriminating Ab–Ag binders/non-binders and (3) predicting Ab–Ag neutralization/non-neutralization effects. Given an Ab–Ag binding pair, some studies predicted the binding sites (Parapred¹⁰, Fast-Parapred and AG-Fast-Parapred¹¹, PECAN¹² and PInet¹³). Given the Ab–Ag pairwise instances, others discriminated binders and non-binders^14,15, which is considered the upstream task of predicting binding sites. We note that binding Abs may not neutralize but instead only opsonize pathogens. Opsonization is an indirect anti-viral process, in which Abs bind pathogens as marks to facilitate phagocytosis of macrophages, while neutralization is a direct anti-viral process, in which Abs directly stop the attachment of pathogens to host tissues¹⁶. In this study, we focus on predicting Ab–Ag neutralization effects.

The methods related to Ab–Ag interaction prediction can be further classified by input: (1) sequence based and (2) structure based. When predicting binding sites, the sequence-based methods combined local neighbourhood and entire sequences (Parapred¹⁰) and applied cross-modal attention on Ab and Ag residues (Fast-Parapred and AG-Fast-Parapred¹¹), and the structure-based methods employed graph convolutional networks (GCNs) on Ab and Ag structures (PECAN¹²) and extracted geometrical features by consuming point clouds from structures (PInet¹³). When classifying binders/non-binders, Mason et al. applied convolutional neural networks (CNNs) on Ab sequences¹⁴, and DLAB¹⁵ implemented CNNs on crystal or modelled structures and found that using highly accurate crystal structures could enhance performance, while using modelled structures failed to achieve strong discrimination between binders and non-binders, probably because ‘structure modelling’ and ‘interaction prediction’ were successively engaged and errors in the former would be exacerbated in the latter. We note that obtaining highly accurate crystal structures through wet-lab experiments is also laborious and costly, while amino acid sequences are easily and widely accessible in the real world. Additionally, large-scale sequence data can enhance the applicability of methods. Therefore, we propose a sequence-based method, facilitating real-world applications.

Predicting unseen Abs’ neutralizability from amino acid sequences has two challenges. (1) We are faced with the well-known cold-start problem, that is, an unseen Ab’s neutralization with ‘any’ Ag is unknown. Existing methods learn Ab representation by backpropagating errors from known Ab–Ag interactions (Fig. 1b), which is not applicable to unseen Abs owing to the lack of interaction instances. (2) Another challenge lies in the problem that the expressivity and adaptability of the static feature to represent Abs and Ags could be limited. Although there are various protein descriptors, for example, k-mer frequency counting (kmer), position-specific scoring matrices (PSSMs) and the protein–protein basic local alignment search tool (BlastP), the feature space could be high dimensional and the features are pre-computed and static; they are unsupervised, not optimized in the training process and probably not optimal for a specific supervised learning task.

To overcome these challenges, we propose a deep Ab–Ag interaction algorithm, named DeepAAI. Our DeepAAI can learn the representation of unseen Abs from seen Abs by constructing two adaptive relation graphs that connect Abs and Ags, respectively, and applying Laplacian smoothing (in GCNs) in the representation of unseen and seen Abs. In the two relation graphs, the nodes represent Abs and Ags, the node attributes are the learned representations of Abs and Ags, and the edge weights are the quantified relation among Abs and among Ags, respectively. Figure 1c shows the Ab relation graph.

Rather than using those high-dimensional and static features directly, DeepAAI applies a neural network to project the original features into a low-dimensional and high-expressivity feature space, in which the representations are used to serve as the node attributes and further quantify the edge weights. The node attributes and the edge weights are not static but dynamically optimized towards the downstream tasks, predicting neutralization effects and estimating 50% inhibition concentration (IC₅₀) values (Fig. 1d). Thereby, the Ab and Ag relation graphs are task oriented and adaptively constructed, predicting the optimal relations among Abs and Ags.

We then predict unseen Abs’ neutralizability by applying GCNs on the relation graphs, conducting Laplacian smoothing between unseen and seen Abs’ representation as transductive learning. Consequently, the unseen Abs’ representation can be learned from the relational seen Abs’ representation and optimized in the training process, guaranteeing that the unseen Abs’ neutralizability can be inferred in a semi-supervised manner.

Additionally, we note that Ab–Ag neutralization is determined by both global and local features. The global features of Abs and Ags are deterministic of interactions, while the local features of amino acids at the interface directly affect the affinities. Therefore, besides the adaptive relation graph that learns global features among Abs and Ags, we also adopt a CNN module to learn local features inside an Ab and Ag.

The performance of DeepAAI is demonstrated on the unseen Abs of various viruses, including human immunodeficiency virus (HIV), severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), influenza and dengue (Fig. 1e). Furthermore, as it does not require knowledge on Ab and Ag structures, DeepAAI is friendly to real-world applications. Additionally, the adaptively constructed relation graphs have rich interpretability. The Ab relation graphs imply similarity in Ab neutralization reactions (similar binding regions). The Ag relation graphs indicate relations among different variants of a virus. We accordingly recommend probable broad-spectrum Abs against new variants of a virus.

Results

DeepAAI

DeepAAI has two neural network modules, an adaptive relation graph convolutional network (AR-GCN) and a CNN module¹⁴, which learn global representation among Abs/Ags and local representation inside an Ab/Ag, respectively (Fig. 2a).

Relation graph module (AR-GCN)

The AR-GCN adaptively constructs two relation graphs by quantifying the relation among Abs and Ags and then learns Ab and Ag representations by applying GCNs on the two relation graphs. We hypothesize that two Abs participating in similar neutralization effects should be given a close relation, which can be quantified by the two Abs’ representation (equation (1)),

$${R}_{{\mathrm{Ab1}}-{\mathrm{Ab2}}}={{{\mathcal{F}}}}({H}_{{\mathrm{Ab1}}},{H}_{{\mathrm{Ab2}}})$$

(1)

where H_Ab1 and H_Ab2 are the two Abs’ representations, R_Ab1−Ab2 is the relation between Ab1 and Ab2, and ${{{\mathcal{F}}}}$ is a function to quantify relation.

Before quantifying the relation among Abs, we devise two fully connected (FC) layers (with activation functions), which non-linearly transform kmer and PSSMs into a low-dimensional feature space. The non-linear transformation can flexibly learn representation from biological similarity (kmer) and evolutionary information (PSSMs), thereby enriching the relation quantification. The relation has the following properties:

(1)
Symmetric: R_Ab1−Ab2 = R_Ab2−Ab1.
(2)
The absolute value is no more than 1: −1 ≤ R_Ab1−Ab2 ≤ 1.
(3)
The self-loop relation equals 1: R_Ab1−Ab1 = 1.

Consequently, we construct a relation graph among Abs. A GCN operation is then applied on the relation graph, working as Laplacian smoothing in Abs’ representation (Supplementary Information). Figure 2b describes the neural network structure of AR-GCN.

CNN module

The CNN module includes one-hot encoding, 1D convolution, maximum pooling, flatten and an FC layer, aiming at learning the local features of an Ab or Ag sequence (Fig. 2c). The kernel size is only two, making this module specifically focus on local feature extraction.

Fusion

Importantly, the AR-GCN and CNN module are also applied in Ag representation learning. Embedding Abs and Ags in the same feature space can facilitate their representation fusion. The fusion is conducted by addition and dot product with a balance coefficient, which is also learnable to avoid human-experience-based settings. Finally, two FC layers are used to predict neutralization effects and estimate IC₅₀ values, respectively. For details on DeepAAI, see Methods.

Performance on HIV

In Methods we describe the details of the HIV dataset curation. We randomly sample 45 Abs from all 242 Abs to serve as unseen Abs, involving 3,301 Ab–Ag pairwise instances in the unseen test set (Fig. 3a). The 45 unseen Abs have no instance that is similar to any instance of the seen Abs (BlastP < 90%). Considering that our task is to predict neutralization effects of Ab–Ag pairwise instances, we define two Ab–Ag pairwise instances as being similar when they have similar Abs (BlastP ≥ 90%), similar Ags (BlastP ≥ 90%) and the same neutralization or non-neutralization effects. Figure 3b shows the multiplicative product of the Ab and Ag BlastP scores between every two Ab–Ag pairs after we remove similar instances.

**Fig. 3: Results on the HIV unseen Abs.**

Predicting unseen Abs’ neutralizability

Figure 3c compares the performances of neutralization prediction on the unseen HIV Abs, and Supplementary Table 1a presents the numeric results. In these methods, kmer and PSSM represent the global features while sequence (seq) means local features. The three DeepAAI variants—DeepAAI (kmer + seq), DeepAAI (PSSM + seq) and DeepAAI (kmer + PSSM + seq)—outperform all eight baseline methods in accuracy, F1 score, precision–recall area under the curve (PC-AUC) and Matthews correlation coefficient (MCC) with statistical significance (P < 0.05). We also note no statistical significance among the three DeepAAI variants. The variants of DeepAAI (kmer + PSSM) and DeepAAI (seq), which reflect the effectiveness of global and local feature extraction, respectively, perform better than the baseline methods but relatively worse than the above three variants that combine global and local features. The results show that combining global and local features is indispensable in predicting Abs’ neutralization with Ags. In the area under the receiver operating characteristic (ROC-AUC), only DeepAAI (kmer + seq) outperforms others. Mason’s CNN architecture beats the other baseline methods but loses to DeepAAI.

The results prove that the proposed DeepAAI outperforms the baseline methods and that combining global and local features is indispensable for predicting unseen Abs’ neutralization effects on Ags.

Predicting unseen Abs’ IC₅₀

Figure 3d and Supplementary Table 1b show the performances of IC₅₀ estimation on the unseen Abs. Compared with all the baseline methods, both DeepAAI (PSSM + seq) and DeepAAI (kmer + PSSM) have superior performances in the mean squared error (MSE) and the mean absolute error (MAE). In MSE, DeepAAI (kmer + PSSM) performs better than DeepAAI (PSSM + seq). AG-Fast-Parapred architecture is the best baseline method.

Runtime

Figure 3e compares the runtime of every epoch on an NVIDIA GeForce RTX 1080 Ti GPU, in which 22,359 Ab–Ag pairwise instances are learned. Compared with the baseline methods, DeepAAI is computationally inexpensive because it avoids the time-consuming recurrent neural networks and the attention algorithms.

Visualization

We transform the values in the penultimate layer to a two-dimensional space by principal component analysis (PCA), which describes the learned representation of Ab–Ag pairwise instances and gives us a view of what the methods have learned (Fig. 3f). DeepAAI has higher intra-class similarity and better inter-class boundaries, while the best baseline method (Mason’s CNN architecture) mixes the neutralization and non-neutralization instances. Figure 3g shows the predicted probabilities in heat map, and DeepAAI achieves heat maps similar to the experimentally validated results.

Predicting seen Abs’ neutralizability and IC₅₀

Although we know seen Abs’ neutralization only with some Ags, we can still predict their neutralization effects with other Ags. As Supplementary Fig. 1 shows, DeepAAI (kmer + seq) and DeepAAI (kmer + PSSM + seq) win in the neutralization prediction, while DeepAAI (kmer + PSSM) surpasses the others in the IC₅₀ estimations.

Label-shuffled control

As an extensive study, we further experiment on the label-shuffled data. The results in Supplementary Table 2 show that, without true knowledge, DeepAAI cannot perform normally, indirectly demonstrating that DeepAAI does not make random predictions but learns valuable knowledge.

Performance on SARS-CoV-2

This experiment investigates whether DeepAAI can be applied to SARS-CoV-2. We collect the Ab–Ag neutralization and non-neutralization instances and Abs’ sequences from Coronavirus Antibody Database (CoVAbDab)¹⁷. Owing to the absence of IC₅₀ values in CoVAbDab, we discriminate only Ab–Ag neutralization and non-neutralization effects. We also have our own wet-lab data as the unseen test data, which were collected from a convalescent individual¹⁸. Figure 4a shows the numbers of Ab–Ag pairwise instances and unique Abs.

**Fig. 4: Results on the SARS-CoV-2 Abs.**

Figure 4b shows the performances on our wet-lab Abs of SARS-CoV-2. DeepAAI (kmer + seq) outperforms Mason’s CNN architecture (the best baseline method) by 0.05, 0.13, 0.13, 0.03 and 0.11 in accuracy, F-score, ROC-AUC, PR-AUC and MCC, respectively. We provide only the performance of DeepAAI (kmer + seq) for its steady performances and Mason’s CNN architecture for its advantages over the other seven baseline methods in the neutralization prediction of HIV.

Figure 4c shows the predictions by DeepAAI and BlastP. In BlastP, we think an Ab will neutralize an Ag when the average BlastP score between the Ab and the other neutralizing Abs is higher than that between the Ab and the other non-neutralizing Abs. The results show that the dynamical relation quantification adapts to the supervised task of the neutralization/non-neutralization prediction better than the unsupervised sequence alignment in BlastP.

DeepAAI’s interpretation

The relation graphs have rich interpretability. The Ab relation graphs imply the similarity in Ab neutralization reactions (similar binding regions). The Ag relation graphs indicate the relation among the different variants of a virus. Moreover, we recommend probable broad-spectrum Abs against a virus’s new variant.

The relation graph reflects binding regions

In the wet-lab experiments of our previous study, we performed competition ELISAs to determine whether our isolated neutralizing Abs had overlapping or non-overlapping epitopes in the receptor-binding domain (RBD) of S protein (Supplementary Fig. 2). We found that our neutralizing Abs could bind to four groups of five distinct epitopes on the RBD. Therefore, the neutralizing Abs were divided into four mutually exclusive groups, namely RBD groups I–IV in our previous study¹⁸.

We compare the quantified relation among neutralizing Abs that belong to the same group and different groups (inter). As Fig. 5a shows, we find that the quantified relations between two Abs that belong to the same group are significantly higher than those that belong to the different groups (inter). We exclude group I because only one neutralizing Ab belongs to group I and therefore we cannot perform a t-test. This finding shows that the Ab relation can predict an unseen Ab’s binding regions in the virus by examining the unseen Ab’s relation to all the seen Abs that have different binding regions.

**Fig. 5: DeepAAI’s interpretability in SARS-CoV-2.**

Differences among the virus variants implied by relation graph

Figure 5b shows the qualified relation among the SARS-CoV-2 variants. From the perspective of Ab–Ag neutralization effects in DeepAAI, Delta is thought of as the most different variant, which accords with the fact that Delta’s symptoms are different from those associated with the original strain (wild type). Furthermore, the values along the diagonals imply the difference in a variant’s subvariants and sequences from different sources. Omicron is quantified to have the lowest self-relation (0.84) by DeepAAI, indicating greater difference in its subvariants.

Important kmers

Figure 5c,d shows the sequence logo of the three most important ‘3-mers’ in the heavy and light sequences of the SARS-CoV-2 Abs that are collected from our wet experiments. We record the top three 3-mers in the heavy and light sequences with the highest weights in the two-layer neural network of AR-GCN that projects the kmers into a low-dimensional and high-expressivity feature space. In the heavy chains, the most important 3-mers are located at the tail, and the second and third ones are consecutive from the 44th to 47th amino acids (Fig. 5c). In the light chains, the most important 3-mers are located near the middle, and the second and third ones are also close to each other (Fig. 5d).

Broad-spectrum Ab recommendation

We also explore probable broad-spectrum Abs that could neutralize the Omicron variant. Among the total 2,587 SARS-CoV-2 Abs, DeepAAI recommends the 50 most probable Abs (Fig. 5e), 5 of which have been demonstrated previously^{19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35}.

Performances on influenza and dengue

This experiment investigates whether the knowledge learned from Ab–Ag interaction instances of HIV can help to predict Abs’ neutralizability of influenza and dengue. We freeze the AR-GCN and CNN module and train the final FC layers in transfer learning. The numbers of Ab–Ag pairwise instances and unique Abs are described in Fig. 6a. In Methods, we show the details of dataset curation and transfer learning. In this experiment, all the collected Ab–Ag pairwise instances are neutralizing. As negative sampling may coincidentally bring in neutralizing instances of broad-spectrum Abs, we do not adopt negative sampling to generate non-neutralizing instances but use the unseen HIV Abs as non-neutralization Abs, considering Abs have high specificity. Therefore, we focus on recall, that is, the fraction of positive instances that were correctly predicted (Fig. 6b).

**Fig. 6: Results on influenza and dengue Abs.**

DeepAAI (kmer + seq) (the best variant in HIV) significantly outperforms Mason’s CNN architecture (the best baseline method in HIV) by 0.10 on influenza. On dengue, both DeepAAI (kmer + seq) and Mason’s CNN architecture perform well and there is no statistical significance between them.

Discussion

In this study, we propose DeepAAI to predict unseen Abs’ neutralizability with Ags. DeepAAI achieves outstanding performances on a variety of viruses, including HIV, SARS-CoV-2, influenza and dengue. On the basis of the adaptively constructed relation graph in DeepAAI, we can denote the similarity in Ab neutralization reactions (similar binding regions) and the relation among the different variants of a virus from the perspective of Ab–Ag neutralization effects and recommend the probable broad-spectrum Abs against a new variant of a virus (Omicron).

As it does not require knowledge on Ab and Ag structures, DeepAAI is friendly to real-world applications. DeepAAI can be used by biologists in two successive steps: (1) predicting Ab–Ag neutralizing/non-neutralizing effects as preliminary screening and then (2) estimating IC₅₀ values to prioritize the subsequent wet-lab validation experiments. We provide a web service of DeepAAI, and the data and codes are freely available.

In this study, we did not use modelled structures because we intend to avoid the two-step prediction of structures and neutralization in which errors in the former could be exacerbated in the latter. In the future, we may integrate the feature extraction modules of Ab and Ag structure prediction into Ab–Ag interaction prediction if more crystal structures or precisely modelled structures are available.

Methods

Problem definition

The amino acid sequences of an Ab and an Ag are denoted as B = (b₁, b₂, . . . , b_m) and G = (g₁, g₂, . . . , g_n), respectively. The objective is to discriminate neutralization/non-neutralization (classification), ${{{{\mathcal{F}}}}}_{{\mathrm{bin}}}(B,G)=\left\{\begin{array}{l}0,\,{{\mbox{non-neutralization}}}\,\\ 1,\,{{\mbox{neutralization}}}\,\\ \end{array}\right.$, and estimate IC₅₀ values (regression), ${{{{\mathcal{F}}}}}_{{\mathrm{reg}}}(B,G)=\,{{\mbox{IC}}}_{50}\,$.

Data

HIV data

Collect HIV data: Algorithm 1 illustrates the pseudo-code for collecting the HIV data. Note that the non-neutralizing pairwise data of HIV are experimentally demonstrated rather than negatively sampled.

Algorithm 1 The process of the HIV dataset collection.

Require: the data source, that is, the Compile Analyze and Tally NAb Panels (CATNAP³⁶) at Los Alamos HIV Database (LANL³⁷)

Ensure: the neutralizing or non-neutralizing Ab–Ag pairwise instances in amino acid sequences

1: Extract the total assay that pairs Abs and Ags, denoted as T;

2: Extract the sequences of the heavy and light chains in T, denoted as H and L, respectively;

3: Uniform the forms of H and L to the fragment variable (Fv)—remove constant-heavy-1 (CH1) from H and constant-light (CL) from L when they are in the form of antigen-binding fragment (Fab);

4: Extract Ag sequences in T, denoted as V;

5: Pair the H, L and V based on T;

6: Remove the duplicated pairs in H, L and V;

7: Remove the pairs that have ‘not available’ (N/A) values in H, L or V;

8: Collect the IC₅₀ values for the paired P_H, P_L and V;

9: Average IC₅₀ values for any pair that has more than one reported IC₅₀ value;

10: Set the cut-off at IC₅₀ = 10 μg ml⁻¹ and consider IC₅₀ < 10 μg ml⁻¹ neutralization and IC₅₀ ≥ 10 μg ml⁻¹ non-neutralization;

11: Return the HIV dataset.

Split unseen and seen Abs: We randomly take 57 Abs to serve as the unseen Abs and the others as the seen Abs. As 12 of the 57 Abs have instances similar to the seen Abs’ instances, we remove them and take the remaining 45 Abs to serve as the unseen Abs. Two Ab–Ag pairwise instances are considered to be similar when they have similar Abs (BlastP ≥ 90%), similar Ags (BlastP ≥ 90%) and the same neutralization/non-neutralization effects. We then split the seen Abs’ Ab–Ag interaction instances into training, validation and seen test sets. We also remove instances in the seen test set that are similar to any instance in the training and validation sets. We include both the unseen and seen Abs in the Ab relation graph before training. We split the seen Abs’ instances, remove similar instances in the seen test set and train the models 20 times in 20 different random seeds. When different seeds are used, the numbers of Ab–Ag pairwise instances and unique Abs in the seen test set vary. Figure 3a shows the data information of seed 18.

CoVAbDab SARS-CoV-2 data

The SARS-CoV-2 data are collected from the CoVAbDab¹⁷. The collected data include both neutralizing and non-neutralizing Ab–Ag pairwise instances. The sequences of the SARS-CoV-2’s variants are collected from the National Center for Biotechnology Information³⁸. The curated dataset includes the SARS-CoV-2 variants of the wild type, Alpha, Beta, Gamma, Delta and Omicron. For each variant, the sequences of the different subvariants from the different sources are different. Therefore, we randomly take 5 sequences for each variant except Omicron, for which we take all 11 sequences.

We take the Omicron variant as ‘unseen Ags’. To suggest probable broad-spectrum Abs against the Omicron variant, we exclude the Ab–Ag pairwise instances of the Omicron variant in training but include the Omicron variant (unseen Ags) in the Ag relation graph and the Omicron Abs (unseen Abs) in the Ab relation graph as transductive learning.

Wet-lab SARS-CoV-2 data

In our previous study¹⁸, we found a convalescent individual with potent IgG neutralizing activity to SARS-CoV-2 from the hospital volunteers. The volunteer recruitment and the blood draws were performed at the Zhoushan Hospital under a protocol approved by the Zhoushan Hospital Research Ethics Committee (2020-003). Experiments related to all human samples were performed at the School of Basic Medical Sciences, Fudan University under a protocol approved by the institutional ethics committee (2020-C007).

We characterized the Ab responses and isolated monoclonal Abs from the individual’s memory B cells. Consequently, we obtained 36 Abs with the confirmed amino acid sequences. The wet-lab Abs are also included in the Ab relation graph as unseen Abs for evaluation. We also conducted ELISA, pseudovirus assays, single-cell sorting and cloning, and so on and found 17 Abs that are neutralizing and 19 that are non-neutralizing to the wide type of SARS-CoV-2. We identified the 17 neutralizing Abs’ binding regions in RBD, which are used in Fig. 4a. For more details on our wet-lab data, please refer to ref. ¹⁸.

Influenza and dengue data

We collect influenza and dengue data from Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB)³⁹. All the collected Ab–Ag pairwise instances are positive (neutralizing). We do not adopt negative sampling to generate non-neutralizing instances because negative sampling may coincidentally bring in neutralizing instances of broad-spectrum Abs. Considering the high specificity of Abs, we use the unseen HIV Abs as non-neutralization Abs to influenza and dengue, but exclude the seen HIV Abs because they have been used to train the HIV models and using them could lead to overfitting in these Abs in transfer learning. We ensure the numbers of neutralization and non-neutralization instances are equal. Finally, we split the data into the training and unseen test sets, remove similar instances in the unseen test sets (BlastP ≥ 90%) and include the unseen and seen Abs in the relation graph of Abs.

Features

Amino acid sequences

Amino acid sequences transparently describe amino acids and their sequential positions. We use one-hot encoding.

kmer

The kmer contains two basic characteristics of biological sequences, monomer component information and entire sequence information⁴⁰, revealing the distribution of entire characteristics and measuring biological similarity for discrimination⁴¹. We use k = 1, 2, 3, which generates 21 (20 amino acids + unknown), 21² and 21³ dimensions, respectively. We abandon k = 4 because it generates too many dimensions (194,481), which tends to deteriorate the algorithm. We then remove the kmer features with a frequency less than 0.05 to prevent overfitting and accelerate training. Finally, a 2,764-dimensional vector of kmer is left.

PSSM

The PSSMs reveal evolutionary information and have been successfully applied to improve the performance of various predictors of protein attributes⁴². We select the Uniref50 database with the tool of the position-specific scoring matrix-based feature generator for machine learning (POSSUM)⁴² to generate PSSMs, encoding evolutionary information in a 420-dimensional vector. Note that using the Uniref50 database will not cause information leakage since the supervision information comes from the output (that is, Ab–Ag neutralizing/non-neutralizing effects and IC₅₀ values) rather than the input (that is, the sequences of Abs and Ags).

Baseline methods

In this study, we use four types of baseline methods: (a) the sequenced-based architecture that predict binding sites: Parapred architecture¹⁰, Fast-Parapred architecture¹¹ and AG-Fast-Parapred architecture¹¹; (b) the sequenced-based models that predict protein–protein interactions: PIPR architecture⁴³ and ResPPI architecture⁴⁴; (c) the classic sequential models: Bi-LSTM-Attention⁴⁵ and TextCNN⁴⁶; and (d) the sequenced-based model that has been demonstrated effective in wet-lab experiments: Mason’s CNN architecture¹⁴.

For the methods in group (a), Parapred architecture, Fast-Parapred architecture and AG-Fast-Parapred architecture, the original task is to predict the binding site given Ab–Ag binding pairs. We keep their network structures and inputs (amino acid sequences) but modify the prediction tasks from binding sites to neutralization and IC₅₀. For Mason’s CNN architecture, we add an Ag extraction module and an Ab–Ag embedding fusion module, because the original Mason’s CNN learns only Ab features and is specific to one Ag (without fusing various Ags). We follow the other implementation details in the cited papers.