Extensive antibody search with whole spectrum black-box optimization

Tučs, Andrejs; Ito, Tomoyuki; Kurumida, Yoichi; Kawada, Sakiya; Nakazawa, Hikaru; Saito, Yutaka; Umetsu, Mitsuo; Tsuda, Koji

doi:10.1038/s41598-023-51095-z

Download PDF

Article
Open access
Published: 04 January 2024

Extensive antibody search with whole spectrum black-box optimization

Andrejs Tučs¹^na1,
Tomoyuki Ito²^na1,
Yoichi Kurumida^3,7,
Sakiya Kawada²,
Hikaru Nakazawa²,
Yutaka Saito^1,3,4,5,7,
Mitsuo Umetsu^2,4 &
…
Koji Tsuda^1,4,6

Scientific Reports volume 14, Article number: 552 (2024) Cite this article

1596 Accesses
11 Altmetric
Metrics details

Subjects

Abstract

In designing functional biological sequences with machine learning, the activity predictor tends to be inaccurate due to shortage of data. Top ranked sequences are thus unlikely to contain effective ones. This paper proposes to take prediction stability into account to provide domain experts with a reasonable list of sequences to choose from. In our approach, multiple prediction models are trained by subsampling the training set and the multi-objective optimization problem, where one objective is the average activity and the other is the standard deviation, is solved. The Pareto front represents a list of sequences with the whole spectrum of activity and stability. Using this method, we designed VHH (Variable domain of Heavy chain of Heavy chain) antibodies based on the dataset obtained from deep mutational screening. To solve multi-objective optimization, we employed our sequence design software MOQA that uses quantum annealing. By applying several selection criteria to 19,778 designed sequences, five sequences were selected for wet-lab validation. One sequence, 16 mutations away from the closest training sequence, was successfully expressed and found to possess desired binding specificity. Our whole spectrum approach provides a balanced way of dealing with the prediction uncertainty, and can possibly be applied to extensive search of functional sequences.

Accurate structure prediction of biomolecular interactions with AlphaFold 3

Article 08 May 2024

Highly accurate protein structure prediction with AlphaFold

Article Open access 15 July 2021

Machine learning designs new GCGR/GLP-1R dual agonists with enhanced biological potency

Article Open access 16 May 2024

Introduction

Machine learning-based protein design strategies can be classified into two categories according to how the training dataset is collected: computational^1,2 and experimental^3,4,5. In the former category, protein structures of interest are predicted by a structure predictor such as AlphaFold2⁶ or RoseTTafold⁷. The designed proteins are computationally evaluated on the basis of closeness to desired topology² or simulated binding affinities to target proteins¹. In the latter category, the dataset is made by synthesizing as many protein variants as possible and evaluated by experimental means.⁴ To this aim, deep mutational screening⁸ is often used in combination with random mutagenesis. However, the number of sequences available for training is far smaller than computational data collection³.

Prediction uncertainty is a central topic in automatic design with small data⁹. The prediction at a point in a sample abundant region is more certain than that in a sample scarce region. Bayesian optimization¹⁰, a popular automatic design algorithm, treats uncertainty as opportunity, because its recommendation scores such as expected improvement increase as uncertainty increases. LaMBO¹¹ and guided diffusion models¹² belong to this type of adventurous approach. In contrast, the methods based on the domain of applicability¹³ (DoA) restrict recommendation within sample abundant regions, regarding uncertainty as risk. Given an automatic design task at hand, it is difficult to choose one from these two approaches, because the outcome depends on the accuracy of predicted scores and their uncertainty estimates.

In our approach, we consider the whole spectrum of adventurous and conservative design strategies and try all of them collectively based on the framework of multi-objective optimization¹⁴ (Fig. 1). First, variations of the training datasets are created by subsampling. A machine learning model is trained by the multiple training datasets, yielding an ensemble of predictors. A multi-objective optimization problem¹⁴ is postulated such that an objective function is the average of all prediction scores and the other is their standard deviation (i.e., prediction instability). By applying a solver¹⁴, one can identify several solutions at the Pareto front, i.e., the set of non-dominated solutions where no objective function can be improved without sacrificing the other at each solution. These solutions range from high average, low stability ones to low average, high stability ones. In biological sequence design problems, the solutions correspond to candidate sequences for wet-lab validation. Since only a few sequences can be validated, domain experts are asked to select some of them. Our approach can produce a diverse list of sequences in comparison to the naïve approach where only one predictor is used and the sequences are ranked by its prediction score. A practical advantage of our approach is that additional objective functions such as developability measures can easily be incorporated to the multi-objective problem.

We apply our approach to discover VHH (Variable domain of Heavy chain of Heavy chain) antibodies against galectin-3, which is a potential therapeutic target for cancer treatment and diagnostic biomarkers for several diseases, including heart failure and cancers¹⁵. 12,737 labeled sequences are obtained by deep mutational screening and five predictors are trained by subsampling the dataset. We applied our multi-objective optimization software MOQA¹⁶ to optimize the following three objectives: (1) the average of the prediction scores, (2) the standard deviation of them, (3) solubility predicted by NetSolP¹⁷. Notice that MOQA is based on quantum annealing¹⁸ and deep learning¹⁹, and was previously applied to antimicrobial peptide design¹⁶. See Supplementary Information about the algorithm of MOQA. NetSolP solubility score was chosen due to high accuracy enabled by protein language models¹⁷. From 19,778 generated sequences, five were chosen based on our quality and novelty criteria. The five were synthesized and evaluated in wet lab experiments, and one sequence, which is distinctly different from the training sequences, was found to possess desired binding affinity and specificity. This result implies that our method called WS-MOQA enables far-reaching search of functional sequences.

Results

Data preparation

To collect the training set, we performed four rounds of phage display biopanning²⁰ to select VHH sequences bound to galectin-3. For the phage display library, the complementarity determining regions of CDR1 (13 aa), CDR2 (10 aa), and CDR3 (16 aa) in VHH (PDB ID: 3DWT) were randomized with degenerate codons designed to mimic an amino acid frequency of antibody CDRs²¹, and the variants were displayed on phages (Fig. 2). The prepared initial phage library (library size: 8.6 × 10⁷) was used to perform biopanning against galectin-3. In the biopanning, the following steps were iterated four times: (1) removal of non-specifically bound phages (N.S. phages), (2) interaction with antigen and elution of target-binding phages (eluted phages), (3) infection of the selected phage into E. coli, and 4) amplification of phages (Fig S1). The N.S. phages and eluted phages in the fourth round, were subjected to deep sequencing analysis with the MiSeq platform. Raw sequences were filtered and trimmed according to their quality, and the forward reads were merged with the corresponding reverse reads. As a result, we obtained 178,883 reads in N.S. phages and 133,179 reads in eluted phages. As the training set, we retained sequences of more than three reads of sequence in either N.S. phages or eluted phages (12,737 sequences). The binding score of each sequence is defined as

$$ Score = \frac{frequency\;in\;eluted\;phages\;at\;the\;4th\;round}{{frequency\;in\;N.S.\;phages\;at\;the\;4th\;round}} $$

Sequences with Score ≥ 5 were considered to bind (i.e., positive class), while sequences with Score < 1 were unbound (i.e., negative class).

Whole spectrum black-box optimization

From the training set, five datasets are created by first splitting the negative sequences into five equal-sized subsets and combining each of them with the positive sequences. Each dataset was used to train bidirectional LSTM using PARROT python package²². This model outputs the probability of the input sequence being in the positive class, which is used as our prediction score. The model had one hidden layer with hidden vector size 10. It was trained for 25 epochs with the learning rate set 0.001. One-hot encoding was used for sequence representation. Our multi-objective optimization problem is concerned about the prediction score average, the standard deviation and solubility predicted by NetSolP¹⁷. 20,000 VHH sequences were generated by MOQA in ten independent runs (each with different random initialization). Figure 3 shows the development of the three objective functions and their Pareto hypervolume²³. It is found that all functions are optimized close to saturation. After removing duplicates, 19,778 sequences remained. See Fig. S2 for tSNE visualization of the training sequences and generated sequences.

For wet-lab validation, we selected the sequences satisfying the following constraints. (1) The sequence length is 39, that is equal to the combined length of CDRs; (2) it does not contain repetitive amino acids of length 5 or more; (3) the average score is above 0.6; (4) solubility scores from CamSol²⁴ are above 0.0; (5) the CDR1, CDR2 and CDR3 sequences differ from the wild-type by at least three residues. As a result, the five sequences shown in Table 1 were selected. Notably, all of them were distinctly different from the training sequences. The Hamming distance from each sequence to its closest training sequence was between 16 and 23.

Table 1 Five VHHs selected for wet-lab validation. The Hamming distance from the closest training sequence is shown as well.

Full size table

Validation of selected VHHs

The expression vectors bearing the gene of the five VHH variants (VHH1832, VHH1834, VHH1835, VHH1836, VHH1837) were prepared and E. coli BL21(DE3) cells were transformed with the vectors. The five variants generated in E. coli were purified by means of immobilized metal ion affinity chromatography (IMAC) and size exclusion chromatography (SEC). The expression levels of the WT, VHH1836, and VHH1837 were 11 mg/L-broth, 6 × 10^–2 mg/L-broth, and 8 × 10^–3 mg/L-broth, respectively. The designed sequences yielded lower expression than WT, but they nevertheless formed monomers (Fig. S3). The fractions of monomers were collected and used to measure the binding affinity to galectin-3 by means of enzyme-linked immunosorbent assay (ELISA). As a result, VHH1836 bound specifically to the plates where galectin-3 was immobilized, but not to the plates without galectin-3 (Fig. 4a). VHH1836 showed concentration-dependent binding, while WT did not bind to the galectin-3 at the same concentration range (800 nM, 400 nM, and 200 nM of VHHs). In addition, VHH1836 showed little binding to other proteins such as streptavidin, lysozyme from chicken egg, bovine serum albumin, and human serum albumin (Fig. 4b). The CD spectrum of VHH1836 shows that the VHH forms a beta-rich secondary structure like immunoglobulin-fold (Fig. S4). The CD spectra of VHH1836 differed largely from those of WT probably due to the structural change of the CDR loop structure.

Discussion

Our methodological contribution is the proposal of including stability as an additional objective function. In our paper, we used MOQA but our idea can be applied to any multi-objective optimization solver, such as CMA-ES ¹⁴, LaMBO¹¹ and guided diffusion models¹². Formulating the stability-activity balancing as multi-objective optimization, one does not need to manually set the balancing constant as done by Tran et al.²⁵. Covering the Pareto front with solutions helps to improve diversity, but it may not be realistic when the number of objective functions is more than three. Our approach is reminiscent of bagging²⁶, where multiple prediction models are created by subsampling the training dataset and the average of prediction scores are used for making decisions for new examples. However, our approach aims to improve black-box optimization, while the prediction accuracy is the primary concern in bagging. We used random splitting to derive the stability, but other sampling methods such as bootstrapping or sampling with replacement may be used.

Our approach, whole spectrum black-box optimization, creates diverse solutions that balance activity and stability. It was successfully applied to the task of finding innovative antibodies that are distinctly different from training sequences. Our results suggest that it has the potential to be applied to a wide range of biological, chemical, and pharmaceutical design problems.

Materials and method

Library construction

The biopanning was conducted basically according to previous report^4,5. CDR1, CDR2, and CDR3 in 3DWT were randomized using degenerate codons reflecting an amino acid frequency of antibody CDRs for training data. M13 phage libraries displaying VHH variants with a size of 8.6 × 10⁷ were prepared. Colony-forming units (5.0 × 10¹¹) from an M13 phage library displaying VHH variants were exposed to magnetic beads (Dynabeads MyOne Streptavidin T1; Thermo Fisher Scientific, MA, USA) for 60 min at room temperature (negative selection in Fig S2) and centrifuged to separate supernatant and magnetic beads. The phages bound to the beads were collected as N.S. phages. For target preparation, 2 µM galectin-3 in PBS was incubated with magnetic beads for 30 min at 4 °C. The supernatant was incubated with galectin-3–immobilized magnetic beads for 60 min at room temperature, and the beads were washed 10 times with PBS with 0.05% Tween-20 for 5 min each wash. Bound phages were eluted with 100 µL of triethylamine and neutralized with 300 µL of 0.5 M Tris-HCl (pH 6.8). Log-phase E. coli TG-1 cells were incubated overnight at 37 °C with 200 µL of the eluted phages in 2× YT agar medium containing 100 µg/mL ampicillin and 1% (w/v) glucose. Cells grown on the plates were used to prepare phage particles for the next round.

Sample preparation and deep sequencing

After fourth round of biopanning, polyclonal plasmid DNAs were extracted with phenol–chloroform from N.S. phages and eluted phages. The extracted DNAs were used for the first polymerase chain reaction (PCR) to amplify VHH library fragments with the primers containing an annealing region for the second PCR primers. The PCR products were purified by using 1.5% agarose gel and a Qiaex II Gel Extraction Kit (20051; Qiagen, Hilden, Germany) and subjected to the second PCR to attach adapter sequences containing TruSeq DNA CD Indexes. The resulting fragments were purified as above, quantified by using a Qbit™ 1× dsDNA HS Assay Kit (Q33231; Thermo Fisher Scientific), and pooled in equal amounts. The quality of the libraries was checked by using an Agilent 2100 Bioanalyzer (G2939B; Agilent Technologies, CA, USA). The prepared sample was sequenced on the MiSeq platform (Illumina, CA, USA) by using a MiSeq Reagent Kit v3 (15043895; Illumina) with 2 × 300 bp paired end reads.

Preparation of proposed VHHs and galectin-3

The gene of VHHs were each inserted into the Nco I–Sac II site of the pRA vector, which included FLAG and poly-histidine tags²⁷. E. coli BL21(DE3) cells were transformed with the constructed expression vectors, grown overnight at 28 °C on LB agar, and then cultured in 2× YT broth; both media contained 100 µg/mL ampicillin. Isopropyl-β-d-thiogalactopyranoside (IPTG) was added to a final concentration of 1 mM at OD₆₀₀ = 0.8, and the cells were shaken at 160 rpm for overnight at 28 °C. IPTG was added to the flask to a final concentration of 1 mM. The cells were shaken at 160 rpm at 20 °C overnight. The cells were harvested by centrifugation, resuspended in phosphate-buffered saline (PBS), and sonicated. Insoluble matter was removed by centrifugation. Variants were purified from the supernatants by IMAC (Ni Sepharose™ 6 Fast Flow; Cytiva, IL, USA) and SEC (HiLoad 26/600 Superdex 75 pg; Cytiva, IL, USA). The procedure of the preparation of galectin-3 was described previously⁴.

Enzyme-linked immunosorbent assay (ELISA)

Fifty microliters of 4 µg/mL NeutrAvidin (Thermo Fisher Scientific) in PBS were incubated in the wells of a 96-well polystyrene enzyme-linked immunosorbent assay microplate (655061; Greiner, Austria) for 60 min, then 150 µL of 3% (w/v) skim milk in PBS was added to the wells, and the plates were incubated for a further 30 min for blocking. After washing each well with PBS three times, 50 µL of 10 µg/mL biotinylated galectin-3 solution was added and the wells were incubated for 30 min. The wells were washed again with PBS, and VHH solution was added and the wells were incubated for 30 min. After washing each well with PBS/0.05% Tween20 three times, the wells were incubated for 40 min at room temperature with horseradish peroxidase-conjugated mouse anti-FLAG monoclonal antibody (1:10,000; A8592, Sigma Aldrich). After washing each well with PBS/0.05% Tween20 three times, 50 µL of 3,3′,5,5′-tetramethylbenzidine solution (1-step Ultra TMB-ELISA Substrate Solution; Thermo Fisher Scientific) was added and the wells were incubated for 10 min at room temperature. After incubation, 50 µL of 2 M H₂SO₄ was added to each well and absorbance at 450 nm was measured with a Synergy H4 Hybrid Multimode Microplate Reader (BioTek Japan, Tokyo, Japan). In the case of measuring target specificity, 50 µL of 4 µg/mL streptavidin, lysozyme, bovine serum albumin, and human serum albumin were incubated in the wells of a 96-well polystyrene ELISA microplate before skim milk blocking and addition of VHHs.

Circular dichroism (CD) spectra

CD spectra were measured with a J-820 CD spectrometer (Jasco, Japan) in a 1.0-mm-long quartz cuvette, as follows: band width 1.0 nm, resolution 0.1 nm, response 8 s, scan speed 2 nm/min. The concentrations of purified VHHs variants were 5 µM.

Data availability

The code and datasets are available at Github repository, https://github.com/tucs7/nanobody_MOQA.

References

Bennett, N. R. et al. Improving de novo protein binder design with deep learning. Nat. Commun. https://doi.org/10.1038/s41467-023-38328-5 (2023).
Article PubMed PubMed Central Google Scholar
Yeh, A.H.-W. et al. De novo design of luciferases using deep learning. Nature 614, 774–780. https://doi.org/10.1038/s41586-023-05696-3 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Johnston, K. E. et al. Machine Learning for Protein Engineering. arXiv:2305.16634. https://ui.adsabs.harvard.edu/abs/2023arXiv230516634J (2023).
Ito, T. et al. Selection of target-binding proteins from the information of weakly enriched phage display libraries by deep sequencing and machine learning. mAbs. https://doi.org/10.1080/19420862.2023.2168470 (2023).
Article PubMed PubMed Central Google Scholar
Ito, T. et al. Combination informatic and experimental approach for selecting scaffold proteins for development as antibody mimetics. Chem. Lett. 50, 1867–1871. https://doi.org/10.1246/cl.210443 (2021).
Article CAS Google Scholar
Jumper, J. et al. Highly accurate protein structure prediction with AlphaFold. Nature 596, 583–589. https://doi.org/10.1038/s41586-021-03819-2 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Baek, M. et al. Accurate prediction of protein structures and interactions using a three-track neural network. Science 373, 871–876. https://doi.org/10.1126/science.abj8754 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Fowler, D. M. & Fields, S. Deep mutational scanning: A new style of protein science. Nat. Methods 11, 801–807. https://doi.org/10.1038/nmeth.3027 (2014).
Article CAS PubMed PubMed Central Google Scholar
Terayama, K., Sumita, M., Tamura, R. & Tsuda, K. Black-box optimization for automated discovery. Acc. Chem. Res. 54, 1334–1346. https://doi.org/10.1021/acs.accounts.0c00713 (2021).
Article CAS PubMed Google Scholar
Shahriari, B., Swersky, K., Wang, Z., Adams, R. P. & de Freitas, N. Taking the human out of the loop: A review of Bayesian optimization. Proc. IEEE 104, 148–175. https://doi.org/10.1109/jproc.2015.2494218 (2016).
Article Google Scholar
Stanton, S. et al. Accelerating Bayesian Optimization for Biological Sequence Design with Denoising Autoencoders. arXiv:2203.12742. https://ui.adsabs.harvard.edu/abs/2022arXiv220312742S (2022).
Gruver, N. et al. Protein Design with Guided Discrete Diffusion. arXiv:2305.20009. https://ui.adsabs.harvard.edu/abs/2023arXiv230520009G (2023).
Sutton, C. et al. Identifying domains of applicability of machine learning models for materials science. Nat. Commun. https://doi.org/10.1038/s41467-020-17112-9 (2020).
Article PubMed PubMed Central Google Scholar
Miettinen, K. Nonlinear Multiobjective Optimization (Kluwer Academic Publishers, 1999).
Google Scholar
Girard, A. & Magnani, J. L. Clinical trials and applications of galectin antagonists. Trends Glycosci. Glycotechnol. 30, SE211–SE220. https://doi.org/10.4052/tigg.1744.1SE (2018).
Article Google Scholar
Tučs, A. et al. Quantum annealing designs nonhemolytic antimicrobial peptides in a discrete latent space. ACS Med. Chem. Lett. 14, 577–582. https://doi.org/10.1021/acsmedchemlett.2c00487 (2023).
Article CAS PubMed PubMed Central Google Scholar
Thumuluri, V. et al. NetSolP: Predicting protein solubility in Escherichia coli using language models. Bioinformatics 38, 941–946. https://doi.org/10.1093/bioinformatics/btab801 (2022).
Article CAS PubMed Google Scholar
Johnson, M. W. et al. Quantum annealing with manufactured spins. Nature 473, 194–198. https://doi.org/10.1038/nature10012 (2011).
Article ADS CAS PubMed Google Scholar
Baynazarov, R. & Piontkovskaya, I. Artificial Intelligence and Natural Language Communications in Computer and Information Science. 139–150 (2019).
Qi, H., Ma, M., Lai, D. & Tao, S.-C. Phage display: An ideal platform for coupling protein to nucleic acid. Acta Biochim. Biophys. Sin. 53, 389–399. https://doi.org/10.1093/abbs/gmab006 (2021).
Article CAS PubMed Google Scholar
Kruziki, M. A., Bhatnagar, S., Woldring, D. R., Duong, V. T. & Hackel, B. J. A 45-amino-acid scaffold mined from the PDB for high-affinity ligand engineering. Chem. Biol. 22, 946–956. https://doi.org/10.1016/j.chembiol.2015.06.012 (2015).
Article CAS PubMed PubMed Central Google Scholar
Griffith, D. & Holehouse, A. S. PARROT is a flexible recurrent neural network framework for analysis of large protein datasets. eLife https://doi.org/10.7554/eLife.70576 (2021).
Article PubMed PubMed Central Google Scholar
Couckuyt, I., Deschrijver, D. & Dhaene, T. Fast calculation of multiobjective probability of improvement and expected improvement criteria for Pareto optimization. J. Glob. Optim. 60, 575–594. https://doi.org/10.1007/s10898-013-0118-2 (2013).
Article MathSciNet Google Scholar
Sormanni, P., Aprile, F. A. & Vendruscolo, M. The CamSol method of rational design of protein mutants with enhanced solubility. J. Mol. Biol. 427, 478–490. https://doi.org/10.1016/j.jmb.2014.09.026 (2015).
Article CAS PubMed Google Scholar
Tran, D. P. et al. Using molecular dynamics simulations to prioritize and understand AI-generated cell penetrating peptides. Sci. Rep. https://doi.org/10.1038/s41598-021-90245-z (2021).
Article PubMed PubMed Central Google Scholar
Breiman, L. Bagging predictors. Mach. Learn. 24, 123–140. https://doi.org/10.1007/bf00058655 (1996).
Article Google Scholar
Asano, R. et al. Efficient construction of a diabody using a refolding system: Anti-carcinoembryonic antigen recombinant antibody fragment. J. Biochem. 132, 903–909. https://doi.org/10.1093/oxfordjournals.jbchem.a003303 (2002).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

This work is supported by NEDO project, Development of the Key Technologies for the Next-Generation Artificial Intelligence/Robots, and JST ERATO JPMJER1903.

Author information

These authors contributed equally: Andrejs Tučs and Tomoyuki Ito.

Authors and Affiliations

Graduate School of Frontier Sciences, The University of Tokyo, Kashiwa, Japan
Andrejs Tučs, Yutaka Saito & Koji Tsuda
Department of Biomolecular Engineering, Graduate School of Engineering, Tohoku University, Sendai, Japan
Tomoyuki Ito, Sakiya Kawada, Hikaru Nakazawa & Mitsuo Umetsu
Artificial Intelligence Research Center, National Institute of Advanced Industrial Science and Technology (AIST), Tokyo, Japan
Yoichi Kurumida & Yutaka Saito
RIKEN Center for Advanced Intelligence Project, RIKEN, Tokyo, 103-0027, Japan
Yutaka Saito, Mitsuo Umetsu & Koji Tsuda
AIST-Waseda University Computational Bio Big-Data Open Innovation Laboratory (CBBD-OIL), Tokyo, Japan
Yutaka Saito
Center for Basic Research on Materials, National Institute for Materials Science (NIMS), Tsukuba, Japan
Koji Tsuda
Department of Data Science, School of Frontier Engineering, Kitasato University, Sagamihara, Japan
Yoichi Kurumida & Yutaka Saito

Authors

Andrejs Tučs
View author publications
You can also search for this author in PubMed Google Scholar
Tomoyuki Ito
View author publications
You can also search for this author in PubMed Google Scholar
Yoichi Kurumida
View author publications
You can also search for this author in PubMed Google Scholar
Sakiya Kawada
View author publications
You can also search for this author in PubMed Google Scholar
Hikaru Nakazawa
View author publications
You can also search for this author in PubMed Google Scholar
Yutaka Saito
View author publications
You can also search for this author in PubMed Google Scholar
Mitsuo Umetsu
View author publications
You can also search for this author in PubMed Google Scholar
Koji Tsuda
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.T, K.T. and M.U. conceived the idea and designed the research. A.T. and K.T. developed the algorithm. T.I., S.K., Y.K., Y.S., H.N. and M.U. developed the experimental methodology and performed the wet-lab experiments. A.T., T.I. and K.T. wrote the manuscript.

Corresponding authors

Correspondence to Mitsuo Umetsu or Koji Tsuda.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Tučs, A., Ito, T., Kurumida, Y. et al. Extensive antibody search with whole spectrum black-box optimization. Sci Rep 14, 552 (2024). https://doi.org/10.1038/s41598-023-51095-z

Download citation

Received: 29 August 2023
Accepted: 30 December 2023
Published: 04 January 2024
DOI: https://doi.org/10.1038/s41598-023-51095-z

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.