A high-throughput pipeline for design and selection of peptides targeting the SARS-Cov-2 Spike protein

Wolfe, Monica; Webb, Sean; Chushak, Yaroslav; Krabacher, Rachel; Liu, Yi; Swami, Nathan; Harbaugh, Svetlana; Chávez, Jorge

doi:10.1038/s41598-021-01225-2

Download PDF

Article
Open access
Published: 05 November 2021

A high-throughput pipeline for design and selection of peptides targeting the SARS-Cov-2 Spike protein

Scientific Reports volume 11, Article number: 21768 (2021) Cite this article

3308 Accesses
13 Citations
3 Altmetric
Metrics details

Subjects

Abstract

Rapid design, screening, and characterization of biorecognition elements (BREs) is essential for the development of diagnostic tests and antiviral therapeutics needed to combat the spread of viruses such as severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). To address this need, we developed a high-throughput pipeline combining in silico design of a peptide library specific for SARS-CoV-2 spike (S) protein and microarray screening to identify binding sequences. Our optimized microarray platform allowed the simultaneous screening of ~ 2.5 k peptides and rapid identification of binding sequences resulting in selection of four peptides with nanomolar affinity to the SARS-CoV-2 S protein. Finally, we demonstrated the successful integration of one of the top peptides into an electrochemical sensor with a clinically relevant limit of detection for S protein in spiked saliva. Our results demonstrate the utility of this novel pipeline for the selection of peptide BREs in response to the SARS-CoV-2 pandemic, and the broader application of such a platform in response to future viral threats.

SARS-CoV-2 detection using a nanobody-functionalized voltammetric device

Article Open access 23 May 2022

Systematic evaluation of IgG responses to SARS-CoV-2 spike protein-derived peptides for monitoring COVID-19 patients

Article Open access 22 January 2021

Multiplexed detection of viral antigen and RNA using nanopore sensing and encoded molecular probes

Article Open access 14 November 2023

Introduction

Since its discovery in late 2019, severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has grown into a worldwide pandemic, due largely in part to a high rate of transmission (with R₀ estimates as high as 5.7¹) and asymptomatic presentation (current estimates report 17% of cases are asymptomatic^2,3). Delays in the development of reliable diagnostic tests, especially for point-of-care detection of acute infection, have made controlling the spread of the virus difficult, further contributing to the severity of the pandemic. Currently, molecular assays such as reverse transcription polymerase chain reaction (RT-PCR) are the gold-standard for identification of acute stage infection, however these techniques require specialized laboratory instruments and lengthy experimental procedures with slow turn-around time⁴. Antigen tests are able to yield results quicker than RT-PCR tests, however, issues with sensitivity limit their utility in the asymptomatic population⁵.

SARS-CoV-2 is the third coronavirus (CoV) in the past two decades to cause severe illness in humans. Consistent with other CoVs, the virus consists of a single positive strand RNA genome that encodes four structural proteins: spike (S), envelope (E), matrix (M), and nucleocapsid (N). Of these, the S protein, which consists of the S1 and S2 subunits, has been identified as the key structural protein responsible for initiating host cell attachment and membrane fusion^6,7. SARS-CoV-2 entry into host cells is facilitated by the S protein’s receptor binding domain (RBD) located on the S1 subunit, which binds to the angiotensin-converting enzyme 2 (ACE2) receptor located on epithelial cells lining the respiratory tract⁸.

Viral surface proteins, such as the S protein, are promising targets for both virus detection and neutralization. Early in the pandemic, the crystal structure of ACE2 and SARS-CoV-2 RBD was solved (PDB code: 6M17⁶) and used to identify a region within the α1 helix of the ACE2 protein directly involved in binding to the S protein RBD^9,10,11. Molecular dynamics simulations demonstrated that a short 23-mer peptide fragment from ACE2 (named SBP1) could bind to the SARS-CoV-2 RBD with micromolar affinity (K_D = 1.3 µM)¹². Since then, numerous studies have confirmed that neutralizing antibodies and nanobodies from Covid-19 patients also bind primarily to the RBD^13,14, solidifying the spike protein as a target for both detection and therapeutic efforts.

Peptides present an interesting opportunity in the development of new point-of-care diagnostics against SARS-CoV-2. Peptides have been used previously for the detection of viral surface proteins and their low production cost, superior stability, high target affinity, small size, and ease of modification make them attractive in the anti-viral space¹⁵. The availability of mature in silico tools to model three-dimensional protein interactions provides a platform to rapidly design and optimize potential binding sequences for a specific target protein. These tools have been used to identify miniproteins capable of blocking the interaction between SARS-CoV-2 RBD and ACE2¹⁶. Computational approaches to peptide design have also been employed to design peptide binders with higher predicted SARS-CoV-2 RBD affinity than the ACE2-derived SBP1^17,18, however a lack of simple and high-throughput experimental platforms for testing these peptides has, for the most part, prevented their validation in vitro. One approach taken to bridge this gap has been the application of a combinatorial affinity selection—mass spectrometry platform to screen and identify peptide binders with nanomolar affinity to SARS-CoV-2 RBD¹⁹. Peptide microarrays offer an alternative approach for rapid screening of large peptide libraries. Custom arrays are available from commercial vendors and can be imaged on a variety of commonly available scanners without the need for specialized training or instrumentation. In response to the SARS-CoV-2 outbreak, microarray technology has been leveraged extensively. The ability to print the entire viral proteome as short overlapping peptide fragments has enabled researchers to build linear epitope landscapes of the SARS-CoV-2 proteins, rapidly identify immunoreactive epitopes, and isolate virus-specific antibodies from patient sera^{20,21,22,23,24}. Though antibody cross-reactivity between SARS-CoV-2 and other human coronaviruses is common, several unique immunodominant epitopes of the SARC-CoV-2 spike and nucleocapsid proteins have been identified and may serve as capture antigens for future COVID-19 diagnostic antibody tests^21,22. Albeit less common, the same microarray technology can also be adapted to screening large de novo peptide libraries for binding to protein targets. To our knowledge, the combination of computational peptide design and microarray screening has not been previously reported for viral protein targets.

Here we demonstrate the use of a rapid and high-throughput platform to experimentally screen for binding of in silico designed peptides to SARS-CoV-2 S protein using commercially available microarray technology. Identified binders were further validated with traditional ELISA and biolayer interferometry (BLI) methods and found to have nanomolar affinity for the S protein, thus confirming the utility of computational peptide-protein design and the ability of the microarray platform to identify efficient binders. Additionally, we integrated the top peptides into an electrochemical based sensor to demonstrate their real-world applicability in saliva-based antigen tests.

Results and discussion

Computational design of SARS-CoV-2 S protein binding peptides

Computational design of peptides requires the generation of tertiary structures in order to predict binding to a target of interest. Analysis of the neutralizing sites on the SARS-CoV-2 S protein indicated that most monoclonal antibodies (mAbs) targeted the receptor binding domain (RBD) of the S protein^25,26, while only one targeted the N-terminal domain (NTD)²⁷. Cryo-electron microscopy structure analysis of the S protein trimer revealed that the RBD can be in an open or closed state²⁸. Based on structural information, Barnes et al.²⁶ identified four different classes of RBD-targeting antibodies depending on whether they bound to the ACE2 binding domain, and whether the S1 RBD was in the closed or open state. Therefore, in the computational design of binding peptides we modeled three different structures of spike protein: S protein trimer in the open form (PDB ID: 6VYB), S protein trimer in the closed form (PDB ID: 6VXX), and RBD of the S protein (PDB ID: 6LZG).

Two approaches were used to generate a library of peptides. In the first approach, we used a fragment from the N-terminal α1 helix of the human ACE2 receptor, shown to be directly involved with binding to the RBD of the S protein^9,12. Ten 18-mer peptide sequences spanning the length of the original ACE2 fragment were generated and their tertiary structures were modeled from the crystal structure of the RBD/hACE2 complex (PDB ID: 6LZG). These peptides were subsequently docked to the RBD of the spike protein to generate protein-peptide complexes for sequence optimization.

In the second approach, we designed a set of peptides from a pool of random sequences. Since short peptides are very flexible and typically lack a distinct conformation in their unbound state, an ensemble of 20 structures was generated for every peptide sequence, as described in the Methods section. These peptides were docked to the S1 subunit of the S protein trimer in its open and closed forms, as well as to the monomer RBD structure, with no constraints on the location of the binding site. Analysis of docking complexes revealed several hot spots for peptide binding on both the NTD and RBD. About 300 protein-peptide complexes (with peptides bound to each the NTD and RBD) were selected for sequence optimization.

Sequence optimization was performed as an iterative process where mutations were introduced into every peptide sequence and kept if they resulted in an improved binding score. The most important, but also most difficult, part of the proposed approach was to correctly score and rank the protein-peptide complexes based on their binding affinity. Current scoring functions use different simplifications to enable search through both the sequence and conformational space. To account for these differences, we used the consensus score of two different scoring functions (ddg provided by Rosetta and ZRANK²⁹ developed at Boston University) to rank sequences based on predicted binding affinity to the S protein. Using the consensus scores generated by computational analysis, we selected a library of 2,376 unique peptide sequences for screening. This library consisted of 10 wild-type ACE2 variants (as shown in Fig. 2a), 800 ACE2-optimized sequences as described in approach 1, and 1566 S protein binding sequences as described in approach 2.

Microarray screening of designed peptides

One critical capability missing from current efforts to design binding sequences against SARS-CoV-2 is the ability to test candidates in a simple and high-throughput format. Here, we applied a fast and simple microarray-based screening pipeline to select S protein binding peptides from our in silico designed library. All reagents used in this pipeline were commercially available and required no special modifications or equipment, thereby allowing for easy adoption in other laboratories. Since the S protein trimer was not commercially available at the time of screening, we screened for binding to the SARS-CoV-2 S1 subunit, which contains the receptor binding domain (RBD) and N-terminal domain (NTD)—two of the binding hot-spots identified during computer-based docking studies.

The top-ranking peptide sequences identified from in silico design were printed on a custom peptide microarray with side-by-side duplicates. The library of 2,376 sequences fit on a 1 × 2 design where two copies of the array were printed onto a single slide. Each subarray was exposed to biotinylated SARS-CoV-2 S1 protein at a single concentration between 2 and 50 µg/mL (or buffer-only control) and binding sequences were identified following incubation with streptavidin conjugated fluorescent dye, as shown in Fig. 1a.

One advantage of this system is that processing of multiple arrays can easily be completed in a single day, allowing for rapid screening of the entire library under multiple experimental conditions. Representative images of the array are shown in Fig. 1b following exposure to 50, 10, and 5 µg/mL of S1 protein. In addition to the control spots, visible as bright and dark spots around the perimeter, a large number of apparent binders are visible on the left third of the array. These spots correspond to the portion of the library (P2-P811) designed from the ACE2 receptor fragment, as described in approach 1. Not surprisingly, the design of peptides from a known binding sequence resulted in numerous “hits” with a range of binding affinities, as observed by the variation in fluorescent intensity from this region. Following data normalization, these sequences also had the highest proportion of binders with 115 of 800 sequences showing a Z-score greater than 1.95 (Fig. 1c) at the highest S1 protein concentration tested.

Perhaps most interesting, however, is that the ten wild-type ACE2 variants (P2-P11), used as the starting sequences for optimization in approach 1 (Fig. 2a), showed very little binding to S1 protein. Despite the low signals, there was a trend for increased binding from peptides overlapping the central region of the original ACE2 sequence. In fact, P7, which contained 9 of the 11 binding residues present in the original fragment (Fig. 2b), showed the highest binding to S1 protein, although it never achieved a normalized binding signal greater than 1.3. The increase in binding signal obtained from sequences derived from the wild-type ACE2 variants highlights the importance of the sequence optimization process for improving target binding affinity.

Additionally, a handful of binding sequences from the pool of random sequences were observed to have high affinity to the S protein (P812-P2377, right two-thirds of array in Fig. 1b). Importantly, these sequences were obtained de novo with no inputs from a priori binding characterization. After data normalization, a total of 5 sequences were identified with a Z-score greater than 1.95 at the highest S1 protein concentration tested. Taking into consideration normalized binding signal across all S1 protein concentrations tested, we selected 14 peptides for further characterization: 10 from the pool of ACE2 mutants, 3 from the pool of S1 modeled sequences, and one non-binding sequence (P481) as a negative control for comparison. A heatmap of Z-scores for each of the selected peptides across all S1 protein concentrations tested is shown in Fig. 2c with corresponding sequences in Fig. 2d. All sequences selected had a normalized binding signal greater than 2 on the 50 µg/mL S1 protein array, except for P481 (the non-binding sequence). Importantly, not including time for array production, the entire screening process could be completed in less than one week, showing that potential binding sequences can be identified rapidly in response to emerging targets.

Peptide binding characterization

To confirm and further characterize the binding observed on the microarray, we turned our attention to more traditional methods. We started by setting up an ELISA-like assay to measure binding of biotinylated peptides to immobilized SARS-CoV-2 S1 protein, as described in the Methods section. We validated this assay with a 23-mer control sequence (SBP1) from the N-terminal alpha helix of the ACE2 receptor, which had been shown to bind the SARS-CoV-2 S protein RBD with micromolar affinity⁷. In our ELISA-like assay, biotinylated SBP1 had a calculated K_D of 2.2 µM to the SARS-CoV-2 S1 protein while a 12-mer truncated version of the peptide (SBP2) showed no measurable binding (Fig. 2e). Using the same assay format, we also screened our selected peptide sequences for binding to S1 protein. Of the 14 sequences selected from the microarray, 13 were able to be synthesized with an N-terminus biotin attached via a PEG4 spacer (Fig. 2d). Testing of the peptides in the assay identified four lead candidates with stronger binding to the S1 protein compared to the benchmark peptide, SBP1 (Fig. 2e). As expected, P481 showed no measurable binding signal to S1 protein. Notably, all four of the lead peptide binders were from the pool of ACE2 derived sequences.

Next, we used bio-layer interferometry (BLI) to further characterize the binding interactions between our top four peptide candidates and the S1 protein. This assay allows for the measurement of kinetic parameters and the immobilization strategy is more relevant to our endpoint sensing platform (described below). Here, biotinylated peptides were immobilized onto streptavidin coated biosensor tips and dipped into wells containing free S1 protein in solution. The presence of the biotin tag on the N-terminus of the peptide sequence forces the peptide into a specific orientation when immobilized, which may limit accessibility to the target protein, but is required for use as a diagnostic. The association and dissociation curves from serial dilutions of the S1 protein are shown for each of the peptides in Fig. 3a–d. After global 1:1 curve fitting, the dissociation constants (K_D) for all peptides were determined to be between 100 and 250 nM (Fig. 3e), with P89 having the highest affinity (K_D = 124 nM). While the top 4 peptides identified in this study were selected for binding to SARS-CoV-2 S protein, preliminary selectivity screening against other viral proteins indicated that some cross-reactivity may be present and needs to be further evaluated (Supplemental Fig. S1).

We also performed computer modeling to identify the potential binding sites of these peptides on the SARS-COV-2 S protein. The 3D structures of the peptide sequences were generated using the Rosetta package and peptide docking to the RBD was performed using the ZDOCK, as described in the Methods section. As shown in Fig. 3f, the predicted binding site for peptides P89 and P100 encompasses residues Y449–Q493 and overlaps with the ACE2-binding site³². Potential binding sites for the two other peptides are located outside of the ACE2-binding region. Peptide P180 binds to the residues Y380–F392 of the S protein, which was identified as the binding site for the S304 and CR3022 antibodies³¹, while P168 binds to residues E340–K356 of the RBD, consistent with the binding site of the S309 antibody.

Electrochemical impedance spectroscopy (EIS) for detection of SARS-CoV-2 S protein

To demonstrate the application of these peptides as the sensing element for the detection of SARS-CoV-2 S protein, we integrated them into an electrochemical-based sensor to detect S protein from spiked saliva. The peptides were immobilized on self-assembled monolayer modified gold electrodes and alterations in the charge transfer resistance (ΔRct) were used to quantify the binding signal by electrochemical impedance spectroscopy (EIS). Results for peptide P180 are shown in Fig. 4a (detailed results in Supplemental Fig. S2). EIS results were obtained in phosphate buffer solution (0.1 M, pH 7.2) containing 10 mM K₃[Fe(CN)₆]/K₄[Fe(CN)₆], with the frequency scanned from 100 kHz to 0.1 Hz at the potential of the redox probe (0.135 V vs. Ag/AgCl). Charge transfer resistance (Rct) values were obtained by fitting the EIS spectra using Randell circuit model provided by Solartron. The plot in Fig. 4b of the signal (ΔRct) for varying S protein levels (0.05–10 µg/mL) that are present in a background of 10 µg/mL Ricin as the non-specific control protein shows a steady rise in signal due to specific binding to the S protein that leads to progressive blockage of the electron transfer. To show the potential of this sensor platform to function in saliva matrices, the S-protein was spiked at varying levels into 50% saliva. The plot in Fig. 4c shows a linear rise in the signal (ΔRct) as a function of S protein concentration in saliva. We assess the limit of detection for this sensor as ~ 0.1 µg/mL in phosphate-buffered saline (PBS) and ~ 0.2 µg/mL in saliva matrices, based on signals that are at least three standard deviations above a negative control protein (10 µg/mL Ricin in PBS, Fig. 4b) and no target in saliva (Fig. 4c). We are currently developing alternate sensor paradigms based on nanoporous gold electrodes³³ and redox signal amplification to improve sensitivity³⁴, however, the current limit of detection would be relevant for detection of SARS-CoV-2 in clinical samples^35,36.

Conclusions

The ability to rapidly develop diagnostic and therapeutic agents to address emergent viral threats would provide the means to monitor and prevent the spread of a disease, protecting the most vulnerable population. In this work, we developed a rapid pipeline to design peptide sequences with the potential to bind the S protein using computational methods and a simple and fast screening method based on peptide microarray technology. The screening and data analysis to identify peptide binder candidates takes less than a week to be executed and is based on commercially available instrumentation and materials. Finally, we demonstrated that the microarray platform, which mimics most sensor platforms in which a bio-recognition element is anchored to a surface, is capable of identifying peptide sequences that can be rapidly integrated in an electrochemical sensor with a clinically relevant limit of detection. We believe the main advantage of this approach is the general applicability to other targets of interest. Depending on the emerging threat landscape, we believe that SARS-CoV-2 variants or even completely unrelated viruses could be addressed. Furthermore, our lab is currently working to expand the pipeline for selection of peptide BREs for a wider variety of targets of interest to the Department of Defense. We acknowledge there are still some limitations that need to be addressed. Mainly (i) the time required for computational design, (ii) the need for a crystal structure for the target protein, (iii) the limited length of peptides that can be synthesized on the array (18-mers), and (iv) the time to receive the arrays after ordering (4–6 weeks). As structure prediction models continue to evolve, we expect the computational component of this research to become much faster and less reliant on the availability of crystal structures. The knowledge of existing binding epitopes on target proteins greatly increases the chance of success during the computational design and sequence optimization stage, resulting in the identification of efficient binders in a single round of screening. We acknowledge that current peptide microarray technology limits the ability to test longer sequences, which may provide more complex structures with better binding affinities. Shorter peptides have been shown capable of achieving high affinity binding but may require multiple rounds of optimization, thereby extending the time required for candidate selection due to lengthy array printing time. We believe that further advances in microarray technologies will overcome these issues providing a more versatile option for peptide design and testing to address new viral threats in a timely fashion.

Materials and methods

Computer modeling

Two methods were explored to generate a peptide library for the SARS-CoV-2 S1 protein. In the first we used a 27-mer peptide fragment derived from the N-terminal alpha-helix of the human ACE2 receptor, shown to be in direct contact with the SARS-CoV-2 S protein RBD during binding of viral particles. The sequence of this fragment was split into ten overlapping 18-mer peptide sequences and tertiary structures were predicted from the crystal structure of the RBD/hACE2 complex (PDB ID: 6LZG). These peptides were subsequently docked to the S protein and used as the initial set of protein-peptide complexes for sequence optimization.

In the second approach, we began with a random 18-mer peptide sequence. An ensemble of 3D structures was generated for every sequence to account for their flexibility. Initially, 1000 s of tertiary structure models for the target sequence were created using the AbinitioRelax protocol from the Rosetta package³⁷. The models were clustered and structures from five clusters with the lowest energy were selected as input structures for the replica exchange Molecular Dynamics (REMD) simulations²⁹. The REMD simulations were performed with the Amber16 suite of biomolecular simulation programs^20,38. Eight replicas distributed over a temperature range from 270 to 600 K were used. The implicit water simulations for each replica were initially equilibrated for 200 ps at the corresponding temperature followed by 5 ns of production run. The time step was set to 2 fs and SHAKE was applied to constrain the bonds connecting hydrogen atoms. The temperature exchanges were attempted every 1 ps and 500 snapshots from the production run were used for cluster analysis. Clustering was performed with the Cpptraj module based on the pairwise backbone-atom only root-mean square deviations (RMSD). Saved snapshot conformations were clustered into ten clusters and a representative from four clusters with the lowest energy were taken for the docking. A total of 20 structures for every peptide sequence were used in the docking calculations.

The rigid docking of the initial peptides to the spike protein was performed using the ZDOCK package developed at Boston University³⁹. Both protein and peptides were considered as rigid and a global search of the rotational and translational space was performed without any constraint on the location of the binding sites. For every peptide structure, the program generated and clustered 2000 docking conformations from which the top 4 protein-peptide complexes were selected for sequence optimization with the Rosetta package.

Peptide sequence optimization used the Monte Carlo-based RosettaScripts protocol that included a random single base mutation, side chain rotamer repacking, and backbone and side chain torsion minimization. The scoring metrics were calculated using Rosetta’s scoring functions: binding energy of the complex ddg and interface buried solvent accessible surface area sasa. The mutation was accepted if the scoring metrics of the mutated sequence were better than the starting sequence. For every protein-peptide complex input, the program screened 10,000 peptide sequence mutants in order to generate the top 10 protein-peptide complexes with optimized binding.

Two different scoring functions were used to rank the generated complexes: ddg scoring function provided by the Rosetta package and ZRANK scoring function²⁹ developed at Boston University. The designed protein-peptide complexes were ranked using consensus score and peptide sequences with the highest scores were selected and processed for microarray screening.

Microarray screening

The library of 2,376 sequences was printed onto a custom PEPperCHIP peptide microarray (PEPperPrint GmbH, Heidelberg, Germany). 18-mers were printed in numerical order with side-by-side duplicates. Positive (hemagglutinin (HA)) and negative (G residue) control sequences were printed around the perimeter of the array for quality control assessment and grid alignment. Two copies of each array were printed on a single microarray slide. To screen for protein binding, each array was blocked (Rockland Blocking Buffer) for 1 h and then incubated with biotinylated SARS-CoV-2 S1 protein, (SARS-CoV-2 S1, His, Avitag #S1N-C82E8, Acro Biosystems, Newark, DE) in binding buffer (1 × PBS + 0.05% tween20 + 0.5% BSA) for 1 h. Between each step the array was washed 3 times for 1 min each in wash buffer (1 × PBS + 0.05% tween20) to remove any non-specific protein binding. Binding of biotinylated protein was detected by incubating the array with streptavidin conjugated to Cy5 dye (SA-Cy5, 1:20 dilution) while control spots were detected with an anti-HA antibody conjugated to Cy5 dye (anti-HA-Cy5, 1:2000 dilution, PEPperPRINT). Both were added in binding buffer for 30 min. The array was scanned on an Agilent SureScan microarray scanner in the red channel (640 nm) with 3 µm resolution and 20-bit dynamic range. Tiff image files were loaded into image analysis software (Mapix v8.5 by Innopsys), overlaid with grid file (.gal) to designate spot location, and median fluorescence intensity for each spot was extracted and averaged across replicates. Spots with visual artifacts (scratches, dust, etc.) were excluded from analysis. In order to compare binding intensities across different arrays and account for experimental variations in signal, sequence intensities were converted into Z-scores, using Eq. (1) shown below^20,38:

$$Z{-}score=({intensity}_{P}-{mean\,intensity}_{{P}_{1}\ldots {P}_{n}})/{SD}_{{P}_{1}\ldots {P}_{n}}$$

(1)

where P is any peptide sequence on the array and P1…Pn represents the aggregate measure of all peptides on the array.

Peptide synthesis

Selected peptides were synthesized by a commercial vendor (Peptide2.0) with an N-terminal biotin attached by a PEG4 spacer and HPLC purified to achieve > 98% reported purity. Peptides were resuspended in 100% DMSO and diluted into PBS to achieve a stock concentration of 1 mM. Sonication and gentle warming were used to improve solubility in the event the peptides precipitated during dilution.

Peptide binding validation with ELISA

An in-house ELISA-like assay was developed to measure binding of biotinylated peptides to immobilized SARS-CoV-2 S1 protein (His-tag, #S1N-C52H3, Acro Biosystems). Briefly, proteins diluted in 1 × PBS were adsorbed (0.2 µg per well) to a 96-well plate overnight at 4 °C. The following day, the plate was blocked (Blocking Buffer, Rockland) for 1 h at RT. The plate was washed 3 times with wash buffer in between each step. Biotinylated peptides were diluted in binding buffer and incubated on the plate, in duplicate, at RT for 1 h. Binding was detected with streptavidin-HRP (1:200 in binding buffer) followed by TMB substrate for colorimetric development. Signal development was stopped with the addition of 2 M sulfuric acid and absorbance values were read on a SpectraMax Paradigm plate reader (450 nm, Molecular Devices). Raw absorbance values were normalized to MIN and MAX wells on each plate. Binding curves were graphed in GraphPad Prism (v8.4) and binding constants (K_D) estimated with non-linear curve fitting (one-site specific binding with hill slope). Data points represent the mean and standard deviation of replicate wells.

Binding characterization using bio layer interferometry (BLI)

A ForteBio Octet RED96e Bio-Layer Interferometry system (ForteBio, CA) was used to characterize peptide-protein binding kinetics for each of the top four SARS-CoV-2 S protein binding peptides. All reactions were completed at 30 °C and 1000 rpm. Biotinylated peptides were immobilized onto pre-hydrated streptavidin (SA) biosensor tips (Forte Bio). Peptide loading time and concentration was optimized to obtain optimal loading density (as previously described³⁹). Tips loaded with peptide were baselined in an appropriate buffer (matched to protein solution), and then dipped into wells containing SARS-CoV-2 S1 protein (His-tag, #S1N-C52H3, Acro Biosystems) serially diluted in binding buffer to obtain the association constant (K_a). Double referencing was completed by subtracting off background signal from a peptide only sensor (no protein) and reference sensors (no peptide) to account for non-specific interactions of protein. After association, the tips were dipped back into the baseline well to obtain the dissociation constant (K_dis). Curve fitting was completed with ForteBio Biosystems using a global fitting algorithm and 1:1 binding model to obtain dissociation constants (K_D).

Disclaimer

The views and opinions presented herein are those of the authors and do not necessarily represent the views of the US Department of Defense or its Components.

Data availability

The datasets generated during and/or analyzed during the current study are available from the corresponding author on reasonable request.

References

Sanche, S. et al. High contagiousness and rapid spread of severe acute respiratory syndrome coronavirus 2. Emerg. Infect. Dis. 26, 1470–1477 (2020).
Article CAS Google Scholar
Mizumoto, K., Kagaya, K., Zarebski, A. & Chowell, G. Estimating the asymptomatic proportion of coronavirus disease 2019 (COVID-19) cases on board the Diamond Princess cruise ship, Yokohama, Japan, 2020. Eurosurveillance 25, 2000180 (2020).
Article Google Scholar
Byambasuren, O. et al. Estimating the extent of asymptomatic COVID-19 and its potential for community transmission: Systematic review and meta-analysis. Off. J. Assoc. Med. Microbiol. Infect. Dis. 5, 223–234 (2020).
Google Scholar
Carter, L. J. et al. Assay techniques and test development for COVID-19 diagnosis. ACS Cent. Sci. 6, 591–605 (2020).
Article CAS Google Scholar
Nagura-Ikeda, M. et al. Clinical evaluation of self-collected saliva by quantitative reverse transcription-PCR (RT-qPCR), Direct RT-qPCR, reverse transcription-loop-mediated isothermal amplification, and a rapid antigen test to diagnose COVID-19. J. Clin. Microbiol. 58, e01438 (2020).
Article CAS Google Scholar
Yan, R. et al. Structural basis for the recognition of SARS-CoV-2 by full-length human ACE2. Science 367, 1444–1448 (2020).
Article ADS CAS Google Scholar
Wang, Q. et al. Structural and functional basis of SARS-CoV-2 entry by using human ACE2. Cell 181, 894-904.e9 (2020).
Article CAS Google Scholar
Ziegler, C. G. K. et al. SARS-CoV-2 receptor ACE2 is an interferon-stimulated gene in human airway epithelial cells and is detected in specific cell subsets across tissues. Cell 181, 1016-1035.e19 (2020).
Article CAS Google Scholar
Han, Y. & Král, P. Computational design of ACE2-based peptide inhibitors of SARS-CoV-2. ACS Nano 14, 5143–5147 (2020).
Article CAS Google Scholar
Renzi, F. & Ghersi, D. ACE2 fragment as a decoy for novel SARS-Cov-2 virus. bioRxiv https://doi.org/10.1101/2020.04.06.028647 (2020).
Article PubMed PubMed Central Google Scholar
Alagumuthu, M., Rajpoot, S. & Baig, M. S. Structure-based design of novel peptidomimetics targeting the SARS-CoV-2 spike protein. Cell. Mol. Bioeng. 14, 177–185. https://doi.org/10.1007/s12195-020-00658-5 (2020).
Article CAS PubMed Central Google Scholar
Zhang, G. et al. Investigation of ACE2 N-terminal fragments binding to SARS-CoV-2 Spike RBD. bioRxiv https://doi.org/10.1101/2020.03.19.999318 (2020).
Article PubMed PubMed Central Google Scholar
Valdes-Balbin, Y. et al. Molecular aspects concerning the use of the SARS-CoV-2 receptor binding domain as a target for preventive vaccines. ACS Cent. Sci. https://doi.org/10.1021/acscentsci.1c00216 (2021).
Article PubMed PubMed Central Google Scholar
Wu, Y. et al. Identification of human single-domain antibodies against SARS-CoV-2. Cell Host Microbe 27, 891-898.e5 (2020).
Article CAS Google Scholar
Al-Azzam, S. et al. Peptides to combat viral infectious diseases. Peptides 134, 170402 (2020).
Article CAS Google Scholar
Cao, L. et al. De novo design of picomolar SARS-CoV-2 miniprotein inhibitors. Science 370, 426–431 (2020).
Article ADS CAS Google Scholar
Sitthiyotha, T. & Chunsrivirot, S. Computational design of 25-mer peptide binders of SARS-CoV-2. J. Phys. Chem. B 124, 10930–10942 (2020).
Article CAS Google Scholar
Chowdhury, S. M. et al. Antiviral Peptides as promising therapeutics against SARS-CoV-2. J. Phys. Chem. B 124, 9785–9792 (2020).
Article CAS Google Scholar
Pomplun, S. et al. De novo discovery of high-affinity peptide binders for the SARS-CoV-2 spike protein. ACS Cent. Sci. 7, 156–163 (2021).
Article CAS Google Scholar
Wang, H. et al. SARS-CoV-2 proteome microarray for mapping COVID-19 antibody interactions at amino acid resolution. ACS Cent. Sci. 6, 2238–2249 (2020).
Article CAS Google Scholar
Mishra, N. et al. Immunoreactive peptide maps of SARS-CoV-2. Commun. Biol. 4, 1–7 (2021).
Google Scholar
Holenya, P. et al. Peptide microarray-based analysis of antibody responses to SARS-CoV-2 identifies unique epitopes with potential for diagnostic test development. Eur. J. Immunol. 51, 1839–1849 (2021).
Article CAS Google Scholar
Yang Li, A. et al. Linear epitope landscape of the SARS-CoV-2 Spike protein constructed from 1,051 COVID-19 patients. Cell Rep. 34, 108915 (2021).
Article Google Scholar
Musicò, A. et al. SARS-CoV-2 epitope mapping on microarrays highlights strong immune-response to N protein region. Vaccines 9, 35 (2021).
Article Google Scholar
Piccoli, L. et al. Mapping neutralizing and immunodominant sites on the SARS-CoV-2 spike receptor-binding domain by structure-guided high-resolution serology. Cell 183, 1024-1042.e21 (2020).
Article CAS Google Scholar
Barnes, C. O. et al. SARS-CoV-2 neutralizing antibody structures inform therapeutic strategies. Nature 588, 682–687 (2020).
Article ADS CAS Google Scholar
Chi, X. et al. A neutralizing human antibody binds to the N-terminal domain of the Spike protein of SARS-CoV-2. Science 369, 650–655 (2020).
Article ADS CAS Google Scholar
Walls, A. C. et al. Structure, function, and antigenicity of the SARS-CoV-2 spike glycoprotein. Cell 181, 281-292.e6 (2020).
Article CAS Google Scholar
Pierce, B. & Weng, Z. ZRANK: Reranking protein docking predictions with an optimized energy function. Proteins 67, 1078–1086 (2007).
Article CAS Google Scholar
Yuan, M. et al. A highly conserved cryptic epitope in the receptor binding domains of SARS-CoV-2 and SARS-CoV. Science 368, 630–633 (2020).
Article ADS CAS Google Scholar
Sugita, Y. & Okamoto, Y. Replica-exchange molecular dynamics method for protein folding. Chem. Phys. Lett. 314, 141–151 (1999).
Article ADS CAS Google Scholar
Bradley, P., Misura, K. M. S. & Baker, D. Biochemistry: Toward high-resolution de novo structure prediction for small proteins. Science 309, 1868–1871 (2005).
Article ADS CAS Google Scholar
Liu, Y. et al. Minimum bactericidal concentration of ciprofloxacin to Pseudomonas aeruginosa determined rapidly based on pyocyanin secretion. Sens. Actuators B 312, 127936 (2020).
Article CAS Google Scholar
Liu, Y. et al. Electrofabricated biomaterial-based capacitor on nanoporous gold for enhanced redox amplification. Electrochim. Acta 318, 828–836 (2019).
Article CAS Google Scholar
Lee, J. H. et al. A novel rapid detection for SARS-CoV-2 spike 1 antigens using human angiotensin converting enzyme 2 (ACE2). Biosens. Bioelectron. 171, 112715 (2021).
Article CAS Google Scholar
Wölfel, R. et al. Virological assessment of hospitalized patients with COVID-2019. Nature 581, 465–469 (2020).
Article ADS Google Scholar
Chen, R. & Weng, Z. Docking unbound proteins using shape complementarity, desolvation, and electrostatics. Proteins 47, 281–294 (2002).
Article CAS Google Scholar
Cheadle, C., Vawter, M. P., Freed, W. J. & Becker, K. G. Analysis of microarray data using Z score transformation. J. Mol. Diagn. 5, 73–81 (2003).
Article CAS Google Scholar
Lou, X., Egli, M. & Yang, X. Determining functional aptamer-protein interaction by biolayer interferometry. Curr. Protoc. Nucleic Acis Res. 67, 1–25. https://doi.org/10.1002/cpnc.18 (2016).
Article Google Scholar

Download references

Acknowledgements

The authors thank the AFRL 711th Human Performance Wing for supporting this research. Funding was provided by the Coronavirus Aid, Relief, and Economic Security (CARES) Act and by the AFOSR grant FA2386-18-1-4100.

Author information

Rachel Krabacher
Present address: Materials & Manufacturing Directorate, Air Force Research Laboratory, Wright-Patterson Air Force Base, Dayton, OH, 45433, USA

Authors and Affiliations

711th Human Performance Wing, Air Force Research Laboratory, Wright-Patterson Air Force Base, Dayton, OH, 45433, USA
Monica Wolfe, Sean Webb, Yaroslav Chushak, Rachel Krabacher, Svetlana Harbaugh & Jorge Chávez
UES, Inc., Dayton, OH, 45432, USA
Monica Wolfe & Sean Webb
Henry M. Jackson Foundation, Dayton, OH, 45433, USA
Yaroslav Chushak
Department of Electrical and Computer Engineering, University of Virginia, Charlottesville, VA, 22904, USA
Yi Liu & Nathan Swami

Authors

Monica Wolfe
View author publications
You can also search for this author in PubMed Google Scholar
Sean Webb
View author publications
You can also search for this author in PubMed Google Scholar
Yaroslav Chushak
View author publications
You can also search for this author in PubMed Google Scholar
Rachel Krabacher
View author publications
You can also search for this author in PubMed Google Scholar
Yi Liu
View author publications
You can also search for this author in PubMed Google Scholar
Nathan Swami
View author publications
You can also search for this author in PubMed Google Scholar
Svetlana Harbaugh
View author publications
You can also search for this author in PubMed Google Scholar
Jorge Chávez
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Y.C. designed and carried out computational modeling experiments. R.K. developed initial microarray screening protocols. M.W. and S.W. designed and carried out the BRE screening and binding characterization experiments. Y.L. carried out the EIS sensor experiments. N.S., S.H., and J.C. were involved in planning, discussion, and supervision of the project. All authors contributed to writing and editing of the manuscript.

Corresponding author

Correspondence to Jorge Chávez.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Wolfe, M., Webb, S., Chushak, Y. et al. A high-throughput pipeline for design and selection of peptides targeting the SARS-Cov-2 Spike protein. Sci Rep 11, 21768 (2021). https://doi.org/10.1038/s41598-021-01225-2

Download citation

Received: 18 August 2021
Accepted: 22 October 2021
Published: 05 November 2021
DOI: https://doi.org/10.1038/s41598-021-01225-2

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.