Introduction

Human papillomaviruses (HPVs) are ubiquitous double-stranded DNA (dsDNA) viruses belonging to family papillomaviridae. HPV induces hyperplastic, papillomatous, and verrucous squamous cell lesions in the skin and at various mucosal sites in a wide range of hosts, including humans and exploits the cellular machinery for their own purposes. Of more than 200 genotypes that have been described till date1, some are categorized as high-risk (HR-HPVs) types (HPV 16, 18, 31, 33. 35, 39, 45, 51, 52, 56, 58, 59, 68, 73 and 82) which are associated with genital and other epithelial cancers, and others as low-risk (LR-HPVs) types (HPV6, 11, 40, 42, 43, 44, 54, 61, 70, 72, 81, CP6108), which are responsible for benign tumors and genital warts2,3,4. Worldwide, it is one of the most common sexually transmitted infections, causing approximately 5% of all cancer cases5,6. Out of 200 HPVs, mainly HPV 16 and HPV 18, are the crucial etiological agents for several epithelial cell malignancies.

HPVs are small, non-enveloped, circular dsDNA viruses with ~ 8 kb genome containing three regions, late proteins (L1 and L2), early proteins (E1, E2, E4, E5, E6, and E7) and upstream regulatory region (URR). Out of 6 early proteins, E5, E6, and E7 play a crucial role in the transformation of cell. Two early oncoproteins, E6 and E7, are constantly expressed in almost all cervical cancer cells and identified as ideal targets for the immunotherapy of HPV-associated cancers7,8.

The World Health Organization (WHO) prequalified three vaccines against HPV i.e., a bivalent, a quadrivalent and nonavalent vaccine are being administered to encounter HPV 16 & HPV 18 infection which are responsible for approximately 70% of global cervical cancer cases (https://www.who.int/teams/immunization-vaccines-and-biologicals/diseases/human-papillomavirus-vaccines-(HPV). The quadrivalent vaccine having an additional impact on LR-HPV 6 & HPV 11 and showed efficacy in preventing anogenital warts caused by HPV 6 & 11 (https://www.who.int/immunization/hpv/vaccines/en/). However, none of them have therapeutic effects9,10. Due to the high prevalence and mortality with HPV associated cervical cancer, an effective therapeutic HPV vaccine or a vaccine having both prophylactic and therapeutic properties is urgently required for the clearance of the virus from the host.

Peptide-based vaccines have certain advantages as they are easy to produce and transport, have high selectivity, multivalency competence and have accessibility to the epitope. With the advancement in genome sequencing techniques, potential B and T cell epitopes can be predicted and have a promising outlook for the development of the peptide-based vaccine against any infectious agent or disease and/or cancers. At present, a branch of bioinformatics, immunoinformatics, used for the identification of effective T- & B- cell immunogenic epitopes from the antigenic peptide by different software which aids us to classify cost and labor effective immunogenic peptides by eluding the non-immunogenic sequence. These methods aid as low-cost by saving the cost of synthetic peptide expense of and working time11,12.

The polymorphism in host genetics can impact the immune response to a pathogen in the target population. In view of the polymorphic nature of major histocompatibility complex (MHC) class I and II alleles they bind towards specific repertoire, short peptides (epitopes), from the processed pathogen and presentation of these complexes to T cells is very important for the development of effective cytotoxic T lymphocytes (CTL) response against the pathogen13. The present study focused on the development of the effective chimeric vaccine having both prophylactic and therapeutic effects against the two HR-HPV (HPV 16 & 18) by reverse vaccinology approach. Here we have analyzed the L1, E5, E6 and E7 proteins of both HPV 16 and 18 by employing various algorithms for the prediction of the best T-cell and B-cell epitopes. Next, we have constructed a multi-epitope chimeric construct and evaluated all physiochemical parameters along with its antigenicity and allergenicity. Furthermore, we have done in silico cloning and checked the stability of our construct through MD simulation. The complete design and methods were used in this study are depicted in Fig. 1.

Figure 1
figure 1

An overview of steps used in designing the chimeric vaccine construct.

Results

T lymphocyte epitopes

Since epitopes on T cells are bound to MHCs, hence their interaction can be modeled accurately on the basis of certain algorithms. Both MHC-I & MHC-II epitopes are needed to make an immunogenic vaccine construct. Here, we have predicted the potent MHC-I & MHC-II epitopes using IEDB server on the default parameters.

MHC-I epitopes prediction

For MHC-I, the prediction of the epitopes of L1, E5, E6 & E7 proteins of HPV 16 & 18 has been done for the 27 reference alleles. The epitopes having a percentile score less than 0.5 have been short listed. The epitopes having the antigenicity score greater than 0.4 were considered as probable antigens.

HPV 16

A total of 155, 12, 22 and 17 epitopes having the percentile score < 0.5 in L1, E5, E6 & E7 proteins of HPV 16, respectively, has been selected for further analysis. These predicted epitopes were further analyzed for the antigenicity score and MHC-I immunogenicity. On analysis, 18, 03, 05 & 04 were found to have antigenicity score 0.4 and predicted as probable antigen for L1, E5, E6, & E7 protein, respectively (Table 1).

Table 1 MHC-I epitopes of L1, E5, E6, & E7 proteins of HPV 16.
HPV 18

A total of 155, 13, 38 and 9 epitopes having the percentile score < 0.5 in L1, E5, E6 & E7 proteins of HPV 18, respectively, has been selected for further analysis. These predicted epitopes were further analyzed for the antigenicity score and MHC-I immunogenicity. The epitopes having an antigenicity score greater than 0.4 and probable antigen. On analysis, 19, 09, 02 & 01 were found to have antigenicity score 0.4 and predicted as probable antigen for L1, E5, E6, & E7 protein, respectively (Table 2).

Table 2 MHC-I epitopes of L1, E5, E6, & E7 proteins of HPV 18.

MHC-II epitopes prediction

For MHC-II, the prediction of the epitopes of L1, E5, E6 & E7 proteins of HPV 16 & 18 has been made for the 7 allele HLA reference set. The epitopes having a percentile score less than 0.5 are short-listed. A total of 22 and 24 epitopes of the L1 & E5 protein of HPV 16 were predicted for the 7 allele HLA reference set however, no predication has been made for the E6 & E7 protein of the HPV 16. For HPV 18, L1, E5, E6 & E7 protein showed the 36, 27, 06, 04 epitopes having the percentile score less than 0.5. These predicted epitopes of HPV 16 & HPV 18 were further analyzed for the interferon- γ (IFN-γ) score (positive IFN-γ Score). In HPV 16, out of 22 and 24 epitope of the L1 & E5 protein, only 09, & 03 epitopes had the positive IFN-γ Score (Table 3). For HPV 18, out of 36, 27, 06, 04 epitopes, only 07, & 03 epitopes of L1 & E5 protein and none for E6 & E7 protein had the positive IFN-γ Score (Table 4).

Table 3 MHC-II epitopes of L1 and E5 proteins of HPV 16.
Table 4 MHC-II epitopes of L1 & E5 proteins of HPV 18.

Linear B cell epitopes

The B cell epitopes for L1, E5, E6 & E7 proteins of HPV 16 & 18 were predicted by using the ABCpred server (Tables 5 and 6). The final construct was subjected to ABCpred for the prediction of the potential B cell epitope at a cut-off of 0.8 and 24 potential epitopes were predicted in the multi-epitope chimeric vaccine construct (Table 7). The most potential B cell epitopes were LQFIFAAYKFGFPDTS, PGSIFYHAGSSRLLTV, & PGTAFTVYVFCFLLPM having score more than 0.90, 0.89 & 0.88, respectively. The B cell epitopes are important in the antigenic determinants and recognized by receptors on B lymphocytes.

Table 5 B Cell epitopes of L1, E6, & E7 proteins of HPV 16.
Table 6 B Cell epitopes of L1, E5, E6, & E7 proteins of HPV 18.
Table 7 Predicted B cell epitopes in vaccine construct.

Multi-epitope chimeric vaccine construction and its characterization

The potent epitopes of L1, E5, E6 & E7 proteins of HPV 16 & 18 were selected and joined with specific linkers for the adjuvant, MHC class I and MHC class II epitopes. In the vaccine construct, adjuvant β-defensin, 45 amino acid long, was at the N terminal and linked to the MHC-I alleles with EAAAK linker. The MHC-I epitopes of L1, E5, E6 & E7 proteins of HPV 16 & 18 were linked together with the AAY linker, on the other hand MHC-II epitopes were linked with GPGPG linkers and the final construct was prepared. Final vaccine construct has the 411 amino acid along with the addition of the adjuvant and linkers. The schematic representation of the prepared construct along with the secondary and tertiary structure shown in Figs. 2 and 3, respectively.

Figure 2
figure 2

Schematic representation of the prepared multi-epitope chimeric vaccine construct having adjuvant, MHC-I & MCH-II epitopes linked with EAAAK, AAY & GPGPG linkers respectively.

Figure 3
figure 3

Predicted 2D and 3D structure of the construct, (a). Predicted 2D structure of the multi-epitope chimeric vaccine construct, (b). Predicted 3D structure of the multi-epitope chimeric vaccine construct by the online I-TASSER server (https://zhanglab.ccmb.med.umich.edu/I-TASSER). The 3D structure of the protein was visualized in UCSF Chimera version1.13rc https://www.rbvi.ucsf.edu/chimera. (c). Ramachandran Plot analysis of the multi-epitope chimeric vaccine construct.

Physiochemical properties of multi-epitope chimeric vaccine construct favored for vaccine production

The physiochemical properties of the multi-epitope chimeric vaccine construct were analyzed using the ProtParam server. The 411 amino acid of this construct has a molecular weight of around 46,246.53 g/mol and theoretical pI was 8.67, depicting it as basic nature. The construct is thermostable as its aliphatic index is 91.07 and average half-life in 30 h in-vivo (in mammalian reticulocytes), whereas > 20 & > 10 h in yeast & Escherichia coli (E. coli), respectively.

Allergenicity and antigenicity

The allergenic property of the multi-epitope chimeric vaccine construct was predicted by AllerTOP v2.0 and found to be non-allergenic behavior. The predicted antigenicity score of this construct was 0.5883 by using Vaxijen v2.0 at a threshold value of > 0.4%, which shows the antigenic nature of the construct.

The multi-epitope chimeric vaccine construct favored secondary and tertiary structure

The 411 amino acid long multi-epitope chimeric vaccines construct was analyzed for secondary structure, out of 411 amino acids, 200 (48.66%) formed alpha helix, 69 (16.78%) β-strands and coil were formed by 142 (34.54%) (Fig. 3a). I-TASSER uses 10 threading templates which showed good Z-score values (ranging from 0.62 to 3.54) and predicted 5 models. The model was selected on the basis of a high C score (higher value indicates higher confidence) which typically ranges from − 5 to 2 (Fig. 3b). The selected model was analyzed for the Ramachandran plot using the SAVES server and found 93.8% of residues in the most favored regions and additional allowed regions (Fig. 3c).

Toll-like receptor (TLR)-3 and TLR4 established the interaction with multi-epitope chimeric vaccine construct

The multi-epitope chimeric vaccine construct with the selected epitopes was docked with TLR3 (2A0Z) and TLR4 (3FXI) and showed good interaction. The docked structure along with molecular interactions between docking complex of vaccine construct and TLR3 & TLR4 were visualized in the LigPlot (Fig. 4a,b).

Figure 4
figure 4

Molecular docking between multi-epitope chimeric vaccine construct and TLR 3 & TLR 4 receptors using the PatchDock (https://bioinfo3d.cs.tau.ac.il/PatchDock/php.php) server, (a). Depiction of the docked conformation and interaction of the TLR 3 & multi-epitope chimeric vaccine construct in Ligplot analysis indicating hydrogen bonding and hydrophobic interactions between the construct and TLR3, (b). Depiction of the docked conformation and interaction of the TLR 4 & multi-epitope chimeric vaccine construct in Ligplot analysis indicating hydrogen bonding and hydrophobic interactions between the construct and TLR4.

MD simulation studies show a stable interaction between ligand and its receptor

The molecular dynamics analysis was performed to access the binding interactions and flexibility of the binding site. MD simulations were done for 100 ns on each complex (total 200 ns) and the stability of the simulation was evaluated using root mean square deviation (RMSD). The RMSD values reveal the structural changes that occurred during the MD. The RMSD plots for all proteins indicated that each system got stabilized quickly and then remained stable throughout the simulation time as evidenced by the movement of the RMSD curve within the 2 Å (Fig. 5a,b). These plots suggested that each system was quite stable for further study.

Figure 5
figure 5

MD Simulation Results. (a). RMSD curve for TLR3-chimeric vaccine construct complex, (b). RMSD curve for TLR4-chimeric vaccine construct complex, (c). RMSF curve for TLR3-chimeric vaccine construct complex, (d). RMSF curve for TLR4-chimeric vaccine construct complex. The beta sheets of the TLR3 and TLR4 show limited flexibility. The flexibility of the interface binding residues is also less indicating stable interactions, (e). The complex of TLR3-chimeric vaccine construct complex generated by the VMD software version 1.9357, (f). The complex of TLR4-chimeric vaccine construct complex generated by the VMD software version 1.9357, Three most stable salt bridges are shown for both of the complexes. The salt bridges anchor the two proteins by stable salt-bridge interactions. The complex of TLR3-chimeric vaccine construct showed stronger salt-bridge interactions as measured by occupancy indicating it may be comparatively more stable than TLR4-chimeric vaccine construct complex.

The root-mean-square fluctuation (RMSF) study was done to get an insight into the flexibility of individual amino acids during the simulation. The amino acid residues taking part in important interactions are generally constrained and show less flexibility as compared to other amino acid residues in the complex. It can be seen from RMSF plots (Fig. 5c,d) that the beta-sheets of TLR3 and TLR4 are showing less fluctuations (wavy pattern) due to strong hydrogen bonding between them. The other part of HPV is showing greater fluctuations. A comparison of the two plots indicates that the complex with TLR3 is showing comparatively lesser fluctuations indicating better binding among the interaction partners and a stronger complex.

The chimeric vaccine construct-TLR3 salt-bridge analysis revealed residues at the binding interface show strong interactions especially ARG348_chimeric vaccine construct-GLU533_TLR3, ASP246_chimeric vaccine construct-LYS41_TLR3, ARG380_chimeric vaccine construct-GLU306_TLR3 were found to be quite stable with percent occupancy of 72, 63 and 55 respectively. It can be seen in Fig. 5e that these salt bridges anchor the two proteins at three distinct points. In case of HPV-TLR4 complex, the binding interactions were less strong as compared with the chimeric vaccine construct-TLR3 complex. The ARG598_TLR4-GLU330_chimeric vaccine construct, ASP453_TLR4-LYS26_chimeric vaccine construct, GLU154_TLR4-ARG36_chimeric vaccine construct interactions showed however some stabilizing interactions with > 40% occupancy. The salt-bridges are shown in Fig. 5f. The results clearly showed the complex HPV-TLR3 is much more stable than chimeric vaccine construct-TLR4.

In silico cloning of the construct in pcDNA3.1/V5/His-Topo vector

The vaccine candidate was processed for the codon optimization for the maximal level of protein expression in E. coli using Jcat server and the optimized vaccine candidate had 1233 nucleotide and the GC content of 67.8%, which showed the high expression of the vaccine candidate. The plasmid vector was prepared using the SnapGene software into pcDNA3.1/V5/His-Topo (Fig. 6).

Figure 6
figure 6

In silico cloning of multi-epitope chimeric vaccine (shown in green) in pcDNA3.1/V5/His-TOPO/LacZ vector.

Immune simulation analysis shows an effective cytokine and antibody response

The result of the C-ImmSim studies showed the effective immune responses of the vaccine candidate which showed the high level of the IgM and also showed the increased level of the immunoglobulin (IgG1 + IgG2, IgM and IgG + IgM) expression and T helper cell and cytotoxic T cells. The production of the cytokines & interleukins was identified along with increased B- & T-cells (Fig. 7).

Figure 7
figure 7

Immune stimulation using in silico C-ImmSim server after antigen 3 dose injection of vaccine construct61, (a). Production of the immunoglobulin after antigen injections (black vertical lines); detailed subclasses are showed as coloured peaks, (b). Evolution of B-cell populations, (c). Th cell population evolution, (d). T-cytotoxic cell populations, (e). Production of cytokines & interleukins.

Discussion

Cervical cancer caused by HPV has reached to its alarming stage and become a public threat globally14. The current management of this disease is not satisfactory and mainly depends on three prophylactic vaccines15. So far, no therapeutic vaccine is available against HPV. Hence, a vaccine having both prophylactic and therapeutic properties will help not only to prevent the HPV infection but should be used to treat the established infection.

Nowadays, immunoinformatics, a branch of bioinformatics, offers new tools for the identification and design of epitopes against specific antigens, which could be used as an ideal target for the development of vaccine against specific viral or pathogenic infections16,17. Immunoinformatics, helps in the identification of effective epitopes which can activate both cell-mediated or humoral immunity against specific viral or pathogenic infections. In 2003, Adu-Bobie et al., developed the first vaccine against Neisseria meningitides by using immunoinformatics approach18. Many research articles showed the importance of the highly immunogenic epitopes for CD8 + and CD4 + CTLs is required in a vaccine for persuading a strong immune response19,20. The peptide (generally 8–11 residues) presented by the MHC-I molecules intracellularly recognized by CD8 + CTLs, whereas the MHC-II recognized extracellular-originated peptides (generally 10–30 residues) by CD4 + Helper T lymphocytes (HTLs). The ideal peptide length for MHC-I and MHC-II is 9 residue & 12–16 residue, respectively. The strong immune response generated by a vaccine depends on the interaction of peptide-MHC complex (pMHC) and T cell receptor19,21. Also, This reverse vaccinology approach has various advantages due to its less time-consuming property, cost-effectiveness, more accuracy and safety. Therefore, most of the research has been done taking this approach in different organisms such as viruses, bacteria, and parasites, for the identification and development of multi-epitope vaccines22,23,24,25,26,27,28,29.

In the present study, the epitopes for the MHC-I & MHC-II were predicted for the L1, E5, E6 & E7 proteins of two highly oncogenic strains of HPV i.e. HPV 16 & HPV 18. These selected proteins of HPV 16 & HPV 18 play an important role in the viral structure, cell transformation and immune evasion to induce malignancy in the cells30,31,32. The chimeric vaccine construct prepared in the study has also shown 16 potential B cell epitopes when submitted to ABCpred server. The most potent epitopes were LQFIFAAYKFGFPDTS, PGSIFYHAGSSRLLTV, & PGTAFTVYVFCFLLPM having score more than 0.90, 0.89 & 0.88, respectively. Further we have made multi-epitope chimeric construct with the help of adjuvants and linkers with appropriate position.

The final vaccine construct has the molecular weight of 46.24 kDa with chemical formula C2177H3223N529O564S13 and has a theoretical isoelectronic point (pI) was 8.67, which suggested the basic nature of the prepared vaccine construct. The aliphatic index and grand average of hydropathicity (GRAVY) of the construct were 91.07 and 0.186, respectively, which suggested that protein is thermostable & hydrophilic in nature. Sympathetic physiochemical properties of the vaccine construct suggested it as a functional vaccine candidate. Knowing the structural properties like secondary & tertiary structure of the vaccine candidate is important for a vaccine candidate. On analyzing, the vaccine candidate has 48.66% alpha-helix, 16.78% β-strands and 34.54% coil. The protein or peptide having alpha helix & coiled coils are important as structural antigens as they can be recognized by antibodies. The 3D structure of the vaccine construct showed the 93.8% of residues in the most favoured regions and additional allowed regions, which indicate the acceptable quality of the predicted vaccine candidate. The multi-epitope chimeric construct was antigenic and non-allergic suggesting its efficacy for the induction of robust immune responses without any allergic responses. The docking results of vaccine construct with TLR3 & TLR4 showed a high binding affinity towards TLR3 & TLR4.

Further, the immune simulation showed an increased T & B cell production. In addition, the B-cells were present for several months along with the stimulated T helper cells. The expression of cytokines showed the increased level of IFN-γ from first injection and gradually reached to the peak level after repeated injection i.e., exposure of the antigen. The increased level of the T helper cells showed the efficient production of the immunoglobulin (Ig) to support humoral response. The expression of the vaccine construct is important in the suitable host. The vaccine construct was optimized and has 67.79% of GC content and the codon CAI-value of the improved sequence was 0.95, which indicates favourable expression like E.coli (Rosano, G. L. & Ceccarelli, E. A. et al., 2014). The optimized sequence was cloned in the pcDNA3.1/V5/His-Topo for efficient expression.

Earlier, several attempts have been made for the development and identification of multi-epitope vaccines against HPV such as the use of virus-like particles (VLPs) based on L1 protein33 and L2-based vaccination34,35. In another study by Negahdaripour et al., predicted epitopes from the HPV 16 L2 protein36,37 while other studies used L1 protein of HPV 16 for epitope prediction38. Overall, most of the studies were performed using E6 and E7 protein as key molecules for epitope prediction due to various reasons.

The search for a better vaccine candidate is still under consideration. Earlier, most of the studies were done by targeting E6 and E7 proteins of HR-HPV 16 and 18. However, none of them reached clinical trials. The majority of the previous studies have focused on the E6, E7 oncoproteins and L1 capsid protein of HPV. However, apart from the oncogenic E6, E7 protein and highly immunogenic L1 protein, E5 is crucial in HPV pathogenesis. E5, along with E6 and E7 is responsible for the transformation of normal cells, downregulation of MHC I expression, protein trafficking through ER and preventing the acidification of endosomes39,40,41,42,43 Recently, Namvar et al. predicted epitopes from the E5 and E7 proteins of HPV 16/18/31/4544 implying the future potential of targeting HPV E5 protein.

Here, for the first time we have analyzed and taken the major oncoproteins (E6 and E7), highly immunogenic L1 capsid protein and also, E5 protein from both the oncogenic strains of HPV (HPV16 &18) and prepared a multi-epitope chimeric vaccine through in silico approach. Our multi-epitope construct has peptides like CYSLYGTTL, ISEYRHYCY, KLPDLCTEL, LLMGTLGIV and TLQDIVLHL that have been experimentally validated45,46,47,48,49. In addition, the first HPV 16 E7 predicted epitope in our study RAHYNIVTF, was also recognized by Feltkamp et al. 1993 who were the pioneers of HPV epitope studies50.

Taken together, our results showed the chimeric construct has the ability to induce both cell-mediated and humoral response and consisting all immunogenic, physicochemical and structural properties which are required for ideal vaccine design. The immune stimulation analysis suggested the efficiency of our construct to initiate an effective cytokine response and B cell response. Finally, our multi-epitope chimeric construct can be used in both prophylactic and therapeutic purposes in in-vitro/in-vivo studies against HPV infection. In addition, further validation will be required for the use of our construct in combination with other cancer immunotherapy approaches to target HPV-induced malignancies.

Material and method

Retrieving of protein sequences

The amino acid sequences of L1, E5, E6 & E7 proteins of HPV 16 (NP_041332; NP_041330; NP_041325 & NP_041326) and HPV 18 (NP_040317; NP_040315; NP_040310; & NP_040311) were retrieved from the National Center for Biotechnology Information (https://www.ncbi.nlm.nih.gov/).

T-cell epitopes prediction

The epitopes for MHC-I & MHC-II alleles from retrieved protein sequences of L1, E5, E6 & E7 proteins of HPV 16 & 18 were predicted by using Immune Epitope Database (IEDB): analysis resource (http://www.iedb.org/) supported by National Institute of Allergy and Infectious Diseases, a component of the National Institutes of Health in the Department of Health and Human Services. The default parameters were used for the prediction of the epitopes using the IEDB server. The percentile score (lower the percentile value higher the binding affinity of the predicted epitope) was chosen less than 0.5 for MHC-I alleles and 0.5 for MHC-II alleles.

Prediction of MHC Class I immunogenicity

The identified MHC class I epitopes were further examined and confirmed for the immunogenicity by using another tool i.e. IEDB MHC Class I immunogenicity (http://tools.iedb.org/immunogenicity/). This tool only analyzed and validated on 9 mer peptides.

Prediction of antigenicity & IFN-γ inducing epitope

Vaxijen, an online tool was used to check the antigenicity in predicted epitopes51. Further, to confirm whether selected epitopes have the ability to induce IFN-γ, we have used IFN epitope server (https://crdd.osdd.net/raghava/ifnepitope/predict.php) in this study52. In our analysis, we have performed Motif and SVM hybrid algorithms and IFN-γ versus non IFN-γ model for prediction52.

Preparation and Physiochemical properties of the vaccine construct

A chimeric vaccine construct of HPV 16 and HPV 18 having MHC class I & II predicted immunogenic and conserved epitopes were analyzed together with linkers to get the final vaccine construct using β-defensin adjuvant. It has been well established the role and properties of β-defensin as an adjuvant against viral infections24,53, which was associated with EAAAK linker at N terminal. The AAY linker was associated with MHC- I epitopes while MHC-II epitopes were linked with GPGPG linkers. The Protparam server was used to compute and validate the physiochemical properties of the final vaccine construct.

Prediction of B-cell epitopes

In the reverse vaccinology approach, a successful vaccine must also induce a strong B-cell mediated humoral immune response. Hence, it is of utmost to show that chimeric vaccine construct is able to induce protective humoral immunity. Here, for B cell epitope prediction the artificial neural network-based ABCpred server (http://www.imtech.res.in/raghava/abcpred/ABCsubmission.html) was employed. All the parameters were setup in default conditions but selected the epitope having score more than 0.854.

Prediction of Allergenicity

Proteins and peptides are well known to induce allergenic reaction. We have used an online tool AllerTOP v2.0 (https://www.ddg-pharmfac.net/AllerTOP/) to check the allergenicity of the chimeric vaccine construct.

Prediction of secondary and tertiary structure

The Psipred, which is an online server was used to predict the secondary structure of prepared chimeric vaccine construct which mainly uses primary amino acid sequences in specific manner (http://bioinf.cs.ucl.ac.uk/psipred/). For the prediction of the tertiary structure of the chimeric vaccine construct, the online freely available server I-TASSER server (https://zhanglab.ccmb.med.umich.edu/I-TASSER/) was used which utilizes sequence-to-structure-to-function paradigm. The last five community-wide CASP experiments ranked I-TASSER server as the top server for the prediction of protein structure24,55.

Validation of the tertiary structure

PDBsum, a pictorial database was used to validate the tertiary structure which is based on to generate Ramachandran plot (http://www.ebi.ac.uk/thornton-srv/databases/pdbsum/Generate.html) that gives an at-a-glance impression of each 3D structure contents submitted in the PDB. Finally, the validation of the 3D structure is vital for the protein model, as it can detect potential errors in modeled 3D structure.

Molecular docking of vaccine construct with TLR3 and TLR4

One of the most crucial steps in reverse vaccinology is to check and predict the chimeric vaccine construct efficacy by molecular docking. Here we have docked TLR 3 (2A0Z) & TLR4 (3FXI) with prepared chimeric vaccine construct by using PatchDock (https://bioinfo3d.cs.tau.ac.il/PatchDock/php.php) server. This server is very efficient and fast and works on specific algorithms. Another server FireDock was used to refine and for the re-scoring of the results obtained from docking (http://bioinfo3d.cs.tau.ac.il/FireDock/php.php). This server generates better results with the lowest global energy. Finally, the best-docked model was used and selected and a visualizing tool UCSF Chimera was used for biocomputing, visualization and informatics for analysis56.

MD simulation

In the current study, the preliminary topology and coordinates for the complexes were generated in VMD version 1.9357. The complexes were prepared and solvated in a rectangular water box (TIP3P) with a buffering distance of 10 Å. Ions were added to ensure the electro-neutrality of the solvated system. SETLE algorithm was used for the water molecules model system. The associated system topology and coordinates were generated by applying charmm34 force field parameters for MD simulation. The MD simulations were executed using NAMD version1.958. Prior to the simulation, the system was properly minimized with a stepwise minimization protocol. Firstly, the water molecules and ions were minimized, followed by hydrogen atoms and the side chains of the complex. The side chains were minimized for 100,000 steps while the backbone atoms and the bond lengths of hydrogen atoms were kept fixed. Thereafter, all the atoms were allowed to relax freely and the whole system was energy-minimized for 100,000 steps with nominal restraints on C-alpha atoms and DNA backbone atoms (10 kcal/mol) to prevent any abrupt change in structure. Subsequently, an equilibration protocol was followed where the system was heated gradually from 0–310 K in steps of 30 K with a canonical ensemble (NVT). At each step, a 20 picosecond (ps) simulation was run to allow the system to adjust to the temperature. Once the system attained 310 K, an isobaric and isothermic ensemble (NPT) was applied for a period of 100 ps with a constant pressure of 1.0 bar using Langevin dynamics0. Finally, the applied restraints on C-alpha atoms and DNA were removed and the system was equilibrated for 1 ns at 310 K using Langevin piston coupling algorithm. During the whole simulation, the Particle Mesh Ewald sum algorithm (PME) algorithm was used to calculate the long-range electrostatic interactions, the hydrogens were constrained using SHAKE algorithm. After equilibration, a production run was done on each complex for 100 ns. The analyses of the MD trajectories were performed to get an insight on the structure and dynamic behavior of all complexes. The trajectories were analysed for root mean square deviation (RMSD), root mean square fluctuation (RMSF), hydrogen bonds, and salt bridges in VMD.

In silico cloning and optimization of designed vaccine candidate

Java Codon Adaptation Tool (JCat) server (http://www.jcat.de/) was used for codon optimization and reverse translation to check translation and efficiency of cloning of the multi-epitope vaccine construct. The protein expression level was assessed by using codon adaptation index (CAI) and the percentage GC content generated in JCat output59. The optimized sequence of the vaccine was then cloned into pcDNA3.1/V5/His-TOPO/LacZ vector using DNASTAR (https://www.dnastar.com).

Immune simulation

In order to characterize the effective immune response generated by the prepared vaccine construct, an online simulation server, C-ImmSim (http://150.146.2.1/C-IMMSIM/index .php), was used61. This server uses a position-specific scoring matrix (PSSM) to predict immune epitope and predict the immune interaction by using machine learning. It predicted both cellular and humoral immune in mammalian immune system60,61. The server was used on the default parameter with simulation volume 50 and the simulation steps 1000 with three injections of the predicted vaccine construct at an interval of 4 weeks.