Article | Open | Published:

Exploring Leptospiral proteomes to identify potential candidates for vaccine design against Leptospirosis using an immunoinformatics approach

Scientific Reportsvolume 8, Article number: 6935 (2018) | Download Citation

Abstract

Leptospirosis is the most widespread zoonotic disease, estimated to cause severe infection in more than one million people each year, particularly in developing countries of tropical areas. Several factors such as variable and nonspecific clinical manifestation, existence of large number of serovars and asymptomatic hosts spreading infection, poor sanitation and lack of an effective vaccine make prophylaxis difficult. Consequently, there is an urgent need to develop an effective vaccine to halt its spread all over the world. In this study, an immunoinformatics approach was employed to identify the most vital and effective immunogenic protein from the proteome of Leptospira interrogans serovar Copenhageni strain L1-130 that may be suitable to stimulate a significant immune response aiding in the development of peptide vaccine against leptospirosis. Both B-cell and T-cell (Helper T-lymphocyte (HTL) and cytotoxic T lymphocyte (CTL)) epitopes were predicted for the conserved and most immunogenic outer membrane lipoprotein. Further, the binding interaction of CTL epitopes with Major Histocompatibility Complex class I (MHC-I) was evaluated using docking techniques. A Molecular Dynamics Simulation study was also performed to evaluate the stability of the resulting epitope-MHC-I complexes. Overall, this study provides novel vaccine candidates and may prompt further development of vaccines against leptospirosis.

Introduction

Leptospirosis is the most widespread zoonosis in the world and emerging as a major public health concern1,2. The global incidence of this tropical disease has been estimated over 1 million cases of severe infection in human amounting to nearly 60,000 deaths annually3. It is caused by pathogenic species of Leptospira and can get transmitted to human by direct contact with reservoir hosts or via exposure to surface water or soil contaminated with their urine2,4. Both wild and domestic animals can serve as reservoir hosts of Leptospira; however, animals such as rodents, pigs, cows, dogs and horses are the most common hosts and sources of infection to humans. Leptospirosis is predominantly an occupational disease where agricultural workers, veterinarians and mineworkers are mainly at risk because of their exposure to contaminated water, soil and infected animals during their regular activities5. The clinical symptoms of leptospirosis in humans are diverse, ranging from mild fever, chills, flu-like illness, headache, muscle aches, to acute disease form known as Weil’s syndrome6. The acute form is characterized by multiple organ complications, including acute renal and hepatic failure, cardiovascular collapse, jaundice, meningitis, pneumonitis and pulmonary haemorrhage, which in turn can lead to death7,8. Also, the disease has a major economic impact on the agricultural industry and companion animals, since it affects the livestock inducing abortions, infertility, stillbirths, reduced milk production and death, especially in developing countries8,9.

Presently, lack of proper therapeutics and vaccine against leptospirosis is increasing the burden of this disease day by day globally. The vaccination against leptospirosis in human populations may prove to be the most feasible approach for controlling the disease. Although, for over 100 years, whole cell inactivated and attenuated vaccines have been used for agricultural and companion animals and in some countries, also being used in human populations. But, due to their adverse effects, short-term immunity and insufficiency in inducing cross-protection, they have not been implemented globally10,11. Leptospira comprises more than 250 antigenically distinct serovars among pathogenic species12,13. This antigenic diversity of pathogenic Leptospira species makes up a challenge for the researchers to develop effective and cross-reactive vaccines.

In last two decades, classical research approach is being used to identify protein targets towards the development of subunit and recombinant vaccines against leptospirosis14. The current research for developing recombinant and subunit vaccines are mostly focused on leptospiral motility, outer-membrane proteins (OMPs), lipoproteins, lipopolysaccharides (LPSs) and virulence factors15. These proteins have been recognized as playing major role in the interaction of pathogens with host cells and possibly associated with pathogenesis; hence, is the major focus of current vaccine research. Among these, significant protection in the hamster model has been reported with several outer membrane proteins, including LipL32 and the leptospiral immunoglobulin-like proteins (Lig)16,17,18. However, protective efficacy of these candidates was limited and also had failed to induce cross-protection and sterile immunity. Therefore, a highly conserved target that can stimulate both humoral and cell mediated immunity against leptospirosis is crucial for the development of an effective vaccine. The current status of leptospiral vaccine development demonstrates that there is an urgent need for the discovery of new effective vaccine candidates to provide immunity against majority of serovars18.

The availability of omics and immunological data, and advances in the computational algorithms have improved the efficiency of vaccine development process by accelerating the research towards the identification of dominant immunogen and thereby potential epitope candidates19,20,21. Various studies have shown that epitope-driven vaccines could effectively stimulate protective immune responses against diverse pathogens, such as influenza virus, human immunodeficiency virus, hepatitis B virus, and hepatitis C virus22,23,24. As a matter of fact, identification of B-cell and T-cell epitopes is a crucial and noteworthy step for the epitope-based vaccine development. Immunoinformatics is now becoming ubiquitous in the field of vaccine development which utilizes genome and proteome based information and offers high level of confidence for the prediction of potential vaccine candidates25. Recently, the approach has been widely accepted for screening the effective immunogens for potential vaccine design of infectious diseases.

In the current study, with the help of immunoinformatics approach, whole proteome of Leptospira interrogans serovar Copenhageni strain L1-130 (LIC) was screened for the most immunogenic and conserved outer membrane (OM) proteins. Subsequently, various B-cell and T-cell epitopes were obtained that could induce protective humoral and cellular immune responses and may be characterized as effective vaccine candidates. Identifying the binding interaction between epitope and major histocompatibility complex (MHC) molecules is considered as the first step to vaccine design, as T-cell immunogenicity is correlated with the binding strength of epitopes and MHC molecule26. Therefore, these predicted epitopes were modelled and docked with MHC class I molecule and later on, their post-docking interaction analysis helped in the selection of optimal candidates for the development of peptide vaccines against leptospirosis.

Results

This study aims to identify a cross-reactive and conserved potential vaccine candidate with the help of a comprehensive bioinformatics approach. In silico approach may prove as a beneficial and directive approach, whereas conventional methods focus more on pathogen cultivation and protein extraction, where testing of these proteins on a large scale is expensive and time-consuming27,28. Several in silico vaccine candidates have been reported by researchers which were known to produce promising preclinical and clinical trial results29,30.

In the present study, putative antigenic protein has been identified, of which B-cell (linear and conformational) and T-cell epitopes (HTL and CTL) have been predicted for the designing of peptide vaccines against leptospirosis.

Identifying the highest antigenic protein

The selection of optimal immunogen is the first step for vaccine design; hence, to identify the most probable antigenic protein, the whole proteome of LIC constituting a total of 3654 proteins was analysed using VaxiJen v2.0 server. An overall score depicting antigenicity for each protein sequence was evaluated which indicated their potentiality to induce immune response; from which, 21 proteins having highest antigenicity score (>1.0) were selected for further analysis (Supplementary Dataset Table S1).

Identification of Outer Membrane Protein (OMP)

It is generally envisaged that subcellular localization of a protein plays a vital role in determining its functionality. In Gram-negative bacteria, OMPs have diverse functions and were known to be involved in the interaction between bacterial cells and their host31. Moreover, in pathogenic bacteria, OMPs are proven to be the most promising vaccine candidates, due to its interaction with the host immune cells32; and hence, identification of OMPs are crucial for a reliable and rapid development of vaccine. Our analysis identified the subcellular localization of all 21 highly antigenic proteins as mentioned in the methods section, from which two proteins with UniProt ID: Q75FL0 and Q72PD2 were predicted to be OMP (Supplementary Table S2). Of these, protein Q72PD2 was uncharacterised and hence not considered for further analysis. Protein Q75FL0 has been annotated as lipoprotein and located in the outer membrane of LIC; therefore, selected as a candidate immunogen to accomplish the epitope based vaccine design.

Primary and secondary structure determination

Q75FL0, the most probable antigenic protein was analysed for its physicochemical properties and secondary structural characteristics. The results revealed the total length of protein as 717 amino acids with molecular weight of 73950.78 Daltons and theoretical Isoelectric point (PI) of 5.39 (Table 1). The instability index (II) was computed to be 32.21, which implies that the sequence of protein is stable. The sequence has about 59 negatively charged residues (Aspartic acid + Glutamic acid) and 48 positively charged residues (Arginine + Lysine). The amino-acid composition revealed that the protein has 10,261 atoms comprising Carbon (3181), Hydrogen (5058), Nitrogen (896), Oxygen (1121) and sulphur (5). The aliphatic index was calculated as 77.22. The grand average of hydropathicity (GRAVY) was calculated to be negative (−0.328). This negative value indicates the hydrophilic nature of protein and most of the residues to be located on the surface; hence this protein tends to have better interaction with other proteins. The secondary structure analysis of protein revealed that the protein is dominated by random coils (55.09%) followed by extended strand (23.15%), alpha helix (12.69%) and beta turns (9.07%). The calculated secondary structure parameters are shown in Table 2 and a plot for each residue position versus its probability score for being in helix, strand, turn and coil in Fig. 1.

Table 1 Different physico-chemical properties of lipoprotein Q75FL0 of LIC.
Table 2 Secondary structure analysis of lipoprotein Q75FL0 through SOPMA.
Figure 1
Figure 1

The probability graph of occurrence of helix, sheet, turn and coil in the secondary structure of the antigenic lipoprotein Q75FL0 of LIC. Here, the graph indicates the values of probability for each secondary structure at all amino acid positions of the protein.

Homology modelling and tertiary structure refinement

Based upon iterative threading assembly and simulation method, I-TASSER server33 generated five 3D models for the protein sequence and ranked all the model based on their C-scores. C-score values measure similarity between the query and template based on the significance of threading template alignment and the query coverage parameters. Typically, C-score values lie in between [−5 to 2], where a higher value denotes a model with a higher confidence and correct topology. The top ranked 3D model yielded C-score value of 0.16 which indicates that the model is having a good topology. The structure of the top ranked model with its functional domain (LurC domain: 439 − 661) is shown in Fig. 2(A). In addition to C-score, I-TASSER predicted up to ten closest structures in PDB and ranked them on the basis of TM-score and the root mean square deviation (RMSD) of atomic positions related to the best template used for 3D modelling. The closest protein structure and quality assessment parameters for the modelled structure are shown in Table 3. The top ranked model was refined using GalaxyRefine server34 and generated five refined models. Of these, the top ranked structure was selected on the basis of Ramachandran plot (80.3% in favoured region). Consequently, the quality of refined model was evaluated by using PROCHECK, ProSA-Web and ModFold6. PROCHECK calculates the steriochemical quality of the protein and depicted Ramachandran plot as shown in Fig. 2(B). The Ramachandran plot analysis of refined structure revealed that 80.3% of residues were located in most favoured region followed by 15.4% in allowed and 1.8% in generously allowed region, while only 2.5% of the residues were in disallowed region. However, ProSA-Web calculated Z-score of −2.97 indicating the model was not in the range of native protein conformation (Fig. 2(C)). Furthermore, the ModFold6 sever was used to evaluate the overall quality of the model (Table 3).

Figure 2
Figure 2

Predicted three-dimensional structure and validation of lipoprotein Q75FL0. (A) Three dimensional structure of the predicted model with the domain (LurC domain shown in red colour). (B) Ramachandran plot of refined 3D model showing 80.3%, 15.4%, 1.8% and 2.5% residues in the most favoured, additional allowed, generously allowed and disallowed region respectively. (C) Z-Score plot for 3D structure of model showing −2.97.

Table 3 Protein structurally close to the model in the PDB and Quality assessment of the model.

Identification of linear B-cell epitopes

The identification and characterization of B-cell epitopes in target antigen is a key step in the epitope-based vaccine design. The Kolaskar and Tongaonkar’s method35 of the Immune Epitope Database (IEDB) Analysis Resource predicts the antigenic peptides by analysing the physicochemical properties of amino acid residues and their abundance in experimentally determined antigenic epitopes. The result revealed that the protein sequence of 717 aa has 26 antigenic peptides falling in the range of 6-22 amino acids length (Table 4). In addition, maximum residual score for each amino acid residue was also predicted. Out of 717 amino acids, 343 amino acids have residual score ≥ 1.008. Proline and Valine at the 400th and 401st positions, found in the antigenic peptide 394KYEVLLPVAAVPT406, was identified as having the highest antigenic residual score of 1.208. It should be noted that the epitopes 20MKKILILLIALSFAVFGCSHK40 and 42KGILLPFLTLLNQ54 were recognized as allergic to human; henceforward, they could not be considered as vaccine candidates. Fortunately, within 617WAILVPGA624 and 5YSSSFILIIKKG16 epitopes, some residues were also predicted as conformational as well as CTL epitopes, so can be considered as good candidates for peptide vaccine design. Moreover, the result indicated that the average antigenic propensity score of the predicted epitope was 1.008 while the minimum and maximum score was 0.855 and 1.208 respectively. The graphical representation of predicted antigenic residues based on the sequence position (X-axis) and antigenic propensity (y-axis) are shown in Supplementary Fig. S1 (Supporting Information). The detailed information of predicted epitopes including their conservancy and allergenicity are shown in Table 4.

Table 4 Antigenic linear B-cell epitopes of lipoprotein Q75FL0 with their conservancy and allergenicity.

Since potential B-cell epitopes have several key features, including surface accessibility, fragment flexibility and hydrophilicity which are crucial for predicting B-cell epitopes, these were analysed by different methods implemented in IEDB. The surface accessibility prediction showed that the maximum surface probability value of predicted peptides was calculated as 5.282 at amino acid residues from 320 to 325 with the sequence of hexapeptide 320TDKQSK325, where 323Q is a surface residue, while the minimum surface probability score was 0.043 for the peptides 23ILILLI28, where 25I is the surface residue. Peptides with threshold value > 1.0 have high probability to be located on the surface36. The Graphical representation of predicted surface accessible residues on the basis of their sequence position (x-axis) and surface probability (y-axis) are shown in Supplementary Fig. S2 (Supporting Information).

Surface flexibility of peptides is also an important feature for predicting antigenic peptides, as experimental data have shown that the antigenic regions of peptide that interact with antibody are probably more flexible and also well suited for choosing cross-reacting peptide37. Based upon the temperature factor or B factor of Cα atom, Karplus and Schulz flexibility method of IEDB predicted the flexible regions on the protein. The analysis showed that the maximum flexibility value was 1.166 at amino acid position 164 to 170 with a sequence of GSSSSSG, while the minimum flexibility score was 0.891 for the peptide 703AAVAYIL709. Peptides with low B-factor value are predicted to have well-organized structure. The result of predicted surface flexible regions is shown in Supplementary Fig. S3 (Supporting Information).

The Parker hydrophilicity scale method38 was employed to identify the hydrophilic peptides in the protein sequence as discussed in the method section. The maximum hydrophilicity score calculated by this method was 7.4 with a peptide sequence of 406TDTDKDG412; however, the minimum score was calculated as −7.243 for peptide sequence of 24LILLIAL30. The graphical representation of predicted hydrophilic residues on the basis of their sequence position (x-axis) and surface hydrophilicity (y-axis) are shown in Supplementary Fig. S4 (Supporting Information).

Structure-based Epitope Prediction

In order to find conformational B-cell epitope in 3D structure, Ellipro39 was used. This tool predicts the epitopes based on the geometrical properties of the protein structure and it discriminates predicted epitopes from non-epitopes on the basis of known protein antibody complex. The conformational B-cell epitopes with a protrusion index (PI) value above 0.7 were selected. The score (PI) reflects the percentage of protein atoms that extend beyond the molecular bulk and are responsible for antibody binding39. The highest probability of a conformational epitope was computed as 85.5% (PI score: 0.855). The Amino acid residues present in conformational epitopes, the number of residues and their scores are depicted in Table 5, whereas the graphical representations are shown in Fig. 3.

Table 5 Predicted conformational B-cell epitopes of protein.
Figure 3
Figure 3

3D representation of conformational B-cell epitopes (AC) of protein. The predicted epitope residues are denoted by blue colour (ball & stick), and the rest of the residues are in red (Cartoon).

Identification of Helper T Lymphocyte (HTL) cell epitopes

HTL is crucial for inducing and generating an efficient humoral or cytotoxic T-cell response; therefore, in order to find the peptides that may trigger the MHC-II restricted T-cell response, the NetMHCIIpan 3.1 server40 was utilised. Prediction was made for Human Leukocyte Antigen-DR (HLA-DR) alleles and only strong binder (SB) epitopes having IC50 value < 50 nM with high binding affinity to HLA-DR, were considered. As a result, a total of 33 SB T-cell epitopes for the query sequence were predicted and are shown in Supplementary Dataset Table S3. It has been known that the binding strength of HTL epitope to the HLA-DR is a key factor in immunogenicity of the T-cell epitope and a good T-cell epitope candidate should interact with maximum number of HLA alleles41,42. Therefore, based on the highest number of HLA-DR binding alleles, the top 10 epitopes were selected as putative HTLs (Table 6). Of these, epitope sequence 17YRVMKKILI25 interacting with highest number of HLA-DR alleles (336 alleles) can be considered as a good candidate for subunit-vaccine design. On the other hand, the peptide sequence 48FLTLLNQDA56 interacting with 120 alleles was predicted to be allergenic to human; hence, could not be considered for vaccine design. Moreover, the conservancy of all selected epitopes were found in the range of 2.17 to 95.65%, representing 46 serovars of pathogenic Leptospira spp. The epitope 565IVFNSPVKK573 interacting with 132 HLA-DR alleles was predicted to be at the highest conservancy level (i.e. conserved among 44 serovars). Details of predicted HTL epitopes along with their binding HLA-DR alleles are shown in Supplementary Dataset Table S3.

Table 6 Top 10 HTL epitope with their allergencity and conservancy, selected on the basis of maximum number of HLA-DR binding alleles.

CTL Epitope prediction

Cytotoxic-T-lymphocytes (CTLs) are critically one of the vital instigators of cellular immunity and play an important role in eliminating the infected cells. Hence, to identify the potential T-cell epitope that is recognized by CD8+ T-cell and stimulate both long-lasting and exclusive cytotoxic immune response, NetCTL 1.2 server43 was employed. This server identifies the epitope candidates by using artificial neural network and calculates a combined score for a peptide sequence based on their MHC-I binding affinity, proteasomal C-terminal cleavage and TAP transport efficiency all together43. Herein, a total of 12 peptide sequences were predicted as CTL epitopes whose prediction scores were greater than 0.75 (Table 7). Of these, epitopes 438STVAYEDLY446, 299QIGSIPFTY308 and 619ILVPGAWKY627 have also been predicted to be antigenic and were conserved among 46 serovars of pathogenic species, which suggest that they could be promising vaccine candidates. In addition, they were predicted to have positive immunogenicity, wherein the positive score of immunogenicity signifies the high potentiality to stimulate strong CTL response. The peptide sequence 526SSSDLNLGI534 was predicted as antigenic for human, so cannot be considered for vaccine design. The details of predicted CTL epitopes with their IEDB immunogenicity score, conservancy value and allergenicity are shown in Table 7.

Table 7 Predicted CTL epitopes.

Molecular Docking of CTL-epitopes with HLA-A*0201

Molecular docking was performed to determine binding affinities between all the predicted CTL epitopes and HLA-A*0201 (as discussed in the methods section). Out of 12 predicted CTL epitopes, 11 CTL epitopes excluding the allergenic one i.e. SSSDLNLGI were docked to MHC class I HLA-A*0201. The analysis revealed that out of 11, only four predicted CTL epitopes (STVAYEDLY, ILVPGAWKY, QIGSIPFTY and KTALGSYPY) showed strong binding affinities in terms of global energy and attractive van der Waals energy (vdW) ranging from – 61.00 to − 48.99 kcal/mol and − 28.32 to − 23.35 kcal/mol respectively (Table 8 and Supplementary Table S4). Of these, three epitopes (STVAYEDLY, ILVPGAWKY and QIGSIPFTY) were found to contain antigenic amino acid residues, positive IEDB immunogenicity score and high degree of conservancy (Table 7). The presence of these properties can lead an epitope to be a promising peptide vaccine candidate. Moreover, seven epitopes (NSDSSSNAT, GTSYKDWYK, VSDNEGHIL, YSSSFILII, VTDLTTKTV, YLDSNNFPW and WVASNGTSY) have shown poor binding affinities in terms of global energy ranging from – 39.30 to – 15.27 kcal/mol as tabulated in the Supplementary Table S4. We found that the docking energies of aforementioned epitopes (lowest global energy –39.30 kcal/mol) are nowhere close to those of top three epitopes (STVAYEDLY, ILVPGAWKY and QIGSIPFTY; highest global energy –48.99 kcal/mol).

Table 8 CTL epitopes and HLA-A*0201 interactions.

Furthermore, the post-docking analysis result revealed the presence of five hydrogen bonds in STVAYEDLY-HLA-A*0201 complex within a distance of 3.68 Å; thus, pointing out the stability of the docked complex (the details are available in Table 8). Likewise, three hydrogen bonds were detected between ILVPGAWKY-HLA-A*0201, QIGSIPFTY-HLA-A*0201 and KTALGSYPY-HLA-A*0201 complex within a distance of 3.96, 2.869 and 3.85 Å respectively (Table 8 and Supplementary Table S4). Moreover, the reliability of the docked complex seems to be well-preserved by the formation of hydrogen bonds. Unfortunately, peptide STVAYEDLY, QIGSIPFTY and KTALGSYPY were not found to be binding in the binding groove of HLA-A*0201; while, ILVPGAWKY binds within the groove of HLA-A*0201. Overall analysis of the result showed that ILVPGAWKY epitope can be considered as a potential vaccine candidate for the epitope-driven vaccine design, as predicted to have lowest global energy and also binding within the groove of HLA-A*0201, which point out the stability of docked complex. In addition, this peptide was found to be conserved among 44 serovars of pathogenic Leptospira. Molecular interactions of the top three docked complexes with CTL-epitopes and HLA-A*0201 are shown in Fig. 4. The detailed analysis of other docked complexes is shown in supplementary Table S4. Further, the molecular interaction analysis of these predicted CTL epitopes (other eight CTL epitoes) docked to HLA-A*0201 protein are shown in supplementary Fig. S5. Also, the binding mode of each epitope to HLA-A*0201 proteins for all 11 complexes are shown in supplementary Fig. S6. Thus including KTALGSYPY, all epitopes having weaker binding were treated as negative controls and used for the stability analysis along with the top three epitopes meeting additional criteria.

Figure 4
Figure 4

Molecular interaction analysis of predicted CTL epitopes docked to MHC-I molecule. (A) STVAYEDLY-HLA-A*0201 docking complex interacting with five hydrogen bonds; (B) ILVPGAWKY-HLA-A*0201 complex interacting with three hydrogen bond; (C) QIGSIPFTY-HLA-A*0201 complex making three hydrogen bonds. The residues forming H-bonds are labelled in magenta and green colour for HLA protein and epitope respectively. Each residue participating in H-bonding are coloured with Atom type (Red: Oxygen, Blue: Nitrogen).

Molecular dynamics simulations

The stability of epitope-MHC I docked complexes was further studied by molecular dynamics (MD) simulation using GROMACS v2016.3. During MD simulation, all peptides moved from their original docking site and made new favourable interactions (top three epitopes are shown in Table 8 and other eight epitopes in Supplementary Table S4). While peptide STVAYEDLY retained its original docking conformation, others moved to adopt a slightly different, yet docked, conformations. Whereas YLDSNNFPW did not retain its original confirmation (Supplementary Fig. S7). The Root Mean Square Deviations (RMSDs) of top three docked complexes (ILVPGAWKY-HLA-A*0201, QIGSIPFTY-HLA-A*0201 and STVAYEDLY-HLA-A*0201) after simulation with respect to the complexes before simulation were 1.06, 1.36 and 0.94 Å respectively.

Discussion

The global incidence of leptospirosis is increasing year by year, from an initial estimate of approximately 500,000 cases in 199944, to over a million of severe cases in humans, resulting in ~60,000 fatalities in 20153. To overcome this disease burden, there is an urgent need of improved preventive measures against the disease. Vaccination is one of the most effective means to efficiently, rapidly and affordably improve the public health and the most feasible way to eradicate this infectious disease. The search for effective vaccines to prevent leptospirosis has been on-going for many decades15. Despite this, the development of broadly effective vaccines against leptospirosis remains desirable and yet challenging task due to the wide array of antigenic diversity among pathogenic species12. The currently available vaccines against leptospirosis consist of whole-cell inactivated and formalin-killed leptospires (bacterin). However, these vaccines often show severe side-effects and are unable to stimulate cross-protection against different serovars and hence, their efficacy is limited. Therefore, current vaccine research is mostly focused on peptide and subunit vaccines as compared to whole organism vaccines because subunit vaccines contain specific immunogenic components of the pathogens responsible for the infection rather than the whole pathogen, which may result in severe side-effects. In Leptospira, vaccine targets include OMPs, lipoproteins and transmembrane proteins. Indeed, the most promising vaccine candidate so far described is the surface protein Lig, while OM LipL32 is the most studied leptospiral protein15. However, the efficacy of these vaccine candidates was limited and failed to induce cross-protective immunity. Therefore, the identification of other, more conserved, immunogenic OM proteins would be highly desirable for the development of cross-protective vaccine against leptospirosis. It is well-known that in Leptospira, OMPs exhibit high level of conservancy and are associated with pathogenesis; therefore, likely to be the most promising and successful candidates for peptide vaccines.

This study aims to screen and scrutinize the most antigenic OMP of the LIC, one of the most studied pathogenic Leptospira strains14, and to predict the possible antigenic B-cell and T-cell epitopes for epitope-based or peptide vaccine development by using in silico proteome wide-screening strategy. Several researchers have used in silico approach for identifying and designing of vaccine candidates45,46,47,48,49,50 and some of them achieved promising clinical trial results (for example ref.30). Screening using in vitro assays further reduces the number of vaccine candidates and hence, the number of laboratory animals required for efficacy testing. With immunoinformatics approaches, it is now feasible to screen the entire antigenic repertoire of a pathogen that could progress the discovery of potential vaccine candidates and may eventually improve existing vaccines. In our study, 21 proteins were predicted as highly immunogenic (antigenicity score > 1). Of these, two proteins (Q75FL0 and Q72PD2) were found to be located on the outer membrane. The Q72PD2 protein was annotated as hypothetical protein; hence, was excluded for further analysis. BLASTP analysis revealed that Q75FL0 is 100% identical to LruC domain-containing protein of LIC with 97% of query coverage and hence, may be characterised as LruC protein. The protein LruC was formerly described as leptospiral recurrent uveitis-associated protein C51,52. Experimentally, LruC protein was proven to be an OM lipoprotein and may have a role in pathogenesis of leptospiral Uveitis52. In addition, the LruC lipoprotein was found to be conserved among pathogenic Leptospira species. Thus, lipoprotein Q75FL0 could be the most promising new vaccine candidate for leptospirosis because of common and important features, including OM localization, conservation, and eliciting antibody production in patients52,53. Vaccination, or immunization works by stimulating antigen specific B-cells or CTLs and HTLs immune response. Consequently, B-cells, HTLs and CTLs epitope were predicted in the Q75FL0 lipoprotein. An effective peptide-based vaccine should contain both B-cell and T-cell epitopes to be able to elicit humoral and cellular immunity respectively. Several researchers have identified combined B-cell and T-cell epitopes of Leptospiral OMPs for diagnosis and vaccine purpose (see for example refs54,55). In our study, a number of peptides were predicted as comprising both B-cell and T-cell epitopes including 5YSSSFILIIKKG16, 301GSIPFTYNTVQTIPLNLVVTD321, 434AEGVSTVAYEDLYPSA449, 522TKTVSSSD529, 576LGSYPYDIFIKVI588 and 617WAILVPGA624, thus could induce humoral as well as cell mediated immunity and hence, can be considered for the development of peptide vaccines against leptospirosis. Furthermore, surface accessibility, surface flexibility as well as hydrophilicity for the B-cell epitopes have also been predicted in the current study. B-cell based vaccines provide antibody-mediated immunity which can be easily overwhelmed by surge of antigens. However, HTL plays a crucial role in inducing vital humoral or CTL responses and confer long-term immunity; hence, critical requirements for effective vaccine design. The response to T-cell epitopes is restricted by HLA proteins. HLAs are highly polymorphic i.e. the frequency of expression of different HLA types varies in different ethnic human populations. Therefore, to elicit broad immune responses in different human populations, the HLA specificity of T-cell epitopes must be considered as major criteria for screening of the epitopes56. Consequently, the epitope candidate should bind to the maximum number of HLA alleles to get more population coverage. Hence, in this study 10 HTL epitopes that bind to the maximum number of HLA alleles were selected as putative HTL epitope candidates. The main adaptive immunity against bacteria was thought to be primarily humoral i.e. mediated by B-cell or CD4+ cell8. However, humoral immunity is not far enough to completely clean the infection, cell-mediated immunity is needed to induce cell death and completely destroy the bacterial habitat. Although pathogenic Leptospira is not considered as a typical intracellular pathogen, indeed some bacterial proteins may be able to escape from the phagolysosome and reach to the cytosol of host cells and are exposed to the host CD8+ T-cells response, as reviewed in ref.57. Though, recent studies have reported that cell-mediated immunity is involved in the protective immune response stimulated by the Leptospira pathogen or vaccines58,59. In the current study, one of the CD8+ restricted CTL epitope, 619ILVPGAWKY627 having high degree of conservancy among 46 serovars of pathogenic Leptospira, has also been predicted as antigenic site of linear B-cell and showing significant binding interaction with HLA-A*0201 protein; thus, considerably enhancing the possibility of this peptide to be a vaccine candidate. As per our knowledge, this immunoinformatics study represents novel vaccine candidates that will further aid in the development of improved vaccines for leptospirosis.

Conclusion

Leptospirosis has emerged as a major concern globally and reasons for a large number of deaths in tropical regions of the world. Despite of that, the present therapeutic strategy available is very sporadic and unable to handle this alarming disease. The immunoinformatics based screening of vaccine target is a promising strategy to accelerate the vaccine development process and could conceivably be used as a cost-effective medical intervention for emerging infectious diseases. Our study starts with the identification of highly immunogenic and conserved outer membrane protein followed by the identification of B-cell, HTL and CTL epitopes. The vaccine candidates identified in the current study are highly conserved among 46 serovars of pathogenic Leptospira, and have not yet been assessed as vaccine candidates; and hence, could be worthy of further investigation as novel vaccine candidates. Furthermore, experimental studies will be required for immunogenicity testing, in vitro and in animal models to validate their efficacy as vaccine candidates against leptospirosis.

Methods

Protein sequence retrieval

Whole proteome of the LIC, encoding 3654 proteins was retrieved from the Universal Protein (UniProt) database (Proteome ID: UP000007037) (http://www.uniprot.org/proteomes/) in FASTA format and used for further analysis. UniProt is a comprehensive resource for protein sequences and annotation information which provide functional information about proteins with accuracy and consistency.


Prediction of highest antigenic protein

Antigenicity refers to the ability of an antigen to induce the immune response. Hence, to find the highest antigenic protein, all protein sequences were submitted to VaxiJen v2.0 server (http://www.ddg-pharmfac.net/vaxijen/VaxiJen/VaxiJen.html) with default parameters, which was developed for the prediction of potent antigen and subunit vaccines with accuracy of 70 to 89%. All the antigenic proteins with highest antigenicity score (>1.0) were selected for further evaluation.


Prediction of subcellular localization

It is important to scrutinize the subcellular localization of a protein, as immunogenic protein have to be easily recognized by the immune cells in order to stimulate immune response, one of the primary criteria for designing a vaccine candidate. Outer membrane proteins are surface-exposed which is easily recognised by the host immune system and possibly associated with pathogenesis60. Therefore, protein sequences with antigenic score >1.0 were subjected to CELLO v.2.5 server61,62 (http://cello.life.nctu.edu.tw/) to retrieve outer membrane protein.


Homology modelling and structure analysis

Antigenicity or the function of a protein correlates with the structural features of the protein; hence, to analyse the target protein sequence, ProtParam server (http://web.expasy.org/protparam/) and SOPMA server (https://npsa-prabi.ibcp.fr/cgi-bin/npsa_automat.pl?page = /NPSA/npsa_sopma.html) were used with default parameters. ProtParam tool allows the computation of various parameters that decide the stability and functional characteristics of the protein to some extent and SOPMA computes the secondary structural features of the protein. The three-dimensional (3D) structure of outer membrane lipoprotein were predicted using the I-TASSER server33,63. I-TASSER generated five alternative 3D models of protein and assigned confidence score (C-score) for each model that infers the quality of the structure. The modelled protein with the highest C-score was refined and subjected for its quality assessment. The 3D model was refined by using GalaxyRefine server (http://galaxy.seoklab.org/). This server refines the modelled structure by reconstructing side-chain conformations followed by repacking and dynamics simulations to repeatedly relax the structure. GalaxyRefine has been evaluated as one of the best method to improve the local quality of the structure. This server can improve local and global quality of the models generated by structure prediction servers such as I-TASSER. Furthermore, to evaluate the refined model, quality assessment of the model was done by using three different servers viz. PROCHECK64, ProSA-Web65,66, and ModFold667. PROCHECK was used to analyse the stereochemical quality of the model by evaluating the Ramachandran plot of the protein structure; whereas, ProSA-Web and ModFold evaluated the overall quality of the model. ProSA-Web calculated the overall quality score of the model by analysing their atomic coordinates which is frequently employed in protein tertiary structure validation. ModFold calculates the p-value and assign a degree of confidence (poor, low, medium, high and cert) of the model depending on the p-value. The 3D structure of protein was visualized using PyMol68.


Linear and Conformational B-cell epitope prediction

B-cell epitope is the main antigenic region of an antigen which are recognized by the B-cell receptors of the immune system and are able to induce humoral immune response, which cause the B-lymphocytes to differentiate into antibody-secreting plasma and memory cells69. B-cell epitopes can be categorized as a linear (continuous) and conformational (discontinuous) based on their spatial structure. The Kolaskar & Tongaonkar method at Immune Epitope Database (IEDB) analysis resource (http://tools.iedb.org/main/bcell/) was applied to predict linear B-cell epitopes. The accuracy of this method to predict epitope is about 75%35. Flexibility, surface accessibility and hydrophilic properties are also important characteristics of B-cell epitopes70; hence, to predict these properties, Emini surface accessibility36, Karplus and Schulz Flexibility37 and Parker hydrophilicity38 prediction methods were employed respectively with default parameters of IEDB analysis resource.

ElliPro (http://tools.immuneepitope.org/toolsElliPro/) from IEDB analysis resource was used for prediction of the conformational B-cell epitopes with minimum score value set at 0.70, while the maximum distance was set as default. This method predicts epitopes based upon solvent-accessibility and flexibility39. Three different algorithms are implemented in this resource including approximation of the protein shape71, protrusion index (PI) of residues72 and neighbouring residues clustering based on their PI values.


Helper T-cell (HTL) epitope prediction

Activation of HTL is prerequisite for inducing an efficient antibody response or Cytotoxic T-lymphocyte (CTL) response through both cytokine secretion and dendritic cell sensitization73,74,75. The binding of a T cell receptor to an epitope complexed with major histocompatibility complex (MHC) class II molecule can result in activation of T-cell. Hence, in order to predict MHC class II restricted HTL epitopes, the protein sequences were submitted to NetMHCIIpan 3.1 server (http://www.cbs.dtu.dk/services/NetMHCIIpan/) with threshold value set as 0.5% and 2% for strong binding peptides (SB) and weak binding peptides (WB), respectively to determine the binding affinities of epitopes and MHC-II allele. NetMHCIIpan is one of the most accurate prediction server that covers all human leucocyte antigen (HLA) class II molecules based on artificial neural network algorithm. Here, the strong binder epitopes with the maximum number of binding HLA-DR alleles were selected as putative epitope candidates.


Prediction of Cytotoxic T-lymphocyte (CTL) epitopes

Consistent predictions of CTL epitopes are very important for the coherent vaccine design. Hence, the presence of CTL epitopes in the amino acid sequence of selected protein was predicted using NetCTL.1.2 server (http://www.cbs.dtu.dk/services/NetCTL), with default parameters. This server predicts epitopes by integrating predictions of MHC class I binding, proteasomal C-terminal cleavage and the TAP transport efficiency. The MHC class I binding and proteasomal C-terminal cleavage were predicted by the artificial neural network while a weight matrix was used to predict the TAP transport efficiency.

Moreover, except for a strong binding affinity, the peptides with strong immunogenicity are more probable CTL epitopes than those with weak immunogenicity. Therefore, the immunogenicity of candidate epitopes was evaluated using IEDB immunogenicity prediction tool (http://tools.immuneepitope.org/immunogenicity/) with default parameters.


Allergenicity assesment

The allergenicity of the predicted epitopes was analysed using AllerHunter server (http://tiger.dbs.nus.edu.sg/AllerHunter), which is based on support vector machine (SVM) and pair-wise sequence similarity. AllerHunter predicts allergen in addition to non-allergen with high sensitivity and specificity, and efficiently distinguish allergens and non-allergens from allergen-like non-allergen sequences, which make AllerHunter a very constructive tool for allergen predictions.


Conservancy analysis

In order to evaluate homologs of the selected proteins within different serovars of pathogenic Leptospira species, BLASTP (https://blast.ncbi.nlm.nih.gov/Blast.cgi?PAGE=Proteins) was performed against proteome of 47 serovars of pathogenic species. Protein sequences with >70% of identity and 40% query coverage were considered as homologs. Of these 47 serovars, the query protein was found to have their homologs among 46 serovars. Furthermore, conservancy of predicted epitope was evaluated among screened homologs (46 serovars) by using epitope conservancy analysis tool at the IEDB analysis resource (http://tools.immuneepitope.org/tools/conservancy). This tool calculates the degree of conservancy of an epitope within a provided protein sequence, set at different degree of identities. The degree of conservancy is defined as the portion of protein sequences that contain the epitope at a specified identity level.


3D structure of CTL-epitopes

The 3D structures of all the predicted CTL epitopes excluding the allergenic one i.e. SSSDLNLGI, were modelled with the PEP-FOLD3 server76, using 200 simulation runs. First the PEP-FOLD3 server clustered different conformational models and then sorted them using the sOPEP energy value. Consequently, the best ranked model was selected to analyse the interactions with selected Class I MHC molecule.


Molecular Docking studies

A docking study was performed to ensure the interaction between HLA class I molecules and our predicted CTL epitopes using the PatchDock rigid-body docking server77,78. Since HLA-A*0201 is one of the most frequent MHC class I alleles in most of the human populations;79,80,81 the best ranked CTL peptide models were docked with HLA-A*0201 (PDB ID: 4U6Y). PatchDock rigid-body server computes complexes with good molecular shape complementarity based on geometry of the molecules. Furthermore, the docking results were refined using FireDock (Fast Interaction Refinement in Molecular Docking) server82,83. It produces 10 best solutions for final refinement. The refined models were based on the binding score. This score includes Atomic contact energy, Van Der Waals interaction, partial electrostatics and estimations of the binding energy. Furthermore, the hydrogen bonding interaction of the docked structures were analysed with the molecular visualization tool UCSF Chimera 1.11.284 and PyMOL68.


Molecular dynamics simulations

Molecular dynamics simulations were performed to check the stability of epitopes-HLA-A*0201 allele docked complex using the GROMACS v2016.3 software85. For each of the docked complexes, a production simulation of 5 ns at 300 K temperature and 1 bar pressure was obtained after carrying out stepwise energy minimization and equilibration protocol of the solvated systems with TIP3P water model. Further, trajectory analysis was performed to investigate H-bonding and Root Mean Square Deviation (RMSD).

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  1. 1.

    Vijayachari, P., Sugunan, A. P. & Shriram, A. N. Leptospirosis: an emerging global public health problem. J. Biosci. 33, 557–569 (2008).

  2. 2.

    Adler, B. Leptospira and Leptospirosis. Curr Topics Microbiol. 387, 1–293 (2015).

  3. 3.

    Costa, F. et al. Global morbidity and mortality of leptospirosis: a systematic review. PLoS Negl. Trop. Dis. 9, e0003898 (2015).

  4. 4.

    Reis, R. B. et al. Impact of environment and social gradient on Leptospira infection in urban slums. PLoS Negl Trop Dis. 2 (2008).

  5. 5.

    Levett, P. N. Leptospirosis. Clin Microbiol Rev. 14, 296–326 (2001).

  6. 6.

    Haake, D. A. & Levett, P. N. Leptospirosis in Humans. Curr Top Microbiol Immunol. 387, 65–97 (2015).

  7. 7.

    Evangelista, K. V. & Coburn, J. Leptospira as an emerging pathogen: a review of its biology, pathogenesis and host immune responses. Future Microbiol. 5, 1413–1425 (2010).

  8. 8.

    Faine, S. B., Adher, B., Bolin, C. & Perolat, P. in: Leptospira and Leptospirosis, second ed. MedSci, Medbourne, (Australia, 1999).

  9. 9.

    Bharti, A. R. et al. Peru-United States Leptospirosis Consortium, Leptospirosis: a zoonotic disease of global importance. Lancet Infect. Dis. 3, 757–771 (2003).

  10. 10.

    Ellis, W. A. Animal leptospirosis. Curr Top Microbiol Immunol 387, 99–137 (2015).

  11. 11.

    Adler, B. Vaccines against leptospirosis. Curr Top Microbiol Immunol. 387, 251–72 (2015).

  12. 12.

    Levett, P. N. Systematics of leptospiraceae. Curr Top Microbiol Immunol. 387, 11–20 (2015).

  13. 13.

    Bourhy, P., Collet, L., Brisse, S. & Picardeau, M. Leptospira mayottensis sp. nov., a pathogenic species of the genus Leptospira isolated from humans. Int J Syst Evol Microbiol. 64, 4061–4067 (2014).

  14. 14.

    Dellagostin, O. A. et al. Recombinant vaccines against leptospirosis. Hum Vaccin. 7, 1215–1224 (2011).

  15. 15.

    Wang, Z., Jin, L. & Węgrzyn, A. Leptospirosis vaccines. Microbial Cell Factories. 6, 39 (2007).

  16. 16.

    Conrad, N. L. et al. LigB subunit vaccine confers sterile immunity against challenge in the hamster model of leptospirosis. PLoS Negl Trop Dis. 11, e0005441 (2017).

  17. 17.

    Seixas, F. K. et al. Evaluation of different ways of presenting LipL32 to the immune system with the aim of developing a recombinant vaccine against leptospirosis. Can J Microbiol. 53, 472–479 (2007).

  18. 18.

    Grassmann, A. A., Souza, J. D. & McBride, A. J. A universal vaccine against leptospirosis: are we going in the right direction? Front Immunol. 8, 256 (2017).

  19. 19.

    De Gregorio, E. & Rappuoli, R. Vaccines for the future: learning from human immunology. Microb. Biotechnol. 5, 149–155 (2012).

  20. 20.

    Patronov, A. & Doytchinova, I. T-cell epitope vaccine design by immunoinformatics. Open Biol. 3, 120139 (2013).

  21. 21.

    Yang, X. & Yu, X. An introduction to epitope prediction methods and software. Rev. Med. Virol. 19, 77–96 (2009).

  22. 22.

    Staneková, Z. & Varečková, E. Conserved epitopes of influenza A virus inducing protective immunity and their prospects for universal vaccine development. Virol J. 7, 351 (2010).

  23. 23.

    Sominskaya, I. et al. Construction and immunological evaluation of multivalent hepatitis B virus (HBV) core virus-like particles carrying HBV and HCV epitopes. Clin Vaccine Immunol. 17, 1027–1033 (2010).

  24. 24.

    He, L. et al. Approaching rational epitope vaccine design for hepatitis C virus with meta-server and multivalent scaffolding. Sci Rep. 5, 12501 (2015).

  25. 25.

    Dellagostin, O. A. et al. Reverse Vaccinology: An Approach for Identifying Leptospiral Vaccine Candidates. Int. J. Mol. Sci. 18, 158 (2017).

  26. 26.

    Lazarski, C. A. et al. The kinetic stability of MHC class II: peptide complexes is a key parameter that dictates immunodominance. Immunity 23, 29–40 (2005).

  27. 27.

    Rappuoli, R. Reverse vaccinology. Curr. Opin. Microbiol. 3, 445–450 (2000).

  28. 28.

    Soria-Guerra, R. E., Nieto-Gomez, R., Govea-Alonso, D. O. & Rosales-Mendoza, S. An overview of bioinformatics tools for epitope prediction: Implications on vaccine development. J Biomed Inform. 53, 405–414 (2015).

  29. 29.

    David, M. N. & Flower, D. R. Harnessing bioinformatics to discover new vaccines. Drug Discov. Today 12, 389–395 (2007).

  30. 30.

    De Groot, A. S. & Rappuoli, R. Genome-derived vaccines. Expert Rev. Vaccines 3, 59–76 (2004).

  31. 31.

    Lin, J., Huang, S. & Zhang, Q. Outer membrane proteins: key players for bacterial adaptation in host niches. Microbes and Infection. 4, 325–331 (2002).

  32. 32.

    Rodrı´guez-Ortega, M. J. et al. Characterization and identification of vaccine candidate proteins through analysis of the group A Streptococcus surface proteome. Nature Biotechnology. 24, 2 (2006).

  33. 33.

    Zhang, Y. I-TASSER server for protein 3D structure prediction. BMC Bioinformatics. 9, 40 (2008).

  34. 34.

    Heo, L., Park, H. & Seok, C. GalaxyRefine: Protein structure refinement driven by side-chain repacking. Nucleic Acids Res. 41, W384–W388 (2014).

  35. 35.

    Kolaskar, A. & Tongaonkar, P. C. A semi-empirical method for prediction of antigenic determinants on protein antigens. FEBS Lett. 276, 172–174 (1990).

  36. 36.

    Emini, E. A., Hughes, J. V., Perlow, D. S. & Boger, J. Induction of hepatitis A virus-neutralizing antibody by a virus-specific synthetic peptide. J. Virol. 55, 836–839 (1985).

  37. 37.

    Karplus, P. & Schulz, G. Prediction of chain flexibility in proteins. Naturwissenschaften 72, 212–213 (1985).

  38. 38.

    Parker, J., Guo, D. & Hodges, R. New hydrophilicity scale derived from high-performance liquid chromatography peptide retention data: correlation of predicted surface residues with antigenicity and X-ray-derived accessible sites. Biochemistry 25, 5425–5432 (1986).

  39. 39.

    Ponomarenko, J. et al. ElliPro: a new structure-based tool for the prediction of antibody epitopes. BMC Bioinformatics. 9, 1 (2008).

  40. 40.

    Karosiene, E. et al. NetMHCIIpan-3.0, a common panspecific MHC class II prediction method including all three human MHC class II isotypes, HLA-DR, HLA-DP and HLA-DQ. Immunogenetics. 65, 711–24 (2013).

  41. 41.

    Lazarski, C. A. et al. The kinetic stability of MHC class II peptide complexes is a key parameter that dictates immunodominance. Immunity 23, 29–40 (2005).

  42. 42.

    Weber, C. A. et al. T cell epitope: Friend or Foe? Immunogenicity of biologics in context. Adv. Drug Deliv. Rev. 61, 965–976 (2009).

  43. 43.

    Larsen, M. V. et al. Large-scale validation of methods for cytotoxic T-lymphocyte epitope prediction. BMC Bioinformatics 8, 1 (2007).

  44. 44.

    WHO. Leptospirosis worldwide, 1999. Wkly Epidemiol Rec. 74, 237–242 (1999).

  45. 45.

    Mirza, M. U. et al. Towards peptide vaccines against Zika virus: Immunoinformatics combined with molecular dynamics simulations to predict antigenic epitopes of Zika viral proteins. Sci Rep. 6, (2016).

  46. 46.

    Khatoon, N., Pandey, R. K. & Prajapati, V. K. Exploring Leishmania secretory proteins to design B and T cell multi-epitope subunit vaccine using immunoinformatics approach. Sci Rep. 7, (2017).

  47. 47.

    Hasan, M. A., Khan, M. A., Datta, A., Mazumder, M. H. & Hossain, M. U. A comprehensive immunoinformatics and target site study revealed the corner-stone toward Chikungunya virus treatment. Mol Immunol. 65, 189–204 (2015).

  48. 48.

    Shi, J. et al. Epitope-Based Vaccine Target Screening against Highly Pathogenic MERS-CoV: An in silico Approach Applied to Emerging Infectious Diseases. PLoS ONE. 10, e0144475 (2015).

  49. 49.

    Fonseca, C. T. et al. Identification of immunodominant epitopes of Schistosoma mansoni vaccine candidate antigens using human T cells. Mem Inst Oswaldo Cruz. 99, 63–66 (2004).

  50. 50.

    Iwai, L. K. et al. In silico prediction of peptides binding to multiple HLA-DR molecules accurately identifies immunodominant epitopes from gp43 of Paraccocidioides brazilienses frequently recognized in primary peripheral blood mononuclear cell responses from sensitized individuals. Mol Med. 9, 209–219 (2003).

  51. 51.

    Grassmann, A. A. et al. Discovery of novel leptospirosis Vaccine candidates Using reverse and structural Vaccinology. Frontiers in Immunology. 8, 463 (2017).

  52. 52.

    Verma, A. et al. Antibodies to a Novel Leptospiral Protein, LruC, in the Eye Fluids and Sera of Horses with Leptospira-Associated Uveitis. Clin Vaccine Immunol. 19, 452–456 (2012).

  53. 53.

    Koizumi, N. & Watanabe, H. Leptospirosis vaccines: past, present, and future. J Postgrad Med. 51, 210–214 (2005).

  54. 54.

    Lin, X., Sun, A., Ruan, P., Zhang, Z. & Yan, J. Characterization of Conserved Combined T and B Cell Epitopes in Leptospira interrogans Major Outer Membrane Proteins OmpL1 and LipL41. BMC Microbiology 11, 21 (2016).

  55. 55.

    Lin, X. et al. Identification of Immunodominant B- and T-Cell Combined Epitopes in Outer Membrane Lipoproteins LipL32 and LipL21 of Leptospira interrogans. Clinical and Vaccine Immunology 17, 778–783 (2010).

  56. 56.

    Maenaka, K. & Jones, E. Y. MHC superfamily structure and the immune system. Curr Opin Struct Biol. 9, 745–753 (1999).

  57. 57.

    Fraga, T. R., Barbosa, A. S. & Isaac, L. Leptospirosis: Aspects of Innate Immunity, Immunopathogenesis and Immune Evasion from the Complement System. Scandinavian Journal of Immunology 73, 408–419 (2011).

  58. 58.

    Brown, R. A. et al. Comparison of three different leptospiral vaccines for induction of a type 1 immune response to Leptospira borgpetersenii serovar Hardjo. Vaccine 21, 4448–4458 (2003).

  59. 59.

    Naiman, B. M. et al. Evaluation of type 1 immune response in na€ıve and vaccinated animals following challenge with Leptospira borgpetersenii serovar Hardjo: involvement of WC1(þ) gammadelta and CD4 T cells. Infect. Immun. 70, 6147–6157 (2002).

  60. 60.

    David, A. H. et al. Leptospiral Outer Membrane Proteins OmpL1 and LipL41 Exhibit Synergistic Immunoprotection. Infection and immunity. 67, 6572–6582 (1999).

  61. 61.

    Yu, C. S., Lin, C. J. & Hwang, J. K. Predicting subcellular localization of proteins for Gram-negative bacteria by support vector machines based on n-peptide compositions. Protein Science. 13, 1402–1406 (2004).

  62. 62.

    Yu, C. S., Lin, C. J. & Hwang, J. K. Prediction of protein subcellular localization. Proteins: Structure. Function and Bioinformatics. 64, 643–651 (2006).

  63. 63.

    Yang, J. & Zhang, Y. Protein structure and function prediction using I-TASSER. Curr Protoc Bioinformatics. 52, 1–15 (2015).

  64. 64.

    Laskowski, R. A., MacArthur, M. W., Moss, D. S. & Thornton, J. M. PROCHECK: a program to check the stereochemical quality of protein structures. J. Appl. Crystallogr. 26, 283–291 (1993).

  65. 65.

    Wiederstein, M. & Sippl, M. J. ProSA-web: interactive web service for the recognition of errors in three-dimensional structures of proteins. Nucleic Acids Res. 35, 407–410 (2007).

  66. 66.

    Sippl, M. J. Recognition of errors in three-dimensional structures of proteins. Proteins. 17, 355–362 (1993).

  67. 67.

    Maghrabi, A. H. A. & McGuffin, L. J. ModFOLD6: an accurate web server for the global and local quality estimation of 3D protein models. Nucleic Acids Res. 45, 416–421 (2017).

  68. 68.

    DeLano, W. L. The PyMOL Molecular Graphics System. DeLano Scientific LLC, San Carlos, CA, USA (2002).

  69. 69.

    Nair, D. T. et al. Epitope recognition by diverse antibodies suggests conformational convergence in an antibody response. J. Immunol. 168, 2371–2382 (2002).

  70. 70.

    Fieser, T. M., Tainer, J. A., Geysen, H. M., Houghten, R. A. & Lerner, R. A. Influence of protein flexibility and peptide conformation on reactivity of monoclonal anti-peptide antibodies with a protein alpha-helix. Proc. Natl. Acad. Sci. USA 84, 8568–8572 (1987).

  71. 71.

    Taylor, W., Thornton, Jt & Turnell, W. An ellipsoidal approximation of protein shape. J. Mol. Graph. 1, 30–38 (1983).

  72. 72.

    Thornton, J. M., Edwards, M. S., Taylor, W. R. & Barlow, D. J. Location of ‘continuous’ antigenic determinants in the protruding regions of proteins. EMBO J. 5, 409–413 (1986).

  73. 73.

    Chen, K. & Kolls, J. K. T Cell–Mediated Host Immune Defenses in the Lung. Annu Rev Immunol. 31, 605–633 (2013).

  74. 74.

    Shedlock, D. J. & Shen, H. Requirement for CD4 T-cell help in generating functional CD8 T-cell memory. Science. 300, 337–339 (2003).

  75. 75.

    Rosa, D. S. et al. A DNA vaccine encoding multiple HIV CD4 epitopes elicits vigorous polyfunctional, long-lived CD4+ and CD8+ T cell responses. PLoS One. 6 (2011).

  76. 76.

    Lamiable, A. et al. PEP-FOLD3: faster de novo structure prediction for linear peptides in solution and in complex. Nucleic Acids Res. 44, W449–W454 (2016).

  77. 77.

    Duhovny, D., Nussinov, R. & Wolfson, H. J. Efficient unbound docking of rigid molecules. In Gusfield et al. Ed. Proceedings of the 2’nd Workshop on Algorithms in Bioinformatics(WABI) Rome, Italy, Lecture Notes in Computer Science 2452, 185–200, Springer Verlag (2002).

  78. 78.

    Schneidman-Duhovny, D., Inbar, Y., Nussinov, R. & Wolfson, H. J. PatchDock and SymmDock: servers for rigid and symmetric docking. Nucleic Acids Res. 33, W363–367 (2005).

  79. 79.

    Hildesheim, A. et al. Association of HLA class I and II alleles and extended haplotypes with nasopharyngeal carcinoma in Taiwan. J Natl Cancer Inst. 94, 41780–41789 (2002).

  80. 80.

    Pelte, C. et al. Random screening of proteins for HLA-A*0201-binding nine-amino acid peptides is not sufficient for identifying CD8 T cell epitopes recognized in the context of HLA-A*0201. J Immunol. 172, 6783–6789 (2004).

  81. 81.

    Rivoltini, L. et al. Binding and presentation of peptides derived from melanoma antigens MART-1 and glycoprotein-100 by HLA-A2 subtypes. Implications for peptide-based immunotherapy. J Immunol. 156, 3882–3891 (1996).

  82. 82.

    Andrusier, N., Nussinov, R. & Wolfson, H. J. FireDock: fast interaction refinement in molecular docking. Proteins 69, 139–159 (2007).

  83. 83.

    Mashiach, E., Schneidman-Duhovny, D., Andrusier, N., Nussinov, R. & Wolfson, H. J. FireDock: a web server for fast interaction refinement in molecular docking. Nucleic Acids Res. 36, W229–W232 (2008).

  84. 84.

    Pettersen, E. F. et al. UCSF Chimera—a visualization system for exploratory research and analysis. J. Comp. Chem. 25, 1605–1612 (2004).

  85. 85.

    Berendsen, H. J., van der Spoel, D. & van Drunen, R. GROMACS: a message-passing parallel molecular dynamics implementation. Comput. Phys. Commun. 91, 43–56 (1995).

Download references

Acknowledgements

This work was supported by Gujarat State Biotechnology Mission (GSBTM), Department of Science & Technology (DST), Government of Gujarat, India (www.btm.gujarat.gov.in), Grant number - GSBTM/GIBS/2016/27. KSL, SK and VV would like to thank GSBTM for providing the fellowships and research grant. We would like to thank Mr. Sugandh Kumar (Institute of Life Sciences, Bhubaneswar) for his help in submission of protein modelling job at I-TASSER server. Also, we would like to thank MD. Afzal Ansari for his help in improving the quality of result images.

Author information

Author notes

    • Kumari Snehkant Lata
    • , Swapnil Kumar
    • , Vibhisha Vaghasia
    • , Priyanka Sharma
    • , Shivarudrappa B. Bhairappanvar
    • , Subhash Soni
    •  & Jayashankar Das

    Present address: Gujarat Biotechnology Research Centre (GBRC), Department of Science & Technology, Government of Gujarat, Gandhinagar, 382011, India

Affiliations

  1. Gujarat Institute of Bioinformatics, Gujarat State Biotechnology Mission, Department of Science & Technology, Government of Gujarat, Gandhinagar, 382011, India

    • Kumari Snehkant Lata
    • , Swapnil Kumar
    • , Vibhisha Vaghasia
    • , Priyanka Sharma
    •  & Shivarudrappa B. Bhairappanvar
  2. Gujarat State Biotechnology Mission, Department of Science & Technology, Government of Gujarat, Gandhinagar, 382011, India

    • Subhash Soni
    •  & Jayashankar Das

Authors

  1. Search for Kumari Snehkant Lata in:

  2. Search for Swapnil Kumar in:

  3. Search for Vibhisha Vaghasia in:

  4. Search for Priyanka Sharma in:

  5. Search for Shivarudrappa B. Bhairappanvar in:

  6. Search for Subhash Soni in:

  7. Search for Jayashankar Das in:

Contributions

K.S.L., J.D., P.S. and S.B. conceived and designed the experiments. K.S.L., S.K. and V.V. performed the experiments. K.S.L. and S.K. analysed the data. S.S. provided laboratory assistance or infrastructure support. K.S.L., S.K., P.S. and J.D. wrote the manuscript. J.D., S.B., P.S. and S.S. provided valuable insight and advice. All authors read and approved the manuscript.

Competing Interests

The authors declare no competing interests.

Corresponding author

Correspondence to Jayashankar Das.

Electronic supplementary material

About this article

Publication history

Received

Accepted

Published

DOI

https://doi.org/10.1038/s41598-018-25281-3

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.