Introduction

Immediately upon ejaculation, semenogelins (secreted by seminal vesicle) form a coagulum after coming in contact with zinc ions. Semenogelin fibers create a dense network to restrict the motility of spermatozoa; however, it has been suggested that it may involve motility restriction methods beyond simple physical hindrance. Complete arrest of the mobility of sperm flagellum suggests that the semenogelins inhibit spermatozoa motility by associating with a cell surface component localized on the flagellum of each spermatozoon1,2,3. Semen liquefaction occurs within 5–20 minutes of ejaculation. The semenogelins initiate their own degradation by chelating zinc ions as the latter activates a network of kallikrein related peptidases4, resulting in the dissolution of semen coagulum and activation of sperm progressive motility5,6,7. Prostate specific antigen (PSA) or kallikrein related peptidase 3 (KLK3) is one of the most abundant proteins in the secretion of normal human prostate epithelium and seminal plasma8. PSA is an androgen dependent 30KDa glycoprotein with chymotrypsin like enzymatic activity9 and plays a major role in the fragmentation of seminal vesicle secreted proteins (semenogelins). It has been suggested that in addition to facilitating coagulum liquefaction, PSA might activate a motility-activating peptide10.

Studies on semen liquefaction have shown that PSA degrades semenogelins preferentially at specific sites11, 12. The semenogelins are considered to be the precursor molecules, whose degradation yields a number of polypeptides that have different biological functions, such as increasing sperm hyaluronidase activity13, hyper-polarization of sperm plasma membrane14, 15, anti-bacterial activity16, and prevention of sperm capacitation, O2.− synthesis and hyperactivated motility2, 10, 15. Furthermore, they bind and/or interact with a number of proteins such as fibronectin2, 17, CD5218, protein C inhibitor19, heparin20, and participate in the formation of a macromolecular complex with clusterin, lactotransferrin and eppin21. Semenogelins and their degradation products are also thought to affect sperm fertility by increasing thyrotropin releasing hormone like action, promoting zinc shuttling and inhibin like activities22. Therefore, by facilitating semenogelins degradation, PSA serves functions that are important for sperm fertility, apart from releasing motile sperm from semen coagulum.

Optimal pace of semenogelins degradation is critical for fertility as they must fragment for sperm release and their presence is important for inhibiting premature capacitation. As mentioned above, PSA is the chief peptidase behind semenogelins degradation and generation of active peptides. Studies till date viewed PSA from semen liquefaction point of view, looking for correlation between PSA level and sperm motility23. Since PSA mediated semenogelin degradation serves functions beyond sperm release, its activity may affect fertility even if PSA level or semen liquefaction appears to be normal. PSA is encoded by a gene that spans 12850 bp region on chromosome 1924. Genetic variations in KLK3 gene could affect its activity and hence the degradation of semenogelins and the generation of active peptides, ultimately affecting fertility. In order to understand the contribution of KLK3 genetic variations to infertility risk, we re-sequenced its complete coding region in 875 infertile and 290 fertile men. We identified a total of twenty-eight substitutions, of which five appear to be strong risk factors for male infertility.

Materials and Methods

Sample collection

We recruited 875 infertile men and 290 fertile controls from the King George’s Medical University (KGMU), Lucknow and the Institute of Reproductive Medicine (IRM), Kolkata. The study was approved by the Institutional Human Ethics Committee of the Central Drug Research Institute (CDRI), Lucknow. All experiments were performed in accordance with the relevant guidelines and regulations of the Institutional Ethics Committee. A verbal explanation of the nature of study was given to participants while taking their informed written consent.

The inclusion criteria for infertile patients was based on infertility persisting longer than one year and absence of any obvious fertility problem in the partner (menstruation and ovulation). A detailed clinical workout on the female partner was taken as the absence of any abnormality in her and to narrow down the problem to the male partner. Sperm count and motility in the case group were between 0 and 200 (average = 54.4) and between 0 and 85 (average = 7.4%), respectively. Male individuals exhibiting obstruction to sperm release, varicocele, endocrine imbalance, infection of accessory glands and human immunodeficiency virus positivity were excluded from the study. Semen analysis was performed after an abstinence of 3–7 days. The patients pool consisted of individuals with oligozoospermia (N = 68), azoospermia (N = 279), asthenozoospermia (N = 246) or normozoospermia (N = 149), uncategorized (N = 133), but experiencing infertility after at least one year of unprotected intercouse constituted the infertile group. The patients were identified from their visits to the clinic on their own or by referal. Most of the patients had been trying for parenthood for the last more than three years. The controls were recruited following the criteria of confirmed paternity. Semen samples for all control samples were not obtained, but confirmed paternity in the last two years was taken as a proof of their fertility. Sperm count and motility in the control group were between 35 and 180 (average = 89.1), and between 34 and 85 (average = 68.7%), respectively. The study subjects (cases and controls) were of Indo-European ethnicity with an average age of 34.13 ± 6.16 years. The average age was 33.11 for the case group and 35.15 for the control group.

Genomic DNA Isolation and DNA sequencing

Genomic DNA was isolated from the peripheral blood samples of subjects using phenol-chloroform isoamyl method as described previously25. Sequence of the KLK3 gene was retrieved from the Ensembl database (Gene ID: ENSG00000142515), and primers for the coding region were designed using the primer-blast tool available at NCBI. Primers were custom synthesised by Eurofins, Bangalore, India. PCR amplification was carried out as previously described26 with details provided in Table 1. Amplicons were treated with Exo-Sap (Exonuclease I and Shrimp Alkaline Phosphatase, ExoSAP-IT; USB Corporation, Cleveland, OH, USA) to remove unutilized primers and dNTPs as per the manufacturer’s protocol. Direct DNA sequencing using BigDyeTM chain termination chemistry was performed on ABI 3730 DNA analyzer (Applied Biosystems, USA)27. Multiple alignment and sequence analysis were done using Auto Assembler Software (Applied Biosystems, USA).

Table 1 Details of primers and PCR coditions used for the amplification of KLK3 exons.

Total protein and t- PSA level in seminal plasma

Semen samples were centrifuged first at low speed (5000 rpm) for 10 minutes at 4 °C and later at high speed (12000 rpm) for 10 minutes at 4 °C for obtaining the seminal plasma from infertile men. Total protein content in the seminal plasma was assessed by Bradford method. Seminal plasma was diluted 1000 times for estimation of t-PSA using an ELISA based kit from Weldon Biotech (Cat No: t-PSA 118WB). Absorbance was measured on µQuant (Bio-Tek Instruments Inc.) and analyzed using KC Junior software.

Statistical analysis

Chi square analysis was used to compare genotypes frequency between fertile and infertile men using the VassarStats Online Calculator (http://faculty.vassar.edu/lowry/VassarStats.html). Odds ratio (OR) was calculated using the dominant model of analysis for all the substitutions. Linkage disequilibrium between each pairwise combination of SNPs and haplotype frequencies were calculated using the Haploview software (Version 4.2) (http://www.broadinstitute.org)28. A P-value of less than 0.05 was considered to be statistically significant.

In silico analysis

Variant effect prediction analysis was done using the VEP tool available at the Ensembl database (www.ensembl.org). PolyPhen (Polymorphism Phenotyping) (http://genetics.bwh.harvard.edu/pph2/) and SIFT (Sorting Intolerant from Tolerant) (http://siftdna.org/www/Extended_SIFT_chr_coords_submit.html) scores were used for the prediction of functional impact of the non-synonymous substitutions. PolyPhen and SIFT scores predict the effect of an amino acid substitution on structure and function of a protein, using sequence homology, proximity of the substitution to predicted functional domains or structural features, and physicochemical similarity between alternate amino acids.

Data availability

All data generated or analysed during this study are included in this published article.

Results

Sequence analysis identified twenty-eight substitutions (Figure 1 and Table 2), of which eight were called mutations (present in <1% frequency) and rest were labelled as SNPs (present in >1% frequency). Variant effect prediction revealed six to be missense variants, five to be synonymous variants, thirteen to be intronic variants, one to be a splice region variant, one to be a 5′UTR variant, one to be 3′UTR variants, and one to be a downstream gene variant with reference to the transcript ENST00000326003. Upon further investigation, two of the intronic substitutions were found to be missense with reference to other transcripts (ENST00000597483 and ENST00000596185). Out of twenty-eight substitutions, eight were novel that had not been catalogued in the dbSNP or ESP databases. Nomenclature of the novel substitutions was done following the guidelines of the Human Genome Variation Society (HGVS) (http://www.hgvs.org/mutnomen/). The non-synonymous substitutions, c.529 C > G, c.548 T > A, c.554 C > T, rs61752561, rs2003783, rs17632542, rs266881, and rs73932617 resulted in p.His177Asp (ENST00000326003), p.Val183Glu (ENST00000326003), p.Ala185Val (ENST00000326003), p.Asp102Asn (ENST00000326003), p.Leu132Ile (ENST00000326003), p.Ile179Thr (ENST00000326003), p.Pro41Gln (ENST00000596185), and p.Glu174Lys (ENST00000597483) changes, respectively (Table 2). In silico analysis using Polyphen predicted none of these to be functionally ‘damaging’ and in silico analysis using SIFT predicted p.Glu174Lys, and p.Ile179Thr to be ‘deleterious’.

Figure 1
figure 1

Electropherograms of the substitutions identified.

Table 2 Minor allele frequency distribution of KLK3 gene substitutions in infertile and fertile men.

Comparison based on fertility status

Minor allele frequencies for each variation are detailed in Table 2. Eight substitutions (rs11573, rs1135766, rs73932617, c.206 + 56 G > A, c.529 C > G, c.554 C > T, c.631–175 C > A, TMP_ESP_19_51361381, c.631–116 T > C, rs61736309, and c.*648 G > A) were observed exclusively in infertile men and one (c.548 T > A) exclusively in fertile men. Genotype distributions for six SNPs (rs266881, rs174776, c.206 + 235 T > C, rs1810020, rs266875 and rs35192866) were significantly different between fertile and infertile men according to 2 by 3 contingency (Table 3, dominant model). Five of these substitutions (rs266881, rs174776, rs1810020, rs266875, rs35192866,) increased the risk of infertility, while one (c.206 + 235 T > C) was protective. The frequency of ‘CA + AA’, ‘TC + CC’, ‘AA + AG’, ‘GG + GA’ and ‘CC + CT’ genotypes for SNPs rs266881 (OR = 2.88, P = <0.0001), rs174776 (OR = 1.91, P =<0.0001), rs1810020 (OR = 2.08, P = 0.034), rs266875 (OR = 1.44, P = 0.016) and rs35192866 (OR = 4.48, P = 0.025), respectively, were significantly higher in the infertile group as compared to the fertile group. On the contrary, the frequency of ‘TC + CC’ genotype for c.206 + 235 T > C (OR = 0.44, P = 0.002) was higher in fertile controls. At least four of these correlations were confirmed by 2 × 3 contingency table analysis as well.

Table 3 Statistical comparison of the genotype distribution of identified SNPs between infertile and fertile.

Linkage disequilibrium and haplotype analysis

Out of 28 SNPs, eight SNPs (rs266881, rs2292186, rsc.206 + 275 T > A, rs12946, rs61752561, rs2003783, rs1810020, and rs17632542) qualified for the haploview analysis. Three SNPs (rsc.−4C > T, rs200846924, and rsc.554 C > T) had failed due to low maf (MAF <0.001). Five SNPs (rs2053087, rs174776, rsc.206 + 235 T > C, rsc.548 T > A, and rs266875) failed the HWE test (<0.001). Six SNPs (rs35192866, rs73932617, rs45588133, rsc.631–175 C > A, rs61736309, and rsc.*648 G > A) failed the % genotypes test (cut off value = 75%). Six SNPs (rs11573, rs1135766, rsTMP_ESP_19_51361381, rsc.529 C > G, rs2659122 and rs6998) failed both the HWE test and % genotype test. Therefore, LD analysis was performed on the reamining eight SNPs. We performed LD analysis using four gamete rule method, which created one block with six SNPs (rsc.206 + 275 T > A, rs12946, rs61752561, rs2003783, rs1810020, and rs17632542) showing strong LD (Fig. 2A). Haplotype analysis of this block revealed that TCGCGT haplotype had the highest frequency (f = 0.614) among other haplotypes (TCGCAT = 0.296; TTGAGT = 0.028; TTGCGT = 0.024; TCGCGC = 0.018) (Fig. 2B). Distribution of these haplotypes between cases and controls revealed no significant difference (Table 4).

Figure 2
figure 2

LD analysis using four gamete rule method showing LD (A) and haplotypes (B).

Table 4 Haplotype analysis based on four gamete rule method.

LD analysis by solid spine of LD method created three blocks. Block 1 contained 2 SNPs (rs2292186, rsc.206 + 275 T > A), block 2 contained three SNPs (rs12946, rs61752561, rs2003783,) and block 3 contained 2 SNPs (rs1810020, and rs17632542) (Fig. 3A). Haplotype analysis of these blocks revealed that block 1, 2 and 3 had the highest frequency of haplotypes CT (f = 0.869), CGC (f = 0.929), and GT (f = 0.679), respectively (Fig. 3B). Further, the distribution of haplotypes between cases and controls revealed no significant difference (Table 5).

Figure 3
figure 3

LD analysis by solid spine of LD method showing LD (A) and haplotypes (B).

Table 5 Haplotype analysis based on solid spine of LD method.

Estimation of t-PSA in seminal plasma

We quantified total PSA in seminal plasma of 96 infertile men and correlated its concentration with semen liquefaction time and sperm motility. We did not find any significant correlation of t-PSA level in seminal plasma with either motility (0.085) or liquefaction time (−0.062).

Discussion

Since long, PSA is well known to be the chief executor of the process of semen liquefaction, which releases the mass of entangled spermatozoa to achieve active motility and initiate their journey towards the ovum. Complete or partial failure of semen liquefaction would result in the loss of sperm motility, causing or contributing to infertility. Men with reduced sperm motility had low seminal fluid PSA29, 30 and a study on Swedish men showed a direct association between PSA level in the seminal fluid and sperm motility in normal male population23. In a large number of infertility cases, PSA is produced in sufficient quantity and semen liquefaction takes place within 5–20 minutes; this may exclude PSA as a possible cause of infertility in these cases. However, studies in the last two decades have pointed out that PSA may have a long trail of its impact of sperm functions and fertility that go beyond semen liquefaction. This starts in the vagina (site of semen deposition) with the first step in the form of semenogelins fragmentation, releasing sperm. Hereafter, semenogelins and their fragments are thought to affect sperm fertility by increasing thyrotropin releasing hormone like action, promoting zinc shuttling and inhibin like activities22.

The whole of seminal plasma contents are left behind once sperm make their way into the uterus. However, semenogelin like peptide fragments have been reported in sperm fractions in a number of studies2, 17, 31,32,33. A 19 kDa protein (probably from semenogelin processing) was found at the periphery of detergent-treated human sperm nuclei31. Further, the high binding capacity of semenogelins and their fragments for Zn2+ promotes the shuttling of Zn2+ to sperm nucleus, where it plays essential role in DNA stability2, 17. An interesting study found a 21 kDa protein (identified as semenogelins I precursor) in spermatozoa that was found at higher concentration in asthenozoospermic infertile men33. Among other evidences in support of numerous functions of semenogelins in sperm fertility, a recent study provided unequivocal evidence that semenogelins in fact cross the sperm plasma membrane to serve intracellular functions such as the inhibition of capacitation10. The study also reported that the levels of semenogelins drop fast at the time of sperm capacitation with a rise in ROS generation.

The above functions of semenogelins are dependent on PSA and other KLKs, making them important for fertility. Erroneous processing of semenogelins could have impact on sperm motlity/fertility even if semen liquefaction appears normal. It is possible that higher level of sperm semenogelins in some infertile men33 and its slow degradation32 could delay or prevent capacitation. Therefore, optimal activity of PSA and other KLKs is critical for sperm fertility. Among studies on KLK genes, Lee and Lee (2011) genotyped KLK2 SNPs (+255 G > A, rs2664155) in 218 infertility cases and 220 fertile controls and found a significant correlation of the polymorphisms with male infertility34. Savblom et al.35 reported the association of few SNPs in the hKLK2 and PSA genes with seminal and serum levels of KLK2 and PSA levels35. Similarly, a previous study reported a strong association of KLK7 polymorphisms with semen hyperviscosity, with a higher incidence in infertile cases36.

We identified 28 substitutions, out of which five (rs266881, rs174776, rs1810020, rs266875, rs35192866) associated with increased risk of infertility, while one (c.206 + 235 T > C) was protective. LD analysis suggested three blocks of SNPs, but haplotype analysis revealed no significant difference between cases and controls. SNPs (rs266881, rs174776) that fall in the intronic region of transcript ENST00000326003 may affect regulatory functions by as yet unknown mechanisms. Substitution at rs266881 results in a non-synonymous change in the transcript ENST00000596185 and increases the risk of infertility. This is the first study reporting the association of KLK3 SNPs with male infertility; however, the functional significance of these polymorphisms remains to be worked out. The genetic variations in kallikreins in relation to their impact on male fertility is in infantile stage and further studies on other candidate kallikreins are required. We did not find a correlation between PSA concentration and semen liquefaction/sperm motility; nevetheless, a previous study reported a significant correlation between PSA level and sperm motility in normal Swedish men23 and another study reported a similar correlation in infertile individuals37. This suggests that PSA level could affect sperm motility, but the effect of PSA activity on sperm motility and fertility has not been assessed.

In a nutshell, the increasing understanding of the functions of semenogelins and their petides in sperm fertility makes PSA far more than imporant for male fertility than previously thought. Eight of twenty-eight substitutions we observed had not been reported in the dbSNP and ESP databases before. Out of twenty-eight, only five SNPs correlated with increased infertility risk in our population; however, studies on other populations would help in identification of the most common risk factors for male infertility. SNPs rs266881, rs174776, rs1810020, rs266875 and rs35192866 affect the risk of male infertility and merit further investigation in other populations. Nevertheless, there were other SNPs which were observed solely in infertile cases, but their absence in controls may be a chance event. Therefore, KLK3 analysis in infertile individuals from ethnically different populations is strongly recommended. The findings of the present study would open up new horizons for investigation of KLK3′s importance in male fertility. Kallikrein related peptidases are so important in fertility that a host of them are found in cervical-vaginal fluid as well38. Further studies on semenogelins and PSA may reveal far unanticipated roles that they play in sperm functions and male fertility.