Abstract
In this work, we sequentially extracted water (CSPw)- and alkali (CSPa)-soluble protein fractions from glandless cottonseed. SDS-Gel electrophoresis separated CSPw and CSPa to 8 and 14 dominant polypeptide bands (110–10 kDa), respectively. Liquid chromatography-electrospray ionization-tandem mass spectrometry identified peptide fragments from 336 proteins. While the majority of peptides were identified as belonging to vicilin and legumin storage proteins, peptides from other functional and uncharacterized proteins were also detected. Based on the types (unique peptide count) and relative abundance (normalized total ion current) of the polypeptides detected by mass spectrometry, we found lower levels (abundance) and types of legumin isoforms, but higher levels and more fragments of vicilin-like antimicrobial peptides in glandless samples, compared to glanded samples. Differences in peptide fragment patterns of 2S albumin and oleosin were also observed between glandless and glanded protein samples. These differences might be due to the higher extraction recovery of proteins from glandless cottonseed as proteins from glanded cottonseed tend to be associated with gossypol, reducing extraction efficiency. This work enriches the fundamental knowledge of glandless cottonseed protein composition. For practical considerations, this peptide information will be helpful to allow better understanding of the functional and physicochemical properties of glandless cottonseed protein, and improving the potential for food or feed applications.
Similar content being viewed by others
Introduction
With about 28 × 109 kg or 124 million bales of cotton produced annually, cotton plant (Gossypium hirsutum L.) is an economically important crop known for fiber, cottonseed oil, and protein contents1,2,3. Currently, cotton fiber and cottonseed account for 85–90% and 10–15% of the value of the crop, respectively, while 150 kg of cottonseed is produced for every 100 kg fiber ginned. The cottonseed consists of approximately 16% oil, 45% meal, 25% hull, and 8% linters. The major components of defatted meal are carbohydrates and protein. Cottonseed protein comprises multiple polypeptides (i.e., various protein fractions) including seed storage proteins and proteins with other biological functions4,5. Cottonseed protein has great potential as a component of value-added industrial products and bio-active functional materials. The potential added-value products of cottonseed protein isolates include, but are not limited to, bio-based adhesives6,7,8,9,10, bioplastics and films11,12, antioxidant fractions/peptides13,14, and antibiotic peptides15.
The traditional variety of cottonseed contains the toxic terpenoid gossypol and is labeled “glanded cottonseed” as gossypol is deposited in scattered tissue structures called glands. Research efforts have been made to produce a new type of “glandless” cottonseed in which there is only trace gossypol content. Gossypol is a terpenoid produced and stored in the pigment glands of the typical cotton plant (about 1 weight percent (wt%) of dry matter of cottonseed)16. Thus, the traditional variety of cottonseed is also labeled as “glanded” cottonseed17,18. The presence of toxic gossypol limits cottonseed to mainly non-food applications1. Research has been directed towards eliminating or reducing gossypol from cottonseed (i. e., below 450 ppm free gossypol) to mitigate the toxic effects of gossypol. One notable genetically modified cultivar, TAM66274, has ultra-low gossypol cottonseed16. Another interesting genetically engineered cultivar is NuMex COT 15 GLS’ Glandless Cotton17. Cheng et al.19,20 demonstrated that the glandless cottonseed protein isolates could serve as wood adhesives. He et al.21 reported that glandless cottonseed protein-based adhesives had similar bonding performances as the glanded counterpart. In the latest study, He et al.14 comparatively examined the antioxidant activities of the water-soluble fractions of both glandless and glanded cottonseed protein. Cao et al.18,22 reported ethanol extracts from both glanded and glandless cottonseed kernels contained some bioactive ingredients.
Continued characterization of glandless cottonseed will enable improved understanding and food and feed utilization of glandless cottonseed protein. In this work, we isolated and separated glandless cottonseed proteins into water- and alkali-soluble fractions, analyzed their polypeptide profiles, and compared them to the peptide features of glanded cottonseed protein fractions.
Results and discussion
Polypeptide band features of glandless CSPw and CSPa on gradient SDS-PAGE
SDS-gel electrophoresis separated CSPw and CSPa into 8 and 14 discernable polypeptide bands, respectively (Fig. 1, full-length gels and blots are included in a Supplementary Information file). The polypeptide bands ranged from over 100 to about 11 kDa with CSPa, but just over 50 to 10 kDa for CSPw. Among the 14 CSPa bands, Ca 5 and Ca 6 at 55 and 45 kDa were most abundant. Together, they accounted for 43% of the relative abundance of the total protein load (Fig. 2). In contrast, the three most abundant bands in the CSPw sample (Cw 5, 6 and 7) were about 20, 15 and 13 kDa. Together, they accounted for 45% of the relative abundance of the total CSPw protein load. These observations were generally consistent with previous data of glandless cottonseed proteins14,23.
Similar protein band patterns with minor differences were observed when comparing glandless to glanded protein extract. There were more CSPa bands, but less CSPw bands in the glandless sample when compared to a glanded sample with 7 CSPa and 12 CSPw bands4. The main difference in the glandless CSPa bands was the four molecular weight bands from 60 to 110 kDa (Ca 1 to 4) which were not observed in either the glanded CSPa fraction4,24 or in glanded whole cottonseed protein isolates5,24,25. The presence of gossypol could result in modified SDS-PAGE protein patterns due to the chemical reactivity of gossypol with proteins. Gossypol is known to react with the free ε-amino group of lysine26 , likely promoting the formation of protein aggregates that were more difficult to extract. In fact, protein yield was 2.5-fold greater from the glandless meal than that from the glanded meal27. The higher extraction efficiency not only increased the quantity of protein biomass, but likely increased the diversity of observed proteins.
The glandless CSPa and CSPw extracts were also resolved on SDS-PAGE with the reducing agent dithiothreitol (DTT) to identify proteins containing disulfide bonds (Fig. 1). The CSPa extract band pattern was clearly altered by the inclusion of DTT in the loading buffer. The migration of several lower molecular weight bands (e. g., Ca10, 11, 13, and 14) in the CSPa sample appeared to be altered by DTT. While none of the relatively higher molecular weight bands in CSPw around 50 kDa appeared to change following the addition of DTT, the migration of several lower molecular weight bands (e. g., Cw 4, 5, 7 and 8) in the CSPw fraction were altered. With the DTT treatment, the relative abundance of new bands appearing in the range of 9–15 kDa made up 18% and 23% of relative protein abundance, respectively, for CSPa and CSPw (Fig. 2). Ma et al.25 reported that, in the presence of the reducing agent β-mercaptoethanol, many additional minor bands appeared between 14 and 35 kDa, compared to SDS-PAGE of their glanded cottonseed protein samples in the absence of a reducing agent. The emergence of these minor bands under reducing condition suggests the presence of intermolecular disulfide bonds in cottonseed protein isolates. It is hard to distinguish the 58/59 kDa DTT sensitive bands in the gel in Fig. 1. We are not aware of any previous research on the relevant SDS-PAGE band patterns under reducing conditions for glandless cottonseed protein samples. Like our work, Delgado et al.23 presented the SDS-PAGE of water-, 0.5 M NaCl-, 70% ethanol-,and 0.1 NaOH-soluble fractions of glandless cottonseed protein without reducing agent treatment. More research is needed to find out if the less impact of reducing treatment on 58/59 kDa bands is due to much less disulfide-bonded proteins presented in the glandless samples, or some other causes.
Peptide and protein profiles of CSPw and CSPa
The unique peptide count and their quantitative values are listed in spreadsheet SupTable1A and SupTable1B of the Supplementary Dataset File, respectively. In contrast to 2319 unique peptides and 70 proteins identified in the glanded sample4, there was a total of 5115 unique peptides that were matched to 336 proteins in the glandless extract. Therefore, the peptide diversity is wider for the glandless extracted than that for the glanded samples. The differences in observed peptide content provide support, but do not conclusively prove, the idea that extraction of proteins from glandless meal was more efficient than from the glanded meal27.
The molecular weight of proteins in the glandless cottonseed extracts ranged from 13 to 289 kDa, and several individual bands were compared to gene products or putative gene products of Gossypium arboretum, G. hirsutum or unspecified Gossypium species via UniPro (https://www.uniprot.org/uniprot/) and/or Conserved Domains in NCBI-BLASTp (https://blast.ncbi.nlm.nih.gov/Blast.cgi). Among them, 156 proteins appeared exclusively in CSPw bands, but only 19 proteins appeared exclusively in CSPa bands. There were 85 protein polypeptides that appeared only once in a unique CSPw band, and 22 proteins that appeared in all 8 CSPw bands. On the other hand, there were 78 protein polypeptides appearing uniquely in a single CSPa band, but 6 protein polypeptides that appeared in all 14 CSPw bands.
These proteins were functionally classified into 11 categories as protein storage, transporters, signal transduction, cell structure, transcription, translation, protein biosynthesis, protein metabolism, energy metabolism, antimicrobial activity, defense/stress, carbohydrate metabolism, fatty acid metabolism and protein species of unknown functions (Fig. 3). Biosynthesis/modification (17.4%), translation regulation (11.7%), transcription regulation (12.8%) and energy metabolism (17.1%) categories accounted for more than half of the proteins identified.
Peptides from the top 10 proteins accounted for about 25–53% of the protein biomass of a glandless CSPw band, and this was much lower than the percentages (84–100%) of any glanded CSPw band (Tables 1 and 2). This difference indicated a more diverse protein composition in glandless CSPw than in glanded CSPw. The percentages of the top 10 protein in glandless CSPa bands were generally high (> 90%), except that Ca12 (50.5%) and Ca14 (59%) were comparable to those in glanded CSPa bands. Vicilin proteins accounted for approximately 80% of glandless CSPa proteins including Ca12 (41.11%) and Ca14 (50.22%). Between the two, the abundance of vicilin-like antimicrobial peptides28 was higher in CSPa (4.52–12.93%) than in CSPw (2.65–6.36%). Legumins (2.39–12.45%) were the second most abundant proteins appeared in all 14 glandless CSPa bands, but not in top 10 of glandless CSPw. Indeed, legumin A and B accounted for only 0–0.88% and 0–1.9% of abundance, respectively, in glandless CSPw proteins which found in 7 and 6 of 8 CSPw bands, respectively. This observation was different from that of glanded CSPw fraction in which legumin A and B were the 1st and 4th most abundant, with a combined 22–53% of protein biomass (Table 1A).
The cotton 2S albumin was also observed in both CSPw and CSPa fractions of glandless cottonseed protein, but was not in the top 10 list of the glanded CSPa fraction. In contrast, another storage protein, the late embryogenesis abundant protein D-19, appeared in glanded CSPa sample. The lipid bodies-related storage protein oleosin with molecular weights of 16.4 and 18.2 kDa was also found in both CSPw and CSPa fractions, but with lower abundances than other storage proteins and was not in the top 10 for either cottonseed sample.
Peptides from three proteins of known function (cytosolic phosphoglycerate kinase, embryonic DC-8, annexin D2-like protein) and two proteins of known function (asparticase and 60S ribosomal L10a-1-like protein) were found in the top 10 of glandless CSPw and CSPa, respectively. Interestingly, none of them were found in the 90 proteins identified in glanded cottonseed samples. Similarly, two proteins of known function appeared in CSPw (transcription initiation factor TFIID subunit 1-B-like protein, and protein lin-54) and were not found in the total list of 336 proteins of glandless cottonseed. However, two (heat shock protein 70, and elongation factor 1-alpha) of the three functional proteins in the top 10 of glanded CSPa, were observed, but were not in the top 10, in the 336 proteins of glandless cottonseed.
Comparison of peptide fragments of legumins in glandless and glanded proteins
Peptide fragments of legumin storage proteins were identified in multiple bands in both CSPw and CSPa from glandless and glanded cottonseed, and were further analyzed to compare the number of observed peptides in each band (Fig. 4). The molecular weight of legumin A is approximately 58 kDa. While legumin A peptides were identified in all glandless CSPa bands from 100 to 9 kDa, the percentage of sequence coverage and number of peptide fragments were generally higher in high molecular weight gel bands than in low molecular CSPa bands. This observation indicated that legumin A of glandless cottonseed was mainly present in CSPa bands as whole protein, multiple and/or long peptide fragments. In contrast, there were just 1, 2 or 3 peptide fragments appearing in 7 of the 8 CSPw bands, suggesting degraded legumin A peptides were present in the CSPw fraction of glandless cottonseed protein. The distribution pattern of legumin B (approximately 59 kDa) was similar to that of legumin A. However, the number of peptide fragments and percent of sequence coverage of legumin B were lower than legumin A in glandless cottonseed. In contrast, peptides of both legumin A and B from glanded cottonseed were distributed in both CSPw and CSPa bands with similar patterns. The high sequence coverage suggested the two legumins were present as whole proteins or nearly full-length polypeptides in the CSPa and CSPw fractions of glanded cottonseed proteins.
Multiple bands from a single protein could be due to three mechanisms: (1) degradation during protein isolation and post treatment, (2) contamination from gels, and (3) indigenous isoforms (including post-translational protein modification)4. While the first and second mechanisms are possible, the systematic difference in the sequence coverage patterns observed between glandless and glanded cottonseed protein suggested the presence of isoforms. In the investigation of genomically biased accumulation of seed storage proteins in allopolyploid cotton, Hu et al.29 proposed that legumin isoforms may be formed through a series of modifications, including proteolytic cleavage and peptide degradation. Specifically, they found 83 spots as legumin A, and 27 spots as legumin B out of 155 identified in a 2-D SDS-PAGE image of cottonseed protein. Isoform analysis through peptide mapping indicated that the 30-kDa polypeptide of legumin A derived from the C-terminal fragment of the 58 kDa pre-propolypeptide. Legumin isoforms are commonly distributed at molecular mass of 30 kDa, 17–20 kDa, and 11–12 kDa as legumin A and at molecular weights of 11–13 kDa as legumin B. Therefore, the differences in legumins between glandless and glanded cottonseed proteins may indicate that lower levels (abundance) and/or types of legumin isoforms were present in glandless cottonseed. It remains to be determined if legumin protein expression could be impacted or not by the genetic engineering of the glandless Upland cotton (G. hirsutum L.) cultivar, which carried the incomplete dominant glandless Gl2e allele introduced from G. barbadense17.
Comparison of vicilin peptide fragments in glandless and glanded proteins
Like legumins, vicilins are primary storage proteins of cottonseed with multiple types and isoforms29,30. Similar to glanded samples, five types of vicilin proteins were identified in glandless cottonseed protein with a wide range of molecular weights (100–9 kDa). Two vicilin-like antimicrobial peptides possess potential antifungal activity in plant development in cropping management and function against important yeasts in medical mycology for medical application31,32. The distribution of these peptides in glandless and glanded cottonseed proteins was compared (Fig. 5). No apparent difference in peptide sequence or protein coverage pattern was observed between glandless and glanded vicilin proteins. However, in contrast to legumins, both vicilin-like antimicrobial peptides 2–1 and 2–2 seemed more abundant in glandless samples than in the corresponding glanded CSPa and CSPw samples. As the glandless cotton cultivar is depleted in gossypol in cottonseed, the increasing level of vicilin-like antimicrobial peptides may be a compensation mechanism to offset the beneficial role of gossypol’s antimicrobial properties during germination and young plantlet development of the glandless cotton32,33.
Compared to the distribution of antimicrobial peptide 2–2 fragments, more antimicrobial 2–1 peptides were identified in SDS-PAGE bands in both glandless and glanded cottonseed proteins. Interestingly, the peptide 2–1 fragments were concentrated in the C-terminus, but more SDS-PAGE bands were observed to contain N-terminal fragments in vicilin-like antimicrobial peptide 2–2. Chung et al.34 separated cottonseed protein into four major cotton seed basic protein fractions by a series of chromatographic procedures, and tested their in vitro antifungal activity. They suggested that those small molecular weight cysteine-rich vicilin-like peptides (approximately 16–20 kDa) with in vitro antifungal activity were from an approximately 70 kDa pre-proprotein. Furthermore, those vicilin-like antibacterial proteins (9–11 kDa) corresponded to N-terminally truncated forms of cysteine-rich small vicilin proteins (e.g., the 16.3 kDa protein). Their work seems consistent with our observation of the presence of both C-terminal and N-terminal enriched peptide 2–1 and 2–2. Hu et al.29 found that small isoforms of vicilin A (49, 35, 11, and 17 kDa) and vicilin B (49 and 17 kDa) were derived from the 70 kDa vicilin A and vicilin B pre-propolypeptides through the cleavage of signal peptides together with the N-terminal fragments, respectively, and further proposed that protein modifications (e.g., glycosylation, phosphorylation, acetylation, and methylation) likely contributed to the formation of these vicilin isoforms. Thus, there is rational and physiological justification for increased antimicrobial vicilin-like peptide production through post-translational cleavage of vicilin preprotein from glandless cotton (lacking in gossypol). The increased relative amount of antimicrobial proteins regarding glandless cottonseed protein samples would enhance its value in post-harvest products such as nutrient foods, food additives, and animal feed with better shelf life14,23,35,36 although the antimicrobial effect in the processed industrial products needs to be further confirmed.
Comparison of peptide fragments of 2S albumin and oleosin in glandless and glanded protein
Both 2S albumin and oleosin are storage proteins. Peptides matching with 2S albumin were found in all 22 CSPa and CSPw bands of glandless cottonseed protein, but only 9 of 19 bands of glanded samples (Fig. 6). Generally, the peptide strings (1–4 fragments) from glandless cottonseed protein were longer than the peptide sequence coverage of the glanded peptide fragments. In addition, the 2S albumin peptides from glanded cottonseed samples were only observed in the C-terminus of the protein. Sharing the homology to potent antifungal proteins from cheeseweed seeds, these small 2S albumin peptide fragments could potentially be used by cotton to form multiple antimicrobial molecules against a wide variety of pathogens37 during cottonseed germination.. By the way, more oxidation and carbamidomethyl (CAM) modification were observed with the 2S albumin fragments in glandless bands (up to 9 modifications in glandless sample vs 6 or less modifications in glanded sample shown in green in Fig. 6. However, CAM was a deliberate post-translational modification introduced to cysteine residues by reacting with iodoacetamide during sample preparation so that it was not indicative of a plant directed effort. The oxidation modifications could also part of sample preparation and not likely to be plant directed.
There were two types of lipid body-related oleosins with molecular weights of 16.4 kDa and 18.2 kDa in glandless samples, but only one type in glanded cottonseed protein (Fig. 6). They were relatively less abundant (not in the top 10), and only found in a few SDS-PAGE bands. The fragments from the 16.4 kDa oleosin were from the C-terminus of the protein in both glandless and glanded cottonseed samples even though the observed peptides were different. In contrast, 5 of 6 bands from the N-terminus of the 18.2 kDa oleosin were observed. Tzen et al.38 reported that the difference between maize 16 kDa and 18 kDa olesin proteins can be explained by an extra amino acid sequence of 2 kDa at the C-terminus of the 18 kDa. Santiago et al.39 found that coconut oleosin was composed of various isoforms with molecular weights of 48.78, 35.12, 32.39, 28.15, 22.06, and 16.13 kDa from SDS-PAGE analysis. Similarly, the possible presence of multiple isoforms of cottonseed oleosin can be seen in Fig. 5 B and D. The olesin protein in glandless cottonseed was dominated by the 18.4 kDa form, while the majority, if not all, of glanded oleosin was the 16.2 kDa form. Oleosins mainly act as oil body structural proteins and are important for the biogenesis, stability, and dynamics of the organelle40. Further research on the two oleosins in the two types of cottonseeds may shed new light into their roles in metabolism and cotton development.
Conclusions
This work reported the polypeptide profile of water and alkali soluble protein fractions of glandless cottonseed. SDS-Gel electrophoresis separated CSPw and CSPa to 8 and 14 dominant protein bands, respectively. LC–ESI–MS/MS of the 22 protein bands identified a total of 5115 unique peptides that were matched to 336 proteins. While the majority of peptides were matched to vicilin and legumin storage proteins, peptides from other functional and uncharacterized proteins were also detected. Compared to the polypeptide profile of the glanded cottonseed protein in literature, glandless cottonseed protein possessed relatively lower levels (abundance) and types (diversity) of legumin isoforms, but higher levels and more fragments of vicilin-like antimicrobial peptides. This work enriched the fundamental knowledge of glandless cottonseed protein composition, thus being helpful in better understanding the functional and physicochemical properties of glandless cottonseed protein for food and feed applications.
Materials and methods
Cottonseed protein products
Defatted cottonseed meal from glandless variety “NuMex 15 GLS” was provided by Cotton, Inc. (Cary, NC, USA)41. A two-step extraction and precipitation procedure was applied to obtained the water soluble (CSPw) and alkali soluble (CSPa) glandless cottonseed protein fractions from the defatted meal24,42. Briefly, the water- (CSPw) and alkali-(CSPa) soluble protein fractions in the defatted cottonseed meal were sequentially extracted by water (naturally around pH 7.0) and 0.015 M NaOH, and then precipitated at pH 4.0 and 7.0, respectively, by pH adjustment with 1 M HCl.. Both fractions were freeze-dried and kept at -20 °C until use. It should be aware that the sequential extraction of the cottonseed protein was a relative measure of different fractions in defatted meal. This is because, prior to defatting, cottonseeds, like other oilseeds, are generally subjected to a “cooking” process which might make some protein denatured and aggregated, affecting their relative solubility43,44. The protein contents of CSPw and CSPa were about 79% and 97%, respectively, converted from the total N content using d using a conversion factor of 5.545. To detect the presence of protein disulfide bonds dithiothreitol (DTT, Sigma-Aldrich, St. Louis, MO, USA) at 4 mM final concentration was added to CSPw and CSPa samples (2 µg) in 50 mM ammonium bicarbonate buffer (pH 8.0) at 37 °C for 15 min. Prior to trypsin digestion and LC–MS/MS analysis, reduced protein samples were alkylated in the dark with 15 mM iodoacetamide in 50 mM ammonium bicarbonate buffer (pH 8.0) for 30 min at room temperature. Iodoacetamide was then quenched by the addition of 4 mM DTT.
Sodium dodecyl sulphate–polyacrylamide gel electrophoresis (SDS-PAGE)
CSPw and CSPa samples (2 µg/lane) were solubilized in 4X NuPAGE LDS Sample Buffer (pH 8.5) (ThermoFisher Scientific, USA), with or without DTT as above, and heated at 55 °C for 5 min. Following a brief centrifugation step to remove particulates, samples were electrophoresed for 90 min at constant voltage (125 V) on Novex 10–20% Tricine Mini Protein Gels (ThermoFisher Scientific, USA) using an XCell SureLock Mini-Cell (ThermoFisher Scientific, USA) apparatus with prestained Precision Plus molecular weight standards (Bio-Rad, USA)46. Following electrophoresis, gels were rinsed and stained with SimplyBlue SafeStain (ThermoFisher Scientific, USA) according to manufacturer’s instructions. Gels were visualized and protein band intensities were quantified using the 680 nm channel on a Li-Cor Odyssey infrared scanner and Image Studio software (LI-COR Biosciences, USA). The relative band intensity of a band was calculated by the percentage of that specific band intensity out of the sum of all band intensity values, and used to represent the relative protein content of that band in the total protein mass).
Liquid chromatography–electrospray ionization–tandem spectrometry (LC–ESI–MS/MS)
Another set of CSPw and CSPa samples (10 µg/lane for six lanes each sample) were subjected to SDS-PAGE under the same conditions as described above. Individual bands excised from the multiple SDS-PAGE lanes were pooled and subjected to in-gel trypsin digestion42. The fragments in the digestion samples were analyzed by LC–ESI–MS/MS by UAB Mass Spectrometry/Proteomics Shared Facility (University of Alabama at Birmingham, Birmingham, Alabama, USA). The data were acquired with Bruker UltraFlex III MALDI ToF/ToF. The tandem mass spectral data generated were processed with SEQUEST and searched by Mascot against protein databases. The quantitative values of the normalized total ion current (TIC) of the peptide MS fragments were used as a relative measurement of the observed peptide in the gel bands. Scaffold (version Scaffold_4.11.0, Proteome Software Inc., Portland, OR) was used to validate MS/MS based peptide and protein identifications. The molecular function of individual proteins was searched via UniPro (https://www.uniprot.org/uniprot/) and/or Conserved Domains in NCBI-BLASTp (https://blast.ncbi.nlm.nih.gov/Blast.cgi). In addition to the specific functions, per the online search results, these proteins were grouped into 11 categories (Gene Ontology terms) for general identification.
Experimental design and statistical analysis
Relative abundance of the major protein fractions separated by SDS-PAGE were calculated as their band intensities relative to the total band intensity of each lane. The lanes were measured and data were presented as the average with standard deviation (n = 3).
Same gel bands of six SDS-PAGE lanes were pooled as one sample for LC–ESI–MS/MS. For MS data treatments, peptide identifications were accepted if they could be established at greater than 80.0% probability by the Peptide Prophet algorithm with Scaffold delta-mass correction. Protein identifications were accepted if they could be established at greater than 99.0% probability and contained at least 2 identified peptides. Protein probabilities were assigned by the Protein Prophet algorithm47. Proteins that contained similar peptides and could not be differentiated based on MS/MS analysis alone were grouped to satisfy the principles of parsimony.
References
Cheng, H. N., He, Z., Ford, C., Wyckoff, W. & Wu, Q. A review of cottonseed protein chemistry and non-food applications. Sustain. Chem. 1, 256–274 (2020).
Zubair, M. et al. A combinational approach of enhanced methanol production and double Bt genes for broad spectrum insect resistance in transgenic cotton. Mol. Biotechnol. 61, 663–673 (2019).
Rojo-Gutiérrez, E., Buenrostro-Figueroa, J., López-Martínez, L., Sepúlveda, D. & Baeza-Jiménez, R. Valorisation of Agro-industrial Residues—Volume II: Non-Biological Approaches 63–82 (Springer, 2020).
He, Z., Zhang, D. & Cao, H. Protein profiling of water and alkali soluble cottonseed protein isolates. Sci. Rep. 8, 9306. https://doi.org/10.1038/s41598-41018-27671-z (2018).
Singh, A. & Kaur, A. Comparative studies on seed protein characteristics in eight lines of two Gossypium species. J. Cotton Res. 2, 6. https://doi.org/10.1186/s42397-42019-40024-42393 (2019).
Li, J. et al. Assessment and application of phosphorus/calcium-cottonseed protein adhesive for plywood production. J. Clean. Prod. 229, 454–462 (2019).
Cheng, H. N., Ford, C. V. & He, Z. Evaluation of polyblends of cottonseed protein and polycaprolactone plasticized by cottonseed oil. Int. J. Polymer Anal. Character. 24, 389–398 (2019).
Cheng, H. N., Ford, C. V., Dowd, M. K. & He, Z. Soy and cottonseed protein blends as wood adhesives. Ind. Crop. Prod. 85, 324–330 (2016).
Liu, M., Wang, Y., Wu, Y., He, Z. & Wan, H. “Greener” adhesives composed of urea-formaldehyde resin and cottonseed meal for wood-based composites. J. Clean. Prod. 187, 361–371 (2018).
He, Z., Cheng, H. N. & Nam, S. Comparison of the wood bonding performance of water-and alkali-soluble cottonseed protein fractions. J. Adhes. Sci. Technol. https://doi.org/10.1080/01694243.2020.1850612,1-18 (2021).
de Oliveira Filho, J. G. et al. Active food packaging: Alginate films with cottonseed protein hydrolysates. Food Hydrocolloids 92, 267–275 (2019).
Yue, H. et al. On the improvement of properties of bioplastic composites derived from wasted cottonseed protein by rational cross-linking and natural fiber reinforcement. Green Chem. https://doi.org/10.1039/D0GC03245J (2020).
Song, W. et al. Antioxidant and antibacterial activity and in vitro digestion stability of cottonseed protein hydrolysates. LWT 118, 108724 (2020).
He, Z., Zhang, D. & Olanya, O. M. Antioxidant activities of the water-soluble fractions of glandless and glanded cottonseed protein. Food Chem. 325, 126907 (2020).
Mohammadrezaei, M., Navidshad, B., Gheisari, A. & Toghyani, M. Cottonseed meal bioactive peptides as an alternative to antibiotic growth promoters in broiler chicks. Int. J. Peptide Res. Therap. 1–12 (2020).
Rathore, K. S. et al. Ultra-low gossypol cottonseed: Selective gene silencing opens up a vast resource of plant-based protein to improve human nutrition. Crit. Rev. Plant Sci. 39, 1–29 (2020).
Zhang, J. et al. Registration of ‘NuMex COT 15 GLS’ glandless cotton. J. Plant Regist. 10, 223–227 (2016).
Cao, H., Sethumadhavan, K. & Bland, J. M. Isolation of cottonseed extracts that affect human cancer cell growth. Sci. Rep. 8, 1–12 (2018).
Cheng, H. N., Dowd, M. K. & He, Z. Investigation of modified cottonseed protein adhesives for wood composites. Ind. Crop. Prod. 46, 399–403 (2013).
Cheng, H. N., Wyckoff, W., Dowd, M. K. & He, Z. Evaluation of adhesion properties of blends of cottonseed protein and anionic water-soluble polymers. J. Adhes. Sci. Technol. 33, 66–78 (2019).
He, Z., Cheng, H. N., Chapital, D. C. & Dowd, M. K. Sequential fractionation of cottonseed meal to improve its wood adhesive properties. J. Am. Oil Chem. Soc. 91, 151–158 (2014).
Cao, H. & Sethumadhavan, K. Cottonseed extracts and gossypol regulate diacylglycerol acyltransferase gene expression in mouse macrophages. J. Agric. Food Chem. 66, 6022–6030 (2018).
Delgado, E. et al. Characterization of soluble glandless cottonseed meal proteins based on electrophoresis, functional properties, and microscopic structure. J. Food Sci. 84, 2820–2830 (2019).
He, Z., Cao, H., Cheng, H. N., Zou, H. & Hunt, J. F. Effects of vigorous blending on yield and quality of protein isolates extracted from cottonseed and soy flours. Mod. Appl. Sci. 7(10), 79–88 (2013).
Ma, M. et al. Physicochemical and functional properties of protein isolate obtained from cottonseed meal. Food Chem. 240, 856–862 (2018).
Conkerton, E. J. & Frampton, V. L. Reaction of gossypol with free ϵ-amino groups of lysine in proteins. Arch. Biochem. Biophys. 81, 130–134 (1959).
Dowd, M. K. & Hojilla-Evangelista, M. P. Preparation and characterization of protein isolate from glandless and glanded cottonseed. ACS Symp. Ser. 1144, 343–357 (2013).
Goyal, R. K. & Mattoo, A. K. Multitasking antimicrobial peptides in plant development and host defense against biotic/abiotic stress. Plant Sci. 228, 135–149 (2014).
Hu, G. et al. Genomically biased accumulation of seed storage proteins in allopolyploid cotton. Genetics 189, 1103–1115 (2011).
Shutov, A. D., Kakhovskaya, I. A., Braun, H., Baumlein, H. & Muntz, K. Legumin-like and vicilin-like seed storage proteins: Evidence for a common single-domain ancestral gene. J. Mol. Evolut. 41, 1057–1069 (1995).
Vieira Bard, G.C. et al. Vicilin‐like peptides from Capsicum baccatum L. seeds are α‐amylase inhibitors and exhibit antifungal activity against important yeasts in medical mycology. Peptide Sci. 102, 335–343 (2014).
Xie, Z., Saha, N. & Chlan, C. Antimicrobial activity of a cys-rich peptide derived from a Centrosema virginianum vicilin. Am. J. Plant Sci. 7, 92–107 (2016).
Gao, W. et al. The gland localized CGP1 controls gland pigmentation and gossypol accumulation in cotton. Plant Biotechnol. J. 18, 1573 (2020).
Chung, R.P.-T., Neumann, G. M. & Polya, G. M. Purification and characterization of basic proteins with in vitro antifungal activity from seeds of cotton, Gossypium hirsutum. Plant Sci. 127, 1–16 (1997).
Swiatkiewicz, S., Arczewska-Wlosek, A. & Jozefia, D. The use of cottonseed meal as a protein source for poultry: An updated review. World Poultry Sci. J. 72, 473–484 (2016).
Abdallh, M. et al. Energy and protein utilisation by broiler chickens fed diets containing cottonseed meal and supplemented with a composite enzyme product. Br. Poultry Sci. 103, 5102–5117 (2020).
Wang, X., Bunkers, G. J., Walters, M. R. & Thoma, R. S. Purification and characterization of three antifungal proteins from cheeseweed (Malva parviflora). Biochem. Biophys. Res. Commun. 282, 1224–1228 (2001).
Tzen, J. T., Lai, Y.-K., Chan, K.-L. & Huang, A. H. Oleosin isoforms of high and low molecular weights are present in the oil bodies of diverse seed species. Plant Physiol. 94, 1282–1289 (1990).
Santiago, M.R. & Devanadera, M.K.P. Microencapsulation of Pediococcus spp. with coconut oil bodies and oleosin for improved viability and enhanced targeted delivery in the gastrointestinal tract. Philippine Agric. Sci. 99 (2016).
Shao, Q., Liu, X., Su, T., Wang, P. & Ma, C. New insights into the role of seed oil body proteins in metabolism and plant development. Front. Plant Sci. 10, 1568 (2019).
He, Z., Zhang, H. & Olk, D. C. Chemical composition of defatted cottonseed and soy meal products. PLoS ONE 10(6), e0129933 (2015).
He, Z., Uchimiya, M. & Cao, H. Intrinsic fluorescence excitation-emission matrix spectral features of cottonseed protein fractions and the effects of denaturants. J. Am. Oil Chem. Soc. 91, 1489–1497 (2014).
Chen, N., Zhao, M., Chassenieux, C. & Nicolai, T. Thermal aggregation and gelation of soy globulin at neutral pH. Food Hydrocolloids 61, 740–746 (2016).
Dowd, M.K. Cotton. 2nd edn. In Agronomy Monograph, Vol. 57. (eds. D.D. Fang & R.G. Percy) 745–781 (ASA, CSSA, and SSSA, 2015).
Ezeagu, I., Petzke, J., Metges, C., Akinsoyinu, A. & Ologhobo, A. Seed protein contents and nitrogen-to-protein conversion factors for some uncultivated tropical plant seeds. Food Chem. 78, 105–109 (2002).
Mattison, C. P., Grimm, C. C. & Wasserman, R. L. In vitro digestion of soluble cashew proteins and characterization of surviving IgE-reactive peptides. Mol. Nutr. Food Res. 58, 884–893 (2014).
Nesvizhskii, A. I., Keller, A., Kolker, E. & Aebersold, R. A statistical model for identifying proteins by tandem mass spectrometry. Anal. Chem. 75, 4646–4658 (2003).
Acknowledgements
We thank Kyoko Kojima and Brandon Young (University of Alabama at Birmingham CCC MS & Proteomics Shared Facility) for help on liquid chromatography-electrospray ionization-tandem mass spectrometric analysis and protein identifications. Mention of trade names or commercial products in this publication is solely for the purpose of providing specific information and does not imply recommendation or endorsement by the U.S. Department of Agriculture. USDA is an equal opportunity provider and employer.
Author information
Authors and Affiliations
Contributions
Conceived and designed the experiments: Z.H. Performed the experiments: Z.H., C.P.M., D.Z., C.G. Analyzed the data: Z.H., C.P.M., D.Z. Wrote the manuscript: Z.H. Manuscript editing: Z.H., C.P.M. All authors reviewed the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
He, Z., Mattison, C.P., Zhang, D. et al. Vicilin and legumin storage proteins are abundant in water and alkali soluble protein fractions of glandless cottonseed. Sci Rep 11, 9209 (2021). https://doi.org/10.1038/s41598-021-88527-7
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-021-88527-7
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.