Revealing Corynebacterium glutamicum proteoforms through top-down proteomics

Melo, Reynaldo Magalhães; de Souza, Jaques Miranda Ferreira; Williams, Thomas Christopher Rhys; Fontes, Wagner; de Sousa, Marcelo Valle; Ricart, Carlos André Ornelas; do Vale, Luis Henrique Ferreira

doi:10.1038/s41598-023-29857-6

Download PDF

Article
Open access
Published: 14 February 2023

Revealing Corynebacterium glutamicum proteoforms through top-down proteomics

Reynaldo Magalhães Melo¹,
Jaques Miranda Ferreira de Souza¹,
Thomas Christopher Rhys Williams²,
Wagner Fontes¹,
Marcelo Valle de Sousa ORCID: orcid.org/0000-0002-2871-4430¹,
Carlos André Ornelas Ricart¹ &
…
Luis Henrique Ferreira do Vale ORCID: orcid.org/0000-0002-4113-5978¹

Scientific Reports volume 13, Article number: 2602 (2023) Cite this article

1816 Accesses
1 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Corynebacterium glutamicum is a bacterium widely employed in the industrial production of amino acids as well as a broad range of other biotechnological products. The present study describes the characterization of C. glutamicum proteoforms, and their post-translational modifications (PTMs) employing top-down proteomics. Despite previous evidence of PTMs having roles in the regulation of C. glutamicum metabolism, this is the first top-down proteome analysis of this organism. We identified 1125 proteoforms from 273 proteins, with 60% of proteins presenting at least one mass shift, suggesting the presence of PTMs, including several acetylated, oxidized and formylated proteoforms. Furthermore, proteins relevant to amino acid production, protein secretion, and oxidative stress were identified with mass shifts suggesting the presence of uncharacterized PTMs and proteoforms that may affect biotechnologically relevant processes in this industrial workhorse. For instance, the membrane proteins mepB and SecG were identified as a cleaved and a formylated proteoform, respectively. While in the central metabolism, OdhI was identified as two proteoforms with potential biological relevance: a cleaved proteoform and a proteoform with PTMs corresponding to a 70 Da mass shift.

The Archaeal Proteome Project advances knowledge about archaeal cell biology through comprehensive proteomics

Article Open access 19 June 2020

Proteomic consequences of TDA1 deficiency in Saccharomyces cerevisiae: Protein kinase Tda1 is essential for Hxk1 and Hxk2 serine 15 phosphorylation

Article Open access 27 October 2022

Differential proteomic analysis by SWATH-MS unravels the most dominant mechanisms underlying yeast adaptation to non-optimal temperatures under anaerobic conditions

Article Open access 18 December 2020

Introduction

The bacterium Corynebacterium glutamicum is an industrial workhorse capable of producing a broad range of biomolecules from a large pool of substrates¹ and is currently the best option for production of L-alpha amino acids². Amino acid production represents a multi-billion dollar market³, with an annual production of approximately 10 million tons worldwide², with l-glutamate and l-lysine corresponding to 3.3 million tons/year and 2.2 million tons/year, respectively². Besides amino acid production, C. glutamicum is also utilized in other biotechnological processes, such as the production of heterologous proteins⁴, organic acids⁵, carotenoids, isobutanol⁶, polymers⁷, and more recently, its potential in bioremediation processes has been investigated⁸.

Recent developments in proteomics have demonstrated the importance of post-translational modifications (PTMs) in several species of bacteria⁹. In C. glutamicum, the importance of PTMs was investigated previously revealing the crucial effect of oxoglutarate dehydrogenase inhibitor (OdhI) phosphorylation in the production of l-glutamate^10,11. More recently, mass spectrometry proteomics of C. glutamicum showed the influence of l-glutamate-producing conditions on the succinylation and acetylation of C. glutamicum metabolic proteins, including OdhI¹².

There are currently two main proteomics approaches that are referred to as bottom-up and top-down. Briefly, in the bottom-up approach proteins are cleaved by a protease (usually trypsin) and the subsequent peptides are analyzed by mass spectrometry, whilst in the top-down procedure, proteins are examined in their intact forms¹³. As a consequence of protease digestion, the information from several protein regions can be lost in bottom-up approaches, including PTMs. Furthermore, in this approach protein identity is inferred based on peptide identification, and this may cause ambiguity in cases in which peptides may belong to more than one protein¹³. In contrast, the top-down approach allows the identification of the full sequence of proteins. Moreover, analysis of intact forms of proteins permits the identification of proteoforms, defined as different forms of proteins derived from the same gene¹⁴. Such proteoforms can be generated by amino acid residue substitution, proteolytic cleavage, alternative splicing, and several types of PTMs¹⁵. Furthermore, different proteoforms of the same protein can have different function and affect biological processes of an organism¹⁶. The power of top-down proteomics has been applied to other bacteria¹⁷. For instance, top-down proteomics of Escherichia coli and the identification of proteoforms has been performed¹⁸. Despite the evidence of the relevance of PTMs in C. glutamicum metabolism, no top-down proteomic studies have been published to date. Here we carried out the first top-down proteomics analysis of C. glutamicum, using a precursor tolerant approach to identify and reveal PTMs and proteoforms.

Results and discussion

As expected, a wide range of proteoform molecular masses could be seen in the pre-fractionated intracellular proteins, but GELFrEE was efficient in separating large proteoforms from smaller forms (Fig. 1A). Fractions 0, 1, 2, 3, 4, 5, 7, 9 and 11 displayed proteoforms below 50 kDa, and for this reason were chosen for subsequent LC–MS/MS analysis. Despite the clear presence of proteins in the 30–50 kDa range, as shown by SDS-PAGE (Fig. 1A), LC–MS analysis detected proteoforms predominantly below 30 kDa (supplementary Fig. S1). The reduced number of observed proteoforms larger than 30 kDa may be due to known challenges for the identification of denatured large proteoforms such as the signal-to-noise ratio reduction occurring with increases in proteoform molecular weight¹⁹.

Global post-translational modifications profile

The top-down proteomic analysis of C. glutamicum generated 5127 proteoform spectrum matches (PrSMs), providing the identification of 1125 different proteoforms related to 273 different proteins (supplementary Table 1). Approximately 65% (177 proteins) of the total protein number and 47% of PrSMs (2423 PrSMs) were identified with mass shifts (Δm), mass differences between the expected precursor mass and observed precursor mass, which suggests the presence of PTMs. Putative PTMs were annotated in supplementary Table 1 using the entire Unimod database, however, this information should be used with caution, since spectra were not manually verified and the software considered rounded mass shifts for comparisons, generating a tolerance of approximately 0.5 Da. There are software available for spectra visualization and further evaluation of PTMs, which includes ProSight Lite²⁰ and TopMSV²¹. Analysis of all PrSMs with Δm revealed a broad diversity of Δm, suggesting the presence of different PTMs (Fig. 1B).

Some of the most frequent Δm identified here were also reported as highly present in Escherichia coli through bottom-up proteomics using an unbiased identification strategy²². In C. glutamicum, the mass shifts most frequently identified were 16 Da (putative oxidation, 10.15% of modified PrSMs), −18 Da (putative dehydration, 2.97% of modified PrSMs), and 32 Da (possible double oxidation, 1.32% of modified PrSMs). Interestingly, the proportion of E. coli identified peptide spectrum matches (PSMs) with -18 Da (3.8%) and 32 Da (0.8%) Δm were very similar to those found in this study. In contrast, the proportion of PSMs identified with 16 Da (24%) Δm in E. coli was greater than in C. glutamicum. However, it is worth mentioning that the high number of PrSMs identified with 15 Da Δm (6.11% of modified PrSMs) may be caused by miss identification of the 16 Da Δm, as will be further discussed in the next topics.

Proteins identified with Δm were analyzed using the String Cytoscape app, after clustering proteins according to their interaction nodes, each cluster was submitted to overrepresentation analysis and the most representative terms were related to different gene ontology terms such as protein-containing complexes, electron transfer activity and CYTH domain (CyaB, thiamine triphosphatase), pyrimidine metabolism, transmembrane helix, biosynthesis of secondary metabolites and thioredoxin domain. Moreover, eight ribosomal PrSMs presented two Δm, indicating the presence of at least two concomitantly occurring PTMs (supplementary Fig. S2).

Beyond functional annotation, we analyzed the number of N-terminal acetylation and some frequently identified Δm: 16 Da, 15 Da, 28 Da, 266 Da, -18 Da, and 32 Da (Fig. 2A). These Δm suggest the presence of oxidation (UnimodAC: 35, Δm = 15.994915 Da), deamidation followed by a methylation (UnimodAC: 528, Δm = 14.999666 Da), formylation (UnimodAC: 122, Δm = 27.994915 Da), sodium dodecyl sulfate (SDS) adducts²³, dehydration (UnimodAC: 23, Δm = −18.010565 Da), and persulfide (UnimodAC: 421, Δm = 31.972071 Da). The 28 Da mass shift may also correspond to di-methylation (Unimod: 36, Δm = 28.031300 Da). Despite being difficult to differentiate between formylation and di-methylation, the higher number of Δm close to 27.99 Da and 27.98 Da (supplementary Table 2) leans the inference towards formylation events. The amino acid residue localization of the 28 Da mass shift was not possible without ambiguity, as can be seen by the absence of this mass shift in Fig. 2B. A possible reason for this is that the suggested formylation events were identified mainly at the N-terminal of proteins, and its assignment was always between a few N-terminal residues of the protein, as in the case of the protein MscL (supplementary Fig. 7). When the assignment of the mass shift was in more than one amino acid residue, the information was not plotted to avoid ambiguity. The suggested oxidation events related to the 16 Da Δm is further supported by a large number of methionine residues modified by this mass shift (Fig. 2B). On the other hand, the 15 Da Δm seems to be sometimes misinterpreted, since the most frequently identified residue with this modification was also methionine. However, some glutamic acid residues were identified with this Δm, suggesting that deamidation followed by methylation can still be present in some cases, but are easily confused with oxidations (Fig. 2B). The 32 Da mass shift identified in some proteins could be related to persulfide PTM, however, this modification occurs in cysteine and aspartic acid, and none of these could be observed in the unambiguously identified residues (Fig. 2B). Other possibility would be two oxidations, supported by the methionine residues identified with this mass shift. On the other hand, the possibility of persulfide modification should not be excluded, mainly in proteins in which the modified amino acid residue could not be unambiguously identified. Protein oxidation may have great relevance in biological processes, however, it is important to note that it may also be an artefact caused by sample handling²⁴. Moreover, several Δm were approximately 266 Da, suggesting the presence of adducts resulting from sodium dodecyl sulfate (SDS). The presence of SDS artifacts was also reported in an intact proteomic analysis of E. coli, and suggested to be caused by incomplete removal of the detergent²⁵. Regarding the −18 Da Δm, it may result from dehydration events and was predominantly identified in serine residues (Fig. 2B). These predominant mass shifts may also be related to other putative PTMs, such as tyrosine oxidation to 2-aminotyrosine (UnimodAC: 342, Δm = 15.010899 Da) for the mass shifts of 15 Da. These possibilities can be observed separately for each identified PrSM in supplementary Table 1. N-terminal acetylation was the most frequently identified PTM (Fig. 2A); on the other hand, the 42 Da mass shift, which suggests lysine acetylation (UnimodAC: 1, Δm = 42.010565 Da) was not frequently detected (supplementary Table 2). In identifying proteoforms, the N-terminal acetylation was used as a variable modification. Therefore, proteoforms with lysine acetylation near the N-terminal without fragments to explain the 42 mass shift to a lysine residue could be misidentified with a N-terminal acetylation.

Overrepresentation analysis of proteins belonging to the six most frequent modifications allowed the identification of terms related to ribosomal proteins, predominant in proteins with Δm of −18 Da, 15 Da, 16 Da, and N-terminal acetylated proteins (Fig. 3). Prokaryotic ribosomal proteins are described to be commonly methylated and acetylated²⁶. On the other hand, Δm of 28 Da and 266 Da were overrepresented for terms related to membrane proteins. In addition, proteins identified with 28 Da mass shift showed overrepresentation for the secretion system and protein export (Fig. 3).

We identified 13 proteins that presented more than 15 proteoforms and most of them were ribosomal (supplementary Fig. S3). Moreover, when considering proteins with more than 9 proteoforms, the cellular component annotation also revealed membrane proteins (supplementary Fig. S3). In despite of the great number of proteoforms, it is worth mentioning that some of them may be boosted by SDS adducts. For example, 50S ribosomal protein L7/L12 (Q8NT28) proteoforms were identified with Δm of 266 Da, 532 Da, and 401 Da, probably representing the addition of one SDS adduct, two SDS adducts, and two SDS adducts plus the loss of the initiator methionine residue. The presence of SDS adducts in proteins is not desirable, however, the identification of these proteins is important to avoid misidentifications and to assign proteins that were only identifiable through adducted forms²⁵.

In order to contribute with more accurate data regarding C. glutamicum biotechnological and metabolic processes, some PrSMs related or potentially involved with these functions were manually characterized through inspection of MS1 and MS2 Spectra (Table 1).

Table 1 Proteoforms with potential metabolism regulation function and of biotechnological interest.

Full size table

Membrane proteins

Secretion system protein (SecG) (Fig. 4A) and large-conductance mechanosensitive channel (MscL) (supplementary Fig. S4) were identified with 28 Da mass shifts. The PrSMs fragments of these proteoforms presented ions supporting the identification of a mass shift of 28 Da, always near the N-terminus, and the mass errors of fragments containing this Δm were extremely low. An example of MS1 and MS2 inspection is demonstrated with SecG (Q8Z469) (Fig. 4A). In this figure, the large number of b-ions supporting the 27.9883 mass shift (b4, b5, b6, b7, b8, b9, b10, b11 and b12) represented by the blue lines in the protein sequence is shown, and it can be seen that the mass shift is localized to a very narrowed region of three amino acid residues, due to the several b-ions contained in this region.

The 28 Da mass shift near the protein N-terminus of these proteins suggests the presence of N-terminal formylation (Unimod: 122, Δm = 27.994915 Da) in the identified proteoforms. Recent evidence has suggested the N-terminal formylation of methionine as a signal for protein degradation. This mechanism has been hypothesized as a quality control of protein translation in bacteria²⁷. Both the SecG and MscL formylated PrSMs mentioned above were identified with the presence of N-terminal methionine. This suggests a possible mechanism of degradation of membrane proteins, including some with promising biotechnological applications. For example, the SecG protein is part of the Sec protein export pathway. There is a growing interest in the capacity of C. glutamicum to express and secrete heterologous proteins of biotechnological interest^4,28. Furthermore, MscL and MscS are mechanosensitive channels, known for reacting to osmotic stress. Another mechanosensitive channel of C. glutamicum (MscCG; P42531) plays a major role in l-glutamate efflux²⁹.

Top-down proteomics is efficient in identifying cleaved proteoforms. For example, mepB, a membrane protein related to the metalloendopeptidase (Q8NS24) was identified by a portion of its sequence with precursor mass of 14.464 kDa. This mass corresponds to the loss of 97 amino acid residues from its N-terminal region (Fig. 4B). In contrast, a mepB cleavage site was supposed to be between Ala43 and Ala44, after a putative signal peptide or transmembrane helix³⁰. Furthermore, according to pfam (https://pfam.xfam.org/), mepB has a domain belonging to the M23 metallopeptidase family (MEROPS) [130–226]. A well characterized member of MEROPS is the LytM protein from Staphylococcus aureus, a metallopeptidase involved in autolysis³¹. It was demonstrated that cleavage in the N-terminal chain causes its peptidase activity to be activated³². Moreover, MEROPS proteins have specificity to peptidoglycan polyglycine regions, some with suggested cell wall metabolism activity³³. In agreement, the C. glutamicum mepB gene was described as part of the MtrAB regulon, a two component system implicated in osmoregulation and cell wall metabolism control³⁴. Although it is not clear whether mepB is secreted or membrane bound, it has been suggested that its activity would occur extracitoplasmically³⁰. Considering this, C. glutamicum mepB may have an important role in cell wall metabolism and/or heterologous protein secretion integrity, and its activity is likely regulated by cleavage of its N-terminal.

Tricarboxylic acid cycle and glutamate metabolism

C. glutamicum is widely used in the industrial production of amino acids, especially l-glutamate, which is produced in millions of tons per year³⁵. The tricarboxylic acid (TCA) cycle is an important step in l-glutamate production by C. glutamicum. It is well established that the decrease of 2-oxoglutarate dehydrogenase complex (ODHC) activity occurs in conditions that induce l-glutamate production^36,37. Furthermore, ODHC activity was found to be regulated by the phosphorylation status of oxoglutarate dehydrogenase inhibitor (OdhI, Q8NQJ3). Thus, the phosphorylated proteoform of OdhI is unable to interact with ODHC, whilst, unphosphorylated OdhI interacts with ODHC inhibiting 2-oxoglutarate conversion to succinyl-CoA¹¹ (Fig. 5A). Moreover, the decreased ODHC activity causes an enhanced production of l-glutamate from 2-oxoglutarate³⁷ (Fig. 5A). Congruently, OdhI phosphorylation status is affected by methods that induce l-glutamate production¹¹.

In this study, seven proteoforms of OdhI were identified. Some of them seemed to be caused by sample preparation or ionization process, such as oxidation (Δm = 16 Da) and dehydration (Δm = −18 Da). Conversely, two proteoforms were identified with potential biological relevance: one of them with N-terminal cleavage of three residues and another one with Δm of 70 Da (supplementary Fig. S5). Although these two proteoforms were confidently identified by TopPic, a low number of matching fragments covered the modified regions, lowering our confidence regarding their identifications, principally concerning their location in the sequence. On the other hand, in agreement with one of the identified proteoforms, a bottom-up proteomic analysis in a preliminary study conducted by our group suggested the presence of Δm of 70 Da in the N-terminal peptide of OdhI (data not shown).

Recently, the 70 Da mass shift was identified in the GM-CSF heterologous expressed protein in Escherichia coli system. This modification was hypothesized to be a result of crotonaldehyde formed during oxidative stress by lipid peroxidation. The aldehyde reacts with the protein N-terminus or lysine residues, resulting in the Δm of 70 Da³⁸. Another possible PTM for this mass shift is the butyrylation of lysine (UnimodAC: 1289, Δm = 70.041865 Da). In the present study, the exact site of the modification resulting in the 70 Da mass shift in OdhI could not be identified due to the complexity of the spectra and limited fragmentation by MS/MS. Therefore, other modifications such as 5 methyl group additions (14 Da), and acetylation (42 Da) followed by formylation (28 Da) could not be excluded. As aforementioned, ODHC inactivation arises from the interaction of unphosphorylated OdhI and the OdhA subunit of ODHC. The T14 OdhI phosphorylation inhibits this interaction, resulting in ODHC activation¹⁰. OdhI has a fork-head associated (FHA) domain responsible for the recognition of a phosphothreonine residue. Furthermore, this domain is the region that interacts with the OdhA subunit of ODHC, resulting in its inactivation. Moreover, it was demonstrated that the phosphorylation of the OdhI threonine residue causes drastic changes in its conformation, leading to an auto inhibition of its function, consequently activating ODHC³⁹. More recently, it was reported that K142 succinylation also affects the OdhI–ODHC interaction, hampering the inhibition of ODHC with impacts on C. glutamicum l-glutamate production^12,40.

Furthermore, as an acetylation site at K52 of OdhI was also described^12,40, we investigated such a modification in addition to N-terminal formylation (27.9949 Da + 42.0106 Da) as a possible cause of production of the 70 Da OdhI proteoform. However, these modifications resulted in a considerable loss of matched fragment peaks (data not shown). Therefore, it is improbable that this was the source of such a mass shift. Moreover, the suggested region for the 70 Da Δm is near to the known T14 phosphorylation site of OdhI, consequently raising the question if it could affect its interaction with ODHC. OdhI phosphorylation at T14 is mainly catalyzed by C. glutamicum PknG¹⁰. Considering the recognition capacity of the OdhI FHA domain of its phosphothreonine and the close location of this phosphorylation site with the 70 Da Δm proteoform, this modification may affect OdhI inhibition of ODHC, resulting in its activation or inactivation (Fig. 5B). A possible mechanism is the obstruction of OdhI phosphorylation by the presence of the 70 Da PTM, allowing the FHA domain interaction and consequently inhibition of ODHC (Fig. 5B, purple box). Another hypothesis is that the 70 Da modification of OdhI, even having T14 phosphorylated residue, interferes with the FHA domain recognition of the phosphothreonine, resulting in the inhibition ODHC by OdhI (Fig. 5B, purple box). Moreover, the 70 Da mass shift may even behave similarly to the phosphorylated residue, inducing the conformational change of OdhI, inhibiting it and activating ODHC (Fig. 5B, green box). It is less likely that the N-terminal tripeptide truncation causes similar effects, however its influence in OdhI cannot be discarded (Fig. 5B). Furthermore, these alterations may influence l-glutamate production in these bacteria, as a consequence of ODHC regulation (Fig. 5B). Despite these potentially relevant effects of OdhI putative proteoforms, further studies must be performed to investigate the importance and validity of these modifications in OdhI function.

Another protein with relevance to the glutamate production process is the heavy-metal-associated domain (HMA) containing protein (Q8NL68, HMADP). Its corresponding transcript was identified as up-regulated in a series of glutamate overproduction conditions, however its function in this process remains unclear⁴¹. Here we identified three different proteoforms of this protein. One of them presented Δm of −2 Da (supplementary Fig. S6), suggesting the presence of a disulfide bond (UnimodAC: 2020, Δm = −2.015650 Da), amino acid residue substitution (UnimodAC: 1217, Δm = −2.015650 Da, Val- > Pro or UnimodAC: 1145, Δm = −1.997892 Da, Met- > Glu) or didehydro (UnimodAC: 401, Δm = −2.015650 Da) in the modified region. Moreover, several PrSMs were identified with Δm of approx. 30 Da. Inspection of its spectra resulted in the confirmation of this mass shift, but it is more likely that it resulted from two PTMs, one of 14 Da and other of 16 Da (supplementary Fig. S6), possibly corresponding to methylation and oxidation, respectively. Furthermore, an isotopic envelope could be detected near this identified proteoform (Δm = 30 Da) with the intact mass of the −2 Da modification of HMADP (supplementary Fig. S6, precursor in blue).

The two putative modifications of HMADP proteoforms are close together in the protein sequence (supplementary Fig. S6). Moreover, no canonical proteoform could be identified. These suggest that the presence of methylation may affect disulfide bond formation. The HMA sequence is found in bacterial proteins conferring toxic heavy metal resistance⁴². To clarify C. glutamicum HMADP function, the Q8NL68 protein sequence was blasted against the UniprotKB reference proteomes plus Swiss-Prot database with default parameters in Uniprot, resulting in similarity to copper chaperone and heavy metal transport/detoxification proteins (supplementary Table 3). The transcripts for the copper-responsive two-component system, CopRS, were shown to be up-regulated in C. glutamicum during penicillin induction of glutamic acid production⁴³. Recently, it was demonstrated that copper can induce glutamic acid production by C. glutamicum, however in a lower amount compared to typical treatments, such as penicillin or biotin limitation⁴⁴. Despite this, the role of HMADP in L-glutamate production remains unclear, however it appears to be a stress response related protein, and it may therefore be regulated or regulate other proteins through PTMs associated with oxidative stress.

Stress response related proteins

Less common Δm values were observed in a peroxiredoxin (Q8NMS6). One proteoform with a mass shift of 154 Da was identified and N-terminal acetylation was also detected (supplementary Fig. S7). Inspection of the MS/MS spectrum allowed us to identify fragments that support the 154 Da mass shift, however, the N-acetylation could not be narrowed down to a shorter sequence region, making it unclear if it was a series of methylation events or indeed an acetylation (supplementary Fig. S7). Considering only one PTM, the mass shift of 154 Da may be caused by three events: addition of glycerophosphate (UnimodAC: 419, Δm = 154.003110 Da), decanoyl (UnimodAC: 449, Δm = 154.135765 Da) or 4-oxo-2-nonenal (ONE) (UnimodAC: 721, Δm = 154.099380 Da). The ONE protein modification is a lipid peroxidation product, caused by reactive species interactions with membrane lipids, suggesting this protein may be localized near the cell membrane of C. glutamicum. This molecule can be added to nucleophile amino acid residues⁴⁵. Observing the proposed region of modification, the presence of ONE at suggested site is unlikely, however, near this location are three possible modification sites, C51, K48, and C46 (supplementary Fig. S7). This peroxiredoxin was described as a peroxiredoxin Q with an important role in the oxidative stress response, and capable of reducing H₂O₂, t-BOOH, cumene hydroperoxide, and peroxynitrite. Interestingly, the C46 proposed to suffer the 154 Da mass shift is the catalytic cysteine of this peroxiredoxin⁴⁶. This suggests this modification may have an important impact on its catalytic activity, and consequently in several C. glutamicum oxidative stress responses. Moreover, proteins involved in oxidative stress regulation are often susceptible to oxidative modifications as a process to regulate redox activity in the cell²⁴. This evidence supports the 154 Da mass shift as a ONE modification in C. glutamicum peroxiredoxin. However, other possibilities cannot be discarded. Another peroxiredoxin proteoform with a Δm of 186 Da supports the identification of the 154 mass shift in this protein and is likely caused by a second PTM in its sequence of approximately 32 Da. In agreement, near the precursor fragmented for the identification of the 186 Da Δm proteoform, there was an isotopic envelope corresponding to the loss of around 32 Da (supplementary Fig. S7). This suggests the presence of a PTM of 32 Da in addition to the putative ONE modification. Mass differences of 32 Da are usually due to two oxidations in different methionines, but the methionines in this protein sequence are closer to the C-terminal, where several fragments were identified without this mass shift. Another possibility for this mass shift would be a dihydroxylation of cysteine (UnimodAC: 425, Δm = 31.989829 Da). There are two cysteines near the modified region (supplementary Fig. S7). Despite the pKa of free cysteine being around 8.6, the presence of positively charged residues near the cysteine decreases it by 3–4 units, supporting the oxidation of its thiol group⁴⁷. In agreement, there is an arginine (R54) and lysine (K47) near C51. Moreover, the C51 is the resolving residue and was suggested to be very important in the catalytic process of this peroxiredoxin⁴⁶.Two oxidations of cysteines result in the formation of sulfinic acid, which is an irreversible modification and signal for protein degradation⁴⁷. Therefore, there is a good chance that the peroxiredoxin protein of C. glutamicum undergoes oxidative modifications which could regulate its activity and integrity. Furthermore, the suggested modification by a ONE peroxidation suggests a possible role of this protein in cell membrane repair.

Another thioredoxin (Q8NLG6, trxB1), described as thiol-disulfide isomerase and thioredoxins, was identified by two proteoforms, both with N-terminal cleavage of 26 amino acid residues. Interestingly, one proteoform was identified with approximately −2 Da mass shift in a region near two cysteines (supplementary Fig. S8), suggesting the formation of a disulfide bond. Another proteoform was identified with a Δm of 29.88 Da with ambiguous possibility of modifications (supplementary Fig. S8), such as two oxidations plus a disulfide bond, or methylation followed by oxidation. Moreover, both proteoforms were identified by good quality MS/MS spectra with several fragments representing the two modified forms (supplementary Fig. S8). This thioredoxin was demonstrated to be responsive to disulfide stress, regulated by the SigM sigma factor in C. glutamicum⁴⁸. Thioredoxins are known to act as a repair system of oxidized cysteine residues, through its CxxC motif. Its cysteine residues undergo oxidation, forming a disulfide bond, producing the reduced form of cysteine residues in the target protein⁴⁷. The presence of a −2 Da Δm identified near this motif of trxB1 reinforces that it is a disulfide bond. Moreover, the identification of oxidized and reduced proteoforms of a thioredoxin indicates the possibility of comparative studies in quantifying these proteoforms and estimate the redox status of the cell.

Top-down proteomic analysis of the industrial workhorse Corynebacterium glutamicum revealed several new putative PTMs of this bacterium related to different biological processes. More precisely, 1125 proteoforms were identified, from 273 proteins. Moreover, membrane proteins and proteins involved in translation seem to be heavily susceptible to PTMs. Proteins relevant to biotechnological and metabolic processes were identified by new proteoforms, which may imply new regulation strategies. For example, the OdhI protein is involved in amino acid production and was identified by a new proteoform suggesting a putative butyrylation or crotonaldehyde peroxidation, that may affect its inhibition of ODHC, therefore influencing glutamate production. The endoproteinase mepB protein was also identified with a new proteoform, cleavage of 93 amino acids residues of its N-terminal, suggesting an undescribed mechanism of its activation in C. glutamicum with possible effects on cell wall metabolism. SecG membrane protein, a subunit of protein secretion system had a putative N-terminal formylation, suggesting a degradation signaling with possible consequences in protein secretion. Another membrane protein was identified with a putative N-terminal formylation, MscL, which may influence metabolite efflux. A peroxiredoxin was identified with a putative ONE modification near its catalytic cysteine residue, suggesting a role in oxidative stress responses. Another stress response protein was identified with N-terminal cleavage, that may influence disulfide stress response. The HMADP protein, implicated in glutamate producing conditions and metal transport/detoxification, was identified by two proteoforms, one with a putative disulfide bond and another with putatives methylation and oxidation, suggesting possible effects in glutamate production and/or metal detoxification. The influence of these proteoforms in C. glutamicum biological processes should be validated and further investigated. For instance, a recent study used genetic engineering of C. glutamicum OdhI, introducing mimic amino acid residues, to investigate OdhI acetylation and succinylation influence on its interaction with ODHC⁴⁰. Another possibility would be comparative analysis coupling quantitative top-down proteomics with metabolomics, observing differences in proteoform abundances correlated with metabolite abundance changes.

Methods

Strain and growth conditions

C. glutamicum ATCC 13032 was grown in tryptic soy agar (17 g/L pancreatic digest of casein, 3 g/L papaic digest of soybean, 5 g/L NaCl, 2.5 g/L K₂HPO₄, 2.5 g/L glucose monohydrate, 1.5 g/L agar) and pre-culture was performed overnight in tryptic soy broth under 170 rpm agitation at 30 °C. Afterwards, the pre-culture was used to inoculate three flasks containing CGXII media [20 g/L of (NH₄)₂SO₄, 5 g/L of Urea, 40 g/L of glucose, 1 g/L of KH₂PO₄, 1 g/L of K₂HPO₄, 0.25 g/L of MgSO₄^.7H₂O, 42 g/L of MOPS, 10 mg/L of CaCl₂, 10 mg/L of FeSO₄^.7H₂O, 10 mg/L of MnSO₄^.7H₂O, 1 mg/L of ZnSO₄^.7H₂O, 0.2 mg/L of CuSO₄, 0.02 mg/L of NiCl₂^.6H₂O, 0.2 mg/L of biotin, 0.03 mg/L of protocatechuic acid]⁴⁹ [start optical density (OD) = 1.0]. The samples were grown for 36 h under 170 rpm agitation at 30 °C. Growth measurements were assessed by OD₆₀₀ using a spectrophotometer CLARIOstar (BMG LABTECH, Germany).

Sample preparation for LC–MS/MS

C. glutamicum cultures were harvested and centrifuged at 12,000×g for 10 min at 21 °C. Pellets were washed with CGXII medium minus glucose, by resuspending the pellet and centrifuging at 12,000×g for 10 min at 21 °C. Cells were then resuspended in lysis buffer (100 mM Tris–HCl, 2 mM DTT, SDS 4%, cOmplete™ EDTA-free Protease Inhibitor Cocktail from Roche, Basel, Switzerland), followed by physical disruption through maceration in liquid nitrogen. After maceration, the solution was centrifuged at 20,000×g for 15 min at 21 °C to separate insoluble particles, and the supernatant with soluble intracellular proteins was stored at −80 °C until further use. Proteins were then quantified by Qubit™ (Invitrogen). A pooled sample was prepared using equal protein quantities of each replicate and 500 µg of the pooled sample was fractionated by Gel-eluted Liquid Fraction Entrapment Electrophoresis—GELFrEE⁵⁰, using a homemade cartridge with 12% resolving gel and 4% stacking gel. First, the sample was diluted in sample buffer 2× (Tris–HCl 0.125 M, SDS 4%, glycerol 20%, DTT 0.1 M, bromophenol blue 0.01%), then it was submitted to electrophoresis using a running buffer (Tris–HCl 0.025 M, Glycine 0.192 M, SDS 0.1%) and a constant current of 10 mA. Fractions were collected according to time (in minutes) after bromophenol blue elution (0 min). Subsequent collections were as follows: 1, 2, 3, 4, 5, 7, 9, 11, 13, 15, 20, 25, 30, 45, 60, 75, 90, and 120 min. The molecular mass range of proteins from each fraction was assessed by SDS-PAGE⁵¹. GELFrEE fractions were then submitted to methanol/chloroform/water precipitation for SDS removal⁵². Briefly, four volumes of methanol were added to the fractions, followed by addition of 1 volume of chloroform and 3 volumes of water, with 30 s of vortexing subsequent to the inclusion of each solution. After water addition and vortex, the solutions were submitted to centrifugation at 20,000×g for 10 min at 21 °C, resulting in the formation of two layers, with proteins floating between them. Then, the superior layer was removed, with care to prevent disturbing the protein pellet, and 3 volumes of methanol were added, followed by gentle mixing and centrifugation at 20,000×g for 10 min at 21 °C. Subsequently, the supernatant was discarded, followed by a second wash with 3 volumes of methanol as described before, and the resulting pellet was dried in a laminar flow bio-hood. After drying, pellets were resuspended in 5% acetonitrile and 0.1% formic acid and stored in −80 °C before LC–MS/MS.

LC–MS/MS

Three technical replicates of GELFrEE fractions 0, 1, 2, 3, 4, 5, 7, 9 and 11 were submitted to top-down analysis on a nano-UHPLC Dionex Ultimate 3000 system coupled to Orbitrap Elite™ mass spectrometer (Thermo Scientific, Bremen, Germany) (LC–MS/MS). Analytical (30 cm, 75 µM I.D.) and trap (4 cm, 100 µM I.D.) columns packed with PLRPS 1000 A, 5 µM (Agilent, California, USA) were used for reverse phase separation. In the analytical column a tip of approximately 1 cm was pulled using a P-2000 instrument (Sutter, California, USA), to be used as emitter in the mass spectrometer. Both columns were kept at room temperature, controlled at approximately 22 °C. Samples were loaded using a flowrate of 3 µL/min for 10 min, under the isocratic condition of 5% Acetonitrile and 0.1% formic acid, then a gradient under flow of 0.230 µL/min was used to elute proteoforms from the column for MS analysis. The gradient was composed of solutions A (formic acid 0.1%) and B (acetonitrile, formic acid 0.1%) and was created by: 5% B (0–10 min), 20% B (10–55 min), 55% B (55–60 min), 85% B (60–80 min) and 5% B (80–90 min). Acquisitions were made during the 90 min gradient in positive mode, and the top-2 most abundant ions were fragmented by stepped high-energy collisional dissociation (HCD) applying normalized collision energy of 25, using two steps, with collision energy width of 10, resulting in two separated fragmentations of ions with normalized collision energy of 20 and 30, and analysis of all resulting fragments together in the Orbitrap. Isolation width of precursors was 25 m/z, using a dynamic exclusion duration of 30 s. Source-induced dissociation (SID) was used with 15 eV in MS1 and the first MS2, whilst for the second most intense ion, it was set to 35 eV. Resolutions were 240,000 and 120,000 for MS1 and MS2, respectively. The mass spectrometry proteomics data have been deposited to the ProteomeXchange Consortium via the PRIDE⁵³ partner repository with the dataset identifier PXD038038.

Proteoform identification and characterization

Thermo raw files were converted to MzML format using the MSConvert tool⁵⁴. Identification and characterization of proteoforms were done using the TopPic suite software tool, version 1.4⁵⁵. Briefly, spectra were deconvoluted using TopFD with default parameters followed by identifications against a C. glutamicum ATCC 13032 reference proteome (11-2020), obtained from UniProt (https://www.uniprot.org/). Two mass shifts between −500 Da and 500 Da were allowed in the identification and the N-terminal forms were set as “NONE” (No modifications), “NME” (N-terminal methionine excision), “NME-ACETYLATION” (N-terminal methionine excision and N-terminal acetylation), and “M-ACETYLATION” (N-terminal acetylation). A false discovery rate strategy was adopted with a decoy database, resulting in the identification of only proteoforms with FDR below 1% at the proteoform spectrum matches (PrSMs) level. Proteoform characterization and annotation were followed by visual interpretation of TopMSV²¹ files. Detected proteins’ MW evaluation and retention times were performed using VisioProt-MS⁵⁶.

Data analysis and functional annotation

Proteoforms identification data were analyzed in R 4.0.2 (R Core Team 2020) using the packages tidyverse⁵⁷ and ggplot2⁵⁸. Briefly, the number of identified mass shifts (Δm) was counted and the frequency of the most common Δm were defined based on rounded values. Then, the number of amino acid residues modified by each Δm was defined. Functional annotation and overrepresentation analysis were performed using the String Cytoscape app^59,60, and DAVID⁶¹. Visual interpretation of DAVID overrepresented terms was also carried out using R computer language through the creation of a bubble plot with the tools present in ggplot2.

Data availability

Mass spectra raw files are available via the Proteomics IDEntifications Database Archive (PRIDE, https://www.ebi.ac.uk/pride/archive/, ID: PXD038038).

References

Becker, J. & Wittmann, C. Bio-based production of chemicals, materials and fuels—Corynebacterium glutamicum as versatile cell factory. Curr. Opin. Biotechnol. 23, 631-640 (2012).
Sanchez, S., Rodríguez-Sanoja, R., Ramos, A. & Demain, A. L. Our microbes not only produce antibiotics, they also overproduce amino acids. J. Antibiot. (Tokyo) 71, 26–36 (2018).
Article CAS Google Scholar
Becker, J. & Wittmann, C. Industrial Microorganisms: Corynebacterium glutamicum. Ind. Biotechnol. https://doi.org/10.1002/9783527807796.ch6 (2016).
Freudl, R. Beyond amino acids: Use of the Corynebacterium glutamicum cell factory for the secretion of heterologous proteins. J. Biotechnol. 258, 101–109 (2017).
Article CAS PubMed Google Scholar
Okino, S., Inui, M. & Yukawa, H. Production of organic acids by Corynebacterium glutamicum under oxygen deprivation. Appl. Microbiol. Biotechnol. 68, 475–480 (2005).
Article CAS PubMed Google Scholar
Yamamoto, S., Suda, M., Niimi, S., Inui, M. & Yukawa, H. Strain optimization for efficient isobutanol production using Corynebacterium glutamicum under oxygen deprivation. Biotechnol. Bioeng. 110, 2938–2948 (2013).
Article CAS PubMed Google Scholar
Matsumoto, K., Kitagawa, K., Jo, S. J., Song, Y. & Taguchi, S. Production of poly(3-hydroxybutyrate-co-3-hydroxyvalerate) in recombinant Corynebacterium glutamicum using propionate as a precursor. J. Biotechnol. 152, 144–146 (2011).
Article CAS PubMed Google Scholar
Ray, D. et al. The soil bacterium, Corynebacterium glutamicum, from biosynthesis of value-added products to bioremediation: A master of many trades. Environ. Res. 213, 113622 (2022).
Article CAS PubMed Google Scholar
Carabetta, V. J. & Hardouin, J. Editorial: Bacterial post-translational modifications. Front. Microbiol. 13, 1–2 (2022).
Article Google Scholar
Niebisch, A., Kabus, A., Schultz, C., Weil, B. & Bott, M. Corynebacterial protein kinase G controls 2-oxoglutarate dehydrogenase activity via the phosphorylation status of the OdhI protein. J. Biol. Chem. 281, 12300–12307 (2006).
Article CAS PubMed Google Scholar
Kim, J., Hirasawa, T., Saito, M., Furusawa, C. & Shimizu, H. Investigation of phosphorylation status of OdhI protein during penicillin- and Tween 40-triggered glutamate overproduction by Corynebacterium glutamicum. Appl. Microbiol. Biotechnol. 91, 143–151 (2011).
Article CAS PubMed Google Scholar
Mizuno, Y. et al. Altered acetylation and succinylation profiles in Corynebacterium glutamicum in response to conditions inducing glutamate overproduction. Microbiologyopen 5, 152–173 (2016).
Article CAS PubMed Google Scholar
Catherman, A. D., Skinner, O. S. & Kelleher, N. L. Top Down proteomics: Facts and perspectives. Biochem. Biophys. Res. Commun. 445, 683–693 (2014).
Article CAS PubMed PubMed Central Google Scholar
Smith, L. M. & Kelleher, N. L. Proteoform: A single term describing protein complexity. Nat. Methods 10, 186–187 (2013).
Article CAS PubMed PubMed Central Google Scholar
Schaffer, L. V. et al. Identification and quantification of proteoforms by mass spectrometry. Proteomics 19, 1–15 (2019).
Google Scholar
Carbonara, K., Andonovski, M. & Coorssen, J. R. Proteomes are of proteoforms: Embracing the complexity. Proteomes 9, 38 (2021).
Gault, J. et al. Complete posttranslational modification mapping of pathogenic Neisseria meningitidis pilins requires top-down mass spectrometry. Proteomics 14, 1141–1151 (2014).
Article CAS PubMed PubMed Central Google Scholar
McCool, E. N. et al. Deep top-down proteomics using capillary zone electrophoresis-tandem mass spectrometry: Identification of 5700 proteoforms from the Escherichia coli proteome. Anal. Chem. 90, 5529–5533 (2018).
Article CAS PubMed PubMed Central Google Scholar
Fornelli, L. & Toby, T. K. Characterization of large intact protein ions by mass spectrometry: What directions should we follow?. Biochim. Biophys. Acta-Proteins Proteom. 1870, 140758 (2022).
Article CAS PubMed Google Scholar
Fellers, R. T. et al. ProSight Lite: Graphical software to analyze top-down mass spectrometry data. Proteomics 15, 1235–1238 (2015).
Article CAS PubMed PubMed Central Google Scholar
Choi, I. K., Jiang, T., Kankara, S. R., Wu, S. & Liu, X. TopMSV: A web-based tool for top-down mass spectrometry data visualization. J. Am. Soc. Mass Spectrom. https://doi.org/10.1021/jasms.0c00460 (2021).
Brown, C. W. et al. Large-scale analysis of post-translational modifications in E. coli under glucose-limiting conditions. BMC Genomics 18, 1–21 (2017).
Fridriksson, E. K., Baird, B. & McLafferty, F. W. Electrospray mass spectra from protein electroeluted from sodium dodecylsulfate polyacrylamide gel electrophoresis gels. J. Am. Soc. Mass Spectrom. 10, 453–455 (1999).
Article CAS PubMed Google Scholar
Lennicke, C. et al. Redox proteomics: Methods for the identification and enrichment of redox-modified proteins and their applications. Proteomics 16, 197–213 (2016).
Article CAS PubMed Google Scholar
Dai, Y. et al. Elucidating Escherichia coli proteoform families using intact-mass proteomics and a global PTM discovery database. J. Proteome Res. 16, 4156–4165 (2017).
Article CAS PubMed PubMed Central Google Scholar
Suh, M.-J., Hamburg, D.-M., Gregory, S. T., Dahlberg, A. E. & Limbach, P. A. Extending ribosomal protein identifications to unsequenced bacterial strains using matrix-assisted laser desorption/ionization mass spectrometry. Proteomics 5, 4818–4831 (2005).
Article CAS PubMed PubMed Central Google Scholar
Piatkov, K., Vu, T., Hwang, C.-S. & Varshavsky, A. Formyl-methionine as a degradation signal at the N-termini of bacterial proteins. Microb. Cell 2, 376–393 (2015).
Article CAS PubMed PubMed Central Google Scholar
Lee, M. J. & Kim, P. Recombinant protein expression system in Corynebacterium glutamicum and its application. Front. Microbiol. 9, 2523 (2018).
Nakamura, J., Hirano, S., Ito, H. & Wachi, M. Mutations of the Corynebacterium glutamicum NCgl1221 gene, encoding a mechanosensitive channel homolog, induce l-glutamic acid production. Appl. Environ. Microbiol. 73, 4491–4498 (2007).
Article CAS PubMed PubMed Central ADS Google Scholar
Brocker, M., Mack, C. & Bott, M. Target genes, consensus binding site, and role of phosphorylation for the response regulator MtrA of Corynebacterium glutamicum. J. Bacteriol. 193, 1237–1249 (2011).
Ramadurai, L. & Jayaswal, R. K. Molecular cloning, sequencing, and expression of lytM, a unique autolytic gene of Staphylococcus aureus. J. Bacteriol. 179, 3625–3631 (1997).
Article CAS PubMed PubMed Central Google Scholar
Odintsov, S. G., Sabala, I., Marcyjaniak, M. & Bochtler, M. Latent LytM at 1.3 Å resolution. J. Mol. Biol. 335, 775–785 (2004).
Grabowska, M., Jagielska, E., Czapinska, H. & Bochtler, M. High resolution structure of an M23 peptidase with a substrate analogue. Nat. Publ. Gr. https://doi.org/10.1038/srep14833 (2015).
Bott, M. & Brocker, M. Two-component signal transduction in Corynebacterium glutamicum and other corynebacteria: On the way towards stimuli and targets. Appl. Microbiol. Biotechnol. 94, 1131–1150 (2012).
Article CAS PubMed PubMed Central Google Scholar
Hirasawa, T. & Shimizu, H. Recent advances in amino acid production by microbial cells. Curr. Opin. Biotechnol. 42, 133–146 (2016).
Article CAS PubMed Google Scholar
Kawahara, Y., Takahashi-Fuke, K., Shimizu, E., Nakamatsu, T. & Nakamori, S. Relationship between the glutamate production and the activity of 2-oxoglutarate dehydrogenase in brevibacterium lactofermentum. Biosci. Biotechnol. Biochem. 61, 1109–1112 (1997).
Article CAS PubMed Google Scholar
Shimizu, H. et al. Effects of the changes in enzyme activities on metabolic flux redistribution around the 2-oxoglutarate branch in glutamate production by Corynebacterium glutamicum. Bioprocess Biosyst. Eng. 25, 291–298 (2003).
Article CAS PubMed Google Scholar
Sandberg, M. W., Bunkenborg, J., Thyssen, S., Villadsen, M. & Kofoed, T. Characterization of a novel + 70 Da modification in rhGM-CSF expressed in E. coli using chemical assays in combination with mass spectrometry. Amino Acids. https://doi.org/10.1007/s00726-021-03004-9 (2021).
Barthe, P. et al. Dynamic and structural characterization of a bacterial FHA protein reveals a new autoinhibition mechanism. Structure 17, 568–578 (2009).
Article CAS PubMed Google Scholar
Komine-Abe, A. et al. Effect of lysine succinylation on the regulation of 2-oxoglutarate dehydrogenase inhibitor, OdhI, involved in glutamate production in Corynebacterium glutamicum. Biosci. Biotechnol. Biochem. 81, 2130–2138 (2017).
Article CAS PubMed Google Scholar
Kataoka, M. et al. Gene expression of Corynebacterium glutamicum in response to the conditions inducing glutamate overproduction. Lett. Appl. Microbiol. 42, 471–476 (2006).
Article CAS PubMed Google Scholar
Bull, P. C. & Cox, D. W. Wilson disease and Menkes disease: New handles on heavy-metal transport. Trends Genet. 10, 246–252 (1994).
Article CAS PubMed Google Scholar
Hirasawa, T., Saito, M., Yoshikawa, K., Furusawa, C. & Shmizu, H. Integrated analysis of the transcriptome and metabolome of Corynebacterium glutamicum during penicillin-induced glutamic acid production. Biotechnol. J. 13, 1700612 (2018).
Article Google Scholar
Ogata, S. & Hirasawa, T. Induction of glutamic acid production by copper in Corynebacterium glutamicum. Appl. Microbiol. Biotechnol. 105, 6909–6920 (2021).
Article CAS PubMed Google Scholar
Doorn, J. A. & Petersen, D. R. Covalent adduction of nucleophilic amino acids by 4-hydroxynonenal and 4-oxononenal. Chem. Biol. Interact. 143–144, 93–100 (2003).
Article PubMed Google Scholar
Su, T. et al. A thioredoxin-dependent peroxiredoxin Q from Corynebacterium glutamicum plays an important role in defense against oxidative stress. PLoS ONE 13, e0192674 (2018).
Article PubMed PubMed Central Google Scholar
Ezraty, B., Gennaris, A., Barras, F. & Collet, J. F. Oxidative stress, protein damage and repair in bacteria. Nat. Rev. Microbiol. 15, 385–396 (2017).
Article CAS PubMed Google Scholar
Nakunst, D. et al. The extracytoplasmic function-type sigma factor SigM of Corynebacterium glutamicum ATCC 13032 is involved in transcription of disulfide stress-related genes. J. Bacteriol. 189, 4696–4707 (2007).
Article CAS PubMed PubMed Central Google Scholar
Keilhauer, C., Eggeling, L. & Sahm, H. Isoleucine synthesis in Corynebacterium glutamicum: Molecular analysis of the ilvB-ilvN-ilvC operon. J. Bacteriol. 175, 5595–5603 (1993).
Article CAS PubMed PubMed Central Google Scholar
Tran, J. C. & Doucette, A. A. Multiplexed size separation of intact proteins in solution phase for mass spectrometry. Anal. Chem. 81, 6201–6209 (2009).
Article CAS PubMed Google Scholar
Laemmli, U. K. Cleavage of structural proteins during the assembly of the head of bacteriophage T4. Nature 227, 680–685 (1970).
Article CAS PubMed ADS Google Scholar
Wessel, D. & Flügge, U. I. A method for the quantitative recovery of protein in dilute solution in the presence of detergents and lipids. Anal. Biochem. 138, 141–143 (1984).
Article CAS PubMed Google Scholar
Perez-Riverol, Y. et al. The PRIDE database resources in 2022: A hub for mass spectrometry-based proteomics evidences. Nucleic Acids Res. 50, D543–D552 (2022).
Article CAS PubMed Google Scholar
Chambers, M. C. et al. A cross-platform toolkit for mass spectrometry and proteomics. Nat. Biotechnol. 30, 918–920 (2012).
Article CAS PubMed PubMed Central Google Scholar
Kou, Q., Xun, L. & Liu, X. TopPIC: A software tool for top-down mass spectrometry-based proteoform identification and characterization. Bioinformatics 32, 3495–3497 (2016).
Article CAS PubMed PubMed Central Google Scholar
Locard-Paulet, M. et al. VisioProt-MS: Interactive 2D maps from intact protein mass spectrometry. Bioinformatics 35, 679–681 (2019).
Article PubMed Google Scholar
Wickham, H. et al. Welcome to the Tidyverse. J. Open Source Softw. 4, 1686 (2019).
Article ADS Google Scholar
Wickham, H. ggplot2. https://doi.org/10.1007/978-0-387-98141-3 (Springer, 2009).
Doncheva, N. T., Morris, J. H., Gorodkin, J. & Jensen, L. J. Cytoscape StringApp: Network analysis and visualization of proteomics data. J. Proteome Res. 18, 623–632 (2019).
Article CAS PubMed Google Scholar
Szklarczyk, D. et al. STRING v11: Protein–protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets. Nucleic Acids Res. 47, D607–D613 (2019).
Article CAS PubMed Google Scholar
Huang, D. W., Sherman, B. T. & Lempicki, R. A. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat. Protoc. 4, 44–57 (2009).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

This work was supported by Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES), Financiadora de Estudos e Projetos (FINEP) and Fundação de Apoio à Pesquisa do Distrito Federal (FAP-DF, Process N.: 00193.00001484/2021-91 and 0193001195/2016).

Author information

Authors and Affiliations

Laboratory of Protein Chemistry and Biochemistry, Department of Cell Biology, Institute of Biology, University of Brasilia, Brasilia, Brazil
Reynaldo Magalhães Melo, Jaques Miranda Ferreira de Souza, Wagner Fontes, Marcelo Valle de Sousa, Carlos André Ornelas Ricart & Luis Henrique Ferreira do Vale
Laboratory of Plant Biochemistry, Department of Botany, Institute of Biology, University of Brasilia, Brasilia, Brazil
Thomas Christopher Rhys Williams

Authors

Reynaldo Magalhães Melo
View author publications
You can also search for this author in PubMed Google Scholar
Jaques Miranda Ferreira de Souza
View author publications
You can also search for this author in PubMed Google Scholar
Thomas Christopher Rhys Williams
View author publications
You can also search for this author in PubMed Google Scholar
Wagner Fontes
View author publications
You can also search for this author in PubMed Google Scholar
Marcelo Valle de Sousa
View author publications
You can also search for this author in PubMed Google Scholar
Carlos André Ornelas Ricart
View author publications
You can also search for this author in PubMed Google Scholar
Luis Henrique Ferreira do Vale
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

R.M.M. performed the bacterial growth and prepared the sample for mass spectrometry. R.M.M. and T.C.R.W. analyzed the data. J.M.F.S., W.F., M.V.S., and L.H.F.V. performed the mass spectrometer maintenance and injection of samples. C.A.O.R., R.M.M., T.C.R.W., and L.H.V.F. conceived the experiments. All authors revised the manuscript.

Corresponding author

Correspondence to Luis Henrique Ferreira do Vale.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Figures.

Supplementary Table 1.

Supplementary Table 2.

Supplementary Table 3.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Melo, R.M., de Souza, J.M.F., Williams, T.C.R. et al. Revealing Corynebacterium glutamicum proteoforms through top-down proteomics. Sci Rep 13, 2602 (2023). https://doi.org/10.1038/s41598-023-29857-6

Download citation

Received: 07 December 2022
Accepted: 11 February 2023
Published: 14 February 2023
DOI: https://doi.org/10.1038/s41598-023-29857-6

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.