Introduction

As an advancement in biological and recombinant technologies, peptide drugs have become an important class of clinical drugs because of their efficacy. However, their inherent instability must be addressed in terms of peptide drug use and transport. To overcome these issues, a method that can precisely quantify a drug’s purity is of central importance. Compared with traditional quantification methods such as immunoassay and liquid chromatography - Ultraviolet (LC-UV), the mass balance method is advantageous because of its high precision and the traceability of its results to SI units. As a basic guideline for the establishment of chemical reference standards, the mass balance method has long been recommended by the WHO1 and was adopted by the European Pharmacopoeia and International Pharmacopoeia2,3. In addition to its ability to determine the content of only the main constituent, the mass balance method can also be used to analyze and quantify impurities in peptide drugs. In general, amino acid dehydration has been used to synthesize peptide drugs, and mistakes such as omission, loss, or increase of the amino acids may occur during this process; peptide decomposition may also create impurities. Although some peptide impurities are similar to the principal component in terms of composition and construction, their chemistries are different. Therefore, it is necessary to identify and quantify peptide impurities. In addition, when their impurities are quantified separately, peptide drugs can be quantified more precisely.

Immunologists perform amino acid analysis by liquid chromatography-isotope dilution mass spectrometry (LC-IDMS) or liquid chromatography-mass spectrometry (LC-MS). Due to their incompatibility with molecules larger than approximately 10 kDa, protein quantification by these methods usually requires proteolytic digestion of large analytes with an enzyme such as trypsin to cleave it into a set of smaller peptides, one of which, the signature peptide, is subsequently used as a surrogate for the protein for quantification4. In the above methods, synthetic peptides may be used as internal or external standards to quantify the cleaved peptides. Toth et al.5 demonstrated the applicability of an integrated immobilized enzyme reactor-liquid chromatography-tandem mass spectrometry (IMER-LC-MS/MS) platform for the quantitation of apolipoproteins in serum using synthetic target peptides as internal standards. Van den Broek et al.6 evaluated the tryptic digestion efficiency for implementation in quantitative clinical chemistry proteomics using a synthetic stable-isotope-labeled standard. Bronsema et al.7 summarized the different options for internal standardization in the absolute targeted quantification of protein biopharmaceuticals by LC-MS/MS using synthetic stable-isotope-labeled peptides as internal standards. For these applications, the purity of the synthetic peptides is essential for quantification.

In this paper, we chose synthetic glucagon consisting of 29 amino acids as an example and adopted the mass balance method to determine its purity. This method is a way to accurately quantify sample impurities such as organic and inorganic compounds and moisture, among others, and then deduct the mass of the impurities from the total sample mass. Isotope dilution mass spectrometry (IDMS) was used to quantify the peptide impurities in our sample. IDMS is suitable as a definitive method because it does not depend on the sample recovery, has high precision, and can be tested for bias and unknown interferences8. IDMS methods involve adding a labeled version of the analyte as an internal standard to the sample, followed by sample processing and subsequent measurement of the peak area ratio of unlabeled to labeled analyte by LC-MS. IDMS is recognized as a primary method9,10,11 and has been widely used to establish reference measurement procedures for biomarkers such as creatinine12,13,14, glucose15,16,17, cholesterol18,19,20, urea21,22, uric acid23,24 and triglycerides8,25. IDMS is also a highly accurate quantitative method. Compared with common quantitative methods, IDMS has numerous advantages, such as low uncertainty, high precision, high accuracy, and results that can be traced to SI units26.

The purity of synthetic glucagon, as determined by LC-UV, is reportedly higher than 98%. However, this purity is inaccurate based on comparison with the result obtained from the mass balance method. Using the mass balance method, we accurately quantified the impurities in synthetic glucagon separately and then determined the glucagon content by subtracting the masses of all impurities from the total sample mass.

Results

Concentration of TFA,water and inorganic ions in synthetic glucagon

Five concentrations of pure TFA were used for this analysis, with high linearity (R2 = 0.9999) as shown in Fig. 1. The accurately calculated TFA concentration was an approximately intermediate value within the five TFA concentrations in the gradient. After calculation, the content of TFA in the synthetic glucagon was 103.03 mg/g. The results of the TFA contents in the sample are shown in Table 1.

Figure 1
figure 1

Five-point TFA standard curve.

Table 1 Concentration of TFA and Water in Synthetic Glucagon.

In this experiment, the Karl Fischer titration method was used to determine the moisture in the sample. The principle is that the Karl Fischer reagent in the electrolysis cell of the instrument is injected into the aqueous sample when the equilibrium is reached, and the water participates in the redox reaction of iodine and sulfur dioxide. In the presence of pyridine and methanol, the iodine consumed by the reaction is again electrolyzed at the anode so that the redox reaction continues until the water is completely consumed. For multiple measurements, the data with a relative standard deviation (RSD) of three consecutive measurements within 3% was taken as the experimental result. The water content was 50.2 mg/g. The results of the water contents in the sample are shown in Table 1.

In this study, Cl, Fe, Si, Ca in synthetic glucagon were detected, the content of all inorganic ions is less than 0.1%, so all can be ignored. The contents of the main inorganic ions in the sample are shown in Table 2.

Table 2 Concentrations of the Main Inorganic Ions in Synthetic Glucagon.

Concentration of impurity peptides in synthetic glucagon

The impurities cannot be discriminated by LC-UV because the RTs of the peptide impurities with similar amino acid sequences to that of the main component are close to the RT of the main component. The peaks of impurities with low concentrations in the sample are small and frequently masked by the main component peak. Therefore, the impurities were qualitatively analyzed based on m/z differences by LC-MS in this study. The m/z values of all peptide impurities were different from the m/z values of each other and of the main component. The method was then used for qualitative analysis in an international data comparison. The results of the main peptide impurities in the sample are shown in Table 3. We synthesized nine impurity peptides by their amino acid sequence and confirmed the purity of each synthetic peptide by isotope dilution mass spectrometry, which is a guarantee for accurate quantitation of impurity peptides. We selected three amino acids (Ala, Leu, and Phe) to determine the purity of each peptide. These three amino acids are reference materials, ensuring the accuracy and traceability of the results. Using Leu as an example, the concentration of amino acids after hydrolysis was calculated as follows:

$${c}_{Leu}=\frac{P{P}_{H}{m}_{s}[{R}_{sample}({I}_{1}-{I}_{2})-({I}_{1}{R}_{2}-{I}_{2}{R}_{1})]}{m({R}_{1}-{R}_{2})}$$
(1)

where P is the purity of Leu, PH is the hydrolysis efficiency of the peptide, ms is the mass of isotopically labeled Leu, Rsample is the Leu/isotopically labeled Leu area ratio in the sample solution, I1 is the Leu/isotopically labeled Leu mass ratio in the lower-level standard solution, I2 is the Leu/isotopically labeled Leu mass ratio in the higher-level standard solution, R1 is the Leu/isotopically labeled Leu area ratio in the lower-level standard solution, R2 is the Leu/isotopically labeled Leu area ratio in the higher-level standard solution, and m is the sample mass.

Table 3 Main Peptide Impurities in Synthetic Glucagon Determined by LC-MS.

The mass fraction of the total peptide calculated from Leu (PPep, Leu) was determined as follows:

$${P}_{{\rm{Pep}},{\rm{Leu}}}=\frac{c{}_{{\rm{Leu}}}M_{{\rm{Pep}}}}{{N}_{{\rm{Leu}}}{M}_{{\rm{Leu}}}}$$
(2)

where MPep is the molecular weight of the peptide impurity, MLeu is the molecular weight of Leu, and NLeu is the number of Leu residues in the peptide impurity.

The mass fractions of the total peptide calculated for Phe and Ala (PPep,Phe and PPep,Ala) were obtained using equations similar to the above two equations. Then, the mass fraction of the total peptide was calculated as follows:

$${P}_{{\rm{Pep}}}=\frac{{P}_{{\rm{Pep}},{\rm{Leu}}}+{P}_{{\rm{Pep}},{\rm{Phe}}}+{P}_{{\rm{Pep}},{\rm{Ala}}}}{3}$$
(3)

According to Eq. (3), the uncertainty in the content of the synthetic peptide impurity can be determined as follows:

$${u}_{pep}=\frac{\sqrt{{{u}_{leu}}^{2}+{{u}_{phe}}^{2}+{{u}_{ala}}^{2}}}{3}$$
(4)

where upep is the uncertainty in the peptide impurity content. The numbers 1 to 9 represent the nine peptide impurities, and their purities are shown in Table 4.

Table 4 Information of the Nine Peptide Impurities found in Synthetic Glucagon.

The mass fraction of synthetic glucagon

Since inorganic impurities can be neglected, the mass fraction of glucagon was determined as follows:

$${w}_{glucagon}=1000-{w}_{w}-{w}_{TFA}-\sum {w}_{xi}$$
(5)

where Wglucagon (mg/g) is the mass fraction of glucagon determined using the mass balance method, WW (0.502 mg/g) is the mass fraction of water in the glucagon sample, WTFA (103.03 mg/g) is the mass fraction of TFA in the glucagon sample, and Wxi (0.0112 mg/g) is the mass fraction of peptide impurities in the glucagon sample.

In this way, the content of glucagon in the sample was determined to be 896.36 mg/g.

$${u}_{{({\rm{w}}}_{glu})}=\sqrt{{u}_{({w}_{w})}{}^{2}+{u}_{w{(}_{IM})}{}^{2}+{u}_{w{(}_{TFA})}{}^{2}}$$
(6)

where \({u}_{{({\rm{w}}}_{glu})}\) is the uncertainty in the glucagon content, \({u}_{({w}_{w})}\) is the uncertainty in the water content, \({u}_{w{(}_{IM})}\) is the uncertainty in the contents of all peptide impurities, and \({u}_{w{(}_{TFA})}\) is the uncertainty in the TFA content. Using Eq. (6), the uncertainty in the glucagon content was determined to be 0.683 mg/g.

The purity of synthetic glucagon reported by the manufacturer is 983.72 mg/g, and the result determined using the mass balance method was 896.36 mg/g, resulting in a difference of 87.36 mg/g. We also analyzed the synthetic glucagon solution by LC-UV and obtained a purity of over 970 mg/g, with small visible impurities. This result is similar to that of the manufacturer. Thus, the result obtained by determining the content of only the main constituent and not the sample impurities using LC-UV is not accurate because some impurities do not absorb ultraviolet radiation, while others may have absorption features that overlap with those of the main constituent. Compared to LC-UV, the mass balance method is an absolute quantitative method that could be traced to SI units.

The Pharmacopeia of US and Europe lists several requirements for impurities in peptide drugs. First, the impurity must be quantified if its content is >1%; second, the impurity should be analyzed qualitatively if its content is >0.5%; and finally, the presence of an impurity must be reported if its content is >0.1%. According to these requirements, the water and TFA reported herein as impurities in synthetic glucagon should be quantified. The reason that such guidelines would require reporting for such small contents of peptide impurities is possibly because the chosen synthetic peptide possesses a small molecular weight and simple structure. Extremely small mass peptide impurities in peptide drugs would have a significant effect on the efficacy, especially for sensitive patients. On the other hand, the determination of very low levels of peptide impurities can also reflect the level of development of proteins and peptides drugs. The peptide impurities that exist in some peptide drugs have high molecular weights and complex structures, which may not be overlooked when using the mass balance method for their quantification.

Conclusions

We developed a mass balance method to quantify synthetic peptides by accurately determining the content of impurities. Mass balance provides a reference method for the qualitative and quantitative analysis of impurities in peptide drugs and contributes to current research efforts aimed at determining how impurities affect efficacy. The type and content of impurities in peptide drugs are closely related to their production methods, processes, and the structure of the drug itself. This study investigated the impurities of synthetic glucagon by solid-phase method. Generally speaking, the impurities produced under the same process conditions are the similar. This research has good reference significance for the determination of peptide drug impurities in the solid-phase synthesis process production plant. For the quantification of unknown impurities, the structure of the impurities must be determined before quantification, otherwise accurate quantification cannot be achieved under current technical conditions. In addition, in this work, we accurately quantified synthetic peptides that could be used as target peptides for determining protein purity. The bias could be decreased by determining the purity of the synthetic peptide using the mass balance method before quantifying the protein using the synthetic peptide as an internal or external standard.

Methods

Experimental

Reagents and chemicals

Synthetic glucagon and nine peptide impurities were acquired from GenScript Biological Technology (Nanjing, China). Deionized water was obtained from a Milli-Q Integral system (resistivity = 18.2 MΩ/cm). Three amino acids (phenylalanine (Phe, 99.4%), alanine (Ala, 99.4%), and leucine (Leu, 99.1%)) were purchased from Sigma (Tokyo, Japan), and three labeled amino acids (C- and N-labeled Phe, C- and N-labeled Ala, and C- and N-labeled Leu, each with a purity of 99%) were purchased from Cambridge Isotope Laboratories Inc. (Andover, USA). Acetonitrile was supplied by Merck KGaA (Darmstadt, Germany). Formic acid was supplied by Fisher Scientific (Shanghai, China). Hydrochloric acid was supplied by Beijing Chemical Works (Beijing, China). Karl Fischer (KF) reagents were obtained from Sigma-Aldrich (Shanghai, China). Trifluoroacetic acid (TFA, purity = 99.9%) was obtained from J&K Scientific (Beijing, China). HCl (analytical grade) was purchased from Beijing Chemical Works (Beijing, China).

Apparatus

An electrothermostatic blast oven (DHG-9140A) from Yiheng Scientific (Shanghai, China) and a vacuum drying oven (VC 50) from Salvis Vacuucenter (Schweiz, Switzerland) were used for polypeptide chain hydrolysis. An Orbitrap FusionTM LumosTM TribridTM mass spectrometer from Thermo Fisher Scientific was used for sequence identification of peptide impurities. A Dionex™ ICS-5000+ capillary tube high-pressure ion chromatography (HPIC) system from Thermo Fisher Scientific (Waltham, MA, USA) was used for IC. A 6410 triple quadrupole mass spectrometer from Agilent Technologies (Santa Clara, USA) was used for mass spectrometric detection. A Karl Fischer coulometer was obtained from Mettler Toledo (Shanghai, China). An Agilent 1200 instrument was obtained from Agilent Technologies (Santa Clara, USA). An Agilent 8900 inductively coupled plasma mass spectrometer (ICP-MS) (Santa Clara, USA) was used for ICP-MS analysis.

Determination of TFA in synthetic glucagon

In this study, we used the external standard method to quantify the TFA in the synthetic peptides. First, the pure TFA used in the external standard method was quantified by the nuclear magnetic method. Then, the pure TFA was configured into a series of gradient concentration solutions by accurate weighing and detected by ion chromatography to draw a standard curve. The content of TFA in the synthetic peptide was calculated from the standard curve. The purity of the TFA standard determined by nuclear magnetic resonance spectroscopy was 992 mg/g, which could be traced to SI units (acesulfame potassium reference material as the internal standard). The conditions of 19F-qNMR for determining the purity of TFA are shown below:

Ofloxacin (85~95 mg) was accurately weighed in an airtight bottle; 30 mL of DMSO was added; and 21~29 mg of TFA was injected by syringe into the bottle. After the ofloxacin dissolved, the solution was transferred to an NMR tube. A capillary containing D2O was inserted for locking onto the magnetic field.

Instrument: Bruker 400 Ultrashield Plus

Pulse angle: 30 degrees

Relaxation delay: 5 s

Scan number: 128

Offset of pulse: −96.4 ppm

Spectral width: 237.1643 ppm

Size of FID: 65536

Relaxation delay: 5 s

The content of TFA in the sample was quantified using pure TFA having a determined purity. The ion chromatogram of TFA in the sample was compared with that of TFA in a standard solution, which qualitatively confirmed the TFA impurity in synthetic glucagon. We first prepared the sample solution at an appropriate concentration using ultrapure water. A series of TFA standard solutions with a mass concentration gradient was prepared from a TFA stock solution using ultrapure water. The gradient solutions were then detected by IC to yield a standard curve between the concentration and peak area. The TFA standard solution and the sample solution are simultaneously detected by the IC, and then, the TFA of the sample is accurately quantified using the plotted standard curve and the peak area generated by the sample.

Chromatography: columns, Dionex IonPac AG19 anionic guard column (50 mm × 4 mm) and IonPac AS19 anion-exchange separation column (250 mm × 4 mm); column temperature, 30 °C; leachate, 55 mM KOH; isocratic elution, 1.0 mL/min; injection volume, 25 µL; and detector, automatically renewable inhibitory conductance detector (ASRS, 4 mm, and curb., 137 mA).

Determination of the water in synthetic glucagon

In this experiment, the Karl Fischer titration method was used to determine the moisture in the sample. To avoid the influence of moisture in the air, we determined the amount of water in synthetic glucagon in a glovebox. The humidity of the glovebox was controlled within a range of 1% by aerating with high-purity nitrogen. Three samples placed inside of a glovebox at room temperature for 1.5 h were direct added to a KF reaction vessel. Then, the samples were used for detection under the following conditions: electrolysis rate, “normal”; polarization current, 2 µA; end voltage, 100 mV; and minimum titration time, 180 s. The drift was determined over 180 s. The recovery of the electrode without a diaphragm using Coulomat AG was 104.51% (0.54%). For multiple measurements, the data with a relative standard deviation (RSD) of three consecutive measurements within 3% was taken as the experimental result.

Determination of the inorganic ions in synthetic glucagon

The inorganic ions in glucagon were determined by inductively coupled plasma mass spectrometry. A certain weight sample was added to tube of 2 ml for ICP-MS. The following ICP-MS parameters were used:

RF power: 750–850 W

Carrier gas flow rate: 0.95 L min-1

Sampling depth: 6 mm

Cones: Nickel

Cell gas flow rate: 2 mL min-1 helium

Extraction lens: −100 V to −125 V

Octopole bias: −16 V

Quadrupole bias: −14 V

Measured isotopes/dwell time: 79Br (0.1 s), 81Br (0.1 s)

Optional plasma gas: N2 at 55 psi

Determination and quantification of peptide impurities in synthetic glucagon

The sample was analyzed by LC-UV, and the resulting chromatogram (Fig. 2) revealed impurities in the sample. Since the differences in the amino acid sequences of the peptide impurities and the main component are very small, their retention times (RTs) are very similar, and all of the peptide impurities cannot be chromatographically separated from the main component. Therefore, we analyzed the sample by LC-MS, which differentiates the impurities by the m/z values. The conditions were used:

Figure 2
figure 2

Chromatogram of a synthetic glucagon solution analyzed by LC.

Instrument: Agilent 6410 Triple Quard LC/MS

Chromatographic column: Agilent Zorbax SB-Aq 3.5 µm, 2.1 × 150 mm

Mobile phase: A: 0.8 mmol/L perfluorooctanoic acid + 0.05% trifluoroacetic

acid aqueous solution B: acetonitrile

Injection volume: 10 µL

Column temperature: 30 °C

Gradient elution at a flow rate of 0.2 mL/min was set as follows: 1% to 30% B over 5 min, 30% to 40% B over 5 min, 40% to 50% B over 5 min, 50% to 80% B over 1 min, 80% B for 3 min, 80% to 1% B over 3 min, 1% B for 9 min.

The MS1 spectra were analyzed by Protein Discovery software (Thermo Fisher, version1.4). Nine (Table 4) of the peptide impurities that had contents over 0.1% were selected for quantification.

Then, we synthesized nine impurity peptides according to their amino acid sequences and then determined the purities of the nine synthesized peptides by amino acid hydrolysis. In the hydrolysis experiment, three amino acids (Ala, Leu, and Phe) were used to determine the purity of each peptide. These three amino acids are all reference materials, which ensure the accuracy and traceability of the analysis results. A solution of the three amino acids of known weight is added to a known weight of synthetic peptide solution, and the resulting solution is hydrolyzed at 110 °C for 24 h. After filtering, the hydrolyzed solution was detected by MS. The three amino acids are well separated and have the same RT as their corresponding C- and N-labeled amino acids. The results are shown in Fig. 3. By calculating the peak area ratio between the amino acids hydrolyzed from the peptide impurities and the amino acids of known weight, the amino acid content in solution is obtained. Then, we determine the purities of the nine peptides used as calibrants for the quantification according to their amino acid sequences. The detection conditions for the three amino acids were as follows: Ala, m/z 90.0 → 44.1; Leu, m/z 132.0 → 86.1; and Phe, m/z 166.0 → 120.0. The detection conditions for the three labeled amino acids were as follows: L-Ala, m/z 94.0 → 47.1; L-Leu, m/z 139.0 → 92.1; and L-Phe, m/z 176.0 → 129.0.

Figure 3
figure 3

Mass spectra of the amino acids hydrolyzed from the peptide impurities.

We then quantified the impurity peptides in the sample using synthetic peptides of known purities. A concentration gradient of solutions of each synthetic peptide was prepared, and these solutions were separately subjected to mass spectrometry to obtain a regression curve of the concentration of each impurity peptide. Then, the glucagon sample solution was prepared, and the nine impurity peptides in the sample were separately detected by mass spectrometry. The peak area of each impurity peptide is brought into a regression curve corresponding to the pure peptide to calculate the impurity peptides of the glucagon sample. Since the synthetic impurity peptides are detected by mass spectrometry, the optimal conditions for mass spectrometry need to be determined. Each impurity peptide was configured as a solution, and the condition parameters were optimized by mass spectrometry. The m/z of each peptide was determined by mass spectrometry in the SCAN mode. Using the SIM mode, the Fragmentor of each parent ion is optimized to determine the best Fragmentor. The mass spectrometric conditions for detecting the impurity peptides in the sample are shown in Table 5.

Table 5 Mass Spectrometric Condition Information of the Nine Peptide Impurities Found in Synthetic Glucago.