One gene can give rise to many functionally distinct proteoforms, each of which has a characteristic molecular mass. Top-down mass spectrometry enables the analysis of intact proteins and proteoforms. Here members of the Consortium for Top-Down Proteomics provide a decision tree that guides researchers to robust protocols for mass analysis of intact proteins (antibodies, membrane proteins and others) from mixtures of varying complexity. We also present cross-platform analytical benchmarks using a protein standard sample, to allow users to gauge their proficiency.
Mutations, polymorphisms, RNA processing and post-translational modifications (PTMs) such as acetylation, methylation and phosphorylation can lead to a single gene producing many functionally distinct ‘proteoforms’1. These proteoforms can have different effects on important biological processes, including gene regulation, cell signaling and protein activity; consequently, the ability to characterize these species is essential for an understanding of the biological response to disease. The identity of a proteoform can often be inferred2 from an accurate experimentally determined intact mass3. One can increase the sensitivity of intact-mass-based proteoform identification by determining the relative abundance of a particular amino acid by using isotopic labeling, by using mass similarities to cluster proteoforms into gene families and by reducing the search space using sample-specific search databases2. Localizing PTMs, and in some cases the definitive proteoform identifier, requires tandem mass spectrometry (MSn) analysis. The measurement of intact protein mass followed by MSn has been coined ‘top-down’ mass spectrometry4,5,6,7,8, with its origins in Fenn and colleagues’ discovery that large biomolecules could be ionized9 and fragmented10,11,12 using electrospray ionization (ESI)-MS. Top-down MS protocols, unlike widely used bottom-up protocols13,14, do not require endoproteinase digestion before analysis, do not conflate proteoforms and tend to complement native MS analysis.
One advantage of ESI over the alternative ‘soft’ ionization method, matrix-assisted laser desorption-ionization (MALDI), is that ESI imparts more charge per protein. This enables the mass determination of large biomolecules using mass analyzers with moderate mass-to-charge ratio upper limits (for example, m/z ≤ 4,000), which happen to offer the highest resolving power. Higher charge per molecular mass also facilitates gas-phase fragmentation and, therefore, the characterization of primary sequence and PTMs by MSn (refs. 15,16). Because of this superior fragmentation and the ability to interface with liquid chromatography (LC) systems, ESI is used for most top-down MS experiments. Projects requiring rapid MS analysis17, the ability to analyze hundreds of proteins in a single spectrum, protein imaging capabilities, or less signal suppression by common protein buffer components18 may be better suited for MALDI-MS.
Compared to bottom-up workflows, top-down approaches provide additional layers of information, including detecting modifications that are removed or scrambled19 during peptide sample preparation (for example, S-thiolation), elucidating functional relationships (for example, cross-talk) between PTMs on the same protein molecule, characterizing drug–target interactions, observing important modifications on biopharmaceuticals, and identifying and quantifying distinct proteoforms that would have been convoluted by endoproteinase digestion20,21,22,23,24. In addition, sample preparation for intact protein MS comprises fewer steps than bottom-up approaches and does not require chemical modification (for example, reduction and alkylation), thereby reducing the number of experimental artifacts25. Current top-down sample cleanup methods (for example, protein precipitation26 and molecular weight cut-off (MWCO) ultrafiltration) are not applicable to all sample types or downstream MS analyses. The demand for robust, generally applicable methods for intact protein MS is the most common request made to members of the Consortium for Top-Down Proteomics27,28 (http://topdownproteomics.org/).
Our goal here is to address this unmet need, by providing a guide to enable users with all levels of expertise to acquire high-quality intact protein mass spectra by ESI-MS. First we discuss signal suppression associated with common buffer components and biotherapeutic excipients. This provides the rationale for most failed intact MS measurements and, in addition, a path to designing MS-compatible buffers. Then, we present a decision tree based on sample composition and experimental goals, which guides users to a best-practices protocol and corresponding benchmark data.
Origins of signal suppression and signal spreading
Biological, biochemical and biotherapeutic sample preparations usually contain numerous interfering substances (for example, salts, detergents, chaotropes and buffers) that lead to signal suppression during ESI-MS analysis. To provide a theoretical context, we describe the two major drivers of the quality of intact protein (positive ion) ESI-MS and how these are affected by interfering substances. The first driver of quality is the formation of desolvated protein ions, which can be understood in terms of a few critical steps during the ESI process16,29,30. Interfering substances generally affect the ESI process after the formation of nanodroplets at the Rayleigh charge limit. Two salient, often opposing, processes that occur within these nanodroplets are the partitioning of net charge toward the droplet surface and the minimization of solvation energy. Polar species such as salts and native proteins partition toward the droplet interior to optimize solvation energy; their ionization, therefore, requires evaporation of solvent molecules16. Hydrophobic species such as detergent monomers and unfolded proteins migrate to the droplet surface to optimize solvation energy and, in a faster process that requires less energy, evaporate or are ejected. Many of the techniques presented here for reducing signal suppression can be rationalized within the framework above. For example, organic solvents that decrease surface tension should promote the ionization of both polar and nonpolar analytes; detergents partition to the surface, where they can outcompete analytes for a limited number of protons; organic solvents and acids that unfold proteins should promote ejection-based ionization; native MS (nMS) requires greater desolvation energy and is more sensitive to polar contaminants.
The second driver of the quality of intact protein MS is signal spreading (that is, the distribution of the signal from a single proteoform across multiple channels), which increases with protein mass. Each channel has its own respective noise; consequently, the cumulative noise increases proportionally to the number of channels. The ESI process promotes signal spreading, via adduct formation, by increasing the concentrations of interfering substances and proteins. Heavy isotopes and charge states further distribute signal intensity across multiple channels; the former can be mitigated by isotope depletion31. Here we describe experimental techniques that minimize signal spreading (increase signal-to-noise ratio, or S/N), including using nMS to reduce the number of charge states, and the use of volatile salts (for example, ammonium acetate) or purification to minimize the effects of alkaline salts.
Signal suppression by common buffer components
Using the intact protein standard mixture (ubiquitin, myoglobin, trypsinogen and carbonic anhydrase) established by the National Resource for Translational and Developmental Proteomics (NRTDP) (http://nrtdp.northwestern.edu/protocols/), we evaluated common buffer components (Fig. 1) to quantify the concentration required for 50% signal suppression during direct infusion ESI. By analogy to half-maximum inhibitory concentration (IC50) nomenclature, we termed this metric the half-maximum suppression concentration (SC50) (Fig. 1, Supplementary Fig. 1). At their typical concentrations, all common buffer additives suppressed ESI signal considerably. Consistent with the mechanisms of ESI ionization described above, detergents produced the most signal suppression, less volatile (for example, metallic) salts produced intermediate suppression and volatile components lowest suppression. Additional details of the experimental parameters used here are provided in Supplementary Notes 1 and 2.
The SC50 values given in Fig. 1 allow users to design MS-compatible buffers. In addition, the SC50 and buffer composition serve as the entry point into the decision tree outlined below, leading users to the appropriate protocol. Although the trends in SC50 values reported here should generally be consistent across MS platforms, parameter-dependent variations in the reported values are likely (in particular, flow rate, voltages, temperatures, and pressures that affect ionization and desolvation). Here, for example, we calculated SC50 obtained by direct infusion using a standard microflow ESI source (about a microliter per minute), but nano-ESI (less than a microliter per minute) is less affected by salts because of the order-of-magnitude decrease in initial droplet size32,33.
Intact protein MS (IPMS) decision tree
The IPMS decision tree (Fig. 2) directs practitioners to a protocol or a combination of protocols based on buffer composition, the number of proteins in the sample, and whether native or denaturing conditions are to be used. Consider, for example, a purified protein in phosphate-buffered saline (PBS). Based on the 1.5 mM SC50 exhibited by NaCl (Fig. 1c) and the 137 mM NaCl present in PBS, a protein sample in PBS requires a 91-fold dilution to achieve 50% of the potential MS signal. Therefore, if the protein concentration is greater than 90 µM and salt adducts will not impede data analysis, the sample can be diluted following Protocol 1. Otherwise, sample cleanup by ultrafiltration using spin cartridges with a MWCO-membrane is recommended following Protocol 2.
Interest in certain PTMs (for example, metallation) or protein complex quaternary structure would dictate the use of native MS methods following Protocol 4b; otherwise the denaturing MS Protocol 4a is recommended. Depending on the complexity of the sample, additional separation techniques such as GELFrEE may be required (Supplementary Fig. 2). The objective of this decision tree is to provide a proven workflow for any sample, not to rule out alternative methods. For example, depending on sample stability, user expertise and available resources, precipitation (Protocol 3), size exclusion ‘spin cartridges’, or LC (Protocol 5) could be suitable alternatives to MWCO ultrafiltration. All protocols and benchmarks referenced by the decision tree and alternative methods are summarized below and further detailed in Supplementary Notes 1−5 and Supplementary Protocols 1−5.
Protein standards and benchmarks
To promote standardization and allow users to benchmark their own data using readily available proteins, we provided representative results for each protocol using the following commercially available standards: (i) the NRTDP intact protein standard mixture (see Supplementary Note 1 for preparation instructions), (ii) NIST monoclonal antibody reference material 8671 (NISTmAb), containing humanized IgG1ĸ in 12.5 mM L-histidine, 12.5 mM L-histidine HCl (pH 6.0), and (iii) Sigma bacteriorhodopsin from Halobacterium salinarum (B0184). Benchmarks for mass accuracy depend upon the instrumentation platform and have been reviewed3,34,35,36,37,38,39. Rules of thumb include requiring 10 p.p.m. accuracy for modern Fourier transform MS and 20 p.p.m. accuracy for modern quadrupole time-of-flight (QTOF) MS. We suggest the use of ProForma notation40 for standardized proteoform nomenclature, and note that the PeptideMass tool (https://web.expasy.org/peptide_mass/) can be used to calculate the mass of a given sequence or of proteoforms contained in the UniProt database.
Protocol 1: sample preparation by dilution of interfering substances
Consistent with the mechanisms of ESI and signal spreading detailed above, common buffer components render proteins undetectable by MS (Fig. 3). Minimally complex, concentrated protein solutions can often be analyzed by direct infusion, following dilution to ~1 µM final protein concentration in the appropriate sample buffer. Users should consider using this protocol if dilution can decrease the concentration of a given interfering substance below its SC50 value (Fig. 1, Supplementary Protocol 1). Assuming a practical upper limit of ~10 mM protein concentration, this protocol is potentially applicable to any of the components listed in Fig. 1. As detailed above, however, nMS utilizes an ESI process that is more sensitive to many interferents, including salts. Consequently, dilution is less likely to adequately improve nMS. Protocol 4 describes methods to dilute native proteins into whichever solution will be used to introduce samples to the MS. However, mass spectra obtained by this method have the lowest S/N of any of the protocols described here and may contain adducts.
Protocol 2: sample preparation using MWCO ultrafiltration
We recommend remediating nonvolatile salt adducts by buffer exchange into a solution of volatile salts. The MWCO of the ultrafiltration device should not exceed half the molecular mass of any given protein in a sample to prevent possible sample loss. No particular pH is optimal for all proteins, but pH extremes should be avoided, as should pH that is equivalent to a protein’s pI, where protein solubility is at a local minimum41. We recommend using ammonium acetate throughout these protocols owing to its volatility and ability to act as a stabilizing background electrolyte during ESI42. Ammonium acetate provides maximal buffering around pH 4.75 (acetate) and 9.25 (ammonium), and results in a neutral pH upon dissolving in water (approximate pH 6.5−7). Before adding protein sample, MWCO-ultrafiltration devices should be rinsed with the appropriate buffer. Additional details for this method can be found in Supplementary Protocol 2.
Protocol 2a, soluble proteins
On the basis of the protein masses in the NRTDP intact protein standard, we recommend using a MWCO of 3 kDa according to the manufacturer’s instructions. The protein preparation should be subjected to three (1:20 dilution) buffer exchanges into 10 mM ammonium acetate (pH 6.5) using a MWCO-ultrafiltration device, followed by an additional three exchanges into 2.5 mM ammonium acetate (pH 6.5) (exemplary data in Fig. 3, Supplementary Figs. 3 and 4, see also Supplementary Protocol 2a and Supplementary Note 3). Denaturing and nondenaturing samples can then be diluted and introduced to the MS as described below in Protocol 4a.
Protocol 2b, native membrane proteins
Membrane proteins are estimated to account for 23% of the total human proteome and represent ~60% of targets for currently approved drugs43,44. The mass analysis of native, intact membrane proteins can further provide key information regarding stoichiometry, ligand binding and lipid association. A typical analysis of a membrane protein complex requires either size-exclusion chromatography (SEC) or MWCO ultrafiltration to remove alkali salt adducts while maintaining the detergent used to solubilize the protein (Supplementary Protocol 2b)45. This differs fundamentally from the MWCO ultrafiltration used during filter-aided sample preparation (FASP) to improve the bottom-up proteomics analysis of membrane proteins, which removes detergents46,47. For users interested in native membrane proteins, we recommend the protocols of Robinson and coworkers45. Their protocols are based on comprehensive optimization and include a complete list of non-ionic detergents compatible with MS and detailed sample preparation considerations. We demonstrate an example application of Robinson and coworkers’ protocols for the native tetramer of Aquaporin Z (AqpZ) from Escherichia coli (Supplementary Fig. 5).
Protocol 3: sample preparation using protein precipitation
Common precipitation protocols use organic solvents to agglomerate proteins while leaving small molecules, including salts and detergents, solubilized. Whereas MWCO ultrafiltration using Protocol 2a does not rescue protein signal from a preparation containing harsh surfactants (for example, SDS and Triton), precipitation of proteins following Protocol 3 does (Fig. 3, Supplementary Protocol 3). A volume ratio of 1:1:4:3 of aqueous protein sample:chloroform:methanol:water is recommended to precipitate proteins26. The supernatant is removed by aspiration, and the precipitated pellet can be further washed with one more addition and removal of methanol. Pellets are resolubilized for 15 minutes at −20 °C using a small volume of 80% (v/v) formic acid (~25% of the starting volume) and are then diluted to the starting sample volume with HPLC-grade water or a solution of volatile salts (for example, ammonium acetate)48. As an alternative method, acetone precipitation has the distinct advantage of leaving many proteins folded. This method, however, has been shown to modify proteins with +98 Da adducts49, requires longer incubation at −20 °C (at least 1 h), requires that all steps be performed at or below 0 °C to maximize resolubilization, and can be compromised by detergents.
Protocol 4a: denaturing direct-infusion MS
Denaturing direct-infusion ESI mass spectra can usually be obtained by introducing samples to the MS in a mixture of 49.95% HPLC-grade acetonitrile, 49.95% HPLC-grade water, and 0.1% formic acid (v/v). A 60:35:5 ratio of HPLC-grade methanol:water:acetic acid may be used as an alternative and, in some cases, can improve S/N9,50. As described above, the use of these organic solvents and acids results in efficient ionization from a droplet’s surface, often allowing MS analysis to be performed using instrumentation parameters typically used for peptides. A more detailed description of instrument parameters for the Bruker SolariX FT-ICR MS used during denaturing direct infusion studies is found in Supplementary Protocol 4a.
Protocol 4b: native direct-infusion MS
Although native MS protocols may not necessarily produce folded ions that match exactly to their in-solution structures, they can be used to achieve accurate mass measurements of native structures and complexes51. Consequently, native direct-infusion MS can provide unique structural information, including the characterization of labile PTMs, metal-binding sites, noncovalent interactions with small molecules, and protein tertiary and quaternary structure. Detergent-free samples can be infused directly in aqueous 2.5 mM ammonium acetate52, the same solution used in the final stage of Protocol 2a (concentrations of ammonium acetate up to 500 mM can even be used).
Figure 4 compares mass spectra of carbonic anhydrase in denatured and native states, with the intensity of the base peak in the native sample being about twofold higher than that of the denatured sample. This comparison was repeated in four additional labs on six different instruments to illustrate the possible range of relative intensities (Supplementary Fig. 6, Supplementary Protocol 4b). Membrane protein complexes with MS-compatible detergents can be infused directly from the final solution described in Protocol 2b45. To observe native membrane proteins, detergent ions must be removed from the protein–micelle complex by increased collisional activation. This may be achieved through an increase in collision voltage applied to the source or the collision cell (typically 50−200 V), but it could require additional critical parameters that are described in detail by Robinson and coworkers, and in part in Supplementary Protocol 4b45,53,54.
Protocol 5: intact protein analysis using LC-MS
Ionization suppression by excipients and by other proteins generally makes the analysis of multiple proteins and proteoforms by direct infusion intractable. For example, many ‘high-purity’ proteins (as judged by SDS-PAGE) contain numerous proteoforms that cannot be reliably detected and quantitatively assessed without up-front separation55,56. Liquid phase separation approaches, including LC (for example, reversed-phase (RP), size-exclusion, ion exchange, chromatofocusing) and capillary electrophoresis techniques (for example, capillary zone electrophoresis, capillary isoelectric focusing) can remove excipients and provide the resolving power for deep characterization of proteins. As directed in the decision tree (Fig. 2), separation of particularly complex samples (>100 proteins) requires an additional dimension of separation before LC-MS. Supplementary Fig. 2 shows the use of GELFrEE separation prior to LC, which fractionates samples on the basis of protein molecular weights and has resulted in the largest number of characterized proteoforms to date57.
Protocol 5a: LC-MS of soluble proteins
RP-LC is recommended for all samples containing more than five unique proteins but is also a viable option for samples with fewer proteins, provided they do not contain high salt concentrations (>1 M) or harsh detergents. The recommended reversed-phase LC protocol is described in Supplementary Protocol 5a and at http://nrtdp.northwestern.edu/protocols/.
Figure 5 demonstrates that sufficient intact MS signal was attained, and four unique chromatographic peaks were observed, using Protocol 5a with a PLRP-S stationary phase (1,000-Å pore size, 5-µm particle size) on a Dionex UPLC coupled to a Thermo Orbitrap Elite. We provide benchmarks for this standard operating procedure (SOP), as well as for additional data acquired using Monolithic and C4 stationary phases, for six widely used platforms (Waters Xevo G2-S QTOF, Supplementary Fig. 7; Bruker Impact II QTOF and Bruker SolariX FT-ICR, Supplementary Fig. 8; Thermo Orbitrap Elite, Thermo Orbitrap Fusion Lumos, and Thermo Orbitrap QE-HF, Supplementary Figs. 9 and 10). To allow users to compare their performance with that of experienced operators using instruments that are operating within specifications, we report S/N for the platforms used here (Fig. 5). However, instrument vendors use proprietary, non-standardized techniques to preprocess data, display data and determine S/N, and, as a result, our data cannot be used for a cross-platform comparison. As an example of a viable alternative method that is notably better suited for proteoforms with similar mass and RP-LC retention (for example, deamidation), we provide a separation of the same protein mix using capillary zone electrophoresis (Supplementary Fig. 11).
Protocol 5b: intact membrane protein LC-MS
Denaturing LC-MS of intact membrane proteins is not straightforward because of their inherent hydrophobicity58,59. Whitelegge et al. provided the earliest example of denaturing LC-MS of membrane proteins using high concentrations of mobile phase additives and demonstrated that ESI of membrane proteins could achieve the 0.01% mass accuracy benchmark established for ESI of soluble proteins58. For thorough reviews of the current state of membrane protein analysis via LC-MS60,61 and the corresponding protocols, we direct readers to refs. 60,61,62.
Current denaturing LC-MS methods for membrane proteins use either size-exclusion63,64 or reversed-phase separation. Owing to the ease of implementation across a variety of MS platforms, we suggest analysis via reversed-phase LC-MS using a polystyrene-divinyl benzene co-polymer stationary phase (PLRP-S, 300 Å, Agilent). We do not recommend the use of long chain bonded stationary phases such as C8 and C18, as membrane proteins are likely to be retained on the column. As an example, we solubilized enriched bacteriorhodopsin from H. salinarum (Sigma B0184) in 88% formic acid to separate the protein from lipid contaminants. To avoid the risk of formic acid adduction (+28 Da), samples are immediately injected onto the column and solvent exchanged to much lower acid concentrations (0.1%). In the case of membrane protein preparations containing high enough concentrations of lipid contaminants to confound analysis or damage the column, we recommend precipitation following Protocol 3 before analysis. Proteins are eluted using an increasing gradient of 49.95% acetonitrile, 49.95% isopropanol, 0.1% formic acid. Figure 6 shows the analysis of denatured bacteriorhodopsin of H. salinarum following this protocol. Although elution efficiency for some integral proteins may fall well below 100%, PLRP-S columns can be regenerated with 90% formic acid injections. This protocol was performed in five labs on five different instrument platforms (Supplementary Fig. 12, Supplementary Protocol 5b). An example of an alternative LC-MS method using a more common stationary phase (ZORBAX RRHD 300SB-C3) is provided for aquaporin Z in Supplementary Fig. 5b.
Special methodological considerations for intact antibody mass spectrometry
With the increasing development of biotherapeutics and biosimilars in the pharmaceutical industry, and an increasingly stringent route to regulatory approvals, there is a growing need for intact antibody MS. Every protocol presented here can be applied to the analysis of intact antibodies (Fig. 3, Supplementary Fig. 13, Supplementary Note 4). However, as antibodies are relatively large and signal spreading increases in proportion to protein size, we recommend against the use of Protocol 1 (dilution) for any regulatory filing.
The IPMS decision tree (Fig. 2) guides practitioners of all levels toward broadly applicable methods to obtain high-quality intact mass spectra from any protein sample. The protocols described here have been scrutinized and optimized in over ten expert intact protein MS labs, and successfully applied in laboratories without experience in intact protein MS. We hope that these protocols will enable any research group to adopt intact protein mass analysis.
The accurate mass measurement of an intact protein is the sine qua non of top-down mass spectrometry, which can characterize how proteoforms interact and identify PTMs that are lost in other analyses. High-throughput top-down analysis of whole proteomes has proven successful in the unambiguous identification of hundreds of proteins and proteoforms from a single biological sample65 and revealed prevalent yet previously uncharacterized biologically relevant modifications66. Quantitative top-down proteomics has been used to identify disease-relevant differences in protein levels, an encouraging step forward in the field of proteomics-based personalized medicine67. Additionally, by using native mass spectrometry following the top-down workflow, one can observe previously unknown protein–protein interactions, protein–ligand binding, protein–cofactor association and protein-complex stoichiometry, and assess their relationships to important biological pathways68. We believe that by starting with intact mass analysis, using these intact protein MS protocols coupled to top-down MS analysis, and by identifying proteoforms rather than proteins, scientists can gain new insights into the human proteome. We also hope that these protocols serve as a starting point for users to push, even further, the current limits of high-molecular-weight mass spectrometry.
All general protocols are available as Supplementary Protocols.
Smith, L. M. & Kelleher, N. L. Proteoform: a single term describing protein complexity. Nat. Methods 10, 186–187 (2013).
Shortreed, M. R. et al. Elucidating proteoform families from proteoform intact-mass and lysine-count measurements. J. Proteome Res. 15, 1213–1221 (2016).
Mann, M. & Kelleher, N. L. Precision proteomics: The case for high resolution and high mass accuracy. Proc. Natl Acad. Sci. USA 105, 18132 (2008).
Shaw, J. B. et al. Complete protein characterization using top-down mass spectrometry and ultraviolet photodissociation. J. Am. Chem. Soc. 135, 12646–12651 (2013).
Ge, Y. et al. Top down characterization of larger proteins (45 kDa) by electron capture dissociation mass spectrometry. J. Am. Chem. Soc. 124, 672–678 (2002).
Siuti, N. & Kelleher, N. L. Decoding protein modifications using top-down mass spectrometry. Nat. Methods 4, 817–821 (2007).
Ge, Y., Rybakova, I. N., Xu, Q. G. & Moss, R. L. Top-down high-resolution mass spectrometry of cardiac myosin binding protein C revealed that truncation alters protein phosphorylation state. Proc. Natl Acad. Sci. USA 106, 12658–12663 (2009).
Toby, T. K., Fornelli, L. & Kelleher, N. L. Progress in top-down proteomics and the analysis of proteoforms. Annu. Rev. Anal. Chem. 9, 499–519 (2016).
Fenn, J. B., Mann, M., Meng, C. K., Wong, S. F. & Whitehouse, C. M. Electrospray ionization for mass spectrometry of large biomolecules. Science 246, 64–71 (1989).
Loo, J. A., Edmonds, C. G. & Smith, R. D. Tandem mass spectrometry of very large molecules: serum albumin sequence information from multiply charged ions formed by electrospray ionization. Anal. Chem. 63, 2488–2499 (1991).
Reid, G. E. & McLuckey, S. A. ‘Top down’ protein characterization via tandem mass spectrometry. J. Am. Soc. Mass Spectrom. 37, 663–675 (2002).
Alexandrov, M. L. et al. Extraction of ions from solutions under atmospheric pressure as a method for mass spectrometric analysis of bioorganic compounds. Rapid Commun. Mass Spectrom. 22, 267–270 (2008).
McCormack, A. L. et al. Direct analysis and identification of proteins in mixtures by LC/MS/MS and database searching at the low-femtomole level. Anal. Chem. 69, 767–776 (1997).
Zhang, Y., Fonslow, B. R., Shan, B., Baek, M.-C. & Yates, J. R. Protein analysis by shotgun/bottom-up proteomics. Chem. Rev. 113, 2343–2394 (2013).
Kelleher, N. L. et al. Top down versus bottom up protein characterization by tandem high-resolution mass spectrometry. J. Am. Chem. Soc. 121, 806–812 (1999).
Konermann, L., Ahadi, E., Rodriguez, A. D. & Vahidi, S. Unraveling the mechanism of electrospray ionization. Anal. Chem. 85, 2–9 (2013).
Lohnes, K. et al. Combining high-throughput MALDI-TOF mass spectrometry and isoelectric focusing gel electrophoresis for virtual 2D gel-based proteomics. Methods 104, 163–169 (2016).
Fenselau, C. & Demirev, P. A. Characterization of intact microorganisms by MALDI mass spectrometry. Mass Spectrom. Rev. 20, 157–171 (2001).
Auclair, J. R. et al. Artifacts to avoid while taking advantage of top-down mass spectrometry based detection of protein S-thiolation. Proteomics 14, 1152–1157 (2014).
Auclair, J. R. et al. Post-translational modification by cysteine protects Cu/Zn-superoxide dismutase from oxidative damage. Biochemistry 52, 6137–6144 (2013).
Ayaz-Guner, S., Zhang, J., Li, L., Walker, J. W. & Ge, Y. In vivo phosphorylation site mapping in mouse cardiac troponin I by high resolution top-down electron capture dissociation mass spectrometry: Ser22/23 are the only sites basally phosphorylated. Biochemistry 48, 8161–8170 (2009).
Martinez-Garcia, E. et al. The MMSET histone methyl transferase switches global histone methylation and alters gene expression in t(4;14) multiple myeloma cells. Blood 117, 211–220 (2011).
Wu, D., Struwe, W. B., Harvey, D. J., Ferguson, M. A. J. & Robinson, C. V. N-glycan microheterogeneity regulates interactions of plasma proteins. Proc. Natl Acad. Sci. USA 115, 8763 (2018).
Doll, S. & Burlingame, A. L. Mass spectrometry-based detection and assignment of protein posttranslational modifications. ACS Chem. Biol. 10, 63–71 (2015).
Chait, B. T. Mass spectrometry: bottom-up or top-down? Science 314, 65 (2006).
Wessel, D. & Flügge, U. I. A method for the quantitative recovery of protein in dilute solution in the presence of detergents and lipids. Anal. Biochem. 138, 141–143 (1984).
Fornelli, L. et al. Top-down proteomics: where we are, where we are going? J. Proteomics 75, 3–4 (2017).
Dang, X. et al. The first pilot project of the Consortium for Top-Down Proteomics: a status report. Proteomics 14, 1130–1140 (2014).
Cech, N. B. & Enke, C. G. Practical implications of some recent studies in electrospray ionization fundamentals. Mass Spectrom. Rev. 20, 362–387 (2001).
Metwally, H., McAllister, R. G. & Konermann, L. Exploring the mechanism of salt-induced signal suppression in protein electrospray mass spectrometry using experiments and molecular dynamics simulations. Anal. Chem. 87, 2434–2442 (2015).
Compton, P. D., Zamdborg, L., Thomas, P. M. & Kelleher, N. L. On the scalability and requirements of whole protein mass spectrometry. Anal. Chem. 83, 6868–6874 (2011).
Karas, M., Bahr, U. & Dülcks, T. Nano-electrospray ionization mass spectrometry: addressing analytical problems beyond routine. Fresenius J. Anal. Chem. 366, 669–676 (2000).
Juraschek, R., Dulcks, T. & Karas, M. Nanoelectrospray—more than just a minimized-flow electrospray ionization source. J. Am. Soc. Mass Spectrom. 10, 300–308 (1999).
Evershed, R. P., Robertson, D. H., Beynon, R. J. & Green, B. N. Application of electrospray ionization mass spectrometry with maximum-entropy analysis to allelic ‘fingerprinting’ of major urinary proteins. Rapid Commun. Mass Spectrom. 7, 882–886 (1993).
Ferrige, A. G. et al. Disentangling electrospray spectra with maximum entropy. Rapid Commun. Mass Spectrom. 6, 707–711 (1992).
Maquin, F., Schoot, B. M., Devaux, P. G. & Green, B. N. Molecular weight determination of recombinant interleukin 2 and interferon gamma by electrospray ionization mass spectroscopy. Rapid Commun. Mass Spectrom. 5, 299–302 (1991).
Loo, J. A., Udseth, H. R. & Smith, R. D. Peptide and protein analysis by electrospray ionization-mass spectrometry and capillary electrophoresis-mass spectrometry. Anal. Biochem. 179, 404–412 (1989).
Morris, H. R. et al. High sensitivity collisionally-activated decomposition tandem mass spectrometry on a novel quadrupole/orthogonal-acceleration time-of-flight mass spectrometer. Rapid Commun. Mass Spectrom. 10, 889–896 (1996).
Kelleher, N. L., Senko, M. W., Little, D. P., O’Connor, P. B. & McLafferty, F. W. Complete large-molecule high-resolution mass spectra from 50-femtomole microvolume injection. J. Am. Soc. Mass Spectrom. 6, 220–221 (1995).
LeDuc, R. D., Schwammle, V., Shortreed, M. R. & Cesnik, A. J. ProForma: a standard proteoform notation. J. Proteome Res. 17, 1321–1325 (2018).
Shaw, K. L., Grimsley, G. R., Yakovlev, G. I., Makarov, A. A. & Pace, C. N. The effect of net charge on the solubility, activity, and stability of ribonuclease Sa. Protein Sci. 10, 1206–1215 (2001).
Konermann, L. Addressing a common misconception: ammonium acetate as neutral pH “buffer” for native electrospray mass spectrometry. J. Am. Soc. Mass Spectrom. 28, 1827–1835 (2017).
Uhlén, M. et al. Tissue-based map of the human proteome. Science 347, 1260419 (2015).
Overington, J. P., Al-Lazikani, B. & Hopkins, A. L. How many drug targets are there? Nat. Rev. Drug Discov. 5, 993 (2006).
Laganowsky, A., Reading, E., Hopper, J. T. S. & Robinson, C. V. Mass spectrometry of intact membrane protein complexes. Nat. Protoc. 8, 639–651 (2013).
Wisniewski, J. R., Zougman, A., Nagaraj, N. & Mann, M. Universal sample preparation method for proteome analysis. Nat. Methods 6, 359–362 (2009).
Erde, J., Loo, R. R. O. & Loo, J. A. Enhanced FASP (eFASP) to increase proteome coverage and sample recovery for quantitative proteomic experiments. J. Proteome Res. 13, 1885–1895 (2014).
Doucette, A. A., Vieira, D. B., Orton, D. J. & Wall, M. J. Resolubilization of precipitated intact membrane proteins with cold formic acid for analysis by mass spectrometry. J. Proteome Res. 13, 6001–6012 (2014).
Güray, M. Z., Zheng, S. & Doucette, A. A. Mass spectrometry of intact proteins reveals +98 u chemical artifacts following precipitation in acetone. J. Proteome Res. 16, 889–897 (2017).
Smith, R. D., Loo, J. A., Edmonds, C. G., Barinaga, C. J. & Udseth, H. R. New developments in biochemical mass spectrometry: electrospray ionization. Anal. Chem. 62, 882–899 (1990).
Leney, A. C. & Heck, A. J. R. Native mass spectrometry: what is in the name? J. Am. Soc. Mass Spectrom. 28, 5–13 (2017).
Hernández, H. & Robinson, C. V. Determining the stoichiometry and interactions of macromolecular assemblies from mass spectrometry. Nat. Protoc. 2, 715 (2007).
Lössl, P., Snijder, J. & Heck, A. J. R. Boundaries of mass resolution in native mass spectrometry. J. Am. Soc. Mass Spectrom. 25, 906–917 (2014).
Gault, J. et al. High-resolution mass spectrometry of small molecules bound to membrane proteins. Nat. Methods 13, 333–336 (2016).
Belov, A. M. et al. Analysis of proteins, protein complexes, and organellar proteomes using sheathless capillary zone electrophoresis - native mass spectrometry. J. Am. Soc. Mass Spectrom. 28, 2614–2634 (2017).
Belov, A. M. et al. Complementary middle-down and intact monoclonal antibody proteoform characterization by capillary zone electrophoresis - mass spectrometry. Electrophoresis 39, 2069–2082 (2018).
Tran, J. C. & Doucette, A. A. Multiplexed size separation of intact proteins in solution phase for mass spectrometry. Anal. Chem. 81, 6201–6209 (2009).
Whitelegge, J. P., Gundersen, C. B. & Faull, K. F. Electrospray-ionization mass spectrometry of intact intrinsic membrane proteins. Protein Sci. 7, 1423–1430 (1998).
le Coutre, J. et al. Proteomics on full-length membrane proteins using mass spectrometry. Biochemistry 39, 4237–4242 (2000).
Whitelegge, J. P. HPLC and mass spectrometry of intrinsic membrane proteins. In HPLC of Peptides and Proteins: Methods and Protocols (ed. Aguilar, M.-I.) 323–339 (Springer, 2004).
Souda, P., Ryan, C. M., Cramer, W. A. & Whitelegge, J. Profiling of integral membrane proteins and their post translational modifications using high-resolution mass spectrometry. Methods 55, 330–336 (2011).
Whitelegge, J. P. Integral membrane proteins and bilayer proteomics. Anal. Chem. 85, 2558–2568 (2013).
Howery, A. E. et al. A designed inhibitor of a CLC antiporter blocks function through a unique binding mode. Chem. Biol. 19, 1460–1470 (2012).
Whitelegge, J. P. et al. Toward the bilayer proteome, electrospray ionization-mass spectrometry of large, intact transmembrane proteins. Proc. Natl Acad. Sci. USA 96, 10695–10698 (1999).
Cleland, T. P. et al. High-throughput analysis of intact human proteins using UVPD and HCD on an Orbitrap mass spectrometer. J. Proteome Res. 16, 2072–2079 (2017).
Ansong, C. et al. Top-down proteomics reveals a unique protein S-thiolation switch in Salmonella Typhimurium in response to infection-like conditions. Proc. Natl Acad. Sci. USA 110, 10153–10158 (2013).
Ntai, I. et al. Applying label-free quantitation to top down proteomics. Anal. Chem. 86, 4961–4968 (2014).
Skinner, O. S. et al. Top-down characterization of endogenous protein complexes with native proteomics. Nat. Chem. Biol. 14, 36 (2017).
J.N.A. was supported in part by NIH R01 NS065263 and ALSA 18-IIA-420 in collaboration with Biopharmaceutical Analysis Training Lab at Northeastern University. Y.G. was supported by NIH R01 GM117058 and S10 OD018475 and acknowledges the UW-Madison Human Proteomics Program Mass Spectrometry Facility (initially funded by the Wisconsin Partnership Funds). A.R.I. was supported by the NIH awards 1R01GM120272 and R01CA218500 and acknowledges Thermo Fisher Scientific and SCIEX for their support. J.P.W.’s lab was supported in part by the NIH grant P30DK063491. J.A.L. acknowledges the NIH (R01 GM103479, S10 RR028893, S10 OD018504) and the US Department of Energy (DE-FC-02-02ER63421). J.L.L. acknowledges the postdoctoral program at Amgen, Inc. C.L. acknowledges support from the NIH Ruth L. Kirschstein National Research Service Award (NRSA; T32 GM007185). Work done by N.L.K. was carried out in collaboration with the National Resource for Translational and Developmental Proteomics under NIH Grant P41GM108569 from the National Institute of General Medical Sciences. The NRTDP would like to thank I. Ntai for assistance in data collection. L. Schachner is a Howard Hughes Medical Institute Gilliam Fellow. Research reported in this publication was supported by a fellowship associated with the Chemistry of Life Processes Predoctoral Training Program T32GM105538 at Northwestern University.
Authors with company affiliations include J.L.L. (Amgen), K.C.A. (Alnylam), R.S. (Biogen), J.J.W. (Bruker), I.D.G.C. (Amgen), P.O.D. (Eastwoods Consulting) and Y.O.T. (Spectroswiss).
Peer review information: Allison Doerr was the primary editor on this article and managed its editorial process and peer review in collaboration with the rest of the editorial team.
Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Integrated supplementary information
These components are outlined in Fig. 1c. The x-axis represents an increasing concentration of interfering substance [C] and the y-axis represents the fraction of signal lost. Each spectrum was collected in triplicate. S/N was calculated as described in the Online Methods. Standard deviation of each data point was calculated and used to produce error bars. The least-squares fitting (LSF) calculation is included to show the quality of fit to the equation.
Supplementary Figure 2 Fractionation of human whole-cell lysate prior to top-down mass spectrometry.
Human colorectal cancer cells were lysed and constituent proteins quantified by the methods described by Anderson et. al.1 Aliquots of lysate containing 400 µg were precipitated in acetone, resuspended in 1% SDS containing 50 mM DTT, and resolved on 8% T (a.) or 10% T (b.) gel-eluted liquid fraction entrapment electrophoresis (GELFrEE) cartridges following the respective manufacturer’s protocols (GELFrEE 8100 Fractionation System, Expedeon, Inc.). Upon collection of MW-based fractions, 10 µL aliquots were resolved by SDS-PAGE and visualized by AgNO3 stain2 to gauge protein content and quality of resolution. (a,b) Note that the MW ranges of f1 (purple box) and subsequent fractions differ depending on the GELFrEE cartridge selected. While 8% cartridges (a.) are recommended for quantitative high-throughput top-down MS applications or analysis of higher-MW proteins, 10% (b.) cartridges provide superior resolution in the 5-30 kDa MW range for qualitative high-throughput applications. 1. Anderson, L.C. et al. Identification and Characterization of Human Proteoforms by Top-Down LC-21 Tesla FT-ICR Mass Spectrometry. J Proteome Res 16, 1087-1096 (2017). 2. Shevchenko, A., Wilm, M., Vorm, O. & Mann, M. Mass spectrometric sequencing of proteins silver-stained polyacrylamide gels. Anal Chem 68, 850-858 (1996).
Supplementary Figure 3 Antibody Buffer and Gentle Elution Buffer Ablate MS Signal; MWCO-Ultrafiltration and Precipitation Rescue Signal.
Buffers included (a.) Thermo Gentle Elution Buffer (containing molar salt concentration), and (b.) Antibody buffer (10 mM Arginine, 10 mM Tris HCl, 10 mM histidine, 10 mM potassium phosphate, 10 mM citric acid, pH 5.5). All the above spectra were obtained using a Bruker SolariX FT-ICR mass spectrometer, 9.4T.
Supplementary Figure 4 Sample Preparation of Protein Mixture following Protocol 3 (MWCO-Ultrafiltration).
These samples were analyzed by direct infusion on a (a.) Waters Xevo G2-S QTOF and a (b.) Thermo Q Exactive Plus.
Native spectrum was acquired on a Waters Synapt G1 Q-TOF with nanoESI via direct infusion while denatured spectra was acquired on a Waters Synapt G1 Q-TOF via nanoESI-LC-MS. The native sample (a.) and the denatured sample (b.) was acquired at a concentration of 10 µM. *Denotes the most abundant charge state. 24268.7 Da is the deconvoluted mass of the unmodified AqpZ monomer. Formylated AqpZ was also detected with a mass of 24,296.4 Da. ‡Five native tetramer masses were observed corresponding to five unique combinations of formylated and unformylated monomers.85
Both denaturing and native analysis of carbonic anhydrase was run on a (a.) Thermo Q Exactive HF MS (b.) Bruker 15T SolariX FT-ICR MS, (c.) Bruker 12T SolariX FT-ICR MS, (d.) Bruker maXis II ETD Q-TOF, (e.) Bruker 15T SolariX FT-ICR MS, (f.) Waters Synapt G2Si MS.
Samples were prepared following the given SOP and separated using a PLRP-S (top panel of a, b, and c) or a C4 (bottom panel of a, b, and c) stationary phase. (a.) The final concentrations of each protein loaded onto the column were; 14 pmol ubiquitin, 49 pmol trypsinogen, 109 pmol myoglobin, and 64 pmol carbonic anhydrase. (b.) The final concentrations of each protein loaded onto the column were; 1.4 pmol ubiquitin, 4.9 pmol trypsinogen, 10.9 pmol myoglobin, and 6.4 pmol carbonic anhydrase. (c.) The final concentrations of each protein loaded onto the column were; 0.14 pmol ubiquitin, 0.49 pmol trypsinogen, 1.09 pmol myoglobin, and 0.64 pmol carbonic anhydrase.
Supplementary Figure 8 LC-MS of protein standard mixture run on Waters nanoAcquity coupled to a Bruker QTOF and a Bruker FT-ICR MS.
Samples were prepared following the given SOP and separated using PLRP-S on a Waters nanoAcquity coupled to (a.) a Bruker impact II QTOF and (b.) a Bruker SolariX FT-ICR MS.
Supplementary Figure 9 LC-MS of protein standard mixture run on a Dionex UPLC coupled to three different orbitrap mass spectrometers.
Samples were prepared following the given SOP and separated on a Dionex UPLC coupled to (a.) a Thermo Orbitrap Elite (monolithic stationary phase, (b.) a Thermo Orbitrap Fusion Lumos (PLRP-S stationary phase), (c.) a Thermo Orbitrap QE-HF (PLRP-S stationary phase).
Supplementary Figure 10 LC-MS of protein standard mixture run on a Dionex UPLC coupled to a Thermo Orbitrap Fusion Lumos.
Samples were prepared following the given SOP, separated on a Dionex UltiMate 3000 RSLCNano System using PLRP-S stationary phase, and analyzed on a Thermo Fusion Lumos. The final concentrations of each protein loaded onto the column were; 0.14 pmol ubiquitin, 0.49 pmol trypsinogen, 1.09 pmol myoglobin, and 0.64 pmol carbonic anhydrase.
Separated using a prototype CESI-8000 Plus (AB SCIEX) used with a Neutral OptiMS cartridge.
Supplementary Figure 12 LC MS of Halobacterium salinarum prepared following Supplemental Protocol 5b.
Proteins were separated using a PLRP-S stationary phase (300 Å pore size, 3 µm bead size) and analyzed on a (a.) Waters nanoAcquity interfaced with a Bruker SolariX FT-ICR MS (b.) Waters H-Class Acquity UPLC interfaced with a Waters Xevo G2-S QTOF (c.) Thermo Scientific Vanquish interfaced with a Thermo Orbitrap Q Exactive (d.) Agilent 1290 interfaced with a Thermo Orbitrap Exactive Plus.
Supplementary Figure 13 LC MS of NIST Antibody on a Waters UPLC-QTOF system using C4 stationary phase.
These results demonstrate that antibody sample clean-up for intact MS analysis can be achieved without any additional steps required.
Supplementary Figs. 1–13, Supplementary Notes 1–5, and Supplementary Protocols 1–5