High sensitivity analysis of nanogram quantities of glycosaminoglycans using ToF-SIMS

Hook, Andrew L.; Hogwood, John; Gray, Elaine; Mulloy, Barbara; Merry, Catherine L. R.

doi:10.1038/s42004-021-00506-1

Download PDF

Article
Open access
Published: 14 May 2021

High sensitivity analysis of nanogram quantities of glycosaminoglycans using ToF-SIMS

Communications Chemistry volume 4, Article number: 67 (2021) Cite this article

3753 Accesses
14 Citations
94 Altmetric
Metrics details

Subjects

Abstract

Glycosaminoglycans (GAGs) are important biopolymers that differ in the sequence of saccharide units and in post polymerisation alterations at various positions, making these complex molecules challenging to analyse. Here we describe an approach that enables small quantities (<200 ng) of over 400 different GAGs to be analysed within a short time frame (3–4 h). Time of flight secondary ion mass spectrometry (ToF-SIMS) together with multivariate analysis is used to analyse the entire set of GAG samples. Resultant spectra are derived from the whole molecules and do not require pre-digestion. All 6 possible GAG types are successfully discriminated, both alone and in the presence of fibronectin. We also distinguish between pharmaceutical grade heparin, derived from different animal species and from different suppliers, to a sensitivity as low as 0.001 wt%. This approach is likely to be highly beneficial in the quality control of GAGs produced for therapeutic applications and for characterising GAGs within biomaterials or from in vitro cell culture.

Shotgun ion mobility mass spectrometry sequencing of heparan sulfate saccharides

Article Open access 20 March 2020

Salt-free fractionation of complex isomeric mixtures of glycosaminoglycan oligosaccharides compatible with ESI-MS and microarray analysis

Article Open access 12 November 2019

A streamlined pipeline for multiplexed quantitative site-specific N-glycoproteomics

Article Open access 19 October 2020

Introduction

Glycosaminoglycans (GAGs) are polysaccharides found within cells, within the pericellular space and as a part of the extracellular matrix (ECM). GAGs regulate biological processes, such as self-renewal, differentiation, growth, inhibition, microbial invasion and defence, with their broad structural diversity and differential localisation accommodating specific interactions with hundreds of binding proteins^1,2. The complexity of GAGs, including chain length (polymerisation machinery), modification (epimerisation and sulphation of the hydroxyl groups at various positions on the saccharide units) and core protein attachment is orchestrated by enzyme mediated synthesis and allows for GAGs to have greater information carrying capacity than the more commonly studied biological polymers, nucleic acids and proteins.

The five sulfated GAGs, heparin, heparan sulphate (HS), chondroitin sulphate (CS), dermatan sulphate (DS) and keratan sulphate (KS) are synthesised attached to protein cores as proteoglycans, unlike non-sulfated hyaluronan (HA) which is extruded into the pericellular space³. Heparin, in the form of a pure polysaccharide released from its core protein, is a globally used anticoagulant and antithrombotic and is currently being considered for anti-inflammatory indications such as chronic obstructive pulmonary disease⁴. Other GAG types are now also increasingly being applied clinically, for example, as treatments for cancer and osteoarthritis, as anti-viral therapies⁵ and to support wound healing^6,7,8,9. The rapid and sensitive structural characterisation of GAGs is critical to maintain the standardisation and safety of these animal-derived biomolecules for medical use, as was highlighted by the contamination of heparin samples with over-sulfated CS (OSCS) that led to patient hypotension and death¹⁰. The ongoing biosecurity of heparin is a significant concern to healthcare systems around the world, necessitating continued efforts to improve heparin analysis and provide synthetic production routes.

Typically, chemical analysis of pharmaceutical GAGs is achieved using nuclear magnetic resonance (NMR) and high performance liquid chromatography (HPLC) methods^11,12,13,14. Simple ¹H-NMR has been shown to detect 0.1 wt% contaminating OSCS within heparin¹⁵, whilst HPLC achieved a limit of detection of 0.03 wt% for OSCS in heparin and remains the gold standard analysis technique for heparin characterisation¹⁴. However, some of these approaches require >10 mg of sample, as well as specialised equipment and expert analysis and, therefore, suffer from low throughput¹⁶. Mass spectrometry plays a leading role in GAG glycomics utilising soft ionisation techniques such as electrospray ionisation¹⁷, however, analysis of whole sulfated GAGs remains difficult¹⁸. This is particularly problematic for the characterisation of heparin as whole-molecule analysis is necessary to detect inter-species contamination of porcine-derived material used for medical applications¹⁹. If porcine sources become limited, for example as a consequence of recent outbreaks of African Swine Fever²⁰, the relatively poor detection of non-porcine material (a limit of detection (LOD) of approximately 2 wt% for detecting a bovine contamination in porcine heparin²¹) is unlikely to be sufficient to protect supplies.

For biomaterial applications requiring surface analysis, X-ray photoelectron spectroscopy has been favoured due to quantitative readouts but is unable to resolve the subtle chemical difference between different GAGs²². Time-of-flight secondary ion mass spectrometry (ToF-SIMS) is a promising approach for GAG analysis as spectral acquisition is rapid (≈20 s per sample) and can be applied to whole molecules without the need for purification or enzymatic digestion. ToF-SIMS has been applied to assess the modification of sugars at surfaces but has typically been limited to mono- or di-saccharides^{23,24,25,26,27,28}. Studies of larger polysaccharides, typically heparin or HA, focussed on low mass fragments that have limited utility to discriminate between the different GAG types^{29,30,31,32,33,34,35}.

In this study ToF-SIMS was used to analyse a microarray containing all six GAG types (analytical preparations of HS, CS, DS, KS, HA, porcine mucosal (PM) heparin, and clinical grade heparin from porcine mucosa, bovine mucosa and bovine lung). Together with principal component analysis (PCA) and partial least square (PLS) regression, this approach was used to chemically distinguish between the different GAG classes in a semi-quantitative manner, whilst notably being able to discern differences between heparin samples derived from different animal sources and different manufacturer batches. The combination of high throughput analysis with high chemical sensitivity indicated the feasibility of this method for quality control of pharmaceutical heparin, detecting possible process related impurities as well as contaminants, and for enabling the surface analysis of GAG-modified materials to facilitate the development of GAG-functional biomaterials.

Results

GAG microarray analysis

Arrays of GAG solutions were prepared using ink-jet printing (Fig. 1a) onto poly-L-lysine (PLL)-coated glass slides, selected for the ability of PLL to adhere GAGs due to ionic interactions, and possible other supramolecular interactions such as hydrogen binding (Supplementary Note 1; Supplementary Figs. 1–2)³⁶. Ink-jet printing also enabled the rapid generation of GAG mixtures via in-spot mixing (Supplementary Note 2; Supplementary Figs. 3–4, Supplementary Table 1). Microarrays enable the rapid assessment of libraries of molecules, require small amounts of material (ng) and are compatible with high throughput surface analysis³⁷. Microarrays have been widely used to assess DNA, proteins and their analogues (oligonucleotides and peptides)^{38,39,40,41,42,43}. Glycan and GAG microarrays have also been used in alternative applications, but not previously for high-throughput GAG structural analysis^44,45. In total, approximately 160 ng of material was deposited per spot. Resultant arrays were assessed by bright field microscopy and ToF-SIMS (Fig. 1b, c). All printed spots appeared to be both physically and chemically distinct (Fig. 1c and d). The absence of sulphate signal (SO₄⁻) for HA samples suggested no carry-over between print runs (Fig. 1c). Regions of interest for each spot were determined from the SO₄⁻ ion image to enable extraction of spectra for each sample (Fig. 1d, e). A typical spectrum from porcine mucosa (PM) derived heparin exhibited high intensity ions associated with sulphate (SO₂⁻, SO₃⁻, C₃HSO₅⁺) and amide (CN⁻, CNO⁻) groups as well as highly oxygenated fragments (C₃H₃O₂⁻, C₂O₃⁺) (Fig. 1d). Ions associated with the sulphate group were absent from a typical spectrum taken from a HA sample (Fig. 1e).

**Fig. 1: Preparation and analysis of a GAG microarray. GAGs were derived from porcine mucosa (PM), bovine mucosa (BM) or bovine lung (BL). If unspecified, samples are derived from PM.**

Differentiation of GAGs using principal component analysis

Each sample typically had approximately 900 different ions (both positive and negative). To effectively assess the differences between samples, principal component analysis was used to reduce the dimensionality of the multispectral dataset. Additionally, a sparse dataset was generated to remove uninformative variables not associated with variance between sample types. This is important for the high-dimension dataset where PCA results are difficult to interpret and the sample eigenvectors are not always consistent estimators whilst regression approaches are susceptible to over-fitting⁴⁶. A number of approaches have been used to develop sparsity for PCA⁴⁷, including recursive feature selection⁴⁸. In this study, recursive feature addition was used to generate a sparse dataset using the maximisation of the distance between the means of the sample sets as a selection criteria. Recursive feature elimination was then used, using the minimisation of the overlap between 95% confidence ellipses from different sample sets as a selection criterion, to select features that would differentiate between samples with sufficient confidence (Supplementary Figs. 5–6, Supplementary Table 2). To avoid over-fitting, the sample sets were split into training and test sets at a 7:3 ratio (training:test). Test samples were required to fall within the 95% confidence ellipse associated with the principal components describing the variance between samples. The final sparse dataset was further tested for its ability to robustly assess the differences between samples by ensuring sample sets remained separated with multiple randomly generated training/test sets. Creation of a sparse dataset by this method resulted in 83.5% of the variance captured by PCA to be associated with the difference between the biochemically similar GAGs CS and DS, giving confidence that this approach could also work for a broader set of materials (Supplementary Fig. 7).

The utility of PCA with a sparse dataset to identify differences in GAG samples was assessed for 5 different medical grade PM-derived heparin samples, bovine lung- (BL) and bovine mucosa- (BM) derived heparin, OSCS, CS and a heparin sample contaminated with 1 wt% OSCS from the heparin crisis. A sparse dataset containing 12 different ions was selected. By considering the scores for PC2 and PC3, the CS samples and contaminated heparin sample were all successfully differentiated from all other heparin samples (Supplementary Fig. 8).

As the ultimate assessment of this approach, samples of each of the 6 GAG types were analysed together to assess whether each sample could be chemically discerned. Without sparsity, PCA was able to successfully differentiate between the KS, HA and OSCS samples, with and without variance scaling (Supplementary Fig. 9a, b). However, separation of the other GAG samples was not achieved, particularly between the different heparin sample sets. The scree plot indicated 1–9 PCs captured variance not associated with noise (Supplementary Fig. 9c). After generation of a sparse dataset, the variance captured by the first 6 principal components (PCs) increased from 78% to 89% due to the removal of features that corresponded to variance not associated with differences between sample set (Supplementary Fig. 9c, d). A total of 48 features were selected for the final sparse dataset that corresponded to the minimum number of features required to produce a high (>0.25) mean average area fraction of ellipses not overlapping.

PCA of the sparse dataset was able to successfully separate all 16 GAG samples to 95% confidence (Figure2). Scores plots for PCs 1-2 showed clustering of the 6 main types of GAG (Fig. 2a). Further separation of the different types of heparin including separation of heparin from PM, BM or BL and different batches of heparin from PM was achieved by considering PCs 3–6 (Fig. 2b, c). Hierarchical cluster analysis was used to classify the different samples based upon their Euclidean distance. The outcome of this unsupervised classification approach is shown as a dendrogram (Fig. 2d), where samples that are most similar are positioned together. In all cases, samples were clustered within their correct sample group, including the test set, with the exception of single replicates of two heparin PM batches and one replicate of the heparin BM sample.

Only those ions with a possible assignment based upon the elemental composition of GAGs were selected (C, O, H, N, S). Each ion was assigned a loading for each PC, shown in Supplementary Table 3. Possible assignments for each of the 48 ions for the key PCs is listed in Supplementary Table 3. A number of ions likely associated with sulphate groups were selected, including ions SN⁺ and SNO₂⁻, as well as larger ions such as C₁₀H₁₁SO₄⁻. This suggests that part of the variance captured by the PCA was associated with the sulfation patterns on the GAGs. Ions likely associated with di- and tri- saccharides, such as C₁₈H₃₃SO₅⁺, C₁₈H₃₈O₉⁺, were also selected. Further interpretation of the relation between the ions identified and the GAG structures is limited due to the relatively low mass resolution of the ToF mass analyser.

To test the capability of ToF-SIMS analysis to chemically distinguish between samples in a more complex biological environment, each of the 6 GAG types were added to a fibronectin (FN) solution, printed as a microarray and analysed by ToF-SIMS. FN is a common component of biological ECMs as well as serum and plasma. After generating a sparse dataset, the multispectral data was assessed for its ability to distinguish between the different samples using PCA and hierarchical cluster analysis. All 6 GAG types were chemically differentiated from each other, and from pure FN (Fig. 3a), where all samples, including the test sets, were successfully categorised using hierarchical cluster analysis (Fig. 3a). Possible assignments for the 18 ions selected for this sparse dataset and their loadings are shown (Supplementary Table 4). Similar to the model without FN, ions containing sulphate groups (CHSO⁻, C₃HSO₂⁻) were present in the model. Most of the ions selected were small in nature and likely derived from a monosaccharide. This may be due to a reduced yield of higher molecular weight ions associated with GAGs from within a protein matrix. Additionally, ions likely associated with FN (CH₄N⁺) were also selected.

**Fig. 3: Summary of multivariate analysis of mixed GAG samples.**

Quantification of spiked samples using partial least square regression

To assess the sensitivity of the analysis methodology to adulteration, a PM heparin was spiked with increasing concentrations of either OSCS, BM heparin or BL heparin. The ToF-SIMS spectral data was correlated with the fraction of spiking agent using PLS regression, as has been done previously for correlating water contact angle or protein adsorption with ToF-SIMS data^49,50. Initially, a sparse dataset was selected for each sample set by least absolute shrinkage and selection operator (LASSO) to minimise over-fitting by removal of uninformative features⁵¹. The number of latent variables used was selected based upon the minimisation of the root mean square error of cross validation (Supplementary Fig. 11). Plots of the measured fractions of spiking agent and those predicted from the ToF-SIMS data using the PLS model are shown in Supplementary Fig. 12. A high correlation (R² > 0.94) between measured and predicted values was observed for samples spiked with either OSCS or heparin BM, suggesting that the ToF-SIMS data was able to distinguish differences in samples down to 0.001 wt%. This was confirmed by PCA of the same samples, which demonstrated separation between non-spiked samples and the samples spiked at 0.001 wt% to 95% confidence (Supplementary Fig. 13, Supplementary Tables 5–7). A weaker correlation (R² = 0.88) was observed for the samples spiked with BL heparin (Supplementary Fig. 12). PCA of these samples showed that the non-spiked sample and the 0.001 wt% sample could not be separated to 95% confidence (Supplementary Fig. 13b). However, a linear response between the measured and predicted fraction of BL heparin was observed down to 0.01 wt%, suggesting that the analysis was sensitive to this concentration. Similar R² values were observed for the training (70%) and test (30%) sets for all models, suggesting there was no over-fitting.

The PLS models were used to predict the fraction of the contaminant in each of the different heparin samples initially assessed by PCA (Fig. 3b–d). The amount of OSCS predicted in all heparin samples was below 0.001%, with the exception of the analytical grade heparins, which had predicted OSCS fractions of 0.0009 and 0.002 wt%. High predicted fractions (≈100 wt%) were predicted for the OSCS sample, whilst the sample with a known OSCS adulteration of 1 wt% had a predicted OSCS fraction of 0.6 wt%. Quantitative analysis of samples using ToF-SIMS data is limited by matrix effects⁵². Therefore, the PLS model was only applicable to samples analysed within the same matrix environment as the training data. For predictions of the bovine-derived heparin content, low values (1 × 10⁻⁵ wt%) were predicted for the porcine-derived heparin samples with the exception of the analytical grade samples and sample 4, which all had predicted values of approximately 1 × 10⁻³ wt%. The lower purity of the analytical grade heparin is expected, and our results indicate low levels of OSCS contamination. The presence of bovine-derived heparin in sample 4 was unexpected, but does coincide with lower levels of anticoagulant activity observed for this sample (Supplementary Table 8).

For completion, the PLS models were also applied to unrelated GAG types, shown in Fig. 3b–d. Although it is possible to suggest that high levels of OSCS were predicted in the DS and CS samples, the predicted values are unreliable as the models were trained for the detection of specific contaminants in heparin. The successful separation of each of the 16 GAGs by PCA suggests that, in principle, PLS models of pairwise mixtures of the other GAG types could be prepared.

Each of the features selected for the PLS regression was assigned a regression coefficient (RC) that informed how strongly it influenced the model and whether it was associated with the contaminant or PM heparin. Tables of possible assignments for the ions selected for each model and their associated RCs are shown in Supplementary Tables 9–11. For each model, ions likely derived from mono- and di-saccharides were associated with PM heparin (having a negative RC) including C₁₃H₂₉S₂NO₄⁺, C₁₆H₂₉N₂O₇⁻ and C₃₂H₅₆N₃O₁₂⁻. Furthermore, ions containing sulphate groups, such as CH₄SNO₂⁻, C₅HSNO₆⁻ and C₂SNO₃⁻ were also selected, suggesting the model includes information both about the disaccharide sequence of the heparin molecules and the sulfation pattern. The ions associated with the spiked GAGs (OSCS, BM heparin and BL heparin) included ions likely representative of the disaccharide sequence (C₂₀H₃₇SN₂O₅⁻, C₁₂H₂₉N₂O₇⁺ and C₂₂H₄₃N₂O₇⁺) or sulfation pattern (KC₅SNO⁻, C₃H₅SNO₃⁻, C₂H₃S⁺) of the spiked GAGs. There is large uncertainty regarding the ion assignments, particularly for large ions, due to the mass resolution of the ToF analyser. The suggestions provided are based upon structures that match GAG stoichiometry and have a minimal deviation between the measured and theoretical values.

Assessment of heparin activity

The anticoagulant action of heparin is chiefly due to its ability to potentiate the serine protease inhibitor antithrombin, a protein normally present in plasma. Assays of antithrombin mediated inhibition of the clotting factors thrombin (factor IIa) and of factor Xa, using purified proteins, are used to determine the potency of clinical grade heparin in International Units (IU)/mg. The Activated Partial Thromboplastin Time (APTT) is a plasma-based method for measuring anticoagulant activity.

The specific activities of five heparin samples were measured by these three methods and the results are summarised in Supplementary Table 5. A number of ions were found to significantly (p < 0.001) correlate linearly (Pearson’s r > 0.75) with each of the measures of activity, as shown in Supplementary Fig. 14. The origin of these ions is not known but the correlations suggest that they arise from structural factors that determine anticoagulant activity, either very specifically in terms of the rare pentasaccharide motif that determines affinity for antithrombin⁵³, or in more general terms such as overall degree of sulfation. This link between the surface chemistry as measured by ToF-SIMS and a quantitative measure of biological activity is unexpected.

Discussion

Analytical characterisation of GAGs underpins multiple aspects of current GAG-related research, including the understanding of their fundamental biological roles. GAGs are already important pharmaceutical compounds (as discussed above for heparin) and are increasingly being used for various therapeutic applications as well as being incorporated into biomaterials for improved biofunctionality^54,55,56,57. Mass spectrometry techniques focus on the analysis of oligosaccharides for the purposes of sequencing^58,59. The use of ToF-SIMS to analyse GAG samples on an arrayed platform provides a methodology by which small quantities (< 200 ng) of hundreds of different GAGs could be analysed within a short time window (3–4 h). The resultant spectra were derived from the whole molecules and did not require any pre-digestion or pre-labelling of material. The analysis was informative of the GAG disaccharide sequence, sulfation pattern and biological activity and enabled discernment between all 6 different GAG types investigated.

The high throughput and sensitivity achievable by this system is also important for the quality control of GAGs within healthcare settings to ensure patient safety. For the most widely used GAG in medicine, heparin, recent problems in pharmacovigilance have been the spur to develop a battery of orthogonal tests to ensure identity, purity and high specific bioactivity¹². Besides the detection of contaminants, whether introduced accidentally or as deliberate adulteration, it is necessary to monitor impurities in heparin that can arise both from co-purification of related compounds such as chondroitin and dermatan sulphates and from minor chemical modifications arising in the manufacturing process^60,61. Whole molecule analysis is desirable to be able to detect contaminants and process-related impurities in active pharmaceutical ingredient of GAG-based products, for example, detection of mixed-species heparin. Whilst whole molecule analysis of GAGs has been achieved⁶², the approaches are typically slow, require large amounts of samples and lack sensitivity. We applied our protocol to pharmaceutical grade heparin derived from different animal species and from different suppliers. Our approach allowed for the clear identification of heparin samples in terms of species of origin, and highly sensitive detection of contaminants spiked into PM heparin, including a sensitivity of 0.001 wt% of the addition of OSCS, the contaminant associated with the heparin crisis, and to 0.01 wt% for BL heparin in PM heparin. This approach is likely to be highly beneficial in the quality control of GAGs produced for therapeutic applications and for characterising GAGs within biomaterial systems or from in vitro cell culture.

The use of multivariate analysis approaches was necessary to interrogate the multi-dimensional ToF-SIMS datasets. PCA has been widely used to assess the variance within ToF-SIMS datasets and was used here to be able to capture the variance between different GAG samples, whilst PLS regression demonstrated that the fraction of a spiked GAG could be predicted from the ToF-SIMS spectra. Creation of sparse datasets was important to avoid over-fitting data as well as to remove uninformative features. For PCA, recursive feature selection identified the ions that captured the variance between the different GAGs including within a more complex biological environment containing fibronectin.

Implementation of the approach described as either a tool for basic research or as a quality control methodology for heparin manufacturer would require for the data readouts to be reached without intervention of expert users. The data models established in this study would provide a useful system that future samples could be applied to, with the possibility to identify unknown GAGs (PCA) or detect contamination within a sample (PLS). Unsupervised approaches like hierarchical cluster analysis provide a mechanism by which useful readouts can be obtained without any user intervention. The models can also easily be further expanded and made more robust through the addition of further control samples, whilst models focussing on a single GAG type are also easily achievable. The approach therefore, has broad applicability and can be readily adapted to various GAG-based applications.

Methods

Materials

HS Na salt from porcine mucosa (Iduron), DS Na salt from porcine mucosa (Average Mw = 41,000, Iduron), CS B Na salt from porcine mucosa (Sigma-Aldrich), HA Na salt from Streptococcus equi (Mw = 15,000–30,000, Sigma-Aldrich), heparin from porcine mucosa (Mw = 5000, Fisher Scientific) were used as received. GAGs were prepared as standard solutions of 5 mg/ml in ultrapure water (Purelab Ultra, ELGA LabWater). Heparin samples received from the NIBSC heparin archive. KS Na salt was derived from bovine corneal. Fibronectin was derived from bovine plasma (Sigma-Aldrich lot#101M7012V). Poly-L-lysine coated slides (Poly-Prep, Sigma-Aldrich), aminoalkylsilane functionalised slides (Silane-Prep, Sigma-Aldrich), tissue culture polystyrene (TCPS, Nunclon Delta, ThermoFisher Scientific), allylamine plasma polymer coated polystyrene (EpranEx, BD Biosciences) and bare glass slides (Corning) were used as received.

Microarray preparation

Arrays were prepared using an s11 sciFLEXARRAYER dispensing system (Scienion) using a glass piezo dispense capillary (P-2020, Scienion). Drop volumes were ≈ 300 pL, as measured using the drop shape analyser tool (Scienion) prior to each run. Print runs were conducted at a relative humidity of 65 % at room temperature. GAG solutions were diluted to 2–5 mg/ml in a polypropylene 384-well plate (Corstar) in ultra-pure water (18.2 MΩ.cm) with or without 1 mg/ml fibronectin. For in-spot mixing, 0–150 nL of water was printed and subsequently GAGs were dosed into the water droplets to facilitate mixing prior to surface adsorption. The nozzle was flushed with 250 μL of water whilst the outside of the nozzle was washed with copious amounts of water between printing different samples.

ToF-SIMS analysis

Time-of-flight secondary ion mass spectrometry measurements were conducted using a ToF-SIMS IV (IONTOF GmbH, Münster, Germany) instrument operated using a 25 keV Bi₃⁺ primary ion source exhibiting a pulsed target current of >0.3 pA. Samples were scanned at a pixel density of 512 pixels per mm, with fifteen shots per pixel over a given area. An ion dose of 2.45 × 10¹ ions per cm² was applied to each sample area ensuring that static conditions were maintained throughout. Both positive and negative secondary ion spectra were collected (mass resolution of >7000 at m/z = 29). Owing to the non-conductive nature of the samples, charge compensation was applied in the form of a low energy (20 eV) electron floodgun. Patch areas of 0.5 × 0.5 mm were acquired at a resolution of 256 × 256 pixels by rastering the primary ion beam over the patch using a ‘random raster’ path sequence. Patch areas were sequentially acquired over the entire microarray using programmed stage movements through the macro-raster stage function. The patch areas were combined into a mosaic image, allowing all patches to be processed together. A peak list was produced using the peak search tool (SurfaceLab 6, IONTOF), minimum counts set to 100, maximum background set to 0.8. To ensure the peak search tool had successfully identified peaks, all ions of interest were visually inspected. Regions associated with each polymer spot were then extracted and recalibrated, and the peak list was applied to produce an individual spectrum for each polymer. In total, 412 positive and 460 negative ion peaks were identified. Peak assignments were achieved using a custom built Visual Basic Application algorithm (PeakAssigner v2.6)⁶³. Only peaks with a chemical assignment derived from C, S, O, N and H within 100 ppm were used for PCA.

Light microscopy

Phase contrast microscopy images were acquired using an Olympus IX51 microscope using a 40× objective, NA = 0.13. The microscope was equipped with a Smart Imaging System (IMSTAR) using Fluo/LightVision software (v6.04 K).

Principal component analysis

A microarray of samples was initially prepared to enable a large number of samples to be rapidly assessed. The arrays were analysed by ToF-SIMS and spectra were obtained for each sample. Datasets were variance scaled and mean-centred and replicate measurements were split into training (70%) and test (30%) sets.

Principal component analysis (PCA) was conducted using the function ‘pca’ within Matlab R2018a (9.4.0.813654) on the full dataset. The scree plots were used to identify the total number of latent variables associated with meaningful variance by fitting a linear curve to high latent variable values (typically 15–20) and observing where the variance explained departed from linearity for lower numbers of latent variables. A sparse dataset was then created by recursive feature addition using the minimisation of confidence ellipse overlap as a weighting from the scores plots of relevant latent variables. After each feature addition the resulting models were checked to ensure the test datasets fell within the 95% confidence limits of the training datasets. The minimum number of variables required to achieve the minimum amount of ellipse overlap was selected. PCA was conducted on the final sparse dataset to assess differences between samples. The function ‘linkage’ within Matlab 2018a (9.4.0.813654) was used for hierarchical cluster analysis using the scores of relevant latent variables from the final PCA.

All data processing was conducted using a custom-built Matlab project.

Partial least square regression

Partial least square regression was conducted in Matlab R2018a (9.4.0.813654) using the plsregress function, that utilises the SIMPLS algorithm. The ToF-SIMS spectra, variance scaled and mean-centred prior to analysis, was used as a set of predictors whilst the concentration of spiked agent was used as the response. Replicate measurements were split into training (70%) and test (30%) sets. Sparse datasets were produced by LASSO using the lassoglm function, with the lambda value selected based upon the minimisation of the standard error. The final PLS models used the sparse datasets, and the number of latent variables was selected as the minimisation of the root mean square error of cross-validation.

Data availability

Data used to produce figures is available at the University of Nottingham data repository https://doi.org/10.17639/nott.7105. The Matlab project used for data processing is available at github https://github.com/fishhooky/PCABundle.

References

Kamhi, E., Joo, E. J., Dordick, J. S. & Linhardt, R. J. Glycosaminoglycans in infectious disease. Biol. Rev. 88, 928–943 (2013).
Article PubMed Google Scholar
Varki, A. Biological roles of glycans. Glycobiology 27, 3–49 (2017).
Article CAS PubMed Google Scholar
Ricard-Blum, S. & Lisacek, F. Glycosaminoglycanomics: where we are. Glycoconj. J. 34, 339–349 (2017).
Article CAS PubMed Google Scholar
Shute, J. K. et al. Inhaled nebulised unfractionated heparin improves lung function in moderate to very severe COPD: a pilot study. Pulm. Pharmacol. Therapeutics 48, 88–96 (2018).
Article CAS Google Scholar
Chandra, N. et al. Sulfated glycosaminoglycans as viral decoy receptors for human adenovirus type 37. Viruses 11, 247 (2019).
Baum, C. L. & Arpey, C. J. Normal cutaneous wound healing: clinical correlation with cellular and molecular events. Dermatologic Surg. 31, 674–686 (2005).
Article CAS Google Scholar
Gotte, M. & Yip, G. W. Heparanase, hyaluronan, and CD44 in cancers: a breast carcinoma perspective. Cancer Res. 66, 10233–10237 (2006).
Article PubMed Google Scholar
McAlindon, T. E., LaValley, M. P., Gulin, J. P. & Felson, D. T. Glucosamine and chondroitin for treatment of osteoarthritis - a systematic quality assessment and meta-analysis. JAMA 283, 1469–1475 (2000).
Article CAS PubMed Google Scholar
Blackhall, F. H., Merry, C. L. R., Davies, E. J. & Jayson, G. C. Heparan sulfate proteoglycans and cancer. Br. J. Cancer 85, 1094–1098 (2001).
Article CAS PubMed PubMed Central Google Scholar
Liu, H. Y., Zhang, Z. Q. & Linhardt, R. J. Lessons learned from the contamination of heparin. Nat. Prod. Rep. 26, 313–321 (2009).
Article CAS PubMed PubMed Central Google Scholar
Keire, D. A. et al. Characterization of currently marketed heparin products: key tests for quality assurance. Anal. Bioanal. Chem. 399, 589–591 (2011).
Article CAS Google Scholar
Szajek, A. Y. et al. The US regulatory and pharmacopeia response to the global heparin contamination crisis. Nat. Biotechnol. 34, 625–630 (2016).
Article CAS PubMed PubMed Central Google Scholar
Spelta, F. et al. SAX-HPLC and HSQC NMR spectroscopy: orthogonal methods for characterizing heparin batches composition. Front. Med. 6, 78 https://doi.org/10.3389/fmed.2019.00078 (2019).
Trehy, M. L., Reepmeyer, J. C., Kolinski, R. E., Westenberger, B. J. & Buhse, L. F. Analysis of heparin sodium by SAX/HPLC for contaminants and impurities. J. Pharm. Biomed. Anal. 49, 670–673 (2009).
Article CAS PubMed Google Scholar
Beyer, T. et al. Quality assessment of unfractionated heparin using H-1 nuclear magnetic resonance spectroscopy. J. Pharm. Biomed. Anal. 48, 13–19 (2008).
Article CAS PubMed Google Scholar
Zhang, Z. et al. Analysis of Pharmaceutical Heparins and Potential Contaminants Using H-1-NMR and PAGE. J. Pharm. Sci. 98, 4017–4026 (2009).
Article CAS PubMed Google Scholar
Song, Y., Zhang, F. & Linhardt, R. J. Analysis of the glycosaminoglycan chains of proteoglycans. J. Histochem. Cytochem. 69, 121–135 (2021).
Li, L., Zhang, F., Zaia, J. & Linhardt, R. J. Top-down approach for the direct characterization of low molecular weight heparins using LC-FT-MS. Anal. Chem. 84, 8822–8829 (2012).
Article CAS PubMed PubMed Central Google Scholar
Kouta, A. et al. Comparative pharmacological profiles of various bovine, ovine, and porcine heparins. Clin. Appl. Thrombosis Hemost. 25, 1–9 (2019).
Vilanova, E., Tovar, A. M. F. & Mourao, P. A. S. Imminent risk of a global shortage of heparin caused by the African Swine Fever afflicting the Chinese pig herd. J. Thrombosis Haemost. 17, 254–256 (2019).
Article Google Scholar
Monakhova, Y. B. & Diehl, B. W. K. Combining H-1 NMR spectroscopy and multivariate regression techniques to quantitatively determine falsification of porcine heparin with bovine species. J. Pharm. Biomed. Anal. 115, 543–551 (2015).
Article CAS PubMed Google Scholar
Nietzold, C. et al. Surface chemical characterization of model glycan surfaces and shelf life studies of glycan microarrays using XPS, NEXAFS spectroscopy, ToF-SIMS and fluorescence scanning. Appl. Surf. Sci. 459, 860–873 (2018).
Article CAS Google Scholar
Chevolot, Y., Bucher, O., Leonard, D., Mathieu, H. J. & Sigrist, H. Synthesis and characterization of a photoactivatable glycoaryldiazirine for surface glycoengineering. Bioconjugate Chem. 10, 169–175 (1999).
Article CAS Google Scholar
Davies, M. C. et al. The application of time-of-flight SIMS for the surface characterization of polymer latex particles prepared with immobilized sugar residues. J. Colloid Interface Sci. 161, 83–90 (1993).
Article CAS Google Scholar
Dietrich, P. M. et al. Multimethod chemical characterization of carbohydrate-functionalized surfaces. J. Carbohydr. Chem. 30, 361–372 (2011).
Article CAS Google Scholar
Wendeln, C., Heile, A., Arlinghaus, H. F. & Ravoo, B. J. Carbohydrate microarrays by microcontact printing. Langmuir 26, 4933–4940 (2010).
Article CAS PubMed Google Scholar
Cheng, F., Shang, J., Ratner, D. M. & Versatile, A. Method for Functionalizing Surfaces with Bioactive Glycans. Bioconjugate Chem. 22, 50–57 (2011).
Article CAS Google Scholar
Scurr, D. J. et al. Surface characterization of carbohydrate microarrays. Langmuir 26, 17143–17155 (2010).
Article CAS PubMed Google Scholar
Burzava, A. L. S. et al. Affinity binding of EMR2 expressing cells by surface-grafted chondroitin sulfate B. Biomacromolecules 18, 1697–1704 (2017).
Article CAS PubMed Google Scholar
Charbonneau, C., Liberelle, B., Hebert, M. J., De Crescenzo, G. & Lerouge, S. Stimulation of cell growth and resistance to apoptosis in vascular smooth muscle cells on a chondroitin sulfate/epidermal growth factor coating. Biomaterials 32, 1591–1600 (2011).
Article CAS PubMed Google Scholar
D’Sa, R. A. et al. Protein, cell and bacterial response to atmospheric pressure plasma grafted hyaluronic acid on poly(methylmethacrylate). J. Mater. Sci. 26, 1–13 (2015).
Konig, U. et al. Heparinization of a biomimetic bone matrix: integration of heparin during matrix synthesis versus adsorptive post surface modification. J. Mater. Sci.-Mater. Med. 25, 607–621 (2014).
Article PubMed CAS Google Scholar
Robinson, D. E. et al. Plasma polymer and biomolecule modification of 3D scaffolds for tissue engineering. Plasma Process. Polym. 13, 678–689 (2016).
Article CAS Google Scholar
Shard, A. G. et al. X-ray photoelectron spectroscopy and time-of-flight SIMS investigations of hyaluronic acid derivatives. Langmuir 13, 2808–2814 (1997).
Article CAS Google Scholar
West, R. H. et al. Correlation of the surface chemistries of polymer bioactive coatings, with their biological performances. J. Mater. Sci.-Mater. Med. 6, 63–67 (1995).
Article CAS Google Scholar
Teixeira, R., Reis, R. L. & Pashkuleva, I. Influence of the sulfation degree of glycosaminoglycans on their multilayer assembly with poly-L-lysine. Colloids Surf. B-Biointerfaces 145, 567–575 (2016).
Article CAS PubMed Google Scholar
Urquhart, A. J. et al. High throughput surface characterisation of a combinatorial material library. Adv. Mater. 19, 2486–2491 (2007).
Article CAS Google Scholar
Berrade, L., Garcia, A. E. & Camarero, J. A. Protein microarrays: novel developments and applications. Pharm. Res. 28, 1480–1499 (2011).
Article CAS PubMed Google Scholar
Bodrossy, L. & Sessitsch, A. Oligonucleotide microarrays in microbial diagnostics. Curr. Opin. Microbiol. 7, 245–254 (2004).
Article CAS PubMed Google Scholar
Foong, Y. M., Fu, J. Q., Yao, S. Q. & Uttamchandani, M. Current advances in peptide and small molecule microarray technologies. Curr. Opin. Chem. Biol. 16, 234–242 (2012).
Article CAS PubMed Google Scholar
Romanov, V. et al. A critical comparison of protein microarray fabrication technologies. Analyst 139, 1303–1326 (2014).
Article CAS PubMed Google Scholar
Shi, H. H. et al. A review: Fabrications, detections and applications of peptide nucleic acids (PNAs) microarray. Biosens. Bioelectron. 66, 481–489 (2015).
Article CAS PubMed Google Scholar
Heller, M. J. DNA microarray technology: devices, systems, and applications. Annu. Rev. Biomed. Eng. 4, 129–153 (2002).
Article CAS PubMed Google Scholar
Rillahan, C. D. & Paulson, J. C. Glycan microarrays for decoding the glycome. In Annual Review of Biochemistry (Kornberg, R. D., Raetz, C. R. H., Rothman, J. E., Thorner, J. W., eds.) 797–823 (2011).
Zong, C. L. et al. Heparan sulfate microarray reveals that heparan sulfate-protein binding exhibits different ligand requirements. J. Am. Chem. Soc. 139, 9534–9543 (2017).
Article CAS PubMed PubMed Central Google Scholar
Ma, Z. M. Sparse principal component analysis and iterative thresholding. Ann. Stat. 41, 772–801 (2013).
Google Scholar
Zou, H., Hastie, T. & Tibshirani, R. Sparse principal component analysis. J. Computational Graph. Stat. 15, 265–286 (2006).
Article Google Scholar
Wang, Y. & Wu, Q. Sparse PCA by iterative elimination algorithm. Adv. Computational Math. 36, 137–151 (2012).
Article Google Scholar
Taylor, M. et al. Partial least squares regression as a powerful tool for investigating large combinatorial polymer libraries. Surf. Interface Anal. 41, 127–135 (2009).
Article CAS PubMed PubMed Central Google Scholar
Hook, A. L., Williams, P. M., Alexander, M. R. & Scurr, D. J. Multivariate ToF-SIMS image analysis of polymer microarrays and protein adsorption. Biointerphases 10, 019005 (2015).
Article PubMed CAS Google Scholar
Chen, G. N. et al. Identification of different tumor states in nasopharyngeal cancer using surface-enhanced Raman spectroscopy combined with Lasso-PLS-DA algorithm. Rsc Adv. 6, 7760–7764 (2016).
Article CAS Google Scholar
Vanden Eynde, X., Bertrand, P. & Penelle, J. “Matrix” effects in ToF-SIMS analyses of styrene-methyl methacrylate random copolymers. Macromolecules 33, 5624–5633 (2000).
Article CAS Google Scholar
Lindahl, U. What else can ‘heparin’ do? Haemostasis 29, 38–47 (1999).
CAS PubMed Google Scholar
Farrugia, B. L., Lord, M. S., Melrose, J. & Whitelock, J. M. The role of heparan sulfate in inflammation, and the development of biomimetics as anti-inflammatory strategies. J. Histochem. Cytochem. 66, 321–336 (2018).
Article CAS PubMed PubMed Central Google Scholar
Ryan, C. N. M. et al. Glycosaminoglycans in tendon physiology, pathophysiology, and therapy. Bioconjugate Chem. 26, 1237–1251 (2015).
Article CAS Google Scholar
Salbach, J. et al. Regenerative potential of glycosaminoglycans for skin and bone. J. Mol. Med. 90, 625–635 (2012).
Article PubMed Google Scholar
Ayerst, B. I., Merry, C. L. R. & Day, A. J. The good the bad and the ugly of glycosaminoglycans in tissue engineering applications. Pharmaceuticals 10, 54 (2017).
Staples, G. O. & Zaia, J. Analysis of glycosaminoglycans using mass spectrometry. Curr. Proteom. 8, 325–336 (2011).
Article CAS Google Scholar
Zaia, J. Glycosaminoglycan glycomics using mass spectrometry. Mol. Cell. Proteom. 12, 885–892 (2013).
Article CAS Google Scholar
Kellenbach, E., Sanders, K., Michiels, P. J. A. & Girard, F. C. H-1 NMR signal at 2.10 ppm in the spectrum of KMnO4-bleached heparin sodium: identification of the chemical origin using an NMR-only approach. Anal. Bioanal. Chem. 399, 621–628 (2011).
Article CAS PubMed Google Scholar
Mourier, P. A. J., Guichard, O. Y., Herman, F. & Viskov, C. Heparin sodium compliance to USP monograph: Structural elucidation of an atypical 2.18 ppm NMR signal. J. Pharm. Biomed. Anal. 67-68, 169–174 (2012).
Article CAS PubMed Google Scholar
Zhao, Y. L., Abzalimov, R. R. & Kaltashov, I. A. Interactions of intact unfractionated heparin with its client proteins can be probed directly using native electrospray ionization mass spectrometry. Anal. Chem. 88, 1711–1718 (2016).
Article CAS PubMed Google Scholar
Hook, A. L. & Scurr, D. J. ToF-SIMS analysis of a polymer microarray composed of poly(meth)acrylates with C6 derivative pendant groups. Surf. Interface Anal. 48, 226–236 (2016).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

Andrew Hook kindly acknowledges the University of Nottingham for provision of his Nottingham Research Fellowship. Catherine Merry kindly acknowledges support from the National Centre for the Replacement Refinement and Reduction of Animals in Research (NC/T2T0219). Prof. Ulf Lindahl is kindly acknowledged for the provision of keratan sulphate samples.

Author information

Authors and Affiliations

Advanced Materials and Healthcare Technology, University of Nottingham, Nottingham, UK
Andrew L. Hook
National Institute for Biological Standards and Control, Potters Bar, UK
John Hogwood & Elaine Gray
Institute for Pharmaceutical Science, King’s College London, Franklin-Wilkins Building, Stamford Street, London, UK
Elaine Gray & Barbara Mulloy
Stem Cell Glycobiology Group, Biodiscovery Institute, Faculty of Medicine and Health Sciences, University of Nottingham, Nottingham, UK
Catherine L. R. Merry

Authors

Andrew L. Hook
View author publications
You can also search for this author in PubMed Google Scholar
John Hogwood
View author publications
You can also search for this author in PubMed Google Scholar
Elaine Gray
View author publications
You can also search for this author in PubMed Google Scholar
Barbara Mulloy
View author publications
You can also search for this author in PubMed Google Scholar
Catherine L. R. Merry
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.L.H. and C.L.R.M. conceived the study. A.L.H. acquired funding, developed the methodology, performed experiments and analysed the data. JH performed heparin activity studies. A.L.H., J.H., E.G., B.M. and C.L.R.M. contributed to the writing of the paper.

Corresponding author

Correspondence to Andrew L. Hook.

Ethics declarations

Competing interests

A.L.H. and C.L.R.M. declare the following competing interest: Patent application 2104133.0. Inventors: A.L.H. and C.L.R.M. Applicant: University of Nottingham. The remaining authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Hook, A.L., Hogwood, J., Gray, E. et al. High sensitivity analysis of nanogram quantities of glycosaminoglycans using ToF-SIMS. Commun Chem 4, 67 (2021). https://doi.org/10.1038/s42004-021-00506-1

Download citation

Received: 13 January 2021
Accepted: 07 April 2021
Published: 14 May 2021
DOI: https://doi.org/10.1038/s42004-021-00506-1

This article is cited by

Evaluation of the relative potential for contact and doffing transmission of SARS-CoV-2 by a range of personal protective equipment materials
- Xuan Xue
- Christopher M. Coleman
- Morgan R. Alexander
Scientific Reports (2022)
Comprehensive structural assignment of glycosaminoglycan oligo- and polysaccharides by protein nanopore
- Parisa Bayat
- Charlotte Rambaud
- Régis Daniel
Nature Communications (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.