ReTimeML: a retention time predictor that supports the LC–MS/MS analysis of sphingolipids

Allwright, Michael; Guennewig, Boris; Hoffmann, Anna E.; Rohleder, Cathrin; Jieu, Beverly; Chung, Long H.; Jiang, Yingxin C.; Lemos Wimmer, Bruno F.; Qi, Yanfei; Don, Anthony S.; Leweke, F. Markus; Couttas, Timothy A.

doi:10.1038/s41598-024-53860-0

Download PDF

Article
Open access
Published: 22 February 2024

ReTimeML: a retention time predictor that supports the LC–MS/MS analysis of sphingolipids

Michael Allwright¹^na1,
Boris Guennewig¹^na1,
Anna E. Hoffmann^2,3,
Cathrin Rohleder^2,3,4,
Beverly Jieu²,
Long H. Chung⁵,
Yingxin C. Jiang⁵,
Bruno F. Lemos Wimmer²,
Yanfei Qi⁵,
Anthony S. Don⁶,
F. Markus Leweke^2,3,4^na2 &
…
Timothy A. Couttas²^na2

Scientific Reports volume 14, Article number: 4375 (2024) Cite this article

832 Accesses
6 Altmetric
Metrics details

Subjects

Abstract

The analysis of ceramide (Cer) and sphingomyelin (SM) lipid species using liquid chromatography–tandem mass spectrometry (LC–MS/MS) continues to present challenges as their precursor mass and fragmentation can correspond to multiple molecular arrangements. To address this constraint, we developed ReTimeML, a freeware that automates the expected retention times (RTs) for Cer and SM lipid profiles from complex chromatograms. ReTimeML works on the principle that LC–MS/MS experiments have pre-determined RTs from internal standards, calibrators or quality controls used throughout the analysis. Employed as reference RTs, ReTimeML subsequently extrapolates the RTs of unknowns using its machine-learned regression library of mass-to-charge (m/z) versus RT profiles, which does not require model retraining for adaptability on different LC–MS/MS pipelines. We validated ReTimeML RT estimations for various Cer and SM structures across different biologicals, tissues and LC–MS/MS setups, exhibiting a mean variance between 0.23 and 2.43% compared to user annotations. ReTimeML also aided the disambiguation of SM identities from isobar distributions in paired serum-cerebrospinal fluid from healthy volunteers, allowing us to identify a series of non-canonical SMs associated between the two biofluids comprised of a polyunsaturated structure that confers increased stability against catabolic clearance.

Q-RAI data-independent acquisition for lipidomic quantitative profiling

Article Open access 07 November 2023

Deep-profiling of phospholipidome via rapid orthogonal separations and isomer-resolved mass spectrometry

Article Open access 17 July 2023

Large-scale lipid analysis with C=C location and sn-position isomer resolving power

Article Open access 17 January 2020

Introduction

Sphingolipids (SLs) are a diverse family of structural and signalling lipids that comprise a broad range of biological functions crucial to normal physiology, cell signalling and trophic support^1,2,3. SLs are defined by their sphingoid backbone, which in mammals consists predominantly of an 1, 3-dihydroxy, 18-carbon, mono-unsaturated sphingosine (d18:1), with variations to this long chain base also studied, including saturated dihydrosphingosine or sphinganine (d18:0), the di-unsaturated sphingadiene (d18:2), as well as mono- (m18:X) and trihydroxy (t18:X) configurations^2,4.

Ceramide (Cer) is the central intermediate of the sphingolipid pathway, that consists of a sphingoid base amide-linked to a fatty acid of variable length, hydroxylation and degree of unsaturation^5,6. Modifications to the C1 hydroxyl group of Cer allow for the formation of the more complex glycosylated SLs (e.g., cerebrosides, gangliosides), and the phosphorylated sphingomyelins (SM), which are the most abundant SL class in the plasma membrane of eukaryotes⁷. SM is frequently studied alongside Cer, owing to the functional importance of Cer-SM balance in cellular and inflammatory processes, as well as in ordered domain function (e.g., lipid rafts)^8,9,10,11. Liquid chromatography-tandem mass spectrometry (LC–MS/MS) is the conventional approach for the analysis of Cer and SM lipids, as their fragmentation following positive ionisation ([M + H]⁺) generates characteristic ions of their sphingoid backbone (e.g., m/z 264 for d18:1) and side groups (e.g., m/z 184 for the choline headgroup of SM) to facilitate their identification (Fig. 1). Their fragmentation can also elucidate structural features of the fatty acyl and sphingoid base chain, including length and degree(s) of unsaturation (Fig. 1), which can help determine the precise position of carbon–carbon double bonds with specialised instruments^4,12.

Specific precursor–product ion transitions can be employed in selective (SRM) or multiple (MRM) reaction monitoring experiments to improve the accuracy and reproducibility of SL analysis^13,14,15, and are readily adapted into commercial (e.g., LipidSearch, LipidBlast, LipidAnnotator) and freeware toolboxes for the development of automated workflows (including setup, data acquisition and validation), thereby making their analysis amendable to non-experts^16,17,18. However, resolving SL identities remains complex, as variations in headgroups, carbon backbone length and sites of unsaturation increase the probable number of variants with similar transitions, including isobaric and isomeric species¹⁹, that cannot be distinguished even with high-resolution instruments²⁰. In addition, new structural species are continually being identified^21,22, adding to this growing complexity.

Notwithstanding, the resolution of closely related SLs, including those with similar structures or nominal mass, can be achieved through LC separation, augmenting the accuracy of MS/MS assignments^23,24,25. SL identities can be further validated by matching their chromatographic separation or retention time (RT) in complex samples against pure external standards or internal (deuterated) compounds (Fig. 2)²⁶. Though this unequivocally resolves their identities, purchasing standards for all SLs of interest is not feasible, given many are not commercially available and the financial burden of purchasing copious numbers of SL variants.

Advancements in computational predictions have vastly improved RT estimation across various lipids (including SLs) and small molecules, for both targeted (SRM/MRM) and global lipid (untargeted) analyses^{27,28,29,30,31,32,33,34}. However, numerous selective modifications to lipid separation (e.g., sample preparation, appropriate solvents, flow rate, chromatography matrix) are made to improve their detection and avoid batch effects. This can impose limitations on consistent RT estimations as modelling is often confined to comparable experimental conditions, including tissue type, potential class-specific biases and/or requires frequent retraining and validation^30,33,34. This has precluded the capacity for robust or straightforward integration of RT estimates in lipid identification tools and experimental analyses.

To overcome these limitations in the assessment of SLs, Mass versus Relative Elution Time (MRET) profiling can be employed³⁵, plotting two-dimensional data of the SLs nominated precursor mass (m/z), against their elution time (RT) (Fig. 3). Standards and controls (quality or internal) are utilised as “points of reference”, with the generated 2D plots used to visually extrapolate the position of other SLs within that class, using the knowledge of their known mass and characteristic fragmentation ions (i.e,. sphingoid backbone, C1 head group) as molecular descriptors to determine the RT of unknowns (Fig. 3). Recognising MRET patterns specific to a given SL family (Fig. 3B), as well as structural characteristics (e.g., degree of unsaturation, Fig. 3C,D), allows for the elimination of major interferences, resulting from ions of indistinguishable precursor and product fragmentation.

Presently, MRET profiling is manually performed. In this study, we aimed to automate the process by developing a bespoke, web-based tool entitled ReTimeML, aptly named to describe the task at hand—a calculated Remeasurement of the Time, on column, to predict the elution of (sphingo)lipids. ReTimeML was constructed through Machine-Learned regression of user RT assignments, collected from our database of LC–MS/MS analyses on Cer and SM lipids, and literature sources. Herein, we verified ReTimeML’s capacity to accurately annotate (> 99% accuracy) Cer and SM RTs, compared to expert-user assignments, across multiple tissues and LC–MS/MS experimental conditions. Notably, ReTimeML was successfully applied to aid the identification of noncanonical Cer and SM structures, resolve ion interferences, and guide the accurate annotations for Cer and SM expressional differences in cerebrospinal fluid (CSF) and paired serum collected from the same healthy volunteers (HVs).

Results

Regression model development and selection

We assessed the capacity of the nominated regression algorithms (Supplementary Data 1) to learn from descriptor information for identifying Cer and SM lipids (e.g., precursor mass, fragmentation) and user RT annotations of previous work (Supplementary Data 2). RT data was sequentially increased to determine the optimal training sample size, broken down by molecular features, for which regression model performance (evaluated on validation data) yielded coefficient of determination (R²) and root mean squared error (RMSE) values at the acceptance thresholds (R² > 0.9, RMSE < 0.25), with the fulfilment of both criteria imperative. As anticipated, performance increased across all models when augmenting the training set size (Fig. 4). Lasso (alpha = 0.001) and ridge regression (alpha = 0.4) outperformed the other machine-learned models (Fig. 4), and were assigned as the optimal regression algorithms for Cer and SM RT estimations, respectively. Though comparable, lasso achieved a slightly more favourable R² value for Cer (lasso: 0.930; ridge: 0.929, n = 9), while ridge regression appeared to be more beneficial for SM estimations (lasso: 0.915; ridge: 0.928, n = 6). Lasso yielded marginally lower RMSE values for both Cer (lasso: 0.091; ridge: 0.102, n = 9) and SM (lasso: 0.132; ridge: 0.178, n = 6), when applying the same number of datasets to concurrently meet our R² > 0.9 prerequisites, though both were well-below the acceptable RMSE < 0.25 threshold (Fig. 4, Supplementary Data 2).

Performance evaluation and model robustness

The performance of ReTimeML’s lasso and ridge regression modelling was verified in four, independently performed, LC–MS/MS analyses on frequently observed d18:0, d18:1 and d18:2 Cer and SM lipid species (Tables 1 and 2). LC–MS/MS for Cer and SM species were performed on various tissues/fluids of rodents and humans, and different chromatography conditions (including an isocratic vs. gradient comparison). Cer and SM species for which experimental RTs were known (i.e., calibrators, internal/quality controls, experimentally determined), were assigned as points of reference (‘Train’), with the remaining unknown RTs extrapolated for each experiment (‘Test’). ReTimeML’s output provides users with an RT list for all Cer/SMs of interest, including those listed as references, which can be downloaded as a .csv file or directly copied into Excel or a similar spreadsheet. An MRET profile plot is also generated that displays the position of each calculated SL, organised into different degrees of SL structural unsaturation. Representative figures of ReTimeML’s output are illustrated in Fig. 5A and B. MRET plots were also manually constructed from user-validated assignments (Supplementary Fig. 1), with their eluting order supporting ReTimeML’s output, as well as previous literature on the separation of these SLs under reverse-phase conditions^35,36,37. ReTimeML’s estimations displayed exceptional agreement when compared to user-determined RTs (Fig. 5C,D). ReTimeML predicted the RTs of 192 Cer and SM species, across the four LC–MS/MS experiments, with an average and median prediction error of 7.6 and 3.6 s, respectively, with each validation experiment achieving R² > 0.99 when comparing ReTimeML estimations to experimentally determined RTs (Fig. 5E–H). Of the ReTimeML predicted RTs, 14 deviated more than 3% from user assignments, with errors of this magnitude occurring only when RTs were estimated under isocratic conditions (Tables 1 and 2). For gradient RT estimations, 23 of 142 deviated more than 1%, with only 2 of these estimates exceeding a 2% variance from user assignments (Tables 1 and 2).

Table 1 Precursor-product ions used for the analysis of ceramides and comparison between ReTimeML estimations to user-defined RTs.

Full size table

Table 2 Precursor–product ions used for the analysis of sphingomyelins and comparison between ReTimeML estimations to user-defined RTs.

Full size table

Accuracy threshold assessment

ReTimeML’s performance in our verification studies prompted us to evaluate how its modelling accuracy responds to variations in the number and/or type (i.e., structure) of reference material employed. This was performed to ascertain the minimum requirement of reference material (‘Train’) that will achieve, on average, appropriate levels of accuracy (< 3% deviation from experimentally determined values). To investigate this potential constraint, we undertook a random sampling of our user-defined RTs from the verification LC–MS/MS experiments (Tables 1 and 2), which were then utilised as operational ‘Train’ values to evaluate their impact on ReTimeML’s subsequent extrapolation of unknowns. Nominated values were incrementally increased to assess the effects of fatty acyl chain length within a sphingoid base (e.g., d18:1/16:0 → d18:1/24:0), alongside structural variations from degree(s) of unsaturation (d18:0/XX:0, d18:1/XX:0, d18:1/XX:1, d18:2/XX:0 and d18:2/XX:1). In all random samplings, the internal control was maintained, with ReTimeML requiring a minimum of two points of reference to begin the estimation of unknowns.

Regardless of the selected RT material used, ReTimeML’s estimations on gradient LC–MS/MS experiments consistently outperformed isocratic measures, requiring fewer ‘Train’ points to achieve appropriate levels of accuracy (Fig. 6). ReTimeML estimations for gradient assessments were deemed suitable upon the employment of three structural variants (e.g., d18:0/XX:0, d18:1/XX:0 and d18:1/XX:1) as reference material, with average deviations from user-RTs ranging from 0.39 to 0.58% for Cer and 0.3–2.71% for SM (Fig. 6A–C,E–G). Increasing the number of reference points beyond this value does not greatly improve RT accuracy, with the exception of SMs assessed in mouse liver tissue where additional fatty acyl chains of the three structural variants improved accuracy to 1% and below (Fig. 6G). For isocratic LC–MS/MS, a larger number of references (n = 5–7) were required to achieve appropriate levels of accuracy (< 3%), with ReTimeML unable to attain a variance from user-RTs below ~ 1.5% (Fig. 6D,H). These results mirrored the similar numbers of ‘Train’ values and model accuracy achieved in our verification experiments (Tables 1 and 2). In all assessments, increasing the number of fatty acyl chain ‘Train’ points does not as drastically improve RT accuracy when compared to increasing the number of structural variants.

ReTimeML SL resolution in the absence of fragmentation distinguishment.

ReTimeML estimations help clarify the RTs of SLs where the occurrence of both isobaric and isotope distributions interferes with the correct peak identification in the total ion chromatogram (TIC), and cannot be resolved by supporting fragmentation data. This was particularly notable for di-unsaturated SMs that can correspond to either d18:1/XX:1 or d18:2/XX:0 in our MRM analyses, as the m/z 184 transition is insufficient to differentiate these isobars (Fig. 7A–C). To aid their resolution, secondary scanning of the m/z 262 (d18:2) and 264 (d18:1) sphingoid base was performed (Fig. 7D–I, Table 2). Sphingoid fragment selection proved useful in resolving SM(d18:2/XX:0) species (Fig. 7D–F) but presented complications in deriving the identities of SM (d18:1/XX:1) species, as the prominent peaks displayed with m/z 264 scans either did not align with MRET principles (Fig. 7G–H) or displayed multiple peaks, leading to further disambiguation of the correct identity (Fig. 7I). ReTimeML-guided RTs helped to either exclude (Fig. 7G,H) or correctly annotate (Fig. 7I) the SM (d18:1/XX:1) peaks in TICs. Peaks that did not align to ReTimeML estimates for m/z 264 transitions were attributed to isotope interference from mono-unsaturated hexosylceramides (HexCer (d18:1/XX:0), as illustrated (Fig. 7J–L). HexCer (d18:1/XX:0) species give rise to an [M + 1] isotopic ion that can interfere with the [M + H]⁺ ion of SM (d18:1/XX:1) (Fig. 7M–R). A similar comparison of the isotopic distribution and ReTimeML RT estimation helped to resolve SM (d18:2/XX:1) and SM (d18:1/XX:2), the latter of which were not assessed as their signal-to-noise ratios (S/N) were below the limit of detection (S/N < 3, data not shown).

Correlation of Cer and SM profiles between CSF and serum

ReTimeML-guided Cer/SM lipids were assessed in our paired CSF and serum samples, with both fluids collected from the same HVs. This resulted in the structural characterisation and quantification of 48 Cer and SM lipid species. Demographics for the 51 HVs included in this analysis are described in Supplementary Table 1 and their concentrations (pmol/mL) for each Cer and SM lipid quantified in both the CSF and serum are listed in Supplementary Data 3. Participants were recruited in Cologne, Germany and considered healthy at the time of body fluid withdrawal, with no close relatives having a psychiatric disorder. Participants were comprised of 30 women (58.8%) and 21 men (41.2%), average age of 27.3 (SD = 6.6) years, BMI of 23.0 (SD = 3.4) and were of Caucasian ethnicity, with the exception of two participants of African and Asian origin. Cer and SM profiles were comparable to previously reported CSF and serum concentrations of these SLs in healthy/control cohorts^{38,39,40,41,42}. SMs displayed predominately higher concentrations compared to Cer, with d18:1 being the most prominent sphingoid backbone for both SL classes, regardless of body fluid type (Fig. 8A,B, Supplementary Data 3).

The comparative similarities in their CSF-serum profiles (Fig. 8A,B) prompted us to evaluate the respective associations between these two body fluids. First, pairings were treated independently by matching every SL concentration identified per individual in the CSF to that of its corresponding value in the serum. This resulted in a total of 606 individualised Cer, and 945 SM, CSF-serum pairings (Fig. 8C,D), with Pearson analysis revealing significant (p < 1 × 10⁻¹⁵) positive associations between the biofluids for Cer (r = 0.466) and SM (r = 0.844).

Mean CSF-serum correlations for each characterised Cer and SM structure were evaluated next. From the 48 Cer and SM structures identified, 41 recorded a sufficient number of CSF-serum parings (N > 10) to be subjected to a correlated coefficient analysis (r), with statistical values (p values) adjusted for multiple comparisons using the FDR approach of Benjamini, Krieger, and Yekutieli (Q < 0.05). As a result, 5 SLs were defined as significantly correlating between the CSF and serum (Fig. 8E). Notably, all of these significant associations were comprised of noncanonical, sphingadiene backbone, with 4 SMs displaying positive associations (SM (d18:2/16:1), SM (d18:2/18:1), SM (d18:2/20:1) and SM (d18:2/22:1), alongside Cer (d18:2/16:0) which showed a significant inverse correlation between the two body fluids. Correlation coefficients and statistical values for all Cer and SM associations are summarised in Supplementary Table 2.

Discussion

The objective of our study was to develop a user-friendly, analytically robust, tool for estimating RTs of the major d18:X SLs classes for Cer and SM, utilising our database of MRET behaviour in prior LC–MS/MS experiments, supplemented with literature sources, as the framework for building our model. Presently, no commercially available software or freeware incorporates RT data into their descriptor information to aid the identification of SLs in LC–MS/MS. However, it is strongly recommended that SL elution order be considered to reduce the likelihood of incorrect annotation in their identities¹⁶. The lack of RT incorporation is partly due to modelling overreliance on the specifics of the separation system used, with previous studies emphasising the need for future models to (1) be readily adaptable to differing experimental conditions and; (2) require as few reference values as possible^28,32,43.

Applying MRET principles³⁵, we demonstrate how ReTimeML is able to correctly assign various d18:0, d18:1 and d18:2 Cer/SM lipids based upon an understanding of their carbon chain length, degree of unsaturation and the C1 headgroup. We chose to focus on d18:X sphingoid bases as these comprise the most abundant SLs in mammalian organisms⁴⁴. A similar approach has been developed for modelling the RTs of glycerophospholipids, assessing their equivalent carbon number (i.e., acyl chain composition), and expanding this across the different classes (e.g., phosphatidylcholine, phosphatidylethanolamine, phosphatidylserine)⁴⁵. Both approaches have the distinct advantage of displaying no bias towards an MS-system (e.g., triple quadrupole (QqQ) or high-resolution orbitrap). This could have an added benefit for resolving Cer and SM lipids from global/untargeted LC–MS/MS analyses that employ the appropriate conditions for their separation⁴⁶, particularly since high-resolution MS systems are without the added precision from MRM specifications to reduce TIC complexity. Though comparable, ReTimeML’s strength lies in its combination of learned material from previous experimental data, alongside the assignment of ‘real’ values (i.e., user-defined references, standards, internal controls) to enhance the decision-making process. This enabled ReTimeML to delineate common patterns for a given SL class, irrespective of LC–MS/MS methodology or experimental conditions, overcoming a major obstacle of RT adaptability for use in lipid identification software. Furthermore, ReTimeML is completely automated, making it accessible to those with limited knowledge of lipid biochemistry, and, thus, translatable to a base of researchers who may have shied away from the complex analyses.

Encouragingly, ReTimeML did not require an excessive number of references to estimate unknowns with a high degree of accuracy, meaning that users are not faced with the considerable costs of purchasing a copious number of standards. Moreover, for most experiments (excluding isocratic analyses) n ≥ 4 reference RTs did not drastically improve the accuracy of unknowns, and in certain instances increased deviations from user RTs, presumably due to data overfitting (Fig. 6). We recommend using a minimum of n = 3 reference points per SL class, be employed, comprised of different structural variants, as this provided the most accurate RT estimations (less than 1% variance for gradient and between 2.5 to 5.3% for isocratic, from user-defined measures, Fig. 6). ReTimeML’s lower accuracy of RT estimations in isocratic systems was attributed to the lesser data available for model training, as well as increases in peak-broadening from using single solvent systems that make analyte detection more difficult, particularly when lipids of interest span a wide polarity range or require the separation of closely related species^47,48. Hence, isocratic measures are more suited to low numbers of SL analytes (n < 10) to reduce complexity but have the advantage of reduced run time with no requirement of column equilibration prior to subsequent measurements⁴⁹.

For most SL analyses, n = 3 reference RTs would be achievable by pooling together a given LC–MS/MS experiments routine internal controls per lipid class (e.g., Cer, SM), or the closest structural/chemical equivalence, that control for lipid extraction inconsistencies and LC–MS/MS normalisation across sample cohorts^50,51, alongside calibrators to externally quantify samples and/or QC mixture to control for instrument signal variability (e.g., ion suppression) and mitigate influences from matrix and batch effects during high throughput screening^52,53.

Importantly, the accuracy of ReTimeML estimations was also adaptable to the reference material employed (Fig. 6), indicating that if a particular reference for a structure of interest (e.g., d18:2/XX:0) is not available, an alternate standard sourced from the same lipid class would suffice, provided our guidelines (n = 3 references, including structures) are retained. If standards cannot be procured, and we do acknowledge that certain classes of SLs (e.g., SM) are limited in their commercial availability, users can still choose to enter experimentally determined values from samples, though we strongly advise caution using this approach and recommend limiting this to RTs of SLs that are well-established or when signal interference is negligible to unequivocally resolve the correct RT on the TIC.

Herein we would also like to highlight ReTimeML’s proficiency at annotating RTs in complex TICs, particularly where ions of similar transitions interfere with correct peak annotation (Figs. 2 and 7), with their mass differentiation (~ 20 ppm) not achievable at the resolution of a QqQ system (~ 0.1–0.2 Da). ReTimeML correctly assigned RTs to the peaks of SM isobars (d18:1/XX:1 vs. d18:2/XX:0) using only the m/z 184 transition (Fig. 7A–C). This is particularly notable given this diagnostic ion is assigned only to detect the presence of a choline headgroup, with SM analyses requiring a secondary m/z 262, 264 or 266 scan(s) to determine the correct sphingoid backbone. This can be problematic in LC–MS/MS experiments for SMs as the choline headgroup is highly sensitive and its signal (up to 100× stronger) can suppress the detection of the sphingoid ion⁵⁴, thereby limiting the structural information on SMs to the sum of its components (e.g., SM (36:2)) rather than at the fatty acyl/sphingoid base structure level (e.g., SM (d18:1/18:1))⁵⁵. SM peak determination was further complicated by the presence of HexCer isotopic distribution, whose sphingoid transitions overlapped with di-unsaturated SMs (Figs. 7G–l). In the absence of RT annotation using ReTimeML, such signals could easily be misinterpreted, particularly if the m/z 184 was not in the MRM experiment or had already been assigned to a particular isobar (Fig. 7A,B).

Although a significant improvement in the resolution of SL variants, ReTimeML remains bound by the user’s LC setup. Should LC conditions not facilitate appropriate separation, interfering SL ions may be indistinguishable on the TIC. This has been proven to potentially cause artificial inflation of SL levels³⁶, and represents a current constraint for ReTimeML to handle isomers of SLs (e.g., galactosyl vs glucosylceramide having been referred to as HexCer), given the chromatography conditions ReTimeML was trained on were incapable of their separation. Additionally, misidentifications from indistinguishable transitions of hydroxylated variants, even at trace levels, could artificiality inflate or cause miss-annotation (e.g., [M + H]⁺for Cer (d18:1/24:1) and [M + H–H₂O]⁺ for Cer (t18:1/24:0) share the same 648.6 → 264.3 transition). Though not a component of this study, resolution of these hydroxylated variants can be achieved under suitable reverse-phase conditions⁵⁶, nonetheless remain an important consideration when establishing LC conditions. Appropriate separation is also pivotal when considering the application of ReTimeML for processing SLs using high-resolution MS. As previously aforementioned, these untargeted measures provide no selective bias (i.e., MRM) towards SLs of interest, which could potentially increase the risk of overlapping or interfering ions from other lipid species. Updated versions of ReTimeML shall require training on additional setups that may circumvent these potential sources of interference, including normal-phase LC–MS/MS²⁵ and next-generation ion mobility MS employing RT with collision cross-section⁵⁷, capable of achieving additional SL class and structure interpretation.

ReTimeML was employed to aid the peak selection and structural annotation of Cer and SM lipids identified in HVs, providing both CSF and serum, allowing us to compare these body fluid profiles in the same subject. CSF is the closest anatomical fluid to the brain, likely to yield more applicable biomarkers for studying neuronal effects and conditions given its composition closely resembles that of the brain's extracellular space^58,59,60. However, preconceived notions towards the invasiveness of the procedure (lumbar puncture) and the resulting distress to subjects have severely influenced its inclusion in clinical trials and broader use as a diagnostic fluid^61,62. As SLs are enriched in the CNS and have exhibited a capacity to cross the blood–brain barrier⁶³, their peripheral concentrations have the potential to act as surrogate markers in neurological and neuropsychiatric disorders⁶⁴. To the best of our knowledge, this is the first reported CSF-serum SL comparison, and only the second paired blood-CSF lipid profiling investigation in HVs⁶⁵.

Although Cer and SM concentrations in the CSF were found to be considerably lower than in serum, their relative structure-distribution patterns remained conserved and positively associated between the two body fluids (Fig. 8A–D). This is consistent with recently published CSF-plasma data⁶⁵. Although Saito et al., reported contrasting results on overall lipid compositions, a closer inspection of their SM data revealed a similar conserved profile, with a positive correlation for the SM structural variants identified in their study (r = 0.811, average per lipid structure, n = 99; r = 0.760, individuals SL pairwise CSF-plasma pairings, n = 2,079; both p values < 0.0001, Supplementary Fig. 2). Their number of Cer identities was insufficient for an effective comparison (data not shown).

A handful of our SL identities exhibited highly stringent correlations, conserved between the CSF and serum (Q < 0.05, Fig. 8E). Interestingly, all the positively associated comparisons were categorised as belonging to the same SM structural variant, which included the presence of a sphingadiene backbone (d18:2, m/z 262), together with a mono-unsaturated fatty acyl chain. Though the presence of d18:2 on SLs was first identified in the late 60 s^66,67,68, it has taken major advancements in LC–MS/MS sensitivity to enable routine assessments of these noncanonical structures, reviewed in⁶⁹, meaning our understanding of their functional importance is largely undetermined. Sphingadiene backbones are exhibited in mouse kidney, brain, lung, and colon tissue SLs⁷⁰, are reported to be the second most abundant sphingoid base in human plasma⁷¹, and a common constituent in plants and fungi^72,73. Natural (soy) sphingadienes have been reported to inhibit intestinal tumoregensis in vivo, through disrupted Akt translocation⁷⁴, and reduced Wnt transcriptional activity in colon cancer cells⁷⁵. Clinical investigations have reported that d18:2 SLs may provide protection against the development of obesity and the risk of diabetes and cardiovascular disease^76,77. Our own research observed a marked accumulation of d18:2 SLs (unpublished), following the ablation of sphingosine kinase 2 (SphK2) in a mouse model for Alzheimer’s disease (J20), which led to severe defects in myelin integrity⁷⁸. However, we never resolved whether this shift towards sphingadiene-based SLs was a primary cause of myelin disruption. SphK2 is the major isoform for catalysing the phosphorylation of sphingosine into sphingosine 1-phosphate (S1P), the penultimate step in SL lysosomal catabolism via irreversible degradation by S1P lyase^1,2. This pathway has been reported to be less efficient for the clearance of d18:2 SLs, over their d18:1 counterparts in vitro^4,70, presumably a consequence of the angled nature of the cis-double bond⁷⁹. Furthermore, it has been recently shown that d18:2 predominately converts to SMs over glycosphingolipids (i.e., HexCer)⁷⁹. Hence, we speculate that stronger associations observed for SM (d18:2/XX:1) profiles between these peripheral systems may be rationalised by their preferential formation from sphingadiene precursors and the accompanied stability from cis-double bonds on both the sphingoid base and fatty acyl chain.

In this study, we set ourselves the objective of developing a freeware that could perform data transformation and feature engineering to estimate the RTs of Cer and SM identities from complex LC–MS/MS spectra, adaptable to the experimental conditions applied, and amenable to any level of LC–MS/MS experience and/or knowledge on the biochemistry of SLs. We believe that ReTimeML excelled at this objective, assisting with the identification process across multiple LC–MS/MS conditions, including structural annotation and the removal of interfering RTs from isobaric and isotopic measures. While we recognise the advantages from employing ReTimeML, we acknowledge the existence of unforeseen circumstances and have highlighted probable instances where ReTimeML may not be applicable to a user’s LC–MS/MS design. Hence, it is always advisable for users to conduct secondary analyses/measures to confirm their SL identities, particularly if the existence of isobaric compounds or ion interferences within the TIC are likely, albeit not directly observed.

Moving forward, our objective is the continued optimisation of ReTimeML, ensuring it grows with mass spectrometry development and refine its ability to guide RT annotations for further classes of SLs, including non-canonical variants. We also plan to expand ReTimeML assessments into other lipid classes, searching relevant data repositories (e.g., MetaboLights, Metabolomics Work Bench), and welcome support from users prepared to share their lipidomic data via the options provided (see ‘Data and Code availability’). In the long term, we envisage that with the incorporation of more machine-learned lipid class RT measures, ReTimeML could become an openly accessed and/or integrated function in current automated lipid identification software engines. In achieving routine RT annotations for SLs, we also drew attention to the physiological and pathophysiological importance of non-canonical SLs that are achievable with current mass spectrometric systems. It is hoped that our findings will further scrutinise their significance as molecular mediators in health and disease.

Materials and methods

Data integration

Regression models were fitted to Cer and SM data collated from our prior published material and that of the literature (Supplementary Data 2). Data incorporated met a minimum quantity of molecular and chromatography information, ensuring sufficient variance between descriptors to accurately fit with models. The information incorporated includes the sphingoid base/fatty-acyl naming for Cer and SM^55,80 and/or chemical formula of the sphingolipid, m/z of the [M + H]⁺ precursor ion together, with a minimum of one fragmentation ion to aid the structural characterisation (e.g., m/z 264 for the sphingosine backbone), the chromatography system applied, stationary-phase column used, solvent conditions (gradient vs isocratic) and nominated flow rate. For each dataset, Cer and SM species were broken down into RTs that were either “user-defined” or “known” (e.g., standard), per study. The selected nominations were defined by the experimental datasets chosen controls (internal and QC), compounds used for calibration or to optimise the LC–MS/MS parameters during their method development.

Retention time learning algorithms

Linear and non-linear regression algorithms were assessed and evaluated based on their ability to predict RTs, alongside the required number of training samples to learn and achieve appropriate levels of accuracy. Variables associated with precursor mass, precursor mass squared, square root and the log of the precursor mass for each molecular sample were calculated for every included data point. Adopting the sphingoid base and fatty acyl notation⁵⁵, a Python (version 3.10, Python Software Foundation) function was used to extract the molecular features, based on the nominated Cer/SM precursor mass and relevant fragmentation to deduce the carbon chain length, both on the sphingoid backbone and fatty acyl chain, the degree(s) of unsaturation, as well as specific modifications to the C1 head group (i.e., m/z 184 for choline phosphate of SM). Incorporated datasets were randomly divided into ‘training’ (70%) and ‘validation’ (30%), using Python’s inbuilt sklearn package, which was also used to train linear, lasso and ridge regression models. Python’s xgboost package was used to train an XGBoost regression algorithm. RMSE and R² were used to evaluate the performance of each regression model’s RT estimations. The number of training, and hence corresponding validation data points were sequentially varied (± 3 datasets) to assess the minimum training size required for a given regression model to forecast RTs accurately (R² > 0.95; RMSE < 0.25), with regression models ranked according to the number of training samples required to achieve these measurables. The complete list of regression models, along with R² and RMSE scores per training size, are summarised in Supplementary Data 1.

Ceramide and sphingomyelin data acquisition

Top-ranked regression models for Cer and SM were selected for secondary verification across four, independently performed, LC–MS/MS assessments spanning human fluids (CSF and serum), mouse liver and rat brain homogenates (all unpublished analyses). Human CSF and serum were provided by our biobank at the Central Institute of Mental Health, Mannheim, with both donated from the same healthy participants (n = 51), originally recruited at the Clinic and Outpatient Clinic of Psychiatry and Psychotherapy, University of Cologne. The Ethics Committee of the Medical Faculty Cologne, University of Cologne, Germany (00-053) approved the use of these samples for this research. Rat brain tissue was procured from our prior animal investigation exploring behavioural changes following different tetrahydrocannabinol preparations⁸¹, approved by the regional authority State Agency for Nature, Environment and Consumer Protection of the State North Rhine-Westphalia (LANIUV-NRW). Only tissue from placebo-administered rats was assessed. SLs extracted from C57BL/6 mouse liver were in accordance with protocols (#2019-033), approved by the Research Ethics and Governance Office, Royal Prince Alfred Hospital, Sydney, Australia.

SLs from human fluids were extracted using the conventional Bligh and Dyer method⁸², while rodent liver and brain tissue SLs were extracted using single-phase methanol/butanol (1:1 v/v⁸³) and two-phase methyl-tert-butyl ether (MTBE)/methanol/water (10:3:2.5, v/v/v⁴), respectively. Prior to extractions, all samples were loaded with Cer (d18:1/17:0) and SM (d18:1/12:0) as internal controls. All samples underwent MRM analysis performed on a TSQ Altis QqQ mass spectrometer (ThermoFisher), coupled to a Vanquish UHPLC system, as previous⁴. For SMs, secondary product ion scans for m/z 264 and 262 were included to help distinguish isobars (e.g., SM(d18:1/XX:1) and SM (d18:2/XX:0)) and the potentially conflicting phosphatidylcholine ions, which induce the same m/z 184 choline headgroup fragment. A complete list of Cer and SM lipids, together with their MRM transitions, are provided in Tables 1 and 2.

Chromatographic and stationary phase conditions were also varied between experiments. SLs extracted from human fluids were resolved on a 3 × 150 mm Agilent XDB-C8 column (5 μM pore size), using a modified Hejazi et al.³⁵ binary gradient as follows: 0 min, 20:80 A/B; 2 min, 20:80 A/B; 7 min, 13:87 A/B; 14 min, 0:100 A/B; 20.5 min, 0:100 A/B; 21 min, 20:80 A/B; 24 min, 20:80 A/B. Mobile phase ‘A’ consisted of 0.2% formic acid, 2 mM ammonium formate in water; Mobile phase ‘B’: 0.2% formic acid, 1 mM ammonium formate in methanol. Total run time was 24 min, at a flow rate of 0.2 mL/min. Extracted SLs from mouse liver were separated on the same 3 × 150 mm Agilent XDB-C8 stationary phase, with modified gradient solvents and conditions as follows: 0 min, 20:80 A/B; 1 min, 20:80 A/B; 9 min, 5:95 A/B; 11 min, 0:100 A/B; 17.5 min, 0:100 A/B; 17.6 min 20:80 A/B, 20.5 min 20:80 A/B. Solvent ‘A’ comprised of 0.1% formic acid, 2 mM Ammonium acetate in water; Solvent ‘B’ composition of 0.1% formic acid, 2 mM Ammonium acetate in methanol. Total run time was 20.5 min at a flow rate of 0.3 mL/min. SL analysis of brains (rat) were resolved on a 2.1 × 100 mm Waters Acquity C18 UPLC column (1.7 µm pore size) under isocratic conditions, as previous⁸⁴, using methanol with 0.2% formic acid as the mobile phase, at a flow rate of 0.25 mL/min for 15 min.

ReTimeML interface

ReTimeML’s pilot version is available as a free, open-source web interface powered by streamlit (https://mikeallwright23-retime-app-lipid3-021zpv.streamlit.app/). Users upload datasets in .csv format, consisting of the Cer/SM lipids of interest, alongside their precursor mass and whether the SL species included are a “Train” (reference with known RT included) or “Test” (unknown) value. Template .csv files are provided for users to test the interface (Supplementary Data 4 and 5), and can be adapted for their own SL analysis. Uploaded data (drag and drop option) triggers the automatic calculation of RTs, utilising the nominated RTs as points-of-reference (Train) to guide the extrapolated unknowns (Test), with users free to amend the number of train/test values. It is of note to mention a minimum of two ‘Train’ values is required for ReTimeML to extrapolate an output. Functions are applied within the web interface to automatically pre-process each data field, using regex functionality in Python, to feature engineer the number of carbon atoms and degree(s) of unsaturation for structural components on the sphingoid and fatty acyl chain, programming these as one hot encoded (ohe) variables, as well as the log of the mass, mass squared and square root of the mass. In addition, ReTimeML also provides an MRET profile output of estimations, annotated using second-order polynomial trendlines. The web interface also provides users the added option to voluntarily upload their own RT estimations, so that our team can evaluate and incorporate them into our working models.

Statistical analysis

Verified Human Cer and SM lipid datasets underwent peak integration using Xcalibur 4.4.16.14 software (ThermoFisher Scientific, San Jose, CA, USA), with Cer and SM species normalised as ratios to their class-specific internal control. A separate Cer/SM mixture, comprising various compounds (Supplementary Table 3), was run every 20 samples. This external mixture acted as a QC and provided additional RT references for Cer and SM. All Cer/SM mixtures were prepared in aqueous/organic proportions reflecting the starting conditions of the respective LC–MS/MS experiments. All Cer and SM values were first log-transformed (natural log) to obtain a normal distribution for Pearson correlations (r). For the assessment of individual lipids between CSF and serum, the resultant p values were adjusted for multiple comparisons using the Benjamini, Krieger, and Yekutieli false discovery rate (FDR) approach, with Q < 0.05 considered significant (GraphPad Prism software, Version 10.0.3, Dotmatics, Boston, MA, USA).

Data availability

All Cer and SM data supporting our findings has been provided in the manuscript and supplementary information. Datasets for ReTimeML training were collated from prior published material. Details for these studies, including the Cer and SM lipids assessed, has been provided in Supplementary Data 2. RT values for these studies can be made available by request to the corresponding authors. Source code for ReTimeML learned regression models is also available upon reasonable request. We encourage users of ReTimeML to voluntarily upload RT data for future lipid regression model development (.xls, .xlsx or .csv format), at either the following Google form https://forms.gle/p9fGuzuujZqDhcpD6 or Dropbox link: https://www.dropbox.com/request/kMOXwBe54IxGWeocE7WS.

References

Hannun, Y. A. & Obeid, L. M. Sphingolipids and their metabolism in physiology and disease. Nat. Rev. Mol. Cell Biol. 19(3), 175–191 (2018).
Article CAS PubMed Google Scholar
Merrill, A. H. Jr. Sphingolipid and glycosphingolipid metabolic pathways in the era of sphingolipidomics. Chem. Rev. 111(10), 6387–6422 (2011).
Article CAS PubMed PubMed Central Google Scholar
Giussani, P., Prinetti, A. & Tringali, C. The role of Sphingolipids in myelination and myelin stability and their involvement in childhood and adult demyelinating disorders. J. Neurochem. 156(4), 403–414 (2021).
Article CAS PubMed Google Scholar
Couttas, T. A. et al. A novel function of sphingosine kinase 2 in the metabolism of sphinga-4,14-diene lipids. Metabolites 10(6), 236 (2020).
Article CAS PubMed PubMed Central Google Scholar
Goni, F. M. & Alonso, A. Biophysics of sphingolipids I. Membrane properties of sphingosine, ceramides and other simple sphingolipids. Biochim. Biophys. Acta 1758(12), 1902–1921 (2006).
Article CAS PubMed Google Scholar
Hannun, Y. A. & Bell, R. M. Functions of sphingolipids and sphingolipid breakdown products in cellular regulation. Science 243(4890), 500–507 (1989).
Article ADS CAS PubMed Google Scholar
van Meer, G., Voelker, D. R. & Feigenson, G. W. Membrane lipids: Where they are and how they behave. Nat. Rev. Mol. Cell Biol. 9(2), 112–124 (2008).
Article PubMed PubMed Central Google Scholar
Fanani, M. L. & Maggio, B. The many faces (and phases) of ceramide and sphingomyelin I - single lipids. Biophys. Rev. 9(5), 589–600 (2017).
Article CAS PubMed PubMed Central Google Scholar
Taniguchi, M. & Okazaki, T. The role of sphingomyelin and sphingomyelin synthases in cell death, proliferation and migration-from cell and animal models to human disorders. Biochim. Biophys. Acta 1841(5), 692–703 (2014).
Article CAS PubMed Google Scholar
Maceyka, M. & Spiegel, S. Sphingolipid metabolites in inflammatory disease. Nature 510(7503), 58–67 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Kolesnick, R. The therapeutic potential of modulating the ceramide/sphingomyelin pathway. J. Clin. Investig. 110(1), 3–8 (2002).
Article CAS PubMed PubMed Central Google Scholar
Ryan, E. et al. Detailed structural characterization of sphingolipids via 193 nm ultraviolet photodissociation and ultra high resolution tandem mass spectrometry. J. Am. Soc. Mass Spectrom. 28(7), 1406–1419 (2017).
Article ADS CAS PubMed Google Scholar
Bielawski, J. et al. Simultaneous quantitative analysis of bioactive sphingolipids by high-performance liquid chromatography-tandem mass spectrometry. Methods 39(2), 82–91 (2006).
Article CAS PubMed Google Scholar
Couttas, T. A. et al. Age-dependent changes to sphingolipid balance in the human hippocampus are gender-specific and may sensitize to neurodegeneration. J. Alzheimers Dis. 63(2), 503–514 (2018).
Article CAS PubMed Google Scholar
Sullards, M. C. & Merrill, A. H. Jr. Analysis of sphingosine 1-phosphate, ceramides, and other bioactive sphingolipids by high-performance liquid chromatography-tandem mass spectrometry. Sci. STKE 2001(67), pl1 (2001).
Article CAS PubMed Google Scholar
Peng, B. et al. LipidCreator workbench to probe the lipidomic landscape. Nat. Commun. 11(1), 2057 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Koelmel, J. P. et al. LipidMatch: an automated workflow for rule-based lipid identification using untargeted high-resolution tandem mass spectrometry data. BMC Bioinform. 18(1), 331 (2017).
Article Google Scholar
Wong, J. W. et al. MMSAT: Automated quantification of metabolites in selected reaction monitoring experiments. Anal. Chem. 84(1), 470–474 (2012).
Article CAS PubMed Google Scholar
Merrill, A. H. Jr. & Sullards, M. C. Opinion article on lipidomics: Inherent challenges of lipidomic analysis of sphingolipids. Biochim. Biophys. Acta Mol. Cell Biol. Lipids 1862(8), 774–776 (2017).
Article CAS PubMed Google Scholar
Zullig, T. & Kofeler, H. C. High resolution mass spectrometry in lipidomics. Mass Spectrom. Rev. 40(3), 162–176 (2021).
Article ADS PubMed Google Scholar
Duan, J. & Merrill, A. H. Jr. 1-Deoxysphingolipids encountered exogenously and made de novo: Dangerous mysteries inside an enigma. J. Biol. Chem. 290(25), 15380–15389 (2015).
Article CAS PubMed PubMed Central Google Scholar
Sandhoff, R. Very long chain sphingolipids: Tissue expression, function and synthesis. FEBS Lett. 584(9), 1907–1913 (2010).
Article CAS PubMed Google Scholar
Merrill, A. H. Jr. et al. Sphingolipidomics: high-throughput, structure-specific, and quantitative analysis of sphingolipids by liquid chromatography tandem mass spectrometry. Methods 36(2), 207–224 (2005).
Article CAS PubMed Google Scholar
Sullards, M. C. et al. Analysis of mammalian sphingolipids by liquid chromatography tandem mass spectrometry (LC-MS/MS) and tissue imaging mass spectrometry (TIMS). Biochim. Biophys. Acta 1811(11), 838–853 (2011).
Article CAS PubMed PubMed Central Google Scholar
Zama, K. et al. Simultaneous quantification of glucosylceramide and galactosylceramide by normal-phase HPLC using O-phtalaldehyde derivatives prepared with sphingolipid ceramide N-deacylase. Glycobiology 19(7), 767–775 (2009).
Article CAS PubMed Google Scholar
Rana, N. A. et al. Qualitative and quantitative measurements of sphingolipids by mass spectrometry. In Bioactive Sphingolipids in Cancer Biology and Therapy (eds Hannun, Y. A. et al.) 313–338 (Springer, 2015).
Chapter Google Scholar
Osipenko, S. et al. Machine learning to predict retention time of small molecules in nano-HPLC. Anal. Bioanal. Chem. 412(28), 7767–7776 (2020).
Article CAS PubMed Google Scholar
Stanstrup, J., Neumann, S. & Vrhovsek, U. PredRet: Prediction of retention time by direct mapping between multiple chromatographic systems. Anal. Chem. 87(18), 9421–9428 (2015).
Article CAS PubMed Google Scholar
D’Archivio, A. A. et al. Cross-column retention prediction in reversed-phase high-performance liquid chromatography by artificial neural network modelling. Anal. Chim. Acta 717, 52–60 (2012).
Article CAS PubMed Google Scholar
Fedorova, E. S. et al. Deep learning for retention time prediction in reversed-phase liquid chromatography. J. Chromatogr. A 1664, 462792 (2022).
Article CAS PubMed Google Scholar
Yang, Q. et al. Prediction of liquid chromatographic retention time with graph neural networks to assist in small molecule identification. Anal. Chem. 93(4), 2200–2206 (2021).
Article CAS PubMed Google Scholar
Aicheler, F. et al. Retention time prediction improves identification in nontargeted lipidomics approaches. Anal. Chem. 87(15), 7698–7704 (2015).
Article CAS PubMed Google Scholar
Bouwmeester, R., Martens, L. & Degroeve, S. Comprehensive and empirical evaluation of machine learning algorithms for small molecule LC retention time prediction. Anal. Chem. 91(5), 3694–3703 (2019).
Article CAS PubMed Google Scholar
Domingo-Almenara, X. et al. The METLIN small molecule dataset for machine learning-based retention time prediction. Nat. Commun. 10(1), 5811 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Hejazi, L. et al. Mass and relative elution time profiling: two-dimensional analysis of sphingolipids in Alzheimer’s disease brains. Biochem. J. 438(1), 165–175 (2011).
Article CAS PubMed Google Scholar
Huang, H. et al. LC–MS based sphingolipidomic study on A549 human lung adenocarcinoma cell line and its taxol-resistant strain. BMC Cancer 18(1), 799 (2018).
Article PubMed PubMed Central Google Scholar
Huynh, K. et al. High-throughput plasma lipidomics: Detailed mapping of the associations with cardiometabolic risk factors. Cell Chem. Biol. 26(1), 71–84 (2019).
Article CAS PubMed Google Scholar
Fonteh, A. N. et al. Sphingolipid metabolism correlates with cerebrospinal fluid beta amyloid levels in Alzheimer’s disease. PLoS ONE 10(5), e0125597 (2015).
Article PubMed PubMed Central Google Scholar
Hammad, S. M. et al. Plasma sphingolipid profile associated with subclinical atherosclerosis and clinical disease markers of systemic lupus erythematosus: potential predictive value. Front. Immunol. 12, 694318 (2021).
Article CAS PubMed PubMed Central Google Scholar
Hanamatsu, H. et al. Altered levels of serum sphingomyelin and ceramide containing distinct acyl chains in young obese adults. Nutr. Diabetes 4(10), e141 (2014).
Article CAS PubMed PubMed Central Google Scholar
Hammad, S. M. et al. Blood sphingolipidomics in healthy humans: Impact of sample collection methodology. J. Lipid Res. 51(10), 3074–3087 (2010).
Article CAS PubMed PubMed Central Google Scholar
Torretta, E. et al. Particular CSF sphingolipid patterns identify iNPH and AD patients. Sci. Rep. 8(1), 13639 (2018).
Article ADS PubMed PubMed Central Google Scholar
Ross, D. H. et al. Evaluating software tools for lipid identification from ion mobility spectrometry-mass spectrometry lipidomics data. Molecules 28(8), 3483 (2023).
Article CAS PubMed PubMed Central Google Scholar
Carreira, A. C. et al. Mammalian sphingoid bases: Biophysical, physiological and pathological properties. Prog. Lipid Res. 75, 100988 (2019).
Article CAS PubMed Google Scholar
White, J. B. et al. Equivalent carbon number and interclass retention time conversion enhance lipid identification in untargeted clinical lipidomics. Anal. Chem. 94(8), 3476–3484 (2022).
Article CAS PubMed Google Scholar
Marian, O. C. et al. Disrupted myelin lipid metabolism differentiates frontotemporal dementia caused by GRN and C9orf72 gene mutations. Acta Neuropathol. Commun. 11(1), 52 (2023).
Article CAS PubMed PubMed Central Google Scholar
Gritti, F. General theory of peak compression in liquid chromatography. J. Chromatogr. A 1433, 114–122 (2016).
Article CAS PubMed Google Scholar
Haidar Ahmad, I. A. Necessary analytical skills and knowledge for identifying, understanding, and performing HPLC troubleshooting. Chromatographia 80(5), 705–730 (2017).
Article CAS Google Scholar
Schellinger, A. P. & Carr, P. W. Isocratic and gradient elution chromatography: A comparison in terms of speed, retention reproducibility and quantitation. J. Chromatogr. A 1109(2), 253–266 (2006).
Article CAS PubMed Google Scholar
Holcapek, M., Liebisch, G. & Ekroos, K. Lipidomic analysis. Anal. Chem. 90(7), 4249–4257 (2018).
Article CAS PubMed Google Scholar
Kofeler, H. C. et al. Recommendations for good practice in MS-based lipidomics. J. Lipid Res. 62, 100138 (2021).
Article PubMed PubMed Central Google Scholar
Pitt, J. J. Principles and applications of liquid chromatography–mass spectrometry in clinical biochemistry. Clin. Biochem. Rev. 30(1), 19–34 (2009).
PubMed PubMed Central Google Scholar
Cheng, W. L. et al. Calibration practices in clinical mass spectrometry: Review and recommendations. Ann. Lab. Med. 43(1), 5–18 (2023).
Article PubMed Google Scholar
Hsu, F. F. & Turk, J. Structural determination of sphingomyelin by tandem mass spectrometry with electrospray ionization. J. Am. Soc. Mass Spectrom. 11(5), 437–449 (2000).
Article CAS PubMed Google Scholar
Liebisch, G. et al. Shorthand notation for lipid structures derived from mass spectrometry. J. Lipid Res. 54(6), 1523–1530 (2013).
Article CAS PubMed PubMed Central Google Scholar
Hartler, J. et al. Automated annotation of sphingolipids including accurate identification of hydroxylation sites using MS(n) data. Anal. Chem. 92(20), 14054–14062 (2020).
Article CAS PubMed PubMed Central Google Scholar
Camunas-Alberca, S. M. et al. Integrating the potential of ion mobility spectrometry–mass spectrometry in the separation and structural characterisation of lipid isomers. Front. Mol. Biosci. 10, 1112521 (2023).
Article CAS PubMed PubMed Central Google Scholar
Blennow, K. et al. Cerebrospinal fluid and plasma biomarkers in Alzheimer disease. Nat. Rev. Neurol. 6(3), 131–144 (2010).
Article CAS PubMed Google Scholar
Lee, J. Cerebrospinal fluid biomarkers in various pediatric neurologic diseases. Clin. Exp. Pediatr. 65(2), 81–82 (2022).
Article PubMed PubMed Central Google Scholar
Paraskevas, G. P. The role of cerebrospinal fluid biomarkers in dementia and other related neurodegenerative disorders. Brain Sci 12(5), 627 (2022).
Article PubMed PubMed Central Google Scholar
Howell, J. C. et al. Research lumbar punctures among African Americans and Caucasians: Perception predicts experience. Front. Aging Neurosci. 8, 296 (2016).
Article PubMed PubMed Central Google Scholar
Day, G. S. et al. Deciphering the factors that influence participation in studies requiring serial lumbar punctures. Alzheimers Dement. (Amst.) 12(1), e12003 (2020).
PubMed Google Scholar
de la Monte, S. M. Triangulated mal-signaling in Alzheimer’s disease: Roles of neurotoxic ceramides, ER stress, and insulin resistance reviewed. J. Alzheimers Dis. 30(Suppl 2), S231–S249 (2012).
Article PubMed PubMed Central Google Scholar
van Kruining, D. et al. Sphingolipids as prognostic biomarkers of neurodegeneration, neuroinflammation, and psychiatric diseases and their emerging role in lipidomic investigation methods. Adv. Drug Deliv. Rev. 159, 232–244 (2020).
Article PubMed PubMed Central Google Scholar
Saito, K. et al. Profiling of cerebrospinal fluid lipids and their relationship with plasma lipids in healthy humans. Metabolites 11(5), 268 (2021).
Article CAS PubMed PubMed Central Google Scholar
Renkonen, O. & Hirvisalo, E. L. Structure of plasma sphingadienine. J. Lipid Res. 10(6), 687–693 (1969).
Article CAS PubMed Google Scholar
Polito, A. J., Akita, T. & Sweeley, C. C. Gas chromatography and mass spectrometry of sphingolipid bases. Characterization of sphinga-4,14-dienine from plasma sphingomyelin. Biochemistry 7(7), 2609–2614 (1968).
Article CAS PubMed Google Scholar
Panganamala, R. V., Geer, J. C. & Cornwell, D. G. Long-chain bases in the sphingolipids of atherosclerotic human aorta. J. Lipid Res. 10(4), 445–455 (1969).
Article CAS PubMed Google Scholar
Lam, B. W. S. et al. The noncanonical chronicles: Emerging roles of sphingolipid structural variants. Cell. Signal 79, 109890 (2021).
Article CAS PubMed Google Scholar
Jojima, K. et al. Biosynthesis of the anti-lipid-microdomain sphingoid base 4,14-sphingadiene by the ceramide desaturase FADS3. FASEB J. 34(2), 3318–3335 (2020).
Article CAS PubMed Google Scholar
Karsai, G. et al. FADS3 is a Delta14Z sphingoid base desaturase that contributes to gender differences in the human plasma sphingolipidome. J. Biol. Chem. 295(7), 1889–1897 (2020).
Article CAS PubMed Google Scholar
Sullards, M. C. et al. Structure determination of soybean and wheat glucosylceramides by tandem mass spectrometry. J. Mass Spectrom. 35(3), 347–353 (2000).
Article ADS CAS PubMed Google Scholar
Aida, K. et al. Prevention of aberrant crypt foci formation by dietary maize and yeast cerebrosides in 1, 2-dimethylhydrazine-treated mice. J. Oleo Sci. 54(1), 45–49 (2005).
Article CAS Google Scholar
Fyrst, H. et al. Natural sphingadienes inhibit Akt-dependent signaling and prevent intestinal tumorigenesis. Cancer Res. 69(24), 9457–9464 (2009).
Article CAS PubMed PubMed Central Google Scholar
Kumar, A. et al. Chemopreventive sphingadienes downregulate Wnt signaling via a PP2A/Akt/GSK3beta pathway in colon cancer. Carcinogenesis 33(9), 1726–1735 (2012).
Article CAS PubMed PubMed Central Google Scholar
Chew, W. S. et al. Large-scale lipidomics identifies associations between plasma sphingolipids and T2DM incidence. JCI Insight 5(13), e126925 (2019).
Article PubMed Google Scholar
Othman, A. et al. Plasma C20-Sphingolipids predict cardiovascular events independently from conventional cardiovascular risk factors in patients undergoing coronary angiography. Atherosclerosis 240(1), 216–221 (2015).
Article CAS PubMed Google Scholar
Lei, M. et al. Sphingosine kinase 2 potentiates amyloid deposition but protects against hippocampal volume loss and demyelination in a mouse model of Alzheimer’s disease. J. Neurosci. 39(48), 9645–9659 (2019).
Article CAS PubMed PubMed Central Google Scholar
Jojima, K. & Kihara, A. Metabolism of sphingadiene and characterization of the sphingadiene-producing enzyme FADS3. Biochim. Biophys. Acta Mol. Cell Biol. Lipids 1868(8), 159335 (2023).
Article CAS PubMed Google Scholar
Fahy, E. et al. Update of the LIPID MAPS comprehensive classification system for lipids. J. Lipid Res. 50(Supp 1), S9-14 (2009).
Article PubMed PubMed Central Google Scholar
Rohleder, C. et al. Different pharmaceutical preparations of Delta(9)-tetrahydrocannabinol differentially affect its behavioral effects in rats. Addict. Biol. 25(3), e12745 (2020).
Article PubMed Google Scholar
Bligh, E. G. & Dyer, W. J. A rapid method of total lipid extraction and purification. Can. J. Biochem. Physiol. 37(8), 911–917 (1959).
Article CAS PubMed Google Scholar
Alshehry, Z. H. et al. An efficient single phase method for the extraction of plasma lipids. Metabolites 5(2), 389–403 (2015).
Article CAS PubMed PubMed Central Google Scholar
Turner, N. et al. A selective inhibitor of ceramide synthase 1 reveals a novel role in fat metabolism. Nat. Commun. 9(1), 3165 (2018).
Article ADS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This research was supported by the Moyira Elizabeth Vine Fund, awarded to both T.A.C. (2023) and F.M.L (2020) for research into schizophrenia; Deputy Vice Chancellor-Start Up Funds from the University of Sydney (F.M.L); Medical Research Futures Fund, Australia (GA268519, B.G.); Brain and Intelligence Science (BISA) Sydney-Fudan University accelerator grant (B.G.); National Health and Medical Research Council Grant, Australia (GNT1162545, Y.Q.); and The Stanley Medical Research Institute (03-NV-003, F.M.L.). A.E.H. and B.J. were supported by Australian Government Research Training Program (RTP) Scholarships. We gratefully acknowledge subsidised access to the LC-MS/MS systems at Sydney Mass Spectrometry, University of Sydney.

Author information

These authors contributed equally: Michael Allwright and Boris Guennewig.
These authors jointly supervised this work: F. Markus Leweke and Timothy A. Couttas.

Authors and Affiliations

ForeFront, Brain and Mind Centre, The University of Sydney, Sydney, Australia
Michael Allwright & Boris Guennewig
Translational Research Collective, Brain and Mind Centre, The University of Sydney, Sydney, NSW, 2006, Australia
Anna E. Hoffmann, Cathrin Rohleder, Beverly Jieu, Bruno F. Lemos Wimmer, F. Markus Leweke & Timothy A. Couttas
Endosane Pharmaceuticals GmbH, Berlin, Germany
Anna E. Hoffmann, Cathrin Rohleder & F. Markus Leweke
Department of Psychiatry and Psychotherapy, Central Institute of Mental Health, Medical Faculty Mannheim, Heidelberg University, Mannheim, Germany
Cathrin Rohleder & F. Markus Leweke
Centenary Institute, The University of Sydney, Sydney, Australia
Long H. Chung, Yingxin C. Jiang & Yanfei Qi
School of Medical Sciences, Faculty of Medicine and Health, The University of Sydney, Sydney, Australia
Anthony S. Don

Authors

Michael Allwright
View author publications
You can also search for this author in PubMed Google Scholar
Boris Guennewig
View author publications
You can also search for this author in PubMed Google Scholar
Anna E. Hoffmann
View author publications
You can also search for this author in PubMed Google Scholar
Cathrin Rohleder
View author publications
You can also search for this author in PubMed Google Scholar
Beverly Jieu
View author publications
You can also search for this author in PubMed Google Scholar
Long H. Chung
View author publications
You can also search for this author in PubMed Google Scholar
Yingxin C. Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Bruno F. Lemos Wimmer
View author publications
You can also search for this author in PubMed Google Scholar
Yanfei Qi
View author publications
You can also search for this author in PubMed Google Scholar
Anthony S. Don
View author publications
You can also search for this author in PubMed Google Scholar
F. Markus Leweke
View author publications
You can also search for this author in PubMed Google Scholar
Timothy A. Couttas
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

T.A.C. and M.A. conceived and designed the project. M.A. developed and uploaded the software tool under the supervision of B.G. Sample acquisition, analysis and interpretation of data was performed by A.E.H., B.J., L.H.C. and Y.C.J., under the supervision of F.M.L., Y.Q. and T.A.C. Writing of the manuscript was performed by T.A.C., with inputs from M.A., B.F.L.W. and A.E.H., C.R., A.S.D., B.G. and F.M.L. reviewed and edited the manuscript. All authors have read and agreed to the published version of the manuscript.

Corresponding author

Correspondence to Timothy A. Couttas.

Ethics declarations

Competing interests

The authors declare no competing interests; F.M.L. is a shareholder of curantis UG (Ltd.) and received an Investigator Initiated Trial grant from Endosane Pharmaceuticals GmbH; C.R. is a shareholder of lero bioscience UG (Ltd.); M.A. is the director of Alchem-AI (Ltd.) and a trustee of MyWorld Creative Projects (Charity number: 1197162); B.G. is a director of Pacific Analytics (Pty Ltd.) and SMRTR (Pty Ltd.); B.G. is a an editor at Scientific Reports but was not involved in the peer review or editorial decision-making process for this submission; A.E.H. and C.R. are employees of Endosane Pharmaceuticals GmbH. The remaining authors have nothing to disclose. All research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Supplementary Legends.

Dataset S1.

Dataset S2.

Dataset S3.

Dataset S4.

Dataset S5.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Allwright, M., Guennewig, B., Hoffmann, A.E. et al. ReTimeML: a retention time predictor that supports the LC–MS/MS analysis of sphingolipids. Sci Rep 14, 4375 (2024). https://doi.org/10.1038/s41598-024-53860-0

Download citation

Received: 20 November 2023
Accepted: 06 February 2024
Published: 22 February 2024
DOI: https://doi.org/10.1038/s41598-024-53860-0

Keywords

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.