Guiding the choice of informatics software and tools for lipidomics research applications

Ni, Zhixu; Wölk, Michele; Jukes, Geoff; Mendivelso Espinosa, Karla; Ahrends, Robert; Aimo, Lucila; Alvarez-Jarreta, Jorge; Andrews, Simon; Andrews, Robert; Bridge, Alan; Clair, Geremy C.; Conroy, Matthew J.; Fahy, Eoin; Gaud, Caroline; Goracci, Laura; Hartler, Jürgen; Hoffmann, Nils; Kopczyinki, Dominik; Korf, Ansgar; Lopez-Clavijo, Andrea F.; Malik, Adnan; Ackerman, Jacobo Miranda; Molenaar, Martijn R.; O’Donovan, Claire; Pluskal, Tomáš; Shevchenko, Andrej; Slenter, Denise; Siuzdak, Gary; Kutmon, Martina; Tsugawa, Hiroshi; Willighagen, Egon L.; Xia, Jianguo; O’Donnell, Valerie B.; Fedorova, Maria

doi:10.1038/s41592-022-01710-0

Download PDF

Perspective
Published: 21 December 2022

Guiding the choice of informatics software and tools for lipidomics research applications

Nature Methods volume 20, pages 193–204 (2023)Cite this article

14k Accesses
18 Citations
70 Altmetric
Metrics details

Subjects

Abstract

Progress in mass spectrometry lipidomics has led to a rapid proliferation of studies across biology and biomedicine. These generate extremely large raw datasets requiring sophisticated solutions to support automated data processing. To address this, numerous software tools have been developed and tailored for specific tasks. However, for researchers, deciding which approach best suits their application relies on ad hoc testing, which is inefficient and time consuming. Here we first review the data processing pipeline, summarizing the scope of available tools. Next, to support researchers, LIPID MAPS provides an interactive online portal listing open-access tools with a graphical user interface. This guides users towards appropriate solutions within major areas in data processing, including (1) lipid-oriented databases, (2) mass spectrometry data repositories, (3) analysis of targeted lipidomics datasets, (4) lipid identification and (5) quantification from untargeted lipidomics datasets, (6) statistical analysis and visualization, and (7) data integration solutions. Detailed descriptions of functions and requirements are provided to guide customized data analysis workflows.

A lipidome atlas in MS-DIAL 4

Article 15 June 2020

liputils: a Python module to manage individual fatty acid moieties from complex lipids

Article Open access 07 August 2020

Utilizing Skyline to analyze lipidomics data containing liquid chromatography, ion mobility spectrometry and mass spectrometry dimensions

Article 13 July 2022

Main

Lipidomics is a rapidly growing sub-area of metabolomics, reporting on the generation and metabolism of lipid species in relation to health and disease^1,2,3,4,5. There is increasing interest in the use of lipidomics to identify biomarkers and new targets for intervention in disease progression, as well as to delineate underpinning mechanisms^6,7,8. Over the last 10 years, there has been an explosion in the establishment and application of lipidomics mass spectrometry (MS) approaches for biomedical research. Newer generation MS instruments, particularly high-resolution time-of-flight and orbitrap configurations, enable the generation of large ‘omics’-type datasets that can report on literally thousands of lipids in a single analytical run. With the current drive in the field being to analyze large numbers of samples (for example, blood plasmas and tissue extracts), the amount of data generated experimentally is increasing exponentially. This has led to challenges in both data processing and downstream storage for later (open access) reuse that requires smart computational solutions.

Researchers have responded strongly to the emerging challenges of data analytics in lipidomics through developing new algorithms and tools that enable effective computational processing of data. These tools have already begun to enable the application of new lipidomics methods to the characterization of diverse biological processes, in many cases leading to notable discoveries, and some examples are listed here. Several tools have been applied to the profiling of plasma lipidomes, for example LipidXplorer, LipidFinder, and Lipid Data Analyser (LDA)^9,10. LDA has also contributed to a diverse range of biochemical studies, including adipocyte-derived extracellular vesicle characterization¹¹, determining the role of phosphatidylserine in autophagy¹², analysis of the role of lipids in flavivirus replication¹³, and how the lipid bilayer stabilizes the serotonin receptor¹⁴. Meanwhile, LipidFinder performed an extended clean-up of high-resolution MS data for the first report of the SARS-CoV2 envelope composition¹⁵. As further examples, the Lipid Ontology enrichment web tool, LION/web¹⁶ enabled the investigation of the role of lipids in bone marrow neutrophils during aging¹⁷ and the effect of sex and genetics on the metabolic response to calorie restriction¹⁸. Several of the tools described in this review, including LION/Web and XCMS have enabled the investigation of cellular metabolic states^19,20. Furthermore, XCMS elucidated a role for sphingolipids in neuropathic pain to be identified²¹. Although this is only a small illustrative list of studies using existing tools, the number and diversity of biological applications for lipidomics tools is increasing as more and more researchers enter the field. LIPID MAPS has around 72,000 users globally, with the LIPID MAPS Structure Database downloaded >4,600 times and viewed ~380,000 times in 2021, along with ~2,500 citations in publications during 2020–2021 (Google Scholar, Google Analytics data).

When it comes to the choice of data analysis approaches to use, researchers need to consider the underlying data structure and the research questions being asked. They also need to understand the underlying approaches used by algorithms to determine whether they will perform as expected for their particular data. However, making decisions on the most appropriate software is currently based on ad hoc processes, such as manual searching of the literature and testing packages individually. This is time consuming and inefficient since implementing tools requires extensive training and familiarization. Furthermore, the inappropriate use of software can lead to significant errors, for example, incorrect annotation of lipid identifications or erroneous interpretation of noise as peaks to suggest the presence of lipids in samples.

To address these issues and to support researchers with identification and testing of appropriate computational solutions for lipidomics, LIPID MAPS has generated a Lipidomics Tools Guide (https://www.lipidmaps.org/resources/tools?page=flow_chart), accessible through our home page. This comprises an interactive display that guides researchers towards appropriate solutions and provides detailed descriptions of key features and performance of individual tools, enabling informed decision making on processing pipelines (Fig. 1). To accompany the tool, in this review, we provide full details on the Lipidomics Tools Guide and listed software, together with practical advice from the individual developers relating to the primary and secondary applications of each tool. Additionally, two tutorials provided in the Supplementary Note illustrate the interoperability of different tools, exemplified for targeted and untargeted lipidomics experiments.

**Fig. 1: A screenshot of Lipidomics Tools Guide guiding the choice of the tools organized by the major tasks for lipidomics data processing.**

On the LIPID MAPS website, the tools are represented in the form of an interactive flow chart (https://www.lipidmaps.org/resources/tools?page=flow_chart), which covers available, open-access solutions supported by a graphical user interface (GUI) for different types of lipidomics-derived datasets. This is integrated into LIPID MAPS with links, descriptions, video tutorials, and contact details for software developers. The new tool comprehensively covers the seven major areas in lipidomics data processing, as follows: (1) lipid-oriented databases; (2) MS data repositories; (3) analysis of targeted lipidomics datasets; (4) lipid identification and (5) quantification from untargeted lipidomics datasets; (6) statistical analysis and visualization; and (7) data integration solutions (Fig. 2). To support informed decision making by lipidomics analysts, for each software, a short description is provided, highlighting the main functionalities and the areas of applications, followed by the specific features listed under ‘Technical information’ and ‘Task specific information’ tabs (Figs. 3 and 4). Additionally, users can review simplified, tabular representations of the available functions for each tool in a given section by using the ‘Tools Overview’ tab.

**Fig. 2: List of tools represented within LIPID MAPS Lipidomics Tools Guide assigned for each of the seven data processing categories.**

**Fig. 3: A screenshot example of individual tool details pages from LIPID MAPS Structure Database.**

**Fig. 4: A screenshot example of individual tool details pages LipidLynxX.**

In the ‘Technical information’ section, users can view the type of the license under which the tool is distributed, the availability of desktop and/or web-based interfaces, data input/output formats, and compatibility with different operating systems (for example, Windows, Linux, macOS). There is also information accessible via clickable links, which allow the downloading of the tool together with related documentation, user guides, and training datasets. Additional fields list how to use the tool through the command line or via API interfaces for advanced users wishing to construct their own customized pipelines. ‘Task specific information’ tabs navigate users to pages describing functionalities of the software for particular tasks covering the seven areas outlined above (Figs. 1 and 2). Some comprehensive tools have multiple functions integrated into one combined package and can be configured for a wide range of workflows. These tools are assigned to each task with associated descriptions accordingly, and the list of tools is shown in Table 1. In the next section, we provide more details about each area and its associated software and tools.

Table 1 The list of tools covered by the interactive LIPID MAP Lipidomics Tools Guide assigned with the corresponding task

Full size table

The categories of lipidomics tools

Lipid-oriented databases

Databases that curate individual lipid structures, both from historical and new publications into organized repositories are essential for researchers who aim to identify the specific molecules present in their biological samples. Databases also serve as a foundation for many data analysis pipelines as well as key knowledge bases for lipid research. Over the last 5–10 years, the size of lipidomics research datasets generated using MS and tandem MS (MS/MS) has increased massively, and their routine analysis requires automated programmatic approaches to enable database searching. To support the selection of the databases suitable for a particular application, the ‘Task specific information’ tabs within ‘Lipid Oriented Databases’ section provide an overview of the database functionalities, including the number of included lipid structures, structural ontology, covered lipid (sub)classes, levels of curation and annotation. Automated approaches to support data searchability and utility are described, including used identifiers, structural representation, availability of spectral libraries, and calculated physicochemical properties when available.

The most widely used lipid-specific databases are provided by LIPID MAPS and SwissLipids. LIPID MAPS hosts several databases in which lipid structures are cataloged according to the LIPID MAPS nomenclature and classification^22,23,24. Specific databases provide utility for different use-cases as follows. LIPID MAPS Structure Database (LMSD)²² contains over 47,000 lipids (August 2022) obtained from sources that include experimental work performed by the LIPID MAPS consortium, from other lipid databases, from the scientific literature, and also some that are computationally generated on the basis of commonly occurring fatty acid chains in mammalian lipids. LMSD can return either bulk (lipid species) annotations for MS data, on the basis of the shorthand nomenclature described by Liebisch et al.²⁴, or fully annotated names (structure-defined lipids), where users already have additional structural information, for example, from MS/MS experiments. LMSD has recently implemented a display of reaction data to link together lipid species by biochemical transformations. This was initially obtained from Rhea²⁵, WikiPathways²⁶, Reactome²⁷, and other sources and is now in place for many generic lipids. This is in the process of being cascaded down to individual lipid species. In the case of (high-resolution) MS experiments, the user may only have information on the m/z value of detected lipid ions. In this case, searching databases will provide information on elemental compositions and, using this, generate putative matches. It is recommended to use the BULK search tool on LIPID MAPS to perform this operation since this returns shorthand nomenclature as a first step. Putative matches on the basis of MS indicate the number of carbons in fatty acyl chains and double bonds or rings present, but not how these are distributed between or within acyl chains in the molecule. For some users, the LIPID MAPS Computationally Generated Bulk Lipids (COMP_DB)²² may be a more suitable resource to query. This database contains over 59,000 lipid species in shorthand format (in the major classes such as fatty acyls, glycero- and glycerophospholipids, sterols, and sphingolipids), computationally generated from a list of commonly occurring acyl and alkyl chains. Most entries in this database represent hierarchical structures that could map to many different specific annotations. The LIPID MAPS In Silico Structure Database (LMISSD)²² contains over 1,100,000 entries derived from the computational expansion of headgroups and chains for common lipid classes. These are provided as specific structural annotations but can also be provided as a hierarchy of sum composition and chain composition. Lastly, the Lipidomic Ion Mobility Database²² was developed using data from the McLean and Griffin labs^28,29,30 to provide collisional cross section measurements for drift tube MS experiments.

The SwissLipids knowledgebase³¹ was developed to aid lipidomics researchers in interpreting experimental datasets and integrating them with prior biological knowledge, allowing also for data exploration and hypothesis generation. In SwissLipids, experimentally characterized lipids are curated from peer-reviewed literature using the ChEBI³² ontology (https://www.ebi.ac.uk/chebi/). Lipid metabolism is described using the Rhea knowledgebase²⁵ for biochemical and transport reactions (https://www.rhea-db.org), which is itself based on ChEBI, while enzymes, transporters, and interacting proteins are described using the UniProt Knowledgebase UniProtKB³³ (https://www.uniprot.org), for which Rhea is the reference vocabulary for such annotation³⁴. As the number of experimentally characterized lipid structures represents only a small fraction of the possible structures that may exist in nature, expert-curated knowledge of lipid structures and metabolism in ChEBI, Rhea, and UniProt is used to design and create a library of all theoretically feasible lipid structures in silico, which is fully mapped to these three resources. The current version of the SwissLipids library contains almost 600,000 lipid structures from over 550 lipid classes, organized into two distinct hierarchical lipid classifications—one that parallels the structural classification of LIPID MAPS³⁵ and one that is based on the shorthand notation for MS data³⁶ that links lipid identifications from MS-based experiments to structures and biological knowledge.

MS data repositories

Raw and/or processed data deposition using free repositories services, although a standard task before publication of the results in the field of proteomics for many years, is only now finding its way into the lipidomics community³⁷. MS data repositories increase data transparency and reproducibility, allow reanalysis for new discoveries and data-driven hypothesis generation, as well as benchmarking of new software tools³⁸. Although numerous platforms for upload of raw MS datasets exist (for example, MassIVE (https://massive.ucsd.edu/), ProteomeXchange (http://www.proteomexchange.org/)), specific functionalities to support metadata, sample preparation protocols, and data matrices are necessary to improve the reusability of the deposited datasets following FAIR principles³⁹. To select the optimal solution for data upload or download, users would need to be informed about the types of stored raw, processed, and metadata, curation strategy, total number of available datasets, and species coverage.

Repositories tuned for metabolomics and lipidomics data, such as Metabolomics Workbench⁴⁰ and MetaboLights⁴¹ have the functionality to associate deposited data with compound query results to enhance the reusability of the datasets, allowing further interrogation. Each dataset is assigned a unique project accession ID, sufficient space to host the raw and/or processed data, supported by detailed information, including study design, associated metadata, details on sample preparation, and analysis protocols. Datasets can be browsed and searched by specific keywords, organisms of origin and reported compounds, and are usually associated with a source publication. MetaboLights has unique fields for data transformation and metabolite identification and provides an online viewer to review lipid identifiers, quantities, and corresponding structures, while Metabolomics Workbench is bundled with the RefMet⁴² data resource (containing over 160,000 annotated metabolite species, including a large collection of lipids) and a suite of online data analysis tools. MetaboLights and Metabolomics Workbench are accepted by mainstream journals as data repositories for publications of lipidomics datasets.

Analysis of targeted lipidomics datasets

Lipidomics data acquisition strategies can be generally subdivided into targeted and untargeted workflows. In targeted lipidomics, a predefined set of lipids with a known mass-to-charge ratio (m/z) of the precursor and fragment/product ion(s) need to be provided by the user before data acquisition. Moreover, optimization of ionization and MS parameters for each pair of precursor–product ions (so-called ‘transition’) must be performed to optimize the sensitivity of the method. Targeted analysis using single or multiple reaction monitoring (SRM, MRM) on triple quadrupole instruments and, more recently, parallel reaction monitoring (PRM) on orbitrap and quadrupole time-of-flight instruments are successfully applied to the quantification of selected sets of lipids as well as hundreds of lipids in large sample cohorts (for example, over 600 lipid species in one liquid chromatography (LC)–MS/MS analysis⁴³). However, to quantify a large number of lipids in a correspondingly large sample cohort, targeted lipidomics workflows should be quick to establish, and obtained results should be easy to inspect and validate. This process can be extremely time consuming and most often is not accessible to non-experts. Thus, specialized tools can be used to facilitate both method design and data processing steps. For software-assisted method design, the user should define the type of the targeted acquisition method planned (SRM/MRM or PRM) and lipid (sub)classes/species aimed to be covered. The selection of transitions can be done among experimentally validated or computationally optimized or can be even predicted on-the-fly on the basis of common knowledge of lipid subclass-specific gas phase fragmentation chemistry. The set of fragment ions and their yield will strongly depend on class, the number of double bonds and fatty acyl length and even the type of instrument on which data were acquired. For instance, with LipidCreator⁴⁴ the targeted assay can be generated in three steps. In brief, during step 1 the user would select the lipid category and class to work with and define fatty acyl, double bond, hydroxyl group, and adduct constraints (precursor selection) as well as the polarity mode to analyze lipids of interest. In step 2 the monitored fragments at the MS/MS level can be defined. In step 3 the designed molecules can be added to the target list, reviewed, and transferred to the MS instrument for data acquisition. METLIN-MRM⁴⁵ is another data-rich resource where users can choose from experimentally and/or computationally optimized transitions or even public repository transitions with links to corresponding DOIs.

Although method design requires careful optimization and is time consuming, post-acquisition data processing of targeted lipidomics datasets is relatively straightforward and follows general rules of LC–MS/MS-based targeted quantification accepted in both the proteomics and metabolomics communities. Indeed, several open-access tools originally developed for targeted analysis of peptides (Skyline) or metabolites (XCMS-MRM) have been adapted for lipidomics applications. Thus, LipidCreator is fully integrated with Skyline⁴⁶ for small molecules, making it a vendor-independent software. METLIN-MRM-assisted method development can be directly extended to post-acquisition data processing using the XCMS-MRM⁴⁵ platform. Both Skyline and XCMS-MRM tools provide automated solutions for peak integration, relative and absolute quantification, and data quality control.

Lipid identification from untargeted lipidomics datasets

A second analytical strategy commonly used in lipidomics relates to untargeted workflows on the basis of data-dependent (DDA) or data-independent acquisition (DIA). Here users perform MS analysis of a lipidome in so-called ‘discovery’ mode without prior knowledge of the exact set of lipids to be analyzed in the sample. Generally, the main aim of untargeted lipidomics is to analyze and ideally identify as many lipid species as possible (ultimately all ionizable constituents extracted from the sample). Both DDA and DIA experiments rely on the iteration of instrument cycles that include MS1 survey scans (usually acquired at high resolution to define the elemental composition of the lipid ions) and a number of MS/MS spectra in which lipid ions, selected on the basis of their abundance (DDA) or within a given m/z range (DIA), undergo collision-induced dissociation (CID). MS/MS information is then used to assign lipids to particular molecular species on the basis of their known gas phase fragmentation patterns. Thus, untargeted lipidomics experiments can support lipid identification at different levels of structural assignment with high-resolution MS spectra providing only elemental composition and thus the putative bulk composition of the lipid (for example, PC 36:4), but with additional MS/MS information supporting the identification of lipids at molecular species levels (for example, PC 16:0_20:4). Although this is possible by manually checking MS and corresponding MS/MS spectra, lipid identification requires automated solutions to support analysis throughput, as within commonly used LC–MS/MS DDA setups, thousands of individual MS/MS spectra are generated within a single analysis.

Owing to the high demand and popularity of untargeted lipidomics workflows, numerous tools have been developed to support this area. Thus, the section of the interactive chart for untargeted lipidomics is represented by nine software tools with open access for academic users. By clicking on the corresponding ‘Task specific information’ tabs, users can get familiar with the tools which support specific acquisition strategies only versus other tools which cover larger application areas. To support the selection of the optimal identification tool, the user can select between high-resolution MS applications (Lipid Data Analyzer (LDA)⁴⁷, LipidFinder⁴⁸, MS-DIAL⁴⁹, XCMS online⁵⁰), DDA (LDA^51,52, MS-DIAL, LipidHunter2⁵³, LipidXplorer⁵⁴, Lipostar2⁵⁵, and MZmine^56,57,58), DIA (MS-DIAL and Lipostar2), and even datasets acquired using ion mobility methods, which provide orthogonal to LC–MS/MS separation (MS-DIAL, MZmine, Lipostar2). Furthermore, analysis of epilipidomics datasets focusing on the identification of oxidized lipids can be supported by LDA⁵⁹, Lipostar2, LPPtiger⁶⁰, and MS-DIAL tools. For each particular application listed above, the ‘Task specific information’ tab provides information about the main principles of operation and scoring, and accuracy measures.

Lipid quantification from untargeted lipidomics datasets

The quantification of lipids provides their abundance (relative or absolute) in a biological sample, enabling comparison with other samples. Quantified values aid harmonization across lipidomics datasets. Quantitative analysis can be performed using data acquired from targeted and untargeted approaches regardless of whether they were acquired using Full-MS, DDA, or DIA modes. Untargeted lipidomics quantification can be subdivided into relative (for example, fold change between condition 1 and condition 2) and semi-absolute (for example, expressed in pmol μg⁻¹ of proteins). Owing to the extremely large diversity of lipid structures in natural lipidomes and relatively limited numbers of commercially available lipid standards, it is not feasible to perform absolute quantification at a true lipidome level^61,62. On the other hand, owing to the close similarity in ionization and MS behavior of lipids from the same subclass, the use of one or a small number of internal standards per subclass is currently considered as a compromise. Isotopic correction algorithms can be used during data processing to minimize the effect of structural differences between internal standards and individual lipid molecular species⁶³. Lipids present a particular challenge for accurate identification since there will be several hundreds of lipids distributed over a relatively narrow m/z range (for example, from 400 to 900 m/z), as well as a high number of isobaric and even isomeric species. Additionally, lipids are detected over a large dynamic range of concentrations in natural lipidomes. These issues result in significant challenges for accurate peak assignment and integration and downstream accurate quantification⁶². Tools for processing quantitative lipidomics datasets have benefited from previously developed software solutions designed for quantitative proteomics and metabolomics. However, owing to the special properties of lipids as outlined above, additional optimizations are necessary to ensure the accuracy of lipidomics data processing. For instance, data normalization using a preconfigured set of internal standards (for example, Lipostar2 and MS-DIAL) is introduced to simplify the normalization process and reduce the post-processing of the data matrix. Additionally, robust peak picking and peak boundary selection algorithms are critical for obtaining accurate peak areas for quantitative analysis. Though several robust peak picking algorithms are available, manual adjustment and re-integration is often required because of the high number of isobaric and isomeric species. Additional features integrated within data processing tools such as peak alignment and deconvolution are important to handle lipid species with multiple adducts types and to process DIA datasets. Current available quantification tools such as LDA, Lipostar2, MS-DIAL, MZmine, and XCMS online generally provide integrated pipelines from lipid identification up to quantification including essential normalization functions. For each tool, the ‘Task specific information’ section within the LIPID MAPS Lipidomics Tools Guide displays multiple features to guide the choice of the tool on the basis of user requirements, including details on quantification methods and accuracy measures.

Statistical analysis and visualization of lipidomics datasets

Lipidomics research generates large datasets, and the complexity of experimental design is also increasing. Therefore, a critical bottleneck in lipidomics data processing is often the statistical analysis, which requires extensive use of tailored approaches that take into account the specific characteristics of lipid data. Different methods are available for the analysis of lipidomics data, each one with its own advantages and pitfalls. The choice of statistical methods to be applied should be first guided by the aim of the lipidomic study. When testing for statistical significance between predefined groups is desired (for example, health versus disease), differences between groups of samples are usually evaluated by applying parametric (for example, t-test, ANOVA) or non-parametric (for example, Wilcoxon signed-rank test, Kruskal–Wallis) statistical hypothesis tests⁶⁴. With thousands of lipids being considered in some lipidomics experiments, the high number of variables increases the chance to find spurious correlated variables (false positives). Therefore, correction for multiple comparison testing is required. In addition, in lipidomics, variables (lipids) are usually not all truly independent (for example, one lipid can be represented by several ions or adducts), meaning that corrections commonly applied for genomics/transcriptomics, such as Bonferroni or Benjamini–Hochberg can significantly overcorrect. Here, softer corrections, such as sequential goodness of fit, represent an alternative that may be more appropriate⁶⁵.

Another consideration is that detected features might not always follow a normal distribution⁶⁶. Thus, multivariate statistical approaches, in which all the variables are considered simultaneously, often by assuming they are correlated and not fully independent, are extensively applied in lipidomics. For explorative purposes, principal component analysis (PCA)⁶⁷ represents the most widely used approach in omics, including lipidomics⁶⁸. Using PCA, the original dataset is represented in a lower-dimensional subspace that maintains most of the relevant information (variance). Being an unsupervised method, PCA does not require a priori knowledge of the dataset and can be used not only to explore clusters of samples eventually formed but also for interpretation without imposing any information on classification or cluster association. Hierarchical or non-hierarchical clustering methods aim at grouping samples by similarity, which is measured utilizing statistical distances or similarities between samples⁶⁹. Supervised regression algorithms for dimensionality reduction, such as linear discriminant analysis^70,71 or partial least squares discriminant analysis^72,73, are also available to evaluate and classify sample identity. In addition to partial least squares methods, other machine learning approaches have been also used in lipidomics applications. Among them, supervised methods like support vector machine⁷⁴ and random forest⁷⁵ were used for classification purposes and can also be used for feature selection. Despite the wide availability of statistical tools applied to lipidomics, several potential issues need to be considered. For example, in large studies, the so-called ‘batch effect’ can hamper statistical analysis, and correction with internal standards and/or quality controls must be made before the application of statistical tools. Also, missing data, which are the result of molecule concentrations below detection limits and very common in lipidomics, can be detrimental in model generation and interpretation, with some tools more sensitive than others⁶⁸. Nevertheless, several strategies for missing data imputation have been proposed⁷⁶.

Generally, the multi-functional tools described above for quantitative lipidomics all provide integrated platforms for statistical analysis and data visualization (LDA, Lipostar2, MS-DIAL, MZmine, and XCMS online). Additionally, several tools were specifically developed to support chemometrics analysis and result visualization of metabolomics and lipidomics data (LIPID MAPS Statistical Analysis Tools⁷⁷ and MetaboAnalyst 5.0⁷⁸). Integrated statistical analysis and visualization functions provide easy access to most common functions, including univariate (parametric and non-parametric testing) as well as multivariate (non-supervised and supervised) solutions with a close interactive connection to the corresponding lipid quantification data matrix and often bundled with data pretreatments including normalization, scaling, and visualization of filtered data subset. Dedicated tools (LIPID MAPS Statistical Analysis Tools and MetaboAnalyst 5.0) might require researchers to transform the quantification data according to specific templates for dataset import but can provide a more extensive set of statistical and visualization functions with detailed customizable configurations. MetaboAnalyst 5.0, for instance, has a dedicated utility for batch-effect correction, which contains nine methods well established in the field of metabolomics as well as eight methods for missing value imputation⁷⁹.

Data integration solutions

The ultimate aim of many lipidomics studies is to investigate biological relevance and mechanisms behind lipidome remodeling driven by the specific biological conditions. Considering the nature of ‘big data’ produced by lipidomics experiments, manual evaluation of the biological significance of obtained results would be extremely time consuming and require extensive knowledge in diverse areas of biochemistry and cell biology. Such advanced data integration goes well beyond single lipidomics data matrices and extends into related multiomics approaches using curated pathways or network analysis strategies. The combination and utilization of multiomics data from different sources require sophisticated data pretreatments, including manual curation and advanced bioinformatics solutions. This type of workflow can be generally divided into three steps: conversion of lipid annotations to their corresponding IDs within knowledge and ontology databases, lipid ontology enrichment, and advanced pathway/network analysis.

Tools that are capable of bridging lipid annotations supported by purely lipidomics software with the structural or functional IDs in data integration tools provide the first critical step towards systems biology integration of lipidomics datasets. To reduce the complexity of ID cross-validation and database queries, several tools are available to assist this conversion (Goslin⁸⁰, LipidLynxX⁸¹, and RefMet⁴²) and to link lipid identifiers to various databases (BridgeDb⁸², Goslin, LipidLynxX, and RefMet). For example, BridgeDb has mappings to other databases for almost 19,000 LIPID MAPS identifiers (https://doi.org/10.6084/m9.figshare.13550384.v1).

Biological interpretation of lipidomics data is often driven by the focus on individual lipids. Although this approach is useful in biomarker discovery, it obscures the possible effects of shared properties of molecules related to the biological phenomenon. A way to circumvent this is to manually curate lipid groups that share specific properties (for example, lipid class, level of unsaturation) and report aggregate statistics. However, the manual construction of these groups is often laborious owing to the ambiguity in lipid nomenclature and introduces a risk of cherry-picking. Ontologies, formalizations of concepts, and their relations have been successful in other omics fields to provide frameworks for constructing groups of molecules with shared biological properties. For lipidomics data, several ontologies, such as lipid ontology (LION/Web)¹⁶ and Lipid Mini-On⁸³, are useful in aiding in the biological interpretation. Currently, LION links over 50,000 lipids to chemical (for example, LIPID MAPS classification and fatty acid associations), physiochemical (for example, membrane fluidity and intrinsic curvature), and cell biological (for example, predominant subcellular localization) properties and Lipid Mini-On uses a text mining strategy to attribute Lipid Ontology structural terms to lipids.

Typically, ontology-derived groups of molecules (‘terms’) are analyzed using enrichment analysis approaches. In these analyses, a given term is enriched if the molecules belonging to the term are overrepresented in a target list or are higher ranked in a list of molecules ordered by a statistic (for example, fold change and P value) than expected by chance. Both LION/web and Lipid Mini-On are freely available online tools that perform ontology-term enrichment analysis of user-provided lipidomics data. LION/web allows specific LION-term categories to be included for analysis. After submission, LION/web reports descriptive matching statistics and enrichment analysis, as well as publication-ready figures. Traditionally, enrichment analyses compare two groups of samples. To analyze datasets with more sample groups, LION/web was recently expanded with the PCA-LION heat map module. This module generates a heat map showing the most dynamic LION-terms for all samples on the basis of the enrichment analysis of a given number of principle components. Lipid IDs of significantly enriched terms can be further mapped to available pathways and networks to investigate the changes at the systems level. Lipid Mini-On enables to generate a variety of visualization of lipid enrichment by structural characteristics. Lipids and their associated lipid ontology terms can be visualized as a network to hierarchize interpretations of the enrichment performed.

Several tools are available to support pathway and network analysis of lipidomics datasets, including integrated pathway graph analysis modules in Lipostar2 and stand-alone web application BioPAN⁸⁴, which allows the visualization of quantitative lipidomics data in the context of known biosynthetic pathways, as well as the central hub of community-driven pathways represented by the Lipid Portal on WikiPathways²⁶, in collaboration with LIPID MAPS. Though more advanced analysis can be performed with highly customized programs and scripts by experienced bioinformaticians, these tools provide simple interfaces for researchers to begin to map lipidomics data to obtain essential lipid centric analysis results from predefined pathways and networks in, for example, PathVisio⁸⁵ and Cytoscape⁸⁶. Furthermore, the pathways from WikiPathways can be easily converted to a network through the WikiPathways App⁸⁷, after which these networks can be extended with additional knowledge such as micro RNAs, transcription factors, or drugs through the CyTargetLinker⁸⁸ app.

Conclusion

Lipidomics is a fast-growing field that is increasingly supporting the analysis of ever larger datasets of high complexity. To assist with high-throughput data processing, many new software tools have been developed by academic researchers and are now openly available on developers’ websites. To guide the user and provide a point of contact for finding these tools, in this review, we provide detailed specifications on the most widely used software packages for lipidomics along with a complementary interactive Lipidomics Tools Guide available on LIPID MAPS. Two tutorials are provided as Supplementary Notes to exemplify the interoperability of the Guide and how to combine different tools for targeted and untargeted lipidomics experiments. This portal can help researchers to construct a complete lipidomics data analysis workflow starting with lipid identification and quantification till advanced visualization and data integration using open-access software solutions with the clickable graphical user interface. The Lipidomics Tools Guide will be regularly reviewed and updated to reflect new developments in the field as well as continuous support for the listed tools. Moreover, the Guide can be updated upon request by authors of the software within the scope of this resource. The LIPID MAPS interactive Lipidomics Tools Guide (https://www.lipidmaps.org/resources/tools?page=flow_chart) summarizes essential information about each tools to assist beginners in lipidomics as well as advanced data scientists in selecting the most suitable tool for each of the steps in the processing of MS-derived lipidomics data.

References

Wenk, M. R. The emerging field of lipidomics. Nat. Rev. Drug Discov. 4, 594–610 (2005).
Article CAS PubMed Google Scholar
Yang, K. & Han, X. Lipidomics: techniques, applications, and outcomes related to biomedical sciences. Trends Biochem. Sci. 41, 954–969 (2016).
Article CAS PubMed PubMed Central Google Scholar
Wood, P. L. Lipidomics of Alzheimer’s disease: current status. Alzheimer’s Res. Ther. 4, 1–10 (2012).
Google Scholar
Meikle, P. J., Wong, G., Barlow, C. K. & Kingwell, B. A. Lipidomics: potential role in risk prediction and therapeutic monitoring for diabetes and cardiovascular disease. Pharmacol. Therapeutics 143, 12–23 (2014).
Article CAS Google Scholar
Yang, L. et al. Recent advances in lipidomics for disease research. J. Sep. Sci. 39, 38–50 (2016).
Article CAS PubMed Google Scholar
Watson, A. D. Thematic review series: systems biology approaches to metabolic and cardiovascular disorders. Lipidomics: a global approach to lipid analysis in biological systems. J. Lipid Res. 47, 2101–2111 (2006).
Article CAS PubMed Google Scholar
Hu, C. et al. Analytical strategies in lipidomics and applications in disease biomarker discovery. J. Chromatogr. B 877, 2836–2846 (2009).
Article CAS Google Scholar
Shevchenko, A. & Simons, K. Lipidomics: coming to grips with lipid diversity. Nat. Rev. Mol. Cell Biol. 11, 593–598 (2010).
Article CAS PubMed Google Scholar
Vvedenskaya, O. et al. Nonalcoholic fatty liver disease stratification by liver lipidomics. J. Lipid Res. 62, 100104–100105 (2021).
Article CAS PubMed PubMed Central Google Scholar
Vvedenskaya, O., Wang, Y., Ackerman, J. M., Knittelfelder, O. & Shevchenko, A. Analytical challenges in human plasma lipidomics: a winding path towards the truth. Trends Anal. Chem. 120, 115277 (2019).
Article Google Scholar
Durcin, M. et al. Characterisation of adipocyte-derived extracellular vesicle subtypes identifies distinct protein and lipid signatures for large and small extracellular vesicles. J. Extracell. Vesicles 6, 1305677 (2017).
Article PubMed PubMed Central Google Scholar
Durgan, J. et al. Non-canonical autophagy drives alternative ATG8 conjugation to phosphatidylserine. Mol. Cell 81, 2031–2040 (2021).
Article CAS PubMed PubMed Central Google Scholar
Zhuang, X. et al. The circadian clock components BMAL1 and REV-ERBα regulate flavivirus replication. Nat. Commun. 10, 1–13 (2019).
Article Google Scholar
Zhang, Y. et al. Asymmetric opening of the homopentameric 5-HT3A serotonin receptor in lipid bilayers. Nat. Commun. 12, 1–15 (2021).
Google Scholar
Saud, Z. et al. The SARS-CoV2 envelope differs from host cells, exposes procoagulant lipids, and is disrupted in vivo by oral rinses. J. Lipid Res. 63, 100208 (2022).
Article CAS PubMed PubMed Central Google Scholar
Molenaar, M. R. et al. LION/web: a web-based ontology enrichment tool for lipidomic data analysis. Gigascience 8, 1–10 (2019).
Article CAS Google Scholar
Lu, R. J. et al. Multi-omic profiling of primary mouse neutrophils predicts a pattern of sex- and age-related functional regulation. Nat. Aging 1, 715–733 (2021).
Article PubMed PubMed Central Google Scholar
Green, C. L. et al. Sex and genetic background define the metabolic, physiologic, and molecular response to protein restriction. Cell Metab. 34, 209–226 (2022).
Article CAS PubMed PubMed Central Google Scholar
Beyer, B. A. et al. Metabolomics-based discovery of a metabolite that enhances oligodendrocyte maturation. Nat. Chem. Biol. 14, 22–28 (2017).
Article PubMed PubMed Central Google Scholar
Rappez, L. et al. SpaceM reveals metabolic states of single cells. Nat. Methods 18, 799–805 (2021).
Article CAS PubMed PubMed Central Google Scholar
Patti, G. J. et al. Metabolomics implicates altered sphingolipids in chronic pain of neuropathic origin. Nat. Chem. Biol. 8, 232–234 (2012).
Article CAS PubMed PubMed Central Google Scholar
Sud, M. et al. LMSD: LIPID MAPS structure database. Nucleic Acids Res. 35, D527–D532 (2007).
Article CAS PubMed Google Scholar
Fahy, E. et al. Update of the LIPID MAPS comprehensive classification system for lipids. J. Lipid Res. 50, S9–S14 (2009). vol.
Article PubMed PubMed Central Google Scholar
Liebisch, G. et al. Update on LIPID MAPS classification, nomenclature and shorthand notation for MS-derived lipid structures. J. Lipid Res. 61, 1539–1555 (2020).
Article CAS PubMed PubMed Central Google Scholar
Bansal, P. et al. Rhea, the reaction knowledgebase in 2022. Nucleic Acids Res. 50, D693–D700 (2022).
Article CAS PubMed Google Scholar
Martens, M. et al. WikiPathways: connecting communities. Nucleic Acids Res. 49, D613–D621 (2021).
Article CAS PubMed Google Scholar
Gillespie, M. et al. The reactome pathway knowledgebase 2022. Nucleic Acids Res. 50, D687–D692 (2022).
Article CAS PubMed Google Scholar
Stow, S. M. et al. An interlaboratory evaluation of drift tube ion mobility-mass spectrometry collision cross section measurements. Anal. Chem. 89, 9048–9055 (2017).
Article CAS PubMed PubMed Central Google Scholar
Hinz, C. et al. A comprehensive UHPLC ion mobility quadrupole time-of-flight method for profiling and quantification of eicosanoids, other oxylipins, and fatty acids. Anal. Chem. 91, 8025–8035 (2019).
Article CAS PubMed PubMed Central Google Scholar
Leaptrot, K. L., May, J. C., Dodds, J. N. & McLean, J. A. Ion mobility conformational lipid atlas for high confidence lipidomics. Nat. Commun. 10, 985 (2019).
Article PubMed PubMed Central Google Scholar
Aimo, L. et al. The SwissLipids knowledgebase for lipid biology. Bioinformatics 31, 2860–2866 (2015).
Article CAS PubMed PubMed Central Google Scholar
Hastings, J. et al. ChEBI in 2016: improved services and an expanding collection of metabolites. Nucleic Acids Res. 44, D1214–D1219 (2016).
Article CAS PubMed Google Scholar
Bateman, A. et al. UniProt: the universal protein knowledgebase in 2021. Nucleic Acids Res. 49, D480–D489 (2021).
Article Google Scholar
Morgat, A. et al. Enzyme annotation in UniProtKB using Rhea. Bioinformatics 36, 1896–1901 (2020).
CAS PubMed Google Scholar
Fahy, E. et al. A comprehensive classification system for lipids. J. Lipid Res. 46, 839–861 (2005).
Article CAS PubMed Google Scholar
Liebisch, G. et al. Shorthand notation for lipid structures derived from mass spectrometry. J. Lipid Res. 54, 1523–1530 (2013).
Article CAS PubMed PubMed Central Google Scholar
O’Donnell, V. B. et al. Steps toward minimal reporting standards for lipidomics mass spectrometry in biomedical research publications. Circulation: Genom. Precis. Med. 13, e003019 (2020).
Google Scholar
Tsugawa, H. et al. Mass spectrometry data repository enhances novel metabolite discoveries with advances in computational metabolomics. Metabolites 9, 119 (2019).
Article CAS PubMed PubMed Central Google Scholar
Wilkinson, M. D. et al. The FAIR Guiding Principles for scientific data management and stewardship. Sci. Data 3, 160018 (2016).
Article PubMed PubMed Central Google Scholar
Sud, M. et al. Metabolomics Workbench: an international repository for metabolomics data and metadata, metabolite standards, protocols, tutorials and training, and analysis tools. Nucleic Acids Res. 44, D463–D470 (2016).
Article CAS PubMed Google Scholar
Haug, K. et al. MetaboLights: a resource evolving in response to the needs of its scientific community. Nucleic Acids Res. 48, D440–D444 (2020).
CAS PubMed Google Scholar
Fahy, E. & Subramaniam, S. RefMet: a reference nomenclature for metabolomics. Nat. Methods 17, 1173–1174 (2020).
Article CAS PubMed Google Scholar
Huynh, K. et al. High-throughput plasma lipidomics: detailed mapping of the associations with cardiometabolic risk factors. Cell Chem. Biol. 26, 71–84.e4 (2019).
Article CAS PubMed Google Scholar
Peng, B. et al. LipidCreator workbench to probe the lipidomic landscape. Nat. Commun. 11, 1–14 (2020).
Article Google Scholar
Domingo-Almenara, X. et al. XCMS-MRM and METLIN-MRM: a cloud library and public resource for targeted analysis of small molecules. Nat. Methods 15, 681–684 (2018).
Article CAS PubMed PubMed Central Google Scholar
Adams, K. J. et al. Skyline for small molecules: a unifying software package for quantitative metabolomics. J. Proteome Res. 19, 1447–1458 (2020).
Article CAS PubMed PubMed Central Google Scholar
Hartler, J. et al. Lipid Data Analyzer: unattended identification and quantitation of lipids in LC–MS data. Bioinformatics 27, 572–577 (2010).
Article PubMed Google Scholar
Fahy, E. et al. LipidFinder on LIPID MAPS: peak filtering, MS searching and statistical analysis for lipidomics. Bioinformatics 35, 685–687 (2018).
PubMed Central Google Scholar
Tsugawa, H. et al. A lipidome atlas in MS-DIAL 4. Nat. Biotechnol. 38, 1159–1163 (2020).
Article CAS PubMed Google Scholar
Tautenhahn, R., Patti, G. J., Rinehart, D. & Siuzdak, G. XCMS online: A web-based platform to process untargeted metabolomic data. Anal. Chem. 84, 5035–5039 (2012).
Article CAS PubMed PubMed Central Google Scholar
Hartler, J. et al. Deciphering lipid structures based on platform-independent decision rules. Nat. Methods 14, 1171–1174 (2017).
Article CAS PubMed PubMed Central Google Scholar
Hartler, J. et al. Automated annotation of sphingolipids including accurate identification of hydroxylation sites using MS nData. Anal. Chem. 92, 14054–14062 (2020).
Article CAS PubMed PubMed Central Google Scholar
Ni, Z., Angelidou, G., Lange, M., Hoffmann, R. & Fedorova, M. LipidHunter identifies phospholipids by high-throughput processing of LC–MS and shotgun lipidomics datasets. Anal. Chem. 89, 8800–8807 (2017).
Article CAS PubMed Google Scholar
Herzog, R. et al. LipidXplorer: a software for consensual cross-platform lipidomics. PLoS One 7, e29851 (2012).
Article CAS PubMed PubMed Central Google Scholar
Goracci, L. et al. Lipostar, a comprehensive platform-neutral cheminformatics tool for lipidomics. Anal. Chem. 89, 6257–6264 (2017).
Article CAS PubMed Google Scholar
Pluskal, T., Castillo, S., Villar-Briones, A. & Orešič, M. MZmine 2: modular framework for processing, visualizing, and analyzing mass spectrometry-based molecular profile data. BMC Bioinf. 11, 1–11 (2010).
Article Google Scholar
Korf, A. et al. Three-dimensional Kendrick mass plots as a tool for graphical lipid identification. Rapid Commun. Mass Spectrom. 32, 981–991 (2018).
Article CAS PubMed Google Scholar
Korf, A., Jeck, V., Schmid, R., Helmer, P. O. & Hayen, H. Lipid species annotation at double bond position level with custom databases by extension of the MZmine 2 open-source software package. Anal. Chem. 91, 5098–5105 (2019).
Article CAS PubMed Google Scholar
Krettler, C. A., Hartler, J. & Thallinger, G. G. Identification and quantification of oxidized lipids in LC–MS lipidomics data. Stud. Health Technol. Inform. 271, 39–48 (2020).
PubMed Google Scholar
Ni, Z., Angelidou, G., Hoffmann, R. & Fedorova, M. LPPtiger software for lipidome-specific prediction and identification of oxidized phospholipids from LC-MS datasets. Sci. Rep. 7, 15138 (2017).
Article PubMed PubMed Central Google Scholar
Wang, M., Wang, C. & Han, X. Selection of internal standards for accurate quantification of complex lipid species in biological extracts by electrospray ionization mass spectrometry—what, how and why? Mass Spectrom. Rev. 36, 693–714 (2017).
Article PubMed Google Scholar
Khoury, S. et al. Quantification of lipids: model, reality, and compromise. Biomolecules 8, 174 (2018).
Article PubMed PubMed Central Google Scholar
Lange, M. & Fedorova, M. Evaluation of lipid quantification accuracy using HILIC and RPLC MS on the example of NIST® SRM® 1950 metabolites in human plasma. Anal. Bioanal. Chem. 412, 3573–3584 (2020).
Article CAS PubMed PubMed Central Google Scholar
Miller, J. N. and Miller, J. C. Statistics and Chemometrics for Analytical Chemistry 4th edn, Ch. 7 (Pearson Education, 2000).
Carvajal-Rodríguez, A., de Uña-Alvarez, J. & Rolán-Alvarez, E. A new multitest correction (SGoF) that increases its statistical power when increasing the number of tests. BMC Bioinf. 10, 209 (2009).
Article Google Scholar
Griffin, J. L., Liggi, S. & Hall, Z. in Lipidomics (eds Griffiths, W. & Wang, Y.) 25–48 (RSC, 2020).
Wold, S., Esbensen, K. & Geladi, P. Principal component analysis. Chemom. Intell. Lab. Syst. 2, 37–52 (1987).
Article CAS Google Scholar
Checa, A., Bedia, C. & Jaumot, J. Lipidomic data analysis: tutorial, practical guidelines and applications. Anal. Chim. Acta 885, 1–16 (2015).
Article CAS PubMed Google Scholar
Kaya Gülağız, F. & Şahin, S. Comparison of hierarchical and non-hierarchical clustering algorithms. Int. J. Computer Eng. Inf. Technol. 9, 6–14 (2017).
Google Scholar
Mika, S., Ratsch, G., Weston, J., Scholkopf, B. & Muller, K. R. Fisher discriminant analysis with kernels. In Neural Networks for Signal Processing - Proceedings of the IEEE Workshop (IEEE, 1999).
Tharwat, A., Gaber, T., Ibrahim, A. & Hassanien, A. E. Linear discriminant analysis: a detailed tutorial. AI Commun. 30, 169–190 (2017).
Article Google Scholar
Wold, S., Sjöström, M. & Eriksson, L. PLS-regression: a basic tool of chemometrics. Chemom. Intell. Lab. Syst. 58, 109–130 (2001).
Article CAS Google Scholar
Lee, L. C., Liong, C. Y. & Jemain, A. A. Partial least squares-discriminant analysis (PLS-DA) for classification of high-dimensional (HD) data: a review of contemporary practice strategies and knowledge gaps. Analyst 143, 3526–3539 (2018).
Article CAS PubMed Google Scholar
Cortes, C., Vapnik, V. & Saitta, L. Support-vector networks. Mach. Learn. 20, 273–297 (1995).
Article Google Scholar
Breiman, L. Random forests. Mach. Learn. 45, 5–32 (2001).
Article Google Scholar
Gromski, P. S. et al. Influence of missing values substitutes on multivariate analysis of metabolomics data. Metabolites 4, 433–452 (2014).
Article PubMed PubMed Central Google Scholar
Fahy, E., Sud, M., Cotter, D. & Subramaniam, S. LIPID MAPS online tools for lipid research. Nucleic Acids Res. 35, W606–W612 (2007).
Article PubMed PubMed Central Google Scholar
Pang, Z. et al. MetaboAnalyst 5.0: narrowing the gap between raw spectra and functional insights. Nucleic Acids Res. 49, W388–W396 (2021).
Article CAS PubMed PubMed Central Google Scholar
Stacklies, W., Redestig, H., Scholz, M., Walther, D. & Selbig, J. pcaMethods-a Bioconductor package providing PCA methods for incomplete data. Bioinformatics 23, 1164–1167 (2007).
Article CAS PubMed Google Scholar
Kopczynski, D., Hoffmann, N., Peng, B. & Ahrends, R. Goslin: a grammar of succinct lipid nomenclature. Anal. Chem. 92, 10957–10960 (2020).
Article CAS PubMed PubMed Central Google Scholar
Ni, Z. & Fedorova, M. LipidLynxX: a data transfer hub to support integration of large scale lipidomics datasets. Preprint at bioRxiv https://doi.org/10.1101/2020.04.09.033894 (2020).
Ding, D. et al. The BridgeDb framework: standardized access to gene, protein and metabolite identifier mapping services. BMC Bioinf. 11, 5 (2010).
Article Google Scholar
Clair, G. et al. Lipid Mini-On: mining and ontology tool for enrichment analysis of lipidomic data. Bioinformatics 35, 4507–4508 (2019).
Article CAS PubMed PubMed Central Google Scholar
Gaud, C. et al. BioPAN: a web-based tool to explore mammalian lipidome metabolic pathways on LIPID MAPS. F1000Research 10, 4 (2021).
Article PubMed PubMed Central Google Scholar
Kutmon, M. et al. PathVisio 3: an extendable pathway analysis toolbox. PLoS Comput. Biol. 11, e1004085 (2015).
Article PubMed PubMed Central Google Scholar
Shannon, P. et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 13, 2498–2504 (2003).
Article CAS PubMed PubMed Central Google Scholar
Kutmon, M., Lotia, S., Evelo, C. T. & Pico, A. R. WikiPathways app for Cytoscape: making biological pathways amenable to network analysis and visualization. F1000Research 3, 152 (2014).
Article PubMed PubMed Central Google Scholar
Kutmon, M., Ehrhart, F., Willighagen, E. L., Evelo, C. T. & Coort, S. L. CyTargetLinker app update: a flexible solution for network extension in Cytoscape. F1000Research 7, 743 (2019).
Article Google Scholar

Download references

Acknowledgements

This publication is based upon work from COST Action EpiLipidNET, Pan-European Network in Lipidomics and Epilipidomics (CA19105; https://www.epilipid.net), supported by COST (European Cooperation in Science and Technology). Funding from the Wellcome Trust is gratefully acknowledged for LIPID MAPS (203014/Z/16/Z). LIPID MAPS is grateful for sponsorship from Cayman Chemical, Merck and Avanti Polar Lipids. T.P. is supported by the Czech Science Foundation Grant 21-11563M. Funding from the FWF P33298-B and Human Frontiers Science Progam RGP0002/2022 is gratefully acknowledged. ‘Sonderzuweisung zur Unterstützung profilbestimmender Struktureinheiten 2021’ by the SMWK and Deutsche Forschungsgemeinschaft (FE 1236/5-1 to M.F.) are gratefully acknowledged. JSPS KAKENHI (21K18216 to H.T.), the National Cancer Center Research and Development Fund (2020-A-9, H.T.), AMED Japan Program for Infectious Diseases Research and Infrastructure (21wm0325036h0001 to H.T.), JST National Bioscience Database Center (NBDC to H.T.), JST ERATO ‘Arita Lipidome Atlas Project’ (JPMJER2101 to H.T.).

Author information

Authors and Affiliations

Center of Membrane Biochemistry and Lipid Research, Faculty of Medicine Carl Gustav Carus of TU Dresden, Dresden, Germany
Zhixu Ni, Michele Wölk & Maria Fedorova
Systems Immunity Research Institute, School of Medicine, Cardiff University, Cardiff, UK
Geoff Jukes, Karla Mendivelso Espinosa, Jorge Alvarez-Jarreta, Robert Andrews, Matthew J. Conroy & Valerie B. O’Donnell
Department of Analytical Chemistry, University of Vienna, Vienna, Austria
Robert Ahrends & Dominik Kopczyinki
Swiss-Prot group, SIB Swiss Institute of Bioinformatics, Centre Medical Universitaire, Geneva, Switzerland
Lucila Aimo & Alan Bridge
European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK
Jorge Alvarez-Jarreta, Adnan Malik & Claire O’Donovan
Babraham Institute, Babraham Research Campus, Cambridge, UK
Simon Andrews, Caroline Gaud & Andrea F. Lopez-Clavijo
Biological Science Division, Pacific Northwest National Laboratory, Richland, WA, USA
Geremy C. Clair
Department of Bioengineering, University of California, San Diego, CA, USA
Eoin Fahy
Department of Chemistry, Biology and Biotechnology, University of Perugia, Perugia, Italy
Laura Goracci
Institute of Pharmaceutical Sciences, University of Graz, Graz, Austria
Jürgen Hartler
Field of Excellence BioHealthe—University of Graz, Graz, Austria
Jürgen Hartler
Center for Biotechnology, University of Bielefeld, Bielefeld, Germany
Nils Hoffmann
Bruker Daltonics GmbH & Co. KG, Bremen, Germany
Ansgar Korf
Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany
Jacobo Miranda Ackerman & Andrej Shevchenko
Structural and Computational Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany
Martijn R. Molenaar
Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences, Prague, Czech Republic
Tomáš Pluskal
Department of Bioinformatics – BiGCaT, NUTRIM, Maastricht University, Maastricht, The Netherlands
Denise Slenter, Martina Kutmon & Egon L. Willighagen
Scripps Center for Metabolomics and Mass Spectrometry, The Scripps Research Institute, La Jolla, CA, USA
Gary Siuzdak
Maastricht Centre for Systems Biology, Maastricht University, Maastricht, The Netherlands
Martina Kutmon
Department of Biotechnology and Life Science, Tokyo University of Agriculture and Technology, Tokyo, Japan
Hiroshi Tsugawa
RIKEN Center for Sustainable Resource Science, Yokohama, Japan
Hiroshi Tsugawa
RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
Hiroshi Tsugawa
Graduate School of Medical Life Science, Yokohama City University, Yokohama, Japan
Hiroshi Tsugawa
Institute of Parasitology, McGill University, Montreal, Canada
Jianguo Xia

Authors

Zhixu Ni
View author publications
You can also search for this author in PubMed Google Scholar
Michele Wölk
View author publications
You can also search for this author in PubMed Google Scholar
Geoff Jukes
View author publications
You can also search for this author in PubMed Google Scholar
Karla Mendivelso Espinosa
View author publications
You can also search for this author in PubMed Google Scholar
Robert Ahrends
View author publications
You can also search for this author in PubMed Google Scholar
Lucila Aimo
View author publications
You can also search for this author in PubMed Google Scholar
Jorge Alvarez-Jarreta
View author publications
You can also search for this author in PubMed Google Scholar
Simon Andrews
View author publications
You can also search for this author in PubMed Google Scholar
Robert Andrews
View author publications
You can also search for this author in PubMed Google Scholar
Alan Bridge
View author publications
You can also search for this author in PubMed Google Scholar
Geremy C. Clair
View author publications
You can also search for this author in PubMed Google Scholar
Matthew J. Conroy
View author publications
You can also search for this author in PubMed Google Scholar
Eoin Fahy
View author publications
You can also search for this author in PubMed Google Scholar
Caroline Gaud
View author publications
You can also search for this author in PubMed Google Scholar
Laura Goracci
View author publications
You can also search for this author in PubMed Google Scholar
Jürgen Hartler
View author publications
You can also search for this author in PubMed Google Scholar
Nils Hoffmann
View author publications
You can also search for this author in PubMed Google Scholar
Dominik Kopczyinki
View author publications
You can also search for this author in PubMed Google Scholar
Ansgar Korf
View author publications
You can also search for this author in PubMed Google Scholar
Andrea F. Lopez-Clavijo
View author publications
You can also search for this author in PubMed Google Scholar
Adnan Malik
View author publications
You can also search for this author in PubMed Google Scholar
Jacobo Miranda Ackerman
View author publications
You can also search for this author in PubMed Google Scholar
Martijn R. Molenaar
View author publications
You can also search for this author in PubMed Google Scholar
Claire O’Donovan
View author publications
You can also search for this author in PubMed Google Scholar
Tomáš Pluskal
View author publications
You can also search for this author in PubMed Google Scholar
Andrej Shevchenko
View author publications
You can also search for this author in PubMed Google Scholar
Denise Slenter
View author publications
You can also search for this author in PubMed Google Scholar
Gary Siuzdak
View author publications
You can also search for this author in PubMed Google Scholar
Martina Kutmon
View author publications
You can also search for this author in PubMed Google Scholar
Hiroshi Tsugawa
View author publications
You can also search for this author in PubMed Google Scholar
Egon L. Willighagen
View author publications
You can also search for this author in PubMed Google Scholar
Jianguo Xia
View author publications
You can also search for this author in PubMed Google Scholar
Valerie B. O’Donnell
View author publications
You can also search for this author in PubMed Google Scholar
Maria Fedorova
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

The manuscript was written with contributions from all authors who have given approval to the final version of the manuscript. M.F. and V.B.O. conceived the original idea and supervised the project. Z.N., M.F., and V.B.O. contributed to review of the tools and data collection. M.F., Z.N, and M.W. contributed to figures. M.W. designed and tested the workflows and tutorials. G.J., K.M.E., and R.An. developed the website with the help from Z.N. and M.F. Software developers Z.N., R.Ah., L.A., J.A.J., S.A., A.B., G.C.C., M.J.C., E.F., C.G., L.G., J.H., N.H., D.K., A.K., A.F.L., A.M., J.M.A., M.R.M., C.O., T.P., A.S., D.S, G.S., M.K., H.T., E.L.W., and J.X. contributed to sections on software tools in both the manuscript and website.

Corresponding authors

Correspondence to Valerie B. O’Donnell or Maria Fedorova.

Ethics declarations

Competing interests

A.K. is employed at Bruker Daltonics GmbH & Co. KG, Bremen, Germany. Other authors declare no competing interests.

Peer review

Peer review information

Nature Methods thanks the anonymous reviewers for their contribution to the peer review of this work. Primary Handling Editor: Arunima Singh, in collaboration with the Nature Methods team.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Supplementary Note

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Ni, Z., Wölk, M., Jukes, G. et al. Guiding the choice of informatics software and tools for lipidomics research applications. Nat Methods 20, 193–204 (2023). https://doi.org/10.1038/s41592-022-01710-0

Download citation

Received: 24 May 2022
Accepted: 02 November 2022
Published: 21 December 2022
Issue Date: February 2023
DOI: https://doi.org/10.1038/s41592-022-01710-0

This article is cited by

From big data to big insights: statistical and bioinformatic approaches for exploring the lipidome
- Jessie R. Chappel
- Kaylie I. Kirkwood-Donelson
- Erin S. Baker
Analytical and Bioanalytical Chemistry (2024)
Untargeted hair lipidomics: comprehensive evaluation of the hair-specific lipid signature and considerations for retrospective analysis
- Maria van de Lavoir
- Katyeny Manuela da Silva
- Adrian Covaci
Analytical and Bioanalytical Chemistry (2023)