Urinary metabolic phenotyping for Alzheimer’s disease

Kurbatova, Natalja; Garg, Manik; Whiley, Luke; Chekmeneva, Elena; Jiménez, Beatriz; Gómez-Romero, María; Pearce, Jake; Kimhofer, Torben; D’Hondt, Ellie; Soininen, Hilkka; Kłoszewska, Iwona; Mecocci, Patrizia; Tsolaki, Magda; Vellas, Bruno; Aarsland, Dag; Nevado-Holgado, Alejo; Liu, Benjamine; Snowden, Stuart; Proitsi, Petroula; Ashton, Nicholas J.; Hye, Abdul; Legido-Quigley, Cristina; Lewis, Matthew R.; Nicholson, Jeremy K.; Holmes, Elaine; Brazma, Alvis; Lovestone, Simon

doi:10.1038/s41598-020-78031-9

Download PDF

Article
Open access
Published: 10 December 2020

Urinary metabolic phenotyping for Alzheimer’s disease

Natalja Kurbatova¹,
Manik Garg¹,
Luke Whiley^2,3,
Elena Chekmeneva²,
Beatriz Jiménez²,
María Gómez-Romero²,
Jake Pearce²,
Torben Kimhofer⁴,
Ellie D’Hondt⁵,
Hilkka Soininen⁶,
Iwona Kłoszewska⁷,
Patrizia Mecocci⁸,
Magda Tsolaki⁹,
Bruno Vellas¹⁰,
Dag Aarsland¹¹,
Alejo Nevado-Holgado¹⁵,
Benjamine Liu¹⁵,
Stuart Snowden¹¹,
Petroula Proitsi¹¹,
Nicholas J. Ashton^11,12,13,14,
Abdul Hye¹¹,
Cristina Legido-Quigley¹¹,
Matthew R. Lewis²,
Jeremy K. Nicholson²^nAff16,
Elaine Holmes³^nAff16^nAff17,
Alvis Brazma¹ &
…
Simon Lovestone¹⁵^nAff18

Scientific Reports volume 10, Article number: 21745 (2020) Cite this article

4402 Accesses
26 Citations
19 Altmetric
Metrics details

Subjects

Abstract

Finding early disease markers using non-invasive and widely available methods is essential to develop a successful therapy for Alzheimer’s Disease. Few studies to date have examined urine, the most readily available biofluid. Here we report the largest study to date using comprehensive metabolic phenotyping platforms (NMR spectroscopy and UHPLC-MS) to probe the urinary metabolome in-depth in people with Alzheimer’s Disease and Mild Cognitive Impairment. Feature reduction was performed using metabolomic Quantitative Trait Loci, resulting in the list of metabolites associated with the genetic variants. This approach helps accuracy in identification of disease states and provides a route to a plausible mechanistic link to pathological processes. Using these mQTLs we built a Random Forests model, which not only correctly discriminates between people with Alzheimer’s Disease and age-matched controls, but also between individuals with Mild Cognitive Impairment who were later diagnosed with Alzheimer’s Disease and those who were not. Further annotation of top-ranking metabolic features nominated by the trained model revealed the involvement of cholesterol-derived metabolites and small-molecules that were linked to Alzheimer’s pathology in previous studies.

Bile acids targeted metabolomics and medication classification data in the ADNI1 and ADNIGO/2 cohorts

Article Open access 17 October 2019

Peripheral serum metabolomic profiles inform central cognitive impairment

Article Open access 20 August 2020

Unbiased proteomics and multivariable regularized regression techniques identify SMOC1, NOG, APCS, and NTN1 in an Alzheimer’s disease brain proteomic signature

Article Open access 06 July 2023

Introduction

Unmet medical need and the repeated failure of clinical trials in Alzheimer’s disease (AD) have together resulted in a surge of research seeking to understand disease mechanisms and generate novel therapeutic approaches. In order for such therapies to succeed it is widely accepted that trials will need to be performed early in the disease process^1,2. Currently, the optimal biomarkers used to detect AD processes early in the course of the disease are Positron Emission Tomography (PET) imaging and cerebrospinal fluid (CSF) markers³. However, PET imaging is not universally available and obtaining CSF is a relatively invasive procedure. Progress has been made in the attempt to supplement these relatively specific biomarkers with other biomarkers that might be more applicable to larger populations using, for example, blood biomarkers. Putative markers in blood have been identified using proteomics, transcriptomics, metabolic and lipidomic phenotyping platforms. Efforts are now underway to replicate these and to generate single markers or biomarker panels that might be used as part of a process identifying people with early disease^4,5,6.

There are relatively few studies examining the potential of urine as a biomarker fluid in AD^7,8,9,10, probably because being separated from the brain not only by the blood–brain barrier but also by glomerular filtration, urine seems inherently unlikely to possess a signature of neurodegeneration. Most of the studies reported to date are small both in terms of the numbers of molecular targets and numbers of individuals examined. However, urine is a complex fluid with metabolites that reflect a response to injury¹¹ and oxidative stress¹² amongst other biological events at the systems level and might, therefore, be useful as a target fluid in neurodegeneration and other brain diseases¹³. An added advantage is that urine carries information on the metabolites arising from the gut microbiome¹⁴, an area of research that is gaining greater focus in neurodegeneration and AD^15,16. In mouse models a range of methods to characterize the urine metabolome have been employed, with some success in identifying markers that differ between transgenic animals and controls^17,18,19,20, but these models do not encompass the full systems effect and metabolic progression of AD in humans. Therefore deep exploration of the urinary metabolome for biomarkers relevant to AD may yield valuable mechanistic information. We report here the application of in-depth urinary metabolic phenotyping in a large multi-centre study consisting of four groups of participants. One group consisted of individuals clinically diagnosed with AD. A second group consisted of individuals with no apparent clinical symptoms of cognitive decline or dementia (control group—CTL). The final two groups consisted of individuals diagnosed with Mild Cognitive Impairment (MCI) who either remained cognitively stable throughout the follow-up term of the study (sMCI) or converted to a clinical AD diagnosis at a later study visit (cMCI). All samples that underwent metabolic phenotyping were collected at a single time point (baseline assessment).

Metabolic phenotyping was completed using a complementary dual-platform approach consisting of both proton nuclear magnetic resonance spectroscopy (¹H-NMR) and ultra-high-performance liquid chromatography coupled with mass spectrometry (UHPLC-MS) to ensure comprehensive metabolite coverage. Critically, in order to address the challenge of rich metabolic datasets, where analytical variables far exceed the numbers of samples available to study, we applied a novel approach consisting of metabolomic dataset normalisation followed by a feature reduction process that only selected those metabolites which associated with a metabolic regulatory genetic element—a technique known as Quantitative Trait Loci for metabolites (mQTL). In this way, we identified a panel of metabolites that might accurately classify the participant groups. Of particular interest was the ability of the classification model to differentiate between baseline MCI participants, who later either remained cognitively stable (sMCI) or converted to clinical AD (cMCI) which heralds promise for non-invasive scoping for early biomarkers of AD.

The classification model was used to prioritise metabolites for annotation according to their importance to predict AD. The analysis of the annotated metabolites revealed multiple direct and a few indirect links to the etiopathological processes in AD.

Results

We report here the largest study to date of metabolic phenotyping analysis of urine as a potential marker of AD. Metabolic phenotyping was performed on urine obtained from the AddNeuroMed/Dementia Case Registry (ANM/DCR) cohorts^21,22,23 using two analytical platforms: UHPLC-MS (\({\hbox {n}}=561\) samples) and ¹H-NMR (\({\hbox {n}}=575\) samples). The cohort of participants used in the analysis includes participants with normal cognition (control group—CTL), stable mild cognitive impairment (sMCI), mild cognitive impairment converting to dementia (cMCI) and participants with Alzheimer’s disease (AD) (Table 1). The number of available samples and metabolic features is summarised in Table 2. Using an innovative approach to the challenge of very high dimensionality data analysis, we first prioritise a set of metabolites using mQTL mapping, ensuring that features with a degree of genetic regulation were selected. From this reduced feature selection, we were then able to select metabolites that are associated with AD and were able to predict conversion to dementia from MCI.

Table 1 Overview of study participants.

Full size table

Dimensionality reduction of metabolic features

Given that the number of metabolic features found in the cohort samples was orders of magnitude higher than the number of samples (specifically 55,675 features), as the first step we developed a novel feature reduction method. We utilised the availability of genetic data, in particular, 12 million SNPs obtained in the previous study²⁴ and looked for an association between these SNPs and the metabolic features, performing metabolic quantitative trait locus analysis. We hypothesized that an association between a metabolite and a disease state is more likely to be relevant to an etiopathological mechanism if this metabolite was also associated with a genetic variant previously linked to a relevant disease phenotype.

The mQTL analysis resulted in a total of 1542 individual metabolic features relating to either a chemical shift in the case of ¹H NMR (233 metabolic features) or a chromatographic retention time and mass to charge ratio (m/z) feature in the UHPLC-MS data (1309 features). The resultant metabolic features were associated with 6932 SNPs at a q-value \(< 0.01\) (Table 3, Fig. 1). Of these, 6047 unique SNPs were linked to features from the UHPLC-MS data, 876 SNPs to features from the ¹H NMR data, with 838 SNPs common to both. Given the 12 million SNPs tested, the probability of observing 838 or more SNPs in the intersection between UHPLC-MS and ¹H NMR results by chance is vanishingly small (p-value < 2.23E−308). Previously, a total of 276 metabolomic QTLs have been reported^{25,26,27,28,29,30}, and 83% of these SNPs were reproduced in the current study (Supplementary Materials Table S2), thereby, validating our pipeline.

Table 2 Summary of samples and metabolic features available for the analysis.

Full size table

Table 3 Metabolic QTL mapping results.

Full size table

Table 4 Performance of the final classification model.

Full size table

Ranking of metabolic features for annotation

Metabolite annotation remains the bottleneck and limitation of metabolic phenotyping studies^31,32. After reducing the number of metabolic features to 1542, we aimed to prioritise them according to their relative importance of correctly predicting the AD state of the sample. We first built and tested Random Forests classification models on different combinations of features and diagnostic classes (for details see “Methods”). Briefly, we hypothesised that genomics data in combination with metabolic features and sample covariates could improve the prediction quality and tried combinations of following features: (1) 1542 metabolic features after the mQTL filtering, (2) 6932 SNPs associated with metabolic features and (3) sample covariates—age, sex and study site. Also, to account for the imbalance between the diagnostic classes, with the prevalence of AD and CTL classes over sMCI and cMCI classes, we applied re-sampling. First, we analysed all data in relation to the four original diagnostic categories (AD/CTL/cMCI/sMCI). Then, hypothesising that cMCI would be most similar to AD and sMCI most similar to control, we performed binary over-sampling by creating AD + cMCI and CTL + sMCI groups. Finally, we applied the under-sampling by using AD and CTL groups only. As the model trained on 1542 metabolic features together with sample covariates and only two diagnostic classes (AD and CTL) gave the lowest prediction error, it was selected as the final model (Fig. 2).

Table 5 Annotated metabolites.

Full size table

Table 6 Annotated metabolites with mQTL results, phenotypic traits and literature findings.

Full size table

To further validate the performance of the final model, MCI samples were used. The model proved effective in separating MCI samples from individuals who subsequently converted to dementia (cMCI) from those who remained stable (sMCI), finding that 82.96% of cMCI samples were predicted as AD and 77.78% of sMCI samples as CTL (Fig. 3). This provided additional evidence that our feature reduction method resulted in a meaningful set of metabolites enabling the detection of early AD patients, along with the previous validation of 83% of associated SNPs in AD-related literature. The AUROC value of the model was 0.99 (Table 4 and Fig. 3), showing that the final model was quite robust.

Next, we used the developed Random Forest model for prioritisation of the metabolic features by computing the permutation importance score for each metabolic feature (Supplementary Materials Table S3). This method gave us 235 metabolic features with a score of at least \(10{^{-4}}\). Out of these, 32 features were successfully annotated by spectral interpretation. Since multiple analytical signals can correspond to the same metabolite, the annotation process resulted in a total list of 23 metabolites (Table 5).

Analysis of annotated metabolites

We found three broad groups of annotated metabolites differentially present in the disease state: (1) conjugated metabolites of cholesterol derived compounds, (2) small molecule metabolites and (3) metabolites of exogenous sources.

Following the annotation, we quantified the differences in the levels of assigned metabolites across study groups using ANOVA (Table 5). A heatmap demonstrating inter-annotation correlations is shown in Fig. 4.

Having assigned chemical identity, we then investigated whether the associated SNPs were linked to any previously reported GWAS traits to gain potential insight into the relationship between metabolomic and genomic variations. We mapped any polymorphisms associated with our annotated metabolites to the GWAS Catalog’s SNPs⁴⁸ by genomic regions ans summarised them in Table 6.

We speculate that all endogenous annotated metabolites are linked to AD processes through alteration of DNA methylation, gut microbiota malfunction and possibly altered metabolism of polyamines, cholesterol and sugar (Table 6, Fig. 5). The key observations from these analyses are reported in the sections below.

Cholesterol metabolism

There are three unique metabolites derived from cholesterol metabolism, all in the conjugated form: tauro(cheno)deoxycholic acid, pregnenolone and pregnanediol. These metabolites have higher levels in the AD and cMCI study groups (Table 5).

We found the two hormones, pregnenolone and pregnanediol, conjugated with sulfate and N-acetylglucosamine, both biotransformations were observed in human urine previously⁴⁹.

Annotated tauro(cheno)deoxycholic bile acid (an isomer of taurochenodeoxycholic or taurodeoxycholic acid) is conjugated with an N-acetylglucosamine, known biotransformation prior to bile acid renal excretion⁵⁰. There is an increased level of taurochenodeoxycholic or taurodeoxycholic acid conjugates in AD patients (see “Discussion” section).

Sugar metabolism and gut microbiota

We identified sucrose and galactose sugars with sucrose levels significantly higher in AD study group.

The following annotated metabolites are produced in or derived from gut microbiota processes: N,N,N-trimethyl-l-alanyl-l-proline betaine, 3-aminoisobutyrate, trimethylamine, lysine, butyrylcarnitine, and mentioned above, taurodeoxycholic acid. These metabolites have increased levels in AD patients (Table 5).

DNA methylation and polyamine metabolism

Evidence of DNA methylation in AD patients’ urine is found in the observation of two methylcytidine metabolites: 5-methylcytidine, and 2-O-methylcytidine. Another metabolite that shared the pattern of alterations with two methylated cytidine metabolites was annotated as “unknown nucleoside with adenosyl moiety”. We were not able to identify its structure definitively in this analysis but the mass spectrometric data provided the molecular formula C10H11N5O3, and fragmentation data indicated the presence of adenosyl moiety in the molecule (some details regarding structure elucidation effort are presented in the “Supplementary Document” “Metabolite_Annotation_details.docx”).

Another annotated metabolite is N-acetylisoputreanine-gamma-lactam, a catabolic product of spermidine. This metabolite levels alter not only in AD but also in cMCI and sMCI groups (Table 5).

Exogenous metabolites

Amongst exogenous metabolites, we found two prescribed medications, paracetamol and the calcium channel blocker diltiazem, in addition to the metabolite quinine. All three exogenous metabolites have associations with genetic variants (Table 6) and their levels significantly alter in study groups, as shown in Table 5.

Discussion

Previous studies have reported data suggesting that a panel of metabolites circulating in blood was able to predict incipient AD with very high degrees of accuracy^51,52, raising considerable hopes for finding pre-clinical AD biomarkers in blood. Subsequent studies using a similar, if not identical approach, in multiple, larger cohorts failed to replicate these findings^53,54,55. Other studies have reported metabolic and lipidomic differences in blood from people with disease compared to age-matched controls with various degrees of power, success and outcome^56,57,58,59. However, none of these studies have been unequivocally replicated. One of the limitations of studies with high dimensionality datasets and relatively small numbers of samples is susceptibility to over-fitting. Additionally, because of the inherent problems of heterogeneity in neurodegenerative disease and the diversity of analytical platforms used in metabolic profiling (¹H-NMR, GC-MS, UHPLC-MS, CE-MS etc.), it is perhaps not surprising that there has been relatively little replication of metabolic phenotyping studies that seek biomarkers of disease. Similar problems plagued early genetic studies seeking susceptibility factors, but these have been largely overcome by the introduction of studies based on tens of thousands of individuals. In genetics it was possible to combine data from different cohorts using imputation techniques. This approach is more challenging for metabolic phenotyping approaches, which often utilize heterogeneous technologies and independent assays, the results from which are more difficult to build a comprehensive picture of the metabolic landscape. With increased throughput and lower cost of metabolic assays, such larger studies will become possible in future.

In the absence of studies with large sample size, one approach to combat the limitation of high dimensionality in molecular studies is to reduce the dimensionality of the data. To achieve this, here we used the mQTL approach. In doing so, we provide for the possibility of a degree of validation in other, much larger dataset derived from genetic associations studies, also enabling the inference of a degree of causality when an association is discovered. While this work requires replication, this finding holds promise for biomarkers in urine—arguably the most readily available biomarker fluid. Using this mQTL targeting approach, we show a highly significant association of a relatively small set of 32 metabolic features with AD. A model generated from these features not only accurately predicts the disease state, but more importantly, the same model when applied to samples from participants with the clinical diagnosis of mild cognitive impairment (MCI), distinguishes those subjects that subsequently progress to dementia (cMCI) from those that remain stable (sMCI). Note that, MCI is commonly referred to as a prodromal condition, but in fact, is a highly heterogeneous state defined by impaired cognitive function relative to age-adjusted norms. An easy to perform test to identify people in the prodromal phase of AD, distinguishing the subjects with MCI that are likely to convert to AD from those remaining stable, would be an important advance for the field. Our results suggest that the analysis of urinary metabolites might provide such a test.

Given that the sample size in our study is small, in the absence of a larger replication cohort, it is important to provide as much additional evidence as possible showing that the method we developed is reliable. One such type of evidence comes from the analysis of the annotated metabolites. One set of metabolites that we found is related to cholesterol—a critical biological precursor, required for the biosynthesis of downstream metabolites such as bile acids, hormones and steroids, which are commonly found in their conjugated form in the urine. Hormones and related steroidal structures derived from cholesterol are known to have a role in brain function. Observed in this study pregnenolone sulfate is a known neuro-steroid species^60,61 reported to influence cognition in rodent models⁶² and patient studies, perhaps through its role in modulating gamma-aminobutyric acid subunit A (GABAA) and N-methyl d-aspartate (NMDA) receptors⁶¹. The other hormone we annotated—pregnanediol, a metabolite of pregnenolone, was previously reported to be lower in the urine of older men⁶², however, this has not been linked to AD. Concentrations of non-conjugated bile acids were observed to be altered in AD in human blood and brain samples and in transgenic models of the disease^58,63. A recent study on mouse models discovered that bile acids strongly inhibit the cysteine dioxygenase type-1-mediated (CDO1-mediated) cysteine catabolic pathway resulting in depletion of the free cysteine pool and reduction of the glutathione concentration⁶⁴. Here, we found an increased level of taurochenodeoxycholic or taurodeoxycholic acid conjugates in AD patients. Taurodeoxycholic acid, is the secondary bile acid that was previously observed to be increased in AD patients as the result of hypothesised gut microbiota malfunction⁶⁵. The need for further investigation of cholesterol related metabolites in AD pathology is strongly supported both by previous research and by our study results.

Sugar metabolism was previously implicated in AD pathophysiology linking dysregulation in glucose metabolism and insulin resistance⁵⁶. We found sucrose and galactose sugars to be important for the AD classification. Studies in mice suggest that sucrose disrupts mitochondrial activity and promotes amyloid deposition in the brains of transgenic AD mice⁶⁶. Additionally, treatment of ovariectomised rats using d-galactose leads to AD-like pathology and the development of AD in a rodent model⁶⁷. The researchers reported the observed AD pathology was prevented following injections of 17-\(\upbeta \) estradiol, suggesting a potential interlinked role for disruptions in sugar and sex hormone metabolism, alternations of both are reported in our results. However, it is necessary to highlight that we have very limited known covariates of this retrospective data cohort that was collected ten years ago. Original study participants exclusion criteria included other neurological or psychiatric diseases, significant unstable systemic illness or organ failure and alcohol or substance misuse²¹. That knowledge gives us some assurance that study participants did not have other diagnosed medical conditions like diabetes mellitus type 2 and renal disease.

Evidence of gut microbiota malfunction is a noticeable trait in our findings. We annotated trimethylamine, N,N,N-trimethyl-l-alanyl-l-proline betaine, 3-aminoisobutyrate, l-lysine and butyrylcarnitine metabolites with different levels of concentrations amongst study groups. We detected several metabolites closely linked to betaine, though not betaine directly. Betaine is a source for trimethylamine production and an alternate methyl source for converting homocysteine to methionine⁶⁸, increasing DNA methylation and altering gene expression⁶⁹. We detected trimethylamine, a metabolite released in gut microbiota from trimethylamine-containing dietary phospholipid components such as choline, lecithin, l-carnitine and mentioned above betaine. The oxidation of trimethylamine generates trimethylamine N-oxide that was reported to be elevated in Alzheimer’s patients¹⁵. In addition, observed here N,N,N-trimethyl-l-alanyl-l-proline betaine is a recently discovered plasma biomarker of kidney function. Plausibly this metabolite is a product of betaine and myosin light chain degradation⁷⁰, though this hypothesis has yet to be confirmed. Another detected product of gut microbiota processes is butyrylcarnitine—a product of l-carnitine processing in the human body. Previous research conducted in the mouse brain has shown that in old age, the AD genetic load significantly increase levels of butyrylcarnitine⁷¹. Changes in butyrylcarnitine concentrations together with the observation of elevated levels of lysine in AD patients suggest alterations of carnitine in AD, since lysine is one of the sources for carnitine production in humans. These complex metabolic processes in gut microbiota require further investigation.

We speculate that changes in DNA methylation during AD processes are verified through observed alterations of 5-methylcytidine and 2-O-methylcytidine. These results correlate with recent reports showing significant alterations in 5-methylcytidine in early stages of Alzheimer’s disease⁷². Alterations in the concentrations of N-acetylisoputreanine-gamma-lactam, a catabolic product of spermidine that is formed from N-actetylspermidine⁷³ support a potentially altered metabolism of polyamines in AD: the stimulation of ornithine decarboxylase in the AD process leads to increased levels of N-acetylspermidine and spermidine⁷⁴.

Exogenous metabolites found in this study and their linkage with genomic variants (paracetamol and quinine) or with possible dementia protective effect (diltiazem metabolomic profile) are significant findings for further investigation. Given that the prescription of quinine preparations is recommended only for malaria prophylaxis, this is an unlikely explanation for the quinine finding in this European population of older people. However, its prevalence in dietary sources may explain its presence in the population. This finding, along with the reporting of altered levels of diltiazem and paracetamol in AD is not obviously explicable, although reverse causality, where the disease state is resulting in a change in either prescription or compliance with medication, is an obvious possible explanation of the finding.

While the strengths of the study lie in the mQTL approach and the acquisition of a large amount of metabolic phenotyping data, there are undoubtedly limitations to consider. First, although large compared to previously reported urine studies, the analysis remains vulnerable to over-fitting and bias as the number of features is three times that of the number of samples, even following the mQTL based feature reduction. Clearly, larger and independent datasets are necessary to replicate the findings we report here. This could be achieved using a targeted quantitative mass spectrometry assay, specifically designed to quantitate the metabolic pathways of the key metabolites identified here. This would both help validate the findings and provide greater insight into pathway mechanisms and the networks involved. Secondly, the cohort used here, lacks specific diagnostic biomarkers (such as PET or CSF measures) indicative of pathological load and the diagnostic categorisation rests on experienced clinician assessment together with systematic assessment by a research worker, albeit using a widely tested and proven methodology. Ideally, data would also be collected on diet and life-style factors such as exercise and other environmental exposures together with detailed assessments of co-morbid and prior medical history and medication use. Such exploration of factors beyond diagnostic category that might influence the metabolic profile will be important topics for future studies. Lastly, by focusing on the metabolites with an associated genetic variant, we may exclude metabolites that have a significant association with AD but are largely influenced by the environment, lifestyle and not as strongly by genetics. The further research of exogenous metabolites, such as medicines and dietary compounds, as well as effects of environmental factors on metabolic processes in AD is necessary.

Methods

We confirm that: all experiments were performed in accordance with relevant guidelines and regulations; all methods were carried out in accordance with relevant guidelines and regulations; all experimental protocols were approved by the European Union AddNeuroMed program.

Study participants

The data and biospecimens were collected (AddNeuroMed/Dementia Case Registry (ANM/DCR) cohort) in European, multi-site study, public-private partnership^21,22,23. The cohort includes participants with established AD, MCI and normal cognition who were evaluated using well-established systematic interviews for diagnosis. Formally, the diagnoses are not pathologically confirmed or supported by specific biomarkers of pathology, but it has been shown that this diagnostic method is highly predictive using MRI scans²¹ and subsequent post-mortem classification⁷⁵. In addition, all disease categories, and conversion to dementia from MCI, were diagnosed by an experienced clinician according to criteria as described below. Briefly, the inclusion and exclusion criteria for study groups were as follows. Inclusion criteria for AD study group: (1) ADRDA/NINCDS and DSM-IV criteria for probable Alzheimer’s disease; (2) mini Mental State Examination score ranged from 12 to 28; (3) age 65 years or above. Inclusion criteria for MCI and CTL study groups: (1) mini Mental State Examination score range between 24 and 30; (2) geriatric Depression Scale score less than or equal to 5; (3) age 65 years or above; (4) medication stable; (5) good general health. The distinction between MCI and controls was based on two criteria: (1) subject scores 0 on Clinical Dementia Rating Scale = control; (2) Subject scores 0.5 on Clinical Dementia Rating scale = MCI. For the MCI subjects it was preferable that the subject and informant reported occurrence of memory problems. All AD subjects had a CDR score of 0.5 or above. The distinction between cMCI (converted MCI) and sMCI (stable MCI): based on follow-up interviews and tests. Exclusion criteria (all study groups): (1) significant neurological or psychiatric illness other than AD; (2) significant unstable systematic illness or organ failure; (3) alcohol or substance misuse.

All samples were collected under human participant research protocols, including informed consent containing the clause necessary for allowing bio-specimens and data sharing abroad. Blood and urine were collected at baseline and stored at \(-80^{\circ }{\hbox {C}}\). Genetic data from blood samples were obtained using the Illumina 610-Quad chip in two different batches as previously described²⁴, while the third batch was obtained using HumanOmniExpress 24 v1.1. For this study morning (non-first void) urine samples were collected at baseline assessment and were subjected to metabolic profiling analysis via UHLC-MS and by ¹H NMR spectroscopy (561 and 575 samples respectively) (Table 1).

¹H NMR metabolic phenotyping

Samples were analyzed by ¹H-NMR spectroscopy in-line with previously published standard protocols for the study of human urine samples⁷⁶. In brief, samples were prepared using a Gilson 215 liquid handling robot and transferred to 4 in. length \(\times \) 5 mm outer diameter NMR tubes in batches of 80 patient samples and four QC pooled samples. Racks of 96 prepared NMR tubes were transferred to a refrigerated SampleJet sample handler robot (Bruker Co) working at \(5^{\circ }\hbox {C}\). One dimensional ¹H-NMR general profile and two-dimensional J-resolved (Jres) experiments were acquired on a Bruker Avance III HD 600 spectrometer following the set up previously described⁷⁶. Experiments were acquired and processed in automation using TopSpin 3.2 and ICON NMR. Phasing, baseline correction and calibration to TSP were also carried out in automation after each acquisition. Spectra quality was assessed using an in-house developed bioinformatics tool nPYc⁷⁷ following the quality criteria previously described⁷⁶.

UHPLC-MS metabolic phenotyping

UHPLC-MS analysis of urine samples was performed as previously described⁷⁸ utilizing a combination of reversed-phase chromatography (RPC) and hydrophilic interaction chromatography (HILIC). UHPLC was performed using Waters Acquity UPLC systems (Waters Corp., Milford, MA, USA), coupled to Waters Xevo G2-S QTOF mass spectrometers (Waters Corp., Wilmslow, UK) via Z-spray electrospray ionization (ESI) sources. RPC separations were paired with both positive and negative ion mode detection (generating URPOS and URNEG datasets respectively), while the HILIC separation was paired with positive ion mode detection only (UHPOS). All UHPLC-MS datasets underwent feature extraction using Progenesis QI 2.1 software (Nonlinear Dynamics, Newcastle, UK) as previously described, and feature filtering was performed using previously described quality control materials⁷⁸. Preprocessing including batch and run order correction was performed using an in-house developed bioinformatics tool nPYc⁷⁷. Features were removed from the data sets where their analytical variation, assessed by repeated measurement of a pooled quality control sample (study reference), exceeded 30% (relative standard deviation) or where the analytical variation exceeded the total observed variation among all study samples. Features were also inspected for correlation between their observed intensity and sample dilution within a pooled QC dilution series and features with Pearson correlation coefficients less than 0.7 were removed.

Metabolic QTL mapping

The goal of QTL mapping is to identify associations between genetic markers and phenotypic variation⁷⁹, in the case of metabolic QTL, the genome-wide contribution of individual alleles to a metabolic feature concentration²⁷. The R package MatrixEQTL⁸⁰ was used for QTL mapping by modelling the effect of genotype as additive linear including covariates to account for: age, sex, data collection centre and cohort. The null-hypothesis, “there is no QTL effect for the metabolic feature concentration”, was tested against the alternative, “there is QTL effect between SNP and metabolic feature concentration”, for each pair of metabolic feature and SNP. We corrected results for multiple comparisons by calculating q-value using the Benjamini–Hochberg procedure⁸¹. The number of metabolic features and SNPs found to be associated using stringent q-value threshold 0.01 is presented in Table 3.

Data preparation For mQTL analysis

UHPLC-MS metabolic datasets UHPOS, URPOS and URNEG were normalised with EigenMS method⁸² that removes bias of unknown complexity from this type of metabolomics experiment at the same time preserving known bias, diagnoses in the case of this study. Subsequently, both EigenMS processed UHPLC-MS data, and raw one-dimensional ¹H NMR metabolic data were normalised using the quantile normalisation⁸³ method to make metabolomics data suitable for the QTL mapping.

Imputation and quality control of genomics data

After quality control using PLINK⁸⁴, genomics data from separate batches were assembled and remapped to “hg19-build37” reference genome. The imputation was performed with IMPUTE2 software⁸⁵. Since SNPs with low minor allele frequency are non-informative and have a potential of creating spurious findings, the imputation results were converted into matrix form and filtered using minor allele frequency threshold 10%. The final genomics matrix includes 12,105,785 SNPs. The results of population stratification indicated biases by cohort and data collection centres (Supplementary materials Figures S1 and S2).

Model selection

We used the Random Forests algorithm⁸⁶ to prioritise metabolic features that can potentially be used as Alzheimer’s disease biomarkers. The main focus of this study is on metabolic features. However, there are also 6923 SNPs that metabolic features are associated with and covariates available for the samples: age, gender and data collection centre. To find the best prediction model, we considered the following sets of features:

A.
Metabolic features only;
B.
Metabolic features and SNPs;
C.
Metabolic features, SNPs and covariates;
D.
Metabolic features and covariates.

As discussed earlier, we tested three different ways of classifying diagnostic groups: four original diagnostic categories (AD/CTL/cMCI/sMCI), binary over-sampling AD + cMCI/CTL + sMCI, and binary under-sampling AD/CTL by removing cMCI and sMCI data. We explored different classifications approaches and feature sets by using the R implementation of RF^87,88 to find the best combination. As the comparison criteria, we used the Out-Of-Bag (OOB) errors which is a standard approach for Random Forests. OOB error is a method of measuring the prediction error utilising bootstrap aggregation to subsample data used for training. It helps to avoid the need for an independent validation dataset⁸⁹. Results in the form of OOB errors of the RF models built for the different combinations of feature set and diagnostic groups are shown in Fig. 2. Each model was repeated ten times, using an increasing number of trees per run. The binary classification AD/CTL gives the best OOB errors for all four feature sets: the feature set A—0.0325, B—0.0404, C—0.0363 and D—0.0292, correspondingly. The feature set D, metabolic features and covariates, and binary classification AD/CTL, Alzheimer’s disease and healthy control samples only, after the tuning of RF parameters, gave the OOB predicted error 0.0241. The value of the area under the receiver operating characteristic curve (AUROC)⁹⁰ for the model is 0.99 (Table 4 and Fig. 3). This model is our final RF model used in further analysis.

Tuning of random forests parameters

There are two parameters to tune for the RF algorithm: the number of trees used in the forest—ntree, and the number of variables used in each tree-mtry. We applied the alternating iterative procedure to find the best possible parameters values for ntree and mtry. The results plotted on Supplementary materials Figure S3 shows that with the ntree value equal to 680 the out-of-bag error rate stabilises and reaches its minimum 0.0254 with standard deviation 0.0056. The best mtry value we found is 90 (Supplementary materials Figure S4). It gives minimal OOB error rate 0.0241 with standard deviation equals to 0.0046.

Ranking of metabolic features

We ranked metabolic features using the permutation importance score obtained from the final RF model. This score is based on the idea that if the feature is not essential, then rearranging the values of that variable does not degrade classification accuracy. The list of ranked metabolic features with calculated permutation importance score and added metabolite annotations are presented in the Supplementary materials Table S3.

Metabolite annotation

For metabolite identification features of interest derived from the UHPLC-MS datasets first underwent correlation analysis using an R script developed in-house. This enabled the observation of co-eluting adducts and in-source fragments that are characteristic of metabolites. Further structural data were obtained via the use of high-resolution accurate mass to charge ratio (m/z) values and collision-induced dissociation (CID) fragmentation patterns. CID experiments were completed at a range of collision voltages at 5 V step intervals (5–45 V). The front quadrupole of the QTOF MS system was engaged to select for the feature of interest prior to CID. Chromatographic retention time matching to in-house standards was also completed where the standard was available for purchase. Online databases such as METLIN⁹¹, and the Human Metabolome Database (HMDB)⁹² were also used to assist with metabolite identification. Where available, analytical standards were purchased and spiked into representative samples to increase confidence in the annotation.

Tentative annotation of features of interest derived from ¹H NMR analysis was completed by searching on literature and using in-house databases to match possible patterns of interest with the relevant spectra of standard compounds. The multiplicity of the signals of interest was confirmed using the corresponding Jres spectrum and a cassette of 2D spectra of representative samples including ¹H,¹H-COSY, ¹H,¹H-TOCSY, ¹H,¹³C-HSQC.

Data availability

Genomics data in European Nucleotide Archive PRJNA266531.

Code availability

Github repository.

References

Hampel, H. et al. Perspective on future role of biological markers in clinical therapy trials of Alzheimer’s disease: A long-range point of view beyond 2020. Biochem. Pharmacol. 88, 426–449. https://doi.org/10.1016/j.bcp.2013.11.009 (2014).
Article CAS PubMed Google Scholar
Blennow, K., Hampel, H. & Zetterberg, H. Biomarkers in Amyloid-\(\upbeta \) immunotherapy trials in Alzheimer’s disease. Neuropsychopharmacology 39, 189–201. https://doi.org/10.1038/npp.2013.154 (2014).
Article CAS PubMed Google Scholar
Lista, S. et al. CSF A\(\upbeta \) 1–42 combined with neuroimaging biomarkers in the early detection, diagnosis and prediction of Alzheimer’s disease. Alzheimer’s Dementia 10, 381–392. https://doi.org/10.1016/j.jalz.2013.04.506 (2014).
Article PubMed Google Scholar
O’Bryant, S. E. et al. Blood-based biomarkers in Alzheimer disease: Current state of the science and a novel collaborative paradigm for advancing from discovery to clinic. Alzheimer’s Dementia J. Alzheimer’s Assoc. 13, 45–58. https://doi.org/10.1016/j.jalz.2016.09.014 (2017).
Article Google Scholar
Olsson, B. et al. CSF and blood biomarkers for the diagnosis of Alzheimer’s disease: A systematic review and meta-analysis. Lancet Neurol. 15, 673–684. https://doi.org/10.1016/S1474-4422(16)00070-3 (2016).
Article CAS PubMed Google Scholar
Ashton, N. J. et al. Increased plasma neurofilament light chain concentration correlates with severity of post-mortem neurofibrillary tangle pathology and neurodegeneration. Acta Neuropathol. Commun. 7, 5. https://doi.org/10.1186/s40478-018-0649-3 (2019).
Article PubMed PubMed Central Google Scholar
Wang, C. Combining serum and urine biomarkers in the early diagnosis of mild cognitive impairment that evolves into Alzheimer’s disease in patients with the apolipoprotein E \(\epsilon \) 4 genotype. Biomarkers Biochem. Indicators Exposure Response Suscept. Chem. 20, 84–88. https://doi.org/10.3109/1354750X.2014.994036 (2015).
Article CAS Google Scholar
Ma, L. et al. The level of Alzheimer-associated neuronal thread protein in urine may be an important biomarker of mild cognitive impairment. J. Clin. Neurosci. Off. J. Neurosurg. Soc. Aust. 22, 649–652. https://doi.org/10.1016/j.jocn.2014.10.011 (2015).
Article CAS Google Scholar
Igarashi, K., Yoshida, M., Waragai, M. & Kashiwagi, K. Evaluation of dementia by acrolein, amyloid-\(\upbeta \) and creatinine. Clin. Chim. Acta Int. J. Clin. Chem. 450, 56–63. https://doi.org/10.1016/j.cca.2015.07.017 (2015).
Article CAS Google Scholar
Zengi, O. et al. Urinary 8-hydroxy-2’-deoxyguanosine level and plasma paraoxonase 1 activity with Alzheimer’s disease. Clin. Chem. Lab. Med. 50, 529–534. https://doi.org/10.1515/CCLM.2011.792 (2011).
Article CAS PubMed Google Scholar
Lindsay, A. & Costello, J. T. Realising the potential of urine and saliva as diagnostic tools in sport and exercise medicine. Sports Med. (Auckland, N.Z.) 47, 11–31. https://doi.org/10.1007/s40279-016-0558-1 (2017).
Article Google Scholar
Roszkowski, K. Oxidative DNA damage-the possible use of biomarkers as additional prognostic factors in oncology. Front. Biosci. (Landmark Edn.) 19, 808–817. https://doi.org/10.2741/4248 (2014).
Article CAS Google Scholar
An, M. & Gao, Y. Urinary biomarkers of brain diseases. Genom. Proteom. Bioinform. 13, 345–354. https://doi.org/10.1016/j.gpb.2015.08.005 (2015).
Article Google Scholar
Nicholson, J. K. et al. Host-gut microbiota metabolic interactions. Science 336, 1262–1267. https://doi.org/10.1126/science.1223813 (2012).
Article ADS CAS PubMed Google Scholar
Vogt, N. M. et al. Gut microbiome alterations in Alzheimer’s disease. Sci. Rep. 7, 1–11. https://doi.org/10.1038/s41598-017-13601-y (2017).
Article CAS Google Scholar
Giau, V. V. et al. Gut microbiota and their neuroinflammatory implications in Alzheimer’s disease. Nutrients 10, 1765. https://doi.org/10.3390/nu10111765 (2018).
Article CAS PubMed Central Google Scholar
Kimball, B. A., Wilson, D. A. & Wesson, D. W. Alterations of the volatile metabolome in mouse models of Alzheimer’s disease. Sci. Rep. 6, 19495. https://doi.org/10.1038/srep19495 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Peng, J. et al. Development of isotope labeling liquid chromatography mass spectrometry for mouse urine metabolomics: Quantitative metabolomic study of transgenic mice related to Alzheimer’s disease. J. Proteome Res. 13, 4457–4469. https://doi.org/10.1021/pr500828v (2014).
Article CAS PubMed Google Scholar
Fukuhara, K. et al. NMR-based metabolomics of urine in a mouse model of Alzheimer’s disease: Identification of oxidative stress biomarkers. J. Clin. Biochem. Nutr. 52, 133–138. https://doi.org/10.3164/jcbn.12-118 (2013).
Article CAS PubMed PubMed Central Google Scholar
Yu, J. et al. High-throughput metabolomics for discovering potential metabolite biomarkers and metabolic mechanism from the APPswe/PS1dE9 transgenic model of Alzheimer’s disease. J. Proteome Res. 16, 3219–3228. https://doi.org/10.1021/acs.jproteome.7b00206 (2017).
Article CAS PubMed Google Scholar
Simmons, A. et al. MRI measures of Alzheimer’s disease and the AddNeuroMed study. Ann. N. Y. Acad. Sci. 1180, 47–55. https://doi.org/10.1111/j.1749-6632.2009.05063.x (2009).
Article ADS PubMed Google Scholar
Lovestone, S. et al. AddNeuroMed-the European collaboration for the discovery of novel biomarkers for Alzheimer’s disease. Ann. N. Y. Acad. Sci. 1180, 36–46. https://doi.org/10.1111/j.1749-6632.2009.05064.x (2009).
Article ADS CAS PubMed Google Scholar
Hye, A. et al. Proteome-based plasma biomarkers for Alzheimer’s disease. Brain J. Neurol. 129, 3042–3050. https://doi.org/10.1093/brain/awl279 (2006).
Article CAS Google Scholar
Proitsi, P. et al. Genetic predisposition to increased blood cholesterol and triglyceride lipid levels and risk of Alzheimer disease: A mendelian randomization analysis. PLoS Med. 11, e1001713. https://doi.org/10.1371/journal.pmed.1001713 (2014).
Article CAS PubMed PubMed Central Google Scholar
Robinette, S. L. & Dumas, M. E. Genetic determinants of metabolism in health and disease: From biochemical genetics to genome-wide associations. Genome Med. https://doi.org/10.1186/gm329 (2012).
Article PubMed PubMed Central Google Scholar
Suhre, K. et al. A genome-wide association study of metabolic traits in human urine. Nat. Genet. 43, 565–569. https://doi.org/10.1038/ng.837 (2011).
Article CAS PubMed Google Scholar
Nicholson, G. et al. A genome-wide metabolic QTL analysis in Europeans implicates two loci shaped by recent positive selection. PLoS Genet. 7, e1002270. https://doi.org/10.1371/journal.pgen.1002270 (2011).
Article CAS PubMed PubMed Central Google Scholar
Rueedi, R. et al. Genome-wide association study of metabolic traits reveals novel gene-metabolite-disease links. PLoS Genet. 10, e1004132. https://doi.org/10.1371/journal.pgen.1004132 (2014).
Article CAS PubMed PubMed Central Google Scholar
Raffler, J. et al. Genome-wide association study with targeted and non-targeted NMR metabolomics identifies 15 novel loci of urinary human metabolic individuality. PLoS Genet. 11, e1005487. https://doi.org/10.1371/journal.pgen.1005487 (2015).
Article CAS PubMed PubMed Central Google Scholar
Paterson, A. D. et al. Genome-wide association identifies the ABO blood group as a major locus associated with serum levels of soluble E-selectin. Arteriosclerosis Thromb. Vasc. Biol. 29, 1958–1967. https://doi.org/10.1161/ATVBAHA.109.192971 (2009).
Article CAS Google Scholar
Whiley, L. et al. Systematic isolation and structure elucidation of urinary metabolites optimized for the analytical-scale molecular profiling laboratory. Analyt. Chem. 91, 8873–8882. https://doi.org/10.1021/acs.analchem.9b00241 (2019).
Article CAS Google Scholar
Chaleckis, R., Meister, I., Zhang, P. & Wheelock, C. E. Challenges, progress and promises of metabolite annotation for LC–MS-based metabolomics. Curr. Opin. Biotechnol. 55, 44–50. https://doi.org/10.1016/j.copbio.2018.07.010 (2019).
Article CAS PubMed Google Scholar
Reitz, C. et al. Independent and epistatic effects of variants in VPS10-d receptors on Alzheimer disease risk and processing of the amyloid precursor protein (APP). Transl. Psychiatry 3, e256. https://doi.org/10.1038/tp.2013.13 (2013).
Article CAS PubMed PubMed Central Google Scholar
Reitz, C. The role of intracellular trafficking and the VPS10d receptors in Alzheimer’s disease. Future Neurol. 7, 423–431. https://doi.org/10.2217/fnl.12.31 (2012).
Article CAS PubMed PubMed Central Google Scholar
Lane, R. F. et al. Vps10 family proteins and the retromer complex in aging-related neurodegeneration and diabetes. J. Neurosci. 32, 14080–14086. https://doi.org/10.1523/JNEUROSCI.3359-12.2012 (2012).
Article CAS PubMed PubMed Central Google Scholar
Campion, D., Charbonnier, C. & Nicolas, G. SORL1 genetic variants and Alzheimer disease risk: A literature review and meta-analysis of sequencing data. Acta Neuropathol. 138, 173–186. https://doi.org/10.1007/s00401-019-01991-4 (2019).
Article CAS PubMed Google Scholar
Mez, J. et al. Two novel loci, COBL and SLC10a2, for Alzheimer’s disease in African Americans. Alzheimer’s Dementia J. Alzheimer’s Assoc. 13, 119–129. https://doi.org/10.1016/j.jalz.2016.09.002 (2017).
Article Google Scholar
Estep, J. A., Wong, W., Wong, Y.-C.E., Loui, B. M. & Riccomagno, M. M. The RacGAP \(\upbeta \)-Chimaerin is essential for cerebellar granule cell migration. Sci. Rep.https://doi.org/10.1038/s41598-017-19116-w (2018).
Article PubMed PubMed Central Google Scholar
Borin, M. et al. Rac1 activation links tau hyperphosphorylation and a\(\upbeta \) dysmetabolism in Alzheimer’s disease. Acta Neuropathol. Commun. 6, 61. https://doi.org/10.1186/s40478-018-0567-4 (2018).
Article CAS PubMed PubMed Central Google Scholar
Newton, J. L. et al. Functional capacity is significantly impaired in primary biliary cirrhosis and is related to orthostatic symptoms. Eur. J. Gastroenterol. Hepatol. 23, 566–572. https://doi.org/10.1097/MEG.0b013e3283470256 (2011).
Article PubMed Google Scholar
Newton, J. L. et al. Cognitive impairment in primary biliary cirrhosis: Symptom impact and potential etiology. Hepatology 48, 541–549. https://doi.org/10.1002/hep.22371 (2008).
Article PubMed Google Scholar
Dastgheib, M., Dehpour, A. R., Heidari, M. & Moezi, L. The effects of intra-dorsal hippocampus infusion of pregnenolone sulfate on memory function and hippocampal BDNF mRNA expression of biliary cirrhosis-induced memory impairment in rats. Neuroscience 306, 1–9. https://doi.org/10.1016/j.neuroscience.2015.08.018 (2015).
Article CAS PubMed Google Scholar
Ruiz, J. C. & Bruick, R. K. F-box and leucine-rich repeat protein 5 (FBXL5): Sensing intracellular iron and oxygen. J. Inorg. Biochem. 133, 73–77. https://doi.org/10.1016/j.jinorgbio.2014.01.015 (2014).
Article CAS PubMed PubMed Central Google Scholar
Horowitz, M. P. & Greenamyre, J. T. Mitochondrial iron metabolism and its role in neurodegeneration. J. Alzheimer’s Dis. JAD 20(Suppl 2), S551-568. https://doi.org/10.3233/JAD-2010-100354 (2010).
Article CAS Google Scholar
Miyashita, A. et al. Genes associated with the progression of neurofibrillary tangles in Alzheimer’s disease. Transl. Psychiatry 4, e396. https://doi.org/10.1038/tp.2014.35 (2014).
Article CAS PubMed PubMed Central Google Scholar
Castillo, E. et al. Comparative profiling of cortical gene expression in Alzheimer’s disease patients and mouse models demonstrates a link between amyloidosis and neuroinflammation. Sci. Rep. 7, 17762. https://doi.org/10.1038/s41598-017-17999-3 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Öhrfelt, A. et al. The pre-synaptic vesicle protein synaptotagmin is a novel biomarker for Alzheimer’s disease. Alzheimer’s Res. Ther. 8, 41. https://doi.org/10.1186/s13195-016-0208-8 (2016).
Article CAS Google Scholar
Buniello, A. et al. The NHGRI-EBI GWAS catalog of published genome-wide association studies, targeted arrays and summary statistics 2019. Nucleic Acids Res. 47, D1005–D1012. https://doi.org/10.1093/nar/gky1120 (2019).
Article CAS PubMed Google Scholar
Meng, L. J., Griffiths, W. J. & Sjövall, J. The identification of novel steroid N-acetylglucosaminides in the urine of pregnant women. J. Steroid Biochem. Mol. Biol. 58, 585–598. https://doi.org/10.1016/0960-0760(96)00080-5 (1996).
Article CAS PubMed Google Scholar
Marschall, H. U. et al. Bile acid n-acetylglucosaminidation. in vivo and in vitro evidence for a selective conjugation reaction of 7 beta-hydroxylated bile acids in humans. J. Clin. Invest. 89, 1981–1987. https://doi.org/10.1172/JCI115806. (1992).
Article CAS PubMed PubMed Central Google Scholar
Mapstone, M. et al. Plasma phospholipids identify antecedent memory impairment in older adults. Nat. Med. 20, 415–418. https://doi.org/10.1038/nm.3466 (2014).
Article CAS PubMed PubMed Central Google Scholar
Fiandaca, M. S. et al. Plasma 24-metabolite panel predicts preclinical transition to clinical stages of Alzheimer’s disease. Front. Neurol. https://doi.org/10.3389/fneur.2015.00237 (2015).
Article PubMed PubMed Central Google Scholar
de Leeuw, F. A. et al. Blood-based metabolic signatures in Alzheimer’s disease. Alzheimer’s Dementia Diag. Assess. Disease Monit. 8, 196–207. https://doi.org/10.1016/j.dadm.2017.07.006 (2017).
Article Google Scholar
Voyle, N. et al. Blood metabolite markers of neocortical amyloid-\(\upbeta \) burden: Discovery and enrichment using candidate proteins. Transl. Psychiatry 6, e719–e719. https://doi.org/10.1038/tp.2015.205 (2016).
Article CAS PubMed PubMed Central Google Scholar
Casanova, R. et al. Blood metabolite markers of preclinical Alzheimer’s disease in two longitudinally followed cohorts of older individuals. Alzheimer’s Dementia J. Alzheimer’s Assoc. 12, 815–822. https://doi.org/10.1016/j.jalz.2015.12.008 (2016).
Article Google Scholar
An, Y. et al. Evidence for brain glucose dysregulation in Alzheimer’s disease. Alzheimer’s Dementia 14, 318–329. https://doi.org/10.1016/j.jalz.2017.09.011 (2018).
Article PubMed Google Scholar
Proitsi, P. et al. Association of blood lipids with Alzheimer’s disease: A comprehensive lipidomics analysis. Alzheimer’s Dementia 13, 140–151. https://doi.org/10.1016/j.jalz.2016.08.003 (2017).
Article PubMed Google Scholar
Pan, X. et al. Metabolomic profiling of bile acids in clinical and experimental samples of Alzheimer’s disease. Metabolites 7, 28. https://doi.org/10.3390/metabo7020028 (2017).
Article CAS PubMed Central Google Scholar
Han, X. et al. Metabolomics in early Alzheimer’s disease: Identification of altered plasma sphingolipidome using shotgun lipidomics. PLOS ONE 6, e21643. https://doi.org/10.1371/journal.pone.0021643 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Mayo, W., Le Moal, M. & Abrous, D. N. Pregnenolone sulfate and aging of cognitive functions: Behavioral, neurochemical, and morphological investigations. Hormones Behav. 40, 215–217. https://doi.org/10.1006/hbeh.2001.1677 (2001).
Article CAS Google Scholar
Rajagopal, L., Soni, D. & Meltzer, H. Y. Neurosteroid pregnenolone sulfate, alone, and as augmentation of lurasidone or tandospirone, rescues phencyclidine-induced deficits in cognitive function and social interaction. Behav. Brain Res. 350, 31–43. https://doi.org/10.1016/j.bbr.2018.05.005 (2018).
Article CAS PubMed Google Scholar
Romanoff, L. P., Thomas, A. W. & Baxter, M. N. Effect of age on pregnanediol excretion by men. J. Gerontol. 25, 98–101. https://doi.org/10.1093/geronj/25.2.98 (1970).
Article CAS PubMed Google Scholar
Marksteiner, J., Blasko, I., Kemmler, G., Koal, T. & Humpel, C. Bile acid quantification of 20 plasma metabolites identifies lithocholic acid as a putative biomarker in Alzheimer’s disease. Metabolomics.https://doi.org/10.1007/s11306-017-1297-5 (2018).
Article PubMed Google Scholar
Wang, Y. et al. Bile acids regulate cysteine catabolism and glutathione regeneration to modulate hepatic sensitivity to oxidative injury. JCI Insight. 3, e99676. https://doi.org/10.1172/jci.insight.99676 (2018).
Article PubMed Central Google Scholar
MahmoudianDehkordi, S. et al. Altered bile acid profile associates with cognitive impairment in Alzheimer’s disease-an emerging role for gut microbiome. Alzheimer’s Dementia J. Alzheimer’s Assoc. 15, 76–92. https://doi.org/10.1016/j.jalz.2018.07.217 (2019).
Article Google Scholar
Carvalho, C. et al. Metabolic alterations induced by sucrose intake and Alzheimer’s disease promote similar brain mitochondrial abnormalities. Diabetes 61, 1234–1242. https://doi.org/10.2337/db11-1186 (2012).
Article CAS PubMed PubMed Central Google Scholar
Hua, X. et al. Long-term d-galactose injection combined with ovariectomy serves as a new rodent model for Alzheimer’s disease. Life Sci. 80, 1897–1905. https://doi.org/10.1016/j.lfs.2007.02.030 (2007).
Article CAS PubMed Google Scholar
Joseph, Loscalzo. Lipid metabolism by gut microbes and atherosclerosis. Circ. Res. 109, 127–129. https://doi.org/10.1161/RES.0b013e3182290620 (2011).
Article CAS Google Scholar
Aslibekyan, S. et al. Genome- and CD4+ T-cell methylome-wide association study of circulating trimethylamine-N-oxide in the genetics of lipid lowering drugs and diet network (GOLDN). J. Nutr. Intermediary Metab. 8, 1–7. https://doi.org/10.1016/j.jnim.2017.03.002 (2017).
Article Google Scholar
Velenosi, T. J. et al. Untargeted metabolomics reveals N,N,N-trimethyl-l-alanyl-l-proline betaine (TMAP) as a novel biomarker of kidney function. Sci. Rep. 9, 1–13. https://doi.org/10.1038/s41598-019-42992-3 (2019).
Article CAS Google Scholar
Dong, Y. & Brewer, G. J. Global metabolic shifts in age and Alzheimer’s disease mouse brains pivot at NAD+/NADH redox sites. J. Alzheimer’s Dis. 71, 119–140. https://doi.org/10.3233/JAD-190408 (2019).
Article CAS Google Scholar
Ellison, E. M., Abner, E. L. & Lovell, M. A. Multiregional analysis of global 5-methylcytosine and 5-hydroxymethylcytosine throughout the progression of Alzheimer’s disease. J. Neurochem. 140, 383–394. https://doi.org/10.1111/jnc.13912 (2017).
Article CAS PubMed PubMed Central Google Scholar
Fitzgerald, B. L. et al. Elucidating the structure of N1-acetylisoputreanine: A novel polyamine catabolite in human urine. ACS Omega 2, 3921–3930. https://doi.org/10.1021/acsomega.7b00872 (2017).
Article CAS PubMed PubMed Central Google Scholar
Inoue, K. et al. Metabolic profiling of Alzheimer’s disease brains. Sci. Rep. 3, 1–9. https://doi.org/10.1038/srep02364 (2013).
Article Google Scholar
Foy, C. M. L. et al. Diagnosing Alzheimer’s disease-non-clinicians and computerised algorithms together are as accurate as the best clinical practice. Int. J. Geriatric Psychiatry 22, 1154–1163. https://doi.org/10.1002/gps.1810 (2007).
Article Google Scholar
Dona, A. C. et al. Precision high-throughput proton NMR spectroscopy of human urine, serum, and plasma for large-scale metabolic phenotyping. Analyt. Chem. 86, 9887–9894. https://doi.org/10.1021/ac5025039 (2014).
Article CAS Google Scholar
Sands, C. J. et al. The nPYc-toolbox, a python module for the pre-processing, quality-control and analysis of metabolic profiling datasets. Bioinformatics 35, 5359–5360. https://doi.org/10.1093/bioinformatics/btz566 (2019).
Article CAS PubMed PubMed Central Google Scholar
Lewis, M. R. et al. Development and application of ultra-performance liquid chromatography-TOF MS for precision large scale urinary metabolic phenotyping. Analyt. Chem. 88, 9004–9013. https://doi.org/10.1021/acs.analchem.6b01481 (2016).
Article CAS Google Scholar
Miles, C. M. & Wayne, M. Quantitative trait locus (QTL) analysis. Nat. Educ. 1, (2008).
Shabalin, A. A. Matrix eQTL: Ultra fast eQTL analysis via large matrix operations. Bioinformatics 28, 1353–1358. https://doi.org/10.1093/bioinformatics/bts163 (2012).
Article CAS PubMed PubMed Central Google Scholar
Benjamini, Y. & Hochberg, Y. Controlling the false discovery rate: A practical and powerful approach to multiple testing. J. R. Stat. Soc. Series B (Methodol.) 57, 289–300. https://doi.org/10.1111/j.2517-6161.1995.tb02031.x (1995).
Article MathSciNet MATH Google Scholar
Karpievitch, Y. V., Nikolic, S. B., Wilson, R., Sharman, J. E. & Edwards, L. M. Metabolomics data normalization with EigenMS. PLoS One 9, e116221. https://doi.org/10.1371/journal.pone.0116221 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Bolstad, B. M., Irizarry, R. A., Astrand, M. & Speed, T. P. A comparison of normalization methods for high density oligonucleotide array data based on variance and bias. Bioinformatics 19, 185–193. https://doi.org/10.1093/bioinformatics/19.2.185 (2003).
Article CAS PubMed Google Scholar
Purcell, S. et al. PLINK: A toolset for whole-genome association and population-based linkage analysis. Am. J. Hum. Genet. 81, 559–575. https://doi.org/10.1086/519795 (2007).
Article CAS PubMed PubMed Central Google Scholar
Howie, B., Marchini, J. & Stephens, M. Genotype imputation with thousands of genomes. G3 Genet. Genes Genom. 1, 457–470. https://doi.org/10.1534/g3.111.001198 (2011).
Article Google Scholar
Breiman, L. Random forests. Mach. Learn. 45, 5–32. https://doi.org/10.1023/A:1010933404324 (2001).
Article MATH Google Scholar
Ho, T. K. Random decision forests. in Proceedings of the 3rd International Conference on Document Analysis and Recognition, Montreal, QC 278–282. https://doi.org/10.1109/ICDAR.1995.598994 (1995).
Ho, T. K. The random subspace method for constructing decision forests. IEEE Trans. Pattern Anal. Mach. Intell. 20, 832–844. https://doi.org/10.1109/34.709601 (1998).
Article Google Scholar
Gareth, J., Witten, D., Hastie, T. & Tibshirani, R. Introduction to Statistical Learning (Springer, New York, 2013).
MATH Google Scholar
Fawcett, T. An introduction to ROC analysis. Pattern Recognit. Lett. 27, 861–874. https://doi.org/10.1016/j.patrec.2005.10.010 (2006).
Article Google Scholar
Smith, C. et al. METLIN: A metabolite mass spectral database. Therap. Drug Monit. 27, 747–751. https://doi.org/10.1097/01.ftd.0000179845.53213.39 (2005).
Article CAS Google Scholar
Wishart, D. S. et al. HMDB: The human metabolome database. Nucleic Acids Res. 35, D521–D526. https://doi.org/10.1093/nar/gkl923 (2007).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

The work was supported by the Innovative Medicines Initiative (IMI) Joint Undertaking under EMIF Grant agreement No. 115372, resources of which are composed of financial contribution from the European Union’s Seventh Framework Programme (FP7/2007-2013) and EFPIA companies’ in kind contribution, and by the Medical Research Council and National Institute for Health Research [Grant number MC_PC_12025], and infrastructure support was provided by the National Institute for Health Research (NIHR) Imperial Biomedical Research Centre (BRC). AddNeuroMed was a pilot project for the Innovative Medicines Initiative funded through the EU FP6 programme. The UK Dementia Research Institute (DRI) is an initiative funded by the Medical Council, Alzheimer’s Society and Alzheimer’s Research UK. The views expressed are those of the authors and not necessarily those of the NHS, the NIHR or the Department of Health or other funders. We thank Dr Aditya Nori and Laurence G. Bourn from Microsoft Research Lab Cambridge for advising in machine learning algorithms. Hilkka Soininen, Iwona Kloszewska, Patrizia Mecocci, Magda Tsolaki, Bruno Vellas, Dag Aarsland, Simon Lovestone—on behalf of AddNeuroMed consortium.

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Jeremy K. Nicholson & Elaine Holmes
Present address: Health Futures Institute, Murdoch University, Perth, WA, 6105, Australia
Elaine Holmes
Present address: The Perron Institute for Neurological and Translational Science, Nedlands, WA, 6009, Australia
Simon Lovestone
Present address: Janssen-Cilag Ltd, High Wycombe, UK

Authors and Affiliations

European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Wellcome Trust Genome Campus, Hinxton, CB10 1SD, UK
Natalja Kurbatova, Manik Garg & Alvis Brazma
Department of Metabolism, Digestion and Reproduction, National Phenome Centre, Imperial College London, Hammersmith Campus, IRDB Building, London, UK
Luke Whiley, Elena Chekmeneva, Beatriz Jiménez, María Gómez-Romero, Jake Pearce, Matthew R. Lewis & Jeremy K. Nicholson
UK Dementia Research Institute, Hammersmith Hospital, Imperial College London, London, W12 0NN, UK
Luke Whiley & Elaine Holmes
Division of Systems Medicine, Imperial College London, South Kensington Campus, London, SW7 2AZ, UK
Torben Kimhofer
IMEC, Leuven, Belgium
Ellie D’Hondt
Department of Neurology, University of Eastern Finland and Kuopio University Hospital, Kuopio, Finland
Hilkka Soininen
Medical University of Lodz, Lodz, Poland
Iwona Kłoszewska
Institute of Gerontology and Geriatrics, University of Perugia, Perugia, Italy
Patrizia Mecocci
3rd Department of Neurology, Aristotle University, Thessaloniki, Greece
Magda Tsolaki
INSERM U 558, University of Toulouse, Toulouse, France
Bruno Vellas
King’s College London, Institute of Psychiatry, Psychology and Neuroscience, London, UK
Dag Aarsland, Stuart Snowden, Petroula Proitsi, Nicholas J. Ashton, Abdul Hye & Cristina Legido-Quigley
Department of Psychiatry and Neurochemistry, Institute of Neuroscience and Physiology, The Sahlgrenska Academy, University of Gothenburg, Gothenburg, Sweden
Nicholas J. Ashton
Wallenberg Centre for Molecular and Translational Medicine, University of Gothenburg, Gothenburg, Sweden
Nicholas J. Ashton
NIHR Biomedical Research Centre for Mental Health and Biomedical Research Unit for Dementia at South London and Maudsley NHS Foundation, London, UK
Nicholas J. Ashton
Department of Psychiatry, Warneford Hospital, University of Oxford, Oxford, UK
Alejo Nevado-Holgado, Benjamine Liu & Simon Lovestone

Authors

Natalja Kurbatova
View author publications
You can also search for this author in PubMed Google Scholar
Manik Garg
View author publications
You can also search for this author in PubMed Google Scholar
Luke Whiley
View author publications
You can also search for this author in PubMed Google Scholar
Elena Chekmeneva
View author publications
You can also search for this author in PubMed Google Scholar
Beatriz Jiménez
View author publications
You can also search for this author in PubMed Google Scholar
María Gómez-Romero
View author publications
You can also search for this author in PubMed Google Scholar
Jake Pearce
View author publications
You can also search for this author in PubMed Google Scholar
Torben Kimhofer
View author publications
You can also search for this author in PubMed Google Scholar
Ellie D’Hondt
View author publications
You can also search for this author in PubMed Google Scholar
Hilkka Soininen
View author publications
You can also search for this author in PubMed Google Scholar
Iwona Kłoszewska
View author publications
You can also search for this author in PubMed Google Scholar
Patrizia Mecocci
View author publications
You can also search for this author in PubMed Google Scholar
Magda Tsolaki
View author publications
You can also search for this author in PubMed Google Scholar
Bruno Vellas
View author publications
You can also search for this author in PubMed Google Scholar
Dag Aarsland
View author publications
You can also search for this author in PubMed Google Scholar
Alejo Nevado-Holgado
View author publications
You can also search for this author in PubMed Google Scholar
Benjamine Liu
View author publications
You can also search for this author in PubMed Google Scholar
Stuart Snowden
View author publications
You can also search for this author in PubMed Google Scholar
Petroula Proitsi
View author publications
You can also search for this author in PubMed Google Scholar
Nicholas J. Ashton
View author publications
You can also search for this author in PubMed Google Scholar
Abdul Hye
View author publications
You can also search for this author in PubMed Google Scholar
Cristina Legido-Quigley
View author publications
You can also search for this author in PubMed Google Scholar
Matthew R. Lewis
View author publications
You can also search for this author in PubMed Google Scholar
Jeremy K. Nicholson
View author publications
You can also search for this author in PubMed Google Scholar
Elaine Holmes
View author publications
You can also search for this author in PubMed Google Scholar
Alvis Brazma
View author publications
You can also search for this author in PubMed Google Scholar
Simon Lovestone
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

The study was conceived and designed by S.L., E.H. and J.K.N. carried out the metabolic profiling design, N.K. designed the analytical pipeline; experimental NMR profiling assays were carried out by B.J., UPLC-MS analysis was conducted by M.G.R., E.C., L.W. and M.R.L., Statistical analysis of the metabolomic data was carried out by T.K., J.P., A.N.H. and A.H.; Data integration and analysis was undertaken by N.K. M.G. repeated the analysis with updated inputs. Interpretation of the metabolomics data was carried out by E.H., J.K.N., L.W., S.S. and C.L.Q. A literature search and linkage of annotated metabolites to AD pathology was done by N.K. in conjunction with S.L. The manuscript was written by N.K. and S.L. in conjunction with A.B., M.G., L.W. All authors reviewed the manuscript.

Corresponding author

Correspondence to Natalja Kurbatova.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Kurbatova, N., Garg, M., Whiley, L. et al. Urinary metabolic phenotyping for Alzheimer’s disease. Sci Rep 10, 21745 (2020). https://doi.org/10.1038/s41598-020-78031-9

Download citation

Received: 09 July 2020
Accepted: 18 November 2020
Published: 10 December 2020
DOI: https://doi.org/10.1038/s41598-020-78031-9

This article is cited by

Non-invasive Biomarkers for Early Detection of Alzheimer’s Disease: a New-Age Perspective
- Niyamat M. A. Chimthanawala
- Akash Haria
- Sadhana Sathaye
Molecular Neurobiology (2024)
Differential DNA methylation associated with delayed cerebral ischemia after aneurysmal subarachnoid hemorrhage: a systematic review
- Tomasz Klepinowski
- Bartłomiej Pala
- Leszek Sagan
Neurosurgical Review (2024)
Small molecule metabolites: discovery of biomarkers and therapeutic targets
- Shi Qiu
- Ying Cai
- Aihua Zhang
Signal Transduction and Targeted Therapy (2023)
Microbial-derived metabolites as a risk factor of age-related cognitive decline and dementia
- Emily Connell
- Gwenaelle Le Gall
- David Vauzour
Molecular Neurodegeneration (2022)
Urinary metabolomic changes and microbiotic alterations in presenilin1/2 conditional double knockout mice
- Jie Gao
- Nian Zhou
- Ying Xu
Journal of Translational Medicine (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Dimensionality reduction of metabolic features

Ranking of metabolic features for annotation

Analysis of annotated metabolites

Cholesterol metabolism

Sugar metabolism and gut microbiota

DNA methylation and polyamine metabolism

Exogenous metabolites

Discussion

Methods

Study participants

1H NMR metabolic phenotyping

UHPLC-MS metabolic phenotyping

Metabolic QTL mapping

Data preparation For mQTL analysis

Imputation and quality control of genomics data

Model selection

Tuning of random forests parameters

Ranking of metabolic features

Metabolite annotation

Data availability

Code availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links

¹H NMR metabolic phenotyping