Abstract
Early diagnosis of esophageal cancer (EC) is extremely challenging. The study presented herein aimed to assess whether urinary volatile organic compounds (VOCs) may be emerging diagnostic biomarkers for EC. Urine samples were collected from EC patients and healthy controls (HCs). Gas chromatography-ion mobility spectrometry (GC-IMS) was next utilised for volatile organic compound detection and predictive models were constructed using machine learning algorithms. ROC curve analysis indicated that an 8-VOCs based machine learning model could aid the diagnosis of EC, with the Random Forests having a maximum AUC of 0.874 and sensitivities and specificities of 84.2% and 90.6%, respectively. Urine VOC analysis aids in the diagnosis of EC.
Similar content being viewed by others
Introduction
Esophageal cancer (EC) is a malignant tumor originating from the epithelium of the esophagus. EC is mainly classified into squamous cell carcinoma and adenocarcinoma dependent on the type of pathology1. In 2020, there were an estimated 604,000 new cases and 544,000 deaths associated with EC worldwide2. In China, EC has the sixth highest incidence rate and the fifth highest mortality rate3. Although there has been a steady reduction in incidence in recent years, the overall disease burden is still high due to the high risk of EC to human health. According to statistics, over 90% of the Chinese population has squamous epithelial carcinoma, making it difficult to draw conclusions from research findings on adenocarcinoma in Western countries. In the early stages of its development, EC mainly presents as a choking sensation when swallowing food, a foreign body sensation, or difficulty in swallowing. As such, EC is often overlooked. Currently, the treatment of EC is a combination of surgery, radiotherapy and chemotherapy, but due to late diagnosis, most patients cannot undergo surgical excision4. Tumor markers play a significant role in monitoring and treating tumors. However, common biomarkers, such as CEA, CA 19–9 and CA 125, have low sensitivity and specificity5,6,7. Moreover, false positives can have bee reported due to nonspecific elevations in distinct digestive tract disorders8,9,10,11. The described issues often result in a poor prognosis for EC, with a 5-year survival rate of only 15–25%12. Thus, developing new biomarkers for EC represent an urgent unmet medical need.
Volatile organic compounds (VOCs) are an important component of human metabolites and can be detected by headspace analysis. As the occurrence of VOCs is primarily associated with oxidative stress, inflammation and changes in cellular metabolism, VOCs are considered to be a systemic and local biomarker that can provide unique information about ongoing biochemical processes and thus an individual’s health status13,14,15. Exploring changes in human VOCs has been reported to be useful in the diagnosis of tumors. VOCs detected in urine can be used in combination to construct an effective diagnostic model16,17,18,19,20,21,22 (Supplementary Table S1). Similarly, studies investigating diagnostic models based on VOCs detected in exhaled breath have been reported in lung cancer23. VOCs in bile and other samples have further been demonstrated to be useful for the diagnosis of tumors24.
The aim of the present study was to identify differential VOCs in patients with EC compared to healthy controls (HCs) and to subsequently develop a volatile biomarker model that could aid in the early diagnosis of EC. To our knowledge, we have revealed for the first time that VOC levels detected by using Gas chromatography-ion mobility spectrometry (GC-IMS) can be utilised as noninvasive biomarkers for the diagnosis of EC.
Results
Participant characteristics
The overall study design is shown in Fig. 1 In the discovery study, the headspace outputs from 241 urine samples were analyzed after 23 patients were excluded prior to surgery. Another 19 patients were excluded after post-operative pathological confirmation. In addition, 37 HCs were excluded; the specific reasons for these exclusions are shown in Supplementary Fig. S1A. Finally, sample data from 162 (EC = 87, HC = 75) patients were included in the analysis for collation (Table 1). Similarly, in the validation study (Supplementary Fig. S1B), the headspace outputs of 125 urine samples were initially analyzed. Nineteen patients were excluded before the surgery and an additional 14 patients were excluded after postoperative pathological confirmation. Furthermore, 22 HCs were excluded. Finally, data from 70 patient samples (EC = 38, HC = 32) were included in the analysis (Table 1). The final analyzed cohort is illustrated in Supplementary Fig. S1C.
VOC profile analysis in EC and HC patients
Similar to our previous studies, substances were characterized by the retention index of the molecule and the drift time of its ions before being quantified by the intensity of the signal peaks24. Therefore, for each sample to be measured, three-dimensional data (retention index, drift time, peak strength) was obtained (Fig. 2A). The data was analyzed by comparing the mean of a 2D spectrogram (top view of a 3D spectrogram, with colors indicating peak intensity), where each “dot” (Fig. 2B) was a signal peak. The 2D coordinates of the signal peak location were retrieved (retention index × drift time) to characterize the compound, and the integration region was boxed to integrate the signal peak to obtain the peak height. Thirty-seven VOC peaks were selected based on retention indices and drift times (Supplementary Table S2). See Supplementary Table S3 for specific peak height values.
Diagnostic performance of urine VOCs with machine learning algorithms
We used a machine learning approach to analyze heterogeneous VOC signals in EC patients and HCs. Combined with the peak height of the above VOCs, four popular machine learning algorithms (random forests (RF), neural network (NN), support vector machines (SVM), decision trees (DT)) were used to construct diagnostic models. Table 2 demonstrates the results of receiver operating characteristic (ROC) analysis of the four models from the urine validation study. The RF model had the highest area under the curve (AUC) of 0.863, with a sensitivity of 78.9% and a specificity of 93.8%. Note: The F1 value is the reconciled mean of the check accuracy and the check completeness rates, and measures the balance of precision and recall of the model. F1 = (2Precision × Recall)/(Precision + Recall).
Estimation of importance of the urinary VOCs using Random forests analysis
The results in Table 2 are obtained from the four machine learning models based on the analysis of all 37 VOCs. The RF algorithm showed the most superior results, so we used RF for the final model construction in the subsequent analysis. Through RF model analysis, the top eight VOCs with Gini coefficients were further analyzed (Fig. 3A). Detailed importance ranking of 37 VOCs are shown in Supplementary Fig. S2. Compared to HCs, five that were up-regulated (2,3-Butandiol, 2-Acetylfuran, Dimethyl trisulfide, 2-Methyl-butanoic acid methyl ester, Methyl decanoate) and three that were down-regulated ((E)-Ethyl-2-hexenoate, 2-Isopropyl-3-methoxy pyrazine, Cyclohexanone-D) (Fig. 3B). In combination with these eight VOCs, we used RF to build a new diagnostic model. The AUC area of 8-VOCs model reached 0.874 (Fig. 4A). In addition, HCs and four EC staging groups were compared using the 8-VOCs model, a diagnostic model that can well distinguish the different staging groups from controls (Fig. 4B and Supplementary Table S4).
Urine biomarkers and pathological parameters
As shown in Table 1, in the discovery study, the majority of patients exhibited squamous carcinomas (96.6%) while a minority presented with adenocarcinomas (3.4%). The proportions were 51.8% and 48.2% for early (I + II) and advanced (III + IV) stages, respectively. As clinical stage plays an extremely important role in the prognosis of patients with EC, we compared the levels of these eight VOCs in controls and patients with different stages. Controls had higher levels of (E)-Ethyl-2-hexenoate, 2-Isopropyl-3-methoxy pyrazine and lower levels of 2-Acetylfuran when compared to stage I, II, III EC patients. 2-Methyl-butanoic acid methyl ester was significantly higher in stages III and IV than in controls, whereas no significant differences were observed in stages I and II (Supplementary Table S5). Regarding comparisons of VOCs among different cancer stages (Supplementary Fig. S3), 2-Isopropyl-3-methoxy pyrazine provided a significantly higher peak height in patients at stage IV, but the difference in peak height between stages for the other VOCs was not statistically significant.
Correlation analysis of VOCs in urine
Correlation analyses demonstrated a significant differential enrichment of metabolically generated VOCs in urine when comparing levels in the EC and HC populations (Supplementary Fig. S4). These data suggest a different pattern of VOC associations between the two groups.
Discussion
In the present study, we explored the measurement of urinary VOCs for the early diagnosis of EC and subsequent data analysis resulted in several meaningful findings. Firstly, GC-IMS was efficient in differentiating between patients with EC and healthy subjects by the detection of VOCs. Secondly, a diagnostic model was constructed by machine learning to clarify the diagnostic value of VOCs in urine. Thirdly, eight different VOCs were identified in urine that may play a relevant role in the diagnosis of EC.
Although research regarding VOCs in EC metabolites has been ongoing for many years, there are currently no biomarkers for VOCs that can be utilized in clinical screening. Previous studies of VOCs in human metabolites in patients diagnosed with EC versus non-cancerous subjects have differed in many aspects25,26,27,28,29. Broadly, all relevant studies have achieved a diagnostic accuracy of over 80%, and strikingly, a diagnostic model AUC area of 0.97 was achieved in a study exploring exhaled breath VOCs in EC. The primary categories of differential VOCs reported so far are aldehydes, ketones, acids and phenols, however, differences in the specific compound categories remains an issue. The platforms most frequently used in previous studies were selected ion flow tube mass spectrometry (SIFT-MS) and proton transfer reaction mass spectrometer (PTR-MS). In this regard, an innovative method employing GC-IMS assays in our study led to similar results. Previously, teams have used GC-IMS to detect and characterise differences in VOCs between pancreatic, colorectal and liver cancer patients and their non-cancer control populations16,30,31. Of course, variation in results is present when comparing different metabolic platforms, sample types, and methodological procedures. These aspects will require refinement in the future by more extensive clinical studies in which sufficient sample data are collected, and by the development of standardized sample handling protocols.
A variety of metabolite VOCs have been extensively studied in previous reports. Exhaled breath is one of the earlier body fluid samples to be used for VOCs detection due to its ease of collection and other advantages. Kumar et al. clearly differentiated EC from patients with benign gastrointestinal disorders based on exhaled gas VOCs27,32. In addition, through a multicentre validation study, Markar et al. constructed a diagnostic model for 5-VOCs based on the composition of butyric acid, valeric acid, hexanoic acid, butyraldehyde, and decanal by analysing exhaled breath samples from esophagogastric cancer patients and control patients using SIFT-MS28. The exploration of urinary VOCs has mainly focused on urological tumors such as prostate cancer and bladder cancer. Gao et al. constructed an 11-VOCs model for differentiation between prostate cancer and non-cancer patients based on their urinary VOCs by GC–MS assay. Gao et al. constructed an 11-VOCs model for differentiation between prostate cancer and non-cancer patients based on their urinary VOCs by GC–MS assay17. Similarly, Tyagi's team examined urinary VOCs in patients with prostate and bladder cancer and, in addition to finding differences between the tumor and control groups, the VOCs were similarly different between patients with the two types of tumors22. There are also studies on bile, faeces and other VOCs, which have established new methods for the diagnosis of cholangiocarcinoma, pancreatic cancer and colorectal cancer, respectively33,34,35,36.
Unlike in previous studies, we first screened for potential diagnostic VOCs based on differences identified between the urine of cancer patients and of healthy patients. Four diagnostic models were used for the analysis. To avoid overfitting of the model, the feasibility of the model was subsequently confirmed by external validation. Afterwards, we selected the RF with the best performance as the diagnostic model for analysis, and finally selected eight VOCs for model construction. The effects of patient diet, medication and serious complications were overcome by stratification and exclusion criteria during the sample collection process. The effects of air exposure were limited by reducing the time taken for samples to travel from the ward to the laboratory. Storage times prior to final VOC testing were kept consistent to avoid differences in long-term storage loss at low temperatures. Distinct to previous studies, herein we have collated a urine metabolomics database consisting of 125 patients with EC and 107 HCs. To our knowledge, we have obtained the largest number of samples for the study of urinary VOCs in patients with EC.
Machine learning has been widely used in the medical field, and several medical-related algorithms are briefly described below37. Random forest is an idea of integrated learning, where data obtained by random sampling is fed into numerous weak learners and voted on to obtain the final output. Neural Networks, also known as Artificial Neural Networks, are algorithms that attempt to mimic the concepts of how the human brain interprets and draws conclusions from information using multiple layers of computation. The learning process of NNs can be supervised or unsupervised, and are essentially mathematical models designed to deal with complex and disparate information. Support Vector Machine is a supervised learning algorithm widely used in classification and regression tasks. The core idea of SVM is to find a hyperplane that maximises the spacing between two classes to achieve good classification. The decision tree algorithm is a method for approximating the value of a discrete function. It is a typical classification method, where the data is first processed, an inductive algorithm is used to generate readable rules and a decision tree, and then the new data is analysed using the decisions. Essentially decision tree is the process of classifying data by a set of rules.
Endogenous VOCs are primarily produced by human cells in the process of energy metabolism and oxidative stress. VOCs eventually reach alveolar cells, renal epithelial cells and intestinal epithelial cells via the blood to be excreted, and as such they directly reflect changes in human metabolism. Many studies have confirmed the validity of analysis of VOC in exhaled breath for tumor diagnosis38,39. However, VOCs in exhaled breath are easily influenced by other airborne substances, leading to confounding of analysis. The choice of urine as study samples is likely to avoid interference from external factors and to yield reliable conclusions.
For the known VOCs detected, we were able to explore their relationship with tumors. The model constructed on the basis of the eight VOCs could distinguish them well in the urine of patients when comparing between the early and late stages of tumor development. These findings may suggest that certain VOCs are associated with the proliferative migration of tumors.
Of the VOCs associated with EC discovered in this study, some have been demonstrated to have potential as diagnostic biomarkers for other diseases. For instance, 2-Acetylfuran was significantly reduced in plasma extracellular vesicles in Severe acute pancreatitis (SAP) patients40, and dimethyl trisulfide in exhaled breath and 4-Heptanone in urine samples are significantly different from breast cancer patients than from non-cancer patients41,42. Moreover, 2-Butanone has the potential to be a biomarker for tumors associated with breast43 and hepatocellular carcinoma30 cancers. In addition, levels of 1-Octen-3-ol and (E)-2-nonenal have been shown to be significantly altered in saliva samples from oral cancer patients compared to the non-cancer patient group44. Meanwhile, cyclohexanone has been shown to be differentially expressed in bile samples from gallbladder cancer patients24 and in exhaled breath samples from breast45 and colorectal cancer patients46.
There is a growing interest in investigation of metabolite VOCs in oncology patients. Metabolites such as: urine, bile, exhaled breath, serum and feces have been shown to have potential diagnostic value as tumor identifiers. Unfortunately, research on the mechanisms of production of metabolite VOCs is extremely limited. Metabolic regulation plays a key role in cancer adaptation to oxidative stress47, and tumor cells have been reported to promote cancer development and progression through the readjustment of metabolic processes. The relationship between VOCs in urine and oxidative stress has been demonstrated15. As such, dimethyl trisulfide has been reported to be associated with oxidative stress48. For substance-specific changes, tumor cells are usually accompanied by elevated aldehyde dehydrogenase (ALDH) activity49, and ALDHs are known to catalyze the oxidation of exogenous and endogenous aldehyde substrates to their corresponding carboxylic acids50. These findings explain the decrease in aldehyde levels in the EC patients in our study. Moreover, changes in ketone levels in metabolites may be associated with altered lipid metabolism in tumor cells51.
Currently, GC-IMS is not widely used for the detection of metabolomic markers and as such, the differential VOCs we identified in this study may not be reproducible by other detection platforms. Those platforms for detecting oncology VOCs, such as Gas chromatography-mass spectrometry (GC–MS), SIFT-MS, electronic-nose (eNose), etc. do have different advantages and disadvantages. For example, GC–MS can only be used for the separation and identification of low molecular weight (approx. 50–600 Da) and volatile compounds52. For the detection of polar, non-thermal and non-volatile metabolites, the use of chemical derivatization is required prior to analysis. The derivatization reaction utilised to produce volatile compounds increases the sensitivity and accuracy of the assay, enabling efficient analysis of hundreds of compounds53. Moreover, SIFT-MS facilitates real-time measurements, and while the instrument is less expensive to maintain and does not require a specialist to operate, it does separate fewer VOCs and consequently provides less information about VOC components54. The eNose assay is simple to operate, faster and cheaper, but is susceptible to interference from environmental factors and cannot separate single VOCs components55. Compared to the above detection platforms, GC-IMS not only has a high ability to separate complex components, but the ultra-sensitivity of ion mobility spectrometry allows it to detect very small concentrations of VOCs, while the samples do not need to be enriched and concentrated, maintaining their true flavour and making it suitable for rapid detection of large numbers of samples. In addition, the detection time of GC-IMS is considerably shorter and the accuracy of the results is significantly improved compared to other methods56.
Metabolomic testing in urine has proven to be an effective method for identifying biomarkers in EC. In the study presented herein, GC-IMS was employed to identify 8 urinary VOCs associated with EC that could accurately distinguish patients with EC from healthy individuals. This study provides an experimental basis for the application of VOC analysis in EC, allowing it to be used in the diagnosis of EC, which has extremely promising clinical applications.
Limitations of the study
The limitation of this study is that the metabolic pathways of some metabolites in the results have not been identified. Meanwhile, the metabolic relationship between the screened markers and esophageal cancer and their future value for clinical diagnosis still need to be further explored. In addition, the cohort size of the study needs to be further expanded and multi-centre validation needs to be added.
Methods
Study population
In the discovery study, a total of 162 patients, including 87 EC patients and 75 HCs, were enrolled in Qilu Hospital of Shandong University from October 2021 to June 2022. Urine samples were collected from all patients before they were tested using GC-IMS to identify candidate VOCs prior to the construction of diagnostic models. The validation cohort consisted of 38 EC patients and 32 HCs enrolled at Qilu Hospital of Shandong University to evaluate the diagnostic model. Inclusion criteria were set as follows: (1) No patients had a history of malignancy, previous radiotherapy or other treatment. (2) Patients could provide fresh urine samples and complete medical records. (3) Patients had undergone radical resection and were pathologically examined for EC. Simultaneously, healthy individuals took physical examinations in our hospital and were selected to serve as the HC group. All individuals in the control group had normal urine, liver and kidney function indicators. No tumors or other major diseases were identified in individuals in the HC group. The study was carried out in accordance with the Declaration of Helsinki. The Ethics Committee of Shandong University's Qilu Hospital approved this study, and each participant signed an informed consent form.
Ultimately, 162 urine samples were obtained for the discovery study (EC = 87, HC = 75), while 70 urine samples were collected for the validation study (EC = 38, HC = 32).
Sample preparation
The standards were purchased from Sinopharm Chemical Reagent Co., Ltd. for characterisation (including esters, acids, ketones, aldehydes, alkanes, ethers, alcohols, aromatic compounds and other compounds); the chromatographic column model was MXT-WAX, 15 m long, 0.53 mm ID, 1um film thickness, purchased from Restek, USA; and the GC-IMS equipment was purchased from GAS, Dortmund, Germany; High purity nitrogen, purity 99.999%.
Fasting urine samples were obtained from all subjects. Urine samples were collected in standard universal sterile specimen containers and frozen at -80 °C within 3 h. No chemicals were added to the urine prior to freezing. The urine was thawed in a laboratory refrigerator at 4 °C prior to analysis and 2 ml was added to a 20 mL glass sample bottle with a pressurised cap.
Analysis of the VOCs
VOCs were measured using GC-IMS (“FlavorSpec” brand, Dortmund, Germany). GC-IMS pre-separates complex VOC fractions in urine by GC, which is subsequently tandem with IMS, and then achieves secondary separation based on the mass of the ion to be measured and the one-dimensional collision cross-sectional area. Two-dimensional characterisation can be performed based on retention indices from GC and drift times from IMS, and quantification based on signal response intensity. For GC columns we use the strongly polarised column MXT-WAX. All samples were treated homogenously. Orthogonal experiments were performed to explore the experimental parameters: the optimal sample volume was derived to be 2 mL, incubated at 100 °C for 5 min, and injected with 1000 μL. Firstly, 2 mL of urine was placed in each headspace vial and incubated at 100 °C for 5 min. Subsequently, 1000 μL of gas was extracted from the headspace vial for analysis. Nitrogen was used as the carrier gas. IMS drift gas was maintained at 150 mL/min with the following carrier gas gradients: 0 min: 2 mL/min; 1 min: 2 mL/min; 8 min: 100 mL/min; 10 min: 150 mL/min; 15 min: 150 mL/min. Additional primary parameters were as follows: T1 drift tube temperature: 45 °C; T2 gas chromatography column temperature: 80 °C; T3 inlet temperature: 80 °C; T4 connection line 1: 80 °C and T5 connection line 2: 45 °C.
Statistical analysis
The GC retention indices and IMS relative drift times were cross-characterised from the NIST database and the drift time data from the self-built IMS database. The peak positions of the compounds were confirmed by comparison with the peak positions of the standards and the retention indices and drift times of the compounds to be tested had to be consistent with the data of the standards at the same time. All compounds were named from the purchased NIST library of spectra.
Continuous data were analyzed using the Mann–Whitney U-test. The R programme (× 64 4.2.0) and the “corrplot” package was used for correlation analysis. The Mann–Whitney U test was used to compare the levels of each VOC. The AUC was calculated on the ROC curves using MedCalc 9.3.9.0. A tenfold cross-validation was used in building the four diagnostic models and was used to select the optimal model parameters. Four machine learning models (RF, NN, SVM, DT) were applied to model construction. RF algorithm evaluated by Gini coefficient. NN uses the stochastic gradient descent (SGD) optimisation algorithm. SVM uses a nonlinear kernel function. The C4.5 algorithm was used for the DT. The error bars in scatter plots represent the median and interquartile range.
Ethics declarations
The studies involving human participants were reviewed and approved by the Ethics Committee of Qilu Hospital of Shandong University. The study was carried out in accordance with the Declaration of Helsinki. The patients/participants provided their written informed consent to participate in this study.
Data availability
The data that support the findings of this study are available from the corresponding author upon reasonable request.
References
Rice, T. W., Ishwaran, H., Ferguson, M. K., Blackstone, E. H. & Goldstraw, P. Cancer of the esophagus and esophagogastric junction: An eighth edition staging primer. J. Thorac. Oncol. 12, 36–42. https://doi.org/10.1016/j.jtho.2016.10.016 (2017).
Sung, H. et al. Global cancer statistics 2020: Globocan estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA 71, 209–249. https://doi.org/10.3322/caac.21660 (2021).
Zheng, R. et al. Cancer incidence and mortality in China, 2016. J. Natl. Cancer Center 2, 1–9. https://doi.org/10.1016/j.jncc.2022.02.002 (2022).
Smyth, E. C. et al. Oesophageal cancer. Nat. Rev. Dis. Primers 3, 17048. https://doi.org/10.1038/nrdp.2017.48 (2017).
Acharya, A., Markar, S. R., Matar, M., Ni, M. & Hanna, G. B. Use of tumor markers in gastrointestinal cancers: Surgeon perceptions and cost-benefit trade-off analysis. Ann. Surg. Oncol. 24, 1165–1173. https://doi.org/10.1245/s10434-016-5717-y (2017).
Kosugi, S. et al. Clinical significance of serum carcinoembryonic antigen, carbohydrate antigen 19–9, and squamous cell carcinoma antigen levels in esophageal cancer patients. World J. Surg. 28, 680–685. https://doi.org/10.1007/s00268-004-6865-y (2004).
Mealy, K. et al. Tumour marker detection in oesophageal carcinoma. Eur. J. Surg. Oncol. 22, 505–507. https://doi.org/10.1016/s0748-7983(96)92998-4 (1996).
Feng, F. et al. Diagnostic and prognostic value of cea, ca19-9, afp and ca125 for early gastric cancer. BMC Cancer 17, 737. https://doi.org/10.1186/s12885-017-3738-y (2017).
Gao, Y. et al. Evaluation of serum cea, ca19-9, ca72-4, ca125 and ferritin as diagnostic markers and factors of clinical parameters for colorectal cancer. Sci. Rep. 8, 2732. https://doi.org/10.1038/s41598-018-21048-y (2018).
Luo, G. et al. Roles of ca19-9 in pancreatic cancer: Biomarker, predictor and promoter. Biochim. Biophys. Acta 1875, 188409. https://doi.org/10.1016/j.bbcan.2020.188409 (2021).
Xu, X., Xiao, Y., Hong, B., Hao, B. & Qian, Y. Combined detection of ca19-9 and b7–h4 in the diagnosis and prognosis of pancreatic cancer. Cancer Biomark. 25, 251–257. https://doi.org/10.3233/CBM-190067 (2019).
Pennathur, A., Gibson, M. K., Jobe, B. A. & Luketich, J. D. Oesophageal carcinoma. Lancet 381, 400–412. https://doi.org/10.1016/S0140-6736(12)60643-6 (2013).
Janssens, E. et al. Headspace volatile organic compound profiling of pleural mesothelioma and lung cancer cell lines as translational bridge for breath research. Front. Oncol. 12, 851785. https://doi.org/10.3389/fonc.2022.851785 (2022).
Kwon, J. W., Park, H. W., Kim, W. J., Kim, M. G. & Lee, S. J. Exposure to volatile organic compounds and airway inflammation. Environ. Health 17, 65. https://doi.org/10.1186/s12940-018-0410-1 (2018).
Pal, V. K., Li, A. J., Zhu, H. & Kannan, K. Diurnal variability in urinary volatile organic compound metabolites and its association with oxidative stress biomarkers. Sci. Total Environ. 818, 151704. https://doi.org/10.1016/j.scitotenv.2021.151704 (2022).
Daulton, E. et al. Volatile organic compounds (vocs) for the non-invasive detection of pancreatic cancer from urine. Talanta 221, 121604. https://doi.org/10.1016/j.talanta.2020.121604 (2021).
Gao, Q. et al. Application of urinary volatile organic compounds (vocs) for the diagnosis of prostate cancer. Clin. Genitourinary Cancer 17, 183–190. https://doi.org/10.1016/j.clgc.2019.02.003 (2019).
Khalid, T. et al. Urinary volatile organic compounds for the detection of prostate cancer. PLoS ONE 10, e0143283. https://doi.org/10.1371/journal.pone.0143283 (2015).
Liu, Q. et al. Volatile organic compounds for early detection of prostate cancer from urine. Heliyon 9, e16686. https://doi.org/10.1016/j.heliyon.2023.e16686 (2023).
Navaneethan, U. et al. Volatile organic compounds in urine for noninvasive diagnosis of malignant biliary strictures: A pilot study. Dig. Dis. Sci. 60, 2150–2157. https://doi.org/10.1007/s10620-015-3596-x (2015).
Nissinen, S. I. et al. Detection of pancreatic cancer by urine volatile organic compound analysis. Anticancer Res. 39, 73–79. https://doi.org/10.21873/anticanres.13081 (2019).
Tyagi, H., Daulton, E., Bannaga, A. S., Arasaradnam, R. P. & Covington, J. A. Urinary volatiles and chemical characterisation for the non-invasive detection of prostate and bladder cancers. Biosensors. https://doi.org/10.3390/bios11110437 (2021).
Wang, P. et al. Identification of lung cancer breath biomarkers based on perioperative breathomics testing: A prospective observational study. EClinicalMedicine 47, 101384. https://doi.org/10.1016/j.eclinm.2022.101384 (2022).
Zhang, X. et al. A panel of bile volatile organic compounds servers as a potential diagnostic biomarker for gallbladder cancer. Front. Oncol. 12, 858639. https://doi.org/10.3389/fonc.2022.858639 (2022).
Bhatt, A. et al. Volatile organic compounds in plasma for the diagnosis of esophageal adenocarcinoma: A pilot study. Gastrointest. Endosc. 84, 597–603. https://doi.org/10.1016/j.gie.2015.11.031 (2016).
Huang, J. et al. Selected ion flow tube mass spectrometry analysis of volatile metabolites in urine headspace for the profiling of gastro-esophageal cancer. Anal. Chem. 85, 3409–3416. https://doi.org/10.1021/ac4000656 (2013).
Kumar, S. et al. Mass spectrometric analysis of exhaled breath for the identification of volatile organic compound biomarkers in esophageal and gastric adenocarcinoma. Ann. Surg. 262, 981–990. https://doi.org/10.1097/sla.0000000000001101 (2015).
Markar, S. R. et al. Assessment of a noninvasive exhaled breath test for the diagnosis of oesophagogastric cancer. JAMA Oncol. 4, 970–976. https://doi.org/10.1001/jamaoncol.2018.0991 (2018).
Zou, X. et al. Exhaled gases online measurements for esophageal cancer patients and healthy people by proton transfer reaction mass spectrometry. J. Gastroenterol. Hepatol. 31, 1837–1843. https://doi.org/10.1111/jgh.13380 (2016).
Bannaga, A. S., Tyagi, H., Daulton, E., Covington, J. A. & Arasaradnam, R. P. Exploratory study using urinary volatile organic compounds for the detection of hepatocellular carcinoma. Molecules. https://doi.org/10.3390/molecules26092447 (2021).
Mozdiak, E., Wicaksono, A. N., Covington, J. A. & Arasaradnam, R. P. Colorectal cancer and adenoma screening using urinary volatile organic compound (voc) detection: Early results from a single-centre bowel screening population (uk bcsp). Tech Coloproctol. 23, 343–351. https://doi.org/10.1007/s10151-019-01963-6 (2019).
Kumar, S. et al. Selected ion flow tube mass spectrometry analysis of exhaled breath for volatile organic compound profiling of esophago-gastric cancer. Anal. Chem. 85, 6121–6128. https://doi.org/10.1021/ac4010309 (2013).
de Meij, T. G. et al. Electronic nose can discriminate colorectal carcinoma and advanced adenomas by fecal volatile biomarker analysis: Proof of principle study. Int. J. Cancer 134, 1132–1138. https://doi.org/10.1002/ijc.28446 (2014).
Navaneethan, U. et al. Volatile organic compounds in bile can diagnose malignant biliary strictures in the setting of pancreatic cancer: A preliminary observation. Gastrointest. Endosc. 80, 1038–1045. https://doi.org/10.1016/j.gie.2014.04.016 (2014).
Navaneethan, U. et al. Volatile organic compounds in bile for early diagnosis of cholangiocarcinoma in patients with primary sclerosing cholangitis: A pilot study. Gastrointest. Endosc. 81, 943-949.e941. https://doi.org/10.1016/j.gie.2014.09.041 (2015).
Navaneethan, U. et al. Volatile organic compounds in bile can distinguish pancreatic cancer from chronic pancreatitis: A prospective observational study. Endoscopy 53, 732–736. https://doi.org/10.1055/a-1255-9169 (2021).
Handelman, G. S. et al. Edoctor: Machine learning and the future of medicine. J. Intern. Med. 284, 603–619. https://doi.org/10.1111/joim.12822 (2018).
Koureas, M. et al. Comparison of targeted and untargeted approaches in breath analysis for the discrimination of lung cancer from benign pulmonary diseases and healthy persons. Molecules. https://doi.org/10.3390/molecules26092609 (2021).
Zhang, J. et al. Breath volatile organic compound analysis: An emerging method for gastric cancer detection. J. Breath Res. https://doi.org/10.1088/1752-7163/ac2cde (2021).
Lou, D. et al. Quantitative metabolic analysis of plasma extracellular vesicles for the diagnosis of severe acute pancreatitis. J. Nanobiotechnol. 20, 52. https://doi.org/10.1186/s12951-022-01239-6 (2022).
Li, Z. et al. Real-time ultrasensitive vuv-pims detection of representative endogenous volatile markers in cancers. Cancer Biomark. 16, 477–487. https://doi.org/10.3233/CBM-160587 (2016).
Silva, C. L., Perestrelo, R., Capelinha, F., Tomas, H. & Camara, J. S. An integrative approach based on gc-qms and nmr metabolomics data as a comprehensive strategy to search potential breast cancer biomarkers. Metabolomics 17, 72. https://doi.org/10.1007/s11306-021-01823-1 (2021).
Kure, S. et al. A prediction model using 2-propanol and 2-butanone in urine distinguishes breast cancer. Sci. Rep. 11, 19801. https://doi.org/10.1038/s41598-021-99396-5 (2021).
Monedeiro, F., Monedeiro-Milanowski, M., Zmyslowski, H., De Martinis, B. S. & Buszewski, B. Evaluation of salivary voc profile composition directed towards oral cancer and oral lesion assessment. Clin. Oral Investig. 25, 4415–4430. https://doi.org/10.1007/s00784-020-03754-y (2021).
Wang, C. et al. Volatile organic metabolites identify patients with breast cancer, cyclomastopathy, and mammary gland fibroma. Sci. Rep. 4, 5383. https://doi.org/10.1038/srep05383 (2014).
Wang, C. et al. Noninvasive detection of colorectal cancer by analysis of exhaled breath. Anal. Bioanal. Chem. 406, 4757–4763. https://doi.org/10.1007/s00216-014-7865-x (2014).
Tang, Y. et al. Metabolic adaptation-mediated cancer survival and progression in oxidative stress. Antioxidants. https://doi.org/10.3390/antiox11071324 (2022).
Sa, C., Matos, D., Pires, A., Cardoso, P. & Figueira, E. Effects of volatile sulfur compounds on growth and oxidative stress of rhizobium leguminosarum e20–8 exposed to cadmium. Sci. Total Environ. 800, 149478. https://doi.org/10.1016/j.scitotenv.2021.149478 (2021).
Ahmed Laskar, A. & Younus, H. Aldehyde toxicity and metabolism: The role of aldehyde dehydrogenases in detoxification, drug resistance and carcinogenesis. Drug Metab. Rev. 51, 42–64. https://doi.org/10.1080/03602532.2018.1555587 (2019).
Jackson, B. et al. Update on the aldehyde dehydrogenase gene (aldh) superfamily. Hum Genom. 5, 283–303. https://doi.org/10.1186/1479-7364-5-4-283 (2011).
Huang, C. & Freter, C. Lipid metabolism, apoptosis and cancer therapy. Int J Mol Sci. 16, 924–949. https://doi.org/10.3390/ijms16010924 (2015).
Beale, D. J. et al. Review of recent developments in gc-ms approaches to metabolomics-based research. Metabolomics 14, 152. https://doi.org/10.1007/s11306-018-1449-2 (2018).
Poojary, M. M. & Passamonti, P. Improved conventional and microwave-assisted silylation protocols for simultaneous gas chromatographic determination of tocopherols and sterols: Method development and multi-response optimization. J. Chromatogr. A 1476, 88–104. https://doi.org/10.1016/j.chroma.2016.10.064 (2016).
Bosch, S., Berkhout, D. J., Ben Larbi, I., de Meij, T. G. & de Boer, N. K. Fecal volatile organic compounds for early detection of colorectal cancer: Where are we now? J. Cancer Res. Clin. Oncol. 145, 223–234. https://doi.org/10.1007/s00432-018-2821-3 (2019).
de Vries, R. & Sterk, P. J. Enose breathprints as composite biomarker for real-time phenotyping of complex respiratory diseases. J. Allergy Clin. Immunol. 146, 995–996. https://doi.org/10.1016/j.jaci.2020.07.022 (2020).
CaptainWeller, C. P. Non-targeted screening approaches for profiling of volatile organic compounds based on gas chromatography-ion mobility spectroscopy (gc-ims) and machine learning. Molecules 26, 5457. https://doi.org/10.3390/molecules26185457 (2021).
Acknowledgements
This work was supported by National Natural Science Foundation of China (81972005), Major Scientific and Technological Innovation Project of Shandong Province (2021CXGC010603, 2021CXGC011105), and Taishan scholar program of Shandong Province.
Author information
Authors and Affiliations
Contributions
Q.L.: Data curation, Formal analysis, Writing—original draft. S.L.: Writing—original draft, Visualization. Y.L.: Data curation, Investigation. L.Y.: Software, Validation. Y.X.Z.: Investigation, Methodology. Z.W.: Supervision, Writing—review & editing, Resources. Y.F., X.L., Y.W.: Investigation, Validation, Data curation. X.Z.: Supervision, Writing—review & editing. Y.Z.: Conceptualization, Funding acquisition, Project administration, Resources. All authors approved the final version of the manuscript.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Liu, Q., Li, S., Li, Y. et al. Identification of urinary volatile organic compounds as a potential non-invasive biomarker for esophageal cancer. Sci Rep 13, 18587 (2023). https://doi.org/10.1038/s41598-023-45989-1
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-023-45989-1
This article is cited by
-
Volatile-compound fingerprinting and discrimination of positional isomers in stamp-pad ink tracing using HS-GC-IMS combined with multivariate statistical analysis
Analytical and Bioanalytical Chemistry (2024)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.