Spectrochemical analysis of blood combined with chemometric techniques for detecting osteosarcopenia

da Silva, Tales Gomes; Morais, Camilo L. M.; Santos, Marfran C. D.; de Lima, Leomir A. S.; de Medeiros Freitas, Raysa Vanessa; Guerra, Ricardo Oliveira; Lima, Kássio M. G.

doi:10.1038/s41598-023-36834-6

Download PDF

Article
Open access
Published: 15 June 2023

Spectrochemical analysis of blood combined with chemometric techniques for detecting osteosarcopenia

Tales Gomes da Silva¹,
Camilo L. M. Morais¹,
Marfran C. D. Santos^1,2,
Leomir A. S. de Lima³,
Raysa Vanessa de Medeiros Freitas⁴,
Ricardo Oliveira Guerra^4,5,6 &
…
Kássio M. G. Lima¹

Scientific Reports volume 13, Article number: 9686 (2023) Cite this article

844 Accesses
3 Citations
2 Altmetric
Metrics details

Subjects

Abstract

Among several complications related to physiotherapy, osteosarcopenia is one of the most frequent in elderly patients. This condition is limiting and quite harmful to the patient’s health by disabling several basic musculoskeletal activities. Currently, the test to identify this health condition is complex. In this study, we use mid-infrared spectroscopy combined with chemometric techniques to identify osteosarcopenia based on blood serum samples. The purpose of this study was to evaluate the mid-infrared spectroscopy power to detect osteosarcopenia in community-dwelling older women (n = 62, 30 from patients with osteosarcopenia and 32 healthy controls). Feature reduction and selection techniques were employed in conjunction with discriminant analysis, where a principal component analysis with support vector machines (PCA–SVM) model achieved 89% accuracy to distinguish the samples from patients with osteosarcopenia. This study shows the potential of using infrared spectroscopy of blood samples to identify osteosarcopenia in a simple, fast and objective way.

Digital colloid-enhanced Raman spectroscopy by single-molecule counting

Article 17 April 2024

Microbiota in health and diseases

Article Open access 23 April 2022

Development and validation of a new algorithm for improved cardiovascular risk prediction

Article Open access 18 April 2024

Introduction

Osteosarcopenia is defined by the European Working Group on Sarcopenia in Older People (EWGSOP) as a progressive and generalized musculoskeletal disorder that is related to physical disability, falls, fractures, and death¹. Osteosarcopenia is a clinical condition often present in people at domestic risk, being considered a factor for several independent health problems, such as difficulties in performing basic and instrumental activities for daily living. In addition, regardless from the age, patients with osteosarcopenia have significantly more expenses in cases of hospitalization, taking up to 5 times more costs than those who do not have this condition².

A review carried out in 2018 by the consensus proposed by the EWGSOP claim the reduction of muscle strength, called dynapenia, as the primary parameter of osteosarcopenia, having its diagnostics confirmed by the presence of reduced muscle mass (muscle amount) and/or by the reduction of physical performance (muscular quality). The prevalence of osteosarcopenia according to these criteria shows wide variety due to differences in the studied population and due to different methods employed to evaluate the diagnosis criteria³. The reference techniques employed for osteosarcopenia diagnosis are Magnetic Resonance Imaging (MRI) and Computed Tomography (CT) scans. However, both techniques are expensive, cause much discomfort to patients, and often are only employed in late-diagnosis. Therefore, new approaches to simplify the diagnosis and allow early-detection of osteosarcopenia are much welcome.

New analytical approaches employing biospectroscopy have played an important role in clinical diagnosis^4,5. These approaches make use of vibrational spectroscopy techniques to analyse biological materials, since most molecules formed by covalent bonds absorb infrared (IR) radiation. Among these molecules, there are organic compounds containing important features of biological interest. Attenuated total reflection Fourier transform infrared (ATR-FTIR) spectroscopy allows a fast and non-destructive analysis of tissues, cells or biofluids^4,6. For biofluids, a very small volume of sample is required for analysis, where microliters of sample can be used for measurement⁴. FTIR spectroscopy has been used to diagnose different types of cancer⁷, viruses⁸ and other conditions⁹.

Chemometric techniques have been widely used as a way for analysing spectroscopy data. Feature selection and classification methods have been used to analyse biological datasets with high data complexity due to the large amount of information acquired through the equipment. Some of the algorithms employed to reduce these data are the principal component analysis (PCA) and the successive projections algorithm (SPA). PCA is an unsupervised analysis algorithm capable of reducing the original and high-dimensional data into a small number of principal components (PCs), where each PC represents a part of the original data variance; while the SPA deterministically selects the variables that best differentiate the groups through the reduction of the data multicolinearity¹⁰.

Multivariate classification techniques can be applied to distinguishing the samples based on their spectrochemical profiles, even in the presence of unknown sources of variation or subtle spectral differences between the samples. Among the supervised classification techniques, linear discriminant analysis (LDA), quadratic discriminant analysis (QDA), and support vector machines (SVM) are widely employed since these can discriminate highly-complex data with a low risk of overfitting¹⁰. These algorithms are used here to differentiate between control and osteosarcopenia samples using the spectroscopic data collected for both groups in a case–control classification study.

Materials and methods

Samples

Blood samples (n = 62, being 32 healthy controls and 30 from patients with osteosarcopenia) were obtained from patients with informed consent. The patients were diagnosed based on Dual-energy X-ray Absorptiometry (DXA), which is a method recommended by the EWGSO¹.The study was approved by the Research Ethics Committee of the Federal University of Rio Grande do Norte (UFRN) under number 2.368.206 following international and national standards (Resolution 466/12 of the National Council of Health) for research with human beings. Each elderly woman invited to participate in the research was informed about the objective and procedures to be adopted and were invited to sign the Informed Consent Form. The interviewers read the Informed Consent Form to the elderly and clarified any doubts about all stages of the process. All the patients in this study met the following eligibility criteria—inclusion criteria: (1) ability to walk alone for at least 400 m with or without auxiliary equipment, (2) absence of cognitive impairment (evaluated by the Leganés cognitive test with cut-off score above 22), (3) no history of cancer in the last 5 years, and, (4) no acute inflammatory or immunological condition, such as rheumatoid arthritis or systemic lupus erythematosus. The exclusion criteria were: (1) orthopedic or neurological deficiencies that could interfere with test results, (2) lack of regular physical activity (less than 3 times a week), and, (3) use of immunosuppressive drugs and/or corticoids in the last 3 months. The patients were filtered by these eligibility criteria before sample collection, so only patients suitable for the study were considered. The collected blood samples were centrifuged at 4000 rpm for 10 min to obtain the blood serum, which was kept in storage aliquots at − 80 ºC for further analysis. Before spectrometric analysis, all samples were thawed at room temperature for 30–40 min and then the protein precipitation process was performed. Precipitation was performed by adding 1.5 µL of 7 M perchloric acid to a 100 µL aliquot of serum. The aliquot was vortexed (FlexVortex 2, Loccus®) for 15 s, and centrifuged for 12 min at 12,000 rpm at 4 °C. The supernatant was then used for analysis (1 drop, approx. 50 μL). Although most of the proteins were precipitated, the sample may still contain proteins residues and small proteins, such as myokines which are fundamental for osteosarcopenia pathogenesis¹¹.

ATR-FTIR spectroscopy

The spectral acquisition was performed using a FTIR IRAffinity-1 spectrometer (Shimadzu Corporation, Japan) coupled to an ATR module containing a diamond crystal as the reflectance element. Measurements were made with 32 co-addition scans and 4 cm⁻¹ spectral resolution. The spectral data were acquired in the 4000–600 cm⁻¹ wavenumber range. The samples (10 µL) were applied directly on top of the ATR crystal for measurement. At the beginning of the experiment, the ATR crystal was cleaned with a mixture of ethanol 70% v/v and acetone p.a. (1:1); and, before each new sample, the crystal was cleaned with ethanol 70% v/v only. A new background spectrum was acquired before each new sample. Samples were measured in triplicate.

Multivariate analysis

The spectral data were entire processed in the MATLAB 2014b environment (MathWorks, Inc., USA) using the PLS Toolbox version 7.9.3 (Eigenvector Research, Inc., USA) and lab-made routines. Firstly, the samples were dived into training (70%) and test (30%) sets using the Kennard-Stone (KS) algorithm¹². The training samples were used for model construction and cross-validation, while the testing samples for final model evaluation. The spectral data were pre-processed by Savitzky-Golay smoothing (window of 5 points, 2nd order polynomial fitting) and automatic-weighted least squares baseline correction. Other pre-processing, including normalization procedures, were also tested but resulted in lower accuracies. The best pre-processing is presented herein. The replica pre-processed spectra were averaged for each sample, so the analysis was performed on a sample-basis. The data were also mean-centered before analysis.

The following classification algorithms based on PCA and SPA were used to analyse the pre-processed spectral data: PCA-LDA (principal component analysis with linear discriminant analysis), PCA-QDA (principal component analysis with quadratic discriminant analysis), PCA-SVM (principal component analysis with support vector machines), SPA-LDA (successive projections algorithm with linear discriminant analysis), SPA-QDA (successive projections algorithm with quadratic discriminant analysis), and SPA-SVM (successive projections algorithm with support vector machines).

PCA is one of the best well-known methods of reducing variables for large volumes of data, where a large number of spectral variables are reduced to a few number of PCs, containing scores and loadings¹³.The scores reflect the variance found with regard to the samples, while the loadings show the most important variables related to the scores construction. The scores and loading matrices are obtained after the decomposition performed by PCA on the pre-processed spectral matrix as follows:

$$\mathbf{X}=\mathbf{T}{\mathbf{P}}^{\mathrm{T}}+\mathbf{E}$$

(1)

where $\mathbf{T}$ represents the scores matrix; $\mathbf{P}$ represents the loading matrix; and $\mathbf{E}$ represents the residual matrix for total reconstruction of the pre-processed spectral matrix $\mathbf{X}$. As the scores represent the samples in the PC space, they can be used as input data for classification algorithms as in LDA, QDA and SVM.

SPA, on the other hand, performs a discrete selection of variables, selecting the variables that best differentiate the groups through the inverse of a cost function $\mathrm{G}$, represented below¹⁴:

$$\mathrm{G}=\frac{1}{\mathrm{Nv}} \sum_{\mathrm{n}=1}^{\mathrm{Nv}}{\mathrm{g}}_{\mathrm{n}}$$

(2)

where $\mathrm{Nv}$ is the number of validation samples and ${\mathrm{g}}_{\mathrm{n}}$ is defined as follows:

$${\mathrm{g}}_{\mathrm{n}}=\frac{{\mathrm{r}}^{2}({\mathrm{x}}_{\mathrm{n}},{\mathrm{m}}_{\mathrm{I}(\mathrm{n})} )}{{\mathrm{m}}_{\mathrm{I}(\mathrm{m})\ne \mathrm{I}(\mathrm{n})}{\mathrm{r}}^{2}({\mathrm{X}}_{\mathrm{n}},{\mathrm{m}}_{\mathrm{I}(\mathrm{m})})}$$

(3)

The numerator of Eq. 3 is the squared Mahalanobis distance between the sample n, ${\mathrm{x}}_{\mathrm{n}}$, and the center of the true class (${\mathrm{m}}_{\mathrm{I}(\mathrm{n})}$); and, the denominator represents the squared Mahalanobis distance between the sample ${\mathrm{x}}_{\mathrm{n}}$ and the center of the closest wrong class (${\mathrm{m}}_{\mathrm{I}(\mathrm{m})}$).

LDA and QDA are algorithms based on the Mahalanobis distance calculation between the samples. As the main difference between them, in LDA, it is assumed that all classes have well-defined and similar variance structures. In QDA, it is assumed that the classes do not have similar variance structures, thus, the covariance matrix is calculated individually for each analysed class¹⁵. The LDA (${\mathrm{L}}_{\mathrm{ik}}$) and QDA (${\mathrm{Q}}_{\mathrm{ik}}$) classification scores can be defined in a non-Bayesian form by the following Eqs.¹⁶:

$${\mathrm{L}}_{\mathrm{ik}}={\left({\mathbf{x}}_{\mathrm{i}}- {\overline{\mathbf{x}} }_{\mathrm{k}}\right)}^{\mathrm{T}}{\mathbf{C}}_{\mathrm{pooled}}^{-1}({\mathbf{x}}_{\mathrm{i}}- {\overline{\mathbf{x}} }_{\mathrm{k}})$$

(4)

$${\mathrm{Q}}_{\mathrm{ik}}={\left({\mathbf{x}}_{\mathrm{i}}-{\overline{\mathbf{x}} }_{\mathrm{k}}\right)}^{\mathrm{T}}{\mathrm{C}}_{\mathrm{k}}^{-1}({\mathbf{x}}_{\mathrm{i}}-{\overline{\mathbf{x}} }_{\mathrm{k}})$$

(5)

where ${\mathbf{x}}_{\mathrm{i}}$ is the response vector for a given i-th sample; ${\overline{\mathbf{x}} }_{\mathrm{k}}$ is the mean response vector for the k-th class; ${\mathbf{C}}_{\mathrm{pooled}}$ is the pooled covariance matrix; and ${\mathbf{C}}_{\mathrm{k}}$ is the calculated variance matrix for the k-th analysed class.

SVM is a supervised classification algorithm which transforms the original data into a new feature space using a kernel function that maximises, often non-linearly, the boundaries between the samples in their respective groups¹⁷. Among the main kernel functions, we have the radial basis function (RBF). The RBF function is calculated as follows¹⁸:

$$K\left({\mathbf{x}}_{\mathbf{i}},{\mathbf{z}}_{j}\right)=\mathrm{exp}(-\gamma \Vert {\mathbf{x}}_{i}-{\mathbf{z}}_{j}^{2}\Vert )$$

(6)

where ${\mathbf{x}}_{\mathbf{i}}$ and ${\mathbf{z}}_{j}$ are sample observations and $\gamma $ is the parameter that determines the RBF width.

The SVM classification was performed using the best training parameters obtained from cross-validation (venetian blinds with 10 data splits). The SVM classification takes the form:

$$f\left(\mathrm{x}\right)=\mathrm{sign}\left(\sum_{\mathrm{i}=1}^{{N}_{\mathrm{SV}}}{\alpha }_{\mathrm{i}}{y}_{\mathrm{i}}K\left({\mathbf{x}}_{\mathbf{i}},{\mathbf{z}}_{j}\right)+b\right)$$

(7)

where ${N}_{\mathrm{SV}}$ is the number of support vectors, ${\alpha }_{\mathrm{i}}$ is the Lagrange multiplier, ${y}_{\mathrm{i}}$ is the training class membership (± 1), $K\left({\mathbf{x}}_{\mathbf{i}},{\mathbf{z}}_{j}\right)$ is the kernel function, and $b$ is the bias parameter.

Model validation

The models were validated based on quality parameters calculated for the test samples. The accuracy, sensitivity, specificity, F-Score and G-Score were calculated as follows¹⁹ :

$$\text{Accuracy }\left(\mathrm{AC}\right)=\left(\frac{\mathrm{TP}+\mathrm{TN}}{\mathrm{TP}+\mathrm{FP}+\mathrm{TN}+\mathrm{FN}}\right)\times 100$$

(8)

$$\text{Sensitivity }\left(\mathrm{SENS}\right)=\left(\frac{\mathrm{TP}}{\mathrm{TP}+\mathrm{FN}}\right)\times 100$$

(9)

$$\text{Specificity }\left(\mathrm{SPEC}\right)=\left(\frac{\mathrm{TN}}{\mathrm{TN}+\mathrm{FP}}\right)\times 100$$

(10)

$$\text{F-Score }\left(\mathrm{FS}\right)=\left(\frac{2\times \mathrm{SENS}\times \mathrm{SPEC}}{\mathrm{SENS}+\mathrm{SPEC}}\right)$$

(11)

$$\text{G-Score }\left(\mathrm{GS}\right)= \sqrt{\mathrm{SENS}\times \mathrm{SPEC}}$$

(12)

where FP stands for false positive, FN for false negative, TP stands for true positive, and TN for true negative.

Results

In this study, 62 samples were analysed, including 32 healthy controls and 30 samples from patients with osteosarcopenia. The ATR-FTIR technique was used to obtain spectra from the blood serum of these patients. The spectra were analysed in the biofingerprint region (1800–900 cm⁻¹), in which there are many absorption bands related to important biomolecules. For example, the amide I peak (~ 1650 cm⁻¹) related to proteins⁴. The raw and pre-processed average spectra for the dataset are shown in Fig. 1A and B, respectively.

The spectral data were pre-processed by Savitzky-Golay smoothing and baseline correction, followed by mean-centering. The pre-processed spectral data were subjected to chemometric analysis by various classification techniques (PCA-LDA, PCA-QDA, PCA-SVM, SPA-LDA, SPA-QDA, SPA-SVM). Data processing is applied as a strategy to extract important spectral information to differentiate healthy controls from osteosarcopenia samples.

For model construction, the pre-processed spectral data were divided into sets where 70% of the samples were used for training and 30% for testing using the KS uniform sample selection algorithm. Figures of merit (accuracy, sensitivity, specificity, F-Score and G-Score) were calculated to evaluate the performance of the model in relation to the prediction of samples used in the test set. Accuracy represents the total number of samples correctly classified considering true and false negatives; sensitivity represents the proportion of correctly classified positives; and specificity represents the proportion of negatives that are correctly classified. The statistical results calculated for the prediction set is shown, with the best model in bold, in Table 1. The best results after evaluating the test samples were obtained using the PCA-SVM model. The discriminant function that demonstrates the classes’ separation can be seen in Fig. 2.

Table 1 Figures of merits (FOM) for the tested models, where AC stands for accuracy, SENS for sensitviity and SPEC for specificity. The best model is highlighted in bold.

Full size table

In terms of classification, the PCA-based models were built with a single PC capturing a variance of approximately 43% of the dataset. When only one score component was calculated for each sample with its pre-processed spectrum, it was possible to obtain, through the SVM classification algorithm, 89% for all figures of merit analysed, which is a relevant and important value for the distinction between the groups.

The loadings on the first PC, from the PCA-SVM model, were used to identify the most important variables for differentiation of the classes. The loadings peaks, selected as the region of greatest importance, were identified in the wavenumber regions of 1711, 1661, 1574, 1510, 1398, 1273, 1225, 1107, and 906.5 cm⁻¹. The loadings graph is shown in Fig. 3. The attempt to assimilate these variables was carried out based on the study by Movasaghi et al.²⁰ and is summarized in Table 2. Here, the assignments were performed considering the regions of maximum response and their respective maximum points in the loadings graph seen in Fig. 3.

Table 2 Main selected wavenumbers based on the PCA loadings on PC1, used to distinguish osteosarcopenia samples from healthy controls.

Full size table

Discussion

The use of spectroscopy in the detection and screening of diseases with complex diagnosis has become common in recent years, with excellent results being achieved when combined with multivariate data analysis. For example, the application for the differentiation of breast cancer patients²¹, gestational diabetes mellitus²², and even in cases related to physical therapy, such as for the detection of fibromyalgia²³, with satisfactory accuracy values. A genetic algorithm with linear discriminant analysis model, used to differentiate control patients from those diagnosed with fibomyalgia, obtained an accuracy of 84.2%, with a sensitivity of 89.5% and a specificity of 79%²³. An SPA-SVM model used for the detection of breast cancer obtained an accuracy of approximately 93%.²¹ In this application to differentiate patients with osteosarcopenia, the model used is a combination of a supervised classifier (SVM) with a dimensionality reduction model (PCA), capable of transforming a large amount of wavenumbers into a few factors built through the linear combination between the original variables, facilitating the interpretation of the results and their visualization through the use of the PCs. The supervised SVM model is quite useful and effective, being used in many cases of difficult separation between classes, presenting satisfactory results in several examples of biospectroscopic applications, such as for classification between patients with prostate cancer²⁴ and breast cancer²¹ based on the mid-infrared (MIR) spectral region.

The main wavenumbers responsible for classification are shown in Table 2. Slightly higher absorbance intensities are observed for healthy controls spectra at 1660 cm⁻¹, between 1580 to 1400 cm⁻¹, and around 1100 cm⁻¹ (Fig. 1). These are associated with Amide I of proteins (1661 cm⁻¹), C = N of adenine (1574 cm⁻¹), Amide II of proteins (1510 cm⁻¹), and ring polysaccharides (1107 cm⁻¹) vibrations (Table 2). Higher absorbance intensities are observed for the osteosarcopenia group between 1400–1200 cm⁻¹ and below 1000 cm⁻¹, which are associated with CH₃ symmetric deformation (1398 cm⁻¹), CH $\alpha $ rocking (1273 cm⁻¹), collagen and asymmetric stretching of phosphate groups of phosphodiester linkages in DNA and RNA (1225 cm⁻¹), and amino acids related to the left-handed helix DNA in Z form (906.5 cm⁻¹).

There are several factors associated with osteosarcopenia, including nutrition, lifestyle and genetics, however there are many biochemical changes in the bone-muscle crosstalk that contributes to the development of osteosarcopenia²⁵. This includes growth hormone/insulin-like growth factor-1 (GH/IGF-1), gonadal sex hormones and vitamin D, with age-related decreasing contributing to the development of osteosarcopenia^25,26. Patients with osteosarcopenia have insufficient intake of proteins^25,27, where reduced levels of protein intake, vitamin D, calcium and reduction in physical activity are correlated with declining muscle strength, thus being key factors for osteosarcopenia²⁷. Among these proteins, myokines are small proteins (5–20 kDa) which are fundamental for osteosarcopenia pathogenesis, where altered levels of these proteins lead to disturbance in the balance between anabolic and catabolic effects with consequent age-related muscle atrophy¹¹. In addition, genetic polymorphisms of various genes, such as androgen receptor, oestrogen receptor, catechol-O-methyltransferase, IGF-1, vitamin D receptor and low-density-lipoprotein receptor-related protein, contribute to the pathogenesis of osteosarcopenia²⁵.

The IR spectra can contain such biohcemical contributions in a complex matrix for which the use of multivariate analysis enable the distinction of case and control groups. For example, changes in protein absorptions and amino acids, such Amide I, Amide II and DNA/RNA absorptions, may be directly associated with the reduction of protein levels and genetic alterations in patients with osteosarcopenia. However, deeper studies are necessary to understand the biochemical pathways of this disease, which may include chromatographic and mass spectrometric techniques, since the FTIR spectra can only provide clues about the functional groups associated with the disease appearance, and do not provide sufficient information to identify specific metabolites or molecular markers associated with the disease. In addition, whilst the results reported herein are promising, ideally such study should be expanded and tested against a larger population of patients to ensure the method can be applied more generally.

Conclusion

In this study, we were able to distinguish patients with osteosarcopenia from healthy controls based on their blood serum. PCA-SVM results reached 89% accuracy, sensitivity and specificity to distinguish both groups in an external sample test set compared to the gold-standard method. The results are promising and demonstrate the potential of spectroscopic techniques in conjunction with multivariate data analysis for osteosarcopenia diagnosis.

References

Cruz-Jentoft, A. J. et al. Sarcopenia: Revised European consensus on definition and diagnosis. Age Ageing 48, 16–31 (2019).
Article PubMed Google Scholar
Sousa, A. S. et al. Financial impact of sarcopenia on hospitalization costs. Eur. J. Clin. Nutr. 70, 1046–1051 (2016).
Article CAS PubMed Google Scholar
Fielding, R. A. et al. Sarcopenia: An undiagnosed condition in older adults. current consensus definition: prevalence, etiology, and consequences. International working group on sarcopenia. J. Am. Med. Dir. Assoc. 12, 249–256 (2011).
Article PubMed Google Scholar
Baker, M. J. et al. Using Fourier transform IR spectroscopy to analyze biological materials. Nat. Protoc. 9, 1771–1791 (2014).
Article CAS PubMed PubMed Central Google Scholar
Morais, C. L. M. et al. Standardization of complex biologically derived spectrochemical datasets. Nat. Protoc. 14, 1546–1577 (2019).
Article CAS PubMed Google Scholar
Martin, F. L. et al. Distinguishing cell types or populations based on the computational analysis of their infrared spectra. Nat. Protoc. 5, 1748–1760 (2010).
Article ADS CAS PubMed Google Scholar
Siqueira, L. F. S. & Lima, K. M. G. MIR-biospectroscopy coupled with chemometrics in cancer studies. Analyst 141, 4833–4847 (2016).
Article ADS CAS PubMed Google Scholar
Santos, M. C. D., Morais, C. L. M., Nascimento, Y. M., Araujo, J. M. G. & Lima, K. M. G. Spectroscopy with computational analysis in virological studies: A decade (2006–2016). Trends Analyt. Chem. 97, 244–256 (2017).
Article CAS PubMed PubMed Central Google Scholar
Baker, M. J. et al. Clinical applications of infrared and Raman spectroscopy: State of play and future challenges. Analyst 143, 1735–1757 (2018).
Article ADS CAS PubMed Google Scholar
Morais, C. L. M., Lima, K. M. G., Singh, M. & Martin, F. L. Tutorial: multivariate classification for vibrational spectroscopy in biological samples. Nat. Protoc. 15, 2143–2162 (2020).
Article CAS PubMed Google Scholar
Ladang, A. et al. Biochemical markers of musculoskeletal health and aging to be assessed in clinical trials of drugs aiming at the treatment of sarcopenia: Consensus paper from an expert group meeting organized by the European society for clinical and economic aspects of osteoporosis, osteoarthritis and musculoskeletal diseases (ESCEO) and the centre Académique de Recherche et d’Expérimentation en Santé (CARES SPRL), under the auspices of the world health organization collaborating center for the epidemiology of musculoskeletal conditions and aging. Calcif. Tissue Int. 112, 197–217 (2023).
Article CAS PubMed PubMed Central Google Scholar
Kennard, R. W. & Stone, L. A. Computer aided design of experiments. Technometrics 11, 137–148 (1969).
Article MATH Google Scholar
Bro, R. & Smilde, A. K. Principal component analysis. Anal. Methods 6, 2812–2831 (2014).
Article CAS Google Scholar
Theophilou, G. et al. Synchrotron- and focal plane array-based Fourier-transform infrared spectroscopy differentiates the basalis and functionalis epithelial endometrial regions and identifies putative stem cell regions of human endometrial glands. Anal. Bioanal. Chem. 410, 4541–4554 (2018).
Article CAS PubMed PubMed Central Google Scholar
Morais, C. L. M. & Lima, K. M. G. Principal component analysis with linear and quadratic discriminant analysis for identification of cancer samples based on mass spectrometry. J. Braz. Chem. Soc. 29, 472–481 (2018).
CAS Google Scholar
Dixon, S. J. & Brereton, R. G. Comparison of performance of five common classifiers represented as boundary methods: Euclidean distance to centroids, linear discriminant analysis, quadratic discriminant analysis, learning vector quantization and support vector machines, as dependent on data structure. Chemometr. Intell. Lab. Syst. 95, 1–17 (2009).
Article CAS Google Scholar
Cortes, C., Vapnik, V. & Saitta, L. Support-vector networks. Mach. Learn. 20, 273–297 (1995).
Article MATH Google Scholar
Morais, C. L. M., Costa, F. S. L. & Lima, K. M. G. Variable selection with a support vector machine for discriminating: Cryptococcus fungal species based on ATR-FTIR spectroscopy. Anal. Methods 9, 2964–2970 (2017).
Article CAS Google Scholar
Morais, C. L. M. & Lima, K. M. G. Comparing unfolded and two-dimensional discriminant analysis and support vector machines for classification of EEM data. Chemometr. Intell. Lab. Syst. 170, 1–12 (2017).
Article CAS Google Scholar
Movasaghi, Z., Rehman, S. & Rehman, I. U. Fourier transform infrared (FTIR) spectroscopy of biological tissues. Appl. Spectrosc. Rev. 43, 134–179 (2008).
Article ADS CAS Google Scholar
Freitas, D. L. D. et al. Spectrochemical analysis of liquid biopsy harnessed to multivariate analysis towards breast cancer screening. Sci. Rep. 10, 12818 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Bernardes-Oliveira, E. et al. Spectrochemical differentiation in gestational diabetes mellitus based on attenuated total reflection Fourier-transform infrared (ATR-FTIR) spectroscopy and multivariate analysis. Sci. Rep. 10, 19259 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Passos, J. O. S. et al. Spectrochemical analysis in blood plasma combined with subsequent chemometrics for fibromyalgia detection. Sci. Rep. 10, 11769 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Siqueira, L. F. S., Morais, C. L. M., Araújo Júnior, R. F., de Araújo, A. A. & Lima, K. M. G. SVM for FT-MIR prostate cancer classification: An alternative to the traditional methods. J. Chemom. 32, e3075 (2018).
Article Google Scholar
Paintin, J., Cooper, C. & Dennison, E. Osteosarcopenia. Br. J. Hosp. Med. 79, 253–258 (2018).
Article Google Scholar
Girgis, C. M., Mokbel, N. & Digirolamo, D. J. Therapies for musculoskeletal disease: can we treat two birds with one stone?. Curr. Osteoporos. Rep. 12, 142–153 (2014).
Article PubMed PubMed Central Google Scholar
Maghbooli, Z. et al. The lower basal metabolic rate is associated with increased risk of osteosarcopenia in postmenopausal women. BMC Women’s Health 22, 171 (2022).
Article CAS PubMed PubMed Central Google Scholar
Polito, A., Barnaba, L., Ciarapica, D. & Azzini, E. Osteosarcopenia: A narrative review on clinical studies. Int. J. Mol. Sci. 23, 5591 (2022).
Article CAS PubMed PubMed Central Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Chemistry, Biological Chemistry and Chemometrics, Federal University of Rio Grande do Norte, Natal, 59075-970, Brazil
Tales Gomes da Silva, Camilo L. M. Morais, Marfran C. D. Santos & Kássio M. G. Lima
Federal Institute of Education, Science and Technology of Sertão Pernambucano, Floresta, 56400-000, Brazil
Marfran C. D. Santos
Estácio de Sá Goiás, North Regional, Goiânia, GO, 74063-010, Brazil
Leomir A. S. de Lima
Postgraduation Program in Health Sciences, Federal University of Rio Grande do Norte, Natal, 59075-970, Brazil
Raysa Vanessa de Medeiros Freitas & Ricardo Oliveira Guerra
Postgraduation Program in Physiotherapy, Federal University of Rio Grande do Norte, Natal, 59075-970, Brazil
Ricardo Oliveira Guerra
Department of Physiotherapy, Federal University of Rio Grande do Norte, Natal, 59075-970, Brazil
Ricardo Oliveira Guerra

Authors

Tales Gomes da Silva
View author publications
You can also search for this author in PubMed Google Scholar
Camilo L. M. Morais
View author publications
You can also search for this author in PubMed Google Scholar
Marfran C. D. Santos
View author publications
You can also search for this author in PubMed Google Scholar
Leomir A. S. de Lima
View author publications
You can also search for this author in PubMed Google Scholar
Raysa Vanessa de Medeiros Freitas
View author publications
You can also search for this author in PubMed Google Scholar
Ricardo Oliveira Guerra
View author publications
You can also search for this author in PubMed Google Scholar
Kássio M. G. Lima
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

T.G.S. performed the experimental part and analyzed the data and wrote the manuscript; D.L.D.F. analyzed the data and contributed to the writing of the first version of the paper; C.L.M.M. provided chemometrics support and text reviewing; M.C.D.S. investigation, software and writing-review and editing; L.A.S.L. conceptualization, planning and writing of the manuscript; R.V.M.F. designed and conducted experimental work; R.O.G. designed and provided clinical insight; K.M.G.L. designed, analyzed data and finalized the manuscript.

Corresponding author

Correspondence to Kássio M. G. Lima.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

da Silva, T.G., Morais, C.L.M., Santos, M.C.D. et al. Spectrochemical analysis of blood combined with chemometric techniques for detecting osteosarcopenia. Sci Rep 13, 9686 (2023). https://doi.org/10.1038/s41598-023-36834-6

Download citation

Received: 24 January 2023
Accepted: 10 June 2023
Published: 15 June 2023
DOI: https://doi.org/10.1038/s41598-023-36834-6

This article is cited by

Modified Dual EKF with Machine Learning Model for Fouling Prediction of Industrial Heat Exchanger
- Resma Madhu Paruthipulli Kalarikkal
- Jayalalitha Subbaiah
Korean Journal of Chemical Engineering (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.