Abstract
Objective. The infrapatellar fat pad (IPFP) has been associated with knee osteoarthritis onset and progression. This study uses machine learning (ML) approaches to predict serum levels of some adipokines/related inflammatory factors and their ratios on knee IPFP volume of osteoarthritis patients.Methods. Serum and MRI were from the OAI at baseline. Variables comprised the 3 main osteoarthritis risk factors (age, gender, BMI), 6 adipokines, 3 inflammatory factors, and their 36 ratios. IPFP volume was assessed on MRI with a ML methodology. The best variables and models were identified in Total-cohort (n = 678), High-BMI (n = 341) and Low-BMI (n = 337), using a selection approach based on ML methods. Results. The best model for each group included three risk factors and adipsin/C-reactive protein combined for Total-cohort, adipsin/chemerin; High-BMI, chemerin/adiponectin HMW; and Low-BMI, interleukin-8. Gender separation improved the prediction (13–16%) compared to the BMI-based models. Reproducibility with osteoarthritis patients from a clinical trial was excellent (R: female 0.83, male 0.95). Pseudocodes based on gender were generated.Conclusion. This study demonstrates for the first time that the combination of the serum levels of adipokines/inflammatory factors and the three main risk factors of osteoarthritis could predict IPFP volume with high reproducibility, with the superior performance of the model accounting for gender separation.
Similar content being viewed by others
Introduction
Osteoarthritis (OA) is known as one of the most common musculoskeletal disorders and occurs in the about half of the population over 65 years old1,2. According to the World Population Data Sheet3, older adults (>65); comprising about 5% of the global population in 1960, increased to 9% in 2018 and are projected to rise to 16% by 2050. With this population shift, the social and economic effects of medical care related to OA will be significant; however, current treatments are only symptomatic and no cure yet exists. One of the hurdles in OA drug discovery, as well as for the improvement of therapeutic approaches, is the early identification of patients who will progress. It is therefore crucial to find efficient and reliable means of screening OA progressors. Although the main risk factors, age, gender and body mass index (BMI), are important, they alone are poor predictors. However, serum molecules could be potential biomarkers for predicting knee OA progression.
Recent studies suggest that alteration in the infrapatellar fat pad (IPFP), an adipose tissue and the largest soft tissue structure in the knee joint4, is a potential early marker of OA disease incidence or progression5,6,7,8,9. Some inflammatory factors and adipokines produced by the IPFP have been identified as having a role in OA progression10,11. Furthermore, ratios of adipokines/inflammatory factors have also been demonstrated in OA and other diseases to provide a better prediction assessment than individual factors12,13,14,15,16,17,18,19.
In a first step toward finding early reliable predictors of OA progressors, this study aimed to determine, in OA individuals, the optimum combination of serum levels of adipokines, related inflammatory factors, their ratios, and the three main OA risk factors for predicting knee OA IPFP volume. Moreover, as adipokines are adipose tissue-derived mediators and some are involved in obesity, we also investigated if differences occurred between obese and non-obese OA subjects. To this end, we used a comprehensive machine learning approach which provides an excellent means for such predictive assessment.
Patients and Methods
Study population
The Osteoarthritis Initiative (OAI) is a longitudinal, multi-center study of knee OA of 4,796 individuals aged 45–79. The target knee of the participants (details in Supplementary Materials) were selected from the Progression subcohort of the OAI database (https://oai.nih.gov) and magnetic resonance imaging (MRI) were obtained from the baseline time point of the OAI cohort as previously described12. The knees of 678 participants were selected for this study and participant characteristics are described in Supplementary Table S1a. Participants were further divided according to their BMI: obese (BMI ≥ 30 kg/m2, High (H)-BMI; n = 341) and non-obese (BMI < 30 kg/m2, Low (L)-BMI; n = 337) and further by gender (female, n = 290, male n = 388).
Serum samples and biomarker determination
The serum samples were obtained from the baseline time point of the OAI cohort as previously described12. Serum samples were received aliquoted and frozen. Upon reception, they were stored at −80 °C and thawed to 4 °C before use. In brief, morning blood specimens were collected after an overnight fast using a uniform protocol. Additional details on specimen collection and processing methods can be found in the OAI operations manuals (https://oai.nih.gov).
The markers included the three main OA risk factors (age, gender and BMI), 9 serum biomarkers (6 adipokines: adiponectin high [H] and low [L] molecular weight [MW], adipsin, chemerin, leptin, visfatin, and three related inflammatory factors: C-reactive protein [CRP], interleukin [IL]-8, monocyte chemoattractant protein-1 [MCP-1]), and their 36 ratios. All biomarkers were determined with specific assays according to the manufacturers’ specifications as previously described (for details about the methodology for Biomarker determination, refer to Supplementary Materials).
All OAI participants provided written informed consent for participation in the OAI. Ethics approval was obtained by each OAI clinical site (University of Maryland Baltimore—Institutional Review Board, Ohio State University’s Biomedical Sciences Institutional Review Board, University of Pittsburgh Institutional Review Board, and Memorial Hospital of Rhode Island Institutional Review Board) and the OAI coordinating center (Committee on Human Research at University of California, San Francisco, CA, USA).
The Institutional Ethics Committee Board of the University of Montreal Hospital Research Centre approved the use of the human serum.
All methods, including serum measurements, were performed in accordance with the relevant guidelines and regulations.
Infrapatellar fat pad (IPFP) volume assessment using MRI
Knee MRI acquisitions were performed at the four OAI imaging centers using a 3 T apparatus (Magnetom Trio, Siemens, Erlangen, Germany). The MR coronal intermediate weighted (COR IW) 2D TSE sequence as defined by the OAI protocol20 was used for segmentation. The automated IPFP segmentation was done using a convolutional neural network (CNN). In brief, it implements a multi-atlas segmentation21 approach based on a multi-label pixel wise CNN22 trained on 2 labels: the first representing the IPFP and the second the knee joint bones including femur, tibia, patella and fibula as a single class. The core of the technology is a U-net architecture implemented in the MxNet framework which includes 5 convolution + pooling layers followed by 5 deconvolution + pooling layers23. The IPFP volume is expressed in mm3. Validation done on 38 individuals (Supplementary Table S1b) and comparing the manual segmentation with the developed CNN methodology revealed an excellent correlation coefficient, R = 0.90. Detailed methods are in Supplementary Materials.
Determination of the best variable combinations for IPFP determination
As a first step, the correlation coefficient (normalized version of the covariance) values are performed in order to measure the significance between the factors under study. Although the values of such a measure show both the tendency and strength of the linear relationship among two variables, when there are more than two interventions, the interaction between all variables should be evaluated with a more appropriate method. The latter should take into account the relationship and interactions between two or more factors. Therefore, and in order to consider all possible combinations between the factors under study on IPFP volume, we further used an evolutionary-based variable selection method for selecting the best variable combinations in IPFP volume prediction from 48 variables. The best variable combinations were determined using the evolutionary algorithm particle swarm optimization (PSO), which is combined with an artificial neural network (ANN) and Monte-Carlo simulation (MCS).
Although artificial intelligence techniques have been developed to address different practical needs in different areas, each of these methods has different pros and cons. For this study, 2.81E + 14 sub-variables should be analyzed to identify the best ones (Supplementary Table S2). Therefore, in feature selection, ANN was chosen as it could address our main goal, which was to develop an easy approach for implementation via the ability of classification, and pattern recognition (detailed method in Supplementary Materials). Moreover, as the dataset may possess some redundant or irrelevant features, the extraction of coherent information requires a comprehensive search over the sample space, while evolutionary algorithms accelerate the learning process to solve the problem. In order to find the best variable among models for IPFP volume determination with the sub-variables, a PSO-based variable selection algorithm was encoded as such technology has been successfully applied to various areas including feature selection24,25,26,27. A PSO algorithm was employed to select the most effective features as it can balance exploration and exploitation in an optimal manner by combining local and global search methods through self and neighboring experiences28. Other advantages of PSO that were needed in the present study are the fast search speed, simple algorithm structure with few adjustable parameters, and possessing memory.
To find the optimal variables (dimensional [12 dimensional: three risk factors and the serum levels of 9 adipokines/inflammatory factors], ratios, or both) for IPFP volume prediction models, PSO-based variable selection method was used and was calculated as follows:
where C is the number of possible combinations, N is the number of desired variables and M is the number of all sub-variables. According to the above equation, the number of sub-variables of 1 to 48 variables, as in the Supplementary Table S2, is 2.81E + 14 sub-variables that should be analyzed to identify the best ones.
For each group, 70% of the values were randomly assigned for use in training and the remaining (30%) for testing. k-fold cross-validation was employed to detect over-fitting and to verify the generalizability of the proposed models. In k-fold cross-validation, all samples were randomly divided into k categories. In each modeling period, k-1 categories were deemed training samples and the remaining sub-data were used as the testing samples. This process was repeated k times. Therefore, all samples were applied at least once as test samples. The testing samples were randomly selected from all samples. Of note, the testing dataset had no role in modeling development and is completely different from the training data. Here, by considering k = 4, the data set was split into 4 folds.
Considering all input combinations, a PSO-based variable selection method was developed (detailed methods in Supplementary Materials) to identify the most important sub-variables through proposed variable selection approach; Supplementary Fig. S1 illustrates the flow chart of the PSO-based variable selection method. In addition, the Monte Carlo Selection is employed to overcome the uncertainty in modeling results. Forty-eight (48) models were then selected.
Next, the adaptive neuro-fuzzy inference system embedded with fuzzy c-means clustering (ANFIS-FCM)29 was employed to predict IPFP volume. ANFIS is a multi-layer feedforward system that uses neural network learning algorithms and fuzzy logic to map an input space onto an output space (detailed method in Supplementary Materials and flowchart in Supplementary Fig. S2). Accordingly, all 48 variables were called, along with IPFP volume as the target variable, and all samples were divided into two categories of training (70% of the database) and testing (remaining 30%). The number of variables was then determined, and as in Supplementary Fig. S3 can vary from 1 to 48. After determining the variation range of the number of desired variables, and in order to evaluate the model performance in different stages, the weight of the training and testing errors was determined. This weight is proportional to the intended percentage for each stage (i.e. training and testing).
Uncertainty analysis
The uncertainty of the ANFIS-FCM prediction of the desired variables was applied as described30 and included the standard deviation of the forecasting error (SDFE) and the 95% confidence band width of uncertainty band (WUB) (detailed formulas in Supplementary Materials). Although it is well-known that performing these analyses might benefit the established ANFIS-FCM model, it will also make a reasonable comparison to expose the prediction vulnerabilities of the different models when compared to actual data.
Performance evaluation criteria
The different statistical indices for validating the performance of the developed ANFIS-FCM in IPFP volume prediction were the scatter index (SI), correlation coefficient (R), mean absolute percentage error (MAPE), and root mean squared relative error (RMSRE) (detailed formulas in Supplementary Materials).
Reproducibility of the proposed model
Reproducibility of the ANFIS-FCM code was done using another cohort; patients with symptomatic knee OA selected from a multicenter, randomized, double-blind study evaluating the effect of Licofelone (a lipoxygenase/cyclooxygenase inhibitor) versus Naproxen (a cyclooxygenase 2 inhibitor)31. Eighty (80) OA patients (female, n = 57; male, n = 23) were used. Patients’ characteristics are described in the Supplementary Table S1a. The original study was approved by the respective local ethics committees and all patients gave their oral and written informed consent to participate, including permission for the use of serum to be collected for biomarker assessment. Samples from this cohort were used only for validation and had no role in the modeling development.
Results
Selecting the best sub-variables combination based on PSO-based variable selection (PSOBVS)
Firstly, the correlation coefficient was performed (Supplementary Table S3) between 48 variables (defined as dimensional, the three risk factors and the serum levels of nine adipokines/inflammatory factors; and their 36 ratios) and the IPFP volume. Data indicated that the correlation coefficient between the three confounding variables (age, gender, BMI) were very low. For the other factors tested and the confounding variables, the highest correlation coefficient were between gender and IPFP volume (0.58). For the adipokines, the highest correlation coefficient were found between the adiponectin HMW and adiponectin LMW (0.58); this is not surprising as both are from the same molecule32,33,34.
Next and to find the best sub-variables in IPFP volume prediction of the 48 variables, the PSOBVS was implemented through an iterative process for the Total cohort, High-BMI and Low-BMI datasets. The results in the Supplementary Table S4 represents the models with 1 to 48 variables and were obtained by considering all sub-variables related to the parametric models with 1–48 variables (Supplementary Materials, Equation 2). For each of the three groups (Total cohort, High-BMI and Low-BMI), a total of 1,712,304 models (Supplementary Table S2) were verified. The Akaike Information Criterion (AIC) index (Supplementary Materials, Equation 5) was applied to select the best sub-variables, such that the model with the lowest AIC value was deemed the best model. According to Supplementary Table S2, the total sub-variables to be investigated and that are needed to identify the top 48 sub-variables are 2.81E + 14. Since this study investigates three groups (Total cohort, Low-BMI and High-BMI), 8.44E + 14 different sub-variables were investigated and 48 sub-variables were selected (Supplementary Table S4).
Data revealed that the best variables selected corresponded for each group of the model #5 (Supplementary Table S4) consisting of 5 variables: age, gender, BMI, adipsin/CRP for each group, combined for the Total cohort with adipsin/chemerin; High-BMI, chemerin/adiponectin HMW; and Low-BMI, IL-8.
By using ANFIS-FCM, the performance of the variables selected for each group was determined using scatter plot (Fig. 1) and statistical indices (Fig. 2). Data showed that for the Total cohort (Fig. 1a), High-BMI (Fig. 1b) and Low-BMI (Fig. 1c) in testing stage, each model estimates the IPFP volume prediction in most samples with low relative error, so that only 85% of the samples having a relative error of less than 15% and more than 75% of the samples estimate IPFP volume with relative error less than 10%. The comparison of the statistical indices (Fig. 2) for training and testing stages indicates that the developed model performs similarly in each stage, but the Total cohort shows a slightly higher performance in testing stage compared to the training one, for all the statistical indices.
Evaluation of the inverse ratio of the selected combinations
For the validation of the selected ratios, we further investigated the performance of ANFIS-FCM in IPFP volume prediction using their inverse forms. More specifically, we examined whether the inverse of the selected ratios for the IPFP volume modeling will have an impact on the results. The statistical indices for each group studied revealed (Supplementary Fig. S4) that compared to the ratio of the main form (form used during the development), the inverse form of the ratio within each model showed, for both training and testing stages, small differences in statistical indices in favor of the main form. Accordingly, using the inverse form of the ratios rather than the main form results in slightly inferior performance, confirming the superior performance of the main form for each group.
Impact of each variable of the main form
Next, we investigated the impact of each variable for each group on the IPFP volume prediction. To this end, 25 models with 2 to 5 sub-variables of the model #5 were evaluated for each group: Total cohort, High-BMI and Low-BMI. As illustrated in Table 1, the models (M) M1, M2-M6, M7-M15, M16-M25 correspond to 5, 4, 3 and 2 sub-variables respectively and M2-M25 were compared with the selected model with 5 variables, M1 including age, gender, BMI, adipsin/CRP in addition to the Total cohort of adipsin/chemerin; High-BMI, chemerin/adiponectin HMW; and Low-BMI, IL-8.
Data showed for the Total cohort (Fig. 3 Total Cohort a) within the models with 4 variables (M2-M6), that the lowest performance was related to the model that comprises BMI, age, adipsin/chemerin and adipsin/CRP. This means that gender was the most important among the 5 selected variables. After M1, the best result was obtained for M4, which shows that age has the lowest effect. Among those with 3 sub-variables (M7-M15) (Fig. 3 Total Cohort b), the best included the variables BMI, gender and adipsin/CRP, while not using gender and adipsin/CRP (two of three variables in the best selected ones) led to the worst model. Among the models with 2 sub-variables (M16-M25) (Fig. 3 Total Cohort c), the best included BMI and gender and the worst adipsin/CRP and adipsin/chemerin. A comparison of M1 with those consisting of 2 sub-variables (Fig. 3 Total Cohort d) indicated that fewer variables result in weaker model performance. Thus, the most effective variables were gender and BMI for this group.
For the High-BMI, the results at Fig. 3 indicated that gender is the most effective variable in IPFP volume prediction. Hence, not using this variable leads to the worst model (Fig. 3 High-BMI a). The lowest effect is from age, as the model including BMI, gender, adipsin/CRP, chemerin/adiponectin HMW performed the best with four variables (Fig. 3 High-BMI a). From the models with 3 variables (Fig. 3 High-BMI b), removing gender and adipsin/CRP result in the highest error in IPFP volume prediction. Adipsin/CRP was selected as the most effective variable in the models with 3 variables (Fig. 3 High-BMI b) and using only BMI and adipsin/CRP results in the worst model (Fig. 3 High-BMI b). As shown in Fig. 3 (High-BMI c), not using risk factors as variables leads to the worst model. Therefore, it can be concluded that although adipsin/CRP is important, gender demonstrated a higher significance. Similar to the data of the Total cohort, the High-BMI showed that less variables lead to higher error in IPFP volume prediction (Fig. 3 High-BMI d).
Likewise, for the Low-BMI group (Fig. 3 Low-BMI), gender was selected as the most effective variable in IPFP volume prediction. Accordingly, the combination of BMI and gender comprises the most effective sub-variables of models with 2 variables (Fig. 3 Low-BMI c), and not using these variables simultaneously resulted in the worst model with 3 sub-variables (Fig. 3 Low-BMI b). In models with 2 variables (Fig. 3 Low-BMI c), gender and adipsin/CRP were selected as the best sub-variables. As for the other groups, less variables resulted in lower performance.
Altogether, these data revealed that for each of the three groups studied, gender was the most effective, followed by adipsin/CRP.
Impact of gender separation on IPFP volume prediction
As previous literature indicates that IPFP volume is significantly greater in male compared to female individuals35,36,37, and the previous section demonstrates that gender is the most important variable, we thus examined the impact of gender. Each group was further separated by female and male and their respective model #5 (Supplementary Table S4) was tested for IPFP volume prediction using the ANFIS-FCM. As illustrated in Fig. 4, the effect of gender separation for Total cohort, High-BMI and Low-BMI represented in Taylor diagrams confirm for all groups the superior performance of the model that considers gender separation compared to the main one comprising female and male together. Figure 5 illustrates the MAPE results of the modeling, taking gender. Data showed that gender separation yields superior performance compared to the original model for the Total cohort and High-BMI. In the Low-BMI, both gender and BMI-based was about similar for the females, but for the males, the BMI-based model made for a slightly better performance in IPFP volume prediction. The MAPE values were improved 13% and 16% for the Total cohort, 16% and 14% for the High-BMI for female and male respectively.
As illustrated in Supplementary Table S5, the WUB demonstrated that the Total cohort had the best statistical indices with gender separation (i.e. lower of each of these indices), indicating that gender separation leads to a model superior to the original for reliable forecasting of IPFP volume value.
ANFIS-FCM based pseudocodes (algorithms) for IPFP volume prediction based on gender separation were then generated for the Total cohort (Supplementary Materials).
Reproducibility of the proposed model
In order to evaluate the reproducibility of the developed ANFIS-FCM gender-based models, a dataset of OA patients from a clinical trial for Licofelone/Naproxen, was employed31. Patients from this cohort, compared to the OAI participants, had a higher percentage of females and higher WOMAC subscores (Supplementary Table S1a).
According to the above data in which the Total cohort with gender separation had better WUB values than those with using BMI separation, the Licofelone/Naproxen cohort was separated according to gender.
Fig. 6 shows the scatter plot and statistical indices for the predicted IPFP volume vs. observed data for the Naproxen/Licofelone female and male cohorts using the described pseudocodes (Supplementary Materials). Data revealed that most data are located around the exact line and in the ±15% error range (Fig. 6), in addition to both genders having a high R (female, 0.83; male, 0.95) and low RMSRE (0.15 and 0.07, respectively) (Fig. 7). Therefore, the proposed ANFIS-FCM pseudocodes for the IPFP volume prediction are highly reliable and generalizable.
Discussion
Early detection of OA progressors is crucial but remains a challenge. This study is the first step aiming toward this goal. Hence, as IPFP is now considered an early marker of OA incidence or progression5,6,7,8,9, being able to predict this tissue volume with serum biomarkers could enable a next step in which this information will be used to predict/discriminate OA progressors from non-progressors.
Here, a comprehensive approach employing a machine learning method was carried out with some adipokines/related inflammatory factors and their ratios to predict the IPFP volume of OA patients. Such a technique was chosen due to the high amount of sub-variables, as it would have been difficult to investigate all the possible input combinations with traditional statistics modeling. Findings demonstrate that the combination of the serum levels of some adipokines/inflammatory factors and the three main OA risk factors predicted with high accuracy and reproducibility the IPFP volume. More specifically, for each group studied, the best model included only 5 variables for predicting the IPFP volume in which, in addition to age, gender, BMI and adipsin/CRP, the factor/ratio for each group was for Total cohort, adipsin/chemerin; High-BMI, chemerin/adiponectin HMW; and L-BMI, IL-8. The fact that age, gender and BMI were included in the best models for each group is not surprising, as in addition to being the three main risk factors for OA, each of them was also found to be associated with IPFP size35,38,39. However, further investigation of the impact of each variable on IPFP volume prediction in each group indicated that gender was the most effective variable and data separation based on gender improved the prediction results compared to the BMI-based models. For each gender, we then generated an ANFIS-FCM pseudocode with the 5 variables for predicting IPFP volume, in the form of an evolutionary computation equation. Notably, and of high clinical significance, the reproducibility experiment performed with symptomatic OA patients from a clinical trial and representing OA patients with a higher disease severity (higher WOMAC), then a real life scenario, demonstrated excellent coefficient correlation and statistical indices for each gender.
The finding that each model comprises at least a ratio of the serum levels of adipokines/inflammatory factors corroborates the usefulness of ratios as valuable and reliable predictive tools. Although the selected models should be used as a combination of the 5 indicated variables, it is interesting to note that, for each group studied; the adipsin/CRP ratio was included and is the most important one after gender. This indicates that this particular ratio is a suitable marker for the disease and not only for a particular sub-group.
Moreover, it is suggested that a ratio is a better predictor when both factors have different roles in a given disease process. This is in line with the ratios chosen as the best model for each group. Hence, in OA, adipsin and CRP were reported to present distinct roles. Adipsin is a structural component of the alternative complement pathway, involved in energy metabolism, associated in OA with cartilage volume loss and suggested to be involved in this tissue’s catabolic process12,40,41,42. In contrast, CRP is a systemic marker of inflammation. The other factors selected as a ratio for the Total cohort, adipsin/chemerin, also have a different role, as chemerin appears to be an inflammatory factor43,44 in addition to being found associated with the disease activity of an inflammatory (auto-immune) arthritis disease: rheumatoid arthritis45. Likewise, for chemerin/adiponectin HMW selected in the High-BMI group, although chemerin is considered an inflammatory factor, adiponectin demonstrated a protective role in OA articular tissues46. In addition, these two adipokines were found to contribute reciprocally to the development of metabolic syndrome47, a syndrome which suggested to contribute to some extent to the development of OA48,49.
IL-8 was the only individual factor included in a model and was found for the Low-BMI group. This interleukin is considered a pro-inflammatory mediator and acts as a chemoattractant for neutrophils50,51, with an increased expression in OA52,53. Interestingly, IL-8 has recently been associated with signal intensity alteration in the IPFP of OA patients54, although its specific role in such alteration remains to be identified.
Data also revealed that the main form of the ratio used for the model was justified, as the use of the inverse form slightly decreased the performance of the model in both training and testing stages.
In addition to the models and the pseudocodes generated for each gender, this study has two other major strengths. First, as no direct artificial intelligence technologies exist for the amount of the sub-variables, 2.81E + 14, to be tested to identify the best model for each group, we developed a selection approach based on PSO to find the best sub-variables for IPFP volume. Second, for the validation of the models, a cohort of symptomatic knee OA patients from a Canadian clinical trial was used, establishing the generalizability of our prediction models, at least in North America, as the participants used to generate the pseudocodes were from the United States.
By using the models for each gender and the pseudocodes for OA patients provided in this study, the next step will be to develop a predictive IPFP volume tool for OA progressors. Such a prediction model will be of importance not only in identifying disease status, but also for optimal patient management, as well as for the stratification of OA patients for drug discovery.
As in all studies, there are limitations. First, we chose a given panel of adipokines and related inflammatory factors that have been associated with OA. However, many other adipokines/related inflammatory factors have been reported and could be further tested. One could question the use of adipokines as biomarkers. Although there are several reports and many biomarkers being tested for OA progression, to date, none has been found sufficiently discriminating for an accurate diagnostic or predict prognosis; therefore, we elected to choose factors other those that are usually tested. However, adipokines were not chosen randomly, but selected, since their major source is the IPFP. Thus, upon production, these factors could be released early into the joint as well as in the circulation. Another limitation of this study is that we could have selected other articular structures such as the cartilage degradation as the outcome instead of the IPFP size. The reason for not having chosen the cartilage is that such an association, with the factors under study, was already done by our group12, but more importantly, this tissue alteration does not occur early during the disease process. Other articular structures could have been selected; however, as with the cartilage, for the majority of them their alterations were not occurring at early stages. In contrast, the IPFP has been reported to be associated not only with the progression of OA, but also with its onset; thus, before measurable cartilage degeneration and many of other articular structure alteration. Finally, among all other IPFP parameters that could have been used as an outcome instead of the volume, were the tissue area and signal intensity (hypointense/hyperintense)5,7,37,55,56,57. However, to date the methods used for the evaluation of those parameters have not yet been fully standardized nor a consensus established. In regard to the IPFP volume, as determine in the present work, this measure is quantitative and from the perspective of creating a diagnosis/prognostic tool, the volume can be assessed from the images of the T2 weighted fat-saturated sequence commonly included in a standard diagnostic MRI exam of the knee.
In conclusion, we have developed an evolutionary-based variable method for selecting the best variables combinations in IPFP volume prediction, using a panel of adipokines/related inflammatory factors, state-of-the-art machine learning methodologies in which the IPFP size determination can be obtained with high accuracy and based on only 5 variables, which was confirmed with an external cohort from a clinical trial. Moreover, this study also provides pseudocodes of IPFP volume prediction for each gender.
Data availability
Data used for this study were from the OAI database, which is publicly available online (https://nda.nih.gov/oai/). The data sets generated and/or analyzed during the current study are included in this published article or available from the corresponding author on reasonable request.
References
Martel-Pelletier, J. et al. Osteoarthritis. Nat Rev Dis Primers 2, 16072, https://doi.org/10.1038/nrdp.2016.72 (2016).
Arden, N. & Nevitt, M. C. Osteoarthritis: epidemiology. Best Pract Res Clin Rheumatol 20, 3–25 (2006).
2018 World Population Data Sheet, https://www.prb.org/wp-content/uploads/2018/08/2018_WPDS.pdf (2018).
Stephen, J. M. et al. The infrapatellar fat pad is a dynamic and mobile structure, which deforms during knee motion, and has proximal extensions which wrap around the patella. Knee Surg Sports Traumatol Arthrosc 26, 3515–3524, https://doi.org/10.1007/s00167-018-4943-1 (2018).
Davis, J. E. et al. Effusion-synovitis and infrapatellar fat pad signal intensity alteration differentiate accelerated knee osteoarthritis. Rheumatology (Oxford) 58, 418–426, https://doi.org/10.1093/rheumatology/key305 (2019).
Masaki, T. et al. Volume change in infrapatellar fat pad is associated not with obesity but with cartilage degeneration. J Orthop Res 37, 593–600, https://doi.org/10.1002/jor.24201 (2019).
Harkey, M. S. et al. Early pre-radiographic structural pathology precedes the onset of accelerated knee osteoarthritis. BMC Musculoskelet Disord 20, 241, https://doi.org/10.1186/s12891-019-2624-y (2019).
Wang, K. et al. Signal intensity alteration within infrapatellar fat pad predicts knee replacement within 5 years: data from the Osteoarthritis Initiative. Osteoarthritis Cartilage 26, 1345–1350, https://doi.org/10.1016/j.joca.2018.05.015 (2018).
Ruhdorfer, A. et al. Between-group differences in infra-patellar fat pad size and signal in symptomatic and radiographic progression of knee osteoarthritis vs non-progressive controls and healthy knees - data from the FNIH Biomarkers Consortium Study and the Osteoarthritis Initiative. Osteoarthritis Cartilage 25, 1114–1121, https://doi.org/10.1016/j.joca.2017.02.789 (2017).
Martel-Pelletier, J., Raynauld, J. P., Dorais, M., Abram, F. & Pelletier, J. P. The levels of the adipokines adipsin and leptin are associated with knee osteoarthritis progression as assessed by MRI and incidence of total knee replacement in symptomatic osteoarthritis patients: a post hoc analysis. Rheumatology (Oxford) 55, 680–688, https://doi.org/10.1093/rheumatology/kev408 (2016).
Conde, J. et al. Differential expression of adipokines in infrapatellar fat pad (IPFP) and synovium of osteoarthritis patients and healthy individuals. Ann Rheum Dis 73, 631–633, https://doi.org/10.1136/annrheumdis-2013-204189 (2014).
Martel-Pelletier, J. et al. The ratio adipsin/MCP-1 is strongly associated with structural changes and CRP/MCP-1 with symptoms in obese knee osteoarthritis subjects: data from the Osteoarthritis Initiative. Osteoarthritis Cartilage 28, 1163–1173, https://doi.org/10.1016/j.joca.2019.04.016 (2019).
Sarray, S., Madan, S., Saleh, L. R., Mahmoud, N. & Almawi, W. Y. Validity of adiponectin-to-leptin and adiponectin-to-resistin ratios as predictors of polycystic ovary syndrome. Fertil Steril 104, 460–466, https://doi.org/10.1016/j.fertnstert.2015.05.007 (2015).
Gandhi, R., Takahashi, M., Smith, H., Rizek, R. & Mahomed, N. N. The synovial fluid adiponectin-leptin ratio predicts pain with knee osteoarthritis. Clin Rheumatol 29, 1223–1228, https://doi.org/10.1007/s10067-010-1429-z (2010).
Rueda-Clausen, C. F. et al. The presence of abdominal obesity is associated with changes in vascular function independently of other cardiovascular risk factors. Int J Cardiol 139, 32–41, https://doi.org/10.1016/j.ijcard.2008.09.005 (2010).
Galluccio, E. et al. Hyperinsulinemia and impaired leptin-adiponectin ratio associate with endothelial nitric oxide synthase polymorphisms in subjects with in-stent restenosis. Am J Physiol Endocrinol Metab 294, E978–986, https://doi.org/10.1152/ajpendo.00003.2008 (2008).
Oda, N. et al. The ratio of leptin to adiponectin can be used as an index of insulin resistance. Metabolism 57, 268–273, https://doi.org/10.1016/j.metabol.2007.09.011 (2008).
Norata, G. D. et al. Leptin:adiponectin ratio is an independent predictor of intima media thickness of the common carotid artery. Stroke 38, 2844–2846, https://doi.org/10.1161/STROKEAHA.107.485540 (2007).
Satoh, N. et al. Leptin-to-adiponectin ratio as a potential atherogenic index in obese type 2 diabetic patients. Diabetes Care 27, 2488–2490 (2004).
Peterfy, C. G., Schneider, E. & Nevitt, M. The osteoarthritis initiative: report on the design rationale for the magnetic resonance imaging protocol for the knee. Osteoarthritis Cartilage 16, 1433–1441, https://doi.org/10.1016/j.joca.2008.06.016 (2008).
Iglesias, J. E. & Sabuncu, M. R. Multi-atlas segmentation of biomedical images: A survey. Medical image analysis 24, 205–219, https://doi.org/10.1016/j.media.2015.06.012 (2015).
Long, J., Shelhamer, E. & Darrell, T. Fully convolutional networks for semantic segmentation. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, 2015, pp. 3431–3440., https://doi.org/10.1109/CVPR.2015.7298965 (2015).
Ronneberger, O., Fischer, P. & Brox, T. In Medical Image Computing and Computer-Assisted Intervention - MICCAI 2015. LNCS Vol. 9351 (eds N. Navab, J. Hornegger, W.M. Wells, & A.F. Frangi) 234-241 (Springer, 2015).
Xiong, B. L., Huang, Y., Shi, M., Du, W. & Yang, M. Y. Feature selection of input variables for intelligence joint moment prediction based on binary particle swarm optimization. IEEE Access 7, 182289–182295 (2019).
Sakri, S. B. R. N.B.A.; Zain, Z.M. Particle swarm optimization feature selection for breast cancer recurrence prediction. IEEE Access 6, 29637–29647 (2018).
Shilaskar, S. G. & Chatur, A. P. Medical decision support system for extremely imbalanced datasets. Inform Sciences 384, 205–219 (2017).
Sharkawy, R. M. I., Salama, K. & Bartnikas, M. M. A. R. Particle swarm optimization feature selection for the classification of conducting particles in transformer oil. IEEE Trans Dielectr Electr Insul 18, 1897–1907 (2011).
Zhu, Q. Q. L., Li, Y. & Zhu, S. An improved particle swarm optimization algorithm for vehicle routing problem with time windows. IEEE International Conference on Evolutionary Computation, 1386–1390 (2006).
Moradi, F. et al. Abutment scour depth modeling using neuro-fuzzy-embedded techniques. Mar Georesour Geotec 37, 190–200, https://doi.org/10.1080/1064119X.2017.1420113 (2019).
Azimi, H., Bonakdari, H. & Ebtehaj, I. A highly efficient gene expression programming model for predicting the discharge coefficient in a side weir along a trapezoidal canal. Irrig. Drain 66, 655–666, https://doi.org/10.1002/ird.2127 (2017).
Raynauld, J. P. et al. Protective effects of licofelone, a 5-lipoxygenase and cyclo-oxygenase inhibitor, versus naproxen on cartilage loss in knee osteoarthritis: a first multicentre clinical trial using quantitative MRI. Ann Rheum Dis 68, 938–947, https://doi.org/10.1136/ard.2008.088732 (2009).
Krysiak, R., Handzlik-Orlik, G. & Okopien, B. The role of adipokines in connective tissue diseases. Eur J Nutr 51, 513–528, https://doi.org/10.1007/s00394-012-0370-0 (2012).
Shehzad, A., Iqbal, W., Shehzad, O. & Lee, Y. S. Adiponectin: regulation of its production and its role in human diseases. Hormones (Athens) 11, 8–20 (2012).
Neumeier, M. et al. Different effects of adiponectin isoforms in human monocytic cells. J Leukoc Biol 79, 803–808, https://doi.org/10.1189/jlb.0905521 (2006).
Diepold, J. et al. Sex-differences of the healthy infra-patellar (Hoffa) fat pad in relation to intermuscular and subcutaneous fat content–data from the Osteoarthritis Initiative. Ann Anat 200, 30–36, https://doi.org/10.1016/j.aanat.2014.12.004 (2015).
Teichtahl, A. J. et al. A large infrapatellar fat pad protects against knee pain and lateral tibial cartilage volume loss. Arthritis Res Ther 17, 318, https://doi.org/10.1186/s13075-015-0831-y (2015).
Han, W. et al. Infrapatellar fat pad in the knee: is local fat good or bad for knee osteoarthritis? Arthritis Res Ther 16, R145, https://doi.org/10.1186/ar4607 (2014).
Burda, B. et al. Variance in infra-patellar fat pad volume: Does the body mass index matter?-Data from osteoarthritis initiative participants without symptoms or signs of knee disease. Ann Anat 213, 19–24, https://doi.org/10.1016/j.aanat.2017.04.004 (2017).
Chuckpaiwong, B., Charles, H. C., Kraus, V. B., Guilak, F. & Nunley, J. A. Age-associated increases in the size of the infrapatellar fat pad in knee osteoarthritis as measured by 3T MRI. J Orthop Res 28, 1149–1154, https://doi.org/10.1002/jor.21125 (2010).
Valverde-Franco, G. et al. High in vivo levels of adipsin lead to increased knee tissue degradation in osteoarthritis: data from humans and animal models. Rheumatology (Oxford) 57, 1851–1860, https://doi.org/10.1093/rheumatology/key181 (2018).
Lo, J. C. et al. Adipsin is an adipokine that improves beta cell function in diabetes. Cell 158, 41–53, https://doi.org/10.1016/j.cell.2014.06.005 (2014).
White, R. T. et al. Human adipsin is identical to complement factor D and is expressed at high levels in adipose tissue. J Biol Chem 267, 9210–9213 (1992).
Mariani, F. & Roncucci, L. Chemerin/chemR23 axis in inflammation onset and resolution. Inflamm Res 64, 85–95, https://doi.org/10.1007/s00011-014-0792-7 (2015).
Berg, V. et al. Human articular chondrocytes express ChemR23 and chemerin; ChemR23 promotes inflammatory signalling upon binding the ligand chemerin(21-157). Arthritis Res Ther 12, R228, https://doi.org/10.1186/ar3215 (2010).
Tolusso, B. et al. Chemerin and PEDF Are Metaflammation-Related Biomarkers of Disease Activity and Obesity in Rheumatoid Arthritis. Front Med (Lausanne) 5, 207, https://doi.org/10.3389/fmed.2018.00207 (2018).
Chen, T. H. et al. Evidence for a protective role for adiponectin in osteoarthritis. Biochim Biophys Acta 1762, 711–718, https://doi.org/10.1016/j.bbadis.2006.06.008 (2006).
Chu, S. H. et al. Chemerin and adiponectin contribute reciprocally to metabolic syndrome. PLoS One 7, e34710, https://doi.org/10.1371/journal.pone.0034710 (2012).
Lee, B. J., Yang, S., Kwon, S., Choi, K. H. & Kim, W. Association between metabolic syndrome and knee osteoarthritis: A cross-sectional nationwide survey study. J Rehabil Med 51, 464–470, https://doi.org/10.2340/16501977-2561 (2019).
Pan, F., Tian, J., Mattap, S. M., Cicuttini, F. & Jones, G. Association between metabolic syndrome and knee structural change on MRI. Rheumatology (Oxford), https://doi.org/10.1093/rheumatology/kez266 (2019).
Matsukawa, A. et al. Neutrophil accumulation and activation by homologous IL-8 in rabbits. IL-8 induces destruction of cartilage and production of IL-1 and IL-1 receptor antagonist in vivo. J Immunol 154, 5418–5425 (1995).
Baggiolini, M., Walz, A. & Kunkel, S. L. Neutrophil-activating peptide-1/interleukin 8, a novel cytokine that activates neutrophils. J Clin Invest 84, 1045–1049, https://doi.org/10.1172/JCI114265 (1989).
Hulejova, H. et al. Increased level of cytokines and matrix metalloproteinases in osteoarthritic subchondral bone. Cytokine 38, 151–156, https://doi.org/10.1016/j.cyto.2007.06.001 (2007).
Kaneko, S. et al. Interleukin-6 and interleukin-8 levels in serum and synovial fluid of patients with osteoarthritis. Cytokines Cell Mol Ther 6, 71–79, https://doi.org/10.1080/13684730050515796 (2000).
Ruan, G. et al. Associations between serum IL-8 and knee symptoms, joint structures, and cartilage or bone biomarkers in patients with knee osteoarthritis. Clin Rheumatol 38, 3609–3617, https://doi.org/10.1007/s10067-019-04718-8 (2019).
Han, W. et al. Association Between Quantitatively Measured Infrapatellar Fat Pad High Signal-Intensity Alteration and Magnetic Resonance Imaging-Assessed Progression of Knee Osteoarthritis. Arthritis Care Res (Hoboken) 71, 638–646, https://doi.org/10.1002/acr.23713 (2019).
Han, W. et al. Hypointense signals in the infrapatellar fat pad assessed by magnetic resonance imaging are associated with knee symptoms and structure in older adults: a cohort study. Arthritis Res Ther 18, 234, https://doi.org/10.1186/s13075-016-1130-y (2016).
Han, W. et al. Signal intensity alteration in the infrapatellar fat pad at baseline for the prediction of knee symptoms and structure in older adults: a cohort study. Ann Rheum Dis 75, 1783–1788, https://doi.org/10.1136/annrheumdis-2015-208360 (2016).
Acknowledgements
The authors would like to thank the OAI participants and Coordinating Center for their work in generating the clinical and radiological data of the OAI cohort and for making them publicly available. The OAI is a public-private partnership comprised of five contracts (N01-AR-2-2258; N01-AR-2-2259; N01-AR-2-2260; N01-AR-2-2261; N01-AR-2-2262) funded by the National Institutes of Health, a branch of the Department of Health and Human Services, and conducted by the OAI Study Investigators. Private funding partners include Merck Research Laboratories; Novartis Pharmaceuticals Corporation, GlaxoSmithKline; and Pfizer, Inc. Private sector funding for the OAI is managed by the Foundation for the National Institutes of Health. This manuscript was prepared using an OAI public use data set and does not necessarily reflect the opinions or views of the OAI investigators, the NIH, or the private funding partners. The authors are grateful to the investigators from the Canadian Licofelone Study Group for their collaboration on the clinical trial. The authors wish to thank André Pelletier for the MRI reading and Thomas Moser, MD, expert musculoskeletal radiologist at the University of Montreal with several years of experience in MRI reading, for training and supervision in IPFP manual MRI reading. They also thank François Mineau and Frédéric Paré for biomarker determination, Joëlle Rousseau Trépanier, statistician, for her assistance in the analysis of participant characteristics and for the validation tests, and Jacqueline Brunet for her assistance with the manuscript preparation. Dr Pelletier and Dr Martel-Pelletier’s work was funded by the Chair in Osteoarthritis of the University of Montreal and the Osteoarthritis Research Unit of the University of Montreal Hospital Research Centre. Dr Bonakdari and Dr Tardif’s work was funded by the Groupe de recherches des maladies rhumatismales du Québec and Dr Abram’s work was funded by ArthroLab Inc.
Author information
Authors and Affiliations
Contributions
H.B., G.T., F.A., J.P.P. and J.M.P. designed the study. F.A., G.T. and J.M.P. acquired the data. H.B., G.T., F.A., J.P.P. and J.M.P. analyzed and interpreted the data. All authors were involved in drafting the article or revising it critically for important intellectual content, and all authors approved the final version to be published. J.M.P. had full access to all of the data in the study and takes responsibility for the integrity of the data and the accuracy of the data analysis.
Corresponding author
Ethics declarations
Competing interests
J.P.P. and J.M.P. are shareholders in ArthroLab Inc. and F.A. is an employee of the company. H.B. and G.T. have no conflict of interest.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Bonakdari, H., Tardif, G., Abram, F. et al. Serum adipokines/related inflammatory factors and ratios as predictors of infrapatellar fat pad volume in osteoarthritis: Applying comprehensive machine learning approaches. Sci Rep 10, 9993 (2020). https://doi.org/10.1038/s41598-020-66330-0
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-020-66330-0
This article is cited by
-
Impacts of a 12-week aerobic, resistance, and combined exercise training on serum FAM19A5, glucose homeostasis, and novel cardiovascular risk factors among adults with obesity
International Journal of Diabetes in Developing Countries (2024)
-
An Open Debate on the Morphological Measurement Methodologies of the Infrapatellar Fat Pad to Determine Its Association with the Osteoarthritis Process
Current Rheumatology Reports (2022)
-
Circulating levels of FAM19A5 are inversely associated with subclinical atherosclerosis in non-alcoholic fatty liver disease
BMC Endocrine Disorders (2021)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.