Abstract
Laboratory data interpretation for the assessment of complex biological systems remains a great challenge, as occurs in mitochondrial function research studies. The classical biochemical data interpretation of patients versus reference values may be insufficient, and in fact the current classifications of mitochondrial patients are still done on basis of probability criteria. We have developed and applied a mathematic agglomerative algorithm to search for correlations among the different biochemical variables of the mitochondrial respiratory chain in order to identify populations displaying correlation coefficients >0.95. We demonstrated that coenzyme Q10 may be a better biomarker of mitochondrial respiratory chain enzyme activities than the citrate synthase activity. Furthermore, the application of this algorithm may be useful to re-classify mitochondrial patients or to explore associations among other biochemical variables from different biological systems.
Similar content being viewed by others
Introduction
Mitochondrial disease diagnosis is a complex process that relies on clinical, biochemical, neuroimaging, histological, and molecular data. Skeletal muscle is considered the most suitable tissue for the diagnosis of these disorders because of its availability and high metabolic rate. Biochemical measures of mitochondrial respiratory chain (MRC) enzyme activities are crucial for clinical diagnosis. The activities of MRC complexes (I–IV) are assayed spectrophotometrically, and the results are normalized to the total muscle protein content or to the activity of the mitochondrial matrix enzyme citrate synthase (CS), which is commonly used as an index of mitochondrial abundance1. Normalization to CS activity will facilitate the detection of partial enzymatic defects in diseases with compensatory mitochondrial proliferation2, while normalization to total proteins could unmask an MRC enzymatic defect in some cases of mitochondrial DNA depletion syndromes that may be associated with low-normal CS activities3,4,5,6.
Coenzyme Q10 (CoQ) is a lipid with a key role in mitochondrial oxidative phosphorylation because it is essential for electron transport from complex I and II to complex III of the MRC. The link between CoQ deficiency and dysfunction of complexes I + III and II + III is expected because low CoQ availability in mitochondria would impair the electron transfer essential for ATP production7,8,9,10. Muscle CoQ concentrations have been demonstrated to be associated with CS activity10, 11, as well as with other mitochondrial dysfunction biomarkers, such as the percentage of subsarcolemmal mitochondrion aggregates10. Furthermore, it has been suggested that total muscle CoQ is the best predictor of an MRC abnormality12. These findings strongly support the hypothesis that routine quantitative evaluation of muscle CoQ might be a new tool for both estimating MRC enzyme activities in muscle biopsies and diagnosing CoQ deficiency states10, 12.
With this background, our aim was to develop and apply an exploratory statistical procedure to assess muscle CoQ content and CS activity as biomarkers of mitochondrial activity evaluated by the analysis of MRC enzyme activities. After the initial statistical assessment, subpopulations of individuals displaying a high linear correlation coefficient among the different biochemical variables were identified.
Material and Methods
Patients
During the last 15 years, we have studied 448 muscular biopsies from patients suspected of mitochondrial disorders (age range 1 month-16 years; mean: 3.6 years). Results of both CoQ and CS were available in 447 samples. Of this population, 179 showed normal results for CoQ levels and all MRC enzymes and citrate synthase activities. Data were compared with those of a previously reported control population (N = 37; age range 2–16 years; average 9.2 years)11.
Ethical issues
The study was approved by the ethical committee of Hospital Sant Joan de Déu. Patients or their parents signed informed consent. All methods, including the obtaining of tissue samples from patients and controls, were carried out according to the Helsinki Declaration of 1964, as revised in 2001.
Biochemical studies
Muscle biopsies were taken and prepared according to standard procedures. NADH:cytochrome c oxidoreductase (complex I + III), succinate:cytochrome c reductase (complex II + III), succinate dehydrogenase (complex II), ubiquinol-cytochrme C oxidoreductase (complex III), cytochrome C oxidase (complex IV) and CS activities were determined using described spectrophotometric methods13, 14. Enzyme activity results were expressed as nmol/min* mg of protein and mUnits/CS Units. Total muscle CoQ levels were determined by reverse-phase high-pressure liquid chromatography (HPLC, Waters, MA, USA) with electrochemical detection (Coulochem II, ESA, MA, USA) (Montero et al., 2008). The CoQ values were expressed as nmol/gram of total protein content measured by the Lowry method15.
Statistical methods
Pearson linear correlation coefficients were initially computed between MRC enzyme activities, CS activity and CoQ content in muscle homogenates from patients. Statistical significance was evaluated using p < 0.01. Calculations were performed using the R program (version 3.2.3). See, for example Ugarte et al. 16.
Two statistical procedures were developed to further explore the correlations detected amongst the different biochemical variables. In particular, two algorithms were implemented to identify subpopulations of individuals in which a high correlation was reached (r > 0.95).
-
1)
Agglomerative procedure: Initial linear axes were found using the robust algorithm developed by García-Escudero et al. 17, which was specifically designed to detect linear clusters. Then, an iterative procedure was implemented by adding individuals to the initial axes until a fixed high correlation was achieved. The procedure was specifically constructed as follows.
Let X and Y be the variables of interest (that initially showed linear association), and let M (0) be the set consisting of the three nearest points to the simple regression line Yi = β 0 + β 1X1 + ε i (calculated by ordinary least squares) fitted over the individuals selected by the linear clustering method implemented by García-Escudero et al. 17; i.e., the three individuals with the smallest residuals \(|\hat{\varepsilon }|=|{Y}_{i}-{\hat{Y}}_{i}|\) are selected. Next, a “correlation loss measure” between the set M (0) and any other individual i is defined as
$$CLM({M}^{(0)},i)=\rho ({X}_{{M}^{(0)}},{Y}_{{M}^{(0)}})-\rho ({X}_{{M}^{(0)}\cup \{i\}},{Y}_{{M}^{(0)}\cup \{i\}}),$$(1)where \(\rho ({X}_{{M}^{(0)}},{Y}_{{M}^{(0)}})\) is the Pearson’s correlation coefficient between the variables X and Y within the set M (0).
1. Let \({M}^{(1)}={M}^{(0)}\cup \{{i}^{\ast }\}\) be a new set where i * is the individual that verifies
$$CLM({M}^{(0)},i\ast )\le CLM({M}^{(0)},i),\,\,{\rm{for}}\,{\rm{all}}\,i$$(2)i.e., i * is the individual with the smallest correlation loss.
2. Step 1 is repeated while \(\rho ({X}_{{M}^{(k)}}{,}_{{M}^{(k)}})\ge {r}^{\ast }\), where r * is the desired correlation to be reached.
-
2)
Divisive procedure: Unlike the agglomerative procedure, there is no need to define an initial axis because all of the individuals are considered as starting points in this method. Then, an iterative procedure is implemented by deleting one case (individual) at each step until a fixed high correlation is achieved.
More explicitly, the procedure can be described as follows:
Let M (0) be the set of all individuals in our target population. We define the “correlation gain measure” between the set M (0) and the individual i as,
where \(\rho ({X}_{{M}^{(0)}-\{i\}},{Y}_{{M}^{(0)}-\{i\}})\) is the Pearson’s correlation coefficient between the variables X and Y within the set M (0) without the i individual.
1. Let M (1) = M (0) − {i *} be a new set where where i * is the individual that verifies
that is, the individual with the largest correlation gain.
2.Step 1 is repeated k times until \(\rho ({X}_{{M}^{(k)}},{Y}_{{M}^{(k)}})\ge {r}^{\ast }\), where r * is the desired correlation to be reached.
Both algorithms were implemented in R (version 3.2.3). After the application of the two algorithms, subpopulations displaying high correlations were identified.
Results
Biochemical results in muscle biopsies in the entire cohort of patients together with our reference values are stated in Table 1. Primary data about CoQ values, CS and CIII activities are stated in Supplementary Table 1.
Pearson correlation coefficients and significance values of primary data are stated in Tables 2 and 3 (data were normalized to muscle total protein concentration, CoQ values and CS activity). CoQ correlation coefficients with all the MRC enzyme activities were higher compared with those obtained between CS and the other MRC enzymes (Table 2). When the CoQ concentrations and MRC enzyme activities were normalized to CS activity values, the highest correlation coefficient was observed between CoQ and complex II + III (Table 3). Then, in the whole population, we used CoQ as a normalizer of the CS and MRC activities. We saw that the MRC complexes that correlate more with CS activity were II, III and IV (Table 3).
Agglomerative and divisive algorithms
From the whole population (n = 447), observations with missing data and potential outliers were removed (ranging from 1 to 7 values depending on the variable). When applying the agglomerative procedure, three initial linear axes were used to find the corresponding clusters of individuals with a high correlation (r > 0.95) between the variables CoQ and CS. The cluster with the highest number of individuals was chosen. The divisive method identified essentially the same subpopulation (98% of individuals in common, n = 214) (Fig. 1).
We started with MRC enzyme activities and their association with either CS or CoQ. In the agglomerative method, three initial linear axes were considered, except for CS and complexes I + III and II + III, in which the agglomerative algorithm did not provide any sensible results and only the divisive method was applied. We finally selected the population with the highest number of individuals and compared it with those cases selected by the divisive method, which offered only one solution. Most MRC enzyme data showed that, when correlated with CoQ, populations with a higher number of individuals were detected (Table 4), especially for CoQ-dependent enzymes (CI + III and CII + III). The percentage of cases sharing the same correlation (r > 0.95) using agglomerative and divisive methods was also higher for CoQ when compared with CS, except for complex III (Table 4).
Discussion
This is the first report to analyze CoQ and other MRC biomarkers in a large cohort of samples. We have developed a statistical algorithm to assess the feasibility of using both CoQ and CS as biomarkers for MRC activities.
After applying the first statistical approach to the different biochemical variables (Pearson single correlation test) across the whole cohort of patients, several observations were made: 1) Correlation of CoQ values with CS was high. 2) Correlation of CoQ values and MRC was strongest than correlation between CS activity and MRC (data normalized to total protein content). 3) By using other normalization strategies for MRC (with either CS or CoQ values), as expected, CoQ was highly correlated with CII + III when these activities were normalized with CS activity, and CS showed a high correlation coefficient with both CIII and IV activities when they were normalized to CoQ content. Thus, differences between the association of CoQ and CS with the different MRC activities were evident.
After these preliminary observations, the next step was to develop the algorithms to further explore these associations. We did not consider age as a potential confounding variable because it has been suggested that it is not related to the activities of most MRC enzymes and CS activities18.
The initial step of the agglomerative method looked for robust linear clusters to determine initial linear directions to start the iterative steps. Although it only provided a single linear cluster as the optimal solution, we decided to explore three different linear cluster solutions and applied the iterative steps to these three potential solutions. The cluster with the highest number of individuals was chosen. To further validate the agglomerative algorithm, we compared the final results with a divisive method, which offered a unique solution from the whole cohort of patients. Interestingly, the degree of agreement between the solutions provided by the two algorithms was very high (see Table 3). The algorithm was then able to provide subpopulations of individuals with a high linear correlation coefficient between the variables of interest. In our case, a single subpopulation seemed to be the most reasonable solution.
We chose a correlation coefficient value of 0.95 because it is remarkably high when we consider biological variables of this complexity. Notably, the number of cases where this correlation was detected was high, especially for CoQ and CS correlation, supporting the hypothesis that CoQ may be employed as a marker for MRC activity normalization. Therefore, the normalization of MRC activities to CoQ seems advisable for a better classification of mitochondrial patients and it would be a good predictor of MRC alterations10.
From the data shown in Table 4, we observed, in terms of number of individuals displaying a correlation >0.95, that CoQ was a better marker than CS for CI + II, CII + III and CII activities, while CS was better for CIII and similar for CIV activities. Furthermore, in all cases (except for CS, CI + III and II + III), the degree of agreement between agglomerative and divisive methods was very high, supporting the usefulness of this new statistical approach. One explanation for the fact that agglomerative algorithm did not provide any sensible results only for CS and complex I + III and II + III activities is probably because both MRC complexes need CoQ for a proper electron transfer. Moreover, the measurement of these complexes, especially that of CI + III, is technically complex.
Although a biological explanation is difficult, the proposed supramolecular organization of MRC could illustrate why CoQ may be a better mitochondrial biomarker than CS. The individual MRC complexes (except complex II) can assemble into different supercomplexes. Until now, the proposed supercomplexes consisted of the respirasome (complexes I, III, and IV), complexes I and III and complexes III and IV. Supercomplexes are presumed to be functional entities, which follow a fluidity model, and it is proposed to modulate respiratory chain efficiency and reactive oxygen species production19. By genetic modulation of interactions between complexes I and III and between complexes III and IV, it has been shown that these associations define a dedicated CoQ pool and organizes the electron flow to optimize the use of available substrates20. Additionally, defective mitochondrial enzymes that reduce CoQ such as Electron Transfer Flavoprotein Dehydrogenases (ETFDH) or depletion of mtDNA that induces defective respiratory complexes cause CoQ deficiencies21. It is then expected a high correlation between CoQ levels and respiratory complexes I, II and III.
In conclusion, we have developed a new algorithm for exploring associations among complex biological variables. In this specific example, we demonstrate that CoQ may be used as biomarker for MRC activities. Hence, its routine determination in the research of mitochondrial diseases seems advisable. Potential applications of the agglomerative method may be the re-classification of patients according to CoQ values, which might lead to a better understanding of mitochondrial disorders. Furthermore, this algorithm may be employed to identify populations displaying high correlations among other biological variables.
References
Reisch, A. S. & Elpeleg, O. Biochemical assays for mitochondrial activity: assays of TCA cycle enzymes and PDHc. Methods. Cell. Biol. 80, 199–222 (2007).
Wen, B. et al. Increased muscle coenzyme Q10 in riboflavin responsive MADD with ETFDH gene mutations due to secondary mitochondrial proliferation. Mol. Genet. Metab. 109, 154–160 (2013).
Spinazzi, M. et al. Optimization of respiratory chain enzymatic assays in muscle for the diagnosis of mitochondrial disorders. Mitochondrion. 11, 893–904 (2011).
Sarzi, E. et al. Mitochondrial DNA depletion is a prevalent cause of multiple respiratory chain deficiency in childhood. J. Pediatr. 150, 531–5344, 534.e1-6 (2007).
Tesarova, M. et al. Mitochondrial DNA depletion in Alpers syndrome. Neuropediatrics 35, 217–223 (2004).
Montero, R. et al. Coenzyme Q10 Deficiency Study Group. Coenzyme Q10 deficiency in mitochondrial DNA depletion syndromes. Mitochondrion. 13, 337–341 (2013).
Lamperti, C. et al. Cerebellar ataxia and coenzyme Q10 deficiency. Neurology 60, 1206–1208 (2003).
Taylor, R. W., Schaefer, A. M., Barron, M. J., McFarland, R. & Turnbull, D. M. The diagnosis of mitochondrial muscle disease. Neuromuscul. Disord. 14, 237–245 (2004).
Lalani, S. R. et al. Isolated mitochondrial myopathy associated with muscle coenzyme Q10 deficiency. Arch. Neurol. 62, 317–320 (2005).
Miles, M. V. et al. Systematic evaluation of muscle coenzyme Q10 content in children with mitochondrial respiratory chain enzyme deficiencies. Mitochondrion. 8, 170–180 (2008).
Montero, R. et al. Analysis of coenzyme Q10 in muscle and fibroblasts for the diagnosis of CoQ10 deficiency syndromes. Clin. Biochem. 41, 697–700 (2008).
Miles, L. et al. Ubiquinol: a potential biomarker for tissue energy requirements and oxidative stress. Clin. Chim. Acta. 360, 87–96 (2005).
Rustin, P. et al. Biochemical and molecular investigations in respiratory chain deficiencies. Clin. Chim. Acta. 228, 35–51 (1994).
Grazina, M. M. Mitochondrial respiratory chain: biochemical analysis and criterion for deficiency in diagnosis. Methods. Mol. Biol. 837, 73–91 (2012).
Lowry, O. H., Rosebrough, N. J., Farr, A. L. & Randall, R. J. Protein measurement with the Folin phenol reagent. J. Biol. Chem. 193, 265–275 (1951).
Ugarte, M. D., Militino, A. F. & Arnholt, A. T. Probability and Statistics with R. Second Edition, (CRC Press (Taylor & Francis Group) by Chapman and Hall: Boca Raton, London, New York, 2015).
García-Escudero, L. A., Gordaliza, A., San Martin, R., Van Aelst, S. & Zamar, R. Robust linear clustering. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 71, 301–318 (2009).
Lefai, E. et al. Enzymatic activities of mitochondrial respiratory complexes from children muscular biopsies. Age-related evolutions. Biochim. Biophys. Acta. 1228, 43–50 (1995).
Genova, M. L. & Lenaz, G. The Interplay Between Respiratory Supercomplexes and ROS in Aging. Antioxid. Redox. Signal. 23, 208–238 (2015).
Lapuente-Brun, E. et al. Supercomplex assembly determines electron flux in the mitochondrial electron transport chain. Science 340, 1567–1570 (2013).
Alcázar-Fabra, M., Navas, P. & Brea-Calvo, G. Coenzyme Q biosynthesis and its role in the respiratory chain structure. Biochim. Biophys. Acta. Pii, S0005-2728(16)30057-3 (2016).
Acknowledgements
This work was supported by grants from the Spanish Ministerio de Economia y Competitividad (Explora Ciencia SAF2013-50139-EXP, FIS PI14/00005, PI14/00028) and FEDER Funding Program from the European Union. The “Centro de Investigación Biomédica en Red de Enfermedades Raras (CIBERER) is an initiative of the Instituto de Salud Carlos III. We are in debt to the Biobanc de l’Hospital Infantil Sant Joan de Déu per a la Investigació, part of the Spanish Biobank Network of ISCIII for sample procurement.
Author Contributions
M D.U. and A.R. participated in the design of the study. D.Y., R.M. and A.R. performed all the initial statistical evaluation and posterior data interpretation. A.A. and M.D.U. developed the mathematical algorithms and did the initial data exploration. D.Y, R.M., J.M., L.G., A.R., C.J-M., C.J. and R.A. provided all of the biochemical, hystopathological and molecular data. A.G-C., A.N. and M.M.O’C. were in charge of the patients and provided data about neurological explorations. All authors contributed to the critical revision of the manuscript.
Competing Interests
The authors declare no competing financial interests.
Author information
Authors and Affiliations
Corresponding author
Additional information
Additional Information
Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Rights and permissions
This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
About this article
Cite this article
Yubero, D., Adin, A., Montero, R. et al. A statistical algorithm showing coenzyme Q10 and citrate synthase as biomarkers for mitochondrial respiratory chain enzyme activities. Sci Rep 6, 15 (2016). https://doi.org/10.1038/s41598-016-0008-1
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-016-0008-1
This article is cited by
-
Quantitative SUMO proteomics identifies PIAS1 substrates involved in cell migration and motility
Nature Communications (2020)
-
Electrically tunable valley polarization in Weyl semimetals with tilted energy dispersion
Scientific Reports (2019)
-
The effect of statin treatment on circulating coenzyme Q10 concentrations: an updated meta-analysis of randomized controlled trials
European Journal of Medical Research (2018)
-
Klein tunneling in Weyl semimetals under the influence of magnetic field
Scientific Reports (2016)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.